2. Field-Effect-Transistor Switch
A metal–oxide–semiconductor–field–effect–transistor (MOSFET) is the archetypal (electronic) binary switch that encodes the bits 0 and 1 in its two conductance states, high (ON) and low (OFF). In the ON state, charges flood into the channel providing a conduction path between the source and the drain to turn the transistor on. In the OFF state, these charges are driven out of the channel in order to disrupt the conduction path and turn the transistor off. Therefore, the two states are ultimately encoded in two
different amounts of charge (
Q1 and
Q2) in the channel. The switching action changes the amount of charge from
Q1 to
Q2, or vice versa, resulting in the (time-averaged) flow of a current:
where Δ
t is the amount of time it takes for the channel charge to change from
Q1 to
Q2 (or vice versa). This current will cause energy dissipation of the amount:
where
R is the resistance in the path of the current and
. We can think of Δ
V as the amount of voltage needed to be imposed at the transistor’s gate in order to change the charge in the channel by the amount Δ
Q. Note that the energy dissipation given in Equation (2) is
not independent of the switching time, because Δ
V depends on the switching time for a fixed Δ
Q and
R . We can rewrite the energy dissipation in Equation (2) as
, which clearly shows that for a fixed Δ
Q and
R, we will dissipate
more energy if we switch
faster (smaller Δ
t). Therefore, a more meaningful quantity to benchmark energy-efficiency is the
energy–delay product which is
. For a fixed
R, we can reduce this quantity by reducing Δ
Q, but that increasingly blurs the distinction between
Q1 and
Q2 and thereby impairs our ability to distinguish between bits 0 and 1. If Δ
Q is too small, then thermal generation and recombination can randomly change the amount of charge in the channel by an amount comparable to Δ
Q, causing random switching. Therefore, a larger Δ
Q translates to both stronger error-resilience and better reliability. This makes it obvious that there is a direct relation between reliability and energy–delay product; if we reduce the energy dissipation or energy-delay product by reducing Δ
Q, then we will invariably make the switch less reliable.
We can make this argument a little more precise by noting that
, where
is the gate capacitance. The thermal voltage fluctuation at the gate terminal is given by
, where
kT is the thermal energy [
3]. Hence, the thermal charge fluctuation in the channel is:
This quantity must be much smaller than the Δ
Q one needs to switch the conductance state of the transistor, and hence
. Let us define a quantity
η such that
. Clearly,
η is a measure of the ‘switching reliability’; the larger its value, the more reliable is the switch. From Equation (2), we can now obtain:
which immediately shows that we have to tolerate more energy dissipation,
Ed, and larger energy–delay product,
EdΔ
t, if we desire more reliability (i.e., a larger
η) [
4].
In some specific cases, such as a field–effect–transistor, we may be able to derive a relation between the energy dissipation/energy–delay product and the error probability. Consider the conduction-band diagram in the channel of an
n-channel field–effect–transistor along the direction of drain current flow as shown in
Figure 1. In the OFF state, there is a potential barrier at the source–channel junction which prevents electrons in the source contact from entering the channel and turning the transistor ON. This barrier has to be lowered by the applied gate potential Δ
V in order to allow electrons to enter the channel when the transistor has to be turned ON. Therefore, this barrier should be approximately equal to the quantity
. It is clear then that the transistor can spontaneously turn ON
while in the non-conducting state (causing a switching error that results in a bit error), if electrons can enter the channel from the source by thermionic emission over the barrier. The probability of entering the channel in this fashion, which is roughly
, is then the switching error probability
p. From Equation (2), we then get that the energy dissipation can be written as:
and the energy–delay product can be written as:
where
is the gate charging time. Equations (5) and (6) show the direct dependences of the energy dissipation and energy–delay product on the error-probability
p. These two equations clearly show that lower energy dissipation or lower energy–delay product are associated with higher switching error probability in a transistor switch.
To provide an estimate of how much energy we must dissipate to maintain any semblance of reliability, we will assume that we can tolerate a maximum error probability of 10
−15 [
5]. In modern day FINFETs, the gate capacitance may vary between 10 and 20 aF for a 0.5 μm wide gate [
6], and τ is on the order of 100 ps. Using the values in Equations (5) and (6), we get that the minimum energy dissipation and the minimum energy–delay product that we can expect in this type of FET device, while maintaining minimum acceptable reliability, are ~10 aJ and 10
−27 J-s, respectively.
3. Nanomagnetic Switches
Next, we consider a magnetic switch. We first point out that there are two types of switching errors: “unintentional switching” (i.e., switching that takes place when it is
not desired) and “failure to switch” (i.e., switching that does
not take place when it is desired). The previous example with the field–effect–transistor is of the first type and relates to errors caused by unintentional switching. Magnetic switches, fashioned out of nanomagnets (e.g., magnetic tunnel junctions), can also experience unintentional switching due to stray magnetic fields, thermal agitations, etc. The relationship between the energy dissipation and error probability for unintentional switching was derived in ref. [
7], which deduced that the minimum energy dissipation is related to the error probability according to:
where
Tc is the clock period and τ
0 is the inverse of the attempt frequency (which is in the range of 1 ps–1 ns) [
8].
To maintain p = 10−15 at a clock frequency of 1 GHz, the minimum energy dissipation would be 0.14–0.17 aJ according to Equation (7). We caution that this is an overly optimistic estimate, since it assumes that the minimum energy dissipation needed to switch is the anisotropy energy barrier within the nanomagnet and no consideration has been made of additional energy losses due to Gilbert damping, etc.
Next, we address “failure to switch” in a magnetic switch (i.e., switching not taking place when switching is desired and a switching stimulus is provided). Unlike the electronic switch, this case is not amenable to any analytical treatment and hence we will resort to simulations, using a specific example.
A bistable nanomagnetic switch can be fashioned out of a ferromagnetic elliptical disk where, because of the elliptical shape, the magnetization can point only along the major axis, either pointing to the left or to the right (as shown in
Figure 2a). This type of nanomagnet is said to possess in-plane magnetic anisotropy (IPA). In thinner nanomagnets, the surface anisotropy may be dominant and the magnetization can point perpendicular to the surface (either up or down). This type of nanomagnet is said to possess perpendicular magnetic anisotropy (PMA) (
Figure 2b). Either type makes a binary switch if we encode the bit information in the magnetization orientation which can point in just two directions. In this paper, we will consider only the IPA nanomagnetic switch, although the results will apply equally to PMA nanomagnets.
The IPA nanomagnet can be vertically integrated as the “soft” layer into a three-layer stack consisting of a “hard” ferromagnetic layer and an insulating (non-magnetic) spacer to form a magnetic tunnel junction (MTJ) (as shown in
Figure 2c). The hard layer is permanently magnetized in one of its two stable directions. When the soft layer’s magnetization is parallel to that of the hard layer, the MTJ resistance (measured between the two ferromagnetic layers) is low. If the two magnetizations are antiparallel, the resistance is high. Thus, the MTJ acts as a binary switch, much like the transistor, whose two resistance states (i.e., high and low) encode the binary bits 0 and 1. The difference between the transistor and the MTJ is that the former is
volatile (since charges leak out when the device is powered off), while the MTJ is
non-volatile (since the bit information is encoded in magnetization (spins) and not charge).
In order to make the magnetizations of the hard and soft layers mutually parallel (ON state), we can employ spin–transfer–torque [
2]. To do this, we apply a voltage across the MTJ with the negative polarity of the battery connected to the hard layer. This will inject spin-polarized electrons from the hard layer into the soft layer, whose spins are mostly aligned along the magnetization orientation of the hard layer. These injected electrons will transfer their spin angular momenta to the resident electrons in the soft layer, whose spins will then gradually turn in the direction of the injected spins and will magnetize the soft layer in a direction parallel to the magnetization of the hard layer. This is how the MTJ is turned “on”. In order to turn it “off”, we will reverse the polarity of the battery. This will inject electrons from the soft layer into the hard layer. However, because of spin-dependent tunneling through the spacer, those electrons whose spin polarizations are parallel to the magnetization of the hard layer will be preferentially injected. As these spins exit the soft layer their population is quickly depleted, leaving the opposite spins as the majority in the soft layer. That aligns the magnetization of the soft layer antiparallel to that of the hard layer. When that happens, the MTJ turns off.
For any given magnitude of injected current (with a given degree of spin polarization
), we can calculate the switching error probability (at room temperature) associated with spin–transfer–torque switching of an MTJ by carrying out
Landau–Lifshitz–Gilbert–Langevin simulation (also known as stochastic Landau–Lifshitz–Gilbert (s–LLG) simulation). To do this, we solve the following equation:
The last term in the right-hand side of Equation (8) is the field-like spin transfer torque exerted by the injected current
Is and the second to last term is the Slonczewski torque exerted by the same current. The coefficients
a and
b depend on device configurations and, following [
9], we will use the values
a = 1,
b = 0.3. Here,
is the time-varying magnetization vector in the soft layer normalized to unity,
mx(
t),
my(
t), and
mz(
t) are its time-varying components along the x-, y-, and
z-axis, respectively (see
Figure 2 for the Cartesian axes),
is the demagnetizing field in the soft layer due to its elliptical shape, and
is the random magnetic field due to thermal noise [
10]. The different parameters in Equation (3) are:
(gyromagnetic ratio),
α is the Gilbert damping constant,
is the magnetic permeability of free space,
Ms is the saturation magnetization of the cobalt soft layer,
kT is the thermal energy, Ω is the volume of the soft layer which is given by
, Δ
t is the time step used in the simulation, and
,
and
are three uncorrelated Gaussians with zero mean and unit standard deviation [
10]. The quantities
are calculated from the dimensions of the elliptical soft layer following the prescription of ref. [
11]. The nanomagnet soft layer is assumed to be made of cobalt with saturation magnetization
Ms = 8 × 10
5 A/m and
α = 0.01. Its major axis = 800 nm, minor axis = 700 nm and thickness = 2.2 nm. We assume that the spin polarization in the injected current is
ξ, which we take to be 30%. The spin current is given by
.
Using the vector identity
, we can recast the vector equation in Equation (8) as [
12]:
This vector equation can be recast as three coupled scalar equations in the three Cartesian components of the magnetization vector [
12]:
where
In our s–LLG simulations, we considered six different switching currents of 0.5, 1.0, 5.0, 10.0, 15.0 and 20.0, corresponding to current densities of A/m2, A/m2, A/m2, A/m2, A/m2 and A/m2, respectively. We generated 1000 switching trajectories for each current by solving Equation (10). We start with the initial condition and ran each trajectory for 20 ns with a time step of 0.1 ps. After 20 ns, each trajectory ended with a value of either close to +1 (switching success) or −1 (switching failure). The error probability is the fraction of trajectories that resulted in failure.
In
Figure 3, we plot the error probability (at room temperature) as a function of the current injected. Keeping in mind that the bulk of the energy dissipated is proportional to the square of the current, we see that the error probability decreases monotonically with increasing current or increasing energy dissipation. This shows that energy efficiency can only be purchased at the cost of reliability. In this respect, the magnetic switch shows the same trait as the electronic switch. In both cases, we have to expend more energy during switching if we wish to increase switching reliability.
In
Figure 4, we show the error probability as a function of switching time (pulse width of the injected current) for a fixed magnitude of the current. The current strength chosen for this plot was 10 mA. In this simulation, we turned off the current after different intervals of time and continued the simulation for 20 ns to see whether the value of
my ended up close to +1 (success) or −1 (failure). Again, the simulation duration of 20 ns was sufficient to ensure that for each simulated trajectory,
my ended up close to either +1 (success) or −1 (failure) at the end of the simulation. One thousand switching trajectories were generated for each pulse width, and the error probability is the fraction of trajectories that result in failure. We observe that the error probability decreased with increasing current pulse width (longer passage of current, or slower switching), as expected.
4. All-Optical Switches
Optical switches can be switched with a variety of agents such as electric fields, strain, another light beam, etc. A comprehensive study of all switching modalities will be beyond the scope of this article. Hence, we will focus only on all-optical switching [
13,
14]. We will study the simple case of a non-linear mirror switch shown in
Figure 5.
Consider a ray incident from a linear medium onto a non-linear medium at an angle θ
l. If the angle of incidence exceeds the critical angle for this pair of media, the ray suffers total internal reflection. Otherwise, it is refracted into the non-linear medium. The critical angle is given by the expression:
where
is the refractive index of the medium of incidence and
is the refractive index of the (non-linear) medium of refraction
. Since the latter medium is non-linear, its refractive index depends on the intensity of light incident on it, so that:
, where
I is the light intensity (related to the energy dissipation in switching). The switch works in the following way. Initially, the ray is incident at an angle less than the critical angle
. Then, as the intensity is increased and the critical angle decreases according to Equation (11), at some point the angle of incidence exceeds the critical angle
and the ray suffers total internal reflection.
At any incident angle θ
i, the minimum intensity needed to switch is given by the relation:
The probability of switching will go up rapidly with θi exceeding θc, (i.e., with the difference θi − θc). Therefore, the probability will go up rapidly with increasing value of the quantity , which shows that the probability of successful switching increases with increasing intensity I, or increasing energy dissipation. Thus, there is once again a trade-off between energy dissipation and reliability.