Physical and Technological Limitations of NanoCMOS Devices to the End of the Roadmap and Beyond

Simon Deleonibus*, Olivier Faynot, Barbara de Salvo, Thomas Ernst, Cyrille Le Royer, Thierry Poiroux and Maud Vinet

CEA-LETI/MINATEC CEA-Grenoble, 17 rue des Martyrs 38054
Grenoble Cedex 09 France.

*sdeleonibus@cea.fr

Since the end of the 1990s, the microelectronics industry has been facing new challenges as far as CMOS devices scaling is concerned. Linear scaling will be possible in the future if new materials are introduced in CMOS device structures or if new device architectures are implemented. Innovations in the electronics history have been possible because of the strong association between devices and materials research. The demand for low voltage, low power and high performance are the great challenges for the engineering of sub 50 nm gate length CMOS devices because of the increasing interest and necessities of Nomadic Electronic Systems. Functional CMOS devices in the range of 5 nm channel length have been demonstrated. In this chapter, alternative architectures that allow increase to devices’ drivability and reduce power consumption are reviewed such as multigate, multichannel architectures and nanowires. The issues in the field of gate stack, channel, substrate, as well as source and drain engineering are addressed. HiK gate dielectric and metal gate are among the most strategic options to implement for power consumption and low supply voltage management. By introducing new materials (Ge, Carbon based materials, III–V semiconductors,
HiK, ...), Si based CMOS will be scaled beyond the ITRS as the future System-on-Chip Platform integrating also new disruptive devices. For these devices, the low parasitics required to obtain high performance circuits, makes competition against logic CMOS extremely challenging.

1. International Technology Roadmap of Semiconductors
   Acceleration and Issues

Since 1994, the International Technology Roadmap for Semiconductor (ITRS)\(^1\) (Fig. 1) has accelerated the scaling of CMOS devices to lower dimensions continuously despite the difficulties that appear in device optimization.

However, technical roadblocks in lithography principally, economics and physical limitations have slowed down the evolution. Also, for the first time, since the introduction of poly gate in CMOS devices process, showstoppers other than lithography appear to be attracting special attention and require some breakthrough or evolution if we want to continue scaling at the same rate. Design will also be affected by this evolution.

Fig. 1. ITRS forecast evolution since 1994 for MPU devices (HP devices).\(^1\) The half pitch (technology node) appears as a parameter. The minimum physical gate length is given in brackets.

6 S. Deleonibus et al.
Which are the main showstoppers for CMOS scaling? In this paper, we focus on the possible solutions to investigate and guidelines for research in the next years in order to propose solutions to enhance CMOS performance before we need to skip to alternative devices. In other words, how can we offer a second life to CMOS?

To that respect, the roadmap distinguishes today three types of products: High Performance (HP) (Fig. 1), Low Operating Power (LOP) and Low Standby Power (LSTP) devices. In the HP case, a historical fact will happen by the 32 nm node: the contribution of static power dissipation will become higher than the dynamic power contribution to the total power consumption! This main fact could affect the MOSFET saturation current as can be observed on historical trends of smallest gate length devices. Multi-gate devices could improve somewhat this evolution (see Section 4.2.2.) by improving the ratio between saturation current and leakage current. In this paper, we will analyze the various mechanisms giving rise to leakage current in a MOS device and that can impact consumption of final devices. Gate leakage current is already a concern. A High Dielectric Constant (HiK) gate insulator will be needed in order to limit static consumption (see Section 4.2).

In Section 2 of this review, we will first analyze the main limitations and showstoppers affecting bulk CMOS scaling. In Section 3, the issues in lowering supply voltage to reduce power dissipation are identified. In Section 4, the limitations to scaling must be taken into account in the device optimization in terms of gate stack, channel and source and drain engineering as well as new devices architectures (FDSOI or multigate devices). The alternative possibilities offered by new materials for enhancement of device transport properties or power dissipation are reviewed in Sections 5 and 6. Finally, in Section 7, we review the applications demonstrated by single or few electronics in the field of memories or possible alternatives to CMOS.

2. Limitations and Showstoppers Coming from CMOS Scaling

CMOS device engineering consist of minimizing leakage current together with maximizing the output current. In sub 100 nm CMOS devices, non stationary transport gains more importance as compared to diffusive transport.
2.1. **Origin of leakage current in CMOS devices**

Several mechanisms can generate devices leakage in ultra small MOSFETs, which can be sorted in two categories:

a) Classical type.
- Drain Induced Barrier Lowering (DIBL) is due to the capacitive coupling between source and drain.
- Short Channel Effect (SCE) due to the charge sharing in the channel in the short channel devices at low $V_{ds}$.
- Punch-Through between source and drain due to the extension of source space charge to the drain.

b) Tunneling currents
- Direct tunneling through the gate dielectric.
- Field assisted tunneling at the drain to channel edge. This effect occurs if electric field is high and tunneling is enhanced through the thinnest part of the barrier.
- Direct tunneling from source to drain. This effect will occur in silicon for a thicker barrier than on $\text{SiO}_2$ because the maximum barrier height is lower (1.15 eV in Si versus 3.2 eV in $\text{SiO}_2$).

2.2. **Issues related to non stationary transport**

Velocity overshoot and ballistic transport are the mechanisms that will enhance drivability in sub 50 nm channel lengths devices. However, the impact of Coulomb scattering by dopants on transport is non negligible even in the 5 nm range channel lengths.\(^3,^4\) Superhalo doping is efficient to improve SCE and DIBL in 16 nm finished gate length (Fig. 2)\(^5\) but will degrade the channel transport properties\(^5\) by dopant Coulomb scattering (Fig. 3(a)) and high transverse electric field.

The degradation of transport properties can be observed on short channel mobility measurement by using a specific method with direct $L_{eff}$ measurement\(^6\) (Fig. 3(b)). A mobility degradation of a factor 2 to 3 or more can be measured on the most aggressive nano-scaled bulk technologies. The ITRS target of a transconductance increase by a factor 2\(^1\) is still very challenging on such gate length even if an enhancement is reported on long channels. Furthermore, for such gate lengths access resistance due to extension scaling is an issue (Fig. 3(a))\(^4\).

---

8 S. Deleonibus et al.
3. Issues in Supply Voltage Down Scaling

In the future, the electronics market will require portable objects used in daily life and consequently low standby power dissipation and low active power consumption will be needed. Scaling down of supply voltage is an essential leverage to decrease power dissipation. However, it raises several questions about the possible lower limits.
The power dissipation $P$ of a MOSFET is due to static and dynamic contributions expressed by:

$$P = P_{\text{stat}} + P_{\text{dyn}}$$  \hspace{1cm} (1)

$$P_{\text{stat}} = V_{dd} \times I_{\text{off}}$$  \hspace{1cm} (2.1)

and

$$P_{\text{dyn}} = CV_{dd}^2 f$$  \hspace{1cm} (2.2)

$P$ is the total power dissipation; $P_{\text{stat}}$ and $P_{\text{dyn}}$ are the static and the dynamic power dissipations respectively. The strong impact of supply voltage on power dissipation appearing in (1), (2.1) and (2.2), will also preclude a strategy of threshold voltage value adjustment depending on the application.

Information theory and statistical mechanics as well as the electrostatics of the device will set the limits of switching of binary devices. Moreover, dopant fluctuations will affect the control of device characteristics substantially: that is why low doping of CMOS channel will help in the down scaling of supply voltage.

3.1. **Fundamental limits of binary devices switching**

Quantum mechanics illustrates that switching involves non linear devices that would demonstrate a gain. That could occur with or without wavefunction phase changing. The Quantum limit on switching energy will be given by the Heisenberg’s uncertainty principle:

$$E \geq \frac{\hbar}{\tau}$$

which gives a minimum switching energy of $E_{\text{min}} = 10^{-5} aJ$ considering $\tau = 10$ ps, $\hbar = \frac{2\pi}{\tau}$ is Planck’s constant equal to $6.34 \times 10^{-34}$ J.s.

The second principle of thermodynamics imposes the maximization of entropy at temperature $T$. Applied to information theory this has a consequence on the minimal energy that a system, based on binary states of each bit of information, will require to switch from one state to the other: $E \geq kTLn (2)$ with entropy $S = kLn (2)$ linked the quantity of information available in such a system. Thus:

$$E \geq 3 \times 10^{-3} aJ \text{ at } T = 300 \text{ K}$$

If the system has a large number of gates $N$, with a response time $\tau$ that could switch at an average rate time $\tau_{\text{mbf}}$, then the mean time
Between failures (MTBF) is given by the expression: $\tau_{\text{mbf}} = \frac{\tau}{N} P = \frac{\tau}{N} e^{\frac{E}{kT}}$

$P = e^{-\frac{E}{kT}}$ is the switching probability of a single gate. We can demonstrate that the minimum switching energy is given by:

$$E \geq kT \ln \left( \frac{N \tau_{\text{mbf}}}{\tau} \right).$$

If we consider $N = 10^9$, $\tau = 10$ ps and MTBF = 1000 h (i.e. $3.6 \times 10^6$ s), then we get: $E \geq 0.25 \, \text{aJ}$.

Among the three limitations mentioned above, the latter is the largest one.

In order to estimate the associated minimal switching voltage $V_{\text{min}}$ one must consider the capacitive load $C_L$ associated to a switching gate. We will then extract $V_{\text{min}}$ from the following relation:

$$kT \ln \left( \frac{N \tau_{\text{mbf}}}{\tau} \right) = C_L V_{\text{min}}^2,$$

and get

$$V_{\text{min}} = \left( \frac{kT \ln \left( \frac{N \tau_{\text{mbf}}}{\tau} \right)}{C_L} \right)^{1/2}.$$

At $T = 300$ K, $V_{\text{min}} = 10$ mV will be the limit if the load capacitance is in the range $0.4$ fF (corresponding to $1$ nm gate oxide thickness).

### 3.2. Issues related with decananometer gate length devices

In the decananometer range (less than 100 nm), besides classical 2 dimensional electrostatic effects, tunneling currents will contribute significantly to MOSFET leakage. In the following, we review the principal parasitic effects that could limit ultimate MOSFETs operation.

#### 3.2.1. Direct tunneling through SiO$_2$ gate dielectric

is significant for a thickness less than 2.5 nm. It contributes to the leakage component of power consumption. Less than 1.4 nm thin SiO$_2$ is usable without affecting devices reliability.$^{3,7-9}$

#### 3.2.2. High doping levels in the channel

reaching more than $5 \times 10^{18}$ cm$^{-3}$ enhances Fowler-Nordheim field assisted tunneling reverse current in sources and drains up to values of $1$ A/cm$^2$ (under $1$ V)$^{10}$.
3.2.3. **Direct tunneling from source to drain** is easily measurable for very short channel lengths\(^4,5\) lower than 10 nm. It will affect subthreshold leakage substantially at room temperature for channel lengths less than 5 nm.

3.2.4. **Classical small dimension effects** are more severe than the fundamental limits of switching (quantum fluctuations, energy equipartition, or thermal fluctuations). A minimum value is required for threshold voltage due to:

- **subthreshold inversion.** For ideal fully-depleted SOI (FDSOI) 59.87 mV/dec subthreshold swing can be obtained at 300 K. The limit \(V_T\) value is 180 mV precluding a supply voltage \(V_S\) lower than 0.50 V. Impact Ionization MOS (I-MOS) would allow reducing subthreshold swing to 5 mV/dec. However, performance and reliability remain issues.\(^11\)
- **short channel effect** due to the charge sharing along the transistor channel following the relation:

\[
\Delta V_T = -4 \Phi_F \frac{C_{W, x_j}}{C_{ox} L} \left( \left( 1 + 2 \frac{W}{x_j} \right)^{1/2} - 1 \right)
\]

\[
= -4 \Phi_F \frac{\varepsilon \ t_{ox}}{\varepsilon_{ox} L W} \left( \left( 1 + 2 \frac{W}{x_j} \right)^{1/2} - 1 \right)
\]

(3)

Here \(V_T\) is expressed by:

\[
V_T = V_{FB} + 2 \Phi_F - \frac{Q_B}{C_{ox}}
\]

(4)

where

\[
V_{FB} = \Phi_{MS} - \frac{Q_{ox}}{C_{ox}}
\]

(5)

and

\[
C_{ox} = \frac{\varepsilon_{ox}}{t_{ox}}; \quad \Phi_{MS} = \Phi_M - \Phi_s
\]

(5.1)

\(\Delta V_T\) is the threshold voltage decay; \(t_{ox}\) is the gate dielectric thickness; \(\varepsilon\) and \(\varepsilon_{ox}\) are the silicon and gate dielectric constant respectively; \(L\) is the channel length; \(X_j\) is the drain or source junction depth; \(W\) is the space charge region depth; \(V_T\) is the threshold voltage; \(V_{FB}\) the flatband voltage; \(\Phi_F\) the distance from Fermi level to the intrinsic Fermi level; \(Q_B\) the gate controlled charge; \(C_{ox}\) is the unit area capacitance of the gate insulator. \(\Phi_{MS}\)
is the difference between the workfunctions of the gate and the semiconductor; $Q_{ox}$ is the oxide charge density; $\varphi_M$ and $\varphi_S$ are the metal and the semiconductor workfunction.

Gate depletion and quantum confinement in the inversion layer will play an important role on short channel effect by adding their contribution to the gate to channel capacitance $C_G$. SCE is the main limitation to minimal design rule. For low $V_T$ values it can be of the order of $V_T$. In order to maintain inverter delay degradation to less than 30%, we must observe the condition $V_T = -\frac{V_{DD}}{3}$. $V_{DD}$ is the supply voltage.

- **Drain Induced Barrier Lowering (DIBL)**

Classically, DIBL is due to the capacitive coupling between drain and source resulting in a barrier lowering on the source side. An eased charge injection from the source allows an increased control of the channel charge by the source and drain electrodes and reduces the threshold voltage. This effect (thus $\Delta V_T$) increases with increasing Vds and decreasing L. A simple model shows that:

$$\Delta V_T = -\gamma \frac{V_{ds}}{L^2} (\gamma \text{ is in the range of } 0.01 \mu m^2)$$

### 3.3. Variability from statistical dopant fluctuations and Line Edge Roughness

The effect of dopant fluctuations has already been considered by Shockley in 1961. Recently, special attention has been paid to this subject because the number of dopants in the channel of a MOSFET tends to decrease with scaling of devices geometry. The random placement of dopants in the MOSFETs channel by ion implantation will affect devices characteristics for geometries lower than 50 nm. The discrete nature of dopant distribution can give rise to asymmetrical device characteristics which will impact seriously the building of a complete integrated system with a large number of devices.

Dopant fluctuations and Fowler Nordheim limitation of leakage at high electric fields will encourage the use of low doped thin SOI.

Atomistic, *ab initio* approaches are used to simulate the contribution of the discrete number of dopants to the parameter variability as well as the Line Edge Roughness which becomes an important source of dispersion brought by ultimate lithography resist or the underlying gate material.
roughness. These contributions will be added to the films interface rough-
ness and thickness fluctuations to affect transport properties or noise figures
at the level of a device or a complete integrated system.

4. Technological Options to MOSFET Optimization

In Sub Sections 4.1, 4.3, the possible solutions to overcome the physical
limitations encountered in classical scaling are reviewed through gate stack
and channel/substrate engineering as well as source and drain engineering.
Mastering and improvement of transport properties by strained channels
and substrate engineering will be of primary importance in the future and
not only limited to threshold voltage adjustment as it was the case in the
past. The gate stack will also be reviewed on the electrical properties side
as well as on the defect density view point. Source and drain engineering
has to be addressed not only on the dopant activation side but also on the
architecture side: access resistance to the channel can drastically reduce
any advantage brought from channel transport properties optimization.
In Sub Section 4.2, we review the alternative architecture candidates
to replace bulk devices by leveraging the trade off between performance
and power consumption. Power dissipation limitation will be the hardest
challenge to face in the future whereas portable devices and systems will
drive the market in the nanoelectronics era. That is why thin films and
Multigate architectures are major alternative approaches to extend CMOS
life to the end of the roadmap and possibly beyond.

4.1. Gate stack and channel/substrate engineering

Threshold voltage management issues in classical bulk MOSFET will guide
its scaling.

Gate and channel engineering must be optimized together because both
physical characteristics affect the nominal $V_T$ value of expression (4) which
can be written as:

$$V_T = V_{FB} + 2\phi_F - Q_B/C_G$$  \hspace{1cm} (6)

(gate depletion and channel quantum effects are taken into account).

Low $V_T$ values will result from:

• Tuning surface doping concentration (see Section 4.1.1)
• Strained channel engineering (see Section 4.1.2)
4.1.1. Tuning surface doping concentration as low as possible. Excellent localization of the dopant profile is needed to minimize junction parasitic capacitance and body effect. Selective Si epitaxy of the channel has also been demonstrated to achieve almost ideal retrograde profiles. Selective epitaxial Si:C acts as a Boron diffusion barrier and thus help to improve drastically short channel effect (Fig. 4(a)) as well as low field mobility. Multibarrier channels, using an alternated Si/SiGeC epitaxial channel structure, have been proven to be efficient in optimizing short channel effects immunity compatible with high devices drivability (Fig. 4(b)). These solutions can give a longer breath to bulk CMOS devices scaling.

Fig. 4. Introduction of Carbonated silicon in MOSFET channel: (a) Influence on short channel effect; (b) Optimization by a multibarrier channel.
4.1.2. Strained channel engineering

4.1.2.1. Global strain

Strained SiGe, SiGe$_x$C$_y$ based alloys or strained Si epitaxy have been studied to increase the channel mobility by introducing compressive or tensile strain to enhance hole or electron effective mass respectively. In order to achieve such channel architectures, bulk relaxed SiGe pseudo substrates obtained by graded SiGe buffer were intensively developed during the last decades. High-quality pseudomorphic silicon layer with very high biaxial-strain values (typically 1.2–1.5 MPa or more) can be grown on those substrates. The resulting degeneracy leverage on the conduction bands leads to effective electron mass reduction and mobility increase up to around 80%.

The quality of those substrates has been spectacularly improved. Independently of possible remaining defects (dislocation pile ups, stacking faults, etch pits) a major limitation remains: the reported gain in current enhancement decreases with gate length reduction (Fig. 5). This $I_{ON}$ gain decrease with $L$ was attributed to self heating (monitored pulse drain

![Graph showing gain in drain current vs. gate lengths at various voltages](image)

Fig. 5. Gain in drain current vs. gate lengths at VGT = VDS = −1.3 V for [ALIE98]; at VGT = −0.5 V VDS = −2 V for [LING02]; and at VGT = −1 V VDS = −1.5 V for [COLL02]; VGT = (VG.– VT) for [COLL02'] (see Refs. 28 and 24).
current measurement) due to low thermal conductivity of SiGe. But some authors have pointed out that even at low drain voltage (insensitive to self-heating) the gain current loss is still relevant. Both possible S/D implantation damages and lateral strain S/D relaxations may explain the loss on mobility increase on those short channel strained devices.

However, high quality gate insulator and subthreshold characteristics optimization require a Si cap layer on top of the channel and low thermal budget. Ultimately, a HiK gate insulator is needed in these architectures.

In parallel, high quality strained silicon on insulator substrate, with or without SiGe for dual channel operation has been developed. SiGe condensation technique can lead to high quality SiGe on Insulator (SGOI) whereas high quality SGOI and sSOI substrated by Smartcut were reported.

### 4.1.2.2. Process induced strain

Process induced strain is the most mature option for today’s IC and is proposed in the 65 nm and 45 nm platforms. In those technologies, external strain, mostly uni-axial, is applied by various means. The most currently used approach is the compressive or tensile contact etch stop layer to obtain respectively tensile channel nMOS or compressive channel pMOS. Recent studies quantify by direct measurements the mobility enhancement on short channels with process induced strain showing a direct correlation between low and high $V_d$ regime.

### 4.1.2.3. Other substrate solutions

Unstrained solutions may use the chemical composition of the substrate or the crystalline surface or transport orientation.

Changing surface silicon orientation or transport orientation can lead to mobility improvement by a factor 2 or more. The (110) surface orientation lead to an improvement for hole. Dual channel with (100) orientation for electrons and (110) orientation for holes was reported. Germanium and Germanium-on-insulator were proposed as unstrained substrates. One of the higher channel mobility improvement by using column IV elements is compressive Germanium with more than a factor 10 of hole inversion charge mobility improvement which could bring a solution for dual channel optimization.
4.1.3. Choosing the gate material

Ideal transfer CMOS inverters characteristics requires symmetry of threshold voltage for \( n \) and \( p \) channel devices (i.e. \( V_{TP} = -V_{TN} \)). Several alternatives have been envisaged:

- **The use of \( n^+ \) poly gate for \( n\text{MOSFET} \) and \( p^+ \) poly gate for \( p\text{MOSFET} \).** This solution suffers from Boron penetration into \( \text{SiO}_2 \) coming from the \( p^+ \) doped gate. Nitrided \( \text{SiO}_2 \) limits this effect without avoiding it: trapping centers are created near or at the \( \text{SiO}_2/\text{Si} \) interface decreasing carrier mobility.

- **The use of metal gate material.** No gate depletion is observed in this case. The use of midgap gate (TiN for example) on bulk silicon or partially depleted SOI will be dedicated to supply voltages higher than 1 V. Workfunction engineering for dual metal gates is challenging: the highest CMOS performance/lowest leakage current trade off can be obtained. It is mandatory on low doped FDSOI.

Several approaches have been proposed for metal gate integration. The classical process integration, so called direct gate, requires the protection of the metal gate material from ion implantation as well as from oxidation during the dopant activation anneal. TiN has often been chosen as a gate material because it is available as a standard in the industry. Alternatives such as the damascene gate (Fig. 6) have been achieved in order to avoid the issue of source and drain activation temperature. It is noteworthy that, thanks to the damascene architecture, High Frequency and Multi threshold devices could be embedded in Systems On Chip. Complete silicidation of polysilicon gate has been demonstrated to lead to metallic behavior of both \( n \) and \( p \) gates. However, integration with HiK dielectrics gives rise to the so called Fermi level pinning similar to what is obtained with polysilicon gates.

4.1.4. Gate dielectric engineering

The gate leakage due to direct tunneling in standard \( \text{SiO}_2 \) or \( \text{SiO}_3\text{N}_y \) is one major show stopper. It will impact directly the static power dissipation \( P_{\text{stat}} \) according to relation (2.1) Let us consider a circuit with active area of the order of 1 cm\(^2\) and gate oxide \( \text{SiO}_2 \) \( t_{ox} = 1.2 \text{ nm} \). Considering the contribution of gate leakage to \( I_{off} \) under the condition \( V_{dd} = 0.5 \text{ V} \), then \( P_{\text{stat}}(0.5 \text{ V}) = 5 \text{ W} \). We would get \( P_{\text{stat}}(1.5 \text{ V}) = 750 \text{ W} \) if \( V_{dd} = 1.5 \text{ V} \)!! This
results as a major show stopper for scaling of CMOS technology. That is why High K will be urgently needed in the near future. Besides affecting static power, gate leakage also impacts negatively delay time and affects the functionality of logic circuits.

4.1.4.1. From SiO$_2$ to High K gate dielectrics

A decrease of devices performance has been reported if SiO$_2$ thickness is lower than 1.3 nm suggesting a surface roughness limited mobility process due to the proximity of sub-oxide. The strong band bending due to quantum mechanical corrections affects the lower limit of supply voltage in the constant field scaling approach. Solutions compatible with silicon gate are also investigated to keep compatibility with a standard CMOS process flow: HfSiO$_x$, ZrSiO$_x$ are given much attention as good candidates. These solutions are dielectric thickness budget consuming (SiO$_x$ interface) and Fermi level pinning occurs at the HiK/poly gate interface.

Very low leakage current has been reported by using HfO$_2$ of 1.3 nm Equivalent Oxide Thickness (EOT) combined with a TiN gate integrated on 45 nm CMOS by a damascene process (Fig. 6). Electron mobility degradation is reported compared to SiO$_2$ gate dielectric attributed to stress induced phonon scattering (Fig. 7(a)). These materials have a smaller bandgap than SiO$_2$; thus trapping is a strong reliability issue. That is why
a SiON interface could be helpful to reduce the leakage current thanks to the higher bandgap of SiON.

La$_2$O$_3$ films with EOT as thin as $0.61$ nm have been proven to demonstrate very low leakage current as low as $J = 5.5 \times 10^{-4}$ A.cm$^{-2}$ compatible with high interface quality and acceptable mobility values (Fig. 7(b)). These results are obtained on low temperature end of process and aluminum gate. Integration into a direct gate process is still an issue.

4.1.4.2. Combining gate stack and channel workfunction engineering

Specific technological optimization may be necessary to maximize the transport gain in short channels. In particular, maintaining the high stress of 1.2 or more GPa in a nanoscaled device and reducing ion implantation damages are among the main challenges. Meanwhile, the combination of strained Si and SiGe channel can be a promising solution for future applications. For instance, it was shown that both surface conduction and hole mobility enhancement (65% at high transverse electric field) could be achieved by using selective SiGe for PMOS coupled with high-k and metal gate$^{33,53}$ (Fig. 8).

Even in the case of low gain in short channel $I_{ON}$ values,$^{33}$ it is possible to adjust $V_T$ by locally strained layers by using a mid gap metal gate.
4.2. Architecture alternatives to improve CMOS performances and integration

4.2.1. Fully depleted SOI devices

In order to obtain the lowest subthreshold slope (60 mV/dec) and acceptable DIBL on FDSOI a practical rule is used: $T_{Si} \leq L_{gate}/4.54$ The spreading of potential into the buried oxide, due to the coupling with the top gate, increases the coupling between source and drain and thus DIBL. Ultra-low SOI films thickness is difficult to control. That is why partially depleted SOI has been proposed.54,55 Because of complete isolation of the SOI devices as well as lower junction capacitance, improved figures of merit are obtained as compared to bulk.54 The threshold voltage is dependent on Si film thickness whenever the film thickness becomes lower than the space charge region. $V_T$ is then expressed as54:

$$V_T = V_{FB} + 2\varphi_F + \frac{qN_AT_{Si}}{2C_{ox}}$$ (7.1)

In the case of a low doped channel, expression (7.1) can be simplified as the well known relation:

$$V_T = \left( \varphi_M - \frac{E_i}{q} \right) + \frac{kT}{q} \ln \left( \frac{2.C_{ox}.kT}{q^2n_iT_{Si}} \right)$$ (7.2)
$N_A$ is the acceptor concentration; $T_{Si}$ is the silicon thickness; $C_{ox}$ is the gate insulator capacitance; $E_i$ is the semiconductor intrinsic Fermi level energy; $n_i$ is the intrinsic carrier concentration.

Scaling of FD devices encounters some limitations due to the quantum confinement of carriers in ultra thin films and its incidence on the threshold voltage value:\textsuperscript{56} the increase of the fundamental level of the conduction band will increase flat band voltage and $V_T$ consequently.

The functionality of ultra small 6 nm gate length devices on 7 nm thin Si film was demonstrated.\textsuperscript{57} However, the electrical performances of these devices are extremely sensitive to the SOI film thickness variations due to the fact that a compromise must be found between series resistance minimization and DIBL.\textsuperscript{58}

Combination of strained channels and SOI could result in optimized trade off between short channel effects reduction and enhanced transport properties. A Si and SiGe Dual strained channels on insulator architecture has been demonstrated functional down to gate lengths of 15 nm (Fig. 9).\textsuperscript{34,37}

For sub 100 nm range channel lengths and widths, the strain induced by the environing thin films affects devices characteristics. The loss of global strain observed in short channels is recovered by the lateral strain induced on the narrow active areas (Fig. 10(a)).\textsuperscript{34,59,60} This effect has been evidenced quite clearly on FDSOI films\textsuperscript{34,59} where the biaxial and uniaxial strain are additive effects which balance the loss of strain that could be induced by

**Fig. 9.** (a) Cross sectional TEM pictures of the co-integrated dual channels MOSFETs on Insulator with a HfO$_2$/TiN/Poly/NiSi gate stack;\textsuperscript{34,37} (b) Strained Dual channels CMOS Process Flow.\textsuperscript{34}
With multi gate devices (Fig. 11(b)), short channel effects and leakage for controlling short channel can be relaxed compared to single gate FD occurs by volume inversion due to the coupling of both gates. The conditions and high drivability can be obtained. In the saturation regime, transport current can be drastically reduced because 60 mV/dec subthreshold swing vs. inversion charge along orientations: (b)

SOI material should allow to realize attractive devices like multi gated MOSFETs\(^\text{61}\) that will extend further scaling of FD devices which are limited by the quantum confinement and splitting of allowed energy bands as well as DIBL via the coupling of the gate with buried oxide\(^\text{56}\) (Fig. 11(a)). With multi gate devices (Fig. 11(b)), short channel effects and leakage current can be drastically reduced because 60 mV/dec subthreshold swing and high drivability can be obtained. In the saturation regime, transport occurs by volume inversion due to the coupling of both gates. The conditions for controlling short channel can be relaxed compared to single gate FD

4.2.2. Multigate devices

SOI material should allow to realize attractive devices like multi gated MOSFETs\(^\text{61}\) that will extend further scaling of FD devices which are limited by the quantum confinement and splitting of allowed energy bands as well as DIBL via the coupling of the gate with buried oxide\(^\text{56}\) (Fig. 11(a)). With multi gate devices (Fig. 11(b)), short channel effects and leakage current can be drastically reduced because 60 mV/dec subthreshold swing and high drivability can be obtained. In the saturation regime, transport occurs by volume inversion due to the coupling of both gates. The conditions for controlling short channel can be relaxed compared to single gate FD

= Fig. 10. A piezoelectric model is applied to describe the effects induced by strain on the MOSFET electrical behaviour of: (a) short and narrow devices on SOI. Experimental gm, max enhancement vs. device width is compared to the piezoelectric model. Inset: Approximation of the used piezo-electric model.\(^\text{34}\) Short and narrow n-channel electron mobility vs. inversion charge along orientations: (b) \(\{110\}\); (c) \(\{100\}\).\(^\text{39,60}\)

source and drain and the process steps to implant contacts architecture. For electrons, these effects are more pronounced on \(\{110\}\) than on \(\{100\}\) (Figs. 10(b) and 10(c)).\(^\text{60}\)
devices. Nevertheless, the control of thin SOI and design of high density circuits with these devices have to be demonstrated.

Another main feature of these devices is to bring a solution to the channel dopant fluctuation issue in small volume. Reducing the film thickness to the minimum, allows using nearly intrinsic Si films because bulk punch-through is no more a problem. Adjusting \( V_T \) to match the overdrive defined by \( (V_s - V_T) \) with a low supply voltage \( V_S \) index will require adjusting the gate workfunction \( \phi_M \) according to relation (5.1). That is why, workfunction engineering on metal gate and HiK stacks is mandatory for low \( V_S \) applications.

Among the various studies published on multi-gate devices, many architectures have been proposed in which the channel is controlled by two or more gates.

In planar architectures, the structure can be non self-aligned, i.e. fabricated with one photo-lithography step for each gate, or self-aligned, using only one lithography step to define both gates. The non self-aligned architecture by wafer bonding is the most straightforward approach to fabricate planar double gate. The success of this approach depends on the lithography capability to align very short gates one to the other. Figure 11(b) shows a 10 nm non self-aligned planar double gate transistor, fabricated thanks to the use of wafer bonding and e-beam lithography. Notice that a quasi-perfect gate alignment, with an accuracy of a few nanometers, could have been achieved using other techniques.

Fig. 11. (a) Threshold voltage dependence of SOI devices as a function of SOI thickness for different values of channel doping; (b) TEM cross-section of a 10 nm planar bonded double gate transistor with TiN metal gate.

24 S. Deleonibus et al.
be achieved thanks to the self-aligned regeneration of the alignment marks after the bonding step.74

Several approaches have been proposed to fabricate self-aligned planar double gate MOSFETs. The first one consisted in patterning a narrow silicon active area on a SOI substrate, etching a localized cavity under this active area into the buried oxide, and its filling by the gate material.75 After gate patterning, the silicon active area is surrounded by the gate. Another gate-all-around (GAA) architecture, based on the silicon-on-nothing (SON) process, has been proposed more recently76 and demonstrated down to very short gate lengths. This approach relies on successive epitaxial growth of crystalline SiGe and Si layers. The SiGe layer is then selectively etched to form a tunnel below the silicon film, and this tunnel is filled by the gate material.

In the PAGODA architecture,77 the unpatterned back gate stack is deposited and encapsulated before wafer bonding. After initial substrate removal, the front gate is patterned and silicon spacers recrystallized from the channel are formed and silicided. These silicided spacers are used as a hard-mask for back gate etching and undercut.

The process flow proposed in78 starts also from back gate stack deposition and wafer bonding. The whole stack, comprising the front gate, the channel and the back gate is then patterned. Insulated layers are formed beside the gates by use of oxidation rate difference between the gate and the channel materials. Source/drain regions are then regenerated by lateral epitaxial regrowth from the channel edges.

The key technological issues of the planar architectures are the precise controls of the very thin film thickness and of the back gate dimension, since the back gate is not directly accessible from the top of the wafer. However, with the planar bonded architectures it is possible to bias the front and back gate independently74 (Figs. 12(a) and (b)). That allows the use of different transistors families with several threshold voltages values available on the same chip by using one single type of device. The electrical characteristics of the devices can fulfill the specifications of the 3 families of devices proposed in the ITRS[1], so-called High Performance (HP), Low Operating Power (LOP) and Low Standby Power (LSTP)74 (Fig. 12(b)). Moreover, the planar bonded Double Gate devices are co integratable with single gate FDSOI and allow a metallic Ground plane by using the backside gate. The planar bonded architecture approach brings a unique innovative option to future Systems On Chip.79
Fig. 12. (a) Tunable threshold voltage of the devices as a function of back gate voltage; (b) $I_{\text{off}}$ vs. $I_{\text{on}}$ of tunable DG MOS (adjustable $V_{bg}$–$V_{fg}$) and tunable DG MOS operating in FD mode (adjustable $V_{bg}$) from Low-stand-by-power (LSTP) to High-performance (HP) –90 nm node.70

On the other hand, structures with fingered vertical channel, such as FinFET80 (Fig. 13(a)), Trigate81 (Fig. 13(b)), Ω-FET82 (Fig. 14(a)), Π-Gate83 and nanowire-FET84 have been extensively studied. Fabrication of FinFETs relies on high aspect ratio fin definition and short gate patterning on this topography (Fig. 13(a)). Conversely to planar devices, the conduction takes place on the vertical sidewalls of the fin. The conduction width is thus twice the fin height ($h_{\text{fin}}$). As the fin height is limited to typically 50 to 100 nm, FinFETs are usually designed as multifinger transistors, with a conduction width quantified by $2h_{\text{fin}}$. In order to obtain the same drive current per silicon area as planar double gate transistors, the spacing between the fingers has to be lower than the fin height.

Fig. 13. (a) Schematic of a FinFET device. (b) Left: SEM top-view of a 20 nm gate length multifinger Trigate device. Right: Schematic cross-section of one Trigate fin.
Thus, one key technological issue lies in the multi-fin definition. Dense array of narrow fins have to be patterned, with a good control of the fin width and shape. The use of spacers as hard-mask for fin patterning seems unavoidable, as it allows to double the fin density and to design sub-10 nm wide fins.

Another approach consists in designing the fin with roughly a square cross-section (Fig. 13(b)). In that case, the channel is controlled by the gate on three sides. This device, so called Trigate, has a conduction width given by twice the fin height plus the fin width. Trigate is still a multifinger device, and the spacing between fins has to be lower than $h_{\text{fin}} + w_{\text{fin}}/2$ to obtain higher drive currents per silicon area than with planar devices. This limit is far more strict for Trigate than for FinFET, since the fin height must be as low as the fin width in order to operate in trigate mode, and comparable to the gate length to benefit from a good electrostatic channel control.

The $\Omega$-FET and $\Pi$-Gate architectures are basically similar to Trigate, but their channel control is close to that of a quadruple-gate device, thanks to the extension of the gate below the fin into the buried oxide. The best electrostatic control can be achieved theoretically in a cylindrical channel completely surrounded by the gate (Fig. 14(b)). The most advanced practical realization of such a device is the 5 nm gate length nanowire-FET.

Thanks to their better electrostatics control, multiple gate transistors are likely to allow a triple drive current with respect to single gate transistors at a given off-state current.
To illustrate this, we have plotted on Fig. 15 the ratio of the drive currents obtained experimentally on 20 nm co-integrated single gate and double gate devices. The drive current of the double gate transistor is 1230 \mu A/\mu m for an off-state current of 1 \mu A/\mu m at V_{dd}=1.2 V, which can be considered as a high performance device.

Two cases can be considered:

1. Both devices have the same film thickness of 10 nm. The single gate transistor suffers from much more electrostatic control loss and the drive current ratio at I_{off} = 1 \mu A/\mu m is between 3.4 and 4.0.

2. Both devices exhibit roughly the same electrostatic control (sub-threshold swing and DIBL respectively lower than 100 mV/dec and 250 mV/V). The film thickness is reduced to 6 nm for the single gate transistor. The current ratio is still around 3, because of the increased access resistances due to a thinner film for the single gate device.

Furthermore, if we consider loading capacitances (for example wires and junctions) in addition to intrinsic gate capacitance in the previous discussion, the multiple gate device advantage over single gate is further increased, because of the higher drive currents delivered by the multiple gate architectures.

Finally, since each added gate allows a better device scalability\(^\text{79,87,89}\), the advantage of multiple gate devices is more and more evident as the gate length is reduced.

\(^{73}\)
Several critical issues are associated with the use of thin film or narrow fin devices. An intrinsic limitation is the mobility reduction observed for film thickness below 5 to 7 nm. This effect is partly due to an increased phonon scattering mechanisms on thin films and can be further accentuated by a more pronounced impact of the surface roughness.

In addition, devices with ultra-thin films are sensitive to thickness fluctuations through short channel effects variations. The scaling length derived in for low-doped double gate transistors is given by the expression:

$$\lambda = \frac{t_{Si}}{2} \sqrt{\frac{1}{2} + \frac{2.C_{Si}}{C_{ox}}}$$

(8)

For an EOT of 1 nm, $\delta\lambda/\lambda$ is about 70% of $\delta t_{Si}/t_{Si}$. As short channel effects depend on $L/\lambda$, a fluctuation of 1 nm on a film thickness of 7 nm is equivalent to a gate length variation of 10%.

4.2.3. Multichannels Multigated devices for improved output current and integration density. Paving the way to the use of Nanowires

The increase of devices drivability could be obtained by multiplying the number of channels. Increasing the drivability capabilities while keeping high integration density is possible by stacking devices in parallel. The exploitation of the third dimension is an elegant and efficient way to achieve such a goal. Several teams have recently published results on multichannel architectures. Figure 16 shows a 3-level CMOS Nanobeams stack of 30 to 70 nm widths: these devices demonstrate up to $3 \times I_{ON}$ increase compared to 1 level trigate. A high current density/surface is obtained thanks to 3D integration. Starting from a SOI substrate, a (Si/SiGe) superlattice is grown. After the silicon nitride deposition, the superlattices are etched anisotropically in order to pattern stacked fins. Then the SiGe is selectively removed between the Si nanowires isotropically.

If the channel width reaches nanometer range dimensions, the quantized width, imposed by the nanowires structure, may reduce significantly the driving current and/or the design flexibility compared to planar architectures. This limitation can be overcome by 3D approaches. The 3D Gate-All-Around (GAA) architecture requires some specific integration strategy. 3D Nano-Wire-GAA architectures (NWG) can be integrated by a damascene-gate FinFET to obtain suspended nanowires with GAA HO2/TiN/Poly gate.
Electronic Device Architectures for the Nano-CMOS Era

Fig. 16. Left: Three stacked levels nanobeam matrix after the Fin etch and the SiGe removal. Right: Cross sectional TEM pictures perpendicular to the beams a) of one stacked Si channels, Inset: $3 \times 50 = 150$ beams b) of one Si channel: excellent Si crystalline quality is obtained; HfO$_2$, TiN and Poly-Si conformity is achieved.\(^9^5\)

Photo-resist trimming and optimized hydrogen annealing are employed to obtain rounded and continuous suspended nanowires.\(^9^6\) Hydrogen annealing was used intentionally for 3-D profile transformation by rounding sharp corners while diminishing surface roughness\(^9^7\) which improves electrical characteristics of FinFETs.\(^9^8\) In Fig. 17 an example of stack made of up to

Fig. 17. TEM cross section of the multilayers nanowires. (a) before annealing — not rounded nanowire (b) annealed at 850°C — rounded nanowires. The lower Si nanowires are on SiO$_2$. Every wire is capped with SiO$_2$, Si$_3$N$_4$ and W for TEM imaging convenience.\(^9^6\)
4 Nanobeams is shown: subsequent resist trimming and hydrogen anneal at 850°C gives a rounded shape to the Nanobeams which will turn out to behave as nanowires.96 Zipping between beams appears as a basic limit when we increase the wire density. This phenomenon is related to the smaller distance between beams when the number of beams is increased. In order to avoid strain relaxations (and thus misfit dislocations) in the initially grown super-lattice, the SiGe thickness between Si layers is decreased for an increasing number of beams. Capillary forces can induce sticking of the beams during the wet surface preparation step prior to the HfO2 deposition. We showed that a shorter beam length avoids zipping when increasing the beams density.95

4.3. Source and drain engineering

Low energy (<1 keV)49 and heavy molecules (BF3,99 B10H14,100 …) have been extensively studied to replace Boron to achieve p+ shallow junctions. Plasma doping is investigated as an alternative to obtain as implanted p+ junction depths lower than 10 nm.101,102 Transient Enhanced Diffusion (TED) is still the limiting process to reach the specified final junction depths (Fig. 18). Fast ramp up and down — so called spike or Flash annealing102 — must be combined with Low Energy Ion Implantation102 to reduce TED

Fig. 18. P+ Sheet resistance as a function of junction depth on bulk or Si thickness for SOI.101–104
as much as possible, by reducing the role played by extended and dopant defects. Excimer Laser Anneal (Fig. 18)\textsuperscript{103,104} has demonstrated the best trade off between low sheet resistance and junction depth shallowness: highest solid solubility combined with fast processing can be achieved. Low sheet resistance combined with low silicon consumption can be obtained with monosilicides (NiSi, PtSi) instead of disilicides (TiSi\textsubscript{2}, CoSi\textsubscript{2}).\textsuperscript{105}

The same behavior will apply to SOI as well as bulk substrates (Fig. 18). However, on SOI films, several issues are linked with the access resistance optimization. As the film thickness decreases, achieving silicon doping becomes more and more challenging, because on one hand the square resistance of the silicon film increases in $1/t_{Si}$ as shown on Fig. 18. On the other hand, increasing dose and/or energy leads to surface silicon amorphization\textsuperscript{73}: as long as the whole layer is not damaged, activation annealing allows the recrystallization of the film giving thus an active doping process window which is very narrow for a 5 nm thick silicon film. The surface species diffusion velocity during high thermal processes being very sensitive to high temperature treatments\textsuperscript{73,106} as silicon thickness decreases.

Devices on thin SOI will require raised sources and drains by epitaxial growth to facilitate further silicidation: pre-anneal before epitaxial growth can lead to a destabilization which dramatically transforms the continuous silicon film into silicon solid droplets on the buried oxide as shown on Fig. 19(a). Therefore selective epitaxy of raised source/drain requires technological developments such as temperature optimization, modulation of the interface energy between silicon and buried oxide to ensure that the silicon film will keep its integrity during the whole fabrication process. Figure 19(b) illustrates results obtained when the temperature of the pre-anneal is lowered (down to 650°C).

Silicidation process also requires technological optimization. Indeed diffusive metals have been introduced to suppress the voiding that occurs in the silicon films when silicon diffuses into the silicide. One way to overcome these technological difficulties could be to design MOS transistors with metallic source and drain either based on Schottky barriers\textsuperscript{107} or modified Schottky barrier.\textsuperscript{108} In both cases, selective epitaxy can be suppressed as source and drain are made out of metal. The key issue in this option is to find metals for N and PMOS with adjusted work function to design either adequate Schottky barrier or low specific resistance ohmic contacts.

\textsuperscript{32} S. Deleonibus et al.
5. Exploiting Non-Stationary Transport or CMOS on Semiconductors other than Silicon?

The introduction of strained channels is limited by saturation velocity values at high electric fields. Under these conditions, non stationary transport can occur for very short channels and devices performances can benefit from velocity overshoot. Unless transport is limited by surface roughness or impurity scattering, ballistic transport can offer a new degree of freedom to the increase of devices performance in sub 100 nm Si channel length devices. If the low field mobility is high, then the mean free path of carriers becomes comparable to or higher than the channel length: ballistic transport is likely to be taken into account. These transport properties can be enhanced whenever undoped or nearly undoped channels can be used. Architectures based on ultra thin bodies like Fully Depleted SOI or Multigate devices can ease the exploitation of these phenomena due to the fact that short channel doping can be minimized while keeping low short channel leakage. Reduction of channel length and supply voltage poses the issue of new scaling paradigms through the exploitation of non stationary effects. Germanium and GaAs for example have low field carrier drift velocities higher than in silicon. However, at high electric fields the reverse...
situation occurs. Still the energy relaxation time is higher in Germanium than it is in silicon thus velocity overshoot may occur for less aggressive channel lengths. Limitations will however come from integration of the new materials which could request new gate dielectrics. Typically, High K materials are needed to fabricate Ge based CMOS devices due to the Ge oxides instabilities. In these devices, hole mobility has been reported to be improved whereas electron mobility enhancement is still an issue (see Section 6.2). Germanium offers the unique possibility for low temperature dopant activation.114,115

6. Optimization of Carrier Transport and Power Dissipation

6.1. Electrostatics, transport and self heating issues

The best choice to maximize the CMOS integration density is obtained under the condition $\mu_n = \mu_p$ ($\mu_n$ and $\mu_p$ are respectively the n-channel and p channel mobilities). Dual channels obtained from strained epitaxial layers could be a possible approach40 (see Section 4.1.3). As far as a monolithic solution can be found, this unique condition occurs in the case of C-diamond (Table 1). However, $n$ dopant activation in this material is still limited116 whereas, recently progress has been made for $p$ doping.117 However, ohmic contacts of metal to diamond need to be optimized. Moreover, C-diamond is far the highest thermal conducting material (10 times the thermal conductivity of silicon or 50 times the thermal conductivity of Al2O3) and could be integrated as a buried layer to limit self heating in future Semiconductor On Insulator substrates. The dielectric constant of

<table>
<thead>
<tr>
<th>Material</th>
<th>$\mu_n$ (cm$^2$V$^{-1}$s$^{-1}$)</th>
<th>$\mu_p$ (cm$^2$V$^{-1}$s$^{-1}$)</th>
<th>$V_{sat}$ (10$^7$ cm/s)</th>
</tr>
</thead>
<tbody>
<tr>
<td>Si</td>
<td>1400</td>
<td>500</td>
<td>0.86</td>
</tr>
<tr>
<td>Ge</td>
<td>3900</td>
<td>1900</td>
<td>0.60</td>
</tr>
<tr>
<td>GaAs</td>
<td>8900</td>
<td>400</td>
<td>0.72</td>
</tr>
<tr>
<td>C Diamond</td>
<td>1800</td>
<td>1800</td>
<td>2.7</td>
</tr>
<tr>
<td>4HSiC</td>
<td>900</td>
<td>120</td>
<td>2.0</td>
</tr>
<tr>
<td>InSb</td>
<td>78000</td>
<td>750</td>
<td>5.0</td>
</tr>
</tbody>
</table>

Table 1. Electrons, holes bulk mobilities and saturation velocities (at 300 K) of mostly used semiconductor materials.

S. Deleonibus et al.
Table 2. Electrons affinity, bandgap, maximum valence band level, thermal conductivity and dielectric constant for various pertinent mostly used semiconductors and High K materials.

<table>
<thead>
<tr>
<th>Material</th>
<th>Electron Affinity (V)</th>
<th>Gap (V)</th>
<th>Ev (V)</th>
<th>Thermal Conductivity $\sigma_{th}$ (W/m/K)</th>
<th>Dielectric constant $K$</th>
</tr>
</thead>
<tbody>
<tr>
<td>Si</td>
<td>4.05</td>
<td>1.12</td>
<td>5.17</td>
<td>141</td>
<td>11.9</td>
</tr>
<tr>
<td>Ge</td>
<td>4.13</td>
<td>0.66</td>
<td>4.79</td>
<td>59.9</td>
<td>16</td>
</tr>
<tr>
<td>GaAs</td>
<td>4.07</td>
<td>1.42</td>
<td>5.49</td>
<td>46</td>
<td>12.5</td>
</tr>
<tr>
<td>C diamond</td>
<td>0</td>
<td>5.47</td>
<td>5.47</td>
<td>$&gt;2000$</td>
<td>5.7</td>
</tr>
<tr>
<td>4H SiC</td>
<td>3.55</td>
<td>3.00</td>
<td>6.55</td>
<td>500</td>
<td>6.52</td>
</tr>
<tr>
<td>InSb</td>
<td>4.59</td>
<td>0.16</td>
<td>4.75</td>
<td>16.0</td>
<td></td>
</tr>
<tr>
<td>SiO$_2$</td>
<td>1.10</td>
<td>9.00</td>
<td>10.1</td>
<td>1.38</td>
<td>3.9</td>
</tr>
<tr>
<td>Si$_3$N$_4$</td>
<td>2.00</td>
<td>5.00</td>
<td>7.00</td>
<td>30.1</td>
<td>7.5</td>
</tr>
<tr>
<td>Al$_2$O$_3$</td>
<td>1.92</td>
<td>6.2</td>
<td>8.12</td>
<td>25.1</td>
<td>10</td>
</tr>
<tr>
<td>HfO$_2$</td>
<td>2.07</td>
<td>5.6</td>
<td>7.67</td>
<td>11.4</td>
<td>24</td>
</tr>
<tr>
<td>ZrO$_2$</td>
<td>2.07</td>
<td>5.5</td>
<td>7.57</td>
<td>1.30</td>
<td>24</td>
</tr>
<tr>
<td>AlN</td>
<td>2.00</td>
<td>6.2</td>
<td>8.20</td>
<td>175</td>
<td>8.9</td>
</tr>
<tr>
<td>BeO</td>
<td>2.00</td>
<td>10.6</td>
<td>12.6</td>
<td>260</td>
<td>6.7</td>
</tr>
</tbody>
</table>

C-diamond ($K_C = 5.7$) offers the best compromise between HiK and SiO$_2$ to control short channel effect according to relation (3).

However, the isolation on the valence band side is difficult (Table 2): the C/Si barrier height is far less than the SiO$_2$/Si barrier height (0.30 eV for C/Si instead of 4.93 eV for SiO$_2$/Si!). That is why a HiK insulator is needed. Among the best candidates, BeO or AlN offer a good compromise in terms of short channel effect ($K_{BeO} = 6.7$ or $K_{AlN} = 8.9$) and thermal conductivity (Table 2). Furthermore, their valence band is at least at $-6.2$ or $-10.6$ eV from vacuum. Thus a good isolation is obtained for holes whereas for C-diamond by itself would not be a good insulator on the valence band side.

Thus the integration of C-diamond has to be combined with HiK buried insulators if we wish to integrate it on silicon as a possible solution to limit power dissipation and suppress self-heating of CMOS devices (Fig. 20).$^{118}$

6.2. Germanium on insulator: a second life for germanium?

Germanium was initially used to fabricate microelectronics through the realization of the first transistor. Many interesting properties can be
accounted to Ge: larger low electric field mobility values than in Si as well as smaller $\mu_n/\mu_p$ ratio (see Table 1), despite lower saturation velocity at high fields. However, Ge has a higher energy relaxation time which potentially relaxes linear gate length scaling constraint to gain performance as compared to Si.

Due to its compatibility with silicon processing and its availability in many fabs, Ge has recently been given much interest again as a promising candidate for high performance MOSFETs. Thanks to High-K materials, the non stable native Ge oxide is not a limitation anymore for the use of Ge in the CMOS technology. Low band gap materials show high diode leakage current. The impact of this leakage on MOS characteristics (IOFF, bulk leakage) is a severe limitation for the use of bulk Ge for CMOS devices. Thus, a more realistic use of Ge for CMOS is Germanium On Insulator (GeOI) Fully Depleted MOSFETs since the bulk leakage is suppressed by the BOX and S/D leakage can be reduced by using ultra thin Germanium in a device operating in the Fully Depleted regime. We have realized Fully Depleted deep sub-micron (gate length down to 0.25 $\mu$m) Ge p-MOSFETs on Ultra Thin Germanium-On-Insulator (GeOI) wafers. The Ge layer obtained by hetero-epitaxy on Si wafers is transferred using the Smart-CutTM process to fabricate 200 mm GeOI wafers with Ge thickness down to 60 nm (Fig. 21).
A full CMOS compatible p-MOSFET process was implemented with HfO$_2$/TiN gate stack. An ION/IOFF ratio higher than $10^3$ and a 300 mV/decade sub-threshold slope are measured. These results suggest that both the quality of the Ge layer and the gate stack have to be improved. Nevertheless ION vs. LG state-of-the-art values reported in Fig. 22 for Ge and GeOI devices illustrate the excellent performances of our devices.$^{115,120–122}$ We have also performed TCAD simulations of GeOI

Fig. 22. Comparison of the ION performance of our GeOI-P-MOSFETs ($L_{G_{\text{min}}} = 0.25 \, \mu m$) with literature. The ON current is measured for $V_{DS} = -1.5 \, V$, $V_{GS-VT} = -2 \, V$. TCAD simulations of GeOI devices show good agreement with the electrical results.$^{113}$
MOSFET structures using a Ge CVT mobility model. The CVT parameters were theoretically calculated or adapted by calibration. From these simulations the ION current values for LG down to 0.25 μm have been extracted, and show a good agreement with our electrical results and also with literature data.115,120–122

7. Alternative CMOS or Alternative to CMOS on Silicon?

Many research teams are making efforts on Single Electron Transistors (SET) operation based on the Coulomb blockade principle. Demonstration of CMOS inverter operation at 27 K has been achieved by using a Vertical Pattern Dependent Oxidation (V-PADOX) process.123 No solution has been found that could compete with CMOS devices. Some possibilities to achieve memory functional devices by using single electron trapping by a Coulomb blockade effect for DRAM,124 or Non-Volatile applications125–127 have been pointed out. This effect supposes that the Coulomb energy: e²/2C (9) is larger than the thermal energy of electrons kT (e is the electron charge; C is the capacitance of the quantum box). This energy is necessary to localize the electrons in a Coulomb box provided that tunneling is the limiting process: implicitly, one has to use very low capacitance and sufficiently high tunneling resistance. However, the Coulomb blockade process will be self limiting due to charge repulsion which reduces the speed of the charge transfer. Non-Volatile Memory (NVM) applications can be envisaged by using trapping in nanometer size Si Nanocrystals (SiNc)126: Al2O3 has been chosen as the tunnel insulator due to the increased dot density as compared to other materials (in the range of 10¹² cm⁻²), with reasonable interface states density (less than 10¹¹ cm⁻²). Whether the involved writing or erase mechanisms are due or not to single electron transfer has been a controversial debate. In large area devices, with a large amount of randomly distributed SiNc, it is very difficult to identify whether the single electron transfer is occurring or not, due to the large distribution of dot sizes and consequently of Coulomb energies. It is thus very important to use a device of the smallest size possible, containing only one dot or a low number of dots, to get a high sensitivity to single electron transfer. Such a result has been obtained at room temperature on 20 nm × 20 nm Non-Volatile Memory Silicon wire based on Silicon quantum dots (Fig. 23(a))128: current spikes on the writing or erasing characteristics have been identified as single electron trapping or detrapping respectively. Coulomb blockade oscillations can be observed
at 75 and 20 K, demonstrating that Coulomb blockade is possible in such devices.

Flash memories in the sub-90 nm nodes (Fig. 24(b)) because of superior NOR type architectures show a larger tolerance to threshold voltage fluctuations compared to the resistance quantum: \((e^2/h)\). This effect has already been reported on 50 nm gate length N channel MOS transistors at 4.2 K making CMOS transistors attractive as single electron devices candidates. As gate length is scaled down to 20 nm, access resistance becomes larger and channel conductance oscillations appear at higher temperatures (here 75 K) (Fig. 23(b)).

The Si-Nc technology (Fig. 24(a)) offers new scaling possibilities to Flash memories in the sub-90 nm nodes (Fig. 24(b)) because of superior Stress Induced Leakage(SILC) immunity of the tunnel oxide. Thus NOR type architectures show a larger tolerance to threshold voltage fluctuations than NAND type devices; if one considers a Si-Nc density of 10^12 cm^-2, NOR type can be scaled down to the 35 nm node whereas NAND type would reach the 65 nm node (Fig. 24(b)). The stored charge discreteness makes these devices much sensitive to stochastic fluctuations of writing and erasure characteristics of 20 nm × 20 nm MOSFET inserted. (b) Drain current oscillations in a Lg = 20 nm MOSFET at 75 and 20 K, demonstrating that Coulomb blockade is possible in such devices.

Fig. 23. Devices characteristics evidencing Single Electron phenomena. (a) Writing and erase characteristics of 20 nm × 20 nm/W×L devices at room temperature. Top view of 20 nm × 20 nm nanowire inserted. (b) Drain current oscillations in a Lg = 20 nm MOSFET at 75 and 20 K, demonstrating that Coulomb blockade is possible in such devices.
retention times\textsuperscript{131}: the use of limited number of electrons makes the Si-nc devices more attractive for low voltage, low power operation (Fig. 25).\textsuperscript{131} Double bit operation has also been demonstrated.\textsuperscript{127,132} This solution is compatible with high standard retention times and endurance cycles,\textsuperscript{127} down to gate lengths of 35 nm.\textsuperscript{132} The use of High K as a coupling dielectric between the control gate and the SiNc will enhance the coupling ratio and thus allows their integration in NAND architectures.\textsuperscript{133}

More generally, discrete traps memories are of interest to address the scaling of NVM via the SONOS architectures\textsuperscript{134} for embedded architectures (see also Chapters 7 to 9 of this book). These architectures are challenged by an increasing interest of Resistor Phase Change memories devices (Chapter 7).

8. Conclusions

By the end and beyond the end of the roadmap, power consumption will be the greatest issue whatever the application. We reviewed the physical limitations of MOSFET that will be encountered in the optimization of the performance versus leakage trade off and screened the different possibilities on the architecture or material sides. Multigate devices using strained channels will be widely used for high performance CMOS. Si based alloys or compatible semiconductors will be introduced to enhance the possibilities of future Systems on Chip. New materials including HiK dielectrics, Ge
Fig. 25. Si-nc allow: (a) lower number of electrons per bit for programming: that reduces the programming voltages and power consumption.\(^\text{131}\) (b) Double bit operation: transfer characteristics of a scaled SOI device charged consecutively on drain, source and on both sides with the same stressing conditions. Four clear states are apparent also if the two pockets of charge are very close to one another.\(^\text{132}\)

and C-based materials could be integrated to optimize integration density of logic circuits as well as for limitation of short channel effects and power dissipation. New devices architectures requiring a low number of electrons for operation have good potentials in low power, low voltage Flash memories applications by the use of silicon nanocrystals. Single electronics will be a major study subject to optimize the use of ultra small devices.

9. Acknowledgements

We wish to warmly thank the members of the LETI — Electronics Nanodevices Laboratory and Nanotechnologies Division for their various contributions to this chapter as well as the LETI Silicon Technologies Platform for wafer processing. Many of these studies were carried out thanks to the funding by industrial collaborations with STMicroelectronics, Freescale, NXP, Texas Instruments, ATMEcL, as well as in the frame of European Commission programs in FP4, FP5 and FP6, MEDEA+, Basic Research French National Programs RTB, ANR and LETI-Carnot Institute labels.

References

18. F. Ducroquet, T. Ernst, J.-M. Hartmann, O. Weber, F. Andrieu, P. Holliger, F. Laugier, P. Rivallin, G. Guégan, D. Lafond, C. Laviron,


<table>
<thead>
<tr>
<th>Reference</th>
<th>Authors</th>
<th>Journal / Conference</th>
<th>Pages</th>
<th>Year</th>
</tr>
</thead>
</table>


Electronic Device Architectures for the Nano-CMOS Era
