1. Introduction
Alumina thin film is studied as a model atomic layer deposition (ALD) process, owing to its high dielectric constant, high thermal stability and good adhesion on various wafer surfaces [
1]. ALD has already been extensively adopted in silicon microelectronics and thin-film fabrications due to its highly self-limited depositions, perfectly uniform thin film surface geometry and accurately controlled film thickness [
1]. Alumina thin film can be deposited by ALD using various precursors, including Trimethylaluminum (TMA), Dimethylaluminum isopropoxide (DMAI), Aluminum chloride (AlCl
3) as Al sources, and water, ozone (O
3) and H
2O
2 as oxygen sources [
2]. The most studied precursors of alumina ALD are TMA and water, because of TMA’s high volatility and reactivity, allowing for efficient deposition across a wide temperature range [
3,
4].
Low throughput has always been a challenge for ALD applications due to its intrinsic nature of depositing materials at atomic level [
5]. For instance, the typical growth rate for alumina ALD is around 1.1 Å/cycle depending on the deposition parameters [
5,
6,
7,
8]. Despite the concept of spatial ALD being adopted to improve the throughput and productivity of ALD process in recent years [
7,
9,
10], considering traditional thermal ALD reactors are still extensively used in research and industrial settings, improving growth rate is still a pressing problem in the ALD community. The growth rate of ALD process is not only determined by the reactor designs, gas delivery systems, substrate placement, orientation and precursors [
11,
12], but also largely affected by process parameters, such as temperature, pulsing time, purging time, inert gas flow rate and chamber pressure [
13,
14].
Researchers have been exploring alumina ALD process by tuning those parameters to achieve higher growth rate. Groner et al. studied Al
2O
3 films deposited by ALD and the growth rate varied between 1.0 and 1.3 Å/cycle at temperatures from 33 to 177 °C using TMA and H
2O [
15]. Mousa et al. presented a study on the effects of temperature and gas flow rate on film growth of Al
2O
3 ALD using TMA and water, and the results showed higher growth rate (~1.5 Å/cycle) was observed when the gas flow rate was 5 slm, but the growth rate declined as the temperature was higher than 150 °C [
16]. A study of the effect of deposition temperature and subsequent annealing time of ALD deposited Al
2O
3 films on silicon surface passivation using TMA and water showed that the film growth rate increased steadily from 0.8, 0.95 to 1.0 Å/cycle with the deposition temperature rises from 100, 200 to 300 °C [
17].
Ylivaara et al. showed alumina ALD GPC increased from 0.73 to 0.94 Å/cycle when increasing ALD temperature from 110 to 250 °C, and after that it decreased to 0.90 Å/cycle when temperature increased to 300 °C [
18]. Li and Ren presented their work on the effects of temperature and purging time of Al
2O
3 ALD process, and the growth rate slightly increased from 0.68 to 0.79 Å/cycle when deposition rate was increased from 100 to 150 °C. The purging time showed nonsignificant on growth rate as it increased from 3 to 9 s, the growth rate was kept at 0.78–0.79 Å/cycle [
19]. Suyeon Kim et al. examined the growth rate and dielectric strength of Al
2O
3 ALD films at low temperatures (lower than 150 °C) and it showed that the growth rate of the Al
2O
3 films increased from 0.9 to 1.1 Å/cycle as temperature increased to 150 °C and then saturated beyond 150 °C [
5]. Karnopp et al. studied the influence of TMA and water flow rate on the deposition rate of Al
2O
3 ALD at two different temperatures, and it showed that the deposition process reached saturation state as the precursor flow rate increased, and increasing the temperature to 200 °C slightly decreased the growth rate [
20].
Review of the literature concludes that deposition temperature is the most studied factor in Al
2O
3 ALD, while other process factors such as pulsing time, purging time, inert gas flow rate and chamber pressure are much less investigated. Karnopp et al. studied the effect of the precursor flow rates on Al
2O
3 ALD deposition process [
20] and Li and Ren included purge times in their study [
19]. Despite deposition temperature being the most studied process parameter, no research was found to statistically confirm the significance of deposition temperature effect. Furthermore, none of the above work considered those process parameters all together systematically, and the interaction effects between the process factors were left uninvestigated.
Meanwhile, most of the reviewed publications adopted the one-factor-at-a-time (OFAT) method. In the OFAT method, a single process parameter, such as deposition temperature, is varied at different levels, sequentially, while other factors remain constant [
21,
22]. OFAT method requires a large number of deposition experiments in order to investigate all the process factors one by one, and this makes ALD experiments extremely time-consuming and costly.
Another major issue of OFAT method is the ignoring of interaction effects [
22]. Interactions extensively exist in experiments when the effect of one parameter depends on the levels of another parameter [
23]. For example, in Li and Ren’s work, the temperature effect on the growth rate was only investigated at a fixed level of purging time, and the interaction between the two factors was not studied [
19]. Moreover, OFAT experiments are usually sequentially carried out, and hence, without randomization of the experiments order, the effect of extraneous parameters could be introduced into the response [
22]. Lastly, in the published alumina ALD experiments, no work mentioned that replication was obtained, considering replication allows us to obtain an estimate of the experimental error and a more precise estimate of the effect of the parameters [
22].
Statistical approaches can solve those problems by implementing carefully designed experiments and statistical analysis using design of experiments (DOE) methodologies. DOE applies statistics in all the experimental activities from planning, designing and implementing, to analyzing the experimental results [
21]. Various DOE methods are available for different experimental goals, including full factorial designs, fractional factorial designs and response surface methodology (RSM). In a full factorial design, all possible combinations of factors and their levels are tested, so it offers comprehensive understanding of the main effects and all the interaction effects. However, full factorial designs become very costly and even impractical as the number of experiments grow exponentially with more factors. It is ideal for detailed and thorough analysis and optimization of a small number of factors (e.g., 2–4) [
22].
Fractional factorial designs reduce the experimental costs and save experimental runs by testing a subset of combinations, but it loses information on some interactions. It is often used in screening experiments with many factors (e.g., 5–10) to identify the most influential ones, especially when experimental resources are limited. RSM is mainly used for optimization by modeling continuous factor effects with polynomial equations to find ideal response conditions when key factors have been already identified using methods like factorial designs. It often involves a series of designed experiments that fit a second-order polynomial to describe the relationship between the input factors and the response. It is a deal for fine tuning processes, optimizing product designs or determining the ideal conditions for achieving a target response [
22,
24]. To fully investigate a process such as ALD process, the best approach is to conduct a factorial DOE first to identify the significant effects, and then RSM can be used for further fine tuning of the processes [
22].
DOE methods are not extensively adopted in ALD research, due to the lack of deeper statistical knowledge and confidence in allowing computer software to decide experiments [
21]. Despite Al
2O
3 thermal ALD is one of the most studied ALD processes, very few papers used DOE to investigate its process parameters. Shendokar et al. used a two-level factorial design of experiments for the predictive evaluation of Al
2O
3 ALD process, in which two level of TMA pulsing time, water pulsing time, ozone pulsing time and 100–200 °C deposition temperature were investigated [
25]. A research work presented by Dogan et al. used a two-level, eight-run fractional factorial method to identify the main parameters that affect the defect density of Al
2O
3 ALD thin film, and then the Bayesian optimization (BO) method was utilized to find the optimum values of the selected process parameters for rapid and adaptive optimization of the defect density [
26].
As discussed earlier, a thorough understanding of the effects of ALD process parameters and their interactions using factorial designs is necessary before establishing optimal process parameters using methods such as RSM. As revealed by the literature review, little work has been done to study the main effects and interaction effects of Al2O3 ALD process. Therefore, in this work, a two-level (24) full factorial DOE method is utilized to systematically investigate the main effects of four ALD process parameters (deposition temperature, inert gas flow rate, pulsing time and purging time) and their interactions on the growth rate of Al2O3 ALD thin films.
Despite two-level factorial DOE not capturing curvature effects or nonlinear effects, our primary goal in this study is to identify the significant factors and interactions. Considering multi-level factorial designs or response surface designs requiring significantly more experimental runs, a two-level factorial DOE can provide meaningful insights with desirable experimental efficiency [
22,
23]. With two replicates for each experimental run, 32 Al
2O
3 thin film samples are deposited using TMA and water as precursors by a commercial thermal ALD reactor. Statistical analysis is performed to identify the significant process parameters and their effects on growth rate. DOE analysis also reveals the potential interaction effects between the parameters, and response optimization is performed to find the optimal process condition for higher Al
2O
3 thin film growth rate. In addition, the work is expected to clear the obstacles of using statistical approaches in ALD experiments and lays out the factorial DOE fundamentals and analysis framework to promote the applications of statistical approaches in ALD community.
3. Experimental Details
In this study, Al2O3 thin films are deposited on silicon (100) substrates with sizes approximately 10 mm × 10 mm. The aluminum precursor is 98% TMA from Strem Chemicals, Inc. (Newburyport, MA, USA), while 99.999% water (also from Strem Chemicals, Inc.) serves as an oxidizer. The substrate preparation process involves the following steps. The substrates are first immersed in acetone for 10 min to remove organic residues from the surface, then rinsed with methanol to eliminate any remaining acetone, followed by a final rinse with deionized water to ensure the removal of all residual solvents, and finally, they are dried using nitrogen gas before being placed in the ALD reactor for deposition.
The experiments are conducted by the Arradiance GEMStar XT Thermal ALD reactor (Arradiance, Littleton, MA, USA) as shown in
Figure 1a, which is equipped with four precursor bottle ports, with water assigned to Port 2 and TMA to Port 4 as shown in
Figure 1c. The reactor can accommodate wafers up to 200 mm in diameter as shown in
Figure 1b and operate at temperatures up to 300 °C. Precursors are introduced into the chamber through inlet holes and exit through outlet pipe, which is connected to a vacuum pump. To ensure adequate precursor delivery, both the TMA (Port 4) and water (Port 2) bottles are heated to 30 °C. During experiments, the chamber pressure is maintained around 100 mTorr.
Argon (Ar) is used as the purging gas, with its flow rate regulated by a mass flow controller. In this study, Ar flow rates of 10 and 20 sccm are tested as controlling factors. The chamber temperature is maintained at either 175 or 275 °C, while the pulsing times for TMA and water are set the same, at 20 and 40 ms, based on the manufacturer’s recommendation for matched precursor injections. Additionally, the purging time is controlled between 10 and 20 s as another controlling factor in the experiments.
The levels of the controlling factors are summarized in
Table 4. The levels of the factors were selected around the manufacturer’s recommended values (175 °C for deposition temperature, 10 sccm for Ar flow rate, 20 ms for both TMA and water pulsing times and 10 s for purging time). To be specific, in this study the recommended settings are set as low level of the factors, which serve as a well-established baseline to ensure the process operates within known and stable conditions. A higher level of the four factors (275 °C for deposition temperature, 20 sccm for Ar flow rate, 40 ms for both TMA and water pulsing time and 20 s for purging time) is selected to test the main and interaction effects of the four factors on deposition rate.
Silicon substrates are placed at the center of the wafer holder as shown in
Figure 1b and two separate substrates are deposited for each experimental setting for 200 cycles, and the deposited film thickness is measured using an Alpha-SE spectroscopic ellipsometer (J. A. Woollam, Lincoln, NE, USA).
4. Results and Discussions
4.1. Experiment Results
In this research, 32 experimental runs are conducted following the 24 full factorial DOE. The experiment runs and results with measured GPC, fitted GPC and residuals are presented in
Table 5. All the 32 experimental runs are randomized by MiniTab (version 20.3) as presented in
Table 5. Precursors and silicon substrates are from the same batch, so there is only one block in the experiments.
Figure 2 presents cube plots with the mean GPC values of two replicates for the 16 treatment combinations. The three axes of the cubes are temperature, flow rate and pulsing time, and the left cube is when purging time is at low level, and right one is when purging time is at high level. The cube plots reveal that the highest GPC is ~1.411 Å/cycle observed when the deposition temperature and flow rate are set at low levels, and pulsing time and purging time are set at high levels. The lowest average thickness of alumina thin film is ~0.949 Å/cycle when the temperature, flow rate and purging time are set at high level, and pulsing time is set at low level.
4.2. Regression Model
By performing a factorial regression, the residuals can be calculated and the normality assumption of the random error in the effect model can be verified. The regression model of the data is shown below,
GPC | (Å/cycle) = | 1.1668 −0.0929 Deposition Temperature −0.0278 Flow Rate +0.0243 Pulsing Time −0.0122 Purging Time −0.0222 Deposition Temperature × Flow Rate +0.0250 Deposition Temperature × Pulsing Time −0.0398 Deposition Temperature × Purging Time +0.0100 Flow Rate × Pulsing Time −0.0199 Flow Rate × Purging Time +0.0340 Pulsing Time × Purging Time |
Note that all the factors and up to two-way interactions are included in the model, as three-way or higher interactions are not practical or important in real-life settings [
23]. The response is Al
2O
3 film growth rate, and the coefficient of each effect is estimated as half of the effect as shown in
Table 6. The
R2 value is 78.64%, which is acceptable considering the goal of the experiments is to identify the significant factors [
22]. The relatively low
R2 value is caused by missing three-way or higher interactions in the regression model.
Before performing ANOVA, the assumptions of normality and homoscedasticity must be verified. In DOE, the assumption of normality can be verified by either graphical methods, such as histogram of residuals and normal probability plot, or statistical tests, such as Ryan–Joiner test, Shapiro–Wilk test, Kolmogorov–Smirnov test and Anderson–Darling test. Histogram of residuals is shown in
Figure 3a, and it is hard to tell if a normal distribution is resembled due to the small size of the data. The normal probability plot as shown in
Figure 3b shows the residuals roughly follow along a line, confirming that the normality assumption is satisfied.
Like the Shapiro–Wilk test, the Ryan–Joiner test is suitable for small sample sizes (
n ≤ 50), and it is performed to further validate the normality. The Ryan–Joiner test probability plot is shown in
Figure 3c, in which the data points closely follow the straight reference line, which implies that the experimental data is normally distributed. The Ryan–Joiner coefficient (RJ) is 0.986, which is very close to 1, and the
p-value is >0.05 (fail to reject normality), both further validating the normality assumption.
The assumption of homogeneity of variances (homoscedasticity) assumes that the spread of residuals should be constant across all factor levels. Graphical methods such as residuals vs. fitted values plot and Levene’s test plot can be used to verify the homogeneity of variances. The plot of residuals vs. fitted values in
Figure 3d shows that the spread of residuals appears random without any patterns, and hence the homoscedasticity is validated.
4.3. Significance of Factors and Interactions
With the normality and homoscedasticity assumptions verified, the ANOVA of the regression model is performed to determine the significance of the main effects and up to two-way interaction effects. The ANOVA results are summarized in
Table 7. Each factor only has two levels, and hence the degree of freedom (DF in
Table 7) of all the effects is one, and the four main effects form the linear portion of the model and six two-way interactions resemble the nonlinear portion of the model. Therefore, the model has 10 DFs. With a total of 32 runs in the experiments, there are 31 DFs. Therefore, the error has 21 DFs. Note that mean squares are the sum of squares divided by DF, and
F-Value is evaluated by Equation (24).
ANOVA shows that the effects of deposition temperature, and two interactions between deposition temperature and purging time, pulsing time and purging time have p-values less than 0.05, and therefore, they are concluded as significant factors. Flow rate, pulsing time, purging time and other interactions are tested nonsignificant in this study.
The normal probability plot of these effects as shown in
Figure 4 can also be used to confirm the significance of the effects.
Figure 4 shows the normal plot of the standardized effects, which are calculated by dividing the effects using the standard deviation of the data. The standardized effects allow for comparison between different factors even if their units are dissimilar. Like random effects, nonsignificant main and interaction effects tend to fall along a straight line, while significant effects fall off the straight line. The normal probability plot of the effects yields the same results from ANOVA, confirming that factor A, interactions AD and CD are significant.
Figure 5 shows the Pareto Chart of the standardized effects, which is more intuitive to examine the significance of the effects. The Pareto chart shows the absolute values of the standardized effects from the largest effect to the smallest effect with a reference line to indicate which effects are statistically significant. The reference line is determined based on the selected significance level, typically 0.05 for a 95% confidence level. Any effect that extends past this reference line is potentially significant. The Pareto Chart reaches the same conclusions about the significance of the effects. It is worth noting that temperature is the most significant factor, which surpassed the reference significantly, while the two interactions are relatively weaker, as they are close to the reference line.
4.4. Main Effects and Interaction Effects
To examine how the factors influence the Al
2O
3 thin film deposition rate, the main and interaction effect plots are constructed as shown in
Figure 6 and
Figure 7, respectively. The main effect plots indicate that the inert gas flow rate, pulsing time and purging time have much less effect on GPC compared to deposition temperature.
Deposition temperature is tested as the only significant main factor in Al
2O
3 ALD process, and the deposition temperature effect plot shows that Al
2O
3 growth rate drops significantly from 1.26 to 1.07 Å/cycle, as temperature increases from 175 to 275 °C. The result matches with the research work in the literature [
15,
16,
17,
20]. The decreasing growth rate is primarily due to the reduction of OH groups on the substrate surface. As temperature increases, the temperature-dependent desorption of water from the substrate surface is enhanced, particularly at lower temperatures. This reduction in surface OH groups limits the available reactive sites for precursor adsorption, and therefore a decrease of the overall deposition rate is observed [
27,
28].
Flow rate effect on growth rate is tested not statistically significant, but the effect plots show that increasing Ar gas flow rate from 10 to 50 sccm slightly decreases the average growth rate from 1.19 to 1.14 Å/cycle. Inert gas in ALD process serves two purposes: carrying precursors into the reactor chamber (positively affecting growth rate) and purging the precursor residuals out of the system (negatively influencing the surface reactions). The overall effect of Ar gas flow rate on the deposition process is balanced by the two effects, which makes the overall effect of inert gas flow rate nonsignificant.
Pulsing time is also concluded as a nonsignificant factor in our study. The effect plot indicates longer pulsing time slightly increases the average growth rate from 1.14 to 1.19 Å/cycle. This is attributed to the extra precursors injected into the reactor with longer pulsing time. Increasing pulsing time allows for more precursors to enter the reactor chamber, providing more reactants for surface reactions, and subsequently enhancing the deposition rate until saturation is reached [
28,
29,
30]. Therefore, the effect of pulsing time relies on the fact whether the growth is saturated with sufficient precursors being injected into the chamber [
28,
30]. In our study, the relatively weak effect of pulsing time can be attributed to the fact that the selected levels are centered around the manufacturer’s recommended saturation pulsing time (20 ms for TMA and water). Therefore, increasing pulsing time beyond the saturation time can only result in a slight increase in growth rate.
The effect of purging time is also very weak, and the average GPC only decreases from 1.18 to 1.15 Å/cycle as the purging time increases from 10 to 20 s. The result matches with the work done by Li and Ren in which it showed the purging time is a nonsignificant factor on growth rate as the purge time increased from 3 to 9 s, and the growth rate was between 0.78–0.79 Å/cycle [
19]. The decrease in deposition rate is primarily attributed to precursor residuals being more effectively removed from the chamber, which reduces gas-phase chemical vapor deposition (CVD)-type depositions [
31]. Since CVD reactions compromise the surface quality of thin film in the ALD process, the reduction in deposition rate at a longer purging time is compensated by minimizing unwanted CVD depositions.
One of the advantages of DOE is its ability to assess interaction effects in the experiments.
Figure 7 presents the two-way interaction effect plots in this study. Visual examination of parallelism of the two lines in the interaction plots indicates the existence of potential interaction effects between all the process factors. However, ANOVA shows that only two interactions, temperature × purging time and pulsing time × purging time, are statistically significant.
The interaction between temperature and purging time shows that the effect of deposition temperature on the deposition rate depends on the level of purging time. The decreasing effect of deposition temperature on growth rate is more pronounced when the purging time is at a high level (20 s). This can be attributed to the fact that with longer purging time, the desorpted water molecules at higher temperatures are more effectively cleaned out, resulting in lower growth rates because of excessive loss of the OH groups from the surface. At higher temperatures, volatile byproducts may desorb more quickly and potentially reduces the required purging time. At lower temperatures, byproducts may desorb more slowly, needing longer purging time to prevent contamination or unwanted reactions in subsequent cycles.
The interaction between pulsing time and purging time is also tested statistically significant by ANOVA. It is discussed that the effect of pulsing time on growth rate is positive. However, it is interesting to note that when purging time is shorter, increasing pulsing time decreases the growth rate slightly as shown in the interaction plot. Similarly, when pulsing time is shorter (20 ms), increasing purging time decreases the growth rate, but at higher level of pulsing time, it increases the growth rate. If purging is insufficient, residual precursor may cause unintended chemical reactions, contamination or non-ideal growth. On the other hand, excessive purging can reduce precursor adsorption efficiency, and results in an incomplete monolayer and lower growth rate. The interaction effects give us more insights into the effect of a factor when it is influenced by other factors in ALD processes.
4.5. Process Parameters Optimization
To find the optimal process parameter settings for growth rate, a reduced regression model is constructed by removing the nonsignificant effects. Since interactions between temperature, pulsing time and purging time are significant, these three factors, together with interaction between temperature and purging time and interaction between pulsing time and purging time are included in the reduced regression model shown below,
GPC (Å/cycle) | = | 1.1668 −0.0929 Deposition temperature +0.0243 Pulsing time −0.0122 Purging time −0.0398 Deposition temperature × Purging time + 0.0340 Pulsing time × Purging time |
The response optimization method in Minitab is used to identify the optimal settings of input factors that yield the best possible response in this factorial design. The method applies a desirability function, where each response is assigned a desirability value (D, ranging from zero to one), based on how well it meets the goal. The best combination is identified by maximizing the overall desirability score. The desirability value is shown as 0.7922 in
Figure 8, which means that the response is achieving 79.22% of the ideal target. The predicated maximum growth rate is 1.3207 Å/cycle as shown in
Figure 8, when deposition temperature is set at low level, while pulsing time and purging time are set as high level. Since flow rate is not significant, it is excluded in the reduced model, and to save resources, it can be set as low level, 10 sccm.
The cube plot of fitted means from the reduced regression model is shown in
Figure 9. The cube plot yields the same result as the experimental data presented in
Figure 2, indicating the optimal parameter setting is when temperature is set at low level, 175 °C, pulsing time is set at high level 40 ms for both TMA and water, and purging time is set at high level, 20 s.
Figure 10 shows the contour plots and surface plots of the two significant interaction effects. When pulsing time is held at a high level as shown in
Figure 10a,c, the higher GPC region is observed when deposition temperature is at low level and purging time is at high level. When temperature is fixed at a lower level, the higher GPC region is found when purging time and pulsing time are both at high level shown in
Figure 10b,d.
It is noted that the effects of those factors and interactions on Al
2O
3 ALD growth could be nonlinear and complex. For example, previous work on Al
2O
3 ALD thin films using TMA and water showed that lowering substrates temperature too much, i.e., less than 150 °C, decreases the deposition rate significantly as there is not enough energy to initiate the surface reactions [
1,
6]. The effect of temperature on growth rate is often nonlinear due to competing effects; increasing temperature can enhance surface reactions and precursor decomposition, but excessive heat may lead to precursor desorption or decomposition before surface reactions, which thus results in reduction of GPC.
Increasing TMA and water pulsing time does not always increase the deposition rate. As discussed earlier, when the substrate surface receives sufficient precursors, the surface reaction will be saturated and the deposition rate will remain constant. A two-level design, with only a low and a high setting for each factor, is not capable of identifying intermediate optimal or nonlinear trends unless center points or higher-order terms are included. To address this limitation, future research can be done by extending the design to a central composite design (CCD) or response surface methodology (RSM) design, which introduces center points and allows modeling of quadratic terms, so that a more accurate exploration of potential nonlinearities can be done and a precise optimization of process parameters like temperature in ALD systems can be conducted. Nonetheless, the two-level factorial DOE offers invaluable information about the significance of the factors and their interactions for further investigations.
To further optimize Al2O3 ALD process for higher process efficiency, alternative precursor choices and novel reactor designs can also be explored. Selecting precursors with higher volatility, improved ligand structures and reduced decomposition byproducts can minimize contamination and improve growth kinetics. Novel reactor designs, such as spatial ALD for high-throughput processing, plasma-enhanced ALD for lower-temperature deposition and batch or roll-to-roll systems for scalability, offer improved deposition control.