1. Introduction
Deposition for industrial products refers to the accumulation of high-performance metal or compound thin films on low-cost or specialized substrates, aiming to achieve product features such as appearance, wear resistance, and electrical conductivity requirements. Typically, techniques like Chemical Vapor Deposition (CVD) or Physical Vapor Deposition (PVD) are employed to achieve the film deposition. Due to the issue of toxic gas generation at high temperatures in CVD, PVD is currently predominantly used as the primary coating process.
PVD primarily uses the phase transition phenomenon of materials to heat or stimulate them, enabling the process of depositing thin film. PVD uses the physical energy from arcs or plasmas to bombard the surface of the cathode target material. This imparts kinetic energy to the target’s atoms, causing them to escape and adhere to the substrate intended for coating. As a result, cathode targets are divided into arc targets and sputtering targets (plasma targets). Arc targets involve the excitation of atoms through arcs, known as Arc Ion Plating (AIP), allowing direct interaction with gases. They are predominantly used for surface hardening coatings, such as TiN, TiC, TiCN, and more. On the other hand, sputtering targets use plasma-excited atoms, also referred to as Magnetron Sputtering. This method finds extensive application in various optoelectronic and semiconductor industries. Sputtered film layers exhibit fine smoothness, precise control over film thickness, and excellent repeatability. They can serve as targets for a wide range of materials, from conductors to insulators. In the context of semiconductors, the purpose of metal sputtering is to create metal wires that transmit information on chips. This involves the gradual deposition of atomic layers from the target material’s surface onto the semiconductor chip. Subsequently, the film is etched to create metal wires at the nanometer scale, connecting billions of micro-sized transistors within the chip. This interconnection facilitates the transmission of signals.
PVD can be categorized into three types: Vacuum Evaporation, Ion Plating, and Sputtering [
1,
2,
3].
Vacuum Evaporation: When metals are heated within a vacuum, they transform into gas and evaporate. Vacuum Evaporation exploits this principle. Processing is commonly carried out in a vacuum environment. Both metals and various compounds can be used as coating materials. Its applications include mirrors, reflective surfaces, plastic components, and more. However, it is rarely used for purposes like surface hardening of metals; instead, it is mainly employed for decorative objects.
Ion Plating: This technique involves bombarding the target material with arcs, causing the atoms of the target material to be excited. These excited atoms then react with reactive gases, resulting in the deposition of compounds onto the surface of the workpiece. After achieving high vacuum within the chamber, inert gas is introduced and biased pressure is applied to generate argon ions (Ar+) and negatively charged electrons (e−). The positively charged argon ions will collide with the negatively biased substrate, effectively cleaning its surface. Subsequently, reactive gases are introduced, creating a plasma between the target material and the substrate, initiating the coating process. This method offers rapid film deposition and improved adhesion, making it suitable for coating cutting tools and similar applications.
Sputtering: Sputtering involves the phenomenon where high-energy particles bombard a target material, causing atoms within the target to be ejected. In this principle, the target material acts as the cathode, the substrate as the anode, and when a high voltage is applied, ionized Ar+ ions near the cathode collide with it. The molecules or atoms knocked out by the impact of Ar+ ions are then deposited onto the substrate, forming a thin film. Sputtering PVD boasts advantages like strong adhesion, high uniformity, rapid deposition speed, and the ability to coat a wide range of materials, including alloys, metals, and insulators. Nowadays, sputtering deposition processes have become a pivotal technology in various applications, such as semiconductors, LCD panels, solar panels, optical components, and more.
Under standard environmental conditions, the resistance value of a sputtered Ti thin film with the same thickness should be consistent. However, in actual production processes, variations in process conditions can lead to discrepancies between the actual and design resistance values. To enhance the resistance stability of the final product, when maintaining the same thin film thickness, it’s desirable to minimize the differences between the actual resistance value and the designed resistance value.
The case study is derived from a dynamic random-access memory (DRAM) manufacturing company located in the Hsinchu Science Park, Taiwan. Dynamic Random-Access Memory (DRAM) is a type of volatile memory used in computers and electronic devices for storing data temporarily. It stores each bit of data in a separate capacitor within an integrated circuit. DRAM requires periodic refreshing of stored data to maintain integrity. Known for its high density and cost-effectiveness compared to static RAM (SRAM), DRAM is widely used in applications where large amounts of memory are needed, such as in main system memory for computers and smartphones. Its performance is critical for efficient data processing and multitasking. In the PVD sputtering process conducted by the case company, the Ti thin film produced experiences a significant resistance deviation due to suboptimal levels of the control variables being set.
The resistance of the Ti thin film is primarily determined by the thickness of the Ti thin film and the quality of the film produced during the sputtering process. Currently, the case company deals with abnormal Ti thin film resistance mainly by adjusting the film thickness to achieve the desired design resistance. For instance, if the Ti thin film resistance is too high, a process engineer would increase the thickness of the Ti thin film to meet the design resistance. However, excessive adjustments to the Ti thin film thickness can lead to deviations from the customer’s intended design structure in the DRAM devices, resulting in an increase in customer complaints. Furthermore, the case company applies various Ti thin film thicknesses in different product structures. If different Ti thin film resistances require distinct control variable settings, it would also escalate the complexity of manufacturing management. The purpose of this study is to enhance the resistance consistency of the Ti thin film, allowing it to achieve the desired design values for different film thicknesses.
The ideal function of this study is for the relationship between Ti thin film resistances and designed resistances to be directly proportional, representing a dynamic quality characteristic, as shown in
Figure 1. The current status of the case company in terms of Ti thin film resistances and their corresponding design resistances for different Ti thin film thicknesses is presented in
Table 1. From
Table 1, it is evident that when the Ti thin film thickness is 100 Å, 150 Å, 200 Å, 300 Å, and 500 Å, their respective ideal Ti thin film resistance (designed resistance value) should be 42 Ω, 28 Ω, 21 Ω, 14 Ω, and 8.4 Ω. However, the actual Ti thin film resistances for the case company are measured at 162 Ω, 88.6 Ω, 55.1 Ω, 23.1 Ω, and 9.12 Ω.
Figure 2 shows the deviations between the current resistances and the designed resistance values for various thicknesses.
Confronting this issue, the company formed a dedicated group of process engineers. Employing a trial-and-error methodology, these engineers worked towards improving the resistance characteristics of the sputtered Ti thin film. Regrettably, this trial and error strategy yielded only marginal improvements, proving both time and resource-intensive. Given the limitations, the case company couldn’t feasibly carry out extensive experimentation for control variable adjustments using this method. Consequently, we introduce an innovative technique in this study to finely adjust control variables and optimize sputtered Ti thin film resistance value, all while minimizing the need for exhaustive on-site trials.
Our suggested approach TSDTM-ANN-GA involves the application of a two-stage dynamic Taguchi method (TSDTM), integrating an artificial neural network (ANN) and a genetic algorithm (GA). This framework is designed to mitigate the shortcomings of the traditional full factorial design and grid search method, ultimately streamlining the process of optimizing sputtered Ti thin film resistance.
2. Methodologies
- A.
Taguchi method
Genichi Taguchi introduced the Taguchi method in the 1950s. This experimental design technique consistently produces favorable results and has remained effective over time. Over time, the Taguchi method has gained widespread application in the industrial sphere, predominantly for quality control purposes.
The Taguchi method proposes the utilization of an orthogonal array for experimental design. This choice streamlines the experimental process by minimizing the necessary number of trials to achieve a comprehensive and dependable dataset. Moreover, the employment of an orthogonal array highlights the reproducibility of the experiments [
4].
The pivotal idea behind experimental design in the Taguchi method involves configuring control variable values using an orthogonal array (OA). An OA is represented as The meanings of the symbols within the OA are as follows:
L: stands for “Latin square”
a: represents the overall count of treatments
b: indicates the quantity of levels, encompassing distinct values for each control variable.
c: represents the upper limit of control variables that can be organized.
Table 2 shows the
L8(2
7) OA. Columns A through G denote the control variables. The table descripts the allocation of various levels for each control variable, with levels 1 and 2 corresponding to differing values of their respective variables. The
L8(2
7) OA consists of seven columns and eight rows, indicating an experimental design that involves seven production control variables and eight unique treatments.
Taguchi suggests that minimizing quality loss happens when a quality feature is in line with a designated target value. Greater quality corresponds to reduced quality loss. In order to quantify this loss, it is crucial to evaluate the connection between a quality characteristic and the resultant quality loss [
4]. Taguchi strongly promoted the utilization of the signal-to-noise (
S/N) ratio as a means to evaluate the effectiveness of Taguchi experimental design. Measured in decibels, the
S/N ratio gauges the quality of control variable settings [
5].
The goal of the dynamic Taguchi method is to treat the system’s quality feature as a function and optimize it to generate a range of outputs. Unlike static cases with a single output value as the optimization objective, dynamic scenarios involve multiple outputs to optimize. Dynamic Taguchi method is suitable when there’s an ideal relationship between input factors within a certain range and the output. Thus, dynamic factors (signal factors) are adjustment factors chosen based on engineering knowledge before conducting experiments. This is where dynamic situations significantly differ from the typical static case.
Various functional relationships might exist between the output and signal factors, determining the type of dynamic problem. The most common and effective form is the linear form, where signal factors act as proportionality factors. The zero-point proportional form is the primary type.
In the case of the zero-point proportional form, a simple linear relationship exists between the output (
y) and the signal factor (
M). Ideal experimental data points fall on a straight line passing through the origin:
where
The symbol β represents the slope of the line, and M represents the signal factor from repeated experiments under various conditions.
The closer the ideal behavior of the dynamic quality feature matches
, the better. The formula for using the
S/N ratio is outlined as follows:
where
J. Sun and J. Kainz (2023) adopted the Taguchi method to explore how four parameters in the hybrid pulse power characteristic (HPPC) test influence the performance of the equivalent circuit model (ECM) for Li-ion batteries, identifying optimal parameter combinations for predictable charge/discharge patterns [
6]. S. Antoniou et al. (2023) used the Taguchi method to analyze the effect of leading-edge slat design parameters on high-angle-of-attack stability in a Blended-Wing-Body UAV [
7]. K. S. Garud et al. (2023) applied the Taguchi method and computational fluid dynamics, this study explores the impact of non-Newtonian fluid models and factors like geometry and operation on shear stress in microfluidic cerebrovascular channels [
8]. R. Kruzel et al. (2023) investigates spark plasma sintering (SPS) for sinters production, widely adopted in practical industries, optimizing a Si
3N
4–Al
2O
3–ZrO
2 composite using the Taguchi method [
9]. Presently, within Taiwan, the Taguchi method is employed by several semiconductor companies for quality control purposes.
- B.
Artificial Neural Network
The artificial neural network (ANN) is a method of data analysis that facilitates the generation of mathematical models for the purposes of prediction, decision-making, and diagnosing engineering and medical challenges. This technique involves using a dataset that includes input parameters and output variables to construct and refine these models [
10].
The training process of an ANN entails iterative modifications to the weights of the network’s connections, symbolizing the intensity of connections between neurons. Elevated weight values indicate a higher significance for the output variables, as they influence the likelihood of activation. The amalgamation of multiple neurons gives rise to the construction of an ANN [
11]. Shown in
Figure 3 is an ANN consisting of three layers of neurons. The fundamental objective of ANN training is to ensure accurate correspondence between each input parameter and its corresponding output variable. The training process aims to minimize the disparity between desired target values and the output values generated by the ANN. Convergence is attained when this disparity stabilizes, indicating the completion of training. For assessing the ANN’s performance, validation using a distinct set of untrained samples is imperative to gauge the error between predicted values and known output variables [
12]. A pivotal element in ANN training is the learning rate, a crucial hyperparameter influencing the convergence pace of the ANN. A higher learning rate accelerates convergence, while a lower rate decelerates the process [
13,
14]. It’s apparent that hyperparameters significantly impact the prediction model’s effectiveness.
Artificial neural network has been extensively employed in various studies for modeling and prediction purposes. N. Matera et al. (2023) employed a network of artificial neural network to train and predict the global hourly electrical power generated by various PV modules [
15]. H. Azgomi et al. (2023) proposed an automatic apple disease diagnosis using a neural network, achieving 73.7% accuracy with color and texture features [
16]. T. Calisir et al. (2023) introduced an artificial neural network model with 8 design parameters as inputs, achieving high accuracy in predicting heat transfer and weight for panel radiator production [
17]. L. P. Lingamdinne et al. (2023) applied artificial neural networks to investigate the adsorption mechanism, pH effect, and desorption studies, yielding promising predictions for Pb(II) and As(V) removal [
18].
- C.
Genetic Algorithm
In 1975, John Holland introduced the genetic algorithm (GA), which has since undergone significant refinement. Demonstrating its prowess, the GA has emerged as a potent search technique for identifying globally optimal solutions. Its foundation lies in the principle of natural selection, favoring the survival of the most adept entities in biological systems. Leveraging this concept, the GA selects the most appropriate population for continued existence, culminating in a self-adapting optimization methodology that efficiently explores intricate, high-dimensional spaces.
The GA finds extensive utility in research involving hyperparameter optimization in machine learning models and engineering control variable refinement [
19]. The evolutionary trajectory of the GA involves three primary stages: chromosome reproduction and duplication, crossover, and mutation. Operating within the solution domain of engineering quandaries, the GA, through numerous iterations, identifies the most versatile solution that fulfills all stipulated constraints. This corresponds to the chromosome exhibiting the highest fitness function value, signifying the globally optimal solution [
20]. The algorithmic procedure of the GA is outlined as follows:
- (1)
Population Initialization: Generate a set of potential solutions at random, forming the initial population state.
- (2)
Evaluate Population: Assess the performance metric for each potential solution, denoted as the fitness function value.
- (3)
Parent Selection: Choose solutions with superior fitness values as parents. Commonly, methods like roulette wheel selection or elite preservation are employed to determine parent selection.
- (4)
Crossover: Randomly pick two parents for crossover, yielding new chromosomes through the process.
- (5)
Mutation: Apply mutations to offspring with a designated probability, preventing convergence into local optima.
- (6)
Next Generation Selection: Elect a subset of offspring and parents to form the subsequent population generation.
- (7)
Iterate Steps 2 to 6: Continuously repeat steps 2 to 6 until the convergence criterion is met or the maximum iteration limit is achieved.
- (8)
Final Selection of Optimal Solution: Ultimately, the global optimal solution will be identified.
Genetic algorithms have been widely used in numerous studies for modeling and prediction purposes. A. Neumann et al. (2023) reviewed Genetic Algorithm (GA) use in Engineer-To-Order (ETO) planning and scheduling problems, highlighting key ETO characteristics, constraints, and objectives. Common encoding formats and genetic operators are identified, emphasizing multi-objective approaches [
21]. M. G. Altarabichi et al. (2023) proposed a two-stage surrogate-assisted evolutionary approach using Genetic Algorithm (GA) for efficient feature selection in large datasets [
22]. R. M. Aziz et al. (2023) proposed a novel approach using Genetic Algorithm (GA) and deep learning for reliable detection of fraudulent transactions in Ethereum smart contracts, achieving high accuracy and outperforming various popular techniques [
23]. R. Ghezelbash et al. (2023) adopted the genetic algorithm to improve mineral prospectivity mapping through unsupervised clustering and supervised machine learning methods [
24].
- D.
The Suggested approach
This paper introduces the utilization of Taguchi method’s OAs to facilitate an efficient experimental plan for the case company. Our focus is on employing the dynamic Taguchi method to optimize the control variable settings for attaining an elevated yield ratio in the manufacturing of DRAM devices. Initially, the first-stage dynamic Taguchi method is employed to gather valuable experimental data and identify significant control variables. Subsequently, the second-stage dynamic Taguchi method is employed to determine a favorable setting for these control variables. Nonetheless, this setting, restricted by predetermined levels of distinct control variables, might not fulfill the ultimate optimal control variable setting.
To address this, the pursuit of the global optimal setting entails exploring all potential combinations of control variable values within their respective ranges. To accomplish this, an artificial neural network is established to create a predictive model for the sputtered Ti thin film resistance across various control variable values. The predictive model forms the core of a GA, facilitating the exploration of the finest global control variable setting. This is accomplished through the evaluation of the fitness function linked to the anticipated Ti thin film resistance. The suggested approach’s flowchart is depicted in
Figure 4.
3. Case Study
This study centers on a semiconductor company that specializes in the production of dynamic random-access memory (DRAM) devices. However, the observed discrepancy between the sputtered Ti thin film resistance and the intended designed resistance values of 8.4 Ω, 14 Ω, 21 Ω, 28 Ω, and 42 Ω has prompted the need for a solution in quality control. To address this challenge, we present an approach comprising a two-stage dynamic Taguchi method, an ANN, and a GA. The goal is to minimize the variance between the Ti thin film resistances of the DRAM devices and their intended target designed resistance values.
In this study, the concept of ideal sputtered Ti thin film resistance (M
i) is introduced as the signal factor, while the corresponding measured Ti thin film resistance (y
i) serves as the response variable. In the most ideal scenario, the slope of the line in
Figure 1 is 1 (i.e.,
β = 1). To align with this, we consider the ideal Ti thin film resistance (M
i) as the signal factor, with the respective ideal values set as M
1 = 8.4 Ω, M
2 = 14 Ω, M
3 = 21 Ω, M
4 = 28 Ω, and M
5 = 42 Ω.
- A.
Details of the Suggested approach
- (1)
Adopting first-stage dynamic Taguchi method to choose the important control variables and determine the optimal directions for setting their levels
The resistance of the Ti thin film is influenced by the sputtering process, as shown in
Figure 5. This figure encompasses various sub-processes, each associated with its respective production control variables. The production control variables for each sub-process are shows in
Table 3.
Drawing from engineering expertise, the team of process engineers has identified seven control variables for improving the Ti thin film resistance. These control variables encompass Degas temperature, Degas pressure, Ar flow, DC power, DC power ramp rate, Cryo pump 1st temperature, and Cryo pump 2nd temperature. The chosen control variables, along with their respective levels for the first-stage dynamic Taguchi method, are presented in
Table 4. To uphold stringent quality control, the semiconductor company conducted Ti thin film resistance sampling from ten predetermined positions on a wafer, shown in
Figure 6. Given the intricacy of our experimental design challenge involving seven control variables and two levels, we have opted for the
L8(2
7) orthogonal array.
Process engineers were tasked with executing the first-stage dynamic Taguchi method to identify pivotal control variables. During this phase, a total of eight experiments were conducted, involving varied levels of settings for each control factor. The sputtered Ti thin film resistance values of ten DRAM devices for each experiment, structured in the Taguchi orthogonal array
L8(2
7), are displayed in
Table 5. In this phase, employing a full factorial design to ascertain optimal settings for seven control variables, each featuring two levels, would necessitate a significant 128 experiments (2
7 = 128). Nevertheless, through the utilization of the
L8(2
7) orthogonal array, this study adeptly executed a compact set of only 8 experiments, effectively capturing a wide range of control variable settings.
Given the primary emphasis of this study on enhancing the resistance value of DRAM devices—a dynamic quality feature—the signal-to-noise ratio (
S/N) for each experiment was computed using Equation (3). The resulting
S/N ratios are tabulated in the final row of
Table 5.
Table 6 shows the outcomes of the variable response table, focusing on the
S/N ratio of DRAM devices. In parallel,
Figure 7 presents the results through a variable response chart. Additionally,
Table 7 offers a comprehensive ANOVA (analysis of variance) breakdown for the same dataset. Analyzing the findings in
Table 6, the impact of each production control variable on the
S/N ratio follows this sequence, descending from high to low: E(0.76) > C(0.73) > D(0.50) > A(0.45) > G(0.16) > B(0.11) > F(0.07). Moving to
Table 7, it is evident that the
p-values associated with production control variables A, C, D, and E fall below the 0.05 threshold. This signifies that these four control variables play a significant role in shaping the
S/N ratio. They emerge as crucial factors that wield notable influence on the
S/N ratio. Guided by the insights from
Figure 7, a better setting for the control variables surfaces: A setting of A = 250 °C, C = 60 sccm, D = 4000 W, and E = 4000 W/s stands out as a favorable combination to enhance the
S/N ratio.
Based on the data detailed in
Table 8, the descending order of the impact of control variables on the slope
β of DRAM devices can be elucidated as follows: A(0.198) > D(0.104) > C(0.070) > F(0.016) > E(0.005) > G(0.001) > B(0.001). Adding to this insight,
Table 9 furnishes the
p values, highlighting that control variables A, C, and D all reside below the critical threshold of 0.05. As a result, it is evident that these three control variables wield substantial influence on the slope
β of DRAM devices. They emerge as vital contributors that significantly shape
β. Taking these discoveries into account, a more optimal setting for the control variables becomes evident: A setting of A = 250 °C, C = 60 sccm, and D = 4000 W for
β, as presented in
Figure 8.
To delve deeper into finding the most effective control variable setting to improve the resistance value of the DRAM devices, an additional Taguchi experimental design, denoted as L9(34), is set to be undertaken. Building on the analysis outcomes of the SN ratio and β from the preceding experiment, the focus narrows down to four pivotal control variables: A (Degas temperature), C (Ar flow), D (DC power), and E (DC power ramp rate). This new round of experimentation aims to recollect pertinent data within a range of experiments that might include the optimal solution, with the intent of pinpointing the optimal levels for control variables as suggested by the first-stage dynamic Taguchi method.
- (2)
Adopting second-stage dynamic Taguchi method to gather dataset and determine the local optimal control variable setting
In this stage, we employ the second dynamic Taguchi method to identify the local optimal control variable configuration. It’s essential to emphasize that the term “local optimal setting” specifically denotes the most optimal setting attained exclusively through the implementation of the second-stage dynamic Taguchi method. The specific control variables and their respective levels earmarked for second-stage dynamic Taguchi method are shown in
Table 10. For the purpose of this second-stage dynamic Taguchi method, the
L9(3
4) orthogonal array was chosen as the framework for the experimental design. In this stage, using a full factorial design to determine the local optimal setting for four control variables, each characterized by three levels, would demand a considerable 81 experiments (3
4 = 81). However, by harnessing the
L9(3
4) orthogonal array, this study efficiently conducted a concise series of 9 experiments, encompassing a diverse spectrum of control variable settings.
Table 11 provides information on the resistances of sputtered Ti thin films and their corresponding
S/N ratios across various settings. In
Table 12, we focus on the
S/N ratio for DRAM devices, as shown in the variable response table. Complementing these tables,
Figure 9 represents the data through a variable response chart. Additionally,
Table 13 furnishes a comprehensive breakdown of the ANOVA results for the same dataset. Upon closer examination of the data presented in
Table 6, the impact of each production control variable on the
S/N ratio follows a descending order from high to low: E (3.62) > C (1.93) > A (0.90) > D (0.86). Transitioning to
Table 13, it becomes evident that the
p values associated with production control variables C and E are below the 0.05 threshold. This signifies the significant influence these four control variables exert on shaping the
S/N ratio, establishing them as pivotal factors. Drawing insights from
Figure 9, an better setting for the control variables emerges: C = 60 sccm and E = 4000 W/s. This setting is identified as favorable for enhancing the
S/N ratio, guided by the representation of the data.
Based on the data provided in
Table 14, we can elucidate the order of influence of control variables on the slope
β of DRAM devices as follows: D (0.066) > A (0.054) > C (0.041) > E (0.006). This insight is complemented by
Table 15, which presents the
p values. Notably, control variables A, C, and D all have
p values below the critical threshold of 0.05. This confirms that these three control variables play a substantial role in shaping the slope
β of DRAM devices, establishing them as crucial contributors. Considering these findings, a better setting for the control variables emerges: A = 250 °C, C = 55 sccm, and D = 4000 W for β, as indicated in
Figure 10. It’s worth noting that
Figure 9 reveals minimal disparity between the
S/N ratios for C
1 and C
2 settings, suggesting that control factor C has a limited impact on the
S/N ratio. Consequently, we identify the A
2C
1D
1E
1 setting as the locally optimal configuration for the control variables. Additionally,
Table 13 highlights the insignificance of factors C and E on
S/N ratios, while
Table 15 confirms the lack of significance of factors A, C, and D on the slope
β.
Following this, we aim to estimate the
S/N ratio and
β values using the local optimal control variables. Since variables A and D exert a negligible influence on the
S/N ratio, we have excluded them from our
S/N ratio prediction. The mean of nine
S/N ratios is depicted in
Table 11 as
, which equals −11.48 dB. We can also represent the mean of
S/N ratio for C1 as
, amounting to −10.86 dB, and the mean of
S/N ratio for E1 as
, which stands at −9.55 dB. Following dynamic Taguchi method, we calculate the predicted
S/N ratio for the local optimal control variable setting can be computed as:
Likewise, since factors E have minimal impact on the slope
β, their inclusion is disregarded in the prediction of
β. The average of the nine
β values is
ohms. The average
β for A2 is
ohms, for C1 it’s
ohms, and for D1 it’s
ohms. Thus, the predicted slope
for the local optimal control variable setting can be computed as:
To validate the precision of the projected
S/N ratio and
β values, the case company in question conducted a series of five confirmation trials.
Table 16 presents a detailed summary of the results derived from these confirmation trials. To enhance the reliability of our predictions, we calculated confidence intervals at a 95% for both the predicted
S/N ratio. This crucial step was accomplished by employing Equation (7) as detailed in reference [
20].
where
: the F statistic;
α: producer’s risk;
ν: degrees of freedom for the variance associated with the combined error;
Ve: variance of combined error;
nobs: effective observation number
; and
r: number of replicates.
In order to determine the confidence intervals at a 95% for both the
S/N ratio and
β, the subsequent calculations of the following components are carried out:
The confidence interval at a 95% for the
S/N ratio ranges from −8.93 to −6.84. To confirm the better settings, the semiconductor company performed Ti thin film resistance sampling at three predetermined positions on a wafer, as shown in
Figure 11. Analysis of the confirmation trial results, presented in
Table 16, reveals a mean S/N ratio of −7.94 dB. Significantly, these value comfortably lie within its corresponding confidence intervals. This alignment between the experimental results and their respective confidence intervals provides strong evidence supporting the accuracy and reliability of our projections for the
S/N ratio. It underscores the robustness of our conclusions, reaffirming the validity of the anticipated
S/N ratio values derived from the analysis.
Table 16.
Confirmation trials for two-stage dynamic Taguchi method.
Table 16.
Confirmation trials for two-stage dynamic Taguchi method.
No | Trial #1 | Trial #2 | Trial #3 | Trial #4 | Trial #5 | Average |
---|
M1 = 8.4 Ω | N1 | 8.87 | 8.88 | 8.89 | 8.89 | 8.89 | |
N2 | 8.89 | 8.90 | 8.87 | 8.86 | 8.89 | |
N3 | 8.88 | 8.87 | 8.86 | 8.88 | 8.88 | |
M2 = 14 Ω | N1 | 14.67 | 14.58 | 14.79 | 15.11 | 14.80 | |
N2 | 15.05 | 14.52 | 15.02 | 14.69 | 14.88 | |
N3 | 15.30 | 14.98 | 14.94 | 14.82 | 14.86 | |
M3 = 21 Ω | N1 | 22.32 | 23.25 | 22.28 | 22.90 | 22.20 | |
N2 | 22.37 | 23.73 | 23.65 | 22.69 | 22.75 | |
N3 | 21.05 | 21.88 | 24.65 | 21.51 | 21.86 | |
M4 = 28 Ω | N1 | 27.86 | 29.49 | 31.74 | 31.06 | 28.62 | |
N2 | 32.59 | 26.97 | 28.78 | 29.78 | 28.14 | |
N3 | 27.89 | 32.07 | 30.80 | 30.41 | 29.15 | |
M5 = 42 Ω | N1 | 44.83 | 39.23 | 39.10 | 38.13 | 42.13 | |
N2 | 42.49 | 44.48 | 34.69 | 36.93 | 32.58 | |
N3 | 36.48 | 35.35 | 41.71 | 40.67 | 44.88 | |
M1 = 8.4 Ω | Average | 8.88 | 8.88 | 8.87 | 8.88 | 8.88 | 8.879 |
M2 = 14 Ω | Average | 15.01 | 14.69 | 14.92 | 14.87 | 14.85 | 14.869 |
M3 = 21 Ω | Average | 21.91 | 22.95 | 23.52 | 22.37 | 22.27 | 22.605 |
M4 = 28 Ω | Average | 29.45 | 29.51 | 30.44 | 30.42 | 28.64 | 29.690 |
M5 = 42 Ω | Average | 41.26 | 39.68 | 38.50 | 38.58 | 39.86 | 39.577 |
β | 1.01 | 1.00 | 1.00 | 0.99 | 0.99 | 0.999 |
S | 2.16 | 2.58 | 2.80 | 2.23 | 2.75 | 2.505 |
S/N ratio | −6.57 | −8.24 | −8.96 | −7.06 | −8.88 | −7.940 |
Table 17 presents a comparison between the outcomes achieved prior to and subsequent to the implementation of the dynamic Taguchi method. Notably, the differences observed between the average resistances of the sputtered Ti thin films and the designed resistance values have demonstrated considerable improvements of 33.5%, 90.5%, 95.3%, 97.2%, and 98.0% for five different thicknesses. This improvement is shown in
Figure 12, which showcases the contrast between the scenarios before and after the application of the dynamic Taguchi method. It is evident from the figure that the resistance values of the sputtered Ti thin films post the dynamic Taguchi method exhibit a remarkable proximity to the idealized design resistance values.
- (3)
Applying ANN for constructing a predictive model
Using the information from
Table 13 and
Table 15, pivotal control variables can be discerned, namely A (Degas temp.), C (Ar flow), D (DC power), and E (DC power ramp rate). These selected variables are slated to function as the input parameters for the ANN. The disparity between the sputtered Ti thin film resistance value and the designed target resistance value will be employed as the output variable. Additionally, the experimental data derived from the
L9(3
4) setting in
Table 11 will be harnessed as the dataset for training the artificial neural network.
In this research, an artificial neural network architecture featuring three hidden layers is put forth. To effectively train this network, an allocation of 80% of the dataset is arbitrarily designated as the training dataset, while the remaining 20% is earmarked for use as the test dataset. The pursuit of optimal hyperparameters entails employing a grid search methodology. This process encompasses the exploration of diverse settings for hyperparameters, including the solver, activation function, the quantity of nodes within each hidden layer, learning rate, and momentum rate. The solver is subjected to assessment using the options [lbfgs, adam, sgd], while the activation function is scrutinized with the choices [logistic, tanh, relu]. Furthermore, the learning rate undergoes evaluation using values of [0.1, 0.2, 0.3, 0.4], and the momentum rate is explored across values of [0.6, 0.7, 0.8, 0.9]. Simultaneously, the number of nodes situated within each hidden layer is preset within the range of [1 to 10].
The artificial neural network undergoes training over a span of 1000 iterations. The criterion employed to finalize the weights is based on the attainment of the lowest root mean square error (RMSE) across the test dataset. Following the execution of the grid search methodology, the optimal setting for the hyperparameters are as follows: the solver is configured as adam, the activation function is specified as tanh, the learning rate is determined as 0.4, the momentum rate is defined at 0.6, and the quantity of nodes integrated within the three hidden layers are set to 5, 3, and 10, respectively. Consequently, the optimal architecture for the neural network is represented as 4-5-3-10-1, as shown in
Figure 13.
- (4)
Applying GA to find the global optimal control variable setting
In the framework of GA, the four control variables are represented by binary strings that correspond to their values. The boundaries for each control factor are established in accordance with their respective ranges, outlined in
Table 10. The fitness function, articulated through Equation (8), plays a pivotal role in the GA process. When implementing the GA, the values assigned to a given control variable are initiated via random generation within the predefined range. This range spans from the lowest to the highest levels associated with the particular control factor, as detailed in
Table 10.
The GA commences by establishing its initial generation, where 30 initial populations are randomly generated. Each chromosome signifies a unique setting of the control variables. Subsequently, the GA proceeds to execute a sequence of crossover and mutation processes, thereby facilitating the evolutionary progression of the population. For the crossover operations, a rate of 0.85 is employed, while the probability of mutation is designated at 0.05. The iteration process continues until it reaches a maximum of 500 iterations. We also defined the allowable ranges of production variable using “Level 1” and “Level 3” as specified in
Table 10.
In this paper, we replicated the experimentation process by conducting the genetic algorithm (GA) procedure a total of ten trials. The outcomes of these trials are outlined in
Table 18. Notably,
Table 18 presents the paramount global fitness value as 0.95459, accompanied by the corresponding variable settings of A = 245 °C, C = 55 sccm, D = 5911 W, and E = 4009 W/s. Furthermore, the statistical summaries of the fitness values across the experiments are shown in
Table 19. This table shows the maximum, minimum, mean, and standard deviation values of the fitness outcomes. The considerably low standard deviation across the experiments underscores the stability and reliability of the GA results.
In GA, we conducted 30 initial population tests and 500 iterations, resulting in a total of 15,000 tests. In contrast, a grid search would require tests, significantly reducing the number of modeling tests.
- B.
Conducting the Confirmation Trials
To substantiate the authenticity of the global optimal control variable setting found by GA, the case company carried out a set of five confirmation trials using the GA-identified optimal setting. The outcomes of these trials are chronicled in
Table 20. Moreover, a comprehensive juxtaposition of the average resistances of the sputtered Ti thin films prior to and subsequent to the implementation of the suggested approach (ANN_GA) is presented in
Table 21. This table reveals that the difference between the average resistances and the designed resistance values are 0.10 Ω, 0.54 Ω, 1.40 Ω, 1.11 Ω, and 1.46 Ω, respectively. These differences have been further improved by 86.8%, 94.1%, 95.9%, 98.2%, and 98.8% for five different thicknesses.
Figure 14 serves as a representation of the contrast between the outcomes prior to and subsequent to the adoption of the suggested approach. After applying the suggested approach, the resistance values of the sputtered Ti thin films exhibit a significant convergence towards the ideal resistance values.
4. Conclusions
Due to resistance value deviations in Ti thin films produced during the PVD process for DRAM, this study aims to minimize these deviations across multiple film thicknesses with minimal resource utilization. To achieve this goal, we propose the TSDTM-ANN-GA framework.
In the first-stage dynamic Taguchi method, we utilized the L8(27) orthogonal array to conduct experiments and identify the key control variables affecting the resistance values of Ti thin films. Typically, a full factorial experiment with seven control variables and two levels would require 128 experiments. However, using the L8(27) orthogonal array effectively reduces this to just 8 experiments. The first-stage dynamic Taguchi method drastically curtails the experimental workload, achieving an 16X reduction relative to the full factorial experiment. Through the use of variable response tables and analysis of variance (ANOVA), we identified four key control variables: A (Degas temperature), C (Ar flow), D (DC power), and E (DC power ramp rate).
Next, the second-stage dynamic Taguchi method was executed using the L9(34) orthogonal array to re-collect experimental data for the four key control variables. Traditionally, a full factorial experiment with four control variables at three levels each would require 81 experiments. However, the L9(34) orthogonal array simplifies this to just 9 experiments. The second-stage dynamic Taguchi method significantly reduces the experimental workload, achieving a 9-fold reduction compared to a full factorial experiment. Using the L9(34) array, the optimal combination settings for the four key control variables were determined to be A (Degas temperature) = 250 °C, C (Ar flow) = 55 sccm, D (DC power) = 4000 W, and E (DC power ramp rate) = 4000 W/s. This optimal combination significantly reduced the resistance value deviations in the Ti thin films. For the five different Ti film thicknesses, the resistance deviation percentages were reduced by 33.5%, 90.5%, 95.3%, 97.2%, and 98.0%, respectively.
The dataset obtained from the L9(34) experiments was used as training and testing data for the artificial neural network (ANN). In constructing the network, the four control variables were used as inputs. Hyperparameters were optimized using a grid search method, resulting in a learning rate of 0.4, a momentum rate of 0.6, and three hidden layers with 5, 3, and 10 nodes, respectively. The final ANN architecture was 4-5-3-10-1. Subsequently, a genetic algorithm (GA) was employed to determine the global optimal control variable settings in the parameter space. In the GA, a total of 15,000 tests were conducted. In contrast, a grid search would require 4 × 108 tests, significantly reducing the modeling effort and making it suitable for low-power computers. The globally optimal solution identified was A (Degas temperature) = 245 °C, C (Ar flow) = 55 sccm, D (DC power) = 5911 W, and E (DC power ramp rate) = 4009 W/s. Final validation experiments showed that the proposed TSDTM-ANN-GA framework reduced the deviations between the Ti thin film resistance values and their design values by 86.8%, 94.1%, 95.9%, 98.2%, and 98.8%, respectively.
The proposed method effectively reduced resistance deviations for the five Ti thin film thicknesses and simplified manufacturing management, allowing the required design values to be achieved under the same manufacturing conditions. This framework can efficiently operate on resource-limited and low-power computers, achieving the goal of real-time dynamic production parameter adjustments and enabling DRAM manufacturing companies to improve product quality promptly.
This study presents an integrated framework that combines the Taguchi method, neural networks, and genetic algorithms. This framework demonstrates considerable flexibility, allowing for potential integration with other machine learning techniques (such as decision trees, SVM, etc.) or heuristic algorithms (like particle swarm optimization, simulated annealing, etc.) in future studies.
Further optimization can be achieved by refining each method individually. For example, the Taguchi method can be enhanced by considering the interaction effects among variables. Additionally, the ANN structure can be improved to more accurately predict the impact of process parameters on resistance stability, and the GA can be used to more effectively search for optimal parameter combinations. Incorporating self-adjusting algorithms could also be considered, enabling the TSDTM-ANN-GA framework to dynamically adapt and learn in response to process variations, thereby further enhancing the resistance stability of the thin films.
To adapt the framework for real-time dynamic production parameter adjustments, it is essential to enhance its data processing and feedback mechanisms to enable real-time monitoring and response. Integrating Internet of Things (IoT) technology could facilitate instant data collection and analysis. Additionally, faster algorithms may need to be developed or implemented to ensure timely and accurate parameter adjustments during the manufacturing process. Furthermore, the models within the framework, such as ANN and GA, should be capable of quickly adapting to changes in production conditions, thereby supporting real-time dynamic adjustments in various environments.
The framework has the potential for scalability and can be applied to other semiconductor manufacturing processes. However, modifications may be necessary, including adjusting input variables and response variables to meet the specific requirements of different processes. Additionally, the training data for the ANN and the objective functions for the GA may need to be redesigned and calibrated according to the new process goals to ensure the accuracy and relevance of the optimization results.
The primary limitation of this research is that relying on a single dataset may restrict the generalizability of the results. Therefore, future studies should incorporate independent datasets from various semiconductor manufacturing processes to enhance the framework’s validity and reliability.
The proposed framework can also be extended to deep learning-based image product defect detection tasks. Notably, its suitability for deployment in manufacturing edge artificial intelligence (edge AI) devices is highlighted, offering more efficient training and accurate inference results for image defect detection tasks in resource-constrained environments.