Next Article in Journal
Assessment of Progressive Product Innovation on Key Environmental Indicators: Pampers® Baby Wipes from 2007–2013
Next Article in Special Issue
Government Support, FDI Clustering and Semiconductor Sustainability in China: Case Studies of Shanghai, Suzhou and Wuxi in the Yangtze Delta
Previous Article in Journal
Challenges for Sustainable Development and Its Legal Response in China: A Perspective for Social Transformation
Previous Article in Special Issue
Strengthening the Competitiveness and Sustainability of a Semiconductor Manufacturer with Cloud Manufacturing
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Enhancing the Effectiveness of Cycle Time Estimation in Wafer Fabrication-Efficient Methodology and Managerial Implications

Department of Industrial Engineering and Systems Management, Feng Chia University, 100, Wenhwa Road, Seatwen, Taichung City 407, Taiwan
*
Author to whom correspondence should be addressed.
Sustainability 2014, 6(8), 5107-5128; https://doi.org/10.3390/su6085107
Submission received: 19 May 2014 / Revised: 28 July 2014 / Accepted: 28 July 2014 / Published: 11 August 2014
(This article belongs to the Special Issue Competitive and Sustainable Semiconductor Manufacturing)

Abstract

:
Cycle time management plays an important role in improving the performance of a wafer fabrication factory. It starts from the estimation of the cycle time of each job in the wafer fabrication factory. Although this topic has been widely investigated, several issues still need to be addressed, such as how to classify jobs suitable for the same estimation mechanism into the same group. In contrast, in most existing methods, jobs are classified according to their attributes. However, the differences between the attributes of two jobs may not be reflected on their cycle times. The bi-objective nature of classification and regression tree (CART) makes it especially suitable for tackling this problem. However, in CART, the cycle times of jobs of a branch are estimated with the same value, which is far from accurate. For these reason, this study proposes a joint use of principal component analysis (PCA), CART, and back propagation network (BPN), in which PCA is applied to construct a series of linear combinations of original variables to form new variables that are as unrelated to each other as possible. According to the new variables, jobs are classified using CART before estimating their cycle times with BPNs. A real case was used to evaluate the effectiveness of the proposed methodology. The experimental results supported the superiority of the proposed methodology over some existing methods. In addition, the managerial implications of the proposed methodology are also discussed with an example.

1. Introduction

Wafer fabrication is a complex and time-consuming process. First, photoresist patterns are photo-masked onto the surface of a wafer. Then, the wafer is exposed to short-wave ultraviolet light. The unexposed areas are etched away and cleaned. Subsequently, hot chemical vapors are deposited onto the desired zones, so that ions can be implanted to form specific patterns at the required depth. These steps are repeated hundreds of times, depending on the complexity of the desired circuits and connections that are usually measured in fractions of micrometers.
This study aims to estimate the cycle time of a job in a wafer fabrication factory. The cycle time (flow time, manufacturing lead time) of a job is the time required for the job to go through the factory. Therefore, it is subject to future demand, capacity constraints, the factory congestion level, the quality of job scheduling, and many other factors. As a result, the cycle time of a job is highly uncertain. According to the competitive semiconductor manufacturing (CSM) survey, the best-performing wafer fabrication factory of memory products achieved an average cycle time of about two days per layer of circuitry [1]. Cycle time management activities include cycle time estimation, internal due date assignment, job sequencing and scheduling, and cycle time reduction (see Figure 1). In practice, shortening the cycle times of jobs is considered an effective way to improve the responsiveness to changes in demand [2]. In addition, according to [3], the number of defects per die has a positive relationship with the cycle time, which means reducing the cycle time can improve product quality. Further, estimating the cycle time of a job helps establish the internal due date for the job. At semiconductor manufacturing factories, the quantities allocated to customers are distributed to daily time buckets according to the available to promise (ATP), which is calculated according to the historical cycle time. For these reasons, estimating and shortening the cycle times of jobs is a very important task to maintain a competitive edge in this industry [4].
Figure 1. Cycle time management activities.
Figure 1. Cycle time management activities.
Sustainability 06 05107 g001
In the literature, various types of methods have been proposed to estimate the cycle time of a job in a factory. For example, probability-based statistical methods, such as queuing theory and regression, have been proposed. Furthermore, in these studies, some restrictive assumptions were made, such as exponential processing time distribution [5]. Recently, Pearn et al. [6] fitted the waiting time of a job in a wafer fabrication factory with a Gamma distribution. After adding the waiting time to the release time, the cycle time can be derived. This is one of the most important tasks in controlling a wafer fabrication factory. However, the fitted distribution became invalid quickly, making some cycle time estimates far from accurate [7]. Wu [8] constructed a Petri net to estimate the stage cycle time of dual-arm cluster tools with wafer revisiting. Hsieh et al. [9] modeled the response surface between the cycle time of normal lots and the percentage of hot lots in semiconductor manufacturing. Chen [10] fitted a fuzzy linear regression (FLR) equation to estimate the cycle time of a job in a wafer fabrication factory. A precise range of the cycle time was also determined. For the same purposes, Chien et al. [11] fitted a nonlinear regression equation instead. Chen [7] applied classification and regression tree (CART) to estimate the cycle time of each job in a wafer fabrication factory. Principal component analysis (PCA) was also applied to generate independent variables from the original ones, which then served as the new inputs to CART.
The application of artificial neural networks (ANNs) is also a mainstream in this field. For example, a self-organization map (SOM) was developed in Chen [12] to classify the jobs in a wafer fabrication factory into a number of categories. Chen [13] and Chang and Hsieh [14] have constructed back propagation networks (BPNs) (or feed-forward neural networks, FNNs) to estimate the cycle time of a job based on the attributes of the job and the current factory conditions. These studies indicated that linear methods are incapable of estimating the cycle time of a job, which supported the application of nonlinear methods such as ANNs. In addition, to improve the effectiveness of an ANN approach, classifying jobs before (or after) estimating the cycle times have been shown to be a viable strategy. To this end, several classifiers were applied, such as k-means (kM) [15], fuzzy c-means (FCM) [16], and SOM [12,17]. A common feature of these classifiers is that all attributes of a job are considered at the same time. In contrast, there are classifiers that consider only some of the job attributes, such as CART and fuzzy inference systems (FISs). The joint use of CART and BPN for estimating the cycle time of a job has rarely been discussed in this field. Chen [18] proposed a BPN tree approach in which the jobs of a branch are separated into two parts for either part a BPN is constructed to estimate the cycle times of jobs. However, Chen’s approach relies on extensive and iterative BPN re-learning.
Genetic algorithm (GA) or genetic programming (GP) have also been applied to optimize the parameters of the existing FISs or ANNs to estimate the cycle time of a job, e.g., Chang et al. [19], and Nguyen et al. [20].
This study proposes a hybrid principal component analysis (PCA), CART, and BPN approach to estimate the cycle time of a job in a wafer fabrication factory. The significance of doing so is six-fold:
(1)
Hybrid algorithms have been shown to be more effective than (pure) algorithms.
(2)
Although variable replacement has been applied to forecasting in many industries, it has not been applied to job cycle time forecasting for semiconductor manufacturers. This study applies PCA to enhance the forecasting performance of the CART-BPN approach.
(3)
In CART, the cycle times of jobs of a branch are estimated with the same value, which is not accurate. In the proposed PCA-CART-BPN approach, a BPN is established for each branch to estimate the cycle times of jobs, which is expected to enhance the estimation accuracy.
(4)
Although clusterwise models, such as SUPPORT and treed Gaussian process models, have been used in various fields, they have not been applied to estimating the cycle time of a job in a manufacturing system.
(5)
Compared with the existing methods, the proposed PCA-CART-BPN approach classify jobs based on fewer job attributes, which is relatively easy to implement. It is also possible to assign more jobs to a branch.
(6)
The existing classifiers in this field, such as kM, FCM, and SOM, classify jobs based on their attributes rather than their compatibilities with the estimation mechanism. However, the compatibility with the estimation mechanism is important, and may be more influential to the estimation performance. In this regard, CART considers the estimation performance in classifying jobs, which makes it more suitable for the same purpose.
In the proposed PCA-CART-BPN approach, first the original variables are replaced according to the results of PCA analysis. Then, jobs are classified using CART. Finally, a BPN is constructed for each category to estimate the cycle times of jobs within the category.
Table 1 is used to compare the proposed PCA-CART-BPN approach with some existing methods in this field. Some methods are easy to use because there have been a lot of software that can seamlessly complete all the necessary steps.
Table 1. A comparison of the proposed methodology with some existing methods.
Table 1. A comparison of the proposed methodology with some existing methods.
MethodVariable ReplacementClassifierEstimation MethodEasiness to UseAccuracy
BPN [14]--BPNEasyModerate
SOM-BPN [17]-SOMBPNModerateHigh
Pearn et al. [6]--Gamma Distribution FittingEasyLow
SOM-FBPN [12]-SOMFBPNDifficultHigh
kM-FBPN [15]-kMFBPNDifficultHigh
PCA-CART [7]PCACARTCARTEasyModerate
The proposed methodologyPCACARTBPNEasyHigh
The procedure of the proposed PCA-CART-BPN approach is detailed in Section 2, followed by an application to a real case in Section 3. Several existing methods were also applied to the same case for a comparison. Then, the advantages and/or disadvantages of each method were discussed. The managerial implications of the proposed methodology are discussed in Section 4. Finally, Section 5 summarizes the findings of this study, and puts forward some directions that can be explored in future studies.

2. Methodology

The proposed methodology is composed of five steps, as illustrated in Figure 2.
Figure 2. The procedure for the proposed PCA-CART-BPN approach.
Figure 2. The procedure for the proposed PCA-CART-BPN approach.
Sustainability 06 05107 g002

2.1. Variable Replacement Using PCA

PCA replaces the original variables by new variables that are independent of each other; these new variables become new inputs to be used in the CART-BPN that estimates job cycle times.
PCA comprises four steps:
(1)
Raw data standardization: The original variables may have excessively large numerical differences and dimensional conflicts. To standardize the dimensions and differences, apply Equations (1) through (3):
Sustainability 06 05107 i001
Sustainability 06 05107 i002
Sustainability 06 05107 i003
where xi and σxi denote the mean and standard deviation of job attribute i.
(2)
The correlation matrix R:
Sustainability 06 05107 i004
where X* is the standardized data matrix. The eigenvalues and eigenvectors of R are λ1 ~ λm and u1 ~ um, respectively (λ1 ≥ λ2 ≥ … ≥ λm).
(3)
Determination of the number of principal components: The variance contribution rate is:
Sustainability 06 05107 i005
and the accumulated variance contribution rate is:
Sustainability 06 05107 i006
Choose the smallest p value such that ηƩ(p) ≥ 85% ~ 90%.
(4)
Formation of the following matrixes:
Um×p = [u1,u2,…,up]
Sustainability 06 05107 i007

2.2. Job Classification Using CART

CART, introduced by Breiman et al. [21], is a statistical procedure primarily used for object classification. The objective is to classify a group of objects into two or more populations. CART can handle both categorical and continuous data. CART uses an exhaustive search in splitting with an objective to improve the impurity measure, i.e., the reduction in the residual sum of squares.
The procedure of CART is composed of stages including tree growing, stopping, and pruning. The first stage is to grow the tree using a recursive partitioning technique that selects variables and split points according to a pre-specified criterion. Criteria to this end include Gini, towing, ordered towing, and maximum deviance reduction [21].
Tree growing stops if any of the following conditions is satisfied:
(1)
The improvement in the performance measure has become insignificant with more branches.
(2)
A certain number of nodes have been generated.
(3)
The depth of the tree has reached a certain level.
A large tree may overfit, while a small tree may not reflect the inherent structure of the data. Cost-complexity pruning is usually used to tackle this issue. The cost-complexity of a tree T is the sum of sum of squared error (SSE) and the penalty on the complexity/size of the tree:
Cα(T) = SSE + α|T|
Sustainability 06 05107 i008
where CTj and yj are the cycle time estimate and actual cycle time of job j, respectively. The results of cost-complexity pruning are a nested subset of trees starting from the largest tree and ending with the smallest tree (with only a single node). The effectiveness of a subtree can be evaluated by cross-validation or using another (testing) data.
In the traditional CART approach, the cycle times of jobs assigned to a branch (b) are estimated with the same value y(b) that is equal to the average of the historical cycle times:
Sustainability 06 05107 i009
where G(b) represents the set of jobs of branch b. However, such a treatment is far from accurate. For this reason, in this study, a BPN is constructed for each branch to estimate the cycle times of jobs assigned to this branch:
yj = BPNb(zj)
where BPNb is the BPN constructed for branch b to estimate the job cycle times; zj is the vector of the new attributes of job j. The comparison of CART, CART-BPN, and the proposed methodology is illustrated in Figure 3, in which Δk is the boundary value used to split the data along the new attribute k.
Figure 3. Comparison of CART and the proposed methodology.
Figure 3. Comparison of CART and the proposed methodology.
Sustainability 06 05107 g003

2.3. Job Cycle Time Estimation within Each Node Using BPN

2.3.1. Rationale

In a wafer fabrication factory, the relationship between the cycle time and attributes of a job has been shown to be a nonlinear one [11]. Consider the simplest example. Sometimes wafers are processed piece by piece, while in other steps tens of wafers are processed as a whole. As a result, the relationship between the cycle time and size of a job cannot be fitted with a linear equation. BPN is a well-known tool for fitting nonlinear relationships, so is CART. A combination of CART and BPN is natural, and has potential for improving the performance of estimating the cycle time of a job.
Another question is why jobs with similar attributes have very different cycle times. That is because most of the attributes of a job were determined when the job was released into the factory. However, the cycle time of a job depends on the future conditions of the factory. That may be much different even for jobs with similar attributes. That explains the incapability of the existing methods based on job classification.

2.3.2. Procedure

Subsequently, a BPN is constructed to estimate the cycle times of jobs. The BPN is configured as follows. There are K inputs to the BPN including the new attributes of a job. A lot of past studies have shown that a BPN with a single hidden layer can achieve a satisfactory approximation performance [15,16,17,22]. In addition, several ways have been proposed in the literature to determine the number of neurons in the hidden layer, e.g., [22,23,24]. In this study, common neural network learning parameters, including number of hidden layers, number of hidden layer units, learning rate, and learning time, all are determined using a trial-and-error method. The activation/transformation functions for the input and hidden layers are the linear activation function and the hyperbolic tangent sigmoid function, respectively:
(linear activation) f(x) = x
Sustainability 06 05107 i010
Inputs to the BPN are multiplied by the weights of the connections between the input and hidden layers, then are summed on each neuron in the hidden layer. After being compared with the threshold on the neuron, only significant signals will be transformed and outputted as:
Sustainability 06 05107 i011
where:
Sustainability 06 05107 i012
Sustainability 06 05107 i013
hjl is the outputted signal from hidden-layer neuron l for job j; Sustainability 06 05107 i014 is the threshold on hidden-layer neuron l; Sustainability 06 05107 i015 is the weight of the connection between input-layer neuron k and hidden-layer neuron l.
Signals outputted from the hidden-layer neurons are transmitted to the neuron in the output layer in the same manner. Finally, the output from the BPN is generated as:
Sustainability 06 05107 i016
where:
Sustainability 06 05107 i017
Sustainability 06 05107 i018
θ0 is the threshold on the output-layer neuron; Sustainability 06 05107 i019 is the weight of the connection between hidden-layer neuron l and the output-layer neuron.
Subsequently, a lot of algorithms can be applied to train a BPN, such as the gradient descent (GD) algorithm, the conjugate gradient algorithm, the Levenberg-Marquardt (LM) algorithm, the Broyden-Fletcher-Goldfarb-Shanno (BFGS) algorithm, and others. For a recent comparison of these algorithms, refer to [22]. Among these algorithms, the LM algorithm has a faster convergence speed, therefore is applied in the proposed methodology, as described below.
First, placing the inputs and the BPN parameters in vectors:
Zj = [Zj1…Zjk]
and
Sustainability 06 05107 i020
respectively. Then, the network output oj can be represented with:
oj = f(zj,β)
Substituting Equation (25) into Equation (14),
Sustainability 06 05107 i021
To find out the optimize values of β, an iterative procedure is used in the LM algorithm:
(1)
Specify the initial values of β, e.g. β = [1, 1, ... , 1].
(2)
Replace β by β + δ, where δ = Sustainability 06 05107 i022:
oj = f(zj,β+δ)
(3)
Approximating the right-hand side by its linearization gives:
f(zj,β+δ) ≈ f(zj,β) + Jjδ
where:
Jj = მf(zj,β)/β
is the gradient vector of f with respect to β. SSE becomes:
Sustainability 06 05107 i023
(4)
The optimal value of δ can be obtained by taking the derivative of SSE with respect to δ and setting the result to zero. For details refer to [25].
(5)
Return to step (2).

3. Applications

A real case containing the data of 120 jobs from a wafer fabrication factory located in Taichung Scientific Park, Taiwan, was used to evaluate the effectiveness of the proposed methodology (see Table 2). There are tens of dynamic random access memory (DRAM) products in the wafer fabrication factory. Six attributes of a job, including job size, factory utilization, the queue length on the route, the queue length before the bottleneck, work-in-process (WIP), and the average waiting time, are indicated with xj = [xj1 xj2 xj3 xj4 xj5 xj6]. The average waiting time is the average of the waiting times of the three most recently completed jobs. That measures the extent of delay that each job is likely to face. The six attributes were chosen from about twenty candidates after the backward elimination of regression analysis. The proposed PCA-CART-BPN approach was implemented on a PC with an Intel Dual CPU E2200 2.2 GHz and 2.0G RAM. The BPN was implemented with the Neural Network Toolbox of MATLAB 2006a.
Table 2. The collected data.
Table 2. The collected data.
jxj1xj2xj3xj4xj5 (h)xj6CTj (h)
12412231588079984.2%953
223122516466514294.8%1248
325123215471837388.4%1299
423128216581314892.9%976
522135218276038993.1%1189
62412931827606996.2%1095
...
12022131915977732688.8%1285
At first, a Pareto comparison of percentage of variability explained by each principal component is shown in Figure 4. An individual component by itself was able to explain less than 30% of the variance; the present study proposed to explain 85%–90% of the variance, and thus p was set to 5. The five principal components shown in Figure 4 explained approximately 85% of the total variability in the standardized data; therefore, the five-component analysis was a reasonable reduction of dimensions.
Figure 4. The Pareto analysis chart.
Figure 4. The Pareto analysis chart.
Sustainability 06 05107 g004
The coordinates of the original data were calculated in terms of the new coordinate system to produce component scores. Table 3 shows the component scores, which were used as new inputs to the BPN.
Table 3. New inputs to the BPN.
Table 3. New inputs to the BPN.
jzj1zj2zj3zj4zj5
10.5470.855−1.640−1.345−0.591
2−0.6490.361−0.8500.047−1.115
3−0.596−1.170−1.9840.388−1.015
40.2181.062−1.090−0.4680.885
50.296−0.0731.1781.1981.764
6−1.3182.1270.1580.2140.305
1200.930−0.846−0.677−0.3611.238
Subsequently, the new data were divided into two parts—the training data (the first 90 jobs) and the testing data (the remaining 30 jobs) (see Table 2). The training data were not normalized before creating the CART tree, but were normalized into [0.1, 0.9] before they were learned by the BPNs. In fact, data normalization is not helpful for CART, but is conducive to the fast convergence of the BPN. Only the performance to the testing data was evaluated. In addition, four-fold cross validation was applied to effectively reduce bias and variability.
Subsequently, CART was applied to classify jobs according to the new attributes. The impurity measure in CART was Gini. The popular 1-SE rule was applied for the CART tree pruning. In this way, the smallest tree which cross-validation cost was less than the minimum cross-validation cost plus the standard deviation of the cross-validation cost was chosen. The results are shown in Figure 5.
Figure 5. The classification results using PCA-CART. (A) 1st run; (B) 2nd run; (C) 3rd run; (D) 4th run.
Figure 5. The classification results using PCA-CART. (A) 1st run; (B) 2nd run; (C) 3rd run; (D) 4th run.
Sustainability 06 05107 g005
Obviously, the number of job groups is much more than that obtained using the existing classifier like kM, FCM, or SOM. The classification results were also different from those using CART alone without PCA (see Figure 6). However, some groups contained too few jobs to train a BPN. For this reason, only for groups with more than five jobs, BPNs were applied to improve the cycle times.
Jobs in each group were used to train the BPN of the group. In this study, a trail-and-error method was used to test the optimal number of neurons. When the learning rate (η) was 0.1, testing various numbers of neurons affected the prediction error. After repeatedly testing five times, when the number of hidden layer neurons was set to 8, the minimal prediction error (minimal RMSE) was obtained (Figure 7). To define the optimal parameter values of the epochs and learning rate (η) and to obtain the best prediction results, the changes in the prediction error when using various combinations of learning rates (η) and epochs were analyzed. The results indicated the minimal RMSE was achieved when the epochs were 50,000 and the learning rate (η) was 0.9 (Figure 8). However, that resulted in a very lengthy learning process. In addition, when the learning rate (η) was 0.9, setting the epochs to 30,000 was a favorable choice. For this reason, in the experiment, the learning rate (η) and epochs were set to 0.9 and 30,000, respectively.
Figure 6. The classification results using CART alone without PCA. (A) 1st run; (B) 2nd run; (C) 3rd run; (D) 4th run.
Figure 6. The classification results using CART alone without PCA. (A) 1st run; (B) 2nd run; (C) 3rd run; (D) 4th run.
Sustainability 06 05107 g006
After training, the BPN can be used to estimate the cycle times of jobs in the testing data that belong to this group. The estimation accuracy of the PCA-CART-BPN method was evaluated in terms of three criteria: the MAE, mean absolute percentage error (MAPE), and root mean squared error (RMSE). Table 4 summarizes the results. The performances of the CART, CART-BPN, and PCA-BPN methods are shown in Table 5, Table 6 and Table 7 for a comparison.
Figure 7. Various numbers of neurons affected the prediction error.
Figure 7. Various numbers of neurons affected the prediction error.
Sustainability 06 05107 g007
Figure 8. Effects of the number of epochs.
Figure 8. Effects of the number of epochs.
Sustainability 06 05107 g008
Table 4. The estimation accuracy of the PCA-CART-BPN method (four-fold cross-validation).
Table 4. The estimation accuracy of the PCA-CART-BPN method (four-fold cross-validation).
MAE (h)MAPERMSE (h)
First Run32.82.8%51.7
Second Run30.22.7%48.5
Third Run37.03.3%54.5
Fourth Run35.83.2%54.5
Average34.03.0%52.3
Table 5. The estimation accuracy of the CART method (four-fold cross-validation).
Table 5. The estimation accuracy of the CART method (four-fold cross-validation).
MAE (h)MAPERMSE (h)
First Run1159.2%100
Second Run907.4%87
Third Run1169.6%99
Fourth Run13211.0%109
Average1139.3%99
Table 6. The estimation accuracy of the CART-BPN method (four-fold cross-validation).
Table 6. The estimation accuracy of the CART-BPN method (four-fold cross-validation).
MAE (h)MAPERMSE (h)
First Run90.17.3%89.3
Second Run74.85.7%76.0
Third Run41.03.4%55.3
Fourth Run77.16.2%82.1
Average90.27.2%89.5
Table 7. The estimation accuracy of the PCA-BPN method (four-fold cross-validation).
Table 7. The estimation accuracy of the PCA-BPN method (four-fold cross-validation).
MAE (h)MAPERMSE (h)
First Run41.73.7%60.3
Second Run38.63.4%55.8
Third Run40.83.5%59.9
Fourth Run33.92.9%53.6
Average38.73.4%57.4
To evaluate which method among the multiple linear regression (MLR), BPN, CART, CBR, CART-BPN, PCA-BPN, kM-BPN, and PCA-CART-BPN methods was more accurate, the estimation performance of the eight methods was arranged, as shown in Table 8, and the RMSE, MAE, and MAPE were employed to determine estimation accuracy. However, we did not compare with methods based on expert collaboration, such as Chen [7] and Chen [10], because there is no collaboration part in the proposed methodology. In addition, some hybrid methods including kM-FBPN [15], FCM-BPN and SOM-FBPN [12] are similar in nature to kM-BPN, and therefore were not compared. Further, the symmetric-partitioning and incremental-relearning classification and BPN approach proposed by Chen [18] was not compared because the execution time was more than 5 min. Only the estimation performances to the testing/validation data were compared.
Table 8. Comparisons of the performances of various methods.
Table 8. Comparisons of the performances of various methods.
Performance MeasureMLRBPNCARTCBRCART-BPNPCA-BPNkM-BPNPCA-CART-BPN
RMSE (h)89.514413216875.757.411952.3
MAE (h)90.211711314070.738.79934.0
MAPE7.2%9.1%8.9%11.2%5.6%3.4%7.7%3.0%
(1)
As evident from Table 5 and Table 6, the most obvious advantage of CART-BPN over CART was regarding MAPE, which achieved 10%. The CART-BPN prediction error was smaller, whereas the CART prediction error was larger; thus, the CART-BPN predictions were more accurate. To validate the more accurate prediction of the CART-BPN, the paired t test was employed.
H0: When estimating the job cycle time, the estimation accuracy of the CART-BPN method is the same as that obtained from using the CART method.
H1: When estimating the job cycle time, the CART-BPN method is more accurate than the CART method.
As expected, the CART-BPN forecasting method was superior to the CART method. Table 9 summarizes the results of a comparative analysis. Therefore, a nonlinear analysis is indeed beneficial to the job cycle time estimation problem.
Table 9. CART-BPN and CART of the paired t test.
Table 9. CART-BPN and CART of the paired t test.
CART-BPNCART
Mean117.3283.8
Variance6779.723928.02
Number of observations3030
Pearson’s Correlation Coefficient 0.2363
Degree of freedom 29
t statistic −2.0193
P(T ≤ t) one-sided 0.0263
P(T ≤ t) two-sided 0.0527
(2)
Of these, the PCA-CART-BPN estimation efficacy exceeded that of the statistical regression model (MLR) by 41%. To make sure that such a difference is statistically significant, a paired t test was used:
H0: When estimating the job cycle time, the PCA-CART-BPN methodology for estimating accuracy is the same as using the statistical regression approach.
H1: When estimating the job cycle time, the PCA-CART-BPN methodology is more accurate than the statistical regression approach.
The comparison results are summarized in Table 10. As a result, the advantage of the PCA-CART-BPN methodology over statistical regression was statistically significant at α = 0.1.
Table 10. PCA-CART-BPN and regression of the paired t test.
Table 10. PCA-CART-BPN and regression of the paired t test.
PCA-CART-BPNMLR
Mean94.5337.32
Variance2609.491554.66
Number of observations3030
Pearson’s Correlation Coefficient 0.0287
Degree of freedom 29
t statistic −4.9247
P(T ≤ t) one-sided 0.0000156
P(T ≤ t) two-sided 0.0000312
(3)
When exploring the PCA-CART-BPN and grouping CART-BPN and kM-BPN models, utilizing the PCA-CART-BPN method reduced the MAE by 52% and 66%, respectively. Therefore, PCA-CART was a better job classifier than kM and CART.
(4)
The most obvious advantage of the proposed methodology over CART was up to 70% in terms of MAE.
(5)
The most obvious advantage of the proposed methodology over CART-BPN was up to 52% in terms of MAE, which confirmed the effectiveness of variable replacement using PCA.
(6)
If the differences between the attributes of two jobs could be fully reflected on their cycle times, CBR would be very effective. However, the results did not support this viewpoint.
(7)
The same problem happened to kM-BPN. Nevertheless, the performance of kM-BPN was quite close to that of the proposed methodology, which can be attributed to the approximation ability of BPN.
(8)
The execution times of the eight methods were compared in Table 11. Direct methods like MLR were really fast. Iterative methods like CART, CBR, and BPN were a bit slow. Nevertheless, training a BPN using the LM algorithm was also efficient. The execution times of some hybrid methods (CART-BPN, PCA-BPN, kM-BPN, and PCA-CART-BPN) were approximately equal, which supported the reasonability of using the proposed methodology instead of CART-BPN, PCA-BPN, or kM-BPN.
Table 11. The comparison results.
Table 11. The comparison results.
Execution Time (s)
MLR<1
BPN16
CART5
CBR15
CART-BPN127
PCA-BPN130
kM-BPN132
PCA-CART-BPN132

4. Managerial Implications

Estimating and shortening the cycle time of each job is an important task to maintain a competitive edge in the DRAM industry. For example, the famous DRAM maker, Samsung, implemented the short cycle time and low inventory (SLIM) method to estimate the cycle times and WIP levels for various manufacturing steps, so that a more effective control of the factory was possible. As a result, the average cycle times of some DRAM products were reduced from more than 80 days to less than 30 days, bringing Samsung a benefit of about 1 billion US$ [23].
Incorrectly estimating the cycle times of jobs increases the difficulties in projecting the monthly output of a wafer fabrication factory [10], which then misleads the production planning personnel in making the release plan. If many wafers are incorrectly outputted from a wafer fabrication factory during a month with very low average selling prices (ASPs), the wafer fabrication factory will suffer considerable losses. To illustrate this, an example is given as follows. There are two products, A and B, in the factory. The ASPs of the two products are shown in Table 12. The gross dies and wafer yields of the two products are the same: 500 dies per wafer and 95%. The monthly capacity of the factory is 10,000 pieces of wafers distributed between the two products according to their ASPs. The actual cycle times of the two products are two and three months, respectively. The release plan based on the correct cycle times is shown in Table 13. The yearly revenues (months 1 to 12) are 205 million US$. If the cycle time of product A is mistaken as three months, the release plan will be Table 14. The yearly revenues reduce to 203 million US$, resulting in a loss of two million dollars per year.
Table 12. The average selling prices (ASPs) of the two products.
Table 12. The average selling prices (ASPs) of the two products.
MonthASP (US$) (Product A)ASP (US$) (Product B)
−104.83.5
−113.43.4
−122.53.5
13.93.5
24.23.6
32.33.5
43.93.4
54.53.5
64.43.5
72.63.3
82.23.5
94.03.5
102.63.6
112.83.5
124.73.4
+12.73.5
+23.53.5
+34.23.3
Table 13. The correct release plan.
Table 13. The correct release plan.
MonthRelease (Product A)Release (Product B)
−1041395861
−1152364764
−1254414559
139656035
252854715
356054395
455364464
542905710
638606140
753084692
842535747
943965604
+1057034297
+1143795621
+1249915009
Table 14. The incorrect release plan.
Table 14. The incorrect release plan.
MonthRelease (Product A)Release (Product B)
−1052364764
−1154414559
−1239656035
152854715
256054395
355364464
442905710
538606140
653084692
742535747
843965604
957034297
+1043795621
+1149915009
+1254224578

5. Conclusions and Future Research Directions

Cycle time reduction continues to be one of the major concerns in controlling a wafer fabrication factory. On one hand, long cycle times lead to the accumulation in WIP. On the other hand, the risk of wafer contamination also increases when the cycle time of a job is long. Cycle time estimation, which is considered as a prerequisite for achieving that, has received much attention.
In this study, the PCA-CART-BPN approach is proposed to estimate the cycle time of a job in a wafer fabrication factory. First, PCA is applied to construct a series of linear combinations of original variables to form new variables that are as unrelated to each other as possible. According to the new variables, jobs are classified using CART before estimating their cycle times with BPNs. The advantage of such a treatment is the consideration of the estimation performance when classifying the jobs. To the contrary, in the existing methods such as kM-BPN, SOM-FBPN, and others, job classification only considers the differences between the attributes of jobs.
After evaluating the estimation performance of the proposed PCA-CART-BPN approach with a real case:
(1)
The advantage of the proposed PCA-CART-BPN approach over the existing methods was quite obvious. Three measures of the estimation accuracy, especially MAE, were improved after applying PCA-CART-BPN.
(2)
In addition, PCA, CART, and BPN are all mature methods supported by a lot of optimization packages (see Table 15), which is conducive to the joint use of the two methods.
(3)
On the other hand, the efficiency of PCA-CART-BPN was not poorer than that of the existing hybrid methods like CART-BPN, PCA-BPN, and kM-BPN.
Table 15. Some software packages for implementing PCA, CART, and BPN.
Table 15. Some software packages for implementing PCA, CART, and BPN.
MethodSoftware
PCAMATLAB, SAS, SPSS, Stata
CARTCART (Salford Systems), MATLAB, SAS
BPNBrian, GENESIS, MATLAB (Neural Network Toolbox), NEURON
In future studies, a collaboration mechanism can be incorporated into the PCA-CART-BPN approach, so that multiple experts can estimate the cycle time of a job collaboratively.

Acknowledgment

This work was supported by National Science Council of Taiwan.

Author Contributions

Toly Chen proposed the PCA-CART-BPN approach, collected the data, and wrote most of the content of this paper. Yu-Cheng Wang analyzed the data, prepared the figures of the results, and helped revise the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Leachman, R.C.; Hodges, D.A. Benchmarking semiconductor manufacturing. IEEE Trans. Semicond. Manuf. 1996, 9, 158–169. [Google Scholar] [CrossRef]
  2. Inoue, T.; Ishii, Y.; Igarashi, K.; Takagi, H.; Muneta, T.; Imaoka, K. Conveyer Belt Model to Analyze Cycle Time Conditions in a Semiconductor Manufacturing Line. In Proceedings of the IEEE International Symposium on Semiconductor Manufacturing Conference, Tokyo, Japan, 25–27 September 2006; pp. 74–77.
  3. Wein, L.M. On the relationship between yield and cycle time in semiconductor wafer fabrication. IEEE Trans. Semicond. Manuf. 1992, 5, 156–158. [Google Scholar] [CrossRef]
  4. Chen, T. A systematic cycle time reduction procedure for enhancing the competitiveness and sustainability of a semiconductor manufacturer. Sustainability 2013, 5, 4637–4652. [Google Scholar] [CrossRef]
  5. Chen, H.; Harrison, J.M.; Mandelbaum, A.; van Ackere, A.; Wein, L.M. Empirical evaluation of a queueing network model for semiconductor wafer fabrication. Oper. Res. 1988, 36, 202–215. [Google Scholar] [CrossRef]
  6. Pearn, W.L.; Chung, S.L.; Lai, C.M. Due-date assignment for wafer fabrication under demand variate environment. IEEE Trans. Semicond. Manuf. 2007, 20, 165–175. [Google Scholar] [CrossRef]
  7. Chen, T. An effective fuzzy collaborative forecasting approach for predicting the job cycle time in wafer fabrication. Comput. Ind. Eng. 2013, 66, 834–848. [Google Scholar] [CrossRef]
  8. Wu, N.; Chu, F.; Chu, C.; Zhou, M. Petri net modeling and cycle-time analysis of dual-arm cluster tools with wafer revisiting. IEEE Trans. Syst. Man Cybernet. Syst. 2013, 43, 196–207. [Google Scholar] [CrossRef]
  9. Hsieh, L.Y.; Chang, K.-H.; Chien, C.-F. Efficient development of cycle time response surfaces using progressive simulation metamodeling. Int. J. Prod. Res. 2014, 52, 3097–3109. [Google Scholar] [CrossRef]
  10. Chen, T. A fuzzy-neural knowledge-based system for job completion time prediction and internal due date assignment in a wafer fabrication plant. Int. J. Syst. Sci. 2009, 40, 889–902. [Google Scholar] [CrossRef]
  11. Chien, C.F.; Hsu, C.Y.; Hsiao, C.W. Manufacturing intelligence to forecast and reduce semiconductor cycle time. J. Intell. Manuf. 2012, 23, 2281–2294. [Google Scholar] [CrossRef]
  12. Chen, T. A hybrid look-ahead SOM-FBPN and FIR system for wafer lot output time prediction and achievability evaluation. Int. J. Adv. Manuf. Technol. 2007, 35, 575–586. [Google Scholar] [CrossRef]
  13. Chen, T. A fuzzy back propagation network for output time prediction in a wafer fab. Appl. Soft Comput. 2003, 2, 211–222. [Google Scholar] [CrossRef]
  14. Chang, P.-C.; Hsieh, J.-C. A neural networks approach for due-date assignment in a wafer fabrication factory. Int. J. Ind. Eng. 2003, 10, 55–61. [Google Scholar]
  15. Chen, T. An intelligent hybrid system for wafer lot output time prediction. Adv. Eng. Inf. 2007, 21, 55–65. [Google Scholar] [CrossRef]
  16. Chen, T.; Wang, Y.C. An iterative procedure for optimizing the performance of the fuzzy-neural job cycle time estimation approach in a wafer fabrication factory. Math. Probl. Eng. 2013, 2013, 1–15. [Google Scholar]
  17. Chen, T. A hybrid SOM-BPN approach to lot output time prediction in a wafer fab. Neur. Process. Lett. 2006, 24, 271–288. [Google Scholar] [CrossRef]
  18. Chen, T. The symmetric-partitioning and incremental-relearning classification and back-propagation-network tree approach for cycle time estimation in wafer fabrication. Symmetry 2014, 6, 409–426. [Google Scholar] [CrossRef]
  19. Breiman, I.; Friedman, J.H.; Olsen, R.A.; Stone, C.J. Classification and Regression Trees; Wadsworth: Pacific Growe, CA, USA, 1984. [Google Scholar]
  20. Steinburg, D.; Colla, P. Classification and Regression Trees; Salford Systems: San Diego, CA, USA, 1997. [Google Scholar]
  21. Firoze, A.; Arifin, M.S.; Rahman, R.M. Bangla user adaptive word Speech recognition: approaches and comparisons. Int. J. Fuzzy Syst. Appl. 2013, 3, 1–36. [Google Scholar]
  22. Rogers, L.L.; Dowla, F.U. Optimization of groundwater remediation using artificial neural networks with parallel solute transport modeling. Water Resourc. Res. 1994, 30, 457–481. [Google Scholar] [CrossRef]
  23. Ranganathan, A. The Levenberg-Marquardt Algorithm. Available online: http://www.scribd.com/doc/10093320/Levenberg-Marquardt-Algorithm (accessed on 28 February 2014).
  24. Leachman, R.C.; Kang, J.; Lin, V. SLIM: Short cycle time and low inventory in manufacturing at Samsung Electronics. Interfaces 2002, 32, 61–77. [Google Scholar] [CrossRef]
  25. Chang, P.-C.; Hsieh, J.-C.; Liao, T.W. Evolving fuzzy rules for due-date assignment problem in semiconductor manufacturing factory. J. Intell. Manuf. 2005, 16, 549–557. [Google Scholar] [CrossRef]
  26. Nguyen, S.; Zhang, M.; Johnston, M.; Tan, K.C. Genetic programming for evolving due-date assignment models in job shop environments. Evol. Comput. 2014, 22, 105–138. [Google Scholar] [CrossRef]

Share and Cite

MDPI and ACS Style

Chen, T.; Wang, Y.-C. Enhancing the Effectiveness of Cycle Time Estimation in Wafer Fabrication-Efficient Methodology and Managerial Implications. Sustainability 2014, 6, 5107-5128. https://doi.org/10.3390/su6085107

AMA Style

Chen T, Wang Y-C. Enhancing the Effectiveness of Cycle Time Estimation in Wafer Fabrication-Efficient Methodology and Managerial Implications. Sustainability. 2014; 6(8):5107-5128. https://doi.org/10.3390/su6085107

Chicago/Turabian Style

Chen, Toly, and Yu-Cheng Wang. 2014. "Enhancing the Effectiveness of Cycle Time Estimation in Wafer Fabrication-Efficient Methodology and Managerial Implications" Sustainability 6, no. 8: 5107-5128. https://doi.org/10.3390/su6085107

Article Metrics

Back to TopTop