Next Article in Journal
Existence Results and Finite-Time Stability of a Fractional (p,q)-Integro-Difference System
Previous Article in Journal
Analyzing Curvature Properties and Geometric Solitons of the Twisted Sasaki Metric on the Tangent Bundle over a Statistical Manifold
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

High-Precision Quality Prediction Based on Two-Dimensional Extended Windows

College of Information Science and Engineering, Northeastern University, Shenyang 110819, China
*
Author to whom correspondence should be addressed.
Mathematics 2024, 12(9), 1396; https://doi.org/10.3390/math12091396
Submission received: 27 March 2024 / Revised: 21 April 2024 / Accepted: 29 April 2024 / Published: 3 May 2024

Abstract

:
A PLS-based quality prediction method is proposed for batch processes using two-dimensional extended windows. To realize the adoption of information in the directions of sampling time and batch, a newly defined region of support (ROS), called the k-i-back-extended region of support (KIBROS), is proposed; it establishes an extended window by adding two regions of data to the traditional ROS to include all possible important data for quality prediction. Based on the new ROS, extended windows are established, and different models are proposed using the extended windows for batch process quality prediction. Furthermore, using the typical injection molding batch process as an example, the proposed quality prediction method is experimentally verified, proving that the proposed methods have higher prediction accuracy than the traditional method and that the prediction stability is also improved.

1. Introduction

Precise variable prediction modeling depends on a good understanding of process characteristics. Industrial processes usually have characteristic changes in two directions. The first is the time direction within a production period, where the main conditions do not change, but some of them shift due to complex factors. During the production period, typically only one kind of production is provided. The characteristic changes in this direction lead to a series of phases. The second direction of change is the repeat production direction, where the main conditions can be maintained to obtain the same products, but the time-varying problem may still occur; however, the main conditions can be changed systematically to obtain different production types.
Multivariate statistical modeling methods are typical data-driven modeling methods, of which principal component analysis (PCA) [1] and partial least squares (PLS) [2] have been applied widely in the fields of industrial process modeling, monitoring, and control [3,4,5]. For batch processes, multilinear principal component analysis (MPCA) and multiway partial least squares regression (MPLS) are classic methods that address the three-dimensional matrix by enfolding it into a two-dimensional matrix [6,7]. Following their development, many research works have been presented [8,9,10,11,12]. Among them, some papers have focused on multiphase characteristics, where different statistical models are built for different phases [11,12]. Additionally, the relationships between phases were investigated for better modeling and prediction [13]. For both the continuous processes and the batch processes, the sliding window method is typically used to address the time-varying problem [12], and multiple models are usually built for multiple modes [14,15,16,17]. As mentioned above, the characteristic changes in the time direction and the repeat production direction have been topics of interest for researchers. However, these research works tend to deal with one problem each; that is, they tend to deal with the characteristic change problems in only one direction. This makes it difficult to apply these methods because the characteristics usually change in the two directions simultaneously. Thus, it is necessary to investigate a uniform modeling method that can address characteristic changes in both directions simultaneously.
Recently, a complete set of process modeling methods was developed to solve the process characteristics evolution problem in both the intra-batch and inter-batch directions [18], where the problems in the two directions are firstly handled separately and then the two strategies are combined. This kind of approach is flexible for selecting appropriate modeling methods in the two directions, and new strategies can be developed based on the scheme [19]. However, the separation of the methods is unavoidable. It is difficult to provide a comprehensive explanation of process characteristics using two unrelated methods. In addition, when the methods for both directions need to be combined, the method structure is relatively complex and difficult to comprehend and implement. Additionally, the idea of tracing the evolution problem of the process characteristics in both the intra-batch and inter-batch directions was also adopted and combined with multimode analysis. A two-dimensional, two-layer quality regression model was established to monitor multiphase, multimode batch processes [20]. With this method, two-dimensional regression traces the intra-batch and inter-batch characteristics, while the two-layer structure establishes the relationship between the target process and the historical modes and phases. The two-layer structure is more suitable for multimode and multiphase characteristics but is not suitable for slow time-varying characteristics in the two directions. Therefore, it is necessary to develop a set of modeling strategies to trace the slow time-varying intra-batch and inter-batch characteristics simultaneously.
This paper presents a comprehensive and thorough analysis of the region of support (ROS). In this study, which considers industrial processes with slow time-varying characteristics in both the time direction and the repeat production direction, a modeling algorithm using two-dimensional extended windows is proposed for quality prediction. In a previous work [21], the ROS was defined as the data before the time point of the current batch and the previous batches, which include the data of the previous sampling times in the same batch (KROS) and the data of the previous sampling times in the historical batches (KIROS). In this paper, the ROS is redefined and extended to four sub-regions, including all the possible regions in the time direction and the batch direction, which include the data of the previous sampling times in the same batch (KROS), the data of the same sampling time in the historical batches (IROS), the data of the previous sampling times in the historical batches (KIROS), and the data of the later sampling times in the historical batches (BROS). The data of the previous batches and those of the previous sampling times are considered to be correlated with the current sampling data; thus, a more precise online prediction of product quality can be provided. On this basis, the i-extended window, k-extended window, i-k-extended window, back-extended window, and i-k-back-extended window are proposed. Finally, quality prediction models are constructed using the extended windows. The injection molding process is used as an example for experimental verification. Suggestions are given for the usage of the extended window-based quality prediction methods.
The remainder of this paper includes the following: In the second section, the quality prediction algorithm, the partial least squares (PLS) method, and the establishment of the sliding window are first introduced. Then, the modeling strategy with the extended ROS—specifically, the two-dimensional i-k-back-extended ROS—is proposed, followed by the quality prediction method and two evaluation indicators. The application of the proposed method to the injection modeling process is introduced in the third section. The fourth section presents the conclusion.

2. Methodology

2.1. Partial Least Squares (PLS) Method

PLS is a classical regression method for quality prediction in industrial processes. It not only carries out regression modeling but also simplifies the structure of the data and analyzes the correlation between two groups of variables [16,22]. Using the PLS algorithm, the correlations between the process variables and the quality variables can be transformed into a latent variable space for research. It should be noted that the inputs for PLS should be two-dimensional matrices. PLS is a classical multivariate statistical method and is more suitable for addressing application backgrounds with fewer samples, like the injection molding process in this paper, compared to deep learning methods. PLS is efficient, and its prediction effect can meet the accuracy requirements; so, it was adopted in this work. Generally, deep learning methods require a large amount of data and are not suitable for processes with this background. However, a suitable deep learning method may be chosen as a substitute for processes with a large amount of data. For processes with limited data, a deep learning method with a reasonable algorithm may also be chosen as a substitute to address the limited data problem.
The main formula of PLS capturing the relationship between X ˜ and Y ˜ is as follows:
X ˜ = T P T + E = a = 1 A t a p a T + E Y ˜ = U Q T + F = a = 1 A u a q a T + F
where T and U represent the score matrices, P and Q represent the load matrices, and E and F represent the residual matrices.
The regression model based on PLS is as follows:
Y ˜ = X ˜ B
where B is the regression coefficient matrix.

2.2. Establishment of Sliding Window

For a typical batch process, the process data are often stored in the form of a three-dimensional matrix, which is denoted as X ¯ ( I × J x × K ) , where I represents the number of batches, J x represents the number of process variables, and K represents the number of sampling points in a batch. Meanwhile, at the end of one batch operation, the quality data can be obtained, which are stored in the form of a two-dimensional matrix, denoted as Y ¯ ( I × J y ) , where J y represents the number of quality variables. Usually, a batch process needs to be divided into several phases using some of the production knowledge of the specific process, and the critical-to-quality phases and critical-to-quality variables are identified. These phases are then treated separately. Within the phases, the slow time-varying problem is characterized by the relationship change between the process variables and the quality variables from batch to batch. The key step of the analysis is to divide the slow time-varying batch process into different modes and to build different models for them. The sliding window method was adopted in this study; a demonstration diagram is shown in Figure 1. In the figure, I represents the batch direction, j represents the process variable direction, and k represents the direction of the sampling points in a batch. The first window is established using the process data of the first I w batch, where the process variable matrix is denoted as X ( I w × J x × K ) and the corresponding quality data are denoted as Y ( I w × J y ) . When sliding, the window moves down by L batches each time; that is, the second window is formed using the data of the ( L + 1 ) -th to the ( L + I w ) -th batches. It is advised that L be made 1 [23].
The process data in the sliding window can be divided into K time slices, and the k-th can be represented as X k ( I w × J x ) . Traditionally, a PLS model can be built between the time-slice data X k and the quality data Y . Next, each element in the time-slice data X k , x i , k ( 1 × J x ) , is extended using novel ROSs.

2.3. Two-Dimensional Extended ROS

The schematic diagram of the ROS proposed in this paper is shown in Figure 2. For convenience, for the process data, X ( I w × J x × K ) , the variable direction is not shown in the figure, and each white circle represents a sample point x i , k ( 1 × J x ) . The previous neighbors of x i , k may affect it, including not only past points in the time direction, x i , k 1 , x i , k 2 ,…, x i , 1 , but also past points in the batch direction, x i 1 , k , x i 2 , k ,…, x 1 , k , past points in both the time direction and the batch direction, x i 1 , k 1 , x i 1 , k 2 ,…, x i 1 , 1 , x i 2 , k 1 ,…, x i 2 , 1 ,…, x 1 , 1 , and even past points of the past batches but with time indexes larger than k, x i 1 , k + 1 , x i 1 , k + 2 ,…, x i 1 , K , x i 2 , k + 1 ,…, x i 2 , K ,…, x 1 , k + 1 ,…, x 1 , K . The region covering the above points which may influence the contribution of the current point to the final quality is defined as the novel ROS. Thus, these four sub-regions of data are called the k-extended region of support (KROS), the i-extended region of support (IROS), the k-i-extended region of support (KIROS), and the back-extended region of support (BROS). Their specific locations are shown in Figure 2. The novel k-i-back-extended region of support (KIBROS) includes the above four ROSs.
In the cited paper [21], the ROS was defined as the data before the time point of the current batch and previous batches, including the k-extended and k-i-extended ROSs in Figure 2. That is, in the previous work, only the two parts on the left side of Figure 2 were defined as ROSs. In this paper, the ROS is redefined and extended to four sub-regions, including all the possible regions in the time direction and the batch direction. Correspondingly, the observed element x i , k is extended to the four specific ROSs.
In previous batch process research, the variable data of the k-th sampling time in each batch were formed into a time-slice matrix, X k ( I × J x ) , and used to determine the relationship between the sampling process data and the quality data. Then, all the time-slice matrices were used to obtain the final prediction quality [24]. In this work, all the samples in the ROSs were considered at the same time when each sampling point was modeled. Thus, the established model contains the corresponding information, and the prediction accuracy can be further improved. Based on the above data in the ROSs, a two-dimensional extended matrix is proposed, whose specific description is as follows.
Step 1. Extension of the samples in the k-extended region of support (KROS).
The past points in the time direction; that is, the samples in the KROS, x i , k 1 , x i , k 2 ,…, x i , 1 , were used to construct the extended ROS in the time direction. And the extended process data matrix can be obtained as shown below, while the quality data vector remains since it represents the final quality for the whole batch. It should be noted that for the final quality, which can only be obtained at the end of each batch process, the k index is not necessary.
x i , k F = [ x i , k F + 1 , , x i , k 1 , x i , k ]
y i F = y i
where F represents the total number of sampling times included; in other words, a total of F 1 sampling times are extended forward.
Step 2. Extension of the samples in the i-extended region of support (IROS).
The following is the method used to build the extended ROS in the batch direction. For the variable vector of the i-th batch at the k-th sampling time, x i , k ( 1 × J ) , the variable vector of the (i-1)-th batch, x i 1 , k , is spliced at the back of x i , k , and the variable vectors of the (i-2)-th, (i-3)-th,..., (i-N+1)-th batches are spliced behind the matrix in turn, where N is the number of batches included. The extended process variable matrix is shown below, and similarly, the quality matrix is established.
X i , k N = x i , k x i 1 , k x i N + 1 , k
Y i N = y i y i 1 y i N + 1
Step 3. Extension of the samples in the k-i-extended region of support (KIROS).
Combining the previously proposed KROS and the IROS, the subsequently proposed method can further realize the construction of a two-dimensional extended matrix based on the time-batch evolution information.
First, for the i-th batch, the data are extended in the direction of the sampling time according to the method mentioned in Step 1 to obtain a matrix x i , k F .
Then, the i-th to the (i–− N + 1)-th batches in the IROS are extended according to the time direction, and the following extended process data matrix is obtained, as well as the extended quality data matrix:
X i , k F , N = x i , k F + 1 x i , k 1 x i , k x i 1 , k F + 1 x i 1 , k 1 x i 1 , k x i N + 1 , k F + 1 x i N + 1 , k 1 x i N + 1 , k
Y i F , N = y i y i 1 y i N + 1
Step 4. Extension of the samples in the back-extended region of support (BROS).
Since, in the above work, only the data before the k-th sample time were adopted, it was necessary to further consider the data information after the k-th sample time to further improve the accuracy of the quality prediction. The construction method for the backward extended matrix is symmetric with the two-dimensional extended matrix based on the time-batch evolution information proposed in the previous step. In addition, since it is not possible to obtain the information of the current i-th batch after the k-th sample time during modeling, and considering the similarity of the processes between batches in the batch process, the data after the k-th time of the (i–−1)-th batch are used as an alternative; then, the corresponding position of the current batch is filled. A schematic diagram of the specific process is shown in Figure 3. In this figure, different batches are represented by different colors and since the I-th batch is the same with the (I-1)-th batch, the same color is used for them.
A backward extended process data matrix and a backward extended quality data matrix can be constructed using the abovementioned method in the batch and sampling time directions, as shown below:
X i , k B = x i 1 , k + 1 x i 1 , k + 2 x i 1 , k + B x i 1 , k + 1 x i 2 , k + 1 x i 1 , k + 2 x i 2 , k + 2 x i 1 , k + B x i 2 , k + B x i N + 1 , k + 1 x i N + 1 , k + 2 x i N + 1 , k + B
Y i B = y i y i 1 y i 2 y i N + 1
where B is the number of backward samples included.
Step 5. Extension of the samples in the k-i-back-extended region of support (KIBROS).
Using sampled data from the whole range of ROSs, a method for constructing a k-i-back-extended matrix based on time-batch evolution information is subsequently proposed. First, the process data at the k-th sampling time in the i-th batch currently considered, x i , k , are extended in the direction of the sampling time to obtain X i , k F , N , B , expressed as Equation (11), followed by the quality data:
X i , k F , N , B = x i , k F + 1 x i , k 1 x i , k x i 1 , k F + 1 x i 1 , k 1 x i 1 , k x i N + 1 , k F + 1 x i N + 1 , k 1 x i N + 1 , k x i 1 , k + 1 x i 1 , k + 2 x i 1 , k + B x i 1 , k + 1 x i 1 , k + 2 x i 1 , k + B x i N + 1 , k + 1 x i N + 1 , k + 2 x i N + 1 , k + B
Y i F , N , B = y i y i 1 y i N + 1
The parameters that determine the size of the expanded matrix are F, N, and B, as mentioned above. During actual construction, these three parameters can be determined one by one to achieve the best prediction effect.
As previously mentioned, the quality prediction model is built between the time-slice data X k and the quality data Y . In this work, each element in the time-slice data X k , x i , k ( 1 × J x ) , was extended using the proposed x i , k F , X i , k N , X i , k F , N , X i , k B , and X i , k F , N , B , according to the proposed new ROSs, obtaining the extended time-slice data X k , e x . Similarly, each element in the quality data Y , y i , was extended using the proposed y i F , Y i N , Y i F , N , Y i B , and Y i F , N , B , obtaining the extended quality data Y e x . Subsequently, the quality perdition model was built.

2.4. Quality Prediction Method and Evaluation Indicators

Following the construction of the extended data matrix within each sliding window, PLS was used to finish the quality prediction. As mentioned above, the PLS model based on an extended window is expressed as follows:
X c , k , e x = T c , k , e x P c , k , e x T + E c , k , e x = a = 1 A t c , k , e x , a p c , k , e x , a T + E c , k , e x Y c , e x = U c , k , e x Q c , k , e x T + F c , k , e x = a = 1 A u c , k , e x , a q c , k , e x , a T + F c , k , e x
where X c , k , e x represents the extended time-slice data matrices, Y c , e x represents the extended quality data matrices, and c represents the c-th phase for the multiphase processes. The regression model is as follows:
Y c , e x = X c , k , e x B c , k , e x
For the batch whose quality is predicted online, the process variable matrix is denoted as X k . The extended matrix of this test batch was established using the method proposed in Section 2.3, denoted as X c , k , e x , and it was substituted into the following formula to obtain the prediction quality.
Y ˜ c , k , e x = X c , k , e x B c , k , e x
A few indicators were used to evaluate the performance of the prediction methods based on the model. The prediction accuracy, R k , c 2 , of the quality prediction model of the k-th sampling time within the c-th phase was calculated as follows:
R k , c 2 = i = 1 I ( y ˜ i , c , k , e x y ¯ ) 2 i = 1 I ( y i y ¯ ) 2
where y i is the measured value of the quality variable of the i-th batch, y ˜ i , c , k , e x is the predicted value of one of the quality variables of the k-th sampling time within the c-th phase, and y ¯ is the average value of the quality variable measured at the end of the i-th batch. The value range of R k , c 2 is 0–1. If R k , c 2 approaches 1, it indicates that the precision of the quality prediction model is high. If R k , c 2 approaches 0, it means that the change in this phase does not explain the change in quality well.
Furthermore, R c 2 ¯ is proposed to indicate the average impact of the c-th phase:
R c 2 ¯ = k = k s t a r t k e n d R k , c 2 / ( k e n d k s t a r t )
where k s t a r t and k e n d represent the starting and last sampling points in the c-th phase, separately. In this study, the offline quality analysis was carried out based on the comparison of R c 2 ¯ and R k , c 2 .
The root mean square error (RMSE) was used to verify the prediction accuracy of the model, which is expressed using the following formula:
R M S E = i = 1 r d i 2 / r
where r is the number of measurement groups, and d i is the deviation between one group of the predicted values and the real value of quality. As can be seen in the formula, the smaller the RMSE, the more accurate the prediction results. The comparison of the online prediction precision of each method was based on the evaluation of the RMSE.

3. Illustration and Discussion

3.1. Introduction of Injection Molding Process

Injection molding technology is an important plastic processing method, and the injection molding process is a typical batch process. During the injection molding process, there are often drifts in the operating conditions and other unknown perturbations that can lead to unstable product quality. Therefore, determining how to accurately predict the product quality to adjust the process parameters in time so that the product quality can meet the requirements is the basic research direction of quality control for the injection molding process. A schematic diagram of a typical injection molding machine is shown in Figure 4 [12].
According to the characteristics of injection molding processes, the entire injection molding process is divided into four phases, namely the injection phase, the pressure holding phase, the plasticization phase, and the cooling phase. These four phases are considered to be the critical-to-quality operation phases that determine the product quality. During the injection molding process, many process variables will affect the final product quality, and the values of these process variables can be measured by the corresponding sensors. At the end of each batch, the product quality information, that is, the weight of each product, can also be measured. The key process variables involved in this experiment are listed in Table 1.

3.2. Variable Analysis of Injection Molding Process and Experiment Condition

The material used in this work was high-density polyethylene (HDPE), and the specific operating conditions are shown in Table 2 [12]. In the experiment, the sampling information of 100 batches of a real injection molding process was used. Each batch contained 919 sampling points.
Based on the knowledge of the injection molding process, all the sampling points in a batch were divided into four phases. The sample points of the injection phase were samples 1–220; the sample points of the packing–holding phase were samples 221–519; the sample points of the plasticizing phase were samples 520–729; the sample points of the cooling phase were samples 730–919.
The method mentioned in Section 2.2 was used to establish the sliding windows. The number of batches in each window was 30, the sliding step size was L = 1, and a total of 71 sliding windows were established.

3.3. Application of Prediction Model and Identification of Related Parameters

3.3.1. Application of One-Dimensional Extended Window Model Using the Samples in KROS Only

A prediction model using k-extended data matrices in the KROS based on sampling time information was constructed within each sliding window using the method mentioned in Section 2. The number of sampling times for the forward extension, F, changed from 0 to 10, and for the c-th phase, the indicator R c 2 ¯ values were analyzed to assess the predictive ability of the model and are given in Table 3.
It can be seen that in the different phases, the changing trend of R c 2 ¯ roughly increased, and at the same time, this increasing trend gradually slowed down. Therefore, to balance the prediction accuracy and the algorithm complexity, it was necessary to set a threshold ε and to stop the extension when the rate of change was less than this threshold. The expression for the change rate can be written as:
Δ = R c , F + 1 2 ¯ R c , F 2 ¯ R c , F 2 ¯
where R c , f 2 ¯ represents the value of R c 2 ¯ in a situation where the extended window includes the F-1 forward sampling time for the modeling. If ε is taken as 0.0025, then F = 6. Therefore, in the following discussion, F is assigned a value of 6. Taking sliding window 30 and sliding window 50 as examples, the R k , c 2 of each sampling time and the mean value of the whole sampling time are given in Figure 5.
Figure 5 shows the performance of the quality prediction model constructed using the traditional method and the method based on the k-extended window method. Here, the traditional method does not consider the ROS. Since, theoretically, all four possible ROSs should be analyzed individually, which is the main contribution of this work, the original two ROSs are not analyzed as a unit. As can be seen in Figure 5, using the extended window modeling in the sampling time direction, from the perspective of sampling times, the average performance was boosted, and most R k , c 2 values of the sampling points were improved. The discussion above verifies the rationality of the proposed method based on the k-extended window method.

3.3.2. Application of Extended Window Model Using the Samples in IROS and KIROS

Similarly, the prediction model using the i-extended data matrices in the IROS based on batch information was constructed within each sliding window using the method mentioned in Section 2, as well as the prediction model using the k-i-extended data matrices in the KIROS. The number of sampling times for the forward extension, F, was uniformly set to 0 and 6, and on this basis, the number of extended batches, N, changed from 0 to 10. The indicator R c 2 ¯ values were analyzed to assess the predictive ability of the model and are given in Table 4.
It can be easily seen in the table that the R c 2 ¯ values roughly increased, and at the same time, the increase rates gradually decreased. Here, the threshold ε was 0.015, and N was set to 5. From the above table, the following conclusions can be drawn. First, when 10 batches were extended downward, the R c 2 ¯ values of Phase 1, Phase 2, and Phase 4 all reached more than 0.9. Compared to the extended window in the sampling time direction, the extended window in the batch direction achieved a better prediction effect. Second, comparing the cases where F was 0 and F was 6 in the table, it was found that the R c 2 ¯ of the latter was higher than the former under the condition of the same batch extension quantity. That is, when the windows in the direction of both the sampling time and the batch were used in combination, the effect was superimposed. This illustrates the rationality of constructing a two-dimensional extended window based on time-batch evolution information. Window 30 and window 50 were also selected to obtain the R k , c 2 values of each sampling time, as shown in Figure 6.
Figure 6 shows the performance of the quality prediction model constructed using three methods: the traditional method, the method based on the i-extended window, and the method based on the k-i-extended method. It can be seen in Figure 6 that the R k , c 2 values at most of the sampling points were improved due to the construction of the extended window. Furthermore, the R k , c 2 values of the two-dimensional extended window were better than that of the one-dimensional extended window. In addition, it can be observed that the gain in R k , c 2 due to the extension of the sampling time direction was significantly smaller than that of the extended batch. In the injection molding process, the correlation between two neighboring batches is relatively strong, which is in line with relevant production knowledge. In conclusion, the above discussion justifies the prediction method based on the i-extended window and the k-i-extended window.

3.3.3. Application of Extended Window Prediction Model Using the Samples in BROS

The method described in Section 2.3 was used to construct an extended window based on the backward time information of historical batches. During the analysis, the number of backward sampling times for backward extension, B, varied from 0 to 10. At the same time, the additivity of the extension in the batch direction was studied, and the number of expanded batches, N, was set to 1 and 5. The R c 2 ¯ values are given in Table 5.
The following conclusions can be drawn from the above table. For the proposed backward extended window, the R c 2 ¯ decreased in the initial stage of the gradual increase in the extension point. After about five sampling times, the R c 2 ¯ began to rise. In some cases, the R c 2 ¯ increased rapidly until it exceeded the initial state, such as N = 0, Phase 1, while some increased slowly. First, the decrease in the R c 2 ¯ in the initial situation occurred. As proposed in the method, since the data after the k-th sampling time of the current i-th batch could not be obtained during the online quality prediction, to fill in the missing data, the corresponding time information of the (i − 1)-th batch was selected to replace these missing data. This approximate substitution resulted in a slight loss in the R c 2 ¯ . As a result of this substitution, the time correlation of the process variable data with the quality data of the current batch was reduced. This phenomenon became more obvious as the extended batch increased, which can be seen in the comparison between N = 1 and i = 5. However, as the number of backward sampling times included continued to increase, the R c 2 ¯ value gradually increased. In some cases, an extension of fewer than 10 sampling times exceeded the initial situation, which proves that the backward moment is indeed related to the currently considered sampling time. The positive effect of the backward extended window is reflected. Combined with the technological characteristics of the injection molding process, the reasons for this phenomenon are explained as follows. During the backward extension, the sampling k-th time involved gradually increases. As the data increase, they become obviously closer to the current batch, and the correlation becomes stronger. At the same time, with the continuous growth at the data scale, the relationship between the process variables becomes gradually significant. These two effects gradually recover the R c 2 ¯ values of the prediction model. As for the differences in the growth speed among the different phases, they are related to the differences among the phases in the actual injection molding process. Finally, the suitable size of the proposed backward extended window was obtained. Through the extension of the backward sampling time of the historical batch, with the continuous addition of sampling points, the R c 2 ¯ values show an unstable upward trend. In other words, unlike the forward extension discussed earlier, the trend of the R c 2 ¯ changes in this case did not appear to slow down significantly. The reason is that the backward sampling points on the historical batches are all relatively far apart for the k-th sampling time of the i-th batch considered when modeling. This long distance weakens the difference between the backward extension points. Therefore, in the process of continuous extension, the goodness of fit can be uniformly larger. In addition, the sampling time k* of the historical batch was observed. Because batches of the injection molding process are carried out successively, when k* is far from k, it is closer to batch i; when k* is close to k, it is farther from batch i. This shows that there is a check-and-balance relationship between the sampling time and the batch, which brings additional uncertainty and, therefore, leads to an increase in goodness-of-fit fluctuations. Based on the above discussion, a larger point of the backward extension performs better. The main purpose was to overcome the negative impact caused by the extension points that were too small. At the same time, to consider the complexity of the algorithm, the number of backward extensions was selected as 10.
Window 30 and window 50 were also selected to obtain the R k , c 2 of each sampling point using the traditional method, the backward extended window method with one extended batch, and the backward extended window method with five extended batches. The results are shown in Figure 7.
As shown in Figure 7, compared to those of the traditional method, the R k , c 2 values of the model built using the back-extended window were improved; at the same time, when more batches were expanded, the R k , c 2 values of the model improved. This matches the conclusion obtained previously; that is, the backward extension of the data at the sampling time can help to improve the prediction model. The benefits obtained by the extension of the batch and the extension of the backward sampling times can also be superimposed.

3.3.4. Application of the k-i-Back-Extended Window Model

In this section, the rationality of the k-i-back-extended window based on the time-batch evolution information is verified. The sets of parameter combinations are listed in Table 6, and the corresponding R c 2 ¯ values are analyzed separately.
It can be seen in Table 6 that with the continuous extension of the data, the R c 2 ¯ values continue to increase, which proves that the R c 2 ¯ obtained using the extension in the various directions discussed above can be continuously increased.

3.4. Online Quality Analysis and Prediction

Online quality prediction was conducted using the proposed model prediction method based on the time-batch evolution information. For a better comparison, in addition to using the proposed k-i-back-extended window model, the traditional method, the k-extended window method, the i-extended window method, the k-i-extended window method, and the back-extended window method were used to perform the task of quality prediction. The parameter settings for these experiments are shown in the Table 7.
An analysis was also performed using the sliding windows, and the middle batch of each window was selected as the test batch to verify the accuracy of each prediction method. First, to analyze the stability of the prediction model with the proposed ROS, four typical test batches were chosen to present the online prediction results. For the four test batches, online quality prediction was performed, and the predicted weight results obtained using the six methods at each sampling time are plotted in Figure 8.
It can be clearly seen in Figure 8 that the prediction performance of the traditional method was more unstable compared to the proposed methods based on extended windows, especially in window 51 and window 71. The prediction results of the proposed method are more stable than the traditional method; that is, the prediction methods of the four extended window models based on the proposed method are significantly better than the traditional method regarding stability. To further understand the accuracy gained using the proposed four models, the mean RMSE values corresponding to all the sampling times in each test batch are listed in Table 8.
It can be seen that in the selected test batches given above, the prediction method based on the k-i-back-extended window generally achieved the best results. The prediction method based on this kind of extended window had the lowest RMSE on average. In the 11th window, its RMSE indicator reached a low value of 2.587 × 10−4. In addition, it should be noted that in the four methods for each test batch, the RMSE could only reach the lowest value with an extension in the direction of the sampling time. But this advantage is obviously unstable. For example, for batch 31, the prediction model that only used the sampling time to extend the window was not even as good as the traditional method. In other words, when only a one-dimensional extension is performed, the accuracy advantage may only be obtained in individual test batches. Based on the above, the effects of the five methods proposed in this paper on quality prediction were proven.
Therefore, combining the R c 2 ¯ index and R k , c 2 index obtained in the previous section, the following conclusions can be drawn. Firstly, compared to the traditional method, an extension in the direction of the batches or in the direction of the historical sampling time can achieve the purpose of improving the accuracy, and an extension in either the sampling time or batch direction is meaningful. Secondly, by considering the evolution information of the two directions in the model at the same time, the establishment of an extended window based on the time-batch evolution information of the sampling in both directions can further improve the accuracy and effectively improve the stability of the prediction model. In addition, in the actual production process, several modeling methods based on the extended window proposed above are suggested. When the production is in pursuit of the obtainment of high-precision products quickly and easily, it is recommended to use the one-dimensional extended window. When the production accuracy requirement is very high, then the two-dimensional windows should be used to build quality prediction models.

4. Conclusions

This paper proposes a new quality prediction strategy for batch processes. In the proposed method, newly defined ROSs are used to build five different extended windows, which contain the data information of historical batches and historical sampling times. The quality prediction models of these extended windows have richer batch-time operation evolution information, and the evolution information results in higher prediction accuracy. When all the sampled data in the KIBROS is considered in the model, the k-i-back-extended window is established, and the highest accuracy is obtained in the application to the injection molding process. For complex production situations, when product quality accuracy is critical, the two-dimensional KIBROS extended window is recommended. However, the application background of this work is the injection molding process, where sample data are limited. Although the representative multivariable statistical method, PLS, offered satisfactory modeling results in this work, deep learning methods are also recommended for general processes with a large amount of data to adapt the method to real-time monitoring and control applications.

Author Contributions

Conceptualization, L.Z.; methodology, L.Z. and J.Y.; software, J.Y.; validation, J.Y.; formal analysis, J.Y.; investigation, J.Y.; resources, L.Z.; data curation, L.Z.; writing—original draft preparation, L.Z. and J.Y.; writing—review and editing, L.Z.; visualization, J.Y.; supervision, L.Z.; project administration, L.Z.; funding acquisition, L.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (No. 61503069) and the Fundamental Research Funds for the Central Universities (N2404010).

Data Availability Statement

The data will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Dunteman, G. Principal Component Analysis; SAGE Publication Inc.: Los Angeles, CA, USA, 1989. [Google Scholar]
  2. Geladi, P.; Kowalshi, B. Partial least squares regression: A tutorial. Anal. Chim. Acta 1986, 185, 1–17. [Google Scholar] [CrossRef]
  3. Wang, X. Data Mining and Knowledge Discovery for Process Monitoring and Control; Springer: Berlin/Heidelberg, Germany, 1999. [Google Scholar]
  4. Kourti, T.; MacGregor, J. Process analysis, monitoring and diagnosis, using multivariate projection methods. Chemom. Intell. Lab. Syst. 1995, 28, 3–21. [Google Scholar] [CrossRef]
  5. Wang, H. Partial Least Squares Regression Method and Its Application; National Defense Industry Press: Beijing, China, 1999. [Google Scholar]
  6. Nomikos, P.; MacGregor, J. Monitoring batch processes using multiway principal component analysis. AIChE J. 1994, 40, 1361–1375. [Google Scholar] [CrossRef]
  7. Nomikos, P.; MacGregor, J. Multi-way partial least squares in monitoring batch processes. Chemom. Intell. Lab. Syst. 1995, 30, 97–108. [Google Scholar] [CrossRef]
  8. Wold, S.; Kettaneh, N.; Fridén, H.; Holmberg, A. Modelling and diagnostics of batch processes and analogous kinetic experiments. Chemom. Intell. Lab. Syst. 1998, 44, 331–340. [Google Scholar] [CrossRef]
  9. Duchesne, C.; MacGregor, J. Multivariate analysis and optimization of process variable trajectories for batch process. Chemom. Intell. Lab. Syst. 2000, 51, 125–137. [Google Scholar] [CrossRef]
  10. Cho, H.; Kim, K. A method for predicting future observations in the monitoring of a batch process. Qual. Technol. 2003, 35, 59–69. [Google Scholar] [CrossRef]
  11. Lu, N.; Wang, F.; Gao, F. Sub-PCA modeling and online monitoring strategy for batch processes. AIChE J. 2004, 50, 255–259. [Google Scholar] [CrossRef]
  12. Zhao, L.; Zhao, C.; Gao, F. Inter-batch-evolution-traced process monitoring based on inter-batch mode division for multi-phase batch processes. Chemom. Intell. Lab. Syst. 2014, 138, 178–192. [Google Scholar] [CrossRef]
  13. Zhao, L.; Wang, F.; Chang, Y.; Wang, S.; Gao, F. Phase-based recursive regression for quality prediction of multi-phase batch processes. In Proceedings of the 13th IEEE International Conference on Control and Automation, Ohrid, Macedonia, 3–6 July 2017; pp. 283–288. [Google Scholar]
  14. Hwang, D.; Han, C. Real-time monitoring for a process with multiple operating modes. Control Eng. Pract. 1999, 7, 891–902. [Google Scholar] [CrossRef]
  15. Chen, J.; Liu, J. Mixture principal component analysis models for process monitoring. Ind. Eng. Chem. Res. 1999, 38, 1478–1488. [Google Scholar] [CrossRef]
  16. Zhao, L.; Zhao, C.; Gao, F. Between-mode quality analysis based multi-mode batch process quality prediction. Ind. Eng. Chem. Res. 2014, 53, 15629–15638. [Google Scholar] [CrossRef]
  17. Zhao, L.; Huang, X. Slow time-varying batch process quality prediction based on batch augmentation analysis. Sensors 2022, 22, 512. [Google Scholar] [CrossRef] [PubMed]
  18. Zhao, L.; Huang, X.; Yu, H. Quality-analysis-based process monitoring for multi-phase multi-mode batch processes. Processes 2021, 9, 1321. [Google Scholar] [CrossRef]
  19. Zhao, L.; Yang, J. Batch process monitoring based on quality-related time-batch 2D evolution information. Sensors 2022, 22, 2235. [Google Scholar] [CrossRef] [PubMed]
  20. Zhao, L.; Huang, X. Two-dimensional, two-layer quality regression model based batch process monitoring. Processes 2022, 10, 43. [Google Scholar] [CrossRef]
  21. Yao, Y.; Lu, N.; Gao, F. Two-dimensional dynamic PCA with auto-selected support region. IFAC Proc. Vol. 2007, 40, 69–74. [Google Scholar] [CrossRef]
  22. You, L.X.; Chen, J. Autogenerated multilocal PLS models without pre-classification for quality monitoring of nonlinear processes with unevenly distributed data. Ind. Eng. Chem. Res. 2022, 16, 5898–5913. [Google Scholar] [CrossRef]
  23. Zou, M.; Zhao, L.; Wang, S.; Chang, Y.; Wang, F. Quality analysis and prediction for start-up process of injection molding processes. IFAC PapersOnLine 2018, 51, 233–238. [Google Scholar] [CrossRef]
  24. Jia, R.; Mao, Z.; Wang, F. KPLS model based product quality control for batch processes. CIESC J. 2013, 64, 1332–1339. [Google Scholar]
Figure 1. Schematic diagram of sliding window.
Figure 1. Schematic diagram of sliding window.
Mathematics 12 01396 g001
Figure 2. Schematic diagram of novel ROS.
Figure 2. Schematic diagram of novel ROS.
Mathematics 12 01396 g002
Figure 3. Backward extension in the direction of the batch.
Figure 3. Backward extension in the direction of the batch.
Mathematics 12 01396 g003
Figure 4. Schematic diagram of the injection molding machine.
Figure 4. Schematic diagram of the injection molding machine.
Mathematics 12 01396 g004
Figure 5. R k , c 2 based on the KROS in (a) sliding window 30 and (b) sliding window 50.
Figure 5. R k , c 2 based on the KROS in (a) sliding window 30 and (b) sliding window 50.
Mathematics 12 01396 g005
Figure 6. R k , c 2 based on the IROS and KIROS in (a) sliding window 30 and (b) sliding window 50.
Figure 6. R k , c 2 based on the IROS and KIROS in (a) sliding window 30 and (b) sliding window 50.
Mathematics 12 01396 g006
Figure 7. R k , c 2 based on the BROS in (a) sliding window 30 and (b) window 50.
Figure 7. R k , c 2 based on the BROS in (a) sliding window 30 and (b) window 50.
Mathematics 12 01396 g007
Figure 8. Predicted results of each sampling time in (a) sliding window 11, (b) sliding window 31, (c) sliding window 51, and (d) sliding window 71.
Figure 8. Predicted results of each sampling time in (a) sliding window 11, (b) sliding window 31, (c) sliding window 51, and (d) sliding window 71.
Mathematics 12 01396 g008
Table 1. Process variables of the injection molding process.
Table 1. Process variables of the injection molding process.
No.Variable DescriptionUnit
1Nozzle temperature°C
2Screw speedMm/s
3Cylinder pressureBar
4Plasticizing pressureBar
5SV1 valve opening%
6SV2 valve opening%
Table 2. Operating condition settings for the injection molding process.
Table 2. Operating condition settings for the injection molding process.
Operating ParameterSet Value
MaterialHigh-density polyethylene (HDPE)
Injection velocity24 mm/s
Packing pressure200 bar
Mold cooling water25 °C
Barrel temperature230 °C
Packing time3 s
Cooling time15 s
Table 3. R c 2 ¯ based on the k-extended window.
Table 3. R c 2 ¯ based on the k-extended window.
FPhase 1Phase 2Phase 3Phase 4Whole
00.66740.67010.72360.75510.7031
10.67350.67180.72840.75900.7116
20.67840.67390.73160.76210.7149
30.68340.67600.73490.76480.7177
40.68810.67790.73780.76680.7202
50.69240.67940.74080.76870.7224
60.69610.68050.74380.76990.7244
70.69930.68130.74700.77090.7264
80.70210.68200.75000.77140.7282
90.70460.68260.75290.77180.7299
100.70680.68330.75570.77200.7315
Table 4. R c 2 ¯ based on the k-extended window and k-i-extended window.
Table 4. R c 2 ¯ based on the k-extended window and k-i-extended window.
FNPhase 1Phase 2Phase 3Phase 4Whole
000.66740.67010.72360.75510.7031
10.71250.69480.77810.77270.7281
20.76200.75260.81800.81930.7596
30.79510.80700.84580.83230.7934
40.83590.82800.86860.84450.8129
50.85850.85910.89110.86410.8332
60.87230.87950.89150.87880.8468
70.89030.88790.78270.90140.8576
80.90110.90630.81210.91190.8709
90.90980.91650.82710.92030.8804
100.91920.91960.83920.92520.8850
600.69610.68050.74380.76990.7244
10.73430.69870.79770.79040.7409
20.77850.75780.83930.83130.7699
30.80880.81150.86520.84190.8033
40.84650.83210.88410.85380.8214
50.86690.86280.90500.87260.8409
60.87990.88300.90230.88630.8540
70.89740.89100.78860.90850.8649
80.90720.90950.81750.91780.8778
90.91540.91930.83120.92560.8868
100.92460.92230.84230.92950.8913
Table 5. R c 2 ¯ values based on the back-extended window.
Table 5. R c 2 ¯ values based on the back-extended window.
FBPhase 1Phase 2Phase 3Phase 4Whole
100.71250.69480.77810.77270.7281
10.70840.69030.77450.76780.7268
20.70620.68730.77260.76590.7273
30.70440.68510.77190.76620.7279
40.70500.68360.77180.76820.7288
50.70690.68290.77210.77020.7299
60.70930.68270.77240.77240.7311
70.71170.68290.77290.77430.7322
80.71400.68330.77350.77590.7334
90.71620.68410.77410.77730.7343
100.71790.68490.77480.77820.7349
500.85850.85910.89110.86410.8332
10.85030.85280.88900.85640.8277
20.84670.84820.88790.85290.8251
30.84500.84600.88750.85070.8244
40.84430.84480.88790.84880.8245
50.84430.84420.88850.84710.8250
60.84490.84390.88970.84550.8257
70.84600.84390.89100.84430.8266
80.84750.84410.89250.84320.8275
90.84930.84440.89410.84220.8285
100.85120.84480.89570.84140.8296
Table 6. Sets of parameter combinations and corresponding R c 2 ¯ values.
Table 6. Sets of parameter combinations and corresponding R c 2 ¯ values.
No.FNB R c 2 ¯
10000.6539
26000.6827
30500.8493
40160.6941
56500.8584
66160.7229
70560.8376
86560.8646
Table 7. Quality prediction methods used for comparison.
Table 7. Quality prediction methods used for comparison.
MethodFNB
Traditional000
k-extended window600
i-extended window050
k-i-extended window650
back-extended window0110
k-i-back-extended window6510
Table 8. Comparison of RMSE values gained using the different prediction methods.
Table 8. Comparison of RMSE values gained using the different prediction methods.
MethodBatch 11Batch 31Batch 51Batch 71Mean
Traditional0.01510.00460.23180.02240.0685
k-extended window0.01410.00490.00160.00640.0068
i-extended window6.516 × 10−40.00370.00310.00270.0025
k-i-extended window5.403 × 10−40.00370.00320.00270.0025
back-extended window0.00920.00140.00310.00550.0048
k-i-back-extended window2.587 × 10−40.00310.00390.00240.0024
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhao, L.; Yang, J. High-Precision Quality Prediction Based on Two-Dimensional Extended Windows. Mathematics 2024, 12, 1396. https://doi.org/10.3390/math12091396

AMA Style

Zhao L, Yang J. High-Precision Quality Prediction Based on Two-Dimensional Extended Windows. Mathematics. 2024; 12(9):1396. https://doi.org/10.3390/math12091396

Chicago/Turabian Style

Zhao, Luping, and Jiayang Yang. 2024. "High-Precision Quality Prediction Based on Two-Dimensional Extended Windows" Mathematics 12, no. 9: 1396. https://doi.org/10.3390/math12091396

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop