Nonlinear Dynamic Process Monitoring Based on Two-Step Dynamic Local Kernel Principal Component Analysis

Fang, Hairong; Tao, Wenhua; Lu, Shan; Lou, Zhijiang; Wang, Yonghui; Xue, Yuanfei

doi:10.3390/pr10050925

Open AccessArticle

Nonlinear Dynamic Process Monitoring Based on Two-Step Dynamic Local Kernel Principal Component Analysis

by

Hairong Fang

¹,

Wenhua Tao

¹,

Shan Lu

²

,

Zhijiang Lou

^2,*,

Yonghui Wang

³ and

Yuanfei Xue

²

¹

School of Information and Control Engineering, Liaoning Petrochemical University, Fushun 113005, China

²

Institute of Intelligence Science and Engineering, Shenzhen Polytechnic, Shenzhen 518055, China

³

Faculty of Engineering, Technology & Built Environment, UCSI University, Kuala Lumpur 56000, Malaysia

^*

Author to whom correspondence should be addressed.

Processes 2022, 10(5), 925; https://doi.org/10.3390/pr10050925

Submission received: 17 March 2022 / Revised: 27 April 2022 / Accepted: 27 April 2022 / Published: 7 May 2022

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Nonlinearity may cause a model deviation problem, and hence, it is a challenging problem for process monitoring. To handle this issue, local kernel principal component analysis was proposed, and it achieved a satisfactory performance in static process monitoring. For a dynamic process, the expectation value of each variable changes over time, and hence, it cannot be replaced with a constant value. As such, the local data structure in the local kernel principal component analysis is wrong, which causes the model deviation problem. In this paper, we propose a new two-step dynamic local kernel principal component analysis, which extracts the static components in the process data and then analyzes them by local kernel principal component analysis. As such, the two-step dynamic local kernel principal component analysis can handle the nonlinearity and the dynamic features simultaneously.

Keywords:

fault detection; kernel principal component analysis; nonlinear dynamic process; two-step dynamic local kernel principal component analysis

1. Introduction

With the development of modern industrialization, chemical processes tend to be large-scale and complex. As such, fault detection technology [1,2] becomes increasingly essential because it has the potential to decrease the economic loss caused by process faults.

The data-driven process monitoring method [3,4], which analyzes the process data without knowing the precise analytical model of the system, is an effective method for ensuring industrial safety due to its simplicity and adaptability. Typical data-driven approaches include principal component analysis (PCA) [5,6,7,8,9,10], partial least squares (PLS) [11,12], independent component analysis (ICA) [13,14], etc. Among the above data-driven approaches, PCA is the most commonly used method [15]. The basic principle of PCA is to extract the principal components (PCs) by projecting high-dimensional data into low-dimensional PC space with a linear transformation, and then, it maintains the majority variance of the original data [16].

However, PCA is primarily a linear approach, and hence, it cannot handle nonlinearity in the process data. To handle this issue, Lee et al. proposed kernel PCA (KPCA) [17,18,19,20,21], the main idea of which is to map the input space into a high-dimensional feature space via a nonlinear mapping kernel function. According to survey paper [22], KPCA has become the mainstream nonlinear method in recent decades, and many improved versions have been proposed in that time. Jiang et al. incorporated PCA and KPCA to handle the linear and nonlinear relationships in process data [18]. Lahdhiri et al. proposed a reduced rank scheme to reduce the computational complexity of KPCA [23]. Peng et al. combined KPCA with the GMM to handle the nonlinearity and multimode feature simultaneously [24].

However, for KPCA, all the training data are used for calculating the kernel function values without considering the inner relationships among the different data points. As such, KPCA is a global structure analysis technique, and it ignores the detailed local structure information, which is important for data dimensionality reduction and feature extraction. To address this issue, Deng et al. proposed a novel KPCA based on local structure analysis, referred to as local KPCA (LKPCA) [25]. LKPCA introduces the local data structure analysis into the global optimization function of KPCA and solves it by using generalized eigenvalue decomposition [26]. The local data structure in LKPCA is obtained by k-nearest neighbor (KNN) [27,28], which is based on the Euclidean distances between variables.

LKPCA is a static algorithm, which assumes that the expectation value of each variable is fixed and that it will not change over time. However, in a dynamic process, the expectation value of each variable changes over time, and hence, the Euclidean distances are not suitable for assessing the similarity between variables in a different sample time. As such, the local data structure obtained by KNN is wrong in a dynamic process, which causes a model deviation problem.

To handle the nonlinearity and the dynamic feature simultaneously, this paper introduces the two-step dynamic scheme [29] into LKPCA and names it as two-step dynamic LKPCA (TSD-LKPCA). The two-step dynamic scheme is adopted to calculate the dynamical structure of the process data, and then, the static components are extracted from the process data. As the static components are time-uncorrelated, they can be monitored by LKPCA, which addresses the model deviation problem of LKPCA.

The main contributions of this paper are as follows: (a) the drawback of the traditional LKPCA is analyzed; (b) a new two-step dynamic scheme is proposed for LKPCA, which handles the dynamic feature in the process and inherits LKPCA’s nonlinear processing ability. Tests results in a numerical model show that TSD-LKPCA achieves a 100% fault detection rates in three different types of faults, and it successfully addresses the false alarm problem caused by the dynamic feature. In addition, the test results in the Tennessee Eastman (TE) Process [30,31] show that TSD-LKPCA achieves the best fault detection rate in 14 of the 21 faults, and it achieves a 100% fault detection rate in fault 5. In both tests, TSD-LKPCA achieves a much better performance than LKPCA, PCA, and KPCA.

The remainder of this paper is organized as follows. KPCA and LKPCA are briefly discussed in Section 2. Then, TSD-LKPCA is presented in Section 3. The superiority of the proposed method is demonstrated in Section 4 by several tests. Finally, the conclusion, limitations, and future research are presented in Section 5.

2. Methods

2.1. Notations and Symbols

Table 1 and Table 2 list the symbol and acronyms used in this manuscript:

2.2. Kernel Principal Component Analysis (KPCA)

Assuming that the training dataset

X = {[\begin{matrix} x (1) & x (2) & \dots & x (n) \end{matrix}]}^{T} \in R^{n \times m}

represents the

n

samples of

m

variables, where

ϕ (\cdot)

denotes the nonlinear translation function mapping from the original nonlinear data space to the new linear feature space. To avoid computing with the function

ϕ (\cdot)

, KPCA defines a

N \times N

kernel matrix

K = {K_{i j}}

as

K_{i j} = 〈 ϕ (x (i)), ϕ (x (j)) 〉 = Γ (x (i), x (j)) .

(1)

In Equation (1),

Γ (*)

is the kernel function [32], i.e., the radial basis kernel as

Γ (x (i), x (j)) = \exp (- \frac{{‖ x (i) - x (j) ‖}^{2}}{σ})

(2)

where

σ

is a weight parameter.

The matrix

K

can be mean normalized as

\bar{K} = K - K ℤ - ℤ K + ℤ K ℤ,

(3)

where

ℤ = \frac{1}{n} [\begin{matrix} 1 & \dots & 1 \\ ⋮ & ⋱ & ⋮ \\ 1 & \dots & 1 \end{matrix}] \in R^{n \times n} .

(4)

Finally, applying eigenvalue decomposition to

\bar{K}

as

\bar{K} p_{i} = α_{i} p_{i},

(5)

where

p = (p_{1}, p_{2}, \dots, p_{k}) \in R^{n \times k}

is the projection vector,

α_{i}

is the variance of the

i th

PC, and

k

is the number of PCs. For a new data sample

y \in R^{1 \times m}

, one gets

t = Γ (y, X) p,

(6)

and

t

can be monitored by the

T^{2}

and

S P E

indices [33].

2.3. Local Kernel Principal Component Analysis (LKPCA)

The high-dimensional data

ϕ (x)

are often embedded in the ambient space on a low-dimensional manifold, and the goal of a local structure analysis is to determine the best linear approximation that keeps the nearby points as close together as feasible [34]. In other words, if

ϕ (x (i))

and

ϕ (x (j))

are nearby points, then

ϕ {(x (i))}^{T} p

and

ϕ {(x (j))}^{T} p

in the feature space should be nearby points as well. As such, the following neighborhood matrix is proposed for describing whether two points are nearby:

S_{i j} = {\begin{cases} 1, if ϕ (x (i)) and ϕ (x (j)) are nearby \\ 0, else \end{cases} .

(7)

To determine whether

ϕ (x (i))

and

ϕ (x (j))

are nearby, the k-nearest neighbor (KNN) approach is utilized: if the Euclidean distances between variables

x (i)

and

x (j)

are small then the two samples are considered to be nearby.

As such, the local optimization goal is as follows:

\min_{α} J_{L} = \min α^{T} K L K α .

(8)

In Equation (8),

L

is the Laplacian matrix, which can be calculated as

L = U - S

, where

S = {S_{i j}}

is the neighbor matrix and

U

is the diagonal matrix, with its

i^{t h}

diagonal element as

U_{i i} = \sum_{j = 1}^{n} S_{i j}

.

Hence, the global–local optimization goal is as follows:

\max_{α} J_{L G} = \frac{α^{T} K K α}{α^{T} K L K α} .

(9)

The above equation can be solved by applying singular value decomposition to

K K α = λ (K L K + ε I) α .

(10)

where

ε

is a small regularization parameter to avoid matrix singularity problems, and

I

is the identity matrix.

2.4. Two-Step Dynamic Scheme

The two-step dynamic scheme was first proposed in paper [35]. Assume

\begin{array}{l} x (t) = & x (t - 1) A_{1} + \dots + x (t - q) A_{q} + \tilde{x} (t) \\ = \bar{X} (t) \tilde{A} + \tilde{x} (t) \end{array},

(11)

where

\bar{X} (t) = [x (t - 1) \dots x (t - q)]

and

\tilde{A} = {[A_{1}^{T} \dots A_{q}^{T}]}^{T}

. Parameter

q

is the time lag and

\tilde{x}

is the static component.

The first step calculates the difference between the two data samples

x (t)

and

x (t - D)

as

Δ x (t) = x (t) - x (t - D),

(12)

where

D

is the time difference. Then, calculate the dynamic matrix

\tilde{A}

between

Δ x (t)

and

[\begin{matrix} Δ x (t - 1) \dots Δ x (t - q) \end{matrix}]

with the least squares algorithm [36], and extract the static components as

\tilde{x} (t) = x (t) - \bar{X} (t) \tilde{A} .

(13)

The second step monitors

\tilde{x} (t)

with the traditional process monitoring methods.

3. The Proposed Method

3.1. Analysis the Performance of LKPCA in Dynamic Process

In addition to the nonlinearity, the dynamic feature [37,38] is also a common issue for process monitoring. When LKPCA is applied in a dynamic process, it achieves a low detection rate and a large false alarm rate. The reason for this phenomenon is that in LKPCA the nearby points are determined by the Euclidean distance of the variable value

x (i)

, rather than that of the deviation

x (i) - E (x (i))

.

As shown in Figure 1, for the static process, the expectation

E (x (i))

remains unchanged, i.e.,

E (x (i)) = μ

, and hence, the Euclidean distance of the two sample values

x (i)

and

x (j)

can used to represent the Euclidean distance of the fluctuation at sample time

i

and

j

, i.e.,

‖ x (i) - x (j) ‖ = ‖ (x (i) - μ) - (x (j) - μ) ‖ .

(14)

As a result, the nearby points in Figure 1 (marked with red circles) have a close fluctuation, and hence, they are nearby points (clustered by KNN).

As shown in Figure 2, for the dynamic process the expectation

E (X)

is not fixed, and it changes over time, i.e.,

E (x (i)) \neq E (x (j))

. In this situation, the Euclidean distance of the two sample values,

x (i)

and

x (j)

, is not equal to the Euclidean distance of the fluctuation at sample time

i

and

j

. In Figure 2a, the cluster result of nearby points based on the Euclidean distances of original variable values is wrong because it ignores the changing of

E (X)

, and hence, the data with close sample times are more likely to be clustered into one group. As the process goes on, the expectation values of the variables deviate a lot from those of the training data, and hence, the false alarm rate will be very large. In Figure 2b, the cluster result is right because each cluster consists of a data sample from a different sample time, and it is not influenced by the changing of the expectation value. As such, how to handle the changing expectation value is the key to addressing the dynamic process feature.

3.2. New Two-Step Dynamic Scheme

To handle the dynamic issue, the key step is to extract the static components from the dynamic data. One drawback of the traditional two-step dynamic scheme is that

x (t)

and

x (t - D)

are related even if

D

is very large, and hence, the dynamic matrix

\tilde{A}

may be wrong. In this paper, we introduce another independent sample dataset as

\begin{array}{l} y (t) = & y (t - 1) A_{1} + \dots + y (t - q) A_{q} + \tilde{Y} (t) \\ = \bar{Y} (t) \tilde{A} + \tilde{Y} (t) \end{array} .

(15)

and calculate the difference between two datasets as

\begin{array}{l} Δ z & = x (t) - y (t) \\ = (\bar{X} (t) - \bar{Y} (t)) \tilde{A} + (\tilde{x} (t) - \tilde{y} (t)) \\ = Δ \bar{Z} (t) \tilde{A} + Δ \tilde{z} (t) \end{array},

(16)

where

\bar{X} (t) - \bar{Y} (t)

is the component related to

Δ z

and

Δ \tilde{z} (t) = \tilde{x} (t) - \tilde{y} (t)

is the static component which is independent of

Δ z

. As

\tilde{x} (t)

and

\tilde{y} (t)

follow the same statistical distribution, hence the expectation value of

Δ \tilde{z} (t)

is zero and

\tilde{A}

can be estimated with the least squares algorithm. The following steps are the same as in the traditional two-step dynamic scheme.

3.3. Two-Step Dynamic Local Kernel Principal Component Analysis (TSD-LKPCA)

As shown in Figure 3, one gets that the static components extracted from

X

are independent of the changing variable expectation value, and hence,

\tilde{x} (t)

can be used for LKPCA.

The details of TSD-LKPCA are as follows.

Step 1. Process the original data with the new two-step dynamic scheme and extract the static components

\tilde{x} (t)

with Equation (13).

Step 2. Cluster

\tilde{x} (t)

into several nearby groups by using KNN and then calculate the neighborhood matrix

S_{i j}

by using Equation (7).

Step 3. Process

\tilde{x} (t)

with the kernel function and calculate the projection vector

p

by using Equation (10).

Step 4. Extract the PCs by using Equation (6) and then monitor them by the

T^{2}

and

S P E

indices.

4. Simulation Results

In this section, we test the performance of TSD-LKPCA based on a numerically simulated dynamic nonlinear process and a TE process. The monitoring results acquired by this approach are compared to those obtained by PCA, KPCA, and LKPCA in terms of detection rate and false alarm rate.

4.1. Numerical Model Test

In order to verify the superiority of TSD-LKPCA in a dynamic nonlinear process, the following mathematical model is designed to test the effectiveness of the algorithm:

{\begin{cases} x_{1} (t) = χ + e_{1} + x_{1} (t - 1) \\ x_{2} (t) = χ^{2} - 3 χ + e_{2} + 0.8 x_{1} (t) \\ x_{3} (t) = - χ^{3} + 3 χ^{2} + e_{3} + 0.8 x_{2} (t) + 0.6 x_{1} (t) \end{cases},

(17)

where

χ

is a random uniform distribution,

e_{1}

,

e_{2}

,

e_{3}

are Gaussian random noise with zero mean and variance 0.01, and

x_{1}

,

x_{2}

,

x_{3}

are the monitoring variables. This process generates 860 samples of normal data for off-line training and another 960 samples for testing. These test data introduce faults at the 450th sampling points.

There are two types of faults, as follows:

Fault 1: a step fault occurs at variable $x_{1}$ with an amplitude of −6;
Fault 2: the coefficient 0.8 in variable $x_{2}$ changes to 0.5.

For the three nonlinear approaches, KPCA, LKPCA, and TSD-LKPCA, the kernel width parameter

σ

is set to 50,000 by cross-validation. The LKPCA and TSD-LKPCA methods’ neighborhood relation parameter

k_{n}

is 15. The dynamic process description model’s lag parameter

q

is set to 2. The control limits for the four methods are based on a confidence limit of 99%. All these parameters are also used in the following Section 4.2.

Table 3 shows the false alarm rates and fault detection rates of four algorithms. Due to the nonlinear characteristics of the process, the average

T^{2}

statistics of the false alarm rates of the three nonlinear methods, i.e., KPCA, LKPCA, and TSD-LKPCA, are much lower than that of the linear method PCA. Because KPCA and LKPCA are static methods, on the one hand they regard the change of expectation value in the training data as the normal data fluctuation, and hence, the control limits are very high in both methods, and they are insensitive to the faults; on the other hand, they regard the change of data expectation values in the testing data as the deviation caused by faults, resulting in large false detection. However, because TSD-LKPCA effectively handles the nonlinear and dynamic characteristics of the process, it achieves a 100% detection rate for both faults. The best result in each item is marked in underline and bold.

The monitoring curves for faults 1 and 2 are depicted in Figure 4 and Figure 5. The blue line indicates the statistics, the red line represents the corresponding control limit, and the orange line represents the failure time in these two graphs. The

T^{2}

statistics of PCA, KPCA, and LKPCA change over time in both figures, while the

T^{2}

statistics of TSD-LKPCA do not change because it monitors the static components, whose expectation value is fixed. Because there is a deviation between the initial process dataset and the mean value of the training data, this deviation is regarded as a fault, and hence, the

T^{2}

and

S P E

statistics of the PCA, KPCA, and LKPCA algorithms initially exceeded the control limit, resulting in a false alarm. However, the TSD-LKPCA method has handled the dynamic characteristics and hence avoided this issue. For fault 1, as shown in Figure 4, when a fault occurs the PCA, KPCA, and LKPCA algorithms detect the fault after a period of time, which causes detection delay. For fault 2, as shown in Figure 5, the mean or variance of the

T^{2}

statistics for the PCA, KPCA, and LKPCA methods change over time because of the dynamic feature, and hence, they are insensitive to the fault. In contrast, the TSD-LKPCA method detects the fault immediately, demonstrating that it has great sensitivity in fault detection in the process of nonlinear dynamic characteristics.

4.2. Tennessee Eastman (TE) Process Test

The Tennessee Eastman (TE) process is based on simulation models of actual industrial processes and is frequently used as a publicly available data source for testing process monitoring methods. The entire process system has 12 operational variables, 22 continuous variables, and 19 component variables. The training dataset consists of 960 normal data samples collected under normal operating circumstances. The TE process also produces 21 distinct types of faults (as shown in Table 4), which contain 960 samples of where the fault occurs, from the 161st data point to the end.

As the TE process is a nonlinear and dynamic simulation model, it is, hence, adopted to test the performance of PCA, KPCA, LKPCA, and TSD-LKPCA. For method comparison, 315 samples of a normal process dataset were utilized for model training. Table 5 displays the detection results of 21 faults. Obviously, TSD-LKPCA achieves the best fault detection rate in 14 of the 21 faults and it has much lower false alarm rates than PCA and KPCA. In particular, TSD-LKPCA achieves a 100% fault detection rate in fault 5, and those of the other methods are lower than 45%. TSD-LKPCA also achieves much better than other the methods in faults 10, 16, 19, and 20. For fairness, we take the difference between the fault detection rates and the false alarm rates as an index, which indicates that TSD-LKPCA can achieve a much higher fault detection rate with the same false alarm rate. The best result in each item is marked in underline and bold.

Figure 6 provides monitoring diagrams for several algorithms in fault 5. Fault 5 is the fault of a sudden change in the intake temperature of the cooling water, which occurs from the 161st data point to the end. According to the detection result curves of PCA (Figure 6a), KPCA (Figure 6b), and LKPCA (Figure 6c), the process is considered to have returned to normal after the 400th sample. In contrast, TSD-LKPCA (Figure 6d) still alarms the fault after that time, and hence, it achieves a 100% fault detection rate. This result also demonstrates that the TSD-LKPCA method outperforms PCA, KPCA, and LKPCA.

5. Conclusions, Limitations, and Future Research

This paper proposed a TSD-LKPCA method for handling the dynamic and nonlinear features simultaneously. Our novel contribution proposes a novel two-step dynamic scheme and integrates it into the LKPCA technique naturally. As such, TSD-LKPCA can successfully extract the static component in the data and handle its nonlinear feature by LKPCA. The testing results in a dynamic and nonlinear numerical model show that TSD-LKPCA can successfully detect all types of faults, and the testing result in the TE process shows that TSD-LKPCA achieves a higher fault detection rate than PCA, KPCA, and LKPCA by more than 9%. As such, TSD-LKPCA is a promising method.

Another contribution of this paper is that it analyzed the influence of the dynamic feature on the clustering results of nearby points in detail and proved that LKPCA is not applicable to dynamic processes.

However, it is worth noting that the selection of parameter

σ

in the Gaussian kernel function and parameter

k_{n}

in the KNN method is still a problem to be solved. More accurate results can be obtained by using more advanced parameter optimization methods. After some modifications, the algorithm for parameter optimization can be designed, which will be considered for future work.

Author Contributions

Conceptualization, H.F.; methodology, H.F. and Z.L.; validation, W.T. and S.L.; formal analysis, H.F.; resources, Y.W. and Y.X.; writing—original draft preparation, H.F.; writing—review and editing, Z.L. and W.T.; visualization, H.F. and W.T.; supervision, S.L.; project administration, Z.L.; funding acquisition, S.L., Z.L. and Y.X. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Innovation Team of the Department of Education of Guangdong Province, China (2020KCXTD041), the School Level Scientific Research Project of SZPT, China (6022310005K), and Young Talents by the Department of Education of Guangdong Province, China (2021KQNCX210).

Conflicts of Interest

The authors declare no conflict of interest.

References

Gao, Z.; Liu, X. An Overview on Fault Diagnosis, Prognosis and Resilient Control for Wind Turbine Systems. Processes 2021, 9, 300. [Google Scholar] [CrossRef]
Aslam, M.; Bantan, R.A.R.; Khan, N. Monitoring the Process Based on Belief Statistic for Neutrosophic Gamma Distributed Product. Processes 2019, 7, 209. [Google Scholar] [CrossRef] [Green Version]
Quiñones-Grueiro, M.; Prieto-Moreno, A.; Verde, C.; Llanes-Santiago, O. Data-driven monitoring of multimode continuous processes: A review. Chemom. Intell. Lab. Syst. 2019, 189, 56–71. [Google Scholar] [CrossRef]
Qin, S.J. Survey on data-driven industrial process monitoring and diagnosis. Annu. Rev. Control 2012, 36, 220–234. [Google Scholar] [CrossRef]
Abdi, H.; Williams, L.J. Principal component analysis. Wiley Interdiscip. Rev. Comput. Stat. 2010, 2, 433–459. [Google Scholar] [CrossRef]
Zhou, P.; Zhang, R.; Xie, J.; Liu, J.; Wang, H.; Chai, T. Data-driven monitoring and diagnosing of abnormal furnace conditions in blast furnace ironmaking: An integrated PCA-ICA method. IEEE Trans. Ind. Electron. 2020, 68, 622–631. [Google Scholar] [CrossRef]
Lou, Z.; Wang, Y.; Lu, S.; Sun, P. Process Monitoring Using a Novel Robust PCA Scheme. Ind. Eng. Chem. Res. 2021, 60, 4397–4404. [Google Scholar] [CrossRef]
Song, Y.; Liu, J.; Zhang, L.; Wu, D. Improvement of Fast Kurtogram Combined with PCA for Multiple Weak Fault Features Extraction. Processes 2020, 8, 1059. [Google Scholar] [CrossRef]
Lei, Y.; Jiang, W.; Jiang, A.; Zhu, Y.; Niu, H.; Zhang, S. Fault diagnosis method for hydraulic directional valves integrating PCA and XGBoost. Processes 2019, 7, 589. [Google Scholar] [CrossRef] [Green Version]
Fu, Y.; Gao, Z.; Liu, Y.; Zhang, A.; Yin, X. Actuator and Sensor Fault Classification for Wind Turbine Systems Based on Fast Fourier Transform and Uncorrelated Multi-Linear Principal Component Analysis Techniques. Processes 2020, 8, 1066. [Google Scholar] [CrossRef]
Kresta, J.V.; MacGregor, J.F.; Marlin, T.E. Multivariate statistical monitoring of process operating performance. Can. J. Chem. Eng. 1991, 69, 35–47. [Google Scholar] [CrossRef]
Dong, J.; Zhang, K.; Huang, Y.; Li, G.; Peng, K. Adaptive total PLS based quality-relevant process monitoring with application to the Tennessee Eastman process. Neurocomputing 2015, 154, 77–85. [Google Scholar] [CrossRef]
Kano, M.; Tanaka, S.; Hasebe, S.; Hashimoto, I.; Ohno, H. Monitoring independent components for fault detection. AIChE J. 2003, 49, 969–976. [Google Scholar] [CrossRef] [Green Version]
Fan, J.; Qin, S.J.; Wang, Y. Online monitoring of nonlinear multivariate industrial processes using filtering KICA–PCA. Control Eng. Pract. 2014, 22, 205–216. [Google Scholar] [CrossRef]
Cartocci, N.; Napolitano, M.R.; Crocetti, F.; Costante, G.; Valigi, P.; Fravolini, M.L. Data-Driven Fault Diagnosis Techniques: Non-Linear Directional Residual vs. Machine-Learning-Based Methods. Sensors 2022, 22, 2635. [Google Scholar] [CrossRef]
Dunia, R.; Qin, S.J. Joint diagnosis of process and sensor faults using principal component analysis. Control Eng. Pract. 1998, 6, 457–469. [Google Scholar] [CrossRef]
Lee, J.-M.; Yoo, C.; Choi, S.W.; Vanrolleghem, P.A.; Lee, I.-B. Nonlinear process monitoring using kernel principal component analysis. Chem. Eng. Sci. 2004, 59, 223–234. [Google Scholar] [CrossRef]
Jiang, Q.; Yan, X. Parallel PCA–KPCA for nonlinear process monitoring. Control Eng. Pract. 2018, 80, 17–25. [Google Scholar] [CrossRef]
Zhang, Y. Enhanced statistical analysis of nonlinear processes using KPCA, KICA and SVM. Chem. Eng. Sci. 2009, 64, 801–811. [Google Scholar] [CrossRef]
Zeng, L.; Long, W.; Li, Y. A Novel Method for Gas Turbine Condition Monitoring Based on KPCA and Analysis of Statistics T2 and SPE. Processes 2019, 7, 124. [Google Scholar] [CrossRef] [Green Version]
Zhou, M.; Zhang, Q.; Liu, Y.; Sun, X.; Cai, Y.; Pan, H. An Integration Method Using Kernel Principal Component Analysis and Cascade Support Vector Data Description for Pipeline Leak Detection with Multiple Operating Modes. Processes 2019, 7, 648. [Google Scholar] [CrossRef] [Green Version]
Wang, Y.; Si, Y.; Huang, B.; Lou, Z. Survey on the theoretical research and engineering applications of multivariate statistics process monitoring algorithms: 2008–2017. Can. J. Chem. Eng. 2018, 96, 2073–2085. [Google Scholar] [CrossRef]
Lahdhiri, H.; Elaissi, I.; Taouali, O.; Harakat, M.F.; Messaoud, H. Nonlinear process monitoring based on new reduced Rank-KPCA method. Stoch. Environ. Res. Risk Assess. 2018, 32, 1833–1848. [Google Scholar] [CrossRef]
Peng, G.; Huang, K.; Wang, H. Dynamic multimode process monitoring using recursive GMM and KPCA in a hot rolling mill process. Syst. Sci. Control Eng. 2021, 9, 592–601. [Google Scholar] [CrossRef]
Deng, X.; Tian, X.; Chen, S. Modified kernel principal component analysis based on local structure analysis and its application to nonlinear process fault diagnosis. Chemom. Intell. Lab. Syst. 2013, 127, 195–209. [Google Scholar] [CrossRef] [Green Version]
Parra, L.; Sajda, P. Blind source separation via generalized eigenvalue decomposition. J. Mach. Learn. Res. 2003, 4, 1261–1269. [Google Scholar]
Kramer, O. K-nearest neighbors. In Dimensionality Reduction with Unsupervised Nearest Neighbors; Springer: Berlin/Heidelberg, Germany, 2013; pp. 13–23. [Google Scholar]
Wang, J.; Zhou, Z.; Li, Z.; Du, S. A Novel Fault Detection Scheme Based on Mutual k-Nearest Neighbor Method: Application on the Industrial Processes with Outliers. Processes 2022, 10, 497. [Google Scholar] [CrossRef]
Lou, Z.; Tuo, J.; Wang, Y. Two-step principal component analysis for dynamic processes. In Proceedings of the 2017 6th International Symposium on Advanced Control of Industrial Processes (AdCONIP), Taipei, Taiwan, 28–31 May 2017; pp. 73–77. [Google Scholar]
Lou, Z.; Wang, Y.; Si, Y.; Lu, S. A novel multivariate statistical process monitoring algorithm: Orthonormal subspace analysis. Automatica 2022, 138, 110148. [Google Scholar] [CrossRef]
Chiang, L.H.; Russell, E.L.; Braatz, R.D. Fault Detection and Diagnosis in Industrial Systems; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2000. [Google Scholar]
Xu, Y.; Zhang, D.; Song, F.; Yang, J.-Y.; Jing, Z.; Li, M. A method for speeding up feature extraction based on KPCA. Neurocomputing 2007, 70, 1056–1061. [Google Scholar] [CrossRef]
Lou, Z.; Shen, D.; Wang, Y. Preliminary-summation-based principal component analysis for non-Gaussian processes. Chemom. Intell. Lab. Syst. 2015, 146, 270–289. [Google Scholar] [CrossRef]
Deng, X.; Cai, P.; Cao, Y.; Wang, P. Two-step localized kernel principal component analysis based incipient fault diagnosis for nonlinear industrial processes. Ind. Eng. Chem. Res. 2020, 59, 5956–5968. [Google Scholar] [CrossRef]
Lou, Z.; Shen, D.; Wang, Y. Two-step principal component analysis for dynamic processes monitoring. Can. J. Chem. Eng. 2018, 96, 160–170. [Google Scholar] [CrossRef]
Han, M.; Zhang, S.; Xu, M.; Qiu, T.; Wang, N. Multivariate chaotic time series online prediction based on improved kernel recursive least squares algorithm. IEEE Trans. Cybern. 2018, 49, 1160–1172. [Google Scholar] [CrossRef] [PubMed]
Ma, X.; Si, Y.; Yuan, Z.; Qin, Y.; Wang, Y. Multistep dynamic slow feature analysis for industrial process monitoring. IEEE Trans. Instrum. Meas. 2020, 69, 9535–9548. [Google Scholar] [CrossRef]
Cong, Y.; Zhou, L.; Song, Z.; Ge, Z. Multirate dynamic process monitoring based on multirate linear Gaussian state-space model. IEEE Trans. Autom. Sci. Eng. 2019, 16, 1708–1719. [Google Scholar] [CrossRef]

Figure 1. Cluster results for static process.

Figure 2. Cluster results for dynamic static process: (a) clustered by variable value and (b) clustered by variable deviation.

Figure 3. Static components extracted from the dynamic process data.

Figure 4. Monitoring charts of fault 1 in the numerical example: (a) PCA, (b) KPCA, (c) LKPCA (d) TSD-LKPCA.

Figure 5. Monitoring charts of fault 2 in the numerical example: (a) PCA, (b) KPCA, (c) LKPCA (d) TSD-LKPCA.

Figure 6. Monitoring charts of fault 5: (a) PCA, (b) KPCA, (c) LKPCA (d) TSD-LKPCA.

Table 1. Acronyms and notations used in the present work.

Symbol	Description
$X$	Training data
$ϕ (\cdot)$	Nonlinear mapping function
$Γ (*)$	Kernel function
$K$	Kernel matrix
$σ$	Weight parameter
$ℤ$	$N \times N$ matrix of each element
$p$	Projection vector
$α$	Eigenvalue
$k$	Number of pivots
$S_{i j}$	Local neighborhood relationship
$L$	Laplacian matrix
$S$	Neighbor matrix
$U$	Diagonal matrix
$ε$	Small regularization parameter
$I$	Identity matrix
$q$	Time lag
$D$	Time difference
$χ$	Random uniform distribution
$k_{n}$	Neighborhood relation parameter

Table 2. Acronyms used in the present work.

Acronym	Description
GMM	Gaussian mixture model
ICA	Independent component analysis
KNN	K-nearest neighbor
KPCA	Kernel principal component analysis
LKPCA	Local kernel principal component analysis
PCA	Principal component analysis
PCs	Principal components
PLS	Partial least squares
TE	Tennessee Eastman
TSD-LKPCA	Two-step dynamic local kernel principal component analysis

Table 3. Monitoring results (%) of PCA, KPCA, LKPCA, and TSD-LKPCA.

Methods	PCA		KPCA		LKPCA		TSD-LKPCA
Indices	$T^{2}$	$S P E$	$T^{2}$	$S P E$	$T^{2}$	$S P E$	$T^{2}$	$S P E$
Fault 1	98.04	27.45	96.67	96.67	96.67	96.67	100.00	100.00
Fault 2	0.00	100.00	0.00	100.00	0.00	100.00	100.00	100.00
Average	49.02	62.73	48.43	98.34	48.43	98.34	100.00	100.00
False alarm rates	15.11	0.22	2.11	0.775	2.11	1.67	1.11	0.45

Table 4. Fault descriptions for the Tennessee Eastman (TE) process.

No.	Description	Type	No.	Description	Type
1	A/C feed ratio, B composition constant	Step	9	D feed temperature	Random variation
2	B composition, A/C ratio constant	Step	10	C feed temperature	Random variation
3	D feed temperature	Step	11	Reactor cooling water inlet temperature	Random variation
4	Reactor cooling water inlet temperature	Step	12	Condenser cooling water inlet temperature	Random variation
5	Condenser cooling water inlet temperature	Step	13	Reaction kinetics	Slow drift
6	A feed loss	Step	14	Reactor cooling water valve	Sticking
7	C header pressure loss—reduced availability	Step	15	Condenser cooling water valve	Sticking
8	A, B, C feed composition	Random variation	16–21	Unknown	Unknown

Table 5. Monitoring results on the TE process (%).

Methods	PCA		KPCA		LKPCA		TSD-LKPCA
Indices	$T^{2}$	$S P E$	$T^{2}$	$S P E$	$T^{2}$	$S P E$	$T^{2}$	$S P E$
Fault1	99.10	99.00	99.25	99.00	99.25	99.38	99.63	99.75
Fault2	98.40	95.80	98.63	98.88	98.50	98.75	99.13	98.13
Fault3	0.90	2.60	3.00	21.50	0.63	5.50	1.25	1.50
Fault4	20.90	100.00	54.38	98.88	5.25	83.75	84.13	76.50
Fault5	24.30	20.90	27.88	40.88	23.50	27.63	100.00	100.00
Fault6	99.10	100.00	99.13	99.50	99.00	99.38	100.00	100.00
Fault7	100.00	100.00	100.00	100.00	99.25	100.00	88.63	100.00
Fault8	96.90	83.60	97.50	96.25	97.00	97.75	97.13	97.63
Fault9	1.80	1.80	3.63	18.25	1.75	4.13	1.38	0.63
Fault10	29.90	25.80	41.13	53.63	33.88	33.25	80.50	82.13
Fault11	40.60	74.90	56.50	69.75	19.13	65.88	69.38	61.25
Fault12	98.40	89.50	98.63	96.50	98.00	95.38	99.13	99.38
Fault13	93.60	95.30	94.50	96.00	94.25	95.25	94.75	98.50
Fault14	99.30	100.00	99.50	100.00	79.13	100.00	99.63	99.88
Fault15	1.40	3.00	5.25	22.13	2.63	6.88	1.13	1.63
Fault16	13.50	27.40	25.00	43.50	17.50	20.00	80.38	80.88
Fault17	76.40	95.40	80.38	93.25	70.63	84.75	94.50	96.25
Fault18	89.30	90.10	89.00	92.75	88.63	89.13	89.88	93.00
Fault19	11.00	12.50	11.13	30.63	0.50	23.00	57.00	54.25
Fault20	31.80	49.80	36.13	61.25	29.00	38.00	85.63	63.88
Fault21	39.30	47.30	40.00	62.38	30.00	48.75	50.88	33.38
Average	55.52	62.60	60.03	71.19	51.78	62.69	74.96	73.26
False alarm rates	2.32	12.20	0.51	5.33	0.00	1.61	0.36	1.13
Difference	53.20	50.40	59.52	65.86	51.78	61.08	74.60	72.13

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Fang, H.; Tao, W.; Lu, S.; Lou, Z.; Wang, Y.; Xue, Y. Nonlinear Dynamic Process Monitoring Based on Two-Step Dynamic Local Kernel Principal Component Analysis. Processes 2022, 10, 925. https://doi.org/10.3390/pr10050925

AMA Style

Fang H, Tao W, Lu S, Lou Z, Wang Y, Xue Y. Nonlinear Dynamic Process Monitoring Based on Two-Step Dynamic Local Kernel Principal Component Analysis. Processes. 2022; 10(5):925. https://doi.org/10.3390/pr10050925

Chicago/Turabian Style

Fang, Hairong, Wenhua Tao, Shan Lu, Zhijiang Lou, Yonghui Wang, and Yuanfei Xue. 2022. "Nonlinear Dynamic Process Monitoring Based on Two-Step Dynamic Local Kernel Principal Component Analysis" Processes 10, no. 5: 925. https://doi.org/10.3390/pr10050925

APA Style

Fang, H., Tao, W., Lu, S., Lou, Z., Wang, Y., & Xue, Y. (2022). Nonlinear Dynamic Process Monitoring Based on Two-Step Dynamic Local Kernel Principal Component Analysis. Processes, 10(5), 925. https://doi.org/10.3390/pr10050925

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Nonlinear Dynamic Process Monitoring Based on Two-Step Dynamic Local Kernel Principal Component Analysis

Abstract

1. Introduction

2. Methods

2.1. Notations and Symbols

2.2. Kernel Principal Component Analysis (KPCA)

2.3. Local Kernel Principal Component Analysis (LKPCA)

2.4. Two-Step Dynamic Scheme

3. The Proposed Method

3.1. Analysis the Performance of LKPCA in Dynamic Process

3.2. New Two-Step Dynamic Scheme

3.3. Two-Step Dynamic Local Kernel Principal Component Analysis (TSD-LKPCA)

4. Simulation Results

4.1. Numerical Model Test

4.2. Tennessee Eastman (TE) Process Test

5. Conclusions, Limitations, and Future Research

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI