Distributed Robust Dictionary Pair Learning and Its Application to Aluminum Electrolysis Industrial Process

Wang, Jingkun; Chen, Xiaofang; Deng, Ziqing; Zhang, Hongliang; Zeng, Jing

doi:10.3390/pr10091850

Open AccessArticle

Distributed Robust Dictionary Pair Learning and Its Application to Aluminum Electrolysis Industrial Process

by

Jingkun Wang

^1,2,

Xiaofang Chen

³,

Ziqing Deng

^3,*,

Hongliang Zhang

^1,2 and

Jing Zeng

³

¹

School of Metallurgy and Environment, Central South University, Changsha 410083, China

²

National Engineering Research Center for Low-Carbon Nonferrous Metallurgy, Changsha 410083, China

³

School of Automation, Central South University, Changsha 410083, China

^*

Author to whom correspondence should be addressed.

Processes 2022, 10(9), 1850; https://doi.org/10.3390/pr10091850

Submission received: 23 August 2022 / Revised: 6 September 2022 / Accepted: 9 September 2022 / Published: 14 September 2022

(This article belongs to the Special Issue Process Monitoring and Fault Diagnosis)

Download

Browse Figures

Versions Notes

Abstract

:

In modern industrial systems, high-dimensional process data provide rich information for process monitoring. To make full use of local information of industrial process, a distributed robust dictionary pair learning (DRDPL) is proposed for refined process monitoring. Firstly, the global system is divided into several sub-blocks based on the reliable prior knowledge of industrial processes, which achieves dimensionality reduction and reduces process complexity. Secondly, a robust dictionary pair learning (RDPL) method is developed to build a local monitoring model for each sub-block. The sparse constraint with

l_{2, 1}

norm is added to the analytical dictionary, and a low rank constraint is applied to the synthetical dictionary, so as to obtain robust dictionary pairs. Then, Bayesian inference method is introduced to fuse local monitoring information to global anomaly detection, and the block contribution index and variable contribution index are used to realize anomaly isolation. Finally, the effectiveness of the proposed method is verified by a numerical simulation experiment and Tennessee Eastman benchmark tests, and the proposed method is then successfully applied to a real-world aluminum electrolysis process.

Keywords:

high-dimension; distributed robust dictionary pair learning; process monitoring; aluminum electrolysis

1. Introduction

With the continuous development of information technology, the degree of automation and integration of industrial processes have been continuously improved, and the structure of industrial systems has become more complex. There are more and more factors affecting the stable operation of industrial production, which makes anomaly detection and isolation of industrial processes face challenges [1,2]. At present, industrial process monitoring technology has attracted great attention from academia and industry. The excellent properties of aluminum make it widely used in practical fields, including construction, electrical, packaging, medicine, transportation and so on. Electrolytic cell is the core equipment of aluminum production industry. Moreover, aluminum electrolysis process mainly depends on biggish and large pre-baked anode electrolytic cell. For the large-scale electrolytic cell, once the abnormal cell condition occurs, the efficient and stable operation of the electrolytic cell may be destroyed, causing huge economic losses [3]. Therefore, effective process monitoring technology is of profound significance to ensure the green and stable operation of electrolytic cells and reduce the production cost of enterprises. Existing aluminum electrolysis process monitoring methods can be divided into three categories: mechanistic model-based methods [4,5], knowledge-based methods [6,7], and data-driven methods [8,9,10]. The effectiveness of the mechanism model-based process monitoring method for aluminum electrolysis depends on the measurements, like cell resistance, cell voltage, and aluminum level. However, some process parameters such as electrolyte temperature and molecular ratio cannot be measured directly in actual aluminum electrolytic production. Meanwhile, individual differences in empirical knowledge and lack of systematic theoretical basis lead to the failure to ensure the effectiveness of knowledge-based process monitoring methods in aluminum electrolysis. In addition, due to the reduction on the cost of sensors, the rapid development of technology and the application of advanced computer technology, modern industrial systems use a large number of sensors to obtain rich process information, and data-driven process monitoring methods are widely studied and applied [11,12].

In recent years, dictionary learning has been proposed as an effective statistical machine learning method. Compared with traditional data-driven methods, dictionary learning methods have good generalization ability, which are successfully applied in many fields, such as pattern recognition, image processing, and computer vision [13]. Dictionary learning seeks the linear combination of atoms to reconstruct the original data. And the learned dictionaries are over-complete and not restricted by orthogonal, which makes dictionary atoms adapt to the training data more flexibly to ensure the high precision of learning methods. According to different sparse coding methods, traditional dictionary learning methods include analytical dictionary learning [14,15] and synthetical dictionary learning [16,17]. Analytical dictionary learning can directly build the required dictionary from the precise and fast transformation base, and its predefined encoding method makes the computational complexity low, but it is relatively limited in monitoring modeling ability. Synthetical dictionary learning requires sparse reconstruction by

l_{0}

or

l_{1}

norm, which leads to much higher computational complexity. However, synthetical dictionary learning is developing in industrial process monitoring with the effective ability of local modeling. Meanwhile, to combine the advantages of analytical dictionary learning and synthetical dictionary learning, Gu et al. [18] combined analytical dictionaries and synthetical dictionaries into a learning framework and proposed dictionary pair learning (DPL) method. Zhang et al. [19] integrated coefficient learning and salient feature extraction into a unified model and proposed a self-expressed local adaptive potential dictionary pair learning method. Sun et al. [20] proposed a structured robust adaptive dictionary pair learning framework for discriminative sparse representation learning, realizing the strong representation ability of available samples.

However, small anomalies of real industrial processes are often hidden in high-dimensional data. Commonly used dimensionality reduction methods include principal component analysis (PCA) [21], partial least squares (PLS) [22], local retained projection (LPP) [23], and other multivariate statistical monitoring methods. These methods project the highly correlated high-dimensional process data into the low-dimensional subspace by selecting representative principal elements, but this way destroys the global structure of the original data matrix. In addition, traditional process monitoring methods often build global monitoring model and ignore the local behavior of industrial processes. In order to carry out regional and refined process monitoring, distributed process monitoring technologies have emerged in the era of big data [24]. Distributed process monitoring technologies combine the prior knowledge of industrial process to divide the high-dimensional data into blocks, and build monitoring sub-models for each block, then fuse the monitoring results of sub-blocks to the global monitoring results. For example, Zhu et al. [25] decomposed large-scale processes into distributed blocks with prior process knowledge, and proposed a distributed parallel data processing strategy based on MapReduce framework. Xu et al. [26] proposed a distributed principal component analysis method for angle-relevant variable selection, and realized whole plant process monitoring by the reduction of process variables and the extraction of potential features. Huang et al. [27] proposed a distributed dictionary learning (DDL) method for fault detection and fault isolation to achieve efficient industrial process monitoring. Thus, existing process monitoring methods for high-dimensional data can be divided into two categories, including projected methods and distributed learning methods. The schematic diagram of the two methods is shown in Figure 1. In fact, industrial process data are always contaminated by noise or outliers, which adversely affects process monitoring efficiency. Therefore, robust process monitoring methods are particularly important.

To solve the problem of noise and outliers in high-dimensional industrial process data, a process monitoring method based on distributed robust dictionary pair learning (DRDPL) is proposed in this article. The main contributions of this article are described as follows. The global system is divided into several sub-blocks with the reliable prior knowledge of industrial process, which achieves dimensionality reduction and reduces process complexity. A robust dictionary pair learning (RDPL) method is proposed to build local process monitoring models for each sub-block. A sparse metric based on

l_{2, 1}

norm is used to encode the reconstruction errors so as to avoid the costly computation of

l_{0}

and

l_{1}

norm. To reduce the interferences of noise and outliers, a sparse constraint with

l_{2, 1}

norm is added to the analytical dictionary, and a low rank constraint is applied to the synthetical dictionary, providing a robust dictionary pair. Then, Bayesian inference method is introduced to fuse local monitoring information for global anomaly detection, and the block contribution index and variable contribution index are used to realize anomaly isolation, obtaining the interpretable location results of anomaly sources.

The remainder of this article is organized as follows. Section 2 describe the proposed method in detail. Section 3 presents a numerical simulation experiment, Tennessee Eastman benchmark tests, and a real-world aluminum electrolysis process monitoring. Finally, the conclusion is given in Section 4.

2. Methodology

In this section, we will introduce the process monitoring method based on distributed robust dictionary pair learning (DRDPL) in detail. The proposed method is mainly divided into three stages, including distributed robust dictionary pair learning (RDPL), anomaly detection, anomaly isolation. In the distributed robust dictionary pair learning stage, training samples are divided in dimensions with the prior process knowledge. RDPL sub-models are built for each sub-block, learning robust synthetical dictionary, robust analytical dictionary, and control threshold. In the stage of anomaly detection, testing samples are divided into blocks with the division of training samples, then the local monitoring information obtained by the learned synthetical dictionary and analytical dictionary is fused to the global one based on Bayesian inference, so as to detect whether the testing sample is abnormal. In the anomaly isolation stage, the contribution plot method is used to define the block contribution index and variable contribution index to locate abnormal sources, where the block contribution index is used to achieve block-level anomaly isolation, and the accurate anomaly location of variables in each block is realized by the variable contribution index. The schematic diagram of the proposed process monitoring method is shown in Figure 2.

2.1. Distributed Robust Dictionary Pair Learning

To solve the deficiency of traditional dictionary learning, dictionary pair learning combines synthetical dictionary and analytical dictionary to reduce the computational burden of

l_{0}

or

l_{1}

norm constraint and enhance the reconstruction ability of dictionary learning [18]. The structure diagram of dictionary pair learning is shown in Figure 3, and its general model is formulated as follows:

\begin{matrix} \{D^{*}, P^{*}\} = arg min_{D, P} \{{∥X - D P X∥}_{F}^{2} + ψ (D, P, X, Y)\} \end{matrix}

(1)

where

X \in R^{p \times n}

is p-dimensional data matrix,

D \in R^{p \times m}

represents the synthetical dictionary,

P \in R^{m \times p}

is the analytical dictionary, m is the number of dictionary atoms.

{∥X - D P X∥}_{F}^{2}

denotes the reconstruction error term of dictionary pair learning,

ψ (D, P, X, Y)

are some discriminative functions, and Y stands for the label matrix of X. In the dictionary pair learning model, the representation coefficients A can be obtained by linear projection instead of nonlinear sparse coding with

l_{0}

or

l_{1}

norm. That is, we can learn an analytical dictionary P, such that A can be analytically obtained as

A = P X \in R^{m \times n}

. Based on this, dictionary pair learning method learns such an analytical dictionary P together with the synthetical dictionary D, then the data matrix X can be reconstructed by D, P, and X, i.e.,

X \approx D P X

, where D is used to reconstruct X, and P is applied to analytically code X.

Dictionary pair learning described above has been improved and used in industrial process monitoring. However, the process data collected in practical industrial systems often contain noise and outliers, which bring difficulties to process monitoring. Figure 4 shows the impact of noise and outliers on process monitoring results, where

α

is the control threshold, a value lower than

α

is considered normal, while a value higher than

α

is detected as an anomaly. From Figure 4a, we can observe that normal samples and abnormal samples can be correctly detected without the interferences of noise and outlier. In Figure 4b, we can see that when process data contains noise and outliers, false positives will appear in abnormal detection results of process monitoring.

Therefore, to address the problem of process monitoring performance degradation caused by outliers and noise, we propose a robust dictionary pair learning method for industrial process monitoring. In addition, the proposed process monitoring method based on distributed robust dictionary pairs is developed for high-dimensional process data. The prior process knowledge is used to divide the training samples into blocks in the dimension direction, i.e.,

X = {[X_{1}, \dots, X_{K}, \dots, X_{N}]}^{T}

. The RDPL model of the Kth block in training samples

X_{K} \in R^{d_{K} \times n}

is denoted as follows:

\begin{matrix} arg min_{D_{K}, P_{K}} \{{∥X_{K}^{T} - X_{K}^{T} P_{K} D_{K}^{T}∥}_{2, 1} + α {∥P_{K}∥}_{2, 1} + β r a n k (D_{K})\}, s . t . P_{K}^{T} X_{K} \geq 0 \end{matrix}

(2)

where

D_{K} \in R^{d_{K} \times m}

represents the synthetical dictionary and

P_{K} \in R^{d_{K} \times m}

represents the analytical dictionary for the Kth block, respectively. m denotes the number of dictionary atoms. The first term is the reconstruction function of data, the second term is the sparse regularization of analytical dictionary, and the third term is the low-rank constraint of synthetical dictionary.

α

and

β

are the positive parameters used to balance the terms. Besides, the constraint

P_{K}^{T} X_{K} \geq 0

is imposed to ensure that the coding coefficient

P_{K}^{T} X_{K}

is non-negative.

By introducing analytical coding matrix

A_{K}

, the non-convex problem of Equation (2) is relaxed and transformed into the optimization function as follows:

\begin{matrix} arg min_{D_{K}, P_{K}, A_{K}} \{{∥X_{K}^{T} - A_{K}^{T} D_{K}^{T}∥}_{2, 1} + λ {∥A_{K}^{T} - X_{K}^{T} P_{K}∥}_{2, 1} + α {∥P_{K}∥}_{2, 1} + β r a n k (D_{K})\}, s . t . A_{K} \geq 0 \end{matrix}

(3)

where

A_{K} \approx P_{K}^{T} X_{K}

, and

λ

is a scalar constant. The optimization of the objective function in Equation (3) is conducted in the following steps.

Numbered lists can be added as follows:

(1): Fix $D_{K}$ and $P_{K}$ , update $A_{K}$

Firstly, we fix the synthetical dictionary

D_{K}

and the analytical dictionary

P_{K}

, the problem with respect to the analytical coding matrix

A_{K}

can be reformulated as follows:

\begin{matrix} arg min_{A_{K}} \{{∥X_{K}^{T} - A_{K}^{T} D_{K}^{T}∥}_{2, 1} + λ {∥A_{K}^{T} - X_{K}^{T} P_{K}∥}_{2, 1}\}, s . t . A_{K} \geq 0 \end{matrix}

(4)

Based on the definition of

l_{2, 1}

norm [28], we have

{∥X_{K}^{T} - A_{K}^{T} D_{K}^{T}∥}_{2, 1} = 2 t r [(X_{K} - D_{K} A_{K}) \cdot

U_{K} (X_{K}^{T} - A_{K}^{T} D_{K}^{T})]

, where

U_{K}

is a diagonal matrix with the

(i, i)

th diagonal entries

U_{K}^{i i} = 1 / [2 {∥{(X_{K}^{T} - A_{K}^{T} D_{K}^{T})}^{i}∥}_{2}]

,

{(X_{K}^{T} - A_{K}^{T} D_{K}^{T})}^{i}

is the ith row vector of

X_{K}^{T} - A_{K}^{T} D_{K}^{T}

. In fact, since

{∥{(X_{K}^{T} - A_{K}^{T} D_{K}^{T})}^{i}∥}_{2}

may be equal to 0, we approximate

2 {∥{(X_{K}^{T} - A_{K}^{T} D_{K}^{T})}^{i}∥}_{2} + τ

instead.

τ

is a small value to avoid singular values and to make the inversion more stable. Similarly,

{∥A_{K}^{T} - X_{K}^{T} P_{K}∥}_{2, 1} = 2 t r [(A_{K} - P_{K}^{T} X_{K}) V_{K} (A_{K}^{T} - X_{K}^{T} P_{K})]

, where

V_{K}

is a diagonal matrix with the

(i, i)

th diagonal entries

V_{K}^{i i} = 1 / [2 {∥{(A_{K}^{T} - X_{K}^{T} P_{K})}^{i}∥}_{2}]

,

{(A_{K}^{T} - X_{K}^{T} P_{K})}^{i}

is the ith row vector of

A_{K}^{T} - X_{K}^{T} P_{K}

. We use

2 {∥{(A_{K}^{T} - X_{K}^{T} P_{K})}^{i}∥}_{2} + τ

to approximate

2 {∥{(A_{K}^{T} - X_{K}^{T} P_{K})}^{i}∥}_{2}

. Then, the problem with respect to

A_{K}

can be reformulated as follows:

\begin{matrix} arg min_{A_{K}} \{\begin{matrix} 2 t r [(X_{K} - D_{K} A_{K}) U_{K} (X_{K}^{T} - A_{K}^{T} D_{K}^{T})] \\ + 2 λ t r [(A_{K} - P_{K}^{T} X_{K}) V_{K} (A_{K}^{T} - X_{K}^{T} P_{K})] \end{matrix}\}, s . t . A_{K} \geq 0 \end{matrix}

(5)

Let

ψ_{K, r c}

be the Lagrange multiplier for

A_{K, r c} \geq 0

and

Ψ = [ψ_{K, r c}]

[20], the Lagrange function

ζ

can be deduced as follows:

\begin{matrix} \begin{matrix} ζ = 2 t r [(X_{K} - D_{K} A_{K}) U_{K} (X_{K}^{T} - A_{K}^{T} D_{K}^{T})] \\ + 2 λ t r [(A_{K} - P_{K}^{T} X_{K}) V_{K} (A_{K}^{T} - X_{K}^{T} P_{K})] + t r (Ψ A_{K}^{T}) \end{matrix} \end{matrix}

(6)

The partial derivatives of

ζ

with respect to

A_{K}

in Equation (6) are computed as follows:

\begin{matrix} \frac{\partial ζ}{\partial A_{K}} = - 4 D_{K}^{T} X_{K} U_{K} + 4 D_{K}^{T} D_{K} A_{K} U_{K} + 4 λ A_{K} V_{K} - 4 λ P_{K}^{T} X_{K} V_{K} + Ψ \end{matrix}

(7)

By the definition of KTT condition [29], the equation with respect to

A_{K, r c}

is obtained as follows:

\begin{matrix} \begin{matrix} - {(4 D_{K}^{T} X_{K} U_{K})}_{r c} A_{K, r c} + 4 {(D_{K}^{T} D_{K} A_{K} U_{K})}_{r c} A_{K, r c} + 4 λ {(A_{K} V_{K})}_{r c} A_{K, r c} \\ - 4 λ {(P_{K}^{T} X_{K} V_{K})}_{r c} A_{K, r c} = 0 \end{matrix} \end{matrix}

(8)

Thus, we can obtain that the elements in rth row and cth column of

A_{K}

are updated as follows:

\begin{matrix} A_{K, r c}^{t + 1} \leftarrow A_{K, r c}^{t} \frac{{(D_{K}^{t T} X_{K} U_{K}^{t} + λ P_{K}^{t T} X_{K} V_{K}^{t})}_{r c}}{{(D_{K}^{t T} D_{K}^{t} A_{K}^{t} U_{K}^{t} + λ A_{K}^{t} V_{K}^{t})}_{r c}} \end{matrix}

(9)

(2): Fix $A_{K}$ and $D_{K}$ , update $P_{K}$

Secondly, after the analytical coding matrix

A_{K}

is updated, we can update the analytical dictionary

P_{K}

. By removing the terms that irrelevant to

P_{K}

, the problem in Equation (3) is reformulated as follows:

\begin{matrix} arg min_{P_{K}} \{λ {∥A_{K}^{T} - X_{K}^{T} P_{K}∥}_{2, 1} + α {∥P_{K}∥}_{2, 1}\} \end{matrix}

(10)

Similarly, we have

{∥P_{K}∥}_{2, 1} = 2 t r (P_{K}^{T} M_{K} P_{K})

, where

M_{K}

is a diagonal matrix with the

(i, i)

th diagonal entries

M_{K}^{i i} = 1 / [2 {∥P_{K}^{i}∥}_{2}]

,

P_{K}^{i}

is the ith row vector of

P_{K}

. We use

2 {∥P_{K}^{i}∥}_{2} + τ

to approximate

2 {∥P_{K}^{i}∥}_{2}

. Then, the problem with respect to

P_{K}

can be converted as follows:

\begin{matrix} arg min_{P_{K}} \{2 λ t r [(A_{K} - P_{K}^{T} X_{K}) V_{K} (A_{K}^{T} - X_{K}^{T} P_{K})] + 2 α t r (P_{K}^{T} M_{K} P_{K})\} \end{matrix}

(11)

Let the partial derivative of Equation (11) with respect to

P_{K}

be 0, and we can obtain the closed-form solution of

P_{K}

as follows:

\begin{matrix} P_{K}^{t + 1} = {(λ X_{K} V_{K}^{t} X_{K}^{T} + α M_{K}^{t} + τ I)}^{- 1} \cdot (λ X_{K} V_{K}^{t} A_{K}^{(t + 1) T}) \end{matrix}

(12)

(3): Fix $A_{K}$ and $P_{K}$ , update $D_{K}$

Finally, after the analytical dictionary

P_{K}

is calculated, we can update the synthetical dictionary

D_{K}

. By removing the terms that irrelevant to

D_{K}

, the problem with respect to

D_{K}

is expressed as follows:

\begin{matrix} arg min_{D_{K}} \{{∥X_{K}^{T} - A_{K}^{T} D_{K}^{T}∥}_{2, 1} + β r a n k (D_{K})\} \end{matrix}

(13)

Obviously, the optimization problem in Equation (13) is an NP-hard problem. Therefore, we use the low-rank function with the nuclear norm constraint to relax the optimization problem as follows [30]:

\begin{matrix} arg min_{D_{K}} \{{∥X_{K}^{T} - A_{K}^{T} D_{K}^{T}∥}_{2, 1} + β {∥D_{K}∥}_{*}\} \end{matrix}

(14)

where

{∥D_{K}∥}_{*}

is the nuclear of

D_{K}

. To reduce the computational complexity, we use

{∥R_{K}∥}_{F}^{2} + {∥S_{K}∥}_{F}^{2}

to replace

∥D_{K}∥_{*}

[31], where

R_{K} \in R^{d_{K} \times d_{K}}

and

S_{K} \in R^{d_{K} \times m}

. Thus, the optimization problem of Equation (14) can be converted as follows:

\begin{matrix} arg min_{D_{K}} \{{∥X_{K}^{T} - A_{K}^{T} D_{K}^{T}∥}_{2, 1} + \frac{β}{2} ({∥R_{K}∥}_{F}^{2} + {∥S_{K}∥}_{F}^{2})\}, s . t . D_{K} = R_{K} S_{K} \end{matrix}

(15)

We use the inexact ALM algorithm [30] to solve the optimization problem in Equation (15), and the augmented Lagrange function is formulated as follows:

\begin{matrix} L = {∥X_{K}^{T} - A_{K}^{T} D_{K}^{T}∥}_{2, 1} + \frac{β}{2} ({∥R_{K}∥}_{F}^{2} + {∥S_{K}∥}_{F}^{2}) + \frac{γ}{2} {∥D_{K} - R_{K} S_{K}∥}_{F}^{2} + 〈μ, D_{K} - R_{K} S_{K}〉 \end{matrix}

(16)

where

γ > 0

is a penalty parameter, and

μ

is a Lagrange multiplier.

To solve the optimization problem in Equation (16), we minimize the augmented Lagrange function by iterative updating as follows:

(1): Fix $R_{K}$ and $S_{K}$ , update $D_{K}$

By removing the terms of Equation (16) that irrelevant to

D_{K}

, the optimization problem with respect to

D_{K}

can be reformulated as follows:

\begin{matrix} arg min_{D_{K}} {∥X_{K}^{T} - A_{K}^{T} D_{K}^{T}∥}_{2, 1} + \frac{γ}{2} {∥D_{K} - R_{K} S_{K}∥}_{F}^{2} + 〈μ, D_{K} - R_{K} S_{K}〉 \end{matrix}

(17)

Let the partial derivative of Equation (17) with respect to

D_{K}

be 0, and we can update the synthetical dictionary

D_{K}

as follows:

\begin{matrix} D_{K}^{*} = (4 X_{K} U_{K} A_{K}^{T} + R_{K} S_{K} - μ I) \cdot {(4 A_{K} U_{K} A_{K}^{T} + γ I)}^{- 1} \end{matrix}

(18)

(2): Fix $D_{K}$ and $S_{K}$ , update $R_{K}$

By removing the terms of Equation (16) that irrelevant to

R_{K}

, the optimization problem with respect to

R_{K}

can be reformulated as follows:

\begin{matrix} arg min_{R_{K}} \frac{β}{2} {∥R_{K}∥}_{F}^{2} + \frac{γ}{2} {∥D_{K} - R_{K} S_{K}∥}_{F}^{2} + 〈μ, D_{K} - R_{K} S_{K}〉 \end{matrix}

(19)

Let the partial derivative of Equation (19) with respect to

R_{K}

be 0, and we can update the variable matrix

R_{K}

as follows:

\begin{matrix} R_{K}^{*} = (γ D_{K} S_{K}^{T} + μ S_{K}^{T}) \cdot {(β I + γ S_{K} S_{K}^{T})}^{- 1} \end{matrix}

(20)

(3): Fix $D_{K}$ and $R_{K}$ , update $S_{K}$

By removing the terms of Equation (16) that irrelevant to

S_{K}

, the optimization problem with respect to

S_{K}

can be reformulated as follows:

\begin{matrix} arg min_{S_{K}} \frac{β}{2} {∥S_{K}∥}_{F}^{2} + \frac{γ}{2} {∥D_{K} - R_{K} S_{K}∥}_{F}^{2} + 〈μ, D_{K} - R_{K} S_{K}〉 \end{matrix}

(21)

Let the partial derivative of Equation (21) with respect to

S_{K}

be 0, and we can update the variable matrix

S_{K}

as follows:

\begin{matrix} S_{K}^{*} = {(β I + γ R_{K}^{T} R_{K})}^{- 1} \cdot (γ R_{K}^{T} D_{K} + μ R_{K}^{T}) \end{matrix}

(22)

Thus, the iterative updating process of the synthetical dictionary

D_{K}

is summarized as follows:

\begin{matrix} \{\begin{matrix} D_{K}^{(r + 1)} = (4 X_{K} U_{K} A_{K}^{T} + R_{K}^{r} S_{K}^{r} - μ I) \cdot {(4 A_{K} U_{K} A_{K}^{T} + γ I)}^{- 1} \\ R_{K}^{(r + 1)} = (γ D_{K}^{(r + 1)} S_{K}^{r T} + μ S_{K}^{r T}) \cdot {(β I + γ S_{K}^{r} S_{K}^{r T})}^{- 1} \\ S_{K}^{(r + 1)} = {(β I + γ R_{K}^{(r + 1) T} R_{K}^{(r + 1)})}^{- 1} \cdot (γ R_{K}^{(r + 1) T} D_{K}^{(r + 1)} + μ R_{K}^{(r + 1) T}) \end{matrix} \end{matrix}

(23)

To fully introduce the proposed method, Algorithm 1 describes the optimization of RDPL, which stops optimizing each variable when the algorithm reaches the maximum iteration T.

Algorithm 1 Robust Dictionary Pair Learning

1:: Input: The training samples $X_{K}$ of the Kth sub-block, the parameters $α$ , $β$ , $λ$ , $γ$ , $μ$ and $τ$ .
2:: Step 1: Initialize the synthetical dictionary $D_{K}^{(0)}$ and the analytical dictionary $P_{K}^{(0)}$ as random matrixes with unit Frobenius norm, set $t = 0$ .
3:: Step 2: Repeat until $t > T - 1$ ;
4:: Step 2.1: Fix the analytical dictionary $P_{K}$ and the synthetical dictionary $D_{K}$ , update the analytical coding matrix $A_{K}^{(t + 1)}$ by Equation (9);
5:: Step 2.2: Fix the synthetical dictionary $D_{K}$ and the analytical coding matrix $A_{K}$ , update the analytical dictionary $P_{K}^{(t + 1)}$ by Equation (12);
6:: Step 2.3: Fix analytical coding matrix $A_{K}$ and the analytical dictionary $P_{K}$ , update the synthetical dictionary $D_{K}^{(t + 1)}$ by Equation (23);
7:: Step 2.4: Set $t = t + 1$ .
8:: Output: The analytical dictionary $P_{K}^{*}$ and the synthetical dictionary $D_{K}^{*}$ of the Kth sub-block.

By building RDPL model through Algorithm 1, we can calculate the reconstruction error of training samples in the Kth sub-block as follows:

\begin{matrix} E_{K} = {∥X_{K} - D_{K}^{*} P_{K}^{* T} X_{K}∥}_{F}^{2} \end{matrix}

(24)

Then, the control threshold

C_{K}

of the Kth sub-block can be obtained by the kernel density estimation (KDE) method [32], and the univariate kernel density estimation is conducted as follow:

\begin{matrix} f_{H} (x) = \frac{1}{M H} \sum_{i = 1}^{M} K (\frac{x - E_{K}^{i}}{H}) \end{matrix}

(25)

where x represents the data point under consideration, M is the number of training samples, H represents the bandwidth,

E_{K}^{i}

is the reconstruction error of the th sample in the Kth sub-block, and

K (\cdot)

is the uniform kernel function.

2.2. Bayesian Inference Based Anomaly Detection

Testing samples

Y_{n e w}

are divided into

Y_{n e w} = [y_{1}, \dots, y_{K}, \dots, y_{N}]

by the means of division in training samples, and the reconstruction error of the Kth sub-block in testing samples is calculated according to the corresponding RDPL model as follows:

\begin{matrix} E_{y_{K}} = {∥y_{K} - D_{K}^{*} P_{K}^{* T} y_{K}∥}_{F}^{2} \end{matrix}

(26)

To fuse the local monitoring statistics of sub-blocks to the global monitoring information in industrial processes, the Bayesian inference method [33] is introduced to convert the reconstruction error

E_{y_{K}}

of sub-block

y_{K}

in testing samples into normal possibility

P_{K} (y_{K} |N)

and anomaly possibility

P_{K} (y_{K} |A)

, which are expressed as follows:

\begin{matrix} P_{K} (y_{K} |N) = e^{- \frac{E_{y_{K}}}{C_{K}}} \end{matrix}

(27)

\begin{matrix} P_{K} (y_{K} |A) = e^{- \frac{C_{K}}{E_{y_{K}}}} \end{matrix}

(28)

The conditional probability of normal sub-blocks and the conditional probability of abnormal sub-blocks are defined as

P_{K} (N)

and

P_{K} (A)

by significance level

α

, respectively, i.e.,

P_{K} (N) = 1 - α

and

P_{K} (A) = α

. And the posterior probability of abnormal sub-blocks based on Bayesian inference method is calculated as follows:

\begin{matrix} P_{K} (A |y_{K}) = \frac{P_{K} (y_{K} |A) P_{K} (A)}{P_{K} (y_{K} |A) P_{K} (A) + P_{K} (y_{K} |N) P_{K} (N)} \end{matrix}

(29)

Then, the global anomaly index (GAI) is defined to fuse local statistical information to global state, which is expressed as

\begin{matrix} G A I = \sum_{K = 1}^{N} \frac{P_{K} (y_{K} |A) p_{K} (A |y_{K})}{\sum_{K = 1}^{N} P_{K} (y_{K} |A)} \end{matrix}

(30)

where N is the number of sub-blocks in testing samples. If a new testing sample

y_{n e w}

satisfies

G A I_{n e w} < α

, it regards as normal, otherwise it regards as anomaly.

2.3. Contribution Index Based Anomaly Isolation

For the detected abnormal samples, we need to further locate abnormal sources. The location of the anomaly is found by the method of locating the abnormal block based on counting time [27], that is, when the posteriori probability of the abnormal block exceeds the significance level, the block anomaly flag (BAF) is set to 1, and BAF is defined as

\begin{matrix} B A F_{K}^{h} = \{\begin{matrix} 1, \begin{matrix} P_{K} (A |y_{K}^{h}) \geq α \end{matrix} \\ 0, \begin{matrix} Otherwise \end{matrix} \end{matrix} \end{matrix}

(31)

where

y_{K}^{h}

represents the hth abnormal sample of the Kth block. To ensure the reliability of abnormal block isolation, block anomaly index (BAI) and block contribution index (BCI) are defined as follows:

\begin{matrix} B A I_{K} = \sum_{h = 1}^{H} B A F_{K}^{h} \end{matrix}

(32)

\begin{matrix} B C I_{K} = \frac{B A I_{K}}{\sum_{i = 1}^{N} B A I_{K}} \end{matrix}

(33)

where H is the number of abnormal samples.

In industrial process monitoring, contribution plot method [34] has become a common method for anomaly isolation. On the basis of locating the abnormal block, the contribution plot method is used to locate the abnormal variable to realize anomaly isolation accurately. Suppose that the synthetical dictionary and the analytical coding matrix of the abnormal sample

y_{K}^{h}

are defined as

D_{K}

and

A_{K}^{h}

, respectively, the abnormal sample

y_{K}^{h}

can be expressed as follows:

\begin{matrix} y_{K}^{h} = D_{K} A_{K}^{h} + f = [D_{K}, I] [\begin{matrix} A_{K}^{h} \\ f \end{matrix}] \end{matrix}

(34)

where

I \in R^{s \times s}

is an identity matrix, and s is the number of variables in the Kth sub-block. The non-zero terms of the vector f represent the position and size of the anomaly source. To more clearly represent the anomaly source, the augmented synthetical dictionary is defined as

{\bar{D}}_{K} = [D_{K}, I]

, so the new analytical coding matrix of the anomaly sample

y_{K}^{h}

under

{\bar{D}}_{K}

is calculated as follows:

\begin{matrix} A_{K, n e w}^{h} = arg min_{A_{K}^{h}} {∥y_{K}^{h} - {\bar{D}}_{K} A_{K}^{h}∥}_{F}^{2} \end{matrix}

(35)

Then, the abnormal sample is reformulated as

y_{K}^{h} = {\bar{D}}_{K} A_{K, n e w}^{h}

. In addition, the vector f can be replaced by

S A_{K, n e w}^{h}

, where

S = [O, I] \in R^{s \times (d + s)}

, and O is the zero matrix. The variable contribution (VC) of the jth variable in the Kth block is defined by contribution plot method, which can be calculated as follows:

\begin{matrix} V C_{K}^{j} = \sum_{h = 1}^{H} {[e_{j} S A_{K, n e w}^{h}]}^{2} \end{matrix}

(36)

And the corresponding variable contribution index (VCI) is expressed as

\begin{matrix} V C I_{K}^{j} = \frac{V C_{K}^{j}}{\sum_{i = 1}^{s} V C_{K}^{i}} \end{matrix}

(37)

where

e_{j} = [{\underset{︸}{0, \dots, 0}}_{j - 1}, 1 {\underset{︸}{0, \dots, 0}}_{s - j}]

is an identity matrix.

3. Experiments

To verify the effectiveness of the proposed method in industrial process monitoring, a numerical simulation experiment is designed and Tennessee Eastman (TE) benchmark tests are carried out, and then the proposed method is applied in a real-world aluminum electrolysis industrial process. Besides, the proposed method is compared with several common methods, including robust PCA (rPCA) [35], distributed PCA (DPCA) [36], KSVD [16], DDL [27] and DPL [18]. Meanwhile, Training Time, Testing Time, false alarm rate (FAR), and fault detection rate (FDR) are considered to quantitatively evaluate the performance of different process monitoring methods [24]. To ensure that the comparison is fair, we use the original codes of the comparison methods directly. The number of dictionary atoms is set to 50 for all dictionary learning methods. For the proposed PRDPL method, the optimal parameters of

α

,

β

and

λ

are selected from the candidate set

\{10^{- 6}, 10^{- 5}, 10^{- 4}, 10^{- 3}, 10^{- 2}, 10^{- 1}, 10^{0}, 10^{1}, 10^{2}\}

. With each group of parameters, twenty groups of samples are randomly collected to test. The parameter value corresponding to the highest average FDR is recorded as the parameter setting value. For all comparison methods, we obtain the optimal parameters from the original paper proposing the comparison methods, or adopt the same setting strategy of optimal parameters as our proposed method. Besides, when the comparison method adopts the optimal parameters from the original paper, we also randomly collect 20 groups of samples for the experiment, and take the average value as the experimental result. Meanwhile, the kernel density estimation method is used to obtain the control thresholds of all dictionary learning methods in this paper, where all parameters are set consistently. For rPCA and DPCA, we use the method provided in the original paper to obtain the control threshold. Moreover, the significance level is set to 0.05, which is common to all methods.

3.1. Numerical Simulation Experiment

Firstly, to verify the effectiveness of the proposed method, a linear system for generating high-dimensional data is introduced as follows [27]:

\begin{matrix} X = A s + e \end{matrix}

(38)

where

A \in R^{8 \times 2}

represents a random observation matrix, s is a state vector containing two independent variables, and e is a noise vector composed of eight independent Gaussian noises with zero mean and the standard deviation of 0.01. Four different state vectors are designed to simulate different operation units in the process, which is used as prior process knowledge to divide into four sub-blocks. Thus, the state vectors are expressed as follows:

\begin{matrix} Block 1 s_{1} : U (2, 3) s_{2} : N (7, 1) \\ Block 2 s_{1} : 2 cos (0.08 t) \cdot sin (0.006 t) s_{2} : N (2, 0.1) \\ Block 3 s_{1} : 2 cos (0.08 t) \cdot sin (0.006 t) s_{2} : U (- 1, 1) \\ Block 4 s_{1} : U (- 1, 1) s_{2} : N (2, 0.1) \end{matrix}

Then, the aforementioned system is used to generate 32-dimensional process data, and 2000 data are collected as training samples and 300 data are collected as normal testing samples. In addition, a bias fault of 2 is added to the third dimension of the first block, and 500 data are collected as abnormal testing samples. First, we divide the training samples in dimension according to the generation relation, that is, the 32-dimensional data are divided into four sub-blocks of 8-dimensional data. Then, RDPL sub-models are built for each sub-block to learn robust synthetical dictionary and robust analytical dictionary, see robust dictionary pair learning in Algorithm 1. Next, the reconstruction errors are calculated to obtain the control threshold. Finally, we implement anomaly detection and anomaly isolation by GAI defined in Equation (30), BCI defined in Equation (33), and VCI defined in Equation (37). For the diagram of the above method, refer to Figure 2.

The process monitoring results of all methods in the numerical simulation experiment are shown in Figure 5, and the quantitative monitoring results of each method are shown in Table 1. As can be seen from the experimental results, The FDRs of DDL method, DPL method, and DRDPL method all reached

100 %

. Meanwhile, DRDPL method performs best in process monitoring, and its FAR is as low as 0. The FDRs of

T^{2}

statistics in rPCA method and DPCA method are up to

100 %

, but their

S P E

statistics have a lower FDR in process monitoring. In addition, the training time of rPCA method, DPL method, and DRDPL method is obviously shorter. Although DDL method has better accuracy in process monitoring, it consumes the most computational time.

The anomaly isolation results of numerical simulation experiment are shown in Figure 6. Figure 6a shows the result of locating the abnormal block by BCI, which demonstrates that the anomaly occurred in the first block. On the basis of locating abnormal blocks, Figure 6b shows that the abnormal variable is further located by VCI. The location result demonstrates that the anomaly is most likely to occur in the third variable. Therefore, the results of anomaly location ate consistent with the anomaly we set.

3.2. TE Benchmark Test

Tennessee Eastman (TE) benchmark tests are often used to validate process monitoring methods. The structure diagram of TE process is shown in Figure 7. There are mainly 5 operation units, including 12 process control variables and 41 process measurement variables [37]. Notably, 22 process measurement variables and 9 process control variables are selected as 31 process variables in TE benchmark tests, see Table A1 in Appendix A. In addition, according to the technological process of TE process, 31 variables can be divided into four blocks, which are shown in Table 2 [25].

TE process contains 28 disturbances, see Table A2 in Appendix A. 3000 data are collected as training samples and 300 data are collected as normal testing samples. Moreover, 500 data are collected under each disturbance as abnormal testing samples. Similarly, for the schematic diagram of TE process monitoring based on DRDPL, refer to Figure 2. Table 3 shows that the FDRs of all methods in TE processes with various disturbances. The results of process monitoring show that the proposed method has high FDR for most disturbances.

To further compare the process monitoring performance of the proposed method with other methods, the results of TE processes under disturbance IDV (6) and IDV (14) are shown in Figure 8 and Figure 9, respectively, and the quantitative monitoring results are presented in Table 4 and Table 5, respectively. The experimental results show that the proposed DRDPL method performs better in process monitoring. That is, rPCA method, DPCA method, K-SVD method, DDL method, and the proposed DRDPL method have high FDRs, while the FAR of our DRDPL is the lowest. Meanwhile, the training time of DPCA method and DDL method are significantly more than that of rPCA method, K-SVD method, DPL method, and the proposed DRDPL method. Besides, DPL method has the excellent performance of process monitoring in TE process with IDV (6), but its FDR in TE process with IDV (14) is only

2.60 %

. Thus, the stability in process monitoring of DPL needs to be improved.

Figure 10 and Figure 11 show the results of anomaly isolation under the two disturbances, respectively. Figure 10a and Figure 11a show the results of locating abnormal blocks by BCI, and Figure 10b and Figure 11b show the results of locating abnormal variables by VCI in sub-blocks with the highest probability of anomalies. The location results show that the first dimension (

v_{1}

) of the first block is the most likely to have an anomaly in TE process monitoring with the disturbance of IDV (6), and the fifth dimension (

v_{30}

) of the second block is most likely to have an anomaly in TE process monitoring with the disturbance of IDV (14). In addition, for the results of variable isolation in other blocks of TE process, see Figure A1 in Appendix B. The results of anomaly location are basically consistent with the possible caused results of disturbances.

3.3. Aluminum Electrolysis Industrial Process Application

With the development of large-scale aluminum electrolytic cell, the complexity and uncertainty of aluminum electrolytic system become higher. Effective cell condition monitoring technology plays an important role in detecting and predicting abnormal cell condition in real time, adjusting control strategy in time, and ensuring efficient and high-quality operation of electrolytic cell [38]. In aluminum electrolysis industrial process, the dynamic behaviors of local anodes form the distributed cell state, and anode current can reflect localized cell states.

In the experiments, anode current data are obtained by the production data report of 400 kA series electrolytic cell from an aluminum electrolysis factory in Shandong Province. The structure diagram of an aluminum electrolysis cell is shown in Figure 12. The series of electrolytic cells have 12 anodes on each side, that is, aluminum electrolysis cells can produce 24-dimensional anode current data. Importantly, six adjacent anodes are divided into one node based on prior process knowledge of alumina concentration in aluminum electrolysis process. Therefore, the 24-dimensional anode current data are divided into four sub-blocks. Meanwhile, 3000 data are collected as training samples and 300 data are collected as normal testing samples. In addition, 500 data are collected under the anodic effect and anodic slip conditions as abnormal test data, respectively. Similarly, for the schematic diagram of aluminum electrolysis industrial process monitoring based on DRDPL, refer to Figure 2.

Figure 13 and Figure 14 show the monitoring results of aluminum electrolysis processes with the abnormal cell conditions of anode effect and anode slippage. Table 6 and Table 7 give corresponding quantitative monitoring results, respectively. The experimental results show that rPCA method, DPL method, and the proposed DRDPL method have outstanding advantages in computional time. In terms of process monitoring performance, the FDRs of all methods can reach

100 %

. And the FARs of KSVD method and our DRDPL method are lower than that of rPCA method, DPCA method, DDL method, and DPL method, which are only 0. Therefore, the performance of the proposed method is better than other methods in cell condition monitoring.

Figure 15 and Figure 16 show the anomaly isolation results of occurring anode effect and anode slippage, respectively. Figure 15a and Figure 16a show the results of locating abnormal blocks by BCI, and Figure 15b and Figure 16b show the results of locating abnormal variables by VCI in sub-blocks with the highest probability of anomalies. The location results show that the fourth dimension (

i_{13}

) of the first block is the most likely to have an anomaly in aluminum electrolysis processes with anode effect and anode slippage, which is consistent with the anomaly source location of the actual cell condition. In addition, for the results of variable isolation in other blocks of aluminum electrolysis industrial process, see Figure A2 and Figure A3 in Appendix B.

3.4. Fault Detection against Noisy Datasets

To test the robustness of the proposed DRDPL method, we add random Gaussian noise to datasets by

D a t a = D a t a + \sqrt{V a r i a n c e} \times r a n d n (s i z e (D a t a))

. For the numerical simulation experiment, TE process experiment with IDV (6), and aluminum electrolysis process experiment with anode effect, 20 groups of samples are randomly selected for testing, and average values are taken as the results of fault detection. The variance values are set in ranges of

{1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2.0}

,

{10, 20, 30, 40, 50, 60, 70, 80, 90, 100}

, and

{1.1 \times 10^{5}, 1.2 \times 10^{5}, 1.3 \times 10^{5}, 1.4 \times 10^{5}, 1.5 \times 10^{5}, 1.6 \times 10^{5}, 1.7 \times 10^{5}, 1.8 \times 10^{5}, 1.9 \times 10^{5}, 2.0 \times 10^{5}}

, respectively. Figure 17 shows the experimental results in the case of noise. We can find that the overall trend of FDR for each method decreases with the increase of variance. It is worth noting that our proposed DRDPL method provides higher FDR than other methods in most cases. That is, the DRDPL method is more robust to the interference of noise due to the adoption of a more reasonable mechanism.

4. Conclusions

In this article, a process monitoring method based on distributed robust dictionary pair learning (PRDPL) is proposed for anomaly detection and anomaly isolation. Firstly, the reliable prior knowledge of industrial processes is integrated into the data-driven model with block division, which is conducive to exposing small anomalies in high-dimensional data. Then, a robust dictionary pair learning (RDPL) method is proposed to build a local monitoring model for each sub-block to obtain robust dictionary pairs. Finally, Bayesian inference method is introduced to realize anomaly detection. To further find the anomaly sources, the block contribution index and variable contribution index are defined to locate abnormal blocks and abnormal variables, respectively. Thus, the applicability and reliability of the proposed method have been demonstrated in a numerical simulation, Tennessee Eastman processes and aluminum electrolysis processes. Particularly, our method performs well in anomaly detection, computation time, and robustness together. It is worth noted that the prior process knowledge used for division has not been systematized. Therefore, a more accurate block division method based on the fusion of knowledge and data is worth further study. In addition, how to obtain adaptive dictionary learning models by considering abnormal samples is a meaningful research direction.

Author Contributions

Methodology, software, writing—original draft, J.W., X.C. and Z.D.; writing—review and editing, Z.D.; formal analysis, supervision, funding acquisition, conceptualization, X.C. and H.Z.; data curation, validation, visualization, J.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported in part by the State Key Program of the National Natural Science of China under Grant 62133016, in part by the National Natural Science Foundation of China under Grants 51974373 and 51874365, and in part by the Yunnan Province Science and Technology Planning Project under Grants 202202AB080017 and 202102AB080062.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

The introduction of each variable in TE benchmark test is shown in Table A1. The descriptions of 28 kinds of disturbances in TE process are shown in Table A2.

Table A1. The process variables of TE benchmark test.

No.	Variables	No.	Variables
1	A feed (stream 1)	17	Stripper underflow (stream 11)
2	D feed (stream 2)	18	Stripper temperature
3	E feed (stream 3)	19	Stripper steam flow
4	A and C feed (stream 4)	20	Compressor work
5	Recycle flow (stream 8)	21	Reactor cooling water outlet temperature
6	Reactor feed rate (stream 6)	22	Separator cooling water outlet temperature
7	Reactor pressure	23	D feed flow (stream 2)
8	Reactor level	24	E feed flow (stream 3)
9	Reactor temperature	25	A feed flow (stream 1)
10	Purge rate (stream 9)	26	A and C feed flow (stream 4)
11	Product separator temperature	27	Purge valve (stream 9)
12	Product separator level	28	Separator pot liquid flow (stream 10)
13	Product separator pressure	29	Stripper liquid product flow (stream 11)
14	Product separator underflow (stream 10)	30	Reactor cooling water flow
15	Stripper level	31	Stripper liquid product flow (stream 11)
16	Stripper pressure

Table A2. The disturbances of TE process.

Number	Disturbance Description	Type
IDV (1)	A/C feed ratio, B composition constant (stream 4)	Step
IDV (2)	B composition, A/C ratio constant (stream 4)	Step
IDV (3)	D feed temperature (stream 2)	Step
IDV (4)	Reactor cooling water inlet temperature	Step
IDV (5)	Condenser cooling water inlet temperature	Step
IDV (6)	A feed loss (stream 1)	Step
IDV (7)	C header pressure loss-reduced availability (stream 4)	Step
IDV (8)	A, B, C feed composition (stream 4)	Random variation
IDV (9)	D feed temperature (stream 2)	D feed temperature
IDV (10)	C feed temperature (stream 4)	Random variation
IDV (11)	Reactor cooling water inlet temperature	Random variation
IDV (12)	Condenser cooling water inlet temperature	Random variation
IDV (13)	Reaction kinetics	Slow drift
IDV (14)	Reactor cooling water valve	Sticking
IDV (15)	Condenser cooling water valve	Sticking
IDV (16)	Unknown	Unknown
IDV (17)	Unknown	Unknown
IDV (18)	Unknown	Unknown
IDV (19)	Unknown	Unknown
IDV (20)	Unknown	Unknown
IDV (21)	A feed temperature (stream 1)	-
IDV (22)	E feed temperature (stream 3)	-
IDV (23)	A feed pressure (stream 1)	-
IDV (24)	D feed pressure (stream 2)	-
IDV (25)	E feed pressure (stream 3)	-
IDV (26)	A and C feed pressure (stream 4)	-
IDV (27)	Pressure fluctuation in the cooling water re-circulating unit of the reactor	-
IDV (28)	Pressure fluctuation in the cooling water re-circulating unit of the condenser	-

Appendix B

In the actual industrial production process, the purpose of anomaly isolation is to find the block with the highest probability of anomalies, so as to provide the basis for process control and decision. Therefore, the variable isolation results of TE process and aluminum electrolysis industrial process except for the most likely abnormal blocks are shown in Figure A1, Figure A2, and Figure A3, respectively.

Figure A1. The variable isolation results in TE process. (a) Block 2 with IDV (6); (b) Block 3 with IDV (6); (c) Block 4 with IDV (6); (d) Block 3 with IDV (14).

Figure A2. The variable isolation results in anode effect of aluminum electrolysis industrial process. (a) Block 2; (b) Block 3; (c) Block 4.

Figure A3. The variable isolation results in anode slippage of aluminum electrolysis industrial process. (a) Block 2; (b) Block 3; (c) Block 4.

References

Yang, F.; Cui, Y.; Wu, F.; Zhang, R. Fault Monitoring of Chemical Process Based on Sliding Window Wavelet DenoisingGLPP. Processes 2021, 9, 86. [Google Scholar] [CrossRef]
Wang, J.; Zhou, Z.; Li, Z.; Du, S. A Novel Fault Detection Scheme Based on Mutual k-Nearest Neighbor Method: Application on the Industrial Processes with Outliers. Processes 2022, 10, 497. [Google Scholar] [CrossRef]
Zhang, H.; Li, T.; Li, J.; Yang, S.; Zou, Z. Progress in aluminum electrolysis control and future direction for smart aluminum electrolysis plant. JOM 2017, 69, 292–300. [Google Scholar] [CrossRef]
Cheung, C.Y.; Menictas, C.; Bao, J.; Skyllas-Kazacos, M.; Welch, B.J. Characterization of Individual Anode Current Signals in Aluminum Reduction Cells. Ind. Eng. Chem. Res. 2013, 52, 9632–9644. [Google Scholar] [CrossRef]
Zhai, S.; Wang, W.; Ye, H. Fault diagnosis based on parameter estimation in closed-loop systems. IET Control Theory Appl. 2015, 9, 1146–1153. [Google Scholar] [CrossRef]
Chi, G.; Wang, D. Sensor Placement for Fault Isolability Based on Bond Graphs. IEEE Trans. Autom. Control 2015, 60, 3041–3046. [Google Scholar] [CrossRef]
Yue, W.; Gui, W.; Xie, Y. Experiential knowledge representation and reasoning based on linguistic Petri nets with application to aluminum electrolysis cell condition identification. Inf. Sci. 2020, 529, 141–165. [Google Scholar] [CrossRef]
Tessier, J.; Duchesne, C.; Tarcy, G.P.; Gauthier, C.; Dufour, G. Multivariate Analysis and Monitoring of the Performance of Aluminum Reduction Cells. Ind. Eng. Chem. Res. 2012, 51, 1311–1323. [Google Scholar] [CrossRef]
Huang, Z.; Yang, C.; Chen, X.; Zhou, X.; Chen, G.; Huang, T.; Gui, W. Functional deep echo state network improved by a bi-level optimization approach for multivariate time series classification. Appl. Soft Comput. 2021, 106, 107314. [Google Scholar] [CrossRef]
Huang, K.; Wu, Y.; Wang, C.; Xie, Y.; Yang, C.; Gui, W. A Projective and Discriminative Dictionary Learning for High-Dimensional Process Monitoring With Industrial Applications. IEEE Trans. Ind. Inform. 2021, 17, 558–568. [Google Scholar] [CrossRef]
Ji, C.; Sun, W. A Review on Data-Driven Process Monitoring Methods: Characterization and Mining of Industrial Data. Processes 2022, 10, 335. [Google Scholar] [CrossRef]
Huang, K.; Wei, K.; Zhou, L.; Li, Y.; Yang, C. Multimode Process Monitoring and Mode Identification Based on Multiple Dictionary Learning. IEEE Trans. Instrum. Meas. 2021, 70, 1–12. [Google Scholar] [CrossRef]
Deng, Z.; Chen, X.; Xie, S.; Xie, Y.; Zhang, H. Semi-Supervised Discriminative Projective Dictionary Pair Learning and Its Application for Industrial Process Monitoring. IEEE Trans. Ind. Inform. 2022. [Google Scholar] [CrossRef]
Zhang, H.; Chen, X.; Du, Z.; Yan, R. Kurtosis based weighted sparse model with convex optimization technique for bearing fault diagnosis. Mech. Syst. Signal Process. 2016, 80, 349–376. [Google Scholar] [CrossRef]
Du, Z.; Chen, X.; Zhang, H.; Yan, R. Sparse Feature Identification Based on Union of Redundant Dictionary for Wind Turbine Gearbox Fault Diagnosis. IEEE Trans. Ind. Electron. 2015, 62, 6594–6605. [Google Scholar] [CrossRef]
Aharon, M.; Elad, M.; Bruckstein, A. K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans. Signal Process. 2006, 54, 4311–4322. [Google Scholar] [CrossRef]
Jiang, Z.; Lin, Z.; Davis, L.S. Label Consistent K-SVD: Learning a Discriminative Dictionary for Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 2013, 35, 2651–2664. [Google Scholar] [CrossRef]
Gu, S.; Zhang, L.; Zuo, W.; Feng, X. Projective dictionary pair learning for pattern classification. Adv. Neural Inf. Process. Syst. 2014, 27, 793–801. [Google Scholar]
Zhang, Z.; Sun, Y.; Wang, Y.; Zhang, Z.; Zhang, H.; Liu, G.; Wang, M. Twin-Incoherent Self-Expressive Locality-Adaptive Latent Dictionary Pair Learning for Classification. IEEE Trans. Neural Netw. Learn. Syst. 2021, 32, 947–961. [Google Scholar] [CrossRef]
Sun, Y.; Zhang, Z.; Jiang, W.; Zhang, Z.; Zhang, L.; Yan, S.; Wang, M. Discriminative Local Sparse Representation by Robust Adaptive Dictionary Pair Learning. IEEE Trans. Neural Networks Learn. Syst. 2020, 31, 4303–4317. [Google Scholar] [CrossRef]
Li, N.; Guo, S.; Wang, Y. Weighted preliminary-summation-based principal component analysis for non-Gaussian processes. Control Eng. Pract. 2019, 87, 122–132. [Google Scholar] [CrossRef]
Liu, Q.; Qin, S.J.; Chai, T. Multiblock Concurrent PLS for Decentralized Monitoring of Continuous Annealing Processes. IEEE Trans. Ind. Electron. 2014, 61, 6429–6437. [Google Scholar] [CrossRef]
He, F.; Xu, J. A novel process monitoring and fault detection approach based on statistics locality preserving projections. J. Process Control 2016, 37, 46–57. [Google Scholar] [CrossRef]
Deng, Z.; Chen, X.; Xie, S.; Xie, Y.; Sun, Y. Distributed process monitoring based on joint mutual information and projective dictionary pair learning. J. Process Control 2021, 106, 130–141. [Google Scholar] [CrossRef]
Zhu, J.; Ge, Z.; Song, Z. Distributed Parallel PCA for Modeling and Monitoring of Large-Scale Plant-Wide Processes With Big Data. IEEE Trans. Ind. Inform. 2017, 13, 1877–1885. [Google Scholar] [CrossRef]
Xu, C.; Liu, F. Process monitoring based on distributed principal component analysis with angle-relevant variable selection. Int. J. Distrib. Sens. Netw. 2019, 15, 1–13. [Google Scholar] [CrossRef]
Huang, K.; Wu, Y.; Wen, H.; Liu, Y.; Yang, C.; Gui, W. Distributed dictionary learning for high-dimensional process monitoring. Control Eng. Pract. 2020, 98, 104386. [Google Scholar] [CrossRef]
Zhang, Z.; Jia, L.; Zhao, M.; Liu, G.; Wang, M.; Yan, S. Kernel-Induced Label Propagation by Mapping for Semi-Supervised Classification. IEEE Trans. Big Data 2019, 5, 148–165. [Google Scholar] [CrossRef]
Cai, D.; He, X.; Han, J.; Huang, T.S. Graph Regularized Nonnegative Matrix Factorization for Data Representation. IEEE Trans. Pattern Anal. Mach. Intell. 2011, 33, 1548–1560. [Google Scholar]
Wang, Y.; Du, H.; Zhang, Y.; Zhang, Y. Efficient and robust discriminant dictionary pair learning for pattern classification. Digit. Signal Process. 2021, 118, 103227. [Google Scholar] [CrossRef]
Mazumder, R.; Hastie, T.; Tibshirani, R. Spectral regularization algorithms for learning large incomplete matrices. J. Mach. Learn. Res. 2010, 11, 2287–2322. [Google Scholar] [PubMed]
Terrell, G.R.; Scott, D.W. Variable kernel density estimation. Ann. Stat. 1992, 20, 1236–1265. [Google Scholar] [CrossRef]
Xu, C.; Zhao, S.; Liu, F. Distributed plant-wide process monitoring based on PCA with minimal redundancy maximal relevance. Chemom. Intell. Lab. Syst. 2017, 169, 53–63. [Google Scholar] [CrossRef]
Jiang, Q.; Yan, X. Monitoring multi-mode plant-wide processes by using mutual information-based multi-block PCA, joint probability, and Bayesian inference. Chemom. Intell. Lab. Syst. 2014, 136, 121–137. [Google Scholar] [CrossRef]
Candès, E.J.; Li, X.; Ma, Y.; Wright, J. Robust principal component analysis? J. ACM 2011, 58, 1–37. [Google Scholar] [CrossRef]
Ge, Z.; Song, Z. Distributed PCA model for plant-wide process monitoring. Ind. Eng. Chem. Res. 2013, 52, 1947–1957. [Google Scholar] [CrossRef]
Bathelt, A.; Ricker, N.L.; Jelali, M. Revision of the Tennessee Eastman Process Model. IFAC-PapersOnLine 2015, 48, 309–314. [Google Scholar] [CrossRef]
Zeng, Z.; Gui, W.; Chen, X.; Xie, Y.; Zhang, H.; Sun, Y. A Cell Condition-Sensitive Frequency Segmentation Method Based on the Sub-Band Instantaneous Energy Spectrum of Aluminum Electrolysis Cell Voltage. Engineering 2021, 7, 1282–1292. [Google Scholar] [CrossRef]

Figure 1. Schematic diagram of the two process monitoring methods for high-dimensional data. (a) Projected methods; (b) Distributed learning methods.

Figure 2. Schematic diagram of the process monitoring method based on distributed robust dictionary pair learning.

Figure 3. Structure diagram of dictionary pair learning.

Figure 4. Illustration of the effects of noise and outliers in process monitoring. (a) is the training data without noise and outliers; (b) is the training data containing noise and outliers.

Figure 5. The process monitoring results of numerical simulation experiment. (a) rPCA-

r T^{2}

; (b) rPCA-

r S P E

; (c) DPCA-

D - T^{2}

; (d) DPCA-

D - S P E

; (e) KSVD-

R E S I

; (f) DDL-

G F I

; (g) DPL-

D R E

; (h) DRDPL-

G A I

.

Figure 5. The process monitoring results of numerical simulation experiment. (a) rPCA-

r T^{2}

; (b) rPCA-

r S P E

; (c) DPCA-

D - T^{2}

; (d) DPCA-

D - S P E

; (e) KSVD-

R E S I

; (f) DDL-

G F I

; (g) DPL-

D R E

; (h) DRDPL-

G A I

.

Figure 6. The anomaly isolation results in numerical simulation experiment. (a) BCI; (b) VCI.

Figure 7. Structural schematic diagram of TE process.

Figure 8. The process monitoring results of TE process with IDV (6). (a) rPCA-

r T^{2}

; (b) rPCA-

r S P E

; (c) DPCA-

D - T^{2}

; (d) DPCA-

D - S P E

; (e) KSVD-

R E S I

; (f) DDL-

G F I

; (g) DPL-

D R E

; (h) DRDPL-

G A I

.

Figure 8. The process monitoring results of TE process with IDV (6). (a) rPCA-

r T^{2}

; (b) rPCA-

r S P E

; (c) DPCA-

D - T^{2}

; (d) DPCA-

D - S P E

; (e) KSVD-

R E S I

; (f) DDL-

G F I

; (g) DPL-

D R E

; (h) DRDPL-

G A I

.

Figure 9. The process monitoring results of TE process with IDV (14). (a) rPCA-

r T^{2}

; (b) rPCA-

r S P E

; (c) DPCA-

D - T^{2}

; (d) DPCA-

D - S P E

; (e) KSVD-

R E S I

; (f) DDL-

G F I

; (g) DPL-

D R E

; (h) DRDPL-

G A I

.

Figure 9. The process monitoring results of TE process with IDV (14). (a) rPCA-

r T^{2}

; (b) rPCA-

r S P E

; (c) DPCA-

D - T^{2}

; (d) DPCA-

D - S P E

; (e) KSVD-

R E S I

; (f) DDL-

G F I

; (g) DPL-

D R E

; (h) DRDPL-

G A I

.

Figure 10. The anomaly isolation results in TE process with IDV (6). (a) BCI; (b) VCI-Block 1.

Figure 11. The anomaly isolation results in TE process with IDV (14). (a) BCI; (b) VCI-Block 2.

Figure 12. Structural schematic diagram of aluminum electrolysis cell.

Figure 13. The process monitoring results of anode effect in aluminum electrolysis process. (a) rPCA-

r T^{2}

; (b) rPCA-

r S P E

; (c) DPCA-

D - T^{2}

; (d) DPCA-

D - S P E

; (e) KSVD-

R E S I

; (f) DDL-

G F I

; (g) DPL-

D R E

; (h) DRDPL-

G A I

.

Figure 13. The process monitoring results of anode effect in aluminum electrolysis process. (a) rPCA-

r T^{2}

; (b) rPCA-

r S P E

; (c) DPCA-

D - T^{2}

; (d) DPCA-

D - S P E

; (e) KSVD-

R E S I

; (f) DDL-

G F I

; (g) DPL-

D R E

; (h) DRDPL-

G A I

.

Figure 14. The process monitoring results of anode slippage in aluminum electrolysis process. (a) rPCA-

r T^{2}

; (b) rPCA-

r S P E

; (c) DPCA-

D - T^{2}

; (d) DPCA-

D - S P E

; (e) KSVD-

R E S I

; (f) DDL-

G F I

; (g) DPL-

D R E

; (h) DRDPL-

G A I

.

Figure 14. The process monitoring results of anode slippage in aluminum electrolysis process. (a) rPCA-

r T^{2}

; (b) rPCA-

r S P E

; (c) DPCA-

D - T^{2}

; (d) DPCA-

D - S P E

; (e) KSVD-

R E S I

; (f) DDL-

G F I

; (g) DPL-

D R E

; (h) DRDPL-

G A I

.

Figure 15. The anomaly isolation results of anode effect in aluminum electrolysis process. (a) BCI; (b) VCI-Block 1.

Figure 16. The anomaly isolation results of anode slippage in aluminum electrolysis process. (a) BCI; (b) VCI-Block 1.

Figure 17. FDR of each method with varying variance. (a) Numerical simulation; (b) TE process with IDV (6); (c) Aluminum electrolysis process with anode effect.

Table 1. The comparison results of numerical simulation experiment.

Method	Training Time (s)	Testing Time (s)	FAR (%)	FDR (%)
rPCA ( $r T^{2}$ )	0.0114	0.0003	17.00	69.00
rPCA ( $r S P E$ )	0.0112	0.0002	2.33	100.00
DPCA ( $D - T^{2}$ )	2.3512	0.0811	1.00	50.00
DPCA ( $D - S P E$ )	2.7365	0.1406	1.00	100.00
K-SVD ( $R E S I$ )	2.6444	0.0039	0.33	43.40
DDL ( $G F I$ )	10.5511	0.0036	6.67	100.00
DPL ( $D R E$ )	0.0316	0.4884	4.00	100.00
DRDPL (GAI)	0.2702	0.0033	0.00	100.00

Table 2. The division of TE process variables.

Block	Variables	Principle of Division
1	$v_{1}, v_{2}, v_{3}, v_{5}, v_{6}, v_{23}, v_{24}, v_{25}$	Input
2	$v_{7}, v_{8}, v_{9}, v_{21}, v_{30}$	Reactor
3	$v_{10}, v_{11}, v_{12}, v_{13}, v_{14}, v_{20}, v_{22}, v_{27}, v_{28}, v_{31}$	Separator, Compressor and Condenser
4	$v_{4}, v_{15}, v_{16}, v_{17}, v_{18}, v_{19}, v_{26}, v_{29}$	Stripper

Table 3. The FDR of the proposed method and comparative methods for 28 kinds of disturbances in TE process.

Disturbance Number	rPCA $(r T^{2})$	rPCA $(rSPE)$	DPCA $(D - T^{2})$	DPCA $(D - SPE)$	K-SVD $(RESI)$	DDL $(GFI)$	DPL $(DRE)$	DRDPL $(GAI)$
1	0.9900	0.9900	1.0000	1.0000	0.9760	0.9880	0.9460	1.0000
2	0.9640	0.9660	1.0000	1.0000	0.9300	0.9560	0.9040	1.0000
3	0.9900	0.9880	1.0000	1.0000	0.9740	0.9840	0.9460	1.0000
4	0.9980	0.9980	1.0000	0.4260	0.9980	0.9980	0.0700	1.0000
5	0.9460	0.9820	1.0000	1.0000	0.9100	0.9660	0.8920	1.0000
6	0.9980	0.9980	0.9980	0.9980	0.9980	0.9980	0.9920	0.9980
7	0.9980	0.9980	1.0000	1.0000	0.9980	0.9980	0.9980	1.0000
8	0.8020	0.8320	1.0000	1.0000	0.7360	0.7960	0.7180	1.0000
9	0.9980	0.9980	1.0000	0.4000	0.9980	0.9980	0.1000	1.0000
10	0.7720	0.9260	0.9760	0.9660	0.5600	0.8700	0.0320	0.8000
11	0.9500	0.9420	1.0000	0.9760	0.9160	0.9180	0.3420	0.9880
12	0.5620	0.4420	0.9300	0.8420	0.2900	0.4340	0.0540	0.1820
13	0.9500	0.9460	1.0000	1.0000	0.9280	0.9360	0.9120	1.0000
14	0.9633	0.8000	1.0000	1.0000	0.9540	0.9680	0.0260	0.9940
15	0.9860	0.9880	1.0000	1.0000	0.9620	0.9600	0.0620	0.9880
16	0.9500	0.9460	1.0000	1.0000	0.9260	0.9340	0.9120	1.0000
17	0.9600	0.9500	1.0000	1.0000	0.9320	0.9320	0.4180	0.9320
18	0.7060	0.7720	0.9560	0.9860	0.5740	0.7520	0.2680	0.8120
19	0.9780	0.9540	1.0000	1.0000	0.8760	0.9620	0.0460	1.0000
20	0.8340	0.7940	1.0000	1.0000	0.7680	0.7880	0.7360	0.9020
21	0.8320	0.7920	1.0000	1.0000	0.7720	0.7860	0.7360	0.9300
22	0.9600	0.9360	1.0000	1.0000	0.8420	0.9460	0.0480	0.9540
23	0.7200	0.7840	0.9760	0.9960	0.5900	0.8600	0.2900	0.7360
24	0.8360	0.9160	1.0000	1.0000	0.7780	0.9300	0.4260	0.9220
25	0.7760	0.9360	0.9880	0.9800	0.5800	0.9180	0.2120	0.7040
26	0.9560	0.9600	1.0000	0.9960	0.9260	0.9600	0.2920	0.9500
27	0.9320	0.9580	0.9820	0.6300	0.8240	0.8900	0.0660	0.8280
28	0.8440	0.9100	0.9900	0.8120	0.6180	0.8260	0.0280	0.8220

Table 4. The comparison results of TE process with IDV (6).

Method	Training Time (s)	Testing Time (s)	FAR (%)	FDR (%)
rPCA ( $r T^{2}$ )	2.5695	0.0008	16.00	99.80
rPCA ( $r S P E$ )	1.0328	0.0007	7.33	99.80
DPCA ( $D - T^{2}$ )	14.9032	0.7141	22.67	99.80
DPCA ( $D - S P E$ )	14.8002	0.7019	17.00	99.80
K-SVD ( $R E S I$ )	4.7873	0.0028	2.33	99.80
DDL ( $G F I$ )	13.6185	0.0001	6.67	99.80
DPL ( $D R E$ )	0.0580	0.1415	4.00	99.20
DRDPL $(GAI)$	2.9189	0.0045	0.00	99.80

Table 5. The comparison results of TE process with IDV (14).

Method	Training Time (s)	Testing Time (s)	FAR (%)	FDR (%)
rPCA ( $r T^{2}$ )	1.2971	0.0002	19.00	99.00
rPCA ( $r S P E$ )	0.0094	0.0001	4.33	99.40
DPCA ( $D - T^{2}$ )	14.3749	0.6949	22.67	100.00
DPCA ( $D - S P E$ )	14.2242	0.6752	17.00	100.00
K-SVD ( $R E S I$ )	4.8615	0.0029	0.67	95.40
DDL ( $G F I$ )	13.8954	0.0001	6.00	96.80
DPL ( $D R E$ )	0.0518	0.1417	4.00	2.60
DRDPL $(GAI)$	1.3678	0.0011	0.00	99.40

Table 6. The comparison results of anode effect in aluminum electrolysis process.

Method	Training Time (s)	Testing Time (s)	FAR (%)	FDR (%)
rPCA ( $r T^{2}$ )	0.0152	0.0003	19.33	100.00
rPCA ( $r S P E$ )	0.0145	0.0001	7.67	100.00
DPCA ( $D - T^{2}$ )	13.3106	0.5665	22.67	100.00
DPCA ( $D - S P E$ )	13.2324	0.5647	1.00	100.00
K-SVD ( $R E S I$ )	3.6734	0.0049	0.00	100.00
DDL ( $G F I$ )	12.8322	0.0004	5.67	100.00
DPL ( $D R E$ )	0.0662	0.2882	1.33	100.00
DRDPL $(GAI)$	0.6024	0.0012	0.00	100.00

Table 7. The comparison results of anode slippage in aluminum electrolysis process.

Method	Training Time (s)	Testing Time (s)	FAR (%)	FDR (%)
rPCA ( $r T^{2}$ )	0.0200	0.0008	19.33	100.00
rPCA ( $r S P E$ )	0.0192	0.0007	7.67	100.00
DPCA ( $D - T^{2}$ )	13.2074	0.5386	22.67	100.00
DPCA ( $D - S P E$ )	13.8177	0.5718	1.00	100.00
K-SVD ( $R E S I$ )	4.0545	0.0053	0.00	100.00
DDL ( $G F I$ )	13.0848	0.0024	10.00	100.00
DPL ( $D R E$ )	0.0649	0.2841	1.33	100.00
DRDPL $(GAI)$	0.6090	0.0014	0.00	100.00

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, J.; Chen, X.; Deng, Z.; Zhang, H.; Zeng, J. Distributed Robust Dictionary Pair Learning and Its Application to Aluminum Electrolysis Industrial Process. Processes 2022, 10, 1850. https://doi.org/10.3390/pr10091850

AMA Style

Wang J, Chen X, Deng Z, Zhang H, Zeng J. Distributed Robust Dictionary Pair Learning and Its Application to Aluminum Electrolysis Industrial Process. Processes. 2022; 10(9):1850. https://doi.org/10.3390/pr10091850

Chicago/Turabian Style

Wang, Jingkun, Xiaofang Chen, Ziqing Deng, Hongliang Zhang, and Jing Zeng. 2022. "Distributed Robust Dictionary Pair Learning and Its Application to Aluminum Electrolysis Industrial Process" Processes 10, no. 9: 1850. https://doi.org/10.3390/pr10091850

APA Style

Wang, J., Chen, X., Deng, Z., Zhang, H., & Zeng, J. (2022). Distributed Robust Dictionary Pair Learning and Its Application to Aluminum Electrolysis Industrial Process. Processes, 10(9), 1850. https://doi.org/10.3390/pr10091850

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Distributed Robust Dictionary Pair Learning and Its Application to Aluminum Electrolysis Industrial Process

Abstract

1. Introduction

2. Methodology

2.1. Distributed Robust Dictionary Pair Learning

2.2. Bayesian Inference Based Anomaly Detection

2.3. Contribution Index Based Anomaly Isolation

3. Experiments

3.1. Numerical Simulation Experiment

3.2. TE Benchmark Test

3.3. Aluminum Electrolysis Industrial Process Application

3.4. Fault Detection against Noisy Datasets

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

Appendix B

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI