A Real-Time Configuration Approach for an Observer-Based Residual Generator of Fault Detection Systems

Zhao, Hao; Luo, Hao; Liu, Tianyu

doi:10.3390/pr10020276

Open AccessArticle

A Real-Time Configuration Approach for an Observer-Based Residual Generator of Fault Detection Systems

by

Hao Zhao

¹,

Hao Luo

^1,*

and

Tianyu Liu

²

¹

Department of Control Science and Engineering, Harbin Institute of Technology, Harbin 150001, China

²

Institute of Automatic Control and Complex Systems, University of Duisburg-Essen, 47057 Duisburg, Germany

^*

Author to whom correspondence should be addressed.

Processes 2022, 10(2), 276; https://doi.org/10.3390/pr10020276

Submission received: 20 December 2021 / Revised: 25 January 2022 / Accepted: 25 January 2022 / Published: 30 January 2022

(This article belongs to the Section Process Control and Monitoring)

Download

Browse Figures

Versions Notes

Abstract

:

This paper is concerned with the real-time configuration of fault detection systems by exploiting an gradient optimization scheme. It is known that industrial processes may often encounter some uncertainties or changes of operating points and environment, which would lead to an unsatisfactory fault detection result. To handle this problem, a real-time (or online) configuration strategy is introduced, which plays an important role in ensuring the efficiency of the fault detection method without a high industrial cost. In this paper, a gradient-based iterative optimization scheme is taken into account for the real-time configuration implementation. By utilizing the gradient-based iterative algorithm to minimize the K-gap between the residual generator and the current system, the parameters of the residual generator can be configured from the online input/output data. Based on this, real-time configuration of the residual generator parameters is achieved and, correspondingly, the fault detection performance is guaranteed. Then, a three-tank system, which is relatively common and important in chemical industrial systems, is studied and explored to verify the effectiveness and superiority of the gradient optimization configuration strategy proposed in this work.

Keywords:

real-time configuration; observer-based residual generator; fault detection; gradient optimization

1. Introduction

With the rapid development of industrial technology, increasing attention has been paid to certain safety and reliability problems, prompting further research on fault detection. Among the research of fault detection problems, model-based methods have been intensively studied, and considerable results have been reported during the past few decades [1,2,3,4,5,6,7,8,9]. To mention a few, by transforming the residual generator design problem into an optimization problem, an optimal periodic fault detection approach is obtained for linear discrete-time periodic systems in the light of robustness and sensitivity [10]. The definitions of finite horizon

H_{\infty} / H_{\infty}

and

H_{-} / H_{\infty}

fault detection performance are first established for linear discrete time-varying systems, based on which the fault detection issue is dealt with by designing some observer gains [11]. The sensor stuck faults are considered for a class of stochastic systems, and a fault detection method is proposed in the stochastic framework to guarantee the effectiveness for arbitrary small sensor stuck faults [12]. Considering the linear system with elliptical uncertainties, a general parametrization is employed for less conservativeness and then a fault detection filter is designed to maximize the sensitivity of fault detection with a certain disturbance attenuation demand [13]. To make a balance between the sensitivity and robustness, a mixed

H_{-} / H_{\infty}

performance index is taken into account, and sufficient criteria are presented to achieve the fault detection observer design for a class of piecewise linear systems with weighted

H_{-} / H_{\infty}

performance [14]. On the basis of linear system study, the model-based fault detection schemes have been extended to the issues of nonlinear fields. For example, the observer-based fault detection design is achieved for general nonlinear systems by investigating the analysis and integrated design scheme [15]. By approximating a nonlinear system as a T-S fuzzy model, the observer-based fault detection for nonlinear systems with disturbances is investigated based on the

L_{2}

stability theory [16]. Via the logic-dynamic method, the fault detection issue for dynamic systems with non-differentiable nonlinearities is solved with the aid of the linear technique [17]. By considering the stochastic property of noises and process disturbance, a distributed fault detection and isolation scheme is proposed in a Plug-and-Plug scenario [18]. A systematic study is carried out for the fault detection of nonlinear systems by designing linear residual generators, which is further utilized in some practical applications [19]. The model-based method plays an important role in the field of fault detection, while it will unavoidably incur a high cost in acquiring the accurate model information. Therefore, the data-driven fault detection as an alternative method has drawn increasing attention in both academia and industry.

Recently, data-driven fault detection issues have been investigated thoroughly, due to their advantages in saving the costly modeling process and making great use of process data information compared with the model-based method. Over the last few years, many studies have been done for the data-driven fault detection issues of industrial process. For instance, the observer-based fault detection is constructed by exploiting the data-driven image and kernel representations [20]. By using the residual generators derived from the process data, a data-driven fault detection approach is devised for wind turbines with measurement noises, unknown disturbances, and nonlinearities [21]. Data-driven fault detection and isolation filters are constructed for sensor and actuator faults by taking advantages of available system data, and meanwhile an estimation approach is established and extended to an offline tuning strategy to compensate the estimation errors under the uncertainties and Markov parameters [22]. In the data-driven framework, a fault detection scheme is proposed to detect small sensor faults and a fault isolation algorithm is developed to distinguish different faults [23]. By identifying a data-driven SKR with the projecting technique, a robust residual generation is derived and, accordingly, the robust data-driven fault detection strategy is obtained for rolling mill processes with unknown eccentricity [24]. The quantitative diagnosability analysis is addressed for dynamic systems by virtue of the data-driven evaluation [25]. By employing a radically data-driven strategy, the fault detection and diagnosis are developed for wind turbines to enhance the reliability [26]. A q-step residual design approach is constructed for the data-driven fault detection of linear systems to ensure the stability and performance demand [27]. The distributed data-driven optimal fault detection is studied in large-scale systems by utilizing the average consensus algorithm [28]. By proposing a prediction model on the output of nonlinear dynamic systems, a detection method is devised according to the comparison between the measurement output and the prediction to determine a residual, and further an isolation scheme is constructed to clarify the fault location for the underlying system [29]. Considering the fact that incipient faults are not easy to discover in electrical drives because of their inapparent symptoms, a data-driven fault detection and diagnosis method is presented by applying the principal component analysis approach which improves the accuracy of fault detection for electrical drives without available system parameters or models [30].

Note that the practical industrial process inevitably suffers the changes of process environment and operation conditions. In this case, the predesigned residual generators may not provide satisfactory fault detection performance. To guarantee the ability and efficiency of the data-driven fault detection without sacrificing the industrial cost, the online configuration or updating becomes an important technology in solving such issues. Up to now, some methods use online configuration and updating of data-driven fault detection including adaptive algorithms, iterative optimization, etc. The adaptive residual generator combined with the data-driven scheme is designed and implemented to recursively estimate the corresponding parameters and improve the robustness against the undesired changes for discrete linear systems [31]. By virtue of a data-driven subspace-based predictor, an adaptive updating strategy is proposed for the fault detection filter of solar power generation systems with uncertainties [32]. Via adopting an autoregressive exogenous model to represent the dynamic process, an adaptive data-driven method is developed for fault detection of dynamic process with process drift [33]. The adaptive algorithm provides an effective way for the online configuration of data-driven residual generators, while each new measurement is utilized for the parameter estimation, and the parameter updating in the adaptive configuration happens at every sampling instant. Moreover, the configured system matrix of the adaptive method may be sensitive to small changes in the parameters. Compared with the adaptive configuration, the residual generator parameters based on the iterative optimization method remain unchanged between two iterations, which would greatly reduce the iteration number and meanwhile guarantee the fault detection performance [34,35]. However, the existing results about the online optimization configuration for the observer-based residual generators are quite limited, which motivates the current study.

Based on the observations above, this paper is aimed to investigate the real-time configuration for fault detection systems via the gradient optimization method. Considering the unavoidable changes of the industrial process and operating environment, it is necessary to establish a real-time configuration method to properly configure the residual generator parameters to guarantee fault detection ability. To achieve the real-time configuration, a gradient-based iterative optimization strategy is proposed by minimizing the K-gap metric between the residual generator and the current system. In this way, the residual generator can be updated from the available input/output (I/O) data without the identification of system matrices. A novel optimization algorithm for the real-time configuration of residual generators is developed, by which the validity of the fault detection process is guaranteed. Furthermore, a three-tank system plant is taken into account to illustrate the usefulness and advantages of the proposed approach, which can be seen as the prototype for many industrial processes, such as chemical process industries.

The structure of this paper is organized as below. In Section 2, the system descriptions and necessary preliminaries are provided. Section 3 presents the gradient optimization to achieve the online configuration for fault detection systems. In Section 4, a simulation example is used to demonstrate the effectiveness of the obtained method. Finally, the conclusions of this work are drawn in Section 5.

Notation 1.

Throughout this paper, the notations are generally standard.

R^{n}

and

R^{m \times n}

, respectively, denote the n-dimensional Euclidean space and the set of all

m \times n

real matrices.

H_{\infty}

is the set of all stable transfer functions and

H_{2}

is the subspace of all signals with bounded energy equal to 0 for any

t < 0

.

{RH}_{\infty}

defines the set of all real-rational transfer functions of stable systems.

{∥ \cdot ∥}_{2}

and

{∥ \cdot ∥}_{\infty}

stand for the

L_{2}

-norm and the

H_{\infty}

-norm, respectively.

v e c (A)

indicates the vectorization of matrix A.

\bar{σ} (A)

and

\bar{e i g} (A)

represent the maximum singular value and the maximum eigenvalue of matrix A, respectively.

A^{+}

denotes the pseudoinverse of matrix A.

d i a g {\dots}

defines a diagonal matrix.

2. Preliminaries

2.1. System Descriptions

Consider the discrete-time linear time-invariant (LTI) system represented by

\begin{matrix} x (k + 1) = & A x (k) + B u (k), \end{matrix}

(1)

\begin{matrix} y (k) = & C x (k) + D u (k), \end{matrix}

(2)

where

x (k) \in R^{n}

is the state vector,

u (k) \in R^{l}

is the input signal, and

y (k) \in R^{m}

is the system output.

A, B, C, D

are system matrices with proper dimensions.

2.2. Stable Kernel Representation

For the nominal system (1) and (2), its transfer function representation is given by

\begin{matrix} y (z) = G (z) u (z) . \end{matrix}

(3)

Then, the stable kernel representation of

G (z)

is described as below.

Consider a proper real-rational transfer function matrix

G (z)

with the following left and right coprime factorizations:

\begin{matrix} G (z) = {\hat{M}}^{- 1} (z) \hat{N} (z) = N (z) M^{- 1} (z), \end{matrix}

(4)

where

\hat{M} (z) \in {RH}_{\infty}^{m \times m}, \hat{N} (z) \in {RH}_{\infty}^{m \times l}, M (z) \in {RH}_{\infty}^{l \times l}, N (z) \in {RH}_{\infty}^{m \times l}

, and

(\hat{M} (z), \hat{N} (z))

,

(M (z), N (z))

are, respectively, left and right coprime pairs over

{RH}_{\infty}

. It means that there exist

\hat{X} (z) \in {RH}_{\infty}^{m \times m}, \hat{Y} (z) \in {RH}_{\infty}^{l \times m}, X (z) \in {RH}_{\infty}^{l \times l}, Y (z) \in {RH}_{\infty}^{l \times m}

such that the following equations hold:

\begin{matrix} [\begin{matrix} \hat{M} (z) & \hat{N} (z) \end{matrix}] [\begin{matrix} \hat{X} (z) \\ \hat{Y} (z) \end{matrix}] = & I_{m \times m}, \\ [\begin{matrix} X (z) & Y (z) \end{matrix}] [\begin{matrix} M (z) \\ N (z) \end{matrix}] = & I_{l \times l} . \end{matrix}

Further, if

(\hat{M} (z), \hat{N} (z))

and

(M (z), N (z))

are satisfied with

\begin{matrix} [\begin{matrix} \hat{M} (z) & \hat{N} (z) \end{matrix}] {[\begin{matrix} \hat{M} (z) & \hat{N} (z) \end{matrix}]}^{T} = & I_{m \times m}, \\ {[\begin{matrix} M (z) \\ N (z) \end{matrix}]}^{T} [\begin{matrix} M (z) \\ N (z) \end{matrix}] = & I_{l \times l}, \end{matrix}

then they are called the normalized left and right coprime pairs, respectively.

Definition 1

([35]). Given a discrete-time LTI system

G (z)

in Equation (3), a stable linear system

K

is called the stable kernel representation (SKR) of

G (z)

, if for any

u (z)

and its response

y (z)

, the following equation holds:

\begin{matrix} K [\begin{matrix} u (z) \\ y (z) \end{matrix}] = 0 . \end{matrix}

Suppose that

r (z)

is the residual signal of the underlying system. According to the description of the left and right coprime factorizations, it is clear that

\hat{M} (z)

and

\hat{N} (z)

correspond to the transfer matrices from the residual signal to the output signal and the input signal, respectively. Then, the following equation holds in the fault- and noise-free case:

\begin{matrix} r (z) = [\begin{matrix} - \hat{N} (z) & \hat{M} (z) \end{matrix}] [\begin{matrix} u (z) \\ y (z) \end{matrix}] . \end{matrix}

(5)

Accordingly, a SKR of system

G (z)

can be formed by the transfer matrices as below,

\begin{matrix} K = [\begin{matrix} - \hat{N} (z) & \hat{M} (z) \end{matrix}] . \end{matrix}

2.3. K-Gap Metric

To achieve the optimization objective, it is necessary to introduce a means to measure the distance between two kernel subspaces. As the K-gap metric has become a powerful tool in dealing with the measurement problems, this paper will adopt the K-gap metric technique for the optimization process. Before mentioning the K-gap metric, the gap metric concept is first restated for clarification. For this purpose, the graph definition is introduced and represented by

\begin{matrix} G = \{ζ = [\begin{matrix} u \\ y \end{matrix}] = [\begin{matrix} M \\ N \end{matrix}] v, v \in H_{2}\} . \end{matrix}

Note that the graph

G

is a subspace in

H_{2}

constructed by all the pairs

(u, y)

, and it is closed [36,37]. Denote

G_{1} = N_{1} M_{1}^{- 1}, G_{2} = N_{2} M_{2}^{- 1}

as the normalized right coprime factorizations of

G_{1}, G_{2}

, respectively. Let

G_{1}, G_{2}

be the corresponding graphs. The direct gap from

G_{1}

to

G_{2}

is defined as

\vec{δ} (G_{1}, G_{2})

, which is formulated by

\begin{matrix} \vec{δ} (G_{1}, G_{2}) = sup_{ζ_{1} \in G_{1}} inf_{ζ_{2} \in G_{2}} \frac{{∥ζ_{1} - ζ_{2}∥}_{2}}{{∥ζ_{1}∥}_{2}} . \end{matrix}

(6)

According to the work in [36], the calculation on the direct gap (6) can be solved by

\begin{matrix} \vec{δ} (G_{1}, G_{2}) = inf_{Q \in H_{\infty}} {∥[\begin{matrix} M_{1} \\ N_{1} \end{matrix}] - [\begin{matrix} M_{2} \\ N_{2} \end{matrix}] Q∥}_{\infty} . \end{matrix}

Based on above, the definition of gap metric between

G_{1}

and

G_{2}

is derived as

\begin{matrix} δ (G_{1}, G_{2}) = max \{\vec{δ} (G_{1}, G_{2}), \vec{δ} (G_{2}, G_{1})\} . \end{matrix}

Considering that the gap metric is based on the image subspace, the K-gap metric defined on the kernel subspace is further proposed. The corresponding graph is defined as

\begin{matrix} K = \{[\begin{matrix} u \\ y \end{matrix}] : [\begin{matrix} - \hat{N} & \hat{M} \end{matrix}] [\begin{matrix} u \\ y \end{matrix}] = 0, [\begin{matrix} u \\ y \end{matrix}] \in H_{2}\}, \end{matrix}

which indicates the kernel subspace and is a closed subspace in

H_{2}

. Similarly, the directed K-gap from graph

K_{1}

to graph

K_{2}

is expressed as follows.

Definition 2

([37]). Suppose that

({\hat{M}}_{1} (z), {\hat{N}}_{1} (z))

,

({\hat{M}}_{2} (z), {\hat{N}}_{2} (z))

are the left coprime factorizations of

G_{1} (z), G_{2} (z)

, respectively, and

\begin{matrix} K_{i} = \{ς_{i} = [\begin{matrix} u_{i} \\ y_{i} \end{matrix}] : [\begin{matrix} - {\hat{N}}_{i} & {\hat{M}}_{i} \end{matrix}] [\begin{matrix} u_{i} \\ y_{i} \end{matrix}] = 0, [\begin{matrix} u_{i} \\ y_{i} \end{matrix}] \in H_{2}\}, i = 1, 2 . \end{matrix}

The directed K-gap from

K_{1}

to

K_{2}

is defined by

\begin{matrix} {\vec{δ}}_{k} (K_{1}, K_{2}) = sup_{ς_{1} \in K_{1}} inf_{ς_{2} \in K_{2}} \frac{{∥ς_{1} - ς_{2}∥}_{2}}{{∥ς_{1}∥}_{2}} . \end{matrix}

Subsequently, the K-gap metric between

K_{1}

and

K_{2}

is given by

\begin{matrix} δ_{k} (K_{1}, K_{2}) = max \{{\vec{δ}}_{k} (K_{1}, K_{2}), {\vec{δ}}_{k} (K_{2}, K_{1})\} . \end{matrix}

Moreover, a computation strategy of the K-gap metric is also established in [37], which is recalled in the following lemma.

Lemma 1

([37]). Consider

K_{i}, i = 1, 2

defined in Definition 2 with normalized left coprime factorizations

({\hat{M}}_{i} (z), {\hat{N}}_{i} (z)), i = 1, 2

. The direct K-gap can be computed by

\begin{matrix} {\vec{δ}}_{k} (K_{1}, K_{2}) = inf_{Q \in H_{\infty}} {∥[\begin{matrix} - {\hat{N}}_{1} & {\hat{M}}_{1} \end{matrix}] - Q [\begin{matrix} - {\hat{N}}_{2} & {\hat{M}}_{2} \end{matrix}]∥}_{\infty} . \end{matrix}

It can be yielded from Lemma 1 that

\begin{matrix} 0 \leq {\vec{δ}}_{k} (K_{1}, K_{2}) \leq 1, \end{matrix}

and when

δ_{k} (K_{1}, K_{2}) < 1

,

\begin{matrix} {\vec{δ}}_{k} (K_{1}, K_{2}) = {\vec{δ}}_{k} (K_{2}, K_{1}) = δ_{k} (K_{1}, K_{2}) . \end{matrix}

2.4. Data-Driven Framework

As the input and output data are crucial to the fault detection realization, the data model is introduced here for latter development. Taking a data vector

λ (k) \in R^{κ}

into account, the related notations are defined as below.

\begin{matrix} λ_{s} (k) = & [\begin{matrix} λ (k - s) \\ λ (k - s + 1) \\ ⋮ \\ λ (k) \end{matrix}] \in R^{(s + 1) κ}, \\ Λ_{k} = & [\begin{matrix} λ (k) & \dots & λ (k + N - 1) \end{matrix}] \in R^{κ \times N}, \\ Λ_{k, s} = & [\begin{matrix} λ_{s} (k) & \dots & λ_{s} (k + N - 1) \end{matrix}] = [\begin{matrix} Λ_{k - s} \\ ⋮ \\ Λ_{k} \end{matrix}] \in R^{(s + 1) κ \times N}, \end{matrix}

where

s, N

are positive integers and

s + 1

is the length of the stacked data vector. Based on the data structure, the data-driven realization of the SKR is defined as follows.

Definition 3

([20]).

K_{d, s}

is called a data-driven realization of the SKR for system

G (z)

, if for all

k \geq 0

, the following equation is satisfied:

\begin{matrix} K_{d, s} [\begin{matrix} u_{s} (k) \\ y_{s} (k) \end{matrix}] = [\begin{matrix} K_{u, s} & K_{y, s} \end{matrix}] [\begin{matrix} u_{s} (k) \\ y_{s} (k) \end{matrix}] = 0 . \end{matrix}

Note that if the data-driven SKR is satisfied with

K_{d, s} K_{d, s}^{T} = I

, then it is called normalized [38]. By applying the singular value decomposition, it holds that

\begin{matrix} K_{d, s} = U_{s y s} [\begin{matrix} Σ_{s y s, 1} & 0 \end{matrix}] [\begin{matrix} V_{s y s, 1}^{T} \\ V_{s y s, 2}^{T} \end{matrix}] \end{matrix}

and thus, the normalized data-driven SKR for system

G (z)

is derived as

\begin{matrix} {\bar{K}}_{d, s} = V_{s y s, 1}^{T} . \end{matrix}

(7)

Next, the data-driven realization of the K-gap metric is presented on the basic of the normalized data-driven SKR.

Lemma 2

([39]). Suppose that

{\bar{K}}_{1, d, s}, {\bar{K}}_{2, d, s}

are the normalized data-driven SKRs of SKRs

K_{1}, K_{2}

. The data-driven realization of the K-gap metric can be calculated by

\begin{matrix} δ_{k_{d, s}} (K_{1}, K_{2}) = \bar{σ} ([I - {\bar{K}}_{2, d, s}^{T} {\bar{K}}_{2, d, s}] {\bar{K}}_{1, d, s}^{T} {\bar{K}}_{1, d, s}) . \end{matrix}

(8)

3. Main Results

Considering that practical circumstance and industrial environment may change, the fault detection by the offline designed residual generator cannot satisfy the complicated industrial demand. In this section, a novel real-time configuration scheme for residual generators is proposed, which is essentially an optimization algorithm based on the K-gap metric. To be specific, an observer-based residual generator is constructed and the gradient optimization algorithm is used to update its parameters. Thus, the real-time configuration is achieved for the observer-based residual generators, the framework of which is displayed in Figure 1 for clarification.

3.1. The Observer-Based General Generator

For the discrete-time LTI system (1), a full-order state observer is constructed with the minimal state-space representation as

\begin{matrix} x_{o} (k + 1) = & A_{o} x_{o} (k) + B_{o} u (k) + L_{o} y (k), \end{matrix}

(9)

\begin{matrix} r (k) = & C_{o} x_{o} (k) + D_{o} u (k) + y (k), \end{matrix}

(10)

where

x_{o} (k) \in R^{n}

indicates the state of the full-order observer and

r (k) \in R^{m}

stands for the residual vector.

A_{o}, B_{o}, C_{o}, D_{o}, L_{o}

are observer matrices with proper dimensions.

For observer (9) and (10), a similarity transformation is performed by

x_{o} = T_{ν} x_{ν}

, which yields

\begin{matrix} x_{ν} (k + 1) = & A_{ν} x_{ν} (k) + B_{ν} u (k) + L_{ν} y (k), \end{matrix}

(11)

\begin{matrix} r (k) = & C_{ν} x_{ν} (k) + D_{ν} u (k) + y (k), \end{matrix}

(12)

where

A_{ν} = T_{ν}^{- 1} A_{o} T_{ν}, B_{ν} = T_{ν}^{- 1} B_{o}, L_{ν} = T_{ν}^{- 1} L_{o}, C_{ν} = C_{o} T_{ν}, D_{ν} = D_{o}

. According to the authors of [40], the controllability Gramian matrix of systems (11) and (12) is equivalent to the identity matrix, i.e.,

\begin{matrix} A_{ν} A_{ν}^{T} + B_{ν} B_{ν}^{T} = I_{n}, \end{matrix}

(13)

which gives rise to a column-orthogonal matrix

[\begin{matrix} B_{ν}^{T} \\ A_{ν}^{T} \end{matrix}]

. Consequently, there exists an invertible matrix

Ψ

such that

\begin{matrix} [\begin{matrix} B_{ν}^{T} \\ A_{ν}^{T} \end{matrix}] = Ψ [\begin{matrix} 0 \\ I_{n} \end{matrix}] . \end{matrix}

Referring to the literature [41], the parameterization based on the input normal form can be described by

\begin{matrix} [\begin{matrix} B_{ν}^{T} (θ_{A B}) \\ A_{ν}^{T} (θ_{A B}) \end{matrix}] = Ψ_{1} (θ_{A B} (1)) \dots Ψ_{n l} (θ_{A B} (n l)) [\begin{matrix} 0 \\ I_{n} \end{matrix}], \end{matrix}

in which

θ_{A B} \in R^{n l}

and its entries take values in the range

(- 1, 1)

. By introducing the following form for each parameter

θ_{A B} (i), i = 1, \dots, n l

\begin{matrix} U (θ_{A B} (i)) = [\begin{matrix} - θ_{A B} (i) & \sqrt{1 - θ_{A B}^{2} (i)} \\ \sqrt{1 - θ_{A B}^{2} (i)} & θ_{A B} (i) \end{matrix}], \end{matrix}

the matrices

Ψ_{i} (θ_{A B} (i)), i = 1, \dots, n l

are represented as

\begin{matrix} Ψ_{1} (θ_{A B} (1)) = & [\begin{matrix} I_{n - 1} & 0 & 0 \\ 0 & U (θ_{A B} (1)) & 0 \\ 0 & 0 & I_{l - 1} \end{matrix}], \\ ⋮ \\ Ψ_{n l} (θ_{A B} (n l)) = & [\begin{matrix} I_{l - 1} & 0 & 0 \\ 0 & U (θ_{A B} (n l)) & 0 \\ 0 & 0 & I_{n - 1} \end{matrix}] . \end{matrix}

As the parameterization of the input normal form is on the basis of the asymptotic stability, no additional restriction on the parameter space is needed, which is a significant advantage of this transformation.

Then, the procedure to obtain the SKR corresponding to the observer (11) is given as below. It is obvious from Formula (12) that

\begin{matrix} r (k - s) = C_{ν} x_{ν} (k - s) + D_{ν} u (k - s) + y (k - s) . \end{matrix}

Subsequently,

\begin{matrix} r (k - s + 1) = & C_{ν} x_{ν} (k - s + 1) + D_{ν} u (k - s + 1) + y (k - s + 1) \\ = & C_{ν} [A_{ν} x_{ν} (k - s) + B_{ν} u (k - s) + L_{ν} y (k - s)] + D_{ν} u (k - s + 1) + y (k - s + 1) \\ = & C_{ν} A_{ν} x_{ν} (k - s) + C_{ν} B_{ν} u (k - s) + D_{ν} u (k - s + 1) + C_{ν} L_{ν} y (k - s) + y (k - s + 1), \\ r (k - s + 2) = & C_{ν} x_{ν} (k - s + 2) + D_{ν} u (k - s + 2) + y (k - s + 2) \\ = & C_{ν} [A_{ν} x_{ν} (k - s + 1) + B_{ν} u (k - s + 1) + L_{ν} y (k - s + 1)] \\ + D_{ν} u (k - s + 2) + y (k - s + 2) \\ = & C_{ν} A_{ν} [A_{ν} x_{ν} (k - s) + B_{ν} u (k - s) + L_{ν} y (k - s)] + C_{ν} B_{ν} u (k - s + 1) \\ + C_{ν} L_{ν} y (k - s + 1) + D_{ν} u (k - s + 2) + y (k - s + 2) \\ = & C_{ν} A_{ν}^{2} x_{ν} (k - s) + C_{ν} A_{ν} B_{ν} u (k - s) + C_{ν} B_{ν} u (k - s + 1) + D_{ν} u (k - s + 2) \\ + C_{ν} A_{ν} L_{ν} y (k - s) + C_{ν} L_{ν} y (k - s + 1) + y (k - s + 2), \\ r (k - s + 3) = & C_{ν} x_{ν} (k - s + 3) + D_{ν} u (k - s + 3) + y (k - s + 3) \\ = & C_{ν} [A_{ν} x_{ν} (k - s + 2) + B_{ν} u (k - s + 2) + L_{ν} y (k - s + 2)] \\ + D_{ν} u (k - s + 3) + y (k - s + 3) \\ = & C_{ν} A_{ν} [A_{ν} x_{ν} (k - s + 1) + B_{ν} u (k - s + 1) + L_{ν} y (k - s + 1)] + C_{ν} B_{ν} u (k - s + 2) \\ + C_{ν} L_{ν} y (k - s + 2) + D_{ν} u (k - s + 3) + y (k - s + 3) \\ = & C_{ν} A_{ν}^{2} [A_{ν} x_{ν} (k - s) + B_{ν} u (k - s) + L_{ν} y (k - s)] + C_{ν} A_{ν} B_{ν} u (k - s + 1) \\ + C_{ν} A_{ν} L_{ν} y (k - s + 1) + C_{ν} B_{ν} u (k - s + 2) + C_{ν} L_{ν} y (k - s + 2) \\ + D_{ν} u (k - s + 3) + y (k - s + 3) \\ = & C_{ν} A_{ν}^{3} x_{ν} (k - s) + C_{ν} A_{ν}^{2} B_{ν} u (k - s) + C_{ν} A_{ν} B_{ν} u (k - s + 1) + C_{ν} B_{ν} u (k - s + 2) \\ + D_{ν} u (k - s + 3) + C_{ν} A_{ν}^{2} L_{ν} y (k - s) + C_{ν} A_{ν} L_{ν} y (k - s + 1) \\ + C_{ν} L_{ν} y (k - s + 2) + y (k - s + 3), \\ ⋮ \end{matrix}

which implies

\begin{matrix} r (k) = & C_{ν} A_{ν}^{s} x_{ν} (k - s) + C_{ν} A_{ν}^{s - 1} B_{ν} u (k - s) + \dots + C_{ν} B_{ν} u (k - 1) + D_{ν} u (k) \\ + C_{ν} A_{ν}^{s - 1} L_{ν} y (k - s) + \dots + C_{ν} L_{ν} y (k - 1) + y (k) . \end{matrix}

Then, it is easy to obtain that

\begin{matrix} r_{s} (k) = Γ_{s} x_{ν} (k - s) + H_{u, s} u_{s} (k) + H_{y, s} y_{s} (k), \end{matrix}

(14)

where

\begin{matrix} Γ_{s} = & [\begin{matrix} C_{ν} \\ C_{ν} A_{ν} \\ ⋮ \\ C_{ν} A_{ν}^{s} \end{matrix}], H_{u, s} = [\begin{matrix} D_{ν} & \dots & 0 & 0 \\ C_{ν} B_{ν} & \dots & 0 & 0 \\ ⋮ & ⋱ & ⋮ & ⋮ \\ C_{ν} A_{ν}^{s - 1} B_{ν} & \dots & C_{ν} B_{ν} & D_{ν} \end{matrix}], \\ H_{y, s} = & [\begin{matrix} I & \dots & 0 & 0 \\ C_{ν} L_{ν} & \dots & 0 & 0 \\ ⋮ & ⋱ & ⋮ & ⋮ \\ C_{ν} A_{ν}^{s - 1} L_{ν} & \dots & C_{ν} L_{ν} & I \end{matrix}] . \end{matrix}

Then, the following equation is obtained:

\begin{matrix} R_{k, s} = Γ_{s} X_{ν, k - s} + H_{u, s} U_{k, s} + H_{y, s} Y_{k, s} . \end{matrix}

As

C_{ν} A_{ν}^{s} \to 0, s \to \infty

, according to Definition 3, the SKR corresponding to the state observer can be obtained by the equation

\begin{matrix} K_{o, s} = & [\begin{matrix} H_{u, s} & H_{y, s} \end{matrix}] \\ = & [\begin{matrix} D_{ν} & \dots & 0 & 0 & I & \dots & 0 & 0 \\ C_{ν} B_{ν} & \dots & 0 & 0 & C_{ν} L_{ν} & \dots & 0 & 0 \\ ⋮ & ⋱ & ⋮ & ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ C_{ν} A_{ν}^{s - 1} B_{ν} & \dots & C_{ν} B_{ν} & D_{ν} & C_{ν} A_{ν}^{s - 1} L_{ν} & \dots & C_{ν} L_{ν} & I \end{matrix}] \end{matrix}

(15)

with removing the front finite rows to eliminate the influence of past data. By applying the singular value decomposition to

K_{o, s}

, it holds that

\begin{matrix} K_{o, s} = U [\begin{matrix} Σ_{1} & 0 \end{matrix}] [\begin{matrix} V_{1}^{T} \\ V_{2}^{T} \end{matrix}] \end{matrix}

and thus, the normalized SKR is

\begin{matrix} {\bar{K}}_{o, s} = V_{1}^{T} . \end{matrix}

(16)

3.2. Online Gradient Optimization

Based on the above observer-based residual generator, this subsection will establish an gradient optimization method for the real-time configuration by taking advantage of the K-gap metric between the observer and system plant.

Define

\begin{matrix} θ = [\begin{matrix} θ_{A B} \\ θ_{C_{ν}} \\ θ_{D_{ν}} \\ θ_{L_{ν}} \end{matrix}] = [\begin{matrix} θ_{A B} \\ v e c (C_{ν}) \\ v e c (D_{ν}) \\ v e c (L_{ν}) \end{matrix}], \end{matrix}

and denote

θ_{i}

(

i = 1, 2, \dots, τ = n l + 2 m n + m l

) as the i-th term of

θ

.

The optimization problem is described by

\begin{matrix} \{\begin{matrix} minimize J = δ_{k_{d, s}} (K_{o, s}, K_{d, s}), \\ subject to θ \in S, \end{matrix} \end{matrix}

where

K_{d, s}, K_{o, s}

are, respectively, the SKRs of the system (1) and the observer (11), and

S

is the set in which the entries of any vector belong to

(- 1, 1)

. To achieve the minimization of cost function J, the Taylor expansion of cost function J at the j-th iteration is considered

\begin{matrix} J_{θ^{(j)}}^{(j)} = & J_{θ^{(j - 1)}}^{(j)} + {(\frac{\partial J_{θ^{(j)}}^{(j)}}{\partial θ^{(j)}} |_{θ^{(j - 1)}})}^{T} (θ^{(j)} - θ^{(j - 1)}) \\ + \frac{1}{2} {(θ^{(j)} - θ^{(j - 1)})}^{T} (\frac{\partial^{2} J_{θ^{(j)}}^{(j)}}{\partial θ^{(j)} \partial {(θ^{(j)})}^{T}} |_{θ^{(j - 1)}}) (θ^{(j)} - θ^{(j - 1)}) + o^{3} (θ^{(j)} - θ^{(j - 1)}), \end{matrix}

where

\frac{\partial J_{θ^{(j)}}^{(j)}}{\partial θ^{(j)}}

and

\frac{\partial^{2} J_{θ^{(j)}}^{(j)}}{\partial θ^{(j)} \partial {(θ^{(j)})}^{T}}

denote the gradient and the Hessian matrix of cost function J, respectively.

o^{3} (\cdot)

stands for the infinitesimal of order higher than 3. Regardless of the high-order infinitesimal, a necessary condition to minimize the function

J_{θ^{(j)}}^{(j)}

is to command

\begin{matrix} (\frac{\partial J_{θ^{(j)}}^{(j)}}{\partial θ^{(j)}} |_{θ^{(j - 1)}}) + (\frac{\partial^{2} J_{θ^{(j)}}^{(j)}}{\partial θ^{(j)} \partial {(θ^{(j)})}^{T}} |_{θ^{(j - 1)}}) (θ^{(j)} - θ^{(j - 1)}) = 0 . \end{matrix}

(17)

Rewriting the expression (17) gives the iteration procedure of updating the parameter

θ

as

\begin{matrix} θ^{(j)} = θ^{(j - 1)} - {(\frac{\partial^{2} J_{θ^{(j)}}^{(j)}}{\partial θ^{(j)} \partial {(θ^{(j)})}^{T}} |_{θ^{(j - 1)}})}^{- 1} (\frac{\partial J_{θ^{(j)}}^{(j)}}{\partial θ^{(j)}} |_{θ^{(j - 1)}}), \end{matrix}

(18)

which is the Gauss–Newton iteration widely used in the literature. However, this method requires the invertibility of

(\frac{\partial^{2} J_{θ^{(j)}}^{(j)}}{\partial θ^{(j)} \partial {(θ^{(j)})}^{T}} |_{θ^{(j - 1)}})

and may lead to a heavy computational burden. To improve its applicability in numerical computation, an iteration procedure known as the steepest-descent algorithm is introduced

\begin{matrix} θ^{(j)} = θ^{(j - 1)} - Δ^{(j)} (\frac{\partial J_{θ^{(j)}}^{(j)}}{\partial θ^{(j)}} |_{θ^{(j - 1)}}), \end{matrix}

(19)

where

Δ^{(j)} > 0

is a diagonal matrix meaning the step length of the j-th iteration.

Clearly, the key question of the optimization is to calculate the gradient

\frac{\partial J}{\partial θ} = \frac{\partial δ_{k_{d, s}} (K_{o, s}, K_{d, s})}{\partial θ}

.

It is obtained from Lemma 2 that

\begin{matrix} \frac{\partial δ_{k_{d, s}} (K_{o, s}, K_{d, s})}{\partial θ} \\ = & \frac{\partial \bar{σ} ([I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s}] {\bar{K}}_{o, s}^{T} {\bar{K}}_{o, s})}{\partial θ} \\ = & \frac{\partial \{\sqrt{\bar{e i g} [(I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s}) ({\bar{K}}_{o, s}^{T} {\bar{K}}_{o, s}) ({\bar{K}}_{o, s}^{T} {\bar{K}}_{o, s}) (I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s})]}\}}{\partial θ} \\ = & \frac{\partial \{\sqrt{\bar{e i g} [(I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s}) ({\bar{K}}_{o, s}^{T} {\bar{K}}_{o, s}) (I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s})]}\}}{\partial θ} \\ = & \frac{1}{2 δ_{k_{d, s}} (K_{o, s}, K_{d, s})} \frac{\partial \{\bar{e i g} [(I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s}) ({\bar{K}}_{o, s}^{T} {\bar{K}}_{o, s}) (I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s})]\}}{\partial θ} \\ = & \frac{1}{2 δ_{k_{d, s}} (K_{o, s}, K_{d, s})} [\begin{matrix} \frac{\partial \{\bar{e i g} [(I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s}) ({\bar{K}}_{o, s}^{T} {\bar{K}}_{o, s}) (I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s})]\}}{\partial θ_{1}} \\ \frac{\partial \{\bar{e i g} [(I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s}) ({\bar{K}}_{o, s}^{T} {\bar{K}}_{o, s}) (I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s})]\}}{\partial θ_{2}} \\ ⋮ \\ \frac{\partial \{\bar{e i g} [(I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s}) ({\bar{K}}_{o, s}^{T} {\bar{K}}_{o, s}) (I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s})]\}}{\partial θ_{i}} \\ ⋮ \\ \frac{\partial \{\bar{e i g} [(I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s}) ({\bar{K}}_{o, s}^{T} {\bar{K}}_{o, s}) (I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s})]\}}{\partial θ_{τ}} \end{matrix}] . \end{matrix}

Denote

ξ

as the eigenvector corresponding to the maximum eigenvalue of matrix

(I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s}) ({\bar{K}}_{o, s}^{T} {\bar{K}}_{o, s}) (I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s})

and focus on the partial derivative on the i-th variable, i.e.,

\begin{matrix} \frac{\partial \{\bar{e i g} [(I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s}) ({\bar{K}}_{o, s}^{T} {\bar{K}}_{o, s}) (I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s})]\}}{\partial θ_{i}} \\ = & \frac{ξ^{T} \partial \{(I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s}) ({\bar{K}}_{o, s}^{T} {\bar{K}}_{o, s}) (I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s})\} ξ}{\partial θ_{i}} \\ = & ξ^{T} \frac{\partial \{(I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s}) ({\bar{K}}_{o, s}^{T} {\bar{K}}_{o, s}) (I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s})\}}{\partial θ_{i}} ξ \\ = & ξ^{T} (I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s}) \frac{\partial ({\bar{K}}_{o, s}^{T} {\bar{K}}_{o, s})}{\partial θ_{i}} (I - {\bar{K}}_{d, s}^{T} {\bar{K}}_{d, s}) ξ . \end{matrix}

Making use of the result of singular value decomposition in (16) gives

\begin{matrix} {\bar{K}}_{o, s}^{T} {\bar{K}}_{o, s} = V_{1} V_{1}^{T} = v_{1} v_{1}^{T} + v_{2} v_{2}^{T} + \dots + v_{μ} v_{μ}^{T}, \end{matrix}

where

V_{1} = [\begin{matrix} v_{1} & v_{2} & \dots & v_{μ} \end{matrix}]

and

μ

means the row number of matrix

{\bar{K}}_{o, s}

. Therefore,

\begin{matrix} \frac{\partial ({\bar{K}}_{o, s}^{T} {\bar{K}}_{o, s})}{\partial θ_{i}} = \frac{\partial (v_{1} v_{1}^{T} + v_{2} v_{2}^{T} + \dots + v_{μ} v_{μ}^{T})}{\partial θ_{i}} = \sum_{α = 1}^{μ} \frac{\partial (v_{α} v_{α}^{T})}{\partial θ_{i}}, \end{matrix}

in which

\begin{matrix} \frac{\partial (v_{α} v_{α}^{T})}{\partial θ_{i}} = & \frac{\partial v_{α}}{\partial θ_{i}} v_{α}^{T} + v_{α} \frac{\partial (v_{α}^{T})}{\partial θ_{i}} \\ = & \frac{\partial v_{α}}{\partial θ_{i}} v_{α}^{T} + v_{α} {(\frac{\partial v_{α}}{\partial θ_{i}})}^{T} . \end{matrix}

Notice that

\begin{matrix} K_{o, s}^{T} K_{o, s} = & [\begin{matrix} V_{1} & V_{2} \end{matrix}] [\begin{matrix} Σ_{1}^{T} \\ 0 \end{matrix}] U^{T} U [\begin{matrix} Σ_{1} & 0 \end{matrix}] [\begin{matrix} V_{1}^{T} \\ V_{2}^{T} \end{matrix}] \\ = & [\begin{matrix} V_{1} & V_{2} \end{matrix}] [\begin{matrix} Σ_{1}^{2} & 0 \\ 0 & 0 \end{matrix}] [\begin{matrix} V_{1}^{T} \\ V_{2}^{T} \end{matrix}], \end{matrix}

where

Σ_{1} = d i a g {σ_{1}, σ_{2}, \dots, σ_{μ}}

. As a result,

v_{α}

is the eigenvector corresponding to the i-th eigenvalue of matrix

K_{o, s}^{T} K_{o, s}

. For any

α = 1, 2, \dots, μ, i = 1, 2, \dots, τ

,

\begin{matrix} \frac{\partial v_{α}}{\partial θ_{i}} = & \frac{{(σ_{α}^{2} I - K_{o, s}^{T} K_{o, s})}^{+} \partial (K_{o, s}^{T} K_{o, s}) v_{α}}{\partial θ_{i}} \\ = & {(σ_{α}^{2} I - K_{o, s}^{T} K_{o, s})}^{+} \frac{\partial (K_{o, s}^{T} K_{o, s})}{\partial θ_{i}} v_{α} . \end{matrix}

It can be easily obtained that

\begin{matrix} \frac{\partial (K_{o, s}^{T} K_{o, s})}{\partial θ_{i}} = & \frac{\partial (K_{o, s}^{T})}{\partial θ_{i}} K_{o, s} + K_{o, s}^{T} \frac{\partial K_{o, s}}{\partial θ_{i}} \\ = & {(\frac{\partial K_{o, s}}{\partial θ_{i}})}^{T} K_{o, s} + K_{o, s}^{T} \frac{\partial K_{o, s}}{\partial θ_{i}} . \end{matrix}

Substituting the Formula (15) into the partial derivative leads to

\begin{matrix} \frac{\partial K_{o, s}}{\partial θ_{i}} = \frac{\partial [\begin{matrix} D_{ν} & \dots & 0 & 0 & I & \dots & 0 & 0 \\ C_{ν} B_{ν} & \dots & 0 & 0 & C_{ν} L_{ν} & \dots & 0 & 0 \\ ⋮ & ⋱ & ⋮ & ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ C_{ν} A_{ν}^{s - 1} B_{ν} & \dots & C_{ν} B_{ν} & D_{ν} & C_{ν} A_{ν}^{s - 1} L_{ν} & \dots & C_{ν} L_{ν} & I \end{matrix}]}{\partial θ_{i}} . \end{matrix}

As

θ_{i}

may be one element of any vector of

θ_{A B}, θ_{C_{ν}}, θ_{D_{ν}}, θ_{L_{ν}}

, the deduction will be divided into four cases in the following.

First, when the parameter

θ_{i}

is one element of vector

θ_{A B}

,

\begin{matrix} \frac{\partial K_{o, s}}{\partial θ_{A B} (i)} = [\begin{matrix} \frac{\partial H_{u, s}}{\partial θ_{A B} (i)} & \frac{\partial H_{y, s}}{\partial θ_{A B} (i)} \end{matrix}], \end{matrix}

where

\begin{matrix} \frac{\partial H_{u, s}}{\partial θ_{A B} (i)} = & [\begin{matrix} 0 & 0 & \dots & 0 \\ C_{ν} \frac{\partial B_{ν}}{\partial θ_{A B} (i)} & 0 & \dots & 0 \\ C_{ν} (\frac{\partial A_{ν}}{\partial θ_{A B} (i)} B_{ν} + A_{ν} \frac{\partial B_{ν}}{\partial θ_{A B} (i)}) & C_{ν} \frac{\partial B_{ν}}{\partial θ_{A B} (i)} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ C_{ν} (\frac{\partial A_{ν}^{s - 1}}{\partial θ_{A B} (i)} B_{ν} + A_{ν}^{s - 1} \frac{\partial B_{ν}}{\partial θ_{A B} (i)}) & C_{ν} (\frac{\partial A_{ν}^{s - 2}}{\partial θ_{A B} (i)} B_{ν} + A_{ν}^{s - 2} \frac{\partial B_{ν}}{\partial θ_{A B} (i)}) & \dots & C_{ν} \frac{\partial B_{ν}}{\partial θ_{A B} (i)} \end{matrix}], \\ \frac{\partial H_{y, s}}{\partial θ_{A B} (i)} = & [\begin{matrix} 0 & 0 & \dots & 0 \\ 0 & 0 & \dots & 0 \\ C_{ν} \frac{\partial A_{ν}}{\partial θ_{A B} (i)} L_{ν} & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ C_{ν} \frac{\partial A_{ν}^{s - 1}}{\partial θ_{A B} (i)} L_{ν} & C_{ν} \frac{\partial A_{ν}^{s - 2}}{\partial θ_{A B} (i)} L_{ν} & \dots & 0 \end{matrix}], \end{matrix}

and

\begin{matrix} \frac{\partial [\begin{matrix} B_{ν}^{T} (θ_{A B}) \\ A_{ν}^{T} (θ_{A B}) \end{matrix}]}{\partial θ_{A B} (i)} = & [\begin{matrix} \frac{\partial B_{ν}^{T} (θ_{A B})}{\partial θ_{A B} (i)} \\ \frac{\partial A_{ν}^{T} (θ_{A B})}{\partial θ_{A B} (i)} \end{matrix}] = Ψ_{1} (θ_{A B} (1)) \dots \frac{\partial Ψ_{i} (θ_{A B} (i))}{\partial θ_{A B} (i)} \dots Ψ_{n l} (θ_{A B} (n l)) [\begin{matrix} 0 \\ I_{n} \end{matrix}], \\ \frac{\partial Ψ_{i} (θ_{A B} (i))}{\partial θ_{A B} (i)} = & [\begin{matrix} 0 & 0 & 0 \\ 0 & \frac{\partial U (θ_{A B} (i))}{\partial θ_{A B} (i)} & 0 \\ 0 & 0 & 0 \end{matrix}], \\ \frac{\partial U (θ_{A B} (i))}{\partial θ_{A B} (i)} = & [\begin{matrix} - 1 & \frac{- θ_{A B} (i)}{\sqrt{1 - θ_{A B}^{2} (i)}} \\ \frac{- θ_{A B} (i)}{\sqrt{1 - θ_{A B}^{2} (i)}} & 1 \end{matrix}] . \end{matrix}

Second, when the parameter

θ_{i}

is one element of vector

θ_{C_{ν}}

, assume

θ_{i} = c_{p q}, p \in {1, 2, \dots, m}, q \in {1, 2, \dots, n}

, where

c_{p q}

corresponds to the p-th row and the q-th column of matrix

C_{ν}

,

\begin{matrix} \frac{\partial [\begin{matrix} D_{ν} & \dots & 0 & 0 & I & \dots & 0 & 0 \\ C_{ν} B_{ν} & \dots & 0 & 0 & C_{ν} L_{ν} & \dots & 0 & 0 \\ ⋮ & ⋱ & ⋮ & ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ C_{ν} A_{ν}^{s - 1} B_{ν} & \dots & C_{ν} B_{ν} & D_{ν} & C_{ν} A_{ν}^{s - 1} L_{ν} & \dots & C_{ν} L_{ν} & I \end{matrix}]}{\partial c_{p q}} \\ = & [\begin{matrix} 0 & 0 & \dots & \dots & 0 & 0 & 0 & \dots & 0 & 0 \\ \frac{\partial C_{ν}}{\partial c_{p q}} B_{ν} & 0 & \dots & ⋮ & ⋮ & \frac{\partial C_{ν}}{\partial c_{p q}} L_{ν} & 0 & \dots & 0 & 0 \\ \frac{\partial C_{ν}}{\partial c_{p q}} A_{ν} B_{ν} & \frac{\partial C_{ν}}{\partial c_{p q}} B_{ν} & \dots & ⋮ & ⋮ & \frac{\partial C_{ν}}{\partial c_{p q}} A_{ν} L_{ν} & \frac{\partial C_{ν}}{\partial c_{p q}} L_{ν} & \dots & 0 & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋮ & ⋮ & \dots & ⋱ & ⋮ \\ \frac{\partial C_{ν}}{\partial c_{p q}} A_{ν}^{s - 1} B_{ν} & \frac{\partial C_{ν}}{\partial c_{p q}} A_{ν}^{s - 2} B_{ν} & \dots & \frac{\partial C_{ν}}{\partial c_{p q}} B_{ν} & 0 & \frac{\partial C_{ν}}{\partial c_{p q}} A_{ν}^{s - 1} L_{ν} & \frac{\partial C_{ν}}{\partial c_{p q}} A_{ν}^{s - 2} L_{ν} & \dots & \frac{\partial C_{ν}}{\partial c_{p q}} L_{ν} & 0 \end{matrix}] \end{matrix}

with

\begin{matrix} \frac{\partial C_{ν}}{\partial c_{p q}} = & \frac{\partial [\begin{matrix} c_{11} & \dots & c_{1 q} & \dots & c_{1 n} \\ ⋮ & ⋮ & ⋮ \\ c_{p 1} & \dots & c_{p q} & \dots & c_{p n} \\ ⋮ & ⋮ & ⋮ \\ c_{m 1} & \dots & c_{m q} & \dots & c_{m n} \end{matrix}]}{\partial c_{p q}} \\ = & [\begin{matrix} 0 & \dots & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋮ \\ 0 & \dots & 1 & \dots & 0 \\ ⋮ & ⋮ & ⋮ \\ 0 & \dots & 0 & \dots & 0 \end{matrix}] . \end{matrix}

Third, when the parameter

θ_{i}

is one element of vector

θ_{D_{ν}}

, assume

θ_{i} = d_{p q}, p \in {1, 2, \dots, m}, q \in {1, 2, \dots, l}

, where

d_{p q}

corresponds to the p-th row and the q-th column of matrix

D_{ν}

,

\begin{matrix} \frac{\partial [\begin{matrix} D_{ν} & \dots & 0 & 0 & I & \dots & 0 & 0 \\ C_{ν} B_{ν} & \dots & 0 & 0 & C_{ν} L_{ν} & \dots & 0 & 0 \\ ⋮ & ⋱ & ⋮ & ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ C_{ν} A_{ν}^{s - 1} B_{ν} & \dots & C_{ν} B_{ν} & D_{ν} & C_{ν} A_{ν}^{s - 1} L_{ν} & \dots & C_{ν} L_{ν} & I \end{matrix}]}{\partial d_{p q}} \\ = & [\begin{matrix} \frac{\partial D_{ν}}{\partial d_{p q}} & 0 & 0 & \dots & 0 & 0 & \dots & \dots & \dots & 0 \\ 0 & \frac{\partial D_{ν}}{\partial d_{p q}} & 0 & \dots & ⋮ & 0 & 0 & \dots & \dots & ⋮ \\ 0 & 0 & \frac{\partial D_{ν}}{\partial d_{p q}} & \dots & ⋮ & 0 & 0 & 0 & \dots & ⋮ \\ ⋮ & ⋮ & ⋮ & ⋱ & 0 & ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & \dots & \dots & 0 & \frac{\partial D_{ν}}{\partial d_{p q}} & 0 & \dots & \dots & \dots & 0 \end{matrix}] \end{matrix}

with

\begin{matrix} \frac{\partial D_{ν}}{\partial d_{p q}} = & \frac{\partial [\begin{matrix} d_{11} & \dots & d_{1 q} & \dots & d_{1 l} \\ ⋮ & ⋮ & ⋮ \\ d_{p 1} & \dots & d_{p q} & \dots & d_{p l} \\ ⋮ & ⋮ & ⋮ \\ d_{m 1} & \dots & d_{m q} & \dots & d_{m l} \end{matrix}]}{\partial d_{p q}} \\ = & [\begin{matrix} 0 & \dots & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋮ \\ 0 & \dots & 1 & \dots & 0 \\ ⋮ & ⋮ & ⋮ \\ 0 & \dots & 0 & \dots & 0 \end{matrix}] . \end{matrix}

Fourth, when the parameter

θ_{i}

is one element of vector

θ_{L_{ν}}

, assume

θ_{i} = l_{p q}, p \in {1, 2, \dots, n}, q \in {1, 2, \dots, m}

, where

l_{p q}

corresponds to the p-th row and the q-th column of matrix

L_{ν}

,

\begin{matrix} \frac{\partial [\begin{matrix} D_{ν} & \dots & 0 & 0 & I & \dots & 0 & 0 \\ C_{ν} B_{ν} & \dots & 0 & 0 & C_{ν} L_{ν} & \dots & 0 & 0 \\ ⋮ & ⋱ & ⋮ & ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ C_{ν} A_{ν}^{s - 1} B_{ν} & \dots & C_{ν} B_{ν} & D_{ν} & C_{ν} A_{ν}^{s - 1} L_{ν} & \dots & C_{ν} L_{ν} & I \end{matrix}]}{\partial l_{p q}} \\ = & [\begin{matrix} 0 & 0 & \dots & 0 & 0 & \dots & \dots & \dots & 0 \\ 0 & 0 & \dots & ⋮ & C_{ν} \frac{\partial L_{ν}}{\partial l_{p q}} & 0 & \dots & \dots & ⋮ \\ 0 & 0 & \dots & ⋮ & C_{ν} A_{ν} \frac{\partial L_{ν}}{\partial l_{p q}} & C_{ν} \frac{\partial L_{ν}}{\partial l_{p q}} & 0 & \dots & ⋮ \\ ⋮ & ⋮ & ⋱ & 0 & ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & \dots & 0 & 0 & C_{ν} A_{ν}^{s - 1} \frac{\partial L_{ν}}{\partial l_{p q}} & \dots & \dots & C_{ν} \frac{\partial L_{ν}}{\partial l_{p q}} & 0 \end{matrix}] \end{matrix}

with

\begin{matrix} \frac{\partial L_{ν}}{\partial l_{p q}} = & \frac{\partial [\begin{matrix} l_{11} & \dots & l_{1 q} & \dots & l_{1 m} \\ ⋮ & ⋮ & ⋮ \\ l_{p 1} & \dots & l_{p q} & \dots & l_{p m} \\ ⋮ & ⋮ & ⋮ \\ l_{n 1} & \dots & l_{n q} & \dots & l_{n m} \end{matrix}]}{\partial l_{p q}} \\ = & [\begin{matrix} 0 & \dots & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋮ \\ 0 & \dots & 1 & \dots & 0 \\ ⋮ & ⋮ & ⋮ \\ 0 & \dots & 0 & \dots & 0 \end{matrix}] . \end{matrix}

Up to now, the optimization problem is solved and the parameter optimization is derived by the above deduction. To summarize, this study aims to achieve the online configuration of an observer-based residual generator for fault detection systems. To achieve this, the observer is first transformed into the input normal form, which guarantees the asymptotic stability. Based on the input normal form, the residual generator is established, and the purpose is to online configure its parameters to cope with the operation condition changes or uncertainties of the system. Then, the optimization configuration strategy is taken into account and the optimization problem is proposed by utilizing the K-gap metric concept. By employing the gradient descent method, the parameter configuration is finally obtained for the online configuration realization.

Remark 1.

Note that the optimization problem of this paper is established based on the K-gap metric concept. The purpose is to minimize the K-gap metric

δ_{k_{d, s}} (K_{o, s}, K_{d, s})

between the observer and system plant, which characterizes the distance between two kernel subspaces. Referring to the work in [37], a definition of cluster is described as below. For

δ_{f} \in (0, 1)

, the set

\begin{matrix} C_{f} \subseteq {K : δ_{k} (K, K_{f}) \leq δ_{f}} \end{matrix}

is called

C_{f}

cluster with the cluster center

K_{f}

and cluster radius

δ_{f}

. From this point of view, by minimizing the K-gap metric

δ_{k_{d, s}} (K_{o, s}, K_{d, s})

, the obtained

K_{o, s}

falls into the cluster with the cluster center

K_{d, s}

and a certain cluster radius. A smaller cluster radius contributes to a more similar level of

K_{o, s}

to

K_{d, s}

. Therefore, the residual generator based on the optimized

K_{o, s}

can guarantee the reliability and efficiency of the fault detection system.

3.3. Online Configuration Realization of Fault Detection

This subsection will present an online algorithm to achieve the fault detection goal by employing the newly proposed gradient configuration scheme. The detailed procedure of online configuring the observer-based residual generator is described in Algorithm 1 as follows.

Algorithm 1 Online Configuration of Observer-Based Residual Generator
Step 1:	Collect the I/O data $u (k), y (k)$ at each k and select an iteration interval W
Step 2:	Execute Step 3–6 for the j-th iteration every W
Step 3:	According to Definition 3, compute the SKR $K_{d, s}$ and its normalized result ${\bar{K}}_{d, s}$
Step 4:	Compute the SKR $K_{o, s}$ and its normalized result ${\bar{K}}_{o, s}$ corresponding to the observer
	by (15) and (16)
Step 5:	Apply Lemma 2 to obtain the K-gap metric $δ_{k_{d, s}} (K_{o, s}, K_{d, s})$ between $K_{o, s}$ and $K_{d, s}$
Step 6:	Given a scalar $ε$ , if $δ_{k_{d, s}} (K_{o, s}, K_{d, s}) > ε$ , compute the gradient and update the parameters
	of residual generator according to (19), increase j by 1 and return Step 2, otherwise
	the configuration ends

To achieve the fault detection, the evaluation function is set to be

J (k) = {∥ r (k) ∥}_{2}^{2}

with the threshold

J_{t h}

chosen as below,

\begin{matrix} J_{t h} = sup_{f = 0, d, Δ} J (k), \end{matrix}

where d and

Δ

represent the disturbance and model uncertainties, respectively. Based on the online residual generator obtained by Algorithm 1, calculate the evaluation function

J (k)

. The decision logic is described by

\begin{matrix} \{\begin{matrix} J (k) > J_{t h} \Rightarrow faulty, \\ J (k) \leq J_{t h} \Rightarrow fault-free . \end{matrix} \end{matrix}

(20)

4. Simulation Experiment

In this section, a simulation experiment is carried out on a three-tank system plant to show the effectiveness and advantages of the proposed real-time configuration scheme. Figure 2 shows the schematic diagram of the three-tank system, which is composed of three water tanks and some connecting pipes. In Figure 2, h

_{1}

, h

_{2}

, and h

_{3}

refer to the water levels of the three tanks, which are measurable through sensors and deemed as the output signals. Q

_{1}

and Q

_{2}

stand for the incoming mass flow rates of Pump 1 and Pump 2 and are used as the input signals. Besides, PV

_{1}

, PV

_{2}

, PV

_{3}

, LV

_{1}

, LV

_{2}

, LV

_{3}

are the adjustable ball valves to administrate the opening and closing of these pipes.

The system plant can be represented by the nonlinear dynamics

\begin{matrix} \{\begin{matrix} A {\dot{h}}_{1} = Q_{1} - α_{1} s s g n (h_{1} - h_{3}) \sqrt{2 g | h_{1} - h_{3} |}, \\ A {\dot{h}}_{2} = Q_{2} + α_{3} s s g n (h_{3} - h_{2}) \sqrt{2 g | h_{3} - h_{2} |} - α_{2} s \sqrt{2 g h_{2}}, \\ A {\dot{h}}_{3} = α_{1} s s g n (h_{1} - h_{3}) \sqrt{2 g | h_{1} - h_{3} |} - α_{3} s s g n (h_{3} - h_{2}) \sqrt{2 g | h_{3} - h_{2} |}, \end{matrix} \end{matrix}

(21)

in which

A = 154

cm

^{2}

and

s = 0.5

cm

^{2}

denote the cross section area of the tanks and pipes, respectively, and

α_{1} = 0.46, α_{2} = 0.60, α_{3} = 0.45

successively indicate the coefficients of flow for the three pipes. In addition, the maximum height of the tanks is chosen as

H_{m a x} = 62

cm, and the maximum flow rates of pumps 1 and 2 are set to be

Q_{1 m a x} = Q_{2 m a x} = 100

cm

^{3}

/s. For certain operation points, the nonlinear representation (21) can be reformulated into the LTI system (1) by utilizing the linearization technique.

In this simulation, the operating time is set to be 25,000 s and the sampling period is chosen as 1s. At first, the operation point of the three-tank system is considered to be h

_{1} = 45

cm, h

_{2} = 15

cm, and h

_{3} = 30

cm, for which a residual generator is predesigned and applied in the fault detection process. Due to the demand of practical industry, it is supposed that the operation point changes to h

_{1} = 50

cm, h

_{2} = 46

cm and h

_{3} = 48

cm at 6000 s. The process data are collected and displayed in Figure 3.

Figure 4 displays the evolution curves of residual signals and evaluation function and Figure 5 exhibits the K-gap metric between system plant and observer. Before the change of the operation point at 6000 s, the evaluation function value is lower than the threshold and the K-gap metric value is

0.0007072

, which means the kernel subspaces of system plant and observer are sufficiently close and thus the predesigned residual generator is proper. From 6000 s, the value of evaluation function increases and obviously exceeds the threshold, which causes a false alarm under the circumstance of no fault. Meanwhile, the K-gap metric between system plant and observer becomes enormous. Clearly, the predesigned residual generator is no more applicable for the system with a changed operation point. To handle this problem and demonstrate the validity of the proposed real-time configuration method, the real-time configuration algorithm, i.e., Algorithm 1 is implemented at 9000 s. It can be observed from Figure 4 and Figure 5 that from 9000 s, the K-gap metric value begins to decline as the real-time configuration implementation, and the values of residual signal and evaluation function, begin to decrease and gradually converge. At ~14,700 s, the K-gap metric settles at ~

0.001

and the kernel subspace of observer is adequately approximate to the kernel subspace of system plant. In addition, the value of evaluation function becomes less than the threshold and the false alarm disappears, which reflects that the real-time optimized residual generator is effective for the current operation point. Therefore, the optimization goal is realized and the real-time configuration of the residual generator is achieved for the system. In addition, the optimized parameters of

θ_{A B}, θ_{C}

are, respectively, given in Figure 6 and Figure 7 to show the parameter configuration process.

Now, the online configured residual generator is applied to the fault detection implementation to verify its usefulness. Here, two types of fault will be considered and assumed to happen at 23,000 s, respectively. First, consider that a leakage fault happens in the tank 2 with a 20% leakage level at 23,000 s. Figure 8 shows the fault detection result by the online configured residual generator. It can be seen from the figure that the value of the evaluation function exceeds the threshold at 23,005 s, which means that the leakage fault is detected. Then, the usefulness of the fault detection based on the online configuration scheme is illustrated. Moreover, another type of fault, i.e., a drift fault with slope 0.01 occurring in the sensor of water level of tank 2, is considered at 23,000 s. Figure 9 shows the corresponding fault detection result, from which one can see that the drift fault is detected by the proposed optimization configuration algorithm at 23,042 s. According to the above fault detection results, it is clear that the optimization configuration method of this paper can detect the fault accurately and timely. As a consequence, the real-time configuration method proposed in this paper can effectively deal with the system under the influence of operation point changes or uncertainties, and the effectiveness of the method in fault detection implementation is demonstrated.

5. Conclusions

Considering that the changes of operating conditions, practical environment, and some uncertainties may occur in industrial processes, this paper is dedicated to the real-time configuration design for fault detection systems. As an important means of real-time configuration, the gradient optimization scheme is considered and adopted in this work for the real-time configuration implementation. A novel optimization algorithm is developed by virtue of the gradient-based technique, in which the K-gap metric between the residual generator and the current system is minimized. Then, the residual generator parameters can be updated based on the I/O data, and the real-time configuration for fault detection systems is realized to satisfy a particular demand. Finally, the usefulness and merits of the proposed approach are demonstrated through the benchmark case on a three-tank model. One main advantage of this work is that by virtue of the K-gap metric technique together with the gradient-based method, the online-configured residual generator parameters are reliable to guarantee the fault detection performance for industrial systems with changeable operating points. Besides, the input/output data information are sufficiently exploited for the fault detection implementation, which avoids the difficulties of the system identification in practice. As the real-time configuration method is carried out based on the process data, the computation amount would be the main concern of the implementation, especially for large-scale industrial systems, which inspires us to conduct further research in this field. Note that one of the main contributions of this paper is to introduce the K-gap idea into the real-time configuration implementation, which exploits the essential characteristic of fault detection systems. In our future work, the K-gap-based optimization configuration approach will be further extended to systems with nonlinearities and fault-tolerant control issues.

Author Contributions

Conceptualization, H.Z.; Formal analysis, T.L.; Funding acquisition, H.L.; Methodology, H.Z.; Software, H.Z.; Supervision, H.L.; Writing—original draft, H.Z.; Writing—review & editing, H.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by National Natural Science Foundation of China under Grant U20A20186, 62073104.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Fekih, A.; Xu, H.; Chowdhury, F. Neural networks based system identification techniques for model based fault detection of nonlinear systems. Int. J. Innov. Comput. Inf. Control 2007, 3, 1073–1085. [Google Scholar]
Wang, X.L.; Yang, G.H. Event-triggered fault detection for discrete-time T-S fuzzy systems. ISA Trans. 2018, 76, 18–30. [Google Scholar] [CrossRef] [PubMed]
Son, J.; Du, Y. Model-based stochastic fault detection and diagnosis of Lithium-Ion batteries. Processes 2019, 7, 38. [Google Scholar] [CrossRef] [Green Version]
Mazzoletti, M.A.; Bossio, G.R.; De Angelo, C.H.; Espinoza-Trejo, D.R. A Model-based strategy for interturn short-circuit fault diagnosis in PMSM. IEEE Trans. Ind. Electron. 2017, 64, 7218–7228. [Google Scholar] [CrossRef]
Huang, S.P.; Xiang, Z.R.; Karimi, H.R. Mixed L₋/L₁ fault detection filter design for fuzzy positive linear systems with time-varying delays. IET Control Theory A 2014, 8, 1023–1031. [Google Scholar] [CrossRef]
Poon, J.; Jain, P.; Konstantakopoulos, I.; Spanos, C.; Panda, S.; Sanders, S. Model-based fault detection and identification for switching power converters. IEEE Trans. Power Electron. 2017, 32, 1419–1430. [Google Scholar] [CrossRef]
Yang, C.; Fang, H. A new nonlinear model-based fault detection method using Mann-Whitney test. IEEE Trans. Ind. Electron. 2020, 67, 10856–10864. [Google Scholar] [CrossRef]
Gao, Y.; Xiao, F.; Liu, J.; Wang, R. Distributed soft fault detection for interval type-2 fuzzy-model-based stochastic systems with wireless sensor networks. IEEE Trans. Ind. Inform. 2019, 15, 334–347. [Google Scholar] [CrossRef]
Zhou, S.; Bai, J.; Wu, F. Decentralized fault detection and fault-tolerant control for nonlinear interconnected systems. Processes 2021, 9, 591. [Google Scholar] [CrossRef]
Zhang, P.; Ding, S.X.; Wang, G.Z.; Zhou, D.H. Fault detection of linear discrete-time periodic systems. IEEE Trans. Automat. Control 2005, 50, 239–244. [Google Scholar] [CrossRef]
Zhong, M.; Ding, S.X.; Ding, E.L. Optimal fault detection for linear discrete time-varying systems. Automatica 2010, 46, 1395–1400. [Google Scholar] [CrossRef]
Li, X.; Yang, G.H. Fault detection for linear stochastic systems with sensor stuck faults. Optim. Contr. Appl. Met. 2012, 33, 61–80. [Google Scholar] [CrossRef]
Su, Q.Y.; Li, J. Fault detection for a class of uncertain linear systems. Math. Probl. Eng. 2013, 33, 856914. [Google Scholar] [CrossRef]
Fan, C.; Lam, J.; Xie, X. Fault detection observer design for periodic piecewise linear systems. Int. J. Syst. Sci. 2020, 51, 1622–1636. [Google Scholar] [CrossRef]
Li, L.; Ding, S.X.; Qiu, J.; Yang, Y.; Xu, D. Fuzzy observer-based fault detection design approach for nonlinear processes. IEEE Trans. Syst. Man Cybern. Syst. 2017, 47, 1941–1952. [Google Scholar] [CrossRef]
Li, L.; Ding, S.X.; Yang, Y.; Zhang, Y. Robust fuzzy observer-based fault detection for nonlinear systems with disturbances. Neurocomputing 2016, 174, 767–772. [Google Scholar] [CrossRef]
Zhirabok, A.; Shumsky, A.; Solyanik, S.; Suvorov, A. Fault detection in nonlinear systems via linear methods. Int. J. Appl. Math. Comput. Sci. 2017, 27, 261–272. [Google Scholar] [CrossRef] [Green Version]
Boem, F.; Riverso, S.; Ferrari-Trecate, G.; Parisini, T. Plug-and-Play fault detection and isolation for large-scale nonlinear systems with stochastic uncertainties. IEEE Trans. Automat. Control 2019, 64, 4–19. [Google Scholar] [CrossRef] [Green Version]
Venkateswaran, S.; Liu, Q.C.; Wilhite, B.A.; Kravaris, C. Design of linear residual generators for fault detection and isolation in nonlinear systems. Int. J. Control 2020, 1–17. [Google Scholar] [CrossRef]
Ding, S.X.; Yang, Y.; Zhang, Y.; Li, L. Data-driven realizations of kernel and image representations and their application to fault detection and control system design. Automatica 2014, 50, 2615–2623. [Google Scholar] [CrossRef]
Yin, S.; Wang, G.; Karimi, H.R. Data-driven design of robust fault detection system for wind turbines. Mechatronics 2014, 24, 298–306. [Google Scholar] [CrossRef]
Naderi, E.; Khorasani, K. A data-driven approach to actuator and sensor fault detection, isolation and estimation in discrete-time linear systems. Automatica 2017, 85, 165–178. [Google Scholar] [CrossRef] [Green Version]
Tariq, M.F.; Khan, A.Q.; Abid, M.; Mustafa, G. Data-driven robust fault detection and isolation of three-phase induction motor. IEEE Trans. Ind. Electron. 2019, 66, 4707–4715. [Google Scholar] [CrossRef]
Luo, H.; Li, K.; Kaynak, O.; Yin, S.; Huo, M.; Zhao, H. A robust data-driven fault detection approach for rolling mills with unknown roll eccentricity. IEEE Trans. Control Syst. Technol. 2020, 28, 2641–2648. [Google Scholar] [CrossRef]
Fu, F.; Wang, D.; Li, L.; Li, W.; Wu, Z. Data-driven method for the quantitative fault diagnosability analysis of dynamic systems. IET Control Theory A 2019, 13, 1197–1203. [Google Scholar] [CrossRef]
Yu, D.; Chen, Z.M.; Xiahou, K.S.; Li, M.S.; Ji, T.Y.; Wu, Q.H. A radically data-driven method for fault detection and diagnosis in wind turbines. Int. J. Electr. Power Energy Syst. 2018, 99, 577–584. [Google Scholar] [CrossRef]
Wang, X.; Yang, G.; Zhang, D. Data-driven fault detection for linear systems: A q-step residual iteration approach. Int. J. Robust Nonlinear 2020, 30, 5341–5355. [Google Scholar] [CrossRef]
Li, L.; Ding, S.X.; Peng, X. Distributed data-driven optimal fault detection for large-scale systems. J. Process Control 2020, 96, 94–103. [Google Scholar] [CrossRef]
Kallas, M.; Mourot, G.; Maquin, D.; Ragot, J. Data-driven approach for fault detection and isolation in nonlinear system. Int. J. Adapt. Control Signal Process. 2018, 32, 1569–1590. [Google Scholar] [CrossRef]
Chen, H.T.; Jiang, B.; Chen, W.; Yi, H. Data-driven detection and diagnosis of incipient faults in electrical drives of high-speed trains. IEEE Trans. Ind. Electron. 2019, 66, 4716–4725. [Google Scholar] [CrossRef]
Ding, S.X.; Yin, S.; Zhang, P.; Ding, E.L.; Naik, A. An approach to data-driven adaptive residual generator design and implementation. IFAC Proc. 2009, 42, 941–946. [Google Scholar] [CrossRef]
Chen, J.M.; Yang, F.W. Data-driven subspace-based adaptive fault detection for solar power generation systems. IET Control Theory A 2013, 7, 1498–1508. [Google Scholar] [CrossRef]
Chen, Z.; Peng, T.; Yang, C.; Li, F.; He, Z. An adaptive data-driven fault detection method for monitoring dynamic process. In Proceedings of the IECON 2018-44th Annual Conference of the IEEE Industrial Electronics Society, Washington DC, USA, 21–23 October 2018; pp. 5353–5358. [Google Scholar]
Luo, H. Plug-and-Play Monitoring and Performance Optimization for Industrial Automation Processes; Springer Vieweg: Wiesbaden, Germany, 2017. [Google Scholar]
Ding, S.X. Data-Driven Design of Fault Diagnosis and Fault-Tolerant Control Systems; Springer: New York, NY, USA, 2014. [Google Scholar]
Vinnicombe, G. Uncertainty and Feedback: Hinf Loop-Shaping and the V-Gap Metric; World Science: New York, NY, USA, 2000. [Google Scholar]
Li, L.; Ding, S.X. Gap metric techniques and their application to fault detection performance analysis and fault isolation schemes. Automatica 2020, 118, 109029. [Google Scholar] [CrossRef]
Koenings, T.; Krueger, M.; Luo, H.; Ding, S.X. A data-driven computation method for the gap metric and the optimal stability margin. IEEE Trans. Automat. Control 2018, 63, 805–810. [Google Scholar] [CrossRef]
Li, H.; Yang, Y.; Zhao, Z.; Zhou, J.; Liu, R. Fault detection via data-driven K-gap metric with application to ship propulsion systems. In Proceedings of the 37th Chinese Control Conference (CCC), Guangzhou, China, 27–30 July 2018; pp. 6023–6027. [Google Scholar]
Hanzon, B.; Olivi, M.; Peeters, R.L.M. Balanced realizations of discrete-time stable all-pass systems and the tangential Schur algorithm. Linear Algebra Its Appl. 2006, 418, 793–820. [Google Scholar] [CrossRef]
Verhaegen, M.; Verdult, V. Filtering and System Identification: A Least Squares Approach; Cambridge University Press: Cambridge, UK, 2012. [Google Scholar]

Figure 1. Framework of the online configuration based on K-gap metric.

Figure 2. Schematic diagram of three-tank system.

Figure 3. Input and output data.

Figure 4. Residual signal and evaluation function under the optimization configuration.

Figure 5. K-gap metric under the online optimization configuration.

Figure 6. Optimized parameters of

θ_{A B}

.

Figure 6. Optimized parameters of

θ_{A B}

.

Figure 7. Optimized parameters of

θ_{C}

.

Figure 7. Optimized parameters of

θ_{C}

.

Figure 8. Fault detection by the optimization configuration under leakage fault.

Figure 9. Fault detection by the optimization configuration under sensor drift fault.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhao, H.; Luo, H.; Liu, T. A Real-Time Configuration Approach for an Observer-Based Residual Generator of Fault Detection Systems. Processes 2022, 10, 276. https://doi.org/10.3390/pr10020276

AMA Style

Zhao H, Luo H, Liu T. A Real-Time Configuration Approach for an Observer-Based Residual Generator of Fault Detection Systems. Processes. 2022; 10(2):276. https://doi.org/10.3390/pr10020276

Chicago/Turabian Style

Zhao, Hao, Hao Luo, and Tianyu Liu. 2022. "A Real-Time Configuration Approach for an Observer-Based Residual Generator of Fault Detection Systems" Processes 10, no. 2: 276. https://doi.org/10.3390/pr10020276

APA Style

Zhao, H., Luo, H., & Liu, T. (2022). A Real-Time Configuration Approach for an Observer-Based Residual Generator of Fault Detection Systems. Processes, 10(2), 276. https://doi.org/10.3390/pr10020276

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Real-Time Configuration Approach for an Observer-Based Residual Generator of Fault Detection Systems

Abstract

1. Introduction

2. Preliminaries

2.1. System Descriptions

2.2. Stable Kernel Representation

2.3. K-Gap Metric

2.4. Data-Driven Framework

3. Main Results

3.1. The Observer-Based General Generator

3.2. Online Gradient Optimization

3.3. Online Configuration Realization of Fault Detection

4. Simulation Experiment

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI