Sensor Fault Diagnosis for Aero Engine Based on Online Sequential Extreme Learning Machine with Memory Principle

Lu, Junjie; Huang, Jinquan; Lu, Feng

doi:10.3390/en10010039

Open AccessArticle

Sensor Fault Diagnosis for Aero Engine Based on Online Sequential Extreme Learning Machine with Memory Principle

by

Junjie Lu

,

Jinquan Huang

^* and

Feng Lu

Jiangsu Province Key Laboratory of Aerospace Power Systems, College of Energy and Power Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China

^*

Author to whom correspondence should be addressed.

Energies 2017, 10(1), 39; https://doi.org/10.3390/en10010039

Submission received: 5 September 2016 / Revised: 18 December 2016 / Accepted: 20 December 2016 / Published: 1 January 2017

Download

Browse Figures

Versions Notes

Abstract

:

The on-board sensor fault detection and isolation (FDI) system is essential to guarantee the reliability and safety of an aero engine. In this paper, a novel online sequential extreme learning machine with memory principle (MOS-ELM) is proposed for detecting, isolating, and reconstructing the fault sensor signal of aero engines. In many practical online applications, the sequentially coming data chunk usually possesses a characteristic of timeliness, and the overdue training data may mislead the subsequent learning process. The proposed MOS-ELM can improve the training process by introducing the concept of memory principle into the online sequential extreme learning machine (OS-ELM) to tackle the timeliness of the data chunk. Simulations on some time series problems and some benchmark databases show that MOS-ELM performs better in generalization performance, stability, and prediction accuracy than OS-ELM. The experiment results of the MOS-ELM-based sensor fault diagnosis system also verify the excellent generalization performance of MOS-ELM and indicate the effectiveness and feasibility of the developed diagnosis system.

Keywords:

extreme learning machine (ELM); memory principle; online learning; aero engine; sensor fault diagnosis

1. Introduction

Aero engine health management (EHM) is a widely researched field due to its operational reliability and maintenance costs [1,2]. Accuracy of most health management systems relies on accuracy of measurements acquired from sensors [3]. Due to harsh operating conditions, such as strong vibration, high pressure and high temperature, most sensors are extremely vulnerable to breaking down, which may incur false alarms and increased engine downtime, thus resulting in lower reliability and higher operational costs [4,5]. Therefore, an on-board sensor fault detection and isolation (FDI) system is of great importance in enhancing the reliability and safety of an aero engine.

The field of sensor FDI systems for aero engines has been studied over the past few decades [6], and plenty of related works have been reported [7,8,9,10,11,12,13,14]. Wallhagen and Arpasi [7] utilized analytical redundancy to address serious sensor fault in one of the engine spool speeds and the compressor outlet static pressure signal, which has set the theoretical basis and given an excellent instance for using the analytical redundancy to diagnose the engine control system. Analytical redundancy technique estimates the sensor measurement according to numerical algorithms, and can reduce the weight and cost of the diagnosis system. With these excellent advantages, the analytical redundancy has attracted large amounts of interest, e.g., Bras designed an FDI system for the inertial and vector sensors of the navigation systems by taking advantage of existing hardware redundancy and exploiting the analytical redundancy [8]. Lu et al. [9] developed integrated architecture according to an adaptive performance model and a baseline model which are both real-time on-board models. The detection can be determined by the comparison between the baseline model outputs and the performance model outputs, but the baseline model is a piecewise linear model and the storage space cost is huge. Bahareh et al. [10] researched the hybrid Kalman filter bank in the detection and isolation of the sensor faults throughout the whole flight envelope. Escobar et al. [11] presented the sensor fault compensation technique by a pair of high-gain observers and model predictive control strategy. Unfortunately, the reliability of Bahareh’s and Escobar’s diagnosis systems also suffer from the modeling error. Mattern et al. [12] compared the sensor fault diagnosis performance by a functional approximation neural network with that by an auto-associative neural network (AANN). Sadough-Vanini et al. [13] provided an integrated solution to the sensor FDI problem based on the multi-model approach and a bank of AANNs. The diagnosis system based on AANN does not need the engine modeling knowledge and is also able to perform well for the diagnosis tasks. Torella [14] discussed diagnosis of the apparatus fault for turbine engines according to certain probabilistic expert systems. The expert system with knowledge bases is able to avoid the problem that the same symptoms may be due to different causes.

As shown above, there are three main techniques, the data-based techniques [15,16], the model-based techniques [17,18] and the hybrid techniques [19], used to address the sensor fault problems. Model-based techniques have the ability to diagnose new sensor faults even if there is no prior knowledge and experience, but it depends on the accuracy of the on-board adaptive engine model whose reliability is bound to decline if the nonlinear complexities and modeling uncertainties are increased [20]. On the other hand, the data-based method does not need any knowledge of the internal engine working principle and complex engine modeling skills and thus attracts lots of interest and concern. With the rapid development of intelligent computing methods, a good deal of data-based methods have arisen and been applied in sensor fault diagnosis for aero engines since the early 1990s. Shah et al. [21] applied an AANN to pre-processing measurement data, and fed the output of AANN to the EHM system. Ogaji et al. [22] modularly designed a diagnosing and quantifying system for the double sensor faults in aero engine by a bank of neural networks, and the diagnosis results are determined according to the comparison between the predicted value and the real measurement acquired from the sensors. Since the neural networks used by Shah and Ogaji are trained by gradient-based algorithms in an iterative way, the diagnosis systems suffer from time-consuming problems. Xu et al. [23] presented a least squares support vector machine (LS-SVM)-based sensor fault diagnosis system. Zhang and Li [24] proposed introducing the idea of fuzzy membership into an LS-SVM- based fault diagnosis system for yaw angular rate sensor. Although the LS-SVM-based diagnosis systems have high accuracy in offline cases, they are not suitable for online applications. For most of the data-based methods, the training process of the diagnosis system is implemented offline in which the dynamic characteristics of the system cannot be well dealt with and the samples used for the training process are only applicable in certain working conditions. In practical applications, the working operation often changes across a wide range, while the diagnosis system trained offline may not adapt to the dynamic changes. Therefore, research on an online learning algorithm is essential to enhance the adaptability of the sensor fault diagnosis system for aero engine.

Extreme learning machine (ELM), is a high-efficiency learning algorithm for single-hidden layer feedforward neural network (SLFN) [25]. As proven in [25], the hidden-layer parameters of the network can be assigned to random values and then the output weight should be analytically determined according to the pseudo-inverse of the hidden-layer output matrix. It has been shown that ELM has not only classification capacity but also universal approximation capacity [26]. Furthermore, as verified in [27], compared with traditional SVM and neural networks, ELM is able to learn much faster while obtaining similar or better generalization performance. Liang et al. [28] incorporated ELM with an online learning algorithm and proposed online sequential extreme learning machine (OS-ELM). If the training data is produced sequentially with the chunk size being constant or unfixed, OS-ELM performs better considering generalization performance and learning speed than other conventional sequential algorithms on a lot of benchmark problems.

However, in many real online applications, such as sensor fault diagnosis for aero engines, the data used for the training process are not only produced sequentially, but also usually have time-varying validity; in other words, the validity of the data chunk may decay along with the time passed. The overdue training data, whose validity decays as time goes on, should have lower weight than the new incoming training data, which is the idea behind the memory principle. Consequently, a novel online learning algorithm is presented in this paper by combining OS-ELM with the memory principle, referred to as MOS-ELM. On the one hand, the proposed MOS-ELM reserves the sequential advantages of OS-ELM by the sequential learning process. On the other hand, it deals with the property of timeliness well by decaying the validity of the data chunk as time goes on. In the circumstance of tested problems possessing timeliness in various databases and sensor fault diagnoses for aero engines, it turns out that the MOS-ELM algorithm performs better in generalization performance, stability and predictability than the OS-ELM algorithm.

This manuscript is organized as follows. In Section 2, the basic concepts and related works of the OS-ELM algorithm are reviewed briefly. The formula of the MOS-ELM algorithm is derived and the performance evaluation of MOS-ELM on some time series prediction problems and some real benchmark regression problems are given in Section 3. In Section 4, the sensor fault diagnosis method for aero engines and the experiment results are presented in detail. The conclusion is drawn in Section 5.

2. Review of Online Sequential Extreme Learning Machine (OS-ELM)

With the purpose of offering an introduction to the proposed MOS-ELM, a brief review of the primary concepts of OS-ELM is given in this section. Considering distinct input-output samples

(x_{i}, t_{i})

, where

x_{i} = {[x_{i 1}, x_{i 2}, \dots, x_{i n}]}^{T} \in ℜ^{n}

and

t_{i} = {[t_{i 1}, t_{i 2}, \dots, t_{i m}]}^{T} \in ℜ^{m}

, the SLFN model is briefly described in a unified way as:

f_{L} (x) = \sum_{i = 1}^{L} β_{i} G (ω_{i}, b_{i}, x), x \in ℜ^{n}

(1)

where

ω_{i} \in ℜ^{n}

,

b_{i} \in ℜ

and

β_{i} = {[β_{i 1}, β_{i 2}, \dots, β_{i m}]}^{T} \in ℜ^{m}

respectively denote the learning parameters and output weight in regard to the i-th hidden node,

L

represents the hidden nodes number, and

G (ω_{i}, b_{i}, x)

denotes the i-th hidden-layer output in regards to

x

. In the case of the hidden node being an additive function,

G (ω_{i}, b_{i}, x)

can be represented by:

G (ω_{i}, b_{i}, x) = g (ω_{i} \cdot x + b_{i})

(2)

In the case of the hidden node being a radial basis function,

G (ω_{i}, b_{i}, x)

can be represented by:

G (ω_{i}, b_{i}, x) = g (b_{i} ‖ ω_{i} - x ‖) .

(3)

We suppose that there are

N

batch-training samples used for the supervised learning process. For the finite distinct set of training samples

{(x_{i}, t_{i})}_{i = 1}^{N} \subset ℜ^{n} \times ℜ^{m}

, if SLFNs having

L

hidden nodes absolutely approximates the

N

training date, it indicates that

ω_{i}, b_{i}

, and

β_{i}

satisfy the following equation:

\sum_{i = 1}^{L} β_{i} G (ω_{i}, b_{i}, x) = t_{j}, j = 1, 2, \dots, N .

(4)

We can rewrite Equation (4) in a compact way as:

H β = T,

(5)

where,

H (ω_{1}, \dots, ω_{L}, b_{1}, \dots, b_{L}, x_{1}, \dots, x_{N}) = {[\begin{matrix} G (ω_{1}, b_{1}, x_{1}) & \dots & G (ω_{L}, b_{L}, x_{1}) \\ ⋮ & ⋱ & ⋮ \\ G (ω_{1}, b_{1}, x_{N}) & \dots & G (ω_{L}, b_{L}, x_{N}) \end{matrix}]}_{N \times L}

(6)

β = {[\begin{matrix} β_{1}^{T} \\ ⋮ \\ β_{L}^{T} \end{matrix}]}_{L \times m} and T = {[\begin{matrix} t_{1}^{T} \\ ⋮ \\ t_{N}^{T} \end{matrix}]}_{N \times m}

(7)

Here

H

is the hidden-layer output matrix.

Traditionally, for the purpose of training an SLFN, one needs to find specific

ω_{i}, b_{i}, β_{i}, i = 1, \dots, L

, such that

‖ H β - T ‖

takes minimum value. If

H

is unknown, the gradient-based learning algorithms are usually used to iteratively adjust the

ω_{i}, b_{i}, β_{i}

. However, for most applications, the gradient-based method is extremely time consuming and often stops at the local minimum. According to the theory of Huang, the hidden-layer learning parameters

ω_{i}

and

b_{i}

of SLFN can be assigned randomly and simply, and such SLFN with any nonzero activation function is able to universally approximate any continuous functions on any compact input sets [26]. If

L \leq N

, the

H

is of full-column rank with the probability one, and in real-world applications, it is easily satisfied that

L \leq N

. Hence, the output weights

β

can be analytically obtained as the least-squares solutions of Equation (5), yielding:

\hat{β} = H^{†} T

(8)

where

H^{†}

represents the pseudo-inverse of

H

. If

H^{T} H

is nonsingular, the pseudo-inverse can be calculated as

H^{†} = {(H^{T} H)}^{- 1} H^{T}

in several ways, such as iterative approach, and orthogonalization method [29]. Compared with traditional iterative implementations of SLFNs, ELM has similar generalization performance and dramatically increased running speed.

The batch ELM algorithm supposes that all the training samples are available for the learning process. Nevertheless, in many problems, the training sample may come chunk by chunk. The OS-ELM algorithm is proposed to handle the online sequential learning problems. We determinate the hidden output function

G (ω, b, x)

by choosing

g

and

L

and assume that the training data is produced to the learning process in the same or different chunk size. The k-th data chunk can be denoted as

ℵ_{k} = {(x_{i}, t_{i})}_{i = (\sum_{j = 0}^{k - 1} N_{j}) + 1}^{\sum_{j = 0}^{k} N_{j}}

, where

N_{j}

is the number of the j-th data chunk

ℵ_{j}, j = 0, 1, 2, \dots, k

. We use a small data chunk

ℵ_{0} = {(x_{i}, t_{i})}_{i = 1}^{N_{0}}

to carry out the initialization of the learning process, where

N_{0}

is the number of data chunk

ℵ_{0}

which is obtained from the sequential training data

ℵ = {(x_{i}, t_{i}) | x_{i} \in ℜ^{n}, t_{i} \in ℜ^{m}, i = 1, 2, \dots}

, and

N_{0}

is equal or greater than

L

. The values of learning parameters

(ω_{i}, b_{i}), i = 1, 2, \dots, L

are assigned randomly and the initial

H_{0}

may be computed as follows:

H_{0} : = H (ω_{1}, \dots, ω_{L}, b_{1}, \dots, b_{L}, x_{1}, \dots, x_{N_{0}})

(9)

And then, the initial output weight

β (0)

can be computed in accordance with ELM as follows:

β (0) = P_{0} H_{0}^{T} T_{0}

(10)

where

P_{0} = {(H_{0}^{T} H_{0})}^{- 1}

and

T_{0} = {[t_{1}, \dots, t_{N_{0}}]}^{T}

.

For the k-th chunk and its previous chunks, the output matrixes of the hidden layer and output layer are respectively defined as:

H_{k} : = H (ω_{1}, \dots, ω_{L}, b_{1}, \dots, b_{L}, x_{1}, \dots, x_{\sum_{j = 0}^{k} N_{j}}), T_{k} = [\begin{matrix} t_{1}^{T} \\ ⋮ \\ t_{\sum_{j = 0}^{k} N_{j}}^{T} \end{matrix}]

(11)

and then,

H_{k + 1} = [\begin{matrix} H_{k} \\ h (k + 1) \end{matrix}], T_{k + 1} = [\begin{matrix} T_{k} \\ t {(k + 1)}^{T} \end{matrix}]

(12)

Here

h (k + 1)

and

t (k + 1)

are respectively defined as:

h (k + 1) : = H (ω_{1}, \dots, ω_{L}, b_{1}, \dots, b_{L}, x_{(\sum_{j = 0}^{k} N_{j}) + 1}, \dots, x_{\sum_{j = 0}^{k + 1} N_{j}})

(13)

t (k + 1) : = {[t_{(\sum_{j = 0}^{k} N_{j}) + 1}, \dots, t_{\sum_{j = 0}^{k + 1} N_{j}}]}^{T}

(14)

Then the estimated values corresponding to the (k + 1)-th data chunk is the least squares solution of

H_{k + 1} β = T_{k + 1}

and it can be computed iteratively as follows:

β (k + 1) = β (k) + P_{k + 1} h {(k + 1)}^{T} (t {(k + 1)}^{T} - h (k + 1) β (k)),

(15)

P_{k + 1} = P_{k} - P_{k} h {(k + 1)}^{T} {(I + h (k + 1) P_{k} h {(k + 1)}^{T})}^{- 1} h (k + 1) P_{k} .

(16)

OS-ELM is composed of an initialization phase and sequential learning phase and needs not retain all the historic data. In the initialization phase,

H_{0}

,

β (0)

,

P_{0}

and

T_{0}

are initialized for use in the sequential learning phase. The samples number of the initialization chunk ought to be equal or greater than the hidden nodes number. In the sequential learning phase, the sequential training date is commenced iteratively. Once the learning procedure on the latest coming data chunk is completed, the historical data can be discarded and is no longer used. From the derivation of OS-ELM, it is easy to conclude that OS-ELM and ELM have similar generalization performance. In fact, ELM algorithm is a specific case of OS-ELM algorithm when all the training samples are used in the initialization phase.

Algorithm OS-ELM: Given hidden nodes number

L

and activation function

g : ℜ \to ℜ

(sigmoid or other function), we can summarize OS-ELM algorithm as the following steps.

(1): Assign learning parameters $(ω_{i}, b_{i}), i = 1, 2, \dots, L$ randomly, and set $k = 0$ ;
(2): Compute $H_{0}$ and $β (0)$ ;
(3): Compute $h (k + 1)$ corresponding to (k + 1)-th data chunk as Equation (13);
(4): Calculate $β (k + 1)$ iteratively as Equations (15) and (16);
(5): If the new data chunk comes, then let $k = k + 1$ and skip to the third step. Otherwise, let $β$ to be the last iteration value $β (k + 1)$ ;
(6): Compute $f_{L} (x) = \sum_{i = 1}^{L} β_{i} G (ω_{i}, b_{i}, x)$ .

3. Proposed Online Sequential Extreme Learning Machine with Memory Principle (MOS-ELM)

In lots of real cases, the sequentially produced training data usually has time variation, that is, the validity of an outdated chunk may decay as time goes on. For instance, in sensor fault diagnosis for an aero engine, since many factors that affect the measurements of the aero engine are usually time-varying, the validity of the previous training process should decay gradually. Hence, the overdue data chunk, whose validity is decaying with time, should have lower weight than the incoming data chunk in the subsequent learning process, which is the idea behind the memory principle. We can easily find that the timeliness of the training data cannot be handled well just with OS-ELM, and the overdue training data may mislead the subsequent learning process. In this section, we introduce the concept of memory principle into OS-ELM to gradually sink the overdue training data into oblivion and name this novel algorithm MOS-ELM.

3.1. Formula Derivation

Assume that the decay rate of each data chunk is

ρ

, where

0 < ρ < 1

, then

β^{'} (k + 1)

; the output matrix in regards to (k + 1)-th data chunk can be solved from the following equation in the sense of least squares,

[\begin{matrix} ρ^{k + 1} H_{0} \\ ρ^{k} h (1) \\ ⋮ \\ h (k + 1) \end{matrix}] β^{'} = [\begin{matrix} ρ^{k + 1} T_{0} \\ ρ^{k} t {(1)}^{T} \\ ⋮ \\ t {(k + 1)}^{T} \end{matrix}]

(17)

Let

H_{k + 1}^{'} : = [\begin{matrix} ρ H_{k}^{'} \\ h (k + 1) \end{matrix}]

,

H_{0}^{'} : = H_{0}

,

T_{k + 1}^{'} : = [\begin{matrix} ρ T_{k}^{'} \\ t {(k + 1)}^{T} \end{matrix}]

,

T_{0}^{'} : = T_{0}

, we can compactly describe Equation (17) as:

H_{k + 1}^{'} β^{'} = T_{k + 1}^{'} .

(18)

Theorem 1.

The least squares solution of Equation (18) can be computed iteratively as follows:

β^{'} (k + 1) = β^{'} (k) + K_{k + 1} (t (k + 1) - h (k + 1) β^{'} (k))

(19)

P_{k + 1}^{'} = \frac{1}{ρ^{2}} (I - K_{k + 1} h (k + 1)) P_{k}^{'}

(20)

K_{k + 1} = P_{k}^{'} h (k + 1) {(ρ^{2} I + h (k + 1) P_{k}^{'} h {(k + 1)}^{T})}^{- 1}

(21)

where

P_{k}^{'} : = {(H_{k}^{' T} H_{k}^{'})}^{- 1}

,

β^{'} (k) = P_{k}^{'} H_{k}^{' T} T_{k}^{'}

.

Proof.

From the definition of

P_{k}^{'}

, we can easily find that:

P_{k + 1}^{'} = {({[\begin{matrix} ρ H_{k}^{'} \\ h (k + 1) \end{matrix}]}^{T} [\begin{matrix} ρ H_{k}^{'} \\ h (k + 1) \end{matrix}])}^{- 1} = {(ρ^{2} H_{k}^{' T} H_{k}^{'} + h {(k + 1)}^{T} h (k + 1))}^{- 1}

(22)

According to the Sherman-Morrison-Woodbury formula [30], Equation (22) can be written as:

\begin{array}{l} P_{k + 1}^{'} = & \frac{P_{k}^{'}}{ρ^{2}} - \frac{P_{k}^{'}}{ρ^{2}} h {(k + 1)}^{T} {(I + h (k + 1) \frac{P_{k}^{'}}{ρ^{2}} h {(k + 1)}^{T})}^{- 1} h (k + 1) \frac{P_{k}^{'}}{ρ^{2}} \\ = \frac{1}{ρ^{2}} (I - P_{k}^{'} h {(k + 1)}^{T} {(ρ^{2} I + h (k + 1) P_{k}^{'} h {(k + 1)}^{T})}^{- 1} h (k + 1)) P_{k}^{'} \\ = \frac{1}{ρ^{2}} (I - K_{k + 1} h (k + 1)) P_{k}^{'} \end{array}

(23)

Substitute Equation (23) and the definitions of

H_{k + 1}^{'}

and

T_{k + 1}^{'}

into

β^{'} (k + 1) = P_{k + 1}^{'} H_{k + 1}^{' T} T_{k + 1}^{'}

, and then the output matrix at (k + 1)-th unit time can be determined by:

\begin{array}{l} β^{'} (k + 1) & = \frac{1}{ρ^{2}} (I - K_{k + 1} h (k + 1)) P_{k}^{'} {[\begin{matrix} ρ H_{k}^{'} \\ h (k + 1) \end{matrix}]}^{T} [\begin{matrix} ρ T_{k}^{'} \\ t {(k + 1)}^{T} \end{matrix}] \\ = \frac{1}{ρ^{2}} (I - K_{k + 1} h (k + 1)) P_{k}^{'} (ρ^{2} H_{k}^{' T} T_{k}^{'} + h {(k + 1)}^{T} t {(k + 1)}^{T}) \\ = P_{k}^{'} H_{k}^{' T} T_{k}^{'} + \frac{1}{ρ^{2}} P_{k}^{'} h {(k + 1)}^{T} t {(k + 1)}^{T} - K_{k + 1} h (k + 1) P_{k}^{'} H_{k}^{' T} T_{k}^{'} \\ - \frac{1}{ρ^{2}} K_{k + 1} h (k + 1) P_{k}^{'} h {(k + 1)}^{T} t {(k + 1)}^{T} \\ = β^{'} (k) - K_{k + 1} h (k + 1) β^{'} (k) \\ + \frac{1}{ρ^{2}} (P_{k}^{'} h {(k + 1)}^{T} - K_{k + 1} h (k + 1) P_{k}^{'} h {(k + 1)}^{T}) t {(k + 1)}^{T} \end{array}

(24)

The

P_{k}^{'} h {(k + 1)}^{T} - K_{k + 1} h (k + 1) P_{k}^{'} h {(k + 1)}^{T}

in Equation (24) can be simplified as:

\begin{array}{l} P_{k}^{'} h {(k + 1)}^{T} - K_{k + 1} h (k + 1) P_{k}^{'} h {(k + 1)}^{T} \\ = P_{k}^{'} h {(k + 1)}^{T} - K_{k + 1} h (k + 1) P_{k}^{'} h {(k + 1)}^{T} - ρ^{2} K_{k + 1} + ρ^{2} K_{k + 1} \\ = P_{k}^{'} h {(k + 1)}^{T} - K_{k + 1} (ρ^{2} I + h (k + 1) P_{k}^{'} h {(k + 1)}^{T}) + ρ^{2} K_{k + 1} \\ = ρ^{2} K_{k + 1} \end{array}

(25)

Substitute Equation (25) into Equation (24), and then the output matrix at (k + 1)-th unit time can be described compactly as:

β^{'} (k + 1) = β^{'} (k) + K_{k + 1} (t {(k + 1)}^{T} - h (k + 1) β^{'} (k)) .

(26)

☐

Proposed Algorithm MOS-ELM: Given hidden nodes number

L

and activation function

g : ℜ \to ℜ

(sigmoid or other function), we can outline the MOS-ELM algorithm with the following steps.

(1): Assign learning parameters $(ω_{i}, b_{i}), i = 1, 2, \dots, L$ randomly, and set $k = 0$ ;
(2): Compute $H_{0}$ and $β (0)$ as Equations (9) and (10);
(3): Compute $h (k + 1)$ corresponding to (k + 1)-th chunk as Equation (13);
(4): Calculate $β^{'} (k + 1)$ iteratively as Equations (19)–(21);
(5): If the new data chunk comes, then let $k = k + 1$ and skip to the third step. Otherwise, let $β^{'}$ be the last iteration value $β^{'} (k + 1)$ ;
(6): Compute $f_{L} (x) = \sum_{i = 1}^{L} β_{i}^{'} G (ω_{i}, b_{i}, x)$ .

Remark 1.

MOS-ELM is actually an OS-ELM with memory principle. As a newly incoming data chunk is presented to predict the datum of the next chunk, there is no need to repeat the process of ELM. Otherwise, the complex matrix computation would be dealt with as in Equations (9) and (10) and the known information which was learned before would be wasted. Only the chunk of training data which is newly arriving and the known information which was learned before are used to carry out the matrix computations for MOS-ELM, while all the training data are used to implement the matrix computations for ELM. Hence, for the sequential prediction problems, the training process by MOS-ELM is much better than ELM’s.

Remark 2.

In the learning process by MOS-ELM, since the validity of each data chunk decays with time, the SLFN will be trained as soon as the new training data arrives at the next unit time and the validity of the overdue data chunk is reduced. Therefore, the learning process can deal with the timeliness well.

Remark 3.

If each chunk of training data does not have the property of timeliness, that is,

ρ = 1

, and it is obvious that MOS-ELM is exactly the same as OS-ELM, it is implied that the OS-ELM algorithm is a specific case of MOS-ELM algorithm.

3.2. Evaluation Test

In this section, we make a comparison between MOS-ELM and OS-ELM on some time series prediction problems and some real benchmark regression problems. The time series prediction problems considered in this subsection include the Mackey-Glass series, Logistic chaotic series and Sunspot series. The Mackey-Glass series is produced by means of the differential equation described as follows [31]:

\frac{d x^{m g} (t)}{d t} = \frac{a (t - τ)}{1 + x^{m g} (t - τ)} - b x^{m g} (t)

(27)

where

τ = 17

,

a \in [0.2, 0.22]

,

b \in [0.1, 0.12]

and

x (0) = 1.2

, and the time series

{x_{k}^{m g} | k = 1, 2, 3, \dots}

are generated according to the Runge-Kutta method. The Logistic chaotic time series

{x_{k}^{l o} | k = 1, 2, 3, \dots}

is described according to the recursive equation as follows [32]:

x_{k + 1}^{l o} = λ x_{k}^{l o} (1 - x_{k}^{l o})

(28)

where

λ \in [3.5, 4]

. The Sunspot time series is monthly mean total sunspot number from January 1749 to December 2015 and is obtained from [33]. The benchmark regression databases considered here involve Auto-MPG, which has 338 training and 168 testing data, and Housing, with 338 training and 168 testing data. The software environment for all simulations is MATLAB 7.11 (MathWorks, Natick, MA, USA) and the hardware environment is a general PC with frequency 2.5 GHz frequency. A usual sigmoid function

g (x) = 1 / (1 + \exp (- x))

is used to be the activation function in all simulations and the chunk size is set as 10. There were 50 trials for each database carried out and the average results are illustrated by Table 1.

As observed from Table 1, the training time for MOS-ELM is close to that for OS-ELM in various database just as we expected. MOS-ELM has a lower standard deviation, implying superior stability. Owing to the memory principle, MOS-ELM performs better generalization performance than OS-ELM in the timeliness databases such as Sunspot, Mackey-Glass and Logistic. In the timeless databases such as Auto-MPG and Housing, MOS-ELM algorithm and OS-ELM algorithm have the close generalization performance. The excellent generalization performance and stability in timeliness databases have created good conditions for the use of MOS-ELM in the sensor fault diagnosis with the characteristic of timeliness.

4. Sensor Fault Diagnosis for Aero Engines

As sensors have shortcomings of easy fault, FDI of a sensor system plays a very important part in ensuring the reliability of an aero engine control system. If the failure of sensors takes place, the safety of the aero engine would be seriously affected. Accurate sensor fault diagnosis with fast response is essential to enhance the reliability and safety of aero engines.

4.1. Diagnosis Method

Figure 1 illustrates the structure of the fault diagnosis and reconstruction system composed of the prediction module and fault diagnosis logic. The vector of measurements

y_{k} = {[N_{L}, N_{H}, T_{22}, P_{22}, T_{3}]}_{k}^{T}

consists of the low pressure rotor speed

N_{L}

, the high pressure rotor speed

N_{H}

, the fan discharge temperature

T_{22}

, the fan discharge pressure

P_{22}

, and the compressor discharge temperature

T_{3}

. The input vector

u_{k} = {[W_{f b}, A_{8}]}_{k}^{T}

is composed of the fuel flow

W_{f b}

and the area of nozzle throat area

A_{8}

[34]. And

t_{k}

denotes the prediction of the measurement vector

y_{k}

.

Figure 2 illustrates the prediction module of the fault diagnosis system according to the proposed MOS-ELM algorithm. Each measurement is predicted by an independent MOS-ELM respectively and is able to be mathematically expressed as:

\begin{array}{l} t_{k}^{i} = f_{i} (x_{k}), i = 1, 2, \dots, 5 \\ x_{k} = {[{\bar{y}}_{k - p}^{i}^{T}, \dots, {\bar{y}}_{k - 1}^{i}^{T}, u_{k - p}^{T}, \dots, u_{k}^{T}]}^{T} \end{array}

(29)

where

t_{k}^{i}

denotes the i-th element of

t_{k}

,

f_{i} (\cdot)

represents the i-th MOS-ELM,

{\bar{y}}_{k - p}^{i}

is created through removing i-th measurement

y_{k - p}^{i}

form

y_{k - p}

, and

p

denotes the embedding dimension for the prediction process.

The prediction of the measurement

t_{k}

is used as an analytical channel for the diagnosis logic in Figure 1. If the discrepancy among the analytical channel

t_{k}

and the measured channel

y_{k}

exceeds a tolerance level, the fault diagnosis logic is able to determine the cause of the difference. For each measured parameter, the sensor fault indicator is introduced as the comparison of the analytical channel against the measured channel, and it is defined as follows:

r_{k}^{i} = | t_{k}^{i} - y_{k}^{i} |

(30)

where the analytical residual

r_{k}^{i}

is the absolute difference between

t_{k}^{i}

and

y_{k}^{i}

. Two typical kinds of sensor faults, drift fault and bias fault, are considered in this paper, and the thresholds for drift fault and bias fault are defined as

D C

and

F C

respectively. The analytical residual computed for each sensor is compared against the thresholds

D C

and

F C

, and the detection logic can determine the fault level. If an analytical residual exceeds the bias fault threshold, it implies the existence of a bias fault. If an analytical residual exceeds the drift threshold and does not exceed the drift fault, it implies the existence of a drift fault.

If the drift fault or bias fault occurs, a correction strategy is applied to reconstruct the measurement and isolate the fault sensor. The correction trick is able to be described as the following equation:

{\tilde{y}}_{k}^{i} = y_{k}^{i} + (t_{k}^{i} - y_{k}^{i}) {(\min (1, \frac{r_{k}^{i} - D C}{F C - D C}))}^{1 / m}

(31)

where

{\tilde{y}}_{k}^{i}

denotes the i-th reconstruction value of

y_{k}

and

m > 1

is a correction factor. If the bias fault is detected, the measured value

y_{k}^{i}

does not contain any effective information. Then the reconstruction value

{\tilde{y}}_{k}^{i}

is completely determined by the prediction value

t_{k}^{i}

, and the fault measured value

y_{k}^{i}

is isolated. In addition, if the drift fault is detected, we use the above correction strategy to properly utilize the information of the measurement and the prediction.

4.2. Diagnosis System for $N_{L}$ Sensor

In this subsection, we use a double-shaft turbofan component level model to be the research object and the measurement noise considered here is Gaussian with the standard deviation being 0.30% [35]. At

H = 0 km

and

M a = 0

, the acceleration and deceleration of an aero engine is simulated with the throttle lever angle in the interval

30^{\circ} - 70^{\circ}

. Because the main dynamic characteristics of the aero engine can be considered to be a second-order element, the embedding dimension is set as

p = 2

[36]. In order to avoid affecting the weight in the diagnosis system, the measurements acquired from sensors are normalized into

[- 1, 1]

. The thresholds

D C

and

F C

are determined according to the compromise between false alarm rate and corrected detection rate. We select the thresholds from the following domain:

\begin{array}{l} D C \in {x | x = 0.013 + 0.0005 k, k = 1, 2, 3, \dots, 8} \\ F C \in {x | x = 0.023 + 0.0005 k, k = 1, 2, 3, \dots, 8} \end{array}

(32)

The best combination of

D C

and

F C

is selected manually considering the false alarm rate and corrected detection rate. For the OS-ELM case, the thresholds are selected as

D C = 0.0145

and

F C = 0.0255

; and for the MOS-ELM case, the thresholds are selected as

D C = 0.0140

and

F C = 0.0245

; and the correction factor is set as

m = 2

. The magnitude of bias fault and drift simulated in this section is 3% and 0%–4%, respectively.

The diagnosis results based on OS-ELM and MOS-ELM algorithms for the

N_{L}

sensor are illustrated in Figure 3 and Figure 4, respectively. The fault level of Figure 3b and Figure 4b is defined as follows: 0, no fault; 1, drift fault; 2, bias fault. The drift fault takes place during the interval

5 - 9 s

, and the bias fault comes up during the interval

34 - 38 s

. We can easily find that the reconstruction value by MOS-ELM is more accurate than that by OS-ELM, and the lower accuracy of OS-ELM tends to lead to a false alarm. With the prediction value being the analytical redundancy, the reconstruction value

{\tilde{y}}_{k}^{i}

can effectively approximate the real value, ensuring the approximate validity of sensor signals even if the bias or drift fault happens. Hence, the right commands are able to be produced in accordance with the control law, and then the reliability and safety of aero engines is enhanced. Figure 5 illustrates the prediction bias of

N_{L}

sensor by OS-ELM and MOS-ELM. It is obvious that MOS-ELM tends to generate more accurate predicted bias than OS-ELM, owing to handling the timeliness properly.

4.3. Statistical Performance for Different Fault Mode

Five kinds of single fault modes,

{N_{L}}

,

{N_{H}}

,

{T_{22}}

,

{P_{22}}

and

{T_{3}}

, are considered in this subsection. The single fault mode

{N_{L}}

denotes that

N_{L}

breaks down alone, and the other four single fault modes follow suit. In addition, the case that more than one sensor are likely to break down at the same time is not overlooked. Two kinds of dual fault modes,

{N_{L}, P_{22}}

and

{N_{H}, T_{3}}

, are considered here. The dual fault mode

{N_{L}, P_{22}}

denotes that

N_{L}

and

P_{22}

break down at the same time, and

{N_{H}, T_{3}}

represents that

N_{H}

and

T_{3}

break down simultaneously. For the purpose of obtaining robust statistical results, 20 different trials are carried out for each instance. Furthermore, in order to measure the learning performance, the root-mean-square error (RMSE) is defined as follows [37]:

RMSE = \sqrt{\frac{\sum_{i = 1}^{5} \sum_{k = 1}^{# Testing} {(t_{k}^{i} - y_{k}^{i})}^{2}}{# Testing \times 5}}

(33)

where #Testing denotes the testing samples number. In general, a smaller RMSE implies a better predicting accuracy for a learning algorithm. The average prediction RMSE for drift sensor fault and bias sensor fault is given in Table 2. It is obvious that the proposed MOS-ELM has lower prediction RMSE and superior generalization performance than OS-ELM in sensor fault diagnosis for aero engines, in both bias fault and drift fault cases, which can be attributed to properly tackling the timeliness of the sensor fault diagnosis system. Just as is derived by theory, the proposed MOS-ELM has no advantage in the aspect of the training time. For each instance, the training time for five learning machines is less than 4 s through the proposed MOS-ELM algorithm or OS-ELM algorithm, which corresponds to 40 s simulation duration. Thus, the real-time performance demand for an aero engine control system is completely met.

In addition, in order to measure the performance of the sensor fault diagnosis system, two performance indexes—the correct detection rate and the false alarm rate—is considered here. The average correct detection rate for drift sensor fault and bias sensor fault is given in Table 3. Figure 6a,b illustrates the false alarm rate for drift sensor fault and bias sensor fault respectively. As observed from Table 3 and Figure 6, we can easily find that the MOS-ELM algorithm has a higher corrected detection rate and lower false alarm rate than OS-ELM algorithm. As a result of the coupling among the different sensors, the dual fault mode

{N_{L}, P_{22}}

cannot be detected by the OS-ELM algorithm, while it can be detected well by the MOS-ELM algorithm.

5. Conclusions

In many real online learning applications, the sequentially arrived data usually has the characteristic of timeliness. The OS-ELM trains the neural network chunk by chunk, but at the same time, it cannot deal with the timeliness of the data chunk. Based on the OS-ELM algorithm, we propose a novel algorithm, MOS-ELM, which introduces the concept of memory principle into OS-ELM to improve the learning process by declining the validity of the outdated data chunk which may mislead the subsequent learning process. Thus MOS-ELM is able to learn sequentially as does the OS-ELM algorithm, but at the same time deals with the timeliness of data chunk properly.

Compared with OS-ELM, simulations on benchmark databases exhibit that MOS-ELM performs better in generalization performance, stability, and prediction accuracy while the tested problems possess the characteristic of timeliness. On this basis, MOS-ELM is employed in detecting, isolating, and reconstructing the fault sensor signal of aero engines. The experiment results show that MOS-ELM has better predictability and generalization performance than OS-ELM in diagnosing a sensor fault. Furthermore, the feasibility and effectiveness of the MOS-ELM-based sensor fault diagnosis system imply that the diagnosis system is an approach with great promise for enhancing the reliability and safety of the aero engine control system.

Acknowledgments

This research was funded by the National Natural Science Foundation of China (under Grant 51276087).

Author Contributions

Jinquan Huang designed the main idea, Feng Lu carried out the simulations, and Junjie Lu interpreted the results and wrote the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

Litt, J.S.; Simon, D.L.; Garg, S.; Guo, T.H.; Mercer, C.; Millar, R.; Behbahani, A.; Bajwa, A.; Jensen, D.T. A survey of intelligent control and health management technologies for aircraft propulsion systems. J. Aerosp. Comput. Inf. Commun. 2005, 1, 543–563. [Google Scholar] [CrossRef]
Jaw, L. Recent advancements in aircraft engine health management (EHM) technologies and recommendations for the next step. In Proceedings of the ASME Turbo Expo 2005: Power for Land, Sea and Air, Reno, NV, USA, 6–9 June 2005.
Patton, R.J.; Chen, J. Robust fault detection of jet engine sensor systems using eigenstructure assignment. J. Guid. Control Dyn. 1992, 15, 1491–1497. [Google Scholar] [CrossRef]
Hwang, I.; Kim, S.; Kim, Y.; Seah, C.E. A survey of fault detection, isolation, and reconfiguration methods. IEEE Trans. Control Syst. Technol. 2010, 18, 636–653. [Google Scholar] [CrossRef]
Chow, E.Y.; Willsky, A.S. Analytical redundancy and the design of robust failure detection systems. IEEE Trans. Autom. Control 1984, 29, 603–614. [Google Scholar] [CrossRef]
Willsky, A.S. A survey of design methods for failure detection in dynamic systems. Automatica 1976, 12, 601–611. [Google Scholar] [CrossRef]
Wallhagen, R.E.; Arpasi, D.J. Self-Teaching Digital-Computer Program for Fail-Operational Control of a Turbojet Engine in a Sea-Level Test Stand; Report No. NASA/TM-X-3043; National Aeronautics and Space Administration, Glenn Research Center: Washington, DC, USA, 1974.
Bras, S.; Rosa, P.; Silvestre, C.; Oliveira, P. Fault detection and isolation in inertial measurement units based on bounding sets. IEEE Trans. Autom. Control 2015, 60, 1933–1938. [Google Scholar] [CrossRef]
Lu, F.; Chen, Y.; Huang, J.Q.; Zhang, D.; Liu, N. An integrated nonlinear model-based approach to gas turbine engine sensor fault diagnostics. Proc. Inst. Mech. Eng. G J. Aerosp. 2014, 228, 2007–2021. [Google Scholar] [CrossRef]
Bahareh, P.; Meskin, N.; Khorasani, K. Sensor fault detection, isolation, and identification using multiple-model-based hybrid Kalman filter for gas turbine engines. IEEE Trans. Control Syst. Technol. 2016, 24, 1184–1190. [Google Scholar]
Escobar, R.F.; Astorga-Zaragoza, C.M.; Hernandez, J.A.; Juarez-Romero, D.; Garcia-Beltran, C.D. Sensor fault compensation via software sensors: Application in a heat pump’s helical evaporator. Chem. Eng. Res. Des. 2015, 93, 473–482. [Google Scholar] [CrossRef]
Mattern, D.L.; Jaw, L.C.; Guo, T.H.; Graham, R.; McCoy, W. Using neural networks for sensor validation. In Proceedings of the 35th AIAA/ASME/SAE/ASEE Joint Propulsion Conference, Cleveland, OH, USA, 12–15 July 1998.
Sadough-Vanini, Z.N.; Meskin, N.; Khorasani, K. Multiple-model sensor and components fault diagnosis in gas turbine engines using autoassociative neural networks. J. Eng. Gas Turbines Power 2014, 136, 76–82. [Google Scholar] [CrossRef]
Torella, G.; Torella, R. Probabilistic expert systems for the diagnostics and trouble-shooting of gas turbine apparatuses. In Proceedings of the 35th AIAA/ASME/SAE/ASEE Joint Propulsion Conference, Los Angeles, CA, USA, 20–24 June 1999.
Garg, S.; Schadow, K.; Horn, W.; Pfoertner, H.; Stiharu, I. Sensor and actuator needs for more intelligent gas turbine engines. In Proceedings of the GT2005 ASME Turbo Expo 2010: Power for Land, Sea and Air, Glasgow, UK, 14–18 June 2010.
Bahareh, P.; Meskin, N.; Khorasani, K. Sensor fault detection and isolation using multiple robust filters for linear systems with time-varying parameter uncertainty and error variance constraints. In Proceedings of the 2014 IEEE Multi-Conference on Systems and Control, Antibes, France, 8–10 October 2014.
Kobayashi, T.; Simon, D.L. Evaluation of an enhanced bank of Kalman filters for in-flight aircraft engine sensor fault diagnostics. J. Eng. Gas Turbines Power 2014, 127, 635–645. [Google Scholar]
Saravanakumar, R.; Manimozhi, M.; Kothari, D.P.; Tejenosh, M. Simulation of sensor fault diagnosis for wind turbine generators DFIG and PMSM using Kalman filter. Energy Procedia 2014, 54, 494–505. [Google Scholar] [CrossRef]
Sadough-Vanini, Z.N.; Khorasani, K.; Meskin, N. Fault detection and isolation of a dual spool gas turbine engine using dynamic neural networks and multiple model approach. Inf. Sci. 2014, 259, 234–251. [Google Scholar] [CrossRef]
Tayarani-Bathaie, S.S.; Khorasani, K. Fault detection and isolation of gas turbine engines using a bank of neural networks. J. Process Control 2015, 36, 22–41. [Google Scholar] [CrossRef]
Shah, B.; Sarvajith, M.; Sankar, B.; Vijayendranath, V. Fault identification, isolation, estimation of sensor measurement using auto associative neural network for aero-engine. In Proceedings of the National Conference on Condition Monitoring, Chennai, India, 12–13 December 2014.
Ogaji, S.O.T.; Singh, R.; Probert, S.D. Multiple-sensor fault-diagnoses for a 2-shaft stationary gas-turbine. Appl. Energy 2002, 71, 321–339. [Google Scholar] [CrossRef]
Xu, L.; Cai, T.; Deng, F. Sensor fault diagnosis based on least squares support vector machine online prediction. In Proceedings of the 2011 IEEE 5th International Conference on Robotics, Automation and Mechatronics (RAM), Qingdao, China, 17–19 September 2011.
Zhang, S.; Li, Y. Simulation research on FLS_SVM in sensor fault diagnosis. In Proceedings of the 2011 International Conference on Electronic & Mechanical Engineering and Information Technology, Harbin, China, 12–14 August 2011.
Huang, G.B.; Zhu, Q.Y.; Siew, C.K. Extreme learning machine: A new learning scheme of feedforward neural networks. In Proceedings of the International Joint Conference on Neural Networks (IJCNN), Budapest, Hungary, 25–29 July 2004.
Huang, G.B.; Chen, L.; Siew, C.K. Universal approximation using incremental constructive feedforward networks with random hidden nodes. IEEE Trans. Neural Netw. 2006, 17, 879–892. [Google Scholar] [CrossRef] [PubMed]
Huang, G.B.; Zhou, H.M.; Ding, X.J.; Zhang, R. Extreme learning machine for regression and multiclass classification. IEEE Trans. Syst. Man Cybern. B 2012, 42, 513–529. [Google Scholar] [CrossRef] [PubMed]
Liang, N.Y.; Huang, G.B.; Saratchandran, P.; Sundararajan, N. A fast and accurate online sequential learning algorithm for feedforward networks. IEEE Trans. Neural Netw. 2006, 17, 1411–1423. [Google Scholar] [CrossRef] [PubMed]
Wang, N.; Er, M.J.; Han, M. Parsimonious extreme learning machine using recursive orthogonal least squares. IEEE Trans. Neural Netw. Learn. 2014, 25, 1828–1841. [Google Scholar] [CrossRef] [PubMed]
Deng, C.Y. A generalization of the Sherman-Morrison-Woodbury formula. Appl. Math. Lett. 2011, 24, 1561–1564. [Google Scholar] [CrossRef]
EI-Sayed, A.M.; Salman, S.M.; Elabd, N.A. On a fractional-order delay Mackey-Glass equation. Adv. Differ. Equ. 2016, 2016, 1–11. [Google Scholar]
Berezowski, M.; Grabski, A. Chaotic and non-chaotic mixed oscillations in a logistic systems with delay. Chaos Solitons Fractals 2002, 14, 97–103. [Google Scholar] [CrossRef]
Sunspot Index and Long-Term Solar Observations. Available online: http://www.sidc.be/silso/DATA/SN_m_tot_V2.0.txt (accessed on 10 June 2016).
Lu, F.; Huang, J.Q.; Xing, Y.D. Fault Diagnostics for Turbo-Shaft Engine Sensors Based on a Simplified On-Board Model. Sensors 2012, 12, 11061–11076. [Google Scholar] [CrossRef] [PubMed]
Kobayashi, T.; Simon, D.; Litt, J.S. Application of a constant gain extended Kalman filter for in-flight estimation of aircraft engine performance parameters. In Proceedings of the GT2005 ASME Turbo Expo 2005: Power for Land, Sea and Air, Reno, NV, USA, 6–9 June 2005.
Zhou, J.; Liu, Y.; Zhang, T.H. Analytical redundancy design for aeroengine sensor fault diagnostics based on SROS-ELM. Math. Probl. Eng. 2016, 2016, 8153282. [Google Scholar] [CrossRef]
Azad, A.K.; Rasul, M.G.; Yusaf, T. Statistical diagnosis of the best weibull methods for wind power assessment for agricultural applications. Energies 2014, 7, 3056–3085. [Google Scholar] [CrossRef]

Figure 1. Structure of sensor fault diagnosis and reconstruction system for aero engines.

Figure 2. Diagram of the prediction module of the fault diagnosis system according to proposed online sequential extreme learning machine with memory principle (MOS-ELM) algorithm.

Figure 3. (a) Reconstruction of

N_{L}

sensor by OS-ELM; (b) Detected fault level of

N_{L}

sensor by OS-ELM.

Figure 3. (a) Reconstruction of

N_{L}

sensor by OS-ELM; (b) Detected fault level of

N_{L}

sensor by OS-ELM.

Figure 4. (a) Reconstruction of

N_{L}

sensor by MOS-ELM; (b) Detected fault level of

N_{L}

sensor by MOS-ELM.

Figure 4. (a) Reconstruction of

N_{L}

sensor by MOS-ELM; (b) Detected fault level of

N_{L}

sensor by MOS-ELM.

Figure 5. Predicted bias of

N_{L}

sensor by OS-ELM and MOS-ELM.

Figure 5. Predicted bias of

N_{L}

sensor by OS-ELM and MOS-ELM.

Figure 6. (a) Comparison of false alarm rate for drift fault; (b) Comparison of false alarm rate for bias fault. Note: the fault mode codes

a, b, c, d, e, f, g

correspond to

{N_{L}}

,

{N_{H}}

,

{T_{22}}

,

{P_{22}}

,

{T_{3}}

,

{N_{L}, P_{22}}

and

{N_{H}, T_{3}}

, respectively.

Figure 6. (a) Comparison of false alarm rate for drift fault; (b) Comparison of false alarm rate for bias fault. Note: the fault mode codes

a, b, c, d, e, f, g

correspond to

{N_{L}}

,

{N_{H}}

,

{T_{22}}

,

{P_{22}}

,

{T_{3}}

,

{N_{L}, P_{22}}

and

{N_{H}, T_{3}}

, respectively.

Table 1. Performance comparison between MOS-ELM and OS-ELM on benchmark databases.

**Table 1.** Performance comparison between MOS-ELM and OS-ELM on benchmark databases.
Databases	Algorithm	RMSE	SD	#Hidden Nodes	Training Time (s)
Sunspot	MOS-ELM ( $ρ^{2} = 0.99$ )	0.0839	0.0005	20	0.0409
Sunspot	OS-ELM	0.0863	0.0005	20	0.0399
Mackey-Glass	MOS-ELM ( $ρ^{2} = 0.99$ )	0.0120	0.0014	20	0.1298
Mackey-Glass	OS-ELM	0.0172	0.0024	20	0.1363
Logistic	MOS-ELM ( $ρ^{2} = 0.995$ )	0.0595	0.0303	40	0.0839
Logistic	OS-ELM	0.0767	0.0351	40	0.0883
Auto-MPG	MOS-ELM ( $ρ^{2} = 0.99$ )	0.0649	0.0013	25	0.0087
Auto-MPG	OS-ELM	0.0651	0.0013	25	0.0094
Housing	MOS-ELM ( $ρ^{2} = 0.99$ )	0.0992	0.0044	30	0.0081
Housing	OS-ELM	0.0998	0.0056	30	00075

Notes: RMSE: the root-mean-square-error; SD: standard deviation.

Table 2. Comparison of prediction RMSE via OS-ELM and MOS-ELM.

**Table 2.** Comparison of prediction RMSE via OS-ELM and MOS-ELM.
Fault Mode	Algorithms	Drift Fault			Bias Fault
Fault Mode	Algorithms	RMSE	SD	Training Time (s)	RMSE	SD	Training Time (s)
${N_{L}}$	OS-ELM	0.0293	0.0001	3.35	0.0284	0.0007	3.33
${N_{L}}$	MOS-ELM	0.0183	0.0008	3.32	0.0177	0.0010	3.27
${N_{H}}$	OS-ELM	0.0508	0.0067	3.34	0.0558	0.0078	3.28
${N_{H}}$	MOS-ELM	0.0221	0.0032	3.31	0.0261	0.0056	3.27
${T_{22}}$	OS-ELM	0.0432	0.0029	3.35	0.0456	0.0042	3.27
${T_{22}}$	MOS-ELM	0.0267	0.0033	3.32	0.0259	0.0027	3.32
${P_{22}}$	OS-ELM	0.0405	0.0041	3.35	0.0315	0.0016	3.30
${P_{22}}$	MOS-ELM	0.0347	0.0070	3.34	0.0217	0.0019	3.27
${T_{3}}$	OS-ELM	0.0471	0.0072	3.29	0.0464	0.0049	3.27
${T_{3}}$	MOS-ELM	0.0293	0.0042	3.36	0.0271	0.0045	3.31
${N_{L}, P_{22}}$	OS-ELM	0.0384	0.0029	3.33	0.0319	0.0014	3.31
${N_{L}, P_{22}}$	MOS-ELM	0.0367	0.0046	3.32	0.0239	0.0020	3.32
${N_{H}, T_{3}}$	OS-ELM	0.0696	0.0114	3.33	0.0800	0.0126	3.38
${N_{H}, T_{3}}$	MOS-ELM	0.0335	0.0063	3.36	0.0347	0.0050	3.30

Notes: RMSE: the root-mean-square-error; SD: standard deviation.

Table 3. Comparison of correct detection rate via OS-ELM and MOS-ELM.

**Table 3.** Comparison of correct detection rate via OS-ELM and MOS-ELM.
Fault Mode	Algorithms	Drift Fault		Bias Fault
Fault Mode	Algorithms	CDR (%)	SD (%)	CDR (%)	SD (%)
${N_{L}}$	OS-ELM	78.64	8.31	77.15	9.48
${N_{L}}$	MOS-ELM	90.31	5.04	96.13	2.42
${N_{H}}$	OS-ELM	97.80	0.89	98.35	0.99
${N_{H}}$	MOS-ELM	97.23	0.95	99.48	0.47
${T_{22}}$	OS-ELM	96.12	1.77	97.35	1.3
${T_{22}}$	MOS-ELM	99	0.83	99.48	0.41
${P_{22}}$	OS-ELM	92.24	2.81	79.78	7.74
${P_{22}}$	MOS-ELM	93.5	1.65	95.55	2
${T_{3}}$	OS-ELM	96.96	1.75	98.13	0.99
${T_{3}}$	MOS-ELM	96.92	1.3	99.45	0.46
${N_{L}, P_{22}}$	OS-ELM	-	-	-	-
${N_{L}, P_{22}}$	MOS-ELM	98.94	3.68	99.63	1.3
${N_{H}, T_{3}}$	OS-ELM	100	0	100	0
${N_{H}, T_{3}}$	MOS-ELM	99.94	0.14	99.84	0.73

Note: CDR: correct detection rate.

© 2017 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lu, J.; Huang, J.; Lu, F. Sensor Fault Diagnosis for Aero Engine Based on Online Sequential Extreme Learning Machine with Memory Principle. Energies 2017, 10, 39. https://doi.org/10.3390/en10010039

AMA Style

Lu J, Huang J, Lu F. Sensor Fault Diagnosis for Aero Engine Based on Online Sequential Extreme Learning Machine with Memory Principle. Energies. 2017; 10(1):39. https://doi.org/10.3390/en10010039

Chicago/Turabian Style

Lu, Junjie, Jinquan Huang, and Feng Lu. 2017. "Sensor Fault Diagnosis for Aero Engine Based on Online Sequential Extreme Learning Machine with Memory Principle" Energies 10, no. 1: 39. https://doi.org/10.3390/en10010039

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Sensor Fault Diagnosis for Aero Engine Based on Online Sequential Extreme Learning Machine with Memory Principle

Abstract

1. Introduction

2. Review of Online Sequential Extreme Learning Machine (OS-ELM)

3. Proposed Online Sequential Extreme Learning Machine with Memory Principle (MOS-ELM)

3.1. Formula Derivation

3.2. Evaluation Test

4. Sensor Fault Diagnosis for Aero Engines

4.1. Diagnosis Method

4.2. Diagnosis System for $N_{L}$ Sensor

4.3. Statistical Performance for Different Fault Mode

5. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Sensor Fault Diagnosis for Aero Engine Based on Online Sequential Extreme Learning Machine with Memory Principle

Abstract

1. Introduction

2. Review of Online Sequential Extreme Learning Machine (OS-ELM)

3. Proposed Online Sequential Extreme Learning Machine with Memory Principle (MOS-ELM)

3.1. Formula Derivation

3.2. Evaluation Test

4. Sensor Fault Diagnosis for Aero Engines

4.1. Diagnosis Method

4.2. Diagnosis System for N L Sensor

4.3. Statistical Performance for Different Fault Mode

5. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.2. Diagnosis System for $N_{L}$ Sensor