1. Introduction
Critical engineering systems, such as aircraft gas turbines, must run safely and economically for their entire lifetimes. The anomaly detection and failure prediction of the gas turbines is of great importance for ensuring the reliable operation of safety-critical systems, which requires an accurate assessment of gas turbine conditions. Gas turbine condition assessment tracks measurable parameters during flight to derive insights into its current health state and trends for effective operation and maintenance decision-making [
1].
Traditionally the condition of a gas turbine is assessed only based on discrete data points gathered during take-off and cruise, typically called engine performance snapshots [
2]. Gas turbine performance trend changes usually trigger diagnostics alerts, which determine if an engine’s performance is changing from its normal operating range. These discrete data points can effectively reduce the amount of data required for analysis, however, provide very little information to reflect the variation in the performance state of the engine throughout the entire flight segment. The data sparsity related to snapshot data leads to difficulties distinguishing between faults and random scatters. Depending on the faulty component and the severity of the fault, it may need multiple data points to detect, which may cause false alarms and missed alarms. The resulting latency in fault detection based on performance snapshots may increase the risk of secondary damage [
3,
4].
In the field of gas turbine conditions assessment, the general fault detection schemes use thermodynamic engine models for computing reference values representing the nominal performance of the aircraft engine. Fault detection performs a comparison between these reference values and in-flight measurements. Significant deviations between the measurements and model predictions indicate an underlying fault. Such gas turbine performance models are traditionally derived from physical principles by domain experts [
5]. The physics-based approach has the advantage of not requiring fault data to validate its performance, particularly in terms of explicability and ability to extrapolate. However, these models are costly to develop and usually are proprietary to the engine manufacturers, which are not available to the asset operators. An alternative paradigm is data-driven models developed directly from the operational data of an engine, where a flexible model structure is fitted to the system by training on historical data.
Currently, most airlines adopted Quick Access Recorders (QAR) for data acquisition, providing flight data continuously sampled at frequencies of 1 Hz and more, which is also referred to as full-flight data covering the whole flight. The availability of these data obtained from a large variety of sensors enables the introduction of new methodologies to assess engine condition, which offers the chance to detect engine faults within one flight more reliably to support more efficient in-service operations and maintenance decisions [
6,
7]. An approach for fault detection based on steady-state flight regimes of full-flight data is demonstrated in [
8,
9]. Weiss et al. proposed a steady-state fault detection framework with complete flight data using a one-class support vector machine, and high detection rates are demonstrated for various gas path component faults using synthesized datasets derived from full-flight data of commercially operated flights [
6]. Hartwell et al. propose a practical and computationally inexpensive method for in-flight real-time anomaly detection based on a convolutional neural network. The efficacy of the method has been demonstrated on both real-time-series and synthetic snapshot data [
7].
In recent years, the availability of big data from engineering systems has opened the era of digital twins, “defined as a virtual representation of a physical asset enabled through data and simulators for real-time prediction, monitoring, control and optimization of the asset for improved decision making through the life cycle of the asset and beyond” [
10]. In the gas turbine field, it is a commonly used method to establish a digital twin for both the whole engine level and unit level based on the high-fidelity physical model to support more efficient in-service operation and maintenance decisions. Kraf et al. adopted a top-down method to construct a multilevel digital twin model from the component level to the whole engine, which can be used to support engine life consumption prediction to improve maintenance decisions [
11]. Dawes et al. described a physics-based digital twin to support a through-life gas turbine service business model and they demonstrate how a digital geometry model can represent typical in-service component degradation and then support performance degradation prediction [
12]. These papers demonstrate the application of digital twins from an MRO perspective and that the high-fidelity physical-based simulation is a necessary step to ensure a high-precision digital twin.
Another important application of the DT for gas turbines is to construct a performance digital twin for real-time control, performance monitoring, and fault detection. In the context of performance monitoring and anomaly detection, a well-known solution is to build an anomaly detector in which an underlying digital twin is constructed based on an adaptive physics-based thermodynamic model of the engine provided by manufacturers. Zaccaria et al. adopted an adaptive physics-based model as a performance digital twin for aircraft engine fault detection and isolation [
13]. Panov et al. proposed a gas turbine PDT for real-time control and monitoring functionalities based on a physics-based performance model and real-time embedded computing [
14]. In general, these physics-based models are proprietary to manufacturers and usually unavailable to the asset operators, an alternative is to construct data-driven performance DT models on operating data from an in-service physical asset. However, constructing such a performance DT covering various stationary and non-stationary operating conditions requires massive amounts of historical flight data and complex deep models.
As an important enabling technology of data-driven Digital Twin, machine learning, especially deep learning, has recently gained attention due to its ability to learn fault patterns directly from raw sensor data and its capacity to handle non-linearity in complex temporal correlation [
15,
16]. In the context of anomaly detection, an anomaly detector is typically built on an underlying data-driven DT trained on real-time operating data from a physical asset to judge when it deviates from its normal behavior [
15]. Xu et al.. proposed a digital twin-based anomaly detection approach (ATTAIN: Anomaly deTection with digiTAl twIN)) which continuously and automatically builds a digital twin with live data obtained from a Cyber-Physical System that implements a Generative Adversarial Network to detect anomalies [
15]. Castellani et al. introduced a Digital Twin-based anomaly detection method that is tailored for weakly supervised settings with very few labeled data samples. The method is demonstrated on real-world use-case data, and the developed solutions outperform state-of-the-art anomaly detection approaches [
17]. It has shown high-grade performance because of its power to deal with unstructured and unlabeled data, which is of great significance for constructing aero-engine condition monitoring digital twins.
Currently, most airlines adopted Quick Access Recorders (QAR) for data acquisition, and the availability of these big data sets enables the introduction of new PDT-based approaches in engine condition monitoring. Data-driven Performance Digital Twin (PDT) is an important method to accomplish real-time condition monitoring covering the full flight of an engine under various operating and environmental conditions. However, the research on the PDT modeling of the aircraft engine covering the whole flight conditions is very limited. This study aims to develop a novel approach to anomaly detection based on the digital twin paradigm that is an accurate simulation of an individual ’as-operated’ gas turbine. The developed digital twin consists of two parts: the Digital Twin model and fault detection capability. The Digital Twin model is a digital representation of the expected behavior of a real-world gas turbine, named Performance Digital Twin (PDT), which capitalizes on multivariate time series data obtained from the physical asset in operation to enable gas turbine performance tracking. Depending on the context, a digital twin can provide various capabilities; in this article, we focus on the anomaly detection capability of a digital twin. The main contributions of this paper is to propose a novel data-driven performance Digital Twin with uncertainty quantification, denoted as uncertain performance digital twin (UPDT), just based on the real-world gas turbine operational data rather than the physics-based model. The proposed UPDT produces a probabilistic digital representation of the expected performance behavior of a real-world gas turbine. Then based on the UPDT, the fault detection capability is developed and a novel anomaly measure based on the first Wasserstein distance is proposed to characterize the full flight data to detect anomaly.
The remainder of the article is organized as follows.
Section 2 presents an overview of the proposed framework and
Section 3 describes the LSTM-AE-based scheme for performance digital twin. The PDT uncertainty quantification and fault detection measures are discussed in
Section 4 and
Section 5. A case study is carried out to demonstrate the developed method in
Section 6. Finally, a summary of the work and outlook are given in
Section 7.
2. Fault Detection Framework Based on UPDT
The developed fault detection framework based on the performance digital twin with uncertainty quantification is presented in
Figure 1. The developed framework comprises two parts: Uncertain performance digital twin (UPDT) and fault detection capability. UPDT aims to produce a probabilistic digital representation of the expected performance behavior of a real-world gas turbine operating under various conditions during a flight directly from raw sensor data. The replica can be used to simulate and predict the engine’s behavior at different ambient/operating conditions. Although simulation of the whole gas turbine is feasible, interesting subset signals, such as the operating conditions and gas path key performance parameters, are selected to form a performance digital twin based on engineering knowledge. The UPDT is first trained based on nominal historical data and then continuously learns from new data to improve its anomaly detection performance. The UPDT, a probabilistic simulation of an individual ’as-operated’ gas turbine, is then used to predict the performance parameters with confidence intervals for quantifying uncertainty in the models.
Fault detection capability is developed based on detecting UPDT outputs that have low probability under the training distribution, i.e., the Out-of-Distribution (OOD) detection mechanisms which have received remarkable attention in recent years [
18]. An anomaly detection model, such as a density model of normal representations [
19], a model of distances from some nominal samples [
20], or a model of reconstruction errors [
21], is created to compute an anomaly score. A threshold can be applied to this score in order to discriminate samples into fault and health. Density-based methods attempt to model the distribution of normal data with the assumption that the anomaly sample has a low likelihood. In contrast, the normal sample has a higher likelihood under the estimated density model. In this study, a density-based method is used to explicitly model the nominal historical data covering expected operating conditions with a multivariate Gaussian distribution and flag test data in low-density regions as anomaly samples based on their likelihoods.
3. Gas Turbine Performance Digital Twin
Traditionally, the gas turbine performance digital twins are based on an adaptive physics-based performance model provided by the engine manufacturers. Since these models are proprietary and usually unavailable to the asset operators, to circumvent the practical constraints of implementing physics-based PDT for airline companies, an alternative is given by data-driven models to develop data-driven PDT directly from the operational data of an engine. One of the key characteristics of a Digital Twin is the interconnection of information between the digital entity and the physical reality. In this paper, this is achieved through the use of sensors of the gas turbine which can directly measure the engine operating, control and performance parameters. The information exchange from the virtual representation to the physical reality is through informed decision-making, such as engine performance degradation assessment and fault detection to support operation and maintenance planning.
Constructing such a PDT covering various stationary and non-stationary operating conditions requires correctly assessing the temporal correlations in full-flight data. Currently, most airlines adopted Quick Access Recorders (QAR) for data acquisition, providing flight data continuously sampled at frequencies of 1 Hz and more during the whole flight, which is also referred to as full-flight data. The availability of these data sets enables the introduction of data-driven PDT–based approaches in engine condition monitoring.
The PDT aims to produce a digital representation of the expected performance behavior of a real-world gas turbine operating under various conditions during a flight. To correctly assess the transient performance of gas turbine engines requires the previous data points to be considered resulting in an auto-correlation. It is necessary to build models to capture time sequence information in the data. In the following, LSTM based deep learning network [
22,
23], which is well-suited for modeling sequential data with the temporal correlations in full-flight data, is implemented to model the steady-state and transient performance of gas turbines. LSTM is well-suited for sequence learning tasks and has been implemented for encoder and decoder networks for anomaly detection due to the capability of LSTM to model sequential data with temporal information [
24].
Autoencoder is an unsupervised learning algorithm that attempts to replicate its input to its output. The hidden layer
h inside the algorithm can describe a certain code for the input’s representation. It consists of an encoder and a decoder and is mainly applied for dimensionless feature extraction. The main function of the encoder is shown below:
where
f contains a linear change
W and a nonlinear activation
b. The decoder converts the hidden representation h to the initial input in a similar manner, as shown below:
Here the parameters
can minimize the cost function. For AE, the form of intermediate layers is noteworthy. The algorithm should be able to predict the target signal y, so the encoding
h should carry the interrelationship of different sensor parameters. The aim of the learning process is to minimize a loss function
L as far as possible. The specific loss function for the proposed LSTM-AE based scheme will be discussed in
Section 4.
The combined framework of LSTM-AE is suitable for constructing the data-driven PDT due to its advantage in processing time-series data. The scheme of the proposed LSTM-AE based PDT is shown in
Figure 2. The architecture allows the streaming of data from selected sensors of a gas turbine in real-time into the developed PDT. To account for the temporal correlation, a temporal feature extraction utilizing LSTM neural network is used as a preprocessing step. A fixed-size sliding window technique is applied to create the input samples of the PDT. Input data is treated as a 2D window with length
T and width
S, where
T and
S are time steps and the number of selected signals. Each input sample is recorded as
Xt, and
Xt is a matrix of
j × 1, while each column
j represents a signal at time
t. The matrix
Xt here can be regarded as a “window” that moves on data series, and the elements inside the window represent the condition of the aero-engine within a specific time interval. This technique allows the trend information to be preserved, which is suitable for processing dynamic and variable time series data.
4. PDT Uncertainty Quantification
The digital twin incorporates as-operated data of the physical product to assist in the predictive and decision-making process. The goal of the PDT training is to learn accurate reconstruction of the normal performance behavior of the engine under continuously varying flight conditions. However, the lack of knowledge about the uncertainty of data captured from the physical domain, and consequently of models created from them, has a great impact on how much a PDT conforms to its physical product. In general, uncertainty is classified as epistemic or aleatoric. Epistemic uncertainty relates to the lack of knowledge, caused by poor assumptions, poor models and missing data. On the other hand, the aleatoric uncertainty relates to the variability of physical processes, which is inherent to the non-deterministic nature of measurement processes [
25]. Calculation of the uncertainty is complex due to the large number of factors affecting it.
To improve the reliability and robustness of fault detection, the data-driven PDT should produce a probabilistic digital representation of the expected performance behavior of a real-world gas turbine. In this paper, a novel Uncertain Performance Digital Twin (UPDT)is proposed to take into account various uncertainty, such as operating condition disturbances, engine dynamics as well as measurement uncertainty. In the following, the performance prediction uncertainty quantification is taken into account in the deep neural networks to achieve a more reliable and robust fault detection.
For modeling the uncertainty, the gas turbine performance measurements are assumed to be sampled from a given probability density function
. The parameter
of the probability density function characterizing the performance prediction is then estimated by the UPDT based on input
Xt. The estimation of the distribution of the output data
is central to the following anomaly detection scheme. In this work, output distribution is described using a multivariate Gaussian with mean
and correlation matrix
given by
To reduce the complexity of the artificial neural network and, therefore the total number of parameters to be estimated, the performance measurements are considered to be sampled independently, leading to uncorrelated measurement noise and, therefore, negligible cross-correlations . This simplification collapses the correlation matrix into a diagonal matrix = diag (, …, ).
The UPDT model is trained through a mini-batch stochastic gradient descent approach to reconstruct the value of the target parameter
Yt at the current moment with uncertainty quantification. Optimization of the weights of a deep neural network requires an optimization target. The objective function is defined as maximizing the likelihood of observing the data
Yt underlying the chosen probability density function
, which is equivalent to the minimization of the negative log-likelihood:
The goal of the proposed UPDT in this work is to accurate reconstruction of the normal behavior of the engine with uncertainty estimation under continuously varying flight conditions. Various approaches to uncertainty estimation in deep neural networks are available, such as using dropout at run-time [
19], using ensembles as a prediction scatter and Bayesian neural network solutions [
26]. In this work, the explicit estimation approach is used, which allows us to use a specific probability density function to characterize the estimation uncertainty while retaining the flexibility of non-Bayesian neural networks.
5. Fault Detection Capability
Fault detection capability is developed based on detecting UPDT outputs that have low probability under the training distribution. A density-based anomaly detection model is created to compute an anomaly score. For some instances x, these methods then yield an outlier score
. A threshold
can be applied to this score in order to discriminate samples into anomaly and health [
18].
Density-based methods attempt to model the distribution of normal data, with an assumption that the anomaly sample has a low likelihood whereas the normal sample has a higher likelihood under the estimated density model. In this study, a density-based method is used to explicitly model the historical nominal data covering expected operating conditions with a multivariate Gaussian distribution, and flag test data in low-density regions as anomaly samples based on their likelihoods.
Data scatter will always be present in the full flight data because of measurement system accuracy, recording accuracy, and actual stability of the aircraft engine during the data acquisition. Since there will always be a certain number of statistical outliers during one flight, an anomaly score,
characterizing the full flight data is proposed based on the first Wasserstein distance [
27]:
where
is the set of (probability) distributions on
whose marginals are
and
on the first and second factors respectively. The first Wasserstein distance, also known as the earth mover’s distance, computed distance between two 1D distributions, where the input distributions can be empirical, therefore coming from samples whose values are effectively inputs of the function. In this study,
U comes from the training samples of the historical nominal data, and
V comes from the test sample of one flight. If the outlier score exceeds a predefined threshold
, the outliers are no longer considered statistical but systematic, indicating a fault.