1. Introduction
Industrial processes are becoming more complex and their hazards to the environment are receiving increasing attention. Process safety is a non-negligible component of industrial processes, which comprises several steps such as hazard identification and analysis [
1]. In particular, the identification and analysis of hazards is a key step in the prevention and mitigation of major process accidents.
In industrial processes, timely and accurately identifying abnormal operating conditions can prevent major accidents and improve operational efficiency, thus achieving compliance with environmental and safety regulations. Dynamic process monitoring for hazard/fault identification is already a trend in the future development of process safety and risk management [
2]. The real-time monitoring of process operations to ensure safety measures is an essential step in the modern process industry [
3].
Fault detection plays a pivotal role in guaranteeing operation safety and reducing downtime in complex industrial processes [
4,
5]. Broadly speaking, fault detection techniques can be categorized into three classes, model-based, knowledge-based, and data-driven based methods [
6]. Model-based methods rely on the mathematical model. However, the mathematical model is often difficult or time-consuming to establish for complex industrial processes such as the blast furnace ironmaking process. For knowledge-based methods, the model is built from expert knowledge or qualitative information, which limits its applications for complex industrial processes. Conversely, only the measured process variables are required for data-driven fault detection methods. Thus, data-driven techniques are more suitable and efficient for the fault detection of complex industrial processes [
7]. With the advance of the sensor, communication, and computing technologies, large amounts of data are collected in modern industrial processes. Under such circumstances, data-driven fault detection has gained an explosive amount of attention in recent years from academia and industry [
8].
For data-driven methods, traditional multivariate analysis (MVA) and deep learning methods have gained considerable attention in the field of fault detection. The main advantage of deep learning methods is that they can learn the features of process data without feature engineering from deep neural networks. Deep learning methods have the powerful capability to capture the nonlinearity of process data through hierarchical abstraction. Recently, deep learning methods have been widely employed in fault detection and diagnosis. Wang et al. proposed a Bidirectional Gated Recurrent Unit (Bi-GRU) model for the fault diagnosis of Modular Multilevel Converters High Voltage DC (MMC-HVDC) [
9]. Yu et al. proposed a graph-weighted reinforcement network (GWRNet) for the fault diagnosis of rotating machinery [
10]. Velasco et al. developed a real-time anomaly detection intelligent system (RADIS) based on long short-term memory (LSTM) and a variational autoencoder (VAE) for the fault diagnosis of rotating machinery [
11]. However, there exist some limitations in applying deep learning for fault detection and diagnosis such as large dataset requirements, computational resources, and poor interpretability.
To handle the highly correlated high-dimensional process data, multivariate analysis (MVA) has been widely employed in industrial processes [
12]. In MVA, the process behavior is modeled by transforming the high-dimensional data into a lower-dimensional space. The features are extracted for establishing monitoring statistics. Among the MVA-based fault detection methods, principal component analysis (PCA) has gained widespread popularity in process monitoring and fault diagnosis in recent decades [
13]. In PCA, process data are projected into a lower-dimensional space to preserve the significant variability information as much as possible. Due to its efficiency and simplicity, PCA has been successfully applied in a large number of industrial processes [
14]. Despite this, PCA is regarded as a kind of globality-based linear dimensionality reduction technique. However, the process data mostly lie on or close to a low-dimensional manifold. Compared to globality-based methods, manifold learning is an approach to nonlinear dimensionality reduction which operates by discovering the manifold structure of data. In manifold learning, the input data are assumed to be sampled from a low-dimensional manifold. Representative manifold learning methods include Isomap [
15], locally linear embedding (LLE) [
16], Laplacian eigenmaps (LE) [
17], local tangent space alignment (LTSA) [
18], locality-preserving projections (LPPs) [
19], neighborhood-preserving embedding (NPE) [
20], and Hessian eigenmaps (or called Hessian LLE) [
21].
In NPE, each data point is represented as a linear combination of the neighboring data points. Then, an optimal embedding is found to preserve the neighborhood structure in the dimensionality-reduced space [
20]. Chen et al. [
22] applied eigenvalue decomposition and generalized eigenvalue decomposition to solve the unstable problem caused by a singularity problem in NPE, and developed an NPE-based incipient fault detection method for small-scale cyber-physical systems. Since the NPE method can preserve the local manifold structure of different modes, Song et al. [
23] performed NPE on the time-lagged variables for multimode dynamic process monitoring. LPP is designed to find the optimal linear approximations to the eigenfunctions of the Laplace Beltrami operator on the manifold through the nearest neighbor search in the low-dimensional space [
19]. Duan et al. [
24] employed LPP to preserve the local structure of process data, and then adopted a least squares support vector machine to predict the key-performance indicator. Zhang et al. [
25] combined LPP and PCA to preserve both global and local structures of the dataset and developed a fault detection and identification method by utilizing the extracted features. LLE attempts to discover nonlinear structure in high-dimensional data by exploiting the local symmetries of linear reconstructions [
21]. Wu et al. learned structure information by LLE and incorporated the extracted local information into canonical correlation analysis (CCA) for quality-relevant nonlinear process monitoring [
26]. Li and Zhang implemented the supervised locally linear embedding projection method for bearing fault diagnosis and illustrated its validity using the experimental data [
27]. Different manifold learning methods focus on uncovering the manifold structure with different criteria. They rely on the knowledge and experience of experts for their own purposes. Therefore, only partial information from the underlying manifold is learned by each existing local manifold learning method. Although there are many other manifold learning methods, LE, LLE, and HLLE are easily fused to characterize the geometric information of the manifold from different perspectives under the framework of the local tangent coordinate system. To take advantage of different manifold learning methods to better uncover the underlying manifold structure, Xing et al. [
28] provided a common framework to synthesize the partial information extracted from different local manifold learning methods under local tangent coordinates.
Motivated by the above discussions, a novel data-driven fault detection based on FLML is proposed in this paper. In the proposed FLML, the partial information on the geometric structure of the underlying manifold is firstly extracted from LE, LLE, and Hessian locally linear embedding (HLLE) methods, respectively. A novel objective function is formulated to fuse the extracted partial information. On the basis of the optimization results, FLML can learn the geometric information from different local methods. The geometric structure of the underlying manifold is more thoroughly explored by the proposed FLML, compared to LE, LLE, and HLLE. In the proposed FLML method, the richer local information can be exploited by taking the data proximity, local linear relationships, and local Hessian structures into account simultaneously. On the other hand, the proposed FLML can manifest robustness due to the local information extracted from different views. Compared to the method developed in [
28], which requires the determination of fusion coefficients, only the global embedding coordinates are obtained in the proposed FLML. Thus, the proposed FLML method is simpler. Like the PCA-based fault detection method, two monitoring statistics including Hotelling’s
and
Q statistics are established. The effectiveness and advantages of the proposed FLML-based fault detection are illustrated by an industrial Tennessee Eastman process benchmark and a real blast furnace ironmaking process.
The rest of this paper is organized as follows.
Section 2 briefly introduces the ideas of LE, LLE, and HLLE.
Section 3 illustrates the proposed FLML method and its application in fault detection in detail. In
Section 4, the proposed FLML-based fault detection approach is verified through an industrial Tennessee Eastman (TE) process benchmark and a real blast furnace ironmaking process. Finally,
Section 5 provides the conclusion.