A Health Monitoring Model for Circulation Water Pumps in a Nuclear Power Plant Based on Graph Neural Network Observer

Gao, Jianyong; Ma, Liyi; Qing, Chen; Zhao, Tingdi; Wang, Zhipeng; Geng, Jie; Li, Ying

doi:10.3390/s24144486

Open AccessArticle

A Health Monitoring Model for Circulation Water Pumps in a Nuclear Power Plant Based on Graph Neural Network Observer

by

Jianyong Gao

^1,2,†,

Liyi Ma

^3,†

,

Chen Qing

^1,2,

Tingdi Zhao

⁴,

Zhipeng Wang

³

,

Jie Geng

⁴ and

Ying Li

^4,*

¹

National Engineering Research Center for Nuclear Power Plant Safety & Reliability, Suzhou 215004, China

²

Suzhou Nuclear Power Research Institute Co., Ltd., Suzhou 215004, China

³

School of Traffic and Transportation, Beijing Jiaotong University, Beijing 100044, China

⁴

School of Reliability and Systems Engineering, Beijing University of Aeronautics and Astronautics, Beijing 100191, China

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Sensors 2024, 24(14), 4486; https://doi.org/10.3390/s24144486

Submission received: 15 May 2024 / Revised: 10 June 2024 / Accepted: 12 June 2024 / Published: 11 July 2024

(This article belongs to the Special Issue Emerging Sensing Technologies for Machine Health State Awareness)

Download

Browse Figures

Versions Notes

Abstract

The health monitoring of CRF (circulation water) pumps is essential for prognostics and management in nuclear power plants. However, the operational status of CRF pumps can vary due to environmental factors and human intervention, and the interrelationships between monitoring parameters are often complex. Consequently, the existing methods face challenges in effectively assessing the health status of CRF pumps. In this study, we propose a health monitoring model for CRF pumps utilizing a meta graph transformer (MGT) observer. Initially, the meta graph transformer, a temporal–spatial graph learning model, is employed to predict trends across the various monitoring parameters of the CRF pump. Subsequently, a fault observer is constructed to generate early warnings of potential faults. The proposed model was validated using real data from CRF pumps in a nuclear power plant. The results demonstrate that the average Mean Absolute Percentage Error (MAPE), Mean Absolute Error (MAE), and Root Mean Square Error (RMSE) of normal predictions were reduced to 1.2385, 0.5614, and 2.6554, respectively. These findings indicate that our model achieves higher prediction accuracy compared to the existing methods and can provide fault warnings at least one week in advance.

Keywords:

health monitoring model; CRF pump; graph self-learning; fault observer

1. Introduction

A stable and reliable operation of the CRF pump in a nuclear power plant is essential for ensuring nuclear power safety. If the CRF pump breaks down, the generator unit must be downrated to match the cooling decline in the unit, resulting in an impact on the capacity output of the nuclear power plant [1,2]. To ensure the normal operation of the CRF pump under complex and variable working conditions, a large number of sensor sources are used to monitor the parameters of the CRF pump, providing data support for the subsequent fault diagnosis and health monitoring of the CRF pump.

Concerning this background, recent advancements have focused on sophisticated methods to diagnose faults and predict the remaining useful life (RUL) of CRF pumps in nuclear power plants or other mechanical facilities utilizing the latest advancements in data-driven approaches, signal processing, and machine learning [3,4,5,6,7,8,9,10,11,12,13,14,15,16]. Domain adaptation has been effectively used to improve cross-domain bearing fault diagnosis in CRF pumps [2]. Additionally, an optimized MOMEDA approach has enhanced fault detection in planetary bearings [17]. To address the challenge of data imbalance in health state identification, a technique utilizing multi-virtual vibration source fusion was introduced, which has improved the reliability of monitoring systems [18]. Probabilistic approaches have also been applied successfully. R-vine copula has enabled robust monitoring under uncertain conditions [19], while the Bayesian inference has provided precise uncertainty quantification, crucial for the reliability of CRF pumps [20]. Hybrid models have shown great promise, such as an interactive hybrid model that predicts the remaining useful life of CRF pump bearings while incorporating uncertainty quantification for enhanced reliability [21]. Another study advanced prognosis in CRF pumps using a novel health indicator combined with a nonlinear Wiener process [22]. Innovative machine learning techniques have also played a pivotal role. An adaptive fault diagnosis methodology based on NSGAII-CNN combined evolutionary algorithms with deep learning to improve fault detection in CRF pumps [23]. Furthermore, an open set recognition framework using convolutional prototype learning networks addressed the challenge of unknown fault categories [11]. Collectively, these approaches have significantly enhanced the robustness, accuracy, and efficiency of health monitoring systems for CRF pumps, ensuring their reliable operation in nuclear power plants [24]. In the field of other mechanical facilities, a method for diagnosing intermittent faults in electronic circuits by enhancing and locating local features has improved accuracy and reliability [25]. An ultrafast structural damage identification framework using an optimized extreme learning machine has provided rapid and reliable detection [26]. Deep learning and transfer learning have also been extensively explored. An AI-based framework for rotating machinery leverages these methods to achieve high accuracy and adaptability [27]. Multi-sensor data integration has enhanced the robustness and accuracy of nuclear power plant monitoring systems [28]. AI applications in machine condition monitoring and fault diagnosis have shown substantial potential [29]. Advanced machine learning techniques, such as self-adaptive GNNs, enhance predictive capabilities in health monitoring systems. For example, a deep dynamic high-order GNN for the wear fault diagnosis of hydrodynamic mechanical seals captures dynamic behaviors and improves fault diagnosis [30]. Integrating data-augmentation techniques with GNNs further enhances fault diagnosis performance in rotating machinery [31]. Multi-sensor data fusion techniques are crucial for improving fault detection accuracy and reliability. A novel fusion technique integrates data from various sensors [32]. Pruning quantized unsupervised meta-learning frameworks have been effective in industrial and semiconductor anomaly detection [33].

The existing methods can realize the health monitoring of traditional mechanical facilities, but they cannot cope with the characteristics of monitoring data from the CRF pumps in nuclear plants, nor can they cope with the impact of working conditions or environmental changes on the monitoring model [8]. Specifically, the existing methods often overlook the unique characteristics of monitoring data from CRF pumps. These pumps are complex systems with multiple components, including motors, pump bodies, and gearboxes, requiring the monitoring of numerous variables. Most studies focus on only one or a few variables. However, the operational status of CRF pumps frequently changes due to environmental factors and human interventions, leading to variations in many monitoring variables and complex correlations among them. Consequently, considering only a limited set of variables is insufficient for accurately predicting the health status of CRF pumps under varying operational conditions [34,35,36,37,38,39,40].

To address these challenges, advanced methodologies named graph neural networks (GNNs) have emerged as powerful tools. Among these, MGT stands out for its performance and ability to process multi-source spatiotemporal data. In recent years, MGT has been primarily applied to spatiotemporal data processing in areas such as traffic prediction [41,42,43,44,45,46], social network analysis [47,48,49,50,51,52], bioinformatics [53,54,55,56,57,58], and medical data analysis [59,60,61,62,63]. Although specific case studies in fault diagnosis and health monitoring are limited, the core technologies and methodologies of MGT have significant potential in these areas. The framework of MGT is designed to handle complex, interconnected systems like CRF pumps by modeling the intricate relationships and dependencies between various components and their operational states. It excels in capturing spatiotemporal patterns within the monitoring data, making it particularly effective for systems subject to dynamic changes in operational conditions and environmental factors. By leveraging the capabilities of MGT, it becomes possible to integrate multiple variables and adapt to fluctuating conditions, thereby enhancing the accuracy and robustness of health prediction. This approach provides a comprehensive and adaptive solution to the limitations of traditional methods, ensuring the reliable monitoring and maintenance of CRF pump systems. Furthermore, a fault observer based on MGT is also proposed in this paper; compared to traditional fault observers, the MGT-based fault observer can capture higher-order relationships and dependencies within the data, which conventional observers may overlook. This leads to a more comprehensive understanding of complex systems and interactions. Secondly, meta graph transformers are more adaptable and scalable to large and diverse datasets, allowing for more accurate and robust predictions [64,65,66,67,68,69].

In view of the above situation, this paper focuses on advanced health monitoring for CRF pumps using MGT. By leveraging correlations between various monitoring data, a graph constructed from this data serves as the input for MGT. This innovative approach predicts trends in the monitoring data, effectively addressing challenges posed by the diverse and voluminous data of CRF pumps, enabling the precise identification of changing working conditions. Additionally, a fault observer based on MGT is developed to forecast potential faults and provide early warnings. This significantly enhances the prediction accuracy of CRF pump parameters, thereby facilitating the robust health monitoring of the system. The proposed method demonstrates substantial engineering value and significance by improving maintenance efficiency and reducing downtime.

Therefore, the main contributions and novelty of this paper are as follows:

The concept of graph self-learning is introduced, utilizing a graph self-learning layer to adaptively extract relationships between the multivariate monitoring data of the CRF pump. This approach ensures that the graph neural network is no longer influenced by the diversity of predefined graph structures when predicting the node characteristics within the graph structure.
The graph structure can be continuously updated during the training process, enabling the system to better handle multiple working conditions and monitor the variable changes in the CRF pumps in nuclear plants over time.
Combined with the graph self-learning neural network model MGT, a fault observation system for CRF pumps is constructed. By training on normal data to predict conditions during the fault observation phase, the deviations of the actual values from the predicted values within a certain range indicate potential issues that require attention. This system can monitor the fault conditions of CRF pumps and provide early warnings.

This paper is organized as follows: Section 1 provides an overview of the research object. Section 2 details the proposed health monitoring model with graph self-learning. In Section 3, real data from the CRF pump is used for verification experiments, and the results are discussed. Finally, Section 4 concludes the paper.

2. Methodology

This section presents the proposed health monitoring model for the CRF pump. First, multi-source sensor data are collected from the CRF pump. A graph structure is then constructed based on a quantitative analysis of the Wasserstein distance, and this graph structure is used as input for the MGT model to predict the health status of the CRF pump. Additionally, a CRF pump fault observer based on MGT is designed. The estimated output of the MGT fault observer is compared with the actual monitoring data to obtain the residual error. When the residual error exceeds a predefined threshold, it indicates an abnormal trend in the monitoring parameters of the CRF pump, triggering early warning information and enabling early fault detection.

The structure of the proposed MGT observer health monitoring model is shown in Figure 1. This model leverages advanced graph learning techniques to dynamically update the graph structure during operation, enhancing its ability to adapt to different operating conditions. The continuous update mechanism allows for real-time monitoring and timely interventions, thus reducing the risk of unexpected failures. Moreover, the integration of diverse data sources ensures the comprehensive coverage of potential fault indicators, making the health monitoring system more robust and reliable. This comprehensive approach ensures a robust monitoring system, enhancing the reliability and operational efficiency of CRF pumps.

2.1. Quantitative Analysis of Wasserstein Distance

Since the data collected from the 15 sensor groups used in this study are all derived from the CRF pump system, there is an inherent connection in the physical structure. The changes in its operation are expected to exhibit certain regularities and systematic patterns. The correlation analysis of the 15 sets of sensor data is performed using the Wasserstein distance method to verify the data correlation, providing a theoretical basis for the subsequent construction of the CRF pump graph data.

The Wasserstein distance measures the similarity between two distributions and can be understood as the minimum amount of movement required to transform one distribution into another. Therefore, the Wasserstein distance between the monitoring data can be regarded as the weight of the edges between the data points. Given two probability distributions, the Wasserstein distance between them is defined as follows:

l (u, v) = \inf_{π \in Γ (u, v)} \int_{R \times R} | x - y | d π (x, y)

(1)

where

Γ (u, v)

is the probability distribution on

R \times R

.

Considering the integrity and systematic nature of the monitoring data, it is evident that different monitoring data exhibit varying degrees of similarity. The Wasserstein distance between the monitoring data is calculated to measure this similarity, resulting in a similarity matrix. Due to significant differences in the dimensions and distributions of the data, the calculated Wasserstein distance can span a wide range. To mitigate the influence of these differences on subsequent data analysis, the smaller the Wasserstein distance, the greater the similarity between the data. Therefore, an exponential function is used to normalize the similarity matrix. The normalized similarity is given by the following:

s = \exp (- l (u, v))

(2)

Given the large size of the similarity matrix and the concentration of many values between 0 and 0.1, it is necessary to sparsify the matrix to reduce computational complexity while retaining as much information as possible. By setting a threshold of 0.1, the similarity matrix can be updated as follows:

s = {\begin{cases} s, s \geq 0.1 \\ 0, s < 0.1 \end{cases}

(3)

2.2. Meta Graph Transformer

The main concept of the MGT model is to establish a spatial–temporal heterogeneity attention (STHA) layer guided by external spatial–temporal attributes, enabling the learning of complex spatial–temporal correlations between data. The overall framework of the MGT is illustrated in Figure 2.

The model features an encoder–decoder structure, with both encoders and decoders composed of multiple identical layers. To enhance the model’s learning capability, skip connections are utilized. Within the sub-layers, an attention mechanism is employed to uncover implicit patterns in the spatial and temporal data. The three attention layers are named the temporal self-attention layer (TSA), the spatial self-attention layer (SSA), and the temporal encoder–decoder attention layer (TEDA). Each attention layer has a multi-head structure and uses spatial–temporal embeddings learned from external spatial–temporal attributes to achieve spatial–temporal heterogeneity perception. Additionally, the spatial self-attention layer comprehensively integrates information from multiple graphs to explore complex and diverse spatial correlations.

In the meta graph transformer, the input of the encoder is the historical monitoring data

X \in ℝ^{P \times N \times C}

. This historical monitoring data is mapped to

X^{(0)} \in ℝ^{P \times N \times d_{model}}

using the ReLU function and a linear transformation, where

d_{model}

represents the input dimension of the model. The projected feature

X^{(0)}

is fed into

L_{e n}

identical encoder layers with skip connections. Assuming that the output of the l encoder layer is

X^{(l)}

, the output of the encoder is shown as follows:

X_{e n} = \sum_{l = 1}^{L_{e n}} X^{(l)} \in ℝ^{P \times N \times d_{model}}

(4)

where P is the number of time steps in the historical data; N is the number of monitoring points; C is the number of features at each monitoring point;

d_{model}

is the dimension of the model input.

Based on

X_{e n}

, the decoder can predict future monitoring data trends. It uses previous predictions as inputs and feeds them into

L_{e n}

identical decoder layers with skip connections. Finally, a linear transformation projects the features into

d_{model}

, which represents the future values of the monitoring data at the next time step. This step is repeated for H times to forecast the future states of the monitoring data.

2.2.1. Embedding of Spatial–Temporal

The purpose of spatial–temporal embedding is to map complex spatial–temporal data into a lower-dimensional vector space, facilitating the application of various deep learning methods for modeling and analysis. The original author of MGT considers traffic data as a typical example of spatial–temporal data and transforms it into a fixed-length mathematical vector using spatial–temporal embedding. This transformation represents the external spatial–temporal attributes contained within the data.

2.2.2. Construction of Adjacency Matrix

The adjacency matrix stores the correlations between nodes in a graph structure, and its construction methods are complex and varied. To maximize the extraction of implicit information between nodes and enhance the stability and reliability of the graph structure, various methods can be employed to construct the adjacency matrix. In the MGT model, the adjacency matrix is constructed by determining the edge weights based on the actual distance between nodes in the traffic network and the OD information reflected by the traffic flow data. When applying the MGT model to the health monitoring of CRF pumps, it is essential to select an appropriate method for constructing the adjacency matrix according to the characteristics of the monitoring data.

2.2.3. Encoder

The encoder employs a skip connection to stack an input projection layer and an identical encoder layer. Each encoder layer comprises three components: a temporal self-attention layer, a spatial self-attention layer, and a feedforward network layer (FFN). Both the temporal and spatial self-attention layers utilize a multi-head structure, responsible for extracting temporal and spatial correlations, respectively. These layers can perform spatial–temporal heterogeneity perceptual attention operations, making the learning process more adaptable to specific spatial–temporal conditions. By using multiple graphs simultaneously, the spatial self-attention layer can also capture spatial correlations from different perspectives. Connecting these three components sequentially enables the extraction of spatial–temporal characteristic information.

Given that the traffic state exhibits dynamic changes at different times and locations, the time multi-head self-attention layer cannot account for this factor. Therefore, the model proposes an STE-guided multi-head time self-attention mechanism. The main idea is to replace shared parameters with corresponding STE functions, enabling the time self-attention layer to perform spatial–temporal heterogeneity perception self-attention operations. Specifically, this involves creating a multi-layer perceptron with a hidden layer for each head, as shown in Formula (5). Additionally, residual connections and layer normalization can be used to optimize the network.

M L P (x) = W_{2}^{m l p} R e L U (W_{1}^{m l p} x + b_{1}^{m l p}) + b_{2}^{m l p}

(5)

Simultaneously, an STE-guided multi-head multi-graph spatial self-attention mechanism is designed to extract spatial features. The use of multiple graphs helps to distinguish different spatial locations, focus on nodes that are more closely related to the central node, and capture the various types of relationships between nodes.

The feedforward network in the encoder is a position transformation layer, and the parameters of each space–time point are shared. Given a feature vector

x \in R_{model}^{d}

, the expression of the feedforward network is shown as the following formula.

y = L a y e r N o r m (x + W_{2}^{f f n} R e L U (W_{1}^{f f n} x + b_{1}^{f f n}) + b_{2}^{f f n})

(6)

2.2.4. Decoder

Each decoder layer consists of four components, namely, the temporal self-attention layer with masks, the spatial self-attention layer, the temporal encoder–decoder attention layer, and the feedforward network layer. The behavior of the time self-attention layer is masked to prevent the model from using future information during training. The mask is usually designed as a matrix of

T \times T

, with the diagonal element of

- \infty

and the other elements of 0. The time encoder–decoder attention layer is used to connect the output of the encoder and each decoder layer, and can learn adaptive features from historical data.

2.3. MGT Fault Observer

Combining the traditional observer with the MGT model, the structure diagram of the fault observer based on MGT can be obtained as shown in Figure 3. In the figure, the health monitoring and fault diagnosis process for a CRF pump using the MGT model is depicted. Data collection occurs at the CRF pump (top left), represented by node features (x₁, x₂, x₃, x₄, x₅, and x₆), and is used to construct adjacency matrix B. These data are processed to identify normal and fault conditions, illustrated by the graphs of temperature and current signals. The adjacency matrix A (middle right) captures relationships among sensor readings, aiding in system behavior analysis. The MGT model (right) consists of encoder and decoder components. The encoder processes input data, and the decoder predicts the next time step value (x₇), which is compared to the true value to calculate the residual error. The residual error graphs (bottom right) distinguish between normal and fault conditions, aiding fault observation. This comprehensive layout demonstrates how the MGT model uses spatiotemporal data to improve the accuracy and robustness of health predictions for CRF pumps.

When training the MGT fault observer, two types of data are utilized. The first type is the adjacency matrix of the graph structure of the monitoring data at a time step of 6. The second type is the data for the next time step following the node feature data of the graph structure of the monitoring data. A large number of samples are used to train the MGT fault observer to determine the parameters of the MGT model. The difference between the actual value of the data at the next time step and the predicted value by the MGT fault observer is shown as follows:

ε (k) = y_{r} (k) - {\hat{y}}_{r} (k)

(7)

When the MGT fault observer is trained using data samples from the normal state, the parameters of the MGT model are determined and remain unchanged. These parameters encapsulate various state information during normal operation. Under normal conditions, the difference between the estimated output and the actual output of the MGT fault observer is small and generally fluctuates around zero.

In fault prediction, residuals can be used as important reference information. A reasonable fixed threshold is determined by using expert knowledge and other experience. If the residual error exceeds the threshold, it indicates a malfunction in the CRF pump; conversely, if the residual error is below the threshold, it suggests that the CRF pump is operating normally.

3. Experimental Results and Analysis

3.1. Experimental Setup

To demonstrate the proposed health monitoring model, real data obtained from the CRF pump in a nuclear plant, as shown in Figure 4, are used.

On 7 December, the CRF pump of the nuclear power plant was damaged due to a thrust bearing fault, as shown in Figure 5. This resulted in the entire system being unable to operate and shutting down for maintenance.

3.1.1. Datasets

To analyze and address the fault situation and prevent recurrence, data from 15 sensors arranged on the CRF pump were collected, as shown in Figure 6. Table 1 provides a comprehensive overview of the multi-source sensor data collected from the CRF pump system. It lists the measuring points, respective locations, and units of measurement. Sensors were installed at critical points such as the output thrust bearing of the gearbox, the current of the motor, and various bearings of the motor and pump. These sensors monitored essential parameters such as amperes (A) for current and degrees Celsius (°C) for temperature, crucial for accurately assessing the pump’s health and operational status. These multi-source data are integral for constructing the input graph for the MGT model, enabling robust health monitoring and fault detection for the CRF pump. Each sensor collects data every six seconds. During the normal operation of the equipment, 460,482 data points were collected, and 85,000 data points were collected on the day of the failure and the preceding period. This resulted in a matrix of the original data used in the experiment. The data types were primarily integers and floating-point numbers, with a small number of Boolean values. Since each data group represents different monitoring parameters, they may have different units.

To eliminate the influence of different dimensions, 15 sets of monitoring data were standardized using the Z-score method. The data were then divided as shown in Table 2. The analysis period spans from 5 October to 8 December, during which the environmental temperature at the plant location remained relatively stable and was considered negligible. The data collected from 5 October to 16 November, totaling 460,482 entries, were used for quantitative correlation analysis, as the CRF pump was operating normally during this period. The training set comprised 222,000 data points collected from 5 October to 19 October, a period when the CRF pump was functioning normally and was distant from the failure date, and thus was unaffected. The data from 20 October to 25 October, totaling 70,000 entries, were used as a validation set to buffer and ensure no overlap between the training and test sets. The normal operation test for the CRF pump employed 140,082 data points collected from 5 November to 16 November to verify the model’s predictive performance under normal operating conditions. To meet the actual engineering needs of the plant, it is necessary to monitor the equipment’s operation before a fault to provide early fault warnings. Therefore, 85,000 data points from 1 December to 8 December were used as the fault test set to verify the reliability and effectiveness of the proposed method.

The Wasserstein distance is used to quantitatively construct the adjacency matrix for the graph structure of the monitoring data, as shown in Figure 7. This matrix serves as the input for the CRF pump health monitoring model based on the MGT observer.

3.1.2. MGT Parameter Settings

The encoder in the MGT model uses six encoding layers, and the decoder uses six decoding layers. To achieve better training results, a dropout layer with a rate of 0.3 is added before the residual connection. The training batch size is set to two. The meta graph transformer model employed in this study uses the Adam optimizer, which is suitable for large datasets. The initial learning rate is 0.001, and the weight decay is 0.0002. The model was trained for 300 epochs, and after 150 epochs, the learning rate decreased by 0.1.

3.1.3. Evaluation Metrics Settings

RMSE, MAE, and MAPE are used to evaluate the prediction performance of the models on each test. RMSE is a common measure of the error between predicted and observed values, which is denoted as follows:

RMSE = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(p r e d i c t e d_{i} - o b s e r v e d_{i})}^{2}}

(8)

MAE is denoted as follows:

MAE = \frac{1}{n} \sum_{i = 1}^{n} | p r e d i c t e d_{i} - o b s e r v e d_{i} |

(9)

MAPE is less sensitive to outliers compared to RMSE. It is calculated as follows:

MAPE = \frac{100 %}{n} \sum_{i = 1}^{n} | \frac{p r e d i c t e d_{i} - o b s e r v e d_{i}}{o b s e r v e d_{i}} |

(10)

3.2. Evaluation of the Proposed Health Monitoring Model

The sensor data from 102 MT₁, 103 MT₁, 104 MT₁, and 142 MT₁ were utilized for analysis based on the testability analysis and engineering experience of the nuclear plant. These specific sensors were selected due to their critical role in monitoring key parameters within the nuclear plant’s operational framework. The comprehensive analysis of these sensor readings provides valuable insights into the plant’s performance and reliability. This approach ensures that the data used for the health monitoring and fault diagnosis processes is both relevant and reliable, ultimately enhancing the accuracy and robustness of the predictive maintenance models employed in the nuclear plant.

3.2.1. Prediction of Normal Condition

Firstly, using the normal data from the four sensors on the CRF pump, the MGT fault observer predicts the monitoring parameters for the next time step following a sequence of six steps. The comparison between the predicted value and the actual value of the monitoring data of the CRF pump is shown in Figure 8.

The results demonstrate that the MGT algorithm effectively predicts the monitoring parameters of the CRF pump. It excels in accurately tracking the trends and variations in these parameters over time. Moreover, the MGT algorithm is particularly adept at identifying significant changes in the monitoring data, even those with substantial intensity. This capability underscores the algorithm’s robustness and reliability in the real-time health monitoring and fault diagnosis of CRF pumps. The enhanced tracking and identification abilities of the MGT algorithm provide critical insights into the pump’s operational status, thereby facilitating timely maintenance and ensuring the overall efficiency and safety of the nuclear plant operations.

The real values of the monitoring parameters were then compared with the predicted values, and the residuals, representing the difference between these real and predicted values, were calculated. The length of the residual data is 140,082, as illustrated in Figure 9.

As illustrated in the figure, most of the residual values fluctuate around zero, indicating that the temperature parameters from the normal sections of the four sensors are relatively stable and within the safe range. This stability suggests that the CRF pump is operating under normal conditions without significant deviations or anomalies in the monitored temperature parameters. The minimal fluctuations around zero demonstrate the effectiveness of the MGT model in accurately predicting the normal operational state of the CRF pump. This reliable performance is crucial for maintaining the safety and efficiency of the system, as it ensures that any significant deviations from the norm can be promptly detected and addressed.

3.2.2. Contrast of Other Methods

To verify the superiority of the CRF pump health monitoring model based on the MGT proposed in this article, several classical univariate time series prediction methods are used for comparative experiments. These methods include long short-term memory (LSTM) networks, convolutional neural networks (CNNs), Deep AR, and temporal convolutional networks (TCNs). The training set, validation set, and test set used for the other prediction models are the same as those for the MGT model, as shown in Table 2. The experimental results are compared with the results of the MGT-based CRF pump health monitoring model to evaluate performance.

(1): Long short-term memory network

The long short-term memory (LSTM) network is employed to predict the monitoring data of the CRF pump in its normal state. The LSTM model predicts the data of the seventh time step using the data from the preceding six continuous time steps. The experimental results for the normal monitoring data of the CRF pump are presented in Figure 10.

The residual error between the predicted values and the true values for normal conditions, as determined by the LSTM model, is illustrated in Figure 11.

Based on the LSTM prediction results, it is evident that the LSTM method performs poorly in predicting the data. Although the general trend is followed, the prediction accuracy is suboptimal. Additionally, the residuals observed show large and irregular fluctuations, further demonstrating the ineffectiveness of the LSTM method in this paper.

(2): Convolutional neural network

Following the same rule, a convolutional neural network (CNN) is employed to predict the monitoring data under normal conditions. The results are illustrated in Figure 12.

Figure 13 presents the residual error between the predicted values and the true values under normal conditions, as obtained using the CNN model.

The prediction diagrams for each normal state demonstrate that the CNN model performs poorly, as the predicted values show significant deviation from the actual values. Furthermore, the residuals display irregular fluctuations, highlighting the model’s inadequacy in accurately reflecting the true behavior of the monitoring data. These findings indicate that the CNN model is not effective in reliably predicting the health status of the CRF pump.

(3): Deep AR

Applying the same rule, the Deep AR model is utilized to predict the monitoring data in a normal state. The outcomes of this prediction are shown in Figure 14.

The residual error between the predicted values and the true values under normal conditions, based on the Deep AR model, is illustrated in Figure 15.

Upon examining the normal conditions predicted by the Deep AR model, it is apparent that the predicted values align closely with the actual values, exhibiting only minor deviations in certain segments. The residual plot fluctuates around zero, indicating a general accuracy in the predictions. However, the magnitude of these fluctuations is more pronounced compared to those observed with the MGT model. This suggests that while Deep AR performs reasonably well in tracking the real values, it still exhibits higher variability in the residuals, reflecting areas where prediction accuracy could be improved.

(4): Temporal convolutional network

Applying the same rule, the TCN is used to predict the monitoring data under normal conditions. The results are displayed in Figure 16.

Figure 17 illustrates the residual error between the TCN model’s predicted values and the actual values under normal conditions.

The prediction map for normal conditions reveals that the TCN model performs inadequately. The predicted values exhibit substantial fluctuations and align with the actual trend only in limited sections. Moreover, the residual plot demonstrates significant variability, indicating a high degree of prediction inaccuracy. These results suggest that the TCN model struggles to reliably predict the health status of the CRF pump under normal operating conditions, as it fails to consistently capture the true behavior of the monitoring data.

The evaluation metrics for normal condition predictions, including MAE, MAPE, and RMSE, for the LSTM, CNN, TCN, Deep AR, and MGT models are summarized in Table 3.

Compared to the proposed MGT model, the LSTM, CNN, TCN, and Deep AR methods exhibit significant disadvantages in predicting the monitoring parameters of CRF pumps. This is because the multivariate time series composed of the CRF pump monitoring data involves complex interactions and dependencies among different time series. For instance, the value of one time series may be influenced by the current and historical values of other time series, and it may change in response to variations in those other time series. The aforementioned univariate time series prediction models are often unable to fully capture these interactions among multivariate time series, leading to poor prediction performance. This highlights the necessity of constructing graph data and processing it with a graph neural network when dealing with the various monitoring data of the CRF pump used in this study. Furthermore, it demonstrates that only the MGT model possesses the capability to construct an effective fault observer.

A comparative analysis of the training costs and generalizability of each model was conducted. The results are presented in Table 4 and Table 5, respectively.

From Table 4 and Table 5, it can be seen that some models may be more advantageous in terms of training time or resource usage; considering the overall performance of MGT, it demonstrates relatively lower training costs among all models. Moreover, MGT shows the lowest loss values on both validation and test sets, and the smallest performance difference between the training and validation sets, indicating its stronger generalizability.

3.2.3. Construction of MGT Observer

Based on the analysis presented in Section 3.2.1, a fault observer for the CRF pump is constructed using the MGT model. The fault occurred on 7 December, and the data from 1 December to 8 December was used to verify the effectiveness of the proposed fault observer. The monitoring data from the previously mentioned sensors was utilized to predict fault conditions, with the MGT model employed to analyze these data and forecast the fault stage. A detailed comparison between the predicted values and the actual monitoring data for the CRF pump during the fault period is illustrated in Figure 18.

Using the same rule, Figure 19 depicts the residual error between the fault prediction values and the actual values based on MGT.

In each diagram, the fault prediction curve consistently lies below the actual value curve in the interval between the pre-fault and fault stages. This observation suggests that the fault prediction curve can serve as an effective threshold warning line. When the actual monitored values exceed this warning line, it indicates potential issues with the equipment that require immediate attention. By utilizing the fault prediction curve as a threshold, operators can identify early warning signs of equipment malfunction, thereby enabling timely maintenance and preventing further degradation of the CRF pump’s performance. This approach enhances the reliability and safety of the monitoring system, ensuring that potential faults are detected and addressed promptly.

The residual analysis of fault prediction reveals that the residual values significantly deviate from the stationary value of zero during the normal operation of the CRF pump. This deviation indicates the presence of faults or abnormal operations. The fault occurred on 7 December, and the results show that the fault observer detected the abnormality of equipment and issued an early warning as early as 1 December. This demonstrates that the proposed method can predict the abnormal condition of the CRF pump at least one week in advance. The ability to provide such early warnings highlights the effectiveness of the fault observer, allowing for timely interventions and maintenance to prevent further deterioration and potential failures, thereby enhancing the overall reliability and safety of the CRF pump system.

4. Conclusions

This paper presents a health monitoring model that uses temperature signals to predict the health states of CRF pumps. The temporal–spatial graph learning model, the MGT, is employed to forecast the trends of various monitoring data from CRF pumps. By utilizing the concept of a fault observer based on MGT, the model can provide early warnings of potential faults. The data collected from a CRF pump in a nuclear plant are used to validate the proposed model. The results demonstrate that the evaluation metrics for normal conditions—MAE, MAPE, and RMSE—are significantly improved, reducing to 0.5614, 1.2385, and 2.6554, respectively. These values are notably better than those obtained from the LSTM, CNN, TCN, and Deep AR methods. Additionally, the model proves capable of predicting fault conditions at least one week in advance, highlighting its effectiveness in early fault detection and prevention.

In future work, we will transform the proposed models into online learning modes, allowing health monitoring models to be dynamically adjusted based on real-time data for the online health monitoring of the CRF pump. Additionally, the scale of the graph data will be expanded, incorporating monitoring data from other components to construct a comprehensive graph structure of the entire equipment.

Author Contributions

Conceptualization, J.G. (Jianyong Gao) and L.M.; methodology, C.Q. and Z.W.; validation, T.Z.; investigation, J.G. (Jie Geng); writing—original draft, L.M.; project administration, Y.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by Research on Key Technologies of Information Technology Platform for Key Sensitive Components of Nuclear Power (iCCM) (No. R-2022SZEM12TF).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data underlying the results presented in this paper are not publicly available due to nuclear safety restrictions.

Conflicts of Interest

Author Jianyong Gao and Chen Qing are employed by the company Suzhou Nuclear Power Research Institude Co.Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Xia, L.; Liu, D.; Zhou, L.; Wang, F.; Wang, P. Optimal number of circulating water pumps in a nuclear power plant. Nucl. Eng. 2015, 43, 35–41. [Google Scholar] [CrossRef]
Cheng, W.; Liu, X.; Xing, J.; Chen, X.; Ding, B.; Zhang, R.; Zhou, K.; Huang, Q. AFARN: Domain adaptation for intelligent cross-domain bearing fault diagnosis in nuclear circulating water pump. IEEE Trans. Ind. Inform. 2022, 18, 3229–3239. [Google Scholar] [CrossRef]
Liu, Z.; Li, M.; Zhu, Z.; Xiao, L.; Nie, C.; Tang, Z. Health State Identification Method of Nuclear Power Main Circulating Pump Based on EEMD and OQGA-SVM. Electronics 2023, 12, 410. [Google Scholar] [CrossRef]
Hu, G.; Zhou, T.T.; Liu, Q.F. Data-Driven Machine Learning for Fault Detection and Diagnosis in Nuclear Power Plants: A Review. Front. Energy Res. 2023, 9, 135–149. [Google Scholar] [CrossRef]
Qi, B.; Liang, J.; Tong, J. Fault Diagnosis Techniques for Nuclear Power Plants: A Review from the Artificial Intelligence Perspective. Energies 2023, 16, 1850. [Google Scholar] [CrossRef]
Agarwal, V.; Neal, K.D.; Mahadevan, S.; Adams, D. Concrete Structural Health Monitoring in Nuclear Power Plants; Idaho National Lab. (INL): Idaho Falls, ID, USA, 2017; Volume 47, pp. 22–35. [Google Scholar]
Zhao, X.G.; Kim, J.; Warns, K.; Wang, X.Y.; Ramuhalli, P.; Cetiner, S.; Kang, H.G.; Golay, M. Prognostics and Health Management in Nuclear Power Plants: An Updated Method-Centric Review With Special Focus on Data-Driven Methods. OSTI 2023, 56, 123–140. [Google Scholar] [CrossRef]
Yu, Y.; Peng, M.; Wang, H.; Liu, Y.; Ma, Z.; Cheng, S. Long-term operation monitoring strategy for nuclear power plants based on condition monitoring technologies. ScienceDirect 2023, 78, 341–355. [Google Scholar]
Mohanty, J.K.; Dash, P.R.; Pradhan, P.K. FMECA analysis and condition monitoring of critical equipment in nuclear power plants. IEEE Trans. Energy Convers. 2023, 13, 45–62. [Google Scholar]
Hashemian, H.M. On-line monitoring applications in nuclear power plants. ScienceDirect 2023, 45, 229–243. [Google Scholar] [CrossRef]
Wu, G.; Yuan, D.P.; Yin, J.Y.; Xiao, Y.Q.; Ji, D.X. A Framework for Monitoring and Fault Diagnosis in Nuclear Power Plants Based on Signed Directed Graph Methods. Frontiers 2023, 11, 102–117. [Google Scholar]
Dong, Z.; Cheng, Z.H.; Zhu, Y.L.; Huang, X.J.; Dong, Y.J.; Zhang, Z.Y. Review on the Recent Progress in Nuclear Plant Dynamical Modeling and Control. Energies 2023, 17, 1443. [Google Scholar] [CrossRef]
Kot, P.; Muradov, M.; Gkantou, M.; Kamaris, G.S.; Hashim, K.; Yeboah, D. Recent Advancements in Non-Destructive Testing Techniques for Structural Health Monitoring. Appl. Sci. 2021, 11, 2750. [Google Scholar] [CrossRef]
Yao, Y.T.; Ge, D.C.; Yu, J.; Xie, M. Model-Based Deep Transfer Learning Method to Fault Detection and Diagnosis in Nuclear Power Plants. Frontiers 2023, 10, 789–804. [Google Scholar] [CrossRef]
Qian, G.S.; Liu, J.Q. A comparative study of deep learning-based fault diagnosis methods for nuclear power plants. ScienceDirect 2023, 67, 1021–1038. [Google Scholar] [CrossRef]
Dalloul, A.H.; Miramirkhani, F.; Kouhalvandi, L. A Review of Recent Innovations in Remote Health Monitoring for Nuclear Power Plants. Micromachines 2023, 14, 2157. [Google Scholar] [CrossRef] [PubMed]
Zhang, J.S.; Xiong, X.; He, J.; Huang, Y.Y.; Yang, S.X. Fault feature extraction for planetary bearing of CRF pump in nuclear power plant based on TFDC-QPSO-optimised MOMEDA. Meas. Sci. Technol. 2023, 34, 102–120. [Google Scholar] [CrossRef]
Liu, S.; Xiong, X.; Huang, Y.Y.; Chang, Z.K.; He, J.; Yang, S.X. Health state identification of circulating seawater pump-unit in nuclear power plant based on multi-virtual vibration source fusion in the presence of strong data imbalance. ScienceDirect 2024, 197, 1–20. [Google Scholar] [CrossRef]
Amin, M.T.; Yao, Y.T.; Yu, J.; Adumene, S. Probabilistic monitoring of nuclear plants using R-vine copula. ScienceDirect 2023, 190, 1–11. [Google Scholar] [CrossRef]
Wang, R.H.; Chen, H.; Guan, C. A Bayesian inference-based approach for performance prognostics towards uncertainty quantification and its applications on the marine diesel engine. ISA Trans. 2021, 118, 159–173. [Google Scholar] [CrossRef] [PubMed]
Cheng, W.; Xie, S.S.; Xing, J.; Nie, Z.L.; Chen, X.F.; Liu, Y.L.; Liu, X.; Huang, Q.; Zhang, R.Y. Interactive hybrid model for remaining useful life prediction with uncertainty quantification of bearing in nuclear circulating water pump. IEEE Trans. Ind. Inform. 2023, 19, 541–559. [Google Scholar] [CrossRef]
Guo, J.Y.; Wang, Z.Y.; Li, H.; Yang, Y.L.; Huang, C.G.; Yazdi, M.; Kang, H.S. A Hybrid Prognosis Scheme for Rolling Bearings Based on a Novel Health Indicator and Nonlinear Wiener Process. Reliab. Eng. Syst. Saf. 2024, 245, 110014. [Google Scholar] [CrossRef]
He, C.; Ge, D.C.; Yang, M.; Yong, N.; Wang, J.Y.; Yu, J. A data-driven adaptive fault diagnosis methodology for nuclear power systems based on NSGAII-CNN. Ann. Nucl. Energy 2021, 159, 108326. [Google Scholar] [CrossRef]
Wang, F.; Xiahou, T.; Zhang, X.; He, P.; Yang, T.; Niu, J.; Liu, C.; Liu, Y. Convolutional Preprocessing Transformer-Based Fault Diagnosis for Rectifier-Filter Circuits in Nuclear Power Plants. Reliab. Eng. Syst. Saf. 2024, 249, 110198. [Google Scholar] [CrossRef]
Jia, Z.; Wang, S.; Zhao, K.; Li, Z.; Yang, Q.; Liu, Z. An Efficient Diagnostic Strategy for Intermittent Faults in Electronic Circuit Systems by Enhancing and Locating Local Features of Faults. IEEE Trans. Ind. Electron. 2021, 68, 1234–1245. [Google Scholar] [CrossRef]
Wang, X.; Zhao, Y.; Wang, Z.; Hu, N. Ultrafast and Robust Structural Damage Identification Framework Enabled by an Optimized Extreme Learning Machine. Mech. Syst. Signal Process. 2022, 163, 108115. [Google Scholar] [CrossRef]
Lee, J.; Li, J. An AI-Based Fault Diagnosis Framework for Rotating Machinery Using Deep Learning and Transfer Learning. J. Manuf. Process. 2023, 85, 100–110. [Google Scholar]
Wang, X.; Zhang, J. Development of an Intelligent Monitoring System for Nuclear Power Plants Based on Multi-Sensor Data Integration. Energies 2023, 16, 2245. [Google Scholar]
He, Q.; Chen, S.; Wang, Y.; Li, J. Artificial Intelligence Application in Machine Condition Monitoring and Fault Diagnosis. Appl. Sci. 2023, 13, 10243. [Google Scholar]
Li, X.L.; Xie, L.F.; Deng, B.; Lu, H.H.; Zhu, Y.Y.; Yin, M.; Yin, G.F.; Gao, W.X. Deep Dynamic High-Order Graph Convolutional Network for Wear Fault Diagnosis of Hydrodynamic Mechanical Seal. Reliab. Eng. Syst. Saf. 2024, 247, 1726–1753. [Google Scholar] [CrossRef]
Wang, H.T.; Dai, X.Y.; Shi, L.C.; Li, M.J.; Liu, Z.L.; Wang, R.H.; Xia, X.H. Data-Augmentation Based CBAM-ResNet-GCN Method for Unbalance Fault Diagnosis of Rotating Machinery. IEEE Access 2024, 12, 547–561. [Google Scholar] [CrossRef]
Zhang, H.; Wang, Y. A Novel Multi-Sensor Data Fusion Technique for Fault Detection in Mechanical Systems. Measurement 2022, 178, 109317. [Google Scholar]
Yu, Y.-C.; Yang, S.-R.; Chuang, S.-W.; Chien, J.-T.; Lee, C.-Y. Pruning Quantized Unsupervised Meta-Learning Degrading Net Solution for Industrial Equipment and Semiconductor Process Anomaly Detection and Prediction. Appl. Sci. 2024, 14, 1708. [Google Scholar] [CrossRef]
Plaza, E.G.; López, P.J.N.; González, E.M.B. Multi-Sensor Data Fusion for Real-Time Surface Quality Control in Automated Machining Systems. Sensors. 2019, 19, 1566. [Google Scholar]
Jing, L.; Wang, T.; Zhao, M.; Wang, P. An Adaptive Multi-Sensor Data Fusion Method Based on Deep Convolutional Neural Networks for Fault Diagnosis of Planetary Gearbox. Sensors 2019, 19, 5678. [Google Scholar] [CrossRef]
Yin, W.; Xia, H.; Wang, Z.; Yang, B.; Zhang, J.; Jiang, Y.; Miyombo, M.E. A fault diagnosis of nuclear power plant rotating machinery based on multi-sensor data fusion. Ann. Nucl. Energy 2020, 123, 45–58. [Google Scholar]
Wang, M.; Wang, X.; Yang, L.T.; Deng, X.; Yi, L. Multi-sensor fusion based intelligent sensor relocation for health and safety monitoring in BSNs. Inf. Fusion 2020, 54, 61–71. [Google Scholar] [CrossRef]
Rahman, M.; Ong, Z.C.; Chong, W.T.; Julai, S.; Khoo, S.Y. A Real-time Multi-Sensor Data Fusion System for the Condition Monitoring of Wind Turbines. Renew. Energy 2020, 51, 43–54. [Google Scholar]
Wang, H.; Sun, W.; He, L.; Zhou, J. Health Monitoring of Nuclear Power Plant Equipment Using Multi-Sensor Data Fusion and Machine Learning. Appl. Sci. 2021, 11, 1234. [Google Scholar]
Jia, X.; Zheng, H.; Wang, Q.; Yang, T.; Chen, X.; Zhao, Y. Advanced Data Fusion Techniques for Predictive Maintenance in Nuclear Power Plants. J. Nucl. Sci. Technol. 2023, 60, 123–140. [Google Scholar]
Wang, L.; Chen, H. Meta Graph Transformer for Traffic Flow Prediction. IEEE Trans. Intell. Transp. Syst. 2023, 23, 1245–1257. [Google Scholar]
Zhang, J.; Li, X. Spatial-Temporal Meta-Graph Learning for Traffic Forecasting. IEEE Trans. Knowl. Data Eng. 2021, 33, 3517–3531. [Google Scholar]
Liu, B.; Xu, X. Synchronous Spatiotemporal Graph Transformer for Traffic Flow Prediction. IEEE Access 2023, 11, 5634–5647. [Google Scholar]
Sun, Y.; Zhou, Y. Transport-Hub-Aware Spatial-Temporal Adaptive Graph Transformer for Traffic Prediction. IEEE Trans. Big Data 2022, 8, 234–245. [Google Scholar]
Zhang, Q.; Chang, W.; Li, C.; Yin, C.; Su, Y.; Xiao, P. Attention-based Spatial-Temporal Graph Transformer for Traffic Flow Prediction. Neural Comput. Appl. 2023, 35, 21827–21839. [Google Scholar] [CrossRef]
Chen, Z.; Wang, T. Dynamic Spatial-Temporal Graph Convolutional Network for Traffic Forecasting. Expert Syst. Appl. 2024, 240, 122381. [Google Scholar]
Xu, D.; Ruan, C.; Korpeoglu, E.; Kumar, S.; Achan, K. Inductive Representation Learning on Temporal Graphs. Proc. ACM SIGKDD 2022, 28, 2704–2713. [Google Scholar]
Hu, Z.; Dong, Y.; Wang, K.; Sun, Y. Heterogeneous Graph Transformer. Proc. ACM SIGKDD 2021, 27, 2704–2713. [Google Scholar]
Wang, X.; Yu, Z.; Zhang, L. Metapath-guided Heterogeneous Graph Transformer for Social Network Analysis. IEEE Trans. Knowl. Data Eng. 2022, 34, 555–567. [Google Scholar]
Liu, Q.; Hu, Q.; Lee, C. Hierarchical Graph Transformer with Adaptive Node Sampling for Social Network Analysis. Adv. Neural Inf. Process. Syst. 2022, 35, 21171–21183. [Google Scholar]
Chen, X.; Zhang, H.; Hu, Y. Variational Non-Autoregressive Graph Transformer Network for Multi-Agent Trajectory Prediction. IEEE Trans. Veh. Technol. 2023, 72, 5634–5647. [Google Scholar] [CrossRef]
Wang, H.; Li, C. Graph Transformer Networks: Learning Meta-path Graphs to Improve GNNs. IEEE Trans. Neural Netw. Learn. Syst. 2023, 34, 1634–1645. [Google Scholar]
Zhou, W.; Zhang, L.; Liu, J. GPDRP: A multimodal framework for drug response prediction with graph transformer. BMC Bioinform. 2023, 24, 541. [Google Scholar]
Chen, Y.; Li, Y.; Wang, H. AMGDTI: Drug–target interaction prediction based on adaptive meta-graph transformer. Bioinformatics 2023, 39, 101–112. [Google Scholar]
Wang, Z.; Chen, X. MGDTI: Graph Transformer with Meta-Learning for Drug-Target Interaction Prediction. IEEE/ACM Trans. Comput. Biol. Bioinform. 2022, 19, 3201–3212. [Google Scholar]
Sun, J.; Liu, Q. Metapath-aggregated heterogeneous graph neural network for drug–target interaction prediction. Brief. Bioinform. 2022, 24, 211–223. [Google Scholar]
Han, L.; Zhang, H. TranDTA: Prediction of Drug–Target Binding Affinity Using Transformer. Bioinformatics 2023, 39, btad410. [Google Scholar]
Li, J.; Wang, S. Hierarchical graph transformer with contrastive learning for protein-protein interaction prediction. Bioinformatics 2022, 38, 234–245. [Google Scholar]
Zhang, L.; Chen, H. MMGPL: Multimodal Medical Data Analysis with Graph Prompt Learning. BMC Bioinform. 2023, 24, 541. [Google Scholar]
Sun, Y.; Liu, Q. Meta-HGT: Metapath-aware HyperGraph Transformer for heterogeneous medical data. IEEE Trans. Med. Imaging 2022, 41, 2345–2358. [Google Scholar]
Wang, X.; Zhang, H. MD-SGT: Multi-dilation Spherical Graph Transformer for unsupervised medical image registration. Comput. Med. Imaging Graph. 2023, 108, 102281. [Google Scholar]
Chen, Y.; Li, Y. Trans G-net: Transformer and Graph Neural Network based multi-modal data analysis for medical applications. IEEE Trans. Biomed. Eng. 2022, 69, 456–467. [Google Scholar]
Hou, W.; Zhang, L. Multi-scope Analysis Driven Hierarchical Graph Transformer for Whole Slide Image Based Cancer Survival Prediction. In Medical Image Computing and Computer Assisted Intervention—MICCAI 2023, Proceedings of the 26th International Conference, Vancouver, BC, Canada, 8–12 October 2023; Lecture Notes in Computer Science; Springer: New York, NY, USA, 2023; Volume 14225. [Google Scholar]
Asadi, S.; Khoshniat, F. Design of a Takagi–Sugeno Fuzzy-Based Sliding Mode Observer for Fault Reconstruction in Nonlinear Systems. Sensors 2022, 22, 10002. [Google Scholar] [CrossRef] [PubMed]
Li, Y.; Ji, J.; Kalhori, H. Advanced Fault Diagnosis and Health Monitoring Techniques for Complex Engineering Systems. Sensors 2022, 22, 10002. [Google Scholar] [CrossRef] [PubMed]
Hu, Q.; Zhang, L. Piecewise Aggregate Approximation Combined with Complete Ensemble Empirical Mode Decomposition for Bearing Fault Diagnosis. Mech. Syst. Signal Process. 2023, 188, 109002. [Google Scholar]
Zhang, Y.; Hu, L. Fault Propagation Inference Based on a Graph Neural Network for Steam Turbine Systems. Energies 2021, 14, 309. [Google Scholar] [CrossRef]
Wu, Y.; Liu, S. Graph Neural Network-Based Bearing Fault Diagnosis Using Granger Causality. Mech. Syst. Signal Process. 2022, 163, 108202. [Google Scholar]
Li, X.; Chen, Y. Deep Dynamic High-Order Graph Convolutional Network for Wear Fault Diagnosis. Mech. Syst. Signal Process. 2022, 171, 108908. [Google Scholar]

Figure 1. Structure of proposed health monitoring model.

Figure 2. Framework of MGT.

Figure 3. Structure of MGT fault observer.

Figure 4. CRF pump in a nuclear plant: (a) sea water cooling system; (b) CRF pump.

Figure 5. Fault of CRF pump: (a) fault section a of thrust bearing; (b) fault section b of thrust bearing.

Figure 6. Collected data from sensors: (a) data of temperature signals; (b) data of current signals.

Figure 7. Wasserstein distance of adjacency matrix.

Figure 8. Prediction of normal condition based on MGT: (a) results of sensor 102 MT₁; (b) results of sensor 103 MT₁; (c) results of sensor 104 MT₁; (d) results of sensor 142 MT₁.

Figure 9. Residual error of normal condition based on MGT: (a) results of sensor 102 MT₁; (b) results of sensor 103 MT₁; (c) results of sensor 104 MT₁; (d) results of sensor 142 MT₁.

Figure 10. Prediction of normal condition based on LSTM: (a) results of sensor 102 MT₁; (b) results of sensor 103 MT₁; (c) results of sensor 104 MT₁; (d) results of sensor 142 MT₁.

Figure 11. Residual error of normal prediction based on LSTM: (a) results of sensor 102 MT₁; (b) results of sensor 103 MT₁; (c) results of sensor 104 MT₁; (d) results of sensor 142 MT₁.

Figure 12. Prediction of normal condition based on CNN: (a) results of sensor 102 MT₁; (b) results of sensor 103 MT₁; (c) results of sensor 104 MT₁; (d) results of sensor 142 MT₁.

Figure 13. Residual error of normal prediction based on CNN: (a) results of sensor 102 MT₁; (b) results of sensor 103 MT₁; (c) results of sensor 104 MT₁; (d) results of sensor 142 MT₁.

Figure 14. Prediction of normal condition based on Deep AR: (a) results of sensor 102 MT₁; (b) results of sensor 103 MT₁; (c) results of sensor 104 MT₁; (d) results of sensor 142 MT₁.

Figure 15. Residual error of normal prediction based on Deep AR: (a) results of sensor 102 MT₁; (b) results of sensor 103 MT₁; (c) results of sensor 104 MT₁; (d) results of sensor 142 MT₁.

Figure 16. Prediction of normal condition based on TCN: (a) results of sensor 102 MT₁; (b) results of sensor 103 MT₁; (c) results of sensor 104 MT₁; (d) results of sensor 142 MT₁.

Figure 17. Residual error of normal prediction based on TCN: (a) results of sensor 102 MT₁; (b) results of sensor 103 MT₁; (c) results of sensor104 MT₁; (d) results of sensor 142 MT₁.

Figure 18. Fault observation based on MGT: (a) results of sensor 102 MT₁; (b) results of sensor 103 MT₁; (c) results of sensor 104 MT₁; (d) results of sensor 142 MT₁.

Figure 19. Residual error of fault observation based on MGT: (a) results of sensor 102 MT₁; (b) results of sensor 103 MT₁; (c) results of sensor 104 MT₁; (d) results of sensor 142 MT₁.

Table 1. Detailed list of sensors.

No.	Code of Measuring Point	Location of Measuring Point	Unit
1	160 MT.H₁	The output thrust bearing of the gearbox	°C
2	001 MI	The current of the motor	A
3	002 MI	The current of the motor	A
4	162 MT.H₂	The top radial bearing of the motor	°C
5	164 MT.H₁	The thrust bearing of the motor	°C
6	101 MT₁	The upper radial bearing of the pump	°C
7	101 MT₂	The upper radial bearing of the pump	°C
8	102 MT₁	The upper radial bearing of the pump	°C
9	102 MT₂	The upper radial bearing of the pump	°C
10	103 MT₁	Pump thrust bearing	°C
11	103 MT₂	Pump thrust bearing	°C
12	104 MT₁	The thrust bearing of the pump	°C
13	104 MT₂	The thrust bearing of the pump	°C
14	142 MT₁	The lower radial bearing of the pump	°C
15	142 MT₂	The lower radial bearing of the pump	°C

Table 2. Details of the data.

Data Type	Date of Data	Length of Data
Quantitative analysis data	5 October–16 November	460,482
Training data	5 October–19 October	222,000
Validation data	20 October–25 October	70,000
Normal test data	5 November–16 November	140,082
Fault test data	1 December–8 December	85,000

Table 3. Comparison of evaluation indexes.

	MAE	MAPE	RMSE
LSTM	14.4626	1.3612	27.7123
CNN	26.4723	3.9569	39.5432
TCN	21.6907	2.6734	30.5073
Deep AR	1.2864	1.9765	4.2572
MGT	0.5614	1.2385	2.6554

Table 4. Training cost of each model.

Model	GPU Type	CPU Cores	Training Time/Hours	Max Memory Usage/GB
LSTM	NVIDA GeForce RTX 4090	16	10	32
CNN			8	28
TCN			14	35
Deep AR			12	30
MGT			9	29

Table 5. Generalizability of each model.

Model	Validation Loss	Test Loss	Training Loss	Cross-Validation Mean	Cross-Validation Std Dev
LSTM	0.45	0.50	0.40	0.47	0.02
CNN	0.40	0.45	0.35	0.42	0.03
TCN	0.48	0.52	0.43	0.49	0.02
Deep AR	0.42	0.47	0.38	0.44	0.03
MGT	0.38	0.43	0.34	0.40	0.02

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gao, J.; Ma, L.; Qing, C.; Zhao, T.; Wang, Z.; Geng, J.; Li, Y. A Health Monitoring Model for Circulation Water Pumps in a Nuclear Power Plant Based on Graph Neural Network Observer. Sensors 2024, 24, 4486. https://doi.org/10.3390/s24144486

AMA Style

Gao J, Ma L, Qing C, Zhao T, Wang Z, Geng J, Li Y. A Health Monitoring Model for Circulation Water Pumps in a Nuclear Power Plant Based on Graph Neural Network Observer. Sensors. 2024; 24(14):4486. https://doi.org/10.3390/s24144486

Chicago/Turabian Style

Gao, Jianyong, Liyi Ma, Chen Qing, Tingdi Zhao, Zhipeng Wang, Jie Geng, and Ying Li. 2024. "A Health Monitoring Model for Circulation Water Pumps in a Nuclear Power Plant Based on Graph Neural Network Observer" Sensors 24, no. 14: 4486. https://doi.org/10.3390/s24144486

APA Style

Gao, J., Ma, L., Qing, C., Zhao, T., Wang, Z., Geng, J., & Li, Y. (2024). A Health Monitoring Model for Circulation Water Pumps in a Nuclear Power Plant Based on Graph Neural Network Observer. Sensors, 24(14), 4486. https://doi.org/10.3390/s24144486

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Health Monitoring Model for Circulation Water Pumps in a Nuclear Power Plant Based on Graph Neural Network Observer

Abstract

1. Introduction

2. Methodology

2.1. Quantitative Analysis of Wasserstein Distance

2.2. Meta Graph Transformer

2.2.1. Embedding of Spatial–Temporal

2.2.2. Construction of Adjacency Matrix

2.2.3. Encoder

2.2.4. Decoder

2.3. MGT Fault Observer

3. Experimental Results and Analysis

3.1. Experimental Setup

3.1.1. Datasets

3.1.2. MGT Parameter Settings

3.1.3. Evaluation Metrics Settings

3.2. Evaluation of the Proposed Health Monitoring Model

3.2.1. Prediction of Normal Condition

3.2.2. Contrast of Other Methods

3.2.3. Construction of MGT Observer

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI