Coupling Fault Diagnosis Based on Dynamic Vertex Interpretable Graph Neural Network

Wang, Shenglong; Jing, Bo; Pan, Jinxin; Meng, Xiangzhen; Huang, Yifeng; Jiao, Xiaoxuan

doi:10.3390/s24134356

Open AccessArticle

Coupling Fault Diagnosis Based on Dynamic Vertex Interpretable Graph Neural Network

by

Shenglong Wang

,

Bo Jing

,

Jinxin Pan

,

Xiangzhen Meng

,

Yifeng Huang

and

Xiaoxuan Jiao

^*

Aeronautics Engineering College, Air Force Engineering University, Xi’an 710038, China

^*

Author to whom correspondence should be addressed.

Sensors 2024, 24(13), 4356; https://doi.org/10.3390/s24134356

Submission received: 29 April 2024 / Revised: 2 July 2024 / Accepted: 4 July 2024 / Published: 4 July 2024

(This article belongs to the Special Issue Fault Diagnosis and Vibration Signal Processing in Rotor Systems)

Download

Browse Figures

Versions Notes

Abstract

:

Mechanical equipment is composed of several parts, and the interaction between parts exists throughout the whole life cycle, leading to the widespread phenomenon of fault coupling. The diagnosis of independent faults cannot meet the requirements of the health management of mechanical equipment under actual working conditions. In this paper, the dynamic vertex interpretable graph neural network (DIGNN) is proposed to solve the problem of coupling fault diagnosis, in which dynamic vertices are defined in the data topology. First, in the date preprocessing phase, wavelet transform is utilized to make input features interpretable and reduce the uncertainty of model training. In the fault topology, edge connections are made between nodes according to the fault coupling information, and edge connections are established between dynamic nodes and all other nodes. Second the data topology with dynamic vertices is used in the training phase and in the testing phase, the time series data are only fed into dynamic vertices for classification and analysis, which makes it possible to realize coupling fault diagnosis in an industrial production environment. The features extracted in different layers of DIGNN interpret how the model works. The method proposed in this paper can realize the accurate diagnosis of independent faults in the dataset with an accuracy of 100%, and can effectively judge the coupling mode of coupling faults with a comprehensive accuracy of 88.3%.

Keywords:

coupling fault diagnosis; graph neural networks; interpretability; dynamic vertex

1. Introduction

A mechanical structure is affected by temperature, vibration, humidity, electromagnetic, shock, and other external stresses during operation. The external stresses act on a large number of components at the same time and affect their reliability synchronously [1]. Meanwhile, during the operation of the equipment, there are internal stresses such as friction and collision between the equipment [2], and their reliability is influenced by each other. Under the same working conditions, there are widespread cases of multi-fault coupling derivation in mechanical equipment [3,4]; furthermore, the coupling of faults may aggravate the derivation speed of faults. Therefore, the accurate identification of coupling faults can diagnose and warn of the occurrence of faults in advance, providing an important reference for maintenance decisions, which has an important application value [5].

Recently, scholars have made important contributions to the research on coupling the fault diagnosis of rotating machinery. Some scholars have studied the frequency domain characteristics of vibration signals and classified multiple faults based on the frequency domain characteristics. In coupling fault diagnosis, fault classification is usually carried out by decoupling, and bearing fault features are extracted. Mishra et al. collected bearing vibration signals under uncertain speed conditions and proposed the STFNet network to extract vibration features and classify faults at different rotating speeds [6]. Tao et al. proposed a cross-domain diagnosis method based on time–frequency domain information fusion, which uses the joint distribution distance to cluster fault features in the time–frequency domain [7]. Zhang et al. selected three indices insensitive to defect position as feature vectors for bearing fault diagnosis, and used the KNN algorithm for fault classification [8]. Dezun et al. proposed iterative generalized demodulation (IGD) to extract the feature frequency of instantaneous faults and carried out feature extraction and fault diagnosis for bearing faults with multiple faults and variable rotating speed [9]. Neisi et al. used UKF to estimate the vibration characteristics of rotating machinery to achieve data fitting and fault identification under multi-fault conditions [10]. Wang et al. selected the data channel by the Pearson correlation coefficient combined with the GRU method for multiple fault diagnosis, and time judged the severity of the fault [11].

Some other scholars have used the neural network data-driven method to classify the multiple faults of rotating machinery. By using the self-consistency of the convergence model [12], the neural network arranges the chaotic model parameters through gradient descent to make them show functional consistency on a macro level, and is widely used in the research of the fault diagnosis of mechanical equipment. However, the black box attribute of the neural network brings difficulties to the interpretation of the model. Although the neural network has a strong performance, it reduces the reliability of the model, which has a negative impact on the engineering application of the neural network to some extent. In the study of the fault diagnosis of rotating machinery, Zhang et al. diagnosed bearing multiple faults by improving the CNN network pool layer and fully connected layer structure [13]. Gong et al. used the finite element simulation method to expand the non-equilibrium data of bearing multiple faults, and the transfer learning method was used to diagnose the faults [14]. Deng et al. proposed the MgNet model to diagnose multi-bearing faults at the system level by collecting vibration signals of the auxiliary bearings [15]. Shi et al. proposed the WDCNN method to classify bearing faults by introducing the multi-scale large convolutional kernel and GRU network, and divided the bearing life cycle into the health stage and fault stage [16]. Hadi et al. determined the structure of the fault diagnosis neural network through reinforcement learning and other methods, which further veiled the explainability of the model [17].

On the cutting edge of machine learning, the generative large model is widely used to manufacture and output knowledge, but its reliability is difficult in terms of meeting the high accuracy of fault diagnosis and other engineering fields [18]. In order to improve the reliability of the model, scholars have carried out research on the interpretability of the model. In the study of interpretable neural networks, Zhang et al. [19] classified interpretable models according to three dimensions: type of engagement (passive and active interpretation methods), the type of explanation, and the focus (from local to global explainability). Li et al. used the U-Net network and integrated signal segmentation technology to mine interpretable fault characteristic information to realize fault diagnosis [20]. Che et al. used the multi-domain element transfer learning framework based on deep convolutional neural networks to reverse reconstruct the fault classification results through the fusion of hierarchical class activation mapping to obtain the mapping relationship between the input and output, which is a global ante hoc interpretable model with feature decomposition [21]. The DIGNN model proposed in this paper is a global ante hoc interpretable model with rule extraction.

The graph neural network was proposed in 2009 [22]. Due to its unique network topology, it has the ability to represent graph data and analyze related data. Currently, it is mainly used in natural language processing [23,24], traffic flow analysis [25,26], molecular structure modeling [27,28], and other fields. According to its structure, it can be divided into two types: spatial-based and spectral. The spectral-based method uses the graph convolution filter to denoise the node data and extract the main features of the signal. Michael et al. [29] extended CNN to the field of graph signal processing, and established convolution check data based on spectrum to carry out convolution operations, reducing its operational complexity. Defferrard et al. [30] improved the efficiency of graph coarsedness when CNN was generalized to GNN by establishing the operation of balanced binary tree record graph coarsedness. The spatial based method is used to aggregate information of the node neighborhood. Gama et al. [31] established the input time series by means of graph aggregation to implement CNN transplantation for unstructured graph data.

Li et al. [32] first applied the graph neural network to fault diagnosis research and established relevant research benchmarks. This method converted time series single-channel data into graph structure to respectively realize fault diagnosis of rotating machinery. However, the topology structure of the graph data constructed by this method lacks physical meaning and has poor interpretability. Gao et al. used GNN to conduct one-shot model training and diagnosis for bearing fault types under varying working conditions [33]. Man et al. [34] took the data collected by multiple sensors as the nodes in the figure respectively, and used GAT to diagnose the fault of the train steering gear. Zhang et al. proposed a graph neural network (GNN) method based on the Granger causality test, which decomposed vibration signals into noise signals and fault signals. The graph data were established for nodes based on fault types to classify faults [35]. The methods mentioned above fix the topology structure of graph data, which is difficult to adapt to the uncertain topology structure caused by the unknown fault type input of real-time data in engineering applications. The methods mentioned run in black box mode, but they still fully embody the excellent performance of the graph neural network in the field of fault diagnosis. In this paper, an interpretable DIGNN method was proposed based on the graph neural network, which can effectively utilize the characteristics of node correlation to realize the coupling fault diagnosis of rotating machinery. The main contributions of this paper are as follows:

(a): A topology construction method of dynamic vertex data for graph neural networks is proposed, which is suitable for topology-based correlation analysis.
(b): An explainable coupling fault diagnosis method is proposed, which gives physical meaning to the data-driven method based on the graph neural network.
(c): In this paper, the bearing coupling fault was analyzed as an example, and the test results show that the method can realize coupling component analysis on the basis of coupling fault diagnosis.

The rest of this paper is organized as follows. Section 2 describes an interpretable DIGNN method based on the graph neural network with the characteristic of node correlation for coupling fault diagnosis. In Section 3, the overall process of the DIGNN algorithm is described, and the pre-processing scheme of wavelet transform and the network structure of DIGNN are specifically described, and the interpretability of the model is analyzed. In Section 4, the dataset used in this paper is first introduced in Section 4.1, then the data are pre-processed using wavelet transform, and the graph data containing the fault topology are constructed in Section 4.2. Then, the method proposed in this paper is used to diagnose the rotating mechanical coupling fault compared with other benchmark methods in Section 4.3. The classification of model coupling faults is explained and analyzed. Finally, conclusions are drawn in Section 5.

2. Interpretability of Graph Neural Networks

In the fault diagnosis of independent fault modes, only the feature extraction of the single fault mode needs to be considered. While under the condition of multiple fault coupling, there are correlation features between coupled fault modes, and the traditional CNN network cannot extract the implicit correlation between multiple faults. Graph neural networks are able to propagate relevant information between nodes through edges, thus providing interpretability to coupled fault diagnosis. Some notations used in this paper are shown in Table 1:

In graph

𝒢 = (𝒱, ε)

,

𝒱

represents the vertices in the graph;

ε

is the edge connecting the vertices;

ε = \{e_{i j} | x_{i}, x_{j} \in 𝒱\},

| ε | \leq N^{2}

. Let

A

represent the symmetric adjacency matrix,

D

denote the degree matrix, and

X \in R^{N \times d}

be the input data.

N

and

d

are the number of nodes and the feature dimension, respectively.

L = D - A

is a symmetric matrix Laplacian matrix, and the Fourier basis

U

is obtained by the spectral decomposition of

L

.

L = U Λ U^{- 1} = U d i a g [λ_{1}, \dots, λ_{N}] U^{- 1}

(1)

The matrix obtained by spectral decomposition is orthogonal,

U U^{T} = E

,

L = U Λ U^{T}

, the Fourier transform of

x

is

\hat{x} = U^{T} x

, and its inverse transformation is

x = U \hat{x}

, the continuous orthogonal basis on the graph is converted to the basis of the Fourier transform for graph convolution operation:

\begin{matrix} x *_{𝒢} g & = U ((U^{T} g) ⊙ (U^{T} x)) \\ = U g_{θ} U^{T} x \end{matrix}

(2)

where

g_{θ} = U^{T} g = g_{θ} (Λ)

denotes the diagonal matrix and

⊙

is the Hadamard product. The Chebyshev polynomial

T_{k} (x) = 2 x T_{k - 1} (x) - T_{k - 2} (x)

,

T_{1} (x) = x

,

T_{0} (x) = 1

is introduced into the inner product of the vector. Then, the ChebyNet expression is obtained, and the nodes in the

K - h o p

neighborhood are aggregated as follows:

x *_{𝒢} g \approx \sum_{k = 0}^{K - 1} θ_{k} T_{k} (\tilde{L}) x

(3)

where

\tilde{L} = 2 / λ_{\max} (L - I_{N})

is the standardized Laplacian eigenvalue and

λ_{\max}

is the largest eigenvalue. Take

K = 2

to simplify ChebyNet further:

\begin{matrix} x *_{𝒢} g & \approx θ_{0} x + θ_{1} (L - I_{N}) x \\ = θ_{0} x - θ_{1} D^{- \frac{1}{2}} A D^{- \frac{1}{2}} x \end{matrix}

(4)

Defining

θ = θ_{0} = - θ_{1}

, then

x *_{𝒢} g \approx θ (I_{N} + D^{- \frac{1}{2}} A D^{- \frac{1}{2}}) x

. Let

{\tilde{D}}_{i i} = \sum_{j} {\tilde{A}}_{i j} \tilde{A} = A + I_{N}

and

{\tilde{D}}^{- \frac{1}{2}} A {\tilde{D}}^{- \frac{1}{2}} = I_{N} + D^{- \frac{1}{2}} A D^{- \frac{1}{2}}

, so the GCN interlayer formula [36] is obtained as follows:

X^{l + 1} = σ ({\tilde{D}}^{- \frac{1}{2}} A {\tilde{D}}^{- \frac{1}{2}} X^{l} W^{l})

(5)

where

σ

is the nonlinear mapping function,

W^{l}

is the weight matrix of the

l th

layer, and

X^{l}

is the input variable of the

l th

layer (

X^{0} = X

).

\tilde{A} = A + I_{N}

is the adjacency matrix of adding node self-information, and the aggregation of node information in the node-hop neighborhood is realized by symmetric normalization of the adjacency matrix

D^{- \frac{1}{2}} \tilde{A} D^{- \frac{1}{2}}

. Through

k

layers of the GCN network, the information of vertices within

k - h o p

are aggregated. The symmetric standardization defines the direction of information transmission between nodes, which not only avoids the difference in data measurement scale caused by the difference in node degree, but also considers the amount of information of the two nodes connected by the edge [37].

The traditional GCN algorithm uses a data-driven approach to build data topology, which can be modified when information is transferred between layers, as shown in Figure 1.

3. Algorithm Flow

3.1. Data Preprocessing

In this paper, the coupling fault was indicated by a node in the graph, and the coupling topology was constructed by connecting the coupling fault node with a single fault node. In order to provide the ability of fault diagnosis analysis and diagnosis of new data, this paper took the fault to be classified as a node in the graph, which is called a dynamic vertex, and constructed the edge vector between the node and the other nodes. In actual condition, bearings and gear structures often work at the same time, and their vibration signals affect each other, as illustrated in Figure 2. Comparing the bearing data doped with gear signals in the XJTU Gearbox dataset with the pure bearing vibration signals in the XJTU-SY Bearing datasets, the rotation of gears will generate high-frequency signals, which will affect the simplicity of the bearing characteristics. Therefore, this paper also took the gear faults as nodes to provide negative samples for the fault diagrams and reduce the impact of gear faults on the fault diagnosis results [38].

For rotating machinery, the vibration signal characteristics are mainly determined by the vibration direction, rotation frequency, and vibration amplitude. In order to improve the interpretability of the method and reduce the uncertainty of the data-driven training of graph neural networks, this paper adopted wavelet transform to preprocess the data [39]. Morlet wavelet basis was used to transform the data, and the bandpass filter ensured that it had good time–frequency domain localization characteristics and could accurately locate the frequency range. The expression is as follows:

ψ (t) = \exp (i ω_{0} t) \exp (- \frac{t^{2}}{2})

(6)

By expanding and translating at different scales

a

and amounts of displacement

b

, the wavelet family is obtained:

ψ_{a, b} (t) = {| a |}^{\frac{1}{2}} ψ (\frac{t - b}{a}) = \exp (\frac{i ω_{0} (t - b)}{a}) \exp (- \frac{{(t - b)}^{2}}{2 a^{2}})

(7)

where

a_{i} = 2 f_{ψ (t)} * t o t a l s c a l e / i

, in which

f_{ψ (t)}

is the center frequency of the wavelet

ψ (t)

and

t o t a l s c a l e

is the number of scales. After, the input signal

x

is transformed into a two-dimensional signal (

t

,

a_{i}

,

v a l u e

), in which

v a l u e = a b s (W T (a, b))

. The formula of continuous wavelet transform is:

\begin{matrix} W T (a, b) & = \int_{R} x (t) ψ_{a, b} (t) d t \\ = \int_{R} x (t) {| a |}^{\frac{1}{2}} ψ (\frac{t - b}{a}) d t \end{matrix}

(8)

The data topology is constructed according to the prior knowledge, in which the fixed vertices are the standard features of the wavelet transform of each type of fault, the dynamic vertices are the data to be measured, and the graph data

X

is formed together.

3.2. Coupling Fault Diagnosis

The data were divided into the training set and the test set, and the partition ratio was 8:2. In this paper, the cross-entropy error was used as the error calculation method; the label

y_l a b l e \in R

was converted into one-hot code

y \in R^{N}

and used to calculate the cross-entropy with the model output

\hat{y} \in R^{N}

:

C (y, \hat{y}) = - \frac{1}{n} \sum_{x} [y \ln \hat{y} + (1 - y) \ln (1 - \hat{y})]

(9)

The loss function consists of two parts,

L o s s 1

and

L o s s 2

, where

L o s s 1

is the classification error of the fault type to be diagnosed and

L o s s 2

is the classification error of the fault type of other nodes. The total loss function is as follows.

\begin{matrix} L o s s & = L o s s 1 + α L o s s 2 \\ = C (y_{N}, {\hat{y}}_{N}) + α \sum_{i = 1}^{N - 1} C (y_{i}, {\hat{y}}_{i}) \end{matrix}

(10)

Since the model focuses on the fault classification of unknown faults, we set

α < 1

so that the accuracy of node classification was ensured first in this paper.

3.3. Algorithm Flow

The algorithm flow of the algorithm proposed in this paper is shown in Figure 3.

(a): The input data $X^{0} = X = (x_{1}, x_{2}, \dots, x_{N})$ , $X^{0} \in R^{N \times d}$ are composed of node features $x_{n}$ $\in R^{d}$ of $N$ types of faults, where $d$ is the feature dimension. The $N th$ node is the dynamic vertex (i.e., the fault node to be diagnosed).
(b): After $X^{0}$ is transformed by wavelet, $X^{1} = W a v e l e t (X^{0})$ , $X^{1} \in R^{N \times f \times S}$ where $f$ is the number of the frequency spectrum and $S$ is the size of the wavelet scale. The wavelet transform raises the dimension of one-dimensional data, endows the data with more intuitive features, and at the same time, carries out data preprocessing, which is conducive to the subsequent feature extraction of the neural network.
(c): After the signal is convolved on the two-dimensional spectrum data, the number of convolution nuclei is $f$ , and $X^{2} \in R^{N \times f}$ is obtained. Through model training, it extracts the feature assignment in $X^{2}$ at each frequency and carries out standardization processing. In the process of CNN, only independent convolution processing is carried out for each node, that is, no data information from other nodes is involved.
(d): Through the two-layer GCN network, it can be obtained that $X^{3} = G C N^{1} (X^{2})$ , $X^{4} = G C N^{2} (X^{3})$ , where $X^{3} \in R^{N \times \frac{f}{2}}$ , $X^{4} \in R^{N \times \frac{f}{4}}$ . The feature extraction of high-frequency and low-frequency features is further carried out in the way of dichotomy, which is similar to the wavelet packet decomposition technology [40]. At the same time, the specific features of each type of fault are extracted through model training.
(e): Through the fully connected MLP, there is further dimensionality reduction of the data, $X^{5} = M L P (X^{4})$ , $X^{5} \in R^{N \times \frac{f}{8}}$ . Finally, fault classification $\hat{Y}$ is output through the Softmax layer.
(f): In the process of model training, the output value ${\hat{y}}_{N}$ of the last layer of node $x_{N}$ is used as the training label to optimize the model parameters. After the model is established, node $x_{N}$ will play the same role with other nodes in the model operation, which only carry out independent output in the model output phase.

3.4. Interpretability Analysis

Explainability is defined as the ability to explain to people in plain language. Graph neural network itself has distinct physical significance because of the topology of its nodes. The interpretability of the structure is made clear by assigning the input data with relation to the node. The DIGNN model proposed in this paper explores and optimizes the interpretability of the model in the specific application of coupling fault diagnosis, so this model is an interpretable model [41].

(a): Each node in $𝒢$ is a type of fault data, and the topological structure maintains the input structure from beginning to end. Each node has a clear physical meaning, and each node in the output data $\hat{Y}$ corresponds to the classification of various types of fault data.
(b): The vibration signal is converted into the time–frequency domain signal by wavelet transform, which provides the data with a clear physical meaning. Due to the introduction of data topology, the physical meaning of each vertex data $X^{1}, X^{2}, X^{3}, X^{4}, X^{5}$ remains stable, even if the data dimension is changed under the condition that the GCN network structure remains unchanged, and all of them are linear transformations of the vibration amplitude of this type of fault at a specific frequency.
(c): The similar characteristics of similar faults in coupling faults are enhanced by aggregation operation:

$x_{n} = a g g r e g a t i o n (N {x_{n}})$

(11)

where $N {x_{i}} = \{x_{j} | e_{i j} \in ε\}$ , and the fault characteristic information of related nodes is aggregated through the target vertex of GCN. The nodes at both ends are comprehensively considered by the symmetric standardization operation $D^{- \frac{1}{2}} \tilde{A} D^{- \frac{1}{2}}$ . The degree of coupling fault is relatively large because it is related to multiple fault nodes, and the symmetric standardization can reduce the weight of such nodes in the aggregation, effectively avoiding the influence of unrelated fault type data on the aggregation of adjacent nodes.

4. Dataset Introduction and Data Processing

4.1. Dataset Introduction

In this paper, the data in the XJTU Gearbox dataset were used. Four planetary gear failure modes and four bearing failure modes were injected in the experiment. As shown in Figure 4, injected gear failures included tooth wear, missing tooth, root cracks, and broken tooth. The injected bearing faults included ball bearing faults, inner race faults, outer race faults, and coupling faults of the above three bearing faults. Together with the normal state, a total of nine states of the type vibration signals were collected. The fault relationship topology is shown in Figure 5.

In the experiment, transverse and longitudinal vibration sensors were installed to collect the state data. In this paper, radial and vertical vibration signals were selected for processing, and the sampling frequency was 20.48 kHz

4.2. Data Preprocessing

The data were preprocessed by wavelet transform before input into the neural network. As shown in Figure 6, the signals in the time–frequency domain of the bearings and gears were mainly concentrated within 8000 Hz, so the time–frequency domain signals in the frequency band from 0 to 8000 Hz were selected in this paper, and the time–frequency domain signals of various faults are shown in Figure 6.

It can be seen from Figure 6 that fault coupling is not a simple feature superposition, but contains a complex physical mechanism. In this paper, the data-driven graph neural network method was used to simulate the physical mechanism and achieve multi-fault coupling analysis.

After analyzing the fault types, the fault data topology was established according to the coupling relationship between faults. The graph structure established in this paper was an undirected graph containing self-loops, as shown in Figure 7. The data topology was composed of 10 nodes

𝒱 = {x_{1}, x_{2}, \dots, x_{10}}

. Fault data were randomly selected in the dataset and filled with flexible nodes to form a training set, corresponding to the orange node in the figure,

x_{4}

is the coupled fault data, and the other nodes are independent fault types. The data topology of the graph consisted of 34 edges

e_{i, j} \in ε

, and the dynamic vertex was assumed to be related to all vertices because of its type and the unknown correlation with other vertices (i.e.,

e_{1, 10}, e_{9, 10}, \dots, e_{10, 10} = 1

).

In the training set data topology, the data

X

except the dynamic vertex

x_{10}

was randomly sampled in the data sample corresponding to the fault type. In order to reduce the difficulty of test set creation and application, this paper took the average value of all kinds of known fault data after wavelet transform as the fixed vertices of the test set:

{x_{1}, x_{2}, \dots, x_{9}}

.

After wavelet transform, the data of each fault type were clearly divided in the data space, as shown in Figure 8, which helped to reduce the influence of uncertainty brought by the graph neural network. In the fault space, the distance between the coupling fault and its fault type was relatively close, reflecting the coupling effect between faults. The obvious boundary between the bearing and gear faults was obtained only by wavelet transform.

4.3. Coupling Fault Diagnosis

First, the model proposed in this paper was used for coupling fault diagnosis of the data. The training set contained 540 samples, the validation set contained 180 samples, and the test set contained 180 samples. In each sample, which can be called a graph cluster,

x_{10}

was selected randomly from nine fault types. The model adopted the SGD optimization method, the learning rate was 0.01, and the momentum was defined as 0.9. A convolutional neural network with a 100 × 1024 × 1024 filter was used in series with two layers of GCN networks. The filter size in the first layer of GCN was 1024 × 512 and the second GCN layer contained a 512 × 256 filter.

Batch normalization further standardized the time-domain information processing for each layer of the network output [42]. While improving the operational efficiency and generalization of the neural network, by adding the BN layer after two GCN networks, the network accuracy rate increased from 87.78% to 100%, each step of the training time decreased from 87 s to 9.17 s, and the algorithm convergence speed had been greatly improved as shown in Figure 9. Each step was to perform serial operations on seven graph data, and each graph was optimized for gradient descent after classification.

Several models were used to diagnose coupling faults including ChebyNet, GCN, DVChebyNet, DVGraphSAGE, DVHOGNN, and the DIGNN proposed in the article. Among the models, except for ChebyNet and GCN, the models used a wavelet transform operation and consisted of a 100 × 1024 × 1024 CNN layer to convert the time–frequency domain data matrix into a one-dimension vector. These models used different graph neural network layers, and their network structure hyperparameters remained consistent. The accuracy of model diagnosis is shown in Figure 10. Due to the convolution operation on the input signal in the frequency domain, the ChebyNet had a faster convergence rate for the original data without wavelet transform. However, compared with the pre-processed model after wavelet transform, there was a large gap in accuracy. The GCN algorithm is a spatial domain graph neural network algorithm that has strong model fitting ability, but the time series data need to be iterated many times to discover the information contained in it. The graph neural network with wavelet transform can optimize the model input, avoid the uncertainty caused by pure data drive, and integrate its powerful model fitting ability to improve the accuracy of fault diagnosis. The DIGNN algorithm proposed in this paper had the highest accuracy in the validation set and showed great robustness in the test set. Additionally, it converged fastest among the benchmark models, as shown in Table 2.

The graph neural network extracts the correlation information of the coupled fault components. Then, the graph dataset is input into the trained model, and the fault features are achieved from the output of the first GCN layer and the output of the second GCN layer, respectively. The visualization effect is shown in Figure 11. In the output of the first GCN layer, as shown in Figure 11a, compared with Figure 8, the normal state is at the coordinate zero of the feature space. However, the correlation between the coupling fault and its fault components is unclear. The boundary between the bearing fault and gear fault and the fault coupling mode are more clearly visualized in the second layer GCN output, as illustrated in Figure 11b.

For bearings with coupling faults, 100 coupling fault samples were selected, and their output after passing through the Softmax layer of the DIGNN network is shown in Figure 12. First of all, it ensures the accuracy of the coupling fault classification, and then, the fault coupling information is mined, a benefit from the advantages of graph neural network correlation analysis, and the fault coupling mode is revealed in the classification results. For each coupling fault sample, the blue part indicates that the coupling degree of this type of fault is low. Other colors indicate that the fault type is highly coupled. The coupling fault is mainly reflected in the inner ring fault, followed by the ball and the outer ring fault being reflected, and the coupling with the gear fault is better avoided.

Take the largest four items in the coupling fault classification vector

\hat{Y}

among 100 fault coupling samples, and define the coupling fault diagnosis accuracy rate as

A_{n} = F_{n} / N_{X}

, where

F_{n}

is the number of occurrences of each coupling fault, and

N_{X}

is the number of samples (100 in this case). Then, except for the coupling fault

y_{3}

, the classification accuracy rate of the other three types of coupling components is shown in Table 3. The comprehensive diagnosis accuracy is defined as

\bar{A} = \sum A_{n} / n

, where

n

is the number of coupling fault components, and the comprehensive diagnosis accuracy of coupling fault in the dataset proposed in this paper was calculated at 88.3%, which can provide a reference for the analysis of coupling fault components.

5. Discussion and Conclusions

This paper presented a coupling fault diagnosis method based on a graph neural network. Through wavelet transform, a one-dimensional vibration signal was transformed into a time–frequency domain two-dimensional signal, which avoids the uncertainty caused by the data driven neural network. By establishing the fault coupling topology diagram, the coupling fault is diagnosed by using the graph neural network. The method proposed in this paper can achieve 100% accuracy of fault diagnosis under a single fault, and supports the mining of fault coupling information. The method proposed in this paper provides an interpretable data-driven method for fault coupling analysis, but the fault coupling dataset was limited, and more experiments are needed to verify the robustness of the proposed method.

Author Contributions

Conceptualization, S.W. and B.J.; Methodology, S.W.; Validation, X.J., Y.H. and X.M.; Investigation, J.P. and X.J.; Writing—original draft preparation, S.W.; Writing—review and editing, S.W.; Supervision, B.J. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the “14th Five-Year Plan” equipment pre-research common technology (grant number JZX7Y20220302001701).

Data Availability Statement

Data is unavailable due to privacy.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Zhu, Y.; Liang, X.; Wang, T.; Xie, J.; Yang, J. Depth Prototype Clustering Method Based on Unsupervised Field Alignment for Bearing Fault Identification of Mechanical Equipment. IEEE Trans. Instrum. Meas. 2022, 71, 1–14. [Google Scholar] [CrossRef]
Cao, Z.; Wu, G.A.; He, C.B.; Rao, M.; Tu, W.B. A new instantaneous contact based dynamic model of rolling element bearings with local defects. Mech. Syst. Signal Process. 2023, 200, 110600. [Google Scholar] [CrossRef]
Che, C.C.; Wang, H.W.; Lin, R.G.; Ni, X.M. Deep meta-learning and variational autoencoder for coupling fault diagnosis of rolling bearing under variable working conditions. Proc. Inst. Mech. Eng. Part C-J. Mech. Eng. Sci. 2022, 236, 9900–9913. [Google Scholar] [CrossRef]
He, F.; Zheng, C.; Pang, C.; Zhao, C.; Yang, M.; Zhu, Y.; Luo, Z.; Luo, H.; Li, L.; Jiang, H. An Adaptive Deconvolution Method with Improve Enhanced Envelope Spectrum and Its Application for Bearing Fault Feature Extraction. Sensors 2024, 24, 951. [Google Scholar] [CrossRef] [PubMed]
Ouyang, T.C.; Wang, G.; Cheng, L.; Wang, J.X.; Yang, R. Comprehensive diagnosis and analysis of spur gears with pitting-crack coupling faults. Mech. Mach. Theory 2022, 176, 104968. [Google Scholar] [CrossRef]
Mishra, R.K.; Choudhary, A.; Fatima, S.; Mohanty, A.R.; Panigrahi, B.K. Multi-fault Diagnosis of Rotating Machine Under Uncertain Speed Conditions. J. Vib. Eng. Technol. 2024, 12, 4637–4654. [Google Scholar] [CrossRef]
Tao, H.; Qiu, J.; Chen, Y.; Stojanovic, V.; Cheng, L. Unsupervised cross-domain rolling bearing fault diagnosis based on time-frequency information fusion. J. Frankl. Inst. Eng. Appl. Math. 2023, 360, 1454–1477. [Google Scholar] [CrossRef]
Zhang, J.; Zhang, Q.; Feng, W.; Qin, X.; Sun, Y. A feature vector with insensitivity to the position of the outer race defect and its application in rolling bearing fault diagnosis. Struct. Health Monit. Int. J. 2024, in press. [Google Scholar] [CrossRef]
Zhao, D.; Li, J.; Cheng, W.; Wen, W. Bearing multi-fault diagnosis with iterative generalized demodulation guided by enhanced rotational frequency matching under time-varying speed conditions. Isa Trans. 2023, 133, 518–528. [Google Scholar] [CrossRef]
Neisi, N.; Nieminen, V.; Kurvinen, E.; Lämsä, V.; Sopanen, J. Estimation of Unmeasurable Vibration of a Rotating Machine Using Kalman Filter. Machines 2022, 10, 23. [Google Scholar] [CrossRef]
Wang, C.; Peng, Z.M.; Liu, R.; Chen, C. Research on Multi-Fault Diagnosis Method Based on Time Domain Features of Vibration Signals. Sensors 2022, 22, 15. [Google Scholar] [CrossRef] [PubMed]
RicoMartinez, R.; Anderson, J.S.; Kevrekidis, I.G. Self-consistency in neural network-based NLPC analysis with applications to time-series processing. Comput. Chem. Eng. 1996, 20, S1089–S1094. [Google Scholar] [CrossRef]
Zhang, X.; Li, J.; Wu, W.; Dong, F.; Wan, S. Multi-Fault Classification and Diagnosis of Rolling Bearing Based on Improved Convolution Neural Network. Entropy 2023, 25, 737. [Google Scholar] [CrossRef]
Gong, X.Y.; Feng, K.P.; Du, W.L.; Li, B.S.; Fei, H.Y. An imbalance multi-faults data transfer learning diagnosis method based on finite element simulation optimization model of rolling bearing. Proc. Inst. Mech. Eng. Part C-J. Mech. Eng. Sci. 2024, 17, 09544062241245826. [Google Scholar] [CrossRef]
Deng, J.; Liu, H.; Fang, H.; Shao, S.; Wang, D.; Hou, Y.; Chen, D.; Tang, M. MgNet: A fault diagnosis approach for multi-bearing system based on auxiliary bearing and multi-granularity information fusion. Mech. Syst. Signal Process. 2023, 193, 110253. [Google Scholar] [CrossRef]
Shi, L.; Su, S.; Wang, W.; Gao, S.; Chu, C. Bearing Fault Diagnosis Method Based on Deep Learning and Health State Division. Appl. Sci. 2023, 13, 7424. [Google Scholar] [CrossRef]
Hadi, R.H.; Hady, H.N.; Hasan, A.M.; Al-Jodah, A.; Humaidi, A.J. Improved Fault Classification for Predictive Maintenance in Industrial IoT Based on AutoML: A Case Study of Ball-Bearing Faults. Processes 2023, 11, 1507. [Google Scholar] [CrossRef]
Xu, J.; Kong, H.; Li, K.; Ding, X. Generative Zero-Shot Compound Fault Diagnosis Based on Semantic Alignment. IEEE Trans. Instrum. Meas. 2024, 73, 13. [Google Scholar] [CrossRef]
Zhang, Y.; Tino, P.; Leonardis, A.; Tang, K. A Survey on Neural Network Interpretability. IEEE Trans. Emerg. Top. Comput. Intell. 2021, 5, 726–742. [Google Scholar] [CrossRef]
Li, H.; Lin, J.; Liu, Z.; Jiao, J.; Zhang, B. An interpretable waveform segmentation model for bearing fault diagnosis. Adv. Eng. Inform. 2024, 61, 102480. [Google Scholar] [CrossRef]
Che, C.; Zhang, Y.; Wang, H.; Xiong, M. Interpretable multi-domain meta-transfer learning for few-shot fault diagnosis of rolling bearing under variable working conditions. Meas. Sci. Technol. 2024, 35, 076103. [Google Scholar] [CrossRef]
Scarselli, F.; Gori, M.; Tsoi, A.C.; Hagenbuchner, M.; Monfardini, G. The Graph Neural Network Model. IEEE Trans. Neural Netw. 2009, 20, 61–80. [Google Scholar] [CrossRef] [PubMed]
Wang, Y.Q.; Li, C.Z.; Liu, Z.; Li, M.Z.; Tang, J.L.; Xie, X.; Chen, L.; Yu, P.S. An Adaptive Graph Pre-training Framework for Localized Collaborative Filtering. ACM Trans. Inf. Syst. 2023, 41, 27. [Google Scholar] [CrossRef]
Wang, Y.Z.; Wang, C.X.; Zhan, J.Y.; Ma, W.J.; Jiang, Y.C. Text FCG: Fusing Contextual Information via Graph Learning for text classification. Expert Syst. Appl. 2023, 219, 10. [Google Scholar] [CrossRef]
Zhang, C.H.; Zhang, S.Y.; Yu, J.J.Q.; Yu, S. FASTGNN: A Topological Information Protected Federated Learning Approach for Traffic Speed Forecasting. IEEE Trans. Ind. Inform. 2021, 17, 8464–8474. [Google Scholar] [CrossRef]
Sharma, A.; Sharma, A.; Nikashina, P.; Gavrilenko, V.; Tselykh, A.; Bozhenyuk, A.; Masud, M.; Meshref, H. A Graph Neural Network (GNN)-Based Approach for Real-Time Estimation of Traffic Speed in Sustainable Smart Cities. Sustainability 2023, 15, 25. [Google Scholar] [CrossRef]
Le, N.Q.K. Predicting emerging drug interactions using GNNs. Nat. Comput. Sci. 2023, 3, 1007–1008. [Google Scholar] [CrossRef] [PubMed]
Zhang, Y.; Yao, Q.; Yue, L.; Wu, X.; Zhang, Z.; Lin, Z.; Zheng, Y. Emerging drug interaction prediction enabled by a flow-based graph neural network with biomedical network. Nat. Comput. Sci. 2023, 3, 1023–1033. [Google Scholar] [CrossRef] [PubMed]
Bruna, J.; Zaremba, W.; Szlam, A.; LeCun, Y. Spectral Networks and Locally Connected Networks on Graphs. arXiv 2013, arXiv:1312.6203. [Google Scholar] [CrossRef]
Defferrard, M.; Bresson, X.; Vandergheynst, P. Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering. In Proceedings of the Neural Information Processing Systems, Barcelona, Spain, 5–10 December 2016. [Google Scholar]
Gama, F.; Marques, A.G.; Leus, G.; Ribeiro, A. Convolutional Neural Network Architectures for Signals Supported on Graphs. IEEE Trans. Signal Process. 2019, 67, 1034–1049. [Google Scholar] [CrossRef]
Li, T.; Zhou, Z.; Li, S.; Sun, C.; Yan, R.; Chen, X. The emerging graph neural networks for intelligent fault diagnostics and prognostics: A guideline and a benchmark study. Mech. Syst. Signal Process. 2022, 168, 108653. [Google Scholar] [CrossRef]
Gao, Y.; Wu, H.; Liao, H.; Chen, X.; Yang, S.; Song, H. A fault diagnosis method for rolling bearings based on graph neural network with one-shot learning. Eurasip J. Adv. Signal Process. 2023, 2023, 101. [Google Scholar] [CrossRef]
Man, J.; Dong, H.; Jia, L.; Qin, Y.; Zhang, J. An Adaptive Multisensor Fault Diagnosis Method for High-Speed Train Bogie. IEEE Trans. Intell. Transp. Syst. 2023, 24, 6292–6306. [Google Scholar] [CrossRef]
Zhang, Z.; Wu, L. Graph neural network-based bearing fault diagnosis using Granger causality test. Expert Syst. Appl. 2024, 242, 122827. [Google Scholar] [CrossRef]
Zhang, H.Q.; Lu, G.Q.; Zhan, M.M.; Zhang, B.X. Semi-Supervised Classification of Graph Convolutional Networks with Laplacian Rank Constraints. Neural Process. Lett. 2022, 54, 2645–2656. [Google Scholar] [CrossRef]
Wu, Z.; Pan, S.; Chen, F.; Long, G.; Zhang, C.; Yu, P.S. A Comprehensive Survey on Graph Neural Networks. IEEE Trans. Neural Netw. Learn. Syst. 2021, 32, 4–24. [Google Scholar] [CrossRef]
Peng, P.; Lu, J.X.; Xie, T.Y.; Tao, S.T.; Wang, H.W.; Zhang, H.M. Open-Set Fault Diagnosis via Supervised Contrastive Learning with Negative Out-of-Distribution Data Augmentation. IEEE Trans. Ind. Inform. 2023, 19, 2463–2473. [Google Scholar] [CrossRef]
Shao, S.Y.; Yan, R.Q.; Lu, Y.D.; Wang, P.; Gao, R.X. DCNN-Based Multi-Signal Induction Motor Fault Diagnosis. IEEE Trans. Instrum. Meas. 2020, 69, 2658–2669. [Google Scholar] [CrossRef]
Cao, D.X.; Gu, Y.; Lin, W. Fault diagnosis based on optimized wavelet packet transform and time domain convolution network. Trans. FAMENA 2023, 47, 1–14. [Google Scholar] [CrossRef]
Li, X.; Xiong, H.; Li, X.; Wu, X.; Zhang, X.; Liu, J.; Bian, J.; Dou, D. Interpretable deep learning: Interpretation, interpretability, trustworthiness, and beyond. Knowl. Inf. Syst. 2022, 64, 3197–3234. [Google Scholar] [CrossRef]
Takagi, S.; Yoshida, Y.; Okada, M. The Effect of Batch Normalization in the Symmetric Phase. In Proceedings of the Artificial Neural Networks and Machine Learning, ICANN 2020, PT II, Bratislava, Slovakia, 15–18 September 2020; pp. 229–240. [Google Scholar]

Figure 1. Flowchart of a traditional GCN algorithm.

Figure 2. Vibration signal spectrum of XJTU Bearing datasets and XJTU Gearbox datasets.

Figure 3. Flowchart of the DIGNN algorithm with dynamic vertex. Each color represents a fault type.

Figure 4. Fault injection experiment platform. 1. Motor, 2. Controller, 3. Bearing, 4. Gearbox, 5. Loading, 6. Vertical accelerometer, 7. Horizontal accelerometer.

Figure 5. Topology of the coupling faults.

Figure 6. Time–frequency feature extraction of the fault data.

Figure 7. Topology and relationship of the coupling fault data with self-loops.

Figure 8. Fixed vertex feature in feature space.

Figure 9. Accuracy of fault diagnosis with 0 layers, 1 layer, 2 layers of BN, respectively.

Figure 10. Comparison of coupling fault diagnosis accuracy between ChebyNet, GCN, WLChebyNet, WLGraphSAGE, WLHOGNN, and DIGNN.

Figure 11. Visualization of the coupling fault characteristics. (a) Feature space after the first GCN layer; (b) feature space after the second GCN layer.

Figure 12. Main components of coupling faults based on the dynamic vertex output vector. Faults that may not be coupling factors are indicated in blue, and the three most possible coupling faults are highlighted in other colors.

Table 1. Commonly used notations.

Notions	Descriptions
$𝒢$	Graph data
$𝒱$	Vertex set
$ε$	Edge set
$A$	Adjacent matrix
$\tilde{A}$	Adjacent matrix with self-loop
$D$	Degree matrix
$*_{𝒢}$	Graph convolution operation
$⊙$	Hadamar product
$L$	Laplacian matrix
$U$	Fourier basis matrix of $L$
$Λ$	Eigenvalue matrix of $L$
$λ$	Eigenvalue forms $Λ$
$X$	Feature matrix of a graph
$x$ , $g$	The feature vector of a graph
$y$	True label of $x$
$\hat{y}$	Predicted label of $x$
$g_{θ}$	Filter parameterized by $θ$
$T$	Chebyshev polynomial coefficients
$σ$	Nonlinear activation function
$θ$ , $W$	Learnable model parameters
$ψ$	Family of wavelets
$α$	Weight of the loss function $L o s s 2$
$A_{n}$	Diagnosis accuracy of coupled fault $n$
$\bar{A}$	Comprehensive diagnosis accuracy of coupling faults

Table 2. Comparison of the coupling fault diagnosis performance.

Models	Validation Accuracy	Test Accuracy	Steps to Convergence
ChebyNet	90.56%	82.78%	7
GCN	98.33%	98.89%	/
WLChebyNet	100%	96.67%	29
WLHOGNN	100%	99.45%	10
WLGraphSAGE	100%	97.8%	8
DIGNN	100%	100%	2

Table 3. Accuracy of the diagnosis of main components constituting coupling fault.

Fault Mode	Inner	Ball	Outer
1st obvious fault	100%	0	0
2nd obvious fault	0	94%	6%
3rd obvious fault	0	6%	59%
Coupling fault diagnosis accuracy	100%	100%	65%
Comprehensive accuracy	88.3%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, S.; Jing, B.; Pan, J.; Meng, X.; Huang, Y.; Jiao, X. Coupling Fault Diagnosis Based on Dynamic Vertex Interpretable Graph Neural Network. Sensors 2024, 24, 4356. https://doi.org/10.3390/s24134356

AMA Style

Wang S, Jing B, Pan J, Meng X, Huang Y, Jiao X. Coupling Fault Diagnosis Based on Dynamic Vertex Interpretable Graph Neural Network. Sensors. 2024; 24(13):4356. https://doi.org/10.3390/s24134356

Chicago/Turabian Style

Wang, Shenglong, Bo Jing, Jinxin Pan, Xiangzhen Meng, Yifeng Huang, and Xiaoxuan Jiao. 2024. "Coupling Fault Diagnosis Based on Dynamic Vertex Interpretable Graph Neural Network" Sensors 24, no. 13: 4356. https://doi.org/10.3390/s24134356

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Coupling Fault Diagnosis Based on Dynamic Vertex Interpretable Graph Neural Network

Abstract

1. Introduction

2. Interpretability of Graph Neural Networks

3. Algorithm Flow

3.1. Data Preprocessing

3.2. Coupling Fault Diagnosis

3.3. Algorithm Flow

3.4. Interpretability Analysis

4. Dataset Introduction and Data Processing

4.1. Dataset Introduction

4.2. Data Preprocessing

4.3. Coupling Fault Diagnosis

5. Discussion and Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI