Next Article in Journal
Assessment of the Smartpill, a Wireless Sensor, as a Measurement Tool for Intra-Abdominal Pressure (IAP)
Previous Article in Journal
Investigation of the Effect of Debris Position on the Detection Stability of a Magnetic Plug Sensor Based on Alternating Current Bridge
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Graph Multi-Scale Permutation Entropy for Bearing Fault Diagnosis

1
School of Mechanical Engineering, Sichuan University, Chengdu 610017, China
2
School of Engineering, University of Birmingham, Birmingham B152TT, UK
*
Author to whom correspondence should be addressed.
Sensors 2024, 24(1), 56; https://doi.org/10.3390/s24010056
Submission received: 2 November 2023 / Revised: 6 December 2023 / Accepted: 12 December 2023 / Published: 21 December 2023
(This article belongs to the Section Fault Diagnosis & Sensors)

Abstract

:
Bearing faults are one kind of primary failure in rotatory machines. To avoid economic loss and casualties, it is important to diagnose bearing faults accurately. Vibration-based monitoring technology is widely used to detect bearing faults. Graph signal processing methods promising for the extraction of the fault features of bearings. In this work, graph multi-scale permutation entropy (MPEG) is proposed for bearing fault diagnosis. In the proposed method, the vibration signal is first transformed into a visibility graph. Secondly, a graph coarsening method is used to generate coarse graphs with different reduced sizes. Thirdly, the graph’s permutation entropy is calculated to obtain bearing fault features. Finally, a support vector machine (SVM) is applied for fault feature classification. To verify the effectiveness of the proposed method, open-source and laboratory data are used to compare conventional entropies and other graph entropies. Experimental results show that the proposed method has higher accuracy and better robustness and de-noising ability.

1. Introduction

Bearings play an important role in many machines, such as aero-engines, rail vehicle bogies, wind-driven generators, etc., for transmitting speed and carrying radical load. The working conditions of bearings are harsh, i.e., variable load and speed, which results in bearing failures occurring easily. Bearing faults may lead to shutdown of the whole machine, causing economic loss and even casualties. To avoid tragedies, many condition monitoring technologies, such as vibration-based detection, acoustic emission-based detection, etc., have been used to detect early faults in bearings. Vibration signals have rich physical features including fault information and hence vibration-based monitoring technology has been widely used in bearing fault diagnosis.
The main bearing fault diagnosis methods which use vibration signals can be divided into two types, i.e., signal processing-based and machine learning-based methods [1,2]. For signal processing-based methods, vibration signals are normally decomposed for de-noising or feature extraction and then the analyzed signal is processed using time-series or spectrum methods. Finally, bearing fault diagnosis can be achieved by calculating indices or observing fault features in the obtained spectrum. Some signal processing methods, such as variational mode decomposition [3], wavelet transform [4], sparse representation [5], and their variants, have been successfully applied for bearing fault diagnosis. Those methods have achieved good performance in many cases. However, the results of signal processing-based methods are highly reliant on expert experience.
For machine learning-based fault diagnosis methods, statistical models are built and trained using signal or feature samples with different health conditions. Intelligent fault diagnosis models can be completed after evaluation by testing samples. Deep learning is one kind of statistical model which has been widely applied to bearing fault diagnosis [6]. Deep learning is an “end-to-end” model which can be trained by feeding signal samples directly into the model. Many deep learning models, for example, convolution neural networks [7], stacked autoencoders [8], deep belief networks [9], and recurrent neural networks [10], have achieved good performance when using large volumes of data. However, there are few samples which include faulty conditions and many samples which demonstrate healthy conditions in real operating conditions. The problem of imbalanced sample data results in overfitting and low accuracy in deep learning models.
The bearing fault diagnosis methods which use conventional statistical models, such as k-nearest neighbor [11], Bayesian classifier [12], and support vector machines (SVM) [13], mitigate these challenges by relying on well-defined mathematical principles and feature spaces that are less susceptible to the effects of imbalanced data [2]. Unlike deep learning methods, these statistical models only require a small number of features extracted from signal samples for fault diagnosis. Feature extraction maps original signals onto the statistical parameters which convey information about the machine’s status. The selection of statistical parameters directly influences the results of fault diagnosis methods using statistical models. Statistical parameters can be divided into two different classes: physical parameters and virtual parameters [14]. Physical parameters are relevant to physics-based information about health status and are extracted from signals using signal processing methods or statistical methods, for example, root mean square, peak value, kurtosis, and entropy. Virtual parameters are obtained by fusing multiple physical parameters or multi-sensor signals by using reduction techniques such as principal component analysis, self-organizing maps, and hidden Markov models. Hence, virtual parameters lose physical meaning and just represent a virtual index related to health status.
Entropy, as one kind of physical parameter, is the measure of the disorder of a signal, and has been successfully applied in fault diagnosis as approximate entropy, sample entropy, fuzzy entropy, etc. [15]. However, the above entropies have limitations. For example, approximate entropy is restricted by the length of the signals; sample entropy heavily depends on the similarity tolerance threshold; and the boundary width of the fuzzy function in fuzzy entropy is difficult to define [15]. Based on spectrum graph theory, graph entropies can be used for analyzing data with topology structures which have rich structural information compared with entropies in Euclidean space [16]. Graph entropies have been used in different research areas, for instance, brain science [17] and web networks [18]. However, there are few research works about fault diagnosis using graph entropies.
Inspired by multi-scale permutation entropy and graph permutation entropy (PEG) [19], graph multi-scale permutation entropy (MPEG) is proposed for the fault diagnosis of bearings in this work. In the proposed method, vibration signals are first transformed into visible graphs. Secondly, the spectral graph coarsening method is introduced to obtain coarse-grained graphs with different scales. Thirdly, the PEG of each coarse-grained graph is calculated and, hence, MPEG can be obtained. Finally, a support vector machine (SVM) is used for classing MPEG and diagnosing bearing faults. The main contributions of this work are (1) graph multi-scale permutation entropy is proposed by combining the graph coarsening method and the graph permutation entropy; (2) graph entropy is introduced for the first time for fault diagnosis.

2. Methodology

2.1. Horizontal Visibility Graphs

An undirected weighted graph can be expressed as G = V , E , W , where V = v 1 , v 2 , v 3 , , v N , | V | = N is a set of vertices; E = e 1 , e 2 , e 3 , , e M , | E | = M is a set of edges connecting vertices; and W = w i j N × N is the weighted adjacency matrix, where w i j is the weight of the edge connecting v i and v j . The Laplacian matrix of graph G is given by L = D W , where D = d 1 , d 2 , d 3 , , d N is the degree diagonal matrix and where d i = w i j . L is a real symmetric matrix with L = U Λ U T , in which U = [ u 1 , u 2 , , u N ] refers to orthogonal eigenvectors and Λ is the diagonal matrix of a set of non-negative real eigenvalues λ = λ 1 , λ 2 , , λ N . In this work, the vibration signal is constructed into the horizontal visibility graph (HVG) [20]. For a given a time series x 1 , x 2 , , x n , if x i and x j satisfy Equation (1) in the time series, v i and v j are connected vertices in the HVG.
x i , x j > x n , i < n < j
A time series and its HVG are shown in Figure 1. Figure 1a shows the bar plot of a time series [0.2 0.5 1.0 0.6 0.8 1.0 0.4 0.7]. Figure 1b is the HVG converted from the time series. Each data value in the time series corresponds to each vertex in the HVG.
It is worth noting that the weight value of edges w i j directly influences the results of graph signal processing methods. The main weighting methods are described below.
(1)
The constant weighting method describes the connection of vertices but neglects the feature diversity of vertices. The weight value of edges w i j is defined as Equation (2)
w i j = 1
(2)
The Gaussian kernel weighting method utilizes the Gaussian function for calculating the weight value of edges w i j , as shown in Equation (3).
w i j = exp x i x j 2 / 2 σ 2
where σ is the width of the Gaussian kernels.
(3)
The Euclidean weighting method utilizes the Euclidean distance for calculating the weight value of edges w i j , as shown in Equation (4).
w i j = x i x j
(4)
The cosine weighting method is defined as Equation (5).
w i j = 1 x i , x j x i 2 x j 2

2.2. Graph Coarsening

Graph coarsening aims to generate a coarse-grained graph of reduced size while preserving the original graph properties. Let the coarse-grained graph G c = V c , E c , W c with n = V c be obtained from the original graph G . For graph coarsening, a set of non-overlapping graph partitions P = { S 1 , S 2 , , S n } V is firstly selected and each partition S p corresponds to a super-node s p . An example illustrating the graph coarsening process with five graph partitions is shown in Figure 2. Super-nodes ( s p , s q ) are connected by super-edges W c ( p , q ) whose weight values are equal to the accumulative edge weights between nodes in graph partitions ( S p , S q ), as shown in Equation (6).
W c p , q = w S p , S q = v i S p , v j S q W i , j
Suppose the partition matrix P R n × N contains columns which are indicator vectors for dividing nodes of the original graph into graph partitions of the coarse-grained graph. P is defined as Equation (7).
P p , i = 1 ,               i f   v i S p 0 ,           o t h e r w i s e
According to Equation (6), the weighted adjacency matrix of the coarse-grained graph W c can be obtained and defined as Equation (8).
W c = P W P T
In this work, the spectral graph coarsening (SGC) method is introduced to calculate the weighted adjacency matrix [21]. In SGC, an iteration loop is implemented to search for different possible combinations of eigenvectors and the k-means clustering algorithm is applied to select the graph coarsening option with the minimum cost. The k-means cost function is defined as Equation (9).
F U k 1 , P k 1 * = i = 1 N r i j S i r j S i 2  
where P k 1 * is graph partitions and U k 1 = U 1 : k 1 ; U k 2 + 1 : N , in which k 1 is defined as Equation (10) and k 2 = N n + k 1 in the iteration.
k 1 = n ,                                                                                                                                                                                         if   λ N 1   k 1 = a r g m i n k k : λ k 1 , k n , λ N n + k + 1 > 1 ,   otherwise
The partition matrix is obtained by minimizing the k-means cost in Equation (11).
P * = a r g m i n F U k 1 , P k 1 *
Substituting Equation (11) into Equation (8), the weighted adjacency matrix of the coarse-grained graph can be obtained.

2.3. Graph Permutation Entropy

Graph permutation entropy is a measurement of the regularity of graphs obtained by combining the vertices’ features with the topology of the graphs. There are four procedures for calculating graph permutation entropy P E G , as follows:
(1)
The embedding vector y k m , τ R m is firstly constructed as Equation (12).
Y k m , τ = y i k τ k = 0 m 1 = y i 0 , y i τ , , y i m 1 τ , i = 1 , 2 , , N
where m is the embedding dimension ( 2 m N ), τ is the delay time ( τ N ), and y i k τ is defined as Equation (13).
y i k τ = 1 H k τ i j H k τ i x j = 1 H k τ i W k τ X i
where H k τ ( i ) is the set of all vertices connected to the vertex i with a walk on k τ edges. It follows that y i 0 = x i and y i 1 = ( I Δ ) x i .
(2)
The elements in embedding vector y k m , τ are associated with integer numbers from 1 to m and then rearranged in increasing order. Hence, there are m ! permutations in which k distinct permutations exist ( k m ! ) .
(3)
The probability of distinct permutations is denoted by P r . According to Shannon entropy, the graph permutation entropy for the k distinct permutations can be defined as Equation (14).
P E G m , τ = r = 1 R P r l n P r
(4)
When P r = 1 / m !   , the probability values of distinct permutations are the same, and hence P E G has the highest value ln m ! . Finally, the normalized procedure is implemented as shown in Equation (15).
P E G m , τ = P E G m , τ l n m !

3. Graph Multi-Scale Permutation Entropy for Bearing Fault Diagnosis

In this work, graph multi-scale permutation entropy is proposed for bearing fault diagnosis. There are four main procedures in the proposed method, as follows:
(1)
Graph transform. The vibration signal samples from bearings are transformed into HVGs.
(2)
Graph coarsening with multiple scales. Multi-scale coarse-grained graphs with different numbers of vertices are obtained for each HVG using the SGC method.
(3)
Graph permutation entropy calculation. The graph permutation entropy values of all multi-scale coarse-grained graphs are calculated for constructing feature vectors of each sample.
(4)
Classifier training. Features are divided into training data and testing data. The support vector machine (SVM) is trained by feeding the training data and then the testing data are used to evaluate the classifier. Finally, the bearing fault classifier can be obtained.

4. Experiment

To verify the effectiveness of the graph multi-scale permutation entropy method, two cases, A and B, were conducted using open-source data and laboratory data, respectively.

4.1. Case A: Vibration Signals from the Public Dataset

Bearing data from the Bearing Data Center of Case Western Reserve University were used to select the parameters of MPEG in this case. In this work, the bearing data with a damage diameter of 7 mils were selected as test signals. The motor speed was 1797 rpm and the load was 0 Hp. The sampling frequency was 12 kHz and there were 1024 sampling points for each sample. Each health status includes 20 samples; there are 80 samples for four health statuses, including rolling ball fault (RF), inner race fault (IF), outer race fault (OF), and healthy (H) bearing. For each health status, 5 samples were randomly chosen for training and the remaining 15 samples were used for testing; thus, a total of 20 training samples and 60 testing samples were obtained.
To test the influence of different weighting methods, a comparison experiment was conducted using three different weighting methods, i.e., the constant weighting method, the Gaussian kernel weighting method, and the Euclidean weighting method. Figure 3 shows the graph multi-scale permutation entropy (MPEG) values of samples with RF, IF, and OF using different weighting methods. For each health status, 5 samples were weighted using the three weighting methods to calculate MPEG and represented in different color lines. It can be seen from Figure 3 that MPEG decreases with the increase in the scale factor s . For samples exhibiting equivalent health conditions, the utilization of Euclidean kernel weighting yields MPEG values that converge more closely across varying scale factors. This phenomenon is indicative of reduced intra-class variability. The variances in MPEG values using different weighting methods are calculated as shown in Table 1. Table 1 demonstrates that the variance in MPEG under various health conditions is minimized when employing the Euclidean distance weighting method. It suggests superior performance characterized by enhanced discriminatory capacity and classification precision. Hence, the Euclidean kernel weighting method is used to calculate MPEG in this work.
The selection of parameters, such as embedding dimension m , sample length N , and time delay τ , directly influences the value of MPEG. A comparison experiment was conducted using samples with different N = 256 ,   512 ,   1024 ,   2048 and m = 3 ,   4 ,   5 ,   6 ,   7 . In the experiment,   s ranged from 1 to 20 and τ was set to 1, respectively. Figure 4 shows MPEG with respect to s under different parameter settings. It can be seen from the figure that the MPEG tends to decrease as the value of s increases. Also, some MPEG values demonstrate substantial and irregular fluctuations when m is 3 or 4. In contrast, when m is 5, 6, or 7, there is a trend of gradually decreasing MPEG values, which tends to stabilize after s reaches 10. Hence, MPEG values with s = 1–20 were selected as features for classification in this work according to experience.
Furthermore, Figure 4 shows that the intra-class distances of MPEG values for the same s tend to decrease with increasing values of m . However, a larger value of m leads to an increase in computation time. The CPU model utilized in this experiment was the Intel(R) Core(TM) i7-7700K, sourced from Intel Corporation, headquartered in Santa Clara, CA, USA, featuring 32 GB of RAM. Additionally, the graphics card employed was the GeForce GTX 1660 Super, manufactured by NVIDIA Corporation, based in Santa Clara, CA, USA, equipped with 6 GB of video memory. Table 2 illustrates the runtime for computing MPEG values under different values of m when N is set to 1024. It is evident that when m is increased to 7, the computation time becomes three times that of m = 6, indicating a significant increase. Hence, m was set to 6 in this work, after taking all factors into consideration.
The value of N also significantly influences the computation time for calculating MPEG values. Table 3 is the runtime for computing MPEG values under different values of N. Table 3 shows the runtime increases with the increase in N. This effect is particularly pronounced when N is set to 2048, where the efficiency drops by approximately 85% compared to when N is set to 1024. However, selecting small values of N may lead to the omission of critical fault information due to insufficient data length. Hence, N was set to 1024 in this work.
Another experiment was conducted to test the effect of testing time delay τ values on the classification results. In this experiment, four health statuses (RF, IF, OF, H) were assigned the numerical labels 0, 1, 2, and 3, respectively. For each health status, five samples were randomly chosen as training samples, while fifteen samples were designated as testing samples. The sequence of the 60 testing samples is as follows: RF (1–15), IF (16–30), OF (31–45), and H (46–60). Figure 5 is the classification results of bearing faults with different values of τ. It can be seen from Figure 5 that the classification accuracy for τ values of 1, 2, 3, and 4 is 98.3%, 83.3%, 90%, and 83.3% respectively. The highest accuracy was achieved when τ was set to 1. Therefore, τ was selected as 1 in this work.
A comparison experiment was implemented to test the robustness of MPEG. Bearing data from South Ural State University [22] were used in this experiment. In this work, the linear acceleration in the bearing data was selected as the test signal. The motor speed was 1200 rpm and the sampling frequency was 31.175 kHz. Considering that there are 1024 sampling points for each sample, the bearing data were downsampled to 10 kHz to ensure that each sample contained information from two revolutions of the test bearing. Each health status includes 20 samples; there were thus 100 samples for five health statuses, including rolling ball fault (BF), inner race fault (IF), outer race fault (OF), combination fault (CF), and healthy (H) bearing. For each health status, 5 samples were randomly chosen for training and the remaining 15 samples were used for testing; thus, a total of 25 training samples and 75 testing samples were obtained. The other four entropies, i.e., permutation entropy (PE) [23], graph permutation entropy (PEG) [19], multi-scale permutation entropy (MPE) [24], and multi-scale approximate entropy (MAE) [25], were added to the experiment. Figure 6 shows the classification results when using five different entropies, in which different colors represent different operating conditions. It can be seen from Figure 6 that the classification accuracies for the four methods, PE, PEG, MPE, and MAE, are 48%, 50.7%, 78.7%, and 89.3%, respectively. The MPEG method has only two misclassified samples, achieving an accuracy of 97.3%, which is 49.3%, 46.6%, 18.6%, and 8% higher than the aforementioned four methods, respectively. These results demonstrate that the proposed method is superior to conventional methods for identifying different kinds of bearing faults.

4.2. Case B: Vibration Signals from the Laboratory

Vibration signals were collected from the test rig as shown in Figure 7a, undertaken in our laboratory. The test faulty bearings are shown in Figure 7b. In this experiment, an MMF Type KS76-100/1 instrumentation-standard piezoelectric accelerometer (Acc 1) was used to measure vibration and was mounted on the test bearing by means of a strong magnet. The accelerometer was calibrated with an MMF Type VC20 vibration calibrator. The parameters of the bearing are shown in Table 4. Data were collected under four kinds of health status, i.e., roller fault (RF), outer race fault (OF), cage fault (CF), and healthy (H), as described in Table 5. Data for each health status were collected at three different rotational speeds, with 7 samples collected at each speed, resulting in a total of 21 samples per health status. During the experiment, 6 samples were chosen from each health status dataset as the training set, while the remaining 15 samples were allocated as the testing set. In total, there were 24 training samples and 60 testing samples.
To further test the robustness and advantages of the proposed method, the rotational speeds were set to 300 rpm, 400 rpm, and 500 rpm. As mentioned in Case A, the point number N of each sample was 1024. In this case, each sample contains information from two revolutions of the test bearing. Hence, the sampling frequency was set to 2 kHz, 2.5 kHz, and 3 kHz, respectively. Also, a hydraulic jack was used to apply a constant vertical load of 800 N over the test bearing.
The four health statuses (RF, OF, CF, H) are labeled as 0, 1, 2, and 3, respectively. The sequence of the 60 testing samples is as follows: RF (1–15), OF (16–30), CF (31–45), and H (46–60). The other four entropies mentioned in Case 1 were used for comparison against the proposed method in this section as well. The comparison results obtained are illustrated in Figure 8. In this figure, black represents a sample with a rotational speed of 300 rpm, red represents a speed of 400 rpm, and green corresponds to a speed of 500 rpm. It can be seen from Figure 8 that the classification accuracies for the four methods, PE, PEG, MPE, and MAE, are 46.7%, 80%, 65%, 80%. The MPEG method has only one misclassified sample, achieving an accuracy of 96.7%, which is 50%, 16.7%, 31.7%, and 16.7% higher than the aforementioned four methods, respectively. The results further validate the proposed method’s accuracy and robustness when dealing with data at different rotational speeds.

5. Conclusions

A novel graph multi-scale permutation entropy (MPEG) method is proposed for bearing fault diagnosis in this work. In the proposed method, vibration signals are first transformed into horizontal visibility graphs. Secondly, the graphs are coarsened at multiple scales to obtain multiple coarse-grained graphs. Thirdly, the permutation entropy values of these coarse-grained graphs are then employed as a multi-dimensional feature input for training a support vector machine (SVM). Comparison experiments were implemented using two different datasets to test the proposed method’s performance. Based on the experimental results, the following conclusions can be drawn:
  • The results of the comparison experiments in Case A demonstrate that the MPEG method is superior to conventional methods (PE, PEG, MPE, and MAE) for identifying different kinds of bearing faults.
  • The proposed method demonstrates strong robustness when dealing with data from various operating conditions. It outperforms traditional methods (PE, PEG, MPE, and MAE) in fault diagnosis. In Case B, the accuracy of the proposed method is 16% to 50% higher when compared to other methods.
  • This work validates the effectiveness and advantages of applying the MPEG method to rolling bearing fault diagnosis. However, it has not yet been applied to diagnose other equipment faults in machinery. Therefore, the efficacy of the MPEG method in other fault diagnostics needs further exploration.

Author Contributions

Conceptualization, Q.F. and D.Z.; Data Curation, J.Y.; Formal Analysis, Q.F. and J.Y.; Funding Acquisition, D.Z.; Investigation, Q.F.; Methodology, Q.F. and D.Z.; Project Administration, D.Z.; Resources, J.Y.; Software, Y.L.; Supervision, D.Z.; Validation, Q.F. and Y.L.; Visualization, Y.L.; Writing—Original Draft, Y.L.; Writing—Review and Editing, Y.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the Research Grants Council of the National Nature Science Foundation of China under Project 52105115 and Project 62120106011, and in part by the Fundamental Research Funds for the Central Universities, Sichuan Science and Technology Program under Grant 2023YFSY0003 and Grant 2023NSFSC0862.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data cannot be made publicly available upon publication because they are owned by a third party and the terms of use prevent public distribution. The data that support the findings of this study are available upon reasonable request from the authors.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Rai, A.; Upadhyay, S.H. A review on signal processing techniques utilized in the fault diagnosis of rolling element bearings. Tribol. Int. 2016, 96, 289–306. [Google Scholar] [CrossRef]
  2. Liu, R.; Yang, B.; Zio, E.; Chen, X. Artificial intelligence for fault diagnosis of rotating machinery: A review. Mech. Syst. Signal Process. 2018, 108, 33–47. [Google Scholar] [CrossRef]
  3. Lv, Z.; Han, S.; Peng, L.; Yang, L.; Cao, Y. Weak Fault Feature Extraction of Rolling Bearings Based on Adaptive Variational Modal Decomposition and Multiscale Fuzzy Entropy. Sensors 2022, 22, 4504. [Google Scholar] [CrossRef] [PubMed]
  4. Chen, J.; Li, Z.; Pan, J.; Chen, G.; Zi, Y.; Yuan, J.; Chen, B.; He, Z. Wavelet transform based on inner product in fault diagnosis of rotating machinery: A review. Mech. Syst. Signal Process. 2016, 70–71, 1–35. [Google Scholar] [CrossRef]
  5. Song, Z.; Huang, W.; Liao, Y.; Mao, L.; Shi, J.; Wang, J.; Shen, C.; Zhu, Z. Sparse representation based on generalized smooth logarithm regularization for bearing fault diagnosis. Meas. Sci. Technol. 2021, 32, 105003. [Google Scholar] [CrossRef]
  6. Hoang, D.; Kang, H. A survey on Deep Learning based bearing fault diagnosis. Neurocomputing 2019, 335, 327–335. [Google Scholar] [CrossRef]
  7. Zhang, J.; Sun, Y.; Guo, L.; Gao, H.; Hong, X.; Song, H. A new bearing fault diagnosis method based on modified convolutional neural networks. Chin. J. Aeronaut. 2020, 33, 439–447. [Google Scholar] [CrossRef]
  8. Mao, W.; Feng, W.; Liu, Y.; Zhang, D.; Liang, X. A new deep auto-encoder method with fusing discriminant information for bearing fault diagnosis. Mech. Syst. Signal Process. 2021, 150, 107233. [Google Scholar] [CrossRef]
  9. He, J.; Yang, S.; Gan, C. Unsupervised Fault Diagnosis of a Gear Transmission Chain Using a Deep Belief Network. Sensors 2017, 17, 1564. [Google Scholar] [CrossRef]
  10. Zhang, Y.; Zhou, T.; Huang, X.; Cao, L.; Zhou, Q. Fault diagnosis of rotating machinery based on recurrent neural networks. Measurement 2021, 171, 108774. [Google Scholar] [CrossRef]
  11. Tian, J.; Morillo, C.; Azarian, M.H.; Pecht, M. Motor Bearing Fault Detection Using Spectral Kurtosis-Based Feature Extraction Coupled with K-Nearest Neighbor Distance Analysis. IEEE Trans. Ind. Electron. 2016, 63, 1793–1803. [Google Scholar] [CrossRef]
  12. Yu, J.; Ding, B.; He, Y. Rolling bearing fault diagnosis based on mean multigranulation decision-theoretic rough set and non-naive Bayesian classifier. J. Mech. Sci. Technol. 2018, 32, 5201–5211. [Google Scholar] [CrossRef]
  13. Lv, Z.; Peng, L.; Cao, Y.; Yang, L.; Li, L.; Zhou, C. Weak Fault Feature Extraction Method of Rolling Bearings Based on MVO-MOMEDA under Strong Noise Interference. IEEE Sens. J. 2023, 23, 15732–15740. [Google Scholar] [CrossRef]
  14. Lei, Y.; Li, N.; Guo, L.; Li, N.; Yan, T.; Lin, J. Machinery health prognostics: A systematic review from data acquisition to RUL prediction. Mech. Syst. Signal Process. 2018, 104, 799–834. [Google Scholar] [CrossRef]
  15. Liu, L.; Zhi, Z.; Zhang, H.; Guo, Q.; Peng, Y.; Liu, D. Related Entropy Theories Application in Condition Monitoring of Rotating Machineries. Entropy 2019, 21, 1061. [Google Scholar] [CrossRef]
  16. Dehmer, M.; Mowshowitz, A. A history of graph entropy measures. Inform. Sci. 2011, 181, 57–78. [Google Scholar] [CrossRef]
  17. Sen, B.; Parhi, K.K. Graph-Theoretic Properties of Sub-Graph Entropy. IEEE Signal Proc. Let. 2021, 28, 135–139. [Google Scholar] [CrossRef]
  18. Liu, X.; Fu, L.; Wang, X.; Zhou, C. On the Similarity Between von Neumann Graph Entropy and Structural Information: Interpretation, Computation, and Applications. IEEE Trans. Inf. Theory 2022, 68, 2182–2202. [Google Scholar] [CrossRef]
  19. Fabila-Carrasco, J.S.; Tan, C.; Escudero, J. Permutation Entropy for Graph Signals. IEEE Trans. Signal Inf. Process. Netw. 2022, 8, 288–300. [Google Scholar] [CrossRef]
  20. Elmadany, N.E.D.; He, Y.; Guan, L. Multimodal Learning for Human Action Recognition via Bimodal/Multimodal Hybrid Centroid Canonical Correlation Analysis. IEEE Trans. Multimed. 2019, 21, 1317–1331. [Google Scholar] [CrossRef]
  21. Jin, Y.; Loukas, A.; Jaja, J.F. Graph Coarsening with Preserved Spectral Properties. In Proceedings of the International Conference on Artificial Intelligence and Statistics, Online, 26–28 August 2020; Volume 108, pp. 4452–4461. [Google Scholar]
  22. Sinitsin, V.; Ibryaeva, O.; Sakovskaya, V.; Eremeeva, V. Intelligent bearing fault diagnosis method combining mixed input and hybrid CNN-MLP model. Mech. Syst. Signal Process. 2022, 180, 109454. [Google Scholar] [CrossRef]
  23. Yan, R.; Liu, Y.; Gao, R.X. Permutation entropy: A nonlinear statistical measure for status characterization of rotary machines. Mech. Syst. Signal Process. 2012, 29, 474–484. [Google Scholar] [CrossRef]
  24. Li, Y.; Xu, M.; Wei, Y.; Huang, W. A new rolling bearing fault diagnosis method based on multiscale permutation entropy and improved support vector machine based binary tree. Measurement 2016, 77, 80–94. [Google Scholar] [CrossRef]
  25. Yan, R.; Gao, R.X. Approximate Entropy as a diagnostic tool for machine health monitoring. Mech. Syst. Signal Process. 2007, 21, 824–839. [Google Scholar] [CrossRef]
Figure 1. An example of a horizontal visibility graph: (a) time series data value; (b) horizontal visibility graph.
Figure 1. An example of a horizontal visibility graph: (a) time series data value; (b) horizontal visibility graph.
Sensors 24 00056 g001
Figure 2. An example illustrating the graph coarsening procedure with five graph partitions.
Figure 2. An example illustrating the graph coarsening procedure with five graph partitions.
Sensors 24 00056 g002
Figure 3. Graph multi-scale permutation entropy values using three different weighting methods. (ac) using the constant weighting method; (df) using the Gaussian kernel weighting method; (gi) using the Euclidean weighting method.
Figure 3. Graph multi-scale permutation entropy values using three different weighting methods. (ac) using the constant weighting method; (df) using the Gaussian kernel weighting method; (gi) using the Euclidean weighting method.
Sensors 24 00056 g003
Figure 4. Graph multi-scale permutation entropy values with different N and m: (ae) rolling ball fault; (fj) inner race fault; (ko) outer race fault.
Figure 4. Graph multi-scale permutation entropy values with different N and m: (ae) rolling ball fault; (fj) inner race fault; (ko) outer race fault.
Sensors 24 00056 g004
Figure 5. Classification results of bearing faults with different values of τ .
Figure 5. Classification results of bearing faults with different values of τ .
Sensors 24 00056 g005
Figure 6. Classification results using five different entropies in Case A. (a) PE, (b) PEG, (c) MPE, (d) MAE, (e) MPEG.
Figure 6. Classification results using five different entropies in Case A. (a) PE, (b) PEG, (c) MPE, (d) MAE, (e) MPEG.
Sensors 24 00056 g006
Figure 7. The bearing test rig and faulty bearings in in our laboratory: (a) test rig, (b) roller fault, (c) outer race fault, (d) cage fault.
Figure 7. The bearing test rig and faulty bearings in in our laboratory: (a) test rig, (b) roller fault, (c) outer race fault, (d) cage fault.
Sensors 24 00056 g007
Figure 8. Classification results using five different entropies in Case 2: (a) PE, (b) PEG, (c) MPE, (d) MAE, (e) MPEG.
Figure 8. Classification results using five different entropies in Case 2: (a) PE, (b) PEG, (c) MPE, (d) MAE, (e) MPEG.
Sensors 24 00056 g008
Table 1. Variance among MPEG values using different weighting methods.
Table 1. Variance among MPEG values using different weighting methods.
Constant WeightingGaussian Kernel WeightingEuclidean Weighting
RF0.015340.011110.00951
IF0.012530.010500.00871
OF0.018200.012950.01152
Table 2. Runtime for computing MPEG values under different values of m.
Table 2. Runtime for computing MPEG values under different values of m.
m34567
Time0.07 s0.1 s0.18 s0.55 s2.9 s
Table 3. Runtime for computing MPEG values under different values of N.
Table 3. Runtime for computing MPEG values under different values of N.
N25651210242048
Time2 min8 min40 min9 h
Table 4. Test bearing parameters.
Table 4. Test bearing parameters.
TypeBearing No.Number of Rollers per CageRoller DiameterPitch
Diameter
Roller Angle
ParametersTAROL
130/230-U-TVP
2224 mm187 mm6.9°
Table 5. Health status of test bearings.
Table 5. Health status of test bearings.
Bearing IDDescription
RFA minor scratch along the roller, inflicted by an electrical discharge engraver
OFA minor scratch across the race (axial direction), inflicted by a combination of an engraver and small grinder
CFThe bearing cage cracked in one place, achieved by cutting and applying excess force with a screwdriver
HHealthy bearing
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Fan, Q.; Liu, Y.; Yang, J.; Zhang, D. Graph Multi-Scale Permutation Entropy for Bearing Fault Diagnosis. Sensors 2024, 24, 56. https://doi.org/10.3390/s24010056

AMA Style

Fan Q, Liu Y, Yang J, Zhang D. Graph Multi-Scale Permutation Entropy for Bearing Fault Diagnosis. Sensors. 2024; 24(1):56. https://doi.org/10.3390/s24010056

Chicago/Turabian Style

Fan, Qingwen, Yuqi Liu, Jingyuan Yang, and Dingcheng Zhang. 2024. "Graph Multi-Scale Permutation Entropy for Bearing Fault Diagnosis" Sensors 24, no. 1: 56. https://doi.org/10.3390/s24010056

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop