Next Article in Journal
Offline Scaling of IoT Devices in IOTA Blockchain
Next Article in Special Issue
Online Surface Roughness Prediction for Assembly Interfaces of Vertical Tail Integrating Tool Wear under Variable Cutting Parameters
Previous Article in Journal
Usability of Functional Electrical Stimulation in Upper Limb Rehabilitation in Post-Stroke Patients: A Narrative Review
Previous Article in Special Issue
Multilevel Fine Fault Diagnosis Method for Motors Based on Feature Extraction of Fractional Fourier Transform
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Bearing Fault Diagnosis Method Based on Wavelet Packet Transform and Convolutional Neural Network Optimized by Simulated Annealing Algorithm

School of Computer Science, Yangtze University, Jingzhou 430023, China
*
Author to whom correspondence should be addressed.
Sensors 2022, 22(4), 1410; https://doi.org/10.3390/s22041410
Submission received: 28 December 2021 / Revised: 7 February 2022 / Accepted: 8 February 2022 / Published: 12 February 2022

Abstract

:
Bearings are widely used in various electrical and mechanical equipment. As their core components, failures often have serious consequences. At present, most parameter adjustment methods are still manual adjustments of parameters. This adjustment method is easily affected by prior knowledge, easily falls into the local optimal solution, cannot obtain the global optimal solution, and requires a lot of resources. Therefore, this paper proposes a new method for bearing fault diagnosis based on wavelet packet transform and convolutional neural network optimized by a simulated annealing algorithm. Firstly, the original bearing vibration signal is extracted by wavelet packet transform to obtain the spectrogram, and then the obtained spectrogram is sent to the convolutional neural network for parameter adjustment, and finally the simulated annealing algorithm is used to adjust the parameters. To verify the effectiveness of the method, the bearing database of Case Western Reserve University is used for testing, and the traditional intelligent bearing fault diagnosis methods are compared. The results show that the new method for bearing fault diagnosis proposed in this paper has a better and more reliable diagnosis effect than the existing machine learning and deep learning methods.

1. Introduction

Bearings are a widely used, significant part of modern machinery and equipment. Bearing failure leads to significant failure time, increases maintenance costs, and may even reduce productivity [1]. Rolling element bearings are important mechanical parts of rotating machinery. They are also the main causes of basic industrial equipment failures, such as those of roll mills in steel mills, paper mills, and wind-turbine power plants, accounting for 51% of all failures [2]. Therefore, efficient and accurate bearing fault diagnosis methods play an important role in ensuring the function of the entire mechanical system, and the research on these methods is thus of great significance.
The study of bearing failure requires a large amount of original data, most of which are related to the state of motion. The state of motion can be reflected through vibration, sound, heat, electricity, etc. To check the health of a motor comprehensively, a state monitoring system is used to collect real-time data on the motor; after the motor has been running for a while, a large amount of data can be obtained [3]. However, these data are often processed with low efficiency in traditional bearing fault diagnosis methods, so a more efficient and reliable method is required.
Supported by a large amount of data, the traditional method of bearing fault diagnosis, which is based on manual experience, has gradually transitioned to data-driven, intelligent bearing fault diagnosis. Intelligent fault diagnosis can quickly and efficiently process the collected signals and provide accurate diagnoses. Thus, it is a promising tool in mechanical big-data processing. However, traditional intelligent diagnosis methods rely on prior knowledge and expertise for the manual extraction of features. These processes utilize human ingenuity, but they are time-consuming and labor-intensive [4].
One study [5] proposed a bearing fault diagnosis method combining the fuzzy c-means (FCM) method and an optimized k-nearest neighbor (KNN) model. Another [6] proposed a method to detect motor bearing faults. This method is based on spectral kurtosis (SK) and cross-correlation, extracts fault features representing different faults, and then uses principal component analysis (PCA) and semi-supervised KNN distance measure combines these features into a health index. Another study [7] proposed a sequential KNN classification method based on the affinity of distance and density to perform classification resulting in bearing fault diagnosis. In other study, a diagnosis method was proposed based on data-driven random fuzzy evidence collection and Dempster–Shafer evidence theory.
The literature [8] proposed a sequential KNN classification method based on the affinity of distance and density to perform classification for bearing fault diagnosis. The literature [9] proposed a diagnosis method based on data-driven random fuzzy evidence collection and Dempster–Shafer evidence theory. Then, by studying the bearing fault diagnosis method at this stage, it can be found that the bearing fault diagnosis method at this stage has obvious defects in feature extraction. In response to this problem, some scholars have proposed a new bearing fault diagnosis model, including feature extraction methods and fault classification methods.
Deep learning, which can discover features from raw data through multi-layer non-linear data-processing units, has become a promising tool for intelligent bearing fault diagnosis. Deep learning has a wide range of applications, such as pedestrian re-identification [10] and face alignment [11]. Indeed, deep learning has recently played a crucial role in the field of artificial intelligence [12]. Deep convolutional neural networks have strong data-mining and information integration capabilities, and they have been widely used in research on monitoring and diagnosing rotating machinery [13]. One study [14] proposed a method for feature extraction using continuous wavelet transform and rolling bearing fault diagnosis based on a convolutional neural network and support vector machine. Another study [15] proposed a bearing fault diagnosis method based on short-time Fourier transform and a convolutional neural network. Other studies have proposed bearing fault diagnosis methods based on the Stacked Inverted Residual Convolutional Neural Network (SIRCNN) and on the deep structure of the convolutional neural network [16,17]. Another novel fault diagnosis strategy was based on Synchronous Compression Transformation (SST) and deep convolutional neural network [16]. Someone proposed a bearing diagnosis method based on deep feature learning, in order to solve the fault identification of metal bearings, ceramic bearings, and hybrid bearings in electromechanical systems [18]. Some scholars have also proposed a bearing fault diagnosis method based on CNN. Compared with the traditional method, because CNN is based on the characteristics of weight sharing and sparse connection, it has a strong feature extraction ability, so it has a better effect in bearing fault diagnosis.
Although the bearing fault diagnosis based on convolutional neural network has achieved good results [19,20,21], two main problems remain. On the one hand, the training of convolutional neural networks is still a difficult problem. The training process often consumes a long time and requires more computing resources. In addition, it is easy to fall into the local optimal solution during the training process, resulting in a poor final recognition effect. Poor, based on the above problems, it will make the final industrial landing very difficult, and a lot of resources are spent in the adjustment process of parameters, but the setting of these parameters often has a decisive impact on the final result. On the other hand, when the parameters are adjusted, the convergence speed is too slow, and the local optimal solution is trapped, and it is difficult to find the global optimal solution. The above problems are factors that have a great influence on the bearing fault diagnosis effect, and need to be solved urgently.
Heuristic algorithms have a wide range of applications. In dealing with optimization problems, many excellent optimization algorithms have been proposed in recent years, such as flower pollination algorithms [22], generalized normal distribution algorithms based on local search [23], and improved genetic algorithms [24]. Some excellent heuristic algorithms have been proposed to solve optimization problems, which have achieved good results in various fields [25,26,27]. The heuristic algorithm mainly uses limited resources to obtain the optimal solution as much as possible in the case of limited resources, and in most cases, it can achieve good results. When training a convolutional neural network, time and computing resources are often limited. In the industry, when performing actual reasoning, it also needs to respond quickly, so it is suitable to use heuristic algorithms for optimization.
To address this problem, this study proposed a simulated annealing (SA) optimization method for bearing fault diagnosis based on a convolutional neural network and wavelet packet transform (WPT). Through WPT, the original signal was changed into a spectrogram. This spectrogram was then used as the input data of the convolutional neural network, which used feature-extracted samples for supervised training. Then, a SA algorithm was introduced to optimize the parameters of the convolutional neural network continually. The Case Western Reserve University bearing database was then used to verify this study’s proposed bearing fault diagnosis method.
This study also compared the proposed bearing fault diagnosis method with some traditional algorithms [28,29] including support vector machine, BP neural network, and WPT—convolutional neural network. Finally, the effectiveness of the proposed diagnosis method was verified through the comparison.
The innovation of this paper is as follows: a novel bearing fault diagnosis framework is proposed, based on a unique combination of the convolutional neural network, the wavelet packet transform and the simulated annealing algorithm.
This article is organized as follows: Section 2 introduces the data set used in the experiment, while Section 3 introduces the method proposed in this study, which included WPT, a convolutional neural network, and a SA algorithm. Section 4 describes the experiment and provides an analysis of the results. The Section 5 summarizes the article.

2. Data Set Description

The Case Western Reserve University bearing database was employed in this study because a large number of recent studies have been based on this data set and have achieved good results. The experimental equipment is shown in Figure 1. All bearings tested during the experiment were SKF bearings, which can be divided into three classifications: ordinary bearings, drive end (DE) bearings, and fan end (FE) bearings [30].
The accelerometer installed on the drive end of the induction motor was used for data collection. The sampling frequency was 12 KHZ, and each vibration signal was 10 s. The collected data included four bearing states: (1) normal state, (2) inner ring failure, (3) rolling element failure, and (4) outer ring failure. Figure 2 provides a physical drawing of the bearing. The sampling was completed at different speeds to capture each of these states: 1750 rpm, 1772 rpm, and 1797 rpm. Empirical mode decomposition was used to introduce a single point of failure. The fault point size was 0.007, 0.014, 0.021, and 0.028 inches. The experiment was also repeated under different loads, including 0, 1, 2, and 3 horsepower (hp) [31].
The number of sampling points often has a great impact on the experimental results. Generally, as the number of sampling points increases, the final bearing effect will be correspondingly better. However, for application to the actual industrial field and realization of real-time bearing fault diagnosis, too many sampling points cannot be selected. To evaluate the effectiveness and robustness of the proposed method, 400 sampling points were selected per group. At present, most studies choose 1024, 2048, or 4096 for a single sampling point. A large number of sampling points will greatly improve the final diagnostic accuracy, but the real-time performance will also be affected. In subsequent experiments, the bearings were divided into 12 categories to verify the method proposed. The classification is shown in Table 1.

3. The Proposed Bearing Fault Diagnosis Method

This section introduces the bearing fault diagnosis method (SWC) based on a SA-optimized convolutional neural network and WPT. The widely used bearing fault diagnosis algorithm is mainly composed of feature extraction and pattern recognition. First, the main features of the original diagnostic signal are extracted. Then, a pattern recognition algorithm is used, and the extracted features are used as a data set for training, thereby achieving the purpose of bearing fault diagnosis. This methodology has been proven relatively mature and has achieved good results.
The proposed bearing fault diagnosis method (SWC) based on a convolutional neural network optimized by SA and WPT also roughly followed this model. On this basis, the SA algorithm was introduced for automatic parameter adjustment, which became a kind of self-adjustment and thus adapted the bearing fault diagnosis method. Figure 3 depicts the workflow of the SWC algorithm First, WPT was used for feature extraction to obtain a spectrogram. This spectrogram was then sent to the convolutional neural network for training. Finally, the SA algorithm was used to optimize the convolutional neural network, and the global optimal solution was obtained. Figure 4 briefly describes this process.

3.1. Feature Extraction Based on Wavelet Packet Transform

Machine-learning algorithms based on vibration signal processing are a common method used in bearing fault diagnosis [32]. Feature extraction of bearing signals is key for fault diagnosis, as the excellent features of bearing vibration signals aid in the accurate diagnosis of bearing faults. When a fault occurs, the time domain and the frequency domain of the output signal are different. WPT analysis can accurately find this fault information and location. Indeed, since WPT appeared in the early 1980s, its popularity has grown; WPT has many advantages over Fourier transform [33]. WPT is an important mathematic tool for analyzing nonlinear and non-stationary signals. It can decompose the signal into multiple sub-signals with different frequency ranges. Further, it can decompose the detailed information of the high-frequency area signal. Therefore, feature extraction of bearing vibration signals based on WPT is one of the most commonly used methods. The mathematical description of the WPT algorithm is as follows:
W j , k n t = 2 j 2 W n 2 j t k .
where the integers j and k are the index scale and translation operations. The index n is an operation modulation parameter or oscillation parameter. In the process of WPT, the detected signal is divided by a pair of low-frequency and high-frequency filters into an approximate part and a detailed part, respectively. The approximate part is then further split into a secondary approximate part and a detailed part. Similarly, the first level of detail is split into a second level of approximation and detail. This process continues until the stopping criterion is reached. For n-level decomposition, the signal is decomposed into 2n narrowband signals. The recombined signal carries the same information as the original signal.
In SWC, WPT was used as the signal extraction method. WPT had the same bandwidth in each decomposition, as shown in the decomposition tree structure provided in Figure 5. In this decomposition mode, the original signal would not increase or decrease, and thus it was retained to the greatest extent possible; almost no loss of the original signal occurred during the process. This facilitated the processing of non-stationary signals, and good time-frequency analysis could be performed, regardless of the high-frequency or low-frequency parts of the original signal.
The Case Western Reserve University bearing database was used as a data set for analysis, and the approximate results are shown in Figure 6.
For timing signals, the characteristic frequency cannot be found directly Therefore, WPT was used to process the original timing signal and convert it to different frequencies, as shown in Figure 6. To improve the efficiency of data utilization, these pictures were spliced into one picture to use as the training data for the convolutional neural network, as shown in Figure 7.

3.2. Fault Classification Based on a Convolutional Neural Network

Deep learning has been widely used in various fields and has achieved relatively good results. Among deep learning algorithms, the development of convolutional neural network is the fastest [35]. With the continuous improvement of computer performance, the efficiency of convolutional neural networks in processing sound and video data sets far exceeds that of the traditional methods.
Similarly, deep learning is also widely used in the field of bearing fault diagnosis. Compared with traditional diagnosis methods, bearing diagnosis based on convolutional neural networks has stronger anti-noise ability, faster processing speed, and higher accuracy. The structure of a convolutional neural network in shown in Figure 8.

3.2.1. Convolutional Layer

The convolution layer of a convolutional neural network is mainly composed of several convolutional kernels, and the parameters of each kernel are optimized through a back propagation algorithm [36]. Convolution is a mathematical operation, and the completion of this operation in the convolution layer depends only on the convolution kernels. These kernels extract the features of the input data and perform learning.
The convolution layer can also complete local connection and weight sharing. Local connection means that the parameters of the convolution layer are only connectedwith the previous layer, and only local features are learned. This ensures that similar data will be input and similar answers will be obtained. Weight sharing reduces parameters and enables rapid convergence. Convolution can be described by the following mathematical formula:
a i , j = f m = 0 2 n = 0 2 w m , n x i + m , j + n + w b
Among them, f represents the activation function, usually the relu function, which is described in detail below. i and j represent the element in the i-th row and j-th column of the input data matrix. m and n represent the weight of the mth row and n column, and w b represents the bias term.

3.2.2. Activation Function

When the activation function is not used, each layer is a linear combination of the previous layer, so that fewer types of functions can be fitted. After introducing a non-linear activation function, the model can fit almost any non-linear function, namely. This activation function is introduced to improve the fitting ability of the model [14]. Activation functions are continuous and differentiable, which facilitates complex mathematical operations. Commonly used activation functions include sigmoid, relu, tanh, leaky relu, maxout, elu, etc.
The expression of the relu function is as follows:
f ( x ) = m a x ( 0 , x )
The image of the relu function is shown in Figure 9.

3.2.3. Pooling Layer

In simple terms, pooling is a non-linear down-sampling. The pooling layer completes two tasks. First, it reduces the input feature map to reduce the complexity of the model and optimize its calculation. Second, it compresses the features, extracts the main features, and reduces the phenomenon of over-fitting [37]. The process of pooling is roughly shown in Figure 10.
Common pooling includes maximum pooling and average pooling. The realization mechanism of maximum pooling is to divide the input matrix into multiple regions and select the maximum value for each subregion. Through this mechanism, the most important characteristics of the input data are extracted.
Some models no longer use the pooling layer, as pooling reduces the dimensionality of the data and simplifies the calculation. However, part of the original data is lost. This loss of the original data will cause part of the data to be lost, which is detrimental to the final effect of the model [38].

3.2.4. Fully Connected Layer

Under normal circumstances, the data must be sent to the fully connected layer after convolutions and pooling to complete the final operation of the model. In the convolution layer, the local feature extraction of the input data is completed. If only the local features are used to judge, the phenomenon of over-fitting is prone to appear [39]. Therefore, all features must be used. Once the process is complete, the weight matrix reassembles the previously extracted features into a complete feature map.

3.2.5. Fault Classification

In the proposed algorithm, the convolutional neural network algorithm is used for final pattern extraction to classify faults. Convolutional neural networks have many advantages in image recognition and classification. In this study’s comparative analysis, the classification effect of the convolutional neural network was compared with that of a support vector machine and BP neural network. The results showed that convolutional neural networks achieved better fault classification results.
The spectrogram was sent to the convolutional neural network for training. After training, the convolutional neural network calculated the fault type. As stated above, convolutional neural networks are composed of an input layer, convolution layer, pooling layer, and fully connected layer. The general structure is shown Figure.
The state of bearings is classified as normal, inner-ring failure, outer-ring failure, and rolling element failure. Figure 11 shows the frequency spectrum of these four bearing states.
The signal-processed data were sent to the convolutional neural network as input data for training. The convolutional neural network of the present study is shown in Figure 12.

3.3. Optimization Based on Simulated Annealing Algorithm

The earliest idea of SA was proposed by N. Metropolis et al. in 1953. Then, in 1983, S. Kirkpatrick and others successfully introduced the idea of annealing to the field of combinatorial optimization, which is a stochastic optimization algorithm based on the Monte-Carlo iterative solution strategy similar to that used in [40]. The solution obtained by the SA algorithm has been shown to converge to the global optimal solution according to the probability theory.
The core concept of SA can be roughly described as follows. As the initial annealing temperature drops, the optimal solution is constantly searched for within the feasible solution. Unlike the hill-climbing algorithm, the SA algorithm has a certain probability of accepting a solution that is worse than the original solution so that it can avoid getting trapped by local optimal solutions. The probability of accepting the poor solution continues to decrease until the SA finally approaches the global optimal solution. This process is just like that of metal annealing [41]. The general process is shown in Figure 13.
As a general random search algorithm, SA has been widely used in VLSI design, image recognition, and neural network computer research. However, this algorithm is not widely applied in the field of bearing fault diagnosis. Thus, the optimization potential of SA is obvious.
The parameter adjustment of the convolutional neural network is very important, as the quality of the parameter settings determines the performance of the final model. However, parameter adjustment is often difficult and requires a lot of manpower and computing resources. Additionally, because it is based on human experience, falling into the local optimal solution is easy, which renders obtaining the global optimal solution impossible. Therefore, the SA was introduced to improve the model. Due to the introduction of the SA algorithm, the final algorithm became an adaptive bearing fault diagnosis method. Compared with other diagnostic methods that require manual adjustment of parameters, this method had a better scope of application.
As an important optimization algorithm, the SA algorithm is mainly used to solve optimization problems in the engineering field. In this study, the SA algorithm was introduced to optimize the parameters of the convolutional neural network so that it could obtain a higher accuracy rate, reduce over-fitting, and accelerate the convergence speed. The setting of some parameters in the SA algorithm will affect the results of the final experiment. In the present study, the initial temperature was set to 1000 degrees Celsius, and the learning rate was 0.95.
In the previous section, the basic model architecture was described. WPT analysis was used as a signal processing method for feature extraction, and a convolutional neural network was used as a fault classification algorithm. Optimizing the convolutional neural network using a SA algorithm aimed to optimize the parameters of the convolutional neural network to help quickly find the most suitable parameters. This sped up the convergence of the convolutional neural network and obtained a higher accuracy rate.
The following steps were used to optimize the convolutional neural network with the SA algorithm:
Step 1: Specify the initial solution, set the initial temperature
Start by setting up an initial solution and use the initial default settings as the initial solution. The solution is treated as a set, including the required parameters for the learning rate and the number of iterations. Theoretically, the larger the initial temperature setting, the better, but since the higher the temperature, the time required for convergence will also increase accordingly, so it is necessary to make a compromise between the convergence time and the convergence accuracy, which is also the essence of the heuristic algorithm, where we choose 1000 degrees Celsius here, and choose 0.05 for the learning rate. With limited computational and time resources, such a setup can find the optimal solution faster.
Step 2: Given the objective function
As an optimization problem, a suitable optimization function needs to be given. Here, the accuracy of the final bearing fault diagnosis is selected as the objective function of the optimization problem. After many iterations of the simulated annealing algorithm, the value of the objective function is infinitely close to the global value. Optimal solution: The cost function is opposite to the objective function. Here, the cost function can be obtained by taking the inverse of the value of the objective function. The optimization goal here is to maximize the objective function and reduce the cost function as much as possible.
Step 3: Given the disturbance function
In order to help find the optimal solution quickly, a perturbation function needs to be given. The perturbation function jumps to 1–5% based on the original parameters. As a new solution: The perturbation function is the core of the modified algorithm. A new solution is generated by setting the perturbation function, and according to the new solution, the corresponding objective function value is generated. After the value is obtained, it is compared with the old objective function value.
Step 4: Given acceptance criteria
In this experiment, Metropolis criterion is used. For the new solution, it will be compared with the original solution first. If it is better than the original solution, it will choose to accept it. If the original solution is not superior, it will be accepted with a certain probability.
This idea of accepting inferior solutions will help jump out of the local optimal solution and provide the possibility to find the global optimal solution.

4. Analysis of Results

The proposed bearing fault diagnosis method based on WPT and a convolutional neural network optimized by the SA algorithm was implemented in MATLAB, and the method was completed on a machine with GTX1660ti GPU. The Case Western Reserve University bearing database was used for verification.
Data containing 12 different types of bearing failures were used as test data. In Figure 14, we visualize the test data.
Bearing fault diagnosis consists of two processes: feature extraction and fault classification. The main feature extraction methods include principal component analysis, empirical mode decomposition, WPT, and short-time Fourier transform. In bearing fault diagnosis, the commonly used fault classification algorithms are BP neural network and support vector machine.
To illustrate the effectiveness of the proposed method, it was compared with the traditional intelligent algorithms. Compared with these algorithms, the results of the new bearing fault diagnosis method showed a higher accuracy rate, thus supporting the effectiveness of the algorithm. The results of the comparison are shown in Table 2, while Figure 15 provides a comparison of the traditional intelligent algorithms. In Figure 16, we compare the auc of different bearing fault diagnosis methods. The results show that the method proposed in this paper has the best classification effect among all the compared methods.
Empirical mode decomposition is an adaptive method that can decompose any signal into an empirical mode. It is commonly used for feature extraction in traditional intelligent bearing fault diagnosis algorithms. The empirical patterns obtained by re-composition are different oscillation patterns extracted from the original signal. Thus, the empirical mode can be used to reconstruct the original signal [42].
After analysis, the reason for obtaining better results is that, first, we use the wavelet packet transform to extract the features of the original signal to obtain the spectrogram, and then use the spectrogram as the training data set. Convolutional neural network is a good feature extractor, and after the original data has been subjected to wavelet packet transformation, it can obtain better diagnostic results. However, this still cannot achieve good results. Therefore, it is necessary to use a parameter adjustment method for the convolutional neural network. If the method of manually adjusting the parameters is used, it will consume a lot of time and computing resources. Heuristic algorithms can achieve better results when dealing with resource-constrained optimization problems. Therefore, the simulated annealing algorithm is introduced to adjust the parameters to obtain a better bearing fault diagnosis effect.

5. Discussion

This study proposed a new bearing fault diagnosis method based on WPT and a convolutional neural network optimized by a SA algorithm. Compared with traditional methods, the model completed the task of bearing fault diagnosis more accurately and intelligently.
The method used WPT to preprocess the original vibration data and generates a spectrogram, which was used as the training data for the convolutional neural network. Then, the simulated degradation algorithm was used to find the optimal parameters, thereby replacing the process of manual parameter adjustment. Finally, the public data set from Case Western Reserve University was used to verify the model, which showed good results.
This method was also compared with support vector machines and BP neural networks, along with other models that have not been optimized by SA algorithms. The final results showed that the proposed method had simpler parameter setting and a higher accuracy rate.
The volume of the convolutional neural network was relatively small, which is convenient for transplantation in the industrial field. Compared with large-scale networks, it more easily achieves cross-platform transplantation while ensuring a certain accuracy rate.
In addition, through comparative experiments, the superiority of our proposed method compared with previous methods is demonstrated, and the heuristic algorithm is also effective in the field of bearing fault diagnosis.

6. Conclusions

This paper proposes a new bearing fault diagnosis algorithm-SWT. Firstly, the wavelet packet transform is used for simple signal processing, and then the convolutional neural network is used for feature extraction and fault identification. Finally, to improve the speed of model convergence and the accuracy of diagnosis, a simulated annealing algorithm is used to adjust parameters. SWT can directly process the original signal without any manual feature extraction, which reduces the waste of human resources. The effectiveness of the proposed method is further demonstrated by comparative experiments. In the academic world, some new methods have been proposed recently, such as compression and acceleration of neural networks, and multi-modal bearing fault diagnosis methods. From the literature, they have achieved good results in many fields, but in bearing faults the field of diagnosis is less used. Therefore, in future work, we will try to combine the acceleration and compression of neural networks, multi-modality, and other new methods, in order to provide new research ideas for the field of bearing fault diagnosis.

Author Contributions

Conceptualization, F.H. and Q.Y.; validation, F.H.; investigation, F.H. and Q.Y.; writing, F.H. and Q.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the National Natural Science Foundation of China under Grant No. 51974036 and No. 62006028.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Hoang, D.T.; Kang, H.J. A survey on deep learning based bearing fault diagnosis. Neurocomputing 2019, 335, 327–335. [Google Scholar] [CrossRef]
  2. Islam, M.M.M.; Kim, J.M. Reliable multiple combined fault diagnosis of bearings using heterogeneous feature models and multiclass support vector machines. Reliab. Eng. Syst. Saf. 2019, 184, 55–66. [Google Scholar] [CrossRef]
  3. Gu, Y.K.; Zhou, X.Q.; Yu, D.P.; Shen, Y.J. Fault diagnosis method of rolling bearing using principal component analysis and support vector machine. J. Mech. Sci. Technol. 2018, 32, 5079–5088. [Google Scholar] [CrossRef]
  4. Dong, S.; Xu, X.; Chen, R. Application of fuzzy C-means method and classification model of optimized K-nearest neighbor for fault diagnosis of bearing. J. Braz. Soc. Mech. Sci. Eng. 2016, 38, 2255–2263. [Google Scholar] [CrossRef]
  5. Tian, J.; Morillo, C.; Azarian, M.H.; Pecht, M. Motor bearing fault detection using spectral kurtosis-based feature extraction coupled with K-nearest neighbor distance analysis. IEEE Trans. Ind. Electron. 2015, 63, 1793–1803. [Google Scholar] [CrossRef]
  6. Kang, M.; Ramaswami, G.K.; Hodkiewicz, M.; Cripps, E.; Kim, J.M.; Pecht, M. A sequential K-nearest neighbor classification approach for data-driven fault diagnosis using distance-and density-based affinity measures. In International Conference on Data Mining and Big Data; Springer: Cham, Switzerland, 2016; pp. 253–261. [Google Scholar]
  7. Sun, X.; Tan, J.; Wen, Y.; Feng, C. Rolling bearing fault diagnosis method based on data-driven random fuzzy evidence acquisition and Dempster–Shafer evidence theory. Adv. Mech. Eng. 2016, 8, 1687814015624834. [Google Scholar] [CrossRef]
  8. Shen, F.; Chen, C.; Yan, R.; Gao, R.X. Bearing fault diagnosis based on SVD feature extraction and transfer learning classification. In Proceedings of the 2015 Prognostics and System Health Management Conference (PHM), Beijing, China, 21–23 October 2015; pp. 1–6. [Google Scholar]
  9. Yuan, L.; Lian, D.; Kang, X.; Chen, Y.; Zhai, K. Rolling bearing fault diagnosis based on convolutional neural network and support vector machine. IEEE Access 2020, 8, 137395–137406. [Google Scholar] [CrossRef]
  10. Ning, X.; Gong, K.; Li, W.; Zhang, L. JWSAA: Joint weak saliency and attention aware for person re-identification. Neurocomputing 2021, 453, 801–811. [Google Scholar] [CrossRef]
  11. Ning, X.; Duan, P.; Li, W.; Zhang, S. Real-time 3D face alignment using an encoder-decoder network with an efficient deconvolution layer. IEEE Signal Process. Lett. 2020, 27, 1944–1948. [Google Scholar] [CrossRef]
  12. Wang, L.H.; Zhao, X.P.; Wu, J.X.; Xie, Y.Y.; Zhang, Y.H. Motor fault diagnosis based on short-time Fourier transform and convolutional neural network. Chin. J. Mech. Eng. 2017, 30, 1357–1368. [Google Scholar] [CrossRef]
  13. Lei, Y.; Jia, F.; Lin, J.; Xing, S.; Ding, S.X. An intelligent fault diagnosis method using unsupervised feature learning towards mechanical big data. IEEE Trans. Ind. Electron. 2016, 63, 3137–3147. [Google Scholar] [CrossRef]
  14. Zhang, Y.; Xing, K.; Bai, R.; Sun, D.; Meng, Z. An enhanced convolutional neural network for bearing fault diagnosis based on time–frequency image. Measurement 2020, 157, 107667. [Google Scholar] [CrossRef]
  15. Yao, D.; Liu, H.; Yang, J.; Li, X. A lightweight neural network with strong robustness for bearing fault diagnosis. Measurement 2020, 159, 107756. [Google Scholar] [CrossRef]
  16. Hoang, D.T.; Kang, H.J. Rolling element bearing fault diagnosis using convolutional neural network and vibration image. Cogn. Syst. Res. 2019, 53, 42–50. [Google Scholar] [CrossRef]
  17. Zhao, D.; Wang, T.; Chu, F. Deep convolutional neural network based planet bearing fault classification. Comput. Ind. 2019, 107, 59–66. [Google Scholar] [CrossRef]
  18. Saucedo-Dorantes, J.J.; Arellano-Espitia, F.; Delgado-Prieto, M.; Osornio-Rios, R.A. Diagnosis Methodology Based on Deep Feature Learning for Fault Identification in Metallic, Hybrid and Ceramic Bearings. Sensors 2021, 21, 5832. [Google Scholar] [CrossRef] [PubMed]
  19. Gelman, L.; Patel, T.H.; Persin, G.; Murray, B.; Thomson, A. Novel technology based on the spectral kurtosis and wavelet transform for rolling bearing diagnosis. Int. J. Progn. Health Manag. 2013, 4. [Google Scholar] [CrossRef]
  20. Gelman, L.; Kırlangıç, A.S. Novel vibration structural health monitoring technology for deep foundation piles by non-stationary higher order frequency response function. Struct. Control. Health Monit. 2020, 27, e2526. [Google Scholar] [CrossRef]
  21. Gelman, L.; Chandra, N.H.; Kurosz, R.; Pellicano, F.; Barbieri, M.; Zippo, A. Novel spectral kurtosis technology for adaptive vibration condition monitoring of multi-stage gearboxes. Insight-Non-Destr. Test. Cond. Monit. 2016, 58, 409–416. [Google Scholar] [CrossRef]
  22. Abdel-Basset, M.; Mohamed, R.; Saber, S.; Askar, S.S.; Abouhawwash, M. Modified Flower Pollination Algorithm for Global Optimization. Mathematics 2021, 9, 1661. [Google Scholar] [CrossRef]
  23. Abdel-Basset, M.; Mohamed, R.; Abouhawwash, M.; Chang, V.; Askar, S.S. A Local Search-Based Generalized Normal Distribution Algorithm for Permutation Flow Shop Scheduling. Appl. Sci. 2021, 11, 4837. [Google Scholar] [CrossRef]
  24. Abdel-Basset, M.; Mohamed, R.; Abouhawwash, M.; Chakrabortty, R.K.; Ryan, M.J. A Simple and Effective Approach for Tackling the Permutation Flow Shop Scheduling Problem. Mathematics 2021, 9, 270. [Google Scholar] [CrossRef]
  25. Szegedy, C.; Liu, W.; Jia, Y.; Sermanet, P.; Reed, S.; Anguelov, D.; Erhan, D.; Vanhoucke, V.; Rabinovich, A. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 8–10 June 2015; pp. 1–9. [Google Scholar]
  26. Zamani, H.; Nadimi-Shahraki, M.H.; Gandomi, A.H. QANA: Quantum-based avian navigation optimizer algorithm. Eng. Appl. Artif. Intell. 2021, 104, 104314. [Google Scholar] [CrossRef]
  27. Nadimi-Shahraki, M.H.; Fatahi, A.; Zamani, H.; Mirjalili, S.; Abualigah, L.; Abd Elaziz, M. Migration-Based Moth-Flame Optimization Algorithm. Processes 2021, 9, 2276. [Google Scholar] [CrossRef]
  28. Chu, W.; Ho, P.S.; Li, W. An adaptive machine learning method based on finite element analysis for ultra low-k chip package design. IEEE Trans. Compon. Packag. Manuf. Technol. 2021, 11, 1435–1441. [Google Scholar] [CrossRef]
  29. Wu, Z.; Chu, W. Sampling strategy analysis of machine learning models for energy consumption prediction. In Proceedings of the 2021 IEEE 9th International Conference on Smart Energy Grid Engineering (SEGE), Oshawa, ON, Canada, 11–13 August 2021; pp. 77–81. [Google Scholar]
  30. Smith, W.A.; Randall, R.B. Rolling element bearing diagnostics using the Case Western Reserve University data: A benchmark study. Mech. Syst. Signal Process. 2015, 64, 100–131. [Google Scholar] [CrossRef]
  31. Boudiaf, A.; Moussaoui, A.; Dahane, A.; Atoui, I. A comparative study of various methods of bearing faults diagnosis using the case Western Reserve University data. J. Fail. Anal. Prev. 2016, 16, 271–284. [Google Scholar] [CrossRef]
  32. Zarei, J.; Poshtan, J. Bearing fault detection using wavelet packet transform of induction motor stator current. Tribol. Int. 2007, 40, 763–769. [Google Scholar] [CrossRef]
  33. Wu, J.D.; Liu, C.H. An expert system for fault diagnosis in internal combustion engines using wavelet packet transform and neural network. Expert Syst. Appl. 2009, 36, 4278–4286. [Google Scholar] [CrossRef]
  34. Yen, G.G.; Lin, K.C. Wavelet packet feature extraction for vibration monitoring. IEEE Trans. Ind. Electron. 2000, 47, 650–667. [Google Scholar] [CrossRef] [Green Version]
  35. Acharya, U.R.; Oh, S.L.; Hagiwara, Y.; Tan, J.H.; Adam, M.; Gertych, A.; San Tan, R. A deep convolutional neural network model to classify heartbeats. Comput. Biol. Med. 2017, 89, 389–396. [Google Scholar] [CrossRef] [PubMed]
  36. Prosvirin, A.; Kim, J.Y.; Kim, J.M. Bearing fault diagnosis based on convolutional neural networks with kurtogram representation of acoustic emission signals. In Advances in Computer Science and Ubiquitous Computing; Springer: Singapore, 2017; pp. 21–26. [Google Scholar]
  37. Zhang, W.; Li, C.; Peng, G.; Chen, Y.; Zhang, Z. A deep convolutional neural network with new training methods for bearing fault diagnosis under noisy environment and different working load. Mech. Syst. Signal Process. 2018, 100, 439–453. [Google Scholar] [CrossRef]
  38. Xu, G.; Liu, M.; Jiang, Z.; Söffker, D.; Shen, W. Bearing fault diagnosis method based on deep convolutional neural network and random forest ensemble learning. Sensors 2019, 19, 1088. [Google Scholar] [CrossRef] [Green Version]
  39. Zhu, J.; Chen, N.; Peng, W. Estimation of bearing remaining useful life based on multiscale convolutional neural network. IEEE Trans. Ind. Electron. 2018, 66, 3208–3216. [Google Scholar] [CrossRef]
  40. Xiong, J.; Liu, X.; Zhu, X.; Zhu, H.; Li, H.; Zhang, Q. Semi-supervised fuzzy c-means clustering optimized by simulated annealing and genetic algorithm for fault diagnosis of bearings. IEEE Access 2020, 8, 181976–181987. [Google Scholar] [CrossRef]
  41. Yong, W.; Zhou, J.; Jahed Armaghani, D.; Tahir, M.M.; Tarinejad, R.; Pham, B.T.; Van Huynh, V. A new hybrid simulated annealing-based genetic programming technique to predict the ultimate bearing capacity of piles. Eng. Comput. 2020, 37, 1–17. [Google Scholar] [CrossRef]
  42. Dybała, J.; Zimroz, R. Rolling bearing diagnosing method based on empirical mode decomposition of machine vibration signal. Appl. Acoust. 2014, 77, 195–203. [Google Scholar] [CrossRef]
Figure 1. The experimental equipment.
Figure 1. The experimental equipment.
Sensors 22 01410 g001
Figure 2. Bearing physical map.
Figure 2. Bearing physical map.
Sensors 22 01410 g002
Figure 3. SWC algorithm flow chart.
Figure 3. SWC algorithm flow chart.
Sensors 22 01410 g003
Figure 4. Schematic diagram of the method presented in this paper. First, input the original signal with a length of 400, perform wavelet packet transformation, and obtain a spectrogram. The obtained spectrogram is then used as a training data set and sent to the convolutional neural network. Finally, the simulated annealing algorithm is used to adjust the parameters of the convolutional neural network.
Figure 4. Schematic diagram of the method presented in this paper. First, input the original signal with a length of 400, perform wavelet packet transformation, and obtain a spectrogram. The obtained spectrogram is then used as a training data set and sent to the convolutional neural network. Finally, the simulated annealing algorithm is used to adjust the parameters of the convolutional neural network.
Sensors 22 01410 g004
Figure 5. WPT decomposition tree structure [34].
Figure 5. WPT decomposition tree structure [34].
Sensors 22 01410 g005
Figure 6. Schematic diagram of wavelet packet transform.
Figure 6. Schematic diagram of wavelet packet transform.
Sensors 22 01410 g006
Figure 7. Timing signal spectrogram.
Figure 7. Timing signal spectrogram.
Sensors 22 01410 g007
Figure 8. Convolutional neural network structure diagram.
Figure 8. Convolutional neural network structure diagram.
Sensors 22 01410 g008
Figure 9. The relu function graph.
Figure 9. The relu function graph.
Sensors 22 01410 g009
Figure 10. Schematic diagram of pooling process.
Figure 10. Schematic diagram of pooling process.
Sensors 22 01410 g010
Figure 11. Comparison chart of four state bearings.
Figure 11. Comparison chart of four state bearings.
Sensors 22 01410 g011
Figure 12. Convolutional neural network structure used in the experiment (partial) [25].
Figure 12. Convolutional neural network structure used in the experiment (partial) [25].
Sensors 22 01410 g012
Figure 13. Flow chart of simulated annealing algorithm.
Figure 13. Flow chart of simulated annealing algorithm.
Sensors 22 01410 g013
Figure 14. Visualization of test data.
Figure 14. Visualization of test data.
Sensors 22 01410 g014
Figure 15. Comparative analysis of accuracy.
Figure 15. Comparative analysis of accuracy.
Sensors 22 01410 g015
Figure 16. Comparison of AUC of different bearing fault diagnosis methods.
Figure 16. Comparison of AUC of different bearing fault diagnosis methods.
Sensors 22 01410 g016
Table 1. Diagram of the experimental equipment.
Table 1. Diagram of the experimental equipment.
ClassNameSamplesDistributionDescription
1normal_FE_184622.2%normal, 1 hp load
2normal_DE_284622.2%normal, 2 hp load
3Inner_FE_1_0072125.56%0.007’’, 1 hp load
4Inner_DE_1_0072125.56%0.007’’, 1 hp load
5Ball_FE_1_0072115.53%0.007’’, 1 hp load
6Ball_DE_1_0072115.53%0.007’’, 1 hp load
7Outer_FE_1_0072125.56%0.007’’, 1 hp load
8Outer_DE_1_0072125.56%0.007’’, 1 hp load
9Inner_FE_1_0142125.56%0.014’’, 1 hp load
10Inner_FE_2_0142125.56%0.014’’, 2 hp load
11Inner_DE_2_0282125.56%0.028’’, 2 hp load
12Inner_DE_3_0282125.56%0.028’’, 3 hp load
Table 2. Experimental omparison. The experimental results come from the implementation of the existing methods. On the machine equipped with GTX 1660TI, the matlab neural network toolbox is used to conduct multiple experiments, and the average value is obtained to obtain the result.
Table 2. Experimental omparison. The experimental results come from the implementation of the existing methods. On the machine equipped with GTX 1660TI, the matlab neural network toolbox is used to conduct multiple experiments, and the average value is obtained to obtain the result.
ClassMethodSample NumberAccuracy(%)
1SVM100016.67
2BP100081.31
3EMD+CNN100066.76
4EMD+SVM100051.32
5EMD+BP100087.53
6WPT+CNN100092.13
7WPT+CNN+SA100097.12
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

He, F.; Ye, Q. A Bearing Fault Diagnosis Method Based on Wavelet Packet Transform and Convolutional Neural Network Optimized by Simulated Annealing Algorithm. Sensors 2022, 22, 1410. https://doi.org/10.3390/s22041410

AMA Style

He F, Ye Q. A Bearing Fault Diagnosis Method Based on Wavelet Packet Transform and Convolutional Neural Network Optimized by Simulated Annealing Algorithm. Sensors. 2022; 22(4):1410. https://doi.org/10.3390/s22041410

Chicago/Turabian Style

He, Feng, and Qing Ye. 2022. "A Bearing Fault Diagnosis Method Based on Wavelet Packet Transform and Convolutional Neural Network Optimized by Simulated Annealing Algorithm" Sensors 22, no. 4: 1410. https://doi.org/10.3390/s22041410

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop