Next Article in Journal
BODIPY-Based Nanomaterials—Sensing and Biomedical Applications
Previous Article in Journal
Effects of Magnetizing Yoke Design on the Magnetic Properties of Nd2Fe14B Permanent Magnet for Electric Vehicle Motor Applications
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Deep Residual Network Combined with Transfer Learning Based Fault Diagnosis for Rolling Bearing

1
Key Laboratory of Conveyance and Equipment, East China Jiaotong University, Ministry of Education, Nanchang 330013, China
2
State Key Laboratory of Rail Transit Infrastructure Performance Monitoring and Guarantee, Nanchang 330013, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2022, 12(15), 7810; https://doi.org/10.3390/app12157810
Submission received: 3 July 2022 / Revised: 26 July 2022 / Accepted: 1 August 2022 / Published: 3 August 2022

Abstract

:
Fault diagnosis of rolling bearings is significant for mechanical equipment operation and maintenance. Presently, the deep convolutional neural network (CNN) is increasingly used for fault diagnosis of rolling bearings, but CNN has challenges with incomplete training and lengthy training times. This paper proposes a residual network combined with the transfer learning (ResNet-TL) based diagnosis method for rolling bearing, which can preprocess the one-dimensional data of vibration signals into image data. Then, the transfer learning theory in parameter transfer is applied to the training of the network model, and the ResNet34 network is pre-trained and re-trained; the image data are selected to be the inputs of the fault diagnosis model. The experimental validation of the rolling bearing fault dataset collected from the practical bench and Case Western Reserve University shows the superiority of the ResNet34-TL model compared with other classification models.

1. Introduction

Rolling bearing is one of the most frequently used parts in mechanical equipment; it will affect the production efficiency of large machinery, thus affecting the benefits of companies and the safety of employees. It is significant to find out the degree and type of faults in rolling bearings timely and effectively.
Traditional bearing fault diagnosis methods based on machine learning mainly include feature extraction and fault classification. Feature extraction is an essential step in fault diagnosis. Xue et al. [1] proposed a novel feature extraction method based on hierarchical analysis. Guo et al. [2] proposed a noise reduction method based on ensemble empirical mode decomposition (EEMD). Hou et al. [3] obtained the optimal features of EEMD by using some feature selection algorithms. Yang et al. [4] based on EEMD designed an effective frequency cut-off criteria and extracted the unknown fault features. Traditional bearing fault diagnosis technology based on machine learning has many requirements for technicians, and there is the risk of information loss [5].
Among the various types of research for rolling bearing fault diagnosis, the most practical and mature method is mechanical signal analysis because it can identify bearing faults by focusing on the intrinsic frequencies with tiny vibrations. Numerous scholars have concluded from their analyses that it is feasible to use current motor frequencies to detect bearing faults [6,7]. However, there are two difficulties in using motor current signals for fault diagnosis: One is that there is no clear signal band to observe bearing faults effectively, and the other is that signal strengths’ temporal characteristics as the deterioration progresses are unclear. Kanemaru et al. [8] diagnosed bearing degradation by monitoring both the bearing’s natural frequency and its rotational frequency and demonstrated the feasibility of using motor current signals for fault diagnosis through relevant motor experiments. Hoang et al. [9] presented a current signal-based bearing fault diagnosis using a Deep Learning (DL) algorithm CNN and a novel decision-level IF technique. The proposed bearing fault diagnosis method has suitable practical applications with cost-effective and relatively high diagnostic accuracy. Singh et al. [10] evaluated the detection of load machine bearing outer ring faults by efficiently extracting features of faulty components from the stator current using continuous wavelet transform. Schoen et al. [11] reported a relationship between vibration and current frequencies caused by faults in the bearings while using vibration monitoring and motor current signature analysis (MCSA) techniques. The above papers found by summarizing that the small amplitude (i.e., low energy) of the fault frequencies extracted from the faulty bearing stator currents can easily be mistaken as noise coming from the system, resulting in the poor diagnosis of bearing faults based on current signals.
Regarding the research on rolling bearing fault diagnosis, more and more scholars have started to experiment with other signals in recent years, such as stray magnetic flux signals, thermal imaging images, acoustic emission signals, etc. Harlişca et al. [12] used statistical analysis of stray flux signals around the motor to conduct fault diagnosis of bearings. Zamudio-Ramirez et al. [13] used an advanced algorithm based on a combination of the MUSIC method and artificial neural networks (ANN), which enables the automatic calculation of fault severity indicators when using stray flux signals. Azeez et al. [14] used a thermal imager to acquire data on rolling bearings and used the bearing temperature to distinguish the type of failure. Although more economical than the traditional signal acquisition method, experiments showed that the approach was more applicable to the outer race of the bearing. Choudhary et al. [15] proposed an emerging two-dimensional discrete wavelet transform (2D-DWT) based infrared thermography method to diagnose different bearing faults in induction motors. Pham et al. [16] used acoustic emission (AE) signals containing bearing health information. They converted AE into 2D spectrograms by Constant Q-Transform (CQT) and then used CNN to infer the bearing status. Chen et al. [17] used Fast Fourier Transforms (FFT) to preprocess AE signals and constructed a low-speed rolling bearing fault diagnosis model based on AE signals and subspace embedded feature distribution alignment. They concluded experimentally that AE signals outperformed vibration signals when rolling bearings were operating at low speeds.
The diagnosis of rolling bearing defects is mainly studied using vibration signal features. Nowadays, the application of deep learning based on 2D-image data to diagnosis methods has gained more attention; it solves the uncertainty problem of artificial feature extraction and promotes the automation of fault diagnosis. Oh et al. [18] produced research based on the Deep Belief Networks (DBN) and using vibration images to diagnose faults of the rotor system. Hoang et al. [19] converted the vibration signal into a 2D form and analyzed the identified flaws of the bearing through Convolutional Neural Networks (CNN) based on vibration images. Ma et al. [20] combined transfer learning with CNN to use 2D time-frequency image sets for bearing fault diagnosis and confirmed RGB images contained more information than grayscale images. Wen et al. [21] proposed a 2-level hierarchical diagnosis network to classify rotating machinery faults with the time-frequency images. Verstraete et al. [22] generated Short-time Fourier, Wavelet Transform, and Hilbert-Huang transform maps and proved the Wavelet Time-frequency image has the highest accuracy. The results of previous research show that the Wavelet Time-frequency transform is more suitable for the time-frequency transform of RGB images. Besides, the Wavelet Time-frequency image can reflect the details of the signal, so it is ideal for fault classification [23].
When using deep learning for fault diagnosis of rolling bearings, CNN models have limitations because they require amounts of labeled data for training. Nevertheless, fault samples are sometimes challenging to meet this condition [24]. In this case, the classifier model is at risk of overfitting and poor generalization, which transfer learning (TL) can solve. Chen et al. [25] used TL to deal with the problems of data imbalance and different distributions. Zhang et al. [26] proposed a method that combines CNN and TL for fault diagnosis. Wu et al. [27] suggested a fault detection and diagnosis method based on TL to overcome the fault data rareness and no label issues. Li et al. [28] proposed a novel fault diagnosis method for deep convolutional domain adversarial TL. Many studies have shown that TL can effectively improve the situation of the model being limited by data and solve the problems of low accuracy of classifier caused by insufficient samples and changes in data distribution [29,30].
The contribution of this paper is to propose a fault diagnosis framework with a high diagnostic rate. The framework uses transfer learning to improve the accuracy of the diagnostic network and converts 1D vibration data to 2D RGB image data before inputting it into the classification model to enhance the stability of the features. After pre-training the ResNet34 network, the time-frequency image set is fed into the ResNet34 network to learn the specific features of the fault data. Comparative experiments are conducted on the CWRU dataset and a laboratory dataset to compare ResNet34-TL with four other models (ResNet34, MobileNet-TL, MobileNet, and CNN) by the loss function value of the training set, the accuracy of the test set, and the accuracy of the validation set to diagnose the ResNet34-TL framework for detecting rolling bearing faults. The performance is analyzed in a detailed comparison.
This paper is organized as follows: the basic theory is given in Section 2. Section 3 presents the rolling bearings fault diagnosis method of ResNet based on transfer learning. Section 4 verifies the experiments and results of the proposed fault diagnosis method in the rolling bearing simulation test bench. Conclusions and future work are provided in Section 5.

2. The Basic Theory of the Model

2.1. Continuous Wavelet Transform Time-Frequency Image Conversion

The continuous wavelet transform (CWT) time-frequency image is a time-frequency representation of the energy density of a signal obtained using the continuous wavelet transform, which decomposes the original signal into time scales, represented by scaling and conversion operations [31]. The Morlet wavelet is used as the mother wavelet because the shape of the Morlet wavelet resembles the features of an impulse occurring in a machine fault.
Assume the input signal is x ( t ) , CWT was obtained by scaling x ( t ) with Morlet wavelet as shown in Equation (1).
W ψ ( a , b ) = x ( t ) ψ a , b ( t ) d t
where a is the wavelet translation, and b is the wavelet scale. To extract more time-frequency information in a limited space, the mother wavelet was used to generate a child wavelet, which is defined by Equation (2).
ψ a , b ( t ) = 1 a ψ t b a
This paper uses the sliding window method to truncate the 1D vibration signal and convert it to a 2D image signal. Before converting the vibration signal to the image signal, the vibration signal is truncated in units of 256 data points by the sliding window function and then the market wavelet basis function is used to generate a time-frequency image. However, the RGB image contains more information than the gray image, and the continuous wavelet transform generates RGB time-frequency images in this paper. The schematic diagram of the sliding window to intercept the signal and convert it into a time-frequency image is shown in Figure 1.

2.2. Convolutional Neural Network

As a standard deep learning algorithm, CNN has a deep feature extraction structure and can dig deeper features of the input features. Its network structure usually includes convolution, activation, pooling, and fully connected layers [32,33]. The convolution calculation equation is
y l i , j = K i l x l r j = j = 0 c 1 K i l j x l j + j
where K i l j is the j’-th weight in the i-th convolution kernel of the l-th layer, with x l j + j as the j’-th perceived weight position in the j-th perceived local region of the l-th layer, and c is the width of the convolution kernel.
The activation layer is to make the neural network better solve the indivisible linear problem. The pooling layer reduces the parameters of the network by maximum pooling, which is defined by Equation (4).
p l ( i , j ) = max ( j 1 ) w + 1 t j w a l ( i , t )
where a l ( i , t ) is the i-th feature map in the t-th activation value of the l-th layer, and w is the pooling width. The fully connected layer uses softmax as the output function, which is defined by Equation (5).
Z j l = f k = i n w i l x i l + b j l
Where w i l is the weight of the l-th layer, with b j l as the offset term of the l-th layer, Z j l is the j-th neuron of the l-th layer, and f is the softmax function when l is the output layer, with f is the Sigmoid function when l is the hidden layer.
The objective function is to explain the consistency between the output value and the target value, which is defined by Equation (6).
E = 1 m k = 1 m j p k j log q k j
where m is the number of samples in small batches, p k j is the theoretical output of networks, q k j is the actual output of the network, and j is the dimension of the data.

2.3. Deep Residual Network

The deep residual network (ResNet) model is a generalized convolutional neural network feature extractor that cleverly addresses the problems of deep learning models in deep learning that may suffer from gradient disappearance or bursting [34]. Besides, good results can be obtained even if the network structure of the deep residual network has a profound number of layers.
In contrast to ordinary deep networks, ResNet networks add the short-circuiting mechanism of residual units to ensure the complexity of the network layers, which is residual learning. The article [35] illustrates that the ResNet network can improve the accuracy by increasing the depth, while ResNet50 and ResNet101 deepen the network depth; however, the training error in the experiments of this paper is not much improved compared to ResNet34, and its training time is also longer. Figure 2 and Table 1 concluded the results obtained using the Qianpeng dataset from Section 4.1 for different depths of the ResNet network. The results show that ResNet34 has a lower training loss function, a higher correct diagnosis rate for fault types in the validation set, and faster and more accurate test results in the test set.
Therefore, the 34-layer ResNet is chosen in this paper, and its specific residual network parameters are shown in Table 2.

2.4. Transfer Learning Network

Parameter transfer is a kind of transfer learning. In this paper, transfer learning is used to transfer the parameters of the classification network to the convolutional layer, and the classification network can extract general features. Then, secondary training is performed in the last layers of the network to learn specific features [36]. Transfer learning is generally defined as:
Given a source domain D s and a target domain D t , the learning tasks are T s and T t , and the purpose of transfer learning is to use the knowledge in D s and T s to improve the target prediction function f t ( ) in D t , where D s D t , T s T t . The source domain D s and target domain D t are defined as:
D s = X , P ( X ) D t = Y , f ( )
where X is the feature space, and P(X) is the marginal probability distribution. Therefore, D s = ( x s 1 , y s 1 ) , ( x s 2 , y s 2 ) , , ( x s n , y s n ) can be obtained, and where data samples satisfy x s i X s , the corresponding class label satisfies y s i Y s . Similarly, D t can be represented by data samples X t and label space Y t . Network migration based on this theory can effectively improve computational efficiency [37,38].

3. Diagnostic Model

Aiming at the incomplete training problem of CNN, this paper proposes to convert vibration data signals into 2D time-frequency image signals and applies TL to augment the data. The intelligent diagnosis method of deep residual network based on migration learning is shown in Figure 3, and the main steps are described as follows:
  • Manually process the faults of three rolling bearings of different sizes, and use acceleration sensors to collect the vibration signals of rolling bearings at different rotational speeds;
  • The vibration signals are segmented with equal spacing of signal overlap, and the original vibration signals are reconstructed into time-frequency signals using CWT to obtain the 2D time-frequency images of the reconstructed signals.;
  • Import the pre-training weights into the classification model. Reasonably divide the training set, validation set, and test set, and use the ResNet-TL model based on these images for training and testing to realize the classification of different health states of rolling bearings under minor sample conditions.
  • Calculate the fault diagnosis results and visualize the test set accuracy of the ResNet-TL model.

4. Experimental Study

In this section, experiments are conducted on two data sets: one is the QianPeng experimental dataset, and the other is the public dataset from Case Western Reserve University. The two sets of experiments use the initial ResNet34, MobileNet, MobileNet-TL, and CNN as comparison methods in the trained cross-entropy loss function values, training accuracy, validation accuracy, confusion matrix of the test set, and training time to verify the superiority of ResNet34-TL. Among them, the initial learning rate of each network is 0.0005, the Batch Size is 32, and the number of iterations is 20, where the optimization method of a deep convolutional neural network is the Adam optimization algorithm. The hardware and software environments in which the experiments were conducted are shown in Table 3.

4.1. Classification of Qianpeng Datasets

(1) Dataset: The experimental data set in this section is obtained from the vibration signal data set collected by the QPZZ-II rotating machinery fault simulation experiment bench. The experimental platform and the faulty bearing are shown in Figure 4. The experiment uses a piezoelectric acceleration sensor model CA-YD-187 to pick up the bearing vibration signal. The acquisition system is based on the NI USB-4431 multifunctional high-precision data acquisition module from National Instruments, USA. The acceleration signal is first converted to an electrical signal by the acceleration sensor, then converted to a voltage signal by the charge amplifier, and converted to a digital signal by the data acquisition card. Finally, the code written by LabView software is linked to the data acquisition card for data acquisition and storage.
The experimental bearing speed is 1188 r/min, the sampling frequency is 12 kHz, and the loading of the tested machine is zero. The bearing fault states in the experiment are divided into seven kinds, which are normal bearings, rolling element faults of different fault sizes, inner ring faults of different fault sizes, and outer ring faults of different fault sizes, as shown in Table 4. Examples of transformed images are shown in Figure 5, which shows that the light and dark distributions of the converted images differ for different fault levels on the same fault type.
(2) Testing Accuracy: The image dataset is input to the trained network, the loss function of the model and the accuracy of the validation set are obtained after 20 iterations of training. The results are shown in Figure 6. The loss error of the network training shows that the ResNet34-TL model based on migration learning pre-training can drop rapidly at the beginning of the iteration and eventually stabilize. However, the loss function of the CNN network has been kept at a shallow value, and the accuracy from the validation set of the ResNet34-TL model shows higher and more stable accuracy. The already trained ResNet34-TL model performs well on training loss error and accuracy tests.
The trained models were used for intelligent classification of the test set. All models were trained ten times to prevent random uncertainty in models, and the average of the ten tests was used as the final diagnostic result. The results are shown in Table 5; the ResNet34-TL model has the highest test accuracy of 96.29%. In this section, the ResNet34-TL model is sufficient to solve the accurate diagnosis of rolling bearing faults with fewer samples during network training and with only 20 iterations, which has application value for sizeable rotating machinery production work. It is also evident from the training time that the proposed network structure can substantially improve the recognition rate of fault types without affecting the speed of the actual network operation. In addition, the ResNet34-TL model has the lowest false alarm rate and missed alarm rate, which is 6.67% and 0.00%, respectively, and the ResNet34-TL model certainly has advantages.
For more clarity in showing the superiority of the ResNet34-TL model, visualization of the test set data using the confusion matrix shows the accurate recognition rate of the ResNet34-TL model with the best training results for each type of fault, as shown in Figure 7. The horizontal direction represents the true labels, and the vertical direction represents the predicted labels. The 7 × 7 matrix is the number of samples of each type, and the percentage includes each type’s prediction accuracy rate, false alarm rate and missing alarm rate. The confusion matrix for each method shows that ResNet34-TL has the highest correct fault diagnosis rate for each type.

4.2. Classification of CWRU Datasets

(1) Dataset: In this section, experiments are conducted using a publicly available dataset from the Bearing Data Center of Case Western Reserve University [39] to verify the robustness of the ResNet34-TL model in terms of diagnostic performance. Its sampling frequency is 12 kHz, the speed is 1797 r/min, no load is applied in the radial direction, and the data of standard bearings and bearings with different types of faults are collected. The fault depth of each kind of faulty bearing is 0.280 mm, and the fault width is 0.178 mm and 0.533 mm. This section selects two fault sizes of the rolling element, two fault sizes of the inner ring, two fault sizes of the outer ring, and the normal condition. The specific dataset description is shown in Table 6. An example of converting the vibration signal collected by the CWRU rolling bearing test bench into a wavelet time-frequency image by overlapping sampling interception is shown in Figure 8.
(2) Testing Accuracy: Importing the CWRU image dataset converted using CWT into each classification model, the results are shown in Figure 9. It can be seen from the loss error of the training set of the network that the ResNet34-TL model can drop quickly at the beginning of the iteration and eventually stabilize. Moreover, the loss function of the ResNet34-TL model has decreased more quickly. On the other hand, the ResNet34-TL model has achieved more than 90% accuracy since the first generation, and the accuracy has reached 99% in the fourth generation. The accuracy of the diagnostic networks that have been improved by transfer learning is higher than the original diagnostic networks, indicating that the application of transfer learning has led to optimizing the networks.
The trained models are used for intelligent classification on the test set, and the average of the 10 test accuracies and pre-training times is taken as the final diagnosis result. The results are shown in Table 7, the ResNet34-TL model achieved 99.86% accuracy on the test set, but the required training time was the longest. Although the MobileNet-TL model performed well in terms of time and accuracy, the ResNet34-TL model performed more consistently and robustly in the experiments with different datasets, proving that the ResNet34-TL model is more stable and robust.
In addition, Figure 10 shows a confusion matrix plot of the best test results for all models. The confusion matrix results show that through 20 generations of training, each deep learning model has been able to classify each type of fault on the test set with a relatively high rate of correct classification. In contrast, the ResNet34-TL model can achieve 100% accuracy in each class.

5. Conclusions

This paper studies a ResNet-TL fault diagnosis of rolling bearing, the ResNet34-TL model is proved to be superior in the classification problem by the experimental, and the following conclusions can be obtained:
(1) Before fault diagnosis, CWT converts the vibration signal into the image signal, which is used to solve the instability problem of manual feature extraction;
(2) Applying the theory of parameter transfer in transfer learning to the ResNet model allows the network to improve fault diagnosis accuracy without significantly increasing the required training time;
(3) The proposed ResNet-TL model solves the problem of incomplete model training. Experiments prove that the final fault identification rate of the deep residual network rolling bearing fault diagnosis model based on migration learning is the best.
This study provides a solution for intelligent fault diagnosis of rotating machinery and demonstrates the potential of combining migration learning with other deep learning models in fault diagnosis. In future research, we will target the direction of the classification model after fusing the transfer learning models so that the network can significantly reduce the time used while achieving high diagnostic rates.

Author Contributions

X.Y. wrote the paper. J.L. experimented. J.Z. conceived and designed the experiment. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, grant number 51865010; Science and Technology Project of Jiangxi Provincial Department of Education, grant number GJJ210639.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are available from the authors upon request.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

TLtransfer learning
ResNetresidual network
ResNet-TLdeep residual network based on transfer learning
MobileNet-TLmobile network based on transfer learning
EEMDensemble empirical mode decomposition
CNNconvolutional neural networks
DBNdeep belief networks
RGBred-green-blue
CWTcontinuous wavelet transform
DLdeep learning
MCSAmotor current signature analysis
ANNartificial neural networks
2D-DWTtwo-dimensional discrete wavelet transform
CQTconstant Q-transform
FFTfast fourier transforms

References

  1. Xue, Q.; Xu, B.Y.; He, C.B.; Liu, F.; Ju, B.; Lu, S.; Liu, Y. Feature Extraction Using Hierarchical Dispersion Entropy for Rolling Bearing Fault Diagnosis. IEEE Trans. Instrum. Meas. 2021, 70, 1–11. [Google Scholar] [CrossRef]
  2. Guo, J.C.; Zhen, D.; Li, H.Y.; Shi, Z.; Gu, F.; Ball, A. Fault feature extraction for rolling element bearing diagnosis based on a multi-stage noise reduction method. Measurement 2019, 139, 226–235. [Google Scholar] [CrossRef] [Green Version]
  3. Hou, J.B.; Wu, Y.X.; Ahmad, A.S.; Gong, H.; Liu, L. A Novel Rolling Bearing Fault Diagnosis Method Based on Adaptive Feature Selection and Clustering. IEEE Access 2021, 9, 99756–99767. [Google Scholar] [CrossRef]
  4. Yang, J.H.; Huang, D.W.; Zhou, D.J.; Liu, H. Optimal IMF selection and unknown fault feature extraction for rolling bearings with different defect modes. Measurement 2020, 157, 107660. [Google Scholar] [CrossRef]
  5. Jiao, W.D.; Ding, X.M.; Yan, T.Y.; Yan, Y.Y. Approach for Fault Diagnosis Based on Pattern Match of Characteristic Waveforms in Frequency Domain. J. East China Jiaotong Univ. 2021, 38, 73–81. [Google Scholar]
  6. Sabir, R.; Rosato, D.; Hartmann, S.; Guehmann, C. LSTM Based Bearing Fault Diagnosis of Electrical Machines using Motor Current Signal. In Proceedings of the IEEE International Conference On Machine Learning And Applications, Boca Raton, FL, USA, 16–19 December 2019. [Google Scholar]
  7. Jiang, Z.; Gao, S.; Kong, Y.; Pennacchi, P.; Chu, F.; Han, Q. Ultra-compact triboelectric bearing based on a ribbon cage with applications for fault diagnosis of rotating machinery. Nano Energy 2022, 99, 107263. [Google Scholar] [CrossRef]
  8. Kanemaru, M.; Tsukima, M.; Miyauchi, T.; Hayashi, K. Bearing fault detection in induction machine based on stator current spectrum monitoring. IEEJ J. Ind. Appl. 2018, 7, 282–288. [Google Scholar] [CrossRef] [Green Version]
  9. Hoang, D.T.; Kang, H.J. A motor current signal-based bearing fault diagnosis using deep learning and information fusion. IEEE Trans. Instrum. Meas. 2019, 69, 3325–3333. [Google Scholar]
  10. Singh, S.; Kumar, N. Detection of bearing faults in mechanical systems using stator current monitoring. IEEE Trans. Ind. Inform. 2016, 13, 1341–1349. [Google Scholar] [CrossRef]
  11. Schoen, R.R.; Lin, B.K.; Habetler, T.G.; Schlag, J.; Farag, S. An unsupervised, on-line system for induction motor fault detection using stator current monitoring. IEEE Trans. Ind. Appl. 1995, 31, 1280–1286. [Google Scholar] [CrossRef]
  12. Harlişca, C.; Szabó, L.; Frosini, L.; Albini, A. Diagnosis of rolling bearings faults in electric machines through stray magnetic flux monitoring. In Proceedings of the 2013 8TH International Symposium on Advanced Topics in Electrical Engineering, Bucharest, Romania, 23–25 May 2013. [Google Scholar]
  13. Zamudio-Ramirez, I.; Ramirez-Núñez, J.A.; Antonino-Daviu, J.; Osornio-Rios, R.A.; Quijano-Lopez, A.; Razik, H.; Romero-Troncoso, R.J. Automatic diagnosis of electromechanical faults in induction motors based on the transient analysis of the stray flux via MUSIC methods. IEEE Trans. Ind. Appl. 2020, 56, 3604–3613. [Google Scholar] [CrossRef]
  14. Azeez, A.A.; Alkhedher, M.; Gadala, M.S. Thermal imaging fault detection for rolling element bearings. In Proceedings of the 2020 Advances in Science and Engineering Technology International Conferences, Dubai, United Arab Emirates, 4 February–9 April 2020; IEEE: Piscataway, NJ, USA, 2020. [Google Scholar]
  15. Choudhary, A.; Goyal, D.; Letha, S.S. Infrared thermography based fault diagnosis of induction motor bearings using machine learning. IEEE Sens. J. 2020, 21, 1727–1734. [Google Scholar] [CrossRef]
  16. Pham, M.T.; Kim, J.M.; Kim, C.H. Efficient fault diagnosis of rolling bearings using neural network architecture search and sharing weights. IEEE Access 2021, 9, 98800–98811. [Google Scholar] [CrossRef]
  17. Chen, R.; Tang, L.; Hu, X.; Wu, H. Fault diagnosis method of low-speed rolling bearing based on acoustic emission signal and subspace embedded feature distribution alignment. IEEE Trans. Ind. Inform. 2020, 17, 5402–5410. [Google Scholar] [CrossRef]
  18. Oh, H.; Jung, J.H.; Jeon, B.C.; Youn, B.D. Scalable and unsupervised feature engineering using vibration-imaging and deep learning for rotor system diagnosis. IEEE Trans. Ind. Electron. 2017, 65, 3539–3549. [Google Scholar] [CrossRef]
  19. Hoang, D.T.; Kang, H.J. Rolling element bearing fault diagnosis using convolutional neural network and vibration image. Cogn. Syst. Res. 2018, 53, 42–50. [Google Scholar] [CrossRef]
  20. Ma, P.; Zhang, H.L.; Fan, W.H.; Wang, C.; Wen, G.; Zhang, X. A novel bearing fault diagnosis method based on 2D image representation and transfer learning-convolutional neural network. Meas. Sci. Technol. 2019, 30, 055402. [Google Scholar] [CrossRef]
  21. Wen, L.; Li, X.; Gao, L. A new two-level hierarchical diagnosis network based on convolutional neural network. IEEE Trans. Instrum. Meas. 2019, 69, 330–338. [Google Scholar] [CrossRef]
  22. Verstraete, D.; Ferrada, A.; Droguett, E.L.; Meruane, V.; Modarres, M. Deep Learning Enabled Fault Diagnosis Using Time-Frequency Image Analysis of Rolling Element Bearings. Shock. Vib. 2017, 2017, 5067651. [Google Scholar] [CrossRef]
  23. Hsueh, Y.M.; Ittangihal, V.R.; Wu, W.B.; Chang, H.-C.; Kuo, C.-C. Fault Diagnosis System for Induction Motors by CNN Using Empirical Wavelet Transform. Symmetry 2019, 11, 1212. [Google Scholar] [CrossRef] [Green Version]
  24. Hoang, D.T.; Tran, X.T.; Van, M.; Kang, H.J. A Deep Neural Network-Based Feature Fusion for Bearing Fault Diagnosis. Sensors 2021, 21, 244. [Google Scholar] [CrossRef] [PubMed]
  25. Chen, W.; Qiu, Y.; Feng, Y.; Li, Y.; Kusiak, A. Diagnosis of wind turbine faults with transfer learning algorithms. Renew. Energy 2021, 163, 2053–2067. [Google Scholar] [CrossRef]
  26. Zhang, D.; Zhou, T. Deep convolutional neural network using transfer learning for fault diagnosis. IEEE Access 2021, 9, 43889–43897. [Google Scholar] [CrossRef]
  27. Wu, H.; Zhao, J. Fault detection and diagnosis based on transfer learning for multimode chemical processes. Comput. Chem. Eng. 2020, 135, 106731. [Google Scholar] [CrossRef]
  28. Li, F.; Tang, T.; Tang, B.; He, Q. Deep convolution domain-adversarial transfer learning for fault diagnosis of rolling bearings. Measurement 2021, 169, 108339. [Google Scholar] [CrossRef]
  29. Wang, X.; Shen, C.; Xia, M.; Wang, D.; Zhu, J.; Zhu, Z. Multi-scale deep intra-class transfer learning for bearing fault diagnosis. Reliab. Eng. Syst. Saf. 2020, 202, 107050. [Google Scholar] [CrossRef]
  30. Shao, J.; Huang, Z.; Zhu, J. Transfer learning method based on adversarial domain adaption for bearing fault diagnosis. IEEE Access 2020, 8, 119421–119430. [Google Scholar] [CrossRef]
  31. Zhang, Y.; Xing, K.; Bai, R.; Sun, D.; Meng, Z. An enhanced convolutional neural network for bearing fault diagnosis based on time–frequency image. Measurement 2020, 157, 107667. [Google Scholar] [CrossRef]
  32. Huang, T.; Zhang, Q.; Tang, X.; Zhao, S.; Lu, X. A novel fault diagnosis method based on CNN and LSTM and its application in fault diagnosis for complex systems. Artif. Intell. Rev. 2022, 55, 1289–1315. [Google Scholar] [CrossRef]
  33. Zhong, S.; Fu, S.; Lin, L. A novel gas turbine fault diagnosis method based on transfer learning with CNN. Measurement 2019, 137, 435–453. [Google Scholar] [CrossRef]
  34. McNeely-White, D.; Beveridge, J.R.; Draper, B.A. Inception and ResNet features are (almost) equivalent. Cogn. Syst. Res. 2020, 59, 312–318. [Google Scholar] [CrossRef]
  35. He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27 June–30 June 2016. [Google Scholar]
  36. Zhuang, F.; Qi, Z.; Duan, K.; Xi, D.; Zhu, Y.; Zhu, H.; Xiong, H.; He, Q. A comprehensive survey on transfer learning. Proc. IEEE 2020, 109, 43–76. [Google Scholar] [CrossRef]
  37. Dagher, I.; Barbara, D. Facial age estimation using pre-trained CNN and transfer learning. Multimed. Tools Appl. 2021, 80, 20369–20380. [Google Scholar] [CrossRef]
  38. Xu, W.; Wan, Y.; Zuo, T.Y.; Sha, X.-M. Transfer learning based data feature transfer for fault diagnosis. IEEE Access 2020, 8, 76120–76129. [Google Scholar] [CrossRef]
  39. Smith, W.A.; Randall, R.B. Rolling element bearing diagnostics using the Case Western Reserve University data: A benchmark study. Mech. Syst. Signal Processing 2015, 64, 100–131. [Google Scholar] [CrossRef]
Figure 1. Schematic for generating RGB time-frequency diagrams.
Figure 1. Schematic for generating RGB time-frequency diagrams.
Applsci 12 07810 g001
Figure 2. Experimental results for ResNet models: (a) training set loss; (b) validation set accuracy.
Figure 2. Experimental results for ResNet models: (a) training set loss; (b) validation set accuracy.
Applsci 12 07810 g002
Figure 3. Intelligent diagnosis flow chart of rolling bearings based on deep residual network based on transfer learning.
Figure 3. Intelligent diagnosis flow chart of rolling bearings based on deep residual network based on transfer learning.
Applsci 12 07810 g003
Figure 4. Experimental platform: (a) The test bench; (b) Photographs of defective components under study.
Figure 4. Experimental platform: (a) The test bench; (b) Photographs of defective components under study.
Applsci 12 07810 g004
Figure 5. Transformed results.
Figure 5. Transformed results.
Applsci 12 07810 g005
Figure 6. Experimental results for all models: (a) training set loss; (b) validation set accuracy.
Figure 6. Experimental results for all models: (a) training set loss; (b) validation set accuracy.
Applsci 12 07810 g006
Figure 7. Confusion matrix charts of fault diagnosis results obtained by different approaches: (a) ResNet34-TL; (b) MobileNet-TL; (c) ResNet34; (d) MobileNet; (e) CNN.
Figure 7. Confusion matrix charts of fault diagnosis results obtained by different approaches: (a) ResNet34-TL; (b) MobileNet-TL; (c) ResNet34; (d) MobileNet; (e) CNN.
Applsci 12 07810 g007aApplsci 12 07810 g007b
Figure 8. Transformed result.
Figure 8. Transformed result.
Applsci 12 07810 g008
Figure 9. Experimental results for all models: (a) training set loss; (b) validation set accuracy.
Figure 9. Experimental results for all models: (a) training set loss; (b) validation set accuracy.
Applsci 12 07810 g009
Figure 10. Confusion matrix charts of fault diagnosis results obtained by different approaches: (a) ResNet34-TL; (b) MobileNet-TL; (c) ResNet34; (d) MobileNet; (e) CNN.
Figure 10. Confusion matrix charts of fault diagnosis results obtained by different approaches: (a) ResNet34-TL; (b) MobileNet-TL; (c) ResNet34; (d) MobileNet; (e) CNN.
Applsci 12 07810 g010aApplsci 12 07810 g010b
Table 1. Results of Resnet network with different depths.
Table 1. Results of Resnet network with different depths.
Network ModelTest Accuracy of the Test DataTime of Training (s)
ResNet340.87871016.93
ResNet500.84761115.12
ResNet1010.80951291.68
Table 2. Parameter table of each layer of deep residual network.
Table 2. Parameter table of each layer of deep residual network.
Layer NameOutput Size34-Layer
conv1112 × 1127 × 7, 64, stride2
conv2_x56 × 563 × 3, max pool, stride2
3 × 3 , 64 3 × 3 , 64 × 3
conv3_x28 × 28 3 × 3 , 128 3 × 3 , 128 × 4
conv4_x14 × 14 3 × 3 , 256 3 × 3 , 256 × 6
conv5_x7 × 7 3 × 3 , 512 3 × 3 , 512 × 3
1 × 1average pool, 1000-d fc, softmax
FLOPs 3.6 × 10 9
Table 3. Experimental environment description.
Table 3. Experimental environment description.
PartConfigured Version
hardwareCPUIntel Core i9-10900
GPUNVIDIA Quadro RTX 4000
Memory32G
softwareOperating Systemwindows10 64 bit
Python3.7.11
Pytorch1.7.1
CUDA10.2.89
Table 4. Introduction to image data sets.
Table 4. Introduction to image data sets.
Fault TypeFault Size (mm)Total Sample QtySamples in the Training SetSamples in the Validation SetSamples in the Test SetLabel
ball wear0.053002106030B00050
0.453002106030B00450
inner race wear0.053002106030IR00050
1.503002106030IR01500
outer race wear0.053002106030OR00050
1.503002106030OR01500
normal bearing03002106030NORMAL
Table 5. Average results for each model.
Table 5. Average results for each model.
ModelTest Set AccuracyTime of Training (s)False Alarm RateMissing Alarm Rate
ResNet34-TL96.29%1044.996.67% (2/30)0.00% (0/180)
MobileNet-TL93.05%762.996.67% (2/30)0.55% (1/180)
ResNet3487.87%1016.9313.33% (4/30)0.55% (1/180)
MobileNet73.62%783.8926.67% (8/30)0.55% (1/180)
CNN90.23%1022.4613.33% (4/30)0.00% (0/180)
Table 6. Introduction to image data sets.
Table 6. Introduction to image data sets.
Fault TypeFault Size (mm)Total Sample QtySamples in the Training SetSamples in the Validation SetSamples in the Test SetLabel
ball wear0.1783002106030B007
0.5333002106030B021
inner race wear0.1783002106030IR007
0.5333002106030IR021
outer race wear0.1783002106030OR007
0.5333002106030OR021
normal bearing03002106030NORMAL
Table 7. Average results for each model.
Table 7. Average results for each model.
ModelTest Set AccuracyTime of Training (s)False Alarm RateMissing Alarm Rate
ResNet34-TL99.86%1054.880.00% (0/30)0.00% (0/180)
MobileNet-TL99.52%752.150.00% (0/30)0.00% (0/180)
ResNet3497.48%999.050.00% (0/30)0.00% (0/180)
MobileNet93.00%756.350.00% (0/30)0.00% (0/180)
CNN97.86%1037.300.00% (0/30)0.00% (0/180)
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Zhou, J.; Yang, X.; Li, J. Deep Residual Network Combined with Transfer Learning Based Fault Diagnosis for Rolling Bearing. Appl. Sci. 2022, 12, 7810. https://doi.org/10.3390/app12157810

AMA Style

Zhou J, Yang X, Li J. Deep Residual Network Combined with Transfer Learning Based Fault Diagnosis for Rolling Bearing. Applied Sciences. 2022; 12(15):7810. https://doi.org/10.3390/app12157810

Chicago/Turabian Style

Zhou, Jianmin, Xiaotong Yang, and Jiahui Li. 2022. "Deep Residual Network Combined with Transfer Learning Based Fault Diagnosis for Rolling Bearing" Applied Sciences 12, no. 15: 7810. https://doi.org/10.3390/app12157810

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop