Bearing Fault Diagnosis for Time-Varying System Using Vibration–Speed Fusion Network Based on Self-Attention and Sparse Feature Extraction
Abstract
:1. Introduction
- A data embedding layer is designed according to the characteristics of bearing vibration signals under time-varying speed, and the multihead attention mechanism of a transformer is fully used to extract bearing fault features under time-varying speed, so as to achieve high-dimensional feature fusion with speed pulse signals. Through adjusting parameters, the effectiveness of the fusion method proposed in this paper for bearing fault diagnosis under time-varying speed is proved in comparison with other deep learning models.
- The degree of influence of the speed fluctuation on the model classification is quantitatively analyzed, and the effectiveness of the model for bearing fault classification under the actual variable speed is verified by experimental data.
- From the perspective of model interpretability, the mechanism and principle of each module of the fusion model in extracting bearing time-domain signal features are analyzed, and the potential application of the deep learning model in multisensor signal fusion is explored.
2. Vibration–Speed Data Fusion Network
2.1. One-Dimensional Sequence Embedding Layer
2.2. Encoder Block Based on Multihead Attention
2.3. Sparse Autoencoder
2.4. Vibration–Speed Data Fusion Network Training Process
- Step 1: Install the vibration sensor and speed sensor at the motor drive end; the collected bearing vibration amplitude signal is marked as signal 1 and the direct tacho signal (typical square wave) is marked as signal 2, where d is the number of samples. In dividing the data, both sensors are sampled at the same frequency, so the samples intercepted by using a fixed-length sliding window for both signal 1 and signal 2 are the working signals in the same state. Meanwhile, the same random number seed is set for both signal 1 and signal 2 when disrupting the samples to ensure that the same operating condition features are extracted during the network fusion training. To augment the dataset, the samples are intercepted in the sliding window so that there is partial overlapping between adjacent samples.
- Step 2: After dividing the data set for two channels, the tacho pulse signal samples from signal 2 are first fed into the SAE network for pretraining. The samples at this stage do not contain labels because the unsupervised learning capability of the SAE can automatically extract the deep representations of the tacho pulses. After completing a set number of iterations, the SAE network has learned the potential spatial representations of the tacho pulse signals during the compression-recovery of the data, and finally all the hidden layer parameters that can represent these potential spatial representations are frozen for subsequent network fusion training.
- Step 3: The vibration samples from signal 1 are fed into the transformer network constructed in this paper based on 1D sequential signals, while the tacho pulse signal samples from signal 2 are fed into the SAE network with the hidden layer parameters frozen. Note that the hidden layer here is only taken as the SAE encoder layer, because the compressed features obtained after the coding layer are more helpful for the network fusion training. After signal 1 and signal 2 have completed forward propagation in their respective channels, the high-dimensional feature outputs and are obtained in the last linear layer of the two networks, respectively, where B is the number of batches of data and d is the vector dimension. These two sets of vectors are spliced to output vector . Finally, the cross-entropy calculation is done in the classification layer and the backpropagation of errors is performed. In the iterative training of the network, only the parameters of the input layer of the SAE network are updated. At the end of the training, results are finally obtained by inputting the testing dataset.
3. Dataset Introduction
3.1. SQV Bearing Dataset from Xi’an Jiaotong University
3.2. Bearing Dataset from the University of Ottawa
3.3. Data Processing
4. Experimental Results and Discussion
4.1. Effect of Network Model Parameters on Diagnostic Results
4.2. Comparisons with Other Network Models
4.3. Analysis of the Influence of Rotational Speed Fluctuation
4.4. Visual Interpretation of the Model
5. Conclusions
- Under the effects of the VSF-ST model’s position coding and multihead attention mechanism, a small and suitable segmentation length could maximize the performance of the model for bearing vibration signals under variable speed conditions. Increasing the number of heads and the number of layers of coding blocks could also increase the performance of the model, but under the comprehensive consideration of the computational complexity and the number of parameters, four heads and six layers of coding blocks could achieve the requirement of an accurate classification under variable speed conditions, with an average classification accuracy of 95.18% in the SQV dataset and 99.85% in the University of Ottawa dataset.
- The advantages and effectiveness of the proposed model in fault diagnosis were verified by experiments under the bearing “start-run-brake” and bearing time-varying speed conditions.
- Without signal processing, the VSF-ST network could directly learn useful vibration shock fragments in the fault signal by a layer-by-layer feature extraction, so as to classify the fault bearings from the health bearings and the fault bearings from each other. The fusion of high-dimensional features of bearing speed signals while extracting features of bearing vibration signals could further enhance the adaptivity of the network under variable speed conditions.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
SAE | Sparse autoencoder |
FFT | Fast Fourier transform |
CNN | Convolutional neural network |
VSF-ST | Vibration–speed data fusion network based on SAE and transformer |
MSA | Multihead self-attention |
GeLU | Gaussian error linear unit |
KL | Scatter calculation |
ReLU | Rectified linear unit |
MLP | Multilayer perceptron |
SQV | Spectra Quest varying speed dataset |
A | Acceleration |
D | Deceleration |
A-D | Acceleration and then deceleration |
D-A | Deceleration and then acceleration |
DCNN | Deep convolutional neural network |
LSTM | Long short-term memory |
DNN | Deep neural network |
References
- Lei, Y.; Jia, F.; Lin, J.; Xing, S.; Ding, S.X. An intelligent fault diagnosis method using unsupervised feature learning towards mechanical big data. IEEE Trans. Ind. Electron. 2016, 63, 3137–3147. [Google Scholar] [CrossRef]
- Lei, Y.G.; Lin, J.; He, Z.J.; Zuo, M.J. A review on empirical mode decomposition in fault diagnosis of rotating machinery. Mech. Syst. Signal. Process. 2013, 35, 108–126. [Google Scholar] [CrossRef]
- Randall, R.B.; Antoni, J. Rolling element bearing diagnostics-A tutorial. Mech. Syst. Signal. Process. 2011, 25, 485–520. [Google Scholar] [CrossRef]
- Lee, J.; Wu, F.; Zhao, W.; Ghaffari, M.; Liao, L.; Siegel, D. Prognostics and health management design for rotary machinery systems—Reviews, methodology and applications. Mech. Syst. Signal. Process. 2014, 42, 314–334. [Google Scholar] [CrossRef]
- Grieves, M.; Vickers, J. Digital twin: Mitigating unpredictable, undesirable emergent behavior in complex systems. In Transdisciplinary Perspectives on Complex Systems; Springer: Berlin/Heidelberg, Germany, 2017; pp. 85–113. [Google Scholar]
- Chen, J.; Pan, J.; Li, Z.; Zi, Y.; Chen, X. Generator bearing fault diagnosis for wind turbine via empirical wavelet transform using measured vibration signals. Renew. Energy 2016, 89, 80–92. [Google Scholar] [CrossRef]
- Ade, P.A.R.; Aghanim, N.; Arnaud, M.; Ashdown, M.; Aumont, J.; Baccigalupi, C.; Banday, A.J.; Barreiro, R.B.; Bartlett, J.G.; Bartolo, N.; et al. Planck 2015 results XIII. Cosmological parameters. Astron. Astrophys. 2016, 594, A13. [Google Scholar] [CrossRef] [Green Version]
- Shao, H.D.; Xia, M.; Han, G.J.; Zhang, Y.; Wan, J.F. Intelligent fault diagnosis of rotor-bearing system under varying working conditions with modified transfer convolutional neural network and thermal images. IEEE Trans. Industr. Inform. 2021, 17, 3488–3496. [Google Scholar] [CrossRef]
- Lu, S.L.; Yan, R.Q.; Liu, Y.B.; Wang, Q.J. Tacholess Speed Estimation in Order Tracking: A Review With Application to Rotating Machine Fault Diagnosis. IEEE Trans. Instrum. Meas. 2019, 68, 2315–2332. [Google Scholar] [CrossRef]
- Antoni, J. The infogram: Entropic evidence of the signature of repetitive transients. Mech. Syst. Signal. Process. 2016, 74, 73–94. [Google Scholar] [CrossRef]
- Cheng, Y.W.; Lin, M.X.; Wu, J.; Zhu, H.P.; Shao, X.Y. Intelligent fault diagnosis of rotating machinery based on continuous wavelet transform-local binary convolutional neural network. Knowl. Based Syst. 2021, 216, 106796. [Google Scholar] [CrossRef]
- Jin, Y.R.; Qin, C.J.; Huang, Y.X.; Liu, C.L. Actual bearing compound fault diagnosis based on active learning and decoupling attentional residual network. Measurement 2021, 173, 108500. [Google Scholar] [CrossRef]
- Kiranyaz, S.; Avci, O.; Abdeljaber, O.; Ince, T.; Gabbouj, M.; Inman, D.J. 1D convolutional neural networks and applications: A survey. Mech. Syst. Signal. Process. 2021, 151, 107398. [Google Scholar] [CrossRef]
- San Martin, G.; Droguett, E.L.; Meruane, V.; Moura, M.D. Deep variational auto-encoders: A promising tool for dimensionality reduction and ball bearing elements fault diagnosis. Struct. Health. Monit. 2019, 18, 1092–1128. [Google Scholar] [CrossRef]
- Wang, X.; Mao, D.X.; Li, X.D. Bearing fault diagnosis based on vibro-acoustic data fusion and 1D-CNN network. Measurement 2021, 173, 108518. [Google Scholar] [CrossRef]
- Zhang, L.; Lv, Y.; Huang, W.; Yi, C. Bearing fault diagnosis under various operation conditions using synchrosqueezing transform and improved two-dimensional convolutional neural network. Meas. Sci. Technol. 2022, 33, 085002. [Google Scholar] [CrossRef]
- Hasan, M.J.; Islam, M.M.; Kim, J.M. Acoustic spectral imaging and transfer learning for reliable bearing fault diagnosis under variable speed conditions. Measurement 2019, 138, 620–631. [Google Scholar] [CrossRef]
- Zhao, W.; Wang, Z.; Cai, W.; Zhang, Q.; Wang, J.; Du, W.; Yang, N.; He, X. Multiscale inverted residual convolutional neural network for intelligent diagnosis of bearings under variable load condition. Measurement 2022, 188, 110511. [Google Scholar] [CrossRef]
- He, Z.; Shao, H.; Ding, Z.; Jiang, H.; Cheng, J. Modified deep autoencoder driven by multisource parameters for fault transfer prognosis of aeroengine. IEEE Trans. Ind. Electron. 2021, 69, 845–855. [Google Scholar]
- Han, T.; Liu, C.; Yang, W.G.; Jiang, D.X. Learning transferable features in deep convolutional neural networks for diagnosing unseen machine conditions. ISA Trans. 2019, 93, 341–353. [Google Scholar] [CrossRef]
- An, Z.H.; Li, S.M.; Wang, J.R.; Xin, Y.; Xu, K. Generalization of deep neural network for bearing fault diagnosis under different working conditions using multiple kernel method. Neurocomputing 2019, 352, 42–53. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.; Polosukhin, I. Attention is all you need. NIPS 2017, 30. [Google Scholar]
- Jin, Y.; Hou, L.; Chen, Y. A Time Series Transformer based method for the rotating machinery fault diagnosis. Neurocomputing 2022, 494, 379–395. [Google Scholar] [CrossRef]
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv 2020, arXiv:2010.11929. [Google Scholar]
- Cordonnier, J.B.; Loukas, A.; Jaggi, M. On the relationship between self-attention and convolutional layers. arXiv 2019, arXiv:1911.03584. [Google Scholar]
- Dubey, S.R.; Singh, S.K.; Chaudhuri, B.B. A Comprehensive Survey and Performance Analysis of Activation Functions in Deep Learning. arXiv 2021, arXiv:2109.14545. [Google Scholar]
- Liu, S.; Chen, J.; He, S.; Shi, Z.; Zhou, Z. Subspace Network with Shared Representation learning for intelligent fault diagnosis of machine under speed transient conditions with few samples. ISA Trans. 2021, 128, 531–544. [Google Scholar] [CrossRef]
- Shi, Z.; Chen, J.; Zi, Y.; Zhou, Z. A novel multitask adversarial network via redundant lifting for multicomponent intelligent fault detection under sharp speed variation. IEEE Trans. Instrum. Meas. 2021, 70, 1–10. [Google Scholar] [CrossRef]
- Huang, H.; Baddour, N. Bearing vibration data collected under time-varying rotational speed conditions. Data Brief. 2018, 21, 1745–1749. [Google Scholar] [CrossRef]
Dataset | Pretraining Phase | Fusion Training Phase | Testing Phase |
---|---|---|---|
Vibration signal (signal 1) | ✕ | √ | √ |
Speed signal (signal 2) | √ | √ | √ |
Label | ✕ | √ | ✕ |
SAE Architecture | Network Structure Parameters | Input Size | Output Size |
---|---|---|---|
Encoder–decoder | (1, 2048) | (1, 128) | |
Transformer architecture | |||
Linear embedding | Class token (1, 64) | (1, 2048) | (33, 64) |
layer | Position code (33, 64) | ||
Encoder block × 8 | (33, 64) | (33, 64) | |
Linear | (1, 64) | (1, 128) | |
Classification layer | (1, 256) | (1, n) |
Data Label | Fault Location | Damage Level Label | Damage Area (mm) | Damage Depth (mm) |
---|---|---|---|---|
IF-1 | Inner race | 1 | 4 | 0.5 |
IF-2 | Inner race | 2 | 8 | 2 |
IF-3 | Inner race | 3 | 12 | 4 |
OF-1 | Outer race | 1 | 4 | 0.5 |
OF-2 | Outer race | 2 | 8 | 2 |
OF-3 | Outer race | 3 | 12 | 4 |
NC-1 | Health | None | None | None |
Operating Condition Label | Variation of Bearing Speed | Fault Label |
---|---|---|
A | Acceleration | NC & IF & OF |
D | Deceleration | NC & IF & OF |
A-D | Acceleration and then deceleration | NC & IF & OF |
D-A | Deceleration and then acceleration | NC & IF & OF |
Label | Hyperparameters | Accuracy | FLOPs | Parameters Number | |||
---|---|---|---|---|---|---|---|
N | L/N | h | Depth | ||||
Baseline | 256 | 8 | 4 | 6 | 95.18% | 406.93 M | 1.59 M |
c | 128 | 16 | 95.18% | 205.27 M | 1.59 M | ||
64 | 32 | 94.85% | 104.85 M | 1.59 M | |||
32 | 64 | 94.47% | 53.72 M | 1.61 M | |||
16 | 128 | 94.21% | 28.12 M | 1.62 M | |||
8 | 256 | 93.78% | 15.69 M | 1.64 M | |||
4 | 512 | 92.81% | 9.56 M | 1.73 M | |||
2 | 1024 | 92.30% | 5.56 M | 1.86 M | |||
1 | 2048 | 91.45% | 4.21 M | 1.94 M | |||
b | 1 | 92.74% | 152.61 M | 0.71 M | |||
2 | 93.52% | 207.41 M | 0.81 M | ||||
3 | 94.67% | 268.16 M | 1.48 M | ||||
8 | 95.14% | 812.36 M | 1.78 M | ||||
a | 1 | 91.53% | 69.37 M | 0.31 M | |||
2 | 92.24% | 139.61 M | 0.54 M | ||||
3 | 93.41% | 275.83 M | 0.72 M | ||||
8 | 95.19% | 561.27 M | 2.38 M |
Model | Model Hyperparameters |
---|---|
DCNN | Conv1d; kernel numbers: 16, 32, 64, 128; kernel size: 1 × 15, 1 × 3, 1 × 3, 1 × 3 2nd layer maxpooling (2, 2), 4th layer maxpooling (4); linear: 128 × 4, 256, 128 |
LSTM | [LSTM (64, 64, tanh), dropout (0.1)] × 5; linear: 128, 128 |
DNN | Linear: 1024, 256, 256, 256, 128, 128; dropout (0.1) |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Chi, F.; Yang, X.; Shao, S.; Zhang, Q. Bearing Fault Diagnosis for Time-Varying System Using Vibration–Speed Fusion Network Based on Self-Attention and Sparse Feature Extraction. Machines 2022, 10, 948. https://doi.org/10.3390/machines10100948
Chi F, Yang X, Shao S, Zhang Q. Bearing Fault Diagnosis for Time-Varying System Using Vibration–Speed Fusion Network Based on Self-Attention and Sparse Feature Extraction. Machines. 2022; 10(10):948. https://doi.org/10.3390/machines10100948
Chicago/Turabian StyleChi, Fulin, Xinyu Yang, Siyu Shao, and Qiang Zhang. 2022. "Bearing Fault Diagnosis for Time-Varying System Using Vibration–Speed Fusion Network Based on Self-Attention and Sparse Feature Extraction" Machines 10, no. 10: 948. https://doi.org/10.3390/machines10100948
APA StyleChi, F., Yang, X., Shao, S., & Zhang, Q. (2022). Bearing Fault Diagnosis for Time-Varying System Using Vibration–Speed Fusion Network Based on Self-Attention and Sparse Feature Extraction. Machines, 10(10), 948. https://doi.org/10.3390/machines10100948