A Time-Distributed Spatiotemporal Feature Learning Method for Machine Health Monitoring with Multi-Sensor Time Series
Abstract
:1. Introduction
- The ConvLSTM is first applied to extract spatiotemporal features of multi-sensor time series for real-time machine health monitoring tasks. It can learn both the complex temporal dependency and spatial dependency of multi-sensor time series, enabling the ConvLSTM to discover more hidden information than CNN and LSTM.
- The time-distributed structure is proposed to learn both short-term and long-term features of time series. Therefore, it can make full use of information on different time scales.
- The proposed end-to-end TDConvLSTM model directly works on raw time series data of multi-sensor and can automatically extract optimal discriminative features without any handcrafted features or expert experience. The time-distributed spatiotemporal feature learning method is not limited to a specific machine type or a fault type. Therefore, TDConvLSTM has wide applicability in MHM systems.
- The proposed model is suitable for multisensory scenario and achieves better performance in both time series classification tasks and regression prediction tasks than some state-of-the-art models, which has been verified in the gearbox fault diagnosis experiment and the tool wear prediction experiment.
2. Related Work
2.1. Machine Health Monitoring Based on CNN
2.2. Machine Health Monitoring Based on LSTM
2.3. Hybrid Models Based on CNN and LSTM for Machine Health Monitoring
3. Introduction of ConvLSTM
3.1. Convolutional Operation
3.2. From LSTM to ConvLSTM
4. Methods
4.1. Notation
4.2. The Proposed TDConvLSTM Model
4.2.1. Data Normalization and Segmentation
4.2.2. Time-Distributed Local Spatiotemporal Feature Extraction
4.2.3. Holistic Spatiotemporal Feature Extraction
4.2.4. Supervised Learning Layer
4.2.5. Batch Normalization
5. Experiments and Discussion
5.1. Case Study 1: Gearbox Fault Diagnosis
5.1.1. Data Collection
5.1.2. Parameters of the Proposed TDConvLSTM
5.1.3. Results and Discussion
5.1.4. Feature Visualization
5.2. Case Study 2: Tool Wear Monitoring
5.2.1. Experiment Setup and Data Description
5.2.2. Model Settings
5.2.3. Results and Discussion
6. Conclusions
Author Contributions
Funding
Conflicts of Interest
References
- Xu, X.; Hua, Q. Industrial Big Data Analysis in Smart Factory: Current Status and Research Strategies. IEEE Access 2017, 5, 17543–17551. [Google Scholar] [CrossRef]
- Liu, R.; Meng, G.; Yang, B. Dislocated Time Series Convolutional Neural Architecture: An Intelligent Fault Diagnosis Approach for Electric Machine. IEEE Trans. Ind. Inform. 2017, 13, 1310–1320. [Google Scholar] [CrossRef]
- Shatnawi, Y.; Al-Khassaweneh, M. Fault diagnosis in internal combustion engines using extension neural network. IEEE Trans. Ind. Electron. 2014, 61, 1434–1443. [Google Scholar] [CrossRef]
- Chen, Z.; Li, W. Multisensor Feature Fusion for Bearing Fault Diagnosis Using Sparse Autoencoder and Deep Belief Network. IEEE Trans. Instrum. Meas. 2017, 66, 1693–1702. [Google Scholar] [CrossRef]
- Lei, Y.; Han, D.; Lin, J.; He, Z. Planetary gearbox fault diagnosis using an adaptive stochastic resonance method. Mech. Syst. Signal Process. 2013, 38, 113–124. [Google Scholar] [CrossRef]
- Jing, L.; Wang, T.; Zhao, M. An Adaptive Multi-Sensor Data Fusion Method Based on Deep Convolutional Neural Networks for Fault Diagnosis of Planetary Gearbox. Sensors 2017, 17, 414. [Google Scholar] [CrossRef] [PubMed]
- Zhao, R.; Wang, D.; Yan, R. Machine Health Monitoring Using Local Feature-based Gated Recurrent Unit Networks. IEEE Trans. Ind. Electron. 2018, 65, 1539–1548. [Google Scholar] [CrossRef]
- Uekita, M.; Takaya, Y. Tool condition monitoring for form milling of large parts by combining spindle motor current and acoustic emission signals. Int. J. Adv. Manuf. Technol. 2017, 89, 65–75. [Google Scholar] [CrossRef]
- Hinton, G.; Salakhutdinov, R.R. Reducing the dimensionality of data with neural networks. Science 2006, 313, 504–507. [Google Scholar] [CrossRef] [PubMed]
- Bengio, Y.; Delalleau, O. On the expressive power of deep architectures. In Proceedings of the 14th International Conference on Discovery Science, Espoo, Finland, 5–7 October 2011; Springer: Berlin, Germany, 2011. [Google Scholar]
- Lee, H.; Pham, P.; Largman, Y. Unsupervised feature learning for audio classification using convolutional deep belief networks. In Proceedings of the Advances in Neural Information Processing Systems, Vancouver, BC, Canada, 7–10 December 2009. [Google Scholar]
- Krizhevsky, A.; Sutskever, I.; Hinton, G.E. Imagenet classification with deep convolutional neural networks. In Proceedings of the Advances in Neural Information Processing Systems, Carson, NV, USA, 3–8 December 2012; pp. 1097–1105. [Google Scholar]
- Le, Q.V.; Zou, W.Y.; Yeung, S.Y. Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis. In Proceedings of the Computer Vision and Pattern Recognition (CVPR), Colorado Springs, CO, USA, 20–25 June 2011. [Google Scholar]
- Singh, S.P.; Kumar, A.; Darbari, H. Building Machine Learning System with Deep Neural Network for Text Processing. In Proceedings of the International Conference on Information and Communication Technology for Intelligent Systems, Ahmedabad, India, 25–26 March 2017; Springer: Cham, Switzerland, 2017. [Google Scholar]
- Zhao, R.; Yan, R.; Chen, Z. Deep Learning and Its Applications to Machine Health Monitoring: A Survey. arXiv, 2016; arXiv:1612.07460. [Google Scholar]
- Zhao, G.; Zhang, G.; Ge, Q. Research advances in fault diagnosis and prognostic based on deep learning. In Proceedings of the Prognostics and System Health Management Conference, Harbin, China, 9–12 July 2017. [Google Scholar]
- Ince, T.; Kiranyaz, S.; Eren, L. Real-time motor fault detection by 1-d convolutional neural networks. IEEE Trans. Ind. Electron. 2016, 63, 7067–7075. [Google Scholar] [CrossRef]
- Abdeljaber, O.; Avci, O.; Kiranyaz, S. Real-time vibration-based structural damage detection using one dimensional convolutional neural networks. J. Sound Vib. 2017, 388, 154–170. [Google Scholar] [CrossRef]
- Babu, G.S.; Zhao, P.; Li, X.L. Deep convolutional neural network based regression approach for estimation of remaining useful life. In Proceedings of the International Conference on Database Systems for Advanced Applications, Dallas, TX, USA, 16–19 April 2016. [Google Scholar]
- Wielgosz, M.; Skoczeń, A.; Mertik, M. Using LSTM recurrent neural networks for monitoring the LHC superconducting magnets. Nuclear Instrum. Methods Phys. Res. Sect. A 2017, 867, 40–50. [Google Scholar] [CrossRef] [Green Version]
- Gers, F.A.; Schmidhuber, J.; Cummins, F. Learning to forget: Continual prediction with lstm. Neural Comput. 2000, 12, 2451–2471. [Google Scholar] [CrossRef] [PubMed]
- Zhao, R.; Wang, J.; Yan, R.; Mao, K. Machine health monitoring with LSTM networks. In Proceedings of the 10th International Conference on Sensing Technology (ICST), Nanjing, China, 11–13 November 2016. [Google Scholar]
- Shi, X.; Chen, Z.; Wang, H. Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting. In Proceedings of the 29th Annual Conference on Neural Information Processing Systems, Montreal, QC, Canada, 7–12 December 2015. [Google Scholar]
- Luo, W.; Liu, W.; Gao, S. Remembering history with convolutional LSTM for anomaly detection. In Proceedings of the IEEE International Conference on Multimedia and Expo, Hong Kong, China, 10–14 July 2017. [Google Scholar]
- Palaz, D.; Collobert, R. Analysis of cnn-based speech recognition system using raw speech as input. In Proceedings of the 16th Annual Conference of the International-Speech-Communication-Association, Dresden, Germany, 6–10 September 2015. [Google Scholar]
- Tra, V.; Kim, J.; Khan, S.A. Bearing Fault Diagnosis under Variable Speed Using Convolutional Neural Networks and the Stochastic Diagonal Levenberg-Marquardt Algorithm. Sensors 2017, 17, 2834. [Google Scholar] [CrossRef] [PubMed]
- Ding, X.; He, Q. Energy-Fluctuated Multiscale Feature Learning with Deep ConvNet for Intelligent Spindle Bearing Fault Diagnosis. IEEE Trans. Instrum. Meas. 2017, 66, 1926–1935. [Google Scholar] [CrossRef]
- Zhang, W.; Peng, G.; Li, C. Rolling Element Bearings Fault Intelligent Diagnosis Based on Convolutional Neural Networks Using Raw Sensing Signal. Smart Innovation Systems and Technologies. In Proceedings of the 12th International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP), Kaohsiung, Taiwan, 21–23 November 2016. [Google Scholar]
- Lee, K.B.; Cheon, S.; Chang, O.K. A Convolutional Neural Network for Fault Classification and Diagnosis in Semiconductor Manufacturing Processes. IEEE Trans. Semicond. Manuf. 2017, 30, 135–142. [Google Scholar] [CrossRef]
- Malhotra, P.; Ramakrishnan, A.; Anand, G. LSTM-based encoder-decoder for multi-sensor anomaly detection. arXiv, 2016; arXiv:1607.00148. [Google Scholar]
- Malhotra, P.; Vishnu, T.V.; Ramakrishnan, A. Multi-Sensor Prognostics using an Unsupervised Health Index based on LSTM Encoder-Decoder. arXiv 2016, arXiv:1608.06154. [Google Scholar]
- Bruin, T.D.; Verbert, K.; Babuška, R. Railway Track Circuit Fault Diagnosis Using Recurrent Neural Networks. IEEE Trans. Neural Netw. Learn. Syst. 2017, 28, 523–533. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Cai, M.; Liu, J. Maxout neurons for deep convolutional and LSTM neural networks in speech recognition. Speech Commun. 2016, 77, 53–64. [Google Scholar] [CrossRef]
- Kahou, S.E.; Michalski, V.; Konda, K. Recurrent Neural Networks for Emotion Recognition in Video. In Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, New York, NY, USA, 9–13 November 2015. [Google Scholar]
- Tsironi, E.; Barros, P.; Weber, C. An Analysis of Convolutional Long-Short Term Memory Recurrent Neural Networks for Gesture Recognition. Neurocomputing 2017, 268, 76–86. [Google Scholar] [CrossRef]
- Rad, N.M.; Kia, S.M.; Zarbo, C.; Jurman, G.; Venuti, P.; Furlanello, C. Stereotypical Motor Movement Detection in Dynamic Feature Space. In Proceedings of the IEEE International Conference on Data Mining Workshops, New Orleans, LA, USA, 18–21 November 2017. [Google Scholar]
- Rad, N.M.; Kia, S.M.; Zarbo, C.; Laarhoven, T.V.; Jurman, G.; Venuti, P.; Marchiori, E.; Furlanello, C. Deep learning for automatic stereotypical motor movement detection using wearable sensors in autism spectrum disorders. Signal Process. 2018, 144, 180–191. [Google Scholar] [CrossRef] [Green Version]
- Zhao, R.; Yan, R.; Wang, J. Learning to Monitor Machine Health with Convolutional Bi-Directional LSTM Networks. Sensors 2017, 17, 273. [Google Scholar] [CrossRef] [PubMed]
- Xiong, F.; Shi, X.; Yeung, D.Y. Spatiotemporal Modeling for Crowd Counting in Videos. In Proceedings of the 16th IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017. [Google Scholar]
- Qiu, Z.; Yao, T.; Mei, T. Learning Deep Spatio-Temporal Dependence for Semantic Video Segmentation. IEEE Trans. Multimedia 2017, 20, 939–949. [Google Scholar] [CrossRef]
- Understanding LSTM Networks. Available online: http://colah. github.io/posts/2015-08-Understanding-LSTMs/ (accessed on 26 May 2018).
- Zhang, Y.; Chan, W.; Jaitly, N. Very Deep Convolutional Networks for End-to-End Speech Recognition. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), New Orleans, LA, USA, 5–9 March 2017. [Google Scholar]
- Ioffe, S.; Szegedy, C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv, 2015; arXiv:1502.03167. [Google Scholar]
- Maaten, L.; Hinton, G. Visualizing data using t-SNE. J. Mach. Learn. Res. 2008, 9, 2579–2605. [Google Scholar]
- Li, X.; Lim, B.; Zhou, J.; Huang, S.; Phua, S.; Shaw, K.; Er, M. Fuzzy neural network modelling for tool wear estimation in dry milling operation. In Proceedings of the Annual Conference of the Prognostics and Health Management Society, San Diego, CA, USA, 27–30 September 2009. [Google Scholar]
- Dozat, T. Incorporating nesterov momentum into adam. In Proceedings of the International Conference on Learning Representations, Caribe Hilton, San Juan, Puerto Rico, 2–4 May 2016. [Google Scholar]
Label | Condition | Description | Speed (rpm) |
---|---|---|---|
0 | FT | A root fracture tooth in the big gear | 280, 860 and 1450 |
1 | CT | A root crack tooth in the big gear | 280, 860 and 1450 |
2 | CTFT | A root crack tooth in the big gear and a half fracture tooth in the small gear | 280, 860 and 1450 |
3 | CIB | A crack on the inner race of the bearing | 280, 860 and 1450 |
No. | Layer Type | Kernel | Stride | Channel | BN Axis | Activation |
---|---|---|---|---|---|---|
1 | Local Convolution | (4,1) | (4,1) | 4 | 4 | sigmoid |
2 | Local ConvLSTM | (1,4) | (1,1) | 4 | 5 | tanh |
3 | Holistic ConvLSTM | (2,2) | (1,1) | 4 | 4 | tanh |
4 | FC layer | 100 | - | 1 | −1 | sigmoid |
5 | Supervised learning layer | 4 | - | 1 | - | softmax |
Model | Constant Rotation Speed | Nonstationary Rotation Speed | ||
---|---|---|---|---|
D1 | D2 | D3 | D4 | |
TDConvLSTM | 100% | 100% | 100% | 97.56% |
TDConvLSTM Without BN | 100% | 99.5% | 99.78% | 93.11% |
CNN-LSTM | 98.67% | 97% | 98.33% | 91.89% |
CNN | 96.83% | 99.5% | 98.17% | 86.78% |
LSTM | 96.67% | 99.83% | 100% | 80.94% |
EMD-SVM | 90.67% | 89.67% | 91.33% | 75.67% |
No. | LAYER TYPE | Kernel | Stride | Channel | BN Axis | Activation |
---|---|---|---|---|---|---|
1 | Local Convolution | (10,3) | (5,3) | 4 | 4 | ReLu |
2 | Local ConvLSTM | (2,2) | (1,1) | 4 | 5 | tanh |
3 | Holistic ConvLSTM | (4,4) | (1,1) | 1 | 4 | tanh |
4 | FC layer | 10 | - | 1 | −1 | ReLu |
5 | Supervised learning layer | 1 | - | 1 | - | linear |
Model | MAE | RMSE | ||||
---|---|---|---|---|---|---|
C4,C6/C1 1 | C1,C6/C4 2 | C1,C4/C6 3 | C4,C6/C1 | C1,C6/C4 | C1,C4/C6 | |
TDConvLSTM | 6.99 | 6.96 | 7.50 | 8.33 | 8.39 | 10.22 |
CNN-LSTM | 11.18 | 9.39 | 11.34 | 13.77 | 11.85 | 14.33 |
CNN | 15.32 | 14.34 | 17.36 | 18.50 | 18.80 | 21.85 |
LSTM | 19.09 | 16.00 | 22.61 | 21.42 | 17.78 | 25.81 |
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Qiao, H.; Wang, T.; Wang, P.; Qiao, S.; Zhang, L. A Time-Distributed Spatiotemporal Feature Learning Method for Machine Health Monitoring with Multi-Sensor Time Series. Sensors 2018, 18, 2932. https://doi.org/10.3390/s18092932
Qiao H, Wang T, Wang P, Qiao S, Zhang L. A Time-Distributed Spatiotemporal Feature Learning Method for Machine Health Monitoring with Multi-Sensor Time Series. Sensors. 2018; 18(9):2932. https://doi.org/10.3390/s18092932
Chicago/Turabian StyleQiao, Huihui, Taiyong Wang, Peng Wang, Shibin Qiao, and Lan Zhang. 2018. "A Time-Distributed Spatiotemporal Feature Learning Method for Machine Health Monitoring with Multi-Sensor Time Series" Sensors 18, no. 9: 2932. https://doi.org/10.3390/s18092932
APA StyleQiao, H., Wang, T., Wang, P., Qiao, S., & Zhang, L. (2018). A Time-Distributed Spatiotemporal Feature Learning Method for Machine Health Monitoring with Multi-Sensor Time Series. Sensors, 18(9), 2932. https://doi.org/10.3390/s18092932