Markov-CVAELabeller: A Deep Learning Approach for the Labelling of Fault Data
Abstract
:1. Introduction
2. Literature Review
2.1. Application of VAE in Fault Diagnosis
2.2. Application of Data-Driven Methodologies for the Labelling of Fault Data
- The introduction of a time series imaging approach based on the first-order Markov chain model to extract the main features of the time series being analysed.
- The application of a convolutional variational autoencoder to extract discriminative features from the time series images.
- The consideration of clustering analysis through the employment of k-means to label the unlabelled fault data.
- The performance of fault identification by employing supervised learning based on the labelling results obtained from the Markov-CVAELabeller and the consideration of a convolutional neural network.
3. Methodology
- 1. Data Preparation and 2. Image Encoding. These steps aim to address the challenges that data may present. Furthermore, time series are encoded into images through the application of the first-order Markov chain so that image classifiers for fault classification can be employed.
- 3. Latent Space Representation. A convolutional variational autoencoder is introduced to obtain the latent space representation of the generated images. Thus, it is expected that discriminative features can be extracted from these time series images.
- 4. Clustering Analysis. k-means is implemented in order to identify the distinct clusters presented in the obtained latent space representation. Thus, the distinct instances can be labelled based on the nature of the fault.
- 5. Classification Analysis. As the instances are labelled through the application of Markov-CVAELabeller, which refers to the preceding three phases, the fault classification task can then be implemented. In this instance, a convolutional neural network is considered.
3.1. Data Preparation and Image Encoding
Algorithm 1. Estimation of the transition matrix through the application of the first-order Markov chain. |
Input: A collection of occurrences, x, indexed by time. Number of states, n, that will define the dimensions of the transition matrix . Output: Transition matrix, . 1. Equidistant states are created by following the subsequent steps: The range of values, r, is defined. The interval between states, k, is created by following the next equation: States, s, are generated. Starting from the minimum value, xmin, the k is added until the maximum value, xmax. 2. Data are assigned to a particular s. 3. The transition matrix P is estimated by considering the distinct transition probabilities, where each entry is . The probability indicates the probability that the previous state i is followed by the current state j. Thus, and satisfies |
3.2. Latent Space Representation
3.3. Clustering Analysis
- i.
- Centroids initialisation. The centroids of the distinct clusters are initialised at random.
- ii.
- Cluster assignment. Each instance is assigned to the nearest cluster. To determine how near an instance is to the cluster, the Euclidean distance between such an instance and the k centroids is determined.
- iii.
- New centroids computation. The new centroids are computed based on the instance assignments performed in step ii. To perform this, the mean of all the instances that pertain to a cluster is determined. This is computed for the k clusters.
- iv.
- Convergence. Steps ii and iii are implemented until the algorithm converges. The algorithm converges when change is no longer perceived with regards to the cluster assignment.
3.4. Classification Analysis
4. Results
- By applying the first-order Markov chain, it is assumed that the time series follows the Markov property. To address such a limitation, other time series imaging methods, such as recurrence plots and Gramian Angular Field approaches, can be studied as part of future work. Additionally, further analysis will be needed for the extension to incipient fault diagnosis in nonlinear systems [46].
- Distinct clusters that refer to the different faults need to be clearly represented in the latent space. Otherwise, the clustering analysis cannot be implemented, and thus the fault labels cannot be generated. Accordingly, alternatives should be explored in future work to address such a disadvantage. For instance, the application of other deep clustering methods may need to be considered.
- The pseudo-labels obtained from the application of the k-means in the latent space may contain errors, which are then utilised for the training of the classification model. Thus, the error may propagate during the performance of the fault diagnosis task, which can yield bias results. Therefore, label refinement and updating strategies need to be considered in future work to minimise mislabels during the labelling stage.
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Tian, Y.; Zhao, X.; Huang, W. Meta-learning approaches for learning-to-learn in deep learning: A survey. Neurocomputing 2022, 494, 203–223. [Google Scholar] [CrossRef]
- Stefanidou-Voziki, P.; Sapountzoglou, N.; Raison, B.; Dominguez-Garcia, J. A review of fault location and classification methods in distribution grids. Electr. Power Syst. Res. 2022, 209, 108031. [Google Scholar] [CrossRef]
- Wang, J.-G.; Cai, X.-Z.; Yao, Y.; Zhao, C.; Yang, B.-H.; Ma, S.-W.; Wang, S. Statistical process fault isolation using robust nonnegative garrote. J. Taiwan Inst. Chem. Eng. 2020, 107, 24–34. [Google Scholar] [CrossRef]
- Prasad, A.; Edward, J.B.; Ravi, K. A review on fault classification methodologies in power transmission systems: Part—I. J. Electr. Syst. Inf. Technol. 2018, 5, 48–60. [Google Scholar] [CrossRef]
- Galar, D.; Kumar, U. eMaintenance; Elsevier: Amsterdam, The Netherlands, 2017; pp. 235–310. [Google Scholar] [CrossRef]
- Lei, Y.; Yang, B.; Jiang, X.; Jia, F.; Li, N.; Nandi, A.K. Applications of machine learning to machine fault diagnosis: A review and roadmap. Mech. Syst. Signal Process. 2020, 138, 106587. [Google Scholar] [CrossRef]
- Zhou, X.; Liu, H.; Pourpanah, F.; Zeng, T.; Wang, X. A survey on epistemic (model) uncertainty in supervised learning: Recent advances and applications. Neurocomputing 2022, 489, 449–465. [Google Scholar] [CrossRef]
- Gilanifar, M.; Wang, H.; Cordova, J.; Ozguven, E.E.; Strasser, T.I.; Arghandeh, R. Fault classification in power distribution systems based on limited labeled data using multi-task latent structure learning. Sustain. Cities Soc. 2021, 73, 103094. [Google Scholar] [CrossRef]
- Velasco-Gallego, C.; De Maya, B.N.; Molina, C.M.; Lazakis, I.; Mateo, N.C. Recent advancements in data-driven methodologies for the fault diagnosis and prognosis of marine systems: A systematic review. Ocean Eng. 2023, 284, 115277. [Google Scholar] [CrossRef]
- Dong, X.; Zhang, C.; Liu, H.; Wang, D.; Chen, Y.; Wang, T. A new cross-domain bearing fault diagnosis method with few samples under different working conditions. J. Manuf. Process. 2025, 135, 359–374. [Google Scholar] [CrossRef]
- El Ghaly, A. Hybrid ML Algorithm for Fault Classification in Transmission Lines Using Multi-Target Ensemble Classifier with Limited Data. Eng 2025, 6, 4. [Google Scholar] [CrossRef]
- Dong, S.; Meng, Y.; Yin, S.; Liu, X. Tool wear state recognition study based on an MTF and a vision transformer with a Kolmogorov-Arnold network. Mech. Syst. Signal Process. 2025, 228, 112473. [Google Scholar] [CrossRef]
- Abidi, A.; Ienco, D.; Ben Abbes, A.; Farah, I.R. Combining 2D encoding and convolutional neural network to enhance land cover mapping from Satellite Image Time Series. Eng. Appl. Artif. Intell. 2023, 122, 106152. [Google Scholar] [CrossRef]
- Ma, J.; Wang, H. Anomaly detection in sensor data via encoding time series into images. J. King Saud Univ. Comput. Inf. Sci. 2024, 36, 102232. [Google Scholar] [CrossRef]
- Yu, T.; Li, S.; Lu, J.; Gong, S. Order Track Variational Stacked Autoencoder Fault Diagnosis Model for Complex Working Conditions. In Proceedings of the 2022 Global Reliability and Prognostics and Health Management (PHM-Yantai), Yantai, China, 13–16 October 2022; pp. 1–8. [Google Scholar]
- Yan, X.; Xu, Y.; She, D.; Zhang, W. Reliable Fault Diagnosis of Bearings Using an Optimized Stacked Variational Denoising Auto-Encoder. Entropy 2021, 24, 36. [Google Scholar] [CrossRef]
- She, B.; Wang, X. A hidden feature label propagation method based on deep convolution variational autoencoder for fault diagnosis. Meas. Sci. Technol. 2022, 33, 055107. [Google Scholar] [CrossRef]
- Yan, X.; She, D.; Xu, Y. Deep order-wavelet convolutional variational autoencoder for fault identification of rolling bearing under fluctuating speed conditions. Expert Syst. Appl. 2022, 216, 119479. [Google Scholar] [CrossRef]
- Xiao, Y.; Feng, K.; Miao, D.; Zhang, P.; Yang, J. A State-Supervised Model and Novel Anomaly Index for Gas Turbines Blade Fault Detection Under Multi-Operating Conditions. IEEE Access 2025, 13, 14225–14238. [Google Scholar] [CrossRef]
- Luo, Q.; Chen, J.; Zi, Y.; Chang, Y.; Feng, Y. Multi-mode non-Gaussian variational autoencoder network with missing sources for anomaly detection of complex electromechanical equipment. ISA Trans. 2022, 134, 144–158. [Google Scholar] [CrossRef]
- Chang, S.J.; Kwon, G.-Y. Anomaly Detection for Shielded Cable Including Cable Joint Using a Deep Learning Approach. IEEE Trans. Instrum. Meas. 2023, 72, 4025. [Google Scholar] [CrossRef]
- Velasco-Gallego, C.; Lazakis, I. RADIS: A real-time anomaly detection intelligent system for fault diagnosis of marine machinery. Expert Syst. Appl. 2022, 204, 117634. [Google Scholar] [CrossRef]
- Ye, S.; Zhang, F. Unsupervised anomaly detection for multilevel converters based on wavelet transform and variational autoencoders. In Proceedings of the 2022 IEEE Energy Conversion Congress and Exposition (ECCE), Detroit, MI, USA, 9–13 October 2022; pp. 1–6. [Google Scholar]
- Temraz, M.; Keane, M.T. Solving the class imbalance problem using a counterfactual method for data augmentation. Mach. Learn. Appl. 2022, 9, 100375. [Google Scholar] [CrossRef]
- Rathore, M.S.; Harsha, S.P. Non-linear Vibration Response Analysis of Rolling Bearing for Data Augmentation and Characterization. J. Vib. Eng. Technol. 2022, 11, 2109–2131. [Google Scholar] [CrossRef]
- Vidal, J.F.; Castro, A.R.G. Diagnosing Faults in Power Transformers with Variational Autoencoder, Genetic Programming, and Neural Network. IEEE Access 2023, 11, 30529–30545. [Google Scholar] [CrossRef]
- Che, C.; Wang, H.; Lin, R.; Ni, X. Deep meta-learning and variational autoencoder for coupling fault diagnosis of rolling bearing under variable working conditions. Proc. Inst. Mech. Eng. C J. Mech. Eng. Sci. 2022, 236, 9900–9913. [Google Scholar] [CrossRef]
- Liu, Y.; Jiang, H.; Wang, Y.; Wu, Z.; Liu, S. A conditional variational autoencoding generative adversarial networks with self-modulation for rolling bearing fault diagnosis. Measurement 2022, 192, 110888. [Google Scholar] [CrossRef]
- Zhang, L.; Zhang, H.; Cai, G. The Multiclass Fault Diagnosis of Wind Turbine Bearing Based on Multisource Signal Fusion and Deep Learning Generative Model. IEEE Trans. Instrum. Meas. 2022, 71, 8483. [Google Scholar] [CrossRef]
- Li, T.; Fung, C.-H.; Wong, H.-T.; Chan, T.-L.; Hu, H. Functional Subspace Variational Autoencoder for Domain-Adaptive Fault Diagnosis. Mathematics 2023, 11, 2910. [Google Scholar] [CrossRef]
- Li, Z.; Jiang, H.; Wang, X. A novel reinforcement learning agent for rotating machinery fault diagnosis with data augmentation. Reliab. Eng. Syst. Saf. 2024, 253, 110570. [Google Scholar] [CrossRef]
- Yao, H.; Xu, Y.; Guo, Q.; Chen, S.; Lu, B.; Huang, Y. Study on transformer fault diagnosisbased on improved deep residual shrinkage network and optimized residual variational autoencoder. Energy Rep. 2025, 13, 1608–1619. [Google Scholar] [CrossRef]
- Wang, C.; Xin, C.; Xu, Z.; Qin, M.; He, M. Mix-VAEs: A novel multisensor information fusion model for intelligent fault diagnosis. Neurocomputing 2022, 492, 234–244. [Google Scholar] [CrossRef]
- Dewangan, G.; Maurya, S. Fault Diagnosis of Machines Using Deep Convolutional Beta-Variational Autoencoder. IEEE Trans. Artif. Intell. 2021, 3, 287–296. [Google Scholar] [CrossRef]
- Xiong, M.; Wu, Y.; Li, C.; Yang, Z. Rolling Bearing Fault Diagnosis Based on Multi-Modal Variational Autoencoders. In Proceedings of the 2022 International Conference on Sensing, Measurement & Data Analytics in the era of Artificial Intelligence (ICSMD), Harbin, China, 30 November–2 December 2022; pp. 1–5. [Google Scholar]
- Yang, X.; Cao, X.; Zhao, J.; Zhang, X.; Duan, Y.; Shi, L. Intelligent fault diagnosis method for rotating machinery based on sample multire presentation information fusion under limited labeled samples conditions. Measurement 2025, 250, 117164. [Google Scholar] [CrossRef]
- Zhao, Z.; Jiao, Y.; Xu, Y.; Chen, Z.; Zio, E. A fault diagnosis framework using unlabeled data based on automatic clustering with meta-learning. Eng. Appl. Artif. Intell. 2024, 139, 109584. [Google Scholar] [CrossRef]
- Fazli, A.; Poshtan, J. Wind turbine fault prognosis using SCADA measurements, pre-fault labeling, and KNN classifiers robust against data imbalance. Measurement 2024, 243, 116202. [Google Scholar] [CrossRef]
- Li, X.; Cheng, C.; Peng, Z. Label-guided contrastive learning with weighted pseudo-labeling: A novel mechanical fault diagnosis method with insufficient annotated data. Reliab. Eng. Syst. Saf. 2024, 254, 110597. [Google Scholar] [CrossRef]
- Zhao, Z.; Jiao, Y.; Xu, Y.; Chen, Z.; Zhao, R. Smeta-LU: A self-supervised meta-learning fault diagnosis method for rotating machinery based on label updating. Adv. Eng. Inform. 2024, 62, 102875. [Google Scholar] [CrossRef]
- Ibrahim, R.; Zemouri, R.; Kedjar, B.; Merkhouf, A.; Tahan, A.; Al-Haddad, K.; Lafleur, F. Non-invasive Detection of Rotor Inter-turn Short Circuit of a Hydrogenerator Using AI-Based Variational Autoencoder. IEEE Trans. Ind. Appl. 2023, 60, 28–37. [Google Scholar] [CrossRef]
- Han, P.; Ellefsen, A.L.; Li, G.; Holmeset, F.T.; Zhang, H. Fault Detection With LSTM-Based Variational Autoencoder for Maritime Components. IEEE Sens. J. 2021, 21, 21903–21912. [Google Scholar] [CrossRef]
- Helwig, N.; Pignanelli, E.; Schutze, A. D8. 1–Detecting and Compensating Sensor Faults in a Hydraulic Condition Monitoring System. In Proceedings of the AMA Conferences 2015—SENSOR 2015 and IRS 2015, Nuremberg, Germany, 19–21 May 2015; pp. 641–646. [Google Scholar]
- Helwig, N.; Pignanelli, E.; Schütze, A. Condition monitoring of a complex hydraulic system using multivariate statistics. In Proceedings of the 2015 IEEE International Instrumentation and Measurement Technology Conference (I2MTC) Proceedings, Pisa, Italy, 11–14 May 2015; pp. 210–215. [Google Scholar] [CrossRef]
- Li, J.; Yao, L. GPLA-12: An Acoustic Signal Dataset of Gas Pipeline Leakage. arXiv 2021, arXiv:2106.10277. [Google Scholar]
- Safaeipour, H.; Forouzanfar, M.; Puig, V.; Birgani, P.T. Incipient fault diagnosis and trend prediction in nonlinear closed-loop systems with Gaussian and non-Gaussian noise. Comput. Chem. Eng. 2023, 177, 108348. [Google Scholar] [CrossRef]
Number of convolutional layers in encoder: | 1 |
Number and size of kernels: | 32 of size 3 × 3 |
Type of pooling layer: | Max. pooling layer |
Latent dimensions: | 2 |
Number of convolutional layers: | 2 |
Number and size of kernels: | 32, 64 of size 3 × 3 |
Type of pooling layer and dimensions: | Max. pooling layer; 2 × 2 dimensions |
Number of fully connected layers: | 3 |
Hidden units in each fully connected layer: | 32, 64, 120 |
Fully connected layer activation function | relu |
Markov-CVAELabeller | |
---|---|
Accuracy | 0.92 ± 0.01 |
Precision | 0.84 ± 0.03 |
Recall | 1.00 ± 0.00 |
Markov-CVAELabeller-CNN | CNN | |
---|---|---|
Accuracy | 0.97 ± 0.01 | 0.99 ± 0.02 |
Precision | 0.84 ± 0.05 | 1.00 ± 0.00 |
Recall | 1.00 ± 0.00 | 0.94 ± 0.08 |
Class | Gas Pressure | Microphone | Environmental Noise |
---|---|---|---|
0 | 0.2 MPa | 1 | No |
1 | 0.2 MPa | 1 | Yes |
2 | 0.4 MPa | 1 | Yes |
3 | 0.5 MPa | 1 | No |
4 | 0.5 MPa | 2 | Yes |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Velasco-Gallego, C.; Cubo-Mateo, N. Markov-CVAELabeller: A Deep Learning Approach for the Labelling of Fault Data. Informatics 2025, 12, 35. https://doi.org/10.3390/informatics12020035
Velasco-Gallego C, Cubo-Mateo N. Markov-CVAELabeller: A Deep Learning Approach for the Labelling of Fault Data. Informatics. 2025; 12(2):35. https://doi.org/10.3390/informatics12020035
Chicago/Turabian StyleVelasco-Gallego, Christian, and Nieves Cubo-Mateo. 2025. "Markov-CVAELabeller: A Deep Learning Approach for the Labelling of Fault Data" Informatics 12, no. 2: 35. https://doi.org/10.3390/informatics12020035
APA StyleVelasco-Gallego, C., & Cubo-Mateo, N. (2025). Markov-CVAELabeller: A Deep Learning Approach for the Labelling of Fault Data. Informatics, 12(2), 35. https://doi.org/10.3390/informatics12020035