A Hybrid Machine Learning Framework for Early Fault Detection in Power Transformers Using PSO and DMO Algorithms
Abstract
:1. Introduction
Research Contributions
- Hybrid Metaheuristic Optimization: The integration of PSO and DMO improves feature selection and classifier hyperparameter tuning, ensuring a robust classification.
- Computational Efficiency: The proposed DMO-based approach converges faster than conventional PSO, reducing computational overhead.
- Interpretability vs. Deep Learning: Unlike black-box CNN-based methods, our approach is interpretable and requires less computational power.
- Real-World Deployment: The proposed method is adaptable for SCADA-based fault detection systems, making it suitable for online monitoring.
2. Related Work
Comparison with Deep Learning-Based Methods
- A lower computational complexity than CNN-based methods.
- A better generalization for small to medium-sized datasets.
- Explainability in terms of selected features and classifier decisions.
3. Materials and Methods
3.1. Experimental Setup
- Data Acquisition (DAQ) System: Used for real-time measurement and data recording.
- Current Transformers (CTs): For fault current measurement.
- Switch Relays and Protection Resistance: Implemented for fault simulation.
- LabVIEW Interface: Used for real-time monitoring and data visualization.
- Transformer Specifications: A 5 kVA, 416 V/240 V three-phase transformer was used for fault simulations.
- Fault Simulation Unit: A controlled switching mechanism was employed to introduce faults, including line-to-ground, line-to-line, and turn-to-turn faults.
- Data Acquisition System (DAQ): A NI cDAQ-9178 system with an NI-9205 analog input module was used for real-time current and voltage measurement.
- Current and Voltage Sensors: High-precision LEM LA 55-P current sensors and LV 25-P voltage sensors were utilized for accurate signal acquisition.
- Software Integration: The setup was interfaced with LabVIEW for real-time monitoring, signal logging, and data processing.
3.2. Dataset Description
3.3. Fault Simulation and Data Collection
3.4. Data Processing and Feature Extraction
4. Research Methodology
4.1. Feature Extraction
- ○
- Time–frequency characteristics are derived from the acquired signals using Discrete Wavelet Transform (DWT) and Matching Pursuit (MP) techniques.
- ○
- These techniques enhance the detection of transient fault characteristics that are often missed by traditional feature extraction methods.
4.1.1. Discrete Wavelet Transform (DWT)
4.1.2. Multiresolution Analysis
- High-frequency components are referred to as detail coefficients (D).
- Low-frequency components are referred to as approximation coefficients (A), which provide a better frequency resolution.
4.1.3. Decomposition Using Discrete Wavelet Transform (DWT)
Justification for DWT Selection
- A lower computational cost than CWT.
- A more concise feature representation than SWT.
- Proven success in transformer signal processing tasks.
4.1.4. Matching Pursuit (MP)
4.1.5. Implementation of the Orthogonal Matching Pursuit Algorithm
4.2. Feature Selection Algorithms
- ○
- Particle Swarm Optimization (PSO) and Dwarf Mongoose Optimization (DMO) are employed for feature selection and hyperparameter tuning.
- ○
- The hybrid use of these two optimization methods ensures an efficient selection of the most relevant features while optimizing model parameters for an improved classification accuracy.
4.2.1. Particle Swarm Optimization (PSO)
- (a)
- Initialization begins with creating a population of particles, each assigned a random velocity. The fitness of each particle is then evaluated to determine solution quality. The particle with the best performance is designated as the global best, while each particle retains its own best-known position as the local best.
- (b)
- During each iteration, particles move through the search space based on an updated velocity. This velocity is influenced by two main components: the global best position, representing the swarm’s top-performing solution, and the particle’s own local best position. Together, these guide the particle’s trajectory toward more optimal regions of the solution space.
- (c)
- Position Update: After calculating the updated velocity, each particle’s position is adjusted accordingly, allowing it to move through the search space. The updated velocity, which governs this movement, is determined using a specific mathematical formulation (referenced as Equation (15) [28]).
- (d)
- The update on the global and local best values: If a particle achieves an improved performance based on the newly adjusted values, both its global and local best positions are updated. The process for determining and updating the local best for each particle follows the criteria outlined in Equation (16).
- (e)
- Termination Check: If the specified stopping criteria are met, the current global best position is accepted as the final optimal solution. If not, the algorithm returns to the velocity update phase to proceed with further optimization.
4.2.2. The DMOA Model
Population Initialization
Alpha Group
Scout Group
The Babysitters
4.3. Classification Techniques
- ○
- The optimized feature sets are classified using (DT), (RF), and (SVM).
- ○
- A 5-fold cross-validation strategy is used to validate performance, ensuring robust and generalizable results.
4.3.1. Decision Tree
4.3.2. Support Vector Machine (SVM)
4.4. Random Forest (RF)
4.5. Performance Evaluation Metrics
- True Positives (TP): Faulty cases correctly predicted as faulty.
- True Negatives (TN): Healthy cases correctly predicted as healthy.
- False Positives (FP): Healthy cases incorrectly predicted as faulty (Type I error).
- False Negatives (FN): Faulty cases incorrectly predicted as healthy (Type II error).
- Sensitivity (Recall): Measures how effectively the model identifies faulty transformers.
- Specificity: Indicates the model’s accuracy in correctly classifying healthy transformers.
- Precision: Denotes the proportion of true fault detections among all instances predicted as faulty.
- Negative Predictive Value (NPV): Represents the share of truly healthy cases among those predicted as non-faulty.
- Accuracy: Reflects the overall proportion of correctly classified samples, including both healthy and faulty categories.
- F1 Score: Calculates the harmonic mean of precision and recall, offering a balanced metric, which is especially useful for datasets with a class imbalance.
5. Results and Discussion
5.1. Performance Evaluation
5.2. Comparative Analysis of Optimization Techniques
5.3. Practical Implications
- An Integration with SCADA Systems: The model can be deployed in Supervisory Control and Data Acquisition (SCADA) systems to enable automated, real-time fault monitoring.
- The Handling of Sensor Constraints: Industrial transformers are exposed to sensor noise and environmental disturbances; robust pre-processing techniques (e.g., wavelet denoising) can enhance signal reliability.
- An Adaptability to Different Transformer Types: The model’s adaptive feature selection ensures that it generalizes well across different transformer sizes and configurations.
- The Reducing of False Alarms: By incorporating advanced metaheuristic optimization, the framework minimizes misclassification rates, ensuring a higher protection reliability.
5.4. Real-World Applicability and Industrial Integration
- Sensor Noise and Environmental Interference: Addressed using pre-processing and denoising techniques.
- Varying Load Conditions: The model is trained on diverse operating scenarios to improve generalization.
- Integration with Protection Systems: The classifier output can support real-time decision-making in SCADA environments.
- Latency and Computational Efficiency: DMO enables a faster convergence, supporting real-time deployment.
- Adaptability: The proposed framework can be adjusted for different transformer ratings and topologies.
6. Conclusions
- An effective hybrid metaheuristic approach integrating PSO and DMO for optimal feature selection.
- A robust experimental validation setup, ensuring a realistic fault simulation.
- A better computational efficiency than deep learning-based methods, making it practical for industrial deployment.
Limitations and Future Work
- The scalability for high-voltage transformers remains to be explored.
- Real-time implementation using embedded hardware (FPGA and IoT) should be tested.
- A hybridization with deep learning techniques could be considered for further improvements.
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Alenezi, M.; Anayi, F.; Packianather, M.; Shouran, M. Enhancing Transformer Protection: A Machine Learning Framework for Early Fault Detection. Sustainability 2024, 16, 10759. [Google Scholar] [CrossRef]
- Maseko, N.S.; Thango, B.A.; Mabunda, N. Fault Detection in Power Transformers Using Frequency Response Analysis and Machine Learning Models. Appl. Sci. 2025, 15, 2406. [Google Scholar] [CrossRef]
- Abbasi, A.R. Fault detection and diagnosis in power transformers: A comprehensive review and classification of publications and methods. Electr. Power Syst. Res. 2022, 209, 107990. [Google Scholar] [CrossRef]
- Ahmadi, H.; Vahidi, B.; Nematollahi, A.F. A simple method to detect internal and external short-circuit faults, classify and locate different internal faults in transformers. Electr. Eng. 2021, 103, 825–836. [Google Scholar] [CrossRef]
- Et, V.K.S. A Review of Various Protection Schemes Of Power Transformers. Turk. J. Comput. Math. Educ. (TURCOMAT) 2021, 12, 3220–3228. [Google Scholar] [CrossRef]
- Ekojono; Prasojo, R.A.; Apriyani, M.E.; Rahmanto, A.N. Investigation on machine learning algorithms to support transformer dissolved gas analysis fault identification. Electr. Eng. 2022, 104, 3037–3047. [Google Scholar] [CrossRef]
- Guezzaz, A.; Asimi, Y.; Azrour, M.; Asimi, A. Mathematical validation of proposed machine learning classifier for heterogeneous traffic and anomaly detection. Big Data Min. Anal. 2021, 4, 18–24. [Google Scholar] [CrossRef]
- Charbuty, B.; Abdulazeez, A. Classification Based on Decision Tree Algorithm for Machine Learning. J. Appl. Sci. Technol. Trends 2021, 2, 20–28. [Google Scholar] [CrossRef]
- Khajavi, H.; Rastgoo, A. Predicting the carbon dioxide emission caused by road transport using a Random Forest (RF) model combined by Meta-Heuristic Algorithms. Sustain. Cities Soc. 2023, 93, 104503. [Google Scholar] [CrossRef]
- Dabiri, H.; Farhangi, V.; Moradi, M.J.; Zadehmohamad, M.; Karakouzian, M. Applications of Decision Tree and Random Forest as Tree-Based Machine Learning Techniques for Analyzing the Ultimate Strain of Spliced and Non-Spliced Reinforcement Bars. Appl. Sci. 2022, 12, 4851. [Google Scholar] [CrossRef]
- Ghosh, A.; Maiti, R. Soil erosion susceptibility assessment using logistic regression, decision tree and random forest: Study on the Mayurakshi river basin of Eastern India. Environ. Earth Sci. 2021, 80, 328. [Google Scholar] [CrossRef]
- Alhijawi, B.; Awajan, A. Genetic Algorithms: Theory, Genetic Operators, Solutions, and Applications; Springer Science and Business Media Deutschland GmbH: Berlin/Heidelberg, Germany, 2024. [Google Scholar] [CrossRef]
- Liu, Y.; As’arry, A.; Hassan, M.K.; Hairuddin, A.A.; Mohamad, H. Review of the grey wolf optimization algorithm: Variants and applications. Neural Comput. Appl. 2024, 36, 2713–2735. [Google Scholar] [CrossRef]
- Ahmad, M.F.; Isa, N.A.M.; Lim, W.H.; Ang, K.M. Differential evolution: A recent review based on state-of-the-art works. Alex. Eng. J. 2022, 61, 3831–3872. [Google Scholar] [CrossRef]
- Indrawati, A.; Wahyuni, I.N. Enhancing Machine Learning Models through Hyperparameter Optimization with Particle Swarm Optimization. In Proceedings of the 2023 10th International Conference on Computer, Control, Informatics and its Applications: Exploring the Power of Data: Leveraging Information to Drive Digital Innovation, IC3INA 2023, Bandung, Indonesia, 4–5 October 2023; pp. 244–249. [Google Scholar] [CrossRef]
- Mohamed, N.; Almutairi, R.L.; Abdelrahim, S.; Alharbi, R.; Alhomayani, F.M.; Elamin Elnaim, B.M.; Elhag, A.A.; Dhakal, R. Automated Laryngeal Cancer Detection and Classification Using Dwarf Mongoose Optimization Algorithm with Deep Learning. Cancers 2024, 16, 181. [Google Scholar] [CrossRef]
- Thomas, J.B.; Chaudhari, S.G.; Shihabudheen, K.V.; Verma, N.K. CNN-Based Transformer Model for Fault Detection in Power System Networks. IEEE Trans. Instrum. Meas. 2023, 72, 3238059. [Google Scholar] [CrossRef]
- Lu, S.; Gao, W.; Hong, C.; Sun, Y. A newly-designed fault diagnostic method for transformers via improved empirical wavelet transform and kernel extreme learning machine. Adv. Eng. Inform. 2021, 49, 101320. [Google Scholar] [CrossRef]
- Cao, H.; Zhou, C.; Meng, Y.; Shen, J.; Xie, X. Advancement in transformer fault diagnosis technology. Front. Energy Res. 2024, 12, 1437614. [Google Scholar] [CrossRef]
- Shang, H.; Liu, Z.; Wei, Y.; Zhang, S. A Novel Fault Diagnosis Method for a Power Transformer Based on Multi-Scale Approximate Entropy and Optimized Convolutional Networks. Entropy 2024, 26, 186. [Google Scholar] [CrossRef]
- Li, Q.; Luo, H.; Cheng, H.; Deng, Y.; Sun, W.; Li, W.; Liu, Z. Incipient Fault Detection in Power Distribution System: A Time-Frequency Embedded Deep Learning Based Approach. IEEE Trans. Instrum. Meas. 2023, 72, 2507914. [Google Scholar] [CrossRef]
- Cui, M.; Wang, Y. Fault detection of rotating machinery based on wavelet transform and improved deep neural network. In Proceedings of the 2020 IEEE 9th Data Driven Control and Learning Systems Conference (DDCLS), Liuzhou, China, 20–22 November 2020; pp. 449–454. [Google Scholar] [CrossRef]
- Li, G.; Wei, M.; Shao, H.; Liang, P.; Duan, C. Wavelet Knowledge-Driven Transformer for Intelligent Machinery Fault Detection with Zero-Fault Samples. IEEE Sens. J. 2024, 24, 35986–35996. [Google Scholar] [CrossRef]
- Ibrahim, A.; Anayi, F.; Packianather, M.; Alomari, O.A. New Hybrid Invasive Weed Optimization and Machine Learning Approach for Fault Detection. Energies 2022, 15, 1488. [Google Scholar] [CrossRef]
- Pati, Y.C.; Rezaiifar, R.; Krishnaprasad, P.S. Orthogonal matching pursuit: Recursive function approximation with applications to wavelet decomposition. In Proceedings of the 27th Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, USA, 1–3 November 1993; Volume 1, pp. 40–44. [Google Scholar] [CrossRef]
- Huang, L.; Asteris, P.G.; Koopialipoor, M.; Armaghani, D.J.; Tahir, M.M. Invasive weed optimization technique-based ANN to the prediction of rock tensile strength. Appl. Sci. 2019, 9, 5372. [Google Scholar] [CrossRef]
- Eberhart, R.; Kennedy, J. A new optimizer using particle swarm theory. In Proceedings of the MHS’95, Sixth International Symposium on Micro Machine and Human Science, Nagoya, Japan, 4–6 October 1995; pp. 39–43. [Google Scholar] [CrossRef]
- Kennedy, J.; Eberhart, R. Particle Swarm Optimization. Proceedings of ICNN’95—International Conference on Neural Networks, Perth, WA, Australia, 27 November–1 December 1995. [Google Scholar] [CrossRef]
- Al Dawsari, S.; Anayi, F.; Packianather, M. Techno-economic analysis of hybrid renewable energy systems for cost reduction and reliability improvement using dwarf mongoose optimization algorithm. Energy 2024, 313, 133653. [Google Scholar] [CrossRef]
- Agushaka, J.O.; Ezugwu, A.E.; Abualigah, L. Dwarf Mongoose Optimization Algorithm. Comput. Methods Appl. Mech. Eng. 2022, 391, 114570. [Google Scholar] [CrossRef]
- Mitu, M.M.; Arefin, S.; Saurav, Z.; Hasan, M.A.; Farid, D.M. Pruning-Based Ensemble Tree for Multi-Class Classification. In Proceedings of the 6th International Conference on Electrical Engineering and Information and Communication Technology, ICEEICT 2024, Dhaka, Bangladesh, 2–4 May 2024; pp. 481–486. [Google Scholar] [CrossRef]
- Quinlan, J.R. Induction of Decision Trees. Mach Learn 1986, 1, 81–106. [Google Scholar] [CrossRef]
- Kotsiantis, S.B. Decision trees: A recent overview. Artif. Intell. Rev. 2013, 39, 261–283. [Google Scholar] [CrossRef]
- Cortes, C.; Vapnik, V.; Saitta, L. Support-Vector Networks Editor; Kluwer Academic Publishers: Dordrecht, The Netherlands, 1995. [Google Scholar]
- Iqbal, G.M.; Rosenberger, J.; Rosenberger, M.; Ha, L.; Anoruo, E.; Gregory, S.; Mazzone, T. A Support Vector Machine based Logistic Regression Model Approach for Classification Problems. In Proceedings of the IISE Annual Conference and Expo 2023, New Orleans, LA, USA, 21–23 May 2023. [Google Scholar] [CrossRef]
- Chauhan, V.K.; Dahiya, K.; Sharma, A. Problem formulations and solvers in linear SVM: A review. Artif. Intell. Rev. 2019, 52, 803–855. [Google Scholar] [CrossRef]
- Chen, X.; Ji, N.; Qin, X.; Zhang, M.; Chen, X.; Jiang, C.; Tao, K. Transformer Fault Diagnosis Based on the Improved Sparrow Search Algorithm and Random Forest Feature Selection. In Proceedings of the 2024 3rd International Conference on Energy and Electrical Power Systems, ICEEPS 2024, Guangzhou, China, 14–16 July 2024; pp. 1086–1091. [Google Scholar] [CrossRef]
- Khatib, T.; Arar, G. Identification of Power Transformer Currents by Using Random Forest and Boosting Techniques. Math. Probl. Eng. 2020, 2020, 1269367. [Google Scholar] [CrossRef]
- Shah, A.M.; Bhalja, B.R. Fault discrimination scheme for power transformer using random forest technique. IET Gener. Transm. Distrib. 2016, 10, 1431–1439. [Google Scholar] [CrossRef]
Methodology | Feature Selection | Optimization Technique | Classifier Used | Performance | Ref. |
---|---|---|---|---|---|
CNN-Based | Manual Feature Extraction | None | CNN | 92.5% Accuracy | [17] |
Hybrid ML (DWT + ML) | PCA | GA | SVM | 94.1% Accuracy | [18] |
Bio-Inspired Approach | Wrapper-Based | GWO | RF | 96.3% Accuracy | [19] |
DWT + ML | PSO-DMO Hybrid Selection | PSO + DMO | RF + SVM + DT | 98.33% Accuracy | This Study |
Method | Feature Extraction | Optimization | Model Complexity | Accuracy | Ref. |
---|---|---|---|---|---|
CNN-Based | Raw Signal Processing | None | High | 96.5% | [20] |
LSTM-Based | Time-Series Analysis | None | Very High | 97.2% | [21] |
Authors’ Approach | DWT + MP | PSO + DMO | Medium | 98.33% | This Study |
Fault Type | Description |
---|---|
Single Line-to-Ground (SLG) Fault | A single phase contacts the ground, leading to an unbalanced current. |
Line-to-Line (LL) Fault | A short circuit between two phases, increasing current flow and heating. |
Turn-to-Ground (TG) Fault | A winding turn contacts the ground, affecting insulation integrity. |
Turn-to-Turn (TT) Fault | Short circuit within the same winding, leading to localized heating and damage. |
External Faults | Faults outside the transformer affecting secondary protection mechanisms. |
Dictionary Parameter | Description |
---|---|
sym4-lev5 | A five-level symmetric wavelet transform characterized by four vanishing moments was applied. |
wpsym4-lev5 | Wavelet packet transform using symmetric wavelets with five levels and four vanishing moments |
Dct | Discrete Cosine Transform |
Sin | Sine-based sub-dictionary |
Cos | Cosine-based sub-dictionary |
Metric | Formula |
---|---|
Sensitivity (Recall) | |
Specificity | |
Precision | |
F1 Score | |
Accuracy | |
Negative Predictive Value (NPV) |
Model | Accuracy | Precision | Sensitivity (Recall) | F1 Score | Specificity | Negative Predictive Value (NPV) |
---|---|---|---|---|---|---|
PSO-DT | 95.00% | 97.72% | 96.25% | 96.98% | 88.75% | 82.56% |
PSO-RF | 97.71% | 98.02% | 99.25% | 98.63% | 90.00% | 96.00% |
PSO-SVM | 94.58% | 95.61% | 98.00% | 96.79% | 77.50% | 88.57% |
Model | Accuracy | Precision | Sensitivity (Recall) | F1 Score | Specificity | Negative Predictive Value (NPV) |
---|---|---|---|---|---|---|
DMO-DT | 96.04% | 95.90% | 99.50% | 97.67% | 78.75% | 96.92% |
DMO-RF | 98.33% | 98.80% | 99.28% | 99.04% | 92.31% | 95.24% |
DMO-SVM | 93.75% | 95.57% | 97.00% | 96.28% | 77.50% | 83.78% |
Optimization Method | Best Accuracy (%) | Iterations to Optimal Solution |
---|---|---|
PSO | 97.71 | 85 |
DMO | 98.33 | 60 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Alenezi, M.; Anayi, F.; Packianather, M.; Shouran, M. A Hybrid Machine Learning Framework for Early Fault Detection in Power Transformers Using PSO and DMO Algorithms. Energies 2025, 18, 2024. https://doi.org/10.3390/en18082024
Alenezi M, Anayi F, Packianather M, Shouran M. A Hybrid Machine Learning Framework for Early Fault Detection in Power Transformers Using PSO and DMO Algorithms. Energies. 2025; 18(8):2024. https://doi.org/10.3390/en18082024
Chicago/Turabian StyleAlenezi, Mohammed, Fatih Anayi, Michael Packianather, and Mokhtar Shouran. 2025. "A Hybrid Machine Learning Framework for Early Fault Detection in Power Transformers Using PSO and DMO Algorithms" Energies 18, no. 8: 2024. https://doi.org/10.3390/en18082024
APA StyleAlenezi, M., Anayi, F., Packianather, M., & Shouran, M. (2025). A Hybrid Machine Learning Framework for Early Fault Detection in Power Transformers Using PSO and DMO Algorithms. Energies, 18(8), 2024. https://doi.org/10.3390/en18082024