1.1. WT Technology Development
Wind and solar energy have demonstrated technological and cost competitiveness, leading to anticipation of a renewables revolution [
1,
2,
3]. To meet the rising demand for clean energy, the global installed wind power capacity needs to increase almost ten-fold in the next 30 years. According to
Figure 1, the global cumulative wind power installations (excluding grid connection) amounted to nearly 940 GW as of the end of 2022, which was primarily propelled by the anticipated rapid expansion of the wind sector in China [
4,
5]. A wind turbine (WT) system is a complete electromechanical system consisting of various elements that are designed to continuously extract significant quantities of kinetic energy from the wind [
6]. The primary objective of WTs is to convert this energy into clean, carbon-free electricity [
7]. The global aspiration for clean energy has been the driving force behind the rapidly growing wind power industry. In recent decades, WT manufacturing techniques have made significant progress [
8]. Contemporary WTs are larger than ever before, with some exceeding 200 m in size. During the mid-1990s, the rated power of WTs was less than 1000 kW. However, the market for modern WTs is now dominated by multi-megawatt machines, with some as powerful as 10 MW, that are designed primarily for offshore wind energy applications [
9].
Although WTs have seen rapid advancements, WT operation and maintenance (O&M) techniques still trail behind those used for steam, hydro, and gas turbines in conventional power plants. Due to the high maintenance costs [
11] of offshore wind turbines, the scarcity of suitable construction sites, and the challenges of handling complex wind loads, onshore power generation remains more prevalent than offshore generation. This places wind turbines in more challenging geographic environments [
12], such as remote locations with high humidity, salt spray, significant temperature fluctuations, and snow-covered terrains [
13]. Additionally, prolonged and large-scale load variations introduce significant uncertainties in the prognostics and health management of WTs. The primary challenge lies in the timely detection of incipient faults and the scheduling of maintenance to reduce the O&M costs of wind farms through condition monitoring and fault diagnosis (CMFD) techniques [
14,
15]. While CMFD technologies have made significant strides in signal processing and diagnostic methods over the past few decades, including advancements in the time, frequency, and time-frequency domains, intelligent diagnostics, image processing, data fusion, data mining, and expert systems [
16], the complexity of wind turbines necessitates the further exploration of CMFD technologies that are tailored specifically for these systems. Thus, CMFD technologies are becoming increasingly critical to maximizing economic efficiency and minimizing downtime in wind turbine operations.
As illustrated in
Figure 2, a WT is a sophisticated electromechanical system comprising multiple components and subsystems, including the rotor hub, blades, shaft, gearbox, and generator, among others. WT systems are highly complex and comprised of numerous components that operate at high power. As a result, they are prone to frequent failures that can cause long periods of downtime that last several days and result in significant financial losses. Therefore, effective fault detection is critical in maximizing the production capacity of WTs [
17].
1.2. ITSC Fault Diagnosis for WT Generators
The early fault diagnosis of WT generators is crucial to enhancing the service life of WTs and curtailing the O&M costs of WT systems. Generator failures can be categorized according to their internal and external causes [
7]. The internal causes of generator failures are the results of mechanical or electrical faults, including rotor bar breakage, shaft bending, air gap eccentricity, mass imbalance, and bearing failures for mechanical faults, and damaged stator/rotor insulation, electrical imbalance, faulty magnetic circuits, open circuits, and short circuits for electrical faults [
18]. Among these faults, inter-turn short-circuit (ITSC) faults are particularly challenging since they are difficult to detect using traditional protection schemes [
19]. ITSC faults involve a several-turn short circuit in the one-phase winding, which can lead to critical insulation breakdown due to heating in the shorted turns. If left unattended, ITSC faults may escalate to a phase-to-phase or phase-to-ground short-circuit, resulting in various adverse effects such as unbalanced line currents, vibrations and noises, decreased efficiency, and excessive heating.
In general, the early detection of ITSC faults in WT generators is crucial for preventing catastrophic failures, improving the reliability of WTs, reducing servicing costs, and extending the entire system lifecycle. ITSC fault diagnosis involves measuring various factors, such as generator voltages and currents, the moment, the vibration, the temperature, and otoacoustic emissions, for the purpose of fault analysis. This contributes to the assessment of the technical status of the machine by extracting damage symptoms from the detected diagnostic signals. It is imperative to develop effective and efficient methods for detecting ITSC faults in WT generators. Such methods would help mitigate the potential damage and expenses associated with generator failures. By accurately assessing the condition of the machine, operators can identify and fix faults before they lead to more severe problems.
Transducers are commonly deployed on generators to acquire important operating information, such as voltages, currents, and vibrations, during their operation [
20]. The signals from these sensors are then used to extract specific features that can be applied to differentiate between faulty and normal situations. The accurate extraction of fault symptoms is crucial for estimating the condition of the machine and indicating possible damage. The analysis is then applied to the sensor measurement signals to see how the signals change with the time, frequency, or time-frequency. However, efforts to fully automate traditional analysis methods often fail, and the diagnostic process can be cumbersome, requiring long fault diagnosis times and manual interpretation. Advancements in artificial intelligence technology have led to the development of data-driven and machine learning-based methods that hold great potential in fault detection [
21,
22]. To surmount the limitations of traditional methods, intelligent ITSC fault diagnosis systems can be developed. These systems generally require the application of data-driven methods, including deep neural network (DNN) models, support vector machine (SVM) [
23,
24,
25] models, random forests (RFs) [
21,
26,
27], naive Bayes models, k-nearest neighbor (k-NN) [
28] models, and expert-based systems [
7].
In recent years, deep learning methods have been developed and applied in the field of fault diagnosis [
29,
30,
31,
32]. Deep learning has shown promising results in extracting representative features from original signals by utilizing its in-depth architecture. Compared to shallow models, utilizing deep structures to learn feature representations is advantageous, as the highly semantic information learned through deep structures is more impressive than the scattered and poor representations learned through extensive shallow models. The deepness of the system allows for better feature learning, while the batch normalization (BN) technique encourages faster convergence, which can lead to improved results. Moreover, deep structures provide great domain adaptability, while the simple architecture of shallow structures is not conducive to domain adaptability, effective representation, or sensitivity to new data. Therefore, deep learning has been widely used in various fields, such as computer vision, speech recognition, biomedicine, and natural language processing, among others [
33,
34,
35].
Given the outstanding performance of deep learning [
36], particularly DNN models, in feature learning, researchers have become increasingly interested in exploring the application of deep learning models in WT fault diagnosis [
37]. Attallah et al. (2023) [
7] used infrared thermography for rotor internal short-circuit fault diagnosis and monitoring techniques for offshore WTs and discussed fault diagnosis methods based on deep convolutional network (DCNN) models. Lei, Liu, & Jiang, (2019) [
38] proposed a fault diagnosis framework for WTs that uses an end-to-end LSTM model to learn features directly from time-series data and capture long-term dependencies, and which outperforms traditional methods in fault classification and demonstrates robustness on small datasets. Cherif et al. (2020) [
39] developed an ANN-based method for the early detection of ITSC faults in the stator winding of induction motors, using a novel indicator based on the discrete wavelet energy ratio (DWER) of three stator currents. Kumar & Hati (2021) [
40] discussed the adoption of a deep convolutional neural network (DCNN) with an adaptive gradient optimizer for bearing and rotor fault detection in squirrel-cage induction motors (SCIMs), employing sensor data fusion in their model training and testing. Xue, Xiahou, Li, Ji, & Wu, (2019) [
41] introduced a data-driven fault diagnosis method, utilizing a long short-term memory network, to detect multiple open-circuit switch faults of the back-to-back converters in doubly fed induction generator-based WT systems, and demonstrated its effectiveness through simulations and experimental tests. Dhibi, Mansouri, Bouzrara, Nounou, & Nounou (2022) [
42] focused on developing and validating an effective neural network-based ensemble approach for fault detection and diagnosis in wind energy conversion systems, using techniques such as bagging, boosting, and random subspace combination. Bazan et al. (2017) [
43] presented a pattern recognition method using mutual information between phase current signals to detect stator winding short circuits in three-phase induction motors, achieving high classification accuracies even under varying load torque and power supply voltage conditions. Li et al. (2024) [
44] present a partial domain adaptation method using deep adversarial learning to address the challenge of limited target-domain data for making accurate cross-domain RUL predictions in machinery. Zhang et al. (2024) [
45] present a Swin transformer-based method for the accurate SOC prediction of lithium-ion batteries, addressing data noise to enhance electric aircraft safety. Chen et al. (2024) [
46] propose a dynamic vision-based spiking neural network for contactless cross-domain fault diagnosis that does not require target-domain data. In two studies, Gao et al. (2023–2024) [
47,
48] advanced the field by proposing a domain feature decoupling network (DFDN) that decomposes and adaptively fuses domain- and fault-related features to enable the zero-shot fault diagnosis of rotating machinery under unseen operating conditions and a complex convolutional self-attention autoencoder (CCSAE) that leverages self-attention-enhanced complex feature representations to achieve highly accurate early fault detection in analog circuits.
It can be deduced that, despite efforts to detect ITSC faults in wind turbines, this technology has not yet been fully developed. With the rapid expansion of wind power capacity, the identification of these faults remains a significant challenge for engineers, requiring ongoing efforts to address it. To date, there has been a lack of systematic analyses of the effectiveness of deep learning techniques, specifically deep convolutional networks, in comparison to traditional machine learning algorithms in detecting ITSC faults. This underscores the importance of continual efforts to simplify the ITSC fault detection process and achieve comprehensive fault detection. Simultaneously, enhancing the performance of fault diagnosis models is crucial for establishing an efficient, accurate, reliable, and robust fault detection and identification system. The proposed deep learning-based framework directly addresses these limitations by leveraging the powerful feature extraction capabilities of deep neural networks (DNNs). Unlike traditional methods, which may require extensive expert knowledge and manual intervention, our approach automates the fault detection process and can identify subtle fault signatures that would otherwise go unnoticed. The primary advantage of our method lies in its ability to process and analyze complex multi-sensor data, specifically current, vibration, and axial magnetic flux signals, enabling more accurate and timely fault detection. By systematically comparing the performance of our deep learning models to that of traditional machine learning algorithms, we demonstrate a significant improvement in detection accuracy and robustness. Our approach not only detects ITSC faults more reliably but also offers a transferable diagnostic framework that can be adapted to various scenarios of fault detection in wind turbine generators.
Thus, the objective of this study is to explore the potential of data-driven and deep learning techniques for diagnosing ITSC faults in wind turbines and to develop a deep learning-based approach that can accurately identify various categories of short-circuit faults in wind turbine generators without requiring extensive expert knowledge. The primary contributions and findings of this study are as follows:
This study proposes a transferable deep learning-centered approach for fault diagnosis in wind turbines, offering a novel pipeline encompassing dataset acquisition, model construction, training strategies, and comprehensive evaluation methods;
The FCNet-5 model achieves a state-of-the-art mAP score of 99.25%, demonstrating superior stability and precision, particularly in challenging fault scenarios;
The use of three-phase current signals is more effective than using vibration and electromagnetic signals in monitoring generator faults. Differentiating between HI-1 and LI-1 faults, which have small feature intervals, remains challenging. However, deep learning models demonstrate superior robustness in accurately diagnosing these challenging faults using both current and vibration/electromagnetic signals.
The remainder of this paper is organized as follows.
Section 2 systematically explains the principle and construction of the wind turbine test bench used herein, as well as the related dataset. Subsequently, various deep learning methodologies are studied, and different architectures of deep network models are designed to achieve state-of-the-art performance in
Section 3.
Section 4 and
Section 5 provide details on model evaluation methods and metrics, as well as the implementation of the deep learning model. In
Section 6, the fault recognition performances of the machine learning and deep learning models are thoroughly discussed and compared. Finally,
Section 7 presents conclusive summaries.