1. Introduction
Due to the insufficient bending strength, tensile strength and other parameters of wind turbine blade material, as well as their overly large dynamic load, blade cracking is a common cause of failure, accounting for approximately 30% of all downtime accidents [
1,
2,
3]. However, due to constantly changing speeds and transient impacts, modeling the failure mechanism of blade cracking proves very challenging. Furthermore, since blades account for roughly 15–20% of total equipment costs, blade cracking can lead to potentially enormous maintenance expenses and safety hazards. Given its importance and costs, timely and accurate approaches to detecting blade cracking are attracting widespread research attention [
4,
5,
6].
Data-based detection approaches work independently of the dynamics and kinematics of physical systems, and are thus effective for blade cracking detection. However, data are generally imbalanced, with very few cracking samples in real scenarios, leading to detection approaches that support normal samples and rarely identify cracking samples in the early stage (theoretical analysis in
Section 2.1). Aiming at such problems, current data-level solutions mainly include sampling and generative strategies.
Sampling strategies, e.g., SMOTE (synthetic minority oversampling technique) and ADASYN (adaptive synthetic), utilize prior knowledge about the space distribution of collected cracking samples to synthesize new samples. Ge [
7] adopted 29 sensor features as original input and used SMOTE to synthesize virtual cracking samples via linear interpolation between two random real cracking samples. Meanwhile, the density and concentration area are introduced to increase the confidence of virtual cracking samples. Jiang [
8] designed a synthetic and dependent wild-bootstrapped oversampling technique for wind turbine fault diagnosis which is a modification of SMOTE and mimics the temporal dependence of time-series data. In the data analysis part, two datasets collected from two wind farms of northeast and northwest China are used, with 70 attributes in total, including wind speed, x-axis nacelle vibration, yaw angle, etc. Cristian [
9] dealt with imbalanced wind turbine data using the random oversampling technique, which directly removed some random normal samples to achieve amount balance. To verify the proposed method, 19 real variables covering 19 different attributes were used, including generated power, R-phase voltage, wind speed, etc. Yi [
10] designed a minority clustering SMOTE approach for wind turbine fault detection, which achieved different clusters of minority samples and solved the problem of the uneven distribution of fault samples. Chen [
11] adopted ADASYN to alleviate the critical imbalance and convolutional LSTM-GRU to recognize blade icing status. The dataset for verification was collected over 341.88 h with 26 sensor values, including wind speed, yaw position, vertical acceleration, etc.
Generative strategies estimate the probability density of cracking samples and then sample new data points to supplement cracking samples. Chen [
12] designed a deep convolutional generative adversarial network (GAN) to produce a threshold for a condition monitoring scheme of wind turbines. The employed signal was the frequency spectrum transformed by fast Fourier transform. Liu [
13] used a generative adversarial network to transform normal data into rough fault data, and furthermore, a refiner developed using a GAN was adopted to make them much similar to real fault data. For verification, the input data contained 28 variables, such as pitch angle, hub angle and generator torque. Liu [
14] introduced sparse dictionary learning into an adversarial variational auto-encoder to generate virtual data and determine the posterior distribution from six sensor variables, including wind speed, active power, generator speed, three-phase current and voltage. Wang [
15] designed a least-square GAN to determine the distribution of health data from 15 selected sensor variables and realize the data augmentation. Yang [
16] used wavelet packet transform to generate time–frequency data of wind turbines and a GAN to compensate for the imbalance level. Ogaili [
17] published a wind turbine fault diagnosis dataset considering vibration under different wind speeds for fault diagnosis under both imbalanced and balanced conditions. Jiang [
18] took 28 sensor variables (e.g., yaw position, yaw speed) as input of a GAN to generate virtual blade icing samples. Zou [
19] combined a convolution neural network and GAN to detect wind turbine blade surface damage using a small number of damage images.
Although plenty of approaches have been developed, there are still two critical problems that deserve in-depth research based on above studies. (1) Both sampling strategies and generative strategies easily produce virtual cracking samples with a high confidence but low usage value. The reason for this is that sampling strategies more often pick samples in a dense area as a basis, leading to many synthesized samples lying in the middle area. Similarly, the generator in a GAN yields virtual cracking samples in the middle area (with high confidence) to easily cheat the discriminator. However, virtual samples near the decision border rather than the middle area may be more helpful for supporting good classification results. (2) Generative strategies focus on generating real, similar cracking samples but ignore the overall space distribution of real samples. Furthermore, if real samples are distributed over more than one cluster, with some clusters owning many data points, generative strategies may only produce samples for these clusters. However, these clusters with few data points may be more important for improving classification accuracy.
Aiming at the above key points, this work designs a novel blade-cracking detection solution. Considering the case that the amount of redundant features will bring a high number of weights to be optimized in GAN, we curtail the input dimension using an unsupervised auto-encoder. Further, a virtual cracking sample generative strategy-based roundtrip framework is designed to achieve bidirectional mapping between virtual and real samples. By inverse mapping, it catches the overall space distribution and avoids the generated virtual samples falling into a dense area. Through verification on the benchmark dataset and real wind turbine blade cracking, the results show the effectiveness.
5. Conclusions and Future Work
Aiming at wind turbine blade cracking recognition under imbalanced data, this paper designs a novel roundtrip auto-encoder method. Two generator networks and two discriminator networks are designed to ensure the generated samples well fit the distribution of historical cracking samples. Auto-encoder method is applied to reduce the dimension of historical samples and thus the complexity of generator and discriminator. From the reconstructed results, it is concluded that auto-encoder is effective for extracting low-dimensional intrinsic features. From the results under different imbalance levels, the detection performance shows significant improvement with the RTAE method. When the imbalance level is above 4:1, all the cracking samples cannot be identified without data augmentation, which shows the importance of considering the imbalance problem in real application.
Analysis of the real wind turbine blade cracking data is carried out, the recognition of cracking samples improves by 19.8%, 23.8% and 22.7% on precision, recall and F1-score. The cracking detection under imbalanced data and the comparisons show that: (1) compared to the popular data-level methods, RTAE is superior under the influence of highly imbalanced data; (2) through integrating auto-encoder and roundtrip model, the framework provides a possibility for solving imbalance problem with high dimension data.
As is known, a wind turbine blade contains many types of cracks in real industrial applications. Currently, subject to roundtrip, RTAE can only deal with the binary-class imbalanced data problems. Multi-class cracking problems are more complex as they may contain more than one minority classes. In the future work, we will extend the application of RTAE and apply it to multi-class problem. Meanwhile, we will explore more effective roundtrip structures and apply them to high-dimensional time-series data.