Fault Detection for Wind Turbine Blade Bolts Based on GSG Combined with CS-LightGBM

Tang, Mingzhu; Meng, Caihua; Wu, Huawei; Zhu, Hongqiu; Yi, Jiabiao; Tang, Jun; Wang, Yifan

doi:10.3390/s22186763

Open AccessArticle

Fault Detection for Wind Turbine Blade Bolts Based on GSG Combined with CS-LightGBM

by

Mingzhu Tang

^1,†

,

Caihua Meng

¹,

Huawei Wu

^2,*,

Hongqiu Zhu

^3,†

,

Jiabiao Yi

¹,

Jun Tang

¹ and

Yifan Wang

¹

School of Energy and Power Engineering, Changsha University of Science & Technology, Changsha 410114, China

²

Hubei Key Laboratory of Power System Design and Test for Electrical Vehicle, Hubei University of Arts and Science, Xiangyang 441053, China

³

School of Automation, Central South University, Changsha 410083, China

^*

Author to whom correspondence should be addressed.

^†

Mingzhu Tang and Hongqiu Zhu contribute equally to this work and should be considered co-first authors.

Sensors 2022, 22(18), 6763; https://doi.org/10.3390/s22186763

Submission received: 10 August 2022 / Revised: 24 August 2022 / Accepted: 25 August 2022 / Published: 7 September 2022

(This article belongs to the Special Issue Data-Driven Performance Monitoring and Management for Complex Manufacturing Processes)

Download

Browse Figures

Versions Notes

Abstract

:

Aiming at the problem of class imbalance in the wind turbine blade bolts operation-monitoring dataset, a fault detection method for wind turbine blade bolts based on Gaussian Mixture Model–Synthetic Minority Oversampling Technique–Gaussian Mixture Model (GSG) combined with Cost-Sensitive LightGBM (CS-LightGBM) was proposed. Since it is difficult to obtain the fault samples of blade bolts, the GSG oversampling method was constructed to increase the fault samples in the blade bolt dataset. The method obtains the optimal number of clusters through the BIC criterion, and uses the GMM based on the optimal number of clusters to optimally cluster the fault samples in the blade bolt dataset. According to the density distribution of fault samples in inter-clusters, we synthesized new fault samples using SMOTE in an intra-cluster. This retains the distribution characteristics of the original fault class samples. Then, we used the GMM with the same initial cluster center to cluster the fault class samples that were added to new samples, and removed the synthetic fault class samples that were not clustered into the corresponding clusters. Finally, the synthetic data training set was used to train the CS-LightGBM fault detection model. Additionally, the hyperparameters of CS-LightGBM were optimized by the Bayesian optimization algorithm to obtain the optimal CS-LightGBM fault detection model. The experimental results show that compared with six models including SMOTE-LightGBM, CS-LightGBM, K-means-SMOTE-LightGBM, etc., the proposed fault detection model is superior to the other comparison methods in the false alarm rate, missing alarm rate and F1-score index. The method can well realize the fault detection of large wind turbine blade bolts.

Keywords:

wind turbine; SMOTE; class imbalance; Gaussian mixture model; cost-sensitive; LightGBM

1. Introduction

As a leader of global wind energy development, China’s installed wind power capacity ranked first in the world by 2021. Additionally, in light of China’s double carbon reduction targets, it is essential to address the fault issue with the earlier wind turbine installations in addition to increasing the overall installed capacity of wind turbines. Therefore, the fault detection of wind turbines is particularly important in the background of China’s double carbon goals. Simultaneously, it also promotes the development of wind turbine fault detection technology [1]. At present, in the field of wind turbine fault detection, the fault detection of wind turbine impeller blades has become a research focus. As the main component of the wind turbine [2], the wind turbine impeller blades operate in harsh environments and are subjected to alternating loads for a long time, which leads to a number of failure issues. For example, when the impeller rotates, it is easy to cause the loosening fault of the blade bolt under the external alternating load, and can even lead to the fracture fault of the bolt in serious cases [3]. The failure of blade bolts directly affects the aerodynamic performance of the impeller, which causes additional load and load imbalance of the wind turbine, and finally reduces the overall life of the wind turbine. Therefore, it is vitally necessary to study the fault detection method of wind turbine blade bolts. At the same time, it is of great significance in reducing the maintenance cost of the wind turbine and increasing the power generation of the wind turbine.

The fault detection methods of blade bolts mainly include vibration signal detection technology [4], the finite element analysis method [5], and the machine learning method [6]. In recent years, machine learning has been widely used in the field of bolt failure detection. Reference [7] used the damage index of multi-scale fuzzy entropy to extract the important features of bolts to construct a dataset, and then input it into the least squares support vector machine classifier to detect the fault of bolts. Reference [8] performed visual processing and extracted relevant features on the collected bolts’ image information to realize the bolt-loosening detection, and the feasibility of the proposed method was verified in the simulation experiment. Reference [9] combined deep learning and the mechanical-based manual-torque method for bolts’ fault detection, which effectively reduced the cost of artificial detection and improved the performance of fault detection models.

When the machine-learning method is used to detect the fault of wind turbine components, processing the data class imbalance problem has also become a hotspot in the research. In the field of machine learning, common methods for processing data class imbalance include oversampling, undersampling, cost-sensitive learning, and optimized classification algorithms [10]. The main research method involved in this paper is the oversampling method. The representative methods of oversampling include random oversampling [11], SMOTE (Synthetic Minority Oversampling technique) [12], Borderline-SMOTE [13], K-means SMOTE [14], etc. Since the samples expanded by random oversampling originate from the raw samples, it is less productive for the training and learning of the model, so domestic and foreign researchers mostly use SMOTE to simply expand the fault samples.

Obtaining high-quality samples is susceptible to noise samples and distribution characteristics of raw data. The direct use of SMOTE to expand data samples can achieve a certain effect, but it is easy to cause data redundancy problems for datasets with extreme data class imbalance. Improved variants of SMOTE are generally used to obtain high-quality synthetic samples. For example, Li et al. [15] proposed a hybrid cluster–boundary fuzzification sampling method to solve the problem of low accuracy of detection models caused by a data class imbalance, which not only balanced data class but also alleviated data redundancy issues. However, this method does not completely consider the original distribution characteristics of the fault data. Aiming at the intrusion detection data class imbalance problem, Zhang et al. [16] combined traditional SMOTE and random undersampling based on GMM to achieve the purpose of balancing the dataset. Yi et al. [17] proposed a SMOTE sampling method based on fault class clustering to solve the problem of data class imbalance in SCADA of wind turbines, which solved the problem that traditional SMOTE does not consider the distribution characteristics of the original fault samples when synthesizing samples. However, the above methods are susceptible to noise samples.

Cost-sensitive learning is also widely used to process the problem of data class imbalance. Aiming at the problem of class imbalance in the operational data of wind turbine gearboxes, Tang et al. [18] introduced a cost-sensitive matrix into the LightGBM algorithm, which solved the problem of inferior performance of the detection model due to the data class imbalance. In order to make the cost-sensitive learning independent of the manually designed cost-sensitive matrix, Reference [19] proposed an innovative method based on the genetic algorithm program to construct the cost-sensitive matrix independent of the manual design. Then, the improved cost-sensitive matrix was used to adjust the attention weight of the correlation classification algorithm to the fault class samples. The method of introducing cost-sensitive learning into the classification algorithm has been well verified when solving the problem of data class imbalance in various fields. However, when the class imbalance problem of the dataset is serious, only using cost-sensitive learning to adjust the learning weight of the classifier on the fault class samples, the effect is not so obvious.

LightGBM is widely used in the field of wind turbine fault detection due to its strong classification performance and fast training speed [20]. For example, aiming at the problem of frequent faults of wind turbine gearboxes. Tang et al. [21] proposed a fault diagnosis model for wind turbine gearboxes based on adaptive LightGBM, which was well verified in practical cases. Wang et al. [22] constructed a cascaded SAE abnormal state detection and LightGBM abnormal state classification model detection framework, which solved the problem that the wind turbine alarm system is insensitive to early faults. However, when using the fault detection model based on LightGBM for wind turbine fault detection, due to the problem of data class imbalance, its fault detection performance will be difficult to guarantee.

Therefore, aimed at the above problems, this paper proposes a fault detection method for wind turbine blade bolts based on GSG combined with CS-LightGBM. This method combined the new sampling method proposed in this paper with cost-sensitive learning, and then solved the problem of class imbalance in the bolt operation data of wind turbine blades.

2. Materials and Methods

2.1. Wind Turbine Blade Bolts Failure Problem Description

Wind turbine blades are the main components of wind turbines to capture wind energy [23]. Common wind turbine impellers are generally 3-blade type, which are connected by the hub. The blade converts the kinetic energy carried by the flowing wind into its own rotational kinetic energy to drive the hub to rotate at the same time, and then the hub transfers the kinetic energy to the main shaft of wind turbines [24]. The rotational speed of the main shaft of the non-direct-drive wind turbine is synchronized with the rotational speed of the hub, which requires a gearbox to increase the lower rotational speed from the main shaft to the rated rotational speed of the wind turbine [25]. Through the above analysis of the energy transfer process, once the wind turbine blade has a fault in operation, it will affect the entire power generation process of the wind turbine. The wind turbine blade bolt connects the blade to the hub. Once the blade bolt fails, such as bolt loosening or bolt breakage, it will directly affect the aerodynamic balance performance of the entire impeller. The aerodynamic imbalance of the wind turbine impeller directly affects the operation of the gearbox, so the aerodynamic imbalance of the wind turbine impeller is also one of the reasons for the failure of the gearbox [26]. In addition, it will also cause vibration of the wind turbine body. Therefore, it is very important to detect the fault of wind turbine blade bolts.

Wind turbine blade bolts are generally stud bolts [27]. Take the bolt of type GB899-1988 as an example, as shown in Figure 1; one end is connected to the hub, and the other end is fastened at the root of the blade by pre-embedding. The main dimensional parameters of the wind turbine stud bolts are as follows: nominal diameter: 470 mm, outer diameter: 30 mm, pitch of screw: 2 mm, and screw diameter: 364 mm. When the blade rotates, the blade bolt is mainly subjected to an axial force and a tangential force. Under the combined action of axial force and tangential force, the bolt will be subjected to an alternating load, resulting in fatigue breaking of bolts. However, the weak alternating load generally does not cause the bolt fault problem, which generally occurs when the bolt is subjected to strong load impact. For example, when the wind turbine during operation encounters wind shear, emergency shutdown, etc., it is easy to cause the blade to be impacted by a strong load, causing the blade bolt to fail. In addition, the cause of its failure is also related to the quality of the steel product of the bolt [28,29].

The failure rate of the blade bolts is closely related to the distribution of the bolts at the blade root. Since the force that causes the failure of the blade bolt is mainly the tangential force, the failure rate of the bolt that is subjected to the tangential force for a long time is relatively high. Based on the working principle of wind turbine blades and field experience, the failure rates of bolts at different positions of the blade root were obtained, as shown in Figure 2. In Figure 2, the blade tip is vertically outward, and the dotted line is used as the dividing line. Above the dotted line represents the high-incidence area of faults; below the dotted line represents the low-incidence area of faults. Therefore, the bolt failure rate at the trailing edge and leeward side of the blade is higher than that at other locations. When collecting bolts’ operation-monitoring data, the data sensors are mostly located at the bolt position on the trailing edge and leeward side of the blade. The advantage is that the collected data can provide more useful information to the machine-learning algorithm [30].

The common failures of wind turbine blade bolts are mainly bolt loosening and bolt breakage. Figure 3 shows the state of the blades of the wind turbine in different operating states. As shown in Figure 3, no matter whether the wind turbine impeller is in the power generation status or the deceleration status, the blade bolt will be subjected to a centrifugal force [31]. However, according to the mechanical analysis, the main reason for the failure of blade bolts is not the centrifugal force but the tangential force. Bolts situated the trailing edge of blades are subjected to a tangential force when wind turbines operate in power generation status. When the wind turbine impeller is in a state of deceleration, especially when the wind turbine is emergency shutdown, the bolt on the leeward side of the blade is also subjected to a tangential force. Therefore, the repeated action of the alternating load leads to the failure problem of the wind turbine blade bolt.

Through the above analysis, the problem of wind turbine blade bolt fault detection is regarded as a binary classification problem. However, for a wind turbine bolts dataset, there is a class imbalance problem because the samples of the fault class are far fewer than the samples of the normal class. As a result, the failure detection performance of wind turbine blade bolts is lower than the required detection accuracy. Aiming at the problem, a fault detection method for wind turbine blade bolts based on GSG combined with CS-LightGBM is proposed.

2.2. GSG Oversampling Method

2.2.1. GMM

The Gaussian Mixture Model (GMM) is often used for sample clustering, and its principle is based on a Gaussian distribution model. Multiple single Gaussian models are integrated to obtain a GMM [32]. When the GMM is used to process data samples, theoretically, as long as the number of integrated components is enough, it can fit the original distribution structure of any data sample as much as possible. Generally, GMM can be described from the perspective of geometric models or mixture models. Although the perspectives on describing models are different, the conclusions obtained are the same. In this paper, we describe the GMM in terms of a mixture model and input a dataset D= {x₁, x₂, x₃, ..., x_N} to the GMM. A hidden variable Z is introduced into dataset D for subsequent calculation, denoted as (D, Z) = {(x₁, z₁), (x₂, z₂), ..., (x_N, z_N)}. The probability density function of GMM is as Equation (1):

P (x | θ) = \sum_{k = 1}^{K} P_{k} \cdot N ({x | μ}_{k}, Σ_{k})

(1)

where θ = {P₁, ..., P_k, μ₁, ...., μ_k, ∑₁, ..., ∑_k}, P_k represents the probability of the kth Gaussian distribution, μ_k represents the mean of the kth Gaussian distribution, ∑_k represents the covariance matrix of the kth Gaussian distribution, x is the observation sample, K is the number of Gaussian distributions, and N is the number of samples. GMM uses MLE to estimate the optimal parameter θ_MLE, as in Equation (2).

θ_{MLE} = \arg \max_{θ} \sum_{i = 1}^{N} \log \sum_{k = 1}^{K} P_{k} \cdot N ({x | μ}_{k}, Σ_{k})

(2)

In order to solve θ_MLE, the θ parameter must be calculated first, so GMM uses the EM algorithm to calculate the θ parameter. The EM algorithm calculates the θ parameter through E steps and M steps. The purpose of the E step is to calculate the expected value of the logarithmic joint probability of (D, Z) under the posterior probability distribution, denoted as Q (θ, θ^t), as in Equation (3).

Q (θ, θ^{t}) = \sum_{Z_{1} \dots Z_{N}} \prod_{i = 1}^{N} P (Z_{i} | x, θ^{t}) \cdot [logP (x_{1}, Z_{1} | θ) + \dots + logP (x_{N}, Z_{N} | θ)]

(3)

The purpose of the M step is to maximize the expected value of Q (θ, θ^t), so as to obtain the best estimated parameters P_j, μ_j, ∑_j with the iteration, and define γ_ij as Equation (4).

γ_{ij} = \frac{P_{j} \cdot N (x_{i} | μ_{j}, \sum_{j})}{\sum_{k = 1}^{N} P_{k} \cdot N (x_{i} | μ_{k}, \sum_{k})}

(4)

So, the three parameter update formulas of GMM are as follows:

P_{j}^{t + 1} = \frac{1}{N} \sum_{i = 1}^{N} γ_{ij}

(5)

μ_{j}^{t + 1} = \frac{\sum_{i = 1}^{N} γ_{ij} x_{i}}{\sum_{i = 1}^{N} γ_{ij}}

(6)

\sum_{j}^{t + 1} = \frac{\sum_{i = 1}^{N} {(x_{i} - μ_{j})}^{T} (x_{i} - μ_{j}) γ_{ij}}{\sum_{i = 1}^{N} γ_{ij}}

(7)

In order to determine the optimal number of components to obtain the optimal clustering, GMM generally uses the Bayesian information criterion (BIC) or Akaike information criterion (AIC) to determine the optimal number of components [33]. This paper uses BIC to determine the optimal number of components, which is defined as Equation (8):

BIC = \ln (n) k - 2 \ln (L)

(8)

where k is the number of model parameters, n is the number of samples, and L is the likelihood function [34].

2.2.2. SMOTE

The main idea of the Synthetic Minority Oversampling Technique (SMOTE) is that the neighboring samples in the same feature space have a certain similarity [35]. Therefore, the artificial samples synthesized by neighboring samples also have a certain similarity with the original samples [36]. The traditional random oversampling method achieves the purpose of sample balance by randomly copying some original samples, but this method does not provide much useful information to the classifier. It even causes data redundancy and overfitting of the algorithm model. Since SMOTE does not simply copy the original data samples, it can avoid the above problems to a certain extent. Many practices have proved that SMOTE can effectively improve the performance of classifiers in binary and multi-classification problems. Figure 4 shows the sampling schematic for SMOTE; here, the number of majority class samples is 11. The specific sampling process is as follows:

Randomly select an initial sample in the minority class samples feature space as the nearest neighbor sample center X_centre;
According to the principle of K nearest neighbors, take X_centre as the center to obtain k sample points, denoted as X_i (i = 1, 2, ..., k);
Set a sampling strategy based on the total number of expanded samples;
According to Formula (9), synthesize new samples X_inew.

X_{inew} = X_{centre} + rand (0, 1) \times (X_{i} - X_{centre})

(9)

2.2.3. GSG Sampling Method

When the wind turbine fails, its control system will immediately stop the equipment operation and remind maintenance personnel to fix the fault. The data collector is installed at the bolt of wind turbine blades, once the collected data have abnormal problems, even if it is only a small abnormal problem. In order to prevent the expansion of the abnormal problem, the measures taken by the maintenance personnel are to shut down the wind turbine and check the abnormal situation. The above measures result in only a small number of fault datasets in the wind turbine blade bolts’ dataset, which leads to a class imbalance problem in the dataset.

Aiming at the problem of data class imbalance, the common solution is to increase the number of minority class samples. Increasing minority class samples is generally achieved through SMOTE. However, it will lead to data redundancy problems. In addition, the traditional SMOTE does not consider the structural distribution and density distribution of samples within the class, and is easily affected by noise samples. Aiming at the above problems, this paper proposes an oversampling method based on GSG. This method does not just make the classes of the dataset balanced, but expands the minority class samples to a certain scale under the constraint of not changing their original distribution characteristics as much as possible. Thereby alleviating the class imbalance problem of the dataset.

Figure 5 shows the basic principle of sampling based on GSG. The basic idea is to start from the fault sample set in the training set. Firstly, use the GMM based on the optimal number of components to cluster the fault sample set into C clusters, and obtain the samples of each cluster. Then, calculate the ratio of the number of samples in each cluster to the total number of fault samples, and multiply the ratio by the total number of samples to be expanded to obtain the number of subsamples to be expanded in each cluster. Finally, perform SMOTE oversampling based on samples of each cluster. This ensures that the original density distribution of the fault samples is not changed when using SMOTE sampling. Use the GMM with the same initial center of the cluster again to cluster the fault samples with added synthetic samples into C clusters. The synthetic samples that are not clustered into the corresponding clusters will be removed. Repeat the above process until each cluster has been expanded. This ensures that the structural distribution characteristics of fault samples can be preserved to the greatest extent when using SMOTE to synthesize samples.

Figure 6 shows the oversampling process based on GSG. Its specific description is as follows:

Perform data preprocessing. First, remove the missing values and abnormal values in the wind turbine blade bolts dataset, and then calculate the feature importance through XGboost and select the important features. Finally, divide the processed dataset is into a training set and validation set. In order to ensure the validity of the experiment, only the training set samples are oversampled. We denoted the training dataset of wind turbine blade bolts as Set(D).
Set(D) contains two types of data samples. One is the normal data sample with a large number of samples, denoted as Set(max) = {X’₁, X’₂, …, X’_M}; the other is the fault data sample with a small number of samples, denoted as Set(min) = {X₁, X₂, …, X_N}.
Based on the original dataset, calculate the class imbalance rate r (r = N/M), and generate a random floating point number b with an interval between (r, 1) as the sampling strategy. Then, obtain the number of samples to be expanded in Set(min), denoted as parameter a. (a = int (M*b − N), where M represents the number of samples of Set(max), and N represents the number of samples of Set(min)).
Train Set(min) according to BIC to obtain an optimal number of components: C. Use GMM to cluster Set(min) into C clusters. We denoted each cluster as G_i (i = 1, 2, ..., C).
Obtain G_i (i = 1, 2, ..., C) samples, and count the number of G_i samples, denoted as Num (G_i), which is used to determine the needed number of synthesized samples in the G_i cluster, denoted as int (a*Num (G_i)/N). This ensures that the sample density distribution of the original Set(min) tends to remain unchanged.
Use SMOTE to synthesize new samples based on the G_i cluster samples to obtain a new sample set, denoted as Set (X_{new_k}) (k = 1, 2, ..., a*Num (G_i)/N). Use GMM with the same initial center of the cluster to cluster the Set (X_{new_k}) samples set and G_i cluster samples into C clusters again, denoting each cluster as G’_i (i = 1, 2, ..., C).
Create an empty dataset Set(G″_i). If the X_{new_k} sample is clustered into the G′_i cluster, add the X_{new_k} sample to the dataset Set(G″_i), and calculate the number of Set(G″_i) samples, denoted as Num (G″_i). Otherwise, remove the X_{new_k} sample. This ensures that the structural distribution characteristics of the original dataset Set(min) can be preserved to the greatest extent when synthesizing samples.
If Num (G″_i) < a*Num (G_i)/N, repeat steps (6) and (7); otherwise, repeat steps (5), (6) and (7), until each cluster is expanded.
Concatenate the dataset Set(G″_i) (i = 1, 2, ..., C), denoted as Set(N′).
Concatenate Set(max), Set(min) and Set(N′). Then, return a training Set(D′) with more minority class samples than the original training Set(D).

This paper gave the pseudo-code of GSG, as shown in Algorithm 1:

Algorithm 1 GSG Pseudo-code

Input:

Training Set(D) = {Set(min), set(max)}

| Set (\min) | = N

, | Set (\max) | = M

# N is the number of fault class samples.

# M is the number of normal class samples.

Output:

A training Set (D′) with more fault class samples than the original training Set(D).

1:

r = N / M

# Calculate the class imbalance rate of the original dataset.

2: b = random(r,1)

3: a = int(M*b-N) # Calculate the number of samples to be expanded in Set(min).

4: C = BIC(Set(min)) # Determine an optimal number of components.

5: G_i = GMM(Set(min), C) # Use GMM to cluster Set(min) into C clusters.

6: for i ←1 to C do

7: Set(G″_i) = {} # Create C empty sample sets.

8: if

| Set {(G^{″}}_{i}) | < a * (| G_{i} | / N)

then

9:

Set (x_new) = SMOTE (G_{i}, int (a * (| G_{i} | / N)))

# Use SMOTE to synthesize new samples based on the G_i cluster samples.

10: G′_I = GMM(Set(min) + Set(x_{new_k}), C)

# Use GMM again to cluster Set(min) and Set(x_{new_k}) into C clusters.

11: for k ←1 to int(a* num(G_i)/N) do

12: if x_{new_k} in G′_i then
# The x_{new_k} is the k-th sample in Set(x_new).

13: Add the x_{new_k} to Set(G″_i)
# Add the x_{new_k} sample to Set(G″i).

14: if x_{new_k} not in G′_i then

15: Remove the x_{new_k}

16: end if

17: end if

18: end for

19: end if

20: Set(N′) = concatenate (Set(G″_i))

21: end for

22: Set(D′) = concatenate (Set(max), Set(min), Set(N′))

23: return Set(D′)

3. Wind Turbine Blade Bolt Fault Detection Model

The fault detection model of wind turbine blade bolts is shown in Figure 7. It mainly consists of three main models: a data preprocessing model, a GSG oversampling model and a CS-LightGBM training and evaluation model.

3.1. Data Preprocessing

The data preprocessing model is mainly to perform relevant data processing operations on the original wind turbine blade bolts dataset, such as data cleaning and standardization, encoding the text labels of the dataset into numerical labels using one-hot encoder, and using XGboost to sort the importance of features for feature selection.

Data cleaning mainly processes the abnormal and missing values in the wind turbine blade bolts dataset. The dataset contains too many normal class samples; therefore, the method adopted in this paper is to directly remove the samples with abnormal or missing values in the normal class samples. The samples with abnormal or missing values in the fault class samples are filled with mean values, thereby reducing the loss of the fault class samples.

One-hot encoding mainly converts the text labels in the wind turbine blade bolts dataset into numerical labels for machine identification. Since its model is a binary classification problem, the normal labels and fault labels in the bolt operation data of wind turbine blades are defined as P = [0, 1].

When machine-learning algorithms learn data features, the best features are those that are sufficient and necessary. Fewer features can easily lead to the underfitting of the model, while too many features not only cause the overfitting of the model, but also take up too many computing resources. So, it is very important to select a certain number of features for the model. In this paper, the XGboost feature importance algorithm is used to rank the importance of the wind turbine blade bolts dataset features. The principle is to input each feature of wind turbine blade bolts data in a single decision tree. Then, calculate the degree value of the feature to the decision tree split point improvement performance measure. The degree value is weighted and counted by the leaf nodes responsible for weighting, and finally, according to the weighted value and count value to determine the importance of the feature. Its performance measurement is generally the Gini purity value of the selected split node. The dataset of wind turbine blade bolts in this study has a total of 48 features, and Table 1 lists the importance values of some features, where the initial letters of the feature parameters represent different blades of the same wind turbine. According to the feature importance value and expert experience, 11 features are selected as the feature training set.

Finally, the dataset after performing feature selection is data-standardized according to Equation (10) and its dataset is converted to a dataset with mean 0 and variance 1. This improves the training speed and accuracy of the wind turbine blade bolt fault detection model.

x^{'} = \frac{x - μ}{δ}

(10)

where x′ is the standardized feature value, x is the original feature value, μ is the feature mean, and δ is the feature standard deviation.

3.2. GSG Oversampling

GSG is a new oversampling method proposed in this paper. GSG is mainly based on the fault class sample dataset in the wind turbine blade bolts training set, and expands the fault class samples to a certain number of scales when the distribution characteristics of the original fault class samples are kept unchanged as much as possible. Then, the expanded fault class sample dataset is concatenated into the normal class sample dataset to obtain a new training dataset. Thereby, the imbalance rate of normal class samples and fault class samples in the original training samples is alleviated.

3.3. CS-LightGBM Training and Evaluation Model

The CS-LightGBM training and evaluation model combines cost-sensitive learning with the LightGBM algorithm, which is used to train the new training set processed by GSG, and evaluate the classification performance of the model through the test set. Finally, through the Bayesian optimization algorithm, search the optimal hyperparameter combination, and obtain an optimal wind turbine blade bolt fault detection model.

3.3.1. CS-LightGBM

Cost-Sensitive (CS), a new method in machine learning to process data classes’ imbalanced problems, is mainly used in the field of relevant classification algorithms. When the machine-learning algorithm classifies the samples, the cost of misclassifying one class of samples into another class of samples is very serious. Aiming at this problem, cost-sensitive learning assigns different costs to different misclassifications, so as to minimize the number of high-cost errors and the sum of the misclassification costs.

LightGBM is optimized on the basis of XGBoost, so its basic principle is basically similar to XGBoost; both use decision trees based on learning algorithms. LightGBM is mainly optimized for the training speed of the model, so the efficiency of LightGBM is generally higher than that of the XGboost algorithm. XGboost performs an indiscriminate split on the nodes of each layer, which wastes a lot of computing resources and time resources, as the information gain brought by some nodes to the algorithm model is almost 0. Aiming at the problem, LightGBM only selects the node with the largest splitting gain for splitting, and runs recursively under the depth limitation of the tree. In addition, LightGBM also performs parallel processing of data and data features, which speeds up the training speed, so that the overall performance of the LightGBM algorithm model is better than the XGboost algorithm model. Therefore, LightGBM is used as the classification algorithm of the detection model proposed in this paper.

CS-LightGBM, a method proposed in recent years to optimize the LightGBM classifier, introduces the idea of cost-sensitive learning into the LightGBM classifier. Specifically, it introduces a cost-sensitive function to replace the information gain in the weight function of the traditional LightGBM. So that when LightGBM performs iterative updates, the fault class samples can get excessive attention weights to improve the classification effect of class imbalance samples.

3.3.2. Bayesian Optimization Algorithm

Bayesian Optimization (BO) is a global optimization algorithm based on probability distribution. When the hyperparameter search space of the algorithm model is given, BO constructs a probability model for the function to be optimized f: χ → Rd, and further uses the model to select the next evaluation point, and then iterates through the cycle to obtain the hyperparameter optimal solution:

x * = \arg \min x_{x}_{\in χ \subseteq R^{d}} f (x)

(11)

where x* is the optimal hyperparameter combination, χ is the decision space, and f(x) is the objective function.

The combination of different hyperparameters can affect the classification performance of a detection model, so it is very important to optimize the hyperparameters of the model. The LightGBM model has many hyperparameters, but some hyperparameters have little effect on the classification performance of the model. This study mainly optimizes four hyperparameters in the LightGBM model, as shown in Table 2.

3.3.3. Model Evaluation Index

The performance evaluation index in this experiment is not only used the false alarm rate (FAR) and missing alarm rate (MAR) indexes to evaluate the classification performance of the fault detection model, but also introduces the F1-score index to better describe the classification performance of LightGBM classifier for unbalanced data.

The expressions of MAR, FAR, and F1-score are as follows:

MAR = \frac{FN}{TP + FN}

(12)

FAR = \frac{FP}{TN + FP}

(13)

F 1 - score = 2 \cdot \frac{P \cdot R}{P + R}

(14)

where P is the precision rate and R is the recall rate; and TP, TN, FP, and FN are the confusion matrices, as shown in Table 3.

4. Results

4.1. Sampling Effect Verification Experiment

To verify the effectiveness of the proposed sampling method in this paper, the GSG sampling method and traditional SMOTE sampling method were used to sample, respectively, the same simulated dataset. Then, the sampling effects based on the various methods were shown by data visualization. At the same time, in order to verify the sensitivity of the sampling method proposed in this paper to noise samples, after adding certain minority class noise samples to the simulation dataset, we then compared the effects of various sampling methods. In addition, to verify the sensitivity of GSG sampling method to the distribution characteristics of minority class data samples, some minority class sample sets with irregular distribution characteristics were randomly generated. The simulated simulation dataset consists of three clusters of sample sets obeying Gaussian distribution, whose specific distribution characteristics are shown in Table 4.

Here, the number of samples in the majority class accounts for 85%, and the samples in the minority class account for 15%. Dataset 1 is used as a majority class sample set, and dataset 2 and dataset 3 are combined to be used as a minority class sample set.

Figure 8 shows the sampling effect based on different sampling methods in the same training set without adding noise samples, and the sampling strategy is 0.85. The original data distribution is shown in Figure 8a. Although there are no noise samples in the majority class samples, the majority class samples and the minority class samples still have certain overlapping samples in the boundary area. So as to examine the sampling situation of different sampling methods in the overlapping area.

It can be seen from Figure 8b,c that when the original training set does not contain noise samples. Both the traditional SMOTE and the GSG sampling method proposed in this paper have effectively sampled it, and the sample distribution characteristics after sampling roughly conforms to the distribution characteristics of the original samples. However, it is obvious that the traditional SMOTE sampling makes the boundary sample distribution tend to be uniform. In contrast, the samples obtained by the GSG sampling method have more distinct distribution characteristics within the minority class samples. In addition, the GSG sampling method also avoids the generation of overlapping samples as much as possible in the fuzzy area of the sample boundary.

Considering that the noise samples affect the sampling effect of various sampling methods, a certain number of minority class samples are added to the majority class samples as noise samples, and the original data situation is shown in Figure 9a. It can be seen from Figure 9b,c that when a certain a certain number of minority class noise samples are added to the majority class samples, the SMOTE will use the noise sample as exemplars to synthesize new samples. Therefore, the SMOTE sampling method synthesizes a large number of new samples between noise samples and between noise samples and normal samples. It makes the overlapping area more blurred and complicated, and, thus, the minority class samples lose their original distribution characteristics. However, the GSG sampling method first uses GMM to cluster the minority class samples, so that the noise samples are clustered into different clusters, and then synthesizes new samples in the intra-class, thus effectively reducing the situation of synthesizing new samples between the noise samples. In addition, when synthesizing new samples based on noise samples and normal samples, since the Euclidean distance between noise samples and normal samples is relatively far, the newly synthesized samples change the distribution characteristics of the original clusters. As a result, when the GMM is used to cluster data samples again, the probability of new synthetic samples being clustered into corresponding clusters is reduced and, thus, some synthetic samples are discarded. Therefore, the distribution characteristics of the original data samples can be retained to the greatest extent after the minority class samples are expanded.

To sum up, when there are no noise samples in the dataset, both traditional SMOTE and GSG can effectively sample the minority class samples, and retain the distribution characteristics of the original minority class samples to the greatest extent. However, the sampling effect of GSG is better than that of SMOTE in the case of intra-class distribution characteristics of minority class samples. When there are noise samples in the original dataset, traditional SMOTE is easily affected by the noise samples, making the overlapping area more blurred, and synthesizing new samples between the noise samples, thus changing the distribution characteristics of the minority class dataset. When GSG faces noise samples, it clusters the noise samples into different clusters, thereby reducing the situation of synthesizing new samples between noise samples and preserving the distribution characteristics of the original minority class samples to the greatest extent.

4.2. Wind Turbine Blade Bolt Fault Detection Experiment

4.2.1. Wind Turbine Blade Bolts Data Description

In order to verify the effectiveness of the sampling method and fault detection model proposed in this paper, the operation monitoring data of a 2 MW wind turbine blade in a wind farm in Anhui Province were taken as the research object. The data source was mainly through the installation of fiber grating sensors and stress and other sensors at the blade root and hub of wind turbines for data collection, and the collection interval was 1 min. Some of the data are shown in Table 5. In order to avoid the randomness of the experiment, two datasets from different wind turbines were used in this experiment. The structure of each dataset is shown in Table 6.

4.2.2. Experimental Results

To verify the effectiveness and superiority of the proposed method in this paper, two datasets were used as experimental objects in this experiment, as shown in Table 6. Based on each dataset, we compared the detection performance of the proposed fault detection model with CS-LightGBM, SMOTE-LightGBM, K-means-SMOTE-LightGBM, Borderline-SMOTE-LightGBM, SMOTE-CS-LightGBM, GSG-LightGBM models. In order to avoid randomness, each experiment was repeated 10 times, and then the average value was taken as the experimental result. Table 7 shows the experimental results.

As shown in Table 7, compared with other models, on the original dataset 1 and dataset 2. The new training set obtained by the sampling method proposed in this paper is input into the CS-LightGBM model, and the output FAR, MAR and F1-score value are all better than other models. For dataset 1, without considering the error term, we compared the fault detection model proposed in this paper with the traditional SMOTE-LightGBM fault detection model; the FAR is reduced by 2.39%, the MAR is reduced by 0.308%, and the F1-score is increased by 3.49%. For dataset 2, the FAR is reduced by 1.8%, the MAR is reduced by 0.576%, and the F1-score is increased by 2.67%.

Therefore, combined with the overall experimental results under the two datasets, the fault diagnosis model proposed in this paper has lower FAR, MAR and higher F1-score value than other models. Thus, the effectiveness and superiority of the method proposed in this paper are verified.

5. Conclusions

Wind turbine blade bolts operate under alternating loads for a long time, which often causes them to fail. When using machine-learning algorithms for fault detection, the fault detection performance is not only affected by external factors, but also affected by the data class imbalance. In order to solve this problem, this paper analyzed and compared the shortcomings of traditional sampling methods and algorithms, and proposed a fault detection method for wind turbine blade bolts based on GSG combined with CS-LightGBM. Its main contributions are as follows:

A new oversampling method GSG was proposed. GSG is based on the basic principle of GMM and SMOTE, which can retain the distribution characteristics of the original data to the greatest extent when the sample is expanded, avoiding the blindness of traditional SMOTE in performing oversampling. In addition, this method also effectively alleviated the influence of noise samples during oversampling, and effectively reduced the generation of overlapping samples.
We combined GSG and CS-LightGBM for the fault detection of wind turbine blade bolts. The model starts from expanding the fault class sample dataset and introducing cost-sensitive learning methods to solve the problem of data classes unbalanced in wind turbine blade bolts. Specifically, we used the GSG proposed in this paper to expand the fault class samples in the wind turbine blade bolts training dataset to obtain a new training dataset. Then, we inputted the new training dataset to the LightGBM classifier introducing a cost-sensitive function for training, and the operational status of the wind turbine blade bolt was used as the output value.
Both the proposed new sampling method and the fault diagnosis model were well experimentally verified. Analysis of the experimental results of GSG sampling on simulated datasets revealed that the sampling effect of GSG was better than that of traditional SMOTE, thus verifying the effectiveness of GSG. In addition, comparing the fault diagnosis model proposed in this paper with other models, the missing alarm rate and false alarm rate under the proposed model were lower, and the F1-score value was higher. Thus, the feasibility and superiority of the fault diagnosis model proposed in this paper were verified.

When the dataset has a serious class imbalance problem, the proposed fault detection model shows good detection performance. However, when there is only a slight class imbalance in the dataset, how to find an optimal sampling strategy for GSG needs further consideration, and will also be the focus of the next research.

Author Contributions

Formal analysis, H.W.; investigation, H.Z.; supervision, M.T.; writing—original draft, C.M.; Data curation, J.Y.; visualization, J.T.; validation, Y.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the National Natural Science Foundation of China (grant no. 62173050), the National Key R&D Program of China (grant no. 2019YFE0105300), the Energy Conservation and Emission Reduction Hunan University Student Innovation and Entrepreneurship Education Center, Changsha University of Science and Technology’s “The Double First Class University Plan” International Cooperation and Development Project in Scientific Research in 2018 (Grant No. 2018IC14), the Hunan Provincial Department of Transportation 2018 Science and Technology Progress and Innovation Plan Project (grant no. 201843), Hubei Superior and Distinctive Discipline Group of “New Energy Vehicle and Smart Transportation”, Open Fund of Hubei Key Laboratory of Power System Design and Test for Electrical Vehicle (grant no. ZDSYS202201), General projects of Hunan University Students’ innovation and entrepreneurship training program in 2022 (grant no. 2565) and the Graduate Scientific Research Innovation Project of Changsha University of Science & Technology (grant no. CXCLY2022094).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

References

Song, D.R.; Li, Z.Q.; Wang, L.; Jin, F.J.; Huang, C.E.; Xia, E.; Rizk-Allah, R.M.; Yang, J.; Su, M.; Joo, Y.H. Energy capture efficiency enhancement of wind turbines via stochastic model predictive yaw control based on intelligent scenarios generation. Appl. Energy 2022, 312. [Google Scholar] [CrossRef]
McLean, D.; Pope, K.; Duan, X. Start-up considerations for a small vertical-axis wind turbine for direct wind-to-heat applications. Energy Convers. Manag. 2022, 261, 115595. [Google Scholar] [CrossRef]
Zhang, Y.; Gao, S.; Guo, L.H.; Qu, J.Y.; Wang, S.L. Ultimate tensile behavior of bolted T-stub connections with preload. J. Build. Eng. 2022, 47, 103833. [Google Scholar] [CrossRef]
Teng, W.; Ding, X.; Tang, S.Y.; Xu, J.; Shi, B.S.; Liu, Y.B. Vibration Analysis for Fault Detection of Wind Turbine Drivetrains-A Comprehensive Investigation. Sensors. 2021, 21, 1686. [Google Scholar] [CrossRef]
Ma, C.P.; Cao, L.Q.; Lin, Y.P. Energy Conserving Galerkin Finite Element Methods for the Maxwell-Klein-Gordon System. Siam J. Numer. Anal. 2020, 58, 1339–1366. [Google Scholar] [CrossRef]
Liu, Y.L.; Wang, J.Y. Transfer learning based multi-layer extreme learning machine for probabilistic wind power forecasting. Appl. Energy 2022, 312, 118729. [Google Scholar] [CrossRef]
Wang, F.R.; Chen, Z.; Song, G.B. Monitoring of multi-bolt connection looseness using entropy-based active sensing and genetic algorithm-based least square support vector machine. Mech. Syst. Signal Processing 2020, 136, 106507. [Google Scholar] [CrossRef]
Park, J.H.; Huynh, T.C.; Choi, S.H.; Kim, J.T. Vision-based technique for bolt-loosening detection in wind turbine tower. Wind Struct. 2015, 21, 709–726. [Google Scholar] [CrossRef]
Yang, X.Y.; Gao, Y.Q.; Fang, C.; Zheng, Y.; Wang, W. Deep learning-based bolt loosening detection for wind turbine towers. Struct. Control Health Monit. 2022, 29, e2943. [Google Scholar] [CrossRef]
Zhang, C.S.; Bi, J.J.; Xu, S.X.; Ramentol, E.; Fan, G.J.; Qiao, B.J.; Fujita, H. Multi-Imbalance: An open-source software for multi-class imbalance learning. Knowl. Based Syst. 2019, 174, 137–143. [Google Scholar] [CrossRef]
Wei, G.L.; Mu, W.M.; Song, Y.; Dou, J. An improved and random synthetic minority oversampling technique for imbalanced data. Knowl. Based Syst. 2022, 248, 108839. [Google Scholar] [CrossRef]
Douzas, G.; Bacao, F. Geometric SMOTE a geometrically enhanced drop-in replacement for SMOTE. Inf. Sci. 2019, 501, 118–135. [Google Scholar] [CrossRef]
Song, L.L.; Xu, Y.K.; Wang, M.H.; Leng, Y. PreCar_Deep: A deep learning framework for prediction of protein carbonylation sites based on Borderline-SMOTE strategy. Chemom. Intell. Lab. Syst. 2021, 218, 104428. [Google Scholar] [CrossRef]
Liang, X.W.; Jiang, A.P.; Li, T.; Xue, Y.Y.; Wang, G.T. LR-SMOTE—An improved unbalanced data set oversampling based on K-means and SVM. Knowl. Based Syst. 2020, 196, 105845. [Google Scholar] [CrossRef]
Li, K.; Ren, B.Y.; Guan, T.; Wang, J.J.; Yu, J.; Wang, K.X.; Huang, J.C. A hybrid cluster-borderline SMOTE method for imbalanced data of rock groutability classification. Bull. Eng. Geol. Environ. 2022, 81, 1–15. [Google Scholar] [CrossRef]
Zhang, H.P.; Huang, L.L.; Wu, C.Q.; Li, Z.B. An effective convolutional neural network based on SMOTE and Gaussian mixture model for intrusion detection in imbalanced dataset. Comput. Netw. 2020, 177, 107315. [Google Scholar] [CrossRef]
Yi, H.K.; Jiang, Q.C.; Yan, X.F.; Wang, B. Imbalanced Classification Based on Minority Clustering Synthetic Minority Oversampling Technique With Wind Turbine Fault Detection Application. Ieee Trans. Ind. Inform. 2021, 17, 5867–5875. [Google Scholar] [CrossRef]
Tang, M.Z.; Zhao, Q.; Wu, H.W.; Wang, Z.M. Cost-Sensitive LightGBM-Based Online Fault Detection Method for Wind Turbine Gearboxes. Front. Energy Res. 2021, 9, 701574. [Google Scholar] [CrossRef]
Pei, W.B.; Xue, B.; Shang, L.; Zhang, M.J. Developing Interval-Based Cost-Sensitive Classifiers by Genetic Programming for Binary High-Dimensional Unbalanced Classification. Ieee Comput. Intell. Mag. 2021, 16, 84–98. [Google Scholar] [CrossRef]
Yan, J.; Xu, Y.T.; Cheng, Q.; Jiang, S.Q.; Wang, Q.; Xiao, Y.J.; Ma, C.; Yan, J.B.; Wang, X.F. LightGBM: Accelerated genomically designed crop breeding through ensemble learning. Genome Biol. 2021, 22, 1–24. [Google Scholar] [CrossRef]
Tang, M.Z.; Zhao, Q.; Ding, S.X.; Wu, H.W.; Li, L.L.; Long, W.; Huang, B. An Improved LightGBM Algorithm for Online Fault Detection of Wind Turbine Gearboxes. Energies 2020, 13, 807. [Google Scholar] [CrossRef]
Wang, L.S.; Jia, S.H.; Yan, X.H.; Ma, L.B.; Fang, J.L. A SCADA-Data-Driven Condition Monitoring Method of Wind Turbine Generators. IEEE Access 2022, 10, 67532–67540. [Google Scholar] [CrossRef]
Song, D.R.; Tu, Y.P.; Wang, L.; Jin, F.J.; Li, Z.Q.; Huang, C.N.; Xia, E.; Rizk-Allah, R.M.; Yang, J.; Su, M.; et al. Coordinated optimization on energy capture and torque fluctuation of wind turbines via variable weight NMPC with fuzzy regulator. Appl. Energy 2022, 312, 118821. [Google Scholar] [CrossRef]
Tang, M.Z.; Yi, J.B.A.; Wu, H.W.; Wang, Z.M. Fault Detection of Wind Turbine Electric Pitch System Based on IGWO-ERF. Sensors 2021, 21, 6215. [Google Scholar] [CrossRef]
Gugliani, G.K.; Sarkar, A.; Ley, C.; Matsagar, V. Identification of optimum wind turbine parameters for varying wind climates using a novel month-based turbine performance index. Renew. Energy 2021, 171, 902–914. [Google Scholar] [CrossRef]
Fotso, H.R.F.; Kaze, C.V.A.; Kenmoe, G.D. Real-time rolling bearing power loss in wind turbine gearbox modeling and prediction based on calculations and artificial neural network. Tribol. Int. 2021, 163, 107171. [Google Scholar] [CrossRef]
Karimipour, A.; Ghalehnovi, M.; Golmohammadi, M.; de Brito, J. Experimental Investigation on the Shear Behaviour of Stud-Bolt Connectors of Steel-Concrete-Steel Fibre-Reinforced Recycled Aggregates Sandwich Panels. Materials 2021, 14, 5185. [Google Scholar] [CrossRef]
Zhang, K.; Peng, K.X.; Ding, S.X.; Chen, Z.W.; Yang, X. A Correlation-Based Distributed Fault Detection Method and Its Application to a Hot Tandem Rolling Mill Process. IEEE Trans. Ind. Electron. 2020, 67, 2380–2390. [Google Scholar] [CrossRef]
Zhang, K.; Peng, K.X.; Dong, J. A Common and Individual Feature Extraction-Based Multimode Process Monitoring Method With Application to the Finishing Mill Process. IEEE Trans. Ind. Inform. 2018, 14, 4841–4850. [Google Scholar] [CrossRef]
Long, W.; Wu, T.B.; Xu, M.; Tang, M.Z.; Cai, S.H. Parameters identification of photovoltaic models by using an enhanced adaptive butterfly optimization algorithm. Energy. 2021, 229. [Google Scholar] [CrossRef]
Cognet, V.; du Pont, S.C.; Thiria, B. Material optimization of flexible blades for wind turbines. Renew. Energy 2020, 160, 1373–1384. [Google Scholar] [CrossRef]
Sangare, M.; Gupta, S.; Bouzefrane, S.; Banerjee, S.; Muhlethaler, P. Exploring the forecasting approach for road accidents: Analytical measures with hybrid machine learning. Expert Syst. Appl. 2021, 167, 113855. [Google Scholar] [CrossRef]
Albahli, S.; Yar, G. Defect Prediction Using Akaike and Bayesian Information Criterion. Comput. Syst. Sci. Eng. 2022, 41, 1117–1127. [Google Scholar] [CrossRef]
Bhattacharya, S.; Biswas, A.; Nandi, S.; Patra, S.K. Exhaustive model selection in b -> sll decays: Pitting cross-validation against the Akaike information criterion. Phys. Rev. D 2020, 101, 055025. [Google Scholar] [CrossRef]
Zhang, A.M.; Yu, H.L.; Huan, Z.J.; Yang, X.B.; Zheng, S.; Gao, S. SMOTE-RkNN: A hybrid re-sampling method based on SMOTE and reverse k-nearest neighbors. Inf. Sci. 2022, 595, 70–88. [Google Scholar] [CrossRef]
Long, W.; Jiao, J.J.; Liang, X.M.; Wu, T.B.; Xu, M.; Cai, S.H. Pinhole-imaging-based learning butterfly optimization algorithm for global optimization and feature selection. Appl. Soft Comput. 2021, 103, 107146. [Google Scholar] [CrossRef]

Figure 1. Force Analysis of Blade Bolts.

Figure 2. Failure distribution of bolts at different positions.

Figure 3. The state of the blades of the wind turbine in different operating states.

Figure 4. The sampling schematic of SMOTE (k = 4, sampling strategy = 1).

Figure 5. Basic principle of sampling based on GSG.

Figure 6. The oversampling Process Based on GSG.

Figure 7. The fault detection model of Wind turbine blade bolts.

Figure 8. (a) Distribution of the original dataset that does not contain noisy samples; (b) Distribution of samples based on SMOTE sampling; (c) Distribution of samples based on GSG sampling.

Figure 9. (a) Distribution of the original dataset containing the noisy samples; (b) Distribution of samples based on SMOTE sampling; (c) Distribution of samples based on GSG sampling.

Table 1. Partial feature importance based on XGboost.

Feature	Importance	Feature	Importance	Feature	Importance
C-bolt-98.1°	0.254579	A-bolt-32.7°	0.060742	C-bolt-229.2°	0.013685
C-blade-180°	0.120046	A-blade-180°	0.023917	A-bolt-294.6°	0.009154
A-blade-90°	0.113605	A-bolt-196.5°	0.020874	B-bolt-294.6°	0.004120
...	...	...	...	...	...
B-bolt-32.7°	0.084388	A-bolt-0°	0.017649	C-blade-temp	0.0038916
B-bolt-98.1°	0.060742	C-bolt-98.1°	0.014236	C-blade-90°	0.000112

Table 2. Hyperparameters to be optimized for the LightGBM model.

Parameter	Range	Description
num_leaves	{1, 2, ···, n}	Number of leaves
max_depth	[3, 10]	Maximum tree depth
min_data_in_leaf	{1, 2, ···, n}	Minimum number of data in a leaf
learning_rate	(0, 1]	Learning rate

Table 3. Confusion matrix for classification results.

Actual Samples	Predicted Value
Actual Samples	Predicted to Be a Failure Class Sample	Predicted to Be a Normal Class Sample
Failure class samples	TP	FN
Normal class samples	FP	TN

Table 4. Distribution characteristics of the simulation dataset.

Data Type	Dataset Name	Center Value	Variance Value	Total Number of Samples
Majority class dataset	1	[17, −4]	[2, 5]	226
Minority class dataset	2	[14, −10]	[3, 6]	20
Minority class dataset	3	[13, −14]	[5, 0.4]	20

Table 5. Raw data of bolt part of wind turbine blades.

Feature Parameters	Time
Feature Parameters	03:27	03:28	03:29	03:30	03:31	03:32	…	07:24	07:25
A-bolt-0°	5.270	7.688	9.362	10.416	10.851	10.633	…	7.527	6.854
A-bolt-65.4°	8.835	9.207	9.393	9.362	9.176	8.990	…	7.967	7.471
A-blade-65.4°	−776	−772	−768	−752	−732	−720	…	−736	−752
…	…	…	…	…	…	…	…	…	…
C-blade-Temp	19.39	19.33	19.36	19.35	19.36	19.36	…	19.39	19.42

Table 6. The structure of the dataset.

Dataset Name	Total Number of Samples	Number of Normal Class Samples	Number of Fault Class Samples	Class Ratio
Dataset 1	4639	4282	357	12.994:1
Dataset 2	4203	3857	346	12.147:1

Table 7. Performance evaluation values based on different fault diagnosis models.

Dataset Name	Model	FAR (%)	MAR (%)	F1-Score
Data 1	CS-LightGBM	6.04 ± 0.024	0.467 ± 0.0029	0.959 ± 0.0025
	SMOTE-LightGBM	7.66 ± 0.028	0.682 ± 0.0041	0.945 ± 0.0040
	K-means-SMOTE-LightGBM	7.08 ± 0.022	0.605 ± 0.0038	0.950 ± 0.0029
	Borderline-SMOTE-LightGBM	6.84 ± 0.031	0.525 ± 0.0036	0.952 ± 0.0032
	SMOTE-CS-LightGBM	6.14 ± 0.019	0.465 ± 0.0022	0.956 ± 0.0027
	GSG-LightGBM	6.53 ± 0.023	0.528 ± 0.0046	0.953 ± 0.0036
	GSG-CS-LightGBM	5.27 ± 0.016	0.374 ± 0.0023	0.978 ± 0.0012
Data 2	CS-LightGBM	6.55 ± 0.030	0.471 ± 0.0024	0.957 ± 0.0028
	SMOTE-LightGBM	8.12 ± 0.026	1.002 ± 0.0025	0.938 ± 0.0024
	K-means-SMOTE-LightGBM	7.34 ± 0.029	0.773 ± 0.0029	0.946 ± 0.0027
	Borderline-SMOTE-LightGBM	6.96 ± 0.031	0.552 ± 0.0034	0.952 ± 0.0032
	SMOTE-CS-LightGBM	6.39 ± 0.024	0.507 ± 0.0021	0.958 ± 0.0019
	GSG-LightGBM	6.92 ± 0.028	0.499 ± 0.0027	0.954 ± 0.0026
	GSG-CS-LightGBM	6.32 ± 0.021	0.426 ± 0.0018	0.963 ± 0.0021

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Tang, M.; Meng, C.; Wu, H.; Zhu, H.; Yi, J.; Tang, J.; Wang, Y. Fault Detection for Wind Turbine Blade Bolts Based on GSG Combined with CS-LightGBM. Sensors 2022, 22, 6763. https://doi.org/10.3390/s22186763

AMA Style

Tang M, Meng C, Wu H, Zhu H, Yi J, Tang J, Wang Y. Fault Detection for Wind Turbine Blade Bolts Based on GSG Combined with CS-LightGBM. Sensors. 2022; 22(18):6763. https://doi.org/10.3390/s22186763

Chicago/Turabian Style

Tang, Mingzhu, Caihua Meng, Huawei Wu, Hongqiu Zhu, Jiabiao Yi, Jun Tang, and Yifan Wang. 2022. "Fault Detection for Wind Turbine Blade Bolts Based on GSG Combined with CS-LightGBM" Sensors 22, no. 18: 6763. https://doi.org/10.3390/s22186763

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Fault Detection for Wind Turbine Blade Bolts Based on GSG Combined with CS-LightGBM

Abstract

1. Introduction

2. Materials and Methods

2.1. Wind Turbine Blade Bolts Failure Problem Description

2.2. GSG Oversampling Method

2.2.1. GMM

2.2.2. SMOTE

2.2.3. GSG Sampling Method

3. Wind Turbine Blade Bolt Fault Detection Model

3.1. Data Preprocessing

3.2. GSG Oversampling

3.3. CS-LightGBM Training and Evaluation Model

3.3.1. CS-LightGBM

3.3.2. Bayesian Optimization Algorithm

3.3.3. Model Evaluation Index

4. Results

4.1. Sampling Effect Verification Experiment

4.2. Wind Turbine Blade Bolt Fault Detection Experiment

4.2.1. Wind Turbine Blade Bolts Data Description

4.2.2. Experimental Results

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI