*Article* **Analysis and Detection of Erosion in Wind Turbine Blades**

**Josué Enríquez Zárate 1,2, María de los Ángeles Gómez López 3, Javier Alberto Carmona Troyo <sup>4</sup> and Leonardo Trujillo 4,\***


**Abstract:** This paper studies erosion at the tip of wind turbine blades by considering aerodynamic analysis, modal analysis and predictive machine learning modeling. Erosion can be caused by several factors and can affect different parts of the blade, reducing its dynamic performance and useful life. The ability to detect and quantify erosion on a blade is an important predictive maintenance task for wind turbines that can have broad repercussions in terms of avoiding serious damage, improving power efficiency and reducing downtimes. This study considers both sides of the leading edge of the blade (top and bottom), evaluating the mechanical imbalance caused by the material loss that induces variations of the power coefficient resulting in a loss in efficiency. The QBlade software is used in our analysis and load calculations are preformed by using blade element momentum theory. Numerical results show the performance of a blade based on the relationship between mechanical damage and aerodynamic behavior, which are then validated on a physical model. Moreover, two machine learning (ML) problems are posed to automatically detect the location of erosion (top of the edge, bottom or both) and to determine erosion levels (from 8% to 18%) present in the blade. The first problem is solved using classification models, while the second is solved using ML regression, achieving accurate results. ML pipelines are automatically designed by using an AutoML system with little human intervention, achieving highly accurate results. This work makes several contributions by developing ML models to both detect the presence and location of erosion on a blade, estimating its level and applying AutoML for the first time in this domain.

**Keywords:** wind energy; wind turbine blades; erosion; modal analysis; aerodynamic analysis; AutoML

#### **1. Introduction**

Wind energy offers an important supply of electricity without pollution problems presented by conventional forms of energy. There is a global interest for the development and use of alternative energy sources, including geothermal, photovoltaic, hydroelectric, tidal wave, biomass and others [1]. Unlike other technologies, wind farms have a very low impact on their environment, which has resulted in increased use worldwide, with some countries obtaining as much as 20 percent of their energy from the wind [2].

Wind turbines can be categorized, for example, based on their rotation axis, which can be vertical or horizontal [3]. Horizontal axis turbines are the most common and can be classified according to the rotation of the rotor with respect to the tower. These machines are composed of a foundation, a tower, a rotor, nacelle with power train and the blades. The blades are one of the most important components, if not the most, since they are in charge of collecting the energy from the wind, converting the linear movement of the wind into a rotary movement of the rotor. This energy is transmitted to the hub, from the hub it proceeds to a mechanical transmission system and from there it proceeds to the generator that transforms it into electrical energy.

**Citation:** Enríquez Zárate, J.; Gómez López, M.d.l.A.; Carmona Troyo, J.A.; Trujillo, L. Analysis and Detection of Erosion in Wind Turbine Blades. *Math. Comput. Appl.* **2022**, *27*, 5. https://doi.org/10.3390/ mca27010005

Academic Editor: Guillermo Valencia-Palomo

Received: 30 December 2021 Accepted: 11 January 2022 Published: 13 January 2022

**Publisher's Note:** MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

**Copyright:** © 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https:// creativecommons.org/licenses/by/ 4.0/).

71

Blades can suffer different types of failures due to a variety of phenomena [4], such as the following: bending, twisting, cracks and erosion. Detecting these types of faults, particularly at early stages, is important for avoiding catastrophic failures, reducing down times and taking corrective actions in a timely manner [5]. In particular, automatically detecting erosion at the leading edge of the blade tip presents a challenge because it is not trivial to properly measure erosion without direct access to the blade [4]. Typically, fault detection in wind turbine blades has been carried out by visual means (https:// energy.sandia.gov/programs/renewable-energy/wind-power/, accessed on 29 December 2021), but it is also possible to use different sensor schemes implemented via a SCADA system [5,6]. Afterward, these data can be analyzed by computational models, such as those derived by means of machine learning (ML), which have been found to be of great utility in the detection of damages in a variety of complex scenarios [7–9]. One approach of note, for example, is the use of sound to detect such damage [10]. Moreover, to implement ML methods, sufficient data are required to model blades with and without erosion; however, extracting these data from the field or in controlled scenarios can be a complex task to perform [11]; thus, one possible alternative is to exploit simulation software such as QBlade [12].

The goal of this work is two fold. First, it presents a detailed analysis of the effects that erosion has on wind turbine blades, considering modal and numerical analysis, with respect to the physical stress caused by erosion on the blades and the power-generating capacity of a wind turbine. Second, this work presents a methodology to construct ML models that can detect the presence and location of erosion in a blade and measure the amount of erosion as well. The paper makes two important contributions. First, we show for the first time that AutoML can be successfully applied in this problem domain, which has not been considered before for this problem. Second, we show that it is possible to detect the presence of erosion on the blade, determine its location and predict the amount of damage caused by erosion.

The remainder of this paper is organized as follows. Section 2 presents wind turbine blades and the aerodynamic, modal and numerical analysis of the blades studied in this work. Section 3 deals with the detection of erosion with ML methods, including an overview of related works and our experimental approach and results. Finally, conclusions and future work are presented in Section 4.

#### **2. Wind Turbine Blades**

During regular operation of a wind turbine blade, air mixed with sand and water droplets will cause severe erosion, which can produce a loss of fatigue resistance of the blade's surface and cause heavy damage to the blade material, resulting in a loss in electrical potential [13]. Before analyzing the effect of erosion on a wind turbine blade, we first consider the aerodynamic and structural response of the blade caused by erosion.

#### *2.1. Aerodynamic Analysis*

Generally, a wind turbine blade is divided in three sections: tip, mid span and root (see Figure 1). These sections are exposed to potential structural damage during normal operation, such as cracks, wrinkles, delamination, debonding and erosion. In particular, this paper is focused on the erosion problem around to the leading edge of the blades. This section presents an analysis of erosion on the blades, identifying the negative effects that erosion has on the power-generating capacity based on power coefficient *Cp* as well as the power generated by the wind turbine *PG*.

**Figure 1.** Sections of a wind turbine blade.

In order to analyze erosion in a wind turbine blade it is necessary to identify the development of several dynamic parameters, such as the drag coefficient *CD*, lift coefficient *CL* and frequency *ωn*. For aerodynamic analysis, this study uses the Theory of Momentum (TM), Blade Element Momentum (BEM), Geometry of Blade (GB) and the evaluation of the performance of the wind rotor, considering a case study of a 1 Kw Wind Turbine Generator (WTG). In order to obtain a design as close to the optimum as possible, several aerodynamic surfaces were evaluated to identify suitable conditions of aerodynamic lift based on the pressure gradient of aerodynamics airfoils. The design parameters of the blade and the technical specification of the WTG are provided in Table 1.

**Table 1.** Design parameters of the 1 Kw WTG.


For this design, we consider a power coefficient *Cp* = 0.40 based on the Betz limit [14], which allows calculating the length (theoretical) of the blade, which results in *R* = 1.628 [m], and the relationship between the speed and the blade tip in nominal conditions resulting in a value of *λ*nom ≈ 8. In order to obtain the Reynolds number of the blade, we separate the blade into three sections. We consider an average airfoil in every section of the blade, resulting in the following Reynolds' numbers. For the root section, the average chord was 0.14 m and Reynolds was 147,000; the average chord of the body is 0.10 m and Reynolds was 198,000, while the average chord of the tip is 0.023 m and Reynolds was 78,000. Finally, BEM parameters used to simulate the blade design were obtained using the method of the *Viterna* extrapolation [15], which include the following: aspect ratio of 10, lift coefficient adjustment of 0.7 and the number of elements is seven.

Subsequently, from the previous parameters, it is necessary to evaluate several aerodynamics airfoils from databases such as NACA, NREL and WORTMANN. In this work, we evaluated 49 airfoils using the open source *QBlade* software with the goal of selecting the best and most stable design. The chosen airfoils were FX 63-137 and E216 based on their lift *CL* and drag *CD* coefficients, both of which influence power coefficient *Cp* of the turbine.

The designed blade, considering the position of the selected airfoils, chords, spin and Reynolds numbers calculated using QBlade, is show in Figure 2a. The evaluation of aerodynamic performance of the blade using QBlade was carried out using the following parameters: 1 kW of power, an input speed of 3 m/s, cutting speed of 15 m/s, rotor speed of 296 rpm and a density of 1.185 kg/m3. In particular, the work considers the wind conditions of *La Ventosa, Oaxaca, Mexico* in the Isthmus of Tehuantepec for the simulations, because this geographical zone is located between the Pacific Ocean and the Gulf of Mexico, which is a perfect location for a wind farm [16]. The wind farms are placed near the Pacific Ocean, within a range of 20 and 60 km. In this zone, wind conditions are considered as

good (class 4) and excellent (class 7). With a wind speed of 8 m/s, the power generated was 1.38 kW, and the power coefficient was *Cp* = 0.4197 with a simulation of 1.4 s.

#### *2.2. Modal Analysis*

Structural analysis characterizes the behavior of the blade in the presence of erosion. In particular, this section presents vibration analysis using simulated erosion in the leading edge of the blade tip to compare a clean model with an eroded blade. The material, or mass, loss is expected to cause a variation in the vibration response of the blade, which directly affects the wind rotor and other subsystems of the turbine, such as the gearbox, main shaft and the generator due to the coupling between them. Furthermore, structural analysis is carried out to analyze the response frequency of the blade through modal analysis using a finite element software. This is performed by considering both a clean blade and an eroded version. For structural analysis, the mechanical and physical proprieties of the blade materials are described in Table 2b. Furthermore, dynamic analysis is also necessary in order to obtain natural frequencies and modal shapes of the blade, which was carried out using the following mesh parameters: quadratic order element, uniform size function, medium center of relevance, fast transition, low smoothness, 732 elements and 1768 nodes.

Simulated erosion was carried out with QBlade considering the tip of the blade, which should effect drag and lift coefficients, as well as the power coefficient and the electric power generated by the turbine. Erosion analysis consist of modifying the geometry of the blade to represent damage on the surface of the tip. The damage due to erosion reduces the original mass of the blade, which is represented in terms of a percentage in profile reduction, as shown in Figure 2b. For this analysis, three different depths of erosion were considered for the bottom edge of the profile, denoted as percentages by *BI*<sup>1</sup> = 10%, *BI*<sup>2</sup> = 18% and *BI*<sup>3</sup> = 25%. In each case, these percentages represents a reduction in profile coordinates relative to the clean blade. This is shown in Figure 2c, where the black line represents the clean profile, and the blue line represents the eroded profile for the *BI*<sup>1</sup> case.

Modal analysis is performed to evaluate the effects of different levels of erosion on the mechanical structure of the blade through their frequency response. The complete results are summarized in Table 3, where four modal frequencies are evaluated for each case of erosion depth compared with the numerical response of the clean blade.


**Table 2.** (**a**) Wind turbine blade design. (**b**) Blades properties.



#### *2.3. Numerical Analysis*

The drag and lift coefficients, respectively, *CD* and *CL*, are compared for the three levels of erosion and the clean blade in Figure 3a. The change in the original design of the blade causes a variation in the power coefficient *CP* and the electric power generated *PG*.

In order to complete analysis, the lift coefficient *CL* is compared with respect to the leading edge of the blade, which is denoted by *α* using QBlade, as shown in Figure 3b. Parameter *α* represents the angle of inclination of the blade with respect to the direction of the wind force. This angle can change in a range between −20 to 20 degrees. In this study, the simulation tests were carried out considering *α* with values of 10 and 20 degrees. For the three erosion cases considered here, *BI*1, *BI*<sup>2</sup> and *BI*3, *α* = 10 degrees. Figure 3b shows the behavior of *CL* relative to our three erosion levels. The percentage of erosion on the leading edge of the blade impacts its aerodynamic response, particularly the lift and drag coefficients (*CL*) and (*CD*). Figure 3a,b show both coefficients reducing their performance when erosion is increased, this considering a particular angle of attack of 10 degrees. The power coefficient (*Cp*) of the wind rotor is affected by the presence of erosion as well, minimizing the electricity production potential (*PG*), shown in Figure 3c,d.

**Figure 3.** (**a**) Relationship between the drag coefficient (*CD*) and the lift coefficient (*CL*). (**b**) Lift coefficient *CL* Vs angle of attack *α*. (**c**) Power coefficient *Cp* for *BI*<sup>1</sup> and the clean wind turbine blade. (**d**) Power output *PG* for *BI*<sup>1</sup> and the clean blade.

Table 4 presents comparative results of the drag coefficient *Cd* and the lift coefficient *Cl* for the three levels of erosion, with respect to the clean blade. Both coefficients are reduced when the depth of the damage is increased on the surface of the blade, affecting aerodynamic performance.

**Table 4.** Comparison of the leading edge in three different erosion's depth for the drag coefficient *Cd* and the lift coefficient *Cl*. Last two columns show the percentage difference relative to the clean blade (*Cd*(%) and *Cl*(%)).


Finally, the behavior of the power coefficient *CP* considering *BI*<sup>1</sup> is shown in Figure 3c, where the blue line represents the clean blade and the red line represents the eroded blade. It is possible to observe the variation of *CP*, which decreases to *CP* = 0.39, while *CP* = 0.41 for the clean blade. The power output generated *PG* is also affected by erosion, which decreases to around *PG* = 1.30 kW with respect to *PG* = 1.36 kW for the clean blade, as shown in Figure 3d.

#### *2.4. Experimental Modal Analysis*

In order to validate our simulated model of erosion, the following tests were carried out using a physical model. The blade was manufactured with a mixture of fiberglass, catalyst and resin. To cause the different levels of erosion on the leading edge of the blade, a Dremel tool was used, which uses a disc to gently sand and abrade the area of interest. Finally, to measure the percentage of erosion, a Vernier was used, measuring depth and thickness at the worn area. The experimental test of the blade was carried out in the Laboratory of Vibration of the Department of Mechatronics Division in the CINVESTAV–IPN. To evaluate the levels of erosion discussed in the numerical results, it was necessary the use an electromagnetic shaker model ET-139 (manufactured by Labworks©) to excite the blade, which is mounted over a mechanical rail supported by a mechanism of ball bearing that allows limited horizontal displacement. The electromagnetic shaker is controlled via a linear power amplifier (manufactured by Labworks©) model PA-138. The response in terms of acceleration and force applied by the electromagnetic shaker to the blade is monitored by using an impedance head placed at the stinger of the shaker connected via a cable to the data acquisition system (manufactured by Klister LabAmp©). The frequency response of the system is fed back using accelerometers model 8640A (manufactured by Klister©) mounted at the tip section of the blade. The data acquisition system is connected via USB to an external graphical interface implemented in MATLAB/Simulink©for the analysis of experimental data captured with a Sensoray©card. A comparative study of the experimental and Finite Element Method (FEM) response (simulation) of a wind turbine blade of 1 Kw of electrical power of the first three vibration modes is summarized Table 5. Results indicate that our simulated model behaves consistently with the real-world tests. The variations in the frequency response of the blade is due to the loss of material on the leading edge of the tip section, which can compromise the stability and balance of the wind rotor.

**Table 5.** Modal analysis of the simulated (FEM) and physical (experimental) blade under different erosion conditions; values given in Hz.


#### **3. Erosion Detection with Machine Learning**

This section deals with the automatic detection of erosion in wind turbine blades. The goal is to detect where erosion has occurred using a classification model, considering the three different cases of where erosion can appear on the blade tip: on the bottom edge, the top edge or both. Moreover, we predict, using regression models, the amount of erosion on the blade. Both tasks are posed as supervised learning problems and solved using ML algorithms by performing feature extraction on the power and vibration response of the blade through QBlade simulation. However, before presenting our proposal, we briefly survey related works in this domain.

#### *3.1. Related Work*

Several works have applied ML towards the detection of different types of problems in wind turbines. For instance, static and dynamic regression models have previously been used to detect failures in wind turbines based on vibration analysis [5]. Another example is [17], which presents an approach to predict when preventive maintenance should be performed, focusing on the remaining useful life of a wind turbine before a failure occurs and diagnosing the type of failure. The proposal involves low implementation costs because it is based solely on information collected from the very common SCADA system. A recent example also includes forecasting of wind speed assessment using satellite data and ML [18], specifically a neural network.

In [11], the authors classify the occurrence of different types of failures in blades, using a piezoelectric accelerometer to measure the vibration of the blade. That work considered five types of damage to the top of the leading edge of the blades, namely bending, cracks, looseness, pitch, twist and erosion. To classify the signals time-domain, feature extraction is performed on the vibration signals, focusing on different types of summary statistics. In a more recent study by the same authors [19], they also use vibration signals, histogram features and ML to monitor the condition of wind turbine blades, in this case using *lazy* classifiers.

ML has also been applied to maintenance management of blades in [20]. The work is based on the detection of delamination, a common structural problem that can generate large costs. Continuous monitoring of turbines is the focus of [21], using real data from a SCADA system to predict damage to the structure and blades of wind turbines. The authors present two models for this: the first is the use of multilayer neural network, and the second is adaptive networks with a fuzzy inference system. The proposal is to monitor the power curve signal, achieving good precision. Sound analysis, a unique approach, has also been used for fault detection, extracting descriptive feature of acoustic waves and detecting damage using common ML methods [10]. A related work can be found in [22], where ML is used to estimate turbine energy yield losses due to erosion on the leading edge of the blade.

In general, few works deal specifically with erosion, and those that do not focus on a detailed analysis of this type of failure.

#### *3.2. Data Set*

The dataset was generated with QBlade and the procedure outlined in Section 2.2. A total of 100 blades were simulated for each type of erosion of the blade tip (bottom edge, top edge or both), producing a dataset with 300 samples, similar to [11]. This work assumes that all blades used in a real-world setting will have a certain amount of erosion in at least one of the edges of the tip. Therefore, we do not consider the case in which the blade is completely clean. It must be stated that, while we are relying on simulated data, it has been shown that simulated results of wind turbine blade performance are reliable predictors of on site behavior [22,23].

In order to simulate an eroded blade, each of the contour points of the blade profile was perturbed, adding displacements within the range of 8% to 18%. The same seven contour points on the tip of the blade were modified to model different levels of erosion. Half of the samples were generated using uniform grid sampling, while the other half of samples were generated with random values within the specified range. For example, for the lower edge cases, 8% of erosion was removed from the Y-axis coordinate value; subsequently, the percentage of erosion increased by 0.2% up to 18% to generate 50 samples. For the remaining 50 samples, the amount of erosion was determined randomly by using a uniform distribution *U*[8, 18]. Random samples were used to simulate a rugged surface on the blade, which can be caused by random events such as contact with insects or large sand particles. Our approach is justified since roughness on a blade is often simulated with a random surface [24,25].

Each blade was simulated with QBlade using the settings in Section 2.1. The acceleration response at the blade tip was obtained, using the QFEM tool for structural design and modal analysis of each blade. The NREL FAST tool [26] was used to carry out analysis of the dynamic response of wind turbines. The vibration of the acceleration signal at the tip of the blade is selected as output. The simulation parameters are as follows: time step of 0.1, 3 blades, a rotor speed of 296 rpm and air density of 1.225 k/m3. The wind fields are

specified in Table 6 by mostly using the same simulation values used to determine power output. The difference is simulation time and air density.


**Table 6.** Wind field and wind simulation parameters.

#### *3.3. Feature Extraction*

After obtaining the samples of both power and acceleration for the different blades, we proceeded to perform feature extraction. For this work, feature detection was carried out in the time domain given the success of such measures in similar work [11] and in the analysis of other complex signals [27,28].

Let *<sup>ξ</sup>*(*t*) <sup>∈</sup> <sup>R</sup>*<sup>T</sup>* denote the vector containing a time series from a single signal, and *<sup>T</sup>* denotes the number of samples in *ξ*. A feature of *ξ*(*t*) is denoted by *x*, while the matrix **X** = [**x**1, .., **x***F*] contains all features from all samples, **x***<sup>i</sup>* is the vector of a single feature, and *F* is the total number of features extracted.

The feature extracted from the signals include six statistical descriptors, namely mean, median, maximum, minimum, sum, standard deviation, variance and kurtosis. Moreover, we also extract the following:

• Power: *P<sup>ξ</sup>* = <sup>1</sup> *T*Σ<sup>∞</sup> <sup>−</sup>∞|*ξ*(*t*)| 2;

$$\bullet \quad \text{First difference: } \delta\_{\tilde{\xi}} = \frac{1}{T - 1} \Sigma\_{t - 1}^{T - 1} |\tilde{\xi}(t + 1) - \tilde{\xi}(t)|\!\!\!/\!\_{\tilde{\varepsilon}}$$

• Normalized first difference: ¯*δξ* <sup>=</sup> *δξ σξ* ;

\* 
$$\mathop{\rm Second}\limits\_{\ldots}\limits\_{\ldots}\textrm{Second difference: }\gamma\_{\xi} = \mathop{\frac{1}{T-2}\Sigma\_{t-1}^{T-2}}\nolimits\_{t-1}^{T-2}|\dot{\xi}(t+2)-\dot{\xi}(t)|\dot{\gamma}$$

• Normalized second difference: *<sup>γ</sup>*¯*<sup>ξ</sup>* <sup>=</sup> *γξ σξ* .

Power *Pξ* measures the strength of the signal or the energy consumed per unit of time. The first and second differences show the changes of a signal in time. The normalized first difference is also known as the Normalized Length Density and is used to quantify the self-similarities contained in a signal. Additionally, we also extract what is referred to as Hjorth features [29]. These include the following: Activity, Mobility and Complexity. The Activity feature represents the variance of the signal and is computed by the following.

$$A\_{\tilde{\xi}} = \frac{\Sigma\_{t=1}^T (\xi(t) - \mu\_{\tilde{\xi}})^2}{T} \dots$$

The Mobility feature is defined by the standard deviation of the slope of the EEG signal using as reference the standard deviation of the amplitude expressed as the following ratio by time unit.

$$M\_{\vec{\xi}} = \sqrt{\frac{var\left(\dot{\xi}^{\vec{\xi}}(t)\right)}{var\left(\dot{\xi}(t)\right)}}\ .$$

The Complexity feature measures the signal's variation using a smooth curve as reference provided by the following.

$$\mathcal{C}\_{\vec{\xi}} = \frac{M(\vec{\xi}(t))}{M(\vec{\xi}(t))} \ .$$

Another time domain feature is the Non-Stationary Index (NSI) [30]. Signal *ξ* is divided into segments, and their respective *μ<sup>i</sup>* is computed. NSI is defined as the standard deviation of the segments' *μi*. When NSI is high, the signal is considered to be "less stationary."

The last feature includes Higher Order Crossings (HOC) [31]. The feature describes the oscillatory nature of signal counting the number of sign changes over multiple variants of the signal. A total of 10 distinct featur HOC features were extracted.

In total, this work considers 27 time domain features to characterize the signals of interest extracted from the wind turbine blade.

#### Classification and Regression Problems

The above feature extraction process produces a total of 27 time domain features for each signal. These features are used to pose three classifications by using the following: (1) the features from the power signal; and (2) adding the features from the acceleration signal. An ML model will learn to use these features to determine what edge of the blade is affected by erosion.

Moreover, the same feature set will be used to generate a regression model to estimate the exact amount of erosion. In this scenario, the objective is to predict the level of erosion, which ranges from 8 to 18 percent. In this case, the location of the erosion (top, bottom or both) is not taken into account, and the percentage of erosion is the target of the learning process.

#### *3.4. Auto Machine Learning with H2O-DAI*

AutoML is an approach for automating the design, tuning, implementation and evaluation of complete ML pipelines. The goal is to simplify the manner in which ML models are tested and evaluated such that the process by which the models are generated provide a comprehensive evaluation of the best possible approach to solve a given problem. In this proposal, we use H2O-DAI, which stands for H2O Driverless AI, which offers a very simple user interface and a comprehensive set of tools to perform AutoML [32]. For instance, it makes a choice from a set of state-of-the-art models, such as XGBoost [33], Generalized Linear Models [34] and Deep Learning [35].

There are basically four tuning hyperparameters that are used to configure the AutoML process of H2O-DAI; these include the following. Accuracy refers to the amount of effort to find the best possible pipeline in the range (1–10); it is set to 7 in these experiments. Time controls the duration of the search process, it is set to 2 in our experiments. Interpretability controls the amount of feature engineering performed by the AutoML system. In this case, since a diverse set of features is already being used, it is set to 8. Moreover, to evaluate performance, 6-fold cross validation was used. All experiments were carried out on an IBMP Power 8 Server for High-Performance Computing with two Power 8 processors and two NVIDIA Tesla P100 GPUs.

#### 3.4.1. Classification Results

Summary of the results are presented as an average confusion matrix based on the classification achieved on the testing folds of the cross validation process. Results are shown in Table 7 when using the power output for feature extraction, where class labels are shown as Bottom, Top and Both for each of the three types of erosion. Other noteworthy classifier performance scores include (given as the average ± standard deviation over all the testing folds) the following: Area Under the Receiver Operating Characteristic Curve of 0.99 ± 0.001 and an F1-Score of 0.98 ± 0.008.


**Table 7.** Average confusion matrix using the power signal for feature extraction.

H2O DAI converged to an XGBoost model for classification [36], using a total of seven input features, four of which are raw features from the 27 time domain features and three automatically engineered features. In particular, for this version of the problem, H2O DAI focused on statistical features, such as the variance, mean and median, but it also used the power feature.

Extending the feature set, incorporating the features computed on the acceleration signal produced optimal results, as shown in the confusion matrix of Table 8. In this case, H2O DAI also converged to an XGBoost model, using a total of 10 input features, including five automatically engineered features. It is notable that all of the features used in this case are features extracted from the acceleration signal, including the first differential, the NSI and the power features.

**Table 8.** Average confusion matrix using both the power signal and the acceleration signal for feature extraction.


#### 3.4.2. Regression Results

The same configuration of H2O DAI is used, which was reported above for classification, with the exception that the scoring function is the root mean squared error (RMSE). Results are presented in Table 9, showing the average performance on the test sets of the 6-fold cross validation. H2O DAI was applied on three groups of features: power signal, acceleration signal and both, showing the mean absolute error (MAE), coefficient of determination *R*<sup>2</sup> and the root mean square percentage error loss (RMSPE). In all cases, H2O DAI converged to a Light Gradient Boosting Machine (Light GBM) [37]. Results show that using both signals for feature extraction produced a highly accurate model in terms of both *R*<sup>2</sup> and RMSPE.

**Table 9.** Regression results for H2O DAI estimating the percentage of erosion showing the average and standard deviation.


#### **4. Concluding Remarks**

This study presents an in-depth analysis of the aerodynamic and modal response of an eroded wind turbine blade. Efficiently and effectively detecting erosion on a blade can have substantial impacts in preventative and timely maintenance of wind turbines. Results show that it is possible to accurately determine where the erosion is present on the blade (top edge, bottom edge or both) and to estimate the level of erosion (between 8 and 18 percent). This is accomplished by analyzing the power signal of the wind turbine and the vibrations of the blade tip. A large set of time domain features was extracted, and the modeling process is carried out by using an AutoML system, namely H2o DAI. As such, this study represents the first contribution that tackles both the detection, localization and estimation of erosion level on the leading edge of a blade using ML. Moreover, this work is the first to apply AutoML in this domain. The process of designing the ML pipeline was carried out in an automatic fashion, without hampering performance and requiring very little human intervention in the design process. This could motivate further collaborative and multidisciplinary research between applied ML and wind energy maintenance and production.

The results presented in this work are consistent with those reported by [11,19], with the slight performance difference probably due to working with simulated data in our case, which is nonetheless a good predictor of real-world performance, as shown by [22,23] and partially validated by our experiments with a physical model.

Among both signals that were analyzed, the accelerometer readings seem to be more informative relative to the power signal, based both on the classification (erosion detection) and regression (erosion level estimation) problems, with small but consistent differences. Moreover, the best performance was achieved when both signals are used for feature extraction. It should be possible to use both models to automatically detect the presence and level of erosion in a properly instrumented wind turbine blade. Future work will focus on applying the same experimental procedure in a fully working prototype: first in a wind tunnel and then in the field.

**Author Contributions:** Conceptualization: J.E.Z. and L.T.; methodology: J.E.Z., M.d.l.Á.G.L. and J.A.C.T.; formal analysis and investigation: J.E.Z., M.d.l.Á.G.L. and J.A.C.T.; writing—original draft preparation: L.T. and J.A.C.T.; writing—review and editing: J.E.Z. and L.T.; resources: J.E.Z. and L.T.; supervision: J.E.Z. and L.T. All authors have read and agreed to the published version of the manuscript.

**Funding:** This research received no external funding.

**Acknowledgments:** The authors want to thank Daniel E. Hernández Morales and Luis A. Cárdenas Florido for their technical support in the development of this research, as well as AP ENGINEERING INNOVACIÓN TECNOLÓGICA EN ENERGÍAS S.A DE C.V. The authors also thank Gerardo Silva Navarro from the Laboratory of Vibration of the Department of Mechatronics Division in the CINVESTAV–IPN.

**Conflicts of Interest:** The authors declare that they have no conflicts of interest.

#### **References**

