Sensor Screening Methodology for Virtually Sensing Transmission Input Loads of a Wind Turbine Using Machine Learning Techniques and Drivetrain Simulations

Azzam, Baher; Schelenz, Ralf; Jacobs, Georg

doi:10.3390/s22103659

Open AccessArticle

Sensor Screening Methodology for Virtually Sensing Transmission Input Loads of a Wind Turbine Using Machine Learning Techniques and Drivetrain Simulations

by

Baher Azzam

^*

,

Ralf Schelenz

and

Georg Jacobs

Center for Wind Power Drives, RWTH Aachen University, 52074 Aachen, Germany

^*

Author to whom correspondence should be addressed.

Sensors 2022, 22(10), 3659; https://doi.org/10.3390/s22103659

Submission received: 31 March 2022 / Revised: 28 April 2022 / Accepted: 9 May 2022 / Published: 11 May 2022

(This article belongs to the Special Issue Intelligent Sensors and Machine Learning)

Download

Browse Figures

Versions Notes

Abstract

:

The ongoing trend of building larger wind turbines (WT) to reach greater economies of scale is contributing to the reduction in cost of wind energy, as well as the increase in WT drivetrain input loads into uncharted territories. The resulting intensification of the load situation within the WT gearbox motivates the need to monitor WT transmission input loads. However, due to the high costs of direct measurement solutions, more economical solutions, such as virtual sensing of transmission input loads using stationary sensors mounted on the gearbox housing or other drivetrain locations, are of interest. As the number, type, and location of sensors needed for a virtual sensing solutions can vary considerably in cost, in this investigation, we aimed to identify optimal sensor locations for virtually sensing WT 6-degree of freedom (6-DOF) transmission input loads. Random forest (RF) models were designed and applied to a dataset containing simulated operational data of a Vestas V52 WT multibody simulation model undergoing simulated wind fields. The dataset contained the 6-DOF transmission input loads and signals from potential sensor locations covering deformations, misalignments, and rotational speeds at various drivetrain locations. The RF models were used to identify the sensor locations with the highest impact on accuracy of virtual load sensing following a known statistical test in order to prioritize and reduce the number of needed input signals. The performance of the models was assessed before and after reducing the number of input signals required. By allowing for a screening of sensors prior to real-world tests, the results demonstrate the high promise of the proposed method for optimizing the cost of future virtual WT transmission load sensors.

Keywords:

machine learning; artificial intelligence; multivariate data analysis; feature importance; virtual sensing; drivetrain simulation; time-series simulation

1. Introduction

Several recent trends in the field of wind energy motivate the need for monitoring the load situation within the WT drivetrain during operation. The increase in WT-rated power and rotor diameter over time has been a prevalent trend in the industry in recent decades and has resulted in an increase in the loads experienced by WT drivetrain components [1,2,3]. In addition, space limitations for wind farms have resulted in the commissioning of more densely packed wind farms [4]. Due to turbine interactions, such as wake effects, a high location dependency for the load situations experienced by WTs in different areas of a given wind farm arises [4,5]. Particularly of interest is the load situation within the main gearbox (GB) of the WT, as overloading can cause long downtimes, as well as high repair costs in the case of its failure, as compared to other WT components [6,7]. Whereas currently deployed WT condition monitoring systems aim to detect early signs of failure, they are not capable of measuring the 6-DOF GB input loads during operation, mainly due to the high cost of available direct measurement solutions [8].

On the other hand, solutions that aim to indirectly or virtually sense transmission input loads face a different challenge of collecting sufficiently representative data during WT operation to develop and validate their solutions [8]. Azzam et al. addressed this need by providing a methodology for generating the necessary data for virtual load sensor development using multibody simulations (MBS) of a Vestas V52 WT model subjected to simulated wind fields according to IEC 61400-1 [9]. The authors demonstrated that MBS simulations can be used to significantly reduce the resources needed during development of WT virtual load sensors by providing a more flexible, less costly alternative to physical measurement campaigns as a data source for virtual sensor prototyping. In this paper, we build on this finding by providing an approach for screening candidate sensors needed for virtually sensing 6-DOF WT transmission input loads prior to real-world testing. As the location of a sensor is an important feature of its design, candidate sensors refer not only to the type of measurement, e.g., distance or rotational speed measurement, but also to the location of the envisaged sensor, e.g., left or right torque arm of the gearbox. By identifying the most influential candidate sensors with respect to the accuracy of the virtual load estimator, in this investigation, we aimed to reduce the number of sensors required and hence the field deployment costs of virtual sensor systems for WT drivetrain load monitoring.

Machine learning has been demonstrated to be a highly relevant tool for data analysis in a variety of domains where data are abundantly available, such as finance [10], astronomy [11], and speech and character recognition [12,13,14]. More recently, machine learning, with the help of synthetic data generation tools, has also been successfully applied in various domains. For example, several computer vision investigations were able to achieve tasks such as object detection [15], robot grasping [16,17], and assembly [18] using synthetic data. In a recent highly publicized example, synthetic data in the form of fake impersonation videos, also known as deepfakes, was able to deceive not only humans but also facial recognition software [19]. In the previously discussed applications, the generation of synthetic data was primarily motivated by scarcity of the desired data or, in the case of deepfakes, the non-existence of the desired data. In this investigation, a virtual load sensor is envisaged for indirectly measuring WT transmission input loads via a set of practical and cost-effective sensors aimed at capturing the deformations and misalignments resulting from the targeted input loads during operation. The input variable selection or, in this case, the design of this set of sensors is crucial for the performance of the envisaged virtual sensor system [20]. Once a sensor set is designed, the resource-intensive instrumentation and testing phases can commence in order to collect empirical measurements that can be used to develop and test the virtual sensor algorithms. As a source of data, real measurements provide more realistic data than what can be obtained from a simulation model. However, physically testing competing combinations of candidate sensors is challenging [20].

Two main obstacles hinder the possibility of iteration through competing sensor sets using real-world testing. Although theoretically possible, changing the designed sensor set during testing is practically difficult due to reduced accessibility, as well as the high cost and limited nature of testing time, would need to be reduced to allow for reinstrumentation. In addition, a sufficient variety of operating conditions should be covered during testing, as changes in process characteristics and operating conditions were highlighted in a 2009 survey of engineers to be a major challenge during development and maintenance of virtual sensors [20]. This further extends the requirements and, hence, the costs of the physical tests for the envisaged WT transmission virtual load sensor. It also further motivates the need to screen candidate sensors prior to the costly instrumentation and real-world testing phases. Although domain knowledge is useful for selecting sensors or input variables for virtual sensing algorithms, it is difficult to understand the relationship between every input and output variable; therefore, a systematic methodology is needed for screening candidate sensors [20]. In this investigation, we address this need by presenting a methodology for sensor screening during the data-scarce virtual sensor design phase prior to real-world testing. Using the methodology recently published by Azzam et al. [9], simulated wind fields covering a variety of operating conditions were applied to an MBS WT model to generate a time series of displacements, misalignments, and other parameters at various locations in the WT drivetrain. Using these simulated data, the methodology presented in this paper is intended to prioritize candidate sensors prior to real-world testing in order to (1) reduce the number of sensors required and (2) enhance the utility of the subsequent instrumentation and real-world testing phases, which were not repeatable in this project due to resource constraints.

In order to generate the time-series dataset of the 6-DOF GB input loads (hereafter referred to as “target variables”) and the resulting deformations, misalignments, and rotational speeds of drivetrain components (hereafter referred to as “predictor variables”) to be analyzed in this investigation, Azzam et al., assembled models of the various WT components in a virtual MBS environment [9]. The rotor blades, as well as the tower, were modally decomposed after an initial modelling in finite elements (FE) [21,22]. In addition, various drivetrain components, e.g., main frame and gearbox planet carrier, which contribute to the predictor variables, were modeled as flexible bodies in the MBS environment, whereas micro-level flaws, such as material imperfections, were not incorporated in the simulations [9]. In order to subject the WT model to simulated wind fields, TurbSim, together with the AERODYN force element and a SIMULINK PI controller, was used as part of a co-simulation with MATLAB [23,24]. In their paper on the topic, Azzam et al. provide a more in-depth explanation of the simulation model, as well as a discussion of its validity with respect to the task of virtually sensing WT transmission input loads [9].

With the help of machine learning techniques, simulated data provide a valuable opportunity to gain insights into the influence of WT transmission input loads on deformations, misalignments, and other phenomena in the various components of the WT drivetrain that can later be measured using candidate sensors. Real measurements are expected to diverge from the simulated time-series data due to the presence of noise and various imperfections in the real-world test setup that cannot be fully replicated in a simulation environment. Therefore, it is important to emphasize the intended scope of use of the presented methodology as a sensor screening tool in the data-scarce design phase of a sensor set to be instrumented for subsequent real-world testing. Despite expected differences between real measurements and simulated data, we expect the general correlations present in measured data to agree with the correlations present in the simulated data. This is not only based on the simulation methodology presented by Azzam et al. [9] but also on the findings of similar investigations involving WT MBS models subjected to simulated wind fields, which demonstrated high correlation between simulated and measured data of WT transmission loads and displacements [25,26]. The current investigation is focused on prioritizing candidate sensors based on the influence of the transmission input loads on the respective phenomena they are intended to measure. Therefore, our aim was to gain insights into the relationships between said loads and phenomena in order to select sensors that are likely to capture highly pronounced, practically measurable effects of the transmission input loads where they manifest in the drivetrain.

In order to capture such relationships, machine learning techniques based on the prolific random forest (RF) ensemble method were utilized. Ensemble methods aim to aggregate the results of several models into a combined model, the performance of which exceeds that of any of its constituent models [27,28,29]. Machine learning using ensemble learning, or ensemble learning, has been successfully applied in numerous investigations to solve both classification and regression problems [27,30]. Dietterich attributed the success of ensemble methods to their statistical, computational, and representational advantages [31]. Several theoretical studies also provided insights into the performance drivers of ensemble learning, such as stochastic discrimination [32], strength correlation [33], margin theory [34], and bias–variance–covariance decomposition [35]. Fawagreh et al. identified three widely used ensemble approaches: boosting, bagging, and stacking [36]. Due to its robustness against model overfitting and noise in the data, bagging was identified as the approach of choice for this investigation [36,37]. Overfitting occurs when a model performs significantly better on training data compared to test or new data [38]. Bagging describes bootstrap aggregating [39], which consists of two main steps. (1) Bootstrapping involves training each constituent model on a randomly drawn sample of observations from the available data with replacement. (2) Aggregating involves an averaging of the outcomes of the constituting models to reach the output of the ensemble model. Combining bagging and the random selection of features [40,41,42], Breiman introduced the RF method, which consists of multiple decision trees grown not only using bootstrapped samples but also with a random sample of features to grow the decision trees [33]. Additionally, Breiman utilized the classification and regression trees (CART) technique [43] for variable selection at each node of the RF trees in order to introduce more randomness in the construction of the trees [33]. Loh provided the context for both classification and regression trees in his paper on the history of these algorithms [44]. Since its conception by Breiman [33], the RF algorithm has been one of the most commonly used ensemble methods in various investigations in the field of wind energy, as well as many other fields [45,46,47,48,49]. Due to its aforementioned advantages, particularly its speed, relative insensitivity to overfitting, and its variable importance measurement, the RF algorithm was utilized in this investigation to gain insights into the importance of candidate sensors or predictors.

In this paper, RF models were first developed to capture the complex relationships between the target and predictor variables during simulated operation. After training and testing the RF models on the task of estimating the target variables using all predictor variables, further tests were performed on the models with the aim of identifying the most important predictor variables for model performance. The simulations performed by Azzam et al. generated the dataset analyzed in this investigation [9]. The dataset contains the target and predictor variables as generated from a simulated operation of a Vestas V52 WT MBS model undergoing time-series wind field simulations [9]. With this paper, we aim to present a methodology for reducing the number of needed sensors, or predictor variables, for a 6-DOF WT virtual transmission load sensor system prior to real-world testing. Because real-world testing is costly to set up and repeat, we aimed to enhance the utility of subsequent real-world tests by providing a tool for prioritizing sensors to be installed, the signals of which contain the most information on the WT transmission input loads. Because real measurements are needed for many virtual sensing investigations where experimental data might be scarce or unattainable at early design stages, we also aimed to support the efforts of a large audience in the field with the outcomes of this paper.

The rest of this paper is organized as follows. In Section 1, we introduce and motivate the methods used in this investigation. In Section 2.1, we present the simulated data to be analyzed. In Section 2.2, we provide an overview of the proposed methodology, followed by an explanation of the data preprocessing steps in Section 2.3 and an explanation of the error estimation approaches used to assess the performance of the developed models in Section 2.4. In Section 2.5, we elaborate on considerations and challenges in practical implementation of the proposed method. In Section 3, we present the results of the investigation, followed by a discussion of the results in Section 4. The conclusions are then outlined in Section 5.

2. Data and Methodology

2.1. Data Description

The available dataset consists of 28 variables sampled at 200 Hz for a total duration of 60 min. Table 1 lists the available variables, indicating the subject (e.g., GB torque arms, wind, or generator), nature (e.g., displacement, misalignment, or rotational speed), location, and axis of the measurement, as well as whether each variable belongs to the target or predictor variable group. All measurements are with respect to the machine carrier of the WT. Furthermore, directional location data in Table 1 (where specified), such as right or left torque arms, are from the point of view of an observer facing the rotor from the position of the generator. Hereafter, the variable ID in Table 1 will be used to refer to the respective variable. Figure 1 indicates the orientations of the x, y, and z axes of measurement with respect to the WT drivetrain during the simulation.

The dataset covers six wind field simulations of the Vestas V52 WT model, each resulting in 10 min of simulated data. Three nominal wind speeds corresponding to the cut-in, rated, and cut-out wind speeds (3, 12, and 25 m/s, respectively) were used to generate two of the six wind fields, with a different random seed for their respective turbulence models [9]. The three blade pitch angles in the current dataset are identical. Therefore, only one blade pitch angle was included in the analysis. For a more detailed explanation of how the dataset was generated, the reader is referred to the following publication [9].

2.2. Methodology Overview

The methodology for identifying highly influential predictor variables with respect to virtual sensing performance is based on an iterative training and testing of RF models in order to discover the patterns in the available data. Figure 2 provides an overview of the methodology. As mentioned earlier, in order to obtain the available data listed in Table 1, a model of the Vestas V52 WT was subjected to a simulated wind field in a virtual multibody simulation (MBS) environment [9]. The simulated wind fields were generated in accordance with design load cases from the IEC 61400 standard [50]. During simulated operation, time-series data of the 6-degree of freedom (6-DOF) transmission input loads were extracted, as well as signals from potential sensor locations covering deformations, misalignments, and rotational speeds at various drivetrain locations. These locations were selected based on an initial hypothesis of sensor locations suspected to be useful and practical for the envisaged transmission input load virtual sensor. However, as the computational cost of adding sensors to the WT model in the simulation environment is relatively low, additional locations could be investigated.

As shown in Figure 2, the process used to identify an optimal subset of sensor locations involves, first, the development of RF models that are trained and tested to estimate the desired target variables using all available predictor variables. Secondly, the methodology involves performing a feature importance analysis on the data in order to measure the influence of the respective predictor variables on the RF model accuracy relative to one another. At this stage, a prioritization of the available predictor variables can be achieved based on their influence or importance, and a subset containing highly important predictors can be selected. By training and testing RF models using only that subset of predictors and comparing the resulting performance to that of the RF models that had access to all available predictors in their development phase, a decision on whether to enlarge the chosen subset to contain more of the available predictors can be made. This decision would largely depend on the cost–accuracy tradeoff deemed appropriate by the development team. In Section 2.2.1, we provide a brief explanation of the RF algorithm [33], whereas in Section 2.2.2, we describe the feature importance analysis used in this investigation: the Boruta algorithm [51].

2.2.1. Random Forests

In this investigation, RF regression models [33] were developed in order to capture the relationships between the predictor and target variable in the available data. The goal was to estimate the 6-DOF transmission input loads using the simulated predictor sensor signals listed in Table 1 as input. The available data introduced in Section 2.1 were used to train and test the RF models.

The RF model [33] consists of an ensemble of regression decision trees that, collectively, are able to learn to approximate a function. In this investigation, six such functions were desired, each capable of producing an accurate estimate of one of the 6-DOF transmission input loads using the predictor variables as input variables.

Figure 3 illustrates an example regression decision tree aimed at estimating torque based on v7, v8, and v9 as input variables. As a reminder, Table 1 lists the parameter measured by v7 to be displacement of the right hand-side GB torque arm along the x-axis, and Figure 1 demonstrates the orientation of the x-axis relative to the WT drivetrain. Starting from the root of the tree at the top of Figure 3, a given data point follows either the right or left path depending on its value for v7. It then follows the appropriate path depending on its value for v8 or v9 to the so-called leaves of the decision tree, illustrated as dotted circles in Figure 3. After a number of data points make their way from the root of the tree to one of its four leaves depending on the values of their respective input variables, the average of the target variable (in this example, torque) of the group of data points that reached each leaf is shown in the figure. This process is referred to as training the decision tree. In the example tree shown in Figure 3, the average torque of the data points that reached the leftmost leaf is 10 kNm. If a new data point with an unknown torque reaches the leftmost leaf; then, the decision tree estimates that the torque for that data point is 10 kNm. A regression RF model contains several such regression decision trees with different numbers of branches and different splitting thresholds at each branch to, collectively, reach a more accurate estimation of the target variable as compared to the estimation reached by an individual decision tree. Breiman [33] provides more information about random forests.

In order to develop an RF model, the investigator must design its architecture by deciding on characteristics of the model, such as the number of decision trees to use, as well as other so-called hyperparameters that influence the performance of the model and the computational effort it will require during its lifecycle. In this investigation, multiple iterations of trial and testing led to a more favorable set of hyperparameters in a process called hyperparameter tuning. Experience and domain knowledge guide the process of hyperparameter tuning by helping to prioritize certain model hyperparameters. For example, two of the most influential RF hyperparameters are sample size and number of tried features at each decision tree split (hereafter referred to as “mtry” [52]. Sample size refers to the number of data points selected from the entire set of training data to be used for training each decision tree in the random forest. The parameter “mtry” refers to the number of candidate predictor variables assessed for each split in the decision tree or, in other words, each point in a given tree where a data point follows the right or left path, as shown in Figure 3, during the training phase of an RF model. Whereas decreasing either of these hyperparameters is likely to have a negative impact on decision tree accuracy and, in turn, the RF model accuracy, increasing them may result in model overfitting [52]. Overfitting describes a model that reaches significantly higher accuracies on data points used for training than other data points, for example, those used for testing the model. For some hyperparameters, there is not a clear consensus on the degree to which they influence model accuracy. For example, the number of decision trees in a random forest is one such hyperparameter that has been set to a computationally feasible large number in previous investigations [52,53,54]. In their paper on the topic, Probst et al. explain some best practices for RF hyperparameter tuning, as well as the potential effects of tuning the different hyperparameters on RF model performance [52].

2.2.2. Identification of Influential Predictor Variables Using the Boruta Algorithm

Whereas the RF models aimed to estimate the 6-DOF transmission input loads, in this section, we present the process by which the predictor variables with the biggest impact on model accuracy are identified. This task was achieved by following the Boruta algorithm [51]. A copy of each predictor variable was randomly shuffled, resulting in 18 distorted versions (so-called shadows) of the predictor variables. The shadows were then added to the set of original predictor variables, and a statistical test [51] was used to iteratively assess whether an original predictor variable will prove to be less important to target variable estimation than the random shadows. A given predictor variable is deemed unimportant if, on average over several iterations, it is found to be less important than the most important shadow variable. For the variables that are deemed important, the result of the analysis can also be used to identify highly important predictor variables that are most influential with respect to the estimation accuracy. Kursa and Rudnicki provide more details on the Boruta algorithm and the calculation of the importance value [51].

2.3. Data Preprocessing

Because the predictor variables in the available dataset vary in their scale of measurement, certain data preprocessing techniques could enhance the performance of the feature importance methodology. In their often-cited paper on the topic of bias in RF variable importance measures, Strobl et al. demonstrated that the existence of predictor variables with significantly varying scales of measurement can be a source of bias when identifying important predictors [55]. As a countermeasure, the predictor data used in this investigation were standardized. Standardization was performed on the data belonging to each predictor variable using the following formula:

Z = \frac{X_{(i)} - u}{σ}

(1)

where X, u, and σ refer to the sample of each predictor variable, its mean, and standard deviation, respectively. The available data were then split into 2 datasets: training and testing. The training set contained 50% of the data, whereas the testing set contained the remaining 50%. This step was performed in order to assess and monitor model overfitting if it takes place. As described in the Introduction, model overfitting takes place when the model performs significantly better on data used in training as compared to new data. Whereas the training set is used to train the models, the testing set would not be used in the training phase. This is not only to use the testing set to assess the performance of the model but also to ensure that overfitting has not taken place. Because, in the case of the testing set, performance is assessed on new data from the point of the view of the trained model, a comparison to the model performance using the training set would indicate model overfitting and, in turn, indicate the need for a revision of the model training procedure.

In addition, the aforementioned scaling was performed in such a way as to avoid data leakage or leakage of information from the testing set to the training set. The mean and standard deviations used to scale both the training and testing sets were obtained only from the training set. This prevented such information from the test set from influencing the training set and potentially causing model overfitting.

2.4. Error Estimation

In order to assess the performance of the proposed methodology, the error of the developed models with all predictors and with only the highly important subset of predictors was estimated using two approaches: the coefficient of determination, or R² score, [56] as well as the LRE score [9]. First, the R² score was calculated for each of the 6-DOF loads by applying Formula (2) to the target values from the test dataset and model estimations [56]:

R^{2} (y, \hat{y}) = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}}

(2)

where

y_{i}

,

{\hat{y}}_{i}

, n, and

\bar{y}

represent the ith expected value, the ith estimated value, the quantity, and the mean of the expected values of the target variable, i.e.,

\bar{y} = \frac{1}{n} \sum_{i = 1}^{n} y_{i}

, respectively.

In addition, an approach recently developed by Azzam et al. [9] was utilized to estimate the error of the estimated loads by the RF models used in this publication with all predictors and with only the highly important subset of predictors. Following this approach, statistics covering the minimum, maximum, and average of each of the 6-DOF transmission input loads, listed in Table 2, were extracted from the simulated data at nominal wind speed. For example, the forces along the y-axis typically fluctuate around an average value close to zero [9]. Azzam et al. argue that if mean absolute error [57], for example, is calculated and compared to the average values of the respective loads, this would result in an inflated estimation of error for a load such as Fy. Alternatively, comparing the error to the maximum value of the respective load would result in an underestimation of error. By scaling the mean absolute error against the median of the three statistics (minimum, maximum, and mean), an error relative to a simple statistic representing the loads at nominal wind speed can be calculated, taking into account the problems mentioned above with comparing the error to only one statistic. Therefore, in addition to the commonly used R² score, the load relative error (LRE) was used in this investigation to assess the performance of the developed models, whereby the LRE of the ith estimated value for each target variable was calculated using Formula (3) [9]:

{LRE}_{i} = \frac{y_{i} - {\hat{y}}_{i}}{median (|y_{m a x}|, |y_{m i n}|, |\bar{y}|)}

(3)

where

y_{m a x}

and

y_{m i n}

are the maximum and minimum values of the target variable, respectively.

The simulated load statistics selected for the calculation of the LRE for each transmission input load are underlined in Table 2 to facilitate interpretation.

2.5. Considerations and Challenges in Practical Implementation

In this investigation, simulation data were used to screen candidate sensors prior to real-world testing following the methodology outlined in Figure 2. In order to assess the performance of the proposed method, RF models were trained and tested on the task of virtually sensing the 6-DOF transmission input loads before and after the application of the proposed method to identify a relatively important subset of simulated predictor variables. Therefore, the accuracy of the RF models is a function of the validity of the simulation model and its resulting data. Because the unconditional validity of a simulation models is refuted by most experts [58,59,60,61], the validity of a simulation model can only be defined within the bounds of a project and its intended application [9,62]. As mentioned earlier, the presented methodology is intended as a sensor screening tool in the data-scarce design phase of sensor sets to be instrumented for subsequent real-world testing to develop a virtual 6-DOF WT transmission input load sensor. Real measurements are expected to diverge from the respective simulated time-series data due to the presence of noise and various imperfections in the real-world test setup that cannot be fully replicated in a simulation environment. The performance of the RF virtual sensor models developed in this investigation provides insights into the possibility of learning the patterns between the predictor and target variables of the envisaged virtual sensor system using RF regression models. However, real-world measurements are still needed to assess whether this ability applies to real-world data. Therefore, the next step in this project was to instrument and test two drivetrain configurations of the Vestas V52 as part of a measurement campaign aimed at further developing the envisaged 6-DOF transmission input load virtual sensor. For that purpose, the proposed method uses simulated data to provide a valuable opportunity to screen different candidate sensors of the envisaged virtual sensor system prior to the resource-intensive and less flexible real-world test. In the remainder of this subsection, we identify some considerations for the practical implementation of this method.

Although feature scaling can have a considerable impact on the performance of certain ML algorithms, e.g., artificial neural networks (ANN) and support vector machines (SVM), it can have less of an impact on other algorithms [55,63,64]. Through experiments comparing the performance of ML algorithms on different datasets with and without data preprocessing, Misra et al. found random forests to be one of the top-performing algorithms without feature scaling [63]. Experiments by Ahsan et al. also showed the performance of RF to be relatively insensitive to feature scaling as compared to distance-based algorithms, such as SVM, and algorithms relying on stochastic gradient descent, such as ANN [64]. This is a logical finding, given that algorithms that rely on stochastic gradient descent, such as ANNs, use the same learning rate for every parameter [65]. On the other hand, the tree-based algorithm used in RF makes each splitting decision within a given tree based on only one variable at each node of the tree [43]. Therefore, the performance of RF models tends to be relatively insensitive to feature scaling [64,66]. However, the effect of data scaling may vary depending on the intended use of a given algorithm. A relevant example is the investigation by Strobl et al. [55], where experiments showed that using the RF algorithm described by Breiman [33] in variable importance investigations may lead to biased findings when predictor variables vary in their scale of measurement. Therefore, to mitigate against bias in the variable importance outcomes of this investigation, data scaling was used to limit variations in the scale of measurement of the predictor variables. The two most widely used data scaling algorithms are the min–max algorithm and the z-score algorithm, also known as the standard scaler algorithm [67]. Whereas the effect of scaling may vary with different ML models, investigations on the effect of different scaling methods on the performance of ML models aim to identify suitable scaling algorithms for the most widely used ML models [64]. For example, in their investigation of the influence of different methods of scaling on heart failure patient datasets, Balabaeva et al. achieved a higher RF performance using the standard scaler method as compared to several other methods, including the min–max algorithm [67]. In this investigation, standard scaler was used to scale the available data.

As z-score scaling is sensitive to outliers, it is important to assess the presence of outliers in the data before scaling [68]. In the case of outliers, several approaches can ensure robustness of the scaling operation, such as robust data scaling, where outliers are excluded from the calculation of the mean and standard deviation used in Equation (1) to scale each variable. Because outliers are data points not conforming to an expected behavior, an understanding of the expected behavior is needed to discern outliers. Whereas domain expertise can be used to detect outliers in the data, the process of identifying potential outliers can be automated to assist in investigations involving high-dimensional big data [69]. Thudumu et al. provide a comprehensive survey of anomaly detection techniques for high-dimensional big data applications [69]. Due to the relatively small amount of simulated data analyzed in this investigation, as well as the domain knowledge of the authors, an exploratory data analysis involving visualizations of the data was sufficient to rule out the presence of outliers. For applications involving large amounts of data, we recommend the use of automated outlier detection techniques prior to the application of the proposed method.

3. Results

Six RF regression models were trained to estimate the respective 6-DOF transmission input loads, taking the predictor variables as input. The available data from six simulated wind fields, each lasting 600 s, were used to train and test the RF models. The RF models demonstrated varying levels of skill in estimating the target 6-DOF loads, reaching R² scores [56] for moments about the x-, y-, and z-axes of 0.99, 0.95, and 0.99, respectively, and R² scores for forces along the x-, y-, and z-axes of 0.81, 0.75, and 0.82, respectively.

In order to identify the most influential predictor variables with respect to estimation accuracy, the Boruta algorithm [51] was used to iteratively remove predictors less important than the shadows in 6-DOF load estimation. This process was applied for each of the 6-DOF transmission input loads. The results indicate that for each RF model estimating a specific target variable, a subset of available predictors is of particularly high importance with respect to model accuracy. The Boruta algorithm was applied once for each of the six target variables, resulting in the six plots shown in Figure 4. Also shown in the figure, the minimum, mean, and maximum importance values of the shadows are indicated as S.Min, S.Mean, and S.Max, respectively. In Figure 4, it is clear that all predictors were found to be important, as none have an importance lower than S.Max. In other words, no shadow variable was found to be more important than any of the available predictor variables. However, the plot shows that a subset of the available predictor variables (shown encompassed within a rounded rectangle) was found to be highly important in comparison to the remaining predictors for estimating each of the 6-DOF transmission input loads.

In order to verify that these subsets indicated the optimal sensor locations, RF models were trained using only the predictor variables within the subsets. The resulting R² scores for moments about the x-, y-, and z-axes were 0.99, 0.83, and 0.99, respectively, and those for forces along the x-, y-, and z-axes were 0.71, 0.74, and 0.73, respectively.

For comparison between the performance of the RF models using all available predictor variables and the models that only used the subset of highly important variables, two R² scores, R²_all and R²_sub, are provided in the plots in Figure 4. The specific target variable of the respective plot in Figure 4 is also indicated in the top left corner within the plot for more clarity. Whereas the difference between R²_sub and R²_all varied for the different target variables, R²_sub remained within 13% of R²_all despite using between 50% and 55% of the available predictors to train and test the RF models responsible for R²_sub. To further demonstrate the performance of the developed models, Figure 5 and Figure 6 show plots of the loads, model estimates using all predictor variables, and estimates using only the respective important subset for torque and force along the y-axis, respectively. The figures show 10 s of data from the testing set.

Figure 4 also shows that some predictor variables are important for the estimation of all 6-DOF transmission input loads. In particular, torque arm displacement along the x-axis is a highly, if not the most, important predictor for all target variables.

In order to further demonstrate and assess the performance of the proposed methodology on the entirety of the test dataset, the LRE of the estimated values for each target variable using all predictors, as well as using only the highly important subset of predictors, was calculated. The distributions of the respective calculated LREs for each of the six target variables are shown in Figure 7.

4. Discussion

The results indicate a considerable capability of the RF models in virtually sensing transmission input moments around the x-, y-, and z-axes: v1, v2, and v3. This is demonstrated both by the high achieved R² scores of the RF models that used all available predictor variables, as well as those models that used only the respective highly important subsets for estimating the input moments. Although the model for moments around the y-axis suffered almost a 13% reduction in R²_sub score as compared to the R²_all score, its R²_all score was a high 0.95, indicating the capability of the developed model to capture the relationships between the available predictor variables and this target variable. The fact that the presented methodology achieved a 13% reduction in performance with a 45% reduction in used predictor variables also indicates its capability of identifying and prioritizing important predictors.

The relatively lower sensitivity of the developed RF models to input forces (force along the x-, y-, and z-axes) is likely be due to the drivetrain configuration of the Vestas V52 WT MBS model, which was used to generate the data. The WT drivetrain has a four-point bearing suspension configuration, which is designed to eliminate all GB input loads, apart from torque, using the main bearing configuration of the drivetrain. This would also explain the contrasting performance between the model estimates of torque and those of the force along the y-axis, as shown in Figure 5 and Figure 6, respectively. The availability of more data for training covering more test scenarios through a measurement campaign that is also planned as part of this project is likely to further improve model performance and generalizability. Overall, the results show high promise for the use of regression RF models in virtually sensing transmission input moments and especially torque.

The results of the feature importance analysis also demonstrate the promise of the presented methodology, shown in Figure 2, for the prioritization and reduction in the needed number of sensors or sensor locations for the envisaged 6-DOF GB virtual load sensor. Once again, the performance with respect to torque stands out, as the reduction in the predictor variables available to the RF model resulted in a negligible accuracy difference. Whereas the RF models of the remaining loads suffered in terms of achieved accuracy after reducing the number of available predictor variables, the differences were within 13% of the accuracies of the models with access to all predictors, as shown in Figure 4 and Figure 7. The results are already useful in designing the envisaged virtual sensor setup of the planned measurement campaign, as they enable the project team to prioritize the sensors to be installed prior to the campaign. For example, torque arm displacement along the x-axis is currently of high priority as a result of the analyses presented in this paper. As changing the setup during the campaign is prohibitively costly due to the lack of accessibility of certain locations and the loss of valuable testing time, the method proposed in this paper will be used to optimize the use of the available resources for the setup of the measurement campaign.

5. Conclusions

In this paper, we presented a methodology for screening candidate sensors at different locations of the drivetrain for virtually sensing WT gearbox input loads prior to real-world testing. This was achieved with the use of RF models trained and tested on simulated data from a multibody simulation of a WT model. The main conclusions are as follows:

The results demonstrate the possibility of learning patterns between the predictor and target variables of the envisaged virtual sensor system using RF regression models and simulated WT operational data prior to real-world tests.
The developed RF models demonstrated a higher accuracy in virtually sensing WT transmission input moments as compared to transmission input forces. The performance of the RF model developed for virtually sensing torque resulted in the highest R² score. Performance may vary when using real measurement data to train and test the models due to the presence of noise and other deviations. However, the results obtained using simulated data provide insights into the challenges ahead in virtually sensing the 6-DOF transmission input loads, given the design of the drivetrain under investigation.
The presented methodology is able to screen candidate sensors in the data-scarce design phase of sensor sets to be instrumented for subsequent real-world testing for virtually sensing 6-DOF transmission input loads.

We aim to utilize the methodology presented in this paper to identify high-potential sensor locations and types of an envisaged virtual sensor system for estimating WT 6-DOF transmission input loads.

Author Contributions

Conceptualization, B.A. and R.S.; data curation, B.A.; formal analysis, B.A.; funding acquisition, R.S. and G.J.; investigation, B.A. and R.S.; methodology, B.A.; project administration, B.A. and R.S.; resources, R.S. and G.J.; software, B.A.; supervision, R.S.; validation, B.A. and R.S.; visualization, B.A.; writing—original draft, B.A.; writing—review and editing, B.A. and R.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the German Federal Ministry for Economic Affairs and Climate Action (0324325).

Data Availability Statement

The data are not publicly available.

Acknowledgments

The authors would like to acknowledge the funding from the German Federal Ministry for Economic Affairs and Climate Action of the Research Project “V6LoadS—Virtual 6DOF Load Sensor for the Determination of the Transmission Input Loads and the Resulting Used Up Service Life”.

Conflicts of Interest

The authors declare no conflict of interest.

References

Fraunhofer IEE. Windenergie Report Deutschland 2018; Fraunhofer Verlag: Stuttgart, Germany, 2018. [Google Scholar]
Wiser, R.; Millstein, D.; Bolinger, M.; Jeong, S.; Mills, A. The hidden value of large-rotor, tall-tower wind turbines in the United States. Wind Eng. 2020, 45, 857–871. [Google Scholar] [CrossRef]
McKenna, R.; vd Leye, P.O.; Fichtner, W. Key challenges and prospects for large wind turbines. Renew. Sustain. Energy Rev. 2016, 53, 1212–1221. [Google Scholar] [CrossRef]
Roscher, B.; Werkmeister, A.; Jacobs, G.; Schelenz, R. Modelling of Wind Turbine Loads nearby a Wind Farm. J. Phys. Conf. Ser. 2017, 854, 12038. [Google Scholar] [CrossRef] [Green Version]
Cardaun, M.; Roscher, B.; Schelenz, R.; Jacobs, G. Analysis of Wind-Turbine Main Bearing Loads Due to Constant Yaw Misalignments over a 20 Years Timespan. Energies 2019, 12, 1768. [Google Scholar] [CrossRef] [Green Version]
Kotzalas, M.; Doll, G. Tribological advancements for reliable wind turbine performance. Philos. Trans. Math. Phys. Eng. Sci. 2010, 368, 4829–4850. [Google Scholar] [CrossRef] [Green Version]
Kim, K.; Parthasarathy, G.; Uluyol, O.; Foslien, W.; Sheng, S.; Fleming, P. Use of SCADA data for failure detection in wind turbines. In Proceedings of the ASME 2011 5th International Conference on Energy Sustainability, Washington, DC, USA, 7–10 August 2011. [Google Scholar]
Perišić, N.; Kirkegaard, P.; Pedersen, B. Cost-Effective Shaft Torque Observer for Condition Monitoring of Wind Turbines. Wind Energy 2015, 18, 1–19. [Google Scholar] [CrossRef]
Azzam, B.; Schelenz, R.; Roscher, B.; Baseer, A.; Jacobs, G. Development of a wind turbine gearbox virtual load sensor using multibody simulation and artificial neural networks. Forsch. Ing. 2021, 85, 241–250. [Google Scholar] [CrossRef]
Heaton, J.; Polson, N.; Witte, J. Deep learning for finance: Deep portfolios. Appl. Stoch. Models Bus. Ind. 2017, 33, 3–12. [Google Scholar] [CrossRef]
Banerji, M.; Lahav, O.; Lintott, C.; Abdalla, F.; Schawinski, K.; Bamford, S.; Andreescu, D.; Murray, P.; Raddick, M.; Slosar, A.; et al. Galaxy zoo: Reproducing galaxy morphologies via machine learning. Mon. Not. R. Astron. Soc. 2010, 406, 342–353. [Google Scholar] [CrossRef] [Green Version]
LeCun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef] [Green Version]
Deng, L.; Li, J.; Huang, J.; Yao, K.; Yu, D.; Seide, F.; Seltzer, M.; Zweig, G.; He, X.; Williams, J.; et al. Recent advances in deep learning for speech research at Microsoft. In Proceedings of the 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vancouver, BC, Canada, 26–31 May 2013. [Google Scholar]
Dahl, G.; Yu, D.; Deng, L.; Acero, A. Large vocabulary continuous speech recognition with context-dependent DBN-HMMS. In Proceedings of the 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, Czech Republic, 22–27 May 2011. [Google Scholar]
Andulkar, M.; Hodapp, J.; Reichling, T.; Reichenbach, M.; Berger, U. Training CNNs from Synthetic Data for Part Handling in Industrial Environments. In Proceedings of the IEEE 14th International Conference on Automation Science and Engineering (CASE), Munich, Germany, 20–24 August 2018. [Google Scholar]
Fang, K.; Bai, Y.; Hinterstoisser, S.; Savarese, S.; Kalakrishnan, M. Multi-Task Domain Adaptation for Deep Learning of Instance Grasping from Simulation. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Brisbane, Australia, 21–26 May 2018. [Google Scholar]
Bousmalis, K.; Irpan, A.; Wohlhart, P.; Bai, Y.; Kelcey, M.; Kalakrishnan, M.; Downs, L.; Ibarz, J.; Pastor, P.; Konolige, K.; et al. Using Simulation and Domain Adaptation to Improve Efficiency of Deep Robotic Grasping. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Brisbane, Australia, 21–25 May 2018. [Google Scholar]
Thomas, G.; Chien, M.; Tamar, A.; Ojea, J.; Abbeel, P. Learning Robotic Assembly from CAD. In Proceedings of the 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, Australia, 21–25 May 2018. [Google Scholar]
Chesney, B.; Citron, D. Deep fakes: A looming challenge for privacy, democracy, and national security. Calif. Law Rev. 2019, 107, 1753. [Google Scholar] [CrossRef] [Green Version]
Kano, M.; Fujiwara, K. Virtual sensing technology in process industries: Trends and challenges revealed by recent industrial applications. J. Chem. Eng. Jpn. 2013, 46, 1–17. [Google Scholar] [CrossRef] [Green Version]
Berroth, J. Einfluss der Stelldynamik der Rotorblätter auf die Lasten der Blattverstellsysteme von Windenergieanlagen [Impact of Rotor Blades Pitch Dynamics on the Loads of Pitch Systems of Wind Turbines]; Verlagsgruppe Mainz GmbH: Aachen, Germany, 2017. [Google Scholar]
Craig, R.; Bampton, M. Coupling of Substructures for Dynamic Analysis. AIAA J. 1968, 6, 1313–1319. [Google Scholar] [CrossRef] [Green Version]
Jonkman, B. Turbsim User’s Guide: Version 1.50; Technical Report No. NREL/EL-500-38230; National Renewable Energy Laboratory: Golden, CO, USA, 2009.
Bi, L.; Schelenz, R.; Jacobs, G. Dynamic simulation of full-scale nacelle test rig with focus on drivetrain response under emulated loads. In Proceedings of the Conference for Wind Power Drives, Aachen, Germany, 3–4 March 2015. [Google Scholar]
Matzke, D.; Schelenz, R.; Reisch, S.; Roscher, B.; Jacobs, G.; Theling, J.; Schroers, M.; Löpenhaus, C.; Brecher, C. Validation of the gearbox load calculation of a wind turbine MBS model. J. Phys. Conf. Ser. 2018, 1037, 62025. [Google Scholar] [CrossRef] [Green Version]
Matzke, D.; Jacobs, G.; Schelenz, R. Validation of MBS modeling methods to calculate bearing and tooth loads in the planetary gear stage of a wind turbine. In Proceedings of the Conference for Wind Power Drives (CWD), Aachen, Germany, 12–13 March 2019. [Google Scholar]
Ren, Y.; Zhang, L.; Suganthan, P. Ensemble Classification and Regression-Recent Developments, Applications and Future Directions. IEEE Comput. Intell. Mag. 2016, 11, 41–53. [Google Scholar] [CrossRef]
Kuncheva, L.; Whitaker, C. Measures of diversity in classifier ensembles and their relationship with the ensemble accuracy. Mach. Learn. 2003, 51, 181–207. [Google Scholar] [CrossRef]
Polikar, R. Ensemble based systems in decision making. IEEE Circuits Syst. Mag. 2006, 6, 21–45. [Google Scholar] [CrossRef]
Mendes-Moreira, J.; Soares, C.; Jorge, A.; de Sousa, J. Ensemble approaches for regression: A survey. ACM Comput. Surv. 2012, 45, 1–40. [Google Scholar] [CrossRef]
Dietterich, T. Ensemble methods in machine learning. In Proceedings of the Multiple Classifier Systems (MCS), Cagliari, Italy, 21–23 June 2000. [Google Scholar]
Kleinberg, E. Stochastic discrimination. Ann. Math. Artif. Intell. 1990, 1, 207–239. [Google Scholar] [CrossRef]
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
Bartlett, P.; Freund, Y.; Lee, W.; Schapire, R. Boosting the margin: A new explanation for the effectiveness of voting methods. Ann. Stat. 1998, 26, 1651–1686. [Google Scholar] [CrossRef]
Pisetta, V. New Insights into Decision Tree Ensembles. Ph.D. Dissertation, Université Lumière Lyon 2, Lyon, France, 2012. [Google Scholar]
Fawagreh, K.; Gaber, M.; Elyan, E. Random Forests: From Early Developments to Recent Advancements. Syst. Sci. Control Eng. 2014, 2, 602–609. [Google Scholar] [CrossRef] [Green Version]
Dietterich, T. An experimental comparison of three methods for constructing ensembles of decision trees: Bagging, boosting, and randomization. Mach. Learn. 2000, 40, 139–157. [Google Scholar] [CrossRef]
Hawkins, D. The Problem of Overfitting. J. Chem. Inf. Comput. Sci. 2004, 44, 1–12. [Google Scholar] [CrossRef] [PubMed]
Breiman, L. Bagging predictors. Mach. Learn. 1996, 24, 123–140. [Google Scholar] [CrossRef] [Green Version]
Ho, T.K. Random decision forests. In Proceedings of the Third International Conference on Document Analysis and Recognition, Montreal, QC, Canada, 14–16 August 1995. [Google Scholar]
Ho, T.K. The random subspace method for constructing decision forests. IEEE Trans. Pattern Anal. Mach. Intell. 1998, 20, 832–844. [Google Scholar]
Amit, Y.; Geman, D. Shape quantization and recognition with randomized trees. Neural Comput. 1997, 9, 1545–1588. [Google Scholar] [CrossRef] [Green Version]
Breiman, L.; Friedman, J.; Olshen, R.; Stone, C. Classification and Regression Trees; Routledge: New York, NY, USA, 1984. [Google Scholar]
Loh, W.-Y. Fifty years of classification and regression trees. Int. Stat. Rev. 2014, 82, 329–348. [Google Scholar] [CrossRef] [Green Version]
Iannace, G.; Ciaburro, G.; Trematerra, A. Wind Turbine Noise Prediction Using Random Forest Regression. Machines 2019, 7, 69. [Google Scholar] [CrossRef] [Green Version]
Fezai, R.; Dhibi, K.; Mansouri, M.; Trabelsi, M.; Hajji, M.; Bouzrara, K.; Nounou, H.; Nounou, M. Effective Random Forest-Based Fault Detection and Diagnosis for Wind Energy Conversion Systems. IEEE Sens. J. 2021, 21, 6914–6921. [Google Scholar] [CrossRef]
Lang, L.; Tiancai, L.; Shan, A.; Xiangyan, T. An improved random forest algorithm and its application to wind pressure prediction. Int. J. Intell. Syst. 2021, 36, 4016–4032. [Google Scholar] [CrossRef]
Zhang, L.; Liu, K.; Wang, Y.; Omariba, Z. Ice Detection Model of Wind Turbine Blades Based on Random Forest Classifier. Energies 2018, 11, 2548. [Google Scholar] [CrossRef] [Green Version]
Ren, Y.; Suganthan, P.; Srikanth, N. Ensemble methods for wind and solar power forecasting—A state-of-the-art review. Renew. Sustain. Energy Rev. 2015, 50, 82–91. [Google Scholar] [CrossRef]
IEC 61400-1; Wind Turbines—Part 1: Design Requirements. International Electrotechnical Commission: Geneva, Switzerland, 2019.
Kursa, M.B.; Rudnicki, W.R. Feature Selection with the Boruta Package. J. Stat. Softw. 2010, 1, 1–13. [Google Scholar]
Probst, P.; Wright, M.; Boulesteix, A.-L. Hyperparameters and Tuning Strategies for Random Forest. WIREs Data Min. Knowl. Discov. 2019, 9, e1301. [Google Scholar] [CrossRef] [Green Version]
Probst, P.; Boulesteix, A.-L. To tune or not to tune the number of trees in random forest? J. Mach. Learn. Res. 2018, 18, 1–18. [Google Scholar]
Azzam, B.; Harzendorf, F.; Schelenz, R.; Holweger, W.; Jacobs, G. Pattern Discovery in White Etching Crack Experimental Data using Machine Learning Techniques. Appl. Sci. 2019, 9, 5502. [Google Scholar] [CrossRef] [Green Version]
Strobl, C.; Boulesteix, A.L.; Zeileis, A.; Hothorn, T. Bias in random forest variable importance measures: Illustrations, sources and a solution. BMC Bioinform. 2007, 8, 25. [Google Scholar] [CrossRef] [Green Version]
Nagelkerke, N. A note on a general definition of the coefficient of determination. Biometrika 1991, 78, 691–692. [Google Scholar] [CrossRef]
Sammut, C.; Webb, G. Encyclopedia of Machine Learning; Springer: Boston, MA, USA, 2010. [Google Scholar]
Sargent, R. Verification and validation of simulation models. In Proceedings of the 2010 Winter Simulation Conference, Baltimore, MD, USA, 5–8 December 2010. [Google Scholar]
Babuska, I.; Oden, T. Verification and validation in computational engineering and science: Basic concepts. Comput. Methods Appl. Mech. Eng. 2004, 193, 4057–4066. [Google Scholar] [CrossRef]
Law, A. Simulation Modeling and Analysis; McGraw-Hill: New York, NY, USA, 2007. [Google Scholar]
Logan, R.; Nitta, C. Verification & Validation: Process and Levels Leading to Qualitative or Quantitative Validation Statements. SAE Trans. 2004, 113, 804–816. [Google Scholar]
Kutluay, K.; Winner, H. Validation of vehicle dynamics simulation models—A review. Veh. Syst. Dyn. 2014, 52, 186–200. [Google Scholar] [CrossRef]
Misra, P.; Yadav, A. Impact of Preprocessing Methods on Healthcare Predictions. In Proceedings of the 2nd International Conference on Advanced Computing and Software Engineering (ICACSE), Sultanpur, India, 8–9 February 2019. [Google Scholar]
Ahsan, M.M.; Mahmud, M.A.; Saha, P.K.; Gupta, K.D.; Siddique, Z. Effect of Data Scaling Methods on Machine Learning Algorithms and Model Performance. Technologies 2021, 9, 52. [Google Scholar] [CrossRef]
Elkan, C. Log-linear models and conditional random fields. Tutor. Notes CIKM 2008, 8, 1–12. [Google Scholar]
Krick, J.E.; Fraine, J.; Ingalls, J.; Deger, S. Random Forests Applied to High-precision Photometry Analysis with Spitzer IRAC. Astron. J. 2020, 160, 99–112. [Google Scholar]
Cao, X.H.; Stojkovic, I.; Obradovic, Z. A robust data scaling algorithm to improve classification accuracies in biomedical data. BMC Bioinform. 2016, 17, 359. [Google Scholar] [CrossRef] [Green Version]
Jain, A.; Nandakumar, K.; Ross, A. Score normalization in multimodal biometric systems. Pattern Recognit. 2005, 38, 2270–2285. [Google Scholar] [CrossRef]
Thudumu, S.; Branch, P.; Jin, J.; Singh, J.J. A comprehensive survey of anomaly detection techniques for high dimensional big data. J. Big Data 2020, 7, 42. [Google Scholar] [CrossRef]

Figure 1. Illustration of the orientation of the x, y, and z axes of measurement with respect to the WT drivetrain.

Figure 2. Overview of methodology.

Figure 3. Example regression decision tree.

Figure 4. Importance of predictor variables using random forests and the Boruta algorithm [51]. The panels show the variable importance results for each of the target variables listed in Table 1 (a–f), covering variables v1–v6, respectively.

Figure 5. Plot of torque, model estimations with all predictors, and with only the highly important subset of predictors.

Figure 6. Plot of force along the y-axis, model estimations with all predictors, and with only the highly important subset of predictors.

Figure 7. Load relative error [9] distributions for estimated transmission input moments with all predictors (black) and with only the highly important subset of predictors (grey). The panels show the LRE for each of the respective target variables listed in Table 1 (a–f), covering variables v1–v6, respectively.

Table 1. Available variables and their respective attributes.

ID	Subject(s)	Measurement	Axis of Measurement	Predictor/Target
v1	GB input load	Moment	x	T
v2			y
v3			z
v4	GB input load	Force	x	T
v5			y
v6			z
v7	GB torque arm (right)	Displacement	x	P
v8			y
v9			z
v10	GB torque arm (right)	Angular misalignment	x	P
v11			y
v12			z
v13	GB torque arm (left)	Displacement	x	P
v14			y
v15			z
v16	GB torque arm (left)	Angular misalignment	x	P
v17			y
v18			z
v19	Generator mount (left)	Displacement	z	P
v20	Generator mount (right)	Displacement	z	P
v21	Wind	Direction	Nacelle axis of rotation	P
v22	Wind	speed	x	P
v23	Generator	Rotational speed	x	P
v24	Blade ¹	Pitch angle	Blade axis of rotation	P

¹ Only one pitch angle is included because the three blade pitch angles are identical in the dataset.

Table 2. Simulated load statistics at rated wind speed.

Simulated Load Statistics at Rated Wind Speed (Units: N, Nm)
Min.	Max.	Avg.
−7898	1050	−3222
−17,972	15,556	−81
−53,850	−26,380	−41,074
−380,056	−41,669	−257,847
22,881	32,429	27,368
4555	45,827	30,124

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Azzam, B.; Schelenz, R.; Jacobs, G. Sensor Screening Methodology for Virtually Sensing Transmission Input Loads of a Wind Turbine Using Machine Learning Techniques and Drivetrain Simulations. Sensors 2022, 22, 3659. https://doi.org/10.3390/s22103659

AMA Style

Azzam B, Schelenz R, Jacobs G. Sensor Screening Methodology for Virtually Sensing Transmission Input Loads of a Wind Turbine Using Machine Learning Techniques and Drivetrain Simulations. Sensors. 2022; 22(10):3659. https://doi.org/10.3390/s22103659

Chicago/Turabian Style

Azzam, Baher, Ralf Schelenz, and Georg Jacobs. 2022. "Sensor Screening Methodology for Virtually Sensing Transmission Input Loads of a Wind Turbine Using Machine Learning Techniques and Drivetrain Simulations" Sensors 22, no. 10: 3659. https://doi.org/10.3390/s22103659

APA Style

Azzam, B., Schelenz, R., & Jacobs, G. (2022). Sensor Screening Methodology for Virtually Sensing Transmission Input Loads of a Wind Turbine Using Machine Learning Techniques and Drivetrain Simulations. Sensors, 22(10), 3659. https://doi.org/10.3390/s22103659

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Sensor Screening Methodology for Virtually Sensing Transmission Input Loads of a Wind Turbine Using Machine Learning Techniques and Drivetrain Simulations

Abstract

1. Introduction

2. Data and Methodology

2.1. Data Description

2.2. Methodology Overview

2.2.1. Random Forests

2.2.2. Identification of Influential Predictor Variables Using the Boruta Algorithm

2.3. Data Preprocessing

2.4. Error Estimation

2.5. Considerations and Challenges in Practical Implementation

3. Results

4. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI