Wind Turbine Fault Detection through Principal Component Analysis and Statistical Hypothesis Testing

Pozo, Francesc; Vidal, Yolanda

doi:10.3390/en9010003

Open AccessEditor’s ChoiceArticle

Wind Turbine Fault Detection through Principal Component Analysis and Statistical Hypothesis Testing

by

Francesc Pozo

^*

and

Yolanda Vidal

Control, Dynamics and Applications (CoDAlab), Departament de Matemàtiques, Escola Universitària d’Enginyeria Tècnica Industrial de Barcelona (EUETIB), Universitat Politècnica de Catalunya (UPC), Comte d’Urgell, 187, Barcelona 08036, Spain

^*

Author to whom correspondence should be addressed.

Energies 2016, 9(1), 3; https://doi.org/10.3390/en9010003

Submission received: 17 November 2015 / Revised: 11 December 2015 / Accepted: 14 December 2015 / Published: 23 December 2015

Download

Browse Figures

Versions Notes

Abstract

:

This paper addresses the problem of online fault detection of an advanced wind turbine benchmark under actuators (pitch and torque) and sensors (pitch angle measurement) faults of different type: fixed value, gain factor, offset and changed dynamics. The fault detection scheme starts by computing the baseline principal component analysis (PCA) model from the healthy or undamaged wind turbine. Subsequently, when the structure is inspected or supervised, new measurements are obtained are projected into the baseline PCA model. When both sets of data—the baseline and the data from the current wind turbine—are compared, a statistical hypothesis testing is used to make a decision on whether or not the wind turbine presents some damage, fault or misbehavior. The effectiveness of the proposed fault-detection scheme is illustrated by numerical simulations on a well-known large offshore wind turbine in the presence of wind turbulence and realistic fault scenarios. The obtained results demonstrate that the proposed strategy provides and early fault identification, thereby giving the operators sufficient time to make more informed decisions regarding the maintenance of their machines.

Keywords:

wind turbine; fault detection; principal component analysis; statistical hypothesis testing; FAST (Fatigue, Aerodynamics, Structures and Turbulence)

1. Introduction

Wind energy is currently the fastest growing source of renewable energy in the world. As wind turbines (WT) increase in size, and their operating conditions become more extreme, a number of current and future challenges exist. A major issue with wind turbines, specially those located offshore, is the relatively high cost of maintenance [1]. Since the replacement of main components of a wind turbine is a difficult and costly affair, improved maintenance procedures can lead to essential cost reductions. Autonomous online fault detection algorithms allow early warnings of defects to prevent major component failures. Furthermore, side effects on other components can be reduced significantly. Many faults can be detected while the defective component is still operational. Thus necessary repair actions can be planned in time and need not to be taken immediately and this fact is specially important for off-shore turbines where bad weather conditions can prevent any repair actions. Therefore the implementation of fault detection (FD) systems is crucial.

The past few years have seen a rapid growth in interest in wind turbine fault detection [2] through the use of condition monitoring and structural health monitoring (SHM) [3,4]. The SHM techniques are based on the idea that the change in mechanical properties of the structure will be captured by a change in its dynamic characteristics [5]. Existing techniques for fault detection can be broadly classified into two major categories: model-based methods and signal processing-based methods. For model-based fault detection, the system model could be mathematical—or knowledge-based [6]. Faults are detected based on the residual generated by state variable or model parameter estimation [7,8,9,10,11]. For signal processing-based fault detection, mathematical or statistical operations are performed on the measurements (see, for example, [12,13]), or artificial intelligence techniques are applied to the measurements to extract the information about the faults (see [14,15]).

With respect to signal-processing-based fault detection, principal component analysis (PCA) is used in this framework as a way to condense and extract information from the collected signals. Following this structure, this paper is focused on the development of a wind turbine fault detection strategy that combines a data driven baseline model—reference pattern obtained from the healthy structure—based on PCA and hypothesis testing. A different approach in the frequency domain can be found in [16], where a Karhunen-Loeve basis is used.

Most industrial wind turbines are manufactured with an integrated system that can monitor various turbine parameters. These monitored data are collated and stored via a supervisory control and data acquisition (SCADA) system that archives the information in a convenient manner. These data quickly accumulates to create large and unmanageable volumes that can hinder attempts to deduce the health of a turbine’s components. It would prove beneficial if the data could be analyzed and interpreted automatically (online) to support the operators in planning cost-effective maintenance activities [17,18,19]. This paper describes a technique that can be used to identify incipient faults in the main components of a turbine through the analysis of this SCADA data. The SCADA data sets are already generated by the integrated monitoring system, and therefore, no new installation of specific sensors or diagnostic equipment is required. The strategy developed is based on principal component analysis and statistical hypothesis testing. The final section of the paper shows the performance of the proposed techniques using an enhanced benchmark challenge for wind turbine fault detection, see [2]. This benchmark proposes a set of realistic fault scenarios considered in an aeroelastic computer-aided engineering tool for horizontal axes wind turbines called FAST, see [20].

The structure of the paper is the following. In Section 2 the wind turbine benchmark is recalled as well as the fault scenarios studied in this work. Section 3 presents the design of the proposed fault detection strategy. The simulation results obtained with the proposed approach applied to the wind turbine benchmark are given in Section 4. Concluding remarks are given in Section 5.

2. Wind Turbine Benchmark Model

A complete description of the wind turbine benchmark model, as well as the used baseline torque and pitch controllers, can be found in [2]. In this benchmark challenge, a more sophisticated wind turbine model—a modern 5 MW turbine implemented in the FAST software—and updated fault scenarios are presented. These updates enhance the realism of the challenge and will therefore lead to solutions that are significantly more useful to the wind industry. Hereafter, a brief review of the reference wind turbine is given and the generator-converter actuator model and the pitch actuator model are recalled, as the studied faults affect those subsystems. A complete description of the tested fault scenarios is given.

2.1. Reference Wind Turbine

The numerical simulations use the onshore version of a large wind turbine that is representative of typical utility-scale land- and sea-based multimegawatt turbines described by [21]. This wind turbine is a conventional three-bladed upwind variable-speed variable blade-pitch-to-feather-controlled turbine of 5 MW. The wind turbine characteristics are given in Table 1. In this work we deal with the full load region of operation (also called region 3). That is, the proposed controllers main objective is that the electric power follows the rated power.

Table 1. Gross properties of the wind turbine.

**Table 1.** Gross properties of the wind turbine.
Reference Wind Turbine	Magnitude
Rated power	5 MW
Number of blades	3
Rotor/Hub diameter	126 m, 3 m
Hub Height	90 m
Cut-In, Rated, Cut-Out Wind Speed	3 m/s, $11.4$ m/s, 25 m/s
Rated generator speed ( $ω_{n g}$ )	$1173.7$ rpm
Gearbox ratio	97

In the simulations, new wind data sets with turbulence intensity set to

10 %

are generated with TurbSim [22]. It can be seen from Figure 1 that the wind speed covers the full load region, as its values range from

12.91

m/s up to

22.57

m/s.

Figure 1. Wind speed signal with turbulence intensity set to

10 %

.

Figure 1. Wind speed signal with turbulence intensity set to

10 %

.

2.2. Generator-Converter Model

The generator-converter system can be approximated by a first-order ordinary differential equation, see [2], which is given by:

\dot{τ_{r}} (t) + α_{g c} τ_{r} (t) = α_{g c} τ_{c} (t)

(1)

where

τ_{r}

and

τ_{c}

are the real generator torque and its reference (given by the controller), respectively. In the numerical simulations,

α_{g c} = 50

, see [21]. Moreover, the power produced by the generator,

P_{e} (t)

, is given by (see [2]):

P_{e} (t) = η_{g} ω_{g} (t) τ_{r} (t)

(2)

where

η_{g}

is the efficiency of the generator and

ω_{g}

is the generator speed. In the numerical experiments,

η_{g} = 0.98

is used, see [2].

2.3. Pitch Actuator Model

The hydraulic pitch system consists of three identical pitch actuators, which are modeled as a linear differential equation with time-dependent variables, pitch angle

β (t)

and its reference

β_{r} (t)

. In principle, it is a piston servo-system, which can be expressed as a second-order ordinary differential equation [2]:

\ddot{β} (t) + 2 ξ ω_{n} \dot{β} (t) + ω_{n}^{2} β (t) = ω_{n}^{2} β_{r} (t)

(3)

where

ω_{n}

and ξ are the natural frequency and the damping ratio, respectively. For the fault-free case, the parameters

ξ = 0.6

and

ω_{n} = 11.11

rad/s are used, see [2].

2.4. Fault Scenarios

Both actuator and sensor faults are considered. All the described faults originate from actual faults in wind turbines [2]. Table 2 summarizes all the considered fault scenarios.

Table 2. Fault scenarios.

**Table 2.** Fault scenarios.
Number	Fault	Type
F1	Pitch actuator	Change in dynamics: air content in oil
F2	Pitch actuator	Change in dynamics: pump wear
F3	Pitch actuator	Change in dynamics: hydraulic leakage
F4	Torque actuator	Offset
F5	Generator speed sensor	Scaling
F6	Pitch angle sensor	Stuck
F7	Pitch angle sensor	Scaling

2.4.1. Actuator Faults

Pitch actuator faults are studied as they are the actuators with highest failure rate in wind turbines. A fault may change the dynamics of the pitch system by varying the damping ratio

(ζ)

and natural frequencies

(ω_{n})

from their nominal values to their faulty values. The parameters for the pitch system under different conditions are given in Table 3. The normal air content in the hydraulic oil is 7%, whereas the high air content in oil fault (F1) corresponds to 15%. Pump wear (F2) represents the situation of 75% pressure in the pitch system while the parameters stated for hydraulic leakage (F3) correspond to a pressure of only 50%. The three faults are modeled by changing the parameters

ω_{n}

and ζ in the relevant pitch actuator model.

Table 3. Change in dynamics pitch actuator faults.

**Table 3.** Change in dynamics pitch actuator faults.
Faults	$ω_{n}$ (rad/s)	ζ
Fault Free(FF)	11.11	0.6
High air content in oil (F1)	5.73	0.45
Pump wear (F2)	7.27	0.75
Hydraulic leakage (F3)	3.42	0.9

For the test, the change in dynamics faults given in Table 3 are introduced only in the third pitch actuator (thus

β_{1}

and

β_{2}

are always fault-free).

A torque actuator fault (F4) is also studied. This fault is an offset on the generated torque, which can be due to an error in the initialization of the converter controller. This fault can occur since the converter torque is estimated based on the currents in the converter. If this estimate is initialized incorrectly it will result in an offset on the estimated converter torque, which leads to the offset on the generator torque. The offset value is 2000 Nm.

2.4.2. Sensor Faults

The generator speed measurement uses encoders and its elements are subject to electrical and mechanical failures, which can result in a changed gain factor on the measurement. The simulated fault, F5, is a gain factor on

ω_{g}

equal to 1.2.

Faults 6 and 7 result in blade 3 having a stuck pitch angle sensor, which holds a constant value of

5^{\circ}

(F6) and

10^{\circ}

(F7), respectively.

Finally, the fault of a gain factor on the measurement of the third pitch angle sensor is studied (F7). The measurement is scaled by a factor of 1.2.

Table 4. Assumed available measurements. These sensors are representative of the types of sensors that are available on a MW-scale commercial wind turbine.

**Table 4.** Assumed available measurements. These sensors are representative of the types of sensors that are available on a MW-scale commercial wind turbine.
Number	Sensor Type	Symbol	Units
1	Generated electrical power	$P_{e, m}$	kW
2	Rotor speed	$ω_{r, m}$	rad/s
3	Generator speed	$ω_{g, m}$	rad/s
4	Generator torque	$τ_{c, m}$	Nm
5	first pitch angle	$β_{1, m}$	deg
6	second pitch angle	$β_{2, m}$	deg
7	third pitch angle	$β_{3, m}$	deg
8	fore-aft acceleration at tower bottom	$a_{f a, m}^{b}$	m/s $^{2}$
9	side-to-side acceleration at tower bottom	$a_{s s, m}^{b}$	m/s $^{2}$
10	fore-aft acceleration at mid-tower	$a_{f a, m}^{m}$	m/s $^{2}$
11	side-to-side acceleration at mid-tower	$a_{s s, m}^{m}$	m/s $^{2}$
12	fore-aft acceleration at tower top	$a_{f a, m}^{t}$	m/s $^{2}$
13	side-to-side acceleration at tower top	$a_{s s, m}^{t}$	m/s $^{2}$

3. Fault Detection Strategy

The overall fault detection strategy is based on principal component analysis and statistical hypothesis testing. A baseline pattern or PCA model is created with the healthy state of the wind turbine in the presence of wind turbulence. When the current state of the wind turbine has to be diagnosed, the collected data is projected using the PCA model. The final diagnosis is performed using statistical hypothesis testing.

The main paradigm of vibration based structural health monitoring is based on the basic idea that a change in physical properties due to structural changes or damage will cause detectable changes in dynamical responses. This idea is illustrated in Figure 2, where the healthy structure is excited by a signal to create a pattern. Subsequently, the structure to be diagnosed is excited by the same signal and the dynamic response is compared with the pattern. The scheme in Figure 2 is also know as guided waves in structures for structural health monitoring [23].

However, in our application, the only available excitation of the wind turbines is the wind turbulence. Therefore, guided waves in wind turbines for SHM as in Figure 2 cannot be considered as a realistic scenario. In spite of that, the new paradigm described in Figure 3 is based on the fact that, even with a different wind turbulence, the fault detection strategy based on PCA and statistical hypothesis testing will be able to detect some damage, fault or misbehavior. More precisely, the key idea behind the detection strategy is the assumption that a change in the behavior of the overall system, even with a different excitation, has to be detected. The results presented in Section 4 confirm this hypothesis.

Figure 2. Guided waves in structures for structural health monitoring. The healthy structure is excited by a signal and the dynamic response is measured to create a baseline pattern. Then, the structure to diagnose is excited by the same signal and the dynamic response is also measured and compared with the baseline pattern. A significant difference in the pattern would imply the existence of a fault.

Figure 3. Even with a different wind turbulence, the fault detection strategy is able to detect some damage, fault or misbehavior.

3.1. Data Driven Baseline Modeling Based on PCA

Let us start the PCA modeling by measuring, from a healthy wind turbine, a sensor during

(n L - 1) Δ

seconds, where Δ is the sampling time and

n, L \in N

. The discretized measures of the sensor are a real vector

\begin{matrix} (\begin{matrix} x_{11} & x_{12} & \dots & x_{1 L} & x_{21} & x_{22} & \dots & x_{2 L} & \dots & x_{n 1} & x_{n 2} & \dots & x_{n L} \end{matrix}) \in R^{n L} \end{matrix}

(4)

where the real number

x_{i j}, i = 1, \dots, n, j = 1, \dots, L

corresponds to the measure of the sensor at time

((i - 1) L + (j - 1)) Δ

seconds. This collected data can be arranged in matrix form as follows:

\begin{matrix} (\begin{matrix} x_{11} & x_{12} & \dots & x_{1 L} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ x_{i 1} & x_{i 2} & \dots & x_{i L} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ x_{n 1} & x_{n 2} & \dots & x_{n L} \end{matrix}) \in M_{n \times L} (R) \end{matrix}

(5)

where

M_{n \times L} (R)

is the vector space of

n \times L

matrices over

R

. When the measures are obtained from

N \in N

sensors also during

(n L - 1) Δ

seconds, the collected data, for each sensor, can be arranged in a matrix as in Equation (5). Finally, all the collected data coming from the N sensors is disposed in a matrix

X \in M_{n \times (N \cdot L)}

as follows:

\begin{matrix} X & = (\begin{matrix} \begin{matrix} x_{11}^{1} & x_{12}^{1} & \dots & x_{1 L}^{1} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ x_{i 1}^{1} & x_{i 2}^{1} & \dots & x_{i L}^{1} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ x_{n 1}^{1} & x_{n 2}^{1} & \dots & x_{n L}^{1} \end{matrix} & \begin{matrix} x_{11}^{2} & \dots & x_{1 L}^{2} \\ ⋮ & ⋱ & ⋮ \\ x_{i 1}^{2} & \dots & x_{i L}^{2} \\ ⋮ & ⋱ & ⋮ \\ x_{n 1}^{2} & \dots & x_{n L}^{2} \end{matrix} & \begin{matrix} \dots \\ ⋱ \\ \dots \\ ⋱ \\ \dots \end{matrix} & \begin{matrix} x_{11}^{N} & \dots & x_{1 L}^{N} \\ ⋮ & ⋱ & ⋮ \\ x_{i 1}^{N} & \dots & x_{i L}^{N} \\ ⋮ & ⋱ & ⋮ \\ x_{n 1}^{N} & \dots & x_{n L}^{N} \end{matrix} \end{matrix}) \\ = (\begin{matrix} X^{1} & X^{2} & \dots & X^{N} \end{matrix}) \end{matrix}

(6)

where the superindex

k = 1, \dots, N

of each element

x_{i j}^{k}

in the matrix represents the number of sensor.

The objective of the principal component analysis is to find a linear transformation orthogonal matrix

P \in M_{(N \cdot L) \times (N \cdot L)} (R)

that will be used to transform or project the original data matrix

X

according to the subsequent matrix product:

\begin{matrix} T = XP \in M_{n \times (N \cdot L)} (R) \end{matrix}

(7)

where

T

is a matrix having a diagonal covariance matrix.

Group Scaling

Since the data in matrix

X

is affected by diverse wind turbulence, come from several sensors and could have different scales and magnitudes, it is required to apply a preprocessing step to rescale the data using the mean of all measurements of the sensor at the same column and the standard deviation of all measurements of the sensor [24].

More precisely, for

k = 1, 2, \dots, N

we define

\begin{array}{l} (8) & \begin{matrix} μ_{j}^{k} & = \frac{1}{n} \sum_{i = 1}^{n} x_{i j}^{k}, j = 1, \dots, L, \end{matrix} \\ (9) & \begin{matrix} μ^{k} & = \frac{1}{n L} \sum_{i = 1}^{n} \sum_{j = 1}^{L} x_{i j}^{k}, \end{matrix} \\ (10) & \begin{matrix} σ^{k} & = \sqrt{\frac{1}{n L} \sum_{i = 1}^{n} \sum_{j = 1}^{L} {(x_{i j}^{k} - μ^{k})}^{2}} \end{matrix} \end{array}

where

μ_{j}^{k}

is the mean of the measures placed at the same column, that is, the mean of the n measures of sensor k in matrix

X^{k}

at time instants

((i - 1) L + (j - 1)) Δ

seconds,

i = 1, \dots, n

;

μ^{k}

is the mean of all the elements in matrix

X^{k}

, that is, the mean of all the measures of sensor k; and

σ^{k}

is the standard deviation of all the measures of sensor k. Therefore, the elements

x_{i j}^{k}

of matrix

X

are scaled to define a new matrix

\overset{ˇ}{X}

as

\begin{matrix} {\overset{ˇ}{x}}_{i j}^{k} & : = \frac{x_{i j}^{k} - μ_{j}^{k}}{σ^{k}}, i = 1, \dots, n, j = 1, \dots, L, k = 1, \dots, N . \end{matrix}

(11)

When the data are normalized using Equation (11), the scaling procedure is called variable scaling or group scaling [25].

For the sake of clarity, and throughout the rest of the paper, the scaled matrix

\overset{ˇ}{X}

is renamed as simply

X

. The mean of each column vector in the scaled matrix

X

can be computed as

\begin{matrix} (12) & \begin{matrix} \frac{1}{n} \sum_{i = 1}^{n} {\overset{ˇ}{x}}_{i j}^{k} & = \frac{1}{n} \sum_{i = 1}^{n} \frac{x_{i j}^{k} - μ_{j}^{k}}{σ^{k}} = \frac{1}{n σ^{k}} \sum_{i = 1}^{n} (x_{i j}^{k} - μ_{j}^{k}) \end{matrix} \\ (13) & \begin{matrix} = \frac{1}{n σ^{k}} [(\sum_{i = 1}^{n} x_{i j}^{k}) - n μ_{j}^{k}] \end{matrix} \\ (14) & \begin{matrix} = \frac{1}{n σ^{k}} (n μ_{j}^{k} - n μ_{j}^{k}) = 0 \end{matrix} \end{matrix}

Since the scaled matrix

X

is a mean-centered matrix, it is possible to calculate its covariance matrix as follows:

C_{X} = \frac{1}{n - 1} X^{T} X \in M_{(N \cdot L) \times (N \cdot L)} (R)

(15)

The covariance matrix

C_{X}

is a

(N \cdot L) \times (N \cdot L)

symmetric matrix that measures the degree of linear relationship within the data set between all possible pairs of columns. At this point it is worth noting that each column can be viewed as a virtual sensor and, therefore, each column vector

X (:, j) \in R^{n}, j = 1, \dots, N \cdot L,

represents a set of measurements from one virtual sensor.

The subspaces in PCA are defined by the eigenvectors and eigenvalues of the covariance matrix as follows:

C_{X} P = P Λ

(16)

where the columns of

P \in M_{(N \cdot L) \times (N \cdot L)} (R)

are the eigenvectors of

C_{X}

. The diagonal terms of matrix

Λ \in M_{(N \cdot L) \times (N \cdot L)} (R)

are the eigenvalues

λ_{i}, i = 1, \dots, N \cdot L,

of

C_{X}

whereas the off-diagonal terms are zero, that is,

\begin{array}{l} (17) & \begin{matrix} Λ_{i i} & = λ_{i}, i = 1, \dots, N \cdot L \end{matrix} \\ (18) & \begin{matrix} Λ_{i j} & = 0, i, j = 1, \dots, N \cdot L, i \neq j \end{matrix} \end{array}

The eigenvectors

p_{j}, j = 1, \dots, N \cdot L

, representing the columns of the transformation matrix

P

are classified according to the eigenvalues in descending order and they are called the principal components or the loading vectors of the data set. The eigenvector with the highest eigenvalue, called the first principal component, represents the most important pattern in the data with the largest quantity of information.

Matrix

P

is usually called the principal components of the data set or loading matrix and matrix

T

is the transformed or projected matrix to the principal component space, also called score matrix. Using all the

N \cdot L

principal components, that is, in the full dimensional case, the orthogonality of

P

implies

{PP}^{T} = I

, where

I

is the

(N \cdot L) \times (N \cdot L)

identity matrix. Therefore, the projection can be inverted to recover the original data as

\begin{matrix} X = {TP}^{T} \end{matrix}

(19)

However, the objective of PCA is, as said before, to reduce the dimensionality of the data set

X

by selecting only a limited number

ℓ < N \cdot L

of principal components, that is, only the eigenvectors related to the ℓ highest eigenvalues. Thus, given the reduced matrix

\begin{matrix} \hat{P} = (p_{1} | p_{2} | \dots | p_{ℓ}) \in M_{N \cdot L \times ℓ} (R) \end{matrix}

(20)

matrix

\hat{T}

is defined as

\begin{matrix} \hat{T} = X \hat{P} \in M_{n \times ℓ} (R) \end{matrix}

(21)

Note that opposite to

T

,

\hat{T}

is no longer invertible. Consequently, it is not possible to fully recover

X

although

\hat{T}

can be projected back onto the original

N \cdot L -

dimensional space to get a data matrix

\hat{X}

as follows:

\begin{matrix} \hat{X} = \hat{T} {\hat{P}}^{T} \in M_{n \times (N \cdot L)} (R) \end{matrix}

(22)

The difference between the original data matrix

X

and

\hat{X}

is defined as the residual error matrix

E

or

\tilde{X}

as follows:

\begin{matrix} E = X - \hat{X} \end{matrix}

(23)

or, equivalenty,

\begin{matrix} X & = \hat{X} + E = \hat{T} {\hat{P}}^{T} + E \end{matrix}

(24)

The residual error matrix

E

describes the variability not represented by the data matrix

\hat{X}

, and can also be expressed as

\begin{matrix} E = X (I - \hat{P} {\hat{P}}^{T}) \end{matrix}

(25)

Even though the real measures obtained from the sensors as a function of time represent physical magnitudes, when these measures are projected and the scores are obtained, these scores no longer represent any physical magnitude [26]. The key aspect in this approach is that the scores from different experiments can be compared with the reference pattern to try to detect a different behavior.

3.2. Fault Detection Based on Hypothesis Testing

The current wind turbine to diagnose is subjected to a wind turbulence as described in Section 2 and Section 3.1. When the measures are obtained from

N \in N

sensors during

(ν L - 1) Δ

seconds, a new data matrix

Y

is constructed as in Equation (6):

\begin{matrix} Y & = (\begin{matrix} \begin{matrix} y_{11}^{1} & y_{12}^{1} & \dots & y_{1 L}^{1} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ y_{i 1}^{1} & y_{i 2}^{1} & \dots & y_{i L}^{1} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ y_{ν 1}^{1} & y_{ν 2}^{1} & \dots & y_{ν L}^{1} \end{matrix} & \begin{matrix} y_{11}^{2} & \dots & y_{1 L}^{2} \\ ⋮ & ⋱ & ⋮ \\ y_{i 1}^{2} & \dots & y_{i L}^{2} \\ ⋮ & ⋱ & ⋮ \\ y_{ν 1}^{2} & \dots & y_{ν L}^{2} \end{matrix} & \begin{matrix} \dots \\ ⋱ \\ \dots \\ ⋱ \\ \dots \end{matrix} & \begin{matrix} y_{11}^{N} & \dots & y_{1 L}^{N} \\ ⋮ & ⋱ & ⋮ \\ y_{i 1}^{N} & \dots & y_{i L}^{N} \\ ⋮ & ⋱ & ⋮ \\ y_{ν 1}^{N} & \dots & y_{ν L}^{N} \end{matrix} \end{matrix}) \in M_{ν \times (N \cdot L)} (R) \end{matrix}

(26)

It is worth remarking that the natural number ν (the number of rows of matrix

Y

) is not necessarily equal to n (the number of rows of

X

), but the number of columns of

Y

must agree with that of

X

; that is, in both cases the number N of sensors and the number of samples per row must be equal.

Before the collected data arranged in matrix

Y

is projected into the new space spanned by the eigenvectors in matrix

P

in Equation (16), the matrix has to be scaled to define a new matrix

\overset{ˇ}{Y}

as in Equation (11):

\begin{matrix} {\overset{ˇ}{y}}_{i j}^{k} & : = \frac{y_{i j}^{k} - μ_{j}^{k}}{σ^{k}}, i = 1, \dots, ν, j = 1, \dots, L, k = 1, \dots, N, \end{matrix}

(27)

where

μ_{j}^{k}

and

σ^{k}

are defined in Equations (8) and (10), respectively.

The projection of each row vector

r^{i} = \overset{ˇ}{Y} (i, :) \in R^{N \cdot L}, i = 1, \dots, ν

of matrix

\overset{ˇ}{Y}

into the space spanned by the eigenvectors in

\hat{P}

is performed through the following vector to matrix multiplication:

\begin{matrix} t^{i} = r^{i} \cdot \hat{P} \in R^{ℓ} \end{matrix}

(28)

For each row vector

r^{i}, i = 1, \dots, ν

, the first component of vector

t^{i}

is called the first score or score 1; similarly, the second component of vector

t^{i}

is called the second score or score 2, and so on.

In a standard application of the principal component analysis strategy in the field of structural health monitoring, the scores allow a visual grouping or separation [27]. In some other cases, as in [28], two classical indices can be used for damage detection, such as the Q index (also known as SPE, square prediction error) and the Hotelling’s

T^{2}

index. The Q index of the ith row

y_{i}^{T}

of matrix

\overset{ˇ}{Y}

is defined as follows:

\begin{matrix} Q_{i} = y_{i}^{T} (I - \hat{P} {\hat{P}}^{T}) y_{i} \end{matrix}

(29)

The

T^{2}

index of the ith row

y_{i}^{T}

of matrix

\overset{ˇ}{Y}

is defined as follows:

\begin{matrix} T_{i}^{2} = y_{i}^{T} (\hat{P} Λ^{- 1} {\hat{P}}^{T}) y_{i} \end{matrix}

(30)

In this case, however, it can be observed in Figure 4—where the projection onto the two first principal components of samples coming from the healthy and faulty wind turbines are plotted—that a visual grouping, clustering or separation cannot be performed. A similar conclusion is deducted from Figure 5. In this case, the plot of the natural logarithm of indices Q and

T^{2}

—defined in Equations (29) and (30)—of samples coming from the healthy and faulty wind turbines does not allow any visual grouping. Therefore, a more powerful and reliable tool is needed to be able to detect a fault in the wind turbine.

Figure 4. Projection onto the two first principal components of samples coming from the healthy wind turbine (red, circle) and from the faulty wind turbine (blue, diamond).

Figure 5. Natural logarithm of indices Q and

T^{2}

of samples coming from the healthy wind turbine (red, circle) and from the faulty wind turbine (blue, diamond).

Figure 5. Natural logarithm of indices Q and

T^{2}

of samples coming from the healthy wind turbine (red, circle) and from the faulty wind turbine (blue, diamond).

3.2.1. The Random Nature of the Scores

Since the turbulent wind can be considered as a random process, the dynamic response of the wind turbine can be considered as a stochastic process and the measurements in

r^{i}

are also stochastic. Therefore, each component of

t^{i}

acquires this stochastic nature and it will be regarded as a random variable to construct the stochastic approach in this paper.

3.2.2. Test for the Equality of Means

The objective of the present work is to examine whether the current wind turbine is healthy or subjected to a fault as those described in Table 2. To achieve this end, we have a PCA model (matrix

\hat{P}

in Equation (20)) built as in Section 3.1.1 with data coming from a wind turbine in a full healthy state. For each principal component

j = 1, \dots, ℓ

, the baseline sample is defined as the set of n real numbers computed as the

j -

th component of the vector to matrix multiplication

X (i, :) \cdot \hat{P}

. Note that n is the number of rows of matrix

X

in Equation (6). That is, we define the baseline sample as the set of numbers

{τ_{j}^{i}}_{i = 1, \dots, n}

given by

\begin{matrix} τ_{j}^{i} : = (X (i, :) \cdot \hat{P}) (j) = X (i, :) \cdot \hat{P} \cdot e_{j}, i = 1, \dots, n, \end{matrix}

(31)

where

e_{j}

is the

j -

th vector of the canonical basis.

Similarly, and for each principal component

j = 1, \dots, ℓ

, the sample of the current wind turbine to diagnose is defined as the set of ν real numbers computed as the

j -

th component of the vector

t^{i}

in Equation (28). Note that ν is the number of rows of matrix

Y

in Equation (26). That is, we define the sample to diagnose as the set of numbers

{t_{j}^{i}}_{i = 1, \dots, ν}

given by

\begin{matrix} t_{j}^{i} : = t^{i} \cdot e_{j}, i = 1, \dots, ν . \end{matrix}

(32)

As said before, the goal of this paper is to obtain a fault detection method such that when the distribution of the current sample is related to the distribution of the baseline sample a healthy state is predicted and otherwise a fault is detected. To that end, a test for the equality of means will be performed. Let us consider that, for a given principal component, (a) the baseline sample is a random sample of a random variable having a normal distribution with unknown mean

μ_{X}

and unknown standard deviation

σ_{X}

; and (b) the random sample of the current wind turbine is also normally distributed with unknown mean

μ_{Y}

and unknown standard deviation

σ_{Y}

. Let us finally consider that the variances of these two samples are not necessarily equal. As said previously, the problem that we will consider is to determine whether these means are equal, that is,

μ_{X} = μ_{Y}

, or equivalently,

μ_{X} - μ_{Y} = 0

. This statement leads immediately to a test of the hypotheses

\begin{array}{l} (33) & \begin{matrix} H_{0} : & μ_{X} - μ_{Y} = 0 versus \end{matrix} \\ (34) & \begin{matrix} H_{1} : & μ_{X} - μ_{Y} \neq 0 \end{matrix} \end{array}

that is, the null hypothesis is “the sample of the wind turbine to be diagnosed is distributed as the baseline sample” and the alternative hypothesis is “the sample of the wind turbine to be diagnosed is not distributed as the baseline sample”. In other words, if the result of the test is that the null hypothesis is not rejected, the current wind turbine is categorized as healthy. Otherwise, if the null hypothesis is rejected in favor of the alternative, this would indicate the presence of some faults in the wind turbine.

The test is based on the Welch-Satterthwaite method [29], which is outlined below. When random samples of size n and ν, respectively, are taken from two normal distributions

N (μ_{X}, σ_{X})

and

N (μ_{Y}, σ_{Y})

and the population variances are unknown, the random variable

\begin{matrix} W = \frac{(\bar{X} - \bar{Y}) + (μ_{X} - μ_{Y})}{\sqrt{(\frac{S_{X}^{2}}{n} + \frac{S_{Y}^{2}}{ν})}} \end{matrix}

(35)

can be approximated with a t-distribution with ρ degrees of freedom, that is

\begin{matrix} W ↪ t_{ρ} \end{matrix}

(36)

where

\begin{matrix} ρ = ⌊\frac{{(\frac{s_{X}^{2}}{n} + \frac{s_{Y}^{2}}{ν})}^{2}}{\frac{{(s_{X}^{2} / n)}^{2}}{n - 1} + \frac{{(s_{Y}^{2} / ν)}^{2}}{ν - 1}}⌋ \end{matrix}

(37)

and where

\bar{X}, \bar{Y}

is the sample mean as a random variable;

S^{2}

is the sample variance as a random variable;

s^{2}

is the variance of a sample; and

⌊ \cdot ⌋

is the floor function.

The value of the standardized test statistic using this method is defined as

\begin{matrix} t_{obs} = \frac{\bar{x} - \bar{y}}{\sqrt{(\frac{s_{X}^{2}}{n} + \frac{s_{Y}^{2}}{ν})}} \end{matrix}

(38)

where

\bar{x}, \bar{y}

is the mean of a particular sample. The quantity

t_{obs}

is the fault indicator. We can then construct the following test:

\begin{array}{l} (39) & \begin{matrix} |t_{obs}| \leq t^{⋆} & ⟹ Fail to reject H_{0} \end{matrix} \\ (40) & \begin{matrix} |t_{obs}| > t^{⋆} & ⟹ Reject H_{0} \end{matrix} \end{array}

where

t^{⋆}

is such that

\begin{matrix} P (t_{ρ} < t^{⋆}) = 1 - \frac{α}{2} \end{matrix}

(41)

and α is the chosen risk (significance) level for the test. More precisely, the null hypothesis is rejected if

|t_{obs}| > t^{⋆}

(this would indicate the existence of a fault in the wind turbine). Otherwise, if

|t_{obs}| \leq t^{⋆}

there is no statistical evidence to suggest that both samples are normally distributed but with different means, thus indicating that no fault in the wind turbine has been found. This idea is represented in Figure 6.

Figure 6. Fault detection will be based on testing for significant changes in the distributions of the baseline sample and the sample coming from the wind turbine to diagnose.

4. Simulation Results

4.1. Type I and Type II errors

To validate the fault detection strategy presented in Section 3, we first consider a total of 24 samples of

ν = 50

elements each, according to the following distribution:

16 samples of a healthy wind turbine; and
8 samples of a faulty wind turbine with respect to each of the eight different fault scenarios described in Table 2.

In the numerical simulations in this Section, each sample of

ν = 50

elements is composed by the measures obtained from the

N = 13

sensors detailed in Table 4 during

(ν \cdot L - 1) Δ = 312.4875

seconds, where

L = 500

and the sampling time

Δ = 0.0125

seconds. The measures of each sample are then arranged in a

ν \times (N \cdot L)

matrix as in Equation (26).

For the first four principal components (score 1 to score 4), these 24 samples plus the baseline sample of

n = 50

elements are used to test for the equality of means, with a level of significance

α = 0.36

(the choice of this level of significance will be justified in Section 4.2). Each sample of

ν = 50

elements is categorized as follows: (i) number of samples from the healthy wind turbine (healthy sample) which were classified by the hypothesis test as “healthy” (fail to reject

H_{0}

); (ii) faulty sample classified by the test as “faulty” (reject

H_{0}

); (iii) samples from the faulty structure (faulty sample) classified as “healthy”; and (iv) faulty sample classified as “faulty”. The results for the first four principal components presented in Table 6 are organized according to the scheme in Table 5. It can be stressed from each principal component in Table 6 that the sum of the columns is constant: 16 samples in the first column (healthy wind turbine) and 8 more samples in the second column (faulty wind turbine).

Table 5. Scheme for the presentation of the results in Table 6.

**Table 5.** Scheme for the presentation of the results in Table 6.
	Undamaged Sample ( $H_{0}$ )	Damaged Sample ( $H_{1}$ )
Fail to reject $H_{0}$	Correct decision	Type II error (missing fault)
Reject $H_{0}$	Type I error (false alarm)	Correct decision

Table 6. Categorization of the samples with respect to the presence or absence of damage and the result of the test for each of the four scores when the size of the samples to diagnose is

ν = 50

.

**Table 6.** Categorization of the samples with respect to the presence or absence of damage and the result of the test for each of the four scores when the size of the samples to diagnose is $ν = 50$ .
	score 1		score 2		score 3		score 4
	$H_{0}$	$H_{1}$	$H_{0}$	$H_{1}$	$H_{0}$	$H_{1}$	$H_{0}$	$H_{1}$
Fail to reject $H_{0}$	16	0	12	1	11	5	9	1
Reject $H_{0}$	0	8	4	7	5	3	7	7

In Table 6, it is worth noting that two kinds of misclassification are presented which are denoted as follows:

Type I error (false positive or false alarm), when the wind turbine is healthy but the null hypothesis is rejected and therefore classified as faulty. The probability of committing a type I error is α, the level of significance.
Type II error (false negative or missing fault), when the structure is faulty but the null hypothesis is not rejected and therefore classified as healthy. The probability of committing a type II error is called γ.

It can be observed from Table 6 that, in the numerical simulations, Type I errors (false alarms) and Type II errors (missing faults) appear only when scores 2, 3 or 4 are considered, while when the first score is used all the decisions are correct. The better performance of the first score is an expected result in the sense that the first principal component is the component that accounts for the largest possible variance.

4.2. Sensitivity and Specificity

Two more statistical measures can be selected here to study the performance of the test: the sensitivity and the specificity. The sensitivity, also called as the power of the test, is defined, in the context of this work, as the proportion of samples from the faulty wind turbine which are correctly identified as such. Thus, the sensitivity can be computed as

1 - γ

. The specificity of the test is defined, also in this context, as the proportion of samples from the healthy structure that are correctly identified and can be expressed as

1 - α

.

The sensitivity and the specificity of the test with respect to the 24 samples and for each of the first four principal components –organized as shown in Table 7– have been included in Table 8.

Table 7. Relationship between type I and type II errors.

**Table 7.** Relationship between type I and type II errors.
	Undamaged Sample ( $H_{0}$ )	Damaged Sample ( $H_{1}$ )
Fail to reject $H_{0}$	Specificity ( $1 - α$ )	False negative rate (γ)
Reject $H_{0}$	False positive rate (α)	Sensitivity ( $1 - γ$ )

Table 8. Sensitivity and specificity of the test for each of the four scores when the size of the samples to diagnose is

ν = 50

.

**Table 8.** Sensitivity and specificity of the test for each of the four scores when the size of the samples to diagnose is $ν = 50$ .
	score 1		score 2		score 3		score 4
	$H_{0}$	$H_{1}$	$H_{0}$	$H_{1}$	$H_{0}$	$H_{1}$	$H_{0}$	$H_{1}$
Fail to reject $H_{0}$	1.00	0.00	0.75	0.13	0.69	0.62	0.56	0.13
Reject $H_{0}$	0.00	1.00	0.25	0.87	0.31	0.38	0.44	0.87

It is worth mentioning that type I errors are frequently considered to be more serious than type II errors. However, in this application, a type II error is related to a missing fault whereas a type I error is related to a false alarm. In consequence, type II errors should be reduced. Therefore a small level of significance of

1 %

,

5 %

or even

10 %

would lead to a reduced number of false alarms but to a higher rate of missing faults. That is the reason of the choice of a level of significance of

36 %

in the hypothesis test.

The results in Table 8 show that the sensitivity of the test

1 - γ

is close to

100 %

, as desired, with an average value of

78.00 %

. The sensitivity with respect to the first, second and fourth principal component is increased, in mean, to a

91.33 %

. The average value of the specificity is

75.00 %

, which is very close to the expected value of

1 - α = 64 %

.

4.3. Reliability of the Results

Using the scheme in Table 9, the results are computed and given in Table 10. This table is based on the Bayes’ theorem [30], where P(

H_{1} | accept H_{0}

) is the proportion of samples from the faulty wind turbine that have been incorrectly classified as healthy (true rate of false negatives) and P(

H_{0} | accept H_{1}

) is the proportion of samples from the healthy wind turbine that have been incorrectly classified as faulty (true rate of false positives).

Table 9. Relationship between the proportion of false negatives and false positives.

**Table 9.** Relationship between the proportion of false negatives and false positives.
	Undamaged Sample ( $H_{0}$ )	Damaged Sample ( $H_{1}$ )
Fail to reject $H_{0}$	P( $H_{0} \| accept H_{0}$ )	$\begin{matrix} true rate of false negatives \\ P (H_{1} \| accept H_{0}) \end{matrix}$
Reject $H_{0}$	$\begin{matrix} true rate of false positives \\ P (H_{0} \| accept H_{1}) \end{matrix}$	P( $H_{1} \| accept H_{1}$ )

Table 10. True rate of false positives and false negatives for each of the four scores when the size of the samples to diagnose is

ν = 50

.

**Table 10.** True rate of false positives and false negatives for each of the four scores when the size of the samples to diagnose is $ν = 50$ .
	score 1		score 2		score 3		score 4
	$H_{0}$	$H_{1}$	$H_{0}$	$H_{1}$	$H_{0}$	$H_{1}$	$H_{0}$	$H_{1}$
Fail to reject $H_{0}$	1.00	0.00	0.92	0.08	0.69	0.31	0.90	0.10
Reject $H_{0}$	0.00	1.00	0.36	0.64	0.62	0.38	0.50	0.50

4.4. The Receiver Operating Curves (ROC)

An additional study has been developed based on the ROC curves to determine the overall accuracy of the proposed method. These curves represent the trade-off between the false positive rate and the sensitivity in Table 10 for different values of the level of significance that is used in the statistical hypothesis testing. Note that the false positive rate is defined as the complementary of the specificity, and therefore these curves can also be used to visualize the close relationship between specificity and sensitivity. It can also be remarked that the sensitivity is also called true positive rate or probability of detection [31]. More precisely, for each principal component and for a given level of significance the pair of numbers

\begin{matrix} (false positive rate, sensitivity) \in [0, 1] \times [0, 1] \subset R^{2} \end{matrix}

(42)

is plotted. We have considered 49 levels of significance within the range

[0.02, 0.98]

and with a difference of

0.02

. Therefore, for each of the first four principal components, 49 connected points are depicted, as can be seen in Figure 7.

Figure 7. The Receiver Operating Curves (ROCs) for the four scores when the size of the samples to diagnose is

ν = 50

.

Figure 7. The Receiver Operating Curves (ROCs) for the four scores when the size of the samples to diagnose is

ν = 50

.

The placement of these points can be interpreted as follows. Since we are interested in reducing the number of false positives while we increase the number of true positives, these points must be placed in the upper-left corner as much as possible. However, this is not always possible because there is also a relationship between the level of significance and the false positive rate. Therefore, a method can be considered acceptable if those points lie within the upper-left half-plane. In this sense, the results presented in Figure 7, particularly with respect to score 1, are quite remarkable. The overall behavior of scores 2 and 4 are also acceptable, while the results of score 3 cannot be considered, in this case, as satisfactory.

In Figure 8 and Figure 9 a further study is performed. While in Figure 7 we present the ROCs when the size of the samples to diagnose is

ν = 50

, in Figure 8 the reliability of the method is analyzed in terms of 48 samples of

ν = 25

elements each and in Figure 9 the reliability of the method is analyzed in terms of 120 samples of

ν = 10

elements each. The effect of reducing the number of elements in each sample is the reduction in the total time needed for a diagnostic. More precisely, if we keep

L = 500

, when the size of the samples is reduced from

ν = 50

to

ν = 25

and

ν = 10

, the total time needed for a diagnostic is reduced from about 312 s to 156 and 62 s, respectively. Another effect of the reduction in the number of elements in each sample is a slight deterioration of the overall accuracy of the detection method. However, the results of scores 1 and 2 in Figure 8 and Figure 9 are perfectly acceptable.

A very interesting alternative to keep a very good performance of the method without almost no degradation in its accuracy is by reducing L –the number of time instants per row per sensor— instead of reducing the number of elements per sample ν. This way, if we keep

ν = 50

, when L is reduced from 500 to 50, the total time needed for a diagnostic is reduced from about 312 s to 31 s.

We can finally say that the ROC curves provide a statistical assessment of the efficacy of a method and can be used to visualize and compare the performance of multiple scenarios.

Figure 8. The ROCs for the four scores when the size of the samples to diagnose is

ν = 25

.

Figure 8. The ROCs for the four scores when the size of the samples to diagnose is

ν = 25

.

Figure 9. The ROCs for the four scores when the size of the samples to diagnose is

ν = 10

.

Figure 9. The ROCs for the four scores when the size of the samples to diagnose is

ν = 10

.

5. Conclusions

The silver bullet for offshore operators is to eliminate unscheduled maintenance. Therefore, the implementation of fault detection systems is crucial. The main challenges of the wind turbine fault detection lie in its nonlinearity, unknown disturbances as well as significant measurement noise. In this work, numerical simulations (with a well-known benchmark wind turbine) show that the proposed PCA plus statistical hypothesis testing is a valuable tool in fault detection for wind turbines. It is noteworthy that, in the simulations, when the first score is used all the decisions are correct (there are no false alarms and no missing faults).

We believe that PCA plus statistical hypothesis testing has tremendous potential in decreasing maintenance costs. Therefore, we view the work described in this paper as only the beginning of a large project. For future work, we plan to develop a complete fault detection, isolation, and reconfiguration method (FDIR). That is, a reconfigurable control strategy in response to faults. In the near future, the next step is to focus our research into efficient fault feature extraction.

Acknowledgments

This work has been partially funded by the Spanish Ministry of Economy and Competitiveness through the research projects DPI2011-28033-C03-01, DPI2014-58427-C2-1-R, and by the Generalitat de Catalunya through the research project 2014 SGR 859.

Author Contributions

All authors contributed equally.

Conflicts of Interest

The authors declare no conflict of interest.

Nomenclature

β	Pitch angle
$β_{r}$	Pitch angle reference
$η_{g}$	Generator efficiency
$ω_{g}$	Generator speed
$P_{e}$	Electrical Power
$τ_{c}$	Reference generator torque
$τ_{r}$	Real generator torque
α	Significance level for the test (probability of committing a type I error)
γ	Probability of committing a type II error
L	Number of time instants per row per sensor
N	Number of sensors
ν	Size of the samples to diagnose
$P$	Principal components of the data set (loading matrix)
$T$	Transformed (or projected) matrix to the principal component space (score matrix)
$E$	Residual error matrix
$X$	Data matrix (original)
$Y$	Data matrix to diagnose
${τ_{j}^{i}}_{i = 1, \dots, n}$	Baseline sample
${t_{j}^{i}}_{i = 1, \dots, ν}$	Sample to diagnose

References

Sinha, Y.; Steel, J. A progressive study into offshore wind farm maintenance optimisation using risk based failure analysis. Renew. Sustain. Energy Rev. 2015, 42, 735–742. [Google Scholar] [CrossRef]
Odgaard, P.; Johnson, K. Wind turbine fault diagnosis and fault tolerant control—An enhanced benchmark challenge. In Proceedings of the 2013 American Control Conference (ACC), Washington, DC, USA, 17–19 June 2013; pp. 1–6.
Soman, R.N.; Malinowski, P.H.; Ostachowicz, W.M. Bi-axial neutral axis tracking for damage detection in wind-turbine towers. Wind Energy 2015. [Google Scholar] [CrossRef]
Adams, D.; White, J.; Rumsey, M.; Farrar, C. Structural health monitoring of wind turbines: Method and application to a HAWT. Wind Energy 2011, 14, 603–623. [Google Scholar] [CrossRef]
Griffith, D.T.; Yoder, N.C.; Resor, B.; White, J.; Paquette, J. Structural health and prognostics management for the enhancement of offshore wind turbine operations and maintenance strategies. Wind Energy 2014, 17, 1737–1751. [Google Scholar] [CrossRef]
Ding, S.X. Model-Based Fault Diagnosis Techniques: Design Schemes, Algorithms, and Tools; Springer Science & Business Media: London, UK, 2008. [Google Scholar]
Odgaard, P.F.; Stoustrup, J.; Nielsen, R.; Damgaard, C. Observer based detection of sensor faults in wind turbines. In Proceedings of the European Wind Energy Conference, Marseille, France, 16–19 March 2009; pp. 4421–4430.
Zhang, X.; Zhang, Q.; Zhao, S.; Ferrari, R.M.; Polycarpou, M.M.; Parisini, T. Fault detection and isolation of the wind turbine benchmark: An estimation-based approach. In Proceedings of the International Federation of Automatic Control (IFAC) World Congress, Milano, Italy, 28 August–2 September 2011; Volume 2, pp. 8295–8300.
Shaker, M.S.; Patton, R.J. Active sensor fault tolerant output feedback tracking control for wind turbine systems via T-S model. Eng. Appl. Artif. Intell. 2014, 34, 1–12. [Google Scholar] [CrossRef]
Shi, F.; Patton, R. An active fault tolerant control approach to an offshore wind turbine model. Renew. Energy 2015, 75, 788–798. [Google Scholar] [CrossRef]
Vidal, Y.; Tutiven, C.; Rodellar, J.; Acho, L. Fault diagnosis and fault-tolerant control of wind turbines via a discrete time controller with a disturbance compensator. Energies 2015, 8, 4300–4316. [Google Scholar] [CrossRef] [Green Version]
Dong, J.; Verhaegen, M. Data driven fault detection and isolation of a wind turbine benchmark. In Proceedings of the International Federation of Automatic Control (IFAC) World Congress, Milano, Italy, 28 August–2 September 2011; Volume 2, pp. 7086–7091.
Simani, S.; Castaldi, P.; Tilli, A. Data-driven approach for wind turbine actuator and sensor fault detection and isolation. In Proceedings of the International Federation of Automatic Control (IFAC) World Congress, Milano, Italy, 28 August–2 September 2011; pp. 8301–8306.
Laouti, N.; Sheibat-Othman, N.; Othman, S. Support vector machines for fault detection in wind turbines. In Proceedings of International Federation of Automatic Control (IFAC) World Congress, Milano, Italy, 28 August–2 September 2011; Volume 2, pp. 7067–7072.
Stoican, F.; Raduinea, C.F.; Olaru, S. Adaptation of set theoretic methods to the fault detection of wind turbine benchmark. In Proceedings of the International Federation of Automatic Control (IFAC) World Congress, Milano, Italy, 28 August–2 September 2011; pp. 8322–8327.
Odgaard, P.F.; Stoustrup, J. Gear-box fault detection using time-frequency based methods. Annu. Rev. Control 2015, 40, 50–58. [Google Scholar] [CrossRef]
Pang-Ning, T.; Steinbach, M.; Kumar, V. Introduction to Data Mining; Pearson Addison Wesley: Boston, MA, USA, 2005. [Google Scholar]
Zaher, A.; McArthur, S.; Infield, D.; Patel, Y. Online wind turbine fault detection through automated SCADA data analysis. Wind Energy 2009, 12, 574. [Google Scholar] [CrossRef]
Kusiak, A.; Li, W.; Song, Z. Dynamic control of wind turbines. Renew. Energy 2010, 35, 456–463. [Google Scholar] [CrossRef]
Jonkman, J. NWTC Information Portal (FAST). Available online: https://nwtc.nrel.gov/FAST (accessed on 18 December 2015).
Jonkman, J.M.; Butterfield, S.; Musial, W.; Scott, G. Definition of a 5-MW Reference Wind Turbine for Offshore System Development; National Renewable Energy Laboratory: Golden, CO, USA, 2009. [Google Scholar]
Kelley, N.; Jonkman, B. NWTC Information Portal (Turbsim). Available online: https://nwtc.nrel.gov/TurbSim (accessed on 18 December 2015).
Ostachowicz, W.; Kudela, P.; Krawczuk, M.; Zak, A. Guided Waves in Structures for SHM: The Time-Domain Spectral Element Method; John Wiley & Sons, Ltd: Hoboken, NJ, USA, 2012. [Google Scholar]
Anaya, M.; Tibaduiza, D.; Pozo, F. A bioinspired methodology based on an artificial immune system for damage detection in structural health monitoring. Shock Vibration 2015, 2015, 1–15. [Google Scholar] [CrossRef]
Anaya, M.; Tibaduiza, D.; Pozo, F. Detection and classification of structural changes using artificial immune systems and fuzzy clustering. Int. J. Bio-Inspired Comput. in press.
Mujica, L.E.; Ruiz, M.; Pozo, F.; Rodellar, J.; Güemes, A. A structural damage detection indicator based on principal component analysis and statistical hypothesis testing. Smart Mater. Struct. 2014, 23, 1–12. [Google Scholar] [CrossRef]
Mujica, L.E.; Rodellar, J.; Fernández, A.; Güemes, A. Q-statistic and T²-statistic PCA-based measures for damage assessment in structures. Struct. Health Monit. 2011, 10, 539–553. [Google Scholar] [CrossRef]
Odgaard, P.F.; Lin, B.; Jorgensen, S.B. Observer and data-driven-model-based fault detection in power plant coal mills. IEEE Trans. Energy Convers. 2008, 23, 659–668. [Google Scholar] [CrossRef]
Ugarte, M.D.; Militino, A.F.; Arnholt, A. Probability and Statistics with R; Chemical Rubber Company (CRC) Press: Boca Raton, FL, USA, 2008. [Google Scholar]
De Groot, M.H.; Schervish, M.J. Probability and Statistics; Pearson: Boston, MA, USA, 2012. [Google Scholar]
Yinghui, L.; Michaels, J.E. Feature extraction and sensor fusion for ultrasonic structural health monitoring under changing environmental conditions. IEEE Sens. J. 2009, 9, 1462–1471. [Google Scholar] [CrossRef]

© 2015 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons by Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Pozo, F.; Vidal, Y. Wind Turbine Fault Detection through Principal Component Analysis and Statistical Hypothesis Testing. Energies 2016, 9, 3. https://doi.org/10.3390/en9010003

AMA Style

Pozo F, Vidal Y. Wind Turbine Fault Detection through Principal Component Analysis and Statistical Hypothesis Testing. Energies. 2016; 9(1):3. https://doi.org/10.3390/en9010003

Chicago/Turabian Style

Pozo, Francesc, and Yolanda Vidal. 2016. "Wind Turbine Fault Detection through Principal Component Analysis and Statistical Hypothesis Testing" Energies 9, no. 1: 3. https://doi.org/10.3390/en9010003

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Wind Turbine Fault Detection through Principal Component Analysis and Statistical Hypothesis Testing

Abstract

1. Introduction

2. Wind Turbine Benchmark Model

2.1. Reference Wind Turbine

2.2. Generator-Converter Model

2.3. Pitch Actuator Model

2.4. Fault Scenarios

2.4.1. Actuator Faults

2.4.2. Sensor Faults

3. Fault Detection Strategy

3.1. Data Driven Baseline Modeling Based on PCA

Group Scaling

3.2. Fault Detection Based on Hypothesis Testing

3.2.1. The Random Nature of the Scores

3.2.2. Test for the Equality of Means

4. Simulation Results

4.1. Type I and Type II errors

4.2. Sensitivity and Specificity

4.3. Reliability of the Results

4.4. The Receiver Operating Curves (ROC)

5. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

Nomenclature

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI