Fault Diagnosis Method for Wind Turbine Gearbox Based on Ensemble-Refined Composite Multiscale Fluctuation-Based Reverse Dispersion Entropy

Wang, Xiang; Du, Yang

doi:10.3390/e26080705

Open AccessArticle

Fault Diagnosis Method for Wind Turbine Gearbox Based on Ensemble-Refined Composite Multiscale Fluctuation-Based Reverse Dispersion Entropy

by

Xiang Wang

^1,*

and

Yang Du

²

¹

School of Energy and Power Engineering, Nanjing Institute of Technology, Nanjing 211167, China

²

School of Electrical Engineering, Nanjing Institute of Technology, Nanjing 211167, China

^*

Author to whom correspondence should be addressed.

Entropy 2024, 26(8), 705; https://doi.org/10.3390/e26080705

Submission received: 22 July 2024 / Revised: 16 August 2024 / Accepted: 18 August 2024 / Published: 20 August 2024

(This article belongs to the Special Issue Entropy Applications in Condition Monitoring and Fault Diagnosis)

Download

Browse Figures

Versions Notes

Abstract

The diagnosis of faults in wind turbine gearboxes based on signal processing represents a significant area of research within the field of wind power generation. This paper presents an intelligent fault diagnosis method based on ensemble-refined composite multiscale fluctuation-based reverse dispersion entropy (ERCMFRDE) for a wind turbine gearbox vibration signal that is nonstationary and nonlinear and for noise problems. Firstly, improved complete ensemble empirical mode decomposition with adaptive noise (ICEEMDAN) and stationary wavelet transform (SWT) are adopted for signal decomposition, noise reduction, and restructuring of gearbox signals. Secondly, we extend the single coarse-graining processing method of refined composite multiscale fluctuation-based reverse dispersion entropy (RCMFRDE) to the multiorder moment coarse-grained processing method, extracting mixed fault feature sets for denoised signals. Finally, the diagnostic results are obtained based on the least squares support vector machine (LSSVM). The dataset collected during the gearbox fault simulation on the experimental platform is employed as the research object, and the experiments are conducted using the method proposed in this paper. The experimental results demonstrate that the proposed method is an effective and reliable approach for accurately diagnosing gearbox faults, exhibiting high diagnostic accuracy and a robust performance.

Keywords:

fault diagnosis; wind turbine gearbox; noise reduction; fluctuation-based reverse dispersion entropy; least squares support vector machine

1. Introduction

Wind power generation is increasingly vital in the field of renewable energy owing to the swift advancement of new energy technologies [1]. Their operation and maintenance costs show an exponential growth trend as the installed capacity of wind turbines continues to increase. Gearboxes, which are among the most precise and costly components of wind turbines, represent a significant portion of the investment [2]. In order to minimize the economic losses caused by gearbox failures leading to unit shutdowns, timely warnings and an accurate identification of fault types in fault diagnosis technology become particularly important [3]. Fault diagnosis technology is one of the key factors for the stable development of wind power [4]. The collection of operational data from various key positions of wind turbine units is achieved through the utilization of state detection technology, which is then subjected to intelligent signal processing techniques to evaluate the overall condition of the units. The accurate identification of fault types during abnormal operation of the units provides reliable support for maintenance work. Fault diagnosis of wind power units generally includes three steps: signal processing [5], feature extraction [6], and fault identification [7].

Vibration signals collected from gearboxes in harsh environments are frequently contaminated by a significant amount of noise, which complicates the extraction of signal features. Therefore, it is crucial to perform time–frequency domain analyses on the signals [8]. The most commonly employed signal processing techniques include empirical mode decomposition (EMD) [9], variational mode decomposition (VMD) [10], and related improved methods. ICEEMDAN was proposed in 2014 [11] and is an improved algorithm of the complementary ensemble EMD (CEEMD) method, which reduces a lot of unnecessary components and reduces the pseudo-modalities to a large extent. ICEEMDAN is extensively utilized in signal processing, data analysis, image processing, and other related applications with the advantage of sufficient decomposition [12,13,14]. In 2023, Su et al. [15] were inspired by the natural frost growth mechanism and proposed the rime optimization algorithm (RIME). The RIME algorithm simulates the movement of delicate ice crystals for the purpose of developing algorithms through the modeling of interaction patterns between hard frost agents, exhibiting a high global search capability, fast convergence, and wide adaptability [16]. In this study, RIME is employed to optimize the number of realizations (NR) and the noise standard deviation (Nsd) of the added noise during ICEEMDAN decomposition.

The issue of filtering noise-containing and noise-free signals represents a significant challenge for the intrinsic modal function (IMF) components obtained by ICEEMDAN decomposition. Effective signals typically display a certain degree of periodicity or regularity, whereas noise often manifests as random or irregular phenomena [17]. Permutation entropy (PE) is highly sensitive to the intricacy of vibration signal temporal data sequences [18], enabling the capture of regular changes in the signals and the effective distinction between signal and noise components. In this study, PE is selected as the criterion for evaluating the noise-containing and noise-free signals in the IMF components. The threshold value has been set to 0.6 [19] and will determine whether the IMF component requires noise reduction.

The conventional approach to noise cancellation entails the utilization of filters for the reduction of signal noise by applying the spectral distribution law of signal and noise [20]. The Fourier transform is initially employed to transform the signal containing noise into the frequency domain [21]. Subsequently, a low-pass filter is utilized for the purpose of denoising. Wavelet transform (WT) exhibits excellent time–frequency localization characteristics, which have been extensively investigated in the context of signal denoising, resulting in highly promising outcomes [22]. It has emerged as a pivotal approach in signal denoising. It has been proposed that an SWT should be employed for the purpose of noise reduction in order to efficiently identify the defect features present in the vibration signal [23]. The advantages of the SWT over the ordinary WT include the following: SWT provides a more stable frequency domain representation, it facilitates the examination of nonstationary signals, and it is more sensitive to the capture of local details in the signal [24]. Furthermore, the SWT is more appropriate for the study of characteristics such as long-term trends and abrupt changes and, therefore, has a wider application in signal processing and analysis.

The nonlinearity and instability of gearbox vibration signals make the application of linear feature extraction methods impractical. Consequently, entropy-based methodologies grounded in nonlinear dynamics theory have been extensively utilized in electromechanical fault identification. Entropy is an appropriate physical quantity for assessing the regularization procedure and intricacy of data sequences, especially while analyzing nonstationary and nonlinear vibration signals [25]. Reverse dispersion entropy (RDE) [26] and fluctuation dispersion entropy (FDE) [27] represent the improved algorithms of dispersion entropy [28]. RDE redefines the calculation of entropy values with distance information, making it more capable of detecting abrupt signals. Its advantages are more pronounced when dealing with highly impulsive vibration signals compared to traditional entropy algorithms. FDE is designed to accommodate the inherent volatility of the sequence, offering a robust and less susceptible approach to noise interference. Inspired by two improved entropy algorithms, Li et al. [29] proposed fluctuation-based reverse dispersion entropy (FRDE). In contrast to traditional entropy algorithms, FRDE is capable of accurately estimating the complexity of a signal by considering the disparity between neighboring elements within the dataset and by incorporating distance information following the removal of a trend from the data.

However, the aforementioned methods exclusively contemplate signals at a singular scale, which may overlook key time-related data. Costa et al. [30] introduced the concept of multiscale entropy in order to address this constraint. This algorithm effectively addresses the singularity problem of traditional entropy algorithms in expressing timeseries features by applying multiscale coarse-graining processing to the temporal sequence. In recent years, gearbox failure diagnosis technology utilizing multiscale entropy algorithms has seen an increased application. Zheng et al. [31] utilized multiscale fuzzy entropy to extract signal features for fault diagnosis input into support vector machines. Shao et al. [32] improved the entropy bias phenomenon in traditional multiscale entropy algorithms using reverse dispersion entropy, addressing the shortcomings of coarse-graining processing through time-shifted sequences. However, the length of the temporal sequence after multiscale processing is shortened as the scale factor grows, leading to reduced stability of the entropy values. The majority of extant methodologies address this issue through the exclusive utilization of first-order moment processing, thereby failing to consider the potential value of other information present within the timeseries. Therefore, this paper proposes an improved feature extraction method based on ERCMFRDE, extending the ensemble coarse-graining processing method from multiorder moment coarse-grained processing. The attributes of the temporal data can be more comprehensively articulated through the parallel processing of features extracted from multiple perspectives.

It is necessary to classify and identify fault characteristics after extracting them from the vibration signals of the gearbox. The application of neural networks [33] and support vector machines (SVM) [34] to small sample datasets is susceptible to overlearning and to a lack of robust generalization capabilities. The LSSVM [35] is a specific form of SVM that operates under a quadratic loss function. This approach not only simplifies the computational process, but also effectively avoids the characteristics of local minima [36]. Li et al. [37] presented a whole life cycle failure diagnostic technique for high-speed rolling bearings based on enhanced grey wolf optimization for LSSVM. Lu et al. [38] employed a sparse empirical wavelet transform in conjunction with an adaptive dynamic LSSVM for the purpose of failure diagnosis of gear pumps. The particle swarm optimization (PSO) algorithm has the potential to enhance the speed of optimization of LSSVM parameters and augment the overall computational capacity of the model [39]. This paper presents the construction of the PSO–LSSVM model for the purpose of gearbox fault diagnosis.

This paper presents an intelligent diagnostic method for the identification of defects in wind turbine gearboxes based on ICEEMDAN–SWT, ERCMFRDE, and LSSVM. Firstly, the wind turbine gearbox vibration signal is decomposed into noise-containing and noise-free signals by the ICEEMDAN method. The noise-containing components screened with the arrangement entropy as the threshold value are subjected to the SWT noise reduction, and the noise-containing components after noise reduction and the unprocessed noise-free components are reorganized. Secondly, the feature set of the denoised signals is extracted using the ERCMFRDE method, which is based on multiorder central moments. Then, the RFE feature selection is employed to retain the most useful features and the feature set is partitioned into training and testing sets, randomly. Finally, the LSSVM model is then utilized to train the model using the training set, and fault identification is performed on the testing set. The experimental results substantiate the efficacy, superiority, generalizability, and noise resistance of the proposed intelligent diagnosis method.

2. Basic Principles

2.1. ICEEMDAN–SWT

2.1.1. Rime Optimization Algorithm

RIME simulates the movement of delicate ice crystals for algorithmic search and develops algorithms by replicating the interaction patterns among agents experiencing severe frost conditions.

Phase 1. Soft frost search mechanism:

In breezy conditions, soft frost grows with strong randomness. This enables frost particles to freely cover the surface of the object, but with a gradual growth in a consistent direction. This study presents a novel approach called the soft frost search mechanism, which takes advantage of the high level of randomness and wide coverage provided by frost particles to rapidly cover the entire search space and avoid falling into a local optimal solution.

R_{i j}^{n e w} = R_{b e s t, j} + r_{1} \cdot \cos θ \cdot β \cdot (h \cdot (U b_{i j} - L b_{i j}) + L b_{i j}), r_{2} < E \frac{n!}{r! (n - r)!}

(1)

The parameters are expressed as:

θ = π \cdot \frac{t}{10 \cdot T}

(2)

β = 1 - [\frac{w \cdot t}{T}] / w

(3)

where

β

is mathematically modeled as a step function, the symbol “[ ]” indicates that rounding is being used, and w has a default value of 5 and is utilized to regulate the quantity of segments in the step function. The attachment coefficient, denoted as E, directly influences the likelihood of coagulation of the agent and rises as the number of repetitions increases.

E = \sqrt{(t / T)}

(4)

Phase 2. Hard Frost Perforation mechanism:

In the presence of strong gusty winds, hard frost tends to develop in a more predictable and consistent manner, whereas soft frost is more prone to developing in a more erratic and random manner. Hard frost agents snowball in a single direction, exhibiting a proclivity for crossing phenomena. Consequently, this study presents a hard frost piercing mechanism with the objective of enhancing the convergence of the algorithm and the capacity to escape from local optima.

R_{i j}^{n e w} = R_{b e s t, j}, r_{3} < F^{n o r m r} (S_{i})

(5)

Phase 3. Aggressive greedy selection mechanism:

A positive greedy selection mechanism for population updating in meta-heuristic optimization algorithms. The mechanism decides whether or not to replace an individual by comparing its updated fitness value with its preupdated value and replaces the solutions of both individuals simultaneously.

2.1.2. Improved Complete Ensemble EMD

ICEEMDAN is an enhanced algorithm derived from EMD. The specific steps for its decomposition are as follows:

Step 1. Apply white noise to the original signal x:

X_{1}^{(i)} = x + e_{1} E_{1} (w^{(i)}), i = 1, 2 \dots n

(6)

where

e_{1}

is the noise standard deviation of the first decomposed signal and

w^{(i)}

is a series of Gaussian white noises.

Step 2. The first residual:

r_{1} = 〈 X_{1}^{(i)} - E_{1} (X_{1}^{(i)}) 〉

(7)

where < > indicates averaging.

Step 3. The original signal x is subtracted from the first calculation to obtain the residual

r_{1}

and obtain the first component:

I M F 1 = x - r_{1}

(8)

Step 4. The second residual is estimated as the mean of a series of

r_{1} + e_{2} E_{2} (ω^{i})

and the second component is obtained:

I M F 2 = r_{1} - r_{2} = r_{1} - 〈 (r_{1} + e_{2} E_{2} (w^{i})) 〉

(9)

Step 5. The residuals of the

k^{t h}

order modes

r_{k}

are calculated:

r_{k} = 〈 X_{k}^{(i)} - E_{k} (X_{k}^{(i)}) 〉

(10)

Step 6. Calculate

I M F k

.

I M F k = r_{k - 1} - r_{k}

(11)

where k is the total number of IMF.

Step 7. Return to step 5 to calculate

r_{k}

.

2.1.3. Permutation Entropy

Assuming that the initial signal sequence is

X = {x_{i} | i = 1, 2, \dots, n}

, the time delay is

τ

and the embedding parameter is m, the steps for calculating the alignment entropy of X are as follows:

Step 1. A coarse-graining calculation is performed on the initial signal, and the coarse-graining process is illustrated in Figure 1 when

τ

= 2.

The sequence of signal reconstruction after coarse graining is represented as

Y = {y (j) | j = 1, 2, \dots, [N / τ]}

, where

[N / τ]

is the rounding parameter. The calculations are as follows:

y_{j}^{τ} = \frac{1}{τ} \sum_{i = (j - 1) τ + 1}^{j τ} x_{i}, 1 \leq j \leq [N / τ]

(12)

Step 2. The reconstructed sequence is mapped in the phase space under an embedding parameter of m on the basis of the signal reconstruction sequence. The equation is as follows:

[\begin{matrix} y (1) & y (2) & \dots & y (1 + m) \\ y (2) & y (3) & \dots & y (2 + m) \\ ⋮ & ⋮ & ⋮ \\ y (j) & y (j + 1) & \dots & y (j + m) \\ ⋮ & ⋮ & ⋮ \\ y (ν) & y (ν + τ) & \dots & y (ν + m) \end{matrix}]

(13)

where

ν = [\begin{matrix} N / τ \end{matrix}] - m

.

Step 3. The elements in each row vector in the phase space are organized in a decreasing order. The vector of indexed columns is defined as

s (l) = (z_{1}, z_{2}, \dots, z_{m})

and satisfies

l = 1, 2; \dots ν

in order to represent the number of columns in which the elements of each row are organized in a decreasing order. The corresponding descending row vector is denoted as:

y (j + z_{1}) \leq y (j + z_{2}) \leq \dots \leq y (j + z_{m})

(14)

There is a total of m! ways of arranging the elements of s(l), and the values of the probability of arranging each row of the phase space are

P_{1}, P_{2}, \dots, P_{ν}

, respectively. The value of the entropy of arranging PE is:

H_{P E} = - \sum_{j = 1}^{m!} P_{j} \ln P_{j}

(15)

when

P_{j} = 1 / m!

, the HPE takes the maximum value

\ln (m!)

. The HPE is generally normalized:

H_{P E} = H_{P E} / \ln (m!)

(16)

From Equation (16), the HPE belongs to 0~1. The HPE value of regular signals is very small, close to 0, while the entropy of the complex disordered arrangement has a large value, close to 1. Timeseries with PE values greater than 0.6 are typically regarded as noise, whereas timeseries with PE values less than or equal to 0.6 are deemed to be valid components that are strongly correlated with the signal [19].

2.1.4. Stationary Wavelet Transform

SWT exhibits a shift-invariant property in contrast to the WT. Upsampling is utilized in the transformation process rather than employing downsampling. This ensures that the transformed signal retains the same length as the original signal and that the transformed coefficients remain unaltered.

The original signal, designated as

f (t)

, can be expressed in the form of a continuous wavelet transform equation:

W_{f} (2^{j}, b) = 2^{\frac{j}{2}} \int_{- \infty}^{+ \infty} f (t) Ψ (\frac{t - b}{2^{j}}) d t

(17)

When

b = k

, the above equation represents the stationary wavelet transform.

The Mallat decomposition algorithm is:

{\begin{matrix} a_{j - 1, k} = \sum_{n} h_{n} a_{j, 2 k + n} \\ d_{j - 1, k} = \sum_{n} g_{n} a_{j, 2 k + n} \end{matrix}

(18)

The Atrous decomposition algorithm is:

{\begin{matrix} a_{j - 1, k} = \sum_{n} h_{n} a_{j, k + 2^{- j} n} \\ d_{j - 1, k} = \sum_{n} g_{n} a_{j, k + 2^{- j} n} \end{matrix}

(19)

where h and g are low-pass and high-pass filters, respectively.

a_{j - 1, k}

indicates the scale coefficients and

d_{j - 1, k}

indicates the wavelet coefficients, reflecting the approximation signal and wavelet coefficients, respectively.

The SWT decomposition process is shown in Figure 2.

2.2. ERCMFRDE

2.2.1. Fluctuation-Based Reverse Dispersion Entropy

FRDE is a nonlinear dynamic feature extraction method that accurately measures regularities and identifies mutations in vibration signals. For a one-dimensional signal of length N:

X = {x_{1}, x_{2}, \dots, x_{N}}

, with the FRDE value of X defined as follows [26]:

Step 1. The temporal sequence is normalized and standardized to a range of 0 to 1. We map X to

Y = {y_{1}, y_{2}, \dots, y_{N}}

by normal cumulative distribution function (NCDF).

y_{i} = \frac{1}{σ \sqrt{2 π}} \int_{- \infty}^{x_{i}} e^{\frac{- {(t - γ)}^{2}}{2 σ^{2}}} d t

(20)

where

y_{i} \in (0, 1)

and

σ

and

γ

represent the standard deviation and average value of the timeseries X, respectively.

Step 2. Mapping timeseries Y to c classes. We map Y to

Z^{c} = {z_{1}^{c}, z_{2}^{c}, \dots, z_{N}^{c}}

by using

r o u n d (c \cdot y_{i} + 0.5)

, where c represents the number of categories and

z_{i}

represents a positive integer ranging from 1 to c.

Step 3. Reconstruction of the phase space. We perform a reconstruction of Z into T by using embedding vectors with a time delay of d and an embedding dimension of m. The matrix comprising all embedding vectors can be written as follows:

[\begin{array}{l} {z_{1}^{c}, z_{1 + d}^{c}, \dots, z_{1 + (m - 1) d}^{c}} \\ ⋮ ⋮ \\ {z_{j}^{c}, z_{j + d}^{c}, \dots, z_{j + (m - 1) d}^{c}} \\ ⋮ ⋮ \\ {z_{T}^{c}, z_{T + d}^{c}, \dots, z_{T + (m - 1) d}^{c}} \end{array}]

(21)

where the number of embedding vectors T is equivalent to

N - (m - 1) d

.

Step 4. The mapping of each sequence

z_{j}

to form a pattern

π_{v_{o} v_{1} \dots v_{m - 1}}

is based on the values inherent to each sequence. The following proposition is, therefore, true:

z_{j}^{c} = v_{0}, z_{j + d}^{c} = v_{1}, z_{j + 2 d}^{c} = v_{2}, \dots, z_{j + (m - 1) d}^{c} = v_{m - 1}

(22)

The number of potential dispersion patterns that can be associated with each sequence is equal to

(2 c - 1)^{m - 1}

.

Step 5. Determining the proportionate occurrence rate of each dispersion pattern. The proportion of occurrences of the

i^{th}

dispersion pattern can be mathematically represented as follows:

p (π_{i}) = \frac{N u m b e r {π_{i}}}{N - (m - 1) d} (1 \leq i \leq c^{m})

(23)

where

p (π_{i})

denotes the ratio of the number of

i^{th}

dispersion patterns to the number of embedding vectors.

Step 6. Performing FRDE calculations. The expression can be formulated as follows:

FRDE (x, m, c, d) = \sum_{i = 1}^{{(2 c - 1)}^{m - 1}} {(p (π_{i}) - \frac{1}{{(2 c - 1)}^{m - 1}})}^{2}

(24)

2.2.2. Ensemble-Refined Composite Multiscale FRDE

In the ERCMFRDE framework, the multiple temporal sequence x is generated with distinct starting points for the coarse-graining process, using a scale factor τ. The

k^{t h}

coarse-grained timeseries

x_{k}^{(τ)} = {x_{k, 1}^{(τ)}, x_{k, 2}^{(τ)}, \dots, x_{k, j}^{(τ)}}

of u at the scale level τ can be calculated by employing a variety of distinct methodologies to generate multiple coarse-grained series.

The main coarse-grained treatments of the first-order moment method are calculated as follows:

x_{k, j}^{(τ)} |_{m e a n} = \frac{1}{τ} \sum_{b = k + τ (j - 1)}^{k + τ j - 1} u_{b}, 1 \leq j \leq N, 1 \leq k \leq τ

(25)

x_{k, j}^{(τ)} |_{\max} = \max_{k + τ (j - 1) \leq b \leq k + τ j - 1} (u_{b}), 1 \leq j \leq N, 1 \leq k \leq τ

(26)

x_{k, j}^{(τ)} |_{\min} = \min_{k + τ (j - 1) \leq b \leq k + τ j - 1} (u_{b}), 1 \leq j \leq N, 1 \leq k \leq τ

(27)

The main coarse-grained treatments of the second-order moment method are calculated as follows:

x_{k, j}^{(τ)} |_{var} = \frac{1}{τ} {\sum_{b = k + τ (j - 1)}^{k + τ j - 1} (u_{b} - {\bar{u}}_{b, j})}^{2}, 1 \leq j \leq N, 1 \leq k \leq τ

(28)

x_{k, j}^{(τ)} |_{r m s} = \sqrt{\frac{1}{τ} \sum_{b = k + τ (j - 1)}^{k + τ j - 1} {u_{b}}^{2}}, 1 \leq j \leq N, 1 \leq k \leq τ

(29)

where

{\bar{u}}_{b, j} = \frac{1}{τ} \sum_{b = k + τ (j - 1)}^{k + τ j - 1} u_{b}

.

The main coarse-grained treatment of the third-order moment method is calculated as follows:

x_{k, j}^{(τ)} |_{s k e w n e s s} = \frac{1}{τ} {\sum_{b = k + τ (j - 1)}^{k + τ j - 1} (u_{b} - {\bar{u}}_{b, j})}^{3}, 1 \leq j \leq N, 1 \leq k \leq τ

(30)

where

{\bar{u}}_{b, j} = \frac{1}{τ} \sum_{b = k + τ (j - 1)}^{k + τ j - 1} u_{b}

.

For different coarse-grained series, the following definitions apply to the RCMFRDE values of different coarse-grained processing:

RCMFRDE_t (x, c, m, d, τ) = \frac{1}{τ} \sum_{k = 1}^{τ} FRDE (x_{k, j}^{(τ)} |_{t}, c, m, d)

(31)

where t refers to the type of coarse graining, including mean, max, min, var, rms, and skewness.

The RCMFRDE of ensemble first-order moment coarsening (ERCMFRDE_1) is constituted of the RCMFRDE of mean processing (RCMFRDE_mean), the RCMFRDE of maximum coarse-grained processing (RCMFRDE_max), and the RCMFRDE of minimum coarse-grained processing (RCMFRDE_min):

ERCMFRDE_1 = [RCMFRDE_mean; RCMFRDE_\max; RCMFRDE_\min]

(32)

The RCMFRDE of ensemble second-order moment coarsening (ERCMFRDE_2) is constituted of the RCMFRDE of variance coarse-grained processing (RCMFRDE_var) and the RCMFRDE of root mean square coarse-grained processing (RCMFRDE_rms):

ERCMFRDE_2 = [RCMFRDE_var; RCMFRDE_rms]

(33)

As the third-order coarse-graining encompasses solely the skewness treatment, the RCMFRDE of the ensemble third-order moment coarsening (ERCMFRDE_3) is identical to the RCMFRDE of the skewness coarse-graining treatment (RCMFRDE_skewness).

ERCMFRDE_3 = RCMFRDE_skewness

(34)

The ERCMFRDE is constituted of ERCMFRDE_1, ERCMFRDE_2, and ERCMFRDE_3(RCMFRDE ensemble of six coarse-grained treatments):

ERCMFRDE = [ERCMFRDE_1; ERCMFRDE_2; ERCMFRDE_3]

(35)

2.2.3. Recursive Feature Elimination

The ERCMFRDET feature set contains a large number of features, many of which may be superfluous. These redundant features can lead to overfitting of the algorithm model, reducing its generalization ability. Feature selection is a common preprocessing step that has the effect of enhancing model performance, reducing computational costs, and accelerating the training process.

RFE [40] is a bottom-up method that starts with all features and iteratively construct models while it eliminates the least important features to choose the most advantageous set of features. In each iteration, RFE trains a model and uses feature importance scores (such as coefficient magnitude, information gain, etc.) to determine which features should be removed. This procedure is iterated until the necessary quantity of characteristics is attained or until a predetermined termination condition is satisfied. RFE can be used with different evaluation methods and models, making it highly flexible and applicable. Moreover, RFE demonstrates good stability and reliability in feature selection. Figure 3 shows the flowchart of the recursive feature elimination algorithm.

2.3. Least Squares Support Vector Machine

LSSVM represents a significant advancement in SVM theory, offering a sophisticated and comprehensive theoretical framework. It is capable of converting the solution to a quadratic optimization problem into the solution to a system of linear equations, thereby streamlining the problem-solving process. Consequently, it has been effectively implemented in a number of fields.

Assuming a training set

(x_{i}, y_{i})

, where

x_{i} = (x_{1}, x_{2}, \dots, x_{n})

is the d-dimensional input vector,

y_{i}

represents the corresponding output data and n represents the total number of training data points. The employed nonlinear function estimation is modeled in the following form:

f (x) = b + 〈 ϕ (x), w 〉

(36)

where w is the weight vector, b is the bias term, and the symbol < > refers to the inner product operation.

The assessment problem is described as an optimization problem based on the structured risk minimization principle:

\begin{matrix} \min J (w, e) = \min (\frac{1}{2} w^{2} + \frac{1}{2} γ \sum_{i = 1}^{N} e_{i}^{2}) \\ s . t . y_{i} = w, ϕ (x_{i}) + b + e_{i} i = 1, 2, \dots, N \\ γ > 0 \end{matrix}

(37)

The regularization parameter

γ

is used to identify the optimal balance between model complexity and accuracy. The regression error

e_{i}

represents the discrepancy between the observed and predicted values of the output.

The Lagrange function is formulated to represent the solution to resolve the aforementioned optimization issue.

L_{LSSVM} = \frac{1}{2} w^{2} + \frac{1}{2} γ \sum_{i = 1}^{N} e_{i}^{2} - \sum_{i = 1}^{N} α_{i} {w, ϕ (x_{i}) + b + e_{i} - y_{i}}

(38)

where

α_{i}

represents the Lagrange multiplier.

The optimal solution to the problem can be obtained by setting the derivatives of w, b,

e_{i}

, and

α_{i}

to zero.

\frac{\partial L_{LSSVM}}{\partial w} = 0 \to w = \sum_{i = 1}^{N} α_{i} ϕ (x_{i})

(39)

\frac{\partial L_{1 . SSVM}}{\partial b} = 0 \to \sum_{i = 1}^{N} α_{i} = 0

(40)

\frac{\partial L_{L . SSVM}}{\partial e_{i}} = 0 \to α = γ e_{i}

(41)

\frac{\partial L_{1 . SSVM}}{\partial α_{i}} = 0 \to < w, ϕ (x_{i}) > + b + e_{i} - y_{i} = 0

(42)

The four linear problems can be reduced to a single equation by eliminating w and

e_{i}

.

[\begin{matrix} 0 & E^{T} \\ E & Ω + \frac{1}{γ} E \end{matrix}] [\begin{matrix} b \\ α \end{matrix}] = [\begin{matrix} 0 \\ y \end{matrix}]

(43)

where

y = {(y_{1}, y_{2}, \dots, y_{n})}^{T}

,

α = {(α_{1}, α_{2}, \dots, α_{n})}^{T}

, E is the identity matrix and Ω is a symmetric matrix of N × N kernel functions:

Ω_{i j} = K (x_{i}, x_{j}) = ϕ {(x_{i})}^{T} ϕ (x_{j}) i, j = 1, 2, \dots, N

(44)

where

K (x_{i}, x_{j})

is the kernel function that satisfies Meser’s condition.

Then, the LSSVM model can be expressed as:

\begin{matrix} y (x) = w, ϕ (x) + b \\ = \sum_{i = 1}^{n} α_{i} ϕ (x_{i}) \cdot ϕ (x) + b \\ = \sum_{i = 1}^{n} α_{i} K (x_{i}, x) + b \end{matrix}

(45)

The radial basis function (RBF) kernel is a commonly employed function in various fields of study. The mathematical expression is as follows:

K (x_{i}, x_{j}) = \exp (- \frac{x_{i} - {x_{j}}^{2}}{2 σ^{2}}), σ > 0

(46)

where

σ

is the bandwidth of the kernel function.

The two hyperparameters,

γ

and

σ

, are crucial parameters that significantly influence the performance of the LSSVM model and require meticulous determination. As evidenced in the literature [39], the PSO algorithm has the capacity to accelerate the optimization of LSSVM parameters and enhance the overall computational efficacy of the model. Consequently, this study has selected the PSO algorithm to optimize the model parameters.

3. The Failure Diagnostic Flowchart

To address the issues of nonstationarity, nonlinearity, and noise in wind turbine gearbox vibration signals, this paper proposes a gear failure diagnostic method based on ERCMFRDE, combined with ICEEMDAN–SWT, RFE, and LSSVM. Figure 4 displays the failure diagnostic flowchart. The precise sequence of steps is as follows:

Step 1. The utilization of acceleration sensors enables the acquisition of vibration signals during the operational process of the gearbox.

Step 2. The RIME algorithm is employed for the purpose of optimizing two critical parameters of ICEEMDAN, namely Nsd and NR. The noise signal is decomposed using the ICEEMDAN model with optimized parameters to obtain a series of IMF components and a Res component.

Step 3. The PE value for each IMF component is calculated. SWT is employed to reduce noise in the case of the IMF components with a PE value exceeding 0.6 [19]. The remaining IMF components are retained in their original form. All the IMF components after noise reduction and those not involved in noise reduction are recombined to obtain the post-noise reduction signal.

Step 4. The RCMFRDE values of the signals are calculated as the feature set of the signals, resulting in a subset of fault features based on six different coarse-grained processes.

Step 5. The feature set integrated with six subsets of fault features is subjected to feature dimensionality reduction using the recursive feature elimination algorithm, selecting the top 20 feature values. These selected features form a new fault feature set.

Step 6. The fault feature set is partitioned into training and testing sets, randomly, based on a predefined ratio. The LSSVM model is then utilized to train the model using the training set. Subsequently, fault identification is performed on the testing set.

4. Experimental Verification

In this paper, the fault signals are extracted by simulating the gearbox operation state on the mechanical failure simulation (MFS) experimental platform, and the intelligent diagnosis process is carried out on the software MATLAB R2022b.

4.1. Signal Acquisition and Noise Reduction

The dataset pertaining to the gearbox was obtained through the utilization of an MFS experimental platform. The central part of the experimental system consisted of a driving motor, bearings, gearbox, and the data acquisition equipment, as shown in Figure 5. The motor speed could reach up to 3600 rpm. Additionally, a magnetic powder brake attached to the output terminal of the gearbox allowed for the manual adjustment of the corresponding load state by rotating it. The scale displayed values from 0 to 5, corresponding to torque values ranging from 0.056 Nm to 1.129 Nm. During the operation of the experimental setup, the accelerometers captured vibration data from both the gearbox and its base, with signals sampled at a frequency of 15 kHz. This experimental setup enabled the analysis of the vibrational properties of individual equipment faults and the investigation of the coupling effects between multiple faults by combining different fault modules during mechanical operation simulation. As illustrated in Figure 6, the gearbox fault diagnostic research toolkit employed in the experiments encompassed normal state (NOR) gears, broken tooth fault (BTF) gears, missing tooth fault (MTF) gears, and surface wear fault (SWF) gears.

The studies were carried out at a motor speed of 1750 rpm to evaluate the motor’s normal condition as well as three different fault situations. The initial sample for each state was divided into 100 subsamples, each containing 2048 sampling points. Each set of subsamples was further split into training and testing sets following a 3:2 ratio (See Table 1).

The acquired vibration signals of the four faulty gears were subjected to the ICEEMDAN–SWT denoising process, and the obtained vibration waveform are shown in Figure 7. It can be observed that the noise-canceled signal exhibited more pronounced vibration shocks compared to the original signal, indicating that the ICEEMDAN–SWT noise reduction method can improve the signal quality and retain useful fault characteristic information.

4.2. Feature Extraction

The RCMFRDE values with different coarse-grain processing of each signal in the sample set were computed under the following parameters: embedding dimension m = 2, scale factor s = 20, classification category c = 6, and time delay d = 1 [25]. This resulted in six fault feature matrices, shown in Table 2. Figure 8 shows the error bar plots of RCM–FRDE computed for the four gear states.

It can be observed that the RCMFRDE values yielded distinct fault features based on different coarse-graining theories from the error bar plots. By comparing the first three graphs in Figure 8 with the last three graphs, the error bars of (a), (b), and (c) appear relatively small, indicating a higher stability of the entropy values when the coarse-graining mode was a first-order moment. The comparison of (a), (b), and (c) in Figure 8 illustrates that the RCMFRDE_mean and RCMFRDE_max showed good discrimination on the vast majority of scales, but RCMFRDE_min was not practical for discriminating between NOR and SWF. As shown in Figure 8d,e, the trends of RCMFRDE after two different second-order moment coarsening treatments were also different. RCMFRDE_var exhibited higher discrimination at most scales, but distinguishing between NOR and SWF became challenging at more prominent scale factors. However, the characteristic curves of RCMFRDE_rms for different faults overlapped at more scales. In Figure 8f, clear distinctions in entropy curves for each fault state were observed when the scale factor was between 5 and 15.

In general, RCMFRDE from different coarse-grained processing was not consistently discriminated at different scales. However, ensemble processing enabled features to complement each other, thereby enhancing feature differentiation. ERCMFRDE-based feature extraction leveraged different characteristics to extract comprehensive feature information from timeseries data, providing a more comprehensive approach to fault feature extraction compared to single-fine-scale composite multiscale entropy extraction methods.

4.3. Feature Selection

As illustrated in Table 2, six distinct feature sets were obtained following the extraction of fault features for all types of faults using RCMFRDE. The implementation of four distinct ensemble processes yielded four ERCMFRDE feature sets, as illustrated in Table 3. Unsupervised feature selection using RFE was implemented for feature importance in ERCMFRDE_1, ERCMFRDE_2, and ERCMFRDE, respectively (given that ERCMFRDE_3 comprised solely RCMFRDE_skewness and possessed a limited number of features, it was unnecessary to apply feature selection.) Subsequently, the significance of RFE sorting of the features was investigated, with the different number of features after RFE sorting input into PSO–LSSVM for fault classification. Due to the large number of features, the author calculated the accuracy rate for the number of features within 40, with the results presented in Figure 9.

Figure 9 indicates that, when the number of features exceeded 20, the accuracy of ERCMFRDE_1, ERCMFRDE_2, and ERCMFRDE became saturated. Consequently, the top 20 most crucial features were selected as the set of features to be employed for fault classification. For the three ensemble feature sets after feature selection, the size of each feature set was 400 × 20. Furthermore, the ERCMFRDE method exhibited the lowest number of requisite features to achieve optimal accuracy and the highest classification accuracy at saturation, thereby demonstrating its superiority over other ensemble processing.

4.4. Intelligent Diagnosis

The ERCMFRDE feature set was then partitioned into training and testing samples, with the training samples used to train the parameters of PSO–LSSVM and the testing samples fed into the trained LSSVM for fault classification. The findings of the fault diagnosis are displayed in Figure 10, with a fault diagnosis accuracy of 100% and no misclassification samples.

We applied feature extraction algorithms using RCMFRDE with different coarse-graining treatments as well as RCMFRDE with different ensemble processing. Twenty repeated experiments were conducted using the same fault sample data and the aforementioned feature extraction methods to compare different feature extraction algorithms and verify the advantage of ERCMFRDE in comprehensively extracting signal features.

The fault diagnosis accuracy rates of nine feature extraction algorithms for 20 repeated experiments are shown in Figure 11, and the maximum, minimum, and average fault diagnosis accuracy rates are shown in Table 4. It can be observed that the ERCMFRDE algorithm, which employs a comprehensive signal feature extraction process, achieved the highest accuracy in fault diagnosis, with an average diagnosis accuracy that was superior to that of the other eight algorithms. The experiment demonstrated that the multiscale fluctuation-based reverse dispersion entropy algorithm, integrated with multiple coarse-graining processes, provide more practical information in feature extraction.

4.5. Noise Resistance Test

In practical engineering applications, the data collected by vibration acceleration sensors installed in wind turbine gearboxes often contain large amounts of noise. We conducted experiments on the fault diagnosis of gearboxes in a noisy background in order to explore the robustness of the model and verify the practicality and anti-interference ability of the model in a real environment.

Gaussian white noise with different levels of interference was added to the dataset to achieve different signal-to-noise ratios (SNR) of 0 dB, 1 dB, 2 dB, 5 dB, 10 dB, and 20 dB. We used five sets of noisy signals to validate the performance of the ICEEMDAN–SWT model proposed in this study regarding noise immunity, and selected VMD–SVD, CEEMD–WT, and no noise reduction as the control benchmark models for the comparison experiments. Each group of experiments was performed 20 times and the average value was calculated, and the results are presented in Figure 12.

In a noise environment with an SNR of 10 dB or 20 dB, the model was largely unaffected by noise in its judgement of faults. In a strong noise environment with an SNR of 0 dB, it still maintained high diagnostic classification accuracy. The diagnostic classification accuracy of the other network models decreased dramatically in the case of increasing noise. It can be concluded that the ICEEMDAN–SWT model proposed in this study exhibits superior anti-noise performance in comparison to other models.

4.6. Comparison Experiment

4.6.1. Comparison of Different Entropy Algorithms

We compared ERCMFRDE with widely used entropy-based feature extraction methods to demonstrate the superiority of the FRDE method. We compared it with the ensemble-refined composite multiscale DE (ERCMDE), ensemble-refined composite multiscale RDE (ERCMRDE), and ensemble-refined composite multiscale FDE (ERCMFDE). The parameters of the various entropy algorithms were chosen consistently and as shown below: l = 2048, s = 20, d = 1, m = 2, c = 6.

The above entropy methods were used to extract the feature set of the fault signal after noise reduction, and the feature matrix was fed into the LSSVM classification model to train and predict the classification results. The intelligent classification was repeated 20 times, and the average accuracy of the fault diagnosis, as well as the maximum and minimum accuracy values, are presented in Table 5. Figure 13 demonstrates the accuracy of the different entropy algorithms for each classification result.

As shown in Figure 13, the proposed ERCMFRDE for gearbox fault diagnosis yielded an average improvement of 1.41%, 2.85%, and 6.6% compared to the other three feature entropy algorithms. Furthermore, the maximum and minimum values were larger than those of the other feature entropy algorithms. The results indicate that the FRDE features were more effective at addressing timeseries problems and were more sensitive to gearbox state information than FDE, RDE, and DE.

4.6.2. Comparison of Different Diagnostic Methods

In the comparative experiments presented in this section, gearbox vibration datasets from two different sources were selected as raw data: Data1 and Data2. Data 1 came from the gearbox vibration fault data extracted from the MFS fault platform in Section 4.1.

Data 2 was derived from the gearbox failure dataset published by Prof. Jiong Tang’s team at the University of Connecticut, USA. The experimental platform is described in the literature [41]. The dataset tested gears in nine states, including healthy condition, missing tooth, root crack, spalling, and chipping tip with five different levels of severity. The dataset comprised 100 samples for each of the nine types of vibration signals, resulting in a total of 900 samples, each of which was 3600 samples in length.

In order to demonstrate the superiority of the proposed ERCMFRDE–LSSVM method, in this paper, we compared it with three classical gearbox fault diagnosis methods: WPD–CNN [42], EEMD–SVM [43], and VMD-DTL [44], based on two data sources. The models trained under different methods were subjected to 20 iterations of random testing of the prediction set. The mean and standard deviation of the 20 recognition accuracies were then calculated and used as the final fault diagnosis results, as illustrated in Figure 14.

As illustrated in Figure 14, the ERCMFRDE–LSSVM method demonstrated a markedly superior success rate in diagnosing gearbox faults across two distinct datasets, outperforming the other three classic methods. This highlights its notable efficiency and superiority. Furthermore, the proposed method demonstrated a low error rate and minimal fluctuations in 20 diagnostic results, indicating its capacity to adapt to diverse vibration signal characteristics and failure modes with relative stability. This observation reflects the method’s stability and reliability.

5. Conclusions

This paper proposes a novel intelligent failure diagnostic method for gearboxes based on ICEEMDAN–SWT, ERCMFRDE, and LSSVM, targeting the characteristics of wind turbine gearbox vibration signals and working environment. Initially, the vibration signal from the wind turbine gearbox was denoised by the ICEEMDAN–SWT noise reduction model. Subsequently, the ERCMFRDE values of the denoised signal were extracted and fed into the LSSVM model for training and identification. The conclusions obtained are as follows:

A comparison of the ERCMFRDE-based feature extraction method with other feature extraction methods revealed that the ERCMFRDE method had the highest accuracy rate, proving to be significantly better than the entropy algorithm with a single coarse-grained processing and the traditional kinds of entropy algorithms. This evidence serves to illustrate the superiority of the proposed ensemble entropy method for feature extraction and fault identification.
In a noisy environment, the fault diagnosis method based on ICEEMDAN–SWT and ERCMFRDE continued to perform well, indicating that the method is effective in eliminating interference components unrelated to fault characteristics. This indicates that the method displays both excellent generalization capabilities and robust performance.
The efficacy and reliability of the fault diagnosis methodology based on ERCMFRDE were verified using two gearbox datasets from different sources. The method could accurately identify the type of faults in gearboxes and has more significant advantages over the other three classic fault diagnosis methods.
The fault diagnosis method based on ERCMFRDE proposed in this paper is a practical approach for extracting the fault features of gearboxes and accurately identifying different types, with an accuracy of 98.88%. This demonstrates that the proposed intelligent diagnostic method offers a significant advantage in gearbox fault diagnosis.
The methods and models investigated in this paper are mainly data-driven and lack consideration of fault mechanisms and knowledge of the unit operating mechanisms. In future research, the complex operating mechanisms of wind turbine gearboxes will be considered and embedded in the data-driven model to guide feature and model learning.

Author Contributions

Conceptualization, X.W. and Y.D.; methodology, X.W. and Y.D.; validation, X.W. and Y.D.; investigation, X.W. and Y.D.; resources, X.W.; data curation, X.W. and Y.D.; writing—original draft, Y.D.; writing—review & editing, X.W.; visualization, Y.D.; supervision, X.W.; project administration, X.W.; funding acquisition, X.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by an industry–university–research cooperation project of Jiangsu Province (grant number BY20230704) and by the Scientific Research Foundation of Nanjing Institute of Technology (grant number 3612403222440).

Data Availability Statement

The data presented in this study are available upon request from the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

DE	Dispersion Entropy
RDE	Reverse Dispersion Entropy
FDE	Fluctuation Dispersion Entropy
FRDE	Fluctuation-based Reverse Dispersion Entropy
ERCMDE	Ensemble Refined Composite Multiscale DE
ERCMRDE	Ensemble Refined Composite Multiscale RDE
ERCMFDE	Ensemble Refined Composite Multiscale FDE
RCMFRDE	Refined Composite Multiscale Reverse Dispersion Entropy
ERCMFRDE	Ensemble RCMFRDE
EMD	Empirical Mode Decomposition
EEMD	Ensemble EMD
CEEMD	Complementary EEMD
ICEEMDAN	Improved CEEMD with Adaptive Noise
WT	Wavelet Transform
SWT	Stationary Wavelet Transform
PE	Permutation Entropy
IMF	Intrinsic Modal Function
SVM	Support Vector Machine
LSSVM	Least Squares Support Vector Machine
RIME	Rime Optimization Algorithm
Nsd	Noise standard deviation
NR	Number of Realizations
NOR	Normal State
MTF	Missing Tooth Fault
BTF	Broken Tooth Fault
SWF	Surface Wear Fault
PSO	Particle Swarm Optimization
rms	Root Mean Square
MFS	Mechanical Failure Simulation
SNR	Signal-to-Noise Ratio
DTL	Deep Transfer Learning
VMD	Variational Mode Decomposition
WPD	Wavelet Packet Decomposition
CNN	Convolutional Neural Network

References

Zhou, H.; Huang, X.; Wen, G.; Lei, Z.; Dong, S.; Zhang, P.; Chen, X. Construction of health indicators for condition monitoring of rotating machinery: A review of the research. Expert Syst. Appl. 2022, 203, 117297. [Google Scholar] [CrossRef]
Lee, J.; Kim, M.; Ko, J.U.; Ha Jung, J.; Sun, K.H.; Youn, B.D. Asymmetric inter-intra domain alignments (AIIDA) method for intelligent fault diagnosis of rotating machinery. Reliab. Eng. Syst. Saf. 2022, 218, 108186. [Google Scholar] [CrossRef]
Shuai, Y. Research on Fault Diagnosis Technology of Industrial Robot Operation Based on Deep Belief Network. Sci. Program. 2022, 2022, 9260992. [Google Scholar] [CrossRef]
Benbouzid, M.; Berghout, T.; Sarma, N.; Djurovic, S.; Wu, Y.; Ma, X. Intelligent Condition Monitoring of Wind Power Systems: State of the Art Review. Energies 2021, 14, 5967. [Google Scholar] [CrossRef]
Ehya, H.; Nysveen, A.; Skreien, T.N. Performance Evaluation of Signal Processing Tools Used for Fault Detection of Hydrogenerators Operating in Noisy Environments. IEEE Trans. Ind. Appl. 2021, 57, 3654–3665. [Google Scholar] [CrossRef]
Yang, J.; Zhou, C.; Li, X.; Pan, A.; Yang, T. A Fault Feature Extraction Method Based on Improved VMD Multi-Scale Dispersion Entropy and TVD-CYCBD. Entropy 2023, 25, 277. [Google Scholar] [CrossRef]
Yan, X.; Liu, Y.; Jia, M. Multiscale cascading deep belief network for fault identification of rotating machinery under various working conditions. Knowl.-Based Syst. 2020, 193, 105484. [Google Scholar] [CrossRef]
Chen, W.; Chen, H.; Yan, X.; Yang, Y.; Jin, S. Hybrid Entropy in the Time-Frequency Domain for Grading Electrode Sediment Identification. IEEE Access 2022, 10, 54751–54763. [Google Scholar] [CrossRef]
Zhong, J.-H.; Wong, P.K.; Yang, Z.-X. Fault diagnosis of rotating machinery based on multiple probabilistic classifiers. Mech. Syst. Signal Process. 2018, 108, 99–114. [Google Scholar] [CrossRef]
Lin, Y.; Xiao, M.; Liu, H.; Li, Z.; Zhou, S.; Xu, X.; Wang, D. Gear fault diagnosis based on CS-improved variational mode decomposition and probabilistic neural network. Measurement 2022, 192, 110913. [Google Scholar] [CrossRef]
Colominas, M.A.; Schlotthauer, G.; Torres, M.E. Improved complete ensemble EMD: A suitable tool for biomedical signal processing. Biomed. Signal Process. Control 2014, 14, 19–29. [Google Scholar] [CrossRef]
Zhang, J.; Chen, Q.; Ye, Q. Nondestructive Testing of Steel Wire Rope Based on Gagnetic Signal and Infrared Information. Russ. J. Nondestruct. Test. 2023, 59, 991–1004. [Google Scholar] [CrossRef]
Liu, J.; Li, T. Multi-step power forecasting for regional photovoltaic plants based on ITDE-GAT model. Energy 2024, 293, 130468. [Google Scholar] [CrossRef]
Tian, S.; Wang, C.; Gong, X.; Huang, D. Permutation fuzzy entropy based ICEEMDAN de-noising for inertial sensors. Meas. Sci. Technol. 2024, 35, 066304. [Google Scholar] [CrossRef]
Hang, S.; Dong, Z.; Asghar, H.A.; Lei, L.; Xiaoqin, Z.; Majdi, M.; Huiling, C. RIME: A physics-based optimization. Neurocomputing 2023, 532, 183–214. [Google Scholar]
Honghui, S.; Ning, C. Cavitation fault diagnosis of centrifugal pump based on RIME-SDAE. Vibroeng. Procedia 2024, 54, 46–52. [Google Scholar]
Zhang, Z.; Li, D. Multi-label algorithm based on rough set of fractal dimension attribute. J. Supercomput. 2020, 76, 1105–1115. [Google Scholar] [CrossRef]
Voltarelli, L.G.J.M.; Pessa, A.A.B.; Zunino, L.; Zola, R.S.; Lenzi, E.K.; Perc, M.; Ribeiro, H.V. Characterizing Unstructured Data with the Nearest Neighbor Permutation Entropy; Chaos: Woodbury, NY, USA, 2024; Volume 34. [Google Scholar]
Zhou, S.; Zhang, Z.X.; Luo, X.; Niu, S.; Jiang, N.; Yao, Y. Developing a hybrid CEEMDAN-PE-HE-SWT method to remove the noise of measured carbon dioxide blast wave. Measurement 2023, 223, 113797. [Google Scholar] [CrossRef]
Ariyanto, A.B.; Supratman, F.Y.; Bethaningtyas, H. Identification disorders of vocal cords through the public communications VoIP network-based IVR with dysphonia severity index (DSI). J. Phys. Conf. Ser. 2018, 1075, 012043. [Google Scholar] [CrossRef]
Johnny, R.; Cornelio, P.; Ernesto, Z. Alternative Method to Estimate the Fourier Expansions and Its Rate of Change. Mathematics 2022, 10, 3832. [Google Scholar] [CrossRef]
Liu, Z.; Li, J.; Zhang, T.; Chen, S.; Xin, D.; Liu, K.; Chen, K.; Liu, Y.C.; Sun, C.; Gao, G.; et al. Accurate Identification of Partial Discharge Signals in Cable Terminations of High-Speed Electric Multiple Unit Using Wavelet Transform and Deep Belief Network. Appl. Sci. 2024, 14, 4743. [Google Scholar] [CrossRef]
Fantini, D.G.; Silva, R.N.; Siqueira, M.B.B.; Pinto, M.S.S.; Guimarães, M.; Brasil, A.C.P. Wind speed short-term prediction using recurrent neural network GRU model and stationary wavelet transform GRU hybrid model. Energy Convers. Manag. 2024, 308, 118333. [Google Scholar] [CrossRef]
Serhat, S.; Erinc, K.; Kayran, A.H.; Tayfun, S. Stationary wavelet transform for fault detection in rotating machinery. Wavelet Appl. Ind. Process. V 2007, 6763, 73–80. [Google Scholar]
Wang, X.; Jiang, H. Gearbox Fault Diagnosis Based on Refined Time-Shift Multiscale Reverse Dispersion Entropy and Optimised Support Vector Machine. Machines 2023, 11, 646. [Google Scholar] [CrossRef]
Li, Y.; Gao, X.; Wang, L. Reverse Dispersion Entropy: A New Complexity Measure for Sensor Signal. Sensors 2019, 19, 5203. [Google Scholar] [CrossRef] [PubMed]
Azami, H.; Escudero, J. Amplitude- and Fluctuation-Based Dispersion Entropy. Entropy 2018, 20, 210. [Google Scholar] [CrossRef]
Rostaghi, M.; Azami, H. Dispersion Entropy: A Measure for Time-Series Analysis. IEEE Signal Process. Lett. 2016, 23, 610–614. [Google Scholar] [CrossRef]
Li, Y.; Jiao, S.; Geng, B.; Jiang, X. RCMFRDE: Refined Composite Multiscale Fluctuation-Based Reverse Dispersion Entropy for Feature Extraction of Ship-Radiated Noise. Math. Probl. Eng. 2021, 2021, 7150921. [Google Scholar] [CrossRef]
Costa, M.; Goldberger, A.L.; Peng, C.K. Multiscale entropy analysis of complex physiologic time series. Phys. Rev. Lett. 2002, 89, 068102. [Google Scholar] [CrossRef]
Zheng, D.; Han, B.; Liu, G.; Li, Y.; Yu, H. Cross-Domain Intelligent Fault Diagnosis Method of Rotating Machinery Using Multi-Scale Transfer Fuzzy Entropy. IEEE Access 2021, 9, 95481–95492. [Google Scholar] [CrossRef]
Shao, K.; Fu, W.; Tan, J.; Wang, K. Coordinated approach fusing time-shift multiscale dispersion entropy and vibrational Harris hawks optimization-based SVM for fault diagnosis of rolling bearing. Measurement 2021, 173, 108580. [Google Scholar] [CrossRef]
Gilbert Chandra, D.; Srinivasulu Reddy, U.; Uma, G.; Umapathy, M. Group normalization-based 2D-convolutional neural network for intelligent bearing fault diagnosis. J. Braz. Soc. Mech. Sci. Eng. 2023, 45, 584. [Google Scholar] [CrossRef]
Li, J.; Guo, Y.; Wall, J.; West, S. Support vector machine based fault detection and diagnosis for HVAC systems. Int. J. Intell. Syst. Technol. Appl. 2019, 18, 204–222. [Google Scholar] [CrossRef]
Fan, M.; Sharma, A. Design and implementation of construction cost prediction model based on SVM and LSSVM in industries 4.0. Int. J. Intell. Comput. Cybern. 2021, 14, 145–157. [Google Scholar] [CrossRef]
Vinayaka, K.U.; Puttaswamy, P.S. Estimation of Electric Arc Furnace Parameters Using Least-Square Support Vector Machine. SN Comput. Sci. 2023, 4, 276. [Google Scholar] [CrossRef]
Li, J.; Luo, W.; Bai, M.; Song, M. Fault diagnosis of high-speed rolling bearing in the whole life cycle based on improved grey wolf optimizer-least squares support vector machines. Digit. Signal Process. 2024, 145, 104345. [Google Scholar] [CrossRef]
Lu, Y.; Huang, Z. A new hybrid model of sparsity empirical wavelet transform and adaptive dynamic least squares support vector machine for fault diagnosis of gear pump. Adv. Mech. Eng. 2020, 12, 1687814020922047. [Google Scholar] [CrossRef]
Wang, H.; Zhang, C.; Zhou, B.; Xue, S.; Wang, F. Prediction of the Productivity Ratio of Perforated Wells Using Least Squares Support Vector Machine with Particle Swarm Optimization. Appl. Sci. 2023, 13, 12978. [Google Scholar] [CrossRef]
Saberi, A.N.; Belahcen, A.; Sobra, J.; Vaimann, T. LightGBM-Based Fault Diagnosis of Rotating Machinery Under Changing Working Conditions Using Modified Recursive Feature Elimination. IEEE Access 2022, 10, 81910–81925. [Google Scholar] [CrossRef]
Cao, P.; Zhang, S.; Tang, J. Preprocessing-Free Gear Fault Diagnosis Using Small Datasets With Deep Convolutional Neural Network-Based Transfer Learning. IEEE Access 2018, 6, 26241–26253. [Google Scholar] [CrossRef]
Huang, D.; Zhang, W.A.; Guo, F.; Liu, W.; Shi, X. Wavelet Packet Decomposition-Based Multiscale CNN for Fault Diagnosis of Wind Turbine Gearbox. IEEE Trans. Cybern. 2021, 53, 443–453. [Google Scholar] [CrossRef] [PubMed]
Sun, S.; Lu, R. Fault diagnosis of gearbox based on ant colony algorithm optimized support vector machine. J. Phys. Conf. Ser. 2021, 2137, 012068. [Google Scholar] [CrossRef]
Wu, Z.; Bai, H.; Yan, H.; Zhan, X.; Guo, C.; Jia, X. Intelligent Fault Diagnosis Method for Gearboxes Based on Deep Transfer Learning. Processes 2022, 11, 68. [Google Scholar] [CrossRef]

Figure 1. Schematic diagram of the initial signal coarse-graining process.

Figure 2. The SWT decomposition process.

Figure 3. The flowchart of the RFE algorithm.

Figure 4. The fault diagnosis flowchart.

Figure 5. MFS experimental platform.

Figure 6. Four states of the gear: (a) normal state; (b) broken tooth fault; (c) missing tooth fault; (d) surface wear fault.

Figure 7. Original signal and denoised signal. (a) Normal state, (b) broken tooth fault, (c) missing tooth fault, (d) surface wear fault.

Figure 8. The RCMFRDE value. (a) RCMRDE_mean, (b) RCMFRDE_max, (c) RCMRDE_min, (d) RCMFRDE_var, (e) RCMRDE_rms, and (f) RCMFRDE_skewness.

Figure 9. Relationship among the number of different features and accuracy.

Figure 10. The LSSVM classification results.

Figure 11. Classification results of RCMFRDE with different coarse-grained processing.

Figure 12. Classification results of different models with noise content.

Figure 13. Classification results of different entropy algorithms.

Figure 14. Classification results of different diagnosis methods.

Table 1. Description of the MFS gearbox dataset.

Fault Types	Motor Speed (r/min)	Number of Training Samples	Number of Testing Samples	Class Label
Normal state	1750	60	40	1
Broken tooth	1750	60	40	2
Missing tooth	1750	60	40	3
Surface wear	1750	60	40	4

Table 2. The results in six fault feature matrices.

Entropy Value	Moment Processing	Coarse Graining	Size
RCMFRDE_mean	First-order	Mean	400 × 20
RCMFRDE_max	First-order	Maximum	400 × 20
RCMFRDE_min	First-order	Minimum	400 × 20
RCMFRDE_var	Second-order	Variance	400 × 19
RCMFRDE_rms	Second-order	Root mean square	400 × 19
RCMFRDE_skewness	Third-order	Skewness	400 × 18

Table 3. The results in four fault feature matrices.

Ensemble Entropy Set	Entropy Value	Moment Processing	Coarse Graining	Size
ERCMFRDE_1	RCMFRDE_mean	First-order	Mean	400 × 60
	RCMFRDE_max		Maximum
	RCMFRDE_min		Maximum
ERCMFRDE_2	RCMFRDE_var	Second-order	Variance	400 × 38
ERCMFRDE_2	RCMFRDE_rms	Second-order	Root mean square	400 × 38
ERCMFRDE_3	RCMFRDE_skewness	Third-order	Skewness	400 × 18
ERCMFRDE	RCMFRDE_mean	First-order	Mean	400 × 116
	RCMFRDE_max		Maximum
	RCMFRDE_min		Maximum
	RCMFRDE_var	Second-order	Variance
	RCMFRDE_rms	Second-order	Root mean square
	RCMFRDE_skewness	Third-order	Skewness

Table 4. Diagnostic accuracy of nine kinds of algorithm.

Entropy Value	Accuracy (%)
Entropy Value	Maximum	Minimum	Average
RCMFRDE_mean	96.88	93.13	95.34
RCMFRDE_max	96.88	91.88	95.22
RCMFRDE_min	93.13	87.50	89.59
RCMFRDE_var	95.63	90.63	93.50
RCMFRDE_rms	95.63	86.25	89.72
RCMFRDE_skewness	96.25	89.38	92.34
ERCMFRDE_1	100.00	96.25	97.75
ERCMFRDE_2	96.88	93.75	95.25
ERCMFRDE	100.00	96.88	98.88

Table 5. Diagnostic accuracy of four kinds of entropy algorithm.

Entropy Value	Accuracy (%)
Entropy Value	Maximum	Minimum	Average
ERCMDE	94.38	90.00	92.28
ERCMRDE	98.13	94.38	96.03
ERCMFDE	100.00	95.63	97.47
ERCMFRDE	100.00	96.88	98.88

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, X.; Du, Y. Fault Diagnosis Method for Wind Turbine Gearbox Based on Ensemble-Refined Composite Multiscale Fluctuation-Based Reverse Dispersion Entropy. Entropy 2024, 26, 705. https://doi.org/10.3390/e26080705

AMA Style

Wang X, Du Y. Fault Diagnosis Method for Wind Turbine Gearbox Based on Ensemble-Refined Composite Multiscale Fluctuation-Based Reverse Dispersion Entropy. Entropy. 2024; 26(8):705. https://doi.org/10.3390/e26080705

Chicago/Turabian Style

Wang, Xiang, and Yang Du. 2024. "Fault Diagnosis Method for Wind Turbine Gearbox Based on Ensemble-Refined Composite Multiscale Fluctuation-Based Reverse Dispersion Entropy" Entropy 26, no. 8: 705. https://doi.org/10.3390/e26080705

APA Style

Wang, X., & Du, Y. (2024). Fault Diagnosis Method for Wind Turbine Gearbox Based on Ensemble-Refined Composite Multiscale Fluctuation-Based Reverse Dispersion Entropy. Entropy, 26(8), 705. https://doi.org/10.3390/e26080705

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Fault Diagnosis Method for Wind Turbine Gearbox Based on Ensemble-Refined Composite Multiscale Fluctuation-Based Reverse Dispersion Entropy

Abstract

1. Introduction

2. Basic Principles

2.1. ICEEMDAN–SWT

2.1.1. Rime Optimization Algorithm

2.1.2. Improved Complete Ensemble EMD

2.1.3. Permutation Entropy

2.1.4. Stationary Wavelet Transform

2.2. ERCMFRDE

2.2.1. Fluctuation-Based Reverse Dispersion Entropy

2.2.2. Ensemble-Refined Composite Multiscale FRDE

2.2.3. Recursive Feature Elimination

2.3. Least Squares Support Vector Machine

3. The Failure Diagnostic Flowchart

4. Experimental Verification

4.1. Signal Acquisition and Noise Reduction

4.2. Feature Extraction

4.3. Feature Selection

4.4. Intelligent Diagnosis

4.5. Noise Resistance Test

4.6. Comparison Experiment

4.6.1. Comparison of Different Entropy Algorithms

4.6.2. Comparison of Different Diagnostic Methods

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI