A Time Series Prediction-Based Method for Rotating Machinery Detection and Severity Assessment

Zhang, Weirui; Sun, Zeru; Lv, Dongxu; Zuo, Yanfei; Wang, Haihui; Zhang, Rui

doi:10.3390/aerospace11070537

Open AccessArticle

A Time Series Prediction-Based Method for Rotating Machinery Detection and Severity Assessment

by

Weirui Zhang

¹

,

Zeru Sun

²,

Dongxu Lv

³,

Yanfei Zuo

³,

Haihui Wang

^1,*

and

Rui Zhang

^2,*

¹

School of Mathematical Sciences, Beihang University, Beijing 102206, China

²

Aero Engine Academy of China, Beijing 101300, China

³

College of Mechanical and Electrical Engineering, Beijing University of Chemical Technology, Beijing 100029, China

^*

Authors to whom correspondence should be addressed.

Aerospace 2024, 11(7), 537; https://doi.org/10.3390/aerospace11070537

Submission received: 6 May 2024 / Revised: 16 June 2024 / Accepted: 27 June 2024 / Published: 1 July 2024

Download

Browse Figures

Versions Notes

Abstract

:

Monitoring the condition of rotating machinery is critical in aerospace applications like aircraft engines and helicopter rotors. Faults in these components can lead to catastrophic outcomes, making early detection essential. This paper proposes a novel approach using vibration signals and time series prediction methods for fault detection in rotating aerospace machinery. By extracting relevant features from vibration signals and using prediction models, fault severity can be effectively quantified. Our experimental results show that the proposed method has potential in early fault detection and is applicable to various types of bearing faults and the different statuses of these faults under complex running conditions, achieving very good generalization ability.

Keywords:

fault detection; fault severity estimation; time series prediction; vibration signal; aircraft engine

1. Introduction

Monitoring the health of rotating machinery is crucial in ensuring the reliability of aerospace systems. Among the components of rotating machinery, rolling bearings are particularly critical, especially in aerospace applications like aircraft engines and helicopter rotors. Effective diagnosis of bearing faults is vital in reducing maintenance costs and enhancing safety within aerospace systems.

Modern aerospace machinery operates under extreme conditions for long periods, making it susceptible to component failure, which poses significant safety risks and can lead to catastrophic outcomes. Vigilance over the health of rotating equipment in aerospace is therefore essential. In recent years, fault diagnosis technology for rotating machinery has developed rapidly. Researchers have conducted many investigations on the condition assessment and diagnosis of rotating machinery faults based on vibration signals [1,2,3,4].

Based on previous research, fault severity assessment techniques can be broadly categorized into signal processing-based methods and learning-based methods [5]. Traditional signal processing techniques involve extracting features from signals obtained from various sources that are related to fault severity estimation and bearing wear assessment. This is achieved by analyzing specific frequencies or calculating indicators that characterize the faults. Well-known methods in this category include Fourier transform, Hilbert transform [6], wavelet transform [7], empirical mode decomposition [8], and related methods.

Since its inception, deep learning has achieved remarkable success in numerous domains and has found extensive applications in engine fault diagnosis. Deep learning methods for fault diagnosis in rotating machinery can be broadly classified into three categories.

The first category focuses on the detection of defect locations or types, with defect classification as the primary objective. This area has produced very promising results. Notable achievements include CNN-LSTM (Convolutional Neural Network-Long Short-Term Memory) with 99.6% accuracy [9], SDAE (Stacked Denoising Auto-encoder) with 99.83% accuracy [10], and EDAE (Ensemble Deep Autoencoder) with 99.15% accuracy [11], among others.

The second category involves the computation of a degradation indicator that quantifies the health of the machine and facilitates the assessment of fault severity. Regression models are often used for this purpose. For example, Shen et al. [12] used a support vector regression (SVR) model to quantitatively estimate fault sizes. It is also possible to treat different levels of defect severity as distinct categories, using classifiers to accomplish the task. Lei et al. [13] used Wavelet Neural Networks (WNNs) as a severity classifier, with input features selected from the most sensitive Intrinsic Mode Function (IMF) obtained via Empirical Mode Decomposition (EMD). The selection criteria were based on the mean and standard deviation of the kurtosis values for data samples of each IMF.

The third category is a hybrid method, and the above two categories of methods were used together. For example, Smith et al. [14] used a correlation-based algorithm to cluster data into separate groups, upon which Principal Component Analysis (PCA) was applied. Faults are detected by calculating Hotelling’s T² statistic, and potential sources of faults are localized by analyzing PCA loadings, which trace the variables contributing most significantly to the T² values. Chang et al. [15] proposed a multi-fault diagnosis method using the Extreme Gradient Boosting (XGBoost) classifier. It identifies up to 20 different fault types based on vibration signals from the Flight Data Recorder (FDR). Additionally, they utilized a health indicator (HI) assessment model, combined with hyperparameter optimization and random search algorithms, to accurately predict the Remaining Useful Life (RUL) of the landing gear.

This study argues that in real-world operational contexts, the severity of mechanical faults inherently exhibits continuous variation, ranging from mild to severe. A significant portion of the existing research in this area has mainly employed methods dedicated to classification. Such classification methods may have certain limitations when deployed in practice. Therefore, this paper attempts to propose an AI-driven approach to establish a degradation metric for rotating machinery failure, with a particular focus on engine vibration signals. The method has good generalizability while maintaining continuity, making it more suitable for practical applications.

This paper is structured as follows: Section 1 introduces the context and current research landscape related to fault detection and estimation in rotating machines. Section 2 elucidates fundamental concepts, including time and frequency domain features, LSTM networks, and the Dynamic Time Warping algorithm. In Section 3, a novel fault detection approach is proposed. Section 4 presents the experimental validation of the proposed method using simulated vibration signals, coupled with a comprehensive analysis of the results. Finally, in Section 5, conclusions are drawn based on the insights gained from the results of the study.

2. Theoretical Fundamental

This section provides an insight into the techniques used in the proposed diagnostic approach.

2.1. Feature Extraction

As bearing failures progress, the mechanical system experiences an increase in vibration intensity, which is manifested in vibration signals. To prevent machine downtime and ensure optimum production efficiency, the challenge is to monitor the severity of the fault in real time to enable timely intervention based on fault trends. It is therefore essential to extract sensitive features that accurately reflect the current condition of the bearing. This step plays a crucial role in subsequent procedures.

In fault diagnosis, it is crucial to utilize the complete information available from vibration signals and filter out the optimal features during the subsequent information processing steps. Therefore, when conducting feature extraction, the extracted features should encompass all commonly used characteristics in signal analysis. In this paper, a range of common features across time, frequency, and statistical domains, as listed in Table 1, were utilized. These features are defined in references [12,16]. Specific features include the root mean square (RMS), square root of amplitude (SRA), kurtosis value (KV), skewness value (SV), peak-to-peak value (PPV), crest factor (CF), impulse factor (IF), margin factor (MF), shape factor (SF), kurtosis factor (KF), frequency center (FC), RMS frequency (RMSF), root variance frequency (RVF), mean, variance, standard deviation, maximum, and minimum.

2.2. Dynamic Time Warping

Dynamic Time Warping (DTW) [17] is a non-linear warping technique that uses dynamic programming to perform time warping and compute distance measures. By establishing a matching path between data points within two correlated time series of arbitrary length, the algorithm assesses the similarity of these sequences. DTW excels at handling time shifts between sequences and demonstrates notable fault tolerance and robustness. Therefore, it has been used as an application in the feature selection phase of rotating bearing fault detection. Experimental results indicate that DTW can improve the accuracy of fault diagnosis while minimizing the number of features, thereby improving the efficiency and effectiveness of fault diagnosis [18].

Assume that the corresponding lengths of the two time series

X = {x_{1}, x_{2}, \dots, x_{n}}

and

Y = {y_{1}, y_{2}, \dots, y_{m}}

are

n

and

m

, respectively. The core principle of DTW revolves around finding an optimal alignment path between

X

and

Y

. DTW aims to discover an alignment path

P

that minimizes the total distance between corresponding elements in

X

and

Y

. To construct

P

, we define a set of conditions: the path starts at the first elements of both sequences

(i_{1}, j_{1}) = (1, 1)

, ends at the last elements

(i_{k}, j_{k}) = (n, m)

, and adheres to monotonically increasing indices, where

i_{1} \leq i_{2} \leq \dots \leq i_{k}

and

j_{1} \leq j_{2} \leq \dots \leq j_{k}

. The objective is to find the alignment path that satisfies these conditions while minimizing the accumulated cost along the path.

To compute the DTW distance between sequences X and Y, the algorithm employs a dynamic programming approach. It constructs a matrix

D

with dimensions

n \times m

, initialized with infinity values. The elements

D [i] [j]

represent the cumulatived cost up to the

(i, j)

element in the alignment path. The computation involves pairwise comparison elements

(x_{i}, y_{j})

in the sequences and updating the matrix

D

based on the following formula:

D [i] [j] = \cos t (x_{i}, y_{j}) + \min {D [i - 1] [j], D [i] [j - 1], D [i - 1] [j - 1]}

(1)

Once the matrix is complete, the DTW distance is calculated as

D [n] [m]

, which is the optimal cumulative cost of the alignment path.

2.3. Long Short-Term Memory

Long Short-Term Memory (LSTM) has demonstrated significant success in the field of speech and images. Engineers extract the acoustic features (mel-frequency cepstral coefficients and log-filterbank energy) of speech and feed them into the LSTM network for natural language modeling [19]. Given this, we fed a specific order of multi-domain features into the LSTM model instead of raw signals for fault diagnosis.

This architecture was introduced by Hochreiter and Schmidhuber in 1997 [20]. The LSTM has demonstrated superior classification and regression capabilities over conventional RNNs on time series datasets, including speech and natural language processing datasets. LSTM contains a memory cell that replaces the role of hidden RNN units. This memory cell contains four gates: the forgetting gate (which determines the extent to which information from the previous cell state is retained), the input gate (which determines whether new information is incorporated into the cell), the input modulation gate (which regulates the amount of information to be written into the cell), and the output gate (which manages the amount of information to be extracted from the cell) [21]. LSTM skillfully maintains error propagation during back-propagation across time and layers

Figure 1 shows a typical LSTM cell, where σ (sigmoid) represents the gate activation function and φ (tanh) represents the input or output node activation. The LSTM model shown in Figure 1 is described by Equation (2).

\{\begin{array}{l} f_{t} = σ (w_{f x} x_{t} + w_{f h} h_{t - 1} + b_{f}) \\ i_{t} = σ (w_{i x} x_{t} + w_{i h} h_{t - 1} + b_{i}) \\ g_{t} = ϕ (w_{g x} x_{t} + w_{g h} h_{t - 1} + b_{g}) \\ o_{t} = σ (w_{o x} x_{t} + w_{o h} h_{t - 1} + b_{o}) \\ s_{t} = g_{t} i_{t} + s_{t - 1} f_{t} \\ h_{t} = ϕ (s_{t}) o_{t} \end{array}

(2)

where,

w_{g x}, w_{i x}, w_{f x}

and

w_{o x}

are the weights at time

t

between the input layer and the hidden layer,

w_{g h}, w_{i h}, w_{f h}

and

w_{o h}

are the weights at time

t

and

t - 1

between the hidden layers,

b_{g}, b_{i}, b_{f}

and

b_{o}

are the biases of the gates,

h_{t - 1}

is the value of the hidden layer at time

t - 1

,

f_{t}, g_{t}, i_{t}

and

o_{t}

are the output values of the forget gate, input modulation gate, input gate, output gate, respectively, and

s_{t}

and

s_{t - 1}

are the current state at time

t

and

t - 1

, respectively.

2.4. Modified Z-Score

In the study conducted on the application of neural network methods for signal analysis, the imperative of outlier identification and rectification was paramount to ensure analytical robustness and the integrity of the results. The adoption of the modified Z-score method, as explained in [22], provided a rigorous framework for this purpose. Departing from conventional norms, this method uses the median and the mean absolute deviation (MAD) as its basic estimators, which makes it more suitable for outlier detection in small samples with pseudo-normal distribution data. The formulation of the modified Z-score is mathematically expressed as:

M_{i} = \frac{0.6745 (x_{i} - \tilde{x})}{M A D}

(3)

where

\tilde{x}

denotes the median of the dataset and

M A D = m e d i a n {| x_{i} - \tilde{x} |}

. Iglewicz and Hoaglin (1993) suggested that observations are labeled outliers when

M_{i} > 3.5

through the simulation based on pseudo-normal observations for sample sizes of 10, 20, and 40 [22].

3. Proposed Method

The fault detection method proposed in this paper is rooted in a core concept: the vibration signals of normally operating machines inherently possess a regular pattern. Based on this characteristic, we can establish a time series prediction model based on signals under normal conditions. When unknown signals align with the predictions of the model, we have reason to believe that these signals also originate from the normal operating state of the machine. Conversely, if the unknown signals significantly deviate from the model’s predictions, this serves as a clear indicator that the machine may have encountered some kind of fault.

The delineation of the proposed bearing fault severity detection method is illustrated in Figure 2.

We wanted to build a time series prediction model to learn the regularity of normal vibration signals. In principle, this model could be any regression model. Due to the powerful multi-input and multi-output capabilities of LSTM, we chose LSTM as our prediction model in this study. The main hypothesis of this work is that higher discriminative power of the fault detection system can be achieved if as much information as possible is extracted from the signal and a subsequent feature selection reduces the final number of features to a reasonable amount and uses these feature sequences for training.

The input signals to the algorithm are denoised, and the exact method of denoising does not matter for the purposes of this paper. All input signals mentioned later will be denoised and will not be specified.

To construct the feature time series, the signal is first converted into multiple samples using a sliding window. Let the original signal be represented by

{x_{1}, x_{2}, . . ., x_{N}}

, where

x_{i}

is a vector of dimension

c h a n n e l

. This is represented as a 2D array of shape

(N, c h a n n e l)

in the program. The goal is to transform it into multiple signal segments, with each segment calculating a feature vector.

A sliding window of length

L

is used to sequentially extract segments from the original signal. To obtain as many samples as possible, the step size of the window is set to 1. Thus, a maximum of

N - L + 1

signal segments can be extracted from the original signal, and the number of segments to be extracted is marked as

p i e c e

. Each signal segment is of length

L

, with the kth sample (signal segment) represented as

{x_{k}, x_{k + 1}, . . ., x_{k + L - 1}}

, which in the program is a 2D array of size

(L, c h a n n e l)

. All signal segments together form a 3D array of size

(p i e c e, L, c h a n n e l)

.

For the multiple signal segments obtained, each being a matrix with length

L

and dimension

c h a n n e l

(

L

rows by

c h a n n e l

columns), a feature value can be calculated from each column of each signal segment so that a signal segment is transformed into a feature vector. Multiple signal segments are then transformed into multiple feature vectors, which are sequentially concatenated to form a high-dimensional feature time series.

As mentioned in Section 2.1, we consider 18 features (listed in Table 1) from the field of signal analysis. It is important to note that the feature calculation formulas mentioned in the paper are only applicable to one-dimensional signals. However, real signal acquisition involves multiple measurement points, with the data from each measurement point constituting a signal channel. Therefore, in practical processing, features should be extracted separately for each channel of the signal. The final feature time series will have dimensions equal to the product of the number of original signal channels and the number of features.

For the input signal, after it is converted into multiple signal segments using a sliding window, each channel of each signal segment can calculate

18 \times c h a n n e l

features, i.e., each signal segment matrix produces a feature vector with dimensions of

18 \times c h a n n e l

. Thus, each original signal (

N

rows by

c h a n n e l

columns) is transformed into a feature time series (

p i e c e

rows by

18 \times c h a n n e l

columns).

If all

18

dimensions of the feature time series are considered, the dimensionality of the data becomes too high, making it difficult for the model to converge. Therefore, it is reasonable to select a subset of features for use. Our subsequent experiments also showed that the use of filtered features accelerated the convergence speed of model training. Since the goal is to reflect the presence of faults through prediction accuracy, the appropriate features should show significant differences between the feature time series generated by normal signals and those generated by fault signals. Assuming that we have normal signals and some known “reference fault signals” from faulty machines, we extracted all possible features from both normal and fault signals and calculated the similarity for each corresponding feature one by one. The feature with the lowest similarity is the appropriate feature.

Similarity selection uses the Dynamic Time Warping (DTW) algorithm to calculate the DTW score for each column between two feature matrices. It is worth noting that for a given feature (such as the mean), since a sequence is calculated on each channel (assuming there are three channels, there would be chn1_Mean, chn2_Mean, and chn3_Mean), each channel will have a corresponding DTW score. We take the sum of the scores from each channel as the final score for that feature.

Sorting the features in descending order of score gives the order in which the features should be used. In general, selecting the first feature is sufficient to discriminate between normal and erroneous signals.

It should be pointed out that using the DTW score to measure similarity and determine the most suitable features is a relative standard, not an absolute one. It identifies which features are more or less suitable compared to others. If multiple features exhibit high DTW scores, they are all considered appropriate, and the choice should then be based on which feature contributes to better model convergence. Conversely, if all features show low DTW scores, this indicates that the normal and fault signals are very similar in their data representations, suggesting that our method may not be applicable in such scenarios. However, such cases are exceptionally rare.

Assuming that only one feature that has the highest DTW score is selected, the next step is to construct a feature sequence sample of normal signals for the sequence prediction model to learn the temporal information of this feature sample. Here, LSTM is used as the sequence prediction model.

Again, a sliding window is applied to extract feature sequence sample matrices from the feature matrix of normal signals to train the LSTM model. The LSTM model aims to predict the values of the next

p

points (feature vectors) based on the previous

m

points (feature vectors) in the sequence. Therefore, the length of the sliding window, marked as

f L

, is set to

f L = m + p

. The sliding step size remains at 1. The training dataset must contain at least

f p i e c e

samples; therefore, the length of the feature time series,

p i e c e

, must be greater than or equal to

f L + f p i e c e - 1

.

Each feature sequence sample is a matrix of

f L

rows by

c h a n n e l

columns (if two features are selected, it would be

c h a n n e l \times 2

columns), with the first

m

rows by

c h a n n e l

columns as input and the subsequent

p

rows by

c h a n n e l

columns as output for that sample. Training is performed with a total of

f p i e c e

samples.

After training is complete, the feature sequence samples of normal signals used for training are fed into the model, yielding a set of normal MAEs that represent the distribution of MAE generated by normal signals after prediction. The same method is applied to other normal signals and the MAEs obtained should conform to this distribution. The MAEs generated by abnormal signals do not conform to this distribution, i.e., the MAEs of abnormal signals are considered outliers compared to the normal MAEs.

The modified Z-score is widely regarded as an efficient method for outlier detection, particularly when it comes to handling non-normal data distributions [22]. Experiments indicate that the normal MAEs follow a non-normal distribution, making the modified Z-score suitable for our method. However, our goal is not to identify outliers within a data set but rather to determine whether a set of unknown MAEs fits the distribution of another set of MAEs. Therefore, we need to adapt the modified Z-score method by calculating a threshold determined by the normal MAEs. By comparing the unknown MAEs to this threshold, we can determine whether they are outliers.

The original definition of the modified Z-score has already been described in Section 2.4. A feasible criterion for being classified as an outlier is

|M_{i}| = |\frac{0.6745 (x_{i} - \tilde{x})}{M A D}| > 3.5

, which is equivalent to

|x_{i} - \tilde{x}| > \frac{3.5 M A D}{0.6745}

, where

x_{i}

is the unknown MAE to be judged and

\tilde{x}

is the median of the normal MAEs. Considering that only an excessively large unknown MAE is deemed a fault, it can be asserted that

x_{i} > \tilde{x}

. Therefore, the threshold given by the modified Z-score is:

t h r e s h o l d = \tilde{x} + \frac{3.5 M A D}{0.6745}

where

\tilde{x}

is the median of the normal MAEs and MAD is the median absolute deviation of the normal MAEs, calculated as

M A D = \underset{x_{i} \in normal MAE}{median} {| x_{i} - \tilde{x} |}

.

For the unknown signals to be tested, feature sequences are extracted in the same way as in the training phase and fed into the time series prediction model to obtain the corresponding MAE. In effect, each fault signal is divided into several samples, resulting in a set of MAEs. This set of MAEs is compared to the threshold provided by the modified Z-score. If the average of these MAEs exceeds the threshold, or if the proportion of MAEs exceeding the threshold reaches a certain level (e.g., 50%), then the unknown signal is said to be abnormal. Otherwise, it is considered normal.

For the analysis of run-to-failure data collected from continuously operating machinery, the initial sampling serves as the reference normal signals. The reference fault signal can be filled with a known fault signal under similar operating conditions. Each sampled signal is diagnosed in the aforementioned manner, and the first sample identified as abnormal is considered the point of failure occurrence, thus determining the result of fault detection.

4. Experiment Validation

This section presents the dataset used in the experiments and validates the effectiveness of the proposed method through experimental evaluations.

4.1. Case Western Reserve University Bearing Data (CWRU)

4.1.1. Description

The bearing fault dataset provided by the Case Western Reserve University (CWRU) Bearing Data Center [23] focuses on the fault diagnosis of engine bearings and provides rich laboratory-level fault diagnosis data by simulating various types of bearing defects, such as inner race defects, outer race defects, and rolling element defects, serving as an important resource in the field of bearing fault analysis.

The data used in this research came from two bearings installed in an engine-driven mechanical system, one at the drive end of the engine and the other at the fan end. In both bearings, three types of defects (outer race, inner race, and ball defects) were introduced using electro-discharge machining with different defect diameters. In the case of the outer ring defects, tests were carried out on both fan and drive end bearings with outer ring defects located at 3 o’clock (directly in the load zone), 6 o’clock (perpendicular to the load zone), and 12 o’clock. Each bearing was tested at four different loads, 0, 1, 2, and 3 hp.

After collecting data with a 16-channel DAT recorder, they were processed in the MATLAB environment, and all data files were saved in MATLAB (.mat) format. Each file includes one or more recordings of DE (Drive-end), FE (Fan-end), and BA (Normal-baseline) acceleration data, collected at sampling frequencies of 12 kHz and 48 kHz. In our experiments, only DE and FE data were used, with them being treated as two channels of a test dataset.

A more detailed description of the experimental setup and the equipment involved can be found on the Case Western Reserve University website [23].

4.1.2. Result

This dataset provides signals under different operating conditions and fault situations, with each signal measured at a constant fault level. In order to verify the effectiveness of the method proposed in this paper in diagnosing faults, the normal signal with a motor load of 0 (approx. motor speed 1797 rpm) numbered 97 and signals numbered 109–262 in different fault situations from the drive side with the same motor load and sampling frequency of 48 k were used, as listed in Table 2.

The important hyperparameter settings in the algorithm are shown in Table 3.

Taking signal 97 as the normal signal and 226 as the reference fault signal, feature sequences are extracted separately. The DTW scores are calculated for each column of the feature sequences and ranked in descending order. The higher the ranking, the greater the difference between the feature in the normal and fault signals, making it suitable for further processing. It is important to note that these scores are only indicative and that other factors, such as the convergence of the prediction model, should also be considered when determining the actual features to be used.

The top 10 features are shown in Table 4. Ultimately, the mean value (Mean) would be selected as the feature used in the prediction model. Since 226 represents the highest level of ball failure, this score ranking implies that “Mean” is suitable for use in this method to detect different levels of ball failure, with this score ranking implying that the mean value feature is suitable for detecting different degrees of faults in the algorithm proposed in this paper.

We used the Mean feature sequence extracted from normal signals to train an LSTM as a time series prediction model. After obtaining the model, for each type of fault signal, we also extracted the Mean feature sequence, fed it into the LSTM to obtain the corresponding MAE (Mean Absolute Error), and compared it with the MAE distribution of normal signals and the calculated threshold, plotting the comparison in a histogram as in Figure 3.

In Figure 3, the blue and green sections represent the distributions of the MAE (Mean Absolute Error) for normal and test signals, respectively, as determined by the algorithm. The red dashed line indicates the threshold calculated on the basis of the MAE of normal signals, where MAEs exceeding this threshold are considered anomalies, indicating that the signals from which these MAEs arise are abnormal compared to normal signals. The black dashed line represents the average MAE of the test signals, with values above the threshold used as the criterion for declaring an error.

It can be seen from the graph that the MAE distribution of the signals from each fault level is generally much higher than that of the normal signals and, of course, exceeds the threshold set based on the Z-score. Although only the highest fault level, signal 226, was used during training, signals 122 and 189 are completely unknown to the algorithm, yet it can still accurately detect faults.

The same approach was used for inner ring and outer ring defects in three different orientations, with the results shown in Table 5 and Figure 4.

Based on the results of the above experiments, it can be seen that our algorithm can effectively distinguish between normal signals and signals with different degrees of faults. However, it is worth noting that approximately 6% of the Mean Absolute Error (MAE) for the 250th fault signal falls below the threshold. However, considering that the average MAE is above the threshold, it can still be said that the algorithm has detected the fault in this signal.

4.2. Dataset of Intelligent Maintenance Systems (IMS), University of Cincinnati

4.2.1. Description

The bearing fault dataset from the Intelligent Maintenance Systems (IMS) Center at the University of Cincinnati, was collected on an endurance test rig of the University of Cincinnati and released in 2014 [24]. It collects comprehensive data on bearing performance under various operating conditions, which is widely used in bearing fault prediction and health management research. The test rig (shown in Figure 5) has the following characteristics:

4 double-row bearings;
2000 rpm stationary speed;
6000 lbs load applied onto the shaft and bearing by a spring mechanism;
high-sensitivity Quart ICP accelerometers.

An AC motor, coupled with a rub belt, keeps the rotation speed constant. The four bearings are in the same shaft and are forced lubricated by a circulation system that regulates the flow and temperature.

Three datasets are provided on the downloaded file, composed of numerous files of one second each. Each file is made of 20,480 samples. Although it is mentioned that the sampling frequency is 20 kHz, it is thus believed that it was actually 20.48 kHz. A one-second acquisition is made every 5 (for the first 54 acquisitions) or 10 min, but it is sometimes subjected to a series of interruptions that make the time history not continuous (Figure 6).

The trial was stopped in the conventional way when the build-up of debris on a magnetic plug exceeded a certain level, indicating the possibility of imminent failure. The resulting life was 49,680 min (i.e., 34 days and 12 h), exceeding the bearing’s design life of more than 100 million revolutions. It should be emphasized that this data set is particularly valuable because bearing degradation was allowed to develop naturally and was not artificially induced, as is often carried out to speed up the experimental test.

4.2.2. Results

IMS is a full lifecycle experiment dataset with signals collected from machines operating normally until they stop due to a fault, during which time the fault level gradually increases. With signals from a full lifecycle experiment, it is not possible to make a binary diagnosis of normal or fault as with the CWRU data. Instead, we hoped to calculate the critical point at which the mechanical state deteriorates from normal to fault. The implication is that in real industrial processes when this critical point is reached, the machinery should be shut down for maintenance to prevent more serious damage.

In the IMS dataset experiment, several signal samples are taken during a test, resulting in several segments of vibration signals. The above algorithm is applied to each signal segment in turn, resulting in a corresponding set of MAEs. When the proportion of MAEs above the threshold reaches a certain level, for example, 50% of the sample MAEs above the threshold, the signal segment is considered abnormal. The acquisition time of this signal segment is then identified as the critical point of failure. To demonstrate the reliability of our algorithm, our experimental results will be compared with existing studies on the IMS dataset.

Gousseau et al. [25] utilized the dataset provided by the Center for Intelligent Maintenance Systems (IMS) at the University of Cincinnati, using a comprehensive suite of signal processing techniques such as time-domain analysis, spectral analysis, blind deconvolution, spectral coherence, and envelope spectrum for in-depth diagnostics and prognosis of vibrations in rolling element bearings. Their analysis successfully diagnosed inner race damage in bearing 3 and ball damage in bearing 4 within dataset 1, as well as outer race damage in bearing 1 in dataset 2, while bearing 3 failed to reveal the expected fault characteristics. For dataset 1’s bearing 3, time–frequency analysis detected damage from 33.8 days, envelope spectrum analysis detected it slightly earlier at 32 days and spectral coherence analysis detected it earliest at 29.2 days. For dataset 1’s bearing 4, time–frequency analysis detected damage at 18 days, envelope spectrum analysis showed signs of damage at 25 days, and spectral correlation analysis confirmed the presence of damage at 23 days. In test 2, all methods (time–frequency analysis, envelope spectrum, and spectral correlation) accurately diagnosed damage to the outer ring of bearing 1 at 3.5 days.

Research by Sacerdoti and colleagues [26] involved the use of the first test data from the IMS dataset to evaluate various signal-processing techniques for detecting and locating bearing faults. By comparing the kurtosis values of the diagnostic techniques, they found that Cepstrum Pre-Whitening (CPW) and Improved Envelope Spectrum (IES) performed best in locating and identifying faults. Fault detection focused on bearings 3 and 4, with an inner race fault in bearing 3 detected between the 33rd and 34th day of the experiment while rolling element and outer race faults in bearing 4 were detected from the 26th day.

In order to compare the results with the two aforementioned studies, our experiment uses only dataset 1, excluding bearings 1 and 2, which had no faults, and focusing only on the data from the four channels corresponding to bearings 3 and 4 (as perforemed in [22]). Table 6 presents a comparison of the fault detection results for bearings 3 and 4 from the literature [25,26].

Considering that the machine is certainly normal at the start of operation and that the fault level increases during operation, the signals obtained from the first few samples are considered normal signals, and the signal from the last sample is used as a reference fault signal for validation. Given the long duration of the experiment and the large amount of data, considering only the first normal signal could lead to overfitting. Therefore, during feature selection, the first four signal segments are all considered normal signals, with DTW scores calculated and summed for each. The hyperparameter

f p i e c e

is set to 100, with other settings consistent with Section 4.2.1 of the CWRU data experiment.

The results of the experiment are shown in Figure 7.

When more than 50% of the MAE calculated for a signal segment exceeds the threshold based on the Z-score, it is considered that a fault has occurred. Using the method proposed in this paper, the fault occurrence date for bearing 3 was determined to be 22 November, corresponding to the 32nd day after the start of the experiment, and the fault occurrence date for bearing 4 was determined to be 14 November, corresponding to the 24th day after the start of the experiment. These results are 1 to 2 days earlier than those reported in the literature [26] and shorter than a day different from the envelope spectrum analysis results in [25], demonstrating that the method proposed in this paper can indeed effectively detect faults.

4.3. Xi’an Jiaotong University Bearing Fault Dataset (XJTU-SY)

4.3.1. Description

The XJTU-SY dataset, provided by the Institute of Design Science and Basic Component of Xi’an Jiaotong University (XJTU) and Changxing Sumyoung Technology Co., Ltd. (SY), Changxing, China [27], is a bearing fault dataset containing complete run-to-failure data of 15 rolling element bearings. The XJTU-SY bearing dataset was obtained by conducting many accelerated degradation experiments, which is designed to provide more detailed and comprehensive fault mode analysis data to advance fault diagnosis technologies [28].

The bearing testbed includes several key components as shown in Figure 8. This test rig is designed to perform the accelerated degradation tests of rolling element bearings under different operating conditions (i.e., different radial forces and speeds). The radial force is generated by the hydraulic loading system and applied to the housing of the tested bearings, and the speed is set and maintained by the speed controller of the AC induction motor.

The dataset used in this study was based on bearings. The relevant parameters of these bearings are given in Table 7. To comprehensively evaluate their performance, the experiments were structured around three different operating conditions, as shown in Table 8. For each operating condition, several sets of vibration signals were meticulously collected throughout the life of the bearings. Throughout the experiments, two unidirectional accelerometers were attached to the test bearings using magnetic mounts to ensure secure attachment in both horizontal and vertical orientations. A portable dynamic signal acquisition system was used to record the vibration signals. The sampling frequency was carefully set at 25.6 kHz with a 1 min sampling interval. Each individual sampling session lasted 1.28 s. A detailed interpretation of the dataset can be found in [27].

To obtain a complete time series, the data from each sample are concatenated, resulting in run-to-failure vibration signals. The first two signals are shown in Figure 9. The label Bearing 1_2 means that the data are from the second test in working condition 1, and so on.

4.3.2. Results

XJTU-SY is another open-source dataset that covers the entire life of the experimental data. The experimental methodology used for the XJTU-SY dataset is exactly the same as that used for the IMS dataset. Three experiments are considered:

(1) Using the data from the first three experiments under operating condition 1 and given that these experiments had the same type of failure, the data from the third experiment (Bearing1_3) are used for feature selection, while the other two experiments are used for testing.

After computation, the most suitable feature was found to be Min with a DTW score of 7.98. The experimental results for Bearing1_1 and Bearing1_2 are shown in Figure 10.

From the graph, it can be seen that Bearing1_1 showed abnormalities as early as the 9th sampling, with the severity of the abnormalities fluctuating initially. By the 73rd sampling, the proportion of abnormal MAE reached 1, at which point it can be considered a complete failure. Bearing1_2 showed abnormalities by the 14th sampling and quickly progressed to complete failure.

(2) Using the data from three outer ring failure experiments under Operating Condition 2, namely Bearing2_2, Bearing2_4, and Bearing2_5, feature selection was performed on Bearing2_5, while the other two were used for testing.

After calculation, the most suitable feature was found to be Meanf with a DTW value of 8.33. The experimental results for Bearing2_2 and Bearing2_4 are shown in Figure 11.

The graph shows that Bearing2_2 showed abnormalities by the 48th sampling, whereas Bearing2_4 showed abnormalities by the 15th sampling and quickly progressed to complete failure.

(3) We used Bearing1_3 (with an outer ring defect) for feature selection and Bearing1_5 (with both inner and outer ring defects) for testing and used Bearing3_3 (with an inner ring defect) for feature selection and Bearing3_2 (with inner ring, rolling element, cage and outer ring defects) for testing. The purpose of this experiment is to demonstrate that the algorithm proposed in this paper remains effective for data with mixed defect types.

The selected features and the results of anomaly detection on the test data are shown in Figure 12.

Bearing1_5 is considered to have developed a fault at the 34th sampling; Bearing3_2 first showed an abnormal MAE ratio near 50% at the 497th sampling, which could be considered a false alarm from the perspective of the whole experiment. This is because the abnormal MAE ratio remained relatively low for a long time after the 1120th sampling and did not rise to 50% until the 1961st sampling.

4.4. Results Analysis

Validation using three different datasets shows that the algorithm proposed in this paper can effectively detect anomalies in vibration signals and has the following advantages:

(1): Experiments on the CWRU dataset show that the method can perform fault detection tasks with a small number of samples. Although it uses only 0.021 inch diameter defect data for training, it can detect 0.007 and 0.014 inch diameter defects. This feature is particularly suited to real-world industrial environments. In many practical application scenarios, the acquisition of a large amount of labeled data is often costly and time-consuming. Therefore, an algorithm that can effectively use a limited number of samples for accurate detection is of significant practical value. In addition, this method is versatile and can detect a variety of defects, including inner ring defects, outer ring defects, and rolling element defects, all using the same approach.
(2): Experiments on the IMS dataset demonstrate that this method can be used to monitor the health of rotating machinery and provide timely warnings when the machine exhibits abnormalities. Compared to existing work, our method does not use complex signal analysis techniques, nor does it require manual identification of characteristic lines in spectrograms, yet it detects faults at times similar to or earlier than existing methods. This suggests that the method is more suitable for unsupervised scenarios.
(3): The experiments conducted on the XJTU-SY dataset confirm the conclusions of the previous two experiments. In experiments (1) and (2) on the XJTU-SY dataset, the data used for testing and the data used for feature selection were not the same, which once again demonstrates that this method can perform defect detection tasks with a small number of samples. Furthermore, despite the different operating conditions in (1) and (2), the method proposed in this paper was able to detect the occurrence of faults, demonstrating its universality. Experiment three (3) shows that this method remains effective even when the fault mode of the unknown signal is a mixture of several faults.

5. Conclusions

This paper presents a novel method for fault detection based on engine vibration signals. The method involves extracting time and frequency domain features from normal signals to build an LSTM time series prediction model. Prediction accuracy is used as a continuous indicator for unknown signals to indicate the presence of faults and the effectiveness of the method is validated by experiment.

The method for rotating machinery fault detection and severity assessment presented in this study offers significant implications for aerospace applications. By providing sensitive and early detection of faults, this method can be effectively applied to rotating machines such as aircraft engines and helicopter rotors. Its robustness and early-warning capabilities will enhance aerospace equipment’s reliability, particularly given the challenging environments.

The advantages of this method are as follows:

(1): Overcoming the limitations of traditional strategies that categorize fault severity in a discrete manner, this method uses a continuous indicator to indicate fault severity. As a result, it provides greater flexibility in setting thresholds to differentiate defects.
(2): Thanks to a well-structured algorithmic framework, this method is highly scalable and adaptable. In the feature selection phase, in addition to the 18 features preset in this paper, other new features can be included. In addition to the DTW algorithm used in this paper, other similarity measurement algorithms can be used. During the training phase, other time series prediction models can be used instead of the LSTM network used in this paper. Furthermore, algorithms other than Z-score can be used for outlier detection.
(3): Only a small amount of fault data were used during training (particularly in the feature selection process), yet the method can still effectively detect faults in unknown signals. Given that fault data collection in real-world engineering environments can be challenging, and typically only a small amount of fault data can be obtained, this method is expected to perform well in real-world production scenarios with limited samples.

The limitations of this work include:

(1): Training the deep learning model requires a large number of feature time series samples, and obtaining these feature samples requires slicing of the original signals. Therefore, implementing this method requires a sufficient amount of original data, and it may consume significant memory resources during computation.
(2): Manual feature selection is an insufficient approach for many applications. If features could be extracted adaptively using neural network methods, it would save time and resources while increasing generality.
(3): The process of feature selection and thresholding lacks standardized criteria and relies on the subjective decisions of researchers. There is a need for more general and robust methods for adaptive decision-making.

Author Contributions

Conceptualization, W.Z., Z.S. and H.W.; Methodology, W.Z. and H.W.; Software, W.Z.; Validation, W.Z.; Resources, Z.S. and H.W.; Data curation, W.Z., D.L. and Y.Z.; Writing—original draft, W.Z.; Writing—review & editing, H.W.; Visualization, W.Z.; Supervision, H.W.; Project administration, Z.S.; Funding acquisition, H.W. and R.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Science and Technology Major Project, China, under Grant J2019-I-0019-0001 and Grant J2019-I-0001-0018.

Data Availability Statement

The original data presented in the study are openly available. The CWRU dataset is available on the Case Western Reserve University Bearing Data Center Website at https://engineering.case.edu/bearingdatacenter (accessed on 1 May 2024). The IMS dataset is available as part of the NASA Bearing Dataset at https://www.kaggle.com/datasets/vinayak123tyagi/bearing-dataset (accessed on 1 May 2024). The XJTU-SY dataset is available as part of the XJTU-SY Bearing Datasets at https://biaowang.tech/xjtu-sy-bearing-datasets (accessed on 1 May 2024).

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to have influenced the work reported in this paper.

References

Glowacz, A. Fault diagnosis of single-phase induction motor based on acoustic signals. Mech. Syst. Signal Process. 2019, 117, 65–80. [Google Scholar] [CrossRef]
Królczyk, G.; Królczyk, J.; Legutko, S.; Hunjet, A. Effect of the Disc Processing Technology on the Vibration Level of the Chipper during Operations. Tehnicki Vjesnik-Technical Gazette. April 2014. Available online: https://www.semanticscholar.org/paper/Effect-of-the-disc-processing-technology-on-the-of-Kr%C3%B3lczyk-Kr%C3%B3lczyk/bb3100ef1c5c6d4ddc99f5927f399de1b5f65c0e (accessed on 16 June 2024).
Montalvo, C.; Gavilán-Moreno, C.; García-Berrocal, A. Cofrentes nuclear power plant instability analysis using ensemble empirical mode decomposition (EEMD). Ann. Nucl. Energy 2017, 101, 390–396. [Google Scholar] [CrossRef]
Li, Z.X.; Jiang, Y.; Hu, C.; Peng, Z. Recent progress on decoupling diagnosis of hybrid failures in gear transmission systems using vibration sensor signal: A review. Measurement 2016, 90, 4–19. [Google Scholar] [CrossRef]
Cerrada, M.; Sánchez, R.-V.; Li, C.; Pacheco, F.; Cabrera, D.; de Oliveira, J.V.; Vásquez, R.E. A review on data-driven fault severity assessment in rolling bearings. Mech. Syst. Signal Process. 2018, 99, 169–196. [Google Scholar] [CrossRef]
Bujoreanu, C.; Monoranu, R.; Olaru, N.D. Study on the Defects Size of Ball Bearings Elements Using Vibration Analysis. Appl. Mech. Mater. 2014, 658, 289–294. [Google Scholar] [CrossRef]
Singh, M.; Kumar, R. Thrust bearing groove race defect measurement by wavelet decomposition of pre-processed vibration signal. Measurement 2013, 46, 3508–3515. [Google Scholar] [CrossRef]
Zhao, S.; Liang, L.; Xu, G.; Wang, J.; Zhang, W. Quantitative diagnosis of a spall-like fault of a rolling element bearing by empirical mode decomposition and the approximate entropy method. Mech. Syst. Signal Process. 2013, 40, 154–177. [Google Scholar] [CrossRef]
Pan, H.; He, X.; Tang, S.; Meng, F. An Improved Bearing Fault Diagnosis Method using One-Dimensional CNN and LSTM. Stroj. Vestn.—J. Mech. Eng. 2018, 64, 443–452. [Google Scholar] [CrossRef]
Guo, X.; Shen, C.; Chen, L. Deep Fault Recognizer: An Integrated Model to Denoise and Extract Features for Fault Diagnosis in Rotating Machinery. Appl. Sci. 2017, 7, 41. [Google Scholar] [CrossRef]
Shao, H.; Jiang, H.; Lin, Y.; Li, X. A novel method for intelligent fault diagnosis of rolling bearings using ensemble deep auto-encoders. Mech. Syst. Signal Process. 2018, 102, 278–297. [Google Scholar] [CrossRef]
Shen, C.; Hu, F.; Liu, F.; Zhang, A.; Kong, F. Quantitative recognition of rolling element bearing fault through an intelligent model based on support vector regression. In Proceedings of the 2013 Fourth International Conference on Intelligent Control and Information Processing (ICICIP), Beijing, China, 9–11 June 2013; IEEE: Piscataway, NJ, USA, 2013; pp. 842–847. [Google Scholar] [CrossRef]
Lei, Y.; He, Z.; Zi, Y. EEMD method and WNN for fault diagnosis of locomotive roller bearings. Expert Syst. Appl. 2011, 38, 7334–7341. [Google Scholar] [CrossRef]
Smith, A.J.; Powell, K.M. Fault Detection on Big Data: A Novel Algorithm for Clustering Big Data to Detect and Diagnose Faults. IFAC-PapersOnLine 2019, 52, 328–333. [Google Scholar] [CrossRef]
Chang, Y.-J.; Hsu, H.-K.; Hsu, T.-H.; Chen, T.-T.; Hwang, P.-W. The Optimization of a Model for Predicting the Remaining Useful Life and Fault Diagnosis of Landing Gear. Aerospace 2023, 10, 963. [Google Scholar] [CrossRef]
Rauber, T.W.; Boldt, F.d.A.; Varejao, F.M. Heterogeneous Feature Models and Feature Selection Applied to Bearing Fault Diagnosis. IEEE Trans. Ind. Electron. 2015, 62, 637–646. [Google Scholar] [CrossRef]
Keogh, E.; Ratanamahatana, C.A. Exact indexing of dynamic time warping. Knowl. Inf. Syst. 2005, 7, 358–386. [Google Scholar] [CrossRef]
Wang, G.; Wang, T.; Chen, J.; Zhao, S. Bearing fault feature selection method based on dynamic time warped related searches. J. Vibroeng. 2023, 25, 311–324. [Google Scholar] [CrossRef]
Chen, J.; Wang, D. Long short-term memory for speaker generalization in supervised speech separation. J. Acoust. Soc. Am. 2017, 141, 4705–4714. [Google Scholar] [CrossRef] [PubMed]
Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
Sabir, R.; Rosato, D.; Hartmann, S.; Guehmann, C. LSTM Based Bearing Fault Diagnosis of Electrical Machines using Motor Current Signal. In Proceedings of the 2019 18th IEEE International Conference on Machine Learning And Applications (ICMLA), Boca Raton, FL, USA, 16–19 December 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 613–618. [Google Scholar] [CrossRef]
Kaliyaperumal, S.M.K.; Arumugam, S. Labeling Methods for Identifying Outliers. Int. J. Stat. Syst. 2015, 10, 231–238. [Google Scholar]
Welcome to the Case Western Reserve University Bearing Data Center Website|Case School of Engineering|Case Western Reserve University. Case School of Engineering. Available online: https://engineering.case.edu/bearingdatacenter/welcome (accessed on 12 March 2024).
NASA Bearing Dataset. Available online: https://www.kaggle.com/datasets/vinayak123tyagi/bearing-dataset (accessed on 12 March 2024).
Lei, Y.; Han, T.; Wang, B.; Li, N.; Yan, T.; Yang, J. XJTU-SY Rolling Element Bearing Accelerated Life Test Datasets: A Tutorial. J. Mech. Eng. 2019, 55, 1. [Google Scholar] [CrossRef]
Gousseau, W.; Antoni, J.; Girardin, F.; Griffaton, J. Analysis of the Rolling Element Bearing Data Set of the Center for Intelligent Maintenance Systems of the University of Cincinnati. CM2016, Charenton, France. October 2016. Available online: https://hal.science/hal-01715193 (accessed on 11 May 2024).
Wang, B.; Lei, Y.; Li, N.; Li, N. A Hybrid Prognostics Approach for Estimating Remaining Useful Life of Rolling Element Bearings. IEEE Trans. Reliab. 2020, 69, 401–412. [Google Scholar] [CrossRef]
Sacerdoti, D.; Strozzi, M.; Secchi, C. A Comparison of Signal Analysis Techniques for the Diagnostics of the IMS Rolling Element Bearing Dataset. Appl. Sci. 2023, 13, 5977. [Google Scholar] [CrossRef]

Figure 1. LSTM network structure based on [21].

Figure 2. Flowchart of the proposed method.

Figure 3. Histograms of MAE distributions of ball defect signals. (a) MAE distribution of signal 122, where the defect is on the 0.007″ diameter ball; (b) MAE distribution of signal 189, where the defect is on the 0.014″ diameter ball; (c) MAE distribution of signal 226, where the defect is on the 0.021″ diameter ball.

Figure 4. MAE distribution histograms of signals in inner race faults and outer race faults: (a) signal 109, inner race fault with a diameter of 0.007″; (b) signal 213, inner race fault with a diameter of 0.021″; (c) signal 135, outer race (position 6:00) fault with a diameter of 0.007″; (d) signal 201, outer race (position 6:00) fault with a diameter of 0.014″; (e) signal 238, outer race (position 6:00) fault with a diameter of 0.021″; (f) signal 148, outer race (position 9:00) fault with a diameter of 0.007″; (g) signal 250, outer race (position 9:00) fault with a diameter of 0.021″; (h) signal 161, outer race (position 12:00) fault with a diameter of 0.007″; (i) signal 262, outer race (position 12:00) fault with a diameter of 0.021″.

Figure 5. Bearing test rig and sensor placement illustration [25].

Figure 6. Raw signal and its actual time history for IMS dataset 1, the first channel of bearing 3.

Figure 7. Line charts of the proportion of MAE exceeding the threshold for IMS dataset 1st_test. (a) Line chart for Bearing 3, with feature SV, indicating a fault detected at 11/22 23:26 (32nd day) (b) Line chart for Bearing 4, with feature MF, indicating a fault detected at 11/14 15:32 (24th day).

Figure 8. Bearing accelerated life test bench.

Figure 9. The run-to-failure signals for the first two trials in test condition 1. (a) Description of the signal in the first trial in working condition 1; (b) description of the signal in the second trial in working condition 1 [27].

Figure 10. Line charts of the proportion of MAE exceeding the threshold for XJTU-SY dataset Bearing1. (a) Line chart for Bearing1_1, indicating a fault detected in the 9th sampling. (b) Line chart for Bearing1_2, indicating a fault detected in the 34th sampling.

Figure 11. Line charts of the proportion of MAE exceeding the threshold for XJTU-SY dataset Bearing2. (a) Line chart for Bearing2_2, indicating a fault detected in the 48th sampling. (b) Line chart for Bearing2_4, indicating a fault detected in the 15th sampling.

Figure 12. Line charts of the proportion of MAE exceeding the threshold for XJTU-SY dataset Bearing1_5 and Bearing3_2. (a) Line chart for Bearing1_5 with feature Min, indicating a fault detected in the 34th sampling. (b) Line chart for Bearing2_4 with feature PPV, indicating a fault detected in the 1961st sampling.

Table 1. Time domain features [12,16].

Feature	Definition	Domain	Feature	Definition	Domain
RMS	$X_{r m s} = {(\frac{1}{N} \sum_{i = 1}^{N} x_{i}^{2})}^{\frac{1}{2}}$	Time	CF	$X_{c f} = \max (\| x_{i} \|) / X_{r m s}$	Time
SRA	$X_{s r a} = {(\frac{1}{N} \sum_{i = 1}^{N} \sqrt{\| x_{i} \|})}^{2}$	Time	IF	$X_{i f} = \frac{\max (\| x_{i} \|)}{\frac{1}{N} \sum_{i = 1}^{N} \| x_{i} \|}$	Time
KV	$X_{k v} = \frac{1}{N} {(\sum_{i = 1}^{N} \frac{x_{i} - \bar{x}}{σ})}^{4}$	Time	MF	$X_{m f} = \max (\| x_{i} \|) / X_{s r a}$	Time
SV	$X_{s v} = \frac{1}{N} {(\sum_{i = 1}^{N} \frac{x_{i} - \bar{x}}{σ})}^{3}$	Time	SF	$X_{s f} = \frac{X_{r m s}}{\frac{1}{N} \sum_{i = 1}^{N} \| x_{i} \|}$	Time
PPV	$X_{p p v} = \max (x_{i}) - \min (x_{i})$	Time	KF	$X_{k f} = \frac{X_{k v}}{{(\frac{1}{N} \sum_{i = 1}^{N} x_{i}^{2})}^{2}}$	Time
FC	$X_{f c} = \frac{1}{N} \sum_{i = 1}^{N} f_{i}$	Frequency	RVF	$X_{r v f} = {(\frac{1}{N} \sum_{i = 1}^{N} {(f_{i} - X_{f c})}^{2})}^{\frac{1}{2}}$	Frequency
RMSF	$X_{r m s f} = {(\frac{1}{N} \sum_{i = 1}^{N} f_{i}^{2})}^{\frac{1}{2}}$	Frequency	Mean	$\bar{x} = \frac{1}{N} \sum_{i = 1}^{N} x_{i}$	Statistical
Var	$S = \frac{1}{N} \sum_{i = 1}^{N} {(x_{i} - \bar{x})}^{2}$	Statistical	Std	$σ = \sqrt{S}$	Statistical
Max	$\max (x_{i})$	Statistical	Min	$\min (x_{i})$	Statistical

Table 2. 48 k drive end bearing fault data.

Fault Diameter	Inner Race	Ball	Outer Race (Position 6:00)	Outer Race (Position 9:00)	Outer Race (Position 12:00)
0.007″	109	122	135	148	161
0.014″	*	189	201	*	*
0.021″	213	226	238	250	262

* No data available for this case

Table 3. Hyperparameter setting.

Parameter	Value	Parameter	Value
L	2048	m	100
piece	22,000	p	10
fL	110	fpiece	200

Table 4. DTW scores of different features.

Feature	DTW Score	Feature	DTW Score
Mean	29.77	Var	17.63
RVF	19.87	KF	17.51
RMSF	19.70	Min	17.31
SRA	18.11	FC	17.09
RMS	17.85	MF	16.99

Table 5. CWRU failure signals test result.

Fault Type	Inner Race	Outer Race (Position 6:00)	Outer Race (Position 9:00)	Outer Race (Position 12:00)
Reference fault signal	213	238	250	262
Best feature	KV	Min	Mean	Mean
DTW score of the best feature	17.67	18.50	17.07	16.11

Table 6. Comparison of bearing fault detection methods.

Study	Bearing 3	Bearing 4
[25]	Time–frequency analysis: 33.8 days (11/23) Envelope spectrum analysis: 32 days (11/22) Spectral coherence analysis: 29.2 days (11/19)	Time–frequency analysis: 18 days (11/8) Envelope spectrum analysis: 25 days (11/15) Spectral coherence analysis: 23 days (11/13)
[26]	Last part of the 33rd day (11/23) and the 34th day (11/24)	From the 26th (11/16) day until the end

Table 7. LDK UER204 bearing parameters [27].

Parameter	Value	Parameter	Value
Inner race diameter (mm)	29.30	Ball diameter (mm)	7.92
Outer race diameter (mm)	39.80	Number of balls	8
Bearing median diameter (mm)	34.55	Contact angle (°)	0
Basic rated dynamic load (N)	12,820	Basic rated static load (N)	6.65

Table 8. Bearing accelerated life test conditions [27].

Number	1	2	3
Rotational Speed	2100	2250	2400
Radial Force	12	11	10

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, W.; Sun, Z.; Lv, D.; Zuo, Y.; Wang, H.; Zhang, R. A Time Series Prediction-Based Method for Rotating Machinery Detection and Severity Assessment. Aerospace 2024, 11, 537. https://doi.org/10.3390/aerospace11070537

AMA Style

Zhang W, Sun Z, Lv D, Zuo Y, Wang H, Zhang R. A Time Series Prediction-Based Method for Rotating Machinery Detection and Severity Assessment. Aerospace. 2024; 11(7):537. https://doi.org/10.3390/aerospace11070537

Chicago/Turabian Style

Zhang, Weirui, Zeru Sun, Dongxu Lv, Yanfei Zuo, Haihui Wang, and Rui Zhang. 2024. "A Time Series Prediction-Based Method for Rotating Machinery Detection and Severity Assessment" Aerospace 11, no. 7: 537. https://doi.org/10.3390/aerospace11070537

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Time Series Prediction-Based Method for Rotating Machinery Detection and Severity Assessment

Abstract

1. Introduction

2. Theoretical Fundamental

2.1. Feature Extraction

2.2. Dynamic Time Warping

2.3. Long Short-Term Memory

2.4. Modified Z-Score

3. Proposed Method

4. Experiment Validation

4.1. Case Western Reserve University Bearing Data (CWRU)

4.1.1. Description

4.1.2. Result

4.2. Dataset of Intelligent Maintenance Systems (IMS), University of Cincinnati

4.2.1. Description

4.2.2. Results

4.3. Xi’an Jiaotong University Bearing Fault Dataset (XJTU-SY)

4.3.1. Description

4.3.2. Results

4.4. Results Analysis

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI