Study on Life Prediction Method of Ball Screw Base on Constructed Degradation Feature and IGWO-BiLSTM

Wu, Qin; Niu, Jun; Wang, Xinglian

doi:10.3390/act12060236

Open AccessArticle

Study on Life Prediction Method of Ball Screw Base on Constructed Degradation Feature and IGWO-BiLSTM

by

Qin Wu

^1,2,*,

Jun Niu

¹ and

Xinglian Wang

³

¹

College of Mechanical and Electrical Engineering, Lanzhou University of Technology, Lanzhou 730050, China

²

Centre for Mechanical Efficiency and Performance Engineering, University of Huddersfield, Huddersfield HD1 3DH, UK

³

Electromechanical Instrument Operation and Maintenance Center, Lanzhou Petrochemical Company, Lanzhou 730060, China

^*

Author to whom correspondence should be addressed.

Actuators 2023, 12(6), 236; https://doi.org/10.3390/act12060236

Submission received: 9 May 2023 / Revised: 4 June 2023 / Accepted: 5 June 2023 / Published: 7 June 2023

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Regarding the problem of reduced remaining useful life (RUL) due to wear of the ball screw in the feed system of CNC (computer numerical control) machine tools, a prediction method based on constructing the degradation feature vector of the signal data and the improved gray-wolf optimization with bidirectional long short-term memory (IGWO-BiLSTM) neural network regression model is proposed. Firstly, a time-domain analysis and the complete ensemble empirical mode decomposition with adaptive noise analysis (CEEMDAN) were carried out based on the collected life cycle signal data of a ball screw. The time-domain feature vector and the energy feature vector of each IMF (intrinsic mode function) component after CEEMDAN decomposition were constructed. The Pearson correlation coefficient was used to filter feature vectors and construct the multivariate feature vector. Secondly, this paper improves the traditional gray wolf optimization algorithm, adds a search strategy based on dimension learning, and combines the improved algorithm with the BiLSTM model, based on the IGWO-BiLSTM theory. A regression model between feature vectors and the remaining life of a ball-screw system was established. Finally, the prediction model was established according to the proposed method and compared with the other five neural network models: LSTM, BiLSTM, BO-LSTM (Bayesian optimization of LSTM), BO-BiLSTM, and IGWO-LSTM. The results indicate that this method has high accuracy and good generalization ability for predicting the remaining life of a ball-screw system.

Keywords:

ball screw; CEEMDAN; IGWO; BiLSTM; life prediction

1. Introduction

CNC machine tools are widely used in modern industrial production, such as petrochemical, automotive, mold manufacturing, aerospace, and other fields. The development level of CNC machine tools reflects the overall advanced manufacturing technology level of a country. As an integral mechanical transmission component of the CNC machine tool feed system, the ball screw demands high precision, large bearing capacity, and reliability. The service life of ball screws may lead to mechanical failures and a decrease in production efficiency, indirectly affecting the machining accuracy of CNC machine tools and, thereby, affecting the economic benefits and image of the enterprise. In summary, it is necessary to establish an effective life-prediction model for ball screws to ensure the reliability of CNC machine tools [1,2].

In recent years, with the rapid development of Internet of Things technology, information technology, and artificial intelligence, predictive maintenance has gradually become a research hotspot [3,4,5]. Predictive maintenance mainly uses remaining useful life (RUL) prediction information to select the lowest-cost maintenance strategy and production schedule in the maintenance-opportunity window to reduce costs, improve efficiency, and maximize production profits. In the context of predictive maintenance, researchers have carried out a series of life-cycle-operation tests of mechanical equipment. The Center for Intelligent Maintenance Systems at the University of Cincinnati has designed three sets of accelerated life-cycle fatigue tests of rolling bearings. The failure time of the bearing is determined by monitoring the wear debris in the oil. The data set clearly indicates the fault form after each bearing failure, so it can be used not only for residual life prediction research [6,7,8] but also for fault diagnosis research. The accelerated life test of rolling bearings was carried out at Xi’an Jiaotong University in China. The life cycle vibration signals of 15 rolling bearings under three working conditions were collected, and the failure parts of each bearing were clearly marked, which provided data support for the research in the field of health-status assessment [9,10]. The intelligent-learning model can independently learn the performance-degradation mode of mechanical equipment from the monitoring data through intelligent algorithms, and predict the remaining life. It does not need to construct a physical model or a statistical model in advance, so it has gradually become a research hotspot. At present, in the field of residual life prediction, the commonly used intelligent models mainly include the artificial neural network model, support vector machine model, and correlation vector machine model. The artificial neural network simulates the working process of the human brain through a large number of connection nodes in a complex hierarchical structure, which can automatically extract features from monitoring data and predict the remaining life of mechanical equipment. The support vector machine model is an intelligent model based on statistical learning theory, which can effectively deal with the residual life prediction problem of small sample data. The support vector regression machine model is also a common support vector machine model used in the field of residual life prediction. At present, some studies [11,12] have applied relevance vector machines to the field of residual life prediction. However, the prediction performance of relevance vector machines depends largely on the choice of the kernel function, and the optimization of model parameters is still a problem to be solved.

With the development of sensor technology and signal processing technology, the residual life prediction method based on deep learning of signal feature extraction has gradually become mainstream. Guo et al. [13] proposed the generalized variational mode decomposition (GVMD) algorithm to extract the weak features of rolling bearing faults. The GVMD algorithm can make full use of the bearing-fault frequency information and bandwidth information to accurately extract the weak feature components of bearing faults on demand. Zhang et al. [14] proposed a parallel variance constrained-convolutional auto-encoder (PVC-CAE) model for bearing degradation feature extraction. The PVC-CAE model is used to extract features in the frequency domain signal, and the LSTM network is used for prediction. In order to effectively evaluate the degradation trend of bearings, Qu et al. [15] proposed a prediction method of bearing RUL based on the beetle antennae BP (back propagation) neural network model optimized by the beetle antennae search algorithm. By extracting the time domain and frequency domain features of 18 kinds of bearing life-cycle vibration signals, the degradation features are constructed. Zhu et al. [16] proposed a rolling bearing fault feature enhancement extraction method based on the instantaneous angular speed (IAS) signal of the rotary encoder. The de-phasing algorithm (DPA) is used to suppress the strict periodic components, such as rotating frequency and its harmonics, and multi-point optimization minimum entropy deconvolution adjusted (MOMEDA) is used to enhance the rolling bearing fault impulse component. The spectrum analysis of the enhanced signal is carried out to extract the bearing fault impulse characteristics. Zhu et al. [17] proposed a new deep feature-learning method for RUL prediction through the representation of time-frequency (RTF) and a multiscale convolutional neural network (MSCNN), which improved the prediction accuracy. As a variant of the recurrent neural network (RNN), the long short-term memory (LSTM) network can effectively solve the problems of gradient disappearance and gradient explosion in RNN. Guo et al. [18] proposed a method based on empirical modal decomposition (EMD) and an LSTM network to predict the RUL of rolling bearings. However, LSTMs can only be passed from the present moment to the future, which leads to ignoring the impact of the future on the present moment. At present, the research on the life prediction of ball screws based on data is few. Therefore, this paper chooses the BiLSTM model, which can consider both past and future information, to predict the RUL and improves the conventional parameter-seeking algorithm to highlight the advantages of the model.

Based on the above analysis, in order to accurately predict the remaining life of the ball screw, the authors improve the traditional gray wolf algorithm and add a search strategy based on dimension learning to enhance the balance between local search and global search. The improved algorithm is combined with a BiLSTM neural network to construct a new life-prediction regression model. Compared with the optimization results of five other regression models, such as LSTM, BiLSTM, etc., the method proposed in this paper performs the best in prediction accuracy and generalization ability through simulation data testing and verification.

2. Energy Characteristics of Signal IMF Component

2.1. CEEMDAN Algorithm

Torres [19] and colleagues proposed the CEEMDAN algorithm to solve the problem of modal aliasing in EMD (empirical mode decomposition). The EEMD (ensemble empirical mode decomposition) and CEEMD (complete ensemble empirical mode decomposition) algorithms, including Gaussian white noise, can improve mode aliasing, but they cannot effectively separate residual noise, and the added white noise is unevenly distributed in the high-frequency and low-frequency regions, leaving some white noise signals in the modal intrinsic components. To solve this issue, CEEMDAN adds adaptive white noise during each empirical mode decomposition, effectively addressing the problem of residual noise and incomplete decomposition that exist in EEMD and CEEMD [20,21,22,23].

The steps of the CEEMDAN algorithm are as follows:

Let E_i(·) be the i-th intrinsic mode function obtained by EMD decomposition, IMF_k be the k-th intrinsic mode component obtained after CEEMDAN decomposition, ε be the constant coefficient, and ω be the added random noise.

(1): The Gaussian white noise is added to the decomposed signal x(t) to obtain a new signal $x_{i} (t) = x (t) + ε_{0} E_{0} (ω_{i})$ (i =1, 2, ..., N). The new signal is decomposed by the EMD algorithm to obtain the first-order intrinsic mode component IMF₁:

${IMF}_{1} (t) = \frac{1}{N} \sum_{i = 1}^{N} {IMF}_{1}^{i} {(t)}_{i}$

(1)
: Calculate the first margin:

$r_{1} (t) = x (t) - {IMF}_{1} (t)$

(2)
(2): A new signal $\begin{matrix} r_{1} (t) + ε_{1} E_{1} (ω_{i}) \end{matrix}$ (i = 1,2, ..., N) is obtained by adding noise to the residual component r₁(t), and the second component IMF₂ is obtained by EMD decomposition:

${IMF}_{2} (t) = \frac{1}{N} \sum_{i = 1}^{N} E_{1} (r_{1} (t) + ε_{1} E_{1} (ω_{i}))$

(3)
(3): Calculate the margin:

$r_{k} (t) = r_{(k - 1)} (t) - {IMF}_{k} (t) (k = 2, 3, ..., K)$

(4)
(4): For the signal $r_{k} (t) + ε_{k} E_{k} (ω_{i})$ (i = 1, 2, ..., N), the k + 1th IMF is obtained by EMD decomposition:

${IMF}_{(k + 1)} (t) = \frac{1}{N} E_{k} (r_{k} (t) + ε_{k} E_{k} (ω_{i}))$

(5)
(5): Repeat (3) and (4) until the margin is a monotonic function and cannot be further decomposed. The final residual sequence is $r_{k} (t) = x (t) - \sum_{k = 1}^{K} {IMF}_{k} (t)$ , then the original signal is decomposed into $x (t) = \sum_{k = 1}^{K} {IMF}_{k} (t) + r_{k} (t)$ .

The schematic diagram of CEEMDAN decomposition is shown in Figure 1:

2.2. Signal IMF Component Energy Feature Construction

When the ball screw is worn inside, the energy information of the IMF component decomposed by CEEMDAN will change accordingly. Here, the energy information of the IMF is used as the characteristics value to indirectly reflect the change of the RUL of the component [24].

The CEEMDAN method is used to decompose the signal x(t) to obtain n intrinsic mode functions (IMF components) and a residual r. The total energy of each IMF component is calculated, as shown in Formula (6):

E_{i} = \int_{- \infty}^{+ \infty} ∣ c_{i} (\begin{matrix} t \end{matrix}) ∣^{2} dt = \sum_{i = 1}^{n} ∣ C_{i} ∣^{2}

(6)

In the formula, C_i(t) is the i-th IMF component, C_i is the amplitude of discrete points, and n is the number of sampling points. The total energy E_s of m IMF components is:

E_{s} = {(\sum_{i = 1}^{m} ∣ E_{i} ∣^{2})}^{\frac{1}{2}}

(7)

Constructing the energy feature vector

E ’

are

E ’ =  [E_{1} E_{2} \dots E_{m}] / E_{s}

3. IGWO-BiLSTM Regression Model

3.1. BiLSTM Neural Network

LSTM is a type of time-recurrent neural network that has evolved from the RNN (recurrent neural network). It was created to address the issue of “gradient disappearance” in the RNN structure of the recurrent neural network. The LSTM network improves upon the structure of the input layer, hidden layer, and output layer of the traditional RNN neural network. It includes a gate mechanism to control the path of information transmission, enabling selective memory or deletion of information that passes through the network.

Figure 2 shows a typical LSTM memory block structure. Each memory block has three ‘gating’ structures: the input gate, output gate, and forgetting gate.

(1): The first step to calculating the forgetting gate f_t is to determine the information discarded from the cell. The decision is implemented by the sigmoid layer of the forgetting gate. It looks at the previous output ht-1 as well as the current input x_t and outputs a number between 0 and 1 for each number in a state C_t−1 on the cell, representing complete deletion and complete retention, respectively.

$f_{t} = σ (W_{f} x_{t} + U_{f} h_{t - 1} + b_{f})$

(8)
(2): The second step, the input gate i_t determines what information is stored in the cell state next. The sigmoid layer of the input gate determines which values we will update. Next, the tanh layer creates a candidate vector ${\tilde{C}}_{t}$ , which will be added to the cell state. Combine these two vectors to create an updated value.

$i_{t} = σ (W_{i} x_{t} + U_{i} h_{t - 1} + b_{i})$

(9)

${\tilde{C}}_{t} = \tan h (W_{c} x_{t} + U_{c} h_{t - 1} + b_{c})$

(10)
(3): The third step to updating the previous state value, and update the previous state value C_t−1 to C_t.

$C_{t} = f_{t} \otimes C_{t - 1} + i_{t} \otimes {\tilde{C}}_{t}$

(11)
(4): The last step, the output gate o_t needs to decide what to output. First, a sigmoid layer is run, which determines the part of the cell state to be output. Then the cell state is passed through tanh and multiplied by the output of the sigmoid gate. The output result h_t is the output of LSTM and the hidden state of the next LSTM.

o_{t} = (W_{o} x_{t} + U_{o} h_{t - 1} + b_{o})

(12)

h_{t} = o_{t} \otimes \tan h (C_{t})

(13)

where W_i, W_f, W_o, and W_c represent the weight matrix from the input gate, forgetting gate, output gate, and candidate memory cells to the next input gate; U_i, U_f, U_o, and U_c represent the weight matrix of the hidden layer; b_i, b_f, b_o, and b_c represent the bias matrix of each gate structure, respectively. σ is the sigmoid activation function.

BiLSTM (bidirectional long short-term memory) is an improvement of LSTM, and its structure diagram is shown in Figure 3. The BiLSTM neural network structure model is divided into two independent LSTM. The input sequence is input into the two LSTM neural networks in positive and reverse order, respectively, for feature extraction. The two output vectors, namely the extracted feature vectors, are spliced to form the final feature expression. In this model, the response signal of the ball screw inputs the information into the BiLSTM network layer through the input layer. The input sample signal outputs

{\vec{h}}_{t}

through the forward LSTM layer and outputs

{\overset{\leftarrow}{h}}_{t}

through the backward LSTM layer to jointly determine the value of the incoming hidden layer and obtain the output y_t of BiLSTM. The update formula is as follows:

{\vec{h}}_{t} = LSTM (x_{t} {, \vec{h}}_{t - 1})

(14)

{\overset{\leftarrow}{h}}_{t} = LSTM (x_{t} {, \overset{\leftarrow}{h}}_{t - 1})

(15)

y_{t} = \vec{W} {\vec{h}}_{t} + \overset{\leftarrow}{W} {\overset{\leftarrow}{h}}_{t} + b_{y}

(16)

where

\vec{W}

is the weight matrix from the forward LSTM to the output layer;

\overset{\leftarrow}{W}

is the weight matrix from the inverse LSTM to the output layer; and b_y represents the bias matrix of the output layer [25,26,27,28].

3.2. Gray Wolf Optimization Algorithm

Gray wolf optimization (GWO) is a swarm intelligence optimization algorithm with a global optimal search mechanism. The inspiration comes from the predation behavior of the gray wolf group. There is a strict hierarchy in the gray wolf group, and a small number of gray wolves with absolute discourse power lead a group of gray wolves to prey. Gray wolves are generally divided into four levels: α wolves, β wolves, δ wolves, and ω wolves. The rights are from large to small in order to simulate the leadership class, as shown in Figure 4. Collective hunting is a social behavior of gray wolves. Social hierarchy plays an important role in the process of collective hunting, and the process of predation is completed under the leadership of a wolf. It mainly includes three steps: (1) tracking, approaching, and harassing the prey; (2) hunting and encircling the prey until it stops moving; and (3) attacking the prey.

To begin, construct a gray wolf social hierarchy model and mathematically model the social hierarchy of the gray wolf. The α wolf is used as the optimal solution, that is, the individual’s fitness is optimal, the suboptimal solution is β wolf, and the best solution is δ wolf, which can be the global optimal solution or local optimal solution of the objective function, with the minimum objective function value or the maximum objective function value. The remaining candidate solution is named ω wolf. The hunting process is guided by the three wolves α, β, and δ, and the ω wolf follows the three wolves. That is, first find the three best solutions, and then search around the region, the purpose is to find a better solution, and then update the α, β, and δ wolves. The behavior of gray wolves hunting prey is defined as follows:

The distance formula between individual and prey:

D = ∣ C \cdot X_{p} (t) - X (t) ∣

(17)

Gray wolf position update formula:

X (t + 1) = X_{p} (t) - A \cdot D

(18)

Coefficient vectors:

A = 2 α \cdot r_{1} - α

(19)

C = 2 \cdot r_{2}

(20)

where t is the number of iterations, D is the distance vector between the individual and the hunt, X_p is the position vector of the prey, X is the position vector of the gray wolf, a is the convergence factor (linearly decreases from 2 to 0 with the number of iterations), and r₁ and r₂ are random vectors, with modulo random numbers between 0–1.

Gray wolves can identify the position of prey and surround them. When the gray wolf recognizes the position of the prey, it guides the wolf group to surround the prey under the guidance of α, β, and δ. The mathematical model of gray wolf individual tracking prey position is described as follows:

\begin{matrix} D_{α} = ∣ C_{1} \cdot X_{α} - X ∣ \\ D_{β} = ∣ C_{2} \cdot X_{β} - X ∣ \\ D_{δ} = ∣ C_{3} \cdot X_{δ} - X ∣ \end{matrix}

(21)

Among them, D_α, D_β, and D_δ represent α and β, respectively, and the distance from other individuals; X_α, X_β, and X_δ represent the current position of α, β, and δ, respectively. C₁, C₂, and C₃ are random vectors, and X is the current position of the gray wolf.

\begin{matrix} X_{1} = X_{α} - A_{1} \cdot (D_{α}) \\ X_{2} = X_{β} - A_{2} \cdot (D_{β}) \\ X_{3} = X_{s} - A_{3} \cdot (D_{δ}) \end{matrix}

(22)

X_{t + 1} = \frac{X_{1} + X_{2} + X_{3}}{3}

(23)

Formula (22) defines the step length and direction of ω individuals in the wolf pack towards α and β, and δ, respectively, and Formula (23) defines the final position of ω.

When the prey stops moving, the gray wolf completes the hunting process by attacking. In order to simulate approaching prey, the value of a is gradually reduced, so the fluctuation range of A is also reduced. In other words, in the iterative process, when the value of a decreases linearly from 2 to 0, the corresponding value of A also changes in the interval [−a, a].

3.3. Improving Gray Wolf Optimization Algorithm

The classical GWO algorithm will cause the diversity loss of the wolf pack to converge prematurely so that the global optimal solution cannot be accurately obtained. The IGWO algorithm adds a dimension learning-based hunting (DLH) search strategy. The role of DLH is to enable each wolf in the group to learn from the surrounding wolves and exchange information, ensuring a fair selection of the best candidate wolves from the population, thereby enhancing the balance between local search and global search, ensuring the diversity of individuals and better searching of the entire space. It solves the shortcomings of the algorithm’s poor optimization effect on complex problems. In the IGWO algorithm, three different steps are redefined, including wolf initialization, wolf movement, wolf selection, and update [29].

(1): Initialization stage. N wolves are randomly distributed and searched with $[\begin{matrix} l_{i}, u_{j} \end{matrix}]$ in a given space.

$X_{ij} = l_{j} + {rand}_{j} [0, 1] (u_{j} - l_{j}), i \in [1, N], j \in {[1, D]}_{\circ}$

(24)
(2): Movement stage. The hunting strategy in IGWO is a combination of X_i-GWO(t + 1) and X_i-DLH(t + 1), that is, a combination of group-based hunting and dimension-based learning hunting (DLH). The Euclidean distance between the current position X_i(t) and the updated X_i-GWO(t + 1) is calculated. Equation (25) is used to construct the neighborhood N_i(t) of each wolf.

\begin{matrix} R_{i} (t) = ∥ X_{i} (t) - X_{(i - GWO)} (t + 1) ∥, \\ N_{i} (t) = \{X_{j} (t) ∣ D_{i} (X_{i} (t), X_{j} (t)) \leq R_{i} (t), X_{j} (t) \in p\} \end{matrix}

(25)

where the position of X_j(t) is adjacent to the current iteration position X_i(t); D_i is Euclidean distance; and p is the population of gray wolves. The position of the candidate wolf is calculated by Equation (26), and the recommended update position is established.

X_{i - GWO} (t + 1) = X_{i, d} (t) + rand {(X_{n, d} (t + 1) - X_{r, d} (t))}_{\circ}

(26)

(3): Selection and update stage. By comparing the fitness values of candidate wolves X_i-GWO(t + 1) and X_i-DLH(t + 1), the best candidate wolves are selected, which can be described by Formula (27).

$X_{i} (t + 1) = \{\begin{matrix} X_{i - DLH} (t + 1), f (X_{i - GWO}) < f (X_{i - DLH}) \\ X_{i - GWO} (t + 1), f (X_{i - GWO}) \geq f (X_{i - DLH}) \end{matrix}$

(27)

3.4. IGWO-BiLSTM Regression Model

The parameter determination of BiLSTM models is usually based on manual experience, which leads to a significant amount of time required for model parameter adjustment and a tendency to converge to locally optimal solutions [30]. The improved gray wolf optimization (IGWO) algorithm has the advantages of strong global search ability, fast convergence, and easy implementation. In order to enhance the accuracy of the residual useful life prediction model of a ball screw, the parameters of the BiLSTM model are optimized and adjusted using the IGWO. Specifically, the number of hidden layer units, learning rate, and iteration times in the BiLSTM network are employed as the positions of wolves. By calculating the fitness function and updating the position of wolves, an optimal solution for the LSTM network model parameters can be obtained, and the residual life prediction model of the ball screw can be constructed using the optimal model parameters. The specific steps involved in constructing the life-prediction model LSTM are as follows:

Step 1: Divide the preprocessed sample data into a training set and a test set in the appropriate proportion.

Step 2: Specify the initial data for IGWO, including the number of gray wolf populations, initial coordinates, and iterations. Transform the number of hidden layer units, learning rate, and iteration times of the BiLSTM network into the position coordinates of wolves, then select the training sample set to train the BiLSTM model.

Step 3: Calculate each wolf’s individual fitness value within the wolf group, and update their individual positions based on the fitness value calculated by the fitness function. Once either the maximum number of iterations is reached or the global optimal position satisfies the minimum boundary, the optimal solutions for the three model parameters of the hidden layer unit number, learning rate, and iteration number of the BiLSTM network will be obtained.

Step 4: Select the test sample set, and test the BiLSTM network using the above-optimized parameters to obtain the optimum BiLSTM network model.

4. Life Prediction Methods

The block diagram depicting the ball-screw life-prediction method based on CEEMDAN decomposition and the IGWO-BiLSTM model is illustrated by MATLAB in Figure 5.

(1): Extract time-domain feature vectors. Extracting the time-domain characteristic values from the collected ball-screw drive-motor current signal, we have identified 16 types of time-domain characteristics. Their respective calculation formulas are provided in Table 1 [31].

Although the 16 time-domain features mentioned above can reflect various aspects of the performance status of parts, the accuracy of the reflection varies greatly. Certain features are closely linked to changes in the degradation state of parts, while others cannot accurately indicate trends in the change of parts’ states. Therefore, it is necessary to select features from the set that can provide accurate reflections of performance state changes so as to avoid excessive dimension and ensure accuracy in predictions. In this case, the Pearson correlation coefficient ρ (Equation (28)) was used to represent the correlation between features and the component’s performance, with a value range between [−1, 1].

ρ = \frac{\sum x_{i} y_{i} - n \bar{x} \bar{y}}{(n - 1) s_{x} s_{y}} = \frac{n \sum x_{i} y_{i} - \sum x_{i} \sum y_{i}}{\sqrt{n \sum x_{i}^{2} - {(\sum x_{i})}^{2}} \sqrt{n \sum y_{i}^{2} - {(\sum y_{i})}^{2}}}

(28)

By applying Equation (28), the correlation coefficient between the 16 time-domain features listed in Table 1 and the RUL can be calculated by selecting the most relevant m eigenvalues to the residual life and constructing a time-domain feature vector

T =  [T_{1} T_{2} \dots T_{m}]

as the input feature for the model.

(2): Extract energy characteristic values by decomposing the signal with CEEMDAN into several IMF components. Utilize Formula (28) to calculate the correlation coefficient between each IMF component and the original signal to obtain n components with a strong correlation with the original signal, which contains the most crucial information. Following the approach in Section 2, use the energy information feature vector $E' = [{E'}_{1} {E'}_{2} \dots {E'}_{n}]$ of these IMF components as the energy input feature for the model. Combine the obtained time domain feature vector with the energy feature vector to create a new feature vector, which serves as the input feature for the IGWO-BiLSTM model.

$X = [T_{1} T_{2} {\dots T}_{m} E^{'}_{1} {E'}_{2} \dots {E'}_{n}] = [X_{1} X_{2} {\dots X}_{k}]$

(29)

where k = m + n.
(3): IGWO-BiLSTM model is established. In order to ensure that the model can train better results, assigning a smaller training set may lead to under-fitting the model, and assigning a larger training set may lead to over-fitting the model. The selected feature vectors are divided into training sets and test sets according to the ratio of 7:3. All data is normalized using the range of [−1, 1] to avoid large differences between samples and improve the convergence speed of the model. The weight matrix and bias vector in the model are determined using the IGWO algorithm and substituted into the model for training. The trained lifespan prediction model is then tested with the test set, and the obtained prediction results, along with the remaining life data of the corresponding test set samples, are used to calculate the root mean square error (RMSE) and coefficient of determination (R2). R2 indicates the percentage of the model prediction array reaching the data itself, with a higher value indicating better regression accuracy. Evaluate the performance of the model using RMSE and R2 indicators. If the requirements are not met, the regression model can be reconstructed by modifying the parameters until the requirements are met. If the requirements are met, determine the IGWO-BiLSTM prediction model.

$RMS = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {({\hat{y}}_{i} - y_{i})}^{2}}$

(30)

$R^{2} = \frac{{(N \sum_{i = 1}^{N} {\hat{y}}_{i} y_{i} - \sum_{i = 1}^{N} {\hat{y}}_{i} \sum_{i = 1}^{N} y_{i})}^{2}}{[N \sum_{i = 1}^{N} {\hat{y}}_{i}^{2} - (\sum_{i = 1}^{N} {\hat{y}}_{i})^{2})] [N \sum_{i = 1}^{N} y_{i}^{2} - {(\sum_{i = 1}^{N} y_{i})}^{2}]}$

(31)
(4): For signal data collected in real time, feature extraction is performed according to steps (1) and (2) before substituting the extracted feature vectors into the IGWO-BiLSTM prediction model established in step (3) to predict RUL in real time.

5. Life Prediction by Experimental Data

There are many failure modes of the ball screw. In order to compare and verify the model better, this paper analyzes and processes the life cycle signal of ball screw raceway wear failure. Because the wear failure of the inner ring and outer ring of the bearing has something in common with the wear failure of the ball screw raceway, the life-cycle signal of the bearing’s outer ring failure is selected for simulation verification. This paper selected bearing life-cycle data sets provided by the University of Cincinnati. The experimental platform and structural diagram are shown in Figure 6:

On a single shaft, the bearing test equipment has four test bearings. An AC (alternating current) motor drives the shaft, which is connected by rubber belts. The rotation was maintained at a steady 2000 rpm. A spring mechanism adds a radial load of 6000 lbs. to the shaft and bearing. All of the bearings are greased by force. Both the flow and the temperature of the lubricant are controlled by an oil circulation system. Debris from the oil is collected by a magnetic plug that was inserted in the oil feedback channel as proof of bearing deterioration. When the magnetic plug’s collected trash reaches a certain level and closes an electrical switch, the test will end. Four Rexnord ZA-2115 double-row bearings were installed on one shaft. Each row of bearings consists of 16 rollers with a pitch diameter of 2.815 inches, a roller diameter of 0.331 inches, and a conical contact angle of 15.171 inches. A PCB 353B33 high-sensitivity quartz ICPs (Integrated Circuit Piezoelectric) accelerometer was installed on each bearing box. Each bearing’s outer race is equipped with four thermocouples for checking lubrication by the temperature reading of the bearing. A National Instruments DAQCard-6062E data acquisition card was used to record vibration data every 20 min. The data length is 20,480 points, and the data sample rate is 20 kHz. LabVIEW software from National Instruments was used to collect the data.

In this paper, the whole life-cycle vibration signal data of outer ring damage, inner ring damage, and roller damage were selected respectively. The time-domain image of the signal is shown in Figure 7. Due to the long-life cycle of the bearing, the signal fluctuation is relatively stable for a long time in the early stage, which is the healthy stage. In the later stage, due to the long-term wear of the bearing, the remaining life of the bearing is reduced, which causes the fluctuation amplitude of the vibration signal to change until the final failure.

This paper intercepted some signals from the entire life cycle as simulation data. The CEEMDAN method was utilized to decompose the residual life simulation signals at varying residual life levels, producing several IMF components. Figure 8 displays the original signal of the simulation data and the first 7 IMF components after decomposition at a residual life of 3200 h.

Based on Equation (6), the energy value of each component can be calculated, while substitution into Equation (28) allows for the calculation of the correlation coefficient between each component and the original signal. The six components with the highest correlation coefficient are selected as energy features in the degradation feature vector.

The time-domain feature T and the energy feature E′ are combined into a new degradation feature vector X (as shown in Table 2) as the model input features:

IGWO was used to train the BiLSTM model to obtain the optimal model parameters. Using the optimal model parameters, the five models of standard LSTM, BiLSTM, BO-LSTM, BO-BiLSTM, and IGWO-LSTM were used to predict the RUL. The prediction results of each model are shown in Figure 9. The predicted points of the model are distributed on both sides of the real-value curve. The closer the scatter distance curve is, the more accurate the prediction is. Figure 9 shows the life prediction of the failure of the outer ring raceway by using six models. It can be seen from the diagram that the predicted scatter points of the IGWO-BiLSTM model are closer to the true value than the other five models.

After using the full life-cycle vibration signal data as simulation data to predict the failure of the outer ring, in order to prove that the model is also applicable to inner ring raceway failure and roller failure, the study used the six models mentioned above to predict the life of the faults caused by these two types of failures. The predicted results are shown in Figure 10 and Figure 11:

In order to further verify the prediction accuracy and prediction ability of the model for the six different models, LSTM, BiLSTM, BO-LSTM, BO-BiLSTM, IGWO-LSTM and IGWO-BiLSTM, the two evaluation indicators, root mean square error (RMSE) and determination coefficient (R2), are selected to measure the prediction performance of the six models. RMSE and R2 are common indicators to measure the prediction performance of the model. RMSE is the square root of the average of the sum of squares of deviations between the predicted value and the actual value of the model. R2 refers to the proportion of the model prediction value that can explain the degree of variation of the actual value. Both RMSE and R2 are indicators for quantitative evaluation of model prediction accuracy, but they have different concerns. RMSE focuses on the difference between the predicted value and the actual value. The smaller the value, the more accurate the model prediction. The coefficient of determination focuses on the ability of the model to describe the actual data. The closer the value is to 1, the better the model can explain the data. Therefore, when evaluating the prediction performance of the model, the above two indicators can be considered comprehensively to ensure that the prediction accuracy and interpretation ability of the model are guaranteed at the same time. Figure 12 shows the prediction-accuracy analysis values of the RUL prediction results for different prediction models.

It can be clearly seen from the diagram that the root mean square error of the IGWO-BiLSTM regression model is smaller than that of the other five models when used to predict the remaining life of the screw. The coefficient of determination R2 predicted by the model is also the closest to 1. The minimum root mean square error indicates that the difference between the predicted value and the actual value of the method used in this paper is the smallest. When the coefficient of determination is 1, the predicted value of the model is the same as the actual value. The coefficient of determination of the predicted result of the IGWO-BiLSTM regression model is the closest to 1, the predicted value is the closest to the real value, and the model prediction is the most accurate. The results show that the IGWO-BiLSTM regression model proposed in this paper can accurately predict the remaining service life of ball screws.

For the real-time collected current signal data that performs the same process, the degradation feature vector is extracted according to the method described in this paper, and the remaining life of the screw corresponding to the current signal can be predicted by inputting the established IGWO-BiLSTM regression model.

6. Conclusions

For the data-driven ball screw remaining-life prediction problem, this paper mainly improves the traditional gray wolf optimization algorithm, improves the effect of algorithm optimization, and combines the improved algorithm with the BiLSTM neural network model to establish a new life-prediction regression model. Using the collected life-cycle signal data, the multivariate life-degradation feature vector is constructed. The regression model was validated and compared based on the actual remaining life data of a ball screw. The simulation results show that this method has better prediction accuracy and generalization ability compared to other models. This method is suitable for CNC machine tools to monitor and collect ball screw signals while predicting and evaluating the remaining life of the ball screw pair in real time based on the measured signals.

Author Contributions

Conceptualization, J.N. and Q.W.; methodology, J.N.; software, J.N. and Q.W.; validation, J.N., Q.W. and X.W.; formal analysis, J.N. and Q.W.; investigation, X.W.; resources, Q.W. and X.W.; data curation, Q.W. and X.W.; writing—original draft preparation, J.N.; writing—review and editing, J.N. and Q.W.; visualization, J.N. and X.W.; supervision, Q.W. and X.W.; project administration, Q.W.; funding acquisition, X.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Ma, J. Progress and existing problems of CNC machine tool machining accuracy improvement technology. Intern. Combust. Engine Accessories 2021, 17, 81–82. [Google Scholar]
Shang, S. Life Prediction Modeling and Influencing Factors Analysis of Ball Screw Pair of CNC Machine Tools; Nanjing University of Aeronautics and Astronautics: Nanjing, China, 2019. [Google Scholar]
Ning-Yun, L.; Chuang, C.; Bin, J.; Yin, X. Latest research progress of complex system maintenance strategy: From condition-based maintenance to predictive maintenance. J. Autom. 2021, 47, 1–17. [Google Scholar]
Yan, J.; Meng, Y.; Lu, L.; Li, L. Industrial big data in an industry 4. 0 environment: Challenges, scheme, and applications for predictive maintenance. IEEE Access 2017, 5, 23484–23491. [Google Scholar] [CrossRef]
Yan, J.; Meng, Y.; Lu, L.; Li, L. Manufacturing system maintenance based on dynamic programming model with prognostics information. J. Intell. Manuf. 2019, 30, 1155–1173. [Google Scholar]
Huang, Z.; Xu, Z.; Ke, X.; Wang, W.; Sun, Y. Remaining useful life prediction for an adaptive skew-Wiener process model. Mech. Syst. Signal Process. 2017, 87, 294–306. [Google Scholar] [CrossRef]
Cui, L.L.; Wang, X.; Xu, Y.G. A novel switching unscented Kalman filter method for remaining useful life prediction of rolling bearing. Measurement 2019, 135, 678–684. [Google Scholar] [CrossRef]
Chen, Y.; Peng, G.; Zhu, Z.; Li, S. A novel deep learning method based on attention mechanism for bearing remaining useful life prediction. Appl. Soft Comput. 2020, 86, 105919–105925. [Google Scholar] [CrossRef]
Wang, B.; Lei, Y.G. A hybrid prognostics approach for estimating remaining useful life of rolling element bearings. IEEE Trans. Reliab. 2018, 6, 401–412. [Google Scholar] [CrossRef]
Tian, Q.P.; Wang, H.L. An ensemble learning and RUL prediction method based on bearings degradation indicator construction. Appl. Sci. 2020, 10, 346–368. [Google Scholar] [CrossRef] [Green Version]
Chang, Y.; Fang, H. A hybrid method for system degradation based on particle filter and relevance vector machine. Re-Liabil. Eng. Syst. Saf. 2019, 186, 51–63. [Google Scholar] [CrossRef]
Wang, X.L.; Jiang, B.; Lu, N.Y. Adaptive relevant vector machine-based RUL prediction under uncertain conditions. ISA Trans. 2019, 87, 217–224. [Google Scholar] [CrossRef] [PubMed]
Guo, Y.; Chen, G.; Wang, Q. Weak feature extraction of rolling bearing fault based on generalized variational mode decomposition. Mech. Transm. 2023, 47, 150–157. [Google Scholar]
Zhang, Y.; Li, H. Prediction method of bearing remaining life based on PVC-CAE. Comput. Appl. Res. 2023, 1–8. [Google Scholar] [CrossRef]
Qu, J.; Ma, X.; Liang, P. Prediction of bearing remaining useful life based on BAS-BP model. Mach. Hydraul. 2022, 50, 172–175. [Google Scholar]
Zhu, Y.; Guo, Y.; Zou, X.; Tian, T.; Xu, W.T. Rolling bearing fault feature enhancement extraction based on rotary encoder signal. Vib. Shock. 2023, 42, 119–125. [Google Scholar]
Zhu, J.; Chen, N.; Peng, W.W. Estimation of bearing remaining useful life based on multiscale convolutional neural network. IEEE Trans. Ind. Electron. 2018, 66, 3208–3216. [Google Scholar] [CrossRef]
Guo, R.X.; Wang, Y.; Zhang, H.C.; Zhang, G. Remaining useful life prediction for rolling bearings using EMD-RISI-LSTM. IEEE Trans. Instrum. Meas. 2021, 70, 1–12. [Google Scholar] [CrossRef]
Torres, M.E.; Colominas, M.A.; Schlotthauer, G.; Flandrin, P. A complete ensemble empirical mode decomposition with adaptive noise. In Proceedings of the 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, Czech Republic, 22–27 May 2011; pp. 4144–4147. [Google Scholar]
Badi; Zhang, H.; Zhang, W. Analysis of rolling bearing fault diagnosis based on EEMD and SVM. Agric. Mach. Use Maint. 2021, 300, 14–15. [Google Scholar]
Gu, Y.; Su, Y.; Shen, X.; Luo, J.F.; Wang, T. Fault diagnosis of rolling bearings based on improved CEEMDAN alignment entropy and GWO-SVM. Comb. Mach. Tools Autom. Mach. Technol. 2022, 582, 62–66. [Google Scholar]
Jin, C.G. Gearbox bearing fault diagnosis method based on CEEMDAN energy entropy and Mahalanobis distance. Mach. Tool Hydraul. 2020, 48, 218–223. [Google Scholar]
Wu, Z.H.; Huang, N.E. Ensemble empirical mode decomposition: Anoise-assisted data analysis method. Adv. Adapt. Data Anal. 2009, 1, 1–41. [Google Scholar] [CrossRef]
Wu, Y.; Shu, Q. Residual life analysis of parts based on time-frequency characteristics and PSO-SVR model. Comb. Mach. Tools Autom. Mach. Technol. 2021, 5, 5–9. [Google Scholar]
Zhao, G.; Jiang, P.; Lin, T. Intelligent rolling bearing remaining life prediction method based on CNN-BiLSTM network and attention mechanism. Mechatron. Eng. 2021, 38, 1253–1260. [Google Scholar]
Zhao, Z.; Zhao, J.; Wei, Z. Research on rolling bearing fault diagnosis based on BiLSTM. Vib. Shock. 2021, 40, 95–101. [Google Scholar]
Nacer, S.M.; Nadia, B.; Abdelghani, R.; Mohamed, B. A novel method for bearing fault diagnosis based on BiLSTM neural networks. Int. J. Adv. Manuf. Technol. 2023, 125, 1477–1492. [Google Scholar] [CrossRef]
Zhan, Y.; Sun, S.; Li, X.; Wang, F. Combined Remaining Life Prediction of Multiple Bearings Based on EEMD-BILSTM. Symmetry 2022, 14, 251. [Google Scholar] [CrossRef]
Li, G.Y.; Liu, S.Y.; Han, Z.L.; Lv, H.T.; Yi, J.Q. Research on IGWO-SVR based transmission line ice coverage prediction model. J. Hubei Univ. Natl. Nat. Sci. Ed. 2023, 41, 79–84. [Google Scholar]
Shi, M. Research on Improved LSTM Networks for Remaining Life Prediction of Lithium-Ion Batteries; Shaanxi University of Science and Technology: Xi’an, China, 2021. [Google Scholar]
Ye, H.; Liu, Y. Eigenvalue Calculation of Large-Scale Time-Delay Power System; Science Press: New York, NY, USA, 2018. [Google Scholar]

Figure 1. Schematic diagram of CEEMDAN decomposition.

Figure 2. Schematic diagram of LSTM structure.

Figure 3. Schematic diagram of BiLSTM structure.

Figure 4. Gray wolf distribution level pyramid.

Figure 5. Block diagram of IGWO-BiLSTM ball-screw life-prediction method.

Figure 6. IMS experimental platform and structure sketch.

Figure 7. Whole life-cycle vibration signals under different failure modes of bearings.

Figure 8. CEEMDAN decomposition of the first 7 components.

Figure 9. Outer ring raceway failure life prediction.

Figure 10. Inner ring raceway failure life prediction.

Figure 11. Roller failure life prediction.

Figure 12. Comparison of model evaluation indicators.

Table 1. Equation for calculating time domain eigenvalues.

Name	Formula
Maximum	$\max (x (n))$
Peak	$\max ∣ x (n) ∣$
Mean	$\frac{1}{N} \sum_{n = 1}^{N} x (n)$
Root amplitude	${(\frac{1}{N} \sum_{n = 1}^{N} \sqrt{∣ x (n) ∣})}^{2}$
Standard deviation	$\sqrt{\frac{1}{N - 1} \sum_{n = 1}^{N} {[x (n) - \bar{x}]}^{2}}$
Kurtosis	$\frac{\sum_{n = 1}^{N} {[x (n) - \bar{x}]}^{4}}{(N - 1) σ_{x}^{4}}$
Waveform factor	$\frac{\sqrt{\frac{1}{N} \sum_{n = 1}^{N} x^{2} (n)}}{\frac{1}{N} \sum_{n = 1}^{N} \|x (n)\|}$
Pulse factor	$\frac{\max ∣ x (n) ∣}{\frac{1}{N} \sum_{n = 1}^{N} x (n)}$
Minimum	$\min (x (n))$
Peak value	$x_{\max} - x_{\min}$
Absolute average	$\frac{1}{N} \sum_{n = 1}^{N} ∣ x (n) ∣$
Variance	$\frac{1}{N} \sum_{n = 1}^{N} x^{2} (n)$
Root mean square	$\sqrt{\frac{1}{N} \sum_{n = 1}^{N} x^{2} (n)}$
Skewness	$\frac{\sum_{n = 1}^{N} {[x (n) - \bar{x}]}^{3}}{(N - 1) σ_{x}^{3}}$
Peak factor	$\frac{x_{\max}}{\sqrt{\frac{1}{N} \sum_{n = 1}^{N} x^{2} (n)}}$
Margin factor	$\frac{\max ∣ x (n) ∣}{{(\frac{1}{N} \sum_{n = 1}^{N} \sqrt{∣ x (n) ∣})}^{2}}$

Table 2. Signal time domain, energy eigenvalues, and actual remaining life data.

Serial Number	1	2	...	301	302
Average amplitude	0.080	0.080	...	0.002	0.001
Square root amplitude	0.066	0.066	...	0.002	0.001
Standard deviation	0.105	0.104	...	0.001	0.001
Root mean square	0.145	0.104	...	0.002	0.002
E_IFM1	0.993	0.989	...	0.993	0.988
E_IFM2	0.005	0.006	...	0.033	0.020
E_IFM3	0.045	0.049	...	0.078	0.131
E_IFM4	0.094	0.114	...	0.055	0.060
E_IFM5	0.054	0.068	...	0.047	0.038
E_IFM6	0.021	0.020	...	0.034	0.022
Remaining life	3200	3100	...	20	10

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wu, Q.; Niu, J.; Wang, X. Study on Life Prediction Method of Ball Screw Base on Constructed Degradation Feature and IGWO-BiLSTM. Actuators 2023, 12, 236. https://doi.org/10.3390/act12060236

AMA Style

Wu Q, Niu J, Wang X. Study on Life Prediction Method of Ball Screw Base on Constructed Degradation Feature and IGWO-BiLSTM. Actuators. 2023; 12(6):236. https://doi.org/10.3390/act12060236

Chicago/Turabian Style

Wu, Qin, Jun Niu, and Xinglian Wang. 2023. "Study on Life Prediction Method of Ball Screw Base on Constructed Degradation Feature and IGWO-BiLSTM" Actuators 12, no. 6: 236. https://doi.org/10.3390/act12060236

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Study on Life Prediction Method of Ball Screw Base on Constructed Degradation Feature and IGWO-BiLSTM

Abstract

1. Introduction

2. Energy Characteristics of Signal IMF Component

2.1. CEEMDAN Algorithm

2.2. Signal IMF Component Energy Feature Construction

3. IGWO-BiLSTM Regression Model

3.1. BiLSTM Neural Network

3.2. Gray Wolf Optimization Algorithm

3.3. Improving Gray Wolf Optimization Algorithm

3.4. IGWO-BiLSTM Regression Model

4. Life Prediction Methods

5. Life Prediction by Experimental Data

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI