1. Introduction
Nowadays, micro electro mechanical systems (MEMS) devices are used in various fields, such as automotive, biomedical, aerospace, and communication technologies [
1]. They play an indispensable role in functioning and protection of the entire system [
2]. As one of the components, micro-switches are affected by different working cycles and unavoidable external factors, such as changes in temperature and humidity, resulting in different degrees of residual life [
3].
However, micro-switches reliability has attracted little attention, whose failure may cause significant downtime, as well as safety implications. Specifically, they are important parts of rail vehicle systems, and whether they are damaged is related to the operation of the entire system or even the safety of the passengers. Due to the significance of such aspect, several research works dealing with the reliability of micro-switches and other electronic components have been published, such as references [
4,
5,
6,
7,
8,
9]. Nevertheless, traditional approaches to estimate remaining useful life (RUL) have failed because of the reliance on average accumulated historical field data [
10]. Reliability is estimated without taking into account the specific utilization of each component, such as working environment and using frequency. However, in practice, the lifetime should be different from one to another depending on how and where it is used. As a result, test duration and cost have become a huge challenge for traditional approaches. Real-time monitoring of the RUL of micro-switches and provide a convenient for timely maintenance decisions, is one of the important ways to improve its reliability [
11,
12].
The topic of the real-time prediction of RUL for electronic devices is one of the most active areas in prognostics and health management (PHM) research today. Considered with the stochastic characteristics of RUL in stochastic dynamic processes under actual working conditions, the data-driven RUL prediction was studied in the early 1980s, Derman et al. [
13] confirmed the importance of life distribution in extending the life of equipment. This type of method is the most typical traditional life prediction method. The statistical analysis of life data determines the probability distribution of equipment life. However, the equipment such as micro-switches, owned high reliability and long-life features and it failed to obtain sufficient time to failure data in the short term, which made it difficult to obtain satisfactory prediction results for traditional life analysis methods based on life data. In recent years, there has been an increasing interest in the establishment of the real-time life prediction model by using monitoring data and calculating the probability distribution in the use of statistical methods. A large amount of literature has been delivered, Wang et al. [
14] summarized the commonly used assumptions applying a random coefficient regression model and proposed a method for determining the failure threshold by optimization. Furthermore, Gebraeel et al. [
15] proposed a logarithmic linearized exponential-like random coefficient regression model. The model utilized historical degradation data from similar devices and incorporated real-time monitoring data from historical data of service equipment through the Bayesian updating mechanism to update the remaining life distribution. Si et al. [
16] summarized the experience of predecessors and provided an effective theory and method for establishing stochastic degradation models and studying RUL prediction problems. Especially, the algorithm put forward by Wang et al. [
17] is widely used but there still exists some unsolved problems, such as, it is not sensitive to the initial real-time monitoring degradation data due to the objects of micro-switches.
In this paper, a linear degradation model based on an inverse Kalman filter to imitate the stochastic deterioration process was proposed. In addition, it referred to others about state monitoring methods, such as the extended Kalman filter (EKF) applied in the estimation of the position of the intake valve of the engine, and a theoretical basis, which was built up for the algorithm proposed in this paper [
18,
19]. Although Wang et al. [
17] benefits were well proved, the algorithm was not sensitive to the initial real-time monitoring degradation data in solving the RUL estimation. Based on the Kalman filter, it was the important measure to the real-time condition monitoring of RUL, this paper proposed an inverse Kalman filter. Furthermore, Bayesian updating method and expectation maximization (EM) algorithm were used to estimate the RUL. Finally, the strong tracking filtering (STF) method was used to enhance the robustness. In order to verify the validity of the method, the S826 rail vehicle micro-switches were chosen as the research object. Thus, the real-time prediction of RUL for micro-switches were necessary. The data-driven method to solve the prediction of this kind of system could provide a feasible way for optimization of problems [
20].
The remainder of this paper is organized as follows.
Section 2 constructs a general stochastic process-based degradation model and then presents a degradation path-dependent approach for adaptive RUL estimation via real-time condition monitoring data. It discusses how to estimate initial parameters by using an inverse Kalman filter and illustrates a Bayes technique improved by the STF method, which can update the system parameters more accurately in real time. In
Section 3, the test rig is designed to obtain performance degradation data for micro-switches.
Section 4 provides several simulations and a case study to illustrate the application and usefulness of the developed approach.
Section 5 concludes the paper.
Notations used in this paper.
Notations | Explanations |
| The degradation detection data . |
| The drift parameter reflects the degradation rate of the equipment. |
, | The updated hyper-parameters by the Bayesian posterior estimation. |
, | stands for the unknown parameters set are not updated by Bayesian estimation, , and denotes the updated results by the EM algorithm, . |
| The fading factor. |
| The updated estimation variance by the STF technique. |
| is flipped as , where . |
2. Prognostic Approach
Under normal circumstances, most studies require multiple similar historical monitoring data to estimate parameters when they estimate the remaining useful life [
21,
22]. However, the micro-switches of the rail vehicle have high reliability and a long service life. For this type of component, even the accelerated life test requires considerable time and cost. Therefore, it is not feasible to estimate parameters with a large amount of historical data. Moreover, the running environment of each railway vehicle is different, and it is inaccurate to estimate the residual life with fixed parameters.
In this section, considering individual differences in micro-switches, we use real-time monitoring data to estimate individual systems. As the monitoring data is acquired, the system parameters will be updated adaptively, and an accurate prediction result can be obtained without similar historical data. The fundamental principle is that enough online monitoring data is used to complement the lack of similar historical data. A linear stochastic degenerate system is described in
Figure 1.
2.1. Modeling of Linear Stochastic Degenerate Systems
The linear degradation model is typically used for modelling degradation processes where the degradation rate is approximately a constant [
17,
23]. Moreover, the linear model with adaptive updating algorithm used in the literature [
17] has a good estimate of the exponential type of data. In this paper, we consider the same linear degradation model based on a Wiener process as follows:
where
is the degradation detection data at time
. The initial state is shown as
To be consistent with existing studies, the initial degradation was 0, which can be obtained by translation transformation of the data. And
was the drift parameter, which means the degradation rate of the system,
is the diffusion parameter.
denotes the standard Brownian motion, which represents the stochastic dynamics of the degradation process. The degradation detection data is described as
.
The reason for using linear system equations is not only its universality, but also it is the convenience for calculation. Another important reason is that the life prediction problem can be understood as predicting the future trend of a random curve. According to the Euler method, given a starting point, if we can predict the tangent curve at any point, from the starting point, we can calculate the predictive value step by step, the approximate curve can be obtained. In this paper, the drift parameter
denotes the tangent curve and we use a reasonable method to adjust
in real time. From the Equation (1), each step of the degradation data can be expressed as:
where
.
It can also be described in a nonlinear model, which will greatly increase computation time and not improve performance significantly. In addition, we apply Bayesian updating and the EM algorithm to update the system parameters, and the STF is used to improve the robustness of the model parameter mismatch. The above methods can guarantee the accuracy of parameters. The detailed description will be given below.
Assumption: In the model of , is assumed to be a random parameter, indicating individual differences. is assumed to be the deterministic parameter as a constant.
2.2. Bayesian Posterior Estimation of Stochastic Parameters
It is noticed that the key parameter for determining the degradation state is
in Equation (1).
is a random parameter and will be updated with the data obtained at the current moment
.
is distributed as
, such parameter distributions are consistent with existing methods [
17,
23].
In order to estimate hyper-parameters in the random parameter , the Bayesian posterior estimation is used in this paper.
Firstly, it is assumed that the prior distribution of
is
. Then the chain recursion was incorporated into the calculation. The posterior distribution of
can be expressed as [
17]:
Where
It can be found in Equation (4) that the posterior estimate of the random parameter can be updated after the new monitoring data.
2.3. Estimation of Unknown Parameters Based on EM Algorithm
As seen in the previous section, unknown parameters
are not updated by the Bayesian estimation. The reason for using the EM algorithm instead of the maximum likelihood estimation algorithm is that the unknown parameter
contains the hidden variable
, which cannot be directly estimated by the maximum likelihood estimation. We need to approximate the maximum likelihood estimation of parameters by maximizing the joint likelihood function
. In order to reflect the updated characteristics of
over time, we use the EM algorithm to estimate
through monitoring data
, its update results are expressed as
[
17].
First, complete logarithm likelihood function
is calculated as:
E-step: Calculate the expected value
of
which is about
M-step: Fixed parameter
, and take the maximum value of
.
,
can be expressed as follows:
Moreover, the updated results in Equation (7) required only one step to compute the maximum value, which have been given proof by the literature [
17]. One step to solve the maximum value greatly reduces the computing time and has a strong practical value.
From the results of Equation (7), the main updated parameter of the EM is . The other two parameters are also updated. The initial parameters in the Bayesian estimation are improved after Bayesian updating in the next step.
2.4. Adding Fading Factor Based on the STF to Enhance Robustness
In this section, we discuss how to adjust parameters in time and guarantee the accuracy of estimation when the model parameters and real-time data are mismatched.
It can be proved from Equations (4) and (7):
This means that the value of the parameter
will gradually decrease as the algorithm is updated. That is, the uncertainty of the real value is decreasing. However, when enough data is available, Equation (8) can be expressed as:
It is easy to prove that
will approach to 0 when
. And because
in the Equation (9) is also monotonically decreasing, hyper-parameter
will decay faster. When
, it can be seen that:
will not change as new data is acquired from Equation (10), that is, the stochastic degradation parameter
will not change with the acquisition of new data. Wang et al. [
17] directly uses Bayesian updating and EM algorithm to estimate unknown parameters
. When the degraded data is stationary, it can be well estimated. However, when parameters are convergent, it will not obtain good estimation results if the newly acquired data is different from the model parameter.
The reason for the parameter no longer being updated is
. Considering that the Kalman filter algorithm is obtained in the Bayesian framework, it has some similarities with the Bayesian updating. The Kalman filter results in the fact that the
K value tends to the minimum, thus that it is no longer sensitive to the prediction error. The STF solves the problem of mutational degradation data on the basis of the Kalman filter [
24]. It is not practical that
is rapidly approaching 0, inspired by reference [
24], we added the prediction error
in each step update to ensure the update capability of the algorithm, then we added a fading factor to adjust the prediction variance in real time, thus that
could be sensitive to the prediction error. The specific calculation steps are as follows:
Step 1: Establish system equation
Step 2: Set initial parameters
Step 3: Calculated fading factor
Model parameter updating:
Estimation variance updating:
Usually, the forget factor is set to , the softening coefficient is , and the prediction error is .
2.5. Estimating Initial Parameters by an Inverse Kalman Filter
Although the algorithm by Wang et al. [
17] has self-adaption update capability, even if the initial parameters are not set correctly, it will approach the accurate value as the new data is acquired. However, a practical problem is that the initial parameters are difficult to determine without a large amount of historical information, and it is also unknown where monitoring data starts from the whole life cycle. A progressively updating algorithm of parameter
is proposed by Wang et al. [
17]. If the initial parameter is set to be more inconsistent with the actual, such errors will affect the subsequent estimation process, and its convergence rate will be slow, resulting in that a large amount of monitoring data is needed to complete the parameter convergence. Even more, the algorithm has not completed convergence, experimental object has been damaged and the inaccurate life prediction results are obtained in its life cycle. To this end, this section will discuss an inverse Kalman filter to update the initial parameters in real time. The initial parameters are updated at the source, thus that the convergence speed of the estimation is accelerated.
The reason for this application of an inverse Kalman filter is that the drift parameter
is unobservable, where the nonlinear form adopted is an unknown problem. Because the degradation is more stable in the early phase, when the sampling interval is not long, it is reasonable for
to obey stochastic Gauss distribution around
based on large sample statistical theory [
25].
At the same time, nonlinear modeling methods (such as exponential models) can be used, and model parameters can be updated with the Bayesian updating and EM. After model parameters are determined, the initial parameters will be estimated by an inverse Kalman filter. In this way, the algorithm will be very complex and computation time will be greatly increased. More importantly, although the initial phase of degradation is relatively stable, it does not necessarily satisfy the overall degradation model. For example, the initial degradation process data in this paper does not agree with the overall degradation trend. That is, the model that conforms to the overall data may not necessarily satisfy the initial degradation process, it is related to the time point of when the monitoring begins. Therefore, it is difficult to estimate the exact initial parameters by using the determined nonlinear model.
It is often the case with actual degradation data: The initial degradation phase is stable, and then the volatility of data becomes more pronounced until the system fails, such as the data from this paper and reference [
26]. Therefore, it is more and more difficult to update the initial parameters by normal methods. Initial parameters are difficult to converge, even if the initial parameters are convergent, they are not the exact values. If the initial parameters are updated via recursively in reverse, it will be found that degradation data will be smooth gradually, updating the true value of the initial parameters
that are expected to be improved, and converge to the exact value.
For the actual operation of the system, the monitoring data corresponding to the current time is , and the observation data are . In order to show the Kalman updated process is more intuitive, the order of the elements are flipped as , where . has been rewritten as by the same method.
According to the conventional Kalman filter [
25,
27], all monitoring data of the system is recursively incorporated into parameter
by an inverse Kalman filter technology. The specific calculation process is as follows:
Step 1: Establish system equation
where
,
,
is the forecast variance.
Step 2: Time updating
where
,
.
Step 3: Status updating
where
is the system error,
.
Step 4: Update results of the initial parameter
The parameters of an inverse Kalman filter are set as , which can make the algorithm more dependent on system measurements.
Remark 1. For an inverse Kalman filter, the difference from the conventional Kalman filter [25,27] is that the order of the elements are flipped as ,it is accounted for , is the last monitoring data at the current time , is the initial monitoring data at the current time . The flipped data as guarantees the iteratively updated forward of the estimation from the last monitoring point when it starts to filter. Remark 2. The consequence of applied an inverse Kalman filter reveals that it can be obtained by the optimal estimation .
Remark 3. It is obvious that the steps according to an inverse Kalman filter are similar with the conventional Kalman filter. However, a practical problem is that the initial parameters are difficult to determine without a large amount of historical information, thus they are indeterminate and often set with errors. If we take advantage of the conventional Kalman filter to solve the errors belonging to the initial parameters, it will not be sensitive to them. In contrast, the last parameters are fixed and reliable relatively accounts for a number of historical information. The advantage of an inverse Kalman filter is that, when we start to filter from the last monitoring data , the accuracy of the filtering is improved for the initial monitoring data especially.
2.6. Expression of Remaining Useful Life
Based on the concept of stochastic process lead time, when the failure threshold
is reached for the first time, the system life is considered to be terminated. Based on the observed data
, the RUL
of the system at the moment
is defined as:
After getting the new data and corresponding updated parameters
, According to the literature [
17], the remaining life of PDF (probability density function) and CDF (cumulative distribution function) can be obtained respectively:
Here, the life prediction method under the linear stochastic deterioration model has been completed. The following steps are concluded to estimate the RUL of the micro-switches, which belongs to the S826 rail vehicle:
Step 1: A linear degradation model based on a Wiener process is proposed: , the degradation detection data are described as .
Step 2: is a random parameter and will be updated with the data obtained at the current moment . is distributed as , in order to estimate the hyper-parameters in the random parameter , the Bayesian posterior estimation is used in this paper. Finally, calculate the expression of the hyper-parameters .
Step 3: Unfortunately, the unknown parameters are not updated by Bayesian estimation. In order to reflect the updated characteristics of over time, we use the EM algorithm to estimate through the monitoring data , and its update results are expressed as . The main updated parameter of the EM is , other two parameters are also updated. Initial parameters-based Bayesian estimation are improved after Bayesian updating in the next step.
Step 4: When the parameters are convergent, it will not obtain good estimation results if the newly acquired data is different from the model parameter. The STF method solves the problem of mutational degradation data on the basis of the Kalman filter. We add prediction error in each step update to ensure the update capability of the algorithm, then we add the fading factor to adjust the prediction variance in real time, thus that can be sensitive to the prediction error.
Step 5: The drift parameter is unobservable, and the nonlinear form is adopted to an unknown problem. The initial parameter is set to be more inconsistent with the actual parameter, and such errors will affect the subsequent estimation process, and its convergence rate will be slower, resulting in a large amount of monitoring data being needed to complete the parameter convergence. As considered above, an inverse Kalman filter is proposed to update the initial parameters in real time, as well as updating at source, thus that the convergence speed of the estimation is accelerated.
4. Comparative Studies
In this section, we used the test data of the S826 micro-switches to illustrate the practicability of this research result, comparing it with the approaches, the Kalman filter instead of an inverse Kalman filter, and the algorithm of Wang et al. [
17] in order to verify the superiority of our method.
For the last 80 sampling points, the method with the Kalman filter instead of an inverse Kalman filter was similar with our approach, thus
Figure 7 compared the updated parameters obtained by Wang et al. [
17] with the method proposed in this paper. For these two approaches, the unknown parameters were obtained by the combination of the Bayesian updating and the EM algorithm. The difference is that the fading factor was added in this paper, thus that the drift coefficient was more sensitive to the change of data. As can be seen from the diagram, the prediction error approaches zero in the approach by Wang et al. [
17], resulting in
being not sensitive to new data, when the micro-switch is about to fail, the degradation rate was obviously accelerated, and our approach could be adjusted better.
The literature [
17] requires higher precision of initial parameter selection, and it is difficult to set an accurate initial parameter in practical applications, because there does not exist a large number of similar historical information and the accurate time to start data monitoring is unknown. In this paper, we selected a set of relatively inaccurate initial parameters in two methods
to verify the ability of our algorithm about updating the initial parameters. It can be seen from the comparison chart (
Figure 8) that when the parameters are improperly set, the convergence speed of Wang et al. [
17] is slow. Furthermore, the method proposed in this paper has faster convergence speed in the initial few sampling points than the method with Kalman filter instead of an inverse Kalman filter merely.
Figure 9 reflects the mean square error (MSE) values at different monitoring time points. In the initial phase, degradation data are less, the fluctuation of the method proposed by Wang et al. [
17] is the largest and with the smallest fluctuation is our method. This means that the remaining life of the PDF of another two predictive models are sensitive to small changes, and if it is applied for a maintenance decision, it may result in two different monitoring points, which are completely different to the maintenance decisions which increases protection and maintenance costs in turn. As a conclusion, our approach has a higher prediction accuracy. Notice that
Figure 9b shows an upward trend of MSE, it is mainly because when the life is about to terminate, the data have fluctuated greatly, and the error has been raised slightly.
Figure 10 illustrates the approach proposed by Wang et al. [
17], which compares with the one proposed by us regarding the estimation RUL at the last four sampling points. PDF becomes gently sharper and closer to the Z-axis by applying our approach. This means when more data are used to estimate parameters, the uncertainty of the remaining life is decreasing, which agrees with the facts.
Remark 4. It can’t be ignored to illustrate a superior method between the algorithm proposed by Zhang et al. [30] and us. Figure 11 reflects that our method fits the actual degradation path better than Zhang et al. [30], especially in the initial several sampling points. It is more obvious in Figure 12, where our PDF is closer to the z-axis than shown by Zhang et al. [30]. This means our approach is still very sensitive to the monitoring data in the final phase. In conclusion, our approach is more sensitive and adjustable to degradation data of micro-switches than the one proposed by Zhang et al. [
30].
Discussion
The most remarkable result that emerged from the data were our approach with an inverse Kalman filter that fitted the real degradation path excellently. Our results shared a number of similarities with findings according to the literature [
17,
25]. However, different from earlier findings, our approach dealt better with errors in the initial degradation phase. We put forward another two methods to compare and demonstrate our view. In addition, we also compared this with Zhang et al. [
30] and got a satisfied result. It is easily seen in the previous study; our approach is the most sensitive to the actual sampling points. The results we have obtained will provide strong technical support for PHM, including micro-switches and even other electronic components of the rail vehicles. And it will be a solid basic study about nonlinear degradation path based on the electronic components in the future.