Track Irregularity Identification Method of High-Speed Railway Based on CNN-Bi-LSTM

Yang, Jinsong; Liu, Jinzhao; Guo, Jianfeng; Tao, Kai

doi:10.3390/s24092861

Open AccessArticle

Track Irregularity Identification Method of High-Speed Railway Based on CNN-Bi-LSTM

Infrastructure Inspection Research Institute, China Academy of Railway Sciences, Beijing 100081, China

^*

Author to whom correspondence should be addressed.

Sensors 2024, 24(9), 2861; https://doi.org/10.3390/s24092861

Submission received: 3 April 2024 / Revised: 26 April 2024 / Accepted: 29 April 2024 / Published: 30 April 2024

(This article belongs to the Special Issue Artificial Intelligence Enhanced Health Monitoring and Diagnostics: 2nd Edition)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Track smoothness has become an important factor in the safe operation of high-speed trains. In order to ensure the safety of high-speed operations, studies on track smoothness detection methods are constantly improving. This paper presents a track irregularity identification method based on CNN-Bi-LSTM and predicts track irregularity through car body acceleration detection, which is easy to collect and can be obtained by passenger trains, so the model proposed in this paper provides an idea for the development of track irregularity identification method based on conventional vehicles. The first step is construction of the data set required for model training. The model input is the car body acceleration detection sequence, and the output is the irregularity sequence of the same length. The fluctuation trend of the irregularity data is extracted by the HP filtering (Hodrick Prescott Filter) algorithm as the prediction target. The second is a prediction model based on the CNN-Bi-LSTM network, extracting features from the car body acceleration data and realizing the point-by-point prediction of irregularities. Meanwhile, this paper proposes an exponential weighted mean square error with priority inner fitting (EIF-MSE) as the loss function, improving the accuracy of big value data prediction, and reducing the risk of false alarms. In conclusion, the model is verified based on the simulation data and the real data measured by the high-speed railway comprehensive inspection train.

Keywords:

track irregularity; body vibration acceleration; Bi-LSTM; high-speed railway

1. Introduction

The smoothness of the track directly determines the safety of the track system and passenger riding comfort [1]. At present, many countries have carried out the reform of track maintenance systems. In order to improve maintenance efficiency and save maintenance costs, the application of fault prediction and health management methods in track maintenance systems is gradually increasing. Replacing the traditional periodic maintenance mode with condition-based maintenance has become the development trend of track maintenance systems. The core of implementing condition-based maintenance is to use a great amount of detection data to scientifically and reasonably judge the smooth state of a track. Therefore, there are higher requirements for track detection frequency and accuracy.

With the continuous growth of high-speed railway traffic volume, in order to ensure the safety of railway operations, inspection frequency needs to be increased accordingly. Therefore, more convenient track smoothness detection methods have become research key topics.

At present, research on track irregularities mainly includes two aspects: one is the maintenance technology for track irregularities, and the other is detection technology and recognition algorithms. In terms of maintenance technology, Zhang et al. [2] proposed a curve radius optimization method to solve the problem of vehicle vertical acceleration exceeding the limit on vertical curves. In terms of identifying track irregularities, they can be directly detected through detection equipment, and track smoothness can be judged through statistical analysis of detection data [3]. It can also be indirectly evaluated through detection data such as vehicle dynamic response and wheel vibration noise [4]. There are mainly three kinds of track irregularity detection methods. The first is based on professional track detection vehicles, the second is the lightweight track survey trolley, and the third is the track detection system carried on in-service railway vehicles.

The professional track detection vehicle technology is relatively mature. It is generally equipped with a complex track detection system, which can measure the track geometry from different dimensions. For example, the HSR-350x track irregularity detection system in South Korea is composed of laser, camera, and inertial tools, and its maximum measurement speed is 320 km/h [5]. Tsubokawa et al. [6] proposed an inertial mid-chord offset method to detect track irregularity. This method combines the characteristics of optical and inertial methods and reduces the integration error caused by inertial measurement.

Although the detection method based on a professional detection vehicle has high detection efficiency, the detection accuracy and positioning accuracy of a detection vehicle will not meet the use requirements in the case of construction or precise track adjustment. Therefore, the lightweight track survey trolley is also an important detection tool for track irregularity. At present, the main principle of the track survey trolley is to combine the inertial navigation system with the total station or global positioning system to realize the high-precision measurement of track geometry [7]. Chen et al. [8] designed a track geometry measuring trolley system by integrating an inertial navigation system (INS) with geodetic instruments, which effectively measures track irregularity. Zhu et al. [9] proposed an attitude variation method based on double difference GNSS (DGNSS) and INS integration for railway track irregularity detection.

The first two kinds of detection equipment are equipped with a large number of optical and inertial sensors, which have high measurement accuracy. However, such inspections are expensive and can only be performed periodically, which means that track failures can occur between inspections [10]. Therefore, how to realize the prediction of track irregularity by carrying simple inertial sensors on active trains has become a research hotspot in recent years [11].

Xiaozhou et al. demonstrated the correlation between track irregularity and vehicle acceleration through fractal analysis and pointed out that the correlation coefficient exceeds 0.7 when the wavelength is greater than 30 m. The methods of predicting track irregularity through vehicle dynamic response can be divided into methods based on the vehicle dynamics model and methods based on the filter model [12]. Track irregularity prediction can be regarded as an inverse dynamic analysis problem, in which track irregularity is an unknown input to be identified, and track irregularity can be identified through axle box or vehicle body acceleration measurement [13]. Czop et al. [14] proposed a detection method of track irregularity based on axle box acceleration measurement during vehicle operation according to the vehicle dynamics model. Compared with the acceleration of the axle box, the detection of the acceleration of the vehicle body is simpler and can be made into a portable device. Tsunashima et al. [15] used the Kalman filter (KF) to predict the track geometry irregularity of the Shinkansen in Japan according to the acceleration of the vehicle body. Odashima [16] demonstrated the possibility of estimating the irregularity of a conventional railway track using only the acceleration of the car body and used a Kalman filter for inversion track irregularities. However, the current prediction methods often have better prediction effects under certain fixed line conditions [17] and speed ranges, and the prediction accuracy will decrease with the increment of the prediction distance. Therefore, it is of great significance to find a track irregularity prediction method that has a wide range of applications and can achieve long-distance prediction.

In recent years, as an artificial intelligence technology involving deep network structure, deep learning provides a new idea for processing time series signals such as vehicle acceleration [18].

The recurrent neural network (RNN) proposed by Hopfield [19] in 1982 is a deep neural network that can consider the time correlation in time series. RNN network has no limit on the length of time series and is widely used in natural language processing. However, there are problems of gradient explosion and gradient disappearance in the training process of RNN. In order to overcome these problems, Hochreiter [20] proposed long short-term memory networks (LSTMs) in 1997. At present, LSTMs are widely used by researchers in the fields of natural language translation [21,22,23], speech recognition [24], finance [25], and signal processing [26]. LSTMs are also commonly used in the field of industrial equipment health management. Wanqing et al. [27] compared LSTMs with recursive neural networks and generalized Cauchy methods and summarized their respective characteristics.

In order to improve the effect of feature extraction, the LSTM neural network is often combined with the convolutional neural network (CNN) [28]. Wu et al. [29] proposed a new framework structure according to the characteristics of financial time series and the task of price prediction, which combines the convolutional neural network and LSTM neural network to realize a more accurate prediction of stock price.

In order to make full use of the information before and after time series, bidirectional LSTM has attracted more attention in recent years. Zou et al. [30] proposed a method combining multi-scale weighted entropy morphological filtering (MWMF) signal processing with bidirectional long short-term memory neural networks (Bi-LSTMs). This method is applied to rolling bearing fault diagnosis. It is verified that the model has high classification accuracy. Xia et al. [31] proposed an integrated framework based on convolution multi-time window Bi-LSTM, which is used to accurately predict the maintenance rules of mechanical equipment when the length of condition monitoring data is highly inconsistent.

In the existing track irregularity research, there are relatively few researches that use depth learning methods to realize point-by-point prediction of track irregularity based on vehicle body acceleration. Previous studies have shown that there is a correlation between track irregularity and car body acceleration [32,33,34], which provides mechanism support for the identification of track smoothness based on car body acceleration data.

The smoothness of the track is the foundation for ensuring the safe operation of the high-speed train. Based on the characteristics of the correlation between track irregularity and car body acceleration, this paper constructs the track irregularity prediction model of high-speed railway based on Bi-LSTM and realizes the prediction of track irregularity value through the car body vertical acceleration detection data. Because car body acceleration detection is simple and does not need professional inspection vehicles, this method can realize the prediction of track smoothness by collecting the car body acceleration data of operating trains. Using this method to predict track irregularities can increase detection frequency at a lower cost and enable more timely detection of track irregularities. On the basis of constructing the prediction model based on the CNN-Bi-LSTM neural network, and by taking into consideration of the characteristics of track irregularity prediction, this paper proposes an exponential weighted mean square error with priority inner fitting (EIF-MSE) as loss function. This improves the prediction accuracy of high-risk values and reduces the possibility of false alarms caused by higher predictive values.

The structure of the rest of this paper is as follows: Section 2 introduces the methods of data preprocessing and data set construction. Section 3 introduces the process of the track irregularity prediction algorithm based on the CNN-Bi-LSTM model. Section 4 introduces the exponential weighted mean square error with priority inner fitting proposed in this paper. In Section 5, the model is verified by using the simulation data and the real data detected by the comprehensive inspection train.

2. Materials and Methods

2.1. Trend Extraction Based on HP Filtering Algorithm

During the operation of the high-speed comprehensive inspection vehicle equipped with the track geometry inspection system, it is inevitable to be disturbed by vehicle vibration and weather and temperature changes, which will affect the accuracy of the detection results. External sunlight reflection, sensor and data transmission errors, laser deviation from the normal detection point at the turnout, image interference, and other reasons will lead to high-frequency noise in track geometry irregularity detection data. When we predict the track irregularity, we pay more attention to the changing trend and overall amplitude of the irregularity. Therefore, when building the data set, we first use the HP filtering algorithm to filter the low irregularity data, extract the change trend, and filter out the high-frequency fluctuations.

HP filtering was proposed by Hodrick and Prescott in 1981 and is widely used in cost-effective analysis. This method can extract the fluctuation trend in the original data and reduce the influence of high-frequency interference [35].

HP filtering was proposed in 1981 to address trend analysis issues in the financial field. The HP filtering method can extract trend components from raw data and reduce the interference of high-frequency components. In this article, the HP filtering method is used to extract trend components from track irregularity detection data for model training and testing. The process of using the HP filtering algorithm to process track irregularity detection data is as follows:

The sequence of track irregularity detection data is represented as

y (t) = \{y_{1}, y_{2} \dots, y_{m}\}

, if the trend component is represented as

q (t) = \{q_{1}, q_{2} \dots, q_{m}\}

, and the fluctuation component is represented as

g (t) = \{g_{1}, g_{2} \dots, g_{m}\}

, then

y (t) = q (t) + g (t)

.

In the HP filtering method, the loss function

Z

is set to:

Z = \sum_{i = 1}^{m} {(y_{i} - q_{i})}^{2} + β \sum_{i = 2}^{m - 2} {(q_{i} - 2 q_{i + 1} + q_{i + 2})}^{2}

(1)

where β is a penalty factor that controls the degree of smoothness. According to Ref. [36], the determination of β mainly depends on the detection frequency of monitoring data. Based on the characteristics of irregularity detection data, β is selected as 30. The problem of solving the trend component q(t) is transformed into the problem of solving the minimum value of Z, which can be simplified as:

Z = {‖y - q‖}^{2} + β {‖\nabla^{2} q‖}^{2}

(2)

where, if

\nabla q = {[\begin{matrix} - 1 & 1 \\ - 1 & 1 \\ \dots \\ - 1 & 1 \end{matrix}]}_{(m - 1) \times m} q,

(3)

then we can get:

\nabla^{2} q = {[\begin{matrix} - 1 & 1 \\ - 1 & 1 \\ \dots \\ - 1 & 1 \end{matrix}]}_{(m - 1) \times m} \nabla q

(4)

If

D

represents

\nabla^{2}

, it can be abbreviated as

\nabla^{2} q = D q

, and the loss function can be transformed into:

\begin{matrix} Z & = {‖y - q‖}^{2} + λ {‖D q‖}^{2} \\ = (y - q)^{T} (y - q) + β {(D q)}^{T} (D q) \\ = q^{T} (I + β D^{T} D) q - 2 y^{T} q + y^{T} y \end{matrix}

(5)

By calculating the gradient, we can obtain:

y = (I + β D^{T} D) q

(6)

which can be solved as follows:

q = {(I + β D^{T} D)}^{- 1} y,

(7)

The fluctuation trend of the time series can be obtained by Equation (7).

The track irregularity trend data obtained by the HP filtering algorithm will be used for the following model training and prediction.

2.2. Data Set Construction Based on Moving Sliding Window

The irregularity data and acceleration data collected by the inspection train are sampled at equal intervals with 4 sampling points per meter. The acceleration detection data corresponds to the irregularity detection data one by one. In order to facilitate model training, in this paper, a moving window with a length of 500 sampling points is used to intercept the detection data, the acceleration sequence composed of 500 points is used as the model input, and the corresponding irregularity sequence is used as the target of the model, as shown in Figure 1. Thus, a data set for model training and testing is formed.

3. Track Irregularity Prediction Model Based on CNN-Bi-LSTM

In this paper, with CNN and Bi-LSTM, a fusion network model is proposed. The characteristics of deep learning are used to be able to extract features independently to learn the deep features contained in the acceleration sequence. Figure 2 shows the proposed CNN-Bi-LSTM network learning framework. The CNN-Bi-LSTM network parameter configuration is as shown in Table 1.

In this paper, one-dimensional convolution is used for the feature extraction of car body vertical acceleration data. A one-dimensional convolutional network consists of a convolutional layer and activation function. Since the acceleration of the car body has positive and negative points, the ‘tanh’ function is selected as the activation function. The convolution core size of the convolution layer is set to 10, the step size is 1, and the number of convolution cores is 100. In order to ensure that the acceleration data passing through the convolution layer can correspond to the track irregularity data one by one, the padding method is used to ensure that the data length before and after convolution is consistent.

Because the response of car body acceleration to track irregularity has a certain delay, when predicting track irregularity through car body acceleration data, we should consider not only the acceleration data before the current point, but also the information after the current point. Based on this feature, this paper uses the Bi-LSTM neural network to extract the features of the time dimension.

In the whole model, the convolution network is used to extract the deep features in the acceleration detection sequence. Then, the Bi-LSTM network is used to extract time series features. The features obtained by the CNN network are propagated in both positive and negative directions, so as to obtain time series features. After the Bi-LSTM network, each node obtains a 100 × 1 characteristic sequence. Finally, the characteristic sequence output by each node is input into the fully connected network for regression. After the three-layer fully connected neural network, the track irregularity predictive value corresponding to this point is obtained.

4. Exponential Weighted Mean Square Error with Priority Inner Fitting

In past regression prediction problems, mean square error is a common loss function. However, track irregularity prediction has its own characteristics, which are mainly reflected in two aspects. First, pay more attention to the accuracy of big value prediction. We call the value with big deviation from the standard value “big value”, which often represents high safety risk and high possibility of defects. Second, try to avoid the predicted value being higher than the real value. For the general mean square error, when the loss function takes a certain value, it may be caused by the predicted value being greater than or less than the real value. However, in contrast, when the predicted value is greater than the real value, it may cause false early warning, and in serious cases, it may suspend the train operation and affect the transportation efficiency. Therefore, we hope that when the predictive error meets the use requirements, the absolute value of the prediction should be less than the real value, that is, give priority to inside forecast bias.

These two characteristics are not only applicable to the prediction of track irregularity, but also applicable to other fault diagnosis, state prediction, and so on. Based on the demand characteristics of the above-mentioned two forecasts, this paper proposes an exponential weighted mean square error with priority inner fitting (EIF-MSE).

The traditional mean square error can be expressed as:

M S E = \frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2},

(8)

where

y_{i}

represents the i-th in the real value sequence

Y = [y_{1}, y_{2}, \dots, y_{n},]

and

{\hat{y}}_{i}

represents the i-th in the corresponding prediction sequence,

\hat{Y} = [{\hat{y}}_{1}, {\hat{y}}_{2}, \dots, {\hat{y}}_{n},]

.

The EIF-MSE proposed in this paper can be expressed as:

E I F - M S E = \frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2} * e^{λ \cdot |y_{i} - \tilde{y}|} + γ \cdot r e l u (|{\hat{y}}_{i}| - |y_{i}|),

(9)

where

e^{λ \cdot |y_{i} - \tilde{y}|}

is the risk coefficient,

\tilde{y}

is the standard value of predicted quantity y, and the value is 0 for track irregularity.

The greater the

|y_{i} - \tilde{y}|

value, the greater the deviation of the real value from the standard value, and the greater the potential risk and hazard. In order to ensure the accuracy of large-value predictions, the loss here should be greater.

γ \cdot r e l u (|{\hat{y}}_{i}| - |y_{i}|)

is called the inboard coefficient, where:

r e l u (x) = \{\begin{matrix} x \\ 0 \end{matrix} \begin{matrix} , \\ , \end{matrix} \begin{matrix} x > 0 \\ x \leq 0 \end{matrix}

When the absolute value

|{\hat{y}}_{i}|

of the predicted value is less than the absolute value

|y_{i}|

of the real value, the inner coefficient is 0, which is equivalent to not increasing the penalty for the loss function. When the absolute value

|{\hat{y}}_{i}|

of the predicted value is greater than the absolute value

|y_{i}|

of the real value, the inner coefficient increases with the increase of

|{\hat{y}}_{i}|

, and the loss function is punished to ensure that the predicted value is preferentially located inside the real value.

5. Model Validation

5.1. Simulation Model Verification

In order to verify the effect of the track irregularity prediction model constructed in this paper and EIF-MSE proposed in this paper, the simulation data are used to verify the model.

The vehicle-track coupling vibration model of a high-speed EMU is established by using ANSYS 19.0and SIMPACK 2018. Taking the line deformation in the 32 m simply supported beam section as the irregularity input, the simulation data of car body acceleration are obtained. Then, the data set is constructed by using the simulated car body acceleration data and track irregularity data, and MSE and EIF-MSE are used as loss function training models, respectively. The test set data is used to verify the prediction effect of the model, and the results are shown in Figure 3.

Comparison demonstrates that using the EIF-MSE loss function has a better prediction effect at peak points similar to points A and B. Through comparison, it is also shown that the EIF-MSE loss function proposed in this paper has better prediction effect at peak value points similar to points A and B.

In order to verify the effect of the EIF-MSE loss function proposed in this paper, two indexes of peak error and inner deviation are constructed, respectively. The peak error is defined as:

P E = \frac{1}{n} \sqrt{{(y_{m} - {\hat{y}}_{m})}^{2}},

(10)

where n indicates that there are n peaks in the test data and

y_{m}

indicates the m-th of them. The accuracy of the model for high value prediction can be evaluated through the peak error value.

Outside error is defined as the percentage of cases where the absolute value of the predicted value exceeds 20% of the absolute value of the real value, as in Formula (9). The percentage reflects the role of the inner coefficient in the EIF-MSE loss function.

O E = \frac{n u m ((|{\hat{y}}_{m}| - |y_{m}|) > 0.2 |y_{m}|)}{n},

(11)

Bi LSTM, LSTM, and RNN are used for comparison and verification, respectively. Each algorithm uses MSE loss function and EIF-MSE loss function to train the model, and then uses PE and OE to verify the model. The results are shown in Table 2.

The data in the table demonstrates that using a combination of the Bi-LSTM network and the EIF_MSE loss function can get the best results. It is obvious that when EIF_MSE is used as the loss function, the peak error is less when compared with that using MSE.

5.2. Verification of Measured Data

In order to verify the prediction effect of the model under actual working conditions, the real track irregularity and car body acceleration data collected by the comprehensive inspection train of China’s high-speed railway (as shown in Figure 4) are used for verification. During the data collection process, the high-speed comprehensive detection train’s driving speed is 300 km/h, and the wavelength range of track irregularities that can be detected is 1.5~120 m. the track irregularity detection system and the car body acceleration detection system are deployed on the same carriage, ensuring the synchronization of data collection. The track irregularity detection system uses the inertial reference method to measure through gyroscopes and displacement sensors. The collection of car body acceleration is carried out using an acceleration sensor installed on the bottom plate of the vehicle, which is located on the same cross-section as the track irregularity detection system. The track irregularity and car body acceleration are both sampled at equal intervals, with a sampling interval of 0.25 m.

In order to construct the data set required for model training, the sliding window method introduced in Section 2.2 is used to truncate the acceleration detection data and irregularity data at the same time. See Figure 5.

The track irregularity data is filtered according to the HP filtering algorithm introduced in Section 2.1, and the fluctuation trend of track irregularity data is extracted as the prediction target. The filtering effect of 50 m irregularity detection data is as shown in Figure 6.

It shows from the figure that the new column after HP Filtering better retains the fluctuation trend of the original sequence, and the small fluctuations need less attention and have been effectively filtered.

In order to verify the effect of the Bi-LSTM model on track irregularity prediction, the LSTM model is compared with the LSTM and RNN models by using the same data set and loss function. The features extracted by CNN are input into these three networks, respectively. Finally, the same fully connected neural network is used for regression prediction. The loss function in the training process of these three models is as shown in Figure 7. It demonstrates from the figure that the prediction model based on Bi LSTM has higher prediction accuracy.

The prediction effect of three different prediction models on a 125 m-long section is as shown in Figure 8. The red line is the real irregularity fluctuation of the section, and the blue line is the prediction of three different models. It also shows that the prediction effect based on the Bi LSTM model is obviously better than the other two models.

In order to verify the effect of EIF-MSE proposed in this paper, the effects of the risk coefficient and the inner coefficient on the model are verified, respectively.

In order to verify the effect of the inner coefficient, under other conditions unchanged, set γ to different values to train the model and test it. The results are shown as in Figure 9. It shows that at points A and B, when γ = 0.3, compared with γ = 0, the inner fitting effect of the model is better. Due to the effect of the inner coefficient, there are fewer cases where the predicted value is greater than the true value. It also shows in Table 3 that as the value of γ increases, the situation where the predicted value is greater than the real value gradually decreases.

Similarly, in order to verify the role of the risk coefficient, set λ to different values under other conditions unchanged. The results are shown in Figure 10. It shows that among the five big value points A, B, C, D, and E, when λ = 0.2, the fitting effect of the model is better than that of other cases. It can be seen that the addition of the risk coefficient enables the model to approach the true value more closely at big values. It also shows in Table 4 that the peak error of the model is the smallest when λ = 0.2.

From the above experimental results, it shows that the CNN-Bi-LSTM model constructed in this paper has better prediction results than the CNN-LSTM model and the CNN-RNN model. Through the comparison, it shows that using EIF-MSE as the loss function can better predict the big value compared with the traditional MSE error, and can reduce the possibility of false positives through the preferential inner fitting.

6. Conclusions

In order to find a more convenient detection method for track irregularity, this paper recommends a method based on car body acceleration. It constructs a prediction algorithm based on the CNN-Bi-LSTM model, takes the car body acceleration as the model input, extracts the features through the CNN network and the Bi-LSTM network, and finally obtains the point-by-point prediction results of track irregularity with a fully connected neural network. In order to ensure the prediction accuracy of big values and reduce the probability of false positives, this paper proposes an exponential weighted mean square error with priority inner fitting. Experiments show that, compared with the traditional MSE loss function, when using EIF-MSE loss function as the model loss function, more accurate prediction results can be obtained. The EIF-MSE loss function proposed in this paper can also be applied to other prediction occasions that pay more attention to the prediction accuracy of big values.

Author Contributions

Conceptualization, J.Y.; Methodology, J.Y.; Software, J.Y. and J.G.; Investigation, J.Y. and J.G.; Formal Analysis, J.Y.; Writing—Original Draft, J.Y.; Data Curation, J.L. and K.T.; Writing—Review and Editing J.L.; Supervision, K.T.; Funding Acquisition, K.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research is supported by the China Academy of Railway Sciences, project number: 2023YJ023.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data are available from the corresponding author on reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviation

Bi-LSTM	Bidirectional Long Short-Term Memory	KF	Kalman Filter
CNN	Convolutional Neural Networks	LSTM	Long Short-Term Memory
CNN-Bi-LSTM	Convolutional Neural Networks -Bidirectional Long Short-Term Memory	MSE	Mean-Square Error
EIF-MSE	Exponential Weighted Mean Square Error with Priority Inner Fitting	OE	Outside Error
EMU	Electric Multiple Units	PE	Peak Error
HP filtering	Hodrick Prescott Filter	RNN	Recurrent Neural Network
INS	Inertial Navigation System

References

Sánchez, A.; Bravo, J.L.; González, A. Estimating the accuracy of track-surveying trolley measurements for railway maintenance planning. J. Surv. Eng. 2017, 143, 05016008–05016016. [Google Scholar] [CrossRef]
Zhang, Y.; Shi, J.; Tan, S.; Wang, Y. Optimizing maintenance of vertical curves in high-speed railways. Automat. Constr. 2023, 15, 104947–104964. [Google Scholar] [CrossRef]
Li, Z.W.; Zhou, Y.L.; Liu, X.Z.; Wahab, M.A. Service reliability assessment of ballastless track in high speed railway via improved response surface method. Reliab. Eng. Syst. Safe. 2023, 234, 109180. [Google Scholar] [CrossRef]
Huang, Z.; Liu, J. A joint vibro-acoustic method for periodic track short-wave defect identification. Appl. Acoust. 2023, 204, 109239–109254. [Google Scholar] [CrossRef]
Kim, S.S.; Park, C.; Kim, Y.G.; Park, C. Parameter characteristics of rail inspection measurement system of HSR-350x. J. Mech. Sci. Technol. 2009, 23, 1019–1022. [Google Scholar] [CrossRef]
Tsubokawa, Y.; Yazawa, E.; Ogiso, K.; Nanmoku, T. Development of the car body mounted track measuring device with the inertial mid-chord offset method. J. STAGE 2012, 53, 216–222. [Google Scholar] [CrossRef]
Chen, Q.; Niu, X.; Zhang, Q.; Cheng, Y. Railway Track Irregularity Measuring by GNSS/INS Integration. Navig. J. Inst. Navig. 2015, 62, 83–93. [Google Scholar] [CrossRef]
Chen, Q.; Niu, X.; Zuo, L.; Tisheng, Z.; Fuqin, X.; Yi, L.; Jingnan, L. A Railway Track Geometry Measuring Trolley System Based on Aided INS. Sensors 2018, 18, 538. [Google Scholar] [CrossRef] [PubMed]
Zhu, F.; Zhou, W.; Zhang, Y.; Duan, R.; Xiaohong, Z. Attitude variometric approach using DGNSS/INS integration to detect deformation in railway track irregularity measuring. J. Geodesy. 2019, 93, 1571–1587. [Google Scholar] [CrossRef]
Obrien, E.J.; Quirke, P.; Bowe, C.; Cantero, D. Determination of railway track longitudinal profile using measured inertial response of an in-service railway vehicle. Struct. Health Monit. 2018, 17, 1425–1440. [Google Scholar] [CrossRef]
Weston, P.; Roberts, C.; Yeo, G.; Stewart, E. Perspectives on railway track geometry condition monitoring from in-service railway vehicles. V. Ehicle Syst. Dyn. 2015, 53, 1063–1091. [Google Scholar] [CrossRef]
Xin, T.; Wang, P.; Ding, Y. Effect of Long-Wavelength Track Irregularities on Vehicle Dynamic Responses. Shock Vib. 2019, 2019, 4178065. [Google Scholar] [CrossRef]
Kulkarni, R.; Qazizadeh, A.; Berg, M.; Carlsson, U.; Stichel, S. Vehicle running instability detection algorithm (VRIDA): A signal based onboard diagnostic method for detecting hunting instability of rail vehicles. Proc. Inst. Mech. Eng. Part F J. Rail Rapid Transit 2022, 236, 262–274. [Google Scholar] [CrossRef]
Czop, P.; Mendrok, K.; Uhl, T. Application of inverse linear parametric models in the identification of rail track irregularities. Arch. Appl. Mech. 2011, 81, 1541–1554. [Google Scholar] [CrossRef]
Tsunashima, H.; Naganuma, Y.; Kobayashi, T. Track geometry estimation from car-body vibration. Vehicle. Syst. Dyn. 2014, 52, 207–219. [Google Scholar] [CrossRef]
Odashima, M.; Azami, S.; Naganuma, Y.; Mori, H.; Tsunashima, H. Track geometry estimation of a conventional railway from car-body acceleration measurement. Mech. Eng. J. 2017, 4, 16-00498. [Google Scholar] [CrossRef]
Xiao, X.; Shen, W.; He, X. Track Irregularity Monitoring on High-Speed Railway Viaducts: A Novel Algorithm with Unknown Input Condensation. J. Eng. Mech. 2021, 147, 4021029–4021032. [Google Scholar] [CrossRef]
Kim, S.; Choi, Y.; Lee, M. Deep learning with support vector data description. Neurocomputing 2015, 165, 111–117. [Google Scholar] [CrossRef]
Hopfield, J.J. Neural networks and physical systems with emergent collective computational abilities. Proc. Natl. Acad. Sci. USA 1982, 79, 2554–2558. [Google Scholar] [CrossRef]
Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural. Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
Sutskever, I.; Vinyals, O.; Le, Q.V. Sequence to sequence learning with neural networks. In Proceedings of the 27th International Conference on Neural Information Processing Systems, Montreal, QC, Canada, 8–13 December 2014; Volume 2, pp. 3104–3112. [Google Scholar]
Guan, L.; Shi, J.; Wang, D.; Shao, H.; Chen, Z.Z.; Chu, D. A trajectory prediction method based on bayonet importance encoding and bidirectional lstm. Expert. Syst. Appl. 2023, 223, 119888–119900. [Google Scholar] [CrossRef]
Kim, H.; Park, M.; Kim, C.W.; Shin, D. Source localization for hazardous material release in an outdoor chemical plant via a combination of LSTM-RNN and CFD simulation. Comput. Chem. Eng. 2019, 125, 476–489. [Google Scholar] [CrossRef]
Zazo, R.; Nidadavolu, P.S.; Chen, N.; Gonzalez-Rodriguez, J.; Dehak, N. Age Estimation in Short Speech Utterances based on LSTM Recurrent Neural Networks. IEEE Access 2018, 6, 22524–22530. [Google Scholar] [CrossRef]
Qiu, J.; Wang, B.; Zhou, C. Forecasting stock prices with long-short term memory neural network based on attention mechanism. PLoS ONE 2020, 15, e0227222. [Google Scholar] [CrossRef] [PubMed]
Cabrera, D.; Guamán, A.; Zhang, S.; Cerrada, M.; Sanchez, R.V.; Cevallos, J.; Long, J.; Li, C. Bayesian approach and time series dimensionality reduction to LSTM-based model-building for fault diagnosis of a reciprocating compressor. Neural. Comput. 2020, 380, 51–66. [Google Scholar] [CrossRef]
Song, W.; Liu, H.; Zio, E. Long-range dependence and heavy tail characteristics for remaining useful life prediction in rolling bearing degradation. Appl. Math. Model. 2022, 102, 268–284. [Google Scholar] [CrossRef]
Zang, H.; Liu, L.; Sun, L.; Cheng, L.; Sun, G. Short-term global horizontal irradiance forecasting based on a hybrid CNN-LSTM model with spatiotemporal correlations. Renew. Energy 2020, 160, 26–41. [Google Scholar] [CrossRef]
Wu, M.T.; Li, Z.; Herencsar, N.; Vo, B.; Lin, J.C.W. A graph-based CNN-LSTM stock price prediction algorithm with leading indicators. Multimed. Syst. 2021, 2021, 1751–1770. [Google Scholar] [CrossRef]
Zou, F.; Zhang, H.; Sang, S.; Li, X.M.; He, W.Y.; Liu, X.W. Bearing fault diagnosis based on combined multi-scale weighted entropy morphological filtering and bi-LSTM. Appl. Intell. 2021, 51, 6647–6664. [Google Scholar] [CrossRef]
Xia, T.; Song, Y.; Zheng, Y.; Pan, E.; Xi, L. An ensemble framework based on convolutional bi-directional LSTM with multiple time windows for remaining useful life estimation. Comput. Ind. 2020, 115, 103182–103197. [Google Scholar] [CrossRef]
Choi, I.I.Y.; Um, J.H.; Lee, J.S.; Choi, H.H. The influence of track irregularities on the running behavior of high-speed trains. Proc. Inst. Mech. Eng. Part F J. Rail Rapid Transit 2013, 227, 94–102. [Google Scholar] [CrossRef]
Liu, D.; Lechner, B.; Freudenstein, S. Evaluation of high-speed track quality using dynamic simulation of vehicle-track interaction. J. Transp. Technol. 2016, 6, 9–14. [Google Scholar]
Lestoille, N.; Soize, C.; Funfschilling, C. Stochastic prediction of high-speed train dynamics to long-term evolution of track irregularities. Mech. Res. Commun. 2016, 75, 29–39. [Google Scholar] [CrossRef]
Choudhary, A.; Hanif, N.; Iqbal, J. On smoothing macroeconomic time series using HP and modified HP filter. MPRA Paper 2013, 46, 2205–2214. [Google Scholar] [CrossRef]
Emi, M.; Kim, T.H.; Newbold, P. The Hodrick-Prescott Filter at Time Series Endpoints. SSRN Electron. J. 2003, 4390, 1129–1130. [Google Scholar]

Figure 1. Data set construction method based on sliding window.

Figure 2. Model structure diagram.

Figure 3. Model prediction effect based on simulation data (The positions A and B in the figure represent the peak areas of track irregularity).

Figure 4. High-speed comprehensive inspection train.

Figure 5. Data truncation method based on sliding window.

Figure 6. HP filtering effect.

Figure 7. Comparison of training processes of different models.

Figure 8. Comparison of prediction effects of different models: (a) Comparison of performance with RNN networks; (b) Comparison of performance with LSTM network; (c) Comparison of performance with Bi-LSTM network.

Figure 9. Internal verification effect verification (The positions A and B in the figure represent the peak areas of track irregularity).

Figure 10. Risk coefficient effect verification (A~E represent five positions with big values).

Table 1. Parameters of models.

Model Level	Parameter
CNN	Conv1D (filters = 100, kernel_size = 10, activation = ‘tanh’)
Bi-LSTM	Bi-directional (LSTM(100, activation = ‘tanh’))
Dense	Dense (60, activation = ‘tanh’) Dense (30, activation = ‘linear’) Dense (1, activation = ‘linear’)

Table 2. Comparison of algorithm results.

Algorithm	RNN		LSTM		Bi-LSTM
Loss function	MSE	EIF_MSE	MSE	EIF_MSE	MSE	EIF_MSE
PE	0.212	0.075	0.194	0.038	0.173	0.013
OE	16.4%	15%	15.6%	13.8	12.8%	12.2%

Table 3. Comparison of internal verification.

Parameter	γ = 0	γ = 0.3	γ = 1
OE	36%	30.6%	24.6%

Table 4. Comparison of risk verification.

Parameter	λ = 0	λ = 0.2	λ = 0.15
PE	0.475	0.171	0.276

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yang, J.; Liu, J.; Guo, J.; Tao, K. Track Irregularity Identification Method of High-Speed Railway Based on CNN-Bi-LSTM. Sensors 2024, 24, 2861. https://doi.org/10.3390/s24092861

AMA Style

Yang J, Liu J, Guo J, Tao K. Track Irregularity Identification Method of High-Speed Railway Based on CNN-Bi-LSTM. Sensors. 2024; 24(9):2861. https://doi.org/10.3390/s24092861

Chicago/Turabian Style

Yang, Jinsong, Jinzhao Liu, Jianfeng Guo, and Kai Tao. 2024. "Track Irregularity Identification Method of High-Speed Railway Based on CNN-Bi-LSTM" Sensors 24, no. 9: 2861. https://doi.org/10.3390/s24092861

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Track Irregularity Identification Method of High-Speed Railway Based on CNN-Bi-LSTM

Abstract

1. Introduction

2. Materials and Methods

2.1. Trend Extraction Based on HP Filtering Algorithm

2.2. Data Set Construction Based on Moving Sliding Window

3. Track Irregularity Prediction Model Based on CNN-Bi-LSTM

4. Exponential Weighted Mean Square Error with Priority Inner Fitting

5. Model Validation

5.1. Simulation Model Verification

5.2. Verification of Measured Data

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviation

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI