Intelligent System for Railway Wheelset Press-Fit Inspection Using Deep Learning

Jwo, Jung-Sing; Lin, Ching-Sheng; Lee, Cheng-Hsiung; Zhang, Li; Huang, Sin-Ming

doi:10.3390/app11178243

Open AccessArticle

Intelligent System for Railway Wheelset Press-Fit Inspection Using Deep Learning

by

Jung-Sing Jwo

^1,2,

Ching-Sheng Lin

¹,

Cheng-Hsiung Lee

^1,*

,

Li Zhang

³ and

Sin-Ming Huang

⁴

¹

Master Program of Digital Innovation, Tunghai University, Taichung 40704, Taiwan

²

Department of Computer Science, Tunghai University, Taichung 40704, Taiwan

³

ZhiQi Railway Equipment Co., Ltd., Taiyuan 030032, China

⁴

Digiwin Software Co., Ltd., Taichung 412031, Taiwan

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2021, 11(17), 8243; https://doi.org/10.3390/app11178243

Submission received: 2 August 2021 / Revised: 1 September 2021 / Accepted: 2 September 2021 / Published: 6 September 2021

(This article belongs to the Special Issue Advanced Digital Technologies for the Integration of Production and Maintenance)

Download

Browse Figures

Versions Notes

Abstract

:

Railway wheelsets are the key to ensuring the safe operation of trains. To achieve zero-defect production, railway equipment manufacturers must strictly control every link in the wheelset production process. The press-fit curve output by the wheelset assembly machine is an essential indicator of the wheelset’s assembly quality. The operators will still need to manually and individually recheck press-fit curves in our practical case. However, there are many uncertainties in the manual inspection. For example, subjective judgment can easily cause inconsistent judgment results between different inspectors, or the probability of human misinterpretation can increase as the working hours increase. Therefore, this study proposes an intelligent railway wheelset inspection system based on deep learning, which improves the reliability and efficiency of manual inspection of wheelset assembly quality. To solve the severe imbalance in the number of collected images, this study establishes a predicted model of press-fit quality based on a deep Siamese network. Our experimental results show that the precision measurement is outstanding for the testing dataset contained 3863 qualified images and 28 unqualified images of press-fit curves. The proposed system will serve as a successful case of a paradigm shift from traditional manufacturing to digital manufacturing.

Keywords:

artificial intelligence; deep learning; intelligent system; industry 4.0; machine learning; railway wheelsets

1. Introduction

In recent years, many leading industrial countries have invested in national plans to support the domestic manufacturing industry’s move towards smart manufacturing to achieve Industry 4.0’s vision. Industry 4.0 includes many technologies and related paradigms, such as the industrial internet of things, cloud-based design, digital technologies, and innovative applications of artificial intelligence (AI) [1]. It has been recognized that AI technologies are impressive and bring many benefits to the enterprise.

The transportation sector, especially the railway sector, has adopted Industry 4.0 to a large extent to improve service quality, reduce cost, and increase resource utilization [2]. For railway transportation, the routine inspection and maintenance of train components and rails are crucial, because neglecting any link is likely to cause major harm. With the development of computer vision techniques, it has become possible for railway systems to use visual inspection technology to replace manual inspection. Liu et al. [3] provided a full review of visual inspection applications based on image processing in the railway industry for rail surface, track component wear, rail component identification, train body components, etc.

Wheelsets are key to ensuring the safe operation of trains because the failure of an axle, a wheel, or one of its bearings will inevitably lead to accidental derailment [4,5,6]. The primary source of damage to railway facilities and vehicles is wheel defects. Possible causes of damage include shaft misalignment, contaminated areas, excessive load, overheating, lack of lubrication, electrical damage, wheel damage, and manufacturing defects, etc. [6]. To develop wayside measurement systems for wheel defect recognition, Zhang et al. [7] utilized the optoelectronic measuring technique to develop a non-contact measurement system capable of measuring the geometric parameters of wheelsets online. Krummenacher et al. [8] proposed two methods to automatically detect wheel defects based on the wheel vertical force, measured by a permanently installed sensor system on the railway network. One method is based on novel wavelet features to classify time series data by support vector machine (SVM); the other method trains the convolutional neural networks (CNN) for different defect types to predict if a wheel has a defect during regular operation. Mosleh et al. [9] established a 3D numerical dynamic model of a vehicle–track coupling system and analyzed the sensitivity and reliability of different sensors and setups for wheel flat detection. In their method, the wheel flat was identified by the envelope spectrum approach with spectral kurtosis analysis by considering the evaluated shear and accelerations in 19 positions as inputs. Gao et al. [10] used a railway wheel flat measurement method based on a parallelogram mechanism to detect wheel flats dynamically and quantitatively. In addition, they established a three-dimensional simulation model based on the rigid–flexible coupled multibody dynamics theory to improve the speed threshold of the measuring mechanism vibration under wheel impact. Zhou et al. [11] proposed a new, long-term monitoring method for wheel flats based on multi-sensor arrays. In their method, the dynamic strain responses of rails are captured by sensor arrays mounted on the rail web to ensure that all the wheels are assessed during the train passage. The above research is mainly focused on detecting possible defects in train wheels due to long-term use or gradual deterioration.

The wheelset is a critical component of the traveling mechanism and must provide precise geometric and mechanical characteristics to minimize dynamic action and avoid derailment [12]. Therefore, whether the wheel and axle can be closely integrated is an essential key to safety inspection in the manufacturing procedure of wheelsets. In railway wheelset manufacturing and maintenance, numerical control (NC) wheelset assembly machines are commonly used to press-fit wheels, and continuously monitor and record pressure changes through the generated force–time or force-displacement curves [13]. The force-displacement curve, also known as the press-fit curve, is an important indicator of the quality of wheelset press-fitting. As insufficient or excessive press-fitting force will lead to safety risks, operators need to monitor and evaluate the assembly quality based on the characteristics of the press-fit curve change at any time [14]. In general, the press-fit quality can meet the standard by using NC wheelset assembly machines, but this is far from enough to achieve the goal of zero-defect production in railway equipment manufacturing. For safety reasons, operators need to recheck press-fit curves manually and individually, instead of judging by press-fit records, as in our practical case.

However, there are many uncertainties in manual inspection because the specific measurement criteria of curve judgment are not suitable for quantitative representation, such as the judgement rules of “flat part” or “evenly rise” [15]. Consequently, there are often ambiguous results in press-fit curve judgment, which lead to different interpretations of the results between inspectors. Figure 1 shows some of the press-fit curves used in this study. It can be seen that there is little difference between the qualified and unqualified images of press-fit curves. In addition, human misinterpretation errors are more likely to happen, due to the long working hours required. Therefore, an intelligent automatic system that can assist the inspector in judging the press-fit curves is urgently needed to improve the reliability and efficiency of wheelset press-fit inspection.

Compared with traditional machine learning technologies, deep learning has recently become a trendy research topic in AI [16,17,18]. This solves the central problems in representation learning by expressing more straightforward representations to enable computers to build complex patterns out of simpler concepts [19]. However, AI technologies still have limitations. Data are key to successful machine-learning algorithms. Insufficient data will lead to failed classification results. However, in the practical cases of the manufacturing industry, it is not easy to collect enough representative abnormal data because normally operating factories are unlikely to experience abnormal conditions frequently. For this reason, the number of abnormal samples that can be collected in this experiment is obviously much lower than for normal samples. The number of unqualified images of the press-fit curve only accounts for 2.3% of all images (109/4754). A description of the number of collected images can be found in Section 4.

At present, many scholars have successfully applied the Siamese network methods in various application fields to suppress the impact of class imbalance on classification performance [20,21,22]. Therefore, this study applies a deep Siamese network to establish the prediction model of press-fit quality, to solve the severe imbalance between positive and negative samples. The main contribution of this paper is a demonstration of the successful application of the deep Siamese network in manufacturing and proposal of an intelligent railway wheelset inspection system, suitable for railway equipment manufacturers. The remainder of this paper is organized as follows. The review of deep Siamese neural network is described in Section 2. Section 3 presents the system architecture of the intelligent railway wheelset inspection system and the deep learning technique used. Section 4 describes experimental results and analysis, and the conclusions are presented in Section 5.

2. Deep Siamese Neural Networks

Deep learning has recently become a trendy research topic in the AI field; it solves the core problems in representation learning by expressing simpler representations [19]. However, when there are almost no available data or a relatively small amount of data, these algorithms often fail to predict accurate results. Under this restriction, many researchers proposed various one-shot learning algorithms, which enable us to make the correct prediction using only one or a few training examples in each class [23,24,25]. At present, theoretical studies and technologies based on the Siamese neural networks (SNN) have been mature and were successfully applied in various fields [26], such as audio and speech signal processing [27,28], remote-sensing scenes [29], biology [30], medicine and health [31,32,33], robotics [34], smart surveillance [35], and text mining [36].

In the manufacturing industry fields, Jalonen et al. developed a visual product tracking system by using the Siamese network method to match the product images at both ends of the tracked process [37]. For tool wear recognition, Kurek et al. applied the Siamese network technique to classify the drill wear states based on images of drilled holes. The proposed automated solution can reduce the time required to manually evaluate the drill state [38]. It is necessary to check whether printed outputs or carved wares are missing or etched by comparing the drawings in the printing and carving industries. To reduce the workforce cost and working hours, Wang et al. presented an effective method of character verification to automatically compare the similarities between the drawing characters and scanned physical characters based on the Siamese network [39].

Koch et al. first applied deep learning based on a convolutional neural network to develop SNN for one-shot classification [24]. The general SNN architecture is shown in Figure 2. An SNN consists of twin networks that accept a pair of images as input and share the same weights. The weights guarantee that their respective networks could not map two extremely similar images to very different locations in feature space, because each network computes under the same function [24]. In this deep SNN architecture with L hidden layers and N_l units,

h_{1}^{l}

represents the hidden vector in layer l for the first twin, and

h_{2}^{l}

denotes the same for the second twin. The notations

x_{1, i}

and

x_{2, i}

represent specific vector elements in two input images. In the distance layer, the difference is calculated between the twin feature vectors

h_{1}^{L}

and

h_{2}^{L}

in the last layer of hidden layers by distance formulas, such as Euclidean distance (

d_{1} = || h_{1, 1}^{L} - h_{2, 1}^{L} ||

₂). After the distance layer, we adopt a fully connected method with a sigmoid activation function to predict the similarity of two input images. Rgardinge the choice of hidden layer network architecture, this study adopts deep residual networks (ResNets) because they have the advantage of efficiently and easily training substantially deeper networks [40].

To evaluate loss function, let S represent the training set size. Let

y (x_{i}, x_{j})

be a length-S vector, which contains the binary labels for the training set, where

y (x_{i}, x_{j}) = 1

if the samples

x_{i}

and

x_{j}

are from the same class, and zero otherwise. The cross-entropy loss function for binary classification is formulated as follows [24]:

ℒ = - (y (x_{i}, x_{j}) \log (p (x_{i}, x_{j})) + (1 - y (x_{i}, x_{j})) \log (1 - p (x_{i}, x_{j}))),

(1)

where

p (x_{i}, x_{j})

is a length-S vector, which contains the probabilities of predicting similarity for any pair of input samples,

x_{i}

and

x_{j}

in the training set.

3. Intelligent Railway Wheelset Inspection System

3.1. System Overview and Description

Humans are the most valuable asset in the manufacturing industry. When developing the system, we should need to account for the human-in-the-loop in interaction [41]. To fully describe the main processes of the system, we roughly divide the system architecture into two parts: cyberspace and physical space. The proposed system architecture is presented in Figure 3.

In cyberspace, the main task is to establish and evaluate the prediction model of press-fit quality. The purpose of the image-preprocessing stage is to automatically segment the press-fit curve region and remove unrelated areas from the original image of the recording press-fit information output using the NC wheelset assembly machine. Figure 4 shows the image preprocessing steps. Figure 4a is an original image, containing the wheelset and press-fit information. Then, we crop the region of interest (ROI) for the press-fit curve at a fixed position in the original image to obtain Figure 4b. The image size of the ROI of the press-fit curve is

400 \times 602

pixels. To convert the image of the press-fit curve into a binary image for subsequent modeling, we first convert the color press-fit image to grayscale and apply a Gaussian low-pass filter to image smoothing to suppress the high-frequency parts of the image. Then, we use grayscale 127 as the threshold value to binarize the smoothed image based on experimental experiences. Pixels with grayscale values below 127 in the image are regarded as candidate objects for the press-fit curve. We take half of the highest 255 grayscales as the binary threshold because, if the threshold value is set too high, over-segmentation will occur. This will allow more false candidate objects to be misjudged as the press-fit curve. However, if the threshold value is set too low, it will cause under-segmentation, which leads to some parts of the press-fit curve being discarded. The initial result after binarization is shown in Figure 4c. In the postprocessing stage, if the area of these objects is too small or the position of an object is obviously not the press-fit curve on image space, such as the straight line above the curve, we will treat these as noise and delete them. The final result is shown in Figure 4d. For a description of the deep Siamese modeling, please refer to the following subsection. Once evaluation and verification of the predictive model are completed, it will be deployed to the enterprise private cloud for remote access by field operators.

In the physical space, the primary focus is on developing technologies and system operational interfaces. The proposed system was implemented by the following techniques. The web front end of this system was developed by AngularJS and other standard technologies, such as HTML, CSS, and JavaScript, to provide an operational interface for on-site operators. Node.js was chosen to build a web server, and RESTful APIs were created to connect the front-end applications with the backend services. Figure 5 shows the query page of the inspection results for the proposed intelligent system. Through this system, operators can upload the original image generated by the NC wheelset assembly machine. Then, the inspection result determined by the prediction model will be sent to the front-end webpage to assist on-site operators in decision-making.

3.2. Deep Learning Architecture Used in This Study

The layout of the used SNN architecture, which is mainly based on the well-known ResNet-50 [40], is shown in Figure 6. It receives a pair of images with an image size of

400 \times 602

pixels as input in the first layer. Each image is then processed through ResNet-50 network architecture. While referring to the network architecture of ResNet-50, we will extract 2048 feature maps after applying 2048 filters at the last convolutional layer. To obtain the length-2048 difference vector, we first obtain 2048 × 1 feature vectors from each twin by applying the global-max pooling operation to the output feature maps from the previous layer, and then calculate Euclidian distance between the twin difference feature vectors. Finally, the neurons in the distance layer are fully connected with one unit and passed to the sigmoid function to measure the degree of similarity between the two input images of press-fit curves.

In general, the loss function may suffer from receiving lots of easily classified samples during the training stage. Whether positive or negative, a sample is called an easy sample when the model distinguishes it as successfully dominated by its high prior probability. This means that the model tends to be dominated by easy samples that contribute little to gradient computing, leading to the model finally being failed by the imbalance [42]. To suppress the impact of the above problem, Lin et al. [43] presented focal loss (FL). FS has proved effective in related research; therefore, this study adopted FS as a loss evaluation function.

To address the class imbalance, a modulating factor

{(1 - p (x_{i}, x_{j}))}^{γ}

was added to the cross-entropy loss function, as defined in Formula (1), with a tunable focusing parameter

γ \geq 0

. Easily classified samples will be down-weighted due to the low values of the modulating factor [42]. The FL can be defined as [43]:

FL = - α {(1 - p (x_{i}, x_{j}))}^{γ} (y (x_{i}, x_{j}) \log (p (x_{i}, x_{j})) + (1 - y (x_{i}, x_{j})) \log (1 - p (x_{i}, x_{j}))),

(2)

where

α

\in [0, 1]

is a weighting factor. To achieve the best experimental results, we set the parameters

α

and

γ

to 0.25 and 2, respectively. In addition, we set the learning rate and batch size to 0.00006 and 8, respectively, in the hyperparameter tuning setting during the training stage.

4. Experimental Results and Analysis

In this experiment, this study collected 4754 images of the press-fit curves; among them, the proportion of unqualified images accounts for 2.3% of the total. All the images in this experiment are divided into two categories, qualified or unqualified, by senior on-site inspectors based on their experience. Table 1 shows the number of qualified and unqualified images for the press-fit curve used in training, validating, and testing, respectively.

To quantitatively evaluate the overall performance of the proposed intelligent system to recognize press-fit curves in railway wheelset press-fit assembly, the following three measurements are adopted: accuracy, precision, and recall. Let TQ, TU, FQ, and FU represent “true qualified”, “true unqualified”, “false qualified”, and “false unqualified”, respectively, in the confusion matrix, as shown in Table 2.

The test result of a qualified condition is either qualified (TQ) or unqualified (FQ), while the test result of an unqualified condition is either qualified (FU) or unqualified (TU). The accuracy is the proportion of both true qualified and true unqualified for press-fit curves in all test results. It is the overall correct classification rate of all test results. Precision, also known as precision rate, is the proportion of all qualified test results that are truly qualified press-fit curves. The recall represents the probability of classifying the press-fit curve as a qualified condition if it is truly qualified. The definitions for the above measurements are listed below [44]:

Accuracy = (TQ + TU)/(TQ + TU + FQ + FU),

(3)

Precision = TQ/(TQ + FQ),

(4)

Recall = TQ/(TQ + FU).

(5)

For the classification results of the 3891 press-fit curves in the testing dataset, the TQ, TU, FQ, and FU are 3255, 28, 0, and 608, respectively. The accuracy is 84.37% ((3255 + 28)/3891), and the results of precision and recall are 100% (3255/(3255 + 0)) and 84.26% (3255/(3255 + 608)), respectively. Among the three efficiency evaluation indicators, we can see that the precision performance is quite outstanding. The proposed intelligent system can successfully detect all unqualified images. This is a significant indicator of railway equipment manufacturers that strictly control abnormal events during the wheelset production process. The proposed method is rigorous in identifying whether the press-fit curve is unqualified, to avoid the possibility of false qualified (FQ). However, this will increase the probability of false unqualified (FU) results, leading to decreased accuracy and recall. In addition, for safety reasons, if operators want to recheck press-fit images manually, they only need to check the images classified as unqualified by the proposed system, thereby reducing the effort of manual inspection.

5. Conclusions

The press-fit curve is an important indicator of the quality of the railway wheelset press-fitting. However, effectively improving human misinterpretation errors during manual inspection has always been a problem, which railway equipment manufacturers urgently need to solve. To this end, this study developed an intelligent railway wheelset inspection system to assist the operators in objectively and effectively judging the press-fit curves.

In practice, the press-fit quality of most wheelsets in the wheelset production process can meet the standard using NC wheelset assembly machines. Abnormal events account for a minimal number of the total. Although the number of unqualified abnormal events is rare, they are likely to cause major traffic accidents when they occur. Therefore, the proposed system must be robust in detecting all unqualified samples. As abnormal samples is rare, the number of unqualified images of the press-fit curve that can be collected in this experiment is much lower than the number of qualified images. In order to suppress the impact of class imbalance on classification performance, this study applied a deep Siamese network with focal loss to establish a prediction model of press-fit quality. The experimental results show that the precision measurement of the proposed system can reach 100% for the testing dataset, which contains 3863 qualified and 28 unqualified images of the press-fit curves. The proposed intelligent system can successfully detect all unqualified cases.

The currently proposed system was gradually launched and tested in the manufacturing site, which is sufficient to prove that the system architecture, method, and technologies used in the implementation process proposed by this study can be provided as an essential reference for relevant researches and applications. Our results can also provide a successful case of paradigm shift from traditional manufacturing to digital manufacturing. In future work, we will continue to collect more images of press-fit curves and improve the two performance indicators of accuracy and recall through more training and testing.

Author Contributions

Conceptualization, J.-S.J.; Investigation, J.-S.J., C.-S.L. and C.-H.L.; Methodology, J.-S.J., C.-S.L. and C.-H.L.; Project administration, J.-S.J. and L.Z.; Software, S.-M.H.; Writing—original draft, C.-S.L. and C.-H.L.; Writing—review and editing, C.-H.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Thames, L.; Schaefer, D. Software-defined Cloud Manufacturing for Industry 4.0. Procedia CIRP 2016, 52, 12–17. [Google Scholar] [CrossRef] [Green Version]
Sahal, R.; Breslin, J.G.; Ali, M.I. Big data and stream processing platforms for Industry 4.0 requirements mapping for a predictive maintenance use case. J. Manuf. Syst. 2020, 54, 138–151. [Google Scholar] [CrossRef]
Liu, S.; Wang, Q.; Luo, Y. A review of applications of visual inspection technology based on image processing in the railway industry. Transp. Saf. Environ. 2019, 1, 185–204. [Google Scholar] [CrossRef] [Green Version]
Bracciali, A. Railway Wheelsets: History, Research and Developments. Int. J. Railw. Technol. 2016, 5, 23–52. [Google Scholar] [CrossRef]
Lu, J.; Xiao, J.; Gao, D.-J.; Zong, S.-Y.; Li, Z. Research on Standard and Automatic Judgment of Press-fit Curve of Locomotive Wheel-set Based on AAR Standard. IOP Conf. Ser. Mater. Sci. Eng. 2018, 326, 012010. [Google Scholar] [CrossRef] [Green Version]
Entezami, M.; Roberts, C.; Weston, P.; Stewart, E.; Amini, A.; Papaelias, M. Perspectives on railway axle bearing condition monitoring. Proc. Inst. Mech. Eng. Part F J. Rail Rapid Transit 2019, 234, 17–31. [Google Scholar] [CrossRef]
Zhang, Z.-F.; Gao, Z.; Liu, Y.-Y.; Jiang, F.-C.; Yang, Y.-L.; Ren, Y.-F.; Yang, H.-J.; Yang, K.; Zhang, X.-D. Computer Vision Based Method and System for Online Measurement of Geometric Parameters of Train Wheel Sets. Sensors 2011, 12, 334–346. [Google Scholar] [CrossRef] [Green Version]
Krummenacher, G.; Ong, C.S.; Koller, S.; Kobayashi, S.; Buhmann, J.M. Wheel Defect Detection With Machine Learning. IEEE Trans. Intell. Transp. Syst. 2017, 19, 1176–1187. [Google Scholar] [CrossRef]
Mosleh, A.; Montenegro, P.; Costa, P.; Calçada, R. Railway Vehicle Wheel Flat Detection with Multiple Records Using Spectral Kurtosis Analysis. Appl. Sci. 2021, 11, 4002. [Google Scholar] [CrossRef]
Gao, R.; He, Q.; Feng, Q. Railway Wheel Flat Detection System Based on a Parallelogram Mechanism. Sensors 2019, 19, 3614. [Google Scholar] [CrossRef] [Green Version]
Zhou, C.; Gao, L.; Xiao, H.; Hou, B. Railway Wheel Flat Recognition and Precise Positioning Method Based on Multisensor Arrays. Appl. Sci. 2020, 10, 1297. [Google Scholar] [CrossRef] [Green Version]
Spiryagin, M.; Wolfs, P.; Cole, C.; Spiryagin, V.; Sun, Y.Q.; McSweeney, T. Design and Simulation of Heavy Haul Locomotives and Trains; CRC Press: Boca Raton, FL, USA, 2016. [Google Scholar]
You, B.; Lou, Z.; Luo, Y.; Xu, Y.; Wang, X. Prediction of Pressing Quality for Press-Fit Assembly Based on Press-Fit Curve and Maximum Press-Mounting Force. Int. J. Aerosp. Eng. 2015, 2015, 1–10. [Google Scholar] [CrossRef]
Wang, X.; Lou, Z.; Wang, X.; Xu, C. A new analytical method for press-fit curve prediction of interference fitting parts. J. Mater. Process. Tech. 2017, 250, 16–24. [Google Scholar] [CrossRef]
Xiao, J.; Han, J.-B.; Cheng, X.; Fang, R. Research on Automatic Judgement of Wheelset Press-Fit Curve. Appl. Mech. Mater. 2012, 236-237, 1321–1326. [Google Scholar] [CrossRef]
Lee, C.-H.; Jwo, J.-S.; Hsieh, H.-Y.; Lin, C.-S. An Intelligent System for Grinding Wheel Condition Monitoring Based on Machining Sound and Deep Learning. IEEE Access 2020, 8, 58279–58289. [Google Scholar] [CrossRef]
Lee, C.-H.; Lai, T.-S. An Intelligent System for Improving Electric Discharge Machining Efficiency Using Artificial Neural Network and Adaptive Control of Debris Removal Operations. IEEE Access 2021, 9, 75302–75312. [Google Scholar] [CrossRef]
Jwo, J.-S.; Lin, C.-S.; Lee, C.-H. Smart technology—Driven aspects for human-in-the-loop smart manufacturing. Int. J. Adv. Manuf. Technol. 2021, 114, 1741–1752. [Google Scholar] [CrossRef]
Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, MA, USA, 2016. [Google Scholar]
Bedi, P.; Gupta, N.; Jindal, V. Siam-IDS: Handling class imbalance problem in Intrusion Detection Systems using Siamese Neural Network. Procedia Comput. Sci. 2020, 171, 780–789. [Google Scholar] [CrossRef]
Mac, B.; Moody, A.R.; Khademi, A. Siamese Content Loss Networks for Highly Imbalanced Medical Image Segmentation. In Medical Imaging with Deep Learning; PMLR: New York, NY, USA, 2020; pp. 503–514. [Google Scholar]
Wu, S.; Wu, Y.; Cao, D.; Zheng, C. A fast button surface defect detection method based on Siamese network with imbalanced samples. Multimed. Tools Appl. 2019, 78, 34627–34648. [Google Scholar] [CrossRef]
Li, F.-F.; Fergus, R.; Perona, P. One-shot learning of object categories. IEEE Trans. Pattern Anal. Mach. Intell. 2006, 28, 594–611. [Google Scholar] [CrossRef] [Green Version]
Koch, G.; Zemel, R.; Salakhutdinov, R. Siamese neural networks for one-shot image recognition. In Proceedings of the Deep Learning Workshop, ICML’15, Paris, France, 10–11 July 2015; Volume 2. Available online: https://sites.google.com/site/deeplearning2015/ (accessed on 1 September 2021).
Rao, S.-J.; Wang, Y.; Cottrell, G.W. A Deep Siamese Neural Network Learns the Human-Perceived Similarity Structure of Facial Expressions Without Explicit Categories. CogSci 2016. Available online: https://cogsci.mindmodeling.org/2016/papers/0050/paper0050.pdf (accessed on 1 September 2021).
Chicco, D. Siamese Neural Networks: An Overview. In Artificial Neural Networks. Methods in Molecular Biology; Cartwright, H., Ed.; Humana: New York, NY, USA, 2021. [Google Scholar]
Zhang, Y.; Pardo, B.; Duan, Z. Siamese Style Convolutional Neural Networks for Sound Search by Vocal Imitation. IEEE/ACM Trans. Audio Speech Lang. Process. 2018, 27, 429–441. [Google Scholar] [CrossRef]
Manocha, P.; Badlani, R.; Kumar, A.; Shah, A.P.; Elizalde, B.; Raj, B. Content-Based Representations of Audio Using Siamese Neural Networks. In Proceedings of the 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, AB, Canada, 15–20 April 2018; pp. 3136–3140. [Google Scholar]
Liu, X.; Zhou, Y.; Zhao, J.; Yao, R.; Liu, B.; Zheng, Y. Siamese Convolutional Neural Networks for Remote Sensing Scene Classification. IEEE Geosci. Remote. Sens. Lett. 2019, 16, 1200–1204. [Google Scholar] [CrossRef]
Zheng, W.; Yang, L.; Genco, R.J.; Wactawski-Wende, J.; Buck, M.; Sun, Y. SENSE: Siamese neural network for sequence embedding and alignment-free comparison. Bioinformatics 2019, 35, 1820–1828. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Zeng, X.; Chen, H.; Luo, Y.; Ye, W. Automated Diabetic Retinopathy Detection Based on Binocular Siamese-Like Convolutional Neural Network. IEEE Access 2019, 7, 30744–30753. [Google Scholar] [CrossRef]
Shorfuzzaman, M.; Hossain, M.S. MetaCOVID: A Siamese neural network framework with contrastive loss for n-shot diagnosis of COVID-19 patients. Pattern Recognit. 2021, 113, 107700. [Google Scholar] [CrossRef] [PubMed]
Li, M.-D.; Chang, K.; Bearce, B.; Chang, C.-Y.; Huang, A.-J.; Campbell, J.P.; Brown, J.M.; Singh, P.; Hoebel, K.V.; Erdoğmuş, D.; et al. Siamese neural networks for continuous disease severity evaluation and change detection in medical imaging. NPJ Digit. Med. 2020, 3, 1–9. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Utkin, L.V.; Zaborovsky, V.S.; Popov, S.G. Siamese neural network for intelligent information security control in multi-robot systems. Autom. Control. Comput. Sci. 2017, 51, 881–887. [Google Scholar] [CrossRef]
Ullah, A.; Muhammad, K.; Haydarov, K.; Haq, I.U.; Lee, M.; Baik, S.W. One-Shot Learning for Surveillance Anomaly Recognition using Siamese 3D CNN. In Proceedings of the 2020 International Joint Conference on Neural Networks (IJCNN), Glasgow, UK, 19–24 July 2020; pp. 1–8. [Google Scholar]
Zhu, W.; Yao, T.; Ni, J.; Wei, B.; Lu, Z. Dependency-based Siamese long short-term memory network for learning sentence representations. PLoS ONE 2018, 13, e0193919. [Google Scholar] [CrossRef] [Green Version]
Jalonen, T.; Laakom, F.; Gabbouj, M.; Puoskari, T. Visual Product Tracking System Using Siamese Neural Networks. IEEE Access 2021, 9, 76796–76805. [Google Scholar] [CrossRef]
Kurek, J.; Antoniuk, I.; Świderski, B.; Jegorowa, A.; Bukowski, M. Application of Siamese Networks to the Recognition of the Drill Wear State Based on Images of Drilled Holes. Sensors 2020, 20, 6978. [Google Scholar] [CrossRef] [PubMed]
Wang, S.; Lv, X.; Li, R.; Yu, C.; Dong, J. Characters Verification via Siamese Convolutional Neural Network. In Proceedings of the 2018 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC), Jinan, China, 14–17 December 2018; pp. 417–420. [Google Scholar]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
Jwo, J.-S.; Lin, C.-S.; Lee, C.-H. An Interactive Dashboard Using a Virtual Assistant for Visualizing Smart Manufacturing. Mob. Inf. Syst. 2021, 2021, 1–9. [Google Scholar] [CrossRef]
Zhao, Y.; Jiang, M.; Kong, J.; Li, S. Paralleled attention modules and adaptive focal loss for Siamese visual tracking. IET Image Process. 2021, 15, 1345–1358. [Google Scholar] [CrossRef]
Lin, T.Y.; Goyal, P.; Girshick, R.; He, K.; Dollár, P. Focal loss for dense object detection. In Proceedings of the 2017 IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 2980–2988. [Google Scholar]
Lin, P.-L.; Huang, P.-W.; Lee, C.-H.; Wu, M.-T. Automatic classification for solitary pulmonary nodule in CT image by fractal analysis based on fractional Brownian motion model. Pattern Recognit. 2013, 46, 3279–3287. [Google Scholar] [CrossRef]

Figure 1. Some examples of force-displacement (press-fit) curves. Figures (a–c) are the qualified images of press-fit curves, and Figures (d–f) are the unqualified examples. In the figures, the y-axis is the press-mounting force, and the x-axis is its corresponding displacement.

Figure 2. An architecture of simple L hidden layers of Siamese network for binary classification. In the twin networks, the weight matrices are shared at each layer [24].

Figure 3. The system architecture of the proposed intelligent railway wheelset inspection system.

Figure 4. An output diagram of each stage in the image-preprocessing process. (a) The original image of the recording press-fit information output by NC wheelset assembly machines; (b) The ROI of the press-fit curve was cropped from the original image; (c) The initial binarization result of (b); (d) The final result after postprocessing.

Figure 5. The query page of inspection results for the proposed system.

Figure 6. The architecture of the SNN is used in this study. The two images from the same class are called a positive pair, and those from the different classes are called a negative pair. This network is mainly based on the well-known ResNet-50.

Table 1. Description of experimental datasets.

Dataset	Training Dataset	Validating Dataset	Testing Dataset	Total
Qualified images	373	409	3863	4645
Unqualified image	26	55	28	109
Total	399	464	3891	4754

Table 2. Cross-relations between test and actual results.

Confusion Matrix (Q: Qualified; U: Unqualified)		Actual Results
Confusion Matrix (Q: Qualified; U: Unqualified)		Q	U
Test results	Q	TQ (3255)	FQ (0)
Test results	U	FU (608)	TU (28)

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jwo, J.-S.; Lin, C.-S.; Lee, C.-H.; Zhang, L.; Huang, S.-M. Intelligent System for Railway Wheelset Press-Fit Inspection Using Deep Learning. Appl. Sci. 2021, 11, 8243. https://doi.org/10.3390/app11178243

AMA Style

Jwo J-S, Lin C-S, Lee C-H, Zhang L, Huang S-M. Intelligent System for Railway Wheelset Press-Fit Inspection Using Deep Learning. Applied Sciences. 2021; 11(17):8243. https://doi.org/10.3390/app11178243

Chicago/Turabian Style

Jwo, Jung-Sing, Ching-Sheng Lin, Cheng-Hsiung Lee, Li Zhang, and Sin-Ming Huang. 2021. "Intelligent System for Railway Wheelset Press-Fit Inspection Using Deep Learning" Applied Sciences 11, no. 17: 8243. https://doi.org/10.3390/app11178243

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Intelligent System for Railway Wheelset Press-Fit Inspection Using Deep Learning

Abstract

1. Introduction

2. Deep Siamese Neural Networks

3. Intelligent Railway Wheelset Inspection System

3.1. System Overview and Description

3.2. Deep Learning Architecture Used in This Study

4. Experimental Results and Analysis

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI