Development of Intelligent Fault Diagnosis Technique of Rotary Machine Element Bearing: A Machine Learning Approach

Saha, Dip Kumar; Hoque, Md. Emdadul; Badihi, Hamed

doi:10.3390/s22031073

Open AccessArticle

Development of Intelligent Fault Diagnosis Technique of Rotary Machine Element Bearing: A Machine Learning Approach

by

Dip Kumar Saha

¹

,

Md. Emdadul Hoque

²

and

Hamed Badihi

^3,*

¹

Department of Mechatronics Engineering, Rajshahi University of Engineering & Technology, Rajshahi 6204, Bangladesh

²

Department of Mechanical Engineering, Rajshahi University of Engineering & Technology, Rajshahi 6204, Bangladesh

³

College of Automation Engineering, Nanjing University of Aeronautics and Astronautics (NUAA), Nanjing 211106, China

^*

Author to whom correspondence should be addressed.

Sensors 2022, 22(3), 1073; https://doi.org/10.3390/s22031073

Submission received: 26 December 2021 / Revised: 17 January 2022 / Accepted: 27 January 2022 / Published: 29 January 2022

(This article belongs to the Collection Recent Advances in Fault Diagnostics, Prognostics, and Intelligent Condition-Based Maintenance)

Download

Browse Figures

Versions Notes

Abstract

:

The bearing is an essential component of a rotating machine. Sudden failure of the bearing may cause an unwanted breakdown of the manufacturing plant. In this paper, an intelligent fault diagnosis technique was developed to diagnose various faults that occur in a deep groove ball bearing. An experimental setup was designed and developed to generate faulty data in various conditions, such as inner race fault, outer race fault, and cage fault, along with the healthy condition. The time waveform of raw vibration data generated from the system was transformed into a frequency spectrum using the fast Fourier transform (FFT) method. These FFT signals were analyzed to detect the defective bearing. Another significant contribution of this paper is the application of a machine learning (ML) algorithm to diagnose bearing faults. The support vector machine (SVM) was used as the primary algorithm. As the efficiency of SVM heavily depends on hyperparameter tuning and optimum feature selection, the particle swarm optimization (PSO) technique was used to improve the model performance. The classification accuracy obtained using SVM with a traditional grid search cross-validation (CV) optimizer was 92%, whereas the improved accuracy using the PSO-based SVM was found to be 93.9%. The developed model was also compared with other traditional ML techniques such as k-nearest neighbor (KNN), decision tree (DT), and linear discriminant analysis (LDA). In every case, the proposed model outperformed the existing algorithms.

Keywords:

support vector machine (SVM); particle swarm optimization (PSO); fault diagnosis; ball bearing; machine learning (ML)

1. Introduction

Machines are the heart of any industrial unit or manufacturing plant. Numerous types of machinery are available in the industry. The profit of any production plant is highly dependent on the available runtime of machines. A reduction in downtime is essential to increase the company’s profit margin because maintenance costs carry about 15–20% of total production costs [1]. However, it is a reality that almost 30% of maintenance costs are simply wasted due to improper maintenance strategy adoption and failure to perform maintenance at appropriate times. The sudden collapse of machine components may lead to substantial production losses. Proper condition monitoring of these components is essential to ensure the uninterrupted operation of industries. Condition monitoring deals with both present and past aspects of the machines. Various forms of information such as vibration, noise, temperature, current drawn by the motor, and lubricating oil conditions are obtained from the machines during this process. Obviously, this information can play a significant role in developing a suitable maintenance strategy.

Although every machine is essential to keep the plant in operation, only 5–10% of machinery is considered most critical. Compressors, turbines, boilers, generators, and motors are some of the examples of essential components. Statistics show that electrical motors are used as prime movers in more than 90% of the mechanical drives such as gearboxes, compressors, and pumps [2]. The failure in these motors typically falls in the category of bearing-related faults, stator winding associated faults, and rotor-related faults. Among these faults, most of them, almost 40%, occur due to bearing-related issues [3].

Many diagnosis techniques have been developed on the basis of vibration analysis [4], temperature analysis [5], wear debris analysis, motor current signature analysis [6], and acoustic emission analysis [7]. Among these techniques, vibration analysis is considered the most effective, as it can provide significant information about anomalies. A wide-ranging literature investigation on condition monitoring systems (CMSs) thorough vibration analysis was carried out to find possible research gaps. In 2019, Leao et al. developed a fault detection method using the model state observer approach [8]. This method was used for detecting transversal cracks in a horizontal rotating shaft. The authors extracted time-domain features from a raw vibration signal. However, the main problem of this method is that it is not suitable for diagnosing faults when multiple faults occur at the same frequency. Zarei et al. used an intelligent filter for the classification of bearing faults. Three types of faults, inner race fault, outer race fault, and double hole in the outer race, were detected using this technique. The external noise did not have any impact on the developed model performance, which was a vital achievement of this work. having said that, the mandatory requirement of domain knowledge was the drawback of this model [9]. In 2017, Marins and his team developed a classification model based on a similarity-based approach. A publicly available dataset from CWRU (Case Western Reserve University) was used to validate this model. This dataset consists of bearing fault data at different sampling frequencies. In this case, domain knowledge is important [10]. The majorization–minimization-based compound fault diagnosis technique was developed by Hao et al. in 2019 [11]. This method became popular due to its capability of detecting compound faults. However, its runtime was pretty high. A decision tree-based bearing fault classification method was developed by Tahi et al. in [12] to detect misalignment, bearing defects, and unbalancing issues. The genetic wrapper was used in this model for feature selection purposes, where kurtosis and crest factor were considered as the main features. Wind turbine gearbox faults were diagnosed in [13] by Inturi et al., where discrete wavelet transform (DWT) was used as the main feature. A decision tree algorithm was used to build the classification model [13]. However, their classification work was limited to the identification of inner and outer race faults, and other types of faults were not considered in this work. Features are the main resource of the machine learning model. However, excessive features sometimes create complexity in model development. On that note, feature reduction can play a vital role in improving the accuracy of classification. Zhao et al. developed a model through a feature reduction technique with global–local margin Fisher analysis [14]. Here, a Euclidean weighted K-nearest neighbor algorithm was mainly used as a classification algorithm. In 2015, Gowid et al. [15] developed a novel condition monitoring technique using acoustic emission. This model was applied to the industrial blower dataset to identify the fault. In this model, spectral features were extracted from raw acoustic emission signals. Rauber et al. [16] used envelop spectrum with statistical time- and frequency-domain features to diagnose the fault in bearing. However, this approach produces a large number of heterogeneous features and, at the same time, creates irrelevant and redundant information that generates complexity in the model and ultimately increases the computational cost [16]. Frequency spectrum [17] and pattern recognition [18] are the two most critical analyzing features for identifying bearing faults. However, the superiority of pattern recognition over the spectrum approach was shown in [19]. In machine learning model development, feature selection is considered as one of the most critical factors. Sometimes, irrelevant features create complexity in model development. To extract the optimum feature subset from raw datasets, the minimum redundancy maximum relevance (nRMR) method was developed in [20]. Due to its effectiveness in real-time analysis and excellent generalization performance, the support vector machine (SVM) was successfully applied by He et al. for signal processing purposes [21] and by Chen et al. for pattern recognition [22]. In SVM, kernel function selection is an important task [23]. Various types of kernels are available to transform the low-dimensional space into a high-dimensional space. Having said that, due to its capability in approximating nonlinear functions, the radial basis function (RBF) is considered the most effective. During model development, another important issue is the so-called hyperparameter tuning. The most convenient approach that researchers usually follow is the trial-and-error method. The main drawback, however, is that it involves a manual iteration process that ultimately increases the computational time. Another approach often followed by researchers is using a grid search CV; however, it suffers from a low operating speed and lower accuracy. This is due to the handling of a large number of parameter combinations. Another technique is using evolutionary algorithms that belong to the metaheuristic family [24]. Huang et al. [25] proposed an optimization method based on genetic algorithm (GA) to optimize the SVM parameter and feature selection process. Lee et al. [26] developed a bearing fault diagnosis method based on ensemble empirical mode decomposition and principle component analysis. In this paper, a particle swarm optimization technique was used to improve the SVM model’s computational efficiency, which is one of the main goals of this research. The PSO has a quick convergence capability, parallel computation ability, and very fey parameters to handle [27]. We are now passing into the era of big data and the Internet of things (IoT). To handle these huge data is not easy, especially for industrial applications, where there is a wide variation in data characteristics. Data fusion is one of the prominent processes that can minimize the problem to some extent [28]. Luwei et al. developed an integrated fault detection framework using the data fusion process of frequency-domain data [29]. In 2020, Wei et al. developed a fault diagnosis of a complex system using the data fusion technique [30]. In this paper, the faults were identified at the earliest time when the defect occurred. In the same year, a composite spectrum-based multisensor data fusion technique was adopted by Akilu and Ruifeng to detect faults on rotating machinery [31]. Banerjee et al. developed a motor fault detection technique using SVM as the primary algorithm with a multisensory data fusion technique [32]. Although it was published in 2012, its contribution is still considered a remarkable one. Cao et al. developed a gear fault detection model with data fusion in 2021 [33]. Carlos et al. proposed a vibration-based fault diagnosis technique for induction motors using the orthogonal matching pursuit algorithm [34].

One main challenge in developing an effective condition monitoring technique is the selection of a proper approach. The maximum model-based approach provides lower accuracy when the machine is complex, whereas the data-driven machine learning approach is promising. A suitable dataset is the main requirement of building a classification algorithm. There are a few popular datasets such as the “NASA repository” and the “PROGNOSTIA” experimental platform. However, in this research, a fresh dataset was used. This setup created opportunities to extract data in various operating conditions. This unit system consisted of a three-phase induction motor with variable-speed operating capability, rotary machine elements bearing, and a power transmission system. One of the system’s major advantages is its very low or no self-vibration feature. The bearing was mounted in the system in such a way that it could be easily replaced on a plug-and-play basis. There was also scope to change the balancing mass of the system. Although only bearing fault diagnosis was considered in this work, it is possible to generate gear pitting fault data using this system. This paper also uses some time-domain features which were not investigated previously.

The main contributions of the paper can be outlined as follows:

An experimental setup is designed and developed for real-time data generation purposes based on which the proposed ML model is developed.
A novel hybrid PSO–SVM model is proposed for improving the generalization capability of the SVM algorithm through optimization of hyperparameters (C and γ) of the radial basis function (rbf).
The PSO optimization technique is used to improve the classification accuracy of different bearing faults.
A comparative analysis is shown with popular machine learning algorithms to validate the proposed model. Results show a significant improvement in model performance due to the introduction of PSO.

2. Experimental Setup

The development of the experimental setup was one of the major challenges of this research work. It was essential to ensure the reliability of various components used for the development of this setup. It has always been maintained to use quality components because a quality system can only produce quality data. The efficiency of the machine learning approach is heavily dependent on data quality. A photograph of the real experimental setup is shown in Figure 1. A three-phase induction motor with 0.5 Hp rated power and up to 6000 RPM rotating speed was used as the driving source of the experimental design. A variable frequency drive (VFD) was installed with a rated frequency of 0 to 599 Hz, just before the induction motor, to run the machine in different operating conditions. The power supply from the primary power source was provided to the motor through this VFD. The shaft was connected with the motor through a coupling. Four bearings carried both the static and dynamic load of the shaft. The bottom structure was constructed with stainless steel to increase the load-carrying capacity and avoid rust problems. Flexible coupling and mounting systems were adopted for both the bearing and shaft to address easy replacement purposes. Although it was possible to create faults in all six bearings, only one bearing was considered to maintain a benchmark and avoid complexity in data generation. A vibration analyzer was used with a measuring range of ±5 V AC, a frequency range of 2 to 10 kHz, and a maximum sampling frequency of 25.6 kHz. The acceleration data were measured with the help of a Ronds accelerometer sensor with a measuring range of ±80 g, frequency range of 0.7 to 10,000 Hz, and resonance frequency of about 30 kHz.

3. Methodology

The overall methodology of the paper can be segregated in several stages as shown in Figure 2. All stages are explained below.

(a): Data generation: This is the most essential part of this methodology section. As already mentioned, in order to avoid complexity in data generation, only one bearing was considered as the testing bearing. The common bearing faults that occurred in deep groove ball bearing are “ball fault” (BF), “outer race fault” (OR), “inner race fault” (IF), and “cage fault” (CF), as shown in Figure 3. All the healthy and faulty bearing system data were collected according to the predefined data generation plan, as shown in Table 1. Here, Outer_Race_10 (OR-10) denotes the dataset of outer race faults collected at 10 Hz rotational frequency. The generated vibration data were collected using an accelerometer sensor mounted to the bearing housing. All faults were seeded manually in our workshop.
(b): Signal conditioning: It is not always possible to have experimental data in the desired format. Usually, the data accusation system maintains the data in a time waveform. This time waveform signal is then converted into “.txt” and ultimately into “.csv” formats using MOS 3000 software (provided by Anhui Ronds Science and Technology Inc., Anhui, China).
(c): Feature extraction: This is one of the most critical steps when developing any ML model, as the model’s performance primarily depends on the selection of features. Various time-domain features such as kurtosis, crest factor, and form factors, are extracted from raw time-domain signals using their corresponding governing equations [35]. A feature matrix was prepared with extracted time-domain features.
(d): Model development: This is the core part of this methodology section. A section dedicated to the machine learning approach for fault detection is introduced in Section 4 to give a detailed idea about the mathematical model.
(e): Model prediction: Lastly, the model was used to predict the classification results in terms of accuracy, precision, recall value, and F1 score.

4. Machine Learning Approach for Fault Detection

4.1. Support Vector Machine (SVM)

In this paper, a multiclass support vector machine (SVM) was used as the classification algorithm. SVM works on the principle of structural risk management (SRM) [36]. It is best suited for classification tasks when the sample size is not too big. The primary concept is to find the optimal hyperplane that differentiates the training data into two binary classes by maximizing the margin. In a multiclass classification problem, the datasets other than the target one are combined to create a binary classification situation. The data points located near imaginary lines are known as support vectors. For any given sets of data (u₁, v₁), (u₂, v₂),…, (u_n, v_n),

u_{i} \in R^{n}

are considered as inputs, where u and v represent respective data points of support vectors, and

v_{i} \in (- 1, + 1)

are the outputs for each u_i. The equation of the hyperplane can be represented as follows:

(w \cdot u) + b = 0,

(1)

where w is a vector with dimension N, and b is a scalar quantity. In more generalized forms, the equations of the hyperplane for v_i = 1 and v_i = −1 can be represented, respectively, as follows:

(w \cdot u_{i}) + b \geq 1,

(2)

(w \cdot u_{i}) + b \leq 1 .

(3)

For any positive margin,

(w^{T} \cdot u_{2}) + b = + 1 .

(4)

For any negative margin,

(w^{T} \cdot u_{1}) + b = - 1 .

(5)

By subtracting Equation (5) from Equation (4), one gets the maximum margin as follows:

w^{T} (u_{2} - u_{1}) = 2,

(6)

\frac{w^{T}}{‖ w ‖} \cdot (u_{2} - u_{1}) = \frac{2}{‖ w ‖} .

(7)

This is the optimized function that needs to be maximized.

Error optimization: For any new testing data v_i, one gets

v_{i} * (w^{T} \cdot u_{i}) + b_{i} \geq 1 .

(8)

If the above condition is not satisfied, it indicates a case of misclassification. In that case, it is necessary to include the error term to calculate w and b.

(w^{*}, b^{*}) = \min \frac{‖ w ‖}{2} + C_{i} \sum_{i = 1}^{n} ε_{i},

(9)

where C is the error penalty, and

ε

is the slack variable.

4.2. Particle Swarm Optimization (PSO)

The concept of particle swarm optimization (PSO) came from analyzing the social behavior of animal groups. Research has shown that some animals such as birds and fishes can share information with their group members while traveling. This finding inspired Kennedy and Eberhart to develop PSO, a metaheuristic algorithm that can optimize nonlinear continuous functions [37]. The idea of swarm intelligence is typically observed in flocks and shoals, i.e., groups of animals.

PSO aims to determine a variable denoted by

X = [x 1, x 2, x 3, \dots \dots x n]

that optimizes the parameters on the basis of function

f (x)

. Here,

f (x)

is a fitness function or objective function; X represents an n-dimensional position vector, where n is the number of variables. The position of each particle is assessed on the basis of the value of the fitness function. The PSO is conducted using the following equations:

The particle velocity is updated by

v_{i j}^{*} (t + 1) = w^{*} v_{i j}^{*} (t) + c_{1} r_{1} (p b s t_{i j} - x_{i j}^{*} (t)) + c_{2} r_{2} (g b s t_{i j} (t) - x_{i j}^{*} (t)) .

(10)

The particle position is updated by

x_{i j}^{*} (t + 1) = x_{i j}^{*} (t) + v_{i j}^{*} (t + 1) .

(11)

The fitness function, which indicates the average classification of the model, is calculated by

f (x) = \frac{1}{n} \sum_{n = 1}^{n = n} \frac{TP + TN}{TP + TN + FP + FN},

(12)

where

v_{i j}^{*} (t)

is the i-th particle’s velocity at the t-th iteration,

p b s t_{i j} (t)

indicates the best position of the particle, and

g b s t_{i j} (t)

indicates the particle’s best position among all. TP means “true positive”, TN means “true Negative”, FP means “false positive”, and FN means “false negative”.

4.3. PSO–SVM Model Construction

In this paper, a particle swarm optimizer was used for SVM hyperparameter optimization purposes. The initialization of the PSO parameters is important for faster optimization, and it was conducted as outlined in Table 2.

The step-by-step process of the PSO–SVM model is described as follows:

Step 1: Preprocess the raw vibration data to have them in usable form.

Step 2: Extract time-domain features and split the data into training and testing datasets. Perform cross-validation to generate multiple feature subsets.

Step 3: Initialize the PSO parameters as in Table 2.

Step 4: Generate the initial position and velocity of the swarm particle using Equations (10) and (11).

Step 5: For the first iteration t = 1, calculate the fitness function using Equation (12) with a 10-fold cross-validated training dataset. A high fitness value denotes low error in classification problems.

Step 6: Update the velocity and position of each particle. Terminate the process when t = tmax. Select the best value of γ and C. Train the model on the basis of this value.

Step 7: Test the model with test datasets. Find the test accuracy of the model along with the classification report.

The flowchart of the PSO–SVM model is shown in Figure 4.

4.4. Confusion Matrix

A confusion matrix is a table capable of describing the performance of a machine learning model for classification purposes for a particular set of data where true values are known. It can be used for both binary and multiclass classification purposes, as shown in Figure 5.

4.4.1. Accuracy

Accuracy is one of the essential performance parameters for classification algorithms. It can be defined as the ratio of the number of correctly predicted data to all predictions made. It may be calculated using the following formula:

Accuracy = \frac{TP + TN}{TP + TN + FP + FN} .

(13)

4.4.2. Precision

Precision may be defined as the ratio of actually predicted true values to the total number of values predicted as true.

Precision = \frac{TP}{TP + FP} .

(14)

4.4.3. Recall

Recall may be defined as the ratio of true positive values to the sum of true positive and false negative values. It calculates how many of the actual positives the model captures by labeling them as true positives. It can be calculated using the following formula:

Recall = \frac{TP}{TP + FN} .

(15)

4.4.4. F1 Score

When a comparison is made between two models where they have high recall value but low precision or vice versa, it becomes difficult to evaluate which one is best. In this case, a third parameter, namely, the F1 score, makes them comparable.

F 1 score = 2 * \frac{Precision * Recall}{Precision + Recall} .

(16)

5. Results and Discussion

5.1. Experimental Data

Some sample experimental data are shown in Figure 6 and Figure 7. Figure 6 shows the frequency spectrum of a healthy bearing, and the faulty condition is shown in Figure 7. For the healthy bearing, the maximum amplitude was found to be

0.0035 m / s^{2}

, whereas, for the faulty bearing, the amplitude went up to

0.006 m / s^{2}

. Fast Fourier transform (FFT) plays a vital role in identifying faults in machinery. However, it loses its efficiency when multiple faults occur at the same frequency.

5.2. Data Preprocessing

Data preprocessing involves filtering, handling missing values, feature extraction, signal processing, data normalization, data scaling, and selecting the optimum feature subset. Figure 8 illustrates the correlation matrix of various features extracted from raw time-domain signals. Here, it can be observed that the RMS and standard deviation were closely related to each other. The crest factor and skewness also showed a good correlation. The mean and RMS offered very low correlation, with close to no relation. The optimum parameters were selected on the basis of this correlation matrix, which was essential to create optimum decision boundaries.

5.3. Model Performance

Figure 9 and Figure 10 show the confusion matrices obtained from the proposed method for training and testing data. A description of data labels and supportive information is given in Table 1. In total, 230 feature data were segmented into two sets, 155 data points for the training set and 75 data points for the testing set. The horizontal axes were assigned as predicted classes, and vertical axes were used as true classes. Figure 9a shows the confusion matrix in terms of sample data of the training set, and Figure 9b expresses the classification results in terms of percentage. It is observed that, out of 10 classes, six classes of faults (IR-10, IR-20, IR-30, OR-10, OR-20, N-20) were identified perfectly without any misclassifications. In the case of cage faults (CF-10 and CF-30), very few misclassifications were observed. Figure 9 shows the confusion matrix of the testing set. Training accuracy also showed promising characteristics by classifying three faults perfectly (IR-10, IR-20, and OR-10) without any misclassification, as shown in Figure 10a. From Figure 10b, it can be noted the maximum number of misclassifications (total 15) was found for CF-20.

The classification report is shown in Table 3. Such a classification report can be used to assess the accuracy of a classification algorithm’s prediction performance. The precision value indicates the ratio of the actual predicted true value to the total predicted true value. When the value is close to 1.0, the model is described as a highly precise model, whereas, with low precision, its value becomes close to 0.0. The proposed model showed higher precision for inner race faults with both 10 and 20 rotational speeds. Furthermore, the outer race fault with a 10 Hz frequency was precisely identified. Similar data were also obtained for recall value.

A performance comparison of the proposed model with existing models applied to the bearing datasets is shown in Table 4. It can be observed from the results that the SVM with grid search CV produced 92% testing accuracy, whereas an almost 2% improvement in accuracy was obtained using SVM with PSO, which is far better than other models. The decision tree and KNN attained good accuracy; however, their range of accuracy was 84–85%. The linear discriminant analysis (LDA) result was not satisfactory, as its accuracy was only 73.7%. Due to considerable robustness to noisy training data and less vulnerability to overfitting issues, SVM was superior to other ML algorithms.

6. Conclusions

In this paper, a data-driven intelligent fault diagnosis technique was developed for early fault detection of the deep groove ball bearing. An experimental setup was fabricated considering all the design criteria to produce data similar to real-life industry data. The experimental data were analyzed to separate the faulty bearing from the healthy one as a function of the fault magnitude. During experimentation, data were collected in a silent condition to avoid external noise, and the base was properly mounted to avoid any self-vibrations. The time wave data were captured in image format. These image data were converted into numeric data using the default computational tool of the vibration analyzer. The raw data were then passed through various signal conditioning processes, i.e., data filtering, missing data handling, data scaling, etc. Time-domain features were then extracted, and optimum features were selected on the basis of the relations in the correlation matrix. The correlation matrix showed that the RMS and standard deviation had a greater correlation than other pairs with an absolute value of 1. The relation between crest factor and skewness also looked promising with a correlation value of 0.83. The feature matrix was then fed into the PSO–SVM model. The classification accuracy obtained by PSO-based SVM was 93.9%, almost 2% greater than the accuracy obtained by SVM using the traditional grid search CV method. This improved performance of the proposed hybridized method was achieved due to the extreme robustness to noisy training data and less vulnerability to overfitting issues of the support vector machine algorithm. To validate the efficiency of the PSO–SVM algorithm, the model’s performance was compared with KNN, DT, and LDA using the same bearing dataset. The comparative analysis shows that the proposed model outperformed all other traditional algorithms by a large margin, with almost 10–20% more accuracy. Although this work obtained significant accuracy in bearing fault classification, there is still an opportunity to improve its performance. The application of deep learning or transfer learning methods represents an option. Due to manufacturing difficulty, the ball fault was not considered in this research; thus, it may be included in future work. The experimental setup developed for data generation of bearing faults can also be used for gear pitting fault data preparation purposes. Therefore, there is also scope to work on gear fault classification in future research.

Author Contributions

All authors of this research contributed significantly to the work submitted. Conceptualization, M.E.H., H.B. and D.K.S.; experimentation and methodology, D.K.S.; data analysis, M.E.H., H.B. and D.K.S.; original draft writing and preparation, D.K.S.; review and editing, M.E.H. and H.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded in part by Rajshahi University of Engineering & Technology under Grant No. DRE-401 and in part by the National Natural Science Foundation of China under Grant No. 62003166.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data are available in the paper. However, any additional data are available to the readers upon reasonable request to the corresponding authors.

Acknowledgments

The authors would like to acknowledge the infrastructural support provided by the Department of Mechanical Engineering, Rajshahi University of Engineering & Technology, Bangladesh.

Conflicts of Interest

The authors declare no conflict of interest.

References

Prainetr, S.; Tunyasrirut, S.; Wangnipparnto, S. Testing and Analysis Fault of Induction Motor for Case Study Misalignment Installation Using Current Signal with Energy Coefficient. World Electr. Veh. J. 2021, 12, 37. [Google Scholar] [CrossRef]
Wang, H.; Wang, P.; Song, L.; Ren, B.; Cui, L. A Novel Feature Enhancement Method Based on Improved Constraint Model of Online Dictionary Learning. IEEE Access 2019, 7, 17599–17607. [Google Scholar] [CrossRef]
Mohanty, A.R. Machinery Condition Monitoring: Principle and Practices, 1st ed.; CRC Press: Boca Raton, FL, USA, 2015. [Google Scholar]
Wang, Y.; Xiang, J.; Markert, R.; Liang, M. Spectral kurtosis for fault detection, diagnosis and prognostics of rotating machines: A review with applications. Mech. Syst. Signal Process. 2016, 66, 679–698. [Google Scholar] [CrossRef]
Lei, Y.; Yang, B.; Jiang, X.; Jia, F.; Li, N.; Nandi, A.K. Applications of machine learning to machine fault diagnosis: A review and roadmap. Mech. Syst. Signal Process. 2020, 138, 106587. [Google Scholar] [CrossRef]
Mehala, N. Condition Monitoring and Fault Diagnosis of Induction Motor Using Motor Current Signature Analysis. Electr. Eng. 2010, 2, 175. [Google Scholar]
Du, K.; Li, X.; Tao, M.; Wang, S. Experimental study on acoustic emission (AE) characteristics and crack classification during rock fracture in several basic lab tests. Int. J. Rock Mech. Min. Sci. 2020, 133, 104411. [Google Scholar] [CrossRef]
Leão, L.; Cavalini, A.; Morais, T.; Melo, G.; Steffen, V. Fault detection in rotating machinery by using the modal state observer approach. J. Sound Vib. 2019, 458, 123–142. [Google Scholar] [CrossRef]
Zarei, J.; Tajeddini, M.A.; Karimi, H.R. Vibration analysis for bearing fault detection and classification using an intelligent filter. Mechatronics 2014, 24, 151–157. [Google Scholar] [CrossRef]
Marins, M.A.; Ribeiro, F.M.; Netto, S.L.; da Silva, E.A. Improved similarity-based modeling for the classification of rotating-machine failures. J. Frankl. Inst. 2018, 355, 1913–1930. [Google Scholar] [CrossRef]
Hao, Y.; Song, L.; Ren, B.; Wang, H.; Cui, L. Step-by-Step Compound Faults Diagnosis Method for Equipment Based on Majorization-Minimization and Constraint SCA. IEEE/ASME Trans. Mechatron. 2019, 24, 2477–2487. [Google Scholar] [CrossRef]
Tahi, M.; Miloudi, A.; Dron, J.; Bouzouane, B. Decision tree and feature selection by using genetic wrapper for fault diagnosis of rotating machinery. Aust. J. Mech. Eng. 2018, 18, 496–504. [Google Scholar] [CrossRef]
Inturi, V.; Sabareesh, G.; Supradeepan, K.; Penumakala, P. Integrated condition monitoring scheme for bearing fault diagnosis of a wind turbine gearbox. J. Vib. Control. 2019, 25, 1852–1865. [Google Scholar] [CrossRef]
Zhao, X.; Jia, M. Fault diagnosis of rolling bearing based on feature reduction with global-local margin Fisher analysis. Neurocomputing 2018, 315, 447–464. [Google Scholar] [CrossRef]
Gowid, S.; Dixon, R.; Ghani, S. A novel robust automated FFT-based segmentation and features selection algorithm for acoustic emission condition based monitoring systems. Appl. Acoust. 2015, 88, 66–74. [Google Scholar] [CrossRef]
Rauber, T.W.; Boldt, F.D.A.; Varejão, F. Heterogeneous Feature Models and Feature Selection Applied to Bearing Fault Diagnosis. IEEE Trans. Ind. Electron. 2014, 62, 637–646. [Google Scholar] [CrossRef]
Huang, D.; Yang, J.; Zhou, D.; Litak, G. Novel Adaptive Search Method for Bearing Fault Frequency Using Stochastic Resonance Quantified by Amplitude-Domain Index. IEEE Trans. Instrum. Meas. 2019, 69, 109–121. [Google Scholar] [CrossRef]
Hoang, D.T.; Kang, H.J. A Motor Current Signal-Based Bearing Fault Diagnosis Using Deep Learning and Information Fusion. IEEE Trans. Instrum. Meas. 2019, 69, 3325–3333. [Google Scholar] [CrossRef]
Van, M.; Hee-Jun, K. Bearing Defect Classification based on Individual Wavelet Local Fisher Discriminant Analysis with Particle Swarm Optimization. IEEE Trans. Ind. Inform. 2015, 12, 124–135. [Google Scholar] [CrossRef]
Radovic, M.D.; Ghalwash, M.F.; Filipovic, N.; Obradovic, Z. Minimum redundancy maximum relevance feature selection approach for temporal gene expression data. BMC Bioinform. 2017, 18, 1–14. [Google Scholar] [CrossRef] [Green Version]
He, J.; Song, C.; Luo, Q.; Lan, L.; Yang, C.; Gui, W. Noise-robust self-adaptive support vector machine for residual oxygen concentration measurement. IEEE Trans. Instrum. Meas. 2020, 69, 8474–8485. [Google Scholar] [CrossRef]
Chen, D.; Tian, Y.; Liu, X. Structural nonparallel support vector machine for pattern recognition. Pattern Recognit. 2016, 60, 296–305. [Google Scholar] [CrossRef]
Smola, A.J.; Schölkopf, B.; Mueller, K.-R. The connection between regularization operators and support vector kernels. Neural Netw. 1998, 11, 637–649. [Google Scholar] [CrossRef] [Green Version]
Aljarah, I.; Faris, H.; Mirjalili, S.; Al-Madi, N. Training radial basis function networks using biogeography-based optimizer. Neural Comput. Appl. 2016, 29, 529–553. [Google Scholar] [CrossRef]
Huang, C.-L.; Wang, C.-J. A GA-based feature selection and parameters optimizationfor support vector machines. Expert Syst. Appl. 2006, 31, 231–240. [Google Scholar] [CrossRef]
Lee, D.-H.; Ahn, J.-H.; Koh, B.-H. Fault Detection of Bearing Systems through EEMD and Optimization Algorithm. Sensors 2017, 17, 2477. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Van, M.; Hoang, D.T.; Kang, H.J. Bearing Fault Diagnosis Using a Particle Swarm Optimization-Least Squares Wavelet Support Vector Machine Classifier. Sensors 2020, 20, 3422. [Google Scholar] [CrossRef]
Hu, J.; Deng, S. Rolling bearing fault diagnosis based on wireless sensor network data fusion. Comput. Commun. 2021, 181, 404–411. [Google Scholar] [CrossRef]
Luwei, K.C.; Yunusa-Kaltungo, A.; Sha’Aban, Y.A. Integrated Fault Detection Framework for Classifying Rotating Machine Faults Using Frequency Domain Data Fusion and Artificial Neural Networks. Machines 2018, 6, 59. [Google Scholar] [CrossRef] [Green Version]
Wei, Y.; Wu, D.; Terpenny, J. Robust Incipient Fault Detection of Complex Systems Using Data Fusion. IEEE Trans. Instrum. Meas. 2020, 69, 9526–9534. [Google Scholar] [CrossRef]
Yunusa-Kaltungo, A.; Cao, R. Towards Developing an Automated Faults. Energies 2020, 13, 20. [Google Scholar]
Banerjee, T.P.; Das, S. Multi-sensor data fusion using support vector machine for motor fault detection. Inf. Sci. 2012, 217, 96–107. [Google Scholar] [CrossRef]
Cao, R.; Yunusa-Kaltungo, A. An Automated Data Fusion-Based Gear Faults Classification Framework in Rotating Machines. Sensors 2021, 21, 2957. [Google Scholar] [CrossRef] [PubMed]
Morales-Perez, C.; Rangel-Magdaleno, J.; Peregrina-Barreto, H.; Amezquita-Sanchez, J.P.; Valtierra-Rodriguez, M. Incipient Broken Rotor Bar Detection in Induction Motors Using Vibration Signals and the Orthogonal Matching Pursuit Algorithm. IEEE Trans. Instrum. Meas. 2018, 67, 2058–2068. [Google Scholar] [CrossRef]
Saha, D.K.; Ahmed, S.; Shaurov, S. Different Machine Maintenance Techniques of Rotary Machine and Their Future Scopes: A Review. In Proceedings of the 2019 4th International Conference on Electrical Information and Communication Technology (EICT), Khulna, Bangladesh, 20–22 December 2019; pp. 1–6. [Google Scholar]
Jiang, Q.; Chang, F. A novel rolling-element bearing faults classification method combines lower-order moment spectra and support vector machine. J. Mech. Sci. Technol. 2019, 33, 1535–1543. [Google Scholar] [CrossRef]
Poli, R.; Kennedy, J.; Blackwell, T. Particle swarm optimization. Swarm Intell. 2007, 1, 33–57. [Google Scholar] [CrossRef]

Figure 1. Photograph of the experimental setup.

Figure 2. Step-by-step approach from data generation to fault diagnosis.

Figure 3. Typical bearing faults.

Figure 4. Flow diagram of PSO–SVM algorithm.

Figure 5. Confusion matrix.

Figure 6. Frequency spectrum of a healthy bearing.

Figure 7. Frequency spectrum of a faulty bearing.

Figure 8. Correlation matrix of extracted time−domain features.

Figure 9. Confusion matrix of training data: (a) in terms of number of training data points; (b) in terms of percentage (%).

Figure 10. Confusion matrix of testing data: (a) in terms of number of testing data points; (b) in terms of percentage (%).

Table 1. Data generation plan.

SN	Bearing Conditions	Parameter	Motor Speed (Hz)
01	Normal condition	Acceleration	20
02	Outer-race fault	Acceleration	10, 20, 30
03	Inner-race fault	Acceleration	10, 20, 30
04	Cage fault	Acceleration	10, 20, 30

Table 2. Initialization parameters of PSO–SVM model.

SN	Bearing Conditions	Symbol	Initial Value
01	Population size	p	25
02	Inertia weight	w*	1
03	Acceleration coefficient 1	c₁	1
04	Acceleration coefficient 2	c₂	2
05	Maximum iterations	tmax	150
06	Cross-validation number	k	10
07	Search range of error penalty	C	0.1 to 100
08	Search range of kernel parameter	γ	0.001 to 6

Table 3. Classification report of PSO-SVM classifier.

SN	Fault Types	Precision	Recall	F1-Score
01	Cage_Fault_10	0.9358974359	0.9733333333	0.9542483660
02	Cage_Fault_20	0.8955223881	0.8000000000	0.8450704225
03	Cage_Fault_30	0.8461538462	0.8800000000	0.8627450980
04	Inner_Race_10	1.0000000000	1.0000000000	1.0000000000
05	Inner_Race_20	1.0000000000	1.0000000000	1.0000000000
06	Inner_Race_30	0.9600000000	0.9600000000	0.9600000000
07	Normal_20	0.9740259740	1.0000000000	0.9868421053
08	Outer_Race_10	1.0000000000	0.9866666667	0.9932885906
09	Outer_Race_20	0.8266666667	0.8266666667	0.8266666667
10	Outer_Race_30	0.9473684211	0.9600000000	0.9536423841
	Accuracy	0.9386666667
	Macro average	0.9385634732	0.9386666667	0.9382503633
	Weighted average	0.9385634732	0.9386666667	0.9382503633

Table 4. Comparative analysis of the proposed model with other algorithms.

SN	Name of Machine Learning Model	Classification Accuracy (Testing) in %
01	K-nearest neighbor (KNN)	84.3
02	Decision tree (DT)	85.3
03	Linear discriminant analysis (LDA)	73.7
04	SVM with grid search CV	92
05	SVM with PSO (proposed model)	93.9

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Saha, D.K.; Hoque, M.E.; Badihi, H. Development of Intelligent Fault Diagnosis Technique of Rotary Machine Element Bearing: A Machine Learning Approach. Sensors 2022, 22, 1073. https://doi.org/10.3390/s22031073

AMA Style

Saha DK, Hoque ME, Badihi H. Development of Intelligent Fault Diagnosis Technique of Rotary Machine Element Bearing: A Machine Learning Approach. Sensors. 2022; 22(3):1073. https://doi.org/10.3390/s22031073

Chicago/Turabian Style

Saha, Dip Kumar, Md. Emdadul Hoque, and Hamed Badihi. 2022. "Development of Intelligent Fault Diagnosis Technique of Rotary Machine Element Bearing: A Machine Learning Approach" Sensors 22, no. 3: 1073. https://doi.org/10.3390/s22031073

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Development of Intelligent Fault Diagnosis Technique of Rotary Machine Element Bearing: A Machine Learning Approach

Abstract

1. Introduction

2. Experimental Setup

3. Methodology

4. Machine Learning Approach for Fault Detection

4.1. Support Vector Machine (SVM)

4.2. Particle Swarm Optimization (PSO)

4.3. PSO–SVM Model Construction

4.4. Confusion Matrix

4.4.1. Accuracy

4.4.2. Precision

4.4.3. Recall

4.4.4. F1 Score

5. Results and Discussion

5.1. Experimental Data

5.2. Data Preprocessing

5.3. Model Performance

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI