Next Article in Journal
Adaptive Truncation Threshold Determination for Multimode Fiber Single-Pixel Imaging
Previous Article in Journal
Blockchain-Based Model for Incentivized Cyber Threat Intelligence Sharing
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Fault Diagnosis of Planetary Gear Train Crack Based on DC-DRSN

College of Mechanical Engineering, Chongqing University of Technology, Chongqing 400054, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2024, 14(16), 6873; https://doi.org/10.3390/app14166873
Submission received: 3 July 2024 / Revised: 31 July 2024 / Accepted: 3 August 2024 / Published: 6 August 2024

Abstract

:
To solve the problem that the existing planetary gear train fault diagnosis methods have, namely their low diagnostic accuracy under low signal-to-noise ratio (SNR), a fault diagnosis method based on a double channel-deep residual shrinkage network (DC-DRSN) is proposed. The short-time Fourier transform (STFT) is used to convert the original vibration signal into a two-dimensional time-frequency graph, which effectively enhances the ability to express information. A DC-DRSN model is constructed, and the optimal number of residual shrinkage modules is determined by combining the diagnostic characteristics with different noises, which effectively improves the accuracy and anti-noise ability of fault diagnosis. The results of bearing and planetary gear train crack fault diagnosis show that the diagnosis method based on DC-DRSN has higher diagnostic accuracy while realizing fault diagnosis, which is better than other deep learning diagnosis methods. At the same time, the method can adapt to fault diagnosis in different noise environments, and has good expression ability and generalization ability.

1. Introduction

The safe development of nuclear power is important. As one of the key components of a nuclear power unit, the main role of the planetary gearbox is to transmit the rotational power of the motor to the impeller of the pump. The gears in the nuclear power planetary gearbox are prone to cracks, pitting corrosion, and other faults [1]. Moreover, current research on the fault diagnosis and health monitoring of the planetary gearbox is relatively weak. Therefore, it is of great significance to establish a fault diagnosis method for the planetary gear train of the nuclear power planetary gearbox.
The fault diagnosis methods are divided into three types [2]. (1) The mechanistic model-based approach. This method provides an in-depth system understanding with good epitaxy and reliability, but the mechanistic model needs sufficient knowledge and data support. With the increasing complexity of the system structure, some mechanism knowledge is difficult to obtain. Even if the mechanistic model is constructed, it is not possible to obtain all the information of the physical system. Especially when the system is highly coupled and has complex nonlinearities, the accuracy of the obtained mechanistic model is limited, which limits the application of this method. (2) The knowledge-based approach. This method is based on the integration of expert experience and prior knowledge, to explore the relationship between system faults, phenomena as well as causes, and to build a system diagnosis model according to certain rules. It does not require precise mathematical expressions and is suitable for many complex systems that are difficult to model. However, this method is poorly adaptable. An important prerequisite for obtaining accurate results is to have a knowledge base with a large amount of rich empirical knowledge. For unknown faults without any prior knowledge, not only there will be misdiagnoses, but also it is impossible to give accurate diagnostic information in the event of a fault. (3) The data-driven approach. This method aims to model and predict the physical system by processing and analyzing the input and output data of the physical system, mining the hidden rules and patterns in the data. The main advantage of this method is that it does not require any knowledge of the internal mechanism, and it can use a large amount of actual data to improve the accuracy and generalization ability of the model, which is outstanding in the absence of accurate mathematical/physical models. The vibration signal contains the essential information about the operating status of the monitored object. Therefore, it is a commonly used signal in data-driven diagnostic methods. The empirical mode decomposition [3] and wavelet transform [4] have been proven to be effective diagnostic methods. Shi [5] proposed an adaptive optimization algorithm for VMD parameters based on a differential search, combined with the relevant kurtosis index, to achieve the adaptive extraction of bearing fault features. The effectiveness of the method was verified by experiments. Chen et al. [6] proposed a fault diagnosis method for rolling bearings based on singular spectrum decomposition and independent component analysis, which solved the problem that the early failure of bearings was overwhelmed by noise due to low energy. Mauricio et al. [7] proposed a multi-band envelope extraction method based on cyclic bivariate mapping, which solved the problem of the difficulty in detecting faulty bearing noise signals using single-pass filtering. Wang et al. [8] proposed a fault diagnosis method based on parameter optimization variational mode decomposition and multi-domain manifold learning to solve the problem of difficult feature extraction of planetary gearbox features, and verified that the multi-domain feature extraction method is superior to the single-domain feature extraction methods in the time domain and frequency domain. Xu et al. [9] proposed a mixed-domain fault diagnosis method for chaotic particle swarm with improved cross-mutation, which effectively improved the accuracy of bearing fault classification based on a mixed eigendomain. Wang et al. [10] proposed a fault diagnosis method based on the combination of integrated empirical mode decomposition and a support vector machine for diesel engine vibration signal analysis and feature extraction, and verified that the method can accurately and effectively identify multiple sources of fault information of a gearbox in different states through comparative experiments. However, traditional machine learning methods are limited to shallow feature representation and struggle to deal with complex nonlinear relationships, which leads to low accuracy in fault diagnosis. In recent years, with the development of artificial intelligence and big data technologies, the methods such as deep learning as well as transfer learning have become widely used in the field of fault diagnosis [11,12,13]. Chen et al. [14] proposed an adaptive neural network model based on instantaneous speed information to realize the fault detection of a planetary gearbox under different working conditions. Zhang et al. [15] proposed a diagnostic method of a parallel multi-scale pooled branch convolutional neural network, which solved the fault diagnosis problem of mechanical vibration signals under multiple working conditions with less computational cost. However, with the increase in network layers and training data, problems such as network degradation and overfitting have become increasingly prominent, which greatly affects the accuracy of fault diagnosis. Therefore, He et al. [16] proposed to use a “shortcut connection” deep residual network (ResNet) to solve the degradation phenomenon and training problem of the deep network, which optimized the convergence speed and achieved the improvement of accuracy by solving the degradation problem during deep network processing. Zhao et al. [17] proposed a deep residual shrinkage network (DRSN) based on the residual network, which was used to improve the feature learning ability of strong noise vibration signals. Zhang et al. [18] combined variational mode Gaussian distortion with DRSN to improve the fault diagnosis performance. Cao [19] proposed a state-of-health identification method for rotating machinery based on Gramian angle field (GAF) and DRSN, which was used to identify faults in rotating machinery under the condition of nonlinear transformation speed and load. Sun et al. [20] converted the one-dimensional logging parameters into two-dimensional images by using GAF, input them into a DRSN model, and experimentally proved that the diagnostic accuracy of the proposed method was better than that of the traditional DRSN and convolutional neural network (CNN). Pei et al. [21] proposed a DRSN method based on characteristic noise energy ratio (CNER), which showed that the average recognition rate could reach 98.70% in the environment with 10 dB SNR. Yu et al. [22] proposed the Pareto-optimal adaptive loss residual shrinkage network (PALRSN), which effectively solves the problem of mechanical fault diagnosis when the fault sample data are insufficient.
In summary, the existing fault diagnosis methods have achieved certain results in low-noise environments, and the diagnostic accuracy is high. However, in strong-noise environments, the diagnostic performance and accuracy of the models need to be improved. Considering that the gearbox of a nuclear power unit works in the environment of strong noise and low SNR, a fault diagnosis method based on DC-DRSN is proposed to improve the fault diagnosis performance of the model. The main contributions of this paper are as follows: (1) The structure of one-dimensional and two-dimensional parallel residual shrinkage networks is constructed to enhance the feature extraction capability. (2) According to the diagnostic performance under different noise levels, the number of optimal residual shrinkage modules is determined. (3) Combined with the public dataset and the experimental dataset, the effectiveness and robustness of the proposed method are verified.

2. Theory and Method

2.1. Image Conversion Method

The visual representation of the vibration signal helps identify the failure mode of the gear. Commonly used image conversion methods for vibration signals include short-time Fourier transform (STFT), wavelet transform (WT), Gramian angular summation field (GASF), generalized S-transform (GST), and so on.
STFT is a common method in time-domain and frequency-domain analysis. In this method, the signal is divided into short periods, and the Fourier transform is applied to each short period to obtain the spectral information of the signal at different times and frequencies. STFT is able to capture the instantaneous frequency change of the signal in time, and the instantaneous frequency change can reflect abnormal patterns in the frequency domain, which helps identify the fault state of the system. Assuming that the vibration signal is x( τ ), the STFT is calculated as [23]:
S T F T ( t , ω ) = x ( τ ) w ( τ t ) e j ω t d τ
where w ( τ t ) is the window function; e j ω τ is the complex exponential term that describes the components of the frequency domain, ω is the frequency; S T F T ( t , ω ) is the result of the short-time Fourier transform at moments and frequencies.
In this paper, to reduce spectrum leakage and maintain computational efficiency, a Hanning window with 256 length and 50% overlap is used.
WT provides a time-frequency representation of a signal by convoluting it with a set of translational and scaled wavelet functions. This method inherits the localization idea of STFT and overcomes the defect that the size of the time window is unchanged, which makes the wavelet transform more advantageous in capturing the local information of the signal. The principle formula of WT can be described as [24]:
w ( a , b ) = f ( t ) ψ [ ( t b ) / a ] d t
where w ( a , b ) , f ( t ) and ψ ( t ) denote the wavelet coefficient, original signal, and wavelet basis function, respectively; a and b represent the scale factor and the translation factor, respectively.
In this paper, to extract enough feature information from original the vibration signal and accurately distinguish the subtle changes of features, the Morlet wavelet (cmor3-3) is used as the mother wavelet function with a total scale of 256.
GASF is the mapping of time series data to a polar coordinate system and the construction of images by calculating cosine similarity between angles. By adjusting the size and resolution of the image, the features of the signal can be extracted at different scales, and the results are as follows [25]:
x ˜ i = ( x i max ( X ) + ( x i min ( X ) ) max ( X ) min ( X )
ϕ i = arccos ( x ˜ i )
r ( i ) = x i n
G A S F = [ cos ( ϕ 1 + ϕ 1 ) cos ( ϕ 1 + ϕ n ) cos ( ϕ n + ϕ 1 ) cos ( ϕ n + ϕ n ) ]
where x ˜ i X ˜ , then encode the x ˜ i value of X ˜ as the angular cosine, time as the radius r ( i ) , and ϕ i as the angular cosine value in polar coordinates, and for the time series in the interval [−1, 1], the value range is limited to the interval [0, π] by using the inverse trigonometric function under the polar mapping. In a polar coordinate system, the cosine and angle values for each polar coordinate are calculated.
In this paper, the data length of the GASF transformation is 1024, and the window size is 256.
GST decomposes a signal into a series of rectangular boxes on the time-frequency plane, and the values in each rectangular box represent the energy of the signal at the corresponding time and frequency, which is suitable for processing nonstationary signals, providing higher resolution and flexibility in time and frequency. According to the standard deviation of the generalized Gaussian window function σ m ( f ) = 1 / f m , the width of the Gaussian window can be controlled by adjusting the parameters, so as to optimize the time-frequency resolution of GST. For an arbitrary squared integrable signal x ( t ) , GST is defined as [26]:
S ( τ , f ) = x ( t ) w ( t τ , f ) e j 2 π f t d t
where τ is the time variable; f is the frequency variable; w ( t τ , f ) is the window function of GST, and e j 2 π f t is the complex exponential function that converts the signal from the time domain to the frequency domain.
In this paper, to balance the time and frequency resolution, and ensure the integrity and smoothness of signal features, a Gaussian window function with 1 window width, 256 scale and 50% overlap is selected.

2.2. Neural Network Model

DRSN is essentially a network integrated by ResNet with an attention mechanism and a soft threshold function. The core of ResNet is the residual block unit (RBU), as shown in Figure 1, which uses multiple parametric layers to learn the residual representation between the input and output. The input feature x is first stored in the variable, and then the residual part F(x) is learned through a series of nonlinear layers, and finally the residual is calculated. Through cross-layer identity mapping, RBU transforms the learning objective from the mapping between input and output to learning residuals, which reduces the training difficulty and accelerates the network convergence speed. It is expressed by the formula [16]:
H ( x ) = F ( x ) + x
The residual shrinkage block unit (RSBU) is the core of the DRSN network, and the structure is shown in Figure 2, which introduces the attention mechanism and soft thresholding on the basis of RBU. The attention mechanism uses the squeeze-and-excitation network (SENet) proposed by HU [27], which learns the weights of each channel through the small network “global pooling → fully connected layer → ReLu activation function → fully connected layer → Sigmoid function”, and applies it to the original input feature map, and weights the eigenvalues of each channel through multiplication operations to adjust the feature size of each channel. In this way, the attention of the network to important features is enhanced so as to improve the expressiveness, generalization ability, and robustness of the model. In this paper, the soft thresholding of different thresholds of channels is adopted, which can automatically calculate different scales and thresholds for each channel, so that the features of different channels are subject to different constraints in the sparsity process, allowing the network to process the features more flexibly. If αc is the threshold of the cth channel of the feature map, and a v e r a g e i , j | x i , j , c | is the average value of the absolute value of each channel of the input feature x, the threshold is calculated as follows [17]:
τ c = α c a v e r a g e i , j | x i , j , c |
Soft thresholding is the core step of signal denoising, which is combined with deep learning to decompose the input signal through the convolutional layer, then filter all the signals after decomposition within the threshold, and then reconstruct the filtered signal to achieve the purpose of denoising. The soft threshold function is as follows. After deriving the soft threshold function, the gradient is only 0 or 1, which avoids gradient explosion and gradient disappearance.
y = { x τ x > τ 0 τ x τ x + τ x < τ
where x, y, and τ are the input, output, and threshold values, respectively.

3. Fault Diagnosis Method Based on DC-DRSN and Verification

3.1. Fault Diagnosis Method Based on DC-DRSN

Although DRSN implements adaptive learning thresholds and achieves good results in fault diagnosis, it is often used to deal with the spatial features contained in two-dimensional images and it fails to fully explore the dynamic changes on one-dimensional time series. Therefore, a fault diagnosis method based on a double channel-deep residual shrinkage network (DC-DRSN) is proposed in this paper.
The fault diagnosis process based on DC-DRSN is shown in Figure 3.
Step 1: For the one-dimensional signal and two-dimensional time-frequency diagram with different levels of noise, the training samples, verification samples, and test samples are randomly selected.
Step 2: Construct the DC-DRSN model, initialize the network weights, and set parameters such as the number of iterations and the learning rate.
The network structure of DC-DRSN is shown in Figure 3, which is composed of a two-channel neural network. The upper channel uses a two-dimensional deep residual contraction network, and its input is a two-dimensional time-frequency graph after the transformation of the original vibration signal to reflect the weak local characteristics of the signal at different angles and improve the feature learning ability of the signal. The lower channel uses a one-dimensional residual contraction network, which takes the original vibration signal as the input, retains the timing and correlation of the signal, and concisely and effectively expresses the global dynamic time domain characteristics. On this basis, the upper and lower channels are fused in the fusion layer, and the extracted time-frequency features are classified and identified by the SoftMax activation function.
The DRSN hyperparameters are shown in Table 1. They consist of a convolutional layer, a batch normalization layer, and a max pooling layer.
Step 3: The one-dimensional signal and the two-dimensional time-frequency graph training samples are input to the upper and lower channels for model training at the same time.
Step 4: The model parameters are continuously optimized through training samples and validation samples until the network accuracy is satisfied and the number of iterations is reached to obtain a trained diagnostic model.
Step 5: Input test samples to the model and obtain diagnostic results through confusion matrix and T-SNE visualization.

3.2. Method Validation

3.2.1. Introduction of Dataset

At present, there is no authoritative report on the public dataset of planetary gearboxes, so the bearing public dataset of Case Western Reserve University (CWRU) in the United States is used to verify the fault diagnosis method proposed in this paper.
The rolling bearing test bench of CWRU is shown in Figure 4, which is mainly composed of a motor, a torque sensor, a power tester, and an electronic controller. During the test, the motor speed is 1720 r/min~1797 r/min. The test bearings are 6205-2RS JEM SKF deep groove ball bearings with single-point damage machined by EDM, which is made in Sweden. The load of the bearings has 1 hp and 3 hp loads two types, and the sampling frequency of the acceleration sensor is 48 kHz. The bearing conditions involved in the dataset include four types: normal, inner ring damage, outer ring damage, and rolling element damage. Each type is available in 0.007” (0.1778 mm), and 0.021” (0.5334 mm) damage diameters.
Table 2 shows the dataset used in this paper, which contains nine fault states. The number of samples in each category is 900, and the number of data points in a single sample is 1024, for a total of 8100 samples. The dataset is randomly divided into training sets, validation sets and test sets in the ratio of 3:1:1. Thus, there are 540 training sets, 180 validation sets and 180 test sets for each fault type. The Fourier time-frequency diagram of size 112 × 112 is input in the upper channel, so the structure of the training set, the verification set and the test set of each fault type are 540 × 112 × 112, 180 × 112 × 112 and 180 × 112 × 112, respectively. While the structure of the training set, the verification set and the test set of each fault type in the lower channel are 540 × 1024, 180 × 1024 and 180 × 1024, respectively. In order to accelerate the convergence of the model and improve the stability and robustness of the model, all samples are normalized and preprocessed.
In this paper, TensorFlow and Keras frameworks are used to build the DC-DRSN network, with a learning rate of 0.001, a batch size of 64, and a training times of 150, in which the convolutional layer adopts the normal distribution initialization method and L2 regularization proposed in Ref. [10]. Table 3 shows the images obtained by using STFT, WT, GASF, and GST to convert the vibration signals in the nine different states of the common dataset. It can be seen that the time-frequency diagram converted by GASF and GST cannot clearly show the characteristics of the signal. The time-frequency diagram of STFT is clearer than that of WT and can clearly separate and analyze different frequency components. Therefore, in this paper, STFT is selected for time-frequency analysis of the fault signal to be diagnosed.

3.2.2. Experimental Results and Discussion

(1)
Comparison of different numbers of residual shrinkage modules
The signal-to-noise ratio (SNR )is a measure of the comparison between the intensity of a signal and noise and is usually used to evaluate the relative strength of a signal and noise. The formula of SNR is expressed as [28]:
S N R = 10 log 10 ( P s i g n a l P n o i s e ) ( d B )
where Psignal is the original signal power and Pnoise is the power that contains noise in the signal. The higher the SNR value is, the stronger the signal relative to the noise is, which is easier to identify and analyze.
In order to test the noise immunity of a different number of RSBUs, the number of RSBUs is adjusted without changing the preprocessing and other layer hyperparameters. Five different SNRs are added to the original vibration signal to compare the effect of the number of RSBUs on diagnostic performance. Figure 5 shows the average of the network after 150 iterations of training and five tests.
When the number of RSBUs is one or two, the network depth is insufficient, which limits the expression and learning ability of the model, resulting in the performance of the model not reaching the optimum. When the number of RSBUs is four, the number of model parameters is too large, resulting in network degradation and decreased diagnostic capabilities. When the number of RSBUs is three, the model has the highest diagnostic accuracy, and the accuracy of the test set reaches 99.56% when the signal-to-noise ratio is 4 dB. In order to better simulate the noise in actual operation, taking the signal-to-noise ratio equal to 4 dB as an example, the accuracy curve of the model is shown in Figure 6. The convergence curves of training and testing have achieved a good fitting effect, and the visual classification of the test set in the confusion matrix and T-SNE is shown in Figure 7 and Figure 8. It can be clearly found that the same type of fault has a good aggregation and different types of features can be effectively distinguished. The proposed method can be used to reshape and classify them in the high-dimensional space.
(2)
Comparison of different image conversion methods
STFT, WT, GASF and GST are used to convert the two-dimensional image of the vibration signal, and the converted image is used as the input of the upper channel of the DC-DRSN model. The accuracy of fault diagnosis when a 4 dB signal-to-noise ratio noise is added to the original signal is shown in Table 4. It can be seen that the GST-DCDRSN method has the lowest fault diagnosis accuracy of 98.31%. This is due to the high-resolution characteristics of GST and data redundancy, which allows the model to process irrelevant details that distract from key features. The STFT transform can accurately locate the time point of fault occurrence and track the dynamic changes in the system, with a fault diagnosis accuracy of up to 99.56%, which is 1.25% higher than that of the GST transform. At the same time, it is also proved that the 2D graph transformed by STFT can be used as the input of the channel on the DC-DRSN, which achieves higher diagnostic accuracy.
(3)
Comparison of DC-DRSN with other deep learning models
The proposed method, DC-DRSN, is compared with DRSN, VGG-16, GoogleNet, and Alexnet-18, where 4 dB noise is added to the original vibration signal, and the time-frequency image is input into each model to test the fault diagnosis performance of each model. The results are shown in Table 5. With the increase in model complexity, the diagnostic effect of DRSN is better than that of VGG-16, GoogleNet, and Alexnet-18. However, STFT-DCDRSN allows the network to process image and one-dimensional signal data separately, which allows each channel to focus on extracting its own features. So STFT-DCDRSN has the best diagnostic effect, with a diagnostic accuracy of 99.56%, which is 0.49% better than DRSN.

4. Fault Diagnosis of Planetary Gear Train Crack Based on DC-DRSN

4.1. Introduction of Test Bench

Combined with the actual structure size of the nuclear power planetary gearbox, a proportionally reduced nuclear power planetary gearbox test bench is designed, as shown in Figure 9. The test bench is mainly composed of the control system, transmission device, power system, loading system, and lubrication system. The transmission device transmits power from the vertical motor to the horizontal load motor and realizes the effective connection between them. The lubrication system lubricates the transmission and filters, as well as cools the oil. The control system is responsible for changing the input speed and load of the equipment. During the test, the BK sensor and its supporting Pulse system were used for data acquisition, and the acquisition card used BK’s Type3053-B-120 made in Denmark. Combined with the test requirements, the transmission route of the planetary gear train is analyzed [29], and in order to ensure the quality of vibration signal acquisition, the sensor is installed on the box wall closest to the planetary gear.

4.2. Crack Failure Test of Planetary Gear Train

In this study, the fault tests of planetary gear train cracks under different working conditions are carried out. During the test, the control system controls the input motor speed, when the input motor speed is 750 r/min, the voltage of the electronic control system is 380 V, and the frequency is 50 Hz. Under different loads, the operating state and vibration signal characteristics of the planetary gear train are different, and the vibration amplitude and frequency components of the gear train meshing under high loads are more complex. In order to avoid aliasing the characteristic signals under different loads and improve the applicability and robustness of the diagnostic system, two load conditions of 50% medium load and 100% heavy load are set by the load motor. A series of tests are carried out for normal, planetary gear cracks and sun gear cracks under each working condition, and the test environment is shown in Figure 10. The crack is located at the root of the gear, and the crack depth is about 1 mm, as shown in Figure 11. In the process of data acquisition, the sampling frequency is 25.6 kHz, and the continuous sampling time is 1 min each time. The vibration data samples in five states of the planetary gear train are finally obtained, which are labeled as 0, 1, 2, 3, and 4, respectively.
Figure 12 shows the original vibration signal of a planetary gear system randomly selected for five states and their image (112 × 112 pixels) after STFT. Table 6 shows the composition of the planetary gear train crack fault dataset obtained in this paper. A sliding window containing 1024 data points is used to sample the vibration signals of the planetary gear train in five states, with a window step size of 300, and each state contains 900 samples. Similarly, the training, validation, and test sets are randomly divided in the ratio of 3:1:1, and the final dataset contains 2700 training set samples, 900 validation sets, and 900 test set samples each. The structure of the training set, validation set, and test set of the upper and lower channels, as well as the data preprocessing steps and hyperparameter settings, are consistent with the preceding ones, and will not be repeated here.

4.3. Test Results and Analysis

(1)
Accuracy at different levels of noise
In order to simulate various noise interferences in the actual operation of the nuclear power gearbox and test the diagnostic accuracy of the model under different levels of noise, five kinds of noise with different SNR are set, and the accuracy of the test set under different SNRs and different RSBUs is shown in Figure 13. When RSBUs is equal to three, the fault diagnosis accuracy is the highest, which is consistent with the previous results. Therefore, RSBUs equal to three is determined as the fixed parameter of DC-DRSN. It is not difficult to see that with the increase in the SNR, the accuracy of diagnosis also increases, and the diagnostic accuracy of the test set is still as high as 97.22% when the SNR is 4 dB. In view of the fact that the SNR in gearbox fault diagnosis studies is often between 0 and 10 dB to simulate the noise level in the actual industrial environment [30], and 4 dB is in the middle of this range, which is a reasonable and representative value, and is helpful to test the robustness and reliability of the fault diagnosis method, therefore, under the SNR 4 dB, DC-DRSN is used to train and classify the nuclear power planetary gearbox dataset, and the accuracy changes in the obtained training set and the validation set are shown in Figure 14. It can be seen that when the number of iterations is less than 50, the accuracy of the training set and the validation set increases but there is oscillation, and the diagnostic performance of the model is improving but the stability needs to be improved. When the number of iterations is more than 50, the training set gradually stabilizes, and the accuracy is close to 100%, and the oscillation of the validation set is greatly reduced compared with before, and the accuracy reaches 99%. After about 150 iterations, the model has good diagnostic performance and stable accuracy.
The confusion matrix and the T-SNE visualization representation of the classification results are shown in Figure 15 and Figure 16, respectively. It is not difficult to find that label 2 has the best sample classification effect, with an accuracy rate of 100%. Label 0 and label 4 have six samples misclassified with each other, five samples from label 3 are classified into label 1, and eight samples from label 1 are misclassified. The results show that the change of gear crack type and load has a great influence on the classification accuracy, and most of the samples are correctly classified. Although there are a few sample errors, the minimum accuracy is still 95.56%, which verifies the effectiveness of DC-DRSN classification for different working conditions. The T-SNE visualization also shows that the distribution interval of each type of fault is regular and orderly, and the spacing of similar faults is very small, which further verifies the superiority of the proposed method.
(2)
Comparison of different image conversion methods
The collected planetary gearbox vibration signals are converted into two-dimensional time-frequency maps of 112 × 112 pixels by STFT, WT, GASF, and GST, and then the time-frequency maps are input into the upper channel of the DC-DRSN model for diagnosis. When the SNR is equal to 4 dB, the fault diagnosis accuracy of each method is shown in Table 7. It can be seen that the GST image conversion method has the lowest diagnostic accuracy of 95.74%, while the STFT is as high as 97.22%. This further verifies that the STFT conversion image is used as the input of the DC-DRSN upper channel, and the time and frequency information of the signal are retained through local analysis of the signal in time and frequency. Therefore, the network can better capture the transient changes, periodicity, and frequency changes of the signal, so as to improve the accuracy of model diagnosis.
(3)
Comparison of DC-DRSN with other deep learning models
Similarly, the proposed method DC-DRSN is compared with DRSN, VGG-16, GoogleNet, and Alexnet-18. The time-frequency diagram after short-time Fourier conversion is input into each model, and the vibration signal is input into the lower channel of DC-DRSN, the number of iterations is set to 150, and the diagnostic accuracy of each model is shown in Table 8 when the SNR is 4 dB. It can be seen that VGG-16 has the lowest classification and recognition accuracy of the test set, which is only 95.49%. Comparing the two classical networks, GoogleNet and Alexnet-18, it can be seen that the complexity of the model increases, and the accuracy of the test set is also improved accordingly. In comparison, the recognition accuracy of DC-DRSN is as high as 97.22% under the same situation, which is at least 0.98% higher than that of other deep neural networks. This further verifies that DC-DRSN has effective noise cancellation ability and strong feature extraction ability, so it has a higher fault diagnosis accuracy.

5. Conclusions

In order to solve the problems of weak feature extraction ability and low fault diagnosis accuracy of existing planetary gearboxes in noisy environments, a fault diagnosis method based on DC-DRSN is proposed to realize the fault monitoring of planetary gear train gears. With the help of the bearing public dataset of CWRU, the proposed method is compared with the traditional DRSN, and the diagnostic accuracy of the proposed method is higher than that of the traditional DRSN. On this basis, the method is applied to the proportionally reduced planetary gear train crack fault test data of a nuclear power gearbox, and the planetary gear train crack fault diagnosis is realized, with an accuracy of 97.22%. The results of this paper are as follows:
(1)
A one-dimensional and two-dimensional parallel residual contraction network structure is constructed, which not only retains the temporal sequence and correlation of one-dimensional signals but also includes the information expression ability contained in two-dimensional images. In this way, more feature information is extracted, and the diagnostic performance of the model is improved.
(2)
Combined with the diagnostic characteristics under different noises, the number of residual shrinkage modules is reasonably determined. Thanks to the soft thresholding in the module, each sample has its own unique set of thresholds, which can be adapted to fault diagnosis in different noise environments.
(3)
The accuracy of the proposed method in diagnosing crack faults under different loads of planetary gear train gearboxes is tested. The results show that only a few samples are misclassified, which verifies that the proposed method has good expression and generalization ability.

Author Contributions

Y.L. conceived this research; L.L. performed the experiments and data processing, and wrote the original draft preparation; editing and review on this article was completed by Y.L.; Y.L. is responsible for overseeing the progress and visualization of the article. All authors have read and agreed to the published version of the manuscript.

Funding

This research was founded by the National Natural Science Foundation of China (No.52075062), Chongqing Municipal Science and Technology Bureau Innovation and Development Joint Fund Project (CSTB2022NSCQ-LZX0024), special key project of technological innovation and application development in Chongqing (cstc2020jscx-dxwtBX0048) and Graduate Innovation Project of Chongqing University of Technology (No. gzlcx20243005).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Publicly available bearing datasets were analyzed in this study. This data can be found here: https://engineering.case.edu/bearingdatacenter/download-data-file. The raw data of gear supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Barszcz, T.; Randall, R.B. Application of spectral kurtosis for detection of a tooth crack in the planetary gear of a wind turbine. Mech. Syst. Signal Process. 2009, 23, 1352–1365. [Google Scholar] [CrossRef]
  2. Frank, P.M. Fault diagnosis in dynamic systems using analytical and knowledge based redundancy. Automatica 1990, 26, 459–474. [Google Scholar] [CrossRef]
  3. Li, H.; Liu, T.; Wu, X.; Li, S. Research on test bench bearing fault diagnosis of improved EEMD based on improved adaptive resonance technology. Measurement 2021, 185, 109986. [Google Scholar] [CrossRef]
  4. Teng, W.; Ding, X.; Cheng, H.; Han, C.; Liu, Y.; Mu, H. Compound faults diagnosis and analysis for a wind turbine gearbox via a novel vibration model and empirical wavelet transform. Renew. Energy 2019, 136, 393–402. [Google Scholar] [CrossRef]
  5. Shi, W.J.; Huang, X.; Wen, G.R.; Zhang, Z.F. Rolling Bearing Fault Diagnosis Based on DS-VMD and Correlated Kurtosis. J. Vib. Meas. Diagn. 2021, 41, 133–141. [Google Scholar]
  6. Chen, S.; Zhang, H.; Zhen, D.; Shi, Z.Q. The Fault Diagnosis of Rolling Bearing Based on SSD and ICA Noise Reduction. Modul. Mach. Tool Autom. Manuf. Tech. 2021, 8, 94–98. [Google Scholar]
  7. Mauricio, A.; Gryllias, K. Cyclostationary-based multiband envelope spectra extraction for bearing diagnostics: The combined improved envelope spectrum. Mech. Syst. Signal Process. 2021, 149, 107150. [Google Scholar] [CrossRef]
  8. Wang, Z.Y.; Yao, L.G.; Qi, X.L.; Zhang, J.; Zheng, J.D. Fault diagnosis of planetary gearbox based on parameter optimized VMD and multi-domain manifold learning. J. Vib. Shock 2021, 40, 110–118+126. [Google Scholar] [CrossRef]
  9. Xu, T.; Ji, J.; Kong, X.; Zou, F.; Wang, W. Bearing fault diagnosis in the mixed domain based on crossover-mutation chaotic particle swarm. Complexity 2021, 2021, 6632187. [Google Scholar] [CrossRef]
  10. Wang, A.; Li, Y.; Du, X.; Zhong, C. Diesel engine gearbox fault diagnosis based on multi-features extracted from vibration signals. IFAC-Pap. 2021, 54, 33–38. [Google Scholar] [CrossRef]
  11. Chen, C.; Shen, F.; Xu, J.; Yan, R. Probabilistic Latent Semantic Analysis-Based Gear Fault Diagnosis under Variable Working Conditions. IEEE Trans. Instrum. Meas. 2020, 69, 2845–2857. [Google Scholar] [CrossRef]
  12. Wang, M.; Ma, X.; Hu, Y.; Wang, Y. Gear Fault Diagnosis Based on Variational Modal Decomposition and Wide+Narrow Visual Field Neural Networks. IEEE Trans. Autom. Sci. Eng. 2022, 19, 3288–3299. [Google Scholar] [CrossRef]
  13. Zeng, Q.; Feng, G.; Shao, Y.; Gu, F.; Ball, A.D. Planetary Gear Fault Diagnosis Based on An Instantaneous Angular Speed Measurement System with a Dual Detector Setup. IEEE Access 2020, 8, 66228–66242. [Google Scholar] [CrossRef]
  14. Chen, P.; Li, Y.; Wang, K.; Zuo, M.J. An automatic speed adaption neural network model for planetary gearbox fault diagnosis. Measurement 2021, 171, 108784. [Google Scholar] [CrossRef]
  15. Zhang, Y.; Cheng, G.; He, L. Convolutional Neural networks based on parallel multi-scale pooling branch: A transfer diagnosis method for mechanical vibrational signal with less computational cost. Measurement 2022, 192, 110905. [Google Scholar] [CrossRef]
  16. He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
  17. Zhao, M.; Zhong, S.; Fu, X.; Tang, B.; Pecht, M. Deep Residual Shrinkage Networks for Fault Diagnosis. IEEE Trans. Ind. Inform. 2020, 16, 4681–4690. [Google Scholar] [CrossRef]
  18. Zhang, Z.; Zhang, C.; Li, H. Highly Imbalanced Fault Diagnosis of Rolling Bearings Based on Variational Mode Gaussian Distortion and Deep Residual Shrinkage Networks. IEEE Trans. Instrum. Meas. 2023, 72, 3527911. [Google Scholar] [CrossRef]
  19. Cao, X.; Xu, X.; Duan, Y.; Yang, X. Health Status Recognition of Rotating Machinery Based on Deep Residual Shrinkage Network under Time-Varying Conditions. IEEE Sens. J. 2022, 22, 18332–18348. [Google Scholar] [CrossRef]
  20. Sun, Y.; Pang, S.; Zhang, J.; Zhang, Y. DRSN-GAF: Deep Residual Shrinkage Network (DRSN) for Lithology Classification through Well Logging Data Transformed by Gram Angle Field. IEEE Geosci. Remote Sens. Lett. 2024, 21, 7500905. [Google Scholar] [CrossRef]
  21. Pei, X.; Dong, S.; Tang, B.; Pan, X. Bearing Running State Recognition Method Based on Feature-to-Noise Energy Ratio and Improved Deep Residual Shrinkage Network. IEEE/ASME Trans. Mechatron. 2022, 27, 3660–3671. [Google Scholar] [CrossRef]
  22. Yu, Y.; Guo, L.; Gao, H.; Liu, Y.; Feng, T. Pareto-Optimal Adaptive Loss Residual Shrinkage Network for Imbalanced Fault Diagnostics of Machines. IEEE Trans. Ind. Inform. 2022, 18, 2233–2243. [Google Scholar] [CrossRef]
  23. Griffin, D.; Lim, J. Signal estimation from modified short-time Fourier transform. IEEE Trans. Acoust. Speech Signal Process. 1984, 32, 236–243. [Google Scholar] [CrossRef]
  24. Burrus, C.S.; Gopinath, R.A.; Guo, H. Wavelets and Wavelet Transforms; Rice University: Houston, TX, USA, 1998. [Google Scholar]
  25. Jones, S.H.; Thornicroft, G.; Coffey, M.; Dunn, G. A brief mental health outcome scale: Reliability and validity of the Global Assessment of Functioning (GAF). Br. J. Psychiatry 1995, 166, 654–659. [Google Scholar] [CrossRef] [PubMed]
  26. Adams, M.D.; Kossentini, F.; Ward, R.K. Generalized S transform. IEEE Trans. Signal Process. 2002, 50, 2831–2842. [Google Scholar] [CrossRef]
  27. Jin, X.; Xie, Y.; Wei, X.-S.; Zhao, B.-R.; Chen, Z.-M.; Tan, X. Delving deep into spatial pooling for squeeze-and-excitation networks. Pattern Recognit. 2022, 121, 108159. [Google Scholar] [CrossRef]
  28. Tandra, R.; Sahai, A. SNR walls for signal detection. IEEE J. Sel. Top. Signal Process. 2008, 2, 4–17. [Google Scholar] [CrossRef]
  29. Lei, Y.G.; Tang, W.; Kong, D.T.; Lin, J. Vibration Signal Simulation and Fault Diagnosis of Planetary Gearboxes Based on Transmission Mechanism Analysis. J. Mech. Eng. 2014, 50, 61–68. [Google Scholar] [CrossRef]
  30. Yin, S.; Li, X.; Gao, H.; Kaynak, O. Data-based techniques focused on modern industry: An overview. IEEE Trans. Ind. Electron. 2014, 62, 657–667. [Google Scholar] [CrossRef]
Figure 1. The residual module of the deep residual network.
Figure 1. The residual module of the deep residual network.
Applsci 14 06873 g001
Figure 2. The RSBU with different thresholds of channels.
Figure 2. The RSBU with different thresholds of channels.
Applsci 14 06873 g002
Figure 3. Fault diagnosis process based on DC-DRSN.
Figure 3. Fault diagnosis process based on DC-DRSN.
Applsci 14 06873 g003
Figure 4. The rolling bearing test bench (CWRU). 1. Fan end bearings; 2. Motor; 3. Drive end bearing; 4. Torque sensors and encoders; 5. Dynamometers.
Figure 4. The rolling bearing test bench (CWRU). 1. Fan end bearings; 2. Motor; 3. Drive end bearing; 4. Torque sensors and encoders; 5. Dynamometers.
Applsci 14 06873 g004
Figure 5. The accuracy of test sets at different SNR and RSBUs.
Figure 5. The accuracy of test sets at different SNR and RSBUs.
Applsci 14 06873 g005
Figure 6. Training process of CWRU bearing datasets.
Figure 6. Training process of CWRU bearing datasets.
Applsci 14 06873 g006
Figure 7. Classification of confusion matrices.
Figure 7. Classification of confusion matrices.
Applsci 14 06873 g007
Figure 8. T-SNE visualization.
Figure 8. T-SNE visualization.
Applsci 14 06873 g008
Figure 9. Scaled-down nuclear planetary gearbox test bench. 1. Input motor; 2. Planetary gearboxes; 3. Power distribution pump; 4. Lubrication system oil pumps; 5. Lubrication system motors.
Figure 9. Scaled-down nuclear planetary gearbox test bench. 1. Input motor; 2. Planetary gearboxes; 3. Power distribution pump; 4. Lubrication system oil pumps; 5. Lubrication system motors.
Applsci 14 06873 g009
Figure 10. Test environment of planetary gear train crack.
Figure 10. Test environment of planetary gear train crack.
Applsci 14 06873 g010
Figure 11. Faulty gear.
Figure 11. Faulty gear.
Applsci 14 06873 g011
Figure 12. The original vibration signals and STFT transformed images for different labels.
Figure 12. The original vibration signals and STFT transformed images for different labels.
Applsci 14 06873 g012
Figure 13. The accuracy of test sets at different SNR and different RSBUs.
Figure 13. The accuracy of test sets at different SNR and different RSBUs.
Applsci 14 06873 g013
Figure 14. DB-DRSN training process with SNR 4 dB.
Figure 14. DB-DRSN training process with SNR 4 dB.
Applsci 14 06873 g014
Figure 15. Confusion matrix categorization of the experiment.
Figure 15. Confusion matrix categorization of the experiment.
Applsci 14 06873 g015
Figure 16. T-SNE visualization of the experiment.
Figure 16. T-SNE visualization of the experiment.
Applsci 14 06873 g016
Table 1. DRSN hyperparameter settings.
Table 1. DRSN hyperparameter settings.
No. of LayerLayerHyper-Parameters
Conv1_1Convolution layer1_132 convolution kernels with the size of [3,3].
Stride: [1,1]. Padding: same.
BNBatch Normalization layerThe scale is 32.
Conv2_1Convolution layer2_132 convolution kernels with the size of [3,3].
Stride: [1,1]. Padding: same.
BNBatch Normalization layerThe scale is 32.
Average PoolingAverage Pooling layerPooling size: [1,1]. Stride: [2,2]. Padding: [0,0,0,0].
Table 2. Bearing public datasets.
Table 2. Bearing public datasets.
Fault LocationFault
Diameter
/mm
Load
/hp
LabelsDataset (Training/
Validation/Test)
Normal030540/180/180
Inner Race0.177811540/180/180
0.177832540/180/180
Ball0.177813540/180/180
0.177834540/180/180
Outer race (Centered@6:00)0.177815540/180/180
0.177836540/180/180
Outer race (Orthogonal@3:00)0.177817540/180/180
0.177838540/180/180
Table 3. Images of vibration signal conversion of the bearing datasets.
Table 3. Images of vibration signal conversion of the bearing datasets.
LabelVibration SignalImage Conversion Method
STFTWTGASFGST
0Applsci 14 06873 i001Applsci 14 06873 i002Applsci 14 06873 i003Applsci 14 06873 i004Applsci 14 06873 i005
1Applsci 14 06873 i006Applsci 14 06873 i007Applsci 14 06873 i008Applsci 14 06873 i009Applsci 14 06873 i010
2Applsci 14 06873 i011Applsci 14 06873 i012Applsci 14 06873 i013Applsci 14 06873 i014Applsci 14 06873 i015
3Applsci 14 06873 i016Applsci 14 06873 i017Applsci 14 06873 i018Applsci 14 06873 i019Applsci 14 06873 i020
4Applsci 14 06873 i021Applsci 14 06873 i022Applsci 14 06873 i023Applsci 14 06873 i024Applsci 14 06873 i025
5Applsci 14 06873 i026Applsci 14 06873 i027Applsci 14 06873 i028Applsci 14 06873 i029Applsci 14 06873 i030
6Applsci 14 06873 i031Applsci 14 06873 i032Applsci 14 06873 i033Applsci 14 06873 i034Applsci 14 06873 i035
7Applsci 14 06873 i036Applsci 14 06873 i037Applsci 14 06873 i038Applsci 14 06873 i039Applsci 14 06873 i040
8Applsci 14 06873 i041Applsci 14 06873 i042Applsci 14 06873 i043Applsci 14 06873 i044Applsci 14 06873 i045
Table 4. The accuracy of the bearing test set under different image conversion methods.
Table 4. The accuracy of the bearing test set under different image conversion methods.
MethodsTesting Accuracy
STFT-DCDRSN99.56%
WT-DCDRSN99.06%
GASF-DCDRSN98.67%
GST-DCDRSN98.31%
Table 5. The accuracy of the bearing test set under different deep learning models.
Table 5. The accuracy of the bearing test set under different deep learning models.
MethodsTesting Accuracy
DC-DRSN99.56%
DRSN99.07%
VGG-1698.88%
GoogleNet98.94%
Alexnet-1899.06%
Table 6. Types of nuclear gearbox failures.
Table 6. Types of nuclear gearbox failures.
Fault LocationStatusSpeed
(r/min)
LoadLabelsDataset (Training/
Validation/Test)
NormalNormal750100%0540/180/180
Planet gearCracks750
750
50%
100%
1
2
540/180/180
540/180/180
Sun gearCracks750
750
50%
100%
3
4
540/180/180
540/180/180
Table 7. The accuracy of the gear test set under different image conversion methods.
Table 7. The accuracy of the gear test set under different image conversion methods.
MethodsTesting Accuracy
STFT-DCDRSN97.22%
WT-DCDRSN96.87%
GASF-DCDRSN96.46%
GST-DCDRSN95.74%
Table 8. The accuracy of the gear test set under different deep learning models.
Table 8. The accuracy of the gear test set under different deep learning models.
ModelsTesting Accuracy
DC-DRSN
DRSN
97.22%
96.67%
VGG-1695.49%
GoogleNet
Alexnet-18
96.10%
96.24%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Luo, L.; Liu, Y. Fault Diagnosis of Planetary Gear Train Crack Based on DC-DRSN. Appl. Sci. 2024, 14, 6873. https://doi.org/10.3390/app14166873

AMA Style

Luo L, Liu Y. Fault Diagnosis of Planetary Gear Train Crack Based on DC-DRSN. Applied Sciences. 2024; 14(16):6873. https://doi.org/10.3390/app14166873

Chicago/Turabian Style

Luo, Le, and Yu Liu. 2024. "Fault Diagnosis of Planetary Gear Train Crack Based on DC-DRSN" Applied Sciences 14, no. 16: 6873. https://doi.org/10.3390/app14166873

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop