Next Article in Journal
A Federated Reinforcement Learning Framework via a Committee Mechanism for Resource Management in 5G Networks
Previous Article in Journal
An Improved Vital Signal Extraction Method Based on Laser Doppler Effect
Previous Article in Special Issue
Multi-Sensory Tool Holder for Process Force Monitoring and Chatter Detection in Milling
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Intelligent Fault Diagnosis of Planetary Gearbox Across Conditions Based on Subdomain Distribution Adversarial Adaptation

1
School of Mechanical Engineering, University of Science and Technology Beijing, Beijing 100083, China
2
China North Vehicle Research Institute, Beijing 100072, China
*
Author to whom correspondence should be addressed.
Sensors 2024, 24(21), 7017; https://doi.org/10.3390/s24217017
Submission received: 19 July 2024 / Revised: 18 September 2024 / Accepted: 29 September 2024 / Published: 31 October 2024
(This article belongs to the Special Issue Emerging Sensing Technologies for Machine Health State Awareness)

Abstract

:
Sensory data are the basis for the intelligent health state awareness of planetary gearboxes, which are the critical components of electromechanical systems. Despite the advantages of intelligent diagnostic techniques for detecting intricate fault patterns and improving diagnostic speed, challenges still persist, which include the limited availability of fault data, the lack of labeling information and the discrepancies in features across different signals. Targeting this issue, a subdomain distribution adversarial adaptation diagnosis method (SDAA) is proposed for faults diagnosis of planetary gearboxes across different conditions. Firstly, nonstationary vibration signals are converted into a two-dimensional time–frequency representation to extract intrinsic information and avoid frequency overlapping. Secondly, an adversarial training mechanism is designed to evaluate subclass feature distribution differences between the source and target domain. A conditional distribution adaptation is employed to account for correlations among data from different subclasses. Finally, the proposed method is validated through experiments on planetary gearboxes, and the results demonstrate that SDAA can effectively diagnose faults under crossing conditions with an accuracy of 96.7% in diagnosing gear faults and 95.2% in diagnosing planet bearing faults. It outperforms other methods in both accuracy and model robustness. This confirms that this approach can refine domain-invariant information for transfer learning with less information loss from the sub-class level of fault data instead of the overall class level.

1. Introduction

Planetary gearboxes are vital components of electromechanical systems, e.g., vehicles and wind turbines. A malfunction of the gearbox can result in system breakdowns and potentially serious safety incidents. Thus, accurately diagnosing faults in planetary gearboxes is crucial for maintaining the safe and reliable operation of this equipment [1,2,3].
In decades, researchers have extensively studied the fault diagnosis of planetary gearboxes. For example, Zhao et al. [4] developed a gear mesh stiffness model that integrates friction and fractal contact theory, aiming to analyze the relationship between friction factors and fractal parameters. This model provides insights into how these factors interact and influence gear performance. Xiang et al. [5] introduced a lateral torsion dynamics model tailored for multi-stage gear systems. This model reveals the mechanisms behind spalling and the progression of faults within these systems. Their comprehensive study also explores the mapping relationship between various gear fault modes and their dynamic characteristics, providing a significant reference point for future fault diagnosis research. This exploration helps in understanding how different fault modes impact the dynamic behavior of gear systems, thereby aiding in the development of diagnostic techniques. Feng et al. [6,7] developed AM-FM analysis models for vibration signals at gear meshing points. They derive the characteristic frequency formula for gear faults on the basis of Fourier transform and identify the spectral characteristics of typical faults. Jiang et al. [8] proposed a novel decomposition scheme for variational mode decomposition. This method resolved the problem of uncertain decomposition modes and balance parameter selection. Traditional fault diagnosis methods require the assistance of expertise knowledge for feature extraction, which limits their applications greatly. To overcome these problems, Zhao et al. [9] introduced a novel fault diagnosis method for rolling bearing. This method focuses on reducing the dimensionality of the features. It can capture more intrinsic fault information, which allows for a more precise identification of faults by isolating the most relevant information from potentially noisy or irrelevant data. Yu et al. [10] developed an innovative data-driven multigranulation model for fault diagnosis of planetary gearboxes. By leveraging multiple granular perspectives, this model allows for a detailed and nuanced analysis. To minimize dependence on labeled data, Wu et al. [11] proposed a hybrid classification model based on an autoencoder. The model can be trained unsupervised with unlabeled data. Zhang et al. [12] presented an innovative adaptive loss-weighted meta-residual network specifically designed to address the challenge of label noise interference in fault diagnosis. This advanced network dynamically adjusts the weights assigned to faulty data during the training process, allowing the model to better distinguish between accurate and noisy labels. Yin et al. [13] embedded cosine loss optimization into an LSTM network for gear fault diagnosis in wind turbines, enhancing the diagnostic accuracy over traditional LSTM networks. Deep learning-based diagnostic methods construct end-to-end models from raw data to equipment health status, automatically extracting useful features from complex data to accurately identify equipment health. These methods reduce dependence on expert knowledge and provide robust technical support for intelligent fault diagnosis.
Traditional data-driven methods for intelligent fault diagnosis often operate under the assumption that training and test data (source-domain and target-domain data) are independently and identically distributed. However, in real application, the operation of a planetary gearbox exhibits time-varying characteristics under varying speeds or loads. These fluctuations result in an uneven distribution of monitoring data, potentially indicating different health states even when the actual health condition remains the same. This cause discrepancies between source-domain and target-domain data, thus requiring diagnostic models to be highly adaptable and capable of accurately interpreting and analyzing signals that may change in unpredictable ways, while overlooking the discrepancies could undermine the performance of the proposed method. This complexity underscores the need for transfer learning techniques.
Transfer learning facilitates the application of knowledge gained from one domain (source-domain) to improve diagnostic performance in a different, but related, domain (target-domain) [14]. The source domain consists of labeled training data D S = x i S , y i S , which provides necessary information for initial model training. In contrast, the target domain contains unlabeled monitoring data D T = x j T . The model must analyze and interpret without predefined labels. To bridge the gap between domains, domain adaptation has been proposed as a crucial technique within transfer learning. Li et al. [15] and Zhu et al. [16] proposed a range of metric learning algorithms to evaluate and quantify the differences of features between source and target domains. These approaches focused on minimizing the distance between variations within the same class (intra-class variation) while maximizing the distance between variations of different classes (inter-class variation). This strategy significantly enhanced the robustness and reliability of diagnostic models by ensuring that the features extracted were more distinct and discriminative. Deng et al. [17] incorporated a range of attention mechanisms into adversarial networks, allowing the networks to focus on the most relevant features across different domains. This integration ensures that the learned features were applicable regardless of domain discrepancies. To make an alignment of data distribution across domains, joint distribution adaptation (JDA) [18,19,20] was proposed to evaluate the joint distribution discrepancy between source and target tasks for cross-domain monitoring data in fault diagnosis. Current methods enable the model to adapt the knowledge gained from the labeled source domain to the unlabeled target domain, thus can more effectively generalize across different operational conditions and datasets, leading to more robust and accurate fault detection.
With unlabeled data in the target domain, E. Tzeng et al. [21] proposed an adaptation layer and an additional domain confusion loss for convolutional neural networks to learn a representation that is both semantically meaningful and domain invariant. Ganin et al. [22] proposed a domain adaptation method by augmenting a network with few standard layers and a simple new gradient reversal layer. The approach can effectively learn deep features that are invariant with respect to the feature shift between the domains. Yu et al. [23] proposed a dynamic adversarial adaptation network for transfer learning. These methods effectively learn features with unlabeled data. However, they only align overall distributions of domain data by adversarial training or metrics like maximum mean discrepancy (MMD), and fail to capture significant variabilities of individual class of fault data. By only focusing on the overall distribution, as illustrated in Figure 1a, traditional domain adaptation methods run the risk of neglecting fault-specific information that is critical for accurate diagnosis. Certain features that are highly relevant to identifying a specific type of fault might be averaged out when aligning the global data distributions. This lack of focus on subclass-level variations can reduce the model’s ability to precisely distinguish between different fault types and lead to suboptimal performance in fault diagnosis tasks.
In real-world applications, different types of planetary gearbox faults can exhibit distinct characteristics, leading to variations in their data distributions. As shown in Figure 1b, the subclass fault data of a planetary gearbox does not follow a single, unified distribution but instead adheres to different data distributions for each fault type. While traditional domain adaptation approaches can transfer general knowledge from related domains, they do not effectively account for the nuances of subdomain-specific data distributions. This limitation highlights the need for methods that not only align global distributions but also consider the intra-domain variations and the conditional distributions of different fault types, ensuring that the model retains the fault-specific information necessary for accurate and reliable diagnosis. To tackle the aforementioned challenges, this paper introduces a subdomain distribution adversarial adaptation approach (SDAA) for identifying faults of planetary gearboxes under cross conditions. This method evaluates the subclass data discrepancy between the source domain and target domain. By assessing feature differences between various subclass faults, SDAA seeks to align the feature distribution spaces between source and target tasks more closely. It individually aligns each type of fault data before making joint adjustments. The shared structures or patterns are identified for transfer from the source domain to the target domain. This method ensures that the subclass-level alignment is more accurate by considering the relationships between features within each fault subclass. From a network structure perspective, the SDAA method employs deep residual networks to construct a dual-stream feature extraction architecture. This architecture simultaneously learns feature information from both source and target tasks. Local difference adversarial evaluation is used to measure feature differences among relevant subclass faults in different tasks, thereby mining subclass fault feature information and reducing model bias. Furthermore, the method uncovers the correlation of transferrable information by assessing conditional distribution differences between source and target subclass fault data. It identifies and leverages the relationships and similarities between different types of fault data. This detailed analysis allows the method to transfer relevant information more accurately from the source domain to the target domain. Through this process, the method ensures that the diagnostic model is well-equipped to handle variations and discrepancies in fault data across different operational conditions with the enhanced robustness and accuracy.
In real application, the signals of planetary gearbox tend to exhibit time-varying characteristics. Using the time-domain signal as the input of SDAA for fault diagnosis could overlook the intrinsic and dynamic characteristics of signal. To solve this problem, the proposed method uses time–frequency analysis as the input of deep learning. The short-time Fourier transform (STFT) is used to extract time–frequency features from vibration signals. It can characterize the non-stationary condition effectively. STFT captures the time-varying nature of frequency components and amplitudes, thereby mitigating the challenges of frequency overlapping and enhance the distinguishability of features. The time–frequency feature can provide more intrinsic information on health state. Thus, the SDAA adapts the framework of network with the time–frequency representation as the input.
The main contributions of this paper are summarized as follows.
(1)
A novel framework incorporating the time–frequency representation construction and subdomain distribution adversarial adaptation diagnosis method is proposed for the fault diagnosis of planetary gearboxes. It allows the model to generalize better across a wide range of operating conditions by aligning the subclass and overall class of data simultaneously, which enhances the fault diagnosis performance.
(2)
Time–frequency representation is adopted as the input of deep learning to provide intrinsic information of health state. The STFT is applied to extract time–frequency representation from vibration signals. It can reflect the variation of frequency components and amplitudes over time, which can characterize the non-stationary condition of planetary gearbox more confidently.
(3)
Local difference adversarial evaluation is used to discover the correlation of faults information by assessing conditional distribution differences between source and target subclass fault data. It can handle variations and discrepancies in fault data, thus revealing more in-depth information for transfer learning.
(4)
An experiment is designed to validate the proposed method. Data of planetary gear and bearing were collected on the test rig across different operational conditions for validation. The result demonstrates that the method is effective and superior to other methods for the fault diagnosis of planetary gearboxes.

2. Theoretical Background

2.1. Representation of Nonstationary Operating Condition

In practical industrial applications, the operation of planetary gear systems is subject to variations in load and speed, leading to changes in the frequency components and amplitudes of vibration signals. This dynamic, nonstationary behavior makes it challenging to extract confident features from monitoring signals for fault diagnosis. To address this issue, time–frequency analysis methods are used to extract informative features from joint domain of time and frequency. Among these methods, the short-time Fourier transform (STFT) stands out for its computational efficiency and its ability to prevent cross-interference among signal components [24,25]. STFT divides time-sequential signals into segments using a window function of fixed length. Each segment is then transformed using the fast Fourier transform (FFT) to obtain a localized Fourier spectrum, retaining both time and frequency information. This method is particularly effective for analyzing nonstationary signals. It provides a clear representation of how frequency constitution evolves over time. The mathematical expression is given by
x STFT ( t , ω ) = f ( t ) g ( t τ ) exp ( j ω t ) d t ,
where f(t) represents the time-series signal; g(tτ) is a window function concentrated at a time instance τ. The STFT is a Fourier transform that is associated with the signal f(t) and the window function g(tτ).

2.2. Feature Learning Based on Residual Algorithms

Features extracted through time–frequency methods are fed into deep convolutional networks to learn intrinsic time–frequency patterns. As the time–frequency representation is high-dimensional and the available data are often limited, they pose challenges to modeling for the fault diagnosis of planetary gearboxes [26]. While increasing the number of convolutional layers can theoretically enhance the ability of feature extraction, it also increases the risk of overfitting. This can result in an initial improvement in diagnostic performance, followed by a sharp decline, as illustrated in Figure 2. Additionally, deeper networks may face training issues, such as gradient vanishing or explosion. They affect stability and model performance.
ResNets are particularly suitable to extract deep features from small sample data of planetary gearbox due to its unique network architecture and its ability to address the vanishing gradient problem through residual connections [27]. Figure 3 illustrates the core structure of ResNets, namely the residual block. They allow deep networks to be effectively trained by learning identity mappings, ensuring that original input information is preserved and complex patterns can be captured. The inclusion of shortcut connections helps maintain the gradient flow, enabling deeper architectures without degradation. This improves the extraction of fine-grained fault features from non-stationary vibration signals of planetary gearbox under varying loads and speeds.
In ResNets, the final output, G(x) is obtained by applying the activation function to the sum of the input x and the residual mapping F(x). Convolutional kernels act as filters, converting time–frequency representations into feature matrices. ReLU activation is then applied for nonlinear transformation. To prevent redundant computations, pooling layers are used for down-sampling. This process refines the high-level features, enhancing the ability to diagnose faults in planetary gearbox systems. The mathematical operation within the residual block is expressed as
x j l = max ( 0 ,   i w i j l x i l 1 + b j l ) ,
x j l ( h max ) = max ( i 1 ) W + 1 h i W x j l ( h ) ,
where x denotes the value of a node in the convolutional layers; w represents the weight matrix; b refers to the bias; max( ) signifies the ReLU activation function; W denotes the pooling operation; and h encompasses all the feature values within the local range.

2.3. Domain Adversarial Training

Traditional adaptation methods typically rely on a fixed feature projection between the source and target domains, aiming to create a shared feature space for effective transfer learning. However, this approach often struggles to account for significant differences in feature distributions between two domains. To better handle this issues, domain adversarial training [22] is employed. This method encourages the model to generate features that are indistinguishable between the source and target domains, reducing discrepancies and improving cross-condition fault diagnosis. A domain discriminator T d distinguishes between source and target tasks, while a feature extractor T f works to confuse the discriminator by making it harder to differentiate between domain features. As shown in Figure 4, the parameters of the domain discriminator θ d are optimized by minimizing discriminator loss, while a gradient reversal layer (GRL) inverts the loss gradient to optimize the feature extractor’s parameters θ f by maximizing the discriminator loss. This adversarial training process continuously adjusts the model using the domain label loss Ld and integrates both the discriminator and label losses in the overall objective function.
L ( θ f , θ d ) = E x i ( D S D T ) [ L d ( T d ( T f ( x i , θ f ) , θ d ) , d i ) ] ,
where di denotes the domain label for sample xi, and D S and D T denote the source and target tasks, respectively.

2.4. Domain Confusion Training

In the fault diagnosis of planetary gearboxes, correlations between vibration signals of different fault categories exist. Ignoring these correlations can lead to migration bias during model training. Thus, this paper adopts domain confusion training to adaptively assess and minimizes the feature distribution distance between source and target tasks. This ensures that the model remains responsive to variations in the target domain and improves its generalization across different operating conditions. A distribution distance metric, maximum mean discrepancy (MMD) [21], is used to calculate adaptation loss between the source and target domains ( D S and D T ), as illustrated in Figure 5. MMD quantifies the difference of feature distributions, which is computed as
d ^ ( X S ,   X T ) = 1 n S i = 1 n S ϕ ( x i ) 1 m T j = 1 m T ϕ ( x j ) H 2 ,
where H denotes the Hilbert space used to calculate the distribution distance, ϕ · denotes the feature value of sample x, n S and m T denote the number of samples in the source and target tasks, respectively.

3. Subdomain Distribution Adversarial Adaptation Framework

3.1. Transfer Framework Design

The fault data of planetary gearboxes often involves missing label information. Without sufficient labels, the model struggles to accurately map feature information to fault categories. To address this issue, pseudo-labeling techniques is adopted. The method iteratively refines its feature by gradually aligning the feature distributions more closely. This technique can leverage the target domain data more effectively, even in the absence of true labels. Additionally, the complexity of measuring feature differences increases in multi-category fault diagnosis tasks. When there are significant differences in data distributions among subclass faults between source and target tasks, assessing the marginal distribution differences becomes critical. Conversely, when marginal distributions are similar, it becomes important to evaluate and extract fault correlation transfer information. As illustrated in Figure 6, a novel diagnostic framework incorporating subdomain distribution adversarial adaptation is developed to identify the health states of planetary gearboxes. The framework adopts STFT to convert time-series vibration signals into time–frequency matrices. The ResNet18 is employed to extract deep features from time–frequency representations of signal. The framework then transforms the extracted time–frequency features from both source and target tasks into fault-related feature vectors. These vectors are used in an adaptation layer to adversarially evaluate subclass fault feature differences, rather than bridging these differences by retraining. Furthermore, conditional distribution adaptation is utilized to evaluate subclass fault-related information across tasks, thereby improving the precision of fault-related transfer information and avoiding feature distribution confusion between different faults.

3.2. Design of Loss Function

The proposed framework processes vibration signal data through several iterative operations involving convolutional and pooling layers. These layers operate together to extract features from signals with a collection of matrix transformation, which can be mainly denoted by Equations (2) and (3), Then the softmax function can be connected to the last layer of network for fault identification with a probability distribution. It can be defined as follows:
p 1 ( x ) p 2 ( x ) p c ( x ) = exp ( x c ) i = 1 C exp ( x i ) ,
where p( ) denotes the probability distribution of category c in the multi-category diagnostic task, and C denotes the total number of fault categories. The cross-entropy loss function is utilized to optimize the network. By minimizing this loss between the predicted and actual labels of fault samples, the parameter of the network can be calculated for fault diagnosis. The function can be mathematically expressed as
L o s s x , y = 1 N i = 1 N c = 1 C y j c log y ^ j c ,
where y and y ^ indicates the true and predicted label of sample x, N denotes the number of samples.
The backpropagation algorithm is used to for model optimization. The model optimization in the source domain can be given by
arg min w i l L ( y i s , f ( x i s , w l s ) ) ,
where w is the weight parameter of the l-th layer.
A term of loss function that measures the differences between sub-class faults within the source and target tasks are designed. In the adaptation layer, the mean loss of the local domain is calculated as the domain adversarial loss Ladv and expressed as
L adv = 1 n s + m t c C x i D S D T L d c ( T d c ( p i c ( T f ( x i ) ) ) , d i ) ,
It quantifies the global correlation of transferred information from different tasks. In theory, the data distributions of identical faults across different tasks should be identical. To achieve this alignment, conditional distribution adaptation is utilized. Conditional maximum mean discrepancy (CMMD) [28] is employed to minimize this distribution distance across domains. It learns the shared structures or patterns that can be transferred from the source-domain data to the target-domain data of planet gearbox, thereby improving the feature alignment process. Features of different faults can be aligned more accurately in subclass level by considering the relationships between features within each fault in sub-class level. The conditional distribution adaptation loss LCMMD involves using Equation (5) to determine the degree of alignment between distributions for each fault category.
L CMMD ( X S , X T ) = 1 C c 1 n S c x i c D S n S c ϕ ( x i ) 1 m T c x j c D T m T c ϕ ( x j ) H 2 ,
The overall objective is designed in terms of Ladv and LCMMD for parameter optimization, and it is denoted as
arg min θ f , θ d ( L c D S ( y ^ ( x i c , θ f ) , y i c ) + λ L joi D S D T ( L CMMD ( x i c , θ f ) , L adv ( x i c , θ f , θ d ) ) ) ,
The detailed training process of the SDAA diagnosis method is listed in Table 1.

4. Experimental Validation and Result Analysis

4.1. Experimental Setup

A single-stage planetary gear system experimental platform, as shown in Figure 7a, was constructed to collect data for method validation. This test rig uses a PLX120 planetary gearbox, with a ratio of 4, to simulate wind turbine operations. The gearbox acts as a speed increaser. A three-phase induction motor provides power to drive the planetary carrier shaft. The output shaft of gearbox is linked to an AC permanent magnet generator, which functions as a load to dissipate electrical energy through a resistance box during operation. The specific parameters of the motor and the generator are given in Table 2. A rotary encoder and a torque sensor are positioned between the gearbox and the generator to measure rotational speed and torque, respectively. An accelerometer was positioned on the upper surface (acquisition point “a” in Figure 7a of the gearbox housing to accurately capture vibration signals for monitoring and fault diagnosis. The brand and model of accelerometer are PCB and 623C01, respectively. The frequency range is 0.8–15,000 Hz. A structural diagram of the PLX120 planetary gearbox is depicted in Figure 7b, with the detailed configuration of the gears and bearings listed in Table 3 and Table 4. Fault was simulated by chipping on the gear and bearing, and the details are depicted in Figure 8. The fault information is provided in Table 5.

4.2. Experimental Dataset

Signals were sampled by the accelerometer at 20,480 Hz for 60 s, then resampled to 8192 Hz to reduce high-frequency interference and computational load. The resampled vibration signal was divided into overlapping slices, creating subsamples with 4096 data points each, covering a 0.5 s time span (Figure 9). 394 samples were generated for each data category as the source and target domain datasets. All 394 source domain samples and 78 target domain samples were used for training, while the remaining target domain samples were reserved for testing.
The first dataset used for method validation comprises four distinct gear health states: normal condition (NC), planetary gear fault (PGF), sun gear fault (SGF), and ring gear fault (RGF). For each of these gear states, vibration signals were collected under two different motor speed conditions, with detailed configurations shown in Table 6. The first motor speed mode linearly accelerates from 0 rpm to 720 rpm (Figure 10a), while the second mode exhibits sinusoidal periodic variations (Figure 10b). This sinusoidal pattern effectively includes both nonlinear time-varying speed conditions, characterized by small time-scale acceleration and deceleration, and periodic time-varying speed conditions, characterized by large time-scale speed variations. When the motor speed is held constant, the generator torque remains steady. When the motor speed varies in a sinusoidal manner, the generator torque responds by adjusting inversely to the speed changes. This inverse adjustment ensures that the load torque is regulated according to the varying motor speed, maintaining the balance and performance of the system. Based on different motor speeds, nine transfer diagnostic tasks were designed for gear fault diagnosis, as detailed in Table 7.
The second dataset used for method validation comprises four distinct planetary bearing faults: normal (NC), inner race fault (IRF), outer race fault (ORF), and rolling element fault (REF). Likewise, experiments on the planet bearings for each fault type were conducted under two distinct time-varying motor speeds. The detailed speed configurations are given in Table 8. Based on the different motor speeds, a total of nine transfer diagnostic tasks were designed for this dataset. The specific details of these transfer diagnostic tasks are outlined in Table 8.

4.3. Method Validation and Diagnostic Results

To validate the reliability of the proposed method, the state-of-the-art methods were compared to SDAA for fault diagnosis under cross conditions, which includes ResNet18 [27], deep adaptation network (DAN) [21], deep domain adversarial network (DDAN) [22], and dynamic adversarial adaptation network (DAAN) [23]. All experimental analyses utilized ResNet18 as the base feature extractor within the PyTorch 1.3 framework. For each method, ten trials were conducted for each task to reduce randomness. The calculation was completed within 3.6 s for each trial. The performance of method was assessed by the metrics of mean and standard deviation.
(1)
Fault Diagnosis of Planetary Gear
Table 9 presents the results of different methods for fault diagnosis of gear under cross conditions. SDAA demonstrates superior performance with an average diagnostic accuracy of 96.7% and a standard deviation of 0.117. By contrast, ResNet18 exhibits a lower average accuracy of 48.5% and a higher standard deviation of 0.345. The comparison underscores the effectiveness of SDAA in achieving reliable and consistent diagnostic results under cross condition in fault diagnosis. As illustrated in Figure 11a, adaptation methods improve diagnostic results, with SDAA obtaining the highest improvement (48.2%) and the lowest standard deviation, indicating consistent performance across multiple analyses. Additionally, Figure 11b demonstrates stability of different methods across various tasks. Compared to other methods, SDAA exhibits a variability of only 2.2% in diagnostic results between tasks. SDAA focuses not only on independently evaluating the marginal distribution differences between source and target tasks, but also the correlation among different subclass fault data. This results in a decrease in feature bias. The method can capture the nuanced relationships between subclass faults successfully. Consequently, SDAA enhances the overall performance of fault diagnosis tasks.
Using the Vin-V3 task as an example, Figure 12 illustrates the training process for various methods. In Figure 12a, all four methods show convergence during training. However, the losses for DAN, DAAN, and SDAA are higher than those for ResNet18, reflecting the greater impact of adaptation loss between the source and target tasks on model training. Figure 12b,c demonstrate that SDAA achieves faster test loss convergence, lower overall test loss, and higher diagnostic accuracy compared to the other methods. Additionally, SDAA maintains stability throughout training, with no significant fluctuations, indicating its effectiveness in evaluating data differences between the Vin and V3 conditions. This enables SDAA to extract more in-depth feature information from the source task. The confusion matrix in Figure 13 highlights the diagnostic performance of various methods in classifying data across different health states. From the matrices, it is evident that SDAA outperforms the other methods in classification accuracy. This comparison underscores the superiority of SDAA in the fault diagnosis of planetary gearboxes.
(2)
Fault Diagnosis Results of Planetary Bearing
Table 10 presents the results of different methods for fault diagnosis of planetary bearings under cross conditions. As depicted in Figure 14, SDAA achieves the highest average result (95.2%) and the lowest standard deviation (0.069), indicating superior diagnostic performance in this task. This highlights the importance of effectively evaluating data discrepancies of sub-class data under cross condition. Figure 15 illustrates the test loss and diagnostic accuracy of three adversarial adaptation methods during the training of the diagnostic task Bin-B1. In Figure 15a, the test losses for DDAN and DAAN fluctuate significantly and are consistently higher than those for SDAA. Likewise, Figure 15b reveals that the diagnostic accuracies of DDAN and DAAN are unstable during training. In contrast, SDAA exhibits the most stable performance throughout the entire training process.
In the fault diagnosis of planetary gearboxes, the proposed method excels other methods, as it integrates the adversarial training mechanism that specifically evaluates and aligns the feature distributions of the same fault types between the source and target domains. Moreover, the method incorporates conditional distribution adaptation, which enhances the model’s ability to correlate different sub-class faults within each domain. In planetary gearboxes, various sub-class faults exhibit subtle differences in their signal patterns. Conditional adaptation helps the model not only recognize these sub-classes but also establish relationships between them, even if they appear differently in the source and target domains. By addressing both the alignment of specific fault types and the correlations between sub-class faults, the proposed method improves the model’s ability to generalize across different conditions. This leads to more reliable fault diagnosis in planetary gearboxes, ensuring that the model can accurately identify faults even in varying operational environments, ultimately improving its performance in real-world applications.
Using the B3-B1 task as an example, Figure 16 visualizes the clustering of features using the t-SNE technique. In Figure 16a, samples from IRF, ORF, and REF fail to be accurately clustered by ResNet18, indicating its inadequacy for fault diagnosis of planetary bearings under cross-condition scenarios. In Figure 16b–d, features from different fault categories appear mixed, demonstrating that evaluating only the marginal distribution differences between source and target domain data are insufficient for accurate fault diagnosis. In contrast, Figure 16e shows that SDAA successfully clusters samples of the same fault together in the target domain. By bridging the gap between domains, SDAA offers a more reliable and robust approach to cross-condition fault diagnosis, outperforming other methods by focusing on both domain and sub-class data differences.

5. Conclusions

In practical engineering applications, correlations exist among different types of fault data in planetary gearboxes. Traditional methods often assume these fault data are independent, which is not the case. Additionally, planetary gearbox vibration signals contain complex frequency components that change over time. Relying solely on time-domain data fails to capture the inherent features of these signals. Both of these factors can significantly undermine the performance of fault diagnosis models, leading to less accurate or reliable results.
To address these issues, this paper proposes the SDAA method for planetary gearbox health state identification. The approach incorporates the STFT to compute the time–frequency representation of raw signals, which effectively captures intrinsic features without frequency overlap. Following this, a dual-stream network structure is employed to extract features from the time–frequency representations of both source and target tasks.
The method adopts adversarial training mechanism to evaluate the feature distribution differences between corresponding faults in the source and target domains. At the same time, conditional distribution adaptation strengthens the correlations between different subclass faults, improving the model’s ability to generalize to the target task’s data distribution.
Finally, the proposed method is validated through multiple cross-condition fault diagnosis scenarios on experimental data of planetary gearboxes. Comparative studies show that SDAA significantly enhances diagnostic performance, even with limited data samples for model training. This proves SDAA a practical and effective solution for planetary gearbox fault identification under varying operational conditions.

Author Contributions

Conceptualization, S.H. and Y.Z.; methodology, S.H. and Z.F.; software, S.H.; validation, S.H. and Z.F.; formal analysis, S.H.; investigation, M.D. and Y.Y.; resources, M.D. and Y.Y.; data curation, Z.F.; writing—original draft preparation, S.H., Y.Z. and Z.F.; writing—review and editing, Z.F., M.D. and Y.Y.; visualization, Y.Z.; supervision, Y.Z. and Z.F.; project administration, Z.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Zhao, C.; Feng, Z. Application of multi-domain sparse features for fault identification of planetary gearbox. Measurement 2017, 104, 169–179. [Google Scholar] [CrossRef]
  2. Liang, X.; Zuo, M.J.; Feng, Z. Dynamic modeling of gearbox faults: A review. Mech. Syst. Signal Process. 2018, 98, 852–876. [Google Scholar] [CrossRef]
  3. Lei, Y.; Li, N.; Li, X. Big Data-Driven Intelligent Fault Diagnosis and Prognosis for Mechanical Systems; Springer: Berlin/Heidelberg, Germany, 2023. [Google Scholar]
  4. Zhao, Z.; Han, H.; Wang, P.; Ma, H.; Zhang, S.; Yang, Y. An improved model for meshing characteristics analysis of spur gears considering fractal surface contact and friction. Mech. Mach. Theory 2021, 158, 104219. [Google Scholar] [CrossRef]
  5. Xiang, L.; An, C.; Zhang, Y.; Hu, A. Failure dynamic modelling and analysis of planetary gearbox considering gear tooth spalling. Eng. Fail. Anal. 2021, 125, 105444. [Google Scholar] [CrossRef]
  6. Feng, Z.; Zuo, M.J. Vibration signal models for fault diagnosis of planetary gearboxes. J. Sound Vib. 2012, 331, 4919–4939. [Google Scholar] [CrossRef]
  7. Feng, Z.; Zhang, D.; Zuo, M.J. Planetary gearbox fault diagnosis via joint amplitude and frequency demodulation analysis based on variational mode decomposition. Appl. Sci. 2017, 7, 775. [Google Scholar] [CrossRef]
  8. Jiang, X.; Wang, J.; Shi, J.; Shen, C.; Huang, W.; Zhu, Z. A coarse-to-fine decomposing strategy of VMD for extraction of weak repetitive transients in fault diagnosis of rotating machines. Mech. Syst. Signal Process. 2019, 116, 668–692. [Google Scholar] [CrossRef]
  9. Zhao, X.; Jia, M. Fault diagnosis of rolling bearing based on feature reduction with global-local margin Fisher analysis. Neurocomputing 2018, 315, 447–464. [Google Scholar] [CrossRef]
  10. Yu, J.; He, Y. Planetary gearbox fault diagnosis based on data-driven valued characteristic multigranulation model with incomplete diagnostic information. J. Sound Vib. 2018, 429, 63–77. [Google Scholar] [CrossRef]
  11. Wu, X.; Zhang, Y.; Cheng, C.; Peng, Z. A hybrid classification autoencoder for semi-supervised fault diagnosis in rotating machinery. Mech. Syst. Signal Process. 2021, 149, 107327. [Google Scholar] [CrossRef]
  12. Zhang, K.; Tang, B.; Deng, L.; Tan, Q.; Yu, H. A fault diagnosis method for wind turbines gearbox based on adaptive loss weighted meta-ResNet under noisy labels. Mech. Syst. Signal Process. 2021, 161, 107963. [Google Scholar] [CrossRef]
  13. Yin, A.; Yan, Y.; Zhang, Z.; Li, C.; Sánchez, R.-V. Fault diagnosis of wind turbine gearbox based on the optimized LSTM neural network with cosine loss. Sensors 2020, 20, 2339. [Google Scholar] [CrossRef]
  14. Ullah, F.; Turab, A.; Ullah, S.; Cacciagrano, D.; Zhao, Y. Enhanced Network Intrusion Detection System for Internet of Things Security Using Multimodal Big Data Representation with Transfer Learning and Game Theory. Sensors 2024, 24, 4152. [Google Scholar] [CrossRef] [PubMed]
  15. Li, X.; Zhang, W.; Ding, Q. A robust intelligent fault diagnosis method for rolling element bearings based on deep distance metric learning. Neurocomputing 2018, 310, 77–95. [Google Scholar] [CrossRef]
  16. Zhu, J.; Chen, N.; Shen, C. A new deep transfer learning method for bearing fault diagnosis under different working conditions. IEEE Sens. J. 2019, 20, 8394–8402. [Google Scholar] [CrossRef]
  17. Deng, Y.; Huang, D.; Du, S.; Li, G.; Zhao, C.; Lv, J. A double-layer attention based adversarial network for partial transfer learning in machinery fault diagnosis. Comput. Ind. 2021, 127, 103399. [Google Scholar] [CrossRef]
  18. Zhao, K.; Jiang, H.; Wang, K.; Pei, Z. Joint distribution adaptation network with adversarial learning for rolling bearing fault diagnosis. Knowl.-Based Syst. 2021, 222, 106974. [Google Scholar] [CrossRef]
  19. Han, T.; Liu, C.; Yang, W.; Jiang, D. Deep transfer network with joint distribution adaptation: A new intelligent fault diagnosis framework for industry application. ISA Trans. 2020, 97, 269–281. [Google Scholar] [CrossRef]
  20. Tan, Y.; Guo, L.; Gao, H.; Zhang, L. Deep coupled joint distribution adaptation network: A method for intelligent fault diagnosis between artificial and real damages. IEEE Trans. Instrum. Meas. 2020, 70, 3507212. [Google Scholar] [CrossRef]
  21. Tzeng, E.; Hoffman, J.; Zhang, N.; Saenko, K.; Darrell, T. Deep domain confusion: Maximizing for domain invariance. arXiv 2014, arXiv:1412.3474. [Google Scholar]
  22. Ganin, Y.; Lempitsky, V. Unsupervised domain adaptation by backpropagation. In Proceedings of the International Conference on Machine Learning, Lille, France, 6–11 July 2015; pp. 1180–1189. [Google Scholar]
  23. Yu, C.; Wang, J.; Chen, Y.; Huang, M. Transfer learning with dynamic adversarial adaptation network. In Proceedings of the 2019 IEEE International Conference on Data Mining (ICDM), Beijing, China, 8–11 November 2019; pp. 778–786. [Google Scholar]
  24. Feng, Z.; Liang, M.; Chu, F. Recent advances in time–frequency analysis methods for machinery fault diagnosis: A review with application examples. Mech. Syst. Signal Process. 2013, 38, 165–205. [Google Scholar] [CrossRef]
  25. Song, L.; Tan, J. Rotation Error Prediction of CNC Spindle Based on Short-Time Fourier Transform of Vibration Sensor Signals and Improved Weighted Residual Network. Sensors 2024, 24, 4244. [Google Scholar] [CrossRef] [PubMed]
  26. Li, X.; Hu, Y.; Li, M.; Zheng, J. Fault diagnostics between different type of components: A transfer learning approach. Appl. Soft Comput. 2020, 86, 105950. [Google Scholar] [CrossRef]
  27. He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
  28. Satpal, S.; Sarawagi, S. Domain adaptation of conditional probability models via feature subsetting. In Knowledge Discovery in Databases: PKDD 2007, Proceedings of the 11th European Conference on Principles and Practice of Knowledge Discovery in Databases, Warsaw, Poland, 17–21 September 2007; Springer: Berlin/Heidelberg, Germany, 2007; pp. 224–235. [Google Scholar]
Figure 1. Diagram of data distribution adaptation among different fault classes: (a) averaged distribution of faults; (b) data distribution of different faults.
Figure 1. Diagram of data distribution adaptation among different fault classes: (a) averaged distribution of faults; (b) data distribution of different faults.
Sensors 24 07017 g001
Figure 2. Overfitting during deep network training.
Figure 2. Overfitting during deep network training.
Sensors 24 07017 g002
Figure 3. Schematic structure of residual block.
Figure 3. Schematic structure of residual block.
Sensors 24 07017 g003
Figure 4. Domain adversarial training process.
Figure 4. Domain adversarial training process.
Sensors 24 07017 g004
Figure 5. Domain confusion training process.
Figure 5. Domain confusion training process.
Sensors 24 07017 g005
Figure 6. Transfer diagnostic framework for SDAA.
Figure 6. Transfer diagnostic framework for SDAA.
Sensors 24 07017 g006
Figure 7. Test rig of one-staged planetary gearbox. (a) Experimental test rig; (b) diagram of the gearbox structure.
Figure 7. Test rig of one-staged planetary gearbox. (a) Experimental test rig; (b) diagram of the gearbox structure.
Sensors 24 07017 g007
Figure 8. Damage parts in planetary gearboxes. (a) Planet gear fault; (b) sun gear fault; (c) ring gear fault; (d) inner race fault; (e) outer race fault; (f) rolling element fault.
Figure 8. Damage parts in planetary gearboxes. (a) Planet gear fault; (b) sun gear fault; (c) ring gear fault; (d) inner race fault; (e) outer race fault; (f) rolling element fault.
Sensors 24 07017 g008
Figure 9. Sample division diagram in vibration signal.
Figure 9. Sample division diagram in vibration signal.
Sensors 24 07017 g009
Figure 10. Motor speed curve under two time-varying mode. (a) Linearity; (b) sinusoidal.
Figure 10. Motor speed curve under two time-varying mode. (a) Linearity; (b) sinusoidal.
Sensors 24 07017 g010
Figure 11. Comparison of diagnostic results for different methods. (a) Method Performance; (b) accuracy variation.
Figure 11. Comparison of diagnostic results for different methods. (a) Method Performance; (b) accuracy variation.
Sensors 24 07017 g011
Figure 12. The convergence of training on the Vin-V3 task. (a) Training loss; (b) test loss; (c) accuracy.
Figure 12. The convergence of training on the Vin-V3 task. (a) Training loss; (b) test loss; (c) accuracy.
Sensors 24 07017 g012
Figure 13. Fault diagnosis performance on the Vin-V3 task. (a) ResNet18; (b) DAN; (c) DDAN; (d) DAAN; (e) SDAA.
Figure 13. Fault diagnosis performance on the Vin-V3 task. (a) ResNet18; (b) DAN; (c) DDAN; (d) DAAN; (e) SDAA.
Sensors 24 07017 g013
Figure 14. Comparison of diagnostic results for different methods.
Figure 14. Comparison of diagnostic results for different methods.
Sensors 24 07017 g014
Figure 15. The convergence of adversarial adaptation methods on the Bin-B1 task. (a) Test loss; (b) accuracy.
Figure 15. The convergence of adversarial adaptation methods on the Bin-B1 task. (a) Test loss; (b) accuracy.
Sensors 24 07017 g015
Figure 16. Feature visualization of different methods in task B3-B1. (a) ResNet18; (b) DAN; (c) DDAN; (d) DAAN; (e) SDAA.
Figure 16. Feature visualization of different methods in task B3-B1. (a) ResNet18; (b) DAN; (c) DDAN; (d) DAAN; (e) SDAA.
Sensors 24 07017 g016
Table 1. Training steps for SDAA.
Table 1. Training steps for SDAA.
Input: Given source domain data pairs D S = { x i S , y i S } i = 1 n and target domain data D T = { x j T } j = 1 m .
Output: The well-trained model.
Begin
 1: Configure the adaptation layer in the diagnostic framework.
 2: Randomly initialize the ResNet18 parameters.
Training
for epoch in epochs do
 3: Simultaneously extract the feature information from the source domain data D S and target domain data D T .
 4: Computing the classification loss in the source task D S , L c D S .
 5: Calculate the adversarial loss L adv between D S and D T .
 6: Calculate the conditional adaptation loss L CMMD between D S and D T .
 7: Fine-tuning the classification loss L c D S by adversarial loss L adv and adaptation loss L CMMD .
 8: Updating network weight parameters by back propagation with updated loss
end for
Until The loss in the target task converges or the training epochs reach.
Table 2. Parameters of induction motor and permanent magnet generator.
Table 2. Parameters of induction motor and permanent magnet generator.
Induction MotorGenerator
Rated power4 kW200 W
Rated voltage380 V24 V
No. of slots3639
No. of pole pairs26
Table 3. Gear configuration parameters.
Table 3. Gear configuration parameters.
Gear TypePlanetSunRing
No. of gear teeth35 (3)36108
Note: no. of planet gear in brackets.
Table 4. Planetary bearing configuration parameters.
Table 4. Planetary bearing configuration parameters.
Diameter of Pitch Circle (mm)Diameter of Rollers (mm)No. of RollersContact (°)
19.53.5100
Table 5. Fault sizes of gears and planetary bearings.
Table 5. Fault sizes of gears and planetary bearings.
Length (mm)Width (mm)Depth (mm)
Sun gear16.30.20.8
Planet gear170.20.8
Ring gear17.50.20.8
Inner race (planet pin)1810.5
Outer race (planet bore)1710.5
Rolling element11.510.5
Table 6. Motor speed configuration in gear fault experiments.
Table 6. Motor speed configuration in gear fault experiments.
Time-Varying ModeRange of Motor Speed(rpm)Symbol
Linearity0~720Vin/Bin
Sinusoidal150~330V1/B1
390~570V2/B2
510~690V3/B3
Table 7. Transfer diagnosis task of gear faults.
Table 7. Transfer diagnosis task of gear faults.
Motor Speed (rpm)Transfer TasksHealth Status
Source DomainTarget Domain
10~720150~330 Vin–V1NC
PGF
RGF
SGF
2390~570 Vin–V2
3510~690 Vin–V3
4150~330 390~570 V1–V2
5390~570 150~330 V2–V1
6150~330 510~690 V1–V3
7510~690 150~330 V3–V1
8390~570 510~690 V2–V3
9510~690 390~570 V3–V2
Table 8. Transfer diagnosis task of planet bearing faults.
Table 8. Transfer diagnosis task of planet bearing faults.
Motor Speed (rpm)Transfer TasksHealth Status
Source DomainTarget Domain
10~720150~330Bin-B1NC
IRF
ORF
REF
2390~570Bin-B2
3510~690Bin-B3
4150~330 390~570 B1-B2
5390~570 150~330 B2-B1
6150~330 510~690 B1-B3
7510~690 150~330 B3-B1
8390~570 510~690 B2-B3
9510~690 390~570 B3-B2
Table 9. Diagnostic results of different methods in gear fault diagnosis.
Table 9. Diagnostic results of different methods in gear fault diagnosis.
ResNet18DANDDANDAANSDAA
Vin-V159.5 ± 0.39276.2 ± 0.23270.4 ± 0.34983.7 ± 0.29595.3 ± 0.126
Vin-V257.5 ± 0.32977.3 ± 0.21868.5 ± 0.32786.7 ± 0.23696.5 ± 0.096
Vin-V356.4 ± 0.35275.9 ± 0.18777.2 ± 0.28382.6 ± 0.22895.9 ± 0.118
V1-V243.3 ± 0.28576.2 ± 0.26274.7 ± 0.27483.4 ± 0.32497.2 ± 0.112
V2-V144.9 ± 0.31475.8 ± 0.27873.5 ± 0.26183.6 ± 0.33296.8 ± 0.106
V1-V339.8 ± 0.40272.7 ± 0.36571.3 ± 0.28285.8 ± 0.34397.5 ± 0.130
V3-V140.3 ± 0.33671.6 ± 0.32270.5 ± 0.24384.9 ± 0.34997.1 ± 0.112
V2-V346.7 ± 0.34775.4 ± 0.34983.6 ± 0.35886.3 ± 0.27396.9 ± 0.121
V3-V248.3 ± 0.35176.5 ± 0.31882.3 ± 0.30287.1 ± 0.28597.3 ± 0.132
Average48.5 ± 0.34575.3 ± 0.28174.7 ± 0.29884.9 ± 0.29696.7 ± 0.117
Table 10. Method comparison for fault diagnosis of planet bearing.
Table 10. Method comparison for fault diagnosis of planet bearing.
ResNet18DANDDANDAANSDAA
Bin-B153.3 ± 0.43574.2 ± 0.34165.4 ± 0.25287.6 ± 0.28493.6 ± 0.106
Bin-B251.9 ± 0.54275.3 ± 0.35763.5 ± 0.23183.5 ± 0.23195.8 ± 0.083
Bin-B349.8 ± 0.53675.9 ± 0.22668.2 ± 0.25782.8 ± 0.19894.7 ± 0.035
B1-B242.4 ± 0.52875.1 ± 0.21673.7 ± 0.16282.1 ± 0.30295.9 ± 0.052
B2-B143.5 ± 0.48474.2 ± 0.25274.3 ± 0.15181.7 ± 0.29494.6 ± 0.084
B1-B339.6 ± 0.45570.8 ± 0.25667.4 ± 0.20185.5 ± 0.13294.8 ± 0.108
B3-B140.2 ± 0.52169.9 ± 0.21868.8 ± 0.44284.3 ± 0.11895.7 ± 0.043
B2-B345.3 ± 0.44277.3 ± 0.35184.2 ± 0.29486.3 ± 0.15496.7 ± 0.057
B3-B246.5 ± 0.45878.8 ± 0.38684.7 ± 0.25986.4 ± 0.13495.2 ± 0.052
Average45.8 ± 0.48974.6 ± 0.28972.2 ± 0.24984.5 ± 0.20595.2 ± 0.069
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Han, S.; Feng, Z.; Zhang, Y.; Du, M.; Yang, Y. Intelligent Fault Diagnosis of Planetary Gearbox Across Conditions Based on Subdomain Distribution Adversarial Adaptation. Sensors 2024, 24, 7017. https://doi.org/10.3390/s24217017

AMA Style

Han S, Feng Z, Zhang Y, Du M, Yang Y. Intelligent Fault Diagnosis of Planetary Gearbox Across Conditions Based on Subdomain Distribution Adversarial Adaptation. Sensors. 2024; 24(21):7017. https://doi.org/10.3390/s24217017

Chicago/Turabian Style

Han, Songjun, Zhipeng Feng, Ying Zhang, Minggang Du, and Yang Yang. 2024. "Intelligent Fault Diagnosis of Planetary Gearbox Across Conditions Based on Subdomain Distribution Adversarial Adaptation" Sensors 24, no. 21: 7017. https://doi.org/10.3390/s24217017

APA Style

Han, S., Feng, Z., Zhang, Y., Du, M., & Yang, Y. (2024). Intelligent Fault Diagnosis of Planetary Gearbox Across Conditions Based on Subdomain Distribution Adversarial Adaptation. Sensors, 24(21), 7017. https://doi.org/10.3390/s24217017

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop