Rolling Bearing Dynamics Simulation Information-Assisted Fault Diagnosis with Multi-Adversarial Domain Transfer Learning

Li, Zhe; Zhong, Zhidan; Zhang, Zhihui; Mao, Wentao; Zhang, Weiqi

doi:10.3390/lubricants13030116

Open AccessArticle

Rolling Bearing Dynamics Simulation Information-Assisted Fault Diagnosis with Multi-Adversarial Domain Transfer Learning

by

Zhe Li

¹

,

Zhidan Zhong

^1,2,*

,

Zhihui Zhang

¹,

Wentao Mao

³

and

Weiqi Zhang

¹

School of Mechanical and Electrical Engineering, Henan University of Science and Technology, Luoyang 471003, China

²

Henan Collaborative Innovation Center for High-End Bearings, Henan University of Science and Technology, Luoyang 471000, China

³

School of Computer and Information Engineering, Henan Normal University, Xinxiang 453000, China

^*

Author to whom correspondence should be addressed.

Lubricants 2025, 13(3), 116; https://doi.org/10.3390/lubricants13030116

Submission received: 19 January 2025 / Revised: 4 March 2025 / Accepted: 5 March 2025 / Published: 7 March 2025

(This article belongs to the Special Issue New Horizons in Machine Learning Applications for Tribology)

Download

Browse Figures

Versions Notes

Abstract

:

To address the issues of negative transfer and reduced stability in transfer learning models for rolling bearing fault diagnosis under variable working conditions, an unsupervised multi-adversarial transfer learning fault diagnosis algorithm based on bearing dynamics simulation data is proposed. Firstly, the algorithm constructs both a global domain classifier and a subdomain classifier. In the subdomain classifier, the simulated vibration signal, which contains rich bearing fault label information, is generated by constructing dynamic equations to replace the label prediction of target domain data, thereby achieving alignment of marginal and conditional distributions. Simultaneously, an improved loss function with embedded maximum mean discrepancy is designed to reduce the feature distribution gap between source and target domain data. Finally, a weight allocation mechanism for source domain and simulation domain samples is developed to promote positive transfer and suppress negative transfer. Experiments were conducted using the Paderborn University dataset and the Huazhong University of Science and Technology dataset, achieving accuracy rates of 89.457% and 96.436%, respectively. The results show that, in comparison with existing unsupervised cross-domain fault diagnosis methods, the proposed method demonstrates significant improvements in diagnostic accuracy and stability, demonstrating its superiority in rolling bearing fault diagnosis under variable operational conditions.

Keywords:

simulation signal; transfer learning; unsupervised fault diagnosis; weight allocation mechanism

1. Introduction

Bearings are key components of rotating machinery, and their performance deterioration or failure directly impacts the equipment’s ability to operate effectively. Therefore, the safe operation of a mechanical system depends largely on the smooth operation of the bearings [1,2,3,4].

Driven by significant breakthroughs in artificial intelligence technology, the successful application of fault diagnosis based on deep learning has garnered widespread attention from scholars worldwide [5,6,7,8].

However, in real-world engineering, obtaining large volumes of richly labeled fault data is highly challenging due to cost and safety constraints, particularly for rare fault types, where the available field data are often severely limited. Additionally, machinery and equipment are typically operated under variable conditions, resulting in differences in the distribution of measured samples. Consequently, it is challenging to directly apply a model trained on one operating condition to samples from other operating conditions [9]. Building on this, transfer learning methods are introduced in this context, and the study of unsupervised cross-domain bearing transfer learning for fault diagnosis holds significant practical importance [10,11]. The fault diagnosis performance in the target domain can be improved by transferring essential features from the richly labeled data of the source domain to the unlabeled or sparsely labeled data of the target domain [12]. Therefore, unsupervised domain-adaptive fault diagnosis offers an effective solution to address the issues of data distribution discrepancy and label scarcity.

As a crucial transfer learning strategy, unsupervised domain adaptation effectively addresses various challenges, such as distributional inconsistencies between the source and target domains. In recent years, these topics have garnered significant and sustained attention from researchers. Pei et al. [13] proposed a method that employs a multi-domain discriminator to achieve the alignment of various data distributions by capturing multimodal structures. Long et al. [14] proposed a joint adaptation network to align the joint distributions of various domain-specific layers across domains, thereby facilitating the learning of a transfer network. Kang et al. [15] proposed a adaptive network that explicitly represents both intra-class and inter-class domain differences to generate more discriminative features, addressing the previous issue of class information neglect, which led to feature misalignment and poor transferability. By adding an extra classifier for the target data, Liang et al. [16] presented an innovative pseudo-labeling framework that lowers classifier bias while improving pseudo-label quality and performance. Nevertheless, when there is a large distribution discrepancy between two domains, conventional transfer learning methods find it difficult to accomplish fine-grained hierarchical alignment, which results in negative transfer. Therefore, it is essential to incorporate simulation data based on fault mechanisms to facilitate more fine-grained domain adaptation for transfer learning.

The fault simulation signals, generated based on the bearing damage mechanism, encompass damage characteristics under various operating conditions and serve as an ideal source of supervisory information. Several studies have utilized fault simulation signals to assist in transfer fault diagnosis. Hou et al. [17] further modeled faulty vibration signals by combining constructed faulty pulses with measured normal baseline data. Qin et al. [18] proposed an innovative dynamic model for rolling bearings exhibiting defects. Li et al. [19] proposed a mathematical model for multi-DOF angular contact ball bearings employing an enhanced iterative method grounded in internal raceway control theory and nonlinear elastic Hertz contact theory. By constructing simulation domains to guide domain-adversarial transfer learning, diagnostic accuracy and model generalization can be improved. Simulation domains help models better learn different features and patterns during training, enabling fine-grained alignment between multiple domains through an effective transfer learning mechanism.

The method proposed in the aforementioned literature demonstrates promising transfer results in fault diagnosis based on simulation data, offering valuable insights and references for research in this field. However, several existing challenges must be addressed to a certain extent in order to further enhance the performance of bearing fault diagnosis.

(1) In most recent research, bearing source-domain fault data are typically real data that are difficult to adapt to the changing demands of fault data under varying operating conditions. However, it is straightforward to generate a large volume of simulation data with comprehensive fault annotations using numerical simulation technologies [20,21], thereby reducing reliance on experimental platform data. (2) The alignment of conditional distributions is often overlooked in favor of focusing solely on aligning the marginal distributions between two domains. This oversight may lead to the misclassification of samples near the category boundaries in the target domain [22]. (3) In domain adaptation, samples are typically assigned equal weights. Even if the source or simulation domain samples differ substantially from the target domain, they are assigned equal weight, which may lead to negative transfer [23].

This work proposes a multi-adversarial domain transfer learning fault detection algorithm that utilizes bearing dynamics simulation data to address the aforementioned challenges and the current state of the field. The following are the innovations of this study:

Simulated vibration signals representing bearing faults are generated using bearing dynamics equations, and a domain adversarial transfer learning network that integrates bearing simulation data is developed. A loss function embedded with the maximum mean discrepancy metric is formulated, and simulation data are integrated into the design of subdomain classifiers, facilitating fine-grained alignment from the source domain to the target domain, as well as simultaneous alignment of both marginal and conditional distributions in the context of unsupervised fault diagnosis. A domain similarity-guided weight assignment mechanism is proposed to suppress negative transfer by assigning varying weights to each source domain and simulation domain sample, based on their similarity to the target domain sample.

2. Theoretical Basis

2.1. Unsupervised Domain Adversarial Neural Networks

Domain Adversarial Neural Networks (DANNs) [24] integrate deep feature learning with domain adaptation in a unified training process to address the challenge of data distribution mismatch.

Figure 1 depicts the DANN’s structure. The commonly used loss function in a DANN consists of the label predictor loss

L_{y}

and the domain classifier loss

L_{d}

, as defined below:

L_{d} (θ_{f}, θ_{d}) = - \frac{1}{n_{s}} \sum_{i = 1}^{n_{s}} log (G_{d} (G_{f} (x_{i}^{s}))) - \frac{1}{n_{t}} \sum_{j = 1}^{n_{t}} log (1 - G_{d} (G_{f} (x_{j}^{t})))

(1)

L_{y} (θ_{f}, θ_{y}) = - \frac{1}{n_{s}} \sum_{i = 1}^{n_{5} K_{s} - 1} \sum_{k = 0} I [y_{i}^{s} = k] log (G_{y}^{k} (G_{f} (x_{i}^{s})))

(2)

Through the integration of domain classifier and label predictor with neural network training, the following optimizations are required:

min_{W, b, V, c} [\frac{1}{n} \sum_{i = 1}^{n} L_{y}^{i} (G_{y} (G_{f} (x_{i}; W, b); V, c), y_{i}) + λ \cdot R (W, b)]

(3)

In this equation,

L_{y}^{i} (G_{y} (G_{f} (x_{i}; W, b); V, c), y_{i})

represents the predicted loss of the ith example in shorthand notation.

R (W, b) = max_{u, z} [- \frac{1}{n} \sum_{i = 1}^{n} L_{d}^{i} (W, b, u, z) - \frac{1}{n^{'}} \sum_{i = n + 1}^{N} L_{d}^{i} (W, b, u, z)]

(4)

The overall objective function of the DANN is given by the following formula:

\begin{matrix} E (W, V, b, c, u, z) = \frac{1}{n} \sum_{i = 1}^{n} L_{y}^{i} (W, b, V, c) \\ - l (\frac{1}{n} \sum_{i = 1}^{n} L_{d}^{i} (W, b, u, z) + \frac{1}{n_{i}} \sum_{i = n + 1}^{N} L_{d}^{i} (W, b, u, z)) \end{matrix}

(5)

The objective function of the DANN demonstrates that it not only optimizes the performance of the classification task but also reduces the two disparity domains through adversarial training, thereby enabling the model to achieve superior domain adaptation capabilities.

2.2. Rolling Bearing Failure Mechanism

Rolling bearings are subjected to continuous alternating stress during operation. The contact between the inner and outer raceways and the rolling elements leads to contact fatigue, causing the contact surface to gradually peel off and eventually result in fatigue spalling. When the rolling element passes through the spalling area, particularly during the entry and exit stages, as shown in Figure 2, the contact mode between the rolling element and the raceway undergoes a sudden change, leading to a process of “stress relief” and “stress recovery”. Finally, the vibration signal is characterized by a dual-impulse behavior phenomenon [25]. This paper adopts the dual-impulse behavior characteristic dynamic modeling of ball-bearing raceway spalling damage based on time-varying contact stiffness to generate vibration signals to form a simulation domain data set.

According to Hertzian contact theory, the dynamic equation of a four-degree-of-freedom rolling bearing can be established [26,27]:

\begin{matrix} m_{i} {\ddot{x}}_{i} + c_{i x} {\dot{x}}_{i} + k_{i x} x_{i} = - F_{i x} + m_{i} e ω^{2} cos (ω t) + W_{x} \\ m_{i} {\ddot{y}}_{i} + c_{i j} {\dot{y}}_{i} + k_{i j} y_{i} = - F_{i y} + m_{i} e ω^{2} sin (ω t) - m_{i} g + W_{y} \\ m_{o} {\ddot{x}}_{o} + c_{o x} {\dot{x}}_{o} + k_{o x} x_{o} = F_{o x} \\ m_{o} {\ddot{y}}_{o} + c_{o y} {\dot{y}}_{o} + k_{o y} y_{o} = F_{o y} - m_{o} g \end{matrix}

(6)

m_{i}

and

m_{o}

are inner and outer ring masses, while

c_{i}

and

c_{o}

are inner and outer ring support damping.

W_{x}

and

W_{y}

are radial forces. e is the eccentricity, and

c_{r}

is the bearing clearance.

The bearing contact stiffness is

k_{b}

.

k_{b} = {[\frac{1}{{(1 / k_{b i})}^{2 / 3}} + \frac{1}{{(1 / k_{b 0})}^{2 / 3}}]}^{\frac{3}{2}}

(7)

The bearing contact deformation is denoted by

δ_{j}

, as defined below:

δ_{j} = (x_{i} - x_{o}) cos θ_{j} + (y_{i} - y_{o}) sin θ_{j} - c_{d} - h

(8)

x_{0}

and

y_{0}

represent the displacement in the initial state.

3. Proposed Method

3.1. Simulation Domain Constructed Based on Simulation Data

Based on the definitions of the source and target domains, this paper constructs the simulation domain using the faulty bearing sample set derived from Equation (6) in Section 2.2. The simulation data retains the primary fault signal characteristics, serving as a reasonable simplification of the bearing system.

This paper models the ER-16K rolling bearings manufactured by Timken (North Canton, OH, USA) and 6203 rolling bearings manufactured by SKF (Gothenburg, Sweden), which are consistent with those used in the subsequent fault diagnosis experiments. Specific dimensional parameters are provided in Table 1.

At the same time, the method of generating simulation signals based on dynamic equations is used to illustrate the auxiliary guiding role of bearing simulation characteristics in fault diagnosis. The area formed by these simulation signals is called the simulation domain.

Combining the representation method of source domain

D_{s}

and target domain

D_{t}

, the simulation domain is represented by

D_{m}

. Combining the label predictor

G_{y}

and the global domain classifier

G_{d}

, a subdomain classifier

G_{m}

is proposed as a representation. Combining the representation of the label predictor loss function

L_{y}

and the domain classifier

L_{d}

, the simulation domain loss is represented by

L_{m}

.

3.2. Improved Loss Function Design with Embedded Simulation Domain

In the DANN unsupervised algorithm, the features extracted by the feature extractor are indistinguishable from those of the domain classifiers through “reverse gradient”, which helps reduce the distributional discrepancy between two domains in the feature space. However, this alignment mechanism relies on adversarial training and does not explicitly optimize the distributional discrepancy between two domains. Therefore, in this paper, we use maximum mean discrepancy (MMD [28]) to explicitly reduce the feature discrepancy between two domain data.

In this paper, the MMD method is introduced to optimize the loss function of the DANN. By embedding this formula into the loss function of the classical DANN for optimization, it effectively promotes the domain adaptation of the DANN in unsupervised transfer learning. The MMD formula is as follows:

\hat{D} (θ_{f}, θ_{y}) = \frac{1}{n^{2}} \sum_{i = 1}^{n} \sum_{j = 1}^{n} k (x_{s_{i}}, x_{s_{j}}) - \frac{2}{m n} \sum_{i = 1}^{n} \sum_{j = 1}^{m} k (x_{s_{i}}, x_{t_{j}}) + \frac{1}{m^{2}} \sum_{i = 1}^{m} \sum_{j = 1}^{m} k (x_{t_{i}}, x_{t_{j}})

(9)

Additionally, to further avoid negative transfer while achieving fine alignment, this paper builds subdomain classifier

G_{m}

by comparing with global classifier

G_{d}

. The global domain classifier evaluates the similarity between two domains, whereas the subdomain classifier measures the similarity between real and simulated data. The simulated data are generated with the same operational parameters as those of the target domain. Therefore, these data can serve as a subdomain to aid in achieving finer alignment during domain adaptation. The loss function for the global domain classifier

G_{d}

is defined as follows:

\begin{matrix} L_{d} (θ_{f}, θ_{d}) = - \frac{1}{n_{s}} \sum_{i}^{n_{s}} log (G_{d} (G_{f} (x_{i}^{s}))) - \frac{1}{n_{t}} \sum_{j}^{n_{t}} log (1 - G_{d} (G_{f} (x_{j}^{t}))) \end{matrix}

(10)

The loss function of the subdomain classifier

G_{m}

is defined as follows:

\begin{matrix} L_{m} (θ_{f}, θ_{m}) = - \frac{1}{n_{m}} \sum_{i}^{n_{m}} log (G_{m} (G_{f} (x_{i}^{m}))) - \frac{1}{n_{t}} \sum_{j}^{n_{t}} log (1 - G_{m} (G_{f} (x_{j}^{t}))) \end{matrix}

(11)

In this equation,

θ_{f}

and

θ_{m}

are the parameters of

G_{f}

and

G_{m}

, respectively, and the subdomain classifiers are aligned with the conditional distribution, while the global domain classifiers are aligned with the marginal distribution. Equation (4) is updated with the addition of the subdomain classifier

G_{m}

to the following regularized equation:

R^{'} (W, b) = max [\begin{matrix} - \frac{1}{n} \sum_{i = 1}^{n} L_{d}^{i} (W, b, u, z) - \\ \frac{1}{n^{'}} \sum_{i = n + 1}^{N} L_{d}^{i} (W, b, u, z) \\ - \frac{1}{n} \sum_{i = 1}^{n} L_{m}^{i} (W, b, u, z) - \\ \frac{1}{n^{'}} \sum_{i = n + 1}^{N} L_{m}^{i} (W, b, u, z) \end{matrix}]

(12)

Building on the previous discussion, the optimization goal of the domain classifier in this paper is to achieve domain fitness alignment by integrating the subdomain classifier, the global domain classifier, and the maximum mean discrepancy. This effectively suppresses the negative transfer problem. The complete loss function after integration is as follows:

L (θ_{f}, θ_{y}, θ_{d}, θ_{m}) = L_{y} (θ_{f}, θ_{y}) - λ L_{d} (θ_{f}, θ_{d}) - λ_{m} L_{m} (θ_{f}, θ_{m}) + μ \hat{D} (θ_{f}, θ_{y})

(13)

In the equation, hyperparameters are employed to adjust the weights of the maximum mean discrepancy, subdomain classifier, and global domain classifier. This optimization function seeks to enhance the alignment between two domains by minimizing both the domain classification loss and the model alignment loss.

3.3. Development of Sample Weight Allocation Mechanisms

To adapt to the target domain, traditional unsupervised domain adversarial adaptation techniques often assign the same weight to each sample from the source and simulation domains. However, samples from the simulation and source domains can differ significantly from those in the target domain. The transfer learning fault diagnostic model may lead to negative transfer if the equal weight allocation approach is maintained.

Given that domain classifiers find it more challenging to distinguish samples with high similarity between different domains, while samples with large differences are more easily distinguished by domain classifiers, their weight allocation mechanism can be computed based on the domain prediction errors of the source domain [29] and simulation domain samples. The specific weight

{\tilde{ω}}_{i}^{s}

of the ith sample in the source domain is as follows:

ω_{i}^{s} = - log (G_{d} (G_{f} (x_{i}^{s})))

(14)

The simulation domain sample weight allocation mechanism is defined as follows. The specific weight

{\tilde{ω}}_{i}^{m}

of the ith sample in the simulation domain is as follows:

ω_{i}^{m} = - log (G_{d} (G_{f} (x_{i}^{m})))

(15)

After applying min–max normalization, the normalized weight

{\tilde{ω}}_{i}^{s}

of

ω_{i}^{s}

is as follows:

{\tilde{ω}}_{i}^{s} = \frac{ω_{i}^{s} - ω_{i, min}^{s}}{ω_{i, max}^{s} - ω_{i, min}^{s}}

(16)

In this equation, where

ω_{i, max}^{s} = max (ω_{i}^{s})

and

ω_{i, \min}^{s} = min (ω_{i}^{s})

. After applying min–max normalization, the normalized weight

{\tilde{ω}}_{i}^{m}

of

ω_{i}^{m}

is as follows:

{\tilde{ω}}_{i}^{m} = \frac{ω_{i}^{m} - ω_{i, min}^{m}}{ω_{i, max}^{m} - ω_{i, min}^{m}}

(17)

In this equation, where

ω_{i, max}^{m} = max (ω_{i}^{m})

and

ω_{i, \min}^{m} = min (ω_{i}^{m})

. By substituting the normalized weight

{\tilde{ω}}_{i}^{s}

into Equation (10), the new global domain classifier cross-entropy loss

{\tilde{L}}_{d} (θ_{f}, θ_{d})

is redefined follows:

\begin{matrix} {\tilde{L}}_{d} (θ_{f}, θ_{d}) = - \frac{1}{n_{s}} \sum_{i}^{n_{s}} {\tilde{ω}}_{i}^{s} log (G_{d} (G_{f} (x_{i}^{s}))) - \frac{1}{n_{t}} \sum_{j}^{n_{t}} log (1 - G_{d} (G_{f} (x_{j}^{t}))) \end{matrix}

(18)

By substituting the normalized weight

{\tilde{ω}}_{i}^{m}

into Equation (11), the new subdomain classifier cross-entropy loss

{\tilde{L}}_{m} (θ_{f}, θ_{d})

is redefined as follows:

\begin{matrix} {\tilde{L}}_{m} (θ_{f}, θ_{m}) = - \frac{1}{n_{m}} \sum_{i}^{n_{m}} {\tilde{ω}}_{i}^{m} log (G_{m} (G_{f} (x_{i}^{m}))) - \frac{1}{n_{t}} \sum_{j}^{n_{t}} log (1 - G_{m} (G_{f} (x_{j}^{t}))) \end{matrix}

(19)

By substituting Formula (18) and Formula (19) into Formula (13), the final improved loss function

L (θ_{f}, θ_{y}, θ_{d}, θ_{m})

can be obtained as follows:

L (θ_{f}, θ_{y}, θ_{d}, θ_{m}) = L_{y} (θ_{f}, θ_{y}) - λ {\tilde{L}}_{d} (θ_{f}, θ_{d}) - λ_{m} {\tilde{L}}_{m} (θ_{f}, θ_{m}) + μ \hat{D} (θ_{f}, θ_{y})

(20)

To some extent, the weight allocation process for simulation and source domain samples can promote positive transfer while mitigating negative transfer.

3.4. Model Architecture and Optimization Methods

Figure 3 provides a detailed description of the model architecture and workflow proposed in this study. The feature extractor

G_{f}

extracts features from the input data. Subsequently, the losses for the global domain classifier

L_{d}

, the subdomain classifier

L_{m}

, and the label predictor

L_{y}

are computed. Concurrently, the optimization objective is to maximize the losses

L_{d}

and

L_{m}

of the domain classifier while minimizing the loss

L_{y}

of the label predictor to facilitate the extraction of domain-invariant features.

The optimization problem is to determine the parameters

{\hat{θ}}_{f}

,

{\hat{θ}}_{y}

,

{\hat{θ}}_{d}

, and

{\hat{θ}}_{m}

that satisfy the given conditions.

({\hat{θ}}_{f}, {\hat{θ}}_{y}) = \underset{θ_{f}, θ_{y}}{argmin} E (θ_{f}, θ_{y}, {\hat{θ}}_{d})

(21)

{\hat{θ}}_{d} = \underset{θ_{d}}{argmax} E ({\hat{θ}}_{f}, {\hat{θ}}_{y}, θ_{d}, {\hat{θ}}_{m})

(22)

{\hat{θ}}_{m} = \underset{θ_{m}}{argmax} E ({\hat{θ}}_{f}, {\hat{θ}}_{y}, {\hat{θ}}_{d}, θ_{m})

(23)

The model employs the adversarial loss from the global domain classifier for adversarial training. On the other hand, misalignment of the bearing health status features across different classes in the feature space may arise due to global adversarial alignment. To address this issue, the model further introduces subdomain classifier adversarial loss to ensure the alignment of distributions for same-category samples from different domains and reduce category misalignment. Additionally, the maximum mean discrepancy is embedded to optimize the loss function. Finally, a mechanism for allocating weights to source and simulation domain samples is introduced to suppress negative transfer and promote positive transfer.

To handle large-scale data and reduce computational costs, thereby accelerating the model’s convergence, the SGD algorithm is used to update the model parameters

θ_{f}

,

θ_{y}

,

θ_{d}

, and

θ_{m}

.

\begin{matrix} θ_{f} \leftarrow θ_{f} - ε (\frac{\partial L_{y}}{\partial θ_{f}} - λ \frac{\partial L_{d}}{\partial θ_{f}} - λ_{m} \frac{\partial L_{m}}{\partial θ_{f}} + μ \frac{\partial D}{\partial θ_{f}}) \\ θ_{y} \leftarrow θ_{y} - ε (\frac{\partial L_{y}}{\partial θ_{y}} + μ \frac{\partial D}{\partial θ_{y}}) \\ θ_{d} \leftarrow θ_{d} - ε \frac{\partial L_{d}}{\partial θ_{d}} \\ θ_{m} \leftarrow θ_{m} - ε \frac{\partial L_{m}}{\partial θ} \end{matrix}

(24)

The parameters of the feature extractor, label predictor, global domain classifier, and subdomain classifier are denoted as

θ_{f}

,

θ_{y}

,

θ_{d}

, and

θ_{m}

, respectively, where

ε

denotes the learning rate. These parameters are further updated by categorization loss

L_{y}

, global domain adversarial loss

L_{d}

, subdomain adversarial loss

L_{m}

, and MMD distance

\hat{D} (θ_{f}, θ_{y})

.

4. Experiments

4.1. Simulation Dataset Description

As an example, three distinct types of bearing faults are considered from the bearing dataset at Paderborn University. Based on Formula (6) in Section 2.2, ODE45 was employed for numerical simulation to generate fault data representing the healthy state, inner race fault, and outer race fault. A simulation domain dataset was constructed by collecting simulated vibration acceleration signals.

Figure 4 presents an example of the simulation signal for the 6203 bearing. As an example, the simulation data of the 6203 bearing with an outer race fault highlight its characteristic frequency as follows:

f_{B P F O} = N_{b} n / (2 \times 60) \times (1 - D_{b} / D_{m}) = 76.36 Hz

(25)

In the frequency domain representation of the simulation data, the outer race fault exhibits a peak at 76.36 Hz, which aligns closely with the theoretical calculation results. It shows that the simulation data can well summarize the kinematic characteristics of rolling bearings.

4.2. Introduction to the Dataset

Relevant experiments were performed in this study on two publicly available datasets. One of these cases (Case 1) utilizes a publicly available, experimentally validated bearing fault diagnosis dataset verified and supplied by Huazhong University of Science and Technology (HUST) in Wuhan, China [30]. Figure 5 shows the experimental setup employed for the dataset. In this case, the ER-16K bearing was chosen for experimental analysis. The fault conditions in this experiment were artificially preset, with the source domain rotation speed being 20 Hz and the target domain rotation speed being 30 Hz. The health status of a bearing includes seven conditions: medium inner race fault, medium ball fault, medium outer race fault, severe inner race fault, severe ball fault, severe outer race fault, and normal. A detailed description of this case, regarding its source domain, target domain, and simulation domain, is provided in Table 2.

Case 2 utilizes a failure dataset from a bearing test bench provided by Paderborn University (PU) in Paderborn, Germany [31]. Figure 6 shows the experimental setup used in the Paderborn bearing test bench. The bearing utilized is a rolling bearing of type 6203 with a sampling frequency of 64 kHz, and the failure data are obtained by an accelerated life test to reflect the real damage situation without distinguishing the size of the failure, which covers three different states: normal, inner race fault, and outer race fault. A detailed description of this case, regarding its source domain, target domain, and simulation domain, is provided in Table 3.

4.3. Introduction to the Experimental Setup and Comparison Methods

This research compared the proposed method to five different methods to verify its superiority. To train the model, the SGD optimization algorithm was employed. The experiment was conducted ten times under each condition. In the two scenarios, the proposed method’s primary parameters were as follows: the learning rate was 0.001, the batch size was 32, and the number of iterations N was 120.

(1) In order to verify the necessity of using transfer learning for fault diagnosis, the proposed method was compared with the convolutional neural network (CNN).

(2) To evaluate the performance of the proposed model, a series of comparisons were made with traditional transfer learning fault diagnosis methods. These include JAN, CDAN, MADA, and FMIA [32] which are denoted as Method 3 to Method 6 in turn.

4.4. Analysis of the Experimental Results

In this study, we employed average accuracy (Table 4), iteration accuracy (Figure 7 and Figure 8), confusion matrices (Figure 9 and Figure 10), and F1 scores (Figure 11) to evaluate the diagnostic validity of each approach on the target domain test set. To further evaluate this method, using t-SNE distribution (Figure 12 and Figure 13), the adaptation performance of different methods in the feature space is intuitively demonstrated.

From the above conclusions, we can draw several meaningful conclusions:

(1) A comparative analysis of Method 1 with Methods 2–6 reveals that Method 1 (the proposed method) achieves higher average accuracy and a lower standard error of the mean (SEM). Method 1 demonstrates superior generalization ability in the unsupervised setting, indicating that Method 1 has an advantage in mitigating negative transfer, while the lower SEM value also shows the superiority of Method 1 in terms of stability. The average accuracy of Method 1 in unsupervised scenarios was 96.436% and 89.457%, respectively, while the other methods had average accuracies of up to 92.635% and 79.855% only. This result further verifies the effectiveness and stability of Method 1 in negative transfer suppression and cross-domain transfer.

(2) Method 1 demonstrates superior diagnostic accuracy and greater stability after convergence when comparing iterative accuracies of the six methods in two cases, as shown in Table 4 and Figure 7, Figure 8, Figure 9, Figure 10 and Figure 11.

(3) Comparing the confusion matrices of the fourth experiment of the six methods in Cases 1 and 2, as shown in Figure 10, Method 1 improves the diagnostic accuracies of the healthy, inner, and outer circles by 22%, 2%, and 4%, respectively, compared to MADA (Method 5), reflecting the validity of the proposed method.

This study employed the t-distribution algorithm for visualization purposes. In the visualization of t-SNE, for Case 1, the defective bearings are categorized into two groups based on the severity of the fault: 1, and 2. Specifically, they are labeled as Normal, Inner-1, Inner-2, Ball-1, Ball-2, Outer-1, and Outer-2, respectively, in the t-SNE visualization indicating seven cases of different fault types and fault levels. In Case 2, the source domain is denoted by the label suffix “src”, while the target domain is represented by the prefix “tra”. “Inner” represents the bearing inner race fault, “Norm” indicates the normal state, and “Outer” corresponds to the outer race fault. The results of the six methods are shown in Figure 12 and Figure 13.

According to the results of t-SNE compared with other methods, the feature distribution of the proposed method exhibits a clearer clustering structure, and the boundaries between different fault types are more obvious, with less overlapping between features. This shows that the proposed method can better distinguish various classes of fault features while maintaining intra-class feature tightness, which enhances the effectiveness of feature adaptation. This is because the proposed method involves a global classifier and subdomain classifier, which can realize fine-grained alignment of the marginal and conditional distributions. Additionally, the adaptive weight allocation mechanism can effectively suppress negative transfer.

4.5. Ablation Experiment

To further analyze the contribution of the MMD module, the subdomain classifier module comprising simulation data and the weighting allocation mechanism module of the proposed method to the transfer learning fault diagnosis, ablation experiments were conducted to verify its effectiveness.

Network A: Removal of the weighting mechanism module, subdomain classifier module, and MMD module; Network B: Removal of the weighting mechanism module and subdomain classifier module; Network C: Removal of the weighting allocation mechanism module; Network D: Proposed method, as shown in Table 5.

Initially, Network A was used as a base model for transfer learning bearing fault diagnosis. Subsequently, additional modules were progressively introduced to assess their performance improvements. Network B incorporated the MMD module into Network A. The improved loss function designed to embed the MMD helps to reduce the difference in feature distribution between the source and target domain data. Network C constitutes a subdomain classifier module based on Network B by adding simulation domain data, which serve as effective supervisory data that guarantee the lower limit of the information transfer effect during the confrontation process. To promote positive transfer and suppress negative transfer, Network D incorporated a weight allocation mechanism module into Network C.

Based on the above experiments, conclusions can be drawn:

(1) As shown in Table 6, the average diagnostic accuracy of the proposed method (Network D) shows an improvement of 11% and 16% in two cases when compared to the original DANN (Network A), reflecting the effectiveness of the proposed method. The DANN method with the weighting allocation mechanism module, MMD module, and subdomain classifier module in Network D outperforms the Network A method in both cases. The results show that the proposed method has a good effect in bearing transfer learning.

(2) As shown in the comparison of the ablation experiments of the three network structures of Network A, Network B, and Network C in Cases 1 and 2, as shown in Figure 14, Figure 15 and Figure 16, the combination of subdomain classifier and global domain classifier enhances the cross-domain adaptability of the model through fine-grained alignment. Meanwhile, the designed improved loss function embedded in the MMD helps to reduce the feature distribution discrepancy between domains, thereby improving the overall diagnostic accuracy for each category. As shown in Figure 17, the accuracy of Network B in the normal state is 3% higher than that of Network A, while the accuracy of Network C in the normal state is further improved by 15% compared to Network B.

(3) By comparing the performance of four indexes, namely, accuracy, F1 score, recall rate, and precision rate, of the two network structures, Network C and Network D in Cases 1 and 2, as shown in Figure 18 and Figure 19, it can be seen that the designed sample weight allocation mechanism significantly enhances the stability of the model while improving the recognition performance of the diagnostic model. This mechanism can adaptively assign weights to the source domain samples and simulation domain samples, thus effectively suppressing negative transfer and promoting positive transfer.

The results show that the MMD module, simulation domain, and weight allocation mechanism in the proposed method not only effectively suppress negative transfer but also improve the effectiveness and stability of unsupervised cross-domain transfer.

4.6. Experimental Results of Noise Immunity

Vibration and friction between bearing components can generate considerable noise in real-world working environments. These noises interfere with the collection of vibration signals by sensors, thus masking fault information within the signal. To evaluate the robustness and effectiveness of the proposed model in a noisy environment that more closely reflects real-world conditions, Gaussian noise of varying intensities is introduced into the test signal to assess the model’s anti-noise capabilities. The signal-to-noise ratio (SNR) is a key indicator used to measure the relationship between signal strength and noise strength and is frequently employed to assess signal quality under noise interference conditions. It is the ratio of signal power to noise power, as shown below:

S N R = 10 {log}_{10} (\frac{p_{signal}}{P_{noise}})

(26)

where

p_{s i g n a l}

is the effective power of the signal and

p_{n o i s e}

is the effective power of the noise. In this study, four types of Gaussian noise intensities were included, with signal-to-noise ratios of 6 db, 4 db, 0 db, and −2 db, respectively. These noise intensities increase from mild to severe noise levels to simulate various working conditions, ranging from slight interference to severe signal pollution in bearing fault diagnosis.

Through the comparison of the above results, the following conclusions can be drawn:

(1) As illustrated in Figure 20 and Figure 21, with the introduction of noise, the accuracy of the six network models decreases to some extent on the two datasets, and the difficulty in characterizing model diagnosis gradually increases. Simultaneously, as the noise intensity increases further, diagnostic accuracy continues to decline. This is due to noise interfering with the model’s feature extraction, making it difficult to distinguish between fault and normal state features. Furthermore, noise weakens the model’s ability to consistently identify feature patterns, thereby impairing the generalization performance of the diagnostic model. Among the six model algorithms, our method demonstrates superior noise resistance. After introducing 4 dB noise, the accuracy of our method in the two cases is 92.62% and 83.56%, respectively, which exceeds the diagnostic accuracy of the other five methods. This further confirms that our method exhibits robustness and performance stability under noise interference conditions and can maintain a certain level of diagnostic accuracy under complex working conditions.

(2) A comparison of the model accuracy of the six methods in two cases, as shown in Figure 20 and Figure 21, indicates that our method exhibits better noise resistance performance than the other five methods when high noise is added, compared to low noise. Following the introduction of 0 dB noise, the accuracy of our method decreased by 8.08% and 9.78% in the two cases, respectively. Under high noise conditions, the accuracy decrease was less than that observed for the other five methods.

(3) Figure 22 and Figure 23 compare the confusion matrices of all methods under two different cases in the noise-resistant experiment. In Figure 22, compared with MADA, the diagnostic accuracy of our method for medium ball bearing fault, medium outer race fault, severe inner race fault, severe ball bearing fault, and normal is improved by 11%, 15%, 5%, 6%, and 6%, respectively, which reflects the effectiveness and stability of the proposed method.

The results demonstrate that the MMD module, simulation domain, and weight distribution mechanism in our method not only effectively suppress negative transfer but also enhance the model’s robustness against noise to some extent.

5. Conclusions

This study presents an information-assisted multi-adversarial domain transfer learning method for fault diagnosis in rolling bearing dynamics simulation. The method aims to diagnose rolling bearing faults under various operating conditions. The method suppresses negative transfers and promotes positive transfers to improve model performance. Initially, a subdomain classifier and a global domain classifier are constructed. In the subdomain classifier, a dynamic equation is constructed to generate simulated vibration data containing extensive bearing fault label information. These data substitute the label prediction for the target domain and facilitate the alignment of both the marginal and conditional distributions. Simultaneously, the enhanced MMD loss function aims to reduce the differences between the feature distributions of the source and target data. Finally, adding a sample weight allocation mechanism can effectively suppress negative transfer.

In the context of unsupervised fault diagnosis, the proposed model effectively learns more generalized data features. By introducing simulated signals generated by the bearing dynamics equation, the algorithm’s lower bound is enhanced, effectively suppressing negative transfer and significantly improving the model’s stability. The results indicate that the proposed method achieves higher accuracy and lower SEM (standard error) on the two datasets with values of 89.457 ± 1.385 and 96.436 ± 1.264, respectively. After multiple rounds of training, the accuracy is improved to a certain extent compared with other methods. More importantly, this method shows better stability in indicators such as accuracy, F1 score, and recall rate, and the SEM is lower. At the same time, the model has better noise resistance and shows significant advantages in suppressing negative transfer.

This work investigated an innovative cross-condition transfer learning method by generating bearing simulation fault data. In the future, the authors will further explore methods for generating simulation fault data for other critical mechanical components, such as bearings, and related transfer learning fault diagnosis algorithms.

Author Contributions

Writing—original draft, Z.L. and Z.Z. (Zhidan Zhong); Writing—review and editing, Z.Z. (Zhidan Zhong); Visualization, Z.Z. (Zhihui Zhang); Project administration, W.Z.; Data curation, W.M.; Funding acquisition, Z.Z. (Zhidan Zhong). All authors have read and agreed to the published version of the manuscript.

Funding

This project received funding from the Henan Provincial Science and Technology Research and Development Joint Fund (Grant No. 225101610001) and the Henan Provincial Key Research and Development Project (Grant No. 231111222900).

Data Availability Statement

Experimental data can be downloaded from https://mb.uni-paderborn.de/kat/forschung/kat-datacenter/bearing-datacenter/data-sets-and-download (accessed on 15 January 2025) and https://github.com/CHAOZHAO-1/HUSTbearing-dataset (accessed on 15 January 2025).

Conflicts of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as potential conflicts of interest.

References

Liu, R.; Yang, B.; Zio, E.; Chen, X. Artificial intelligence for fault diagnosis of rotating machinery: A review. Mech. Syst. Signal Process. 2018, 108, 33–47. [Google Scholar] [CrossRef]
Hoang, D.T.; Kang, H.J. A survey on deep learning based bearing fault diagnosis. Neurocomputing 2019, 335, 327–335. [Google Scholar] [CrossRef]
Heda, Z. Fault diagnosis and life prediction of mechanical equipment based on artificial intelligence. J. Intell. Fuzzy Syst. 2019, 37, 3535–3544. [Google Scholar] [CrossRef]
Kchaou, M. A data-driven approach for studying tribology based on experimentation and artificial intelligence coupling tools. Sustain. Eng. Innov. 2024, 6, 25–36. [Google Scholar] [CrossRef]
Chen, S.; Cheng, G.; Guo, F.; Jia, X.; Wen, X. Integrating Friction Noise for In Situ Monitoring of Polymer Wear Performance: A Machine Learning Approach in Tribology. J. Tribol. 2025, 147. [Google Scholar] [CrossRef]
Li, Z.; Li, J.; An, B.; Li, R. The design method for surface texture of sliding friction pairs based on machine learning under mixed lubrication. Tribol. Int. 2024, 109563. [Google Scholar] [CrossRef]
Kim, M.; Ko, J.U.; Lee, J.; Youn, B.D.; Jung, J.H.; Sun, K.H. A Domain Adaptation with Semantic Clustering (DASC) method for fault diagnosis of rotating machinery. ISA Trans. 2022, 120, 372–382. [Google Scholar] [CrossRef]
Marian, M.; Tremmel, S. Physics-informed machine learning—An emerging trend in tribology. Lubricants 2023, 11, 463. [Google Scholar] [CrossRef]
Zhang, Z.; Chen, H.; Li, S.; An, Z. Unsupervised domain adaptation via enhanced transfer joint matching for bearing fault diagnosis. Measurement 2020, 165, 108071. [Google Scholar] [CrossRef]
Wang, H.; Bai, X.; Tan, J.; Yang, J. Deep prototypical networks based domain adaptation for fault diagnosis. J. Intell. Manuf. 2022, 33, 973–983. [Google Scholar] [CrossRef]
Zhao, H.; Jiming, E.; Chen, S.; Cheng, G.; Guo, F. Prediction of friction coefficient of polymer surface using variational mode decomposition and machine learning algorithm based on noise features. Tribol. Int. 2024, 191, 109184. [Google Scholar] [CrossRef]
Zhu, J.; Chen, N.; Shen, C. A new deep transfer learning method for bearing fault diagnosis under different working conditions. IEEE Sens. J. 2019, 20, 8394–8402. [Google Scholar] [CrossRef]
Pei, Z.; Cao, Z.; Long, M.; Wang, J. Multi-adversarial domain adaptation. In Proceedings of the AAAI Conference on Artificial Intelligence, New Orleans, LA, USA, 2–7 February 2018; Volume 32. [Google Scholar]
Long, M.; Zhu, H.; Wang, J.; Jordan, M.I. Deep transfer learning with joint adaptation networks. In Proceedings of the International Conference on Machine Learning, PMLR, Sydney, Australia, 6–11 August 2017; pp. 2208–2217. [Google Scholar]
Kang, G.; Jiang, L.; Yang, Y.; Hauptmann, A.G. Contrastive adaptation network for unsupervised domain adaptation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019; pp. 4893–4902. [Google Scholar]
Liang, J.; Hu, D.; Feng, J. Domain adaptation with auxiliary target domain-oriented classifier. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 20–25 June 2021; pp. 16632–16642. [Google Scholar]
Hou, W.; Zhang, C.; Jiang, Y.; Cai, K.; Wang, Y.; Li, N. A new bearing fault diagnosis method via simulation data driving transfer learning without target fault data. Measurement 2023, 215, 112879. [Google Scholar] [CrossRef]
Qin, Y.; Li, C.; Wu, X.; Wang, Y.; Chen, H. Multiple-degree-of-freedom dynamic model of rolling bearing with a localized surface defect. Mech. Mach. Theory 2020, 154, 104047. [Google Scholar] [CrossRef]
Li, Z.; Wang, Q.; Wang, R.; Qin, B.; Shao, W. Nonlinear dynamic behaviors of angular contact ball bearing with waviness based on a synthetical multi-degree-of-freedom mathematical modelling. J. Low Freq. Noise Vib. Act. Control 2024, 43, 41–74. [Google Scholar] [CrossRef]
Xu, K.; Kong, X.; Wang, Q.; Han, B.; Sun, L. Intelligent fault diagnosis of bearings under small samples: A mechanism-data fusion approach. Eng. Appl. Artif. Intell. 2023, 126, 107063. [Google Scholar] [CrossRef]
Wang, H.; Zheng, J.; Xiang, J. Online bearing fault diagnosis using numerical simulation models and machine learning classifications. Reliab. Eng. Syst. Saf. 2023, 234, 109142. [Google Scholar] [CrossRef]
Deng, W.; Liao, Q.; Zhao, L.; Guo, D.; Kuang, G.; Hu, D.; Liu, L. Joint clustering and discriminative feature alignment for unsupervised domain adaptation. IEEE Trans. Image Process. 2021, 30, 7842–7855. [Google Scholar] [CrossRef]
Zhang, R.; Tao, H.; Wu, L.; Guan, Y. Transfer learning with neural networks for bearing fault diagnosis in changing working conditions. IEEE Access 2017, 5, 14347–14357. [Google Scholar] [CrossRef]
Ganin, Y.; Ustinova, E.; Ajakan, H.; Germain, P.; Larochelle, H.; Laviolette, F.; March, M.; Lempitsky, V. Domain-adversarial training of neural networks. J. Mach. Learn. Res. 2016, 17, 1–35. [Google Scholar]
Liu, Q.; Guo, Y. Dynamic model of faulty rolling element bearing on double impact phenomenon. In Proceedings of the 2015 IEEE International Conference on Information and Automation, Lijiang, China, 8–10 August 2015; pp. 2017–2021. [Google Scholar]
Luo, M.; Guo, Y.; Andre, H.; Wu, X.; Na, J. Dynamic modeling and quantitative diagnosis for dual-impulse behavior of rolling element bearing with a spall on inner race. Mech. Syst. Signal Process. 2021, 158, 107711. [Google Scholar] [CrossRef]
Luo, M.; Guo, Y.; Wu, X.; Na, J. An analytical model for estimating spalled zone size of rolling element bearing based on dual-impulse time separation. J. Sound Vib. 2019, 453, 87–102. [Google Scholar] [CrossRef]
Borgwardt, K.M.; Gretton, A.; Rasch, M.J.; Kriegel, H.P.; Schölkopf, B.; Smola, A.J. Integrating structured biological data by kernel maximum mean discrepancy. Bioinformatics 2006, 22, e49–e57. [Google Scholar] [CrossRef]
Xiao, Y.; Shao, H.; Han, S.; Huo, Z.; Wan, J. Novel joint transfer network for unsupervised bearing fault diagnosis from simulation domain to experimental domain. IEEE/ASME Trans. Mechatron. 2022, 27, 5254–5263. [Google Scholar] [CrossRef]
Zhao, C.; Zio, E.; Shen, W. Domain generalization for cross-domain fault diagnosis: An application-oriented perspective and a benchmark study. Reliabil. Eng. Syst. Saf. 2024, 245, 109964. [Google Scholar] [CrossRef]
Lessmeier, C.; Kimotho, J.K.; Zimmer, D.; Sextro, W. Condition monitoring of bearing damage in electromechanical drive systems by using motor current signals of electric motors: A benchmark data set for data-driven classification. In Proceedings of the PHM Society European Conference, Bilbao, Spain, 5–8 July 2016; Volume 3. [Google Scholar]
Zhong, Z.; Zhang, Z.; Cui, Y.; Xie, X.; Hao, W. Failure Mechanism Information-Assisted Multi-Domain Adversarial Transfer Fault Diagnosis Model for Rolling Bearings under Variable Operating Conditions. Electronics 2024, 13, 2133. [Google Scholar] [CrossRef]

Figure 1. DANN basic architecture.

Figure 2. Schematic diagram of rolling element passing through the peeling area.

Figure 3. Architecture of the proposed method.

Figure 4. 6203 bearing simulation signal.

Figure 5. HUST experimental platform.

Figure 6. Paderborn University dataset test bench.

Figure 7. Iterative accuracy of all methods in Case 1.

Figure 8. Iterative accuracy of all methods in Case 2.

Figure 9. Confusion matrix of the fourth experiment for each method in Case 1. (a) Proposed; (b) CNN; (c) JAN; (d) CDAN; (e) MADA; (f) FMIA.

Figure 10. Confusion matrix of the fourth experiment for each method in Case 2. (a) Proposed; (b) CNN; (c) JAN; (d) CDAN; (e) MADA; (f) FMIA.

Figure 11. Average F1 scores of all methods for two cases: (a) Case 1; (b) Case 2.

Figure 12. The results of each method’s visual feature adaptation using t-SNE in Case 1. (a) Proposed; (b) CNN; (c) JAN; (d) CDAN; (e) MADA; (f) FMIA.

Figure 13. The results of each method’s visual feature adaptation using t-SNE in Case 2. (a) Proposed; (b) CNN; (c) JAN; (d) CDAN; (e) MADA; (f) FMIA.

Figure 14. Accuracy of the four networks in Case 1 over 10 repetitions.

Figure 15. Accuracy of the four networks in Case 2 over 10 repetitions.

Figure 16. Confusion matrix of the fourth experiment for each network in Case 1 in the ablation experiment. (a) Network 1. (b) Network 2. (c) Network 3. (d) Network 4.

Figure 17. Confusion matrix of the fourth experiment for each network in Case 2 in the ablation experiment. (a) Network 1. (b) Network 2. (c) Network 3. (d) Network 4.

Figure 18. Indicator performance of different networks in Case 1.

Figure 19. Indicator performance of different networks in Case 2.

Figure 20. The average cross-domain diagnostic accuracy of six networks across varying signal-to-noise ratio conditions over ten repeated experiments in Case 1.

Figure 21. The average cross-domain diagnostic accuracy of six networks across varying signal-to-noise ratio conditions over ten repeated experiments in Case 2.

Figure 22. Confusion matrix for each method in the fourth experiment under 4 dB SNR conditions in Case 1. (a) Proposed; (b) CNN; (c) JAN; (d) CDAN; (e) MADA; (f) FMIA.

Figure 23. Confusion matrix for each method in the fourth experiment under 4 dB SNR conditions in Case 2. (a) Proposed; (b) CNN; (c) JAN; (d) CDAN; (e) MADA; (f) FMIA.

Table 1. 6203 and ER-16K bearing parts’ geometry parameters.

Bearing Type	Parameter	Size (mm)
ER-16K	Outside diameter	52
	Bore diameter	25.4
	Width	34.13
	Ball diameter	7.94
	Number of bearing balls	9
6203	Outside diameter	40
	Bore diameter	17
	Width	12
	Ball diameter	6.75
	Number of bearing balls	8

Table 2. Case 1: Detailed description of the source domain, target domain, and simulation domain.

Dataset	Bearing Type	Bearing Condition	Speed (Hz)
Source domain	ER-16K	Normal	20
		Inner race fault (medium, severe)
		Rolling body fault (medium, severe)
		Outer race fault (medium, severe)
Target domain	ER-16K	Normal	30
		Inner race fault (medium, severe)
		Rolling body fault (medium, severe)
		Outer race fault (medium, severe)
Simulation domain	ER-16K	Normal	30
		Inner race fault (medium, severe)
		Rolling body fault (medium, severe)
		Outer race fault (medium, severe)

Table 3. Case 2: Detailed description of the source domain, target domain, and simulation domain.

Dataset	Bearing Type	Bearing Condition	Load (hp)
Source domain	6203	Normal	$w_{0}$
		Inner race fault
		Outer race fault
Target domain	6203	Normal	$w_{3}$
		Inner race fault
		Outer race fault
Simulation domain	6203	Normal	$w_{3}$
		Inner race fault
		Outer race fault

Table 4. A comparison of the diagnostic results from six methods in two cases.

NO.	Method	Case 1 (%)	Case 2 (%)
1	Proposed Method	96.436 ± 1.264	89.457 ± 1.385
2	CNN	73.395 ± 8.628	68.316 ± 9.875
3	JAN	87.326 ± 5.533	75.354 ± 7.587
4	CDAN	86.354 ± 5.268	76.948 ± 6.569
5	MADA	92.635 ± 3.262	79.855 ± 4.316
6	FMIA	94.591 ± 1.332	87.131 ± 1.416

Table 5. Ablation study setup.

Network	Global Domain	MMD	Subdomain	Weighting
A	√	×	×	×
B	√	√	×	×
C	√	√	√	×
D	√	√	√	√

Table 6. Comparison of four unsupervised diagnosis networks in two cases in ablation experiments.

Network	Case 1 (%)	Case 2 (%)
A	85.483 ± 5.368	72.891 ± 6.354
B	88.469 ± 3.165	76.469 ± 4.346
C	91.565 ± 1.953	82.456 ± 1.965
D	96.436 ± 1.264	89.457 ± 1.385

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, Z.; Zhong, Z.; Zhang, Z.; Mao, W.; Zhang, W. Rolling Bearing Dynamics Simulation Information-Assisted Fault Diagnosis with Multi-Adversarial Domain Transfer Learning. Lubricants 2025, 13, 116. https://doi.org/10.3390/lubricants13030116

AMA Style

Li Z, Zhong Z, Zhang Z, Mao W, Zhang W. Rolling Bearing Dynamics Simulation Information-Assisted Fault Diagnosis with Multi-Adversarial Domain Transfer Learning. Lubricants. 2025; 13(3):116. https://doi.org/10.3390/lubricants13030116

Chicago/Turabian Style

Li, Zhe, Zhidan Zhong, Zhihui Zhang, Wentao Mao, and Weiqi Zhang. 2025. "Rolling Bearing Dynamics Simulation Information-Assisted Fault Diagnosis with Multi-Adversarial Domain Transfer Learning" Lubricants 13, no. 3: 116. https://doi.org/10.3390/lubricants13030116

APA Style

Li, Z., Zhong, Z., Zhang, Z., Mao, W., & Zhang, W. (2025). Rolling Bearing Dynamics Simulation Information-Assisted Fault Diagnosis with Multi-Adversarial Domain Transfer Learning. Lubricants, 13(3), 116. https://doi.org/10.3390/lubricants13030116

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Rolling Bearing Dynamics Simulation Information-Assisted Fault Diagnosis with Multi-Adversarial Domain Transfer Learning

Abstract

1. Introduction

2. Theoretical Basis

2.1. Unsupervised Domain Adversarial Neural Networks

2.2. Rolling Bearing Failure Mechanism

3. Proposed Method

3.1. Simulation Domain Constructed Based on Simulation Data

3.2. Improved Loss Function Design with Embedded Simulation Domain

3.3. Development of Sample Weight Allocation Mechanisms

3.4. Model Architecture and Optimization Methods

4. Experiments

4.1. Simulation Dataset Description

4.2. Introduction to the Dataset

4.3. Introduction to the Experimental Setup and Comparison Methods

4.4. Analysis of the Experimental Results

4.5. Ablation Experiment

4.6. Experimental Results of Noise Immunity

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI