1. Introduction
As one of the key supporting components in rotating machinery, rolling bearings are widely used in various types of mechanical equipment [
1,
2,
3]. However, to meet the demands of industrial production, rolling bearings must operate for extended periods under complex and variable working conditions, which inevitably leads to faults. Once a rolling bearing fails, it can potentially cause damage to machinery and even pose a threat to human safety [
4]. Therefore, fault diagnosis for rolling bearings is of great significance to ensure the operational reliability of production systems.
With advancements in high-performance computing hardware, deep learning now enables efficient training on large-scale datasets and precise inference for complex models. Leveraging its powerful feature extraction capabilities and proficiency in modeling intricate nonlinear relationships, deep learning has driven a transition in fault diagnosis from traditional handcrafted feature engineering to intelligent diagnostic techniques [
5,
6]. Compared to traditional methods [
7,
8,
9,
10], deep learning can automatically extract complex fault features from massive datasets, significantly improving diagnostic accuracy while reducing reliance on expert knowledge. Yu et al. [
11] represented CNN-extracted feature vectors as graph structures and further extracted deep feature representations using GNN; Fu et al. [
12] constructed a parallel encoder architecture to extract key information from both time-domain and time-frequency graphs, enhancing diagnostic precision; Song et al. [
13] optimized model hyperparameters using a particle swarm optimization algorithm, improving diagnostic performance with limited training samples; He et al. [
14] synthesized multi-sensor data at the data layer via pixel matrix fusion and built a multi-scale structure to capture information across different scales; Zheng et al. [
15] proposed a two-stage fault diagnosis framework to address the issue of data imbalance; and Wu et al. [
16] employed a U-Net for early fault diagnosis for rolling bearings. To ensure that models accurately capture deep feature representations in the data and achieve high diagnostic precision, input data must have low noise levels and similar distribution characteristics. However, in practical engineering applications, rolling bearings operate under complex and variable conditions, resulting in significant differences in the data distribution of gearbox fault samples across different working conditions. Moreover, factors such as vibrations and wear in the operating environment introduce additional noise, exacerbating signal aliasing and blurring, thereby increasing the difficulty and uncertainty of fault diagnosis.
Transfer learning extends models trained on source domain data to target domain data via transfer strategies, demonstrating notable advantages in fault diagnosis tasks under varying working conditions [
17,
18,
19,
20]. To address the impact of noise and varying working conditions on the fault diagnosis of rolling bearings, Chen et al. [
21] employed convolutional neural networks with two different kernel sizes to automatically extract multi-scale signal features from raw data, demonstrating strong performance under noisy conditions. Ghorvei M. et al. [
22] proposed the Deep Subdomain Adaptive Convolutional Neural Network (DSACNN), which exhibited good results in noise resistance and reducing domain distribution discrepancies. Peng et al. [
23] combined multi-scale concepts by integrating information from multiple components and time scales of vibration signals, improving the model’s noise resistance and domain adaptability. Su et al. [
24] developed a hierarchical convolutional neural network that utilizes multiple output layers to predict the hierarchical structure of bearing faults, showing robust performance in noise-affected and variable-working-condition fault diagnosis tasks. Huang et al. [
25] enhanced the representation of multi-scale features through a deep convolutional neural network based on multi-scale features and mutual information, further improving the model’s generalization ability under complex working conditions. Qian et al. [
26] reduced the negative impact of noise on the model by reconstructing and learning features from the input signal via a convolutional autoencoder, while introducing a new domain adaptation loss based on Coral loss and domain classification loss, effectively enhancing diagnostic performance under complex working conditions. Yu et al. [
27] used wavelet packet transform in the data preprocessing stage to reduce noise interference and utilized MK-MMD to minimize the feature distribution discrepancy between the source and target domains, achieving excellent fault diagnosis performance and noise suppression across different working conditions.
While the aforementioned methods provide valuable insights into rolling bearing fault diagnosis, they face limitations under complex working conditions due to the ambiguity of fault signals. Current transfer learning approaches struggle to explicitly model inter-domain feature discrepancies, often leading models to focus on non-discriminative information, thereby reducing diagnostic accuracy and generalization capability. To address these limitations, this paper proposes a domain-conditioned feature correction method for fault diagnosis for rolling bearings in complex operating environments. By incorporating a domain-conditioned adaptation strategy into a multi-scale self-calibrating convolutional neural network, we develop a noise-resistant and adaptive transfer learning model capable of end-to-end fault diagnosis under challenging conditions. The primary contributions of this study are summarized as follows:
- (1)
An end-to-end fault diagnosis method for rolling bearings is proposed, which effectively enhances the model’s adaptability to noise interference and varying working conditions;
- (2)
A multi-scale self-calibrating convolutional neural network is constructed, which significantly expands the receptive field at each spatial location and aggregates features across different scales, thereby improving the network’s nonlinear expressive capability;
- (3)
A domain-conditioned adaptive strategy is proposed, which activates the convolutional channels of the source and target domains differently. This allows the model to recalibrate features within the convolutional layers for each domain, capturing more domain-specific information.
4. Conclusions
To address the issue of low fault diagnosis accuracy caused by noise interference and varying rotational speeds in rolling bearings, this study proposes a domain-conditioned adaptation-based fault diagnosis method for rolling bearings under complex working conditions, integrating noise reduction and adaptation within a unified framework. A multi-scale self-calibrating convolutional neural network was constructed to aggregate input signals at different scales, mitigating the negative impact of noise on the model’s ability to extract discriminative features. A domain-conditioned adaptation strategy was introduced to recalibrate the features of the source and target domains and generate correction terms for the target domain features. By aligning the source and target domain features through minimizing inter-domain feature distribution discrepancies, the method explicitly reduces distribution differences caused by changes in working conditions. Comparative and ablation experiments using rolling bearing fault datasets showed that the proposed method achieves an accuracy exceeding 95%, with improvements of 3.25% to 15.99% over other frameworks. A further analysis revealed a 21.42% accuracy improvement over the initial model, demonstrating strong robustness. As working conditions become more complex, the cost of obtaining labeled samples in the source domain increases, making diagnostic performance more susceptible to label scarcity in cross-domain diagnosis. Future work could explore integrating the proposed method with semi-supervised learning to leverage the potential feature information and structural patterns from large amounts of unlabeled data, further enhancing the model’s diagnostic performance and generalization capability.