1. Introduction
In recent years, electric vehicles have rapidly developed in terms of their zero emissions, low noise, and high efficiency [
1]. As a primary tool and important carrier of road transportation, their stable operation and safety are of significant importance. Rolling bearings, as critical components of drive motors, play an essential role in the safety of electric vehicles. The timely monitoring of bearing conditions, detection of potential faults, and providing scientific decisions for maintenance are crucial scientific means to ensure smooth vehicle operation [
2]. Bearing fault diagnosis methods have been widely researched over the past few decades [
3,
4]. Typically, signals are acquired from sensors, and sensitive features related to bearing faults are mined and extracted from these signals. Subsequently, feature selection and fusion are performed, followed by an evaluation of the bearing’s health status through a decision-making system [
5].
The fault diagnosis techniques for rolling bearings mainly include methods based on vibration signals, acoustic signals, electromagnetic signals, and ultrasonic signals. Among these, vibration-based methods are more prevalent and can achieve early fault detection. In the field of bearing fault diagnosis, data-driven machine learning methods have gradually replaced the research approach based on mathematical and physical models [
6]. In traditional machine learning-based diagnostic methods, fault features need to undergo two steps including manual feature extraction and feature selection. After collecting features such as the frequency domain, time domain, and time–frequency domain from the data, they are inputted into traditional machine learning algorithms for fault prediction and classification. For instance, Song D D et al. analyzed and extracted typical features from bearing raw signals in the time domain, followed by dimensionality reduction using PCA (Principal Component Analysis), and achieved high-precision diagnosis using the SVM (Support Vector Machine) algorithm [
7]. Zhenhao Tang et al. utilized complete ensemble empirical mode decomposition to extract time-domain features of bearing fault signals, followed by two fast Fourier transforms to extract deep frequency domain information for training the diagnostic model [
8].
With the advancement of science and technology, modern bearing fault data exhibit characteristics of large data volume and high data dimensionality. Traditional machine learning methods, because of their shallow network structures, suffer from poor nonlinear fitting capabilities, making it challenging to meet the diagnostic requirements of complex mechanical systems [
9]. Consequently, bearing fault diagnosis methods based on deep learning have attracted researchers’ attention. In comparison with traditional machine learning diagnostic methods, deep learning diagnostic models have multi-layer network structures and powerful capabilities in adaptive feature learning and representation [
10]. Therefore, they can typically achieve precise diagnosis of more complex bearing faults.
Convolutional neural networks (CNNs), as representative methods in the field of deep learning, have garnered significant attention from researchers. CNN-based diagnostic methods are generally categorized into two types as follows: 2D CNNs, which take images as input, and 1D CNNs, which take data as input.
In the realm of 2D CNNs, Ke Zhang et al. proposed a Multi-Modal Convolutional Neural Network (MMCNN), which effectively extracted rich and complementary fault features using multiple parallel convolutional layers. They then converted these features into two-dimensional grayscale images through Continuous Wavelet Transform (CWT), serving as input to MMCNN, thus enhancing the diagnostic accuracy of bearing faults under varying conditions [
11]. Y Y Long et al. improved a Stacked Autoencoder (SAE) with a softmax classifier of a Backpropagation Neural Network (BPNN) to enhance one-dimensional vibration signals, which were subsequently transformed into two-dimensional images as training data for CNNs to achieve bearing fault diagnosis [
12]. Tong Yu et al. encoded one-dimensional signals using the Gram Angle Difference Domain, serving as input data for 2D CNNs, and ultimately achieved a precise diagnosis of bearing faults [
13]. While these methods convert fault data into two-dimensional feature images for training CNN models, thus leveraging the computational power of convolutional neural networks, the conversion from one-dimensional to two-dimensional data may lead to feature loss, resulting in lower robustness.
In the realm of 1D CNN diagnostic model research, Zhen Y Z et al. utilized convolutional neural networks for adaptive feature extraction and feature dimensionality reduction of bearing vibration signals. They replaced fully connected layers with global average pooling, achieving higher diagnostic accuracy [
14]. Min Xia et al. inputted time and spatial information of raw signals from multiple sensors into CNNs, enabling the model to extract representative features from the data automatically, and empirically demonstrated the effectiveness of this method [
15]. Duan Hao et al. introduced dilated convolutions to increase the receptive field of convolutional layers and incorporated attention mechanisms into the model to enhance its ability to extract key feature information. They proposed an end-to-end bearing fault diagnosis method with strong adaptability [
16].
All these methods utilize raw data from bearing vibration signals as input for diagnostic models. However, because of factors such as measurement errors, the original data often contain noise, making it difficult for diagnostic models to extract effective fault features and resulting in lower diagnostic accuracy and poor robustness.
To achieve high-precision diagnosis of bearing faults while maintaining high robustness, this paper constructs a bearing fault diagnosis model based on 1D convolutional neural networks. We design a feature parameterization weighting module to extract fault features, thus enhancing the robustness of the diagnostic model. The original fault data undergo joint feature extraction using two different algorithms, serving as input to the convolutional neural network. The original bearing fault data are extracted by variational mode decomposition and empirical mode decomposition, and the extracted feature data are combined as the input of the diagnosis model. After the feature data are input into the diagnostic model, different operations are applied to the feature tensor by obtaining the mean value of its tensor weights, and according to the positive and negative values of the mean value, so as to realize the prominence of important features. The minor features are removed by linear activation function to reduce their impact on important features, so as to achieve accurate diagnosis and high robustness of the diagnostic model.
2. Basic Principles of Convolutional Neural Networks
CNNs are deep feedforward neural networks that perform repeated convolution and pooling operations on input signals using multiple layers of filters to extract features from faulty data automatically. Within the network, operations between adjacent layers are performed using local connections and weight sharing.
2.1. Convolutional Layer
The convolution operation is completed by the combination of convolutional kernels and strides. When input features pass through convolutional layers, the kernels extract features from a portion of the input features. Subsequently, the kernels move according to a certain stride until the entire input feature is traversed, generating corresponding feature maps. The characteristic of weight sharing in convolutional kernels determines that one kernel corresponds to only one output feature map. The depth of the output feature map is determined by the number of convolutional kernels.
In Equation (1), represents the output of the i-th channel in the k − 1-th layer. represents the c-th channel in the k − 1-th layer; denotes the output of the ‘k-th’ layer; represents the weight matrix of the convolutional kernel in the k-th layer; denotes the bias term; and * denotes the convolution operation.
2.2. Pooling Layer
In convolutional neural network models, the pooling layer primarily compresses feature dimensions to reduce the computational load on target features. During the forward propagation process, it performs downsampling operations only on target regions. By refining the features of input feature vectors, it effectively prevents overfitting. Typically, there are two types of pooling layers as follows: average pooling and max pooling, corresponding to the following formulas, respectively.
In Equations (2) and (3), represents the output value of the -th neuron in the -th channel of layer , denotes the width of the pooling kernel, and represents the input value of the t-th neuron in the i-th channel of layer .
2.3. Fully Connected Layer
The main role of the fully connected layer is to refine the feature tensor extracted through feature extraction. Before being input into the fully connected layer, features are typically flattened into a one-dimensional vector, processed by the fully connected layer, and then input into the classification layer to accomplish the intended classification task. The corresponding formula is as follows.
In Equation (4), represents the output vector of the -th fully connected layer, denotes the number of neurons in the -th fully connected layer, represents the weight of the connection from the -th neuron in the -th layer to the -th neuron in the k-th layer, represents the output value of the -th neuron in the -th layer, and denotes the bias term of the -th neuron in the -th layer.
2.4. Dropout Layer
The primary purpose of the Dropout layer is to prevent overfitting. Depending on the target object, it can be classified into Weight Dropout and Neuron Dropout. Weight Dropout primarily involves deactivating certain weight channels within the neural layer weight matrix, thus rendering them inactive. Neuron Dropout, on the other hand, deactivates certain neurons. During each iteration, certain neurons or weights cease to function with a certain probability and are temporarily removed from the network. This reduces the model’s dependency on local features, thereby enhancing generalization, A schematic diagram of the Dropout operation is shown in
Figure 1.
4. Feature Parameter Weighting Module (FPWM)
Traditional convolutional neural network-based bearing fault diagnosis models usually rely on repetitive feature extraction by convolutional layers without distinguishing among different features. This leads to difficulty in effectively reflecting fault types, resulting in lower diagnostic accuracy of the model. To address this issue, an attention mechanism is introduced into the diagnostic model to highlight important features for effective feature extraction and to improve diagnostic accuracy. However, existing attention mechanisms only focus on important features and neglect minor features, resulting in feature loss and lower model robustness.
This paper proposes a feature processing method based on feature-parameterized weighting. A flowchart of the method is shown in
Figure 2. When the feature tensor enters the module, it undergoes preliminary feature extraction and parameter reduction by a convolutional layer to reduce the parameter volume. Then, the features are compressed to [−1,1] using the Sigmoid function. The compressed feature data are then averaged, and based on the relationship with 0, they are divided into two categories as follows: when the mean is greater than 0, the feature data are added to the mean at that time; when the mean is less than or equal to 0, the feature data are subtracted by the mean multiplied by the parameter
. Finally, the output is passed through the relu activation function to remove features that are still less than 0. The purpose of the above operations is to highlight important features while also including minor features in subsequent feature extraction and clearing redundant information.
The processed feature tensor is multiplied by the feature ratio tensor and added to the input feature tensor. However, the original input feature tensor is multiplied by the parameter as a weighting parameter for both tensors.
7. Conclusions
In the context of rolling bearing fault diagnosis, where different levels of fault features require distinct treatment, a feature-parameterized weighted diagnostic model is proposed. This model achieves high-precision diagnosis of rolling bearings under variable operating conditions while maintaining robustness in noisy environments. The key conclusions are as follows:
(1) Feature extraction from fault data is performed using variational mode decomposition (VMD) and empirical mode decomposition (EMD). The extracted features from both methods are then combined and used as input data for subsequent diagnostic models, enhancing data dimensionality while reducing noise.
(2) Parameter selection experiments conducted on datasets from different operating conditions improve the model’s generalization ability.
(3) A feature-parameterized weighting module is constructed, wherein features are divided into two categories based on the mean of feature tensors. Different operations are applied to these categories, allowing the diagnostic model to focus more on features of higher importance. This enhances both the diagnostic accuracy and robustness of the network model.
Starting from the importance of features, we design a feature processing module FPWM, which can carry out different operations on fault features according to the positive and negative parts. In this module, the processed feature data and the original feature data are weighted with reference to the residual idea, but not simply added in a 1:1 way. By setting an adjustable parameter, the fault features can be processed in the same way. The effective fusion of the two features is realized to improve the diagnostic accuracy and robustness of the model. When the effective features in the fault data are difficult to discover and the data dimension is not enough to satisfy the training of the convolutional neural network, the proposed method can realize the high-precision diagnosis of multi-working condition bearings and has certain robustness.