Fault Diagnosis Method for Vacuum Contactor Based on Time-Frequency Graph Optimization Technique and ShuffleNetV2

Li, Haiying; Wang, Qinyang; Song, Jiancheng

doi:10.3390/s24196274

Open AccessArticle

Fault Diagnosis Method for Vacuum Contactor Based on Time-Frequency Graph Optimization Technique and ShuffleNetV2

by

Haiying Li

¹,

Qinyang Wang

^1,* and

Jiancheng Song

²

¹

School of Mechanical Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China

²

Shanxi Key Laboratory of Mining Electrical Equipment and Intelligent Control, Taiyuan University of Technology, Taiyuan 030024, China

^*

Author to whom correspondence should be addressed.

Sensors 2024, 24(19), 6274; https://doi.org/10.3390/s24196274

Submission received: 1 August 2024 / Revised: 20 September 2024 / Accepted: 26 September 2024 / Published: 27 September 2024

(This article belongs to the Special Issue Reliability Verification and Diagnosis Methods for Mechanical Equipment)

Download

Browse Figures

Versions Notes

Abstract

This paper presents a fault diagnosis method for a vacuum contactor using the generalized Stockwell transform (GST) of vibration signals. The objective is to solve the problem of low diagnostic performance efficiency caused by the inadequate feature extraction capability and the redundant pixels in the graph background. The proposed method is based on the time-frequency graph optimization technique and ShuffleNetV2 network. Firstly, vibration signals in different states are collected and converted into GST time-frequency graphs. Secondly, multi-resolution GST time-frequency graphs are generated to cover signal characteristics in all frequency bands by adjusting the GST Gaussian window width factor λ. The OTSU algorithm is then combined to crop the energy concentration area, and the size of these time-frequency graphs is optimized by 68.86%. Finally, considering the advantages of the channel split and channel shuffle methods, the ShuffleNetV2 network is adopted to improve the feature learning ability and identify fault categories. In this paper, the CKJ5-400/1140 vacuum contactor is taken as the test object. The fault recognition accuracy reaches 99.74%, and the single iteration time of model training is reduced by 19.42%.

Keywords:

vacuum contactor; vibration signal; time-frequency graph optimization; ShuffleNetV2 network; fault diagnosis

1. Introduction

With the growth of transparency requirements for distribution networks, the transparency level of distribution networks is deepening, and the intellectualization upgrade for distribution terminals is accelerated [1,2]. The vacuum contactor, as one type of critical transparent management equipment in distribution networks, is used to frequently start and stop high-power load devices [3]. Its operational state is directly related to the reliability of the power grid. Vibration signals during the opening and closing are generated by severe collisions among components, containing rich information about the equipment health state. Therefore, these signals can be used to identify various mechanical faults, such as spring fatigue and iron core rusting [4].

Fault diagnosis based on vibration signals is divided into two parts: feature extraction and fault identification. A summary of the latest methods is shown in Table 1. In the feature extraction stage, vibration signal data are commonly processed using time-domain, frequency-domain, and time-frequency domain methods [5]. Time-domain analysis utilizes multiple parameter indexes to characterize the dynamic information. Typical time-domain features include skewness, impulse factor, shape factor, and various characteristic entropy [6,7,8]. Due to the fact that the time-domain features are not apparent during the early stages of equipment failure, fault identification is a challenge [9]. For this reason, frequency-domain analysis is adopted to convert time-series signals into an intuitive frequency spectrum. By extracting critical statistical features such as envelope spectrum [10], power spectrum [11], and cepstrum [12], the fault characteristics of vibration signals are enhanced or separated. However, frequency-domain analysis fails to capture the characteristics of transient vibration signals through Fourier transform [13].

Unlike time-domain or frequency-domain analyses that extract partial features of the signal, time-frequency domain analysis converts one-dimensional vibration signals into two-dimensional time-frequency graphs, fully reflecting the distribution of vibration signal characteristics [14]. Common time-frequency domain analysis methods include short-time Fourier transform (STFT), continuous wavelet transform (CWT), and Stockwell transform (ST). Ref. [15] employs STFT to convert vibration signals into time-frequency matrices. After normalization, the STFT time-frequency graphs are generated with comprehensive fault features. Refs. [16,17] convert vibration signals of the circuit breaker into CWT time-frequency graphs according to the selected scale and wavelet basis function, extracting the features in different states. Both of the time-frequency analysis methods can extract rich fault feature components. However, STFT uses a fixed window function, resulting in uneven time-frequency resolution and an inability to capture instantaneous frequency variations [18]. CWT, on the other hand, relies on fixed wavelet bases and suffers from issues such as frequency band energy leakage [19], leading to poor performance in high-frequency abrupt change scenarios. ST overcomes the above shortcomings by maintaining the phase information of the vibration signals and using different Gaussian window functions for each frequency band, providing good time-frequency resolution [20,21]. To obtain more time-frequency resolution datasets, the GST adjusts the size of the Gaussian window function by introducing the window width adjustment factor. The transient changes are easily captured, and all information across different frequency bands is represented [22]. Therefore, GST is more suitable for feature extraction to various transient signals.

Table 1. A summary of the latest methods in fault diagnosis.

Field	Category	Reference	Method	Limitation
Feature extraction	Time-domain	[6]	Skewness, impulse factor	Not apparent during the early stages of equipment failure
		[7]	Shape factor
		[8]	Characteristic entropy
	Frequency-domain	[10]	Envelope spectrum	Fail to capture the features of transient signals through Fourier transform
		[11]	Power spectrum
		[12]	Cepstrum
	Time-frequency domain	[15]	STFT	Feature extraction with fixed window functions or wavelet bases
		[16,17]	CWT
		[20,21]	ST
Fault identification	Machine learning	[23,24]	BPNN	Poor-fitting performance for complex data
		[25]	SVM
		[26,27]	RF
	Deep learning	[28]	AlexNet	Constrained feature learning capability due to the limited convolutional and pooling layers
		[29]	ResNet50	The number of parameters grows as the layers deepen
		[30]	ResNeXt50	More computational resources are occupied due to the excessive group convolution

For fault identification, the traditional diagnostic algorithms include back propagation neural network (BPNN) [23,24], support vector machine (SVM) [25], and random forest (RF) [26,27]. Compared with the machine learning algorithms above, deep learning, AlexNet [28] as an example, can automatically extract features from target data without expert knowledge, showing significant advantages in image processing tasks. To extract features fully, as advanced networks, ResNet50 [29] and ResNeXt50 [30] incorporate residual blocks to keep deep networks. For higher accuracy, ResNeXt50 introduces group convolution, dividing each residual block into multiple branches. However, massive parameters in deep learning networks and excessive group convolutions occupy more computational resources during the training of ResNet50 and ResNeXt50 models [31,32]. ShuffleNetV2, combined with channel split and channel shuffle operations, is designed with full consideration of memory access cost, greatly enhancing computational efficiency [33]. It guarantees the balance between computational complexity and recognition accuracy. In recent years, ShuffleNetV2 has begun to draw attention to equipment fault identification [34].

Fault diagnosis at present still shows some limitations. For instance, the features in the dataset are insufficient, and the redundant background pixels of time-frequency graphs occupy computational resources. Moreover, complex neural networks generally have stronger fitting ability, but deep learning models may fail to be trained due to gradient explosion or gradient disappearance during training.

In response to the above issues, we propose a fault diagnosis method for vacuum contactors integrating the time-frequency graph optimization technique with ShuffleNetV2. The main contributions of this paper are summarized as follows:

(1): A data augmentation technique using multiple-resolution GST time-frequency graphs can fully extract the signal features of each frequency band.
(2): The OTSU algorithm is combined to crop key feature areas of the time-frequency graphs, removing redundant background pixels to improve training efficiency.
(3): The ShuffleNetV2 network is employed to construct the fault diagnosis model. The recognition accuracy and the training time are improved due to its lightweight network architecture.

The rest of this paper is organized as follows: Section 2 introduces the time-frequency graph of vibration signals. Section 3 presents the graph optimization technique in detail. Section 4 describes the principles of the ShuffleNetV2 fault diagnosis model. Section 5 discusses the optimization and diagnosis results. The research conclusions and directions for future work are provided in Section 6.

Considering the convenience of readers, the Abbreviations table lists the abbreviations used in this paper and their meanings.

2. Time-Frequency Graph of Vibration Signal

2.1. Vibration Signal Acquisition

The vacuum contactor vibration signal acquisition system consists of an acceleration sensor, a constant current power module, and a data acquisition card. Vibration signals are collected using a CT1000L piezoelectric acceleration sensor, with a measurement range from 0 to 1000 g and a sensitivity of 5 mV/g. This sensor is characterized by high sensitivity and excellent anti-interference performance. To enhance the reliability of the vibration signal, the acceleration sensor is fixed near the moving contacts, with its installation axis aligned with the vibration direction, as shown in Figure 1.

The opening and closing actions of the vacuum contactor accompany the vibration signals. Compared with the opening action, the closing action, with more vibration sources and longer vibration durations, is more suitable as an indicator of the health state. Therefore, closing vibration signals are collected to identify various mechanical faults.

This paper focuses on the CKJ5-400/1140 vacuum contactor, which can perform up to 2000 operations per hour and withstand transient current surges up to 10 times its rated operating conditions. It operates year-round in humid environments with an air humidity of approximately 90% [35]. The harsh working environment and frequent overcurrent surges often lead to various gradual failures.

Based on the working environment and typical fault types, three fault states are simulated by referencing the literature [36]. Closing vibration signals are collected at different times to increase the diversity of sample data. The specific fault simulation scheme is shown in Table 2.

To improve the noise rate by keeping high-frequency noise, the signal sampling frequency is set to 100 kHz for 40 ms. All vibration signals are recorded since the closing instruction. A set of closing vibration signals is randomly selected for each operating state, and the corresponding waveform is shown in Figure 2.

As shown in Figure 2, the vibration signals exhibit transient and non-stationary characteristics in different states with the following features:

(1): Normal state: The high-energy vibration is induced by the energized core and the closing main contacts, resulting in the primary and secondary peak. The signal energy gradually diminishes 20 ms later.
(2): Iron core rusting fault: When this fault occurs, rust and debris increase resistance to movement, weakening the vibration signal energy. The amplitudes in the primary peak and secondary peak are slightly lower than those in the normal state.
(3): Closing spring fatigue fault: When this fault occurs, the mechanical properties of the spring degrade and its elasticity declines. Compared with the core rusting fault and normal state, the closing action is easier to approach stability. Therefore, the vibration occurs earlier, and the primary peak is short.
(4): Base screw loosening fault: the system’s damping diminishes in this case, which leads to an increase in the primary peak and secondary peak, consuming longer time for the signal to decay.

2.2. GST Time-Frequency Graph

The closing vibration signals undergo rapid changes within a very short period. It is difficult to characterize their transient frequency-domain characteristics with traditional time-frequency analysis methods. The Stockwell transform introduces the Gaussian window function, which converts one-dimensional time-series signals into two-dimensional time-frequency matrices. This method provides all local features in the time-frequency graph. The Stockwell transform of the closing vibration signal x(t) of a vacuum contactor is given as follows:

S (τ, f) = \int_{- \infty}^{\infty} x (t) ω (τ - t, f) e^{- i 2 π f t} d t

(1)

where t is time, τ is the time-shift factor, f is continuous frequency, i is the imaginary unit, and ω(τ − t, f) is the Gaussian window function, defined as follows:

ω (τ - t, f) = \frac{|f|}{\sqrt{2 π}} e^{- \frac{{(t - τ)}^{2} f^{2}}{2}}

(2)

From Formula (2), it is evident that the size of the Gaussian window function changes at a fixed rate with frequency. When the signal frequency changes rapidly within a certain period, the Stockwell transform may fail to fully capture the detailed characteristics of the signal in that period. To improve time-frequency resolution, the GST introduces an adjustment factor λ into the Gaussian window function, allowing flexible adjustment of the window size. This enables the local characteristics of vibration signals in different states to be fully reflected in the time-frequency graph. The GST of the closing vibration signal x(t) is given as follows:

S_{G S T} (τ, f) = \int_{- \infty}^{\infty} x (t) \frac{|λ| |f|}{\sqrt{2 π}} e^{- \frac{λ^{2} {(t - τ)}^{2} f^{2}}{2}} e^{- i 2 π f t} d τ

(3)

When the Gaussian window width adjustment factor λ = 1, Formula (3) becomes the standard Stockwell transform. The adjustment factor λ will be further discussed in Section 3.1; for now, λ is set to 0.6. A closing vibration signal is randomly selected for each state, and the complex time-frequency matrix of the vibration signal is obtained using the discrete expression of the GST. After being taken modulus and normalized, the corresponding GST time-frequency graphs of the four states are shown in Figure 3.

3. Time-Frequency Graph Optimization Technique

3.1. Data Augmentation of Time-Frequency Graph

The GST adjustment factor λ affects the resolution of vibration signal time-frequency graphs. To extract detailed characteristics in various frequency bands and enhance the diversity of the time-frequency graph dataset, data augmentation optimization is performed by configuring multiple sets of adjustment factors.

According to the characteristics of the adjustment factor λ, when 0 < λ ≤ 1, the width of the Gaussian window decreases inversely with frequency at a decelerating rate, facilitating the extraction of features in the low-frequency range. Conversely, when λ > 1, the window width increases proportionally with frequency at an accelerating rate, making it suitable for extracting features in the high-frequency range [37,38]. A series of λ parameters are set to generate multiple GST time-frequency graphs with different resolutions from the same signal.

The time-frequency graph of the vibration signal corresponding to λ = 1 serves as the original data. A series of adjustment factors λ, denoted as λ₁, λ₂, …, λ_n₋₁, and λ_n, are evenly set. Each λ value is applied in the GST expression to generate multi-resolution GST time-frequency graphs, thereby realizing data augmentation, as shown in Figure 4.

From Figure 4, it is observed that after the vibration signal undergoes time-frequency transformations at different scales, the number of samples is expanded to n times that of the original data. This augmentation not only enriches the diversity of time-frequency graph samples but also characterizes the time-frequency distribution characteristics in both high- and low-frequency ranges.

3.2. Cropping Optimization of Time-Frequency Graph

The data augmentation technique enriches the time-frequency graph samples. However, non-peak regions in the time-frequency graphs lack effective characteristics, and the redundant pixels in the background occupy more computational resources. To improve model training and fault diagnosis efficiency, the OTSU algorithm is employed to crop energy-concentrated regions in the time-frequency graphs [39].

The OTSU algorithm, also known as the maximum between-class variance method, partitions an image into foreground and background based on an optimal threshold determined by the grayscale levels. It intends to ensure the between-class variance between foreground and background is maximized. Assuming the GST time-frequency graph contains k grayscale levels [0, 1, …, k − 1], if there exists a threshold r to divide the pixels of the time-frequency graph into two parts—the set r₁ consists of pixels with grayscale values less than r, representing the background of the time-frequency graph, while the set r₂ consists of pixels with grayscale values greater than r, representing the foreground of the time-frequency graph—then the probabilities of occurrence of r₁ and r₂, denoted as P₁ and P₂, respectively, are as follows:

\{\begin{cases} P_{1} = \sum_{j = 0}^{r} \frac{n_{j}}{N} = \sum_{j = 0}^{r} p_{j} \\ P_{2} = \sum_{j = r + 1}^{k - 1} \frac{n_{j}}{N} = \sum_{j = r + 1}^{k - 1} p_{j} = 1 - P_{1} \end{cases}

(4)

where n_j is the number of pixels with grayscale j, N is the total number of pixels in the graph, and p_j is the probability of pixels with grayscale j appearing in the graph.

The grayscale mean of the background, the grayscale mean of the foreground, and the entire image are denoted as μ₁, μ₂, and μ. The relationship among them is as follows:

μ = P_{1} μ_{1} + P_{2} μ_{2}

(5)

According to the definition of variance, the expression for the between-class variance σ² is

σ^{2} = P_{1} {(μ_{1} - μ)}^{2} + P_{2} {(μ_{2} - μ)}^{2}

(6)

Substituting Formulas (4) and (5) into Formula (6), the variational equation to maximize the between-class variance σ² is constructed as follows:

\{\begin{cases} \max_{\{r\}} σ^{2} = \sum_{j = 0}^{r} p_{j} \sum_{j = r + 1}^{k - 1} p_{j} \cdot {(μ_{1} - μ_{2})}^{2} \\ s . t . \sum_{j = 0}^{k - 1} p_{j} = 1 \end{cases}

(7)

By traversing grayscale levels, the optimal threshold r_op is determined by Formula (7). Pixels in the GST time-frequency graph below r_op are set to zero, while those above r_op are retained. The energy-concentrated portions are cropped based on the boundaries of non-zero pixels.

An arbitrary sample i from the GST time-frequency graph dataset is selected. Following the above steps of the OTSU algorithm, the time and frequency values corresponding to the non-zero pixel boundary points are denoted as

t_{\min}^{i}

,

t_{\max}^{i}

,

f_{\min}^{i}

, and

f_{\max}^{i}

, respectively. The energy concentration regions are located in the time interval [

t_{\min}^{i}

,

t_{\max}^{i}

] and frequency range [

f_{\min}^{i}

,

f_{\max}^{i}

].

All the multi-resolution GST time-frequency graphs are traversed. The minimum values, t_min and f_min, are obtained as min{

t_{\min}^{i}

,

f_{\min}^{i}

}, and the maximum values, t_max and f_max, are obtained as max{

t_{\max}^{i}

,

f_{\max}^{i}

}. The GST time-frequency graphs are uniformly cropped to extract concentrated regions of feature distributions for various closing vibration signals, as depicted in Figure 5.

Figure 5 illustrates that the cropping optimization technique, combined with the OTSU algorithm, can remove redundant pixels while preserving the effective feature information of the critical frequency bands. It reduces the dimension of the GST time-frequency graph and retains the concentrated energy distribution regions of GST time-frequency graphs for different states.

4. ShuffleNetV2 Fault Diagnosis

4.1. Principles of ShuffleNetV2 Network

ShuffleNet is a lightweight convolutional neural network suitable for online fault diagnosis in edge computing scenarios. ShuffleNetV1 reduces the number of network parameters by introducing group convolutions. However, when the number of groups increases, the speed of inference decreases, restricting its applications in practice [40]. To enhance parallel computing efficiency and reduce memory access costs, ShuffleNetV2 utilizes channel split and channel shuffle operations to shorten network runtime and improve feature learning capabilities [41]. The architecture of ShuffleNetV2 is illustrated in Figure 6.

From Figure 6, it is evident that the ShuffleNetV2 network consists of two Convolutional (Conv) modules and three Stages. The Conv module performs dimension reduction using convolution, batch normalization (BN), ReLU activation, and pooling layers. The reduced-dimensional features are sequentially fed into the three Stages to extract vibration signal characteristics further. Each Stage consists of two units called the basic unit and the down-sampling unit, as illustrated in Figure 7.

In the ShuffleNetV2 basic unit, the optimized time-frequency graphs are split into two branches. One branch preserves the original feature information. The other branch undergoes sequential operations of 1 × 1 regular convolution, 3 × 3 depth-wise convolution (DWConv), and another 1 × 1 regular convolution, all with a stride of 1 and the same number of input and output channels, ensuring the dimensions are unchanged. After convolution, the outputs of the two branches are concatenated and subjected to channel shuffling, effectively integrating vibration signal features across different channels.

In the down-sampling unit of ShuffleNetV2, the optimized time-frequency graphs are directly separated into two branches without channel split operations. Each branch includes a 2-stride DWConv with double output channels and half input dimensions. This enhances the computational efficiency of the network model while preserving essential vibration signal features.

4.2. ShuffleNetV2 Fault Diagnosis Framework

Figure 8 depicts the fault diagnosis framework for vacuum contactors, which consists of four parts: signal acquisition, feature extraction, model training, and fault diagnosis.

(1): Signal acquisition: A signal acquisition system is constructed according to the vacuum contactor fault simulation plan. Acceleration sensors are used to capture vibration signals during different states when the vacuum contactors close.
(2): Feature extraction: A series of multi-resolution GST time-frequency graphs of vibration signals are generated by combining Gaussian window width adjustment factors. The OTSU algorithm is employed to crop energy-concentrated regions from the GST time-frequency graphs.
(3): Model training: The optimized GST time-frequency graphs are partitioned into training, validation, and test sets. Model parameters are optimized, and the optimal ShuffleNetV2 model is obtained.
(4): Fault diagnosis: the optimal ShuffleNetV2 model is utilized to classify fault categories and output diagnostic outcomes.

5. Example Analysis

5.1. Time-Frequency Graph Dataset

In this study, GST time-frequency graphs of the closing vibration signals in four states are employed as signal features. The vibration signals in each state are augmented, and each state’s samples are expanded to 960. The dataset is randomly partitioned into training, validation, and test sets in a ratio of 6:2:2 for the same state, as detailed in Table 3.

5.2. Optimization Results of Time-Frequency Graph

A sample from the spring fatigue is selected. Eight sets of adjustment factors are applied to the GST expression to generate multi-resolution GST time-frequency graphs, as shown in Figure 9.

From Figure 9, it is observed that as the adjustment factor λ increases, the frequency resolution of the time-frequency graph is gradually improved from low-frequency bands to high-frequency bands. The characteristics of vibration signals are adequately represented across all frequency bands.

After data augmentation of the time-frequency graphs, the effect is evaluated using the structural similarity index measure (SSIM) [42]. SSIM is utilized to assess the similarity among images, with a range from −1 to 1. A higher SSIM value indicates greater similarity between the augmented time-frequency graphs and the original data. The SSIM calculation results are illustrated in Figure 10.

From Figure 10, it is evident that the SSIM values are all higher than 0.9. On one hand, this indicates a high level of consistency between the augmented time-frequency graphs and the original data, preserving important feature information of the original graphs. On the other hand, the augmentation of time-frequency graphs improves the diversity of the dataset.

The initial size of the GST time-frequency graphs is 224 × 224 pixels. After data augmentation and crop optimization rules given in Section 3.2, the size of the GST time-frequency graphs for each state is determined to be 125 × 125 pixels. Part of the crop optimization results are illustrated in Figure 11. The fault characteristics are highlighted, and the proportion of redundant pixels is significantly reduced, which is helpful in fault diagnosis accuracy and fast computational speed.

5.3. Performance Analysis of the Proposed Method

Based on the optimization results of the GST time-frequency graphs, the ShuffleNetV2 network is constructed on the Python 3.9 platform in the PyTorch environment. The specific network architecture is detailed in Table 4.

To enhance model performance, this study sets the initial learning rate of ShuffleNetV2 to 5 × 10⁻⁴ and the batch size to 32. Transfer learning is employed to load pre-trained models so as to accelerate network convergence. With fewer iterations, higher diagnostic accuracy is achieved, so the iteration count is set to 100.

The training sets of GST time-frequency graphs without and with cropping optimization are input into the ShuffleNetV2 networks to train the model. Then, the well-trained ShuffleNetV2 models are used to classify 192 test sets. The confusion matrices for state recognition of the test sets are shown in Figure 12a,b.

From Figure 12a, it is observed that in the GST time-frequency graphs without cropping optimization, the overall accuracy is 99.22%. The recognition accuracy for fatigue failure of springs and iron core rusting reaches 100%. There is one misclassified sample for base screw loosening fault, and in the normal state test set, five samples are misclassified as closing spring fatigue fault.

From Figure 12b, it is noted that in the GST time-frequency graphs with cropping optimization, the recognition accuracy of the test set reaches 99.74%. There is only one misclassified sample each for the normal state and closing spring fatigue fault. The accuracy is further improved.

The training and recognition times are shown in Table 5. Table 5 indicates that with the removal of the vast redundant background pixels during the graph optimization process, computational efficiency has been improved. The total training time is reduced by 9.24 min, and the total recognition time on the test set is decreased by 2.56 s. It is evident that the cropping optimization technique combined with the ShuffleNetV2 model maintains high diagnostic accuracy with finite computational resources.

5.4. Comparison of Different Network Structures

To validate the performance of the ShuffleNetV2 fault diagnosis model, the optimized vibration signal GST time-frequency graphs are input to the AlexNet, ResNet50, and ResNeXt50 networks to train. The training is set to 100 iterations, with a learning rate of 5 × 10⁻⁴ and a batch size of 32.

All networks load pre-trained models to achieve optimal training results. The experiment is repeated 20 times, and the recognition accuracy of each network model is recorded, as shown in Figure 13.

According to Figure 13, the average test accuracy of the ShuffleNetV2 network reaches 99.61%, which is 2.36%, 1.05%, and 0.67% higher than those of the AlexNet, ResNet50, and ResNeXt50 networks. Moreover, the accuracy remains above 99% with minor fluctuations in 20 experiments, indicating its stable performance. Benefiting from the deeper networks, ResNet50 and ResNeXt50 networks can maintain high accuracy. However, due to the limited convolutional and pooling layers, the feature learning capability of AlexNet is constrained, which impacts its accuracy.

The average time for each iteration is shown in Table 6. It is evident that with channel split and channel shuffle structures, among the single iteration time of model training, ShuffleNetV2 is the fastest. Compared with AlexNet, ResNet50 and ResNeXt50, ShuffleNetV2 can reduce running time by up to 14%. AlexNet also runs faster than ResNet50 and ResNeXt50 because of its fewer layers. In addition, with cropping optimization, the time consumption is reduced obviously, especially in the case of ResNet50 and ResNeXt50, which reduce time consumption by 21.04% and 25.35%, individually, higher than 19.42% of ShuffleNetV2. This is because their networks are deeper and more sensitive to the scale of the input data. ShuffleNetV2 demonstrates significant advantages in both speed and accuracy for processing time-frequency graphs.

6. Conclusions

In this paper, the vacuum contactor closing vibration signals are transformed from one-dimensional time-series to two-dimensional time-frequency graphs using GST, combined with graph optimization techniques and the ShuffleNetV2 network, to be aware of the health state of the vacuum contactor. The following conclusions are drawn:

(1): GST introduces the Gaussian window width adjustment factor to generate multiple-resolution GST time-frequency graphs. This data augmentation technique ensures the extraction of overall time-frequency features of vibration signals, increases the diversity of the training dataset, and mitigates overfitting issues.
(2): The OTSU algorithm crops the energy concentration area of the GST time-frequency graphs. This process reduces 68.86% of redundant background pixels in these graphs. Therefore, the effective feature information of the critical frequency bands is kept, and size optimization is achieved simultaneously.
(3): A comparison is made between the AlexNet, ResNet50, ResNeXt50, and ShuffleNetV2 networks. ShuffleNetV2 can achieve the highest mean accuracy of 99.74%, and the single iteration time of model training is reduced by 19.42%.

In the future, we intend to explore regular updates for this fault diagnosis model to maintain high accuracy with more available operation data. We also aim to test the validity of the proposed model in other fields.

Author Contributions

Conceptualization, H.L.; methodology, H.L.; software, Q.W.; validation, H.L. and Q.W.; formal analysis, Q.W.; investigation, H.L.; resources, H.L.; data curation, Q.W.; writing—original draft preparation, Q.W.; writing—review and editing, H.L.; visualization, Q.W.; supervision, J.S.; project administration, J.S.; funding acquisition, J.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Key Research and Development Program of Shanxi Province, grant number 202003D111008.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data that support the findings of this study are available upon reasonable request from the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

BN	Batch Normalization
BPNN	Back Propagation Neural Network
Conv modules	Convolutional modules
CWT	Continuous Wavelet Transform
DWConv	Depth-Wise Convolution
FC layer	Fully Connected layer
GST	Generalized Stockwell Transform
RF	Random Forest
SSIM	Structural Similarity Index Measure
ST	Stockwell Transform
STFT	Short-Time Fourier Transform
SVM	Support Vector Machine

References

Aghahadi, M.; Bosisio, A.; Merlo, M.; Berizzi, A.; Pegoiani, A.; Forciniti, S. Digitalization Processes in Distribution Grids: A Comprehensive Review of Strategies and Challenges. Appl. Sci. 2024, 14, 4528. [Google Scholar] [CrossRef]
Liu, B.; Tan, Z.; Lan, C. Key Concepts and Framework of Power Distribution and Utilization of Transparent Power Grids. Front. Energy Res. 2022, 10, 890. [Google Scholar] [CrossRef]
Wu, Z.; Fang, C.; Wu, G.; Lin, Z.; Chen, W. A CNN-Regression-Based Contact Erosion Measurement Method for AC Contactors. IEEE Trans. Instrum. Meas. 2022, 71, 3518410. [Google Scholar] [CrossRef]
Chen, H.; Han, C.; Zhang, Y.; Ma, Z.; Zhang, H.; Yuan, Z. Investigation on the Fault Monitoring of High-Voltage Circuit Breaker Using Improved Deep Learning. PLoS ONE 2023, 18, e0295278. [Google Scholar] [CrossRef] [PubMed]
Altaf, M.; Akram, T.; Khan, M.A.; Iqbal, M.; Ch, M.M.I.; Hsu, C.-H. A New Statistical Features Based Approach for Bearing Fault Diagnosis Using Vibration Signals. Sensors 2022, 22, 2012. [Google Scholar] [CrossRef]
Prieto, M.D.; Cirrincione, G.; Espinosa, A.G.; Ortega, J.A.; Henao, H. Bearing Fault Detection by a Novel Condition-Monitoring Scheme Based on Statistical-Time Features and Neural Networks. IEEE Trans. Ind. Electron. 2013, 60, 3398–3407. [Google Scholar] [CrossRef]
Jiang, L.; Yin, H.; Li, X.; Tang, S. Fault Diagnosis of Rotating Machinery Based on Multisensor Information Fusion Using SVM and Time-Domain Features. Shock Vib. 2014, 2014, 418178. [Google Scholar] [CrossRef]
Qi, J.; Gao, X.; Huang, N. Mechanical Fault Diagnosis of a High Voltage Circuit Breaker Based on High-Efficiency Time-Domain Feature Extraction with Entropy Features. Entropy 2020, 22, 478. [Google Scholar] [CrossRef]
Chen, F.; Cheng, M.; Tang, B.; Xiao, W.; Chen, B.; Shi, X. A Novel Optimized Multi-Kernel Relevance Vector Machine with Selected Sensitive Features and Its Application in Early Fault Diagnosis for Rolling Bearings. Measurement 2020, 156, 107583. [Google Scholar] [CrossRef]
Bao, W.; Tu, X.; Hu, Y.; Li, F. Envelope Spectrum L-Kurtosis and Its Application for Fault Detection of Rolling Element Bearings. IEEE Trans. Instrum. Meas. 2020, 69, 1993–2002. [Google Scholar] [CrossRef]
Gong, W.; Li, A.; Wu, Z.; Qin, F. Nonlinear Vibration Feature Extraction Based on Power Spectrum Envelope Adaptive Empirical Fourier Decomposition. ISA Trans. 2023, 139, 660–674. [Google Scholar] [CrossRef] [PubMed]
Jiang, F.; Ding, K.; He, G.; Du, C. Sparse Dictionary Design Based on Edited Cepstrum and Its Application in Rolling Bearing Fault Diagnosis. J. Sound Vib. 2021, 490, 115704. [Google Scholar] [CrossRef]
Vamsi, I.; Sabareesh, G.R.; Penumakala, P.K. Comparison of Condition Monitoring Techniques in Assessing Fault Severity for a Wind Turbine Gearbox under Non-Stationary Loading. Mech. Syst. Signal Process. 2019, 124, 1–20. [Google Scholar] [CrossRef]
Liang, G.; Song, X.; Liao, Z.; Jia, B. Optimal Time Frequency Fusion Symmetric Dot Pattern Bearing Fault Feature Enhancement and Diagnosis. Sensors 2024, 24, 4186. [Google Scholar] [CrossRef]
Tao, H.; Wang, P.; Chen, Y.; Stojanovic, V.; Yang, H. An Unsupervised Fault Diagnosis Method for Rolling Bearing Using STFT and Generative Neural Networks. J. Frankl. Inst. 2020, 357, 7286–7307. [Google Scholar] [CrossRef]
Sun, S.; Zhang, T.; Li, Q.; Wang, J.; Zhang, W.; Wen, Z.; Tang, Y. Fault Diagnosis of Conventional Circuit Breaker Contact System Based on Time–Frequency Analysis and Improved AlexNet. IEEE Trans. Instrum. Meas. 2021, 70, 3508512. [Google Scholar] [CrossRef]
Yan, R.; Lin, C.; Gao, S.; Luo, J.; Li, T.; Xia, Z. Fault Diagnosis and Analysis of Circuit Breaker Based on Wavelet Time-Frequency Representations and Convolution Neural Network. J. Vib. Shock 2020, 39, 198–205. [Google Scholar] [CrossRef]
Li, L.; Xiao, J.; Wu, B.; Zhou, M.; Wang, Q. Online Monitoring and Diagnosis of High Voltage Circuit Breaker Faults: Feature Extraction Analysis of Vibration Signals. Int. J. Metrol. Qual. Eng. 2019, 10, 13. [Google Scholar] [CrossRef]
Zhao, S.; Ma, L.; Zhu, J.; Li, J.; Zhao, H. Mechanical Fault Diagnosis of High Voltage Circuit Breaker Based on CEEMDAN Sample Entropy and FWA-SVM. Electr. Power Autom. Equip. 2020, 40, 181–186. [Google Scholar] [CrossRef]
Esam El-Dine Atta, M.; Ibrahim, D.K.; Gilany, M.I. Broken Bar Faults Detection Under Induction Motor Starting Conditions Using the Optimized Stockwell Transform and Adaptive Time–Frequency Filter. IEEE Trans. Instrum. Meas. 2021, 70, 3518110. [Google Scholar] [CrossRef]
Zaman, W.; Ahmad, Z.; Siddique, M.F.; Ullah, N.; Kim, J.-M. Centrifugal Pump Fault Diagnosis Based on a Novel SobelEdge Scalogram and CNN. Sensors 2023, 23, 5255. [Google Scholar] [CrossRef] [PubMed]
Yuan, P.; Zhang, J.; Feng, J.; Wang, H.; Ren, W.; Wang, C. An Improved Time-Frequency Analysis Method for Structural Instantaneous Frequency Identification Based on Generalized S-Transform and Synchroextracting Transform. Eng. Struct. 2022, 252, 113657. [Google Scholar] [CrossRef]
Zhang, J.; Sun, H.; Sun, Z.; Dong, W.; Dong, Y. Fault Diagnosis of Wind Turbine Power Converter Considering Wavelet Transform, Feature Analysis, Judgment and BP Neural Network. IEEE Access 2019, 7, 179799–179809. [Google Scholar] [CrossRef]
Du, C.; Gao, S.; Jia, N.; Kong, D.; Jiang, J.; Tian, G.; Su, Y.; Wang, Q.; Li, C. A High-Accuracy Least-Time-Domain Mixture Features Machine-Fault Diagnosis Based on Wireless Sensor Network. IEEE Syst. J. 2020, 14, 4101–4109. [Google Scholar] [CrossRef]
Miao, D. Research on Fault Diagnosis of High-Voltage Circuit Breaker Based on Support Vector Machine. Int. J. Pattern Recognit. Artif. Intell. 2019, 33, 1959019. [Google Scholar] [CrossRef]
Hu, Q.; Si, X.; Zhang, Q.; Qin, A. A Rotating Machinery Fault Diagnosis Method Based on Multi-Scale Dimensionless Indicators and Random Forests. Mech. Syst. Signal Process. 2020, 139, 106609. [Google Scholar] [CrossRef]
Liu, A.; Yang, Z.; Li, H.; Wang, C.; Liu, X. Intelligent Diagnosis of Rolling Element Bearing Based on Refined Composite Multiscale Reverse Dispersion Entropy and Random Forest. Sensors 2022, 22, 2046. [Google Scholar] [CrossRef] [PubMed]
Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet Classification with Deep Convolutional Neural Networks. Adv. Neural Inf. Process. Syst. 2012, 25, 1097–1105. [Google Scholar] [CrossRef]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
Xie, S.; Girshick, R.; Dollár, P.; Tu, Z.; He, K. Aggregated Residual Transformations for Deep Neural Networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 1492–1500. [Google Scholar]
Khan, M.M.; Uddin, M.S.; Parvez, M.Z.; Nahar, L. A Squeeze and Excitation ResNeXt-Based Deep Learning Model for Bangla Handwritten Compound Character Recognition. J. King Saud Univ. Comput. Inf. Sci. 2022, 34, 3356–3364. [Google Scholar] [CrossRef]
Chen, L.; Li, S.; Bai, Q.; Yang, J.; Jiang, S.; Miao, Y. Review of Image Classification Algorithms Based on Convolutional Neural Networks. Remote Sens. 2021, 13, 4712. [Google Scholar] [CrossRef]
Ma, N.; Zhang, X.; Zheng, H.-T.; Sun, J. ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design. In Proceedings of the COMPUTER VISION—ECCV 2018, PT XIV, Munich, Germany, 8–14 September 2018; Springer International Publishing Ag: Cham, Switzerland, 2018; Volume 11218, pp. 122–138. [Google Scholar]
Wang, W.; Guo, S.; Zhao, S.; Lu, Z.; Xing, Z.; Jing, Z.; Wei, Z.; Wang, Y. Intelligent Fault Diagnosis Method Based on VMD-Hilbert Spectrum and ShuffleNet-V2: Application to the Gears in a Mine Scraper Conveyor Gearbox. Sensors 2023, 23, 4951. [Google Scholar] [CrossRef] [PubMed]
CKJ5-400/1.14 AC Vacuum Contactor. Available online: https://dc-components.com/product/glvac-ckj5-400-1-14kv-ac-vacuum-contactor (accessed on 16 September 2024).
Chen, X.; Feng, D.; Lin, S. Mechanical Fault Diagnosis Method of High Voltage Circuit Breaker Operating Mechanism Based on Deep Auto-Encoder Network. High Volt. Eng. 2020, 46, 3080–3088. [Google Scholar] [CrossRef]
Cui, R.; Tong, D.; Li, Z. Aviation Arc Fault Detection Based on Generalized S Transform. Proc. Chin. Soc. Electr. Eng. 2021, 41, 8241–8249. [Google Scholar] [CrossRef]
Peng, Y.; Ma, X. A Fault Diagnosis Method of Rolling Bearings Based on Parameter Optimization and Adaptive Generalized S-Transform. Machines 2022, 10, 207. [Google Scholar] [CrossRef]
Lopez-Ramirez, M.; Ledesma-Carrillo, L.M.; Garcia-Guevara, F.M.; Munoz-Minjares, J.; Cabal-Yepez, E.; Villalobos-Pina, F.J. Automatic Early Broken-Rotor-Bar Detection and Classification Using Otsu Segmentation. IEEE Access 2020, 8, 112624–112632. [Google Scholar] [CrossRef]
Wang, Y.; Yan, J.; Sun, Q.; Zhao, Y.; Liu, T. ShuffleNet-Based Comprehensive Diagnosis for Insulation and Mechanical Faults of Power Equipment. High Volt. 2021, 6, 861–872. [Google Scholar] [CrossRef]
Yang, H.; Liu, J.; Mei, G.; Yang, D.; Deng, X.; Duan, C. Research on Real-Time Detection Method of Rail Corrugation Based on Improved ShuffleNet V2. Eng. Appl. Artif. Intell. 2023, 126, 106825. [Google Scholar] [CrossRef]
Zhou, Y.; Cao, R.; Zhang, A.; Li, P. An Interference Mitigation Method for FMCW Radar Based on Time–Frequency Distribution and Dual-Domain Fusion Filtering. Sensors 2024, 24, 3288. [Google Scholar] [CrossRef]

Figure 1. Acceleration sensor mounting position diagram.

Figure 2. Closing vibration signal waveforms in different states.

Figure 3. GST time-frequency graphs of vibration signals in different states: (a) normal; (b) iron core rusting; (c) closing spring fatigue; (d) base screw loosening.

Figure 4. Vibration signal GST time-frequency graph data augmentation.

Figure 5. Cropping optimization of the GST time-frequency graph.

Figure 6. ShuffleNetV2 network architecture.

Figure 7. Two basic modules for ShuffleNetV2: (a) basic unit; (b) down-sampling unit.

Figure 8. The framework of vacuum contactor fault diagnosis.

Figure 9. Multi-resolution GST time-frequency graphs.

Figure 10. The values of SSIM between time-frequency graph augmented data and original data.

Figure 11. Optimization results of cropped GST time-frequency graphs: (a) normal; (b) iron core rusting; (c) closing spring fatigue; (d) base screw loosening.

Figure 12. State recognition confusion matrices for GST time-frequency graphs: (a) without cropping optimization; (b) with cropping optimization.

Figure 13. The accuracy of the test set for GST time-frequency graphs in four networks. AlexNet [28], ResNet50 [29], ResNeXt50 [30], ShuffleNetV2 [33].

Table 2. Vacuum contactor fault simulation scheme.

Status Category	Analogue Method	Collection Period	Number of Acquisitions
Normal	——	After industrial trials	50
		The first overhaul	40
		The second overhaul	30
Iron core rusting	A few iron filings added inside the core	After industrial trials	50
		The first overhaul	40
		The second overhaul	30
Closing spring fatigue	Spring pre-compression reduced by 3 mm	After industrial trials	50
		The first overhaul	40
		The second overhaul	30
Base screw loosening	Base screws screwed outwards 4 mm	After industrial trials	50
		The first overhaul	40
		The second overhaul	30

Table 3. Sample division of GST time-frequency graphs.

State	Label	Training Set Samples	Validation Set Samples	Test Set Samples
Normal	1	576	192	192
Iron core rusting	2	576	192	192
Closing spring fatigue	3	576	192	192
Base screw loosening	4	576	192	192

Table 4. Structure of the ShuffleNetV2 network model.

Model	Layer	Output Size	Kernel Size
ShuffleNetV2 fault diagnosis model	Input	125 × 125 × 3
	Conv1	63 × 63 × 24	3 × 3
	MaxPool	32 × 32 × 24	3 × 3
	Stage2	16 × 16 × 116
	Stage3	8 × 8 × 232
	Stage4	4 × 4 × 464
	Conv5	4 × 4 × 1024	1 × 1
	GlobalPool	1024
	FC	4

Table 5. Training and recognition time for GST time-frequency graph model without and with cropping optimization.

	Without Cropping Optimization	With Cropping Optimization
Model training time (min)	47.65	38.41
Total recognition time of the test set (s)	6.79	4.23

Table 6. The average time required for each iteration with different networks.

Network	Without Cropping Optimization	With Cropping Optimization
AlexNet [28]	33.26 s	27.32 s
ResNet50 [29]	39.49 s	31.18 s
ResNeXt50 [30]	43.99 s	32.84 s
ShuffleNetV2 [33]	28.43 s	22.91 s

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, H.; Wang, Q.; Song, J. Fault Diagnosis Method for Vacuum Contactor Based on Time-Frequency Graph Optimization Technique and ShuffleNetV2. Sensors 2024, 24, 6274. https://doi.org/10.3390/s24196274

AMA Style

Li H, Wang Q, Song J. Fault Diagnosis Method for Vacuum Contactor Based on Time-Frequency Graph Optimization Technique and ShuffleNetV2. Sensors. 2024; 24(19):6274. https://doi.org/10.3390/s24196274

Chicago/Turabian Style

Li, Haiying, Qinyang Wang, and Jiancheng Song. 2024. "Fault Diagnosis Method for Vacuum Contactor Based on Time-Frequency Graph Optimization Technique and ShuffleNetV2" Sensors 24, no. 19: 6274. https://doi.org/10.3390/s24196274

APA Style

Li, H., Wang, Q., & Song, J. (2024). Fault Diagnosis Method for Vacuum Contactor Based on Time-Frequency Graph Optimization Technique and ShuffleNetV2. Sensors, 24(19), 6274. https://doi.org/10.3390/s24196274

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Fault Diagnosis Method for Vacuum Contactor Based on Time-Frequency Graph Optimization Technique and ShuffleNetV2

Abstract

1. Introduction

2. Time-Frequency Graph of Vibration Signal

2.1. Vibration Signal Acquisition

2.2. GST Time-Frequency Graph

3. Time-Frequency Graph Optimization Technique

3.1. Data Augmentation of Time-Frequency Graph

3.2. Cropping Optimization of Time-Frequency Graph

4. ShuffleNetV2 Fault Diagnosis

4.1. Principles of ShuffleNetV2 Network

4.2. ShuffleNetV2 Fault Diagnosis Framework

5. Example Analysis

5.1. Time-Frequency Graph Dataset

5.2. Optimization Results of Time-Frequency Graph

5.3. Performance Analysis of the Proposed Method

5.4. Comparison of Different Network Structures

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI