Fault Diagnosis of Permanent Magnet Synchronous Motor Based on Wavelet Packet Transform and Genetic Algorithm-Optimized Back Propagation Neural Network

Ye, Ming; Gong, Run; Wu, Wanjun; Peng, Zhiyuan; Jia, Kelin

doi:10.3390/wevj16040238

Open AccessArticle

Fault Diagnosis of Permanent Magnet Synchronous Motor Based on Wavelet Packet Transform and Genetic Algorithm-Optimized Back Propagation Neural Network

by

Ming Ye

¹,

Run Gong

^1,*,

Wanjun Wu

¹,

Zhiyuan Peng

² and

Kelin Jia

¹

College of Vehicle Engineering, Chongqing University of Technology, Chongqing 400054, China

²

Chongqing Tsingshan Industrial Co., Ltd., Chongqing 402761, China

^*

Author to whom correspondence should be addressed.

World Electr. Veh. J. 2025, 16(4), 238; https://doi.org/10.3390/wevj16040238

Submission received: 20 February 2025 / Revised: 2 April 2025 / Accepted: 15 April 2025 / Published: 18 April 2025

(This article belongs to the Special Issue Permanent Magnet Motors and Driving Control for Electric Vehicles)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, a fault diagnosis method for permanent magnet synchronous motors is proposed, combining wavelet packet transform (WPT) energy feature extraction and a genetic algorithm (GA)-optimized back propagation (BP) neural network. Firstly, for the common types of motor faults (turn-to-turn short-circuit, phase-to-phase short-circuit, loss of magnetism, inverter open-circuit, rotor eccentricity), a corresponding motor fault model is established. The stator current signals during motor operation are analyzed using wavelet packet transform, and energy features are extracted from them as feature vectors for fault diagnosis. Then, a BP neural network is constructed, and a genetic algorithm is used to optimize its initial weights and thresholds, thereby improving the network’s classification accuracy. The results show that the GA-BP model outperforms the SSA-PNN diagnostic model in terms of fault classification accuracy. In particular, for the diagnosis of normal operation, inverter open-circuit, and demagnetization faults, the accuracy rate reaches 100%. This method demonstrates high diagnostic accuracy and practical application value.

Keywords:

wavelet packet transform; genetic algorithm; BP neural network; permanent magnet synchronous motor; fault diagnosis

1. Introduction

Due to the advantages of high power density, precise torque control, compact size, and low noise, permanent magnet synchronous motors (PMSMs) have seen widespread adoption in a variety of industrial applications in recent years [1]. However, PMSMs also face several challenges, such as high power density, limited installation space, and heat dissipation difficulties, which contribute to their high failure rate [2]. As a core component in the system, the reliability of the motor is critical to the overall system performance. When a motor fails, it may trigger a chain reaction, potentially causing the collapse of the entire system [3]. Therefore, early fault detection in motors is of great significance in ensuring the normal operation of the system. While several experts and scholars have conducted extensive research on PMSM faults, most studies focus on the identification and diagnosis of a single or a few faults. In practical applications, however, multiple fault types often occur simultaneously, limiting the adaptability of single-fault diagnostic methods [4]. Consequently, fault diagnosis methods for multiple fault scenarios in PMSMs have become a research hotspot.

In motor fault diagnosis, signal analysis is a commonly used method for fault feature extraction, which can effectively detect potential faults by analyzing the signals collected during motor operation. Common stator current signal processing methods in practice include current normalization [5], wavelet transform [6], Fast Fourier Transform (FFT) [7], and Hilbert–Huang transform (HHT) [8]. These methods are used to extract fault features by transforming motor operation signals into the frequency domain. In [9], the authors analyzed the current and voltage harmonic components of a turn-to-turn short-circuit faulty motor and used Fast Fourier Transform (FFT) to obtain space vector spectral features for fault detection. In [10], the authors proposed an Analysis of Variance (ANOVA)-based method that uses experimental data to diagnose faults in Variable Frequency Drive (VFD)-powered induction motors. In [11], the authors coupled a transient finite element model of an asynchronous motor with an external circuit to investigate its end characteristics and applied discrete wavelet transform to fault diagnosis. In [12], the authors determined the eigenfrequency of the demagnetization fault through finite element analysis, then extracted the instantaneous frequency components of the stator current using Empirical Mode Decomposition (EMD), and analyzed each of the eigenmode components by Hilbert–Huang transform. In [13], the authors proposed a method combining vibration and stator current signals by obtaining the spectrograms of both signals using FFT, then fusing these spectrograms for more accurate fault diagnosis. However, using the Fourier transform to identify specific harmonics and their content in stator current as fault features may result in misclassification due to the various causes of harmonic variations [14]. For example, environmental factors and load changes can affect the spectral characteristics of the current signal. In contrast, wavelet packet analysis offers better time–frequency domain resolution and can decompose both the low- and high-frequency components of the fault signal simultaneously, thus more effectively extracting fault features [15]. Wavelet packet transform provides a more refined signal decomposition process and delivers more frequency band information, enabling it to better identify characteristic signals in complex fault scenarios.

Artificial intelligence methods have been widely applied in electric motor fault diagnosis, improving diagnostic efficiency and reliability. In [16], genetic algorithms are used to diagnose faults in fuzzy neural networks. Although the accuracy has improved compared to traditional diagnostic methods, the training data are overly simplistic and unrepresentative, leading to significant discrepancies between the selected data and actual fault types, indicating room for improvement. In [17], the authors applied the Radial Basis Function (RBF) neural network to fault diagnosis and compared the advantages and disadvantages of RBF and BP neural networks, highlighting RBF’s advantages in diagnosis. BP neural networks have a simple structure, making them suitable for solving complex internal mechanisms and capable of approximating any nonlinear function with arbitrary accuracy. Through the backpropagation algorithm, the BP neural network continuously adjusts the network weights and thresholds, minimizing the difference between actual and desired outputs [18]. This method has been widely adopted in fault diagnosis systems across various fields [19]. However, one major problem with BP neural networks is the random selection of initial weights and thresholds, which can lead to an unstable training process, failure to converge, and a tendency to fall into local minima, resulting in significant variation in results across different training sessions [20]. To overcome this issue, genetic algorithms (GA) can be used for the global optimization of BP neural networks’ initial weights and thresholds. The global optimization and robustness of genetic algorithms help avoid local minima and improve network classification accuracy [21]. By inputting the optimized results into the BP network for training, this approach not only leverages the powerful mapping capability of BP neural networks, but also significantly enhances the accuracy and reliability of fault diagnosis.

Combining the above analyses, this paper first establishes a simulation model for common fault types of permanent magnet synchronous motors using Matlab/Simulink(R2023a) and Ansoft/Maxwell (Ansoft Maxwell 16.0). To avoid the issues of using raw current signals as fault feature vectors, which may result in excessive input dimensions or dimensional catastrophe, this paper employs wavelet packet transform to extract the energy features of stator current signals during motor operation. The energy features of each frequency band are then extracted as input vectors for the diagnostic model. Next, a BP neural network is constructed. To address the convergence issues and the tendency to fall into local minima in traditional BP neural networks, this paper optimizes the network’s initial weights and thresholds using genetic algorithms to improve classification accuracy and robustness.

2. PMSM Fault Modeling

In this study, Matlab/Simulink is employed to model, simulate, and analyze turn-to-turn short-circuit faults, phase-to-phase short-circuit faults, demagnetization faults, and inverter open-circuit faults. While Simulink is primarily utilized for system-level dynamic simulations, particularly for rotor eccentricity faults, it lacks high-precision modeling capabilities for internal magnetic field distribution, air gap variations, and their detailed effects on motor performance. Therefore, Ansoft/Maxwell is used specifically for simulating rotor eccentricity faults. However, it is not employed for the simulation of other faults due to the following considerations.

As finite element analysis (FEA) software, Ansoft/Maxwell excels at accurately computing the electromagnetic field distribution of motors, making it well suited for studying the effects of air gap magnetization variations caused by rotor eccentricity faults. Conversely, for electrical faults such as short circuits, demagnetization, and inverter faults, system-level dynamic characteristics are more critical. Matlab/Simulink is more appropriate for modeling and analysis in these cases, as it can efficiently simulate motor control systems and visualize the effects of faults on the dynamic responses of current, voltage, and torque.

Additionally, finite element simulations are computationally intensive. Conducting all fault simulations in Ansoft/Maxwell would significantly increase computational costs. Therefore, to balance computational accuracy and efficiency, Ansoft/Maxwell is exclusively used for rotor eccentricity fault analysis. The motor parameters used in this study are presented in Table 1.

Based on the

i_{d} = 0

double closed-loop vector control principle, a fault simulation model for the PMSM is constructed using the Simulink modular modeling approach, as illustrated in Figure 1. This model serves as the foundation for simulating turn-to-turn short-circuit faults, phase-to-phase short-circuit faults, demagnetization faults, and inverter open-circuit faults.

Stator turn-to-turn short circuits are the most common type of fault in permanent magnet synchronous motors (PMSMs). They are caused by the breakage of the insulation inside the same-phase winding, resulting in a short circuit between multiple coil turns [22]. The equivalent circuit in the case of a short circuit between turns in phase B is shown in Figure 2. When an inter-turn short-circuit fault occurs in the motor, a nonlinear model of the permanent magnet synchronous motor inter-turn short-circuit fault is constructed by adding a short-circuit branch to the short-circuited phase, and the voltage equation of the fault model is as follows:

U_{a b c f}^{s} = R_{a b c f}^{s} I_{a b c f}^{s} + \frac{d ψ_{a b c f}^{s}}{d t}

(1)

\{\begin{matrix} \begin{matrix} U_{a b c f}^{s} = [\begin{matrix} U_{a}^{s} & U_{b}^{s} & U_{c}^{s} & U_{f}^{s} \end{matrix} \end{matrix}]^{T}, U_{f}^{s} = 0 \\ R_{a b c f}^{s} = [\begin{matrix} R_{a}^{s} & 0 & 0 & 0 \\ 0 & R_{b}^{s^{'}} & 0 & 0 \\ 0 & 0 & R_{c}^{s} & 0 \\ 0 & 0 & 0 & R_{f}^{s} \end{matrix} \\ I_{a b c f}^{s} = {[\begin{matrix} i_{a}^{s} & i_{b}^{s} & i_{c}^{s} & i_{f}^{s} \end{matrix}]}^{T} \\ ψ_{a b c f}^{s} = {[\begin{matrix} ψ_{a}^{s} & ψ_{b}^{s} & ψ_{c}^{s} & ψ_{f}^{s} \end{matrix}]}^{T} \end{matrix}

(2)

where

U_{a b c f}^{s}

,

R_{a b c f}^{s}

,

I_{a b c f}^{s}

, and

ψ_{a b c f}^{s}

denote the voltage, resistance, current, and winding chain of the three-phase and short-circuit phase windings, respectively.

When an inter-phase short-circuit fault occurs, a short-circuit fault between any two phases of the motor can be simulated by using the three-phase fault module in Simulink, setting the fault time and the location of the faulted phase. The three-phase fault module and its internal structure are shown in Figure 3.

When a demagnetization fault occurs, appropriate modifications can be made to the original mathematical model to reflect the fault. By introducing the demagnetization scale factor

σ

into the normal motor magnetic chain equation, the magnetic chain equation during the demagnetization fault can be expressed as

[\begin{matrix} ψ_{A} \\ ψ_{B} \\ ψ_{C} \end{matrix}] = [\begin{matrix} L_{A} & M_{AB} & M_{AC} \\ M_{BA} & L_{B} & M_{BC} \\ M_{CA} & M_{CB} & L_{C} \end{matrix}] [\begin{matrix} i_{A} \\ i_{B} \\ i_{C} \end{matrix}] + σ [\begin{matrix} ψ_{mA} \\ ψ_{mB} \\ ψ_{mC} \end{matrix}]

(3)

σ = 1 - \frac{ψ_{f}^{'}}{ψ_{f}}

(4)

When an open-circuit fault occurs in the IGBT of the inverter, the drive signals of the power devices are controlled by the high and low levels of the step module to simulate the occurrence of an open-circuit fault in the inverter. The result of the inverter open-circuit topology is shown in Figure 4.

Rotor eccentricity fault in a motor refers to the displacement of the rotor’s rotational axis from the geometric center of the stator–rotor system, resulting in a non-uniform circumferential distribution of the air gap. In the simulation, this fault is modeled by adjusting the rotor’s geometric center through displacement. Using the Maxwell platform and the eccentric modeling tool in the ACT extension module, static, dynamic, and mixed eccentricity modes can be configured.

In this study, the motor’s air gap length is set to 1 mm, and for the mixed eccentricity fault, both static and dynamic eccentricities are configured at 0.2 mm. The fault model is illustrated in Figure 5, and the simulation data for rotor eccentricity are based on the motor parameters presented in Table 1.

The simulation model of the drive system with normal double-closed-loop vector control is established using Matlab/Simulink, and faults such as turn-to-turn short-circuit, phase-to-phase short-circuit, permanent magnet demagnetization, and inverter open-circuit are simulated and analyzed. Simultaneously, the simulation analysis of rotor eccentricity faults is carried out using Ansoft/Maxwell. Through the comprehensive analysis of voltage, current, and torque waveforms under fault conditions, it was found that the stator current can serve as a characteristic signal for fault diagnosis.

3. Fault Feature Extraction

Wavelet packet transform is a signal processing method developed based on traditional wavelet analysis. Unlike the wavelet transform, which only decomposes the low-frequency part, the wavelet packet transform can simultaneously decompose both the low-frequency and high-frequency parts of the signal more finely, thereby effectively improving the time–frequency resolution. This method is especially suitable for processing gradient signals [23]. By wavelet packet decomposition, the detailed features of the signal can be captured more accurately, particularly in the high-frequency band, compensating for the shortcomings of traditional wavelet analysis in terms of poor resolution in the high-frequency range. Each permanent magnet synchronous motor fault waveform signal has unique time–frequency characteristics, and after wavelet packet decomposition, the energy distribution in each frequency band will vary. Therefore, the energy features obtained from wavelet packet decomposition can be used as feature vectors for fault diagnosis [24].

In Figure 6,

S

denotes the original signal, and

S_{(i, j)}^{m}

represents the value of the

j

node in the

i

layer of the wavelet packet decomposition tree, with

m

being the node number. The wavelet packet transform is used to extract the features of the fault signal, and the energy characteristics of the signal are utilized to construct the feature vector. The implementation steps are as follows:

(1) The three-phase currents at the time of the fault are collected, and after P-layer wavelet packet decomposition, the feature information of each frequency band of the Pth layer is extracted. The recursive formula for wavelet packet decomposition is as follows:

\{\begin{matrix} d_{k}^{j + 1, 2 n} & = & \sum_{k \in Z} d_{p}^{j, n} h_{(2 p - k)} \\ d_{k}^{j + 1, 2 n + 1} & = & \sum_{k \in Z} d_{p}^{j, n} g_{(2 p - k)} \end{matrix}

(5)

Low-frequency analysis is performed using the filter h, which extracts the low-frequency components of the signal to generate the sub-band

d_{k}^{j + 1, 2 n}

. High-frequency analysis is conducted using the filter g, which decomposes the high-frequency components and produces the corresponding high-frequency sub-band

d_{k}^{j + 1, 2 n + 1}

. Here,

d_{p}^{j, n}

represents the signal coefficients at the current decomposition level, while

h_{(2 p - k)}

and

g_{(2 p - k)}

denote the coefficients of the low-pass and high-pass filters used in wavelet packet decomposition, respectively.

(2) The wavelet packet decomposition coefficients are reconstructed, and the recursive formula for wavelet packet reconstruction is as follows:

d_{p}^{j, n} = \sum_{k \in Z} d_{k}^{j + 1, 2 n} h_{(p - 2 k)} + \sum_{k \in Z} d_{k}^{j + 1, 2 n + 1} g_{(p - 2 k)}

(6)

The terms

h_{(p - 2 k)}

and

g_{(p - 2 k)}

represent the coefficients of the low-pass and high-pass filters used in wavelet reconstruction, respectively. The notation

k \in z

denotes summation over all integer indices, k.

(3) Calculate the energy of each frequency band signal and construct the feature vector of the fault signal. Since the input and output signals are random signals, let

E_{p, j}

be the energy of the wavelet packet decomposition coefficient sequence

S_{(p, j)}^{m}

of node

j

in layer

p

, which can be calculated by Equation (7).

E_{p, j} = \int ∣ S_{(p, j)}^{m} (t) ∣^{2} d t = \sum_{k = 1}^{n} ∣ d_{j, k} ∣^{2}

(7)

where

d_{j, k} (j = 0, 1, 2, \dots, 7; k = 1, 2, \dots, n)

is the wavelet packet coefficient of node

S_{(p, j)}^{m}

. The term

\sum_{k = 1}^{n} ∣ d_{j, k} ∣^{2}

represents the total energy of the signal’s coefficients after the wavelet packet transform. This energy is approximated by squaring and summing the wavelet packet coefficients

d_{j, k}

.The energy feature vector of the fault signal can be expressed as

T = [E_{p, 0}, E_{p, 1}, \dots, E_{p, 2^{p} - 1}]

(8)

(4) Normalize the feature vectors computed in step (3) so that the total energy of the signal is

E

:

E = \sqrt{\sum_{j = 0}^{2 p - 1} {|E_{j}|}^{2}}

(9)

E_{j, m} = \frac{E_{j, m} - E_{j, \min}}{E_{j, \max} - E_{j, \min}}

(10)

The fault feature vector

T_{1}

after normalizing the total energy of the signal can be expressed as

T_{1} = [\begin{matrix} \frac{E_{p, 0}}{E} & \frac{E_{p, 1}}{E} & \dots & \frac{E_{p, 2^{p} - 1}}{E} \end{matrix}]

(11)

The current signals extracted from the Simulink fault simulation model undergo wavelet packet processing to enhance data quality and extract meaningful features. First, the db6 wavelet basis function, which is well-suited for fault diagnosis, is selected. The entropy type is set to “Shannon”, and the original signal is decomposed into three wavelet packet levels (P = 3). Following this decomposition, the signal is divided into eight frequency bands, each containing specific fault-related information. Figure 7 shows the energy distribution of the motor in six states under rated operating conditions, with each color corresponding to a fault type. The energy spectrum characteristics of different faults in each frequency band are different, indicating that wavelet packet analysis can effectively extract the characteristic information of different faults. Therefore, it is feasible to distinguish fault types through energy spectrum distribution, which further demonstrates the effectiveness of the drive system fault diagnosis method utilizing the wavelet packet energy spectrum.

4. BP Neural Network Fault Diagnosis

A BP neural network is a multilayer feed-forward neural network training based on the error backpropagation algorithm, consisting of an input layer, an output layer, and one or more uncoupled hidden layers [25].

Fault diagnosis based on BP neural networks consists of two steps: first, the network is trained using training samples to generate a model; then, the model is utilized to test the diagnostic samples. The dataset is randomly divided into 2 parts, with 800 groups used as training samples and 300 groups used as test samples to verify the diagnostic effect.

4.1. BP Neural Network Topology Design

Based on the constructed PMSM drive system fault simulation platform, a fault dataset is generated by configuring various speed–torque combination conditions, as shown in Table 2. A total of 100 samples are collected under normal conditions, while 200 samples are generated for each fault type, resulting in 1100 samples overall. The dataset is then split in an 8:3 ratio, with 800 samples allocated for training.

(1): Number of neurons in the input layer.

For the fault diagnosis problem, the number of neurons in the input layer is determined by the length of the fault feature vector. In this paper, the energy values of eight frequency bands obtained after the three-layer wavelet packet decomposition of the fault current signal are normalized and input into the neural network as feature vectors, so the number of neurons in the input layer is

N_{1} = 8

.

(2): Number of neurons in the output layer

This study examines five typical fault modes, including winding turn-to-turn short circuit and phase-to-phase short circuit, along with normal operating conditions. The output layer of the network employs a multi-classification structure, where each operating state corresponds to a distinct output node. Consequently, the number of neurons in the output layer is

N_{3} = 6

. The specific expected response vector parameters are detailed in Table 3.

(3): Normalization

The mapminmax function is utilized to scale the data to the interval [−1, 1]. The formula for normalization is as follows:

X_{n o r m} = 2 \cdot \frac{X - X_{m i n}}{X_{m a x} - X_{m i n}} - 1

(12)

where

X_{m a x}

and

X_{m i n}

are the maximum and minimum values, respectively,

X

and

X_{n o r m}

are the data before and after normalization.

(4): Activation function

The activation function of the hidden layer is “tansig,” which can map the output value of the hidden layer to [−1, 1]. For the output layer, the logarithmic S-shaped function “logsig” is used, which can map the output value of the network to [0, 1]. The training algorithm is “Levenberg–Marquardt.”

(5): Training parameters

The learning rate is set to 0.01, the mean square error is set to 0.03, the maximum number of iterations is set to 10,000, and the maximum number of failures is set to 6. The evaluation criterion for the training objectives is chosen as mean squared error (MSE), which is defined as follows:

M S E = \frac{1}{m} \sum_{k = 1}^{m} {(t_{k} - y_{k})}^{2}

(13)

where

t_{k}

is the desired output,

y_{k}

is the actual output, and

m

is the number of outputs.

(6): Structure of the implicit layer

The determination of the number of nodes in the implicit layer can refer to the following three empirical formulas:

\{\begin{matrix} N_{2} = \sqrt{N_{1} \cdot N_{3}} + n \\ N_{2} = 2 N_{1} + 1 \\ N_{2} = \sqrt{N_{1} + N_{3}} + n \end{matrix}

(14)

where

N_{2}

denotes the number of nodes in the implicit layer, and n has a range of values from 1 to 10 and is an integer. Usually, the range of the number of hidden layer nodes can be estimated by empirical formulas, and then optimized within the range by the trial-and-error method. Since the network training effect is closely related to the random initialization, the actual determination is carried out for 12 times of training, and after removing the maximum and minimum errors, the average error of the remaining 10 times of training is calculated as the evaluation index. Substituting

N_{1} = 8

and

N_{3} = 6

into the formula, the value range of the number of hidden layer nodes is [5, 17]. The training samples are utilized to traverse the case of different numbers of hidden layer nodes, and the average error and classification correctness of 10 training sessions are used as the evaluation criteria.

Figure 8 shows the classification correctness and training effect of the BP network corresponding to different numbers of hidden layer nodes. For the pattern classification problem, the higher the classification correct rate and the smaller the training error, the better. Considering these two indicators, the number of hidden layer nodes is finally determined to be

N_{2} = 14

. Therefore, the structure of the hidden layer BP network designed in this paper is 8-14-6.

4.2. Initialize Weights and Thresholds

Based on the established BP neural network structure, the total number of weights and thresholds can be determined, as shown in Table 4.

(1): Weight initialization

The weights are parameters that connect neurons and determine the significance of each neuron’s output. In this study, the “tansig” function is used as the activation function for the hidden layer, and the Xavier initialization method is applied to initialize the weights.

Since the tansig function has an output range of [−1, 1], its derivative is an around of 0 at the largest, but tends to saturate for large inputs, leading to gradient vanishing. To mitigate this issue, the Xavier initialization method assumes equal variance between the input and output, ensuring that gradients neither vanish nor explode during forward and backward propagation. The initialization formula is as follows:

W ~ U (- \frac{\sqrt{6}}{\sqrt{n_{i n} + n_{o u t}}}, \frac{\sqrt{6}}{\sqrt{n_{i n} + n_{o u t}}})

(15)

where

n_{i n}

is the number of neurons in the previous layer and

n_{o u t}

is the number of neurons in the current layer.

U (a, b)

stands for uniform distribution.

(2): Threshold initialization

The threshold term adjusts the activation threshold of neurons. Common initialization strategies include zero initialization

(b = 0)

, which helps prevent bias from affecting gradient updates, and small random number initialization

(b ~ U (- ε, ε))

, which prevents all neurons from learning identical features simultaneously, thereby improving training efficiency.

In this study, small random number initialization is used, where the hidden layer threshold

b_{1} \in R^{14}

and output layer threshold

b_{2} \in R^{6}

follow a uniform distribution:

b_{1} ~ U (- 0.1, 0.1), b_{2} ~ U (- 0.1, 0.1)

(16)

4.3. BP Neural Network Diagnostic Results

After the network training is completed, the remaining 300 sets of test data are input into the trained network for prediction. Figure 9 shows the confusion matrix of the BP model for classifying the test samples, and the results show that the accuracy of the BP neural network in identifying normal operation and inverter open-circuit faults is 100%; however, there is some error in diagnosing turn-to-turn short-circuit, phase-to-phase short-circuit, and loss-of-magnetism faults and rotor eccentricity.

The randomness of the initial weights and thresholds of the BP neural network, overfitting phenomenon, insufficient generalization ability, poor convergence, and the tendency to fall into local minima may lead to lower overall accuracy.

5. GA-BP Diagnostic Model Design

In order to overcome the shortcomings of a traditional BP neural network in terms of poor convergence and ability to fall into local minima, this paper introduces a genetic algorithm to search for the optimal values of its initial weights and thresholds. After the genetic algorithm finds the optimal solution through global search, the optimized weights and thresholds are given to the BP neural network to improve the training and prediction performance of the model.

Figure 10 shows the overall process of the GA-BP neural network, in which the left side is the optimization part of the genetic algorithm, and the right side is the training and prediction part of the BP neural network. The specific steps are as follows:

(1) Determining the topology and parameters of the BP neural network is a prerequisite for optimization using genetic algorithms [26]. In this paper, the initial weights and thresholds of the BP network are optimized, and the network structure is consistent with the previous paper: the number of nodes in the input layer is

N_{1} = 8

, the hidden layer

N_{2} = 14

, and the output layer

N_{3} = 6

.

(2) Choosing the appropriate coding method and coding length is a key step in optimizing the BP neural network. In this paper, real number coding is used, where a set of real numbers represents an individual, and the length of the chromosome is determined by the network structure, as shown in Figure 11. Since the network structure is “8-14-6”, there are 8 × 14 + 14 × 6 = 196 weights and 14 + 6 = 20 thresholds, so the individual coding length is 216.

(3) Calculate the fitness value of each individual and quantitatively evaluate its performance advantages and disadvantages, so as to select the good individuals to be inherited in the next generation. The genetic operation is executed according to the crossover probability and mutation probability set at the initialization, and the optimal individual is finally saved after several iterations until the expected performance is achieved or the termination conditions are satisfied. The fitness function, selection, crossover, and mutation methods in this paper are shown below:

Fitness function

After the weights and thresholds encoded in the individual are given to the network, the training data are utilized and the mean square error is used as the fitness value of the individual. The fitness function

F

is expressed as follows:

F = \sum_{i = 1}^{N} {(y_{i} - o_{i})}^{2}

(17)

where

N

is the total number of training samples, and

y_{i}

and

o_{i}

denote the expected and predicted values of output node

i

, respectively.

Selection operation

Each individual is quantitatively evaluated according to the fitness function, and the higher the fitness value, the higher the probability that the individual is selected into the new group. In this paper, we adopt the simple and widely used roulette method, in which the probability of an individual being selected is proportional to its fitness value. The expression for the probability

P_{i}

that the

i

th individual is selected is as follows:

P_{i} = \frac{f_{i}}{\sum_{i = 1}^{n} f_{i}} (i = 1, 2, \dots, n)

(18)

where

f_{i}

is the fitness value of individual

i

, and

n

is the total number of individuals in the population.

Crossover operation

In this paper, we adopt the real fusion crossover method for data crossover by randomly selecting two individuals and fusing their codes at a random location to generate a new individual. The specific expression is as follows:

\{\begin{matrix} a_{k j} = a_{k j} (1 - b) + a_{l j} b \\ a_{l j} = a_{l j} (1 - b) + a_{k j} b \end{matrix}

(19)

where

b

is a random number between [0, 1],

l

represents the crossover position, and

a_{k}

and

a_{j}

are the

k

th and

j

th chromosomes.

Mutation operation

Corresponding to the real number coding, the mutation method adopted in this paper is real number mutation, selecting the

n

th gene

a_{m n}

of individual

m

for mutation; the specific expression is as follows:

a_{m n} = \{\begin{matrix} a_{m n} + (a_{m n} - a_{m a x}) \cdot r_{1} \cdot (1 - \frac{g}{G_{m a x}}) & r_{2} > 0.5 \\ a_{m n} + (a_{m i n} - a_{m n}) \cdot r_{1} \cdot (1 - \frac{g}{G_{m a x}}) & r_{2} \leq 0.5 \end{matrix}

(20)

where

a_{m a x}

and

a_{m i n}

are the upper and lower bounds of the encoding,

g

denotes the current number of evolution,

G_{m a x}

is the maximum number of iterations,

r_{1}

is a random number, and

r_{2}

is a random number between 0–1.

(4) The optimal individual contains the optimal initialization connection weights and thresholds of the network. These initialization conditions are assigned to the BP network through the decoding operation, and the error is calculated according to the training process of the BP neural network until the termination conditions are satisfied.

(5) The initialization parameters of the genetic algorithm include population size, maximum iteration number, crossover rate, and mutation rate. After several experiments and comparisons, the population size M = 30, the number of iterations T = 200, the crossover rate Pe = 0.9, and the mutation rate Pm = 0.01 were finally selected.

6. Results and Discussion

6.1. GA-BP Network Diagnostic Results

In the optimization process of the genetic algorithm, the total number of iterations is set to 200. When the 161st generation is reached, the adaptation value is close to the optimal. First, a global optimization search is performed according to the initialization parameters of the genetic algorithm, and then the neural network is trained using the optimal weights and thresholds obtained. Randomly select 800 sets of samples as training data, and after 12 times of training, the network converges and reaches the set error, and the training is finished.

After the network training is completed, the remaining 300 sets of test data are fed into the trained GA-BP model for prediction. Figure 11 shows the classification results of the test samples, where the red triangles indicate the predicted fault categories, the blue circles indicate the real fault categories, and their overlap indicates correct classification.

Figure 12 demonstrates the confusion matrix of the GA-BP model for classifying 300 sets of test samples. It can be seen that 36 normal samples, 45 lost rotor eccentricity samples, and 52 inverter open-circuit samples are correctly identified; of the 57 turn-to-turn short-circuit samples, 2 are misclassified as phase-to-phase shorts, and the remaining 55 are correctly categorized; of the 43 phase-to-phase short-circuit samples, 1 is misclassified as turn-to-turn short-circuit, and the remaining 42 are correctly categorized; and of the 67 loss-of-magnetism fault samples, 3 are misclassified as rotor eccentricity, and the remaining 64 are correctly categorized.

6.2. Comparison of Diagnostic Results Between GA-BP Algorithm and SSA-PNN Algorithm

The method proposed in this study is compared with SSA-PNN (Sparrow Search Optimized Probabilistic Neural Network), and the comparison results are shown in Table 5. GA-BP (Genetic Algorithm-Optimized Backpropagation Neural Network) optimizes the initial weights and thresholds of the BP neural network using a genetic algorithm (GA), enhancing the model’s global search capability and mitigating the tendency of BP networks to become trapped in local optima. As a result, GA-BP improves both the accuracy and generalization of fault classification.

Experimental results indicate that for complex faults such as rotor eccentricity and demagnetization, the F1 scores of GA-BP reach 97.0% and 97.7%, respectively, outperforming SSA-PNN (95.0% and 93.8%). Additionally, GA-BP achieves F1 scores exceeding 96.0% for turn-to-turn and phase-to-phase short-circuit faults, demonstrating superior fault identification capability and robustness. Overall, GA-BP exhibits strong performance across most fault categories, particularly in classifying complex nonlinear faults. It effectively extracts fault features in a stable and efficient manner, thereby improving diagnostic accuracy and reliability.

However, GA-BP has certain limitations. Since genetic algorithms involve computationally intensive operations such as population initialization, selection, crossover, and mutation, the training process is time-consuming, potentially limiting its applicability in real-time systems or resource-constrained environments. Moreover, GA-BP is highly sensitive to hyperparameter settings, including crossover rate, mutation rate, and population size. Improper tuning of these parameters may result in unstable optimization, affecting convergence speed and final classification accuracy. Therefore, careful parameter tuning is required to maintain GA-BP’s high diagnostic performance under varying operating conditions.

Furthermore, when dealing with high-dimensional and complex data, GA-BP may still encounter local optima despite its improved global search capability. In cases of highly complex and noisy data distributions, it may suffer from unstable convergence or overfitting. To enhance its robustness, GA-BP could be integrated with adaptive genetic algorithms, dynamic learning rate mechanisms, or deep learning techniques to improve fault diagnosis accuracy and adaptability. Additionally, to address its high computational complexity, distributed computing or parallel optimization techniques could be employed to enhance training efficiency and reduce computational costs, making GA-BP more suitable for industrial applications.

7. Conclusions

By combining fault modeling, wavelet packet feature extraction, and genetic algorithm optimization of the BP neural network, the GA-BP model proposed in this paper effectively improves the accuracy and robustness of permanent magnet synchronous motor fault diagnosis. First, representative time–frequency features are extracted from the fault signal by wavelet packet transform, which can better reflect the state of the motor fault. Then, the initial weights and thresholds of the BP neural network are optimized using a genetic algorithm, which effectively avoids the local optimum problem that may occur in the traditional BP network and improves the convergence speed and classification performance of the model. The results show that the GA-BP model exhibits significant classification accuracy enhancement on most fault categories, especially in the identification of normal operation, inverter faults, and demagnetization faults, with the accuracy and recall reaching 100%. Compared with the SSA-PNN algorithm, the performance of the GA-BP model in the categories of turn-to-turn short-circuit and phase-to-phase short-circuit is also significantly improved, and the increase in the F1 score further proves its effectiveness. In summary, the BP neural network combined with wavelet packet feature extraction and genetic algorithm optimization has an important application prospect in electric motor fault diagnosis, which can effectively reduce leakage detection and misjudgment and improve the stability and reliability of the system.

Author Contributions

Conceptualization, M.Y. and R.G.; methodology, M.Y., R.G. and Z.P.; software, R.G.; validation, R.G., Z.P. and W.W.; formal analysis, M.Y.; investigation, M.Y.; resources, M.Y.; data curation, R.G. and Z.P.; writing—original draft preparation, R.G.; writing—review and editing, M.Y., R.G. and K.J.; visualization, R.G.; supervision, M.Y.; project administration, M.Y.; funding acquisition, M.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Science and Technology Innovation Key R&D Program of Chongqing (No. CSTB2022TIAD-STX0005).

Data Availability Statement

The original contributions presented in this study are included in the article, and further inquiries can be directed to the corresponding author.

Conflicts of Interest

Zhiyuan Peng was employed by the company Chongqing Tsingshan Industrial Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Soualhi, A.; Clerc, G.; Razik, H. Detection and Diagnosis of Faults in Induction Motor Using an Improved Artificial Ant Clustering Technique. IEEE Trans. Ind. Electron. 2013, 60, 4053–4062. [Google Scholar] [CrossRef]
Skowron, M.; Wolkiewicz, M.; Tarchala, G. Stator Winding Fault Diagnosis of Induction Motor Operating under the Field-Oriented Control with Convolutional Neural Networks. Bull. Pol. Acad. Sci.-Tech. Sci. 2020, 68, 1039–1048. [Google Scholar] [CrossRef]
Soualhi, M.; Nguyen, K.T.P.; Soualhi, A.; Medjaher, K.; Hemsas, K.E. Health Monitoring of Bearing and Gear Faults by Using a New Health Indicator Extracted from Current Signals. Measurement 2019, 141, 37–51. [Google Scholar] [CrossRef]
Skowron, M.; Orlowska-Kowalska, T.; Kowalski, C.T. Detection of Permanent Magnet Damage of Pmsm Drive Based on Direct Analysis of the Stator Phase Currents Using Convolutional Neural Network. IEEE Trans. Ind. Electron. 2022, 69, 13665–13675. [Google Scholar] [CrossRef]
Estima, J.O.; Cardoso, A.J.M. A New Algorithm for Real-Time Multiple Open-Circuit Fault Diagnosis in Voltage-Fed Pwm Motor Drives by the Reference Current Errors. IEEE Trans. Ind. Electron. 2012, 60, 3496–3505. [Google Scholar] [CrossRef]
Khlaief, A.; Boussak, M.; Gossa, M. Open Phase Faults Detection in Pmsm Drives Based on Current Signature Analysis. In Proceedings of the The XIX International Conference on Electrical Machines-ICEM 2010, Rome, Italy, 25 October 2010; pp. 1–6. [Google Scholar]
Kenny, A.; Palazzolo, A.B. Comparison of the Dynamic Response of Radial and Tangential Magnetic Flux Thrust Bearings. IEEE/ASME Trans. Mechatronics 2002, 7, 61–66. [Google Scholar] [CrossRef]
Rosero, J.; Romeral, L.; Ortega, J.; Urresty, J. Demagnetization Fault Detection by Means of Hilbert Huang Transform of the Stator Current Decomposition in Pmsm. In Proceedings of the 2008 IEEE International Symposium on Industrial Electronics, Cambridge, UK, 30 June–2 July 2008; pp. 172–177. [Google Scholar]
Çira, F.; Arkan, M.; Gümüş, B.; Goktas, T. Analysis of Stator Inter-Turn Short-Circuit Fault Signatures for Inverter-Fed Permanent Magnet Synchronous Motors. In Proceedings of the IECON 2016-42nd Annual Conference of the IEEE Industrial Electronics Society, Florence, Italy, 23–26 October 2016; pp. 1453–1457. [Google Scholar]
Shabbir, M.N.S.K.; Liang, X.; Chakrabarti, S. An Anova-Based Fault Diagnosis Approach for Variable Frequency Drive-Fed Induction Motors. IEEE Trans. Energy Convers. 2020, 36, 500–512. [Google Scholar] [CrossRef]
Mohammed, O.; Abed, N.; Ganu, S. Modeling and Characterization of Induction Motor Internal Faults Using Finite-Element and Discrete Wavelet Transforms. IEEE Trans. Magn. 2006, 42, 3434–3436. [Google Scholar] [CrossRef]
Espinosa, A.G.; Rosero, J.A.; Cusido, J.; Romeral, L.; Ortega, J.A. Fault Detection by Means of Hilbert–Huang Transform of the Stator Current in a Pmsm with Demagnetization. IEEE Trans. Energy Convers. 2010, 25, 312–318. [Google Scholar] [CrossRef]
Li, Y.; Liang, Y. The Correlation Analysis of Pm Inter-Turn Fault Based on Stator Current and Vibration Signal. In Proceedings of the 2015 IEEE International Conference on Mechatronics and Automation (ICMA), Beijing, China, 2–5 August 2015; pp. 1733–1737. [Google Scholar]
Ribeiro Junior, R.F.; dos Santos Areias, I.A.; Campos, M.M.; Teixeira, C.E.; da Silva, L.E.B.; Gomes, G.F. Fault Detection and Diagnosis in Electric Motors Using Convolution Neural Network and Short-Time Fourier Transform. J. Vib. Eng. Technol. 2022, 10, 2531–2542. [Google Scholar] [CrossRef]
Zhang, Z.; Wang, Y.; Wang, K. Fault Diagnosis and Prognosis Using Wavelet Packet Decomposition, Fourier Transform and Artificial Neural Network. J. Intell. Manuf. 2013, 24, 1213–1227. [Google Scholar] [CrossRef]
Yu, W.; Ruochen, L.; Guangjun, L. Neural Network Bogie Bearing Fault Diagnosis Based on Improved Genetic Algorithm. Urban Rail Transit Res. 2020, 23, 46–49. [Google Scholar]
Li, Y.; Hu, X.; Yao, Z.; Zhang, Y. Fault Diagnosis of Fire Control System Based on Genetic Algorithm Optimized Bp Neural Network. In Proceedings of the 2022 Prognostics and Health Management Conference (PHM-2022 London), London, UK, 27–29 May 2022; pp. 11–16. [Google Scholar]
Liu, J.; Kong, X.; Xia, F.; Bai, X.; Wang, L.; Qing, Q.; Lee, I. Artificial Intelligence in the 21st Century. IEEE Access 2018, 6, 34403–34421. [Google Scholar] [CrossRef]
Long, Q.; Guo, S.; Li, Q.; Sun, Y.; Li, Y.; Fan, Y. Research of Converter Transformer Fault Diagnosis Based on Improved Pso-Bp Algorithm. In Proceedings of the IOP Conference Series: Materials Science and Engineering, Singapore, 28–30 July 2017; p. 012015. [Google Scholar]
Deng, H.; Zhang, W.-x.; Liang, Z.-f. Application of Bp Neural Network and Convolutional Neural Network (Cnn) in Bearing Fault Diagnosis. In Proceedings of the IOP Conference Series: Materials Science and Engineering, Shaanxi, China, 8–11 October 2020; p. 042026. [Google Scholar]
Jiang, Q.; Huang, R.; Huang, Y.; Chen, S.; He, Y.; Lan, L.; Liu, C. Application of Bp Neural Network Based on Genetic Algorithm Optimization in Evaluation of Power Grid Investment Risk. IEEE Access 2019, 7, 154827–154835. [Google Scholar] [CrossRef]
Li, Y.; Wang, Y.; Zhang, Y.; Zhang, J. Diagnosis of Inter-Turn Short Circuit of Permanent Magnet Synchronous Motor Based on Deep Learning and Small Fault Samples. Neurocomputing 2021, 442, 348–358. [Google Scholar] [CrossRef]
Zhang, X.; Zhu, J.; Wu, Y.; Zhen, D.; Zhang, M. Feature Extraction for Bearing Fault Detection Using Wavelet Packet Energy and Fast Kurtogram Analysis. Appl. Sci. 2020, 10, 7715. [Google Scholar] [CrossRef]
Shao, R.; Hu, W.; Wang, Y.; Qi, X. The Fault Feature Extraction and Classification of Gear Using Principal Component Analysis and Kernel Principal Component Analysis Based on the Wavelet Packet Transform. Measurement 2014, 54, 118–132. [Google Scholar] [CrossRef]
Li, J.; Cheng, J.-h.; Shi, J.-y.; Huang, F. Brief Introduction of Back Propagation (Bp) Neural Network Algorithm and Its Improvement. In Proceedings of the Advances in Computer Science and Information Engineering, Zhengzhou, China, 19–20 May 2012; Volume 2, pp. 553–558. [Google Scholar]
Yin, F.; Mao, H.; Hua, L. A Hybrid of Back Propagation Neural Network and Genetic Algorithm for Optimization of Injection Molding Process Parameters. Mater. Des. 2011, 32, 3457–3464. [Google Scholar] [CrossRef]

Figure 1. PMSM fault simulation model.

Figure 2. Equivalent circuit diagram of B-phase turn-to-turn short circuit.

Figure 3. Schematic diagram of three-phase fault module and its internal structure.

Figure 4. Inverter open circuit fault topology diagram.

Figure 5. Schematic diagram of rotor eccentricity fault model and its air gap variation.

Figure 6. Schematic diagram of three-layer wavelet packet decomposition.

Figure 7. Energy distribution diagram.

Figure 8. Trial process for the number of nodes in the hidden layer.

Figure 9. BP network confusion matrix.

Figure 10. Flowchart of GA-BP algorithm.

Figure 11. Classification results of GA-BP network.

Figure 12. Confusion matrix of GA-BP network.

Table 1. Motor parameters.

Parameters	Numerical Value
Rated power (kW)	120
Rated voltage (V)	320
$Pole pair number p_{n}$	2
$D - axis inductance L_{d}$ (mH)	0.47
$Q - axis inductance L_{q}$ (mH)	0.18
Number of stator slots	24

Table 2. Motor failure simulation working condition settings.

Working Condition	Speed (r/min)	Torque (N-m)
1	5000	244
2	5000	122
3	5000	61
4	4000	244
5	4000	122
6	4000	61
7	3000	244
8	3000	122
9	3000	61
10	2000	244
11	2000	122
12	2000	61

Table 3. Target outputs corresponding to different failure modes.

Failure Mode	Target Output
Normal operation	1	0	0	0	0	0
Turn-to-turn short circuit	0	1	0	0	0	0
Phase-to-phase short circuit	0	0	1	0	0	0
De-magnetization fault	0	0	0	1	0	0
Inverter open circuit	0	0	0	0	1	0
Rotor eccentricity	0	0	0	0	0	1

Table 4. Number of weights and thresholds.

Input Layer and Implicit Layer Connection Weights	Implicit Layer Threshold	Implicit layer and Output Layer Connection Weights	Output Layer Threshold
112	14	84	6

Table 5. Comparison of evaluation indexes of GA-BP and SSA-PNN fault diagnosis algorithms.

Fault Category	Model	Classification Accuracy (%)	Recall Rate (%)	F1 Score
Normal operation	BP	100.0	100.0	100.0
	GA-BP	100.0	100.0	100.0
	SSA-PNN	95.0	95.0	95.0
Turn-to-turn short circuit	BP	91.1	89.1	90.1
	GA-BP	98.2	96.5	97.3
	SSA-PNN	97.2	94.6	95.6
Phase-to-phase short circuit	BP	91.4	93.0	92.2
	GA-BP	95.5	97.7	96.6
	SSA-PNN	93.8	92.5	93.2
De-magnetization fault	BP	88.7	96.5	92.5
	GA-BP	100.0	95.5	97.7
	SSA-PNN	91.8	95.3	93.8
Inverter open circuit	BP	100.0	100.0	100.0
	GA-BP	100.0	100.0	100.0
	SSA-PNN	100.0	100.0	100.0
Rotor eccentricity	BP	95.9	87.0	91.3
	GA-BP	94.0	100.0	97.0
	SSA-PNN	93.6	96.4	95.0

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Published by MDPI on behalf of the World Electric Vehicle Association. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ye, M.; Gong, R.; Wu, W.; Peng, Z.; Jia, K. Fault Diagnosis of Permanent Magnet Synchronous Motor Based on Wavelet Packet Transform and Genetic Algorithm-Optimized Back Propagation Neural Network. World Electr. Veh. J. 2025, 16, 238. https://doi.org/10.3390/wevj16040238

AMA Style

Ye M, Gong R, Wu W, Peng Z, Jia K. Fault Diagnosis of Permanent Magnet Synchronous Motor Based on Wavelet Packet Transform and Genetic Algorithm-Optimized Back Propagation Neural Network. World Electric Vehicle Journal. 2025; 16(4):238. https://doi.org/10.3390/wevj16040238

Chicago/Turabian Style

Ye, Ming, Run Gong, Wanjun Wu, Zhiyuan Peng, and Kelin Jia. 2025. "Fault Diagnosis of Permanent Magnet Synchronous Motor Based on Wavelet Packet Transform and Genetic Algorithm-Optimized Back Propagation Neural Network" World Electric Vehicle Journal 16, no. 4: 238. https://doi.org/10.3390/wevj16040238

APA Style

Ye, M., Gong, R., Wu, W., Peng, Z., & Jia, K. (2025). Fault Diagnosis of Permanent Magnet Synchronous Motor Based on Wavelet Packet Transform and Genetic Algorithm-Optimized Back Propagation Neural Network. World Electric Vehicle Journal, 16(4), 238. https://doi.org/10.3390/wevj16040238

Article Menu

Fault Diagnosis of Permanent Magnet Synchronous Motor Based on Wavelet Packet Transform and Genetic Algorithm-Optimized Back Propagation Neural Network

Abstract

1. Introduction

2. PMSM Fault Modeling

3. Fault Feature Extraction

4. BP Neural Network Fault Diagnosis

4.1. BP Neural Network Topology Design

4.2. Initialize Weights and Thresholds

4.3. BP Neural Network Diagnostic Results

5. GA-BP Diagnostic Model Design

6. Results and Discussion

6.1. GA-BP Network Diagnostic Results

6.2. Comparison of Diagnostic Results Between GA-BP Algorithm and SSA-PNN Algorithm

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI