1. Introduction
Induction machines (IMs) are among the most widely used power devices globally due to their robustness, low cost, ease of control, and high efficiency. Ensuring their continuous and safe operation is crucial for maintaining substantial economic benefits and reducing downtime. However, these machines are subject to various stresses contributing to their degradation and aging. These stresses are categorized under the TEAM (Thermal, Electrical, Ambient, and Mechanical) model [
1].
An induction motor comprises multiple mechanical and electrical components, including the motor frame, stator windings, rotor cage, rolling bearings, fan, and rotor shaft. Despite their inherently robust design, induction motors are exposed to external stressors [
2]. Short-circuit faults between windings are the most prevalent among the different types of motor faults. Such faults originate from the deterioration of insulating materials, ultimately leading to insulation breakdown between windings and incipient inter-turn short-circuit faults. Early detection of inter-turn short-circuits is essential to prevent further motor damage and mitigate economic losses [
3].
Empirical studies [
2,
4,
5,
6,
7] indicate that bearing faults account for approximately 40% of all failures in induction motors, with some research suggesting an even higher rate for small motors. Additionally, studies conducted by the Institute of Electrical and Electronics Engineers (IEEE) and the Electric Power Research Institute (EPRI) report that stator faults constitute 36% and 28% of IM failures, respectively. Furthermore, rotor faults account for 8–10% of induction motor failures [
2,
4]. Given these statistics, the early detection of such incipient faults has been a significant focus of research to prevent unexpected production downtimes and economic losses.
Over the past decades, various fault detection techniques have been developed, including vibration analysis, temperature monitoring, magnetic field analysis, current analysis, acoustic noise assessment, and torque profiling [
2,
6,
7]. Among these methods, current analysis is widely adopted due to its effectiveness in fault detection, which this study utilizes. Artificial intelligence (AI) techniques have increasingly been integrated into these methodologies in recent years to enhance detection accuracy [
7,
8,
9].
Several AI-based techniques have been extensively applied in IM fault diagnosis, including artificial neural networks (ANNs), support vector machines (SVM), genetic algorithms (GAs), fuzzy logic, adaptive neuro-fuzzy inference systems (ANFIS), and hybrid models. ANN, in particular, is a powerful data processing system capable of learning input–output relationships from training data. ANN comprises processing elements (neurons) interconnected through weighted links that enable information sharing [
7,
10].
Artificial neural networks (ANNs) are widely recognized for their capacity to learn, approximate, and analyze complex problems. Single-layer perceptron (SLP) networks, consisting of only input and output layers, represent the simplest form of ANNs but are inadequate for handling nonlinearly separable patterns. To overcome these limitations, researchers have developed multilayer perceptron (MLP) networks, the most frequently employed ANN model in the literature. MLPs offer advantages such as high learning potential, noise robustness, nonlinearity adaptability, parallel processing capability, fault tolerance, and excellent generalization capacity [
11].
Recent studies on IM fault detection employ diverse techniques, ranging from mathematical analyses to AI-based methods. Researchers have explored various ANN-based approaches for fault identification. For instance, Khanjani and Ezoji [
12] investigated the classification of thermal patterns of electrical faults in three-phase IMs as a non-invasive and non-destructive diagnostic approach, employing SVM due to its strong generalization capability. Osornio et al. [
13] proposed a methodology for processing electric current signals using the short-time Fourier transform (STFT). The diagnostic system was developed using digital hardware implementations, integrating an FPGA with a microprocessor (PYNQ Z2, Xilinx, Inc., San Jose, CA, USA) for efficient digital signal and image processing. Their method demonstrated high classification accuracy for various IM faults, including healthy (HLT), misalignment (MAMT), unbalance (UNB), damaged bearing (BDF), and broken rotor bar (BRB) conditions.
Aib et al. [
14] examined the implementation of fuzzy logic on Field Programmable Gate Array (FPGA) (Virtex-4, Xilinx, Inc., San Jose, CA, USA) based programmable logic circuits for diagnosing phase unbalance and phase failure in asynchronous machines. Khoualdia et al. [
15] developed a structured and optimized MLP-ANN model for online IM condition monitoring. Their ANN utilized eight indicators representing different motor states (healthy, broken rotor bars, bearing fault, and misalignment), while the output layer employed a coded matrix. The model demonstrated high accuracy. Ewert et al. [
16] presented a low-cost system for online IM-bearing condition monitoring and fault diagnosis using neural networks (NNs) in radial vibration measurements.
Bazan et al. [
17] proposed an approach for detecting stator winding short-circuit faults using phase-shifted stator current measurements. Their MLP-based online detection system was implemented in Matlab using a data acquisition card (USB-6221, National Instruments, Inc., Austin, TX, USA). Additionally, Bazan et al. [
18] developed a two-stage diagnosis method based on mutual information measurements of current signals, principal component analysis, and intelligent systems, achieving high classification accuracies.
Li et al. [
19] introduced an improved selective set deep learning method with beetle antenna search for enhancing bearing fault diagnostic performance. Their findings suggest that this approach outperforms conventional base models and ensemble learning methods in accuracy and robustness.
In summary, winding failures represent IMs’ second most common incipient faults. Existing fault detection methodologies for AC motors exhibit certain limitations, such as being primarily implemented in simulations, requiring invasive sensors, lacking in situ measurement capabilities, or having complex digital implementations. Few studies focus on developing specialized systems for real-time processing, online and in situ detection of short-circuit faults in IM windings. Most available solutions rely on personal computers, data acquisition cards (DAQ), and invasive sensors, highlighting the need for a dedicated system capable of real-time, efficient fault detection.
Reconfigurable computing, mainly through Field-Programmable Gate Arrays (FPGAs), offers a promising solution. FPGAs balance high-performance hardware and software flexibility, allowing post-manufacturing reprogramming to accommodate application-specific requirements. Compared to Application-Specific Integrated Circuits (ASICs), FPGAs offer cost efficiency, enhanced performance, and reduced power consumption [
20]. Additionally, advancements in FPGA-based deep learning accelerators have demonstrated substantial performance improvements in computational efficiency [
21].
This study aims to develop a non-invasive, purpose-specific system for real-time and online detection of short-circuit faults in induction motors. The proposed system incorporates current descriptors processed through an MLP neural network for fault classification. The primary advantages of this system include:
- -
High-speed, energy-efficient processing tailored to purpose-specific diagnostic tasks.
- -
Simple and reconfigurable digital implementation.
- -
FPGA-based architecture, enabling hardware upgrades and reuse while maintaining low cost and power consumption.
- -
Portability and practicality for standalone, online, in situ measurements using non-invasive sensors.
- -
Real-time acquisition, storage, display, and transmission of current measurement data.
- -
Utilization of MLP-ANNs, which offer superior learning potential, noise robustness, nonlinearity adaptation, and high fault tolerance.
3. Results
3.1. Motor Current Acquisition System
The neural network requires training. Therefore, generating a database of the motor current in the different load and fault states is necessary.
The current acquisition system proposed by González et al. [
22] was used to obtain the current values. The acquired current signal passes through a conditioning circuit, where it is filtered and rectified. The sampling frequency for the current acquisition was set to 10 kHz.
Subsequently, a finite state machine was implemented on an FPGA card to manage data storage, data processing, visualization, and transmission. The state machine algorithm first acquires the current values and stores them in memory until 150,000 pieces of data are obtained. Then, a slide switch determines the status of data transmission. If the decision is not to transmit them, the data are overwritten by new values. If you decide to transmit the data, it is sent through a serial port to a PC using the RS-232 [
23] protocol configured at a transmission speed of 19,200 bauds. Finally, the user can monitor the measured effective current and type of fault calculated in situ, both processed in real time through the present standalone system, and displayed using an LCD screen (
Figure 2).
3.2. Neural Network Design
Neural networks have been widely used to detect faults in induction motors, and the multilayer perceptron has stood out for its good results in fault classification [
2,
15,
24,
25,
26]. This work used a multilayer perceptron with backpropagation learning. Because the multilayer perceptron works under supervised learning, the input patterns and their corresponding expected outputs must be defined for the correct network training. To shape the input patterns (Equation (1)), a descriptor model was developed and applied to the acquired stream data. This consists of performing the square of the averaged sum of
r number of samples of the half-cycles of the acquired signal (Equation (2)), forming the training patterns every six half-cycles (
Figure 3).
where
P is the training pattern,
n is the number of patterns,
Cj is the ridge descriptor, and
x represents the half-cycle of the acquired current signal.
After applying the descriptor model to the dataset, 1788 patterns were formed for each failure, thus obtaining the input patterns to the six-dimensional neural network. Therefore, the neural network comprises six neurons in the input layer and six in the output layer, one for each type of failure. So, the architecture is a fully connected network with six nodes at the input, with nodes inside the intermediate layers, and with six nodes at the outputs. The procedure for calculating the number of neurons in the intermediate layers follows the expression suggested in Matlab (R2021b, Mathtworks, Natick, MA, USA) (Equation (3)):
where, using trial and error, we find the number from 1 to
n that will give the number of neurons in that hidden layer, determined by the minimum mean square error. Two hidden layers were used, the first with 15 neurons and the second with 22 neurons.
The hyperbolic tangent activated all neurons in the multilayer perceptron’s training stage. In the network’s implementation, the activation function of the output layer was changed to a linear function, where each output neuron represents one kind of failure. The most active output neuron indicates the type of failure the motor presents.
The network used for this proposal is second-generation and has the same convergence principle as the deep learning framed networks, such as backpropagation networks. However, these networks require much more significant amounts of memory and processing time than the network used in this work. In addition, convolutional networks focus on images, and deep networks focus on massive data inputs, which is not the case in this proposal, where we provide only six inputs.
Calculation of Training Patterns
Always remember that only the minimum peaks of a half-cycle are needed to sum up the current values. Therefore, the current signal is multiplied by a negative integer (−1) to generate these minimum peaks, thus obtaining the inverted signal (
Figure 4).
Once the inverted current signal is obtained, the training patterns are calculated. To calculate the training patterns and describe the model (Equation (2)), the first step is to locate the maximum current peaks of the signal half-cycles obtained during the acquisition stage. This can be achieved using the command “[PKS, LOCS] = findpeaks (data vector)” available in the Matlab tool. This command helps to find the maximum peaks (PKS) and their locations (LOCS) in a data vector.
Once the minimum current peaks are located, the generated vector is analyzed, and Equation (2) is subsequently implemented. The code to perform this calculation in each half-cycle is achieved with a double for cycle, one for the complete vector, and the other for each half-cycle within the vector. This code calculates the vector of descriptors, where the training patterns are formed for every six values. This process is performed for each of the 36 datasets. Thus, the 1788 patterns were formed by class and later divided into 70% for training and 30% for testing.
Once the number of neurons in the hidden layers is defined, as well as the training and testing patterns, with the support of the propagation algorithm, all layers’ weight and threshold vectors are initialized randomly. Subsequently, the algorithm allows controlling the epochs and each pattern with which the network will be trained. Within this last cycle are the three most important steps of the backpropagation algorithm: forward propagation of the input stimulation, backward propagation of the error in each layer, and finally, updating the synaptic weights of each layer. These cycles are repeated up to the number of epochs the user chooses. The root mean square error is plotted during training. Once the epochs have ended, it is verified if the network has been trained correctly. The test patterns are presented, and the percentage of network successes is calculated. As mentioned in
Section 3.2, there is a neuron for each type of failure, so the most active neuron indicates the type of failure presented in the motor. According to the above information, if neuron 1 is the most active, there are no faults; on the contrary, if the most active neuron is between 2 and 6, it represents that 1 to 5 loops are shorted, respectively.
Once the neural network is trained, a matrix with fixed weights is generated. Therefore, the accuracy values are subject to statistical evaluation. So, this accuracy was obtained by employing a statistical K-fold test. K-Fold cross-validation is a widely used technique in machine learning and statistics to evaluate the performance of predictive models. Its application is especially valuable in contexts where the amount of data is limited, as it allows for a more robust and reliable assessment of model performance. Through an iterative process, each observation in the dataset is used in both the training and testing phases, thus providing a comprehensive assessment of the model’s generalization ability [
27]. Compared to traditional validation methods, K-Fold cross-validation offers several advantages:
- -
More accurate performance estimation: By averaging the results obtained in multiple iterations, variability in the evaluation of the model is reduced, providing a more reliable measure of its performance.
- -
Efficient use of data: Each dataset instance is used for training and testing, which is particularly useful in small datasets.
- -
Reduction in bias and variance: A more stable and representative evaluation is obtained by reducing the dependence on the initial partition of the data.
- -
Improved generalization capacity: It more accurately evaluates the model’s performance on unseen data, reducing the risk of overfitting.
- -
Versatility: It can be applied to various models, including regression, classification, and clustering algorithms, making it a widely adopted strategy in data science.
Given its ability to provide more reliable evaluations and applicability to different models, K-Fold cross-validation is considered an essential technique in developing and optimizing AI models.
3.3. Implementation of the Neural Network on FPGA
FPGAs have advantages such as high-speed processing, high reconfigurability, and parallel acquisition, which ensures that all variables of interest are taken simultaneously. They are reprogrammable devices that create embedded systems, code portability, better architecture, sufficient input–output ports required for the application, and reconfigurable architecture. They are also a purpose-specific system. Due to these advantages, FPGAs are currently used in applications where high-demand computational resources and fast data acquisition are necessary, thus gaining great popularity [
28].
The implementation of the neural network in the FPGA is divided into four stages:
- -
Non-invasive and on-line current acquisition;
- -
In situ and standalone pattern creation;
- -
In situ and standalone pattern processing by the network;
- -
In situ display of the motor status on the LCD screen.
In the first stage, non-invasive and on-line current acquisition, a finite state machine controls the process every moment. As the online processing is performed pattern by pattern, 700 current samples are taken and stored in RAM in fixed-point binary vectors using an unsigned Q
12,10 format (12 bits for integers and 10 bits to represent the fractional part). In the second stage, in situ and standalone pattern creation, the current descriptor must be applied to the previously acquired signal. A moving window scrolls through the acquired data, comparing the magnitude of all samples in the window and allowing peaks to be found from a threshold. The address of the minimum value is stored in RAM. This process is repeated until the seven peaks corresponding to the six patterns are found. The moving window depends on the acquisition frequency of the system, as well as the size of the samples that are taken between peaks. It is essential to have an adequate choice regarding the window size, since if larger or smaller windows are considered, the minimums will be incorrect (
Figure 5, red windows). In the case of this project, 84 samples were taken based on moving window size (yellow window), as shown in
Figure 5.
Once all the directions of the peak values have been found, we continue implementing Equation (2). The resulting pattern is saved in RAM. The logic inputs and flags of the finite state machine designed in the FPGA control this process. The six pattern attributes are calculated in fixed-point binary vectors using an unsigned Q32,24 format (32 bits for the integer section and 24 bits for the fractional part).
The third stage, in situ and standalone pattern processing by the network, evaluates the pattern generated by the neural network. As mentioned above, the network comprises six neurons in the input layer, fifteen in the first hidden layer, twenty-two in the second hidden layer, and six in the output layer. The activation function used in the hidden layer neurons is a tangential sigmoid, while it is a linear function for the output layer. The architecture of the network designed in FPGA is simplified in the diagram in
Figure 6. The synaptic weights and polarizations obtained in the training stage using the Matlab tool are stored in ROM memories. On the other hand, the calculation of each of the neurons in the hidden layer is stored in RAM. The calculation of the output layer neurons is recorded and compared, thus determining the number of the most active neurons and, consequently, the fault classification. A control machine (CM) is responsible for propagating the pattern through the different blocks of the Arithmetic Logic Unit (ALU).
The ALU handles the mathematical calculations and evaluates the activation function (
Figure 7). The ALU consists of a multiplier (weights ∗ inputs), an accumulator for adding all weights by the inputs arriving at a neuron, and an adder for the bias. Finally, it is evaluated in the hyperbolic tangent activation function.
The digital resource requirements for synthesizing the sigmoid function on an FPGA depend on the chosen mathematical approach. The direct hardware implementation of the tangential sigmoid activation function is impractical due to its inherent division and exponential operations, which demand excessive logic resources and exhibit slow convergence, thereby increasing processing time. To address this limitation, this study employs a polynomial approximation method,
H(
x), to represent the sigmoid function, as described in Equation (4):
In this approach,
Pn1 and
Pn2 correspond to the polynomial approximations for the positive and negative sections of the function, respectively.
Table 3 presents the coefficients of the eighth-order polynomial, obtained using Matlab’s
polyfit function. These coefficients were subsequently transformed and approximated into fixed-point binary vectors using a signed Q
32,31 format (1 bit for the sign, 31 bits for the integer part, and 31 bits for the fractional section) (
Table 4). Such binary vectors are essential for the FPGA’s VHDL description and implementation.
Equation (5) shows the resulting polynomial of order 8; however, algebraic transformations allow it to be simplified into Equation (6). This simplification allows basic operations such as addition and multiplication, thus avoiding raising the polynomial to the eighth power, which allows designing a less complex FPGA implementation.
Figure 8 shows the architecture of the hyperbolic tangent function. The execution of the architecture consists of the following. When an input value (“
x”) is presented, a comparator determines in which interval of the function (Equation (4)) this value is located. Based on this, the polynomial evaluation is not required if “
x” is more significant than 4.4 or less than −4.4. A; “1” or “−1” is obtained in the output register according to the detected condition. On the other hand, if the value of “
x” is within the interval established by Equation (4), two output bits of the comparator select the coefficients of the polynomial section to which the input belongs. Such coefficients are stored in the 18-input, 9-output multiplexer block (18–9).
Based on Equation (6), a second 9-input, 1-output multiplexer (9–1) chooses the first coefficient to be summed with a zero, then this result is multiplied by the input value “x” and consequently recorded. This process executed by the multiplexer 9–1 is repeated until the last coefficient. Once the process is finished, the total sum is recorded in the output, thus obtaining the result of the polynomial evaluation after several iterations.
The last stage of the neural network implementation consists of displaying the type of fault resulting from the network via an interface. For this purpose, a liquid crystal display (LCD) of two lines with 16 characters per line is used. The first line shows the acquired current value in real time, and the second line indicates the number of shorted turns detected by the multilayer perceptron.
The bucket sorting algorithm is among the most popular algorithms based on sorting without comparison [
29]. Therefore, a bucket sorting module was implemented in the network output to obtain a device that can detect faults with better results (
Figure 9). Its purpose is to sort the classification of various patterns into six buckets, representing the neural network’s six output classes. In the end, the bucket with the most patterns is the one that represents the motor failure. The user can arbitrarily choose the number of patterns to sort, considering that the higher the number, the longer the classification delay. In the case of this study, ten patterns processed and sorted by the network were chosen.
The sorting module was developed with seven counters. Six counters represent the buckets, which are enabled by a selector conditioned by the information coming from the output of the neural network of each pattern. The seventh counter is responsible for numbering the ten patterns to be sorted. Each time the network finishes processing a pattern, a flag conditioned by the network function (NF) activates the 10-pattern counter and one of the buckets according to the network output. Once the maximum number of patterns to be sorted in the cuvettes has been reached, a cascade of comparators chooses the cuvette containing the most patterns. This information is then saved in a log and displayed in the in situ LCD screen.
3.4. Data Collection
Six load states were implemented using the electromagnetic brake at different speeds to extract data corresponding to the induction motor current, as mentioned in
Section 3.1.
Figure 10 shows the current peaks of the measurements taken at a speed of 1745 rpm, without fault induction and with the five types of faults. This figure presents the No-Fault line (WoF), corresponding to the reference sinusoidal signal. As incipient short-circuit faults between turns are introduced in a controlled manner, a progressive attenuation in the signal amplitude is observed. This phenomenon is attributed to the generation of harmonics in the signal, which alters its behavior. Since the tests are conducted in a controlled environment, a direct correlation can be established between these changes and the presence of the motor fault. Furthermore, this figure demonstrates that these alterations are identifiable in the time domain, supporting the feasibility of temporal analysis for early fault detection. This information is recorded and classified according to the neural network processing implemented in the FPGA.
3.5. Neural Network Design Results
The developed neural network satisfactorily classified the faults with a hit rate of 88.13%. This percentage was improved to 93.18% by adding the bucket sorting algorithm. It is important to remark that the experiment was performed n-times and the values presented are averaged. The results of the tests based on the percentage of hits for each fault condition were analyzed and are shown in
Table 5. This analysis showed that in the non-fault condition (zero short-circuited turns), there was an 81.81% hit rate. As for the state of three and four short-circuited loops, it was possible to reach 100% assertiveness. It was also determined that the higher the number of short-circuited turns, the higher the percentage of correct classification.
3.6. FPGA Implementation Results
The algorithm was implemented in an FPGA (Spartan 3A, Xilinx, Inc., San José, CA, USA) to demonstrate the lightness of the final designed architecture.
Table 6 shows the resources of suchFPGA, on which the neural network was implemented using VHDL code. These resources are considered practically low because no percentage usage value exceeds 65%. Considering that the Xilinx Spartan 3A is a low-resource development FPGA board, the project developed is optimized for implementation on low-resource boards.
The processing time of the neural network implemented on the FPGA is significantly faster than that achieved using Matlab. The acquisition of the current signal requires approximately 60 ms to collect sufficient data for generating the six descriptors of each input pattern, with this delay being dictated by the signal’s period. The pattern generation process is completed in 82.06 µs, while the classification of each pattern presented to the network is performed in 45.94 µs. Given these processing times, the proposed system qualifies as a real-time processing solution.
The definition of real-time processing depends on the sampling frequency used in industrial applications. The selected FPGA operates with a 50 MHz crystal, which, combined with its parallel processing capabilities, enables it to meet industry requirements. Notably, short-circuit coil faults are typically detected offline. However, due to the implemented system’s high processing speed and parallel processing architecture, data can be acquired and processed, and faults detected in real-time, allowing for online and in situ fault monitoring. It is important to highlight that motor control falls outside the scope of this study.
As for the error that can present the 32-bit arithmetic unit implemented in the FPGA concerning the calculation in Matlab in floating point, it yields an average absolute error of 0.01 and an average relative error of 0.01. On the other hand, the error present in the 32 bits of work processed in the FPGA and in the neural network in Matlab developed at a fixed point is zero bits. For this reason, for any pattern presented to the network, the output neuron with the highest excitation is the same in both platforms, representing the existing fault in the motor.
The last stage of implementing the neural network in the FPGA consists of displaying the type of fault resulting from the network through an LCD. FPGA can perform the neural network processing properly, which allows for visualizing the fault present in the motor in real time.
4. Discussion
Induction motors are widely used and are crucial in modern industrial systems. These motors present common faults such as short-circuits between stator winding turns, bearing wear, or broken rotor bars. Therefore, detection techniques capable of diagnosing these types of faults early on have particular attention from industry and researchers. If a predictive diagnostic method is developed, it will prevent sudden production stoppage and reduce maintenance costs and financial losses.
For predictive diagnostics in IM, computers and data acquisition cards (DAQ) with offline processing and invasive sensors that necessarily interrupt the process of performing current measurements have been implemented. A few works related to developing purpose-specific devices, such as FPGAs for online monitoring and detection of short-circuit faults in induction motor turns, have been reported in the literature.
The work of Mejia et al. [
30] presents a model based on multilayer neural networks to reproduce the current signatures associated with inter-turn short-circuit fault conditions. The model considers five fault severities. This model was implemented in a simulation based on a power hardware scheme, which is a tool for testing electrical and electronic systems based on a virtual representation of a physical system. Similarly, Ewert et al. [
16] present a system that allows online monitoring of induction motor bearings and subsequent fault diagnosis based on the analysis of vibration measurement data. A suitably trained neural network (NN) performs the bearing condition assessment based on spectral and envelope analysis of mechanical vibrations. It was also shown that the developed NN-based detectors are insensitive to other motor faults, such as stator winding short-circuits, broken rotor bars, or motor misalignment. Like our proposal, Mejia et al. [
30] and Ewert et al. [
16] employ an MLP-based diagnostic system for fault detection and identification. However, their approaches focus on different aspects, such as vibration measurement data analysis [
16]. Both studies are limited to simulation-only implementations, either using a power hardware scheme [
30] or the LabVIEW environment [
16]. In contrast, the approach presented in this article extends beyond software implementation by incorporating a portable, non-invasive, in situ, online, and purpose-specific deployment on a standalone, real-time digital system based on a low-cost, low-range FPGA.
Osornio et al. [
13] present a methodology based on the processing of electric current signals using time-domain and time-frequency-domain techniques in combination with an analysis of thermographic images that was achieved through various image processing techniques applying the short-time Fourier transform. The development of the diagnostic system has been performed with digital hardware implementations using a sensor platform based on an FPGA microprocessor running a series of algorithms developed for the online detection of multiple faults in electric motors. The system uses infrared thermograms and current signals to detect five different fault states: healthy, misaligned, unbalanced, damaged bearing, and broken bar, with an accuracy close to 99%. This work presented by Osorio et al. [
13] is similar to the one presented in this manuscript in that it is a non-invasive system, and its implementation is on an FPGA. However, the method used for fault detection is based on image processing and cannot detect short-circuit faults in the winding turns of induction motors.
The contribution of the research work developed by Aib et al. [
14] consists in the use of FPGA programmable logic circuits for fault diagnosis in an asynchronous machine for cases of phase unbalance and missing phase faults. This contribution introduces a fuzzy inference system in the analysis algorithm of the motor current signal by taking the effective signal of the stator phase current as the fault indicator signal. In that study, Aib et al. [
14] adapted the fuzzy logic for optimal implementation. The implementation must guarantee efficiency, speed of execution, and minimum possible space in the FPGA circuit. As can be seen, this study also uses an FPGA and analyzes the motor current signals for fault detection; however, it introduces a fuzzy inference system to determine unbalance in the stator phases. According to previous research, most stator-related faults are short-circuiting faults between turns of the same phase due to insulation degradation and are very difficult to detect. Therefore, the analysis of the current between the turns is more effective for fault detection than the starting vibration of a motor. For this reason, our research is focused on detecting this type of fault.
As mentioned, even though or research uses the data acquisition system presented in González et al. [
22], specific improvements are reached in the present article. The existing problem in creating the descriptors when choosing the parameters is eliminated since the summation of the complete half-cycle is performed and not from an empirically determined threshold. Also, the MLP implemented in this article has a higher classification rate than the network developed by Gonzales et al. [
22]. This classification presents a higher accuracy when applying the bucket sorting algorithm. Finally, it was possible to obtain a system embedded in a standalone, portable, non-invasive and purpose-specific device capable of performing online processing and showing, in real time, through an interface, the fault present in the engine.
Regarding fault diagnosis in induction motor coils using artificial intelligence, recent studies have explored advanced strategies, such as the integration of traditional regression approaches with machine learning techniques [
31]. Likewise, Long Short-Term Memory (LSTM) neural networks have been investigated for fault prediction [
32]. These architectures represent an advanced variant of ANN, with a more complex structure and functionality. While these methodologies offer high potential in data processing and analysis, they have certain limitations, such as higher computational cost, significant demand for memory and training time, and a propensity for overfitting. In addition, their interpretation is more complex compared to simpler approaches, such as the MLP used in the present study, due to the greater number of layers and hidden states involved in its operation.
In this context, although the current trend in AI favors massive data expansion and the exponential growth of neurons in intermediate layers—leveraging the computational power of cloud servers from companies such as Google (Mountain View, CA, USA) or IBM (Armonk, NY, USA), among others, via collaborative platforms—these strategies are not directly applicable to this work. The approach presented in this study focuses on the implementation of embedded systems for real-time applications, where large-scale data processing is not feasible. In this scenario, the use of a MLP is sufficient for fault detection, offering advantages in terms of energy efficiency, hardware requirements, computational resources, and processing time. Furthermore, relying on large-scale computing infrastructures would significantly increase operational costs in industrial applications, limiting their practicality in real-world production environments.
In summary, the present article proposes a portable system, which offers several advantages over other solutions presented in
Section 4. These include a short response time due to real-time processing, high accuracy in fault detection, and a digitally simple algorithm that enables implementation on a low-cost, low-range, purpose-specific FPGA board. Additionally, the system operates independently, without requiring external devices, making it a fully stand-alone solution.
5. Conclusions
This study presents a standalone, non-invasive, and portable system capable of detecting short-circuit faults in winding turns. The proposed system performs real-time processing, in situ and online measurements of variations in stator load conditions of induction motors without interrupting their operation.
A multilayer perception neural network (MLP) was successfully developed to classify current faults, achieving an accuracy rate of 93.18%. The performance analysis, based on the percentage of correct classifications for each fault condition, demonstrated that for the non-fault condition (zero short-circuit turns, WoF), the classification accuracy was 81.81%. For a single short-circuit turn, an accuracy of 86.26% was obtained, while for three and four short-circuit turns, the system achieved an accuracy of 100%. Additionally, for two and five short-circuit turns, the accuracy reached 95.45%.
The system incorporates a data acquisition stage, which proved to be effective and sufficiently accurate to obtain phase current values in a non-invasive manner. Furthermore, the descriptor model applied to the acquired current data (Equation (2)) successfully captured the relevant current characteristics, allowing the formation of patterns that could be effectively classified by the neural network. The final digital algorithm is computationally efficient, allowing its implementation on a low-cost, low-range, purpose-specific FPGA board for real-time classification of short-circuit faults in induction motor winding turns. The classification results are displayed on an LCD screen.
The findings of this study highlight that FPGA-based implementations offer high real-time processing capabilities at relatively low cost. The open and reconfigurable architecture of the FPGA facilitates upgrades and modular integration to adapt to changing requirements. Furthermore, its internal hardware parallelism enables fast and efficient processing for online diagnostic systems. In summary, this work successfully demonstrates the development of a portable electronic device capable of detecting and displaying fault classifications in real-time without interrupting engine operation, allowing users to take timely preventive measures. While the proposed approach offers a functional solution for engine fault diagnosis, certain limitations prevent it from being considered a fully general and comprehensive solution. Nevertheless, the results suggest that the proposed method has significant potential for effective and efficient application to other engine types and real-world operating conditions. Future work may focus on extending the system’s capability to detect more than five short-circuit loops, as well as investigating its applicability to detect other types of faults.