1. Introduction
Industries heavily rely on induction motors due to their simple design and consistent performance, making them integral for heavy-duty applications. However, the potential for sudden failures poses significant safety risks and economic burdens. A timely diagnosis of abnormal conditions in induction machines is crucial to avoid costly downtime and ensure reliable operations. These abnormal conditions encompass various types of faults: stator faults such as open circuits, short circuits, and inter-turn faults, rotor faults, mechanical faults in the rotor, including bearing damage, shaft misalignment, and eccentricity, failures in different components of the power electronics control system, and cracking or breaking of the magnetic material in rotating electrical machines [
1]. Detecting these diverse faults provides valuable insights for diagnostic purposes. The characteristics of multiple faults in induction machines manifest in various ways, including line currents, air-gap voltages, torque fluctuations, machine losses, acoustic emissions, and excessive heating. These signals contain crucial information for identifying and diagnosing abnormal conditions in induction machines [
2].
Diverse types of abnormal conditions commonly occur in induction machines and are categorized as either mechanical or electrical faults. The distribution of faults in induction motors is as follows: approximately 38% of faults are related to stator winding, 40% are related to bearings, 10% are related to rotors, and the remaining 12% are miscellaneous flaws [
3]. In steady-state conditions, condition monitoring can primarily detect broken rotor bar problems by analyzing harmonics. This involves monitoring fundamental supply surrounded by slip frequency sidebands in the current frequency spectrum. The amplitude of sidebands increases with the severity of consecutive broken rotor bars under the same pole. Statistics indicate that around 10% of induction machine issues are attributable to broken rotor bars [
4].
Figure 1 illustrates the distribution of failures that commonly occur in rotating electrical machines.
The conventional relays-based protection methods are gradually becoming obsolete, as advanced diagnostic algorithms capable of detecting faults at their incipient stages are emerging [
5]. Among these faults, broken rotor bars (BRBs) pose a significant challenge in squirrel cage induction machines. The primary causes of fractured rotor bars in these machines include manufacturing defects, harsh duty cycles, and pulsating mechanical loads. These factors lead to machine vibration and torque fluctuations, resulting in mechanical, thermal, and magnetic stresses on the rotor bars. Induction motors, serving industrial, commercial, and domestic purposes, often operate at variable speeds [
6]. Harmonics present in the motor line current can originate from three main sources: supply-side harmonics, internal harmonics, and motor faults. Principal and rotor slot harmonics are motor-generated harmonics resulting from non-sinusoidal winding distributions, slot openings, and material saturation [
7]. The most prevalent method for identifying patterns in squirrel cage induction motor global signals is Motor Current Signature Analysis (MCSA) [
8,
9]. BRB defects introduce magnetic and geometric imbalances, leading to the appearance of sidebands in the stator current spectrum. The analysis ofsideband amplitudes and frequencies serves as valuable input for training machine learning models [
10]. One notable advantage of MCSA is its non-invasiveness: sensor installation does not disturb motor operation [
11].
Todiagnose broken bars in motors operating under indirect field-oriented control, artificial neural networks and the stator current spectrum derived from the Hilbert transform have been recommended [
12,
13]. These methods provide effective diagnostic capabilities while maintaining the operational integrity of the motor and enterprises. In [
14], the proposal of an inverse thresholding technique applied to the current spectrogram aims to enhance the visibility of cracked rotor bars. Meanwhile, in [
15], the authors presents an integrated design strategy focusing on preserving closed-loop functionality while leveraging active fault diagnostic and tracking control to detect early flaws. Authors in [
16] advocate for the utilization of zonotopic observers and MANFIS models to ensure reliable detection of broken rotor bars (BRBs). However, [
17] highlights a challenge where the fracture of two bars, situated a pole pitch apart, conceals the fault harmonics, rendering fault detection seemingly impossible. In scenarios where fault data are limited or unrepresentative, expert systems are often employed to augment defect identification. Domain knowledge-based problem signature identification systems, as discussed in [
18], expedite outage correction by efficiently extracting fault features and patterns [
19]. Deep learning methodologies offer promising avenues for classifying machine flaws, even in the absence of fault data, for both classification and regression analyses. Techniques such as neural networks, autoencoders, and LSTMs demonstrate efficacy in processing raw sensor data [
20,
21]. Transfer learning emerges as a solution to overcome data constraints [
22,
23], particularly in addressing BRB fault issues.
Recent advancements encompass intelligent diagnostics [
24], real-time neural classifiers [
25], and the integration of high-precision hybrid statistical and machine learning approaches [
26]. Transfer learning combined with vibration imaging sheds light on fault-spectral behavior [
27], while dictionary learning aids in damage identification through vibration analysis [
28]. Furthermore, ongoing research emphasizes visual and time-series-based diagnostic solutions employing convolutional neural networks (CNNs). CNNs excel at capturing localized spatial correlations in both images and data, facilitating hierarchical feature learning across various scales [
29]. Surface damage detection is garnering increasing attention, as evidenced by studies exploring node currents [
30], convolutional neural networks [
31], damage picture categorization [
32], deep neural network feature extraction [
33], and sensor fusion [
31]. The main objectives and contributions of this paper can be summarized as follows:
An enhanced methodology is implemented to accurately identify and classify the number of broken rotor bars (BRBs) in inverter-fed induction motors across various loading conditions by integrating sparse stacked autoencoders with LightGBM (SSAE–LightGBM) using stratified group k-fold cross-validation.
This research introduces a novel cross-validation technique designed to balance the distributions of significant features in each fold, deviating from conventional stratification-based labeling. This refined approach exposes the model to diverse feature distributions, thereby augmenting its diagnostic capabilities and bolstering its reliability.
Extensive datasets are utilized across various loading circumstances to collect a broad spectrum of signals and incorporating SSAE toreduce dimensionality and manage noise in motor fault data.
The framework conducts an in-depth analysis of various operational parameters of the motor, including signatures of current, voltage, speed, and torque. By incorporating this multifaceted methodology, the framework enhances the breadth of fault detection and classification.
Leveraging the sparse stacked autoencoder (SSAE), the framework effectively extracts complex features from comprehensive datasets, substantially improving the system’s ability to identify BRB signatures across different loading conditions.
Exclusive datasets collected from induction motors powered by inverters and operating under varied loading scenarios (0%, 25%, 50%, 75%, and 100%) are utilized to ensure a comprehensive analysis.
2. Identification of Abnormal Conditions in Broken Rotor Bars (BRBs)
Detection of damaged rotor bars using vibration signals, acoustics, current, magnetic flux, voltage, torque, and speed are prevalent in the literature [
34]. A common approach in predictive maintenance involves segregating fundamental harmonics from faulty harmonics through spectral analysis [
35]. In our study, we employed current, voltage, speed, and torque harmonics to train the proposed algorithm. The following are the fundamental equations that wereused for the pre-analysis of the collected data.
2.1. Current Analysis
The motor current contains significant information about the health of the machine in the form of supply, winding, saturation, and fault-related harmonics [
34]. The stator current modulates at a predetermined frequency in defective conditions. An induction motor with damaged rotor bars has the following frequency components in its stator current spectrum:
where
s represents the slip,
denotes the frequency components of the current attributed to broken rotor bars in the frequency spectrum,
signifies the power supply frequency, and
k stands for any positive integer.
2.2. Mechanical Torque Analysis
Flaws in rotor bars can induce torque pulsations due to magnetic field irregularities, resulting in harmonics in currents and voltages. Variations in speed resulting from torque inconsistencies can impact motor stability and performance [
36]. Torque, being a function of electrical quantities, holds potential as a candidate for condition monitoring in electrical motors:
where
and
represent phase voltage and current, PF denotes the power factor, and
ωsignifies the angular frequency. Detection oftorque pulses or deviations from expected values can serve as an indicator of rotor faults.
2.3. Voltage Analysis
Broken rotor bars can cause voltage spectrum abnormalities, especially in severe defects [
34]. Detecting irregularities or distortions in the phase voltages can point out the BRBs in induction machines. Let
be the phase voltages. The imbalances in the voltage can be calculated as:
2.4. Speed Analysis
In transient conditions, the detection of unanticipated changes in motor speed may indicate failures in BRBs. The synchronous speed N in an induction machine can be calculated as:
2.5. Difficulties of BRB Diagonistics in Inverter-Driven Induction Motors
The diagnosis of broken rotor bars is particularly challenging in inverter-driven systems due to the unique operational profiles these drives afford, including variable speed and load conditions. Traditional diagnostic methods often fall short under these conditions, necessitating innovative approaches to fault detection and diagnosis. Our experimental setup comprised two similar induction motors connected back-to-back: one serving as a test motor with deliberately induced rotor bar and bearing faults, and the other functioning as a healthy loading motor. This configuration allowed us to simulate operational conditions closely resembling real-world scenarios while maintaining control over the testing environment. Diagnosing rotor bar breaks in inverter-driven induction motors is tricky because these motors change speed and load frequently, making it hard to spot problems. In the frequency domain, the signals we use to detect faults can get mixed up with other signals from the motor’s control system, making it even harder to find issues. Sometimes, these faults are so small that normal diagnostic methods cannotdetectthem early on, leading to bigger problems later. In addition, the electrical noise from the inverter can hide fault signals in the motor electrical signals. Rotor bar breaks also occur in complex patterns, and they can happen when the motor is under different loads. To detectthese issues early on, we need to monitor the motor in real time and use advanced tools such asmachine learning to help us find and fix them quickly.
3. Improved Sparse Stacked Auto-Encoder (SSAE) with LightGBM
SAE (Sparse Autoencoder) is a hierarchical deep neural network comprising deep autoencoders. In basic autoencoder architecture, there are input, hidden, and output layers. The encoder is responsible for extracting hidden attributes from input data, while the decoder reconstructs input data using features from the hidden layer [
37]. By closely matching input data, the autoencoder (AE) preserves fundamental properties. The input layer of the autoencoder receives the data, and the intermediate layers generate latent codes from the input data. Downsampling input data with principal component analysis (PCA) produces latent codes, the dimensionality of which depends on the nodes in the hidden layer [
38]. The final layer of the model deciphers these latent codes to reconstruct the initial input. Autoencoders convert input data into latent codes and use them for reconstruction. The typical representation for AE inputs is
x = [
,
]. T in
, where
is input dimension. The input
x is translated from the input layer to the hidden feature vector h, which contains
neurons in the hidden layer, during encoding. This conversion process utilizes the activation function f. The encoding layer encodes network inputs, while the decoding layer decodes them [
39]. Consequently, the neuron count in the decoding layer is determined by the input dimensionality. The primary objective of an autoencoder (AE) is to compute a reconstruction code h for an input instance x to accurately retrieve it. The formulation of a two-stage approximation function can be described as follows:
where
represents the function of the encoding layer and
denotes the function of the decoding layer. The neuron count in the encoding layer is typically lower compared to the input dimensionality. Consequently, the network is encouraged to reduce input dimensionality in this layer to eliminate redundancy. The typical back propagation technique with random weight initialization is well-suited for training a single autoencoder because it is a shallow neural network.
Figure 2 illustrates the workflow of the anticipated sparse stacked autoencoder with LightGBM architecture for BRBs diagnostics under various loading conditions.
Machine learning leverages the gradient-boosting framework LightGBM for tasks such as classification, regression, and ranking. Renowned for its ability to efficiently process massive datasets, LightGBM employs gradient boosting to construct a robust ensemble model from numerous weak learners, typically decision trees [
40]. Notably, LightGBM has undergone enhancements for improved performance and efficiency. Unlike traditional boosting methods, LightGBM preserves eigenvalues within histograms, allowing for the simultaneous and distributed construction of decision trees. By iterating through discrete and continuous eigenvalues within histograms, LightGBM identifies optimal division points without the need for preordered boosting. Moreover, LightGBM employs depth-limited leaf growth, prioritizing the discovery of leaf nodes with the highest split gain over unnecessary split points. This approach contrasts with conventional level-wise decision trees, which may be less effective in certain classification tasks [
41].
Consider the training sample set T of the LightGBM algorithm to be {(
,
) (
,
) … (
)} and the prediction output
. Follow these procedures to determine the strong combination of weak decision trees for the
i-th sample:
The decision tree is denoted as
, and
N represents the number of trees at the
n-th iteration. The feature space is divided into
L non-overlapping regions denoted by
.
is the feature subset space that corresponds to the leaf node l of the
n-th tree. To express
as an equation, it can be written as follows:
The prediction score for the
i-th leaf node is denoted by
, whereas the piecewise function is depicted by I (⋅), which takes on a value of either 1 or 0. Let us consider the hypothetical value of the
i-th sample at the
k-th iteration, denoted as
. At iteration
k, the objective function
can be estimated using the second-order Taylor expansion.
In which
and
represent the first and second derivatives of the logistic loss function with multiple classes. The effect of the regularisation term
on the increase in leaf node quantity and quality is minimal. Additionally, the regularization factors α and λ are applied to the parameters. Determine the non-overlapping area
and the optimal leaf node score
for each tree to decrease
during training. Rewriting the objective function
as follows:
After the process of segmentation is completed, the sample sets RL and RR correspond to the left and right leaf nodes, respectively. The original nodes are represented by R = RL → RR. Superior tree growth is the consequence of increased benefit values. During the segmentation of leaf nodes, the gain associated with the split point of the candidate feature is calculated utilizing the given equation. The expansion of the leaf-wise growth methodology to include the utilization of a stacked autoencoder for the detection of anomalous conditions is illustrated in
Figure 3.
LightGBM divides the leaf with the largest split gain, and feature scores are computed by summing split operation gains or segmentation passes. Innovative algorithms such as exclusive feature bundling (EFB) and gradient-based one-side sampling (GOSS) are employed in LightGBM. GOSS targets poorly trained samples while preserving baseline data dispersion, enhancing model efficiency. On the other hand, EFB minimizes feature information while retaining original feature information, thereby improving algorithm efficiency [
42]. Consequently, LightGBM enhances memory conservation and accelerates training.
3.1. Hyperparameter Setting of Improved SSAE and LightGBM
The optimal performance of an enhanced SSAE can be achieved through hyperparameter adjustment. Denoising, variational, or sparse autoencoders are some examples of changes that might be used in an enhanced SSAE. Take into account
Table 1 shows the following critical hyperparameters of stacked sparse autoencoder SSAE (
Table 1):
3.2. Bayesian Hyperparameters for Light GBM
The process of selecting hyperparameters is pivotal in the modeling process, and LightGBM provides a diverse set of hyperparameters to choose from. To enhance real-time efficiency in fault detection, only parameters with a significant impact on model performance were selected for hyperparameter tuning. Bayesian optimization was employed to effectively tune these hyperparameters, as it constructs a probabilistic model of the objective function and selects hyperparameters with the highest likelihood of success.
Table 2 lists some of the most critical hyperparameters targeted for improvement using Bayesian optimization in the LightGBM gradient boosting framework:
4. Test Bench Setup
For data collection, one, two, and three broken bar rotors were prepared by drilling radial holes, as shown in
Figure 4. The monitoring of various parameters produces diverse datasets, resulting in the acquisition of voltage, current, torque, and speed signals. For measurements and experiments, we utilized two different motors, with their parameters provided in
Table 3. Both motors were fixed to the same mechanical base and coupled with one another through shafts, as shown in
Figure 5. The overall flow of the proposed method is shown in
Figure 6. Both motors were supplied through industrial inverters; the load was changed by changing the torque of the loading machine working under direct torque control (DTC). A Dewetron transient recorder was utilized to measure the current, voltage, speed, and torque at a sampling frequency of 20 KHz. We divided the data into four categories: healthy, 1BRB (‘1’), 2BRB (‘2’), and 3BRB (‘3’). The training dataset contained 1.2 million samples at 20 kHz for both scenarios, and the validation dataset contained 300,000 samples. We converted the data from the time domain to the frequency domain as well to train a better model for more accurate testing outcomes. For the measurement phase, both motors were driven by ABB industrial drives (loading: ACS600, testing: ACS800), employing scalar and direct torque control (DTC) modes to vary the operational conditions. A Dewetron data acquisition system, alongside Fukei400 current probes (pending confirmation), captured the electrical signals at a sampling frequency of 20 kHz. The experimental approach involved the creation of broken rotor bar conditions by drilling holes in the rotor slots and carefully matching the slot depth to ensure a consistent and reproducible fault condition. Tests were conducted under a range of loading conditions (from 0% to 100% of nominal load) and control modes to assess the diagnostic challenges and identify potential signatures indicative of broken rotor bars.
Table 4 shows the main parameters of the machines under investigation, including a four-pole, three-phase induction motor with a star/delta connection, 36 non-skewed stator slots, and 28 skewed rotor slots. The motors were rated at 7.5 kW power, with terminal voltages of 690 V/400 V at 50 Hz, and rated currents of 8.8 A/13.5 A.
5. Results and Discussion
It is important to analyze the harmonic components of current, voltage, torque, and speed and understand how these components change with varying loading circumstances. Increasing the load might cause the harmonic components to change their behavior, a phenomenon which may hide fault-induced harmonics or make their consequences worse. Thus, it is crucial to precisely differentiate fault-related harmonics from load-induced changes for successful fault identification. Furthermore, the existence of harmonics close to supply and slotting frequencies emphasizes the need to thoroughly analyze particular frequency ranges to distinguish fault patterns from surrounding noise. Utilizing sophisticated signal processing methods and strong diagnostic approaches can improve the dependability and precision of fault diagnostics in motor systems, reducing downtime and maximizing operating efficiency. To detect abnormal conditions, the harmonic components of the current, voltage, torque, and speed should be considered. Fourier transforms of the mentioned signals can identify these constituents in the frequency domain. These frequency components are the function of load. Here, the load was found to increase and change harmonic component behavior. The fault-based harmonics were distributed in the entire spectrum. They significantly appeared near supply and slotting harmonics.
Figure 7 and
Figure 8 illustrate the frequency spectrum of current, voltage, speed, and torque with their FFTs at 100% loading conditions for the healthy and the three broken rotor bars, respectively.
We analyzed harmonic components and display signal frequency content in spectrograms under various operating parameters. By constructing spectrograms for normal operation and cases with one, two, or three broken rotor bars (BRBs), we can examine how signal frequency properties vary under fault conditions. Spectrograms analyze frequency distribution and intensity, revealing event-specific fault patterns. We can evaluate rotor bar spectrum fluctuations to develop effective motor fault diagnostic methods by comparing spectrograms. The spectrograms of all the healthy and faulty conditions are shown below to detect the abnormal conditions in the healthy, one BRB, two BRBs, and three BRBs broken rotor bars. These spectrograms may reveal patterns connected to several faulty states, helping identify and understand the machine’s health. When the load changes, these spectrograms demonstrate how the strength and emphasis of the frequency change.
Figure 9 shows the spectrograms of healthy and faulty conditions at 100%. It is evident that, as the number of broken rotor bars increases, the spectral power density increases in the entire frequency range. This is because the fault-based harmonics and their copies spread in the wideband frequency range. The number and the strength of those harmonics depend on the type and the severity of the fault.
The suggested diagnostic framework (SSAE–LightGBM) incorporates a stratified feature-based cross-validation approach applied to the broken rotor bars dataset obtained from the test setup. The dataset was divided into folds, preserving the proportion of relevant features in each fold. This strategy facilitates a more accurate and comprehensive assessment of model performance, particularly beneficial for unbalanced or critical datasets. Leveraging the advanced SSAE–LightGBM method alongside strict stratified feature-based cross-validation enables thorough diagnostic investigations. The algorithm underwent fine-tuning through extensive experiments conducted on data generated at various loading levels (0%, 25%, 50%, 75%, and 100%). The effectiveness with which the model was able to diagnose rotor bar health states is represented visually via the confusion matrices. Confusion matrices provide an evaluation of the performance of a classifier across different health states. The classifier’s ability to resolve between rotor bars that were healthy and those that were damaged to various degrees is afforded by the comprehensive analysis this provides. It effectively maps out the efficiency and adaptability of the diagnostic process. The study displays confusion matrices illustrating the performance of the diagnostic framework, sparse stacked autoencoder (SSAE), and LightGBM under varying load conditions ranging from 0% to 100% for both healthy motors and those with damaged rotor bars. The matrices present the classification results, demonstrating the accuracy of our model in distinguishing between healthy and malfunctioning motor conditions across different operational loads. The confusion matrices at various loading levels demonstrate the resilience of our diagnostic method across a broad spectrum of operational scenarios. SSAE and LightGBM can effectively identify and classify BRB problems at various load levels, demonstrating the model’s reliability and efficiency in real-world industrial settings. In
Figure 10, the results of the confusion matrices for healthy broken rotor bars at 0%, 25%, 50%, 75%, and 100% loading conditions are presented. As such, the model’s performance is depicted in different cases.
Additionally, ROC curves beyond the confusion matrices offer nuanced insights regarding the discriminatory ability of the model. These illustrate the trade-off between true positives and false positives across multiple loading conditions, revealing the model’s ability to discern BRB anomalies at differing operational thresholds. These ROC curves further reveal the model’s capacity to discern accurately rotor bar faults over numerous loading conditions, presented in
Figure 11 for comparison of abnormal conditions due to broken rotor bars in multiple loading conditions, providing further insights based on several operating scenarios.
The given
Table 5,
Table 6,
Table 7 and
Table 8 provide diagnostic performance parameters for undamaged rotor bars and different fault levels under different loading circumstances. Accuracy, precision, recall, F1-score, and processing time are performance measurements. Each table shows fault scenarios with undamaged rotor bars and one, two, or three damaged bars. The diagnostic framework detects faults accurately under diverse loading conditions. Precision, recall, and F1-score measures show the framework’s ability to detect and categorize rotor bar faults, with slight variances based on fault severity and operational load. The processing time also indicates the computational efficiency of the diagnostic procedure. The tables show the diagnostic framework’s reliability and effectiveness in industrial fault diagnosis applications. Increased motor load can impact diagnostic accuracy in several ways. Excessive loads might mask trouble indicators, reducing precision due to noise or interference. Increased loads can reduce precision and recall as a result of signal intricacy and fault signature concealment. The F1-score, which considers both precision and recall, might vary based on loading conditions, reflecting the efficiency of the diagnostic system. Signal analysis necessitates additional computational resources, hence larger workloads might lead to longer processing times. Thorough testing is necessary to guarantee the reliability of industrial applications as fault detection algorithms need to be tuned to perform consistently under varying loading conditions. To develop effective defect detection algorithms and assure the reliability of motor systems in practical applications, it is crucial to comprehend these implications.
Table 5,
Table 6,
Table 7 and
Table 8 provide the diagnostic performance measures of accuracy, precision, recall, F1-score, and performance time (that the proposed algorithm (SSAE–LightGBM) takes to predict the outcomes) for healthy and faulty BRBs at varying degrees under different loading situations.
7. Conclusions
The paper proposes an advanced diagnostic approach for identifying and classifying broken rotor bars (BRBs) in inverter-fed induction motors. This method combines a sparse stacked autoencoder (SSAE) with LightGBM, a gradient-boosting framework, to analyze crucial motor operating data features such as current, voltage, speed, and torque signatures. A unique stratified group k-fold cross-validation technique is introduced to maintain the distribution of class labels and grouping variables across folds, enhancing the model’s performance. SSAE is employed for feature extraction, automatically learning hierarchical representations of input data and identifying critical patterns, while LightGBM highlights important BRB features using the extracted traits. Extensive datasets exclusively sourced from inverter-powered motor systems operating at various loading percentages are utilized for training and testing. Evaluation metrics including vibrations, spectrograms, confusion matrices, and receiver operating characteristic curves are employed to assess the model’s performance. Bayesian optimization is utilized to fine-tune LightGBM hyperparameters, and the optimal feature subset is used for final classification. The experimental results demonstrate promising accuracy ranging from 91.55% to 99% across different loading scenarios, demonstratingthe efficacy of the proposed BRB diagnostic framework. Overall, the integration of advanced techniques such asSSAE and LightGBM enhances the robustness and effectiveness of the diagnostic architecture, offering significant improvements to BRB detection and classification in induction motors.