1. Introduction
Due to the frequent occurrence of accidents caused by lithium batteries, there is an urgent need to troubleshoot lithium-ion batteries. A lithium-ion power battery is a highly complex, nonlinear system in which internal state information cannot be directly observed, making it very difficult to diagnose faults. In fault development, even a small internal short circuit can lead to battery capacity degradation. In more serious cases, internal short circuits can lead to fire accidents [
1]. Cell faults in battery modules can only be characterized by parameters such as voltage, current, and temperature. Therefore, fault diagnosis methods must rely on limited data to identify and accurately assess fault characteristics. Battery faults in battery modules can only be characterized by parameters such as voltage, current, and temperature. As a result, fault diagnosis methods must rely on limited data to identify and accurately assess different fault characteristics.
The methods commonly used for diagnosing power battery faults can be categorized into three broad categories: empirically based fault diagnosis [
2], model-based fault diagnosis [
3], and data-driven fault diagnosis [
4]. The empirically based fault diagnosis method diagnoses battery faults by observing the characteristics of a battery system in different states. This method is effective for complex, nonlinear systems like lithium batteries as it does not rely on an accurate mathematical model. Although the working principle and diagnostic results of this method are easy to explain, it still necessitates extensive research on the battery’s failure mechanism and the characterization of prior knowledge when applied to battery fault diagnosis. The most widely used empirically based method in fault diagnosis is the threshold-based method. This method is advantageous due to its simplicity, ease of implementation, and wide application in the battery management systems of electric vehicles [
5,
6].
In certain cases, the single-threshold setting fails to differentiate between the normal state and the fault state of a battery. As a result, there are instances in which detection does not occur, leading to false alarms. This, in turn, adversely impacts the level of trust the battery management system commands from its owner. The empirical fault diagnosis method relies on the rational use of experience. As an auxiliary means of fault diagnosis, it can help extract the characteristic parameters of specific faults and improve the accuracy of fault diagnosis. However, when used as the main diagnosis method, it still faces several difficulties.
The model-based battery fault diagnosis method uses the difference between a model’s output and actual battery measurement data, known as the residual, to determine a fault [
7,
8]. Model-based methods can be classified into four categories: parameter estimation, state estimation, parity space methods, and structural analysis methods [
9]. State estimation methods primarily employ observers or filters to reconstruct or estimate the internal state of a system, including variables like the state of charge (SOC) and the internal temperature of a battery. The residual, which contains fault information, is derived by comparing the estimated signal with the value measured by the sensor [
10]. The main idea of the parameter-estimation-based fault diagnosis method is that faults affect the physical system and further lead to changes in the model parameters [
11,
12]. Therefore, detecting changes in the electrical and thermal model parameters of a battery can enable the identification and isolation of faults. The parity space method can determine the nonlinear relationship between a battery’s excitation signal and its response by analyzing the input and output values of the battery system [
13,
14]. The structural analysis method is employed to uncover and make use of the overdetermined structural portion of the system’s dynamic equations [
15]. Subsequently, the analysis focuses on the detectability and isolability of faults [
16,
17].
One current research focus for enhancing the identification of models for complex faults is the use of a multi-model fusion approach. This approach involves the fusion of models that exhibit greater accuracy in fault diagnosis together when compared to individual models. Sidhu developed an adaptive fault diagnosis technique for lithium-ion batteries. This technique employs multiple nonlinear models in the diagnosis process to capture battery-specific faults, such as over-charge and over-discharge faults, which can cause notable variations in model parameters [
18]. One of the main advantages of this method is that instead of providing an exact fault type result after diagnosing a fault, it gives the probability of a certain class of fault. Chen proposed an algorithm for diagnosing external short circuits in lithium-ion batteries using an improved first-order RC model [
19]. This method accurately diagnoses faults soon after they occur. Similarly, Yan proposed a fractional-order model and a first-order RC model for detecting external short-circuit faults in batteries [
20]. The parameters of the model are determined using a genetic algorithm. Building upon these ideas, a battery electrolyte leakage detection method based on a random forest model is also proposed. This method is practical for fault detection and considers various fault conditions. Dey proposed a diagnostic algorithm for thermal runaway faults in lithium-ion batteries [
21]. This method offers a more accurate simulation of the battery’s internal core temperature and can rapidly detect thermal runaway faults. However, there are still challenges in applying this method to actual vehicles, and further optimization is necessary.
The model-based fault diagnosis method is capable of detecting various types of faults. It can be effectively validated in laboratory test conditions to ensure accurate diagnostic performance. However, the application of this method in real vehicle operating conditions poses several challenges. The battery is often used as a reference in the modeling process. However, due to factors like its internal resistance, capacity, and other parameters that are subject to constant change, the output value of the model can differ significantly from the actual measured value [
22]. In real-world applications, it is necessary to periodically calibrate the battery parameters, which poses challenges for implementing this method in actual vehicles. Furthermore, the model-based approach fails to consider the different types of faults that can occur during the operation of electric vehicles. It can only identify specific faults in a targeted manner, limiting its overall applicability [
23].
The data-driven fault diagnosis approach is a method that detects faults by directly analyzing and processing operational data without relying on accurate analytical models or expert experience. This approach simplifies the fault detection process by eliminating the need to consider complex fault mechanisms and system structures, such as the thermal runaway and accelerated aging of the battery caused by various unidentified coupling factors. Additionally, partially data-driven methods have their limitations. These include the need for large amounts of historical data for training and the computational cost and complexity involved in analyzing such data [
24].
Data-driven methods are commonly used in real-world applications of electric vehicles. These methods are often combined with equivalent circuit models, expert experience, and other techniques to enhance their adaptability to real-world data. Kong proposed a diagnostic method for early micro-short-circuit faults in Li-ion batteries that occur before thermal runaway. The method is based on the variation in the remaining charge capacity among batteries [
25]. It has been shown to accurately estimate such faults in constant-current and constant-power charging experiments. However, its effectiveness on real vehicle data has not been verified. Zhao proposed a fault diagnosis method for electric vehicle battery systems. This method utilizes big data statistical methods to detect abnormal battery terminal voltages in battery modules. Machine learning algorithms and a 3-σ multi-level screening strategy are employed for the detection process [
26]. The method incorporates a neural network algorithm to combine fault and defect diagnosis results. It also utilizes big data statistical regulation to construct a comprehensive battery system fault diagnosis model. Xia proposed a battery fault diagnosis method based on correlation coefficients to detect capacity inconsistency among battery units in a battery module [
27]. While this method displays good robustness in detecting battery inconsistency, it is not effective in diagnosing other types of faults. To address this problem, Kang proposed an online multi-fault diagnosis method. This method is based on a non-redundant crossover measurement circuit and an improved correlation coefficient method [
28]. By improving the connection of voltage sensors without adding to the hardware cost, the method captures fault characteristics and assesses the severity of faults using the improved correlation coefficient method. As a result, this method can differentiate battery faults from other non-battery faults. Wang proposed a modified Shannon entropy-based in situ battery fault diagnosis method which can predict voltage faults in time by monitoring a battery voltage during vehicle operation [
29]. The method underwent experimental validation using much real-time voltage-monitoring data. The results demonstrated an accurate prediction of the time and location of voltage faults within the battery module.
In summary, the data-driven battery fault diagnosis method can diagnose faults in real-vehicle conditions but still suffers from poor robustness and poor adaptability to data. Therefore, this study aims to utilize the data-driven method as the primary approach to fault localization while also incorporating the empirically based method as a complementary tool for diagnosing faults in real-vehicle situations. By combining these two methods, the goal is to enhance the robustness of the diagnosis results and improve their adaptability to real vehicle data. Although the effectiveness of most power battery fault methods has been verified using experimental data, their performance in real-vehicle conditions has not yet been assessed.
However, the phenomenon of instability exists in the real-vehicle-condition data of power batteries, and the use of recently commonly used fault diagnosis methods such as the correlation coefficient method and the information entropy method does not work well in practical applications. Therefore, there is an urgent need for a method that can properly monitor faults in multiple situations in practical applications. The characterization of faults and data characteristics may be different in different working conditions, so it is necessary to combine multiple fault characterization methods in actual working conditions so there are at least corresponding fault characterization parameters that can characterize faults properly in different working conditions. Therefore, in this paper, traditional fault features are fused multidimensionally, and a density-based outlier algorithm is used to extract abnormal features to maximize the universality and robustness of fault diagnosis.
The remainder of the paper is organized as follows: an analysis of battery data is presented in
Section 2. The constructed fault characterization indicator and LOF algorithm are presented in
Section 3. The method proposed in this paper is tested in
Section 4. The key conclusions and outlook are summarized in
Section 5.
2. Battery Data Analysis
The actual operation data of all electric vehicle power batteries involved in this report come from the cloud-based commercial power battery big data platform in cooperation with the School of Electronics and Electronic Engineering of Hubei University of Technology [
30], as shown in
Figure 1. The data from the platform are transmitted in real time through a wireless communication module in the vehicle’s terminal system. The data collection for the vehicle terminal consists of three parts: the battery management system (BMS), the vehicle controller, and the Global Positioning System (GPS). The BMS gathers data on the battery’s voltage, current, temperature, and charge status. The vehicle controller gathers data on motor speed and torque. The GPS is responsible for collecting data on vehicle position and acceleration. The data collected by the above three parts are interacted with and transmitted by the Controller Area Network (CAN), and the data are transmitted to a wireless communication module in time sequence. The big data platform will store, decode, and monitor the received data.
The data stored in the cloud-based big data platform include various electrical parameters of batteries, such as the battery cell end voltage, battery module end voltage, current, SOC, etc. Additionally, other parameters like temperature, mileage, and so on are also recorded. The data are organized in a standardized time series format in which each row consists of a time stamp and its corresponding parameters. The data format is a standard time series; each line of data contains a time stamp and its corresponding parameters, and the time stamp interval is 10 s, in line with the national standard GB/T32960 [
31]. The data are from the period from March 2020 to September 2021.
The battery module consists of 96 NCM811 Li-ion ternary batteries (LiNi
0.8Co
0.1Mn
0.1O
2). These cells are connected in series, and each cell operates in a voltage range of 3.5 V to 4.3 V. Since each module is connected in series, the current data flowing through each cell remain constant. The current data fluctuate greatly under the discharge condition, as shown in
Figure 2, and it is difficult to determine the fault state of the battery through the current data. Regarding the temperature inside the battery module, due to a limited number of temperature probes, it can only be measured at 24 positions evenly distributed inside the module. As a result, it is not possible to measure the temperature of each battery cell which, together with the hysteresis of temperature parameters compared to electrical signals, makes the localization of early failures using temperature difficult to achieve at this stage.
This paper focuses on analyzing various battery system faults, specifically micro-short-circuit faults, severe internal short-circuit faults, battery inconsistency faults, and sampling circuit faults. It is important to note that non-electrical quantities such as temperature do not provide a visual representation of battery faults as they change slowly. Similarly, internal resistance and other parameters cannot be measured directly. Therefore, to estimate the state of a battery, additional data are required. In this study, the voltage value is chosen as the most sensitive and intuitive data for fault analysis and diagnosis.
Figure 3 shows the voltage curves of three different types of faults which come from the typical fault data stored in the big data platform and have data characteristics typical of different types of faults.
For the short-circuit fault within a battery, a first-order equivalent model of the battery can be used to analyze the fault, as follows:
where
denotes the battery’s terminal voltage,
denotes the battery’s open circuit voltage,
denotes the battery’s equivalent capacitance–voltage,
denotes the circuit current, and
is the equivalent resistance. A schematic diagram of the first-order equivalent model of the battery is shown in
Figure 4.
When an internal short circuit occurs in a battery, the resistance value of the equivalent resistance
decreases; as a result, the voltage at the end of the battery will drop. As shown in
Figure 3a, when a micro-short circuit occurs in a battery, the voltage drop is not obvious and is difficult to observe directly, while when the battery has a serious internal short circuit, as shown in
Figure 3b, its resistance value is lower, the voltage drops quickly, and the change in the voltage curve is more obvious. Therefore, distinguishing a battery’s micro-short circuit is more difficult compared to a serious internal short circuit. In the case of a severe internal short circuit, the impact is immediate and significant. Therefore, it is crucial to identify the faulty battery unit promptly to prevent any potential failure. It is more practical and effective to predict a fault before it occurs, rather than conducting a diagnosis after experiencing a failure.
Due to the scarcity of actual vehicle data and the rapid increase in battery fire incidents, it has become crucial to forecast severe internal short circuits well in advance, potentially even days before they occur. The causes of severe internal short circuits in batteries are multifaceted and can be attributed to thermal, electrical, or mechanical abuse, as well as defects in the battery manufacturing and assembly process. While micro-short circuits in batteries develop at a slower pace and cause less harm than serious internal shorts, there is still a potential for them to escalate to serious internal short circuits. Therefore, diagnosing micro-short circuits is essential for providing timely fault information to owners and manufacturers. This information can facilitate battery replacement and maintenance, ensuring the safety of owners. In this study, the focus will be on diagnosing this specific type of fault.
Unlike an internal short circuit, when there is inconsistency within a battery module, the voltages in some cells will be lower than in other battery singles, as shown in
Figure 3c. This is caused by differences in parameters such as the internal resistance of the cells, the resistance of the connections, power, and capacity. As shown in
Figure 5, under initial conditions, the rated capacity of cell A is low, and the initial SOC of cell C is low. During the discharge process, the inconsistency in the SOC will lead to an inconsistent voltage in the battery cells, which will lead to the over-discharge phenomenon occurring in battery C at the discharge cut-off, and the over-charge phenomenon will occur for battery A with a low initial capacity when it is re-charged. The cause of this type of failure may be subtle differences in the battery manufacturing and formation processes or the different working conditions of the batteries. This type of failure is the most common type of power battery failure. Due to its widespread occurrence, many researchers fail to recognize it as a battery system failure as it does not impact normal battery usage. While battery inconsistency may not cause major damage, certain individual battery cells within a module may experience frequent over-charging and over-discharging, resulting in accelerated battery capacity degradation and potentially compromising safety. An important characteristic of battery module inconsistency, as illustrated by Formula (1), is that the voltage change trend remains relatively uniform when subjected to the same current.
3. Fault Diagnosis Method
3.1. Fault Characterization Indicators
To identify the fault characteristics present in voltage data, it is necessary to abstract the a priori knowledge associated with power battery faults into a parameter that can effectively represent these faults. Given the variety of power battery faults that can occur in real-world operating conditions, the external characterization of each fault type will differ to some extent. Therefore, a fault feature characterization formula has been proposed for various fault types, as illustrated in
Table 1.
Both the extended Root Mean Square Error (RMSE) and the Euclidean metric are distance-based fault characterization formulas. These formulas are effective in identifying the consistency evolution process in power battery modules and can also detect changes in faults after a certain delay when a micro-short-circuit fault occurs in the battery. However, when dealing with batteries that have poor battery consistency, these formulas may not produce ideal results. In such cases, additional processing methods are required to correct the results and eliminate the interference of battery consistency in the diagnosis results.
Both cosine similarity and the Pearson correlation coefficient are mathematical formulas used to measure the similarity between two vectors. These methods are commonly used to determine the voltage condition of a battery, making them useful for the early detection of battery failures. However, it is important to note that these methods can be affected by data fluctuations when applied in real-vehicle conditions. Additionally, the accuracy of these methods relies heavily on the quality of the data collected during the battery charging process. In instances in which the battery data have a high level of randomness during the discharging process, the effectiveness of these methods may be diminished.
The method of using information entropy for battery fault diagnosis relies primarily on the data characteristics of the battery unit itself. It assesses battery faults by analyzing changes in the data vertically and comparing them horizontally with other battery units in the battery module. This method is particularly sensitive to battery waveform data and is effective in identifying minor fluctuations and faults in the early stages of battery failure. However, it is also impacted by the quality of the data, which limits its robustness in real-vehicle conditions.
3.2. LOF Algorithm
The power battery fault diagnosis method for electric vehicles typically relies on comparing individual cells within the battery module. This comparison serves as a basis for discrimination. Electric vehicle batteries typically consist of the same types of battery cells connected in series and parallel. When these batteries are subjected to the same external excitation, the voltage change in the same type of battery should follow a normal distribution [
34]. Hence, any battery unit that falls outside the probability range of the normal distribution can be considered anomalous.
However, comparing differences between cells cannot be the sole criterion for determining battery failure. When a battery fails, its deviation from historical operating conditions in terms of electrical quantity should also be taken into account. Therefore, when designing fault diagnosis methods, it is essential to consider multiple fault characteristic indicators simultaneously.
The literature [
33] discusses how the calculated values of the extended root mean square error can be mapped in a two-dimensional coordinate space. This transformation allows the data variation, initially expressed in a one-dimensional space, to be represented using two-dimensional space coordinates. As a result, fault features at different time scales can be effectively fused. Building on this approach, multiple fault feature indicators can also be mapped to a higher-dimension coordinate space, following the same format. This enables the fusion of multidimensional fault features.
The Local Outlier Factor (LOF) [
35] algorithm is an unsupervised anomaly detection algorithm that identifies anomalous data by comparing the density of data points. Unlike other algorithms, the LOF calculates a factor for each object that indicates its degree of outlierness rather than simply classifying it as normal or an outlier. As a result, the LOF performs well in detecting outliers in high-dimensional data spaces. Therefore, it is a suitable algorithm for identifying outliers in such spaces.
The precise description of local outlier factors is based on several definitions.
Definition 1. The k-distance of
In dataset D, the distance between object and object is denoted as . For any positive integer k, the k-distance of an object p is denoted as . , satisfying the following conditions, is equal to .
There exist at least k objects satisfying.
There exist at most k-1 objects satisfying.
For denser areas, the value of is smaller; for less dense areas, the value of is larger.
Definition 2. The k-distance neighborhood of
The k-distance neighborhood of an object is the set of all objects whose distance to does not exceed the k-distance, defined by the following formula.
Definition 3. The reachable distance of
Assuming that is a natural number, the reachable distance of object about object o is defined as follows:
According to this definition, the reachable distances (
k = 4) of object
and object
about object
in
Figure 6 are equal.
Definition 4. The local reachable density of
Assuming that is a positive integer, the locally reachable density of object is defined as follows:
The local reachable density of object is the inverse of the average local reachable distance of the k-nearest neighbors of object . If object has a high degree of outlying, i.e., few near neighbors in its surrounding region, then its k-distance neighborhood has a wide coverage. For an object o within the k-distance neighborhood of , the probability that object is within the k-distance neighborhood of object is small, and has a higher probability of taking . Conversely, if object lies in a cluster of classes, the probability that object is within the k-distance neighborhood of object o is large for object o within the k-distance neighborhood of p, and has a higher probability of taking . Based on the analysis above, it is evident that outlier objects have small local reachable densities, while the local reachable densities of objects in class clusters are close and have larger values. In other words, local reachable densities provide insight into the density of local spatial regions within a dataset.
Definition 5. The local outlier factors of .
The local outlier factor for object is defined as follows:
If point
is a point with a large outlier degree, its local reachable distance is small, while the local reachable distances of the objects in its neighborhood are large, i.e., the LOF of point
is large. On the contrary, if point
is an object in a taxon, the difference between the locally reachable distance of point
and the objects in its neighborhood is small, i.e., the LOF value of point
is close to 1. This feature eliminates the effect of density differences among different taxa. An illustration of the local outlier calculation procedure for anomaly detection, based on the definition of the LOF, is presented in
Figure 7.
During the operation of an electric vehicle’s battery module, the parameter changes in each battery unit inside it may not be synchronized at all times. This lack of synchronization can be attributed to signal transmission delays and current limitations of the hardware, which remain unresolved for the time being. Changes in battery monomer data often exhibit distinct differences between groups. In other words, the trend in battery monomer changes is generally more consistent within each group but inconsistent between groups. Traditional anomaly detection algorithms typically define anomalies as small numbers of outliers. However, when applied to actual operational data for power batteries, these methods may generate false reports due to significant fluctuations in the data. In contrast, the LOF algorithm utilizes cluster density as a discriminator, thereby identifying data points with genuine anomalies and theoretically reducing the likelihood of false positives.
3.3. Fault Diagnosis Process
The implementation process of the power battery fault diagnosis method incorporating multidimensional fault features is shown in
Figure 8.
Step 1: Extract the battery pack voltage data, i.e., , where M is the number of individual cells in the battery pack. A moving window is used to filter the data for processing the data stream.
Step 2: The fault feature parameters are calculated separately according to different fault feature calculation formulas to obtain different fault feature sequences , where T denotes the number of moving window moves and the fault feature sequences contain the Euclidean metric, Pearson correlation coefficient, information entropy, extended RMSE, and cosine similarity.
Step 3: According to the fault feature, the sequence is converted into coordinates; at this time, M five-dimensional spatial coordinate points
. At this point, the fault diagnosis problem is converted into an anomaly identification problem with five-dimensional spatial coordinate points where
denote the calculated values containing the Euclidean metric, Pearson correlation coefficient, information entropy, extended RMSE, and cosine similarity within the moving window, respectively. To facilitate the uniform calculation of data in each dimension, the calculated results of the above features are further processed using the Z-score normalization method. The formula for calculating the Z-score is as follows:
where
denotes the eigenvalue sequence,
denotes the length of the eigenvalue sequence, and
denotes the eigenfinger sequence after Z-score normalization.
Step 4: Using the above coordinate points as input values for the LOF algorithm, the LOF algorithm is used to calculate the abnormal value for each coordinate point. The result this calculation for a normal vehicle is used as the basis for setting the threshold value to ensure that the calculation result of this algorithm will not issue a fault alarm under normal conditions.
The fault judgment criterion is shown in Equation (7): if the fault feature exceeds the threshold
, an alarm signal is issued for the next judgment; if the fault feature does not exceed the threshold
, no alarm signal is issued, and the next cycle is entered.
4. Fault Diagnosis Based on Real Vehicle Data
To verify the effectiveness of the proposed method discussed in this paper, historical data from the onboard power battery of an electric vehicle currently in use were employed for testing purposes. The aforementioned data were obtained from the most recent discharge cycle preceding the sampling cutoff. The sampling interval is 10 s, and the voltage resolution is 1 mV.
Figure 9 contains the battery voltage data of two electric vehicles within the occurrence of a failure cycle, which were derived from a cloud-based big data platform.
Figure 9 shows the voltage variation curve for each battery cell in the battery pack during the electric vehicle’s operation. The curve represents 96 battery cells connected in series. The gray curve represents the voltage of normal, fault-free battery cells, while the red curve represents the voltage of cells experiencing a fault. From
Figure 9, it is evident that battery cell #12 in car 1′s battery pack and battery cell #31 in car 2 both experienced early internal short-circuit faults. It is worth noting that the voltage variation curves exhibit random perturbations under actual operating conditions, which could potentially impact the accuracy of the fault identification method.
To verify the effectiveness of the method proposed in this paper, we will first analyze how the parameters extracted from different feature indicators distinguish battery failure, using car 2 as an example. A moving window length of 50 is used, and the tolerance in the extended RMSE method is set to 0.5.
Figure 10 shows the results of applying the information entropy method to car 2. The blue curves represent the upper and lower limits of the 3σ threshold, while the red curve represents the curve of the faulty battery.
The horizontal coordinate in
Figure 10 indicates the sampling time point with a sampling interval of 10 s, and the vertical coordinate is the calculated value of information entropy. From
Figure 10, it can be seen that the information entropy method does not have a significant advantage in fault discrimination until the faulty battery is identified at the 3460th sampling point. This identification occurs well after a significant drop in battery voltage. However, the method does not produce any false alarm cases within a reasonable threshold setting range. Additionally, although the fault diagnosis results in a significant delay, this delay does not negatively impact the fault results in the case of multidimensional fusion.
Both
Figure 11 and
Figure 12 contain curves of the original computed results as well as curves of the results after Z-score processing. The curves in red are known as faulty cell curves.
Figure 11 depicts the results obtained from the cosine similarity method applied to car 2. The outcomes reveal that this method effectively differentiates faulty battery singletons between the 300th and 350th sampling points despite significant differences being observed. In contrast,
Figure 10a shows that a single threshold value fails to trigger the threshold alarm for the faulty battery monomer #31.
When examining the correlation coefficient method on car 2 (as seen in
Figure 12), similar results to the cosine similarity method are observed. However, this method is less sensitive to data fluctuations, making it unable to accurately identify faulty cells in the early stages of the fault without employing Z-score normalization. After Z-score normalization, the eigenvalue calculations are unified to a common scale, as illustrated in
Figure 11b and
Figure 12b. Thus, both methods can discriminate anomalous data using a single threshold value.
Figure 13 and
Figure 14 demonstrate the use of the extended RMSE method and the Euclidean metric on car 2, respectively. These methods are both distance-based, resulting in a similar overall trend in the calculated results. However, the extended RMSE method is more sensitive to faults due to its tolerance parameter settings, which leads to higher calculated values for faulty cells. Like the cosine similarity method, the distance-based method can identify faulty cells by using a single threshold at the 350th sampling point.
After converting the above fault characteristic indicators, the results of the LOF algorithm are used to calculate the abnormalities in car 1 and car 2. These results can be seen in
Figure 15 and
Figure 16. For the LOF algorithm, the proximity point parameter is set to 10. In
Figure 15, the red dashed line represents the fault threshold, while the red curve represents the LOF value curve of battery #31, which is the faulty battery. By using the threshold value, the single cell of battery #31 in car 2 can be correctly triggered at the 318th sampling point. In
Figure 16, the LOF value curve for battery #12 correctly triggers the threshold at the 4844th sampling point, indicating an alarm. To avoid inconsistencies in threshold adjustment across different cars, the outlier threshold value is uniformly set at 7.5. This allows batteries with slightly less consistency to avoid falsely triggering the threshold. Typically, a certain number of fault data points are used as training objects to set thresholds and meet the requirements of various application scenarios. In this study, car 2 was used to provide training data for the threshold adjustment, while car 1 was used to provide test data to verify the feasibility of the threshold setting. Results from real car data demonstrate that this method can be appropriately applied across different cars.
It is important to mention that if a certain threshold is set, there will be a slight delay in the fault diagnosis results. However, this delay is still considered acceptable. When it comes to early internal short-circuit faults in car 1, it is challenging to accurately locate these faults using conventional methods. This is because the voltage drop in the faulty cell is not noticeable, and changes in the voltage drop are less obvious. Thankfully, after implementing the LOF algorithm to detect outliers, we can gradually identify the outliers of faulty cell #12 as the fault progresses.
To reflect the state-of-the-art method proposed in this paper, the power battery fault diagnosis method using a differential method to deal with extended RMSE [
33] features is added as a comparison experiment. The method is also applicable to power battery fault diagnosis in real-vehicle conditions, but it uses only one dimension of indicators as a feature extraction method. Its experimental results are shown in
Figure 17 and
Figure 18. In this method, the original calculation of the LOF value is negative, which is the same as the realization of the method used in this paper. The method detects the fault in car 2 at 351 sampling points, with a delay of 330 s in detection time compared to the method proposed in this paper. In car 1, the fault was detected at the 4114th sample point. At the 3880th sample point, the proposed method can also detect the fault, but it needs to consider the fault alarm margin, so the threshold needs to be set as large as possible to minimize false fault alarms in extreme cases. When the threshold is set lower, the proposed method can detect the fault 2340 s earlier than the extended RMSE method. In the extended RMSE method, the threshold value is set low, which improves the sensitivity of fault detection but still lacks robustness. Taken together, the method proposed in this paper outperforms the extended RMSE method in terms of detection time, with an average improvement of 22 min. The results of a detailed comparison with other methods are shown in
Table 2, which compares the ability to detect faults with the time to detect faults and robustness. The results show that the method proposed in this paper ensures the accuracy of detecting faults while improving robustness, which verifies the feasibility of applying the method in real working conditions.
In an ideal state, conventional fault diagnosis methods such as the correlation coefficient method and information entropy method are not affected by data fluctuations, and they can accurately determine the occurrence of faults. However, in a non-ideal state, the data fluctuation of the power battery will cause these methods to produce frequent false alarms. The method proposed in this paper combines parts of fault extraction from traditional methods so that it can cope with various types of faults under ideal conditions. However, data fluctuations in real situations can cause them to experience frequent false alarms, so in the process of setting the threshold, although we only used the battery data of car 2 as a training object, the battery data of car 2 contain battery data under normal conditions and data under abnormal conditions. In the battery data under normal conditions, there are not only normal battery voltage data but also many sampling errors with large fluctuation conditions. These rough data were used for training during the threshold-setting process, making it possible to maximize the fault diagnosis process without false alarms. Similarly, we performed training with data from car 1 and validation with data from car 2 to achieve the same effect.