1. Introduction
Against the backdrop of global efforts to advance the “dual carbon” strategic goals (carbon peaking and carbon neutrality), new energy vehicles have emerged as a key enabler of global energy decarbonization. As such, the performance monitoring of their core power source—lithium-ion batteries (LIBs), has become increasingly critical [
1,
2,
3]. While LIBs have gained widespread adoption in the industry due to their high energy density and long cycle life, the irreversible aging that occurs during continuous charge–discharge cycles leads to capacity degradation and significantly increases the risk of thermal runaway [
4,
5,
6]. When state of health (SOH) drops below the 80% threshold, the LIB reaches its retirement criteria and can no longer meet the power and energy requirements of the device. Therefore, accurately estimating the SOH of LIBs helps to avoid the risks of overcharging and over-discharging, as well as safety accidents, in advance.
To date, the SOH estimation methods are mainly divided into two methods: model-based and data-driven [
7]. In the model-based method, common models include electrochemical models and equivalent circuit models. Xiong et al. simplified the traditional pseudo-P2D model by thoroughly analyzing the electrochemical principles of LIBs [
8]. Then, they identified the model parameters using an optimization algorithm. Subsequently, they selected five aging characteristics to describe the SOH of LIBs, with an error of no more than 3% in the SOH estimation. Hakeem et al. established an effective reduced equivalent circuit model and obtained high-frequency and mid-frequency electrochemical curves from it [
9]. Then, the LIB capacity was predicted based on these extracted data. The results showed that predicting battery capacity based on impedance related to charge transfer resistance can achieve satisfactory estimation accuracy. Model-based methods have the advantage of being physically interpretable and have been widely used in the past. However, the premise of such methods is a physical model that can accurately reflect the internal mechanism of the LIB. In addition, model parameter identification methods also pose challenges and cannot reliably and stably identify the key parameters of LIBs.
In recent years, deep learning technology has seen rapid development, directly promoting in-depth research into data-driven SOH estimation methods. This method has the advantage of not requiring knowledge of the internal mechanisms of LIBs and can build the SOH estimation model using historical aging data [
10]. Therefore, the focus of this method is on model selection and health feature extraction. Peng et al. used a long short-term memory (LSTM) model and estimated the LIB SOH by extracting physical quantities such as time and energy [
11]. He et al. extracted multiple health characteristics from different perspectives, such as time and incremental capacity, and achieved SOH estimation through an improved informer model [
12]. Buvansen et al. extracted nine different health characteristics from voltage, current, and temperature, and constructed a convolutional neural network–bidirectional LSTM model to achieve SOH estimation [
13]. The results show that the proposed method has significantly lower estimation errors than LSTM, bidirectional LSTM, and other models. The common feature of the above studies is that they all require the extraction of physical characteristics that reflect the SOH of LIBs. These characteristics may originate from physical quantities such as voltage, temperature and so on. However, the aforementioned studies have not fully exploited the degradation information contained in the charging curves of LIBs, especially in terms of the health characteristics of the charging temperature curves.
On the other hand, in model selection, deep neural networks have some hyperparameters that are difficult to manually adjust, such as the initial learning rate and regularization parameters. These hyperparameters are critical to the accuracy of model estimation. The genetic algorithm–particle swarm optimization (GAPSO) algorithm possesses efficient global and local search capabilities, enabling automatic optimization of the model’s critical hyperparameters [
14]. This effectively overcomes the blindness and tediousness of manual parameter tuning, significantly improving model construction efficiency. The bidirectional gated recurrent unit (BiGRU) neural network, with its unique bidirectional structure, can fully exploit the forward and backward long-term dependencies embedded in battery time-series data, accurately capturing the complex nonlinear dynamic features during the capacity degradation process [
15,
16,
17]. This complementary fusion strategy enables GAPSO-BiGRU to not only possess robust feature learning and pattern recognition capabilities but also demonstrate enhanced robustness, providing a strong foundation for reliable and stable prediction of battery SOH in complex real-world application scenarios.
Based on the above research, we propose a SOH estimation method based on multidimensional health characteristics and GAPSO-BiGRU. The main contributions are as follows:
- (1)
Differential thermal voltammetry (DTV) curves were established based on historical aging data from LIBs, and peak and trough values and their corresponding positions were extracted from these curves. The maximum charging temperature time and average temperature were extracted from the charging temperature curve. The correlation between the above characteristics and SOH was verified using Pearson’s correlation coefficient.
- (2)
Based on the extracted health features mentioned above, a SOH estimation model for LIBs was established using the BIGRU, which has the ability to describe time-series dependencies.
- (3)
To avoid manual adjustment of the model’s hyperparameters and improve estimation accuracy, GAPSO is used to optimize the number of hidden layer units, initial learning rate, and regularization parameters of the model.
The organization of the paper is as follows:
Section 2 introduces the aging dataset of LIBs and the extraction method of health characteristics.
Section 3 presents the establishment method of the SOH estimation model for LIBs, and explains the principles of BiGRU and GAPSO.
Section 4 presents the specific experimental setup and results.
Section 5 is the conclusion of the paper.
2. Multi-Dimensional Health Feature Extraction Based on the Charging Curves
The SOH of LIBs is an important indicator for assessing the working condition of batteries. SOH indicates the degree of battery aging and is usually defined as the ratio of the battery’s current capacity to its rated capacity, i.e.,
where
is the current capacity of the LIB.
is the initial nominal capacity of the LIB. Generally, the charging process of LIBs is relatively controllable, typically involving constant current-constant voltage (CCCV) charging. In contrast, the discharge process is influenced by usage conditions and application scenarios. During the lifespan of a LIB, its operating conditions may undergo significant changes, making it difficult to extract effective health characteristics that reflect aging. Therefore, extracting health characteristics that reflect the aging of LIBs from the charging curves is of practical significance and can be more effectively applied in actual industrial applications.
2.1. Lithium-Ion Battery Aging Dataset
The dataset used in this article was developed by researchers from the University of Oxford in the UK [
18]. The dataset comprises eight pouch-type experimental batteries produced by Kokam, featuring graphite anodes and LCO/NCO mixed cathodes.
Table 1 contains the detailed information on this type of battery. All batteries were subjected to charge–discharge cycling tests at a constant temperature of 40 °C, using a CC-CV charging scheme, and discharge based on the dynamic current profile of the Artemis urban driving cycle (with a peak rate of up to 3C). The experiment recorded parameters including voltage, current, temperature, and energy, with a total data volume of approximately 150,000 data points.
Figure 1 shows the SOH curves of the eight batteries. This dataset shows that even LIBs of the same model type exhibit different aging curves due to manufacturing constraints and subsequent usage. In
Figure 1, some LIBs age significantly faster than others. After a number of cycles, some of the batteries did not age as expected. In other words, the SOH of these batteries did not drop to the standard for retired LIBs. In addition, the SOH of one of these batteries decreased rapidly after about 4000 cycles. This could be due to abnormal changes in the internal materials of this battery. Therefore, we also excluded it. We finally used only cells 1–4 to validate the proposed method. Aging data from individual cells of the same type will help further validate the applicability of the proposed method.
2.2. Health Characteristics Based on the Differential Thermal Voltammetry
DTV analysis is a method based on changes in temperature and voltage during battery charging and discharging [
19]. LIBs generate thermal effects caused by internal resistance during charging and discharging, and changes in differential thermal voltammetry can reflect the thermodynamic characteristics and aging status of the battery. The DTV calculation formula is as follows:
where
is the temperature of a LIB.
is the terminal voltage of a LIB.
is the sampling time. The DTV analysis method records the differential of temperature with respect to voltage to form a DTV curve. Since the original data, especially temperature, is susceptible to measurement noise, advanced filtering methods are required to obtain an effective DTV curve. This paper uses an SG filter as a low-pass filter to smooth the curve. The DTV curve contains local features related to battery aging. The SG filter uses local polynomial fitting to smooth noise while retaining these key feature points, thereby avoiding the loss of important information due to excessive smoothing. The SG filtering method is as follows:
where
is the filtered output value.
is the local window data of the input signal.
is the pre-calculated filter coefficient.
is the coefficient of the window, and the size of the window is
. Considering the removal of noise and the retention of the basic characteristics of the DTV curve comprehensively, the parameters of the SG filter are: the window length is 23, and the polynomial order is 3.
Figure 2 shows the original DTV curve and the filtered DTV curve.
Figure 3 shows the DTV curves of four different batteries. In four LIBs, both the peak and valley values change as the lithium-ion battery ages. Specifically, as the battery gradually ages, its peak value gradually shifts to the right, while the valley value shifts in the opposite direction. This is due to the presence of positive and negative phase transitions between these phase transitions, which may be accompanied by entropy jumps appearing as inflection points on the DTV curve. Therefore, the peaks and valleys of the DTV curve can reflect the current positions of phase transitions in the cathode and anode. This means that we can extract the above physical quantities to reflect the SOH of the LIB.
The specific mathematical descriptions of the peaks and valleys can be expressed as follows:
where
is the mapping function between voltage and DTV.
and
represent the voltages at the previous sampling time point and the subsequent sampling time point, respectively. Specifically, the peaks and valleys of the DTV curve correspond to battery degradation and phase transitions in the combination of cathode and anode materials. In addition, the phase transition process of LIBs is accompanied by changes in temperature. Therefore, changes in the battery’s surface temperature can indicate the extent of phase transitions, and changes in terminal voltage indicate the type of phase transition. Thus, the DTV method can bridge macro-level signal characteristics and micro-level degradation characteristics through the phase transition characteristics of temperature and terminal voltage, which are relatively easy to measure. Based on the above analysis, the peak, peak position, valley, and valley position are extracted as HF1, HF2, HF3, and HF4, respectively.
2.3. The Average Temperature of the Charging Temperature Curve
During the aging process of LIBs, their internal resistance gradually increases, leading to an increase in their internal heat generation rate and, consequently, an increase in their temperature [
20]. Therefore, as the number of cycles increases, their charging temperature curve gradually changes.
Figure 4 shows the temperature curve during the charging phase of battery 1 at different cycle intervals.
Thus, we can extract the average temperature (HF5) from the charging temperature curve as a health characteristic of LIBs. The average charging temperature is defined as the arithmetic mean of all temperature sampling points during a complete charging cycle, as shown below:
where
represents the temperature value of the
sampling point.
represents the total number of sampling points for a single charging cycle.
represents the average temperature of this cycle.
2.4. The Time at Which the Maximum Temperature Occurs in the Charging Temperature Curve
During charging, the temperature of LIBs is affected by internal heat generation and heat exchange with the external environment. Therefore, in general, the temperature curve of LIBs will show a peak point. This peak point is significantly related to the internal heat generation of LIBs. Internal heat generation is in turn affected by the SOH of LIBs. Therefore, analyzing the temperature peak points of LIBs under different cycle periods can help us observe the internal aging trends of LIBs.
Define the start time of charging as
and the end time as
. The temperature data collected on the time series
is
. First, calculate the maximum value in the temperature sequence:
where
is the maximum temperature. Subsequently, determine the time point corresponding to the maximum temperature:
where
is the time when the maximum temperature occurs. Therefore, we define the time when the maximum temperature occurs as HF6.
Figure 5 shows the time at which the maximum temperature occurs in different cycles for different batteries. The results show that as LIBs age, the time at which their maximum temperature occurs gradually advances. This is because the internal resistance of LIBs increases during aging, leading to increased internal heat generation, which ultimately results in a faster rate of temperature increase. Therefore, this health indicator exhibits a trend that is highly similar to changes in SOH. Therefore, the use of this health feature should be effective in better estimating the SOH of LIBs.
2.5. Correlation Analysis of Health Characteristics
To quantitatively analyze the correlation between health characteristics and SOH, we used Pearson’s correlation coefficient, described by the formula [
21]:
where
is the health characteristics.
is the SOH of a LIB. Therefore, by calculating the Pearson correlation coefficients between different health characteristics and SOH, we can intuitively understand the linear correlation between different health characteristics and SOH, thereby better establishing subsequent LIB SOH models.
Figure 6 shows the heat map of the Pearson coefficients between the extracted health features and SOH, and the specific values are shown in
Table 2.
The results indicate that all six proposed health features exhibit good Pearson correlation coefficients, with the lowest absolute value reaching 0.68. This suggests that the proposed health features are effective in characterizing the SOH of LIBs across all four batteries. Among these, the peak, peak location, and valley exhibit relatively high Pearson correlation coefficients, with the lowest absolute value reaching 0.93. In contrast, the correlation between the SOH and the valley position, average temperature, and the time at which the maximum temperature occurs is relatively weaker. This may be related to noise in temperature measurements. Overall, all six health features are effective and can support SOH estimation.
3. Lithium-Ion Battery SOH Estimation Method Based on GAPSO-BiGRU
In this section, we first introduce the principles of BiGRU, then introduce the principles of GAPSO and its optimization process, and finally introduce the process of the GAPSO-BiGRU estimation model.
3.1. BiGRU Neural Network
The GRU serves as a simplified model of the LSTM network, offering fewer parameters while maintaining comparable modeling performance to LSTM. Theoretically, the cell state mechanism of LSTM may be more effective at capturing long-range temporal dependencies. However, in SOH estimation, the sequence length typically does not reach the theoretical threshold where LSTM demonstrates its advantages. Therefore, the advantage of the GRU model over the LSTM model is its ability to significantly reduce the overall training time and computational cost of the model. As a result, GRU has found widespread application in temporal modeling. BiGRU is a model composed of two stacked GRUs, with one processing data in the forward direction of the time series and the other processing data in the reverse direction. This configuration helps the model better understand the temporal relationships in the data.
Figure 7 shows the network structure of BiGRU and the single-layer cell structure of GRU.
3.2. Genetic Algorithm–Particle Swarm Optimization
The hyperparameters of neural network models are generally determined through experience or extensive trial and error. To address this issue, it is more efficient to use optimization algorithms to automatically optimize parameters. The GAPSO used in this paper is a hybrid intelligent optimization algorithm that combines the respective advantages of the well-known PSO and GA. The PSO has a strong convergence ability and can converge quickly, but it is easy to fall into the local optimum prematurely. The crossover operation of GA can combine the information of different particles to produce a completely new and possibly better search direction; the mutation operation can randomly perturb the particles to help the population jump out of the current local optimal region. This greatly enhances the algorithm’s ability to explore unknown regions. The GAPSO combines the two to allow particles to follow the historical optimum and, at the same time, jump out of the local optimum through genetic manipulation, so as to achieve a balance between “exploration” and “exploitation” capabilities. The GAPSO process is shown below.
- (1)
Randomly generate a population containing particles, with the position and velocity of each particle randomly initialized within the solution space:
where
is the uniformly distributed random numbers.
,
are the boundary of the solution space.
- (2)
For each particle , calculate its fitness value , which is the objective function (such as the validation set RMSE of BiGRU):
where
is the sample size.
is the true SOH.
is the predicted value of SOH.
- (3)
Sort by fitness, and retain the first optimal particles to directly enter the next generation:
- (4)
For the remaining particles, crossover is performed according to the probability to generate new individuals. For example, two-point crossover:
where
represents randomly selecting two parent particles
and
, and exchanging some of their dimension values.
- (5)
For the particles after crossover, perform mutation according to the probability , and randomly perturb the value of a certain dimension:
where
is the Gaussian perturbation.
controls the degree of variation.
- (6)
Particle swarm update:
For particle
, update its speed using the following formula:
where
is the inertia weight.
,
are the acceleration constant.
. Then, update its position:
When the iteration number
reaches the maximum iteration number
, or when the fitness convergence reaches the threshold
, stop:
This algorithm combines the respective advantages of GA and PSO. When the PSO particle swarm clusters in a certain region, the mutation operation of GA randomly disturbs the positions of some particles, reactivating the search space. GAPSO focuses on GA’s global exploration in early iterations and PSO’s local optimization in later iterations. This phased strategy performs better in complex nonlinear problems such as SOH estimation. The algorithm can automate hyperparameter tuning to reduce manual intervention, providing an efficient and reliable solution for battery health management.
3.3. SOH Estimation Model Based on Health Characteristics and GAPSO-BiGRU
This paper proposes the SOH estimation method based on the GAPSO-BiGRU model after extracting important health features. The six extracted health features are used as input, and SOH is used as output. GAPSO is used to optimize the number of hidden layer units, learning rate, and regularization parameters of the BiGRU model to establish the SOH estimation model. As shown in
Figure 8, the optimized block diagram of GAPSO-BIGRU is presented. The steps are as follows:
Step 1: Initialize the range of BiGRU hyperparameters. The parameter settings as shown in
Table 3.
Step 2: Calculate the fitness values for each particle and sort them based on these values. Based on each particle’s fitness value, determine the individual optimal position and population optimal position for each particle, and use each particle’s optimal position as the historical optimal position.
Step 3: Perform selection, crossover, and mutation operations accordingly.
Step 4: Update the particle positions, individual optimal positions, and population optimal positions of the particles.
Step 5: Repeat steps 2–4 until the termination condition in Formula (18) is met.
In
Figure 9, the SOH estimation framework used in this paper is shown. Firstly, the DTV curves are extracted from the aging data of lithium-ion batteries. Then, the SG filter is used for filtering, and the peaks, troughs, and their positions are extracted. At the same time, the time when the maximum temperature occurs and the average temperature in the charging temperature curve are also extracted. Subsequently, the early and mid-cycle data of each battery were used as the training set to train the BiGRU model, and at the same time, some key hyperparameters of the BiGRU were optimized using GAPSO. The remaining cycle data was used as the test set to evaluate the performance of the SOH estimation of the model. Specifically, the first 65% of the data is used as the training set, while the remaining 35% is used as the test set.
4. Results and Discussions
4.1. Evaluation Indicators
In statistical analysis, there are multiple statistical indicators that can describe the error between the predicted values and the actual values. Generally speaking, the most common ones are the mean absolute error (MAE) and the root mean square error (RMSE). As shown below:
where
is the sample size.
is the true value of the SOH.
is the predicted value of the SOH. The smaller the two statistical indicators mentioned above are, the higher the estimation accuracy of SOH will be.
4.2. Experimental Results of Different Batteries
In order to more fully verify the effectiveness of the proposed method, it was validated on different LIBs from the Oxford Battery Dataset. In addition, to compare the performance of the model, comparisons were made with some common models, such as BiGRU, BiLSTM, GAPSO-BiLSTM, and GAPSO-BiGRU.
Figure 10 shows the estimated results of the SOH for different batteries. The results indicate that the error in SOH estimation using GAPSO-BiGRU is smaller, enabling more accurate SOH estimation. Furthermore, compared with the other three methods, the error in SOH estimation based on the proposed method is the smallest. In addition, among the four LIBs, the SOH estimation results for Cell2 showed significant fluctuations, resulting in relatively poor estimation performance. This may be due to stronger internal dynamic nonlinearity, causing severe fluctuations in the SOH curve and preventing the model from effectively capturing its dynamic changes. The underlying deeper reasons for this might involve the growth of SEI films, loss of active materials, and lithium-ion exfoliation. However, the health features extracted in this paper are based solely on basic voltage, current and temperature data. Although Cell2 may have undergone abnormal aging, its basic attributes have not changed. It can still reflect the specific aging pattern of lithium-ion batteries through the defined health features, thereby achieving SOH estimation. To further quantify the errors of different methods,
Table 4 lists the MAE and RMSE of SOH estimation for different batteries.
The results indicate that, except for the MAE of the first battery, GAPSO-BiGRU achieved the best results in all statistical metrics for the remaining batteries. Overall, BiGRU had the largest error, BiLSTM had a relatively smaller error, and the statistical metrics of GAPSO-BiLSTM and GAPSO-BiGRU were further reduced compared to BiLSTM. This is because BiGRU and BiSLTM still rely on manually adjusting hyperparameters, which may prevent the models from fully realizing their modeling potential. In contrast, automatically optimizing the model’s hyperparameters via GAPSO can mitigate the issue of estimation errors caused by improper hyperparameters to some extent. The reason GAPSO-BiGRU outperforms GAPSO-BiLSTM is that BiGRU has fewer parameters than BiLSTM, making it easier to optimize via GAPSO and more effectively identify its optimal hyperparameters. In summary, compared with BiGRU, the proposed method reduced MAE and RMSE by 68.35% and 72.29% on average, respectively. Compared with BiLSTM, the proposed method reduced MAE and RMSE by 33.57% and 37.53% on average, respectively. Compared with GAPSO-BiLSTM, the proposed method reduced MAE and RMSE by 33.26% and 33.04% on average, respectively. Therefore, the proposed method is effective and can support high-precision LIB SOH estimation.
4.3. Further Discussions
In this study, the dataset used was generated in a laboratory environment where the ambient temperature was constant at 40 °C. It is undeniable that under different environmental temperatures or charging protocols, these six health characteristics may exhibit different characteristics. Although in this paper, these six health characteristics have a good correlation with the SOH of LIBs, their correlation may weaken under other conditions, thereby reducing the effectiveness of SOH estimation. In practical applications, the most direct method is to collect battery aging data covering a wide temperature range and various charging protocols, and then train the model. With carefully designed training, this can significantly improve the model’s ability to estimate SOH under different environmental temperatures and charging protocols. Additionally, another strategy is to develop compensation algorithms based on this paper, taking into account the effects of different environmental temperatures and charging protocols in the compensation algorithm, thereby avoiding the failure of health characteristics and the decline in the accuracy of the SOH estimation model under complex conditions.
In addition, the health features extracted in this work are entirely based on basic voltage, current, and temperature measurement data. Therefore, they should be applicable to different models and shapes of LIBs. However, for LIBs with different compositions, the correlation between the extracted health features and SOH may weaken, thereby affecting the accuracy of SOH estimation. To address this issue, we can additionally extract specific health features with strong correlations as input for the model. Diversified health features can still enable the model to capture more deep-level nonlinear features, thereby supporting the achievement of stronger SOH estimation performance.
5. Conclusions
To effectively estimate the SOH, we have proposed an SOH estimation method that integrates multi-dimensional health features and GAPSO-BiGRU. In terms of features, this paper extracts multiple health features, such as peaks and troughs, from the DTV curve. In addition, the average charging temperature and the time at which the maximum charging temperature occurs are also extracted. In terms of the model, to ensure the accuracy of the model, a high-precision SOH estimation model is established by optimizing BiGRU through GAPSO. The proposed method was experimentally validated by four different LIBs of the same model. The results show that this method can effectively estimate the SOH of LIBs, with a maximum RMSE of no more than 0.2583% and a maximum MAE of no more than 0.2771%. Compared with other methods, this method significantly reduces the estimation error. The RMSE was reduced by at least 33.04%, and the MAE was reduced by at least 33.26%. Despite the relatively good results of the proposed method, the limited battery dataset still restricts the validation of its generality. Future studies will further validate the applicability of the proposed method using more diverse aging data of LIBs.