Next Article in Journal
Precise Position Estimation of Road Users by Extracting Object-Specific Key Points for Embedded Edge Cameras
Previous Article in Journal
ATIRS: Towards Adaptive Threat Analysis with Intelligent Log Summarization and Response Recommendation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Joint Estimation Method for the SOC and SOH of Lithium-Ion Batteries Based on AR-ECM and Data-Driven Model Fusion

1
College of Instrumentation & Electrical Engineering, Jilin University, Changchun 130012, China
2
College of Chemistry, Jilin University, Changchun 130000, China
*
Author to whom correspondence should be addressed.
Electronics 2025, 14(7), 1290; https://doi.org/10.3390/electronics14071290
Submission received: 24 February 2025 / Revised: 18 March 2025 / Accepted: 19 March 2025 / Published: 25 March 2025

Abstract

:
Accurate estimations of State-of-Charge (SOC) and State-of-Health (SOH) are crucial for ensuring the safe and efficient operation of lithium-ion batteries in Battery Management Systems (BMSs). This paper proposes a novel joint estimation method integrating an Autoregressive Equivalent Circuit Model (AR-ECM) with a data-driven model to address the strong coupling between SOC and SOH. First, a multi-strategy improved Ivy algorithm (MSIVY) is utilized to optimize the hyperparameters of a Hybrid Kernel Extreme Learning Machine (HKELM). Key voltage interval features, including split voltage, differential capacity, and current–voltage product, are extracted and filtered using a sliding window approach to enhance SOH prediction accuracy. The estimated SOH is subsequently incorporated into the AR-ECM state-space equations, where an enhanced particle swarm optimization algorithm optimizes the model parameters. Finally, the Extended Kalman Filter (EKF) is applied to achieve collaborative SOC–SOH estimation. Experimental results demonstrate that the proposed method achieves SOH errors below 1% and SOC errors under 2% on public datasets, showcasing its robust generalization capability and real-time performance.

1. Introduction

As energy demand and pollution rise, countries urgently need sustainable development. Electric vehicles (EVs) are key solutions, with lithium-ion batteries (LIBs) as crucial components [1]. SOH and SOC are vital for safety, performance, and reliability. These parameters are hard to measure directly, making high-precision estimation methods essential for battery technology [2].
The SOC, a key battery performance metric, represents the ratio of remaining power to total capacity [3]. SOC estimation methods fall into three categories. Category 1: The Ah integration method calculates charge changes through current integration. While simple, it suffers from inaccuracies due to initial value errors, current measurement inaccuracies, and aging effects [4,5]. Consequently, it is often combined with model-based or data-driven approaches to enhance accuracy and robustness. Wang et al. [6] proposed a method based on a combination of ampere-hour integration and an extended Kalman filter, which effectively reduces cumulative errors over long-term use. Chang et al. [7] used a new approach of combining neural networks and ampere-hour integral compensation to improve the accuracy of charge-state prediction under various environmental and operating conditions. Category 2: Physical model-based methods use adaptive filtering and state-space expressions for SOC estimation, with SOC and polarization voltage as state variables derived from OCV tests. Zeng et al. [8] developed an SOC estimation method based on a fractional-order model, which better describes the dynamic characteristics of the battery and improves the estimation accuracy. Li et al. [9] proposed an SOC estimation method based on a Dual Cubature Kalman Filter, which effectively improves estimation accuracy by capturing the dynamic behavior of the battery on different time scales. While theoretically interpretable, these methods face limitations [10,11]. They rely on complex electrochemical parameters (e.g., OCV, polarization voltage, and internal resistance) requiring precise identification and increasing model complexity. Category 3: Data-driven methods. Data-driven methods use machine learning techniques to establish a nonlinear mapping relationship between battery operating parameters and the SOC [12], and commonly used algorithms include convolutional neural networks (CNNs), recurrent neural networks (RNNs) [13], gate recurrent units (GRUs) [14], long short-term memory (LSTM) [15], and support vector machines (SVMs) [16]. Ma et al. [17] proposed an SOC estimation method based on the combination of CNNs and UKFs, which took the output of a CNN as the input of a UKF and obtained high-precision SOC estimation through self-correction. El Fallah et al. [18] developed an SOC estimation model based on a deep neural networks (DNNs), which can effectively capture the dynamic characteristics of batteries. Compared to physical models, this approach minimizes reliance on battery characteristic modeling and avoids complex parameter identification. By analyzing extensive battery operation data, it achieves high-precision SOC estimation. However, it demands high-quality, large-scale data, particularly covering diverse operating conditions, as data quality directly impacts model training and prediction accuracy [19].
The SOH is used to measure the degree of battery aging, defined as the ratio of the currently available capacity to the rated capacity, and is an important metric for assessing battery performance and degradation [20]. SOH estimation methods can be classified into model-based methods and data-driven methods. Model-based methods describe battery degradation laws by building mathematical models, such as the equivalent circuit model (ECM), electrochemical model, and fractional order model [21,22]. Li et al. [23] proposed an SOH estimation method based on an improved equivalent circuit model (ECM), which reflects the degradation of battery capacity by identifying changes in model parameters. Chen et al. [24] developed an electrochemical aging model that considers the effect of temperature, improving the accuracy of SOH estimation in different temperature environments. Although these methods have high theoretical accuracy, they rely on many experimental tests and calibrations, and the models are complex and computationally expensive, making it difficult to meet the demands of real-time applications; additionally, aging models are more sensitive to parameters and environmental changes. Data-driven methods use machine learning techniques to capture the nonlinear characteristics of battery degradation and model the relationships between health indicators (e.g., incremental capacity, differential voltage, and partial energy characteristics) and SOH, and common algorithms include RNNs [25], LSTM [26], and Gaussian process regression (GPR), among others. Peng et al. [27] developed an SOH estimation model based on IGWO–LSTM, which combined with Dropout regularization to suppress model overfitting and improve generalization ability. He et al. [28] applied GPR to SOH estimation, quantifying the uncertainty of predictions through a probabilistic model and enhancing the reliability of decision-making. Data-driven methods do not require tedious parameter identification, but they have high requirements for data quality and quantity and high computational overheads.
In addition, since single estimation methods tend to ignore the strong coupling between SOC and SOH, methods for jointly estimating the two have gained much attention. Battery degradation has a significant effect on the accuracy of SOC estimation, while inaccurate SOC estimation results may in turn interfere with SOH calibration. To address this issue, researchers have developed machine learning, multi-timescale, and dual-filtering methods for the joint estimation of SOC and SOH. Zeng et al. [29] proposed two RBF-ARX models to capture the nonlinear dynamics of batteries and establish the association between SOC, SOH, and observed values. The initial state is then sampled and inferred using the MCMC method and finally combined with UKF for joint estimation. The experimental validation uses multiple data sets to demonstrate the effectiveness of the method. Yang et al. [30] proposed a complementary cooperative algorithm based on the combination of double Kalman filtering (DEKF) and pattern recognition, which achieves high-precision joint SOC–SOH estimation. Wei et al. [31] proposed an algorithm combining an adaptive central difference Kalman filter and discrete-time sliding mode observer (ACDKF-DSMO) for improving the robustness and accuracy of SOC and SOH estimation of lithium-ion batteries. In addition, Li et al. [32] showed that fractional-order models for joint SOC–SOH estimation have higher accuracy than traditional integer-order equivalent circuit-based models and can better describe the nonlinear characteristics of batteries.
Different neural networks have different hyperparameters. Selecting an appropriate optimization algorithm to automatically search for the best parameter combination solves the inefficiency and subjectivity of manual parameter tuning and can significantly improve model performance. Ge et al. [33] constructed an improved IBA-ELM model for joint SOC–SOH estimation, and improved the model performance through an optimization algorithm. Wang et al. [34] proposed an improved firefly algorithm (IFA), improved the prediction performance of the GPR model from the perspective of the internal prediction process, and applied it to the joint estimation of SOH and SOC. Ghasemi et al. [35] first proposed the theoretical framework of the basic Ivy algorithm, which can efficiently search for the optimal solution in the solution space by simulating the characteristics of ivy growing towards favorable resources. Zhang et al. [36] improved the basic Ivy algorithm by introducing an adaptive perturbation factor and adaptive growth rate, significantly improving the convergence speed and accuracy of the algorithm when dealing with high-dimensional nonlinear problems. To address the deficiencies of existing SOC–SOH joint estimation methods and optimization algorithm research, this paper proposes a novel hybrid framework that combines an autoregressive equivalent circuit model (AR-ECM) and a data-driven method to achieve high-precision joint estimation of SOC and SOH. The contributions of this research are briefly described as follows:
  • We propose a novel hybrid framework integrating AR-ECM with data-driven models for SOC–SOH joint estimation. The framework adaptively incorporates SOH into SOC calculations, enabling precise estimation under battery aging conditions.
  • A systematic feature selection methodology is developed combining voltage segmentation and multiple correlation analysis (MCA). This approach integrates key voltage characteristics (SV, dQ/dV, and InV) to enhance SOH prediction accuracy.
  • We present an improved Ivy algorithm (IVYA) with chaotic mapping and tangent flight operators for optimizing HKELM hyperparameters, significantly improving convergence speed and estimation precision.
  • Comprehensive experiments across multiple datasets demonstrate the framework’s effectiveness, achieving SOH errors below 1% and SOC errors under 2% under various operating conditions and temperatures.
The organization of the paper is as follows: Section 2 describes the dataset and the data processing process in detail. The SOC and SOH co-estimation method is described in Section 3. Section 4 shows that the predictions were accurate and that the proposed approach worked. The conclusions are presented in Section 5.

2. Data Acquisition and Processing

2.1. Dataset

This paper utilizes two datasets to evaluate the efficacy of the proposed method. Dataset I is the University of Michigan Battery Laboratory (UMBL) [37] for joint SOH–SOC estimation, and Dataset II is the McMaster University in Hamilton dataset (LG_HG2) [38] for validating the universality of the SOC estimation method. Figure 1a,b displays the Dataset I SOH and the voltages of Groups I-Cell 1. Figure 1c–e displays the voltage and SOC curves for various dynamic duty cycles. This paper uses three common temperatures: −20 °C, 25 °C, and 40 °C.

2.2. Relationship Between the SOC and SOH

The relationship between the SOC and SOH is outlined in the literature [39]. Setting the current SOH to 70.5%, the results are shown in Figure 2 without considering SOH, where the predicted SOC when the battery is charged to the upper cutoff voltage does not reach 100%. When considering the effect of SOH, the impact of SOH on SOC decreases gradually as SOH approaches the actual SOH value. Therefore, it is necessary to predict SOC and SOH jointly.

2.3. Data Processing

2.3.1. Segment Voltage Selection

Since charging mode is easier to control than discharge mode, two voltage time integrations of 3.8–4.0 V and 4.0–4.2 V in the charging phase of Group I-Cell 1 in Dataset I are taken as examples, as shown in Figure 3a. Figure 3b shows the capacity degradation curves for the two voltage time integration segments. The segmented capacity degradation curve closely mirrors the overall capacity degradation curve, accurately reflecting the capacity regeneration characteristics of the battery. The Pearson correlation coefficient (PCC) is used to quantify the relationship between these two curves, as indicated in Equation (4):
ρ x , y = cov ( X , Y ) σ X σ Y i = 1 n ( x i x ¯ ) ( y i y ¯ ) i = 1 n ( x i x ¯ ) 2 i = 1 n ( y i y ¯ ) 2
where x i denotes the feature, y i denotes the SOH, and x ¯ and y ¯ denote the average of the data x i and y i , respectively. The selection of the constant current charging segment is determined by two key factors. The voltage rise time and SOH’s PCC are the first points. The second is that 20% to 80% is the typical battery SOC. Figure 3c shows the PCCs between Group l-Cell 1 and the SOH for different up–down voltage ranges.
According to the above analyses, different voltage bands capture the full capacity degradation pattern differently. Furthermore, the voltage range has a significant effect on certain HIFs. The peak is approximately 3.7 V on the d Q / d V curve, for example. To enhance the correlation between HIFs and capacity, selecting an optimal voltage band is essential. The objective of this paper is to maximize the sum of the absolute PCCs between the four HIFs and the SOH selected within the voltage interval, as expressed in the following equation:
max   P m a x = i = 1 4 p i
where P max is the maximum correlation coefficient sum obtained and p is the correlation coefficient between the HIFs and the capacity at the i th time.
Real-world EV users usually start charging before their SOC reaches 0% and typically stop charging before reaching 100%, especially regarding rapid charging. This pattern is influenced primarily by concerns regarding overcharging, which is frequently exacerbated by anxiety associated with range limitations. Finding the most constrained voltage range is essential for improving this charging strategy. Adopting this method reduces computational expenses and enhances the availability of vital data in accurate capacity assessment. Taking these factors into account, this paper suggests certain limits on both the SOC and the voltage range that is allowed:
10 % < S O C < 80 % 0.05 V < V b V a < 0.25 V
where V a is the left endpoint of the voltage segment and V b is the right endpoint. According to Equations (5) and (6), the problem of voltage interval optimization is a multi-constrained nonlinear programming problem. This paper uses PSO along with adaptive inertia weights to solve it. The Dataset I voltage selection intervals are described in detail in Supplementary Table S1, and the solution algorithms are described in Supporting Document S2.

2.3.2. Feature Extraction

In feature extraction, it is often desirable to obtain high-quality features via simple data operations. Therefore, for the HIFs extracted from the time–voltage curves of the charging process and the d Q / d V curves, on the basis of the analysis in Section 2.3.1 and taking Group I-Cell 1 of Dataset I as an example, the voltage interval with the highest correlation coefficient is [3.53–3.81], as shown in Figure 4a, and the total sum of the correlation coefficients is 2.987. Within the same voltage range, as the number of cycles increases, the charging time and charging capacity decrease, and the rate of voltage rise (slope of the curve) increases. Therefore, four parameters, the voltage integral over time ( S ), the voltage rise time ( d v / d t ), the voltage slope ( K ), and the intercept ( b ) are selected as the basic features.
Noise significantly impacts test results if the voltage measurement interval is too short. Important information could be lost if the voltage interval is set too wide. Therefore, every 0.05 V was chosen and the d Q / d V still contains considerable noise. Therefore, filtering methods can further process the data and we utilize the moving average filtering method. The principle of the filter can be described as
y ( n ) = 1 w i n d o w s i z e ( x ( n ) + x ( n 1 ) + + x ( n ( w i n d o w s i z e 1 ) ) )
In Figure 4b, comparison of the filtered d Q / d V curve with the original curve obtained from the original d Q / d V data reveals that the filtered curve is smooth and retains the features on the curve well. Figure 4c shows that the peak of the d Q / d V decreases and shifts to the right with increasing numbers of cycles, and two indicators, the d Q / d V peak and d Q / d V peak position are intimately associated with battery aging, so they are chosen as the HIFs.
According to the literature [40], for different discharge current demands, the battery discharge voltage distribution may be different, but the product of I n * V is relatively constant. In this work, this characteristic is used to modify the voltage fragment data obtained from the analysis as follows. First, the new capacity ( Q I n V ) resulting from the product of the voltage data V ( t ) and its corresponding charging current increasing to a power of 0.05 ( V ( t ) * I ( t ) 0.05 ) was utilized, as shown in Figure 4f. Next, all the start times of the voltage segments were time-shifted so that each voltage segment started at time 0, yielding the T I n V Q I n V curve, as shown in Figure 4e. The I n V data were then normalized in the time domain. The T I n V Q I n V curve was generated by subtracting period 1 ( T I n V 1 ) from period i ( T I n V i ). The features of normalized T I n V and normalized Δ I n v were obtained, as shown in Figure 4f. The features extracted in this paper are shown in Table 1.

2.3.3. Feature Selection in Multiple Correlation Analysis

In machine learning, feature selection is critical. It is important to remove useless or redundant features to eliminate noise from irrelevant features and make model predictions more accurate. Five types of correlation analyses were used to analyze feature data via multiple correlation analysis (MCA) [41], including maximum information coefficient (MIC) [42], Pearson correlation coefficient (PCC), Kendall correlation (KC) [43], gray correlation analysis (GRA) [44], and Spearman correlation (SC) [45]. We take Group I-Cell 1 of Dataset I as an example, and the results are shown in Figure 5.
This study comprehensively evaluated the importance of each HIF. All the extracted features are shown in Figure 5a and different correlation results were obtained according to the MCA, as shown in Figure 5b. The obtained coefficients are sorted by features according to absolute value, and the highest-ranking HIFs are assigned the highest weight of 1, decreasing by 0.125 in turn. The weighted scores for each HIF are then added together to obtain the final total, as follows:
H i = s o r t ( H I F i , j ) * ( 1 ( j 1 ) * 0.125 ) S j = i = 1 5 H i , j ,   i = 1 , 2 , 5 , j = 1 , 2 , 8
where S j denotes the sum of the HIF coefficients. Figure 5c shows the ranking of all the HIFs on the basis of their importance in obtaining the features that are most closely linked to the cell’s SOH. The features above the average are selected as the input to the prediction model.

3. Method

3.1. SOH Estimation Method

3.1.1. HKELM

HKELM is an improved ELM method, which combines the advantages of global and local kernel functions to improve the generalization and prediction performance of the model. In order to improve the performance of the ELM algorithm, a kernel function is introduced. The polynomial kernel has strong generalization ability but is not suitable for local prediction, and the Gaussian kernel has good robustness but limited accuracy. Therefore, the hybrid kernel function was introduced to combine the two and strengthen the ELM model.
The hybrid kernel function is formulated as follows:
K ( x i , x j ) = μ · K local ( x i , x j ) + ( 1 μ ) K g l o b a l ( x i , x j )
Local kernel function:
K R B F ( x i , x j ) = μ · exp ( x i x j 2 2 σ 2 )
Global kernel function:
K p l o y ( x i , x j ) = ( 1 μ ) ( α ( x i , x j ) + 1 ) 2
The identity matrix a and penalty coefficient S are introduced to prevent the overfitting phenomenon caused by overcomplexity of the model.
β = H T ( I 0 z + H H T ) 1 L
F ( x ) = K ( H H T + I 0 z ) 1 L
The improved IVYA is used to optimize the (μ, σ, α, and z) parameters of HKELM.

3.1.2. Multi-Strategy Improvement for Ivy

IVYA

The IVYA is a metaheuristic based on group intelligence that was proposed in 2024 [35]. This algorithm focuses on modeling various stages of ivy life. An ivy population’s starting point in the search space is chosen at random via Equation (11). The expression is provided below:
I i = I min + r a n d ( 1 , D ) ( I max I min ) , i = 1 , , N p o p
where r a n d ( 1 , D ) denotes a D-dimensional vector of uniformly distributed random numbers in the interval [0, 1]. I max and I min are the upper and lower bounds of the search space, respectively, and denotes the Hadamard product of the two vectors.
The steps of researching and population searching in the proposed IVYA.
Step 1: Coordinate and order population growth.
Let G υ be the growth rate, ϕ be the growth rate, and φ be the correction factor for deviation from growth. Equation (12) is transformed into a difference equation for the growth rate G υ i ( t ) of member I i as follows:
d G υ ( t ) d t = ψ G υ ( t ) φ ( G υ ( t ) )
Δ G υ i ( t + 1 ) = r a n d 2 ( N ( 1 , D ) ) Δ G υ i ( t )
where the vectors Δ G υ i ( t ) and Δ G υ i ( t + 1 ) denote the discrete-time system growth rate and where r a n d 2 denotes that the probability density function is equal to 1 / ( 2 x ) . The D-dimensional random vector N ( 1 , D ) is a normal distribution random number.
Step 2: Ivy plants grow to obtain sunlight.
In the following equation, the member I i uses member I i i to climb and move logically toward the light source:
I i n e w = I i + N ( 1 , D ) ( I i i I i ) + N ( 1 , D ) Δ G υ i , i = 1 , 2 , , N p o p
with
Δ G υ i = I i ( I max I min ) , I t e r = 1 ; r a n d 2 ( N ( 1 , D ) Δ G υ i ) , I t e r > 1 ;
The vector N ( 1 , D ) has absolute values for its components N ( 1 , D ) and Hadamard divisions denoted by .
Step 3: Spread and evolution of the Ivy plants.
After a phase in which the member I i roams globally through the search space to the nearest and most important neighbor I i i , a phase occurs when I i tries to directly follow the best member of the whole population I B e s t , which is equivalent to searching for a better optimal solution around the member I B e s t . This is denoted as
I i n e w = I B e s t ( r a n d ( 1 , D ) + N ( 1 , D ) Δ G υ i )
Next, the new value of Δ G υ i , the current member growth rate I i n e w , is calculated via the following formula (this formula is identical to the initialization Δ G υ i formula):
Δ G υ i n e w = I i n e w ( I max I min )
Step 4: Survivor selection
To simulate the two alternating stages of an ivy tree’s life, i.e., “climbing” and “expanding”, the following decision-making method is used in the IVYA. When the objective function value f ( I i ) of the member I i is less than a multiple of f ( I B e s t ) , the parameter β = ( 2 + r a n d ) / 2 is used. Then, the ivy tree starts expanding the width of branches and leaves (given by Equation (17)). The ivy ascends and climbs, as described by Equation (19).

Improved Methods

IVYA performs well in functional testing and practical engineering problems, but it has some limitations, such as difficulty with global optimal searching and premature convergence. To this end, this study introduces three enhancement strategies to optimize the IVYA and reduce the risk of falling into local optimal searching.
Improvement 1: A chaotic sequence is generated by using the circular chaotic mapping distributed between [0, 1], a simple process generates the random sequence, and its chaotic characteristics are used to replace random initialization so that the population is more evenly distributed in the search space, which can be expressed as
x i + 1 = mod ( x i + b ( a 2 π ) sin ( 2 π x k ) , 1 ) , a = 0.5   and   b = 0.2 .
Improvement 2: The improved method proposed in this paper is mainly to improve the local searching by the IVYA, Cauchy distribution inverse cumulative distribution (CDICD) function, and Tangent flight operator (TFO); these operators are used as scaling factors to control the step size. The CDICD function can further enhance the search stability of the algorithm. TFO has the function of balancing exploitation and exploration, which enhances the convergence ability of the algorithm.
The Cauchy inverse cumulative distribution can be defined as
f ( x ; a , b ) = 1 π [ b ( x a ) 2 + b 2 ]
When a = 0 , b = 1 is a standard Cauchy distribution, the probability density function is as follows:
f ( x ; 0 , 1 ) = 1 π [ 1 x 2 + 1 2 ]
The cumulative distribution function is as follows:
f ( x ; a , b ) = 1 π arctan ( x a b ) + 1 2
Its Cauchy distribution function can be expressed as
f 1 ( p ; a , b ) = a + b tan ( π ( p 1 2 ) )
p = r a n d n ( 1 , d )
where p is a uniformly distributed random number in the range [0, 1] and where d is the function dimension a = 0 , b = 0.01 . Equation (16) is rewritten as
I i n e w = I i + N ( 1 , D ) ( I i i I i ) + N ( 1 , D ) Δ G υ i ( 0.01 tan ( π ( p 1 2 ) ) ) , i = 1 , 2 , , N p o p
It is similar to the Cauchy function in that it is a tangent function and is calculated as follows:
f = tan ( υ × π 2 )
υ = r a n d n ( 1 , d )
where υ is a uniformly distributed random number in the range [0, 1] and where d is the dimension of the function. Equation (18) is rewritten as
I i n e w = I B e s t ( r a n d ( 1 , D ) + N ( 1 , D ) Δ G υ i ) tan ( υ × π 2 )
To verify the effectiveness of the proposed improved method, this study compares three test functions on the CE2005 function set and the results are shown in Figure 6. The running results and fitness values are shown in Table 2. F1 and F4 are unimodal functions, while F9 is a multimodal function. The solution time and the optimal value are the average values of 30 calculations. The results show that the IIVY algorithm outperforms comparison algorithms such as Ivy, EWOA, IGWO, and PSO regarding convergence speed and solution quality. Especially in the complex multimodal function F9 test, the IIVY algorithm shows obvious performance advantages. Regarding solution time, IIVY is inferior to the PSO algorithm, but the PSO algorithm has lower accuracy in terms of solution quality. The above comparison verifies the effectiveness of the improved IVY algorithm in this paper.

Training and Testing Process

This paper predicts the SOH of LIBs by learning the mapping of feature inputs ( x ) to SOH outputs ( y ). The research employs a dataset comprising four features following MCA selection and SOH values from the preceding n cycles. Model training predicts the next cycle’s SOH value based on past cycles’ SOH values. The study set n = 30 (number of training examples) and F = 4 (number of features after selection), as shown in Figure 7.
The process is as follows:
Step 1: Select the training data via the sliding window method with a window size of n, which is immediately adjacent to the cycle to be predicted, k, to capture the battery degradation patterns and latest changes.
Step 2: Construct the training datasets, each of which includes the SOH values of the past n cycles with the four external feature variables of the previous cycle as inputs and the SOH values of the current cycle as outputs. A total of K sets of such training data were constructed.
Step 3: These data are used to train the MSIVY-HKELM model and determine the model parameters.
Step 4: Test the model in cycle k + 1 to predict new SOH values via past predicted SOH values and external feature variables.
Update cycle k and repeat Steps 2–4. The entire process selects the training data through a moving window of 30 cycles to ensure that the model can adapt to different battery operating conditions and accurately predict future SOH values.

3.2. SOC Estimation Method

3.2.1. AR Equivalent Circuit Model

In this paper, an AR-ECM is established, as shown in Figure 8. U o c v is the open-circuit voltage of the battery, U i n is the internal voltage of the battery, U d is the terminal voltage, and U A R is the internal polarization voltage of the battery. On the basis of the equation between each voltage in Figure 8
U d ( k ) = U o c v ( k ) + U i n ( k ) = U o c v R I ( k ) U A R ( k )
Considering that the polarization voltage of the battery is closely related to the current, using the current as input to the AR model inputs
U A R = i = 1 p a i ( k i )
where p is the model order, a i ( i = 1 , 2 ,   .   .   .   ,   p ) represents the AR model coefficients, and Equation (12) is rewritten as
U d ( k ) = U o c v ( k ) R I ( k ) i = 1 p a i ( k i )
When i = 0 , a 0 is the ohmic internal resistance and a 0 = R and a i ( i = 1 , 2 ,   .   .   .   ,   p ) are the AR model coefficients for the polarization voltage drop.

3.2.2. AR-ECM State-Space Equation

On the basis of Figure 8, U i n is represented as ( R I ( k ) i = 1 p a i ( k i ) ) at time k and the recursive expression is as follows:
U i n ( k + 1 ) = U i n + R ( I ( k + 1 ) I ( k ) ) + i = 1 p a i ( I ( k i + 1 ) I ( k i ) ) = U i n ( k ) + i = 1 p a i Δ I ( k + 1 i )
where Δ I ( k + 1 i ) = I ( k + 1 i ) I ( k i ) when i = 0 , a 0 is the ohmic internal resistance, and a 0 = R and a i are the coefficients of the polarization voltage drop AR model.
The SOC uses the Ansatz integral method to establish the equation of state in discrete form:
S O C ( k + 1 ) = S O C ( k ) + I ( k ) * η Δ t S O H ( K ) * C r a t e d
is the sampling time and nominal capacity, a coefficient related to the temperature and charge/discharge multiplier, and can be expressed considering SOH as being in a long-term changing state, as follows:
S O H ( k + 1 ) = S O H ( k )
Using Equations (34)–(36), the state-space equations are obtained as follows:
S O H ( k + 1 ) = S O H ( k ) S O C ( k + 1 ) = S O C ( k ) + I ( k ) * η Δ t S O H ( k ) * C r a t e d U i n ( k + 1 ) = U i n ( k ) + i = 0 p a i Δ I ( k + 1 i )
where the state transition function is x ( k ) = [ S O H ( k )   S O C ( k )   U i n ( k ) ] T and the measurement function is u ( k ) = [ I ( k + 1 )   I ( k ) , , I ( k + 1 p ) ] T . The terminal voltage is the observation variable and the observation equation is as follows:
U d ( k ) = G x ( k ) + U o c v ( S O C ( k ) ) + υ ( k )
According to the literature [46], U O C V can be expressed as a function of the battery S O C ( k ) at time k , as follows:
U O C V ( k ) = K 0 + K 1 S O C ( k ) + K 2 S O C ( k ) + K 3 ln ( S O C ( k ) ) + K 4 ln ( 1 S O C ( k ) )

3.2.3. Extended Kalman Filter SOC Estimation

For the power battery, its actual working conditions should be considered and, since the SOC cannot be measured directly and presents a nonlinear state throughout the charging and discharging process, it is more reasonable to use the extended Kalman filter [47].
The EKF implementation steps are as follows:
(1) Obtain the initial state SOC values and initial values of the error equation matrix, as follows:
x 0 = S O C 0 , P 0 = var ( x 0 )
where x 0 and P 0 are the initial values of the system state variables and the initial values of the state variable error covariance matrix, respectively.
(2) State variable prediction estimation:
State vector prediction
x ^ k + 1 | k = f ( x ^ k , u k )
A k 1 = f ( x k 1 , u k 1 ) x k 1 = 1
C k = y k x k = K 1 x k 2 + K 2 + K 3 x k K 4 1 x k
x k = x k 1 I * Δ t C n
y k = K 0 + K 1 x k + K 2 x k + K 3 ln ( x k ) + K 4 ln ( 1 x k ) + a 0 I k + a 1 I k 1 + a 2 I k 2 + a 3 I k 3
Calculate the error covariance matrix as follows:
P k + 1 | k = A k P k A k T + ω Q ω T
where P k is the state error covariance matrix at k ; Q is the system process noise covariance matrix; Q is a fixed value of 0.00012; k + 1 | k is the recursive result based on the k time for the k + 1 time variable; and ω is process noise.
(3) State variable measurement update
Kalman gain calculation
K k + 1 = P k + 1 | k C k T [ C k P k + 1 | k C k T + v R v T ] 1
where R is the measurement of noise covariance, using a fixed value of 0.00052, and ν is the observed noise. Calculate the SOC optimal estimates and error covariance optimal estimates, as follows:
z ^ k + 1 = g ( x ^ k | k , u k ) x ^ k + 1 = x ^ k + 1 | k + K k + 1 ( z k + 1 z ^ k + 1 )
Error covariance matrix update:
P k + 1 = ( I K k + 1 C k ) P k + 1 | k

3.2.4. AR Model Parameter Identification

Based on Ref. [11], the AR model order   p = 3 a i ( i = 1 , 2 ,   .   .   .   ,   p ) is selected. The parameters to be identified are [ K 0 , K 1 , K 2 , K 3 , K 4 , a 0 , a 1 , a 2 , a 3 ] , and the same algorithm as in Section 2.3.1 is used for parameter identification to reduce the solution time. In this study, we utilized the battery from Group I, specifically Cell 1, to conduct a series of tests over multiple cycles. The results obtained from these experiments are detailed in Figure 9. Throughout the testing, we observed that the error margin was consistently kept within 10 mV, which underscores the high level of accuracy achieved in our measurements. This reliable performance highlights the battery’s effectiveness under varying conditions.

3.3. Joint SOH–SOC Estimation Method

The analysis presented in Section 2.2 demonstrates that the SOH plays a crucial role in the accuracy of SOC estimation results. This paper introduces an innovative combined AR-ECM and data-driven model designed to jointly estimate both the SOC and SOH of LIBs. As illustrated in Figure 10, the approach comprises two main components: the online identification of battery parameters and the joint estimation of battery states, which includes both SOC and SOH estimations.

4. Results and Discussion

This study evaluated SOC and SOH prediction via three error matrices—MAE, MSE, and RMSE—to objectively reveal and compare method performance. During model comparison, MSE and RMSE values closer to zero indicate an acceptable model. The results for the MAE, MSE, RMSE and R 2 , respectively, were as follows:
M A E = 1 N n = 1 N C n C ^ n
M S E = 1 N n = 1 N ( C n C ^ n ) 2
R M S E = 1 N n = 1 N ( C n C ^ n ) 2
R 2 = 1 n = 1 N ( C n C ¯ n ) 2 n = 1 N ( C n C ¯ n ) 2
where C n is the actual battery capacity, C ^ n is the predicted battery capacity, and C ¯ n is the average actual battery capacity.

4.1. SOH Prediction Results

4.1.1. Prediction Results for Different Features

In this subsection, to compare the advantages of MCA feature selection versus single feature selection, following the SOH prediction process described in Section 3.1, the model parameters of the MSIVY-HKELM model are trained using the past 30 sets of training data, and then the obtained model with known parameters is used to predict the SOH values for the next cycle. For example, the SOH estimation results obtained for the Group I-Cell 1 battery in Dataset I with different eigenvalues for most inputs are shown in Figure 11.
The black curve in Figure 11a represents the real SOH recession trajectory, the red line indicates the SOH results obtained by using the features obtained from the voltage segmentation as model input, the blue line indicates the SOH results obtained by using d Q / d V as model input, the green line indicates the SOH results obtained by using the InV transformed features as model input, and the red line indicates the SOH results obtained by using the MCA features after selection as model input. The red line shows the SOH results obtained after MCA feature selection is used as the model input, and the figure shows that the most accurate results are obtained after feature selection is used as the model input. All the errors in Figure 11b are less than 2%, indicating that the model has high accuracy. The prediction results are box plotted in Figure 11c, avoiding extreme outliers. Based on the different feature inputs, the median of the MCA is almost 0, and the IQR range is the smallest among all the subgroups, which indicates that the overall trend in the errors is very close to the true value, with good stability and consistency. Figure 12d shows the results of various feature errors, in which the MAE, MSE, and RMSE of d Q / d V are the largest at 0.4585, 0.2975, and 0.5455, respectively, and the features selected by MCA obtain the smallest MAE, MSE and RMSE values, with values of 0.1451, 0.0251, and 0.1584, respectively. The results are shown in Table 3. This feature selection method is adaptable and accurate based on the experimental results.

4.1.2. Prediction Results for Different Kernel Functions

This section is written because, in HKELM, the choice of kernel function significantly impacts model performance. In this study, the mixed kernel function has obvious advantages compared with the single kernel function. The SOH estimation results for the Group I-Cell 1 battery in Dataset I were obtained according to Section 4.1.1, as shown in Figure 12.
The black curve in Figure 12a represents the accurate SOH decay trajectory, the red line represents the SOH result obtained by using mixed kernel functions, the blue line represents the SOH result obtained by using local RBF kernel functions, and the green line represents the SOH result obtained by global Poly kernel functions. The undesired sum and function errors in Figure 12b are all near 0 and relatively stable. This shows that the hybrid nuclei have high robustness. The results are shown in Table 4. The experimental results show that the mixed kernel function has higher accuracy.

4.1.3. Prediction Results for Different Steps

Under the general definition, battery life can be over when the SOH value decays below 80% of the initial value. The battery’s SOH can be predicted more accurately by predicting the capacity value. Figure 12 shows the prediction performance with different step size prognostications and Table 5 shows the prediction errors. Training and testing are the same but we set the forward period of L cycles to vary the output SOH value. Training with data from the previous cycle yields model parameters for each cycle. A 30-cycle moving window was also used in this experiment. The steps for 5, 10, 20, and 30 cycle forward periods are shown in Figure 13.
As shown in Figure 13a, this method is highly accurate for small forward-looking intervals L, but decreases as L increases. The longer the interval is, the less accurate the forward-looking prediction. The results show that the model better predicts the capacity after 30 cycles.
The black curve in Figure 13b represents the accurate SOH recession trajectory; the red line shows the predicted SOH results for the next five steps; the blue line shows the predicted SOH results for the next ten steps; the green line shows the predicted SOH results for the next twenty steps; and the violet line shows the predicted SOH results for an additional thirty steps. The error is shown in Figure 13c. The prediction accuracy decreases progressively with increasing future prediction step length, mainly when a relatively large deviation occurs after 350 loops for the 20-step and 30-step conditions. Figure 13d shows the box plot lines of the predicted results for different step lengths, and it can be seen that the outliers are particularly prominent in the positive direction with increasing step length, which indicates that episodic errors may have a significant impact on the results. Figure 13e shows the statistical results of the indicator, which clearly shows that the longer the prediction step length is, the larger the error, and the results are shown in Table 5.

4.1.4. Comparison with Different Methods

To further prove the superiority of the proposed SOH co-estimation method, this study is compared with four recently proposed methods. These methods propose different SOC–SOH co-estimation models, including LSTM [48], GPR [49], and LS-SVM [50]. Under the same experimental conditions, Group I-Cell 1 was used in this study for comparative experiments.
Figure 14 shows the SOH estimation results of the above method. The statistical results are shown in Table 6. It can be seen from the results that the method proposed in this paper has significant advantages in SOH prediction. Compared with the actual value and prediction curve in Figure 14a, the prediction trajectory of HKELM is closest to the real SOH decline trend. At the same time, the initial deviation of LS-SVM and the late error of GPR/LSTM are significant. The error percentage in Figure 14b shows that the error of HKELM is strictly controlled within ±0.5%, which is significantly better than that of LS-SVM/GPR (±1%) and LSTM (±2%). The Figure 14c box plot further verifies that HKELM has the most concentrated error distribution (the IQR is only 0.4%) and no outliers, while LSTM has extreme errors. As shown in Figure 14d, the MAE (0.1606), MSE (0.0019), and RMSE (0.1807) of HKELM levels are all the highest, and their comprehensive performances are far superior to LS-SVM, GPR, and LSTM, with severe error fluctuations. The results show that HKELM is the best model choice for SOH prediction, with high accuracy and strong stability.

4.1.5. Dataset I Results for Different Batteries

This subsection explicitly analyzes different cell monomers (Group I-Cell 1, Group II-Cell 5, Group III-Cell 9, and Group IV-Cell 10) in Dataset I to evaluate the model’s prediction ability at different temperatures and charging/discharging conditions. Each subplot consists of two parts: the upper part shows the actual value of the SOH concerning the number of cycles versus the predicted value, and the lower part shows the residual curve of the prediction error. This distribution can be used to visualize model prediction performance and error variation. In the upper central figure, the black curve indicates the actual SOH, reflecting the trend in the battery health state of gradually decaying with increasing charge/discharge cycles and the overall pattern of a nonlinear decline. The red curve indicates the SOH curve predicted by the model, which is compared with the actual SOH data in green, demonstrating the expected performance of the model on different battery monomers. In addition, the Monte Carlo (MC) method yields the probability distribution of the RUL as the purple part. From Figure 15a, the RUL of Cell 1 is 292 cycles, and the RUL probability density distribution (PDD) is 288~300 cycles. From Figure 15b, the RUL of Cell 5 is 284 cycles and the PDD is 288~300 cycles. From Figure 15c, the RUL of Cell 9 is 119 cycles and the PDD is 111~123 cycles. From Figure 15d, the RUL of Cell 10 is 385 cycles and the PDD is 382~394 cycles. The predicted RUL probability density is distributed within a narrow interval close to the failure cycles and accurately covers the actual effectiveness cycles. In the error plots below, each subplot shows how the predicted values deviate from the actual values, with an expected error of ±2%. The degree of fluctuation in the residual curves reflects the accuracy and stability of the model predictions. At some stages, the residual error is small, indicating a more favorable predictive performance of the model. The results are shown in Table 7.

4.2. SOC Estimation Results

4.2.1. Dataset I SOC Estimation Results

To evaluate the robustness of the SOC estimation method, in Dataset I of Group I-Cell 1, the actual initial value of the SOC is 100% when SOH = 0.9 and the initial values are set to 60% and 40%, respectively. The prediction results are shown in Figure 16. The results show that, under different initial values, the method can achieve the convergence of the estimated value and the actual value in approximately 0.2 h with high accuracy.
The final SOC estimation results for Group I-Cell 1 in Dataset I are shown in Figure 16. As shown in Figure 17a–c, joint SOC-SOH estimation can follow the actual value of the SOC of an Li-ion battery more accurately. Figure 17c depicts the errors (residuals) between the predicted and actual values, i.e., the time-domain performance of the estimation error, which is relatively small in magnitude, indicating that the EKF algorithm has good estimation accuracy. Figure 17a is a partial enlargement of Figure 17b, which compares the actual SOC and the EKF-estimated SOC in a specific period in more detail. As shown in Figure 17a, the joint estimation method is used to estimate the battery SOC, which can identify the parameters of the novel equivalent circuit online in real time according to the changes in the battery SOC and charge/discharge cycles and adjust the maximum usable capacity in real time according to the changes in the SOH. Therefore, it can follow the actual values of SOC for LIBs more accurately in the case of battery aging.

4.2.2. Dataset II SOC Estimation Results

Figure 18 shows the results of the battery SOC estimations via the EKF for Dataset II via AR-ECM under three typical operating conditions, namely, US06, UDDS, and LA92, at three temperatures, namely, −20 °C, 25 °C, and 40 °C, respectively. The statistical results are shown in Table 8. The black curve represents the actual SOC values, the red curve represents the SOC values estimated via the EKF, and the light blue curve represents the absolute errors between the estimated values and the actual values. The comparison shows that the EKF shows good estimation performance under different working conditions and temperatures, the estimated values are in high agreement with the actual values, and the absolute errors are always controlled within 2%. Further analysis reveals slight differences in the SOC estimation results under different operating conditions: under US06, owing to the high speed and frequent dynamic changes, the SOC decreases at a faster rate and the estimation error fluctuates slightly over time, whereas under UDDS and LA92, the SOC curves are relatively smooth and the fluctuations in the estimation errors are slight. In addition, the effect of temperature on the SOC estimation is more significant. The estimation errors are slightly greater at low temperatures (−20 °C), which may be due to the increase in internal resistance and the decrease in the electrochemical reaction rate of the battery at low temperatures and other characteristics concerning the accuracy of the model. The SOC estimation results are more stable and accurate in room temperature (25 °C) and high-temperature (40 °C) conditions.
The performance of the model was evaluated by comparative analysis with recent studies in the field, as shown in Table 9, focusing on RMSE and MAE indicators. The results show that the accuracy of the model is significantly improved compared to previous studies, reflecting improved performance under various evaluation criteria.
The reason for the good performance of the EKF model used in this study can be summarized as the combination with the AR-ECM equivalent circuit, which improves the accuracy of the model through the accurate parameter identification method. Overall, the EKF algorithm shows good adaptability and robustness under various complex operating conditions and temperatures. Its high-precision estimation results indicate that it can effectively cope with dynamic operating condition changes and temperature fluctuations, which provides reliable technical support for SOC estimation in practical applications. The results of this study have important theoretical value and practical significance for further improving the performance of battery management systems.

5. Conclusions

This paper proposes a joint SOH–SOC estimation method incorporating the AR-ECM and a data-driven model. First, a new equivalent circuit model that simultaneously considers the effects of battery ageing and the SOC is constructed, and its parameters are optimally solved via the PSO algorithm incorporating adaptive inertia weights. The battery AR model parameters are identified under different SOC and SOH conditions. Then, based on the features extracted from the charging process of LIBs, essential features are selected as model inputs by the MCA method, the battery SOH values are used as outputs, and the MSIVY-DLEM model is used for training, establishing the feature–SOH mapping relationship, and ultimately achieving accurate prediction of the SOH of LIBs. The predicted SOH value is subsequently multiplied by the rated capacity of the battery to calculate the actual capacity of the battery, which is used to update the AR-ECM spatial equation of state to estimate the SOC further. This paper validates the joint SOH–SOC estimation method via the University of Michigan Battery Laboratory and McMaster University Hamilton dataset. The results indicate that the process can track the SOC and SOH values of LIBs more accurately than other methods over the battery life cycle.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/electronics14071290/s1, Table S1. Data set I voltage selection interval. Table S2. Parameter Settings.

Author Contributions

Conceptualization, X.S., C.L. and H.L.; writing—original draft, Z.W.; Writing—review and editing, Y.L. and W.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Major Science and Technology Projects for Independent Innovation of China FAW Group Co., Ltd. (Grant No. 20230301021ZD) and the FAW-Volkswagen China Environmental Protection Foundation Automotive Eco-friendly Innovation Project.

Data Availability Statement

The authors have used publicly available data in this manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Masias, A.; Marcicki, J.; Paxton, W.A. Opportunities and Challenges of Lithium Ion Batteries in Automotive Applications. ACS Energy Lett. 2021, 6, 621–630. [Google Scholar] [CrossRef]
  2. Hou, J.J.; Li, T.; Zhou, F.; Zhao, D.F.; Zhong, Y.D.; Yao, L.; Zeng, L. A Review of Critical State Joint Estimation Methods of Lithium-Ion Batteries in Electric Vehicles. World Electr. Veh. J. 2022, 13, 159. [Google Scholar] [CrossRef]
  3. Demirci, O.; Taskin, S.; Schaltz, E.; Demirci, B.A. Review of battery state estimation methods for electric vehicles—Part I: SOC estimation. J. Energy Storage 2024, 87, 111435. [Google Scholar] [CrossRef]
  4. Zheng, Y.J.; Ouyang, M.G.; Han, X.B.; Lu, L.G.; Li, J.Q. Investigating the error sources of the online state of charge estimation methods for lithium-ion batteries in electric vehicles. J. Power Sources 2018, 377, 161–188. [Google Scholar] [CrossRef]
  5. Cheng, K.W.E.; Divakar, B.P.; Wu, H.J.; Ding, K.; Ho, F.H. Battery-Management System (BMS) and SOC Development for Electrical Vehicles. IEEE Trans. Veh. Technol. 2011, 60, 76–88. [Google Scholar] [CrossRef]
  6. Wang, S.L.; Fernandez, C.; Shang, L.P.; Li, Z.F.; Yuan, H.F. An integrated online adaptive state of charge estimation approach of high-power lithium-ion battery packs. Trans. Inst. Meas. Control 2018, 40, 1892–1910. [Google Scholar] [CrossRef]
  7. Chang, W.E.; Kung, C.C. An Improved AhI Method With Deep Learning Networks for State of Charge Estimation of Lithium-Ion Battery. IEEE Access 2024, 12, 55465–55473. [Google Scholar] [CrossRef]
  8. Zeng, J.W.; Wang, S.L.; Zhang, M.Y.; Cao, W.; Fernandez, C.; Guerrero, J.M. Battery multi-time scale fractional-order modeling method for state of charge estimation adaptive to full parameters updating. J. Energy Storage 2024, 86, 111283. [Google Scholar] [CrossRef]
  9. Li, X.; Song, Y.W.H.; Ren, H.Q. State of Charge Estimation of Lithium-Ion Batteries Based on Fractional-Order Model with Mul-ti-Innovations Dual Cubature Kalman Filter Method. J. Electrochem. Soc. 2024, 171, 090510. [Google Scholar] [CrossRef]
  10. Kim, T.; Qiao, W.; Qu, L.Y. Online SOC and SOH Estimation for Multicell Lithium-ion Batteries Based on an Adaptive Hybrid Battery Model and Sliding-Mode Observer. In Proceedings of the IEEE Energy Conversion Congress and Exposition (ECCE), Denver, CO, USA, 15–19 September 2013; pp. 292–298. [Google Scholar]
  11. Liu, F.; Shao, C.; Su, W.X.; Liu, Y. Online joint estimator of key states for battery based on a new equivalent circuit model. J. Energy Storage 2022, 52, 104780. [Google Scholar] [CrossRef]
  12. Zahid, T.; Xu, K.; Li, W.M.; Li, C.M.; Li, H.Z. State of charge estimation for electric vehicle power battery using advanced machine learning algorithm under diversified drive cycles. Energy 2018, 162, 871–882. [Google Scholar] [CrossRef]
  13. Chen, L.P.; Song, Y.J.; Lopes, A.M.; Bao, X.Y.; Zhang, Z.Q.; Lin, Y. Joint Estimation of State of Charge and State of Energy of Lithium-Ion Batteries Based on Optimized Bidirectional Gated Recurrent Neural Network. IEEE Trans. Transp. Electrif. 2024, 10, 1605–1616. [Google Scholar] [CrossRef]
  14. Xiao, Y.J.; Song, W.H.; Liu, W.L.; Wan, F. Estimation of lithium battery state of charge using the LTG-SABO-GRU model. Meas. Sci. Technol. 2024, 35, 115106. [Google Scholar] [CrossRef]
  15. Zafar, M.H.; Khan, N.M.; Abou Houran, M.; Mansoor, M.; Akhtar, N.; Sanfilippo, F. A novel hybrid deep learning model for accurate state of charge estimation of Li-Ion batteries for electric vehicles under high and low temperature. Energy 2024, 292, 130584. [Google Scholar] [CrossRef]
  16. Hansen, T.; Wang, C.J. Support vector based battery state of charge estimator. J. Power Sources 2005, 141, 351–358. [Google Scholar] [CrossRef]
  17. Ma, H.L.; Bao, X.Y.; Lopes, A.; Chen, L.P.; Liu, G.Q.; Zhu, M. State-of-Charge Estimation of Lithium-Ion Battery Based on Convolutional Neural Network Combined with Unscented Kalman Filter. Batteries 2024, 10, 198. [Google Scholar] [CrossRef]
  18. El Fallah, S.; Kharbach, J.; Vanagas, J.; Vilkelyte, Z.; Tolvaisiene, S.; Gudzius, S.; Kalvaitis, A.; Lehmam, O.; Masrour, R.; Hammouch, Z.; et al. Advanced State of Charge Estimation Using Deep Neural Network, Gated Recurrent Unit, and Long Short-Term Memory Models for Lithium-Ion Batteries under Aging and Temperature Conditions. Appl. Sci. 2024, 14, 6648. [Google Scholar] [CrossRef]
  19. How, D.N.T.; Hannan, M.A.; Lipu, M.S.H.; Ker, P.J. State of Charge Estimation for Lithium-Ion Batteries Using Model-Based and Data-Driven Methods: A Review. IEEE Access 2019, 7, 136116–136136. [Google Scholar] [CrossRef]
  20. Alsuwian, T.; Ansari, S.; Zainuri, M.; Ayob, A.; Hussain, A.; Lipu, M.S.H.; Alhawari, A.R.H.; Almawgani, A.H.M.; Almasabi, S.; Hindi, A.T. A review of expert hybrid and co-estimation techniques for SOH and RUL estimation in battery management system with electric vehicle application. Expert. Syst. Appl. 2024, 246, 123123. [Google Scholar] [CrossRef]
  21. Zhang, X.G.; Gao, X.Y.; Duan, L.C.; Gong, Q.S.; Wang, Y.; Ao, X.Y. A novel method for state of health estimation of lithium-ion batteries based on fractional-order differential voltage-capacity curve. Appl. Energy 2025, 377, 124404. [Google Scholar] [CrossRef]
  22. Li, J.J.; Zhou, J.; Wu, X.; Liu, T. Wide temperature multi-condition SOC estimation for Li-ion batteries based on F-FOMIAEKF. Meas. Sci. Technol. 2025, 36, 025012. [Google Scholar] [CrossRef]
  23. Li, C.F.; Yang, L.; Li, Q.; Zhang, Q.S.; Zhou, Z.Y.; Meng, Y.Z.; Zhao, X.W.; Wang, L.; Zhang, S.M.; Li, Y.; et al. SOH estimation method for lithium-ion batteries based on an improved equivalent circuit model via electrochemical impedance spectroscopy. J. Energy Storage 2024, 86, 111167. [Google Scholar] [CrossRef]
  24. Chen, S.Q.; Zhang, Q.; Wang, F.C.; Wang, D.F.; He, Z.Q. An electrochemical-thermal-aging effects coupled model for lithium-ion batteries performance simulation and state of health estimation. Appl. Therm. Eng. 2024, 239, 122128. [Google Scholar] [CrossRef]
  25. Bharath, Y.K.; Anandu, V.P.; Vinatha, U.; Sudeep, S. Long-Term Estimation of SoH Using Cascaded LSTM-RNN for Lithium Batteries Subjected to Aging and Accelerated Degradation. Energy Storage 2024, 6, e70066. [Google Scholar] [CrossRef]
  26. Zhou, J.N.; Wang, S.L.; Cao, W.; Xie, Y.X.; Fernandez, C. State of health prediction of lithium-ion batteries based on SSA optimized hybrid neural network model. Electrochim. Acta 2024, 487, 144146. [Google Scholar] [CrossRef]
  27. Peng, S.M.; Wang, Y.J.; Tang, A.H.; Jiang, Y.X.; Kan, J.R.; Pecht, M. State of health estimation joint improved grey wolf optimization algorithm and LSTM using partial discharging health features for lithium-ion batteries. Energy 2025, 315, 134293. [Google Scholar] [CrossRef]
  28. He, Y.; Bai, W.; Wang, L.; Wu, H.; Ding, M. SOH estimation for lithium-ion batteries: An improved GPR optimization method based on the developed feature extraction. J. Energy Storage 2024, 83, 110678. [Google Scholar] [CrossRef]
  29. Zeng, X.Y.; Sun, Y.K.; Xia, X.Y.; Chen, L. A framework for joint SOC and SOH estimation of lithium-ion battery: Eliminating the dependency on initial states. Appl. Energy 2025, 377, 124624. [Google Scholar] [CrossRef]
  30. Yang, F.; Xu, Y.X.; Su, L.; Yang, Z.C.; Feng, Y.; Zhang, C.; Shao, T. State of Charge and State of Health Estimation of Lithium-Ion Battery Packs With Inconsistent Internal Parameters Using Dual Extended Kalman Filter. J. Electrochem. Energy Convers. Storage 2024, 21, 011004. [Google Scholar] [CrossRef]
  31. Wei, J.J.; Wan, Y.H.; Zhang, C.M.; Hua, P.; Tang, J. Enhanced SOC and SOH estimation for Li-ion batteries based on combining adaptive central difference Kalman filter and discrete-time sliding mode observer. J. Energy Storage 2025, 114, 115671. [Google Scholar] [CrossRef]
  32. Li, Z.X.; Shen, S.Y.; Zhou, Z.; Cai, Z.D.; Gu, W.M.; Zhang, F.Y. Novel method for modelling and adaptive estimation for SOC and SOH of lithium-ion batteries. J. Energy Storage 2023, 62, 106927. [Google Scholar] [CrossRef]
  33. Ge, D.D.; Jin, G.Y.; Wang, J.Q.; Zhang, Z.D. A novel data-driven IBA-ELM model for SOH/SOC estimation of lithium-ion batteries. Energy 2024, 305, 132395. [Google Scholar] [CrossRef]
  34. Wang, C.C.; Su, Y.Y.; Ye, J.L.; Xu, P.H.; Xu, E.Y.; Ouyang, T.C. Enhanced state-of-charge and state-of-health estimation of lithium-ion battery incorporating machine learning and swarm intelligence algorithm. J. Energy Storage 2024, 83, 110755. [Google Scholar] [CrossRef]
  35. Ghasemi, M.; Zare, M.; Trojovsky, P.; Rao, R.V.; Trojovská, E.; Kandasamy, V. Optimization based on the smart behavior of plants with its engineering applications: Ivy algorithm. Knowl. Based Syst. 2024, 295, 111850. [Google Scholar] [CrossRef]
  36. Zhang, C.Q.; Lin, W.Z.; Hu, G. An enhanced ivy algorithm fusing multiple strategies for global optimization problems. Adv. Eng. Softw. 2025, 203, 103862. [Google Scholar] [CrossRef]
  37. Mohtat, P.; Lee, S.; Siegel, J.B.; Stefanopoulou, A.G. Reversible and Irreversible Expansion of Lithium-Ion Batteries Under a Wide Range of Stress Factors. J. Electrochem. Soc. 2021, 168, 100520. [Google Scholar] [CrossRef]
  38. Vidal, C.; Kollmeyer, P.; Naguib, M.; Malysz, P.; Gross, O.; Emadi, A. Robust xEV Battery State-of-Charge Estimator Design Using a Feedforward Deep Neural Network. SAE Int. J. Adv. Curr. Pract. Mobil. 2020, 2, 2872–2880. [Google Scholar] [CrossRef]
  39. An, J.K.; Guo, W.; Lv, T.Y.; Zhao, Z.H.; He, C.G.; Zhao, H.S. Joint Prediction of the State of Charge and the State of Health of Lithium-Ion Batteries Based on the PSO-XGBoost Algorithm. Energies 2023, 16, 4243. [Google Scholar] [CrossRef]
  40. Ang, E.Y.M.; Paw, Y.C. Linear Model for Online State of Health Estimation of Lithium-Ion Batteries Using Segmented Discharge Profiles. IEEE Trans. Transp. Electrif. 2023, 9, 2464–2471. [Google Scholar] [CrossRef]
  41. Dai, H.D.; Lai, Y.; Huang, Y.Y.; Yu, H.; Yang, Y.X.; Zhu, L.Q. State-of-health estimation of lithium-ion batteries using multiple correlation analysis-based feature screening and optimizing echo state networks with the weighted mean of vectors. J. Power Sources 2024, 623, 235482. [Google Scholar] [CrossRef]
  42. Reshef, D.N.; Reshef, Y.A.; Finucane, H.K.; Grossman, S.R.; McVean, G.; Turnbaugh, P.J.; Lander, E.S.; Mitzenmacher, M.; Sabeti, P.C. Detecting Novel Associations in Large Data Sets. Science 2011, 334, 1518–1524. [Google Scholar] [CrossRef] [PubMed]
  43. Li, G.Z.; Li, B.; Li, C.; Wang, S. State-of-health rapid estimation for lithium-ion battery based on an interpretable stacking ensemble model with short-term voltage profiles. Energy 2023, 263, 126064. [Google Scholar] [CrossRef]
  44. Yang, Y.L.; Chen, S.Y.; Chen, T.; Huang, L.S. State of Health Assessment of Lithium-ion Batteries Based on Deep Gaussian Process Regression Considering Heterogeneous Features. J. Energy Storage 2023, 61, 106797. [Google Scholar] [CrossRef]
  45. Hong, J.N.; Chen, Y.C.; Chai, Q.Q.; Lin, Q.B.; Wang, W. State-of-health estimation of lithium-ion batteries using a novel dual-stage attention mechanism based recurrent neural network. J. Energy Storage 2023, 72, 109297. [Google Scholar] [CrossRef]
  46. Liu, F.; Liu, Y.P.; Su, W.X.; Jiao, C.P.; Liu, Y. Online estimation of lithium-ion batteries state of health during discharge. Int. J. Energy Res. 2021, 45, 10112–10128. [Google Scholar] [CrossRef]
  47. Gholizadeh, M.; Yazdizadeh, A. Systematic mixed adaptive observer and EKF approach to estimate SOC and SOH of lithium-ion battery. IET Electr. Syst. Transp. 2020, 10, 135–143. [Google Scholar] [CrossRef]
  48. Yu, M.; Zhu, Y.H.; Gu, X.; Li, J.L.; Shang, Y.L. Co-estimation and definition for states of health and charge of lithium-ion batteries using expansion. Energy 2024, 308, 132872. [Google Scholar] [CrossRef]
  49. Li, D.H.; Liu, X.; Cheng, Z. The co-estimation of states for lithium-ion batteries based on segment data. J. Energy Storage 2023, 62, 106787. [Google Scholar] [CrossRef]
  50. Sun, Y.K.; Liu, F.F.; Qin, W.; Li, J.; Cheng, X.F.; Zeng, J.B. Dynamic internal resistance modeling and thermal characteristics of lithium-ion batteries for electric vehicles by considering state of health. J. Power Sources 2024, 612, 234806. [Google Scholar] [CrossRef]
  51. Hu, C.S.; Cheng, F.J.; Ma, L.; Li, B.H. State of Charge Estimation for Lithium-Ion Batteries Based on TCN-LSTM Neural Networks. J. Electrochem. Soc. 2022, 169, 030544. [Google Scholar] [CrossRef]
  52. Li, H.; Fu, L.J.; Long, X.L.; Liu, L.; Zeng, Z.Q. A hybrid deep learning model for lithium-ion batteries state of charge estimation based on quantile regression and attention. Energy 2024, 294, 130834. [Google Scholar] [CrossRef]
  53. Wu, L.L.; Chen, C.; Li, Z.H.; Chen, Z.; Li, H. The Joint Estimation of SOC-SOH for Lithium-Ion Batteries Based on BiLSTM-SA. Electronics 2025, 14, 97. [Google Scholar] [CrossRef]
Figure 1. Dataset. (a,b) Dataset I SOH and the voltages of Group I-Cell 1. (ce) Voltage and SOC curves of Dataset II at 25 °C.
Figure 1. Dataset. (a,b) Dataset I SOH and the voltages of Group I-Cell 1. (ce) Voltage and SOC curves of Dataset II at 25 °C.
Electronics 14 01290 g001
Figure 2. Evolution of SOC under different SOH conditions.
Figure 2. Evolution of SOC under different SOH conditions.
Electronics 14 01290 g002
Figure 3. Voltage segment feature extraction and correlation coefficients. (a) Voltage segment time integration: curve 1 is the 3.8–4.0 V integration curve, curve 2 is the integration curve for 4.0–4.2 V time, and curve 3 is the actual SOH. (b) Voltage ageing evolution of Cell 1 battery. (c) PCC between integration time and capacity for different voltage intervals.
Figure 3. Voltage segment feature extraction and correlation coefficients. (a) Voltage segment time integration: curve 1 is the 3.8–4.0 V integration curve, curve 2 is the integration curve for 4.0–4.2 V time, and curve 3 is the actual SOH. (b) Voltage ageing evolution of Cell 1 battery. (c) PCC between integration time and capacity for different voltage intervals.
Electronics 14 01290 g003
Figure 4. Feature extraction. (a) Cell 1 charging voltage curve. (b) d Q / d V filter: curve 1 is the initial d Q / d V curve, and curve 2 is the filtered d Q / d V curve. (c) d Q / d V aging evolution of the d Q / d V curves. (df) I n * V process.
Figure 4. Feature extraction. (a) Cell 1 charging voltage curve. (b) d Q / d V filter: curve 1 is the initial d Q / d V curve, and curve 2 is the filtered d Q / d V curve. (c) d Q / d V aging evolution of the d Q / d V curves. (df) I n * V process.
Electronics 14 01290 g004
Figure 5. Multiple correlation coefficient selection. (a) Extracted features curves. (b) MCA for 8 HIFs with SOH. (c) Importance ranking of 8 HIFs with SOH.
Figure 5. Multiple correlation coefficient selection. (a) Extracted features curves. (b) MCA for 8 HIFs with SOH. (c) Importance ranking of 8 HIFs with SOH.
Electronics 14 01290 g005
Figure 6. Improved Ivy optimization results: (a) F1, (b) F4, and (c) F9.
Figure 6. Improved Ivy optimization results: (a) F1, (b) F4, and (c) F9.
Electronics 14 01290 g006
Figure 7. Training and testing process.
Figure 7. Training and testing process.
Electronics 14 01290 g007
Figure 8. AR-based model ECM principle.
Figure 8. AR-based model ECM principle.
Electronics 14 01290 g008
Figure 9. Parameter identification results for Group I-Cell 1 under various cycle number conditions.
Figure 9. Parameter identification results for Group I-Cell 1 under various cycle number conditions.
Electronics 14 01290 g009
Figure 10. Joint SOH–SOC estimation framework.
Figure 10. Joint SOH–SOC estimation framework.
Electronics 14 01290 g010
Figure 11. Prediction results for different features. (a) Prediction results for different features. (b) Prediction error for different features. (c) Error box plot. (d) Error statistical graph.
Figure 11. Prediction results for different features. (a) Prediction results for different features. (b) Prediction error for different features. (c) Error box plot. (d) Error statistical graph.
Electronics 14 01290 g011
Figure 12. Prediction results for different kernel functions. (a) Prediction results for different kernel functions. (b) Prediction error for different kernel functions. (c) Error box plot. (d) Error statistical graph.
Figure 12. Prediction results for different kernel functions. (a) Prediction results for different kernel functions. (b) Prediction error for different kernel functions. (c) Error box plot. (d) Error statistical graph.
Electronics 14 01290 g012
Figure 13. Prediction results for different step sizes. Training and testing processes for different step sizes. (a) Different step training and testing process. (b) Prediction results for different step sizes. (c) Prediction errors for different step sizes. (d) Error box plot. (e) Error statistical graph.
Figure 13. Prediction results for different step sizes. Training and testing processes for different step sizes. (a) Different step training and testing process. (b) Prediction results for different step sizes. (c) Prediction errors for different step sizes. (d) Error box plot. (e) Error statistical graph.
Electronics 14 01290 g013
Figure 14. Prediction results for different methods. (a) Prediction results for different methods. (b) Prediction error for different methods. (c) Error box plot. (d) Error statistical graph.
Figure 14. Prediction results for different methods. (a) Prediction results for different methods. (b) Prediction error for different methods. (c) Error box plot. (d) Error statistical graph.
Electronics 14 01290 g014
Figure 15. Prediction results of different battery cells. (a) Group I-Cell 1 predicted results and errors. (b) Group II-Cell 5 predicted results and errors. (c) Group III-Cell 9 predicted results and errors. (d) Group IV-Cell 10 predicted results and errors.
Figure 15. Prediction results of different battery cells. (a) Group I-Cell 1 predicted results and errors. (b) Group II-Cell 5 predicted results and errors. (c) Group III-Cell 9 predicted results and errors. (d) Group IV-Cell 10 predicted results and errors.
Electronics 14 01290 g015
Figure 16. Predicted results of different SOC initial values under the SOH = 0.9 condition. (a) SOC initial error 40%. (b) SOC initial error 60%.
Figure 16. Predicted results of different SOC initial values under the SOH = 0.9 condition. (a) SOC initial error 40%. (b) SOC initial error 60%.
Electronics 14 01290 g016
Figure 17. Dataset I SOC estimated results. (a) Magnified partial view of (b). (b) Dataset I SOC estimated results. (c) Times of SOC estimation errors.
Figure 17. Dataset I SOC estimated results. (a) Magnified partial view of (b). (b) Dataset I SOC estimated results. (c) Times of SOC estimation errors.
Electronics 14 01290 g017
Figure 18. Results of SOC estimation for Dataset II. (ac) SOC estimation results of US06, UDDS, and LA92 under −20 °C conditions. (df) SOC estimation results of US06, UDDS, and LA92 under 25 °C conditions. (hg) SOC estimation results of US06, UDDS, and LA92 under 40 °C conditions.
Figure 18. Results of SOC estimation for Dataset II. (ac) SOC estimation results of US06, UDDS, and LA92 under −20 °C conditions. (df) SOC estimation results of US06, UDDS, and LA92 under 25 °C conditions. (hg) SOC estimation results of US06, UDDS, and LA92 under 40 °C conditions.
Electronics 14 01290 g018
Table 1. Extracted HIFs.
Table 1. Extracted HIFs.
LabelNameDetail
HIF1 S Voltage integral over time
HIF2 d v / d t Voltage rise time
HIF3 K Voltage slope
HIF4 b voltage intercept
HIF5 d Q / d V p e a k d Q / d V Peak
HIF6 d Q / d V p o s d Q / d V peak position
HIF7 T i n v Poststandardization time
HIF8 Δ I n v Standardized time lag
Table 2. Prediction results of capacity with different methods.
Table 2. Prediction results of capacity with different methods.
FunctionIvyIIVYE_WOAIGWOPSO
F1Computing time (s)0.2340.1530.290.5470.037
Fitness value00000.280
F4Computing time (s)0.2340.1530.2970.5590.040
Fitness value000.37901.556
F9Computing time (s)0.2320.150.3070.5710.55
Fitness value0090.893216.664111.613
Table 3. Statistical results for different characteristics.
Table 3. Statistical results for different characteristics.
Feature UseMAE (%)MSE (%)RMSE (%)R2
SV0.18840.07390.27190.998
d Q / d V 0.45850.29750.54550.994
InV0.30510.17820.42210.996
MCA0.14510.02510.15840.999
Table 4. Prediction results of capacity with different kernel functions.
Table 4. Prediction results of capacity with different kernel functions.
Kernel FunctionsMAE (%)MSE (%)RMSE (%)R2
Hybird Kernel 0.10970.01790.13040.999
RBF_kernel0.37500.24880.49980.995
Poly_kernel0.46670.30120.54890.994
Table 5. Prediction results of capacity with different look-ahead periods.
Table 5. Prediction results of capacity with different look-ahead periods.
Look-Ahead PeriodMAE (%)MSE (%)RMSE (%)R2
50.16060.00190.18070.999
100.19530.00230.22730.999
200.23780.00290.38790.996
300.26860.00340.59230.992
Table 6. Prediction results for capacity with different methods.
Table 6. Prediction results for capacity with different methods.
MethodMAE (%)MSE (%)RMSE (%)R2Ate (s)
Our proposal0.16060.00190.18070.99964.644
LSTM0.19530.00230.22730.999992.086
GPR0.23780.00290.38790.99664.159
LS-SVM0.26860.00340.59230.99263.416
Table 7. SOH estimation errors of the test cells.
Table 7. SOH estimation errors of the test cells.
Cell No.MAE (%)MSE (%)RMSE (%)R2
10.15500.03890.19740.999
50.36980.17600.44950.995
90.29570.12400.35220.998
100.22510.00580.24220.998
Table 8. Proposed model performance evaluation.
Table 8. Proposed model performance evaluation.
ModelTemperatureMAE (%)MSE (%)RMSE (%)R2
US06−200.51000.0030.57040.9986
250.53280.00450.66930.9995
400.40200.00260.50510.9997
UDDS−200.98270.01290.67880.9986
250.93540.01210.64850.9986
400.35900.00220.46970.9997
LA92−200.34260.00210.46190.9997
250.77800.00870.63020.999
400.60650.04800.69640.9995
Avg. 0.60540.01060.59220.9992
Table 9. Performance comparison with the latest study.
Table 9. Performance comparison with the latest study.
MethodsMAE (%)RMSE (%)
TCN-LSTM [51]0.70.81
QTGA [52]0.80.96
BiLSTM-SA [53]0.841.20
Proposed0.60540.5922
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wei, Z.; Sun, X.; Li, Y.; Liu, W.; Liu, C.; Lu, H. A Joint Estimation Method for the SOC and SOH of Lithium-Ion Batteries Based on AR-ECM and Data-Driven Model Fusion. Electronics 2025, 14, 1290. https://doi.org/10.3390/electronics14071290

AMA Style

Wei Z, Sun X, Li Y, Liu W, Liu C, Lu H. A Joint Estimation Method for the SOC and SOH of Lithium-Ion Batteries Based on AR-ECM and Data-Driven Model Fusion. Electronics. 2025; 14(7):1290. https://doi.org/10.3390/electronics14071290

Chicago/Turabian Style

Wei, Zhiyuan, Xiaowen Sun, Yiduo Li, Weiping Liu, Changying Liu, and Haiyan Lu. 2025. "A Joint Estimation Method for the SOC and SOH of Lithium-Ion Batteries Based on AR-ECM and Data-Driven Model Fusion" Electronics 14, no. 7: 1290. https://doi.org/10.3390/electronics14071290

APA Style

Wei, Z., Sun, X., Li, Y., Liu, W., Liu, C., & Lu, H. (2025). A Joint Estimation Method for the SOC and SOH of Lithium-Ion Batteries Based on AR-ECM and Data-Driven Model Fusion. Electronics, 14(7), 1290. https://doi.org/10.3390/electronics14071290

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop