1. Introduction
As energy demand and pollution rise, countries urgently need sustainable development. Electric vehicles (EVs) are key solutions, with lithium-ion batteries (LIBs) as crucial components [
1]. SOH and SOC are vital for safety, performance, and reliability. These parameters are hard to measure directly, making high-precision estimation methods essential for battery technology [
2].
The SOC, a key battery performance metric, represents the ratio of remaining power to total capacity [
3]. SOC estimation methods fall into three categories. Category 1: The Ah integration method calculates charge changes through current integration. While simple, it suffers from inaccuracies due to initial value errors, current measurement inaccuracies, and aging effects [
4,
5]. Consequently, it is often combined with model-based or data-driven approaches to enhance accuracy and robustness. Wang et al. [
6] proposed a method based on a combination of ampere-hour integration and an extended Kalman filter, which effectively reduces cumulative errors over long-term use. Chang et al. [
7] used a new approach of combining neural networks and ampere-hour integral compensation to improve the accuracy of charge-state prediction under various environmental and operating conditions. Category 2: Physical model-based methods use adaptive filtering and state-space expressions for SOC estimation, with SOC and polarization voltage as state variables derived from OCV tests. Zeng et al. [
8] developed an SOC estimation method based on a fractional-order model, which better describes the dynamic characteristics of the battery and improves the estimation accuracy. Li et al. [
9] proposed an SOC estimation method based on a Dual Cubature Kalman Filter, which effectively improves estimation accuracy by capturing the dynamic behavior of the battery on different time scales. While theoretically interpretable, these methods face limitations [
10,
11]. They rely on complex electrochemical parameters (e.g., OCV, polarization voltage, and internal resistance) requiring precise identification and increasing model complexity. Category 3: Data-driven methods. Data-driven methods use machine learning techniques to establish a nonlinear mapping relationship between battery operating parameters and the SOC [
12], and commonly used algorithms include convolutional neural networks (CNNs), recurrent neural networks (RNNs) [
13], gate recurrent units (GRUs) [
14], long short-term memory (LSTM) [
15], and support vector machines (SVMs) [
16]. Ma et al. [
17] proposed an SOC estimation method based on the combination of CNNs and UKFs, which took the output of a CNN as the input of a UKF and obtained high-precision SOC estimation through self-correction. El Fallah et al. [
18] developed an SOC estimation model based on a deep neural networks (DNNs), which can effectively capture the dynamic characteristics of batteries. Compared to physical models, this approach minimizes reliance on battery characteristic modeling and avoids complex parameter identification. By analyzing extensive battery operation data, it achieves high-precision SOC estimation. However, it demands high-quality, large-scale data, particularly covering diverse operating conditions, as data quality directly impacts model training and prediction accuracy [
19].
The SOH is used to measure the degree of battery aging, defined as the ratio of the currently available capacity to the rated capacity, and is an important metric for assessing battery performance and degradation [
20]. SOH estimation methods can be classified into model-based methods and data-driven methods. Model-based methods describe battery degradation laws by building mathematical models, such as the equivalent circuit model (ECM), electrochemical model, and fractional order model [
21,
22]. Li et al. [
23] proposed an SOH estimation method based on an improved equivalent circuit model (ECM), which reflects the degradation of battery capacity by identifying changes in model parameters. Chen et al. [
24] developed an electrochemical aging model that considers the effect of temperature, improving the accuracy of SOH estimation in different temperature environments. Although these methods have high theoretical accuracy, they rely on many experimental tests and calibrations, and the models are complex and computationally expensive, making it difficult to meet the demands of real-time applications; additionally, aging models are more sensitive to parameters and environmental changes. Data-driven methods use machine learning techniques to capture the nonlinear characteristics of battery degradation and model the relationships between health indicators (e.g., incremental capacity, differential voltage, and partial energy characteristics) and SOH, and common algorithms include RNNs [
25], LSTM [
26], and Gaussian process regression (GPR), among others. Peng et al. [
27] developed an SOH estimation model based on IGWO–LSTM, which combined with Dropout regularization to suppress model overfitting and improve generalization ability. He et al. [
28] applied GPR to SOH estimation, quantifying the uncertainty of predictions through a probabilistic model and enhancing the reliability of decision-making. Data-driven methods do not require tedious parameter identification, but they have high requirements for data quality and quantity and high computational overheads.
In addition, since single estimation methods tend to ignore the strong coupling between SOC and SOH, methods for jointly estimating the two have gained much attention. Battery degradation has a significant effect on the accuracy of SOC estimation, while inaccurate SOC estimation results may in turn interfere with SOH calibration. To address this issue, researchers have developed machine learning, multi-timescale, and dual-filtering methods for the joint estimation of SOC and SOH. Zeng et al. [
29] proposed two RBF-ARX models to capture the nonlinear dynamics of batteries and establish the association between SOC, SOH, and observed values. The initial state is then sampled and inferred using the MCMC method and finally combined with UKF for joint estimation. The experimental validation uses multiple data sets to demonstrate the effectiveness of the method. Yang et al. [
30] proposed a complementary cooperative algorithm based on the combination of double Kalman filtering (DEKF) and pattern recognition, which achieves high-precision joint SOC–SOH estimation. Wei et al. [
31] proposed an algorithm combining an adaptive central difference Kalman filter and discrete-time sliding mode observer (ACDKF-DSMO) for improving the robustness and accuracy of SOC and SOH estimation of lithium-ion batteries. In addition, Li et al. [
32] showed that fractional-order models for joint SOC–SOH estimation have higher accuracy than traditional integer-order equivalent circuit-based models and can better describe the nonlinear characteristics of batteries.
Different neural networks have different hyperparameters. Selecting an appropriate optimization algorithm to automatically search for the best parameter combination solves the inefficiency and subjectivity of manual parameter tuning and can significantly improve model performance. Ge et al. [
33] constructed an improved IBA-ELM model for joint SOC–SOH estimation, and improved the model performance through an optimization algorithm. Wang et al. [
34] proposed an improved firefly algorithm (IFA), improved the prediction performance of the GPR model from the perspective of the internal prediction process, and applied it to the joint estimation of SOH and SOC. Ghasemi et al. [
35] first proposed the theoretical framework of the basic Ivy algorithm, which can efficiently search for the optimal solution in the solution space by simulating the characteristics of ivy growing towards favorable resources. Zhang et al. [
36] improved the basic Ivy algorithm by introducing an adaptive perturbation factor and adaptive growth rate, significantly improving the convergence speed and accuracy of the algorithm when dealing with high-dimensional nonlinear problems. To address the deficiencies of existing SOC–SOH joint estimation methods and optimization algorithm research, this paper proposes a novel hybrid framework that combines an autoregressive equivalent circuit model (AR-ECM) and a data-driven method to achieve high-precision joint estimation of SOC and SOH. The contributions of this research are briefly described as follows:
We propose a novel hybrid framework integrating AR-ECM with data-driven models for SOC–SOH joint estimation. The framework adaptively incorporates SOH into SOC calculations, enabling precise estimation under battery aging conditions.
A systematic feature selection methodology is developed combining voltage segmentation and multiple correlation analysis (MCA). This approach integrates key voltage characteristics (SV, dQ/dV, and InV) to enhance SOH prediction accuracy.
We present an improved Ivy algorithm (IVYA) with chaotic mapping and tangent flight operators for optimizing HKELM hyperparameters, significantly improving convergence speed and estimation precision.
Comprehensive experiments across multiple datasets demonstrate the framework’s effectiveness, achieving SOH errors below 1% and SOC errors under 2% under various operating conditions and temperatures.
The organization of the paper is as follows:
Section 2 describes the dataset and the data processing process in detail. The SOC and SOH co-estimation method is described in
Section 3.
Section 4 shows that the predictions were accurate and that the proposed approach worked. The conclusions are presented in
Section 5.