Battery Life Prediction for Ensuring Robust Operation of IoT Devices in Remote Metering

Yong, Taein; Lee, Chaebong; Kim, Seongseop; Kim, Jaeho

doi:10.3390/app15062968

Open AccessArticle

Battery Life Prediction for Ensuring Robust Operation of IoT Devices in Remote Metering

¹

Information and Communications Engineering Department, Sejong University, Gwangjin-gu, Seoul 05006, Republic of Korea

²

Vitzro Cell Co., Ltd., Dangjin-si 31816, Chungcheongnam-do, Republic of Korea

³

Korea Electronics Technology Institute, Seongnam-si 13509, Gyeonggi-do, Republic of Korea

⁴

Department of Electronic and Information Engineering, Sejong University, Gwangjin-gu, Seoul 05006, Republic of Korea

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2025, 15(6), 2968; https://doi.org/10.3390/app15062968

Submission received: 20 January 2025 / Revised: 23 February 2025 / Accepted: 27 February 2025 / Published: 10 March 2025

(This article belongs to the Special Issue Recent Advances in Internet of Things and System Design)

Download

Browse Figures

Versions Notes

Abstract

:

Primary batteries are extensively employed as power sources in Internet of Things (IoT) devices for remote metering. However, primary batteries maintain a relatively consistent discharge voltage curve over a long period before experiencing a full discharge, making it challenging to predict the battery’s life. In this study, we introduce a battery life prediction method to ensure the robust operation of IoT devices in remote metering applications. The robust battery life prediction process is divided into two stages. The first stage involves predicting the state of charge (SOC) to enable real-time remote monitoring of the battery status of metering devices. In the second stage, IoT devices implement a hardware-based alerting mechanism to provide warnings prior to complete discharge, leveraging a custom-designed Multi-Stage Discharge battery architecture. In the first stage, we developed the CNN-Series Decomposition Transformer (C-SDFormer) model, which is capable of accurately predicting the SOC of primary batteries. This model was specifically designed to support the real-time monitoring of battery status in large-scale IoT deployments, enabling proactive maintenance and enhancing system reliability. To validate the performance of the C-SDFormer model, data were collected from smart remote meters installed in households. The model was trained using the collected data and evaluated through a series of experiments. The performance of the C-SDFormer model was compared with existing methods for SOC prediction. The results indicate that the C-SDFormer model outperformed the traditional methods. Specifically, the SOC prediction achieved a mean absolute error (MAE) of less than 4.1%, a root mean square error (RMSE) of less than 5.2%, a symmetric mean absolute percentage error (SMAPE) of less than 7.0%, and a coefficient of determination (R²) exceeding 0.96. These results demonstrate the effectiveness of the C-SDFormer model in accurately predicting the SOC of primary batteries. For the second stage, a Multi-Stage Discharge (MSD) primary battery was developed to ensure a hardware-based low battery alert before the battery is fully discharged. This battery was designed to ensure the reliable operation of IoT devices, especially those whose batteries are not proactively managed through real-time monitoring in the first stage. By providing a low battery alert, the MSD battery reduces the risk of unexpected device shutdowns. This feature enhances the overall reliability of IoT devices, ensuring their continuous operation in remote metering applications.

Keywords:

machine learning; state of charge (SOC); CNN-Series Decomposition Transformer (C-SDFormer); Multi-Stage Discharge (MSD) primary battery

1. Introduction

Internet of Things (IoT) devices in remote metering applications are typically installed in locations where continuous power supply is challenging, or where human access is limited [1,2]. These devices require a stable and long-lasting power source to operate reliably for several years, and potentially up to a decade [3,4]. Due to these requirements, primary batteries are predominantly used, as they provide the necessary longevity and stability without the need for frequent replacement or maintenance, making them ideal for applications where regular access is restricted [5,6].

However, due to the flat discharge curve characteristics of primary batteries [7], it is difficult to predict the battery life, and this often leads to unexpected fully discharge. Such unexpected power loss can have a significant impact on remote metering devices and cause property damage [8,9,10]. To mitigate this risk, backup batteries [11] are currently used, or batteries are replaced in advance. However, these approaches are expensive and lead to a significant waste of resources, making them unsustainable for wide-ranging or long-term applications.

In this paper, we propose a battery life prediction method to ensure the robust operation of IoT devices in remote metering applications, as shown in Figure 1. The robust battery life prediction process is divided into two stages. As the first stage, we focuses on predicting the state of charge (SOC), facilitating real-time remote monitoring of battery conditions in metering devices. The SOC is estimated using temperature, voltage and communication quantity data. This SOC inference is performed on a server, where the meter device periodically transmits its measured parameters. The high cost of SOC measurement equipment poses a significant challenge in practical IoT devices deployed for remote metering applications. Therefore, we employed alternative parameters for real-time prediction instead of utilizing SOC directly, enabling efficient and cost-effective monitoring. In the second stage, a hardware-based alert system is integrated into IoT devices to issue warnings before complete discharge, leveraging a custom-designed Multi-Stage Discharge battery architecture.

In the first stage, we developed the CNN-Series Decomposition Transformer (C-SDFormer) model, which is capable of accurately predicting the SOC of primary batteries. As mentioned earlier, three inputs are employed for SOC prediction. Among these, instances of communication induce transient voltage fluctuations, which represent significant factors for SOC prediction due to the inherent characteristics of IoT devices in remote metering applications. We incorporated a transformer layer into the model architecture to effectively capture and analyze voltage variations, leveraging its attention mechanism to focus on voltage fluctuation data. In addition, we combined a convolutional neural network (CNN) with series decomposition techniques to enhance the model’s ability to handle complex patterns of battery discharge behavior. This led to the development of the C-SDFormer model, which enables real-time discharge capacity prediction, providing a more accurate and robust approach to battery health monitoring in remote metering applications.

Environmental factors, including the installation location of IoT devices and other external influences, degrade the accuracy of SOC prediction during the first stage [12]. To address these challenges, a Multi-Stage Discharge (MSD) primary battery was developed for the second stage to ensure a hardware-based low battery alert before the battery is fully discharged. Specifically, the MSD primary battery was developed by mixing two electrolytes, which results in two distinct voltage drops during discharge. A low battery alert is triggered within the voltage range corresponding to the first of these voltage drops. This design also enhances the overall robustness against missing or noisy data in the first stage. Even if the C-SDFormer’s real-time SOC estimation is adversely affected by such uncertainties, the voltage drop in the MSD battery still provides a clear hardware-based signal to indicate a low battery state. As a result, the system can reliably detect when the battery is nearing depletion, regardless of model inaccuracies or data gaps. This battery was specifically designed to guarantee the reliable operation of IoT devices, particularly where batteries are not actively monitored in real time during the first stage. By issuing a low battery alert, the MSD battery mitigates the risk of unexpected device shutdowns. Hence, even under noise or missing data conditions, the second stage of the MSD battery architecture ensures continued device reliability and uninterrupted operation in remote metering applications.

To evaluate the effectiveness of the proposed C-SDFormer model, we collected data from smart remote meters deployed in households. The model was subsequently trained on this dataset and its performance was validated through a series of experimental analyses. These experiments were designed to demonstrate the model’s capability in accurately predicting SOC and improving battery life management for IoT devices in remote metering. The C-SDFormer model utilizes temperature, voltage, and number of communications as input features for SOC prediction. Furthermore, the efficiency of the C-SDFormer architecture was verified through ablation studies, where the contributions of individual layers to its performance were systematically analyzed. The model also exhibited superior accuracy and robustness compared to existing methods in SOC prediction. Furthermore, the low-battery alert enabled by the MSD primary battery enhanced the robustness of IoT devices in remote metering and ensured their continuous operation.

The rest of this paper is organized as follows. Section 2 introduces the related works on SOC estimation based on deep learning. Section 3 explains the details of the remote metering life prediction method. Section 4 describes the experimental setup. Section 5 demonstrates the SOC estimation and prediction performance in comparison with existing methods and provides a detailed discussion of the experimental results. Finally, Section 6 concludes this paper.

2. Related Work

There has been limited machine learning research focused on estimating the remaining capacity of primary batteries using voltage. This is due to the fact that, as previously discussed, conventional primary batteries exhibit notably flat discharge curves with just a single voltage drop.

Conversely, machine learning–based SOC prediction for secondary batteries has seen active development [13,14,15,16,17,18,19,20,21,22]. Early research employed a deep neural network (DNN) [13] as the foundational model, predicting SOC by considering three main attributes: instantaneous voltage, instantaneous current, and instantaneous temperature. In another study, a long short-term memory (LSTM) [14] architecture was used to examine various features for their correlation with SOC. Parameters such as voltage, current, and environmental factors (temperature, humidity, and wind speed) were compared against SOC. Given that voltage exhibited the strongest correlation, predictions were ultimately based solely on voltage. A convolutional neural network (CNN) [15], commonly applied in computer vision, was then introduced by applying temporal convolutions to relaxation voltage data collected over a 10-s window. Subsequently, a CNN-LSTM-DNN framework [16] was developed, combining the strengths of a DNN, LSTM, and CNN into a single model. In this approach, the CNN extracted temporal features, LSTM handled time series information, and DNN enhanced the estimation accuracy. As a result, the CNN–LSTM–DNN outperformed the basic LSTM, CNN, and CNN-LSTM models in SOC estimation for secondary batteries.

Additionally, some researchers adopted a bidirectional gated recurrent unit (Bi-GRU) [17] to reduce computational overheads while also leveraging bidirectional data analysis. This model processed voltage, current, and temperature inputs. To address gradient loss, the Nesterov Accelerated Gradient (NAG) algorithm was used for estimating the remaining capacity. Building on the Bi-GRU concept, the CNN-bidirectional weighted gated recurrent unit (CNN-BWGRU) [18] was subsequently proposed, integrating a CNN layer to extract input features. It too utilized voltage, current, and temperature data, passing these features through the Bi-GRU for SOC prediction.

Various Transformer-based models [19,20,21,22] have been employed for SoC estimation. The Transformer architecture is renowned for its ability to process more comprehensive information than traditional neural networks. The Transformer with an immersion and invariance adaptive observer [19] utilizes two separate encoders to simultaneously encode information on current temperature and voltage temperature. The outputs are then concatenated and used as input for the decoder to estimate the SoC. Next, TTSNet [20] proposes a sequence-to-sequence network that employs a temporal Transformer, using voltage, current, and temperature as inputs to estimate the SoC. In addition, another Transformer-based online battery life estimation model [21] improves the estimation accuracy by incorporating vehicle speed as an additional input feature. Finally, SGEformer [22] predicts the battery lifespan by leveraging a variety of input data, including the number of charge cycles, voltage, current, and load, combined with a lookback window technique.

Nevertheless, it is essential to note that these studies encompass data from both the charging and discharging processes, rendering them unsuitable for primary batteries, where no charging phase exists. Moreover, the discharge profiles of primary and secondary batteries differ significantly, necessitating distinct models. Consequently, attempting to estimate the SOC of primary batteries using a model trained exclusively on secondary battery data presents formidable challenges. To address these challenges in SOC estimation, we have introduced the MSD primary battery, incorporating machine learning techniques, particularly the C-SDFormer model, for SOC estimation tailored to primary batteries.

3. Remote Metering Life Prediction Method

3.1. Multi-Stage Discharge Primary Battery

As previously mentioned in the introduction, we focus on real-time SOC prediction and on triggering a low battery alert before complete discharge. To achieve this, we developed a Multi-Stage Discharge (MSD) primary battery, as illustrated in Figure 2, which exhibits two distinct voltage drop stages. In Figure 2, the point at which the voltage drop initiates is denoted as the primary point. The primary point represents the final moment at which real-time estimation can be performed. For enhanced life prediction reliability and to allow sufficient time for maintenance, we recommend battery replacement when the battery reaches the primary point rather than waiting for the final drop point. This innovative design incorporates two separate voltage drops rather than the single voltage drop characteristic of conventional primary batteries. The MSD primary battery is produced by blending two electrolytes,

S O_{2} C l_{2}

and

S O C l_{2}

, resulting in two discharge stages governed by the following chemical reactions (see Equations (1) and (2)):

\begin{matrix} F i r s t s t a g e : 4 L i + 2 S O_{2} C l_{2} & \to 4 L i C l + 2 S O_{2}, \end{matrix}

(1)

\begin{matrix} S e c o n d s t a g e : 4 L i + 2 S O C l_{2} & \to 4 L i C l + S O_{2} + S . \end{matrix}

(2)

The operation of the MSD primary battery commences with the reaction of

L i

with

S O_{2} C l_{2}

, which leads to a minor voltage drop to a point termed the primary point. This point can be moved to the 10% and 5% points of

S O C_{P r i m}

, depending on the ratio of the two electrolytes, allowing the MSD primary battery to be used to its full capacity. Subsequently,

L i

reacts with

S O C l_{2}

, inducing a further reduction in voltage to a lower level, and eventually the battery discharges.

In our implementation, the MSD primary battery demonstrates a total capacity of approximately

17.40 Ah

. During the first discharge stage, where the average operating voltage typically stabilizes at around

3.78 V

, the battery constitutes about

14.82 Ah

of its total capacity. We also observed that this first-stage voltage may vary with temperature: at 60 °C, the battery maintains around

3.60 V

, whereas at −20 °C, it drops to about

3.30 V

, and at ambient temperature (20 °C), it averages approximately

3.75 V

. After passing the primary point, the voltage transitions to the second stage, averaging around

3.47 V

. Considering that the remote metering device consumes approximately

11.8 mAh

per day, the battery sustains operation for about four years before reaching complete discharge.

Figure 2. Discharge curve of the MSD primary battery. Batteries produced through electrolyte mixing exhibit a primary point at 15% of

S O C_{P r i m}

.

D O D_{P r i m}

represents the depth of discharge at the primary point, while

S O C_{P r i m}

denotes the state of charge at the primary point.

Figure 2. Discharge curve of the MSD primary battery. Batteries produced through electrolyte mixing exhibit a primary point at 15% of

S O C_{P r i m}

.

D O D_{P r i m}

represents the depth of discharge at the primary point, while

S O C_{P r i m}

denotes the state of charge at the primary point.

3.2. Data Collection and Preprocessing

To analyze the discharge characteristics of MSD primary batteries in a real-world setting, we implemented the printed circuit board (PCB) embedded within the remote gas meter for collecting data (refer to Figure 3a). Figure 3b shows the MSD Primary battery installed in the remote gas meter. Figure 3c shows that the rear part of the board is equipped with a PMIC for overseeing voltage and current, a Cortex-M0 microcontroller’s temperature sensor to measure temperature, and an SX1276 Long Range Radio (LoRa) module for wireless data transmission. We deployed 50 remote gas meters within households (as illustrated in Figure 3d) and harnessed a power management integrated circuit (PMIC) embedded in each remote gas meter to capture the data. The collected data include timestamps, discharge capacity, temperature, voltage, and the communication count, as outlined in Table 1. We utilized temperature, voltage, and communication count as input data for the C-SDFormer model, while discharge capacity was employed as the ground truth for prediction. This approach was adopted because, in real-world scenarios, measuring discharge capacity requires expensive equipment, making it impractical for widespread applications such as smart metering. The data were recorded at 10 min intervals and subsequently transmitted to the server for storage at hourly intervals.

For clarity, Table 2 presents a sample excerpt of the collected data over a one-hour period, showing representative values for timestamps, voltage, temperature, and communication count. This sample illustrates the typical range of input values observed during operation and highlights how communication events affect voltage changes. Additionally, Table 3 presents summary statistics (mean, standard deviation (Std), minimum (Min), and maximum (Max)) for the measured input variables, providing an overview of the range of operating conditions observed in our dataset. These statistics illustrate the diversity in temperature, voltage, and communication count values, which helps the model capture variations in operating conditions and improve the accuracy of discharge capacity predictions.

Raw data acquired from remote gas meters frequently contain missing values and noise, which may hinder the performance of machine learning models. To address these challenges and improve model performance, we implemented a four-step data preprocessing procedure including missing value interpolation, noise reduction, primary point identification, and normalization. First, missing values in voltage, temperature, and communication count were addressed using Algorithm 1, while discharge capacity was handled through linear interpolation. To ensure the reliability of this process, missing values were analyzed across multiple test sets, and different interpolation techniques were compared to minimize bias in data imputation. Second, convolution-based smoothing [23] was employed to reduce noise in the discharge capacity and voltage data. This technique was selected based on empirical analysis comparing various denoising methods, ensuring that essential signal patterns were preserved while removing irrelevant fluctuations. Third, the primary point for triggering alerts was mathematically determined based on the first voltage drop. Lastly, min–max normalization was applied to enhance model performance.

During the data collection process, missing values in discharge capacity, temperature, voltage, and communication count can occur due to communication interruptions between the smart gas meter and the server. Algorithm 1 presents the pseudocode for imputing missing voltage values through interpolation, maintaining the voltage variation patterns attributed to LoRa communication, as observed in the measured data. Here, T represents a time series with N empty values,

T^{*}

is the imputed version of T, X signifies the previous data point for T, and Y stands for the subsequent data point for T. Line 3 assigns X to T, and lines 4 and 5 calculate the average of X and Y, respectively. Line 6 finds the bias, and lines 7 and 8 update T by subtracting the bias from each element of T. Line 9 allocates the updated T to

T^{*}

. The same procedure is applied to the temperature and the communication count for the same reasons. Missing values for discharge capacity were imputed through linear interpolation. Figure 4 illustrates the change in voltage data after implementing Algorithm 1.

Algorithm 1 Voltage Interpolation

Input: time series

T = [t_{m + 1}, t_{m + 2}, \dots, t_{m + n}]

, with empty value of length N

Output: filled time series

T^{*}

1:: Let $X = [t_{m - n + 1}, t_{m - n}, \dots, t_{m}]$ , Length N
2:: Let $Y = [t_{m + n + 1}, t_{m + n + 2}, \dots, t_{m + 2 n}]$ , Length N
3:: $T \leftarrow X$
4:: $x_{a v e r a g e} = \frac{1}{n} \sum_{i = m - n + 1}^{m} t_{i}$
5:: $y_{a v e r a g e} = \frac{1}{n} \sum_{i = m + n + 1}^{m + 2 n} t_{i}$
6:: $B i a s = \frac{x_{a v e r a g e} - y_{a v e r a g e}}{2}$
7:: for each t in T do
8:: Update t as $t \leftarrow t - B i a s$
9:: $T^{*} \leftarrow T$

During data collection, noise can significantly affect the training process. To address this, we applied a convolution-based smoothing technique commonly used in signal processing, as defined in Equation (3).

\begin{matrix} h (t) = f (t) * g (t) = \sum_{k = 0}^{N} f (k) g (t - k), \end{matrix}

(3)

where

g (t)

is the input filter, N is the length of the data collected for

f (t)

, t is the time parameter, and

h (t)

is the denoised data obtained by the convolution of

g (t)

and

f (t)

. Figure 5 shows a clear and noticeable reduction in noise after the data passes through the filter.

The process of determining the primary point utilized the second derivative of the daily voltage data, as shown in Figure 6a. These second derivative values capture significant changes in the voltage slope between consecutive days. A negative second derivative indicates a decreasing voltage slope, marking the onset of a voltage drop, whereas a positive second derivative reflects an increasing voltage slope, signifying the end of the voltage drop. The primary point was identified as the point with the smallest second derivative value (as defined by Equation (4)).

\begin{matrix} P r i m a r y p o i n t = M i n i m u m (\frac{d}{d^{2} t} V (t)), \end{matrix}

(4)

where

V (t)

represents the voltage discharge curve over time. Figure 6b visually illustrates the placement of the primary point on the discharge curve, while Figure 6c showcases the extracted data for learning, using the primary point as a reference.

For the final step, we applied a machine learning technique, normalization. This method aims to standardize the scales of input features, ensuring equitable significance during model training. To address the scale disparities within our collected data, encompassing discharge capacity, temperature, voltage, and the communication count, we opted for min–max normalization. This process scaled the original data to fall within the range of 0 to 1, as per Equation (5).

\begin{matrix} Y (t) = \frac{y (t) - y_{m i n}}{y_{m a x} - y_{m i n}}, \end{matrix}

(5)

In Equation (5), where t represents time,

y (t)

signifies the data before normalization,

y_{m i n}

denotes the minimum value of

y (t)

,

y_{m a x}

represents the maximum value of

y (t)

, and

Y (t)

is the normalized data.

3.3. C-SDFormer Model for Predicting the Discharge Capacity of MSD Primary Battery

Our objective is to predict the discharge capacity in real-time using deep learning techniques, with input parameters including voltage, temperature, and communication count. To achieve this, we employed a transformer-based model, which represents an evolution of the seq2seq model [24], featuring an encoder–decoder architecture. Diverging from traditional recurrent architectures [25,26,27], the transformer model relies on a neural network with an attention mechanism to analyze the relationships between outputs based on the voltage drop characteristics occurring during communication. While transformer models are predominantly used in natural language processing [28], they also find utility in analyzing time series data [29,30,31,32].

To predict the discharge capacity during the first discharge stage, we introduce the C-SDFormer model, which builds upon the encoder architecture of the Transformer [33]. Unlike conventional Transformer models, C-SDFormer is specifically designed to handle the unique challenges posed by transient voltage fluctuations in remote metering environments. Traditional models struggle to effectively capture these variations, particularly those induced by communication events. To address this, C-SDFormer integrates mechanisms to differentiate between short-term fluctuations and long-term discharge trends, making it more suitable for battery discharge prediction in practical applications.

Prior research has utilized traditional convolutional neural networks (CNNs [15]) for local feature extraction, as well as long short-term memory networks (LSTMs [14]) and gated recurrent units (GRUs [17]) for time series forecasting. However, these models exhibit limitations in capturing long-range dependencies and tend to suffer from vanishing gradients when processing extended sequences. Transformer-based models have emerged as a powerful alternative due to their capability to model long-term dependencies through self-attention mechanisms. Informer [29], Autoformer [30], and Fedformer [31] are notable Transformer-based architectures designed for time series forecasting. While these models improve efficiency and accuracy compared to RNN-based approaches, they are not explicitly optimized for battery discharge prediction in remote metering environments.

C-SDFormer distinguishes itself from existing Transformer-based time series models through its series decomposition module, which separates trend and seasonal components to better capture transient voltage variations caused by communication activities. Additionally, the CNN-based feature extraction module enhances the representation of voltage changes by identifying spatial patterns in time-series data, making it particularly effective in detecting sudden voltage drops induced by communication events. Furthermore, the self-attention mechanism in the Transformer encoder dynamically weighs the importance of different time steps and refines the extracted features from the series decomposition and CNN modules. This allows the model to focus on the most relevant temporal dependencies and spatial patterns, enhancing its ability to capture both short-term fluctuations and long-term battery discharge trends. This hybrid approach ensures that both short-term and long-term voltage dynamics are modelled accurately, leading to improved battery discharge prediction.

The architecture of C-SDFormer consists of several key components, including a CNN-based input embedding layer, a positional encoding layer, an encoder layer, a series decomposition, and a linear output layer. Figure 7 illustrates the detailed structure of C-SDFormer. Initially, the voltage, communication count, and temperature are obtained through a sliding window and stacked together. They then pass through the input embedding layer, which consists of a CNN and facilitates the feature extraction process within the encoder layer, mapping them to a 128-dimensional vector. The C-SDFormer model lacks a recursive architecture, necessitating the inclusion of position tokens to indicate the position of each element in the sequence. This aids the model in comprehending the sequence’s item order and retaining this order. Various techniques exist for incorporating positional information into sequences, but in this study, we employ sine and cosine functions, as defined in Equations (6) and (7).

\begin{matrix} P E_{(p o s, 2 i)} & = s i n (\frac{p o s}{{10, 000}^{(2 i / d_{m o d e l})}}), \end{matrix}

(6)

\begin{matrix} P E_{(p o s, 2 i + 1)} & = c o s (\frac{p o s}{{10, 000}^{(2 i / d_{m o d e l})}}), \end{matrix}

(7)

where

p o s

is the position of the embedding vector in the sequence and i is the index for each dimension of the embedding vector. If the index is even, we use the sine function. If the index is odd, we use the cosine function.

Subsequently, the input, now encompassing positional information, proceeds to the encoder block. This encoder block comprises multiple layers, including a multi-head self-attention layer, a series decomposition layer, a feed-forward network (FFN), residual connections (Add), and layer normalization (LayerNorm). Residual connections serve to mitigate the risk of gradient vanishing, while layer normalization aids in convergence learning and enhances the model’s resilience to data variations. The self-attention mechanism, a potent tool for learning dependencies within sequence components, enables the model to focus on the most pertinent information in the sequence. It operates by generating three matrices: Query (Q), Key (K), and Value (V) for the input sequence. These matrices are computed using distinct weight matrices,

W^{q}

,

W^{k}

, and

W^{v}

, learned during training. The Q and K matrices are multiplied and scaled by

\sqrt{d_{k}}

, and then converted into a probability distribution using the softmax function. Subsequently, the V matrix is multiplied by this probability distribution to yield the attention value, which provides the model with more detailed insights into the elements deserving of focus. The calculation of the attention value can be summarized by Equation (8).

\begin{matrix} A t t e n t i o n (Q, K, V) = s o f t m a x (\frac{Q K^{T}}{\sqrt{d_{k}}}) V, \end{matrix}

(8)

where

d_{k}

is the dimension of the K vector. Our model also incorporates multi-head self-attention, an enhancement of the self-attention mechanism. Multi-head self-attention simultaneously applies multiple self-attention mechanisms to model the relationships among input data from various perspectives and extracts comprehensive information. This approach empowers the model to distinguish between distinct features within the input sequence, thereby enhancing its learning capacity. After passing through Add&LayerNorm, the model employs a series decomposition layer to segregate the data into seasonal and trend components. This separation streamlines the identification of seasonal and trend information in isolation, with only the seasonal features undergoing processing through the FFN. This step allows for more detailed information extraction and pattern learning from the seasonal features.

Ultimately, following transit through the series of N encoder blocks, the discharge capacity is predicted utilizing M convolution layers and a dense layer. For specific model hyperparameters, please refer to Section 4 for detailed information.

3.4. SOC and Remaining Time Calculation Using the Predicted Discharge Capacity

The SOC indicates the remaining battery capacity relative to the rated capacity. The SOC value ranges from 0% to 100%, with 100% representing a fully charged battery and 0% representing a fully discharged battery. To estimate the SOC of the MSD primary battery, we use the discharge capacity predicted by the C-SDFormer. Equation (9) shows the

S O C_{t}

calculation process.

S O C_{t} = 100 % - \frac{C_{r e l e a s e d}}{C_{r a t e d}} \times 100 %,

(9)

where

S O C_{t}

is the SOC at time t,

C_{r e l e a s e d}

is the discharge capacity, and

C_{r a t e d}

is the rated capacity as specified by the manufacturer.

In addition, to enhance the practical relevance of our approach, we converted the predicted SOC into an estimated remaining time until complete battery discharge. Measuring current in real-world settings is challenging. Therefore, we defined

I_{a v g}

as the daily average current consumption derived from the predicted SOC based on the device’s power consumption profile data. This profile was estimated in real time because it varies with each device’s operating conditions, such as the number of retransmissions determined by the communication environment and the ambient temperature. The estimated remaining time,

T_{r e m a i n i n g}

, is then calculated using:

T_{r e m a i n i n g} = \frac{C_{r a t e d} - C_{r e l e a s e d}}{I_{a v g}},

(10)

where

T_{r e m a i n i n g}

represents the estimated remaining operational time until full discharge, measured in days, and

I_{a v g}

is the daily average of

C_{r e l e a s e d}

. This transformation enables the model to provide users with a practical, time-based prediction of battery life, thereby further improving the applicability of our battery management strategy.

4. Experimental Setup

In this section, we delve into a comprehensive exploration of the experimental components crucial to our study. We provide detailed insights into our experimental setup, encompassing hyperparameters, implementation specifics, and the evaluation metrics.

4.1. Hyperparameters of C-SDFormer

The hyperparameters of the C-SDFormer model are comprehensively outlined in Table 4. Here, the encoder block, input dimension of the encoder, number of attention heads, and feed-forward network dimension were set based on the optimal parameter values obtained from grid search. In the input embedding stage, we adopted an optimal CNN structure, determined through rigorous experimentation. This choice allows us to more effectively extract local patterns for voltage, temperature, and the communication count than a linear layer. We incorporated zero padding and the tanh activation function, notably omitting a pooling layer to preserve the temporal order of the time series data. Additionally, we employed Batch Normalization (BatchNorm) 1D to maintain the model’s training stability.

The encoder stage integrated four self-attention heads and eight encoder blocks to adeptly learn features and patterns. The feed-forward stage enhanced the model’s capability to represent various types of information through the inclusion of two linear layers, one dropout layer, and the GELU activation function. The output stage utilized two CNN layers and one linear layer to yield the final results. Only the values corresponding to the first vector in the output of the second CNN layer are used as input to the subsequent linear layer.

4.2. Implementation Details

For the prediction of discharge capacity using the C-SDFormer model, we established a standardized dataset for network training, represented as

D = {(X (1), C (1)), (X (2), C (2)), \dots, (X (τ), C (τ))}

, where

C (t)

denotes the discharge capacity value, and

X (t)

signifies the input vector or scalar at time step t. The input

X (t)

is

X_{V N T} (t) = [V (t), N (t), T (t)]

, where

V (t)

,

N (t)

, and

T (t)

correspond to voltage, the communication count, and temperature at time step t. To enable the model to grasp the temporal relationships among the input variables, we employed a sliding window algorithm with a window size of 24 h on the input data, with the subsequent 10 min being predicted as the output.

Data that did not reach the primary point, contained numerous missing values, or exhibited excessive noise were not used in the discharge profile data. Moreover, we excluded three test sets from the training process for model evaluation. The remaining data were divided into a training set (80%) and a validation set (20%). We implemented early stopping to combat overfitting, terminating training when the validation loss exhibited no improvement for 10 epochs, with a maximum limit of 100 epochs. The model was trained using the Adam optimizer and the mean absolute error (MAE) loss function, employing a batch size of 64 and a learning rate of 0.0001.

The C-SDFormer model was implemented in Python (version 3.8.11) using the PyTorch deep learning framework (version 2.2.1). Our experiments were conducted on a Linux-based system (Ubuntu 20.04) equipped with CUDA 12.1 and an NVIDIA GeForce RTX 3090.

4.3. Evaluation Metrics

To evaluate our model’s performance, we utilized four metrics: mean absolute error (MAE), root mean square error (RMSE), symmetric mean absolute percentage error (SMAPE), and the coefficient of determination (R²). The MAE, as the most intuitive metric, represents the average of the absolute differences between the predicted and actual values, making it effective for evaluating overall prediction accuracy. RMSE, the square root of the mean squared error, transforms the error metrics into the same units as the actual values and is particularly sensitive to larger errors, highlighting significant deviations. SMAPE evaluates the percentage difference between the predicted and actual values, considering their magnitude, and is less sensitive to outliers, making it suitable for assessing relative errors in balanced datasets. R² serves as a measure of the extent to which the dependent variable can be explained by the independent variable, serving as a key indicator of the model’s goodness of fit.

5. Experimental Results and Discussion

In this section, we conducted experiments on the SOC estimation capability up to the primary point and the prediction of the primary point using three MSD primary battery test sets (#1, #2, and #3). Specifically, we evaluated the efficiency of the C-SDFormer model through ablation studies. Additionally, we trained existing battery SOC estimation methods on MSD primary battery data and compared their performance on the same test set.

To assess the contributions of individual components in the C-SDFormer model, we conducted an ablation study, evaluating four modifications: (i) without the CNN embedding layer (which effectively replaces it with a linear layer), (ii) without the CNN output layer, (iii) without the transformer layer, and (iv) without the series decomposition layer. The experimental results, summarized in Table 5, indicate that the complete C-SDFormer architecture outperforms all of these modified configurations across all evaluation metrics. Specifically, our ablation study reveals that the CNN embedding layer is the critical component: its removal resulted in the largest performance drop across MAE, RMSE, and R² metrics. This finding underscores its essential role in extracting robust, informative features from the raw input data. Without this layer, the model’s ability to capture subtle variations in voltage, temperature, and communication count is markedly diminished, leading to poorer SOC estimation. Similarly, when the CNN output layer was omitted, we observed a notable decline in prediction accuracy. This layer is responsible for refining and consolidating the extracted features before they are passed to subsequent processing stages, and its absence reduces the model’s capacity to form a cohesive representation of the input signal. Moreover, a configuration of the model that entirely excluded the transformer component—relying solely on CNN layers—showed significantly reduced performance across all evaluated metrics. This result demonstrates that the self-attention mechanism inherent to the transformer is indispensable for capturing the temporal dependencies and transient voltage fluctuations induced by communication. These transient events, although brief, are crucial for accurately predicting the discharge capacity, and the transformer effectively highlights these features through its attention weights. Finally, the removal of the series decomposition layer led to a considerable performance reduction (over 0.6% increase in MAE, 1% increase in RMSE, 0.5% increase in SMAPE, and a decrease of 0.01 in R²). This layer plays a pivotal role in segregating seasonal patterns from trend components, enabling the model to address the nonlinear dynamics present in the data more effectively. Its exclusion confirms that without this separation, the model struggles to differentiate between long-term trends and periodic variations, thereby compromising prediction accuracy. Collectively, these results highlight the complementary roles of each component in the C-SDFormer architecture. The ablation study not only validates the design choices made in our model but also demonstrates how each module contributes significantly to the overall robustness and accuracy of SOC prediction in real-world conditions.

The previous experiments show that the structure of the proposed C-SDFormer model can successfully learn the characteristics of an MSD primary battery. Here, we performed a performance comparison with existing battery SoC estimation methods on the same dataset under identical conditions, and found that C-SDFormer was superior. Table 6 presents the performance comparison of SOC estimation for the test sets between the C-SDFormer and other methods. Transformer-based advanced models [19,20,21,22] utilize various features such as current and SOC as input to specific layers during training to improve SOC estimation. However, these models were not applicable to our study because our dataset does not include these features as training inputs. In Table 6, the C-SDFormer outperforms the other models on all performance metrics, generally, with RMSE being the best performer for all test sets. The average MAE of C-SDFormer is 4.021, showing that on average, the SOC estimates are closest to the actual values. The average RMSE is 5.186, indicating that large errors occur less frequently. The average SMAPE is 7.072, highlighting the model’s reduced sensitivity to relative differences, which underscores its robustness. The average R² is 0.965, demonstrating the model’s ability to accurately capture the patterns and variability in the data. Specifically, the average MAE and RMSE values demonstrate over 1% better performance compared to existing models, with the average RMSE showing a difference of approximately 1%, and the average R² value exhibiting a difference of over 0.02. In addition, C-SDFormer shows competitive results, with an MAE difference of only 0.003 compared to CNN-LSTM-DNN [16] for #2, and an R² difference of 0.02 compared to LSTM [14] for #1. These results highlight the effectiveness of the C-SDFormer model in enabling real-time SOC estimation and providing accurate and robust predictions without relying on discharge capacity as an input. The proposed battery life prediction method is particularly suitable for practical applications such as remote metering. It addresses critical considerations, including minimizing hardware costs and ensuring robust operation. The model demonstrates high prediction accuracy while processing data such as voltage, temperature, and communication count. This capability highlights its potential for deployment in real-world scenarios. Additional performance evaluations and extended analyses are provided in Table A1 in the appendix, offering further insights into the model’s robustness and superiority.

We recommend replacing the battery as soon as it reaches the primary point, ensuring a sufficient maintenance margin for life prediction stability. To validate this approach, we evaluated the SOC estimation error at the primary point. Figure 8 illustrates the SOC estimation error rates at the primary point for the C-SDFormer model compared to other methods. Our results show that the C-SDFormer model achieves the lowest error rates, with values of 0.27, 0.25, and 0.01, respectively. These findings demonstrate the superior predictive accuracy and robustness of the C-SDFormer model in estimating battery SOC. The ability to accurately predict the primary point before a sharp voltage drop occurs ensures that a hardware-based low battery alert is triggered in time. Moreover, even when unexpected SOC estimation errors result in missing SOC states, the voltage drop still effectively activates the low battery alert. This capability substantially mitigates the risk of unexpected power loss and supports proactive battery management, thereby enhancing the overall reliability and operational stability of our system in real-world applications.

6. Conclusions

In this paper, we propose a battery life prediction method to ensure the robust operation of IoT devices in remote metering applications. The robust battery life prediction process proposed in this paper is structured in two stages. The first stage focuses on predicting the SOC, enabling the real-time remote monitoring of battery conditions in metering devices. In the second stage, a hardware-based alerting mechanism is integrated into IoT devices to issue warnings prior to complete discharge. This mechanism utilizes a custom-designed MSD primary battery architecture to enhance the reliability and operational efficiency of IoT devices in remote metering applications. These two stages collectively contribute to improving battery life management and ensuring the robust operation of IoT systems.

For the real-time prediction of SOC, we developed the C-SDFormer model. C-SDFormer uses a self-attention mechanism to process sequences of voltage, temperature, and communication count data for SOC prediction. Specifically, the model uses transformer layers to effectively extract features from input and output sequences, and series decomposition is integrated into the transformer to mitigate noise. The effectiveness of this model architecture was validated through an ablation study.

To evaluate the SOC estimation, we deployed smart gas meters equipped with the MSD primary battery in households and collected discharge characteristic data. The collected temperature, voltage, and communication count data were used as model inputs, while the discharge capacity was used as the ground truth for SOC estimation. As a result, C-SDFormer demonstrated superior performance compared to existing SOC prediction methods, achieving an average MAE of 4.021%, an average RMSE of 5.186%, an average SMAPE of 7.072%, and an average R² of 0.965. In addition, C-SDFormer produced results closer to the ground truth in primary point estimation than other methods. These results confirm the C-SDFormer model’s robustness and accuracy in SOC estimation and primary point prediction for MSD primary batteries, offering a promising solution for real-world applications.

In the second stage of the proposed method, a Multi-Stage Discharge (MSD) primary battery was developed to enable a hardware-based low battery alert before the battery is fully discharged. This design ensures the reliable operation of IoT devices, particularly those not actively managed through real-time SOC monitoring in the first stage. By providing timely low battery alerts, the MSD battery minimizes the risk of unexpected device shutdowns, thereby enhancing the overall reliability and operational continuity of IoT devices in remote metering scenarios. These contributions collectively address critical challenges in battery management and pave the way for more robust IoT applications in real-world environments.Furthermore, our proposed method demonstrates significant potential for practical deployment, as it effectively reduces the reliance on expensive SOC measurement equipment while maintaining robust performance, even in challenging environments. However, challenges remain in integrating the model into resource-constrained devices and ensuring consistent accuracy across diverse operational conditions. Future work will focus on optimizing system integration and enhancing adaptability to further improve the scalability and reliability of the proposed approach in real-world applications.

Author Contributions

T.Y. proposed the idea and defined the concept for this study, and also prepared the data and scenario design. S.K. contributed to data collection, while C.L. led the design and development of the MSD battery. The first draft of the manuscript was written by T.Y. The review process, supervision, and approval of the final manuscript was done by J.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded in part by the Technology Innovation Program (RS-2022-00154678, Development of Intelligent Sensor Platform Technology for Connected Sensor) funded By the Ministry of Trade, Industry & Energy (MOTIE, Republic of Korea); and in part by the Technology Innovation Program (20015961, The Development of Lifetime Prediction and State of Charge Estimating Techniques for Primary Lithium Batteries in Industrial IoT Devices) funded By the Ministry of Trade, Industry & Energy (MOTIE, Republic of Korea).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets presented in this article are not readily available because they are part of an ongoing study and contain proprietary experimental data. Requests to access the datasets should be directed to the corresponding author.

Conflicts of Interest

Author Chaebong Lee was employed by the company “Vitzro Cell Co., Ltd.”. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Appendix A. Supplementary Experimental Details

Appendix A.1. Additional Analysis and Metrics

This appendix provides supplementary information that supports the main text by including extended performance comparisons and additional analyses that further validate our results. The table below summarizes the performance of SOC estimation (State of Charge) by comparing the Mean Absolute Percentage Error (MAPE) and Mean Squared Error (MSE) across various methods, with the values for #1, #2, and #3 corresponding to three independent experiments and the ’Average’ column reporting their mean. MAPE quantifies errors as percentages, making it useful for interpreting performance relative to SOC values, while MSE penalizes larger errors more heavily due to the squaring effect, offering a sensitive measure for detecting significant deviations. The analysis shows that the proposed C-SDFormer outperforms the compared methods by achieving a notably lower MAPE of 12.82 and an MSE of only 0.275, demonstrating its superior accuracy and robustness in SOC estimation across diverse experimental conditions.

Table A1. Performance comparison of SOC estimation between C-SDFormer and other methods (MAPE and MSE).

Method	MAPE				MSE (%)
Method	#1	#2	#3	Average	#1	#2	#3	Average
DNN [13]	14.65	25.86	19.93	20.14	0.385	0.960	0.642	0.662
LSTM [14]	12.57	17.84	15.43	15.28	0.220	0.581	0.311	0.370
CNN [15]	12.50	16.70	18.93	16.04	0.234	0.457	0.495	0.395
CNN-LSTM-DNN [16]	13.84	14.92	16.01	14.92	0.326	0.415	0.597	0.446
Bi-GRU [17]	11.89	23.60	15.50	16.99	0.326	0.415	0.597	0.446
CNN-BWGRU [18]	12.07	18.55	13.17	14.59	0.221	0.697	0.383	0.433
C-SDFormer (Ours)	9.840	16.63	12.01	12.82	0.211	0.408	0.208	0.275

Note: Bold values indicate the best performance among the compared models.

References

Marahatta, A.; Rajbhandari, Y.; Shrestha, A.; Singh, A.; Thapa, A.; Gonzalez-Longatt, F.; Korba, P.; Shin, S. Evaluation of a lora mesh network for smart metering in rural locations. Electronics 2021, 10, 751. [Google Scholar] [CrossRef]
Da Cunha, M.P. High-temperature harsh-environment saw sensor technology. In Proceedings of the 2023 IEEE International Ultrasonics Symposium (IUS), Montreal, QC, Canada, 3–8 September 2023; pp. 1–10. [Google Scholar]
Callebaut, G.; Leenders, G.; Van Mulders, J.; Ottoy, G.; De Strycker, L.; Van der Perre, L. The art of designing remote iot devices—Technologies and strategies for a long battery life. Sensors 2021, 21, 913. [Google Scholar] [CrossRef]
Dai, Y.; Jing, Z.; Sun, Y. Reliability-centered prediction methods of clock battery low-output-voltage event for smart electricity meters. Microelectron. Reliab. 2022, 138, 114631. [Google Scholar] [CrossRef]
Shrestha, A.; Bishwokarma, R.; Chapagain, A.; Banjara, S.; Aryal, S.; Mali, B.; Thapa, R.; Bista, D.; Hayes, B.P.; Papadakis, A.; et al. Peer-to-peer energy trading in micro/mini-grids for local energy communities: A review and case study of Nepal. IEEE Access 2019, 7, 131911–131928. [Google Scholar] [CrossRef]
Duan, Z.; Yuan, Z.; Jiang, Y.; Zhao, Q.; Huang, Q.; Zhang, Y.; Liu, B.; Tai, H. Power generation humidity sensor based on primary battery structure. Chem. Eng. J. 2022, 446, 136910. [Google Scholar] [CrossRef]
Marom, R.; Amalraj, S.F.; Leifer, N.; Jacob, D.; Aurbach, D. A review of advanced and practical lithium battery materials. J. Mater. Chem. 2011, 21, 9938–9954. [Google Scholar] [CrossRef]
Quy, V.K.; Hau, N.V.; Anh, D.V.; Quy, N.M.; Ban, N.T.; Lanza, S.; Randazzo, G.; Muzirafuti, A. IoT-enabled smart agriculture: Architecture, applications, and challenges. Appl. Sci. 2022, 12, 3396. [Google Scholar] [CrossRef]
Kalsoom, T.; Ramzan, N.; Ahmed, S.; Ur-Rehman, M. Advances in sensor technologies in the era of smart factory and industry 4.0. Sensors 2020, 20, 6783. [Google Scholar] [CrossRef]
Schmidt, C.L.; Skarstad, P.M. The future of lithium and lithium-ion batteries in implantable medical devices. J. Power Sources 2001, 97, 742–746. [Google Scholar] [CrossRef]
Jackson, N.; Adkins, J.; Dutta, P. Capacity over capacitance for reliable energy harvesting sensors. In Proceedings of the 18th International Conference on Information Processing in Sensor Networks, Montreal, QC, Canada, 15–18 April 2019; pp. 193–204. [Google Scholar]
Bathre, M.; Das, P.K. Smart dual battery management system for expanding lifespan of wireless sensor node. Int. J. Commun. Syst. 2023, 36, e5389. [Google Scholar] [CrossRef]
How, D.N.; Hannan, M.A.; Lipu, M.S.H.; Sahari, K.S.; Ker, P.J.; Muttaqi, K.M. State-of-charge estimation of li-ion battery in electric vehicles: A deep neural network approach. IEEE Trans. Ind. Appl. 2020, 56, 5565–5574. [Google Scholar] [CrossRef]
Hong, J.; Wang, Z.; Chen, W.; Wang, L.Y.; Qu, C. Online joint-prediction of multi-forward-step battery SOC using LSTM neural networks and multiple linear regression for real-world electric vehicles. J. Energy Storage 2020, 30, 101459. [Google Scholar] [CrossRef]
Zhang, S.; Zhu, H.; Wu, J.; Chen, Z. Voltage relaxation-based state-of-health estimation of lithium-ion batteries using convolutional neural networks and transfer learning. J. Energy Storage 2023, 73, 108579. [Google Scholar] [CrossRef]
Zraibi, B.; Okar, C.; Chaoui, H.; Mansouri, M. Remaining useful life assessment for lithium-ion batteries using CNN-LSTM-DNN hybrid method. IEEE Trans. Veh. Technol. 2021, 70, 4252–4261. [Google Scholar] [CrossRef]
Zhang, Z.; Dong, Z.; Lin, H.; He, Z.; Wang, M.; He, Y.; Gao, X.; Gao, M. An improved bidirectional gated recurrent unit method for accurate state-of-charge estimation. IEEE Access 2021, 9, 11252–11263. [Google Scholar] [CrossRef]
Cui, Z.; Kang, L.; Li, L.; Wang, L.; Wang, K. A hybrid neural network model with improved input for state of charge estimation of lithium-ion battery at low temperatures. Renew. Energy 2022, 198, 1328–1340. [Google Scholar] [CrossRef]
Shen, H.; Zhou, X.; Wang, Z.; Wang, J. State of charge estimation for lithium-ion battery using Transformer with immersion and invariance adaptive observer. J. Energy Storage 2022, 45, 103768. [Google Scholar] [CrossRef]
Bao, Z.; Nie, J.; Lin, H.; Gao, K.; He, Z.; Gao, M. TTSNet: State-of-Charge Estimation of Li-ion Battery in Electrical Vehicles with Temporal Transformer-based Sequence Network. IEEE Trans. Veh. Technol. 2024, 73, 7838–7851. [Google Scholar] [CrossRef]
Nakano, K.; Tanaka, K. Transformer-Based Online Battery State of Health Estimation from Electric Vehicle Driving Data. In Proceedings of the 15th International Conference on Applied Energy (ICAE), Doha, Qatar, 3–7 December 2023; Volume 43. [Google Scholar]
Fauzi, M.R.; Yudistira, N.; Mahmudy, W.F. State-of-Health Prediction of Lithium-Ion Batteries using Exponential Smoothing Transformer with Seasonal and Growth Embedding. IEEE Access 2024, 12, 14659–14670. [Google Scholar] [CrossRef]
Madden, H.H. Comments on the Savitzky-Golay convolution method for least-squares-fit smoothing and differentiation of digital data. Anal. Chem. 1978, 50, 1383–1386. [Google Scholar] [CrossRef]
Bahdanau, D.; Cho, K.; Bengio, Y. Neural machine translation by jointly learning to align and translate. arXiv 2014, arXiv:1409.0473. [Google Scholar]
Yu, Y.; Si, X.; Hu, C.; Zhang, J. A review of recurrent neural networks: LSTM cells and network architectures. Neural Comput. 2019, 31, 1235–1270. [Google Scholar] [CrossRef]
Sherstinsky, A. Fundamentals of recurrent neural network (RNN) and long short-term memory (LSTM) network. Phys. D Nonlinear Phenom. 2020, 404, 132306. [Google Scholar] [CrossRef]
Dey, R.; Salem, F.M. Gate-variants of gated recurrent unit (GRU) neural networks. In Proceedings of the 2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS), Boston, MA, USA, 6–9 August 2017; pp. 1597–1600. [Google Scholar]
Wolf, T.; Debut, L.; Sanh, V.; Chaumond, J.; Delangue, C.; Moi, A.; Cistac, P.; Rault, T.; Louf, R.; Funtowicz, M.; et al. Transformers: State-of-the-art natural language processing. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Online, 16–20 November 2020; pp. 38–45. [Google Scholar]
Zhou, H.; Zhang, S.; Peng, J.; Zhang, S.; Li, J.; Xiong, H.; Zhang, W. Informer: Beyond efficient transformer for long sequence time-series forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, Virtual, 2–9 February 2021; Volume 35, pp. 11106–11115. [Google Scholar]
Wu, H.; Xu, J.; Wang, J.; Long, M. Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting. Adv. Neural Inf. Process. Syst. 2021, 34, 22419–22430. [Google Scholar]
Zhou, T.; Ma, Z.; Wen, Q.; Wang, X.; Sun, L.; Jin, R. Fedformer: Frequency enhanced decomposed transformer for long-term series forecasting. In Proceedings of the International Conference on Machine Learning. PMLR, Baltimore, MA, USA, 17–23 July 2022; pp. 27268–27286. [Google Scholar]
Nie, Y.; Nguyen, N.H.; Sinthong, P.; Kalagnanam, J. A time series is worth 64 words: Long-term forecasting with transformers. arXiv 2022, arXiv:2211.14730. [Google Scholar]
Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. Adv. Neural Inf. Process. Syst. 2017, 30. [Google Scholar]

Figure 1. Battery life prediction method for IoT devices in remote metering applications.

Figure 3. Remote gas meters and data collection board. (a) Remote gas meter with built-in PCB. (b) MSD primary battery installed in remote gas meter. (c) Rear part of PCB for data measurement. (d) Remote gas meter installed in house (deployed to 50 households).

Figure 4. Voltage graphs before and after missing value imputation. Left (a) shows original voltage values, and right (b) shows imputed voltage values.

Figure 5. Graph before and after applying convolutional smoothing. The noise in the graph on the left (a) is greatly decreased in the graph on the right (b).

Figure 6. Process of determining the primary point. (a) The minimum of the second derivative indicates the primary point. (b) Location of the primary point in the discharge curve. (c) Truncated data before the primary point.

Figure 7. Structure of the proposed C-SDFormer model.

Figure 8. Estimation error rate of SOC at primary point.

Table 1. Data collected from smart gas meters.

Collected Data	Detail
Timestamp [Second]	Timestamp of data collection
Discharge Capacity [mAh]	Battery usage using Coulomb Counter (Used as the ground truth for SOC prediction)
Temperature [°C]	Battery operating temperature
Voltage [V]	Present voltage
Communication Count [count]	Number of communications attempts per 10 min

Table 2. Sample of input data.

Timestamp	Voltage [V]	Temperature [°C]	Communication Count [Count]
1658103487	3.814	32.81	0
1658104087	3.814	32.76	0
1658104687	3.785	32.86	3
1658105287	3.811	32.71	0
1658105887	3.811	32.69	0
1658106487	3.813	32.74	0

Table 3. Summary statistics of input data.

Input Variables	Mean	Std	Min	Max
Temperature [°C]	25.2	4.37	3.02	37.1
Voltage [V]	3.59	0.08	3.34	3.83
Communication Count [count]	0.55	1.56	0.00	41.0

Table 4. Hyperparameters of the C-SDFormer.

Stage	Order	Layers	Layers Parameters
Input Embedding	1	Xavier Initialization	gain = 1.0
	2	Conv1D	Number of filters = 32, Kernel size = 7
	3	BatchNorm1D	-
	4	Activation Function	Tanh
	5	Conv 1D	Number of filters = 64, Kernel size = 5
	6	BatchNorm 1D	-
	7	Activation Function	Tanh
	8	Conv 1D	Number of filters = 128, Kernel size = 3
	9	BatchNorm 1D	-
	10	Activation Function	Tanh
	11	Conv 1D	Number of filters = 128, kernel size = 1
	12	BatchNorm 1D	-
	13	Activation Function	Tanh
Encoder	1	Encoder layer	Self-attention heads = 4, Encoder blocks = 8
Feed Forward	1	Linear	Hidden units = 32, Activation = Gelu
	2	Drop out	Rate = 0.05
	3	Linear	Hidden units = 128
Outputs	1	Xavier Initialization	gain = 1.0
	2	Conv1D	Number of filters = 64, Kernel size = 3
	3	Activation Function	Tanh
	4	Conv1D	Number of filters = 32, Kernel size = 3
	5	Linear	Hidden units = 1

Table 5. Ablation study of C-SDFormer.

Ablated Layer	MAE (%)				RMSE (%)				SMAPE (%)				R²
Ablated Layer	#1	#2	#3	Average	#1	#2	#3	Average	#1	#2	#3	Average	#1	#2	#3	Average
w/o CNN Embedding	4.764	6.822	7.907	6.497	6.177	8.628	9.525	8.110	7.977	12.92	12.86	11.21	0.947	0.871	0.882	0.900
w/o CNN Output	4.524	4.634	7.064	5.407	5.597	6.447	7.064	6.369	7.235	8.431	11.49	9.052	0.954	0.933	0.915	0.934
w/o Transformer	5.273	6.816	5.644	5.911	6.178	8.338	6.777	7.097	10.33	10.89	9.117	10.11	0.928	0.854	0.928	0.903
w/o Series Decomposition	4.641	4.966	4.500	4.702	6.236	6.652	5.973	6.287	6.714	9.790	6.323	7.609	0.949	0.944	0.958	0.950
C-SDFormer (Ours)	3.408	5.123	3.534	4.021	4.602	6.391	4.567	5.186	5.070	11.60	4.547	7.072	0.972	0.950	0.974	0.965

Note: Bold values indicate the best performance among the ablated layers.

Table 6. Performance comparison of SOC estimation between C-SDFormer and other methods.

Method	MAE (%)				RMSE (%)				SMAPE (%)				R²
Method	#1	#2	#3	Average	#1	#2	#3	Average	#1	#2	#3	Average	#1	#2	#3	Average
DNN [13]	4.799	7.643	6.366	6.269	6.207	9.802	8.018	8.009	8.131	11.54	8.708	9.459	0.949	0.854	0.920	0.907
LSTM [14]	3.745	6.921	4.360	5.008	4.406	8.037	5.558	6.000	7.471	10.23	6.744	8.148	0.974	0.903	0.960	0.945
CNN [15]	3.793	5.337	6.169	5.099	4.840	6.763	7.035	6.212	7.366	8.144	9.441	8.317	0.964	0.921	0.928	0.937
CNN-LSTM-DNN [16]	4.228	5.120	6.241	5.196	5.716	6.448	7.729	6.631	7.260	11.03	9.226	9.172	0.954	0.943	0.920	0.939
Bi-GRU [17]	3.810	7.492	6.565	5.955	4.844	9.250	8.417	7.503	6.714	12.32	8.789	9.274	0.968	0.889	0.904	0.920
CNN-BWGRU [18]	3.777	6.586	4.995	5.119	4.704	8.353	6.192	6.416	6.834	12.48	6.898	8.737	0.970	0.904	0.948	0.940
C-SDFormer (Ours)	3.408	5.123	3.534	4.021	4.602	6.391	4.567	5.186	5.070	11.60	4.547	7.072	0.972	0.950	0.974	0.965

Note: Bold values indicate the best performance among the compared methods.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yong, T.; Lee, C.; Kim, S.; Kim, J. Battery Life Prediction for Ensuring Robust Operation of IoT Devices in Remote Metering. Appl. Sci. 2025, 15, 2968. https://doi.org/10.3390/app15062968

AMA Style

Yong T, Lee C, Kim S, Kim J. Battery Life Prediction for Ensuring Robust Operation of IoT Devices in Remote Metering. Applied Sciences. 2025; 15(6):2968. https://doi.org/10.3390/app15062968

Chicago/Turabian Style

Yong, Taein, Chaebong Lee, Seongseop Kim, and Jaeho Kim. 2025. "Battery Life Prediction for Ensuring Robust Operation of IoT Devices in Remote Metering" Applied Sciences 15, no. 6: 2968. https://doi.org/10.3390/app15062968

APA Style

Yong, T., Lee, C., Kim, S., & Kim, J. (2025). Battery Life Prediction for Ensuring Robust Operation of IoT Devices in Remote Metering. Applied Sciences, 15(6), 2968. https://doi.org/10.3390/app15062968

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Battery Life Prediction for Ensuring Robust Operation of IoT Devices in Remote Metering

Abstract

1. Introduction

2. Related Work

3. Remote Metering Life Prediction Method

3.1. Multi-Stage Discharge Primary Battery

3.2. Data Collection and Preprocessing

3.3. C-SDFormer Model for Predicting the Discharge Capacity of MSD Primary Battery

3.4. SOC and Remaining Time Calculation Using the Predicted Discharge Capacity

4. Experimental Setup

4.1. Hyperparameters of C-SDFormer

4.2. Implementation Details

4.3. Evaluation Metrics

5. Experimental Results and Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A. Supplementary Experimental Details

Appendix A.1. Additional Analysis and Metrics

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI