BWO–ICEEMDAN–iTransformer: A Short-Term Load Forecasting Model for Power Systems with Parameter Optimization
Abstract
:1. Introduction
- A novel ICEEMDAN–iTransformer forecasting model based on the BWO is proposed. By integrating data decomposition, optimization algorithms, and an improved Transformer model, the accuracy of short-term electricity load forecasting is enhanced.
- The data decomposition process is optimized by utilizing BWO to adjust ICEEMDAN parameters, maximizing the decomposition effectiveness and reducing the non-stationarity of electricity load data.
2. Methodology
2.1. Correlation Analysis
2.2. Beluga Whale Optimization (BWO)
2.2.1. Exploration Phase
2.2.2. Exploitation Phase
2.2.3. Whale Fall Phase
2.3. ICEEMDAN
- (1)
- White noise is introduced into the original signal to construct the signal:
- (2)
- The residual of the first stage is obtained as follows:
- (3)
- The modal component of the first stage is calculated as follows:
- (4)
- The residual and modal components of the second stage are obtained as follows:
- (5)
- The above steps are repeated to find the residual and modal component of the j-th stage:
- (6)
- The iteration stops when the residual becomes a monotonic function, or when the standard deviation of adjacent IMFs becomes less than 0.2.
2.4. iTransformer
- (1)
- A multilayer perceptron (MLP) to map to . Here, contains all the temporal changes of the corresponding variable over the past time, called a Variate Token.
- (2)
- A multivariate attention mechanism is used to analyze the correlation between each Variate Token.
- (3)
- Each Variate Token is normalized to follow a Gaussian distribution, ensuring that the features of all variables are under a relatively uniform distribution, thereby reducing differences in measurement units.
- (4)
- Extract the intrinsic properties of the sequences using feed-forward neural networks, followed by layer normalization.
- (5)
- Return the predicted sequence.
2.5. Criteria
3. Proposed BWO–ICEEMDAN–iTransformer Model
- (1)
- The raw power load data contains some anomalies and missing values that need to be preprocessed. For missing values, fill them with the average value of the same time point over the previous and subsequent three days. For anomalies, apply the three-sigma rule for screening and treat them as missing values.
- (2)
- Optimize the two parameters of ICEEMDAN, noise standard deviation (Nstd) and the number of realizations (NR), using BWO to maximize the decomposition effect.
- (3)
- Use ICEEMDAN to decompose the preprocessed electricity load data, generating modal components and residuas.
- (4)
- Use SCC to select factors with high correlation to electricity load, including electricity price, related humidity (RH), and temperature (T).
- (5)
- Reconstruct the decomposed subsequences with the selected relevant factors.
- (6)
- Use the iTransformer model to obtain the predicted results for each subsequence and then combine them to obtain the final prediction.
4. Results and Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Data Availability Statement
Conflicts of Interest
References
- Lee, C.-M.; Ko, C.-N. Short-term load forecasting using lifting scheme and ARIMA models. Expert Syst. Appl. 2011, 38, 5902–5911. [Google Scholar] [CrossRef]
- Li, Y.; Niu, D. Application of Principal Component Regression Analysis in power load forecasting for medium and long term. In Proceedings of the 2010 3rd International Conference on Advanced Computer Theory and Engineering (ICACTE), Chengdu, China, 20–22 August 2010; IEEE: Piscataway, NJ, USA, 2010; Volume 3, pp. V3-201–V3-203. [Google Scholar]
- Al Amin, M.A.; Hoque, M.A. Comparison of ARIMA and SVM for short-term load forecasting. In Proceedings of the 2019 9th Annual Information Technology, Electromechanical Engineering and Microelectronics Conference (IEMECON), Jaipur, India, 13–15 March 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 1–6. [Google Scholar]
- Imani, M. Electrical load-temperature CNN for residential load forecasting. Energy 2021, 227, 120480. [Google Scholar] [CrossRef]
- Das, A.; Annaqeeb, M.K.; Azar, E.; Novakovic, V.; Kjærgaard, M.B. Occupant-centric miscellaneous electric loads prediction in buildings using state-of-the-art deep learning methods. Appl. Energy 2020, 269, 115135. [Google Scholar] [CrossRef]
- So, D.; Oh, J.; Jeon, I.; Moon, J.; Lee, M.; Rho, S. BiGTA-Net: A hybrid deep learning-based electrical energy forecasting model for building energy management systems. Systems 2023, 11, 456. [Google Scholar] [CrossRef]
- Li, C.; Li, G.; Wang, K.; Han, B. A multi-energy load forecasting method based on parallel architecture CNN-GRU and transfer learning for data deficient integrated energy systems. Energy 2022, 259, 124967. [Google Scholar] [CrossRef]
- Kim, T.Y.; Cho, S.B. Predicting residential energy consumption using CNN-LSTM neural networks. Energy 2019, 182, 72–81. [Google Scholar] [CrossRef]
- Yadav, K.; Singh, M. A novel energy management of public charging stations using attention-based deep learning model. Electr. Power Syst. Res. 2025, 238, 111090. [Google Scholar] [CrossRef]
- Lin, J.; Ma, J.; Zhu, J.; Cui, Y. Short-term load forecasting based on LSTM networks considering attention mechanism. Int. J. Electr. Power Energy Syst. 2022, 137, 107818. [Google Scholar] [CrossRef]
- Niu, D.; Yu, M.; Sun, L.; Gao, T.; Wang, K. Short-term multi-energy load forecasting for integrated energy systems based on CNN-BiGRU optimized by attention mechanism. Appl. Energy 2022, 313, 118801. [Google Scholar] [CrossRef]
- Gao, X.; Li, X.; Zhao, B.; Ji, W.; Jing, X.; He, Y. Short-term electricity load forecasting model based on EMD-GRU with feature selection. Energies 2019, 12, 1140. [Google Scholar] [CrossRef]
- Li, K.; Duan, P.; Cao, X.; Cheng, Y.; Zhao, B.; Xue, Q.; Feng, M. A multi-energy load forecasting method based on complementary ensemble empirical model decomposition and composite evaluation factor reconstruction. Appl. Energy 2024, 365, 123283. [Google Scholar] [CrossRef]
- Li, L.; Jing, R.; Zhang, Y.; Wang, L.; Zhu, L. Short-term power load forecasting based on ICEEMDAN-GRA-SVDE-BiGRU and error correction model. IEEE Access 2023, 11, 110060–110074. [Google Scholar] [CrossRef]
- Vaswani, A. Attention is all you need. Adv. Neural Inf. Process. Syst. 2017, 30, 5998–6008. [Google Scholar]
- Tian, Z.; Liu, W.; Jiang, W.; Wu, C. Cnns-transformer based day-ahead probabilistic load forecasting for weekends with limited data availability. Energy 2024, 293, 130666. [Google Scholar] [CrossRef]
- Hu, J.; Hu, W.; Cao, D.; Sun, X.; Chen, J.; Huang, Y.; Chen, Z.; Blaabjerg, F. Probabilistic net load forecasting based on transformer network and Gaussian process-enabled residual modeling learning method. Renew. Energy 2024, 225, 120253. [Google Scholar] [CrossRef]
- Xu, C.; Chen, G. Interpretable transformer-based model for probabilistic short-term forecasting of residential net load. Int. J. Electr. Power Energy Syst. 2024, 155, 109515. [Google Scholar] [CrossRef]
- Liu, Y.; Hu, T.; Zhang, H.; Wu, H.; Wang, S.; Ma, L.; Long, M. iTransformer: Inverted transformers are effective for time series forecasting. arXiv 2023, arXiv:2310.06625. [Google Scholar]
- Zhong, C.; Li, G.; Meng, Z. Beluga whale optimization: A novel nature-inspired metaheuristic algorithm. Knowl. Based Syst. 2022, 251, 109215. [Google Scholar] [CrossRef]
- Hassan, M.H.; Kamel, S.; Jurado, F.; Ebeed, M.; Elnaggar, M.F. Economic load dispatch solution of large-scale power systems using an enhanced beluga whale optimizer. Alex. Eng. J. 2023, 72, 573–591. [Google Scholar] [CrossRef]
- Wang, C.; Wang, K.; Liu, D.; Zhang, L.; Li, M.; Khan, M.I.; Li, T.; Cui, S. Development and application of a comprehensive assessment method of regional flood disaster risk based on a refined random forest model using beluga whale optimization. J. Hydrol. 2024, 633, 130963. [Google Scholar] [CrossRef]
- Hou, H.; Liu, C.; Wang, Q.; Wu, X.; Tang, J.; Shi, Y.; Xie, C. Review of load forecasting based on artificial intelligence methodologies. Electr. Power Syst. Res. 2022, 210, 108067. [Google Scholar] [CrossRef]
- Shen, Y.; Li, D.; Wang, W. Multi-energy load prediction method for integrated energy system based on fennec fox optimization algorithm and hybrid kernel extreme learning machine. Entropy 2024, 26, 699. [Google Scholar] [CrossRef] [PubMed]
- NEMS Prices [EB/OL]. 2024. Available online: https://www.nems.emcsg.com/nems-prices (accessed on 30 September 2024).
- Zhou, T.; Ma, Z.; Wen, Q.; Wang, X.; Sun, L.; Jin, R. Fedformer: Frequency enhanced decomposed transformer for long-term series forecasting. In Proceedings of the International Conference on Machine Learning, Baltimore, MD, USA, 17–23 July 2022; PMLR: Cambridge, MA, USA, 2022; pp. 27268–27286. [Google Scholar]
- Liu, S.; Yu, H.; Liao, C.; Li, J.; Lin, W.; Liu, A.X.; Dustdar, S. Pyraformer: Low-complexity pyramidal attention for long-range time series modeling and forecasting. In Proceedings of the International Conference on Learning Representations, Virtual Event, 25–29 April 2022. [Google Scholar]
- Wu, H.; Xu, J.; Wang, J.; Long, M. Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting. Adv. Neural Inf. Process. Syst. 2021, 34, 22419–22430. [Google Scholar]
- Yi, K.; Zhang, Q.; Fan, W.; Wang, S.; Wang, P.; He, H.; Lian, D.; An, N.; Cao, L.; Niu, Z. Frequency-domain MLPs are more effective learners in time series forecasting. Adv. Neural Inf. Process. Syst. 2024, 36, 76656–76679. [Google Scholar]
- Zhang, T.; Zhang, Y.; Cao, W.; Bian, J.; Yi, X.; Zheng, S.; Li, J. Less is more: Fast multivariate time series forecasting with light sampling-oriented mlp structures. arXiv 2022, arXiv:2207.01186. [Google Scholar]
- Nie, Y.; Nguyen, N.H.; Sinthong, P.; Kalagnanam, J. A time series is worth 64 words: Long-term forecasting with transformers. arXiv 2022, arXiv:2211.14730. [Google Scholar]
- Liu, Y.; Li, C.; Wang, J.; Long, M. Koopa: Learning non-stationary time series dynamics with koopman predictors. Adv. Neural Inf. Process. Syst. 2023, 36, 12271–12290. [Google Scholar]
- Mirjalili, S.; Mirjalili, S.M.; Lewis, A. Grey wolf optimizer. Adv. Eng. Softw. 2014, 69, 46–61. [Google Scholar] [CrossRef]
- Mirjalili, S. Dragonfly algorithm: A new meta-heuristic optimization technique for solving single-objective, discrete, and multi-objective problems. Neural Comput. Appl. 2016, 27, 1053–1073. [Google Scholar] [CrossRef]
- Braik, M.S. Chameleon Swarm Algorithm: A bio-inspired optimizer for solving engineering design problems. Expert Syst. Appl. 2021, 174, 114685. [Google Scholar] [CrossRef]
Factors | SCC |
---|---|
Price | 0.5131 |
Temperature | 0.6830 |
Dew Point | 0.0694 |
Relative Humidity | −0.6440 |
Pressure | 0.0615 |
Wind Direction | 0.0639 |
Wind Speed | 0.2080 |
Factors | SCC |
---|---|
Price | 0.4521 |
Temperature | 0.5699 |
Relative Humidity | −0.4473 |
Parameter | Value |
---|---|
Population size | 30 |
Maximum number of iterations | 30 |
Nstd | [0.15, 0.6] |
NR | [10, 600] |
Batch_size | 72 |
Epochs | 10 |
Patience | 3 |
Learning rate | 0.0001 |
linear layer | 512 |
FFN layer | 2048 |
Method | R2 | MAE | RMSE |
---|---|---|---|
iTransformer | 0.9577 | 83.2604 | 120.8423 |
FEDformer [26] | 0.9316 | 114.4395 | 153.7248 |
Pyraformer [27] | 0.8888 | 135.9388 | 195.9085 |
Autoformer [28] | 0.8138 | 193.9856 | 253.5933 |
FreTS [29] | 0.9029 | 126.8714 | 183.1028 |
LightTS [30] | 0.8796 | 149.3750 | 203.8944 |
PatchTST [31] | 0.9162 | 118.5022 | 170.0605 |
Koopa [32] | 0.8654 | 158.0816 | 215.6215 |
ICEEMDN–iTransformer | 0.9738 | 71.1998 | 95.0231 |
CEEMD–iTransformer | 0.9728 | 72.0094 | 96.9611 |
EEMD–iTransformer | 0.9686 | 80.4151 | 104.0551 |
CSA–ICEEMDAN–iTransformer | 0.9867 | 49.1543 | 67.7606 |
DA–ICEEMADN–iTransformer | 0.9858 | 50.0338 | 69.9737 |
GWO–ICEEMADN–iTransformer | 0.9869 | 48.8129 | 67.3607 |
BWO–CEEMD–iTransformer | 0.9767 | 67.1662 | 89.6762 |
BWO–EEMD–iTransformer | 0.9689 | 78.1639 | 103.5629 |
BWO–ICEEMADN–iTransformer | 0.9873 | 48.0014 | 66.2221 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zheng, D.; Qin, J.; Liu, Z.; Zhang, Q.; Duan, J.; Zhou, Y. BWO–ICEEMDAN–iTransformer: A Short-Term Load Forecasting Model for Power Systems with Parameter Optimization. Algorithms 2025, 18, 243. https://doi.org/10.3390/a18050243
Zheng D, Qin J, Liu Z, Zhang Q, Duan J, Zhou Y. BWO–ICEEMDAN–iTransformer: A Short-Term Load Forecasting Model for Power Systems with Parameter Optimization. Algorithms. 2025; 18(5):243. https://doi.org/10.3390/a18050243
Chicago/Turabian StyleZheng, Danqi, Jiyun Qin, Zhen Liu, Qinglei Zhang, Jianguo Duan, and Ying Zhou. 2025. "BWO–ICEEMDAN–iTransformer: A Short-Term Load Forecasting Model for Power Systems with Parameter Optimization" Algorithms 18, no. 5: 243. https://doi.org/10.3390/a18050243
APA StyleZheng, D., Qin, J., Liu, Z., Zhang, Q., Duan, J., & Zhou, Y. (2025). BWO–ICEEMDAN–iTransformer: A Short-Term Load Forecasting Model for Power Systems with Parameter Optimization. Algorithms, 18(5), 243. https://doi.org/10.3390/a18050243