PredXGBR: A Machine Learning Framework for Short-Term Electrical Load Prediction †
Abstract
:1. Introduction
- Summary of Contribution:
- We have developed and implemented PredXGBR-1, a short-term feature-based XGBoost model with time-lagged features. PredXGBR-1 is designed to capture short-term fluctuations in electricity demand by leveraging data from the previous 24 h, and it has been rigorously evaluated and validated across five different datasets. The integration of time-lagged features significantly improved prediction accuracy, addressing a key gap in existing forecasting methods.
- We performed an extensive analysis to explore how different feature sets influence the performance of the model. This comparative study proved our intuition that short-term lag features are essential for enhancing predictive accuracy, especially in rapidly fluctuating demand environments.
- The proposed model demonstrated high accuracy, achieving an MAPE of 0.98–1.2% across all datasets. This result underscores the model’s robustness and reliability for short-term load forecasting in diverse scenarios.
- We pledge to share the whole code repository and the dataset with the community to promote reproducibility and advancements in the field of electric load forecasting.
2. Related Works
3. Background and Preliminaries
3.1. ARIMA and Time Series Methods
3.2. SVM
3.3. DL Approaches
3.3.1. RNN
3.3.2. LSTM
3.3.3. Temporal Convolutional Networks (TCN)s
3.3.4. Transformer
3.4. XGBoost
4. Proposed Model: PredXGBR-1
4.1. Challenges in Short-Term Electrical Load Forecasting
- Nonlinear Relationships: Electrical load is influenced by various external factors such as weather conditions, time of day, and sudden shifts in demand. These complex and nonlinear relationships are difficult to model using conventional linear methods.
- Unbalanced Data: Load datasets are often characterized by periods of stable usage interspersed with sudden spikes or drops in consumption. This imbalance can negatively impact the performance of standard regression models.
- Temporal Dependencies: Load at a particular time is dependent on both short-term and long-term historical data, making the selection of features and modeling of temporal dependencies critical.
4.2. How PredXGBR-1 Addresses These Challenges
- Tree-Based Regression: The model employs the classification and regression tree (CART) as a base learner, enabling it to capture complex, nonlinear relationships within the data. The tree-based structure allows the model to perform well in unbalanced datasets by focusing on regions of the data with the highest residuals.
- Boosting Mechanism: XGBoost uses boosting to iteratively refine predictions by correcting the residual errors from previous iterations. This iterative process enables PredXGBR-1 to focus on improving short-term predictions, which are typically more volatile and difficult to forecast.
- Feature Selection: The model utilizes short-term lag features—mean and standard deviation of load over the prior 6, 12, and 24 h intervals—which capture the immediate temporal dependencies. This is critical in load forecasting, where short-term variations can greatly impact overall prediction accuracy.
4.3. Model Structure and Formalization
4.4. Illustration of the Model Structure
4.5. Objective Function
4.6. Leaf Weight Optimization
4.7. Tree Quality Evaluation
4.8. Split Candidate Evaluation
4.9. Model Generalization and Overfitting Control
5. Datasets and Feature Extraction
5.1. Data Preprocessing
- Handling Missing Values:Missing entries in the datasets were primarily due to interruptions in data logging or transmission errors, which can introduce biases and disrupt model training. To address this, we applied two techniques: linear interpolation and forward filling. For extended periods with missing data, linear interpolation was used to generate intermediate values based on surrounding data points, creating smooth transitions and preserving underlying trends. This approach was particularly useful for restoring temporal continuity over multiple time steps. For isolated missing points, forward filling was employed, where the last available valid value was repeated to fill gaps. This method was beneficial for short, intermittent gaps, ensuring the continuity of time series patterns without distorting the data structure. By addressing missing values, we maintained the temporal integrity of the data, allowing the model to capture continuous patterns in load fluctuations accurately.
- Organizing Data in Temporal Order:Accurate temporal sequencing is essential for load forecasting models, especially those that rely on time-lagged features to capture dependencies over time. To achieve this, we standardized all timestamps across datasets to a 24 h format using Python’s datetime module, which enabled us to resolve inconsistencies, such as incorrect AM/PM labels. These inconsistencies, if left unaddressed, could lead to misalignment of hourly data, resulting in misleading trends and poor model performance. After standardizing the timestamps, we sorted records in ascending order by hour, ensuring that each observation followed a natural temporal progression. This careful sequencing allowed our model to accurately interpret time-dependent features and reliably capture the structure of load demand patterns.
- Removing Duplicate Entries:Duplicate entries in time series data can skew the model’s learning process by overemphasizing certain observations, potentially leading to biased predictions. We performed a systematic search for duplicate records within each dataset, focusing on entries with identical timestamps and load values. Once identified, these duplicates were removed to ensure that each data point represented a unique, distinct observation. This step preserved the dataset’s integrity, enabling the model to generalize well by learning from an unbiased representation of historical load patterns.
- Segmentation of Peak and Off-Peak Hours:Load demand often fluctuates significantly between peak and off-peak periods, driven by factors such as residential and industrial activity levels. To capture these fluctuations, we segmented each day’s load data into peak and off-peak hours. Specifically, we recorded the maximum load observed during peak hours (typically between 5 p.m. and 9 p.m.) and the minimum load during off-peak hours (usually from midnight to early morning). This segmentation helped the model to distinguish between periods of high and low demand, enhancing its ability to forecast accurately across different times of the day. By providing the model with these segmented values, we enabled it to capture and adapt to the distinct patterns characteristic of peak and off-peak demand, which are critical for short-term load forecasting accuracy.
- Resolving AM/PM Inconsistencies:Time inconsistencies related to AM/PM formatting were common in some datasets and could interfere with temporal ordering. For instance, an entry incorrectly marked as “PM” instead of “AM” could cause significant deviations in the load pattern analysis, leading to inaccurate predictions. Using the datetime module, we converted all timestamps to a uniform 24 h format, thus eliminating ambiguity and ensuring that each record corresponded to the correct time of day. This consistency allowed the model to extract reliable time-dependent features and improved its capability to capture daily load cycles accurately.
5.2. Feature Extraction and Analysis
5.2.1. Seasonal Decomposition
5.2.2. Temporal Features of Electric Load Consumption
- Hour of the Day:This feature captures how electricity consumption varies throughout the 24 h daily cycle. It provides a granular view of consumption patterns on an hourly basis, which is crucial for identifying peak and off-peak hours. Typically, demand is lower during late night and early morning hours (11 p.m. to 7 a.m.), when most residential, commercial, and industrial activities are minimal. Conversely, consumption often peaks during the morning and early evening, when individuals and businesses are most active. This feature allows for a detailed examination of daily demand cycles and helps in load forecasting and grid management.
- Day of the Week:The day of the week feature distinguishes between weekdays and weekends, capturing the variation in electricity demand that occurs based on the socio-economic activities typically scheduled during the week. Weekdays (Monday through Friday) usually show higher and more stable demand patterns due to the regular operation of industries, offices, and commercial establishments. Weekends, on the other hand, may exhibit a drop in demand, particularly in commercial and industrial sectors, though residential consumption may fluctuate depending on lifestyle habits.
- Month of the Year:This feature reflects the seasonal variation in electric load consumption over the twelve months, providing insight into how different times of the year impact electricity demand. Seasonal changes drive consumption patterns, with summer months (e.g., July, August) generally showing higher demand due to increased use of cooling systems, while winter months (e.g., December, January) may reflect higher consumption due to heating needs. Transitional seasons, such as fall and spring, tend to exhibit lower and more stable consumption levels compared to the extremes of summer and winter. Analyzing monthly data helps understand the impact of climatic conditions on load demand, allowing for better planning and resource allocation.
6. Performance Evaluation
6.1. Evaluation Metrics
6.2. Optimal Parameter Selection
6.3. PredXGBR Performance Across Different Datasets
6.4. PredXGBR Generalization Performance
6.5. Computational Complexity and Inference Time
7. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Saqib, N.; Haque, K.F.; Zabin, R.; Preonto, S.N. Analysis of grid integrated PV system as home RES with net metering scheme. In Proceedings of the 2019 International Conference on Robotics, Electrical and Signal Processing Techniques (ICREST), Dhaka, Bangladesh, 10–12 January 2019; pp. 395–399. [Google Scholar]
- Rodrigues, F.; Cardeira, C.; Calado, J.M.; Melicio, R. Short-term load forecasting of electricity demand for the residential sector based on modelling techniques: A systematic review. Energies 2023, 16, 4098. [Google Scholar] [CrossRef]
- Habbak, H.; Mahmoud, M.; Metwally, K.; Fouda, M.M.; Ibrahem, M.I. Load forecasting techniques and their applications in smart grids. Energies 2023, 16, 1480. [Google Scholar] [CrossRef]
- Akhtar, S.; Shahzad, S.; Zaheer, A.; Ullah, H.S.; Kilic, H.; Gono, R.; Jasiński, M.; Leonowicz, Z. Short-term load forecasting models: A review of challenges, progress, and the road ahead. Energies 2023, 16, 4060. [Google Scholar] [CrossRef]
- Eren, Y.; Küçükdemiral, İ. A comprehensive review on deep learning approaches for short-term load forecasting. Renew. Sustain. Energy Rev. 2024, 189, 114031. [Google Scholar] [CrossRef]
- Cordeiro-Costas, M.; Villanueva, D.; Eguía-Oller, P.; Martínez-Comesaña, M.; Ramos, S. Load forecasting with machine learning and deep learning methods. Appl. Sci. 2023, 13, 7933. [Google Scholar] [CrossRef]
- Avendano, I.A.C.; Javan, F.D.; Najafi, B.; Moazami, A.; Rinaldi, F. Assessing the impact of employing machine learning-based baseline load prediction pipelines with sliding-window training scheme on offered flexibility estimation for different building categories. Energy Build. 2023, 294, 113217. [Google Scholar] [CrossRef]
- Zhang, D.; Wang, S.; Liang, Y.; Du, Z. A novel combined model for probabilistic load forecasting based on deep learning and improved optimizer. Energy 2023, 264, 126172. [Google Scholar] [CrossRef]
- Aduama, P.; Zhang, Z.; Al-Sumaiti, A.S. Multi-feature data fusion-based load forecasting of electric vehicle charging stations using a deep learning model. Energies 2023, 16, 1309. [Google Scholar] [CrossRef]
- Zhang, L.; Wen, J.; Li, Y.; Chen, J.; Ye, Y.; Fu, Y.; Livingood, W. A Review of Machine Learning in Building Load Prediction. Appl. Energy 2021, 285, 116452. [Google Scholar] [CrossRef]
- Saqib, N.; Haque, K.F.; Yanambaka, V.P.; Abdelgawad, A. Convolutional-neural-network-based handwritten character recognition: An approach with massive multisource data. Algorithms 2022, 15, 129. [Google Scholar] [CrossRef]
- Haque, K.F.; Zabin, R.; Yelamarthi, K.; Yanambaka, P.; Abdelgawad, A. An IoT based efficient waste collection system with smart bins. In Proceedings of the 2020 IEEE 6th World Forum on Internet of Things (WF-IoT), New Orleans, LA, USA, 2–16 June 2020; pp. 1–5. [Google Scholar]
- Chen, K.; Chen, K.; Wang, Q.; He, Z.; Hu, J.; He, J. Short-Term Load Forecasting with Deep Residual Networks. IEEE Trans. Smart Grid 2018, 10, 3943–3952. [Google Scholar] [CrossRef]
- Hammad, M.A.; Jereb, B.; Rosi, B.; Dragan, D. Methods and Mdels for Electric Load Forecasting: A Comprehensive Review. Logist. Sustain. Transp. 2020, 11, 51–76. [Google Scholar] [CrossRef]
- Aly, H.H. A Proposed Intelligent Short-Term Load Forecasting Hybrid Models of ANN, WNN and KF based on Clustering Techniques for Smart Grid. Electr. Power Syst. Res. 2020, 182, 106191. [Google Scholar] [CrossRef]
- Singh, S.; Hussain, S.; Bazaz, M.A. Short Term Load Forecasting using Artificial Neural Network. In Proceedings of the 2017 Fourth International Conference on Image Information Processing (ICIIP), Shimla, India, 21–23 December 2017; pp. 1–5. [Google Scholar]
- Khwaja, A.; Zhang, X.; Anpalagan, A.; Venkatesh, B. Boosted Neural Networks for Improved Short-Term Electric Load Forecasting. Electr. Power Syst. Res. 2017, 143, 431–437. [Google Scholar] [CrossRef]
- Amarasinghe, K.; Marino, D.L.; Manic, M. Deep Neural Networks for Energy Load Forecasting. In Proceedings of the 2017 IEEE 26th International Symposium on Industrial Electronics (ISIE), Edinburgh, UK, 19–21 June 2017; pp. 1483–1488. [Google Scholar]
- Ageng, D.; Huang, C.Y.; Cheng, R.G. A Short-Term Household Load Forecasting Framework using LSTM and Data Preparation. IEEE Access 2021, 9, 167911–167919. [Google Scholar] [CrossRef]
- Ogunjuyigbe, A.S.; Ayodele, T.R.; Lasarus, C.P.; Yusuff, A.A.; Mosetlhe, T.C. Comparative Analysis of Short-Term Load Forecasting Methods. In Proceedings of the 2021 IEEE AFRICON, Arusha, Tanzania, 13–15 September 2021; pp. 1–6. [Google Scholar]
- Mubashar, R.; Awan, M.J.; Ahsan, M.; Yasin, A.; Singh, V.P. Efficient residential load forecasting using deep learning approach. Int. J. Comput. Appl. Technol. 2022, 68, 205–214. [Google Scholar] [CrossRef]
- Bashir, T.; Haoyong, C.; Tahir, M.F.; Liqiang, Z. Short term electricity load forecasting using hybrid prophet-LSTM model optimized by BPNN. Energy Rep. 2022, 8, 1678–1686. [Google Scholar] [CrossRef]
- Neeraj, N.; Mathew, J.; Agarwal, M.; Behera, R.K. Long short-term memory-singular spectrum analysis-based model for electric load forecasting. Electr. Eng. 2021, 103, 1067–1082. [Google Scholar] [CrossRef]
- Yang, J.; Zhang, X.; Bao, Y. Short-term load forecasting of central China based on DPSO-LSTM. In Proceedings of the 2021 IEEE 4th International Electrical and Energy Conference (CIEEC), Wuhan, China, 28–30 May 2021; pp. 1–5. [Google Scholar]
- Nespoli, A.; Ogliari, E.; Pretto, S.; Gavazzeni, M.; Vigani, S.; Paccanelli, F. Electrical load forecast by means of lstm: The impact of data quality. Forecasting 2021, 3, 91–101. [Google Scholar] [CrossRef]
- Orr, M. Short-Term Electrical Load Forecasting for Irish Supermarkets with Weather Forecast Data. Ph.D. Thesis, National College of Ireland, Dublin, Ireland, 2021. [Google Scholar]
- Wang, Y.; Sun, S.; Chen, X.; Zeng, X.; Kong, Y.; Chen, J.; Guo, Y.; Wang, T. Short-term load forecasting of industrial customers based on SVMD and XGBoost. Int. J. Electr. Power Energy Syst. 2021, 129, 106830. [Google Scholar] [CrossRef]
- Ibrahim, B.; Rabelo, L. A deep learning approach for peak load forecasting: A case study on panama. Energies 2021, 14, 3039. [Google Scholar] [CrossRef]
- Zheng, H.; Yuan, J.; Chen, L. Short-term load forecasting using EMD-LSTM neural networks with a Xgboost algorithm for feature importance evaluation. Energies 2017, 10, 1168. [Google Scholar] [CrossRef]
- Imani, M. Electrical load-temperature CNN for residential load forecasting. Energy 2021, 227, 120480. [Google Scholar] [CrossRef]
- Alhussein, M.; Aurangzeb, K.; Haider, S.I. Hybrid CNN-LSTM model for short-term individual household load forecasting. IEEE Access 2020, 8, 180544–180557. [Google Scholar] [CrossRef]
- Barman, M.; Choudhury, N.D.; Sutradhar, S. A regional hybrid GOA-SVM model based on similar day approach for short-term load forecasting in Assam, India. Energy 2018, 145, 710–720. [Google Scholar] [CrossRef]
- Chodakowska, E.; Nazarko, J.; Nazarko, Ł. Arima models in electrical load forecasting and their robustness to noise. Energies 2021, 14, 7952. [Google Scholar] [CrossRef]
- Mohandes, M. Support vector machines for short-term electrical load forecasting. Int. J. Energy Res. 2002, 26, 335–345. [Google Scholar] [CrossRef]
- Kontopoulou, V.I.; Panagopoulos, A.D.; Kakkos, I.; Matsopoulos, G.K. A review of ARIMA vs. machine learning approaches for time series forecasting in data driven networks. Future Internet 2023, 15, 255. [Google Scholar] [CrossRef]
- Sharma, R.R.; Kumar, M.; Maheshwari, S.; Ray, K.P. EVDHM-ARIMA-based time series forecasting model and its application for COVID-19 cases. IEEE Trans. Instrum. Meas. 2020, 70, 6502210. [Google Scholar] [CrossRef]
- Sirisha, U.M.; Belavagi, M.C.; Attigeri, G. Profit prediction using ARIMA, SARIMA and LSTM models in time series forecasting: A comparison. IEEE Access 2022, 10, 124715–124727. [Google Scholar] [CrossRef]
- Atabay, F.V.; Pagkalinawan, R.M.; Pajarillo, S.D.; Villanueva, A.R.; Taylar, J.V. Multivariate time series forecasting using arimax, sarimax, and rnn-based deep learning models on electricity consumption. In Proceedings of the 2022 3rd International Informatics and Software Engineering Conference (IISEC), Ankara, Turkey, 15–16 December 2022; pp. 1–6. [Google Scholar]
- Ahn, E.; Hur, J. A short-term forecasting of wind power outputs using the enhanced wavelet transform and arimax techniques. Renew. Energy 2023, 212, 394–402. [Google Scholar] [CrossRef]
- He, K.; Yang, Q.; Ji, L.; Pan, J.; Zou, Y. Financial time series forecasting with the deep learning ensemble model. Mathematics 2023, 11, 1054. [Google Scholar] [CrossRef]
- Song, Z.; Yang, L. Statistical inference for ARMA time series with moving average trend. J. Nonparametr. Stat. 2022, 34, 357–376. [Google Scholar] [CrossRef]
- Ahmad, W.; Ayub, N.; Ali, T.; Irfan, M.; Awais, M.; Shiraz, M.; Glowacz, A. Towards short term electricity load forecasting using improved support vector machine and extreme learning machine. Energies 2020, 13, 2907. [Google Scholar] [CrossRef]
- Aisyah, S.; Simaremare, A.A.; Adytia, D.; Aditya, I.A.; Alamsyah, A. Exploratory weather data analysis for electricity load forecasting using SVM and GRNN, case study in Bali, Indonesia. Energies 2022, 15, 3566. [Google Scholar] [CrossRef]
- Emhamed, A.A.; Shrivastava, J. Electrical load distribution forecasting utilizing support vector model (SVM). Mater. Today Proc. 2021, 47, 41–46. [Google Scholar] [CrossRef]
- Pant, M.; Kumar, S. Fuzzy time series forecasting based on hesitant fuzzy sets, particle swarm optimization and support vector machine-based hybrid method. Granul. Comput. 2022, 7, 861–879. [Google Scholar] [CrossRef]
- Gao, W.; Li, Z.; Chen, Q.; Jiang, W.; Feng, Y. Modelling and prediction of GNSS time series using GBDT, LSTM and SVM machine learning approaches. J. Geod. 2022, 96, 71. [Google Scholar] [CrossRef]
- Ramadevi, B.; Bingi, K. Chaotic time series forecasting approaches using machine learning techniques: A review. Symmetry 2022, 14, 955. [Google Scholar] [CrossRef]
- Aseeri, A.O. Effective RNN-based forecasting methodology design for improving short-term power load forecasts: Application to large-scale power-grid time series. J. Comput. Sci. 2023, 68, 101984. [Google Scholar] [CrossRef]
- Zhao, C.; Ye, J.; Zhu, Z.; Huang, Y. FLRNN-FGA: Fractional-Order Lipschitz Recurrent Neural Network with Frequency-Domain Gated Attention Mechanism for Time Series Forecasting. Fractal Fract. 2024, 8, 433. [Google Scholar] [CrossRef]
- Zhang, X.; Zhong, C.; Zhang, J.; Wang, T.; Ng, W.W. Robust recurrent neural networks for time series forecasting. Neurocomputing 2023, 526, 143–157. [Google Scholar] [CrossRef]
- Misgar, M.M.; Mushtaq, F.; Khurana, S.S.; Kumar, M. Recognition of offline handwritten Urdu characters using RNN and LSTM models. Multimed. Tools Appl. 2023, 82, 2053–2076. [Google Scholar] [CrossRef]
- Masood, F.; Khan, W.U.; Ullah, K.; Khan, A.; Alghamedy, F.H.; Aljuaid, H. A hybrid CNN-LSTM random forest model for dysgraphia classification from hand-written characters with uniform/normal distribution. Appl. Sci. 2023, 13, 4275. [Google Scholar] [CrossRef]
- Abumohsen, M.; Owda, A.Y.; Owda, M. Electrical load forecasting using LSTM, GRU, and RNN algorithms. Energies 2023, 16, 2283. [Google Scholar] [CrossRef]
- Zaboli, A.; Tuyet-Doan, V.N.; Kim, Y.H.; Hong, J.; Su, W. An lstm-sae-based behind-the-meter load forecasting method. IEEE Access 2023, 11, 49378–49392. [Google Scholar] [CrossRef]
- Pavlatos, C.; Makris, E.; Fotis, G.; Vita, V.; Mladenov, V. Enhancing Electrical Load Prediction Using a Bidirectional LSTM Neural Network. Electronics 2023, 12, 4652. [Google Scholar] [CrossRef]
- Bai, S.; Kolter, J.Z.; Koltun, V. An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. arXiv 2018, arXiv:1803.01271. [Google Scholar]
- Wu, Z.; Pan, S.; Long, G.; Jiang, J.; Zhang, C. Graph wavenet for deep spatial-temporal graph modeling. arXiv 2019, arXiv:1906.00121. [Google Scholar]
- Oreshkin, B.N.; Carpov, D.; Chapados, N.; Bengio, Y. N-BEATS: Neural basis expansion analysis for interpretable time series forecasting. arXiv 2019, arXiv:1905.10437. [Google Scholar]
- Vaswani, A. Attention Is All You Need. Advances in Neural Information Processing Systems. 2017. Available online: https://user.phil.hhu.de/~cwurm/wp-content/uploads/2020/01/7181-attention-is-all-you-need.pdf (accessed on 20 June 2022).
- Zhou, H.; Zhang, S.; Peng, J.; Zhang, S.; Li, J.; Xiong, H.; Zhang, W. Informer: Beyond efficient transformer for long sequence time-series forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, Virtually, 2–9 February 2021; Volume 35, pp. 11106–11115. [Google Scholar]
- Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM Sigkdd International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
- Oukhouya, H.; El Himdi, K. Comparing machine learning methods—svr, xgboost, lstm, and mlp—for forecasting the moroccan stock market. In Proceedings of the Computer Sciences & Mathematics Forum, Online, 1–5 May 2023; MDPI: Basel, Switzerland, 2023; Volume 7, p. 39. [Google Scholar]
- Zhang, T.; Zhang, X.; Liu, Y.; Chow, Y.H.; Iu, H.H.; Fernando, T. Long-term energy and peak power demand forecasting based on sequential-XGBoost. IEEE Trans. Power Syst. 2023, 39, 3088–3104. [Google Scholar] [CrossRef]
- Zhang, L.; Jánošík, D. Enhanced short-term load forecasting with hybrid machine learning models: CatBoost and XGBoost approaches. Expert Syst. Appl. 2024, 241, 122686. [Google Scholar] [CrossRef]
- Kumar, B.; Yadav, N. A novel hybrid model combining βSARMA and LSTM for time series forecasting. Appl. Soft Comput. 2023, 134, 110019. [Google Scholar] [CrossRef]
- Kumar, I.; Tripathi, B.K.; Singh, A. Attention-based LSTM network-assisted time series forecasting models for petroleum production. Eng. Appl. Artif. Intell. 2023, 123, 106440. [Google Scholar] [CrossRef]
- Mounir, N.; Ouadi, H.; Jrhilifa, I. Short-term electric load forecasting using an EMD-BI-LSTM approach for smart grid energy management system. Energy Build. 2023, 288, 113022. [Google Scholar] [CrossRef]
- Mishra, P.; Al Khatib, A.M.G.; Yadav, S.; Ray, S.; Lama, A.; Kumari, B.; Sharma, D.; Yadav, R. Modeling and forecasting rainfall patterns in India: A time series analysis with XGBoost algorithm. Environ. Earth Sci. 2024, 83, 163. [Google Scholar] [CrossRef]
- PJM Interconnection LLC. Regional Transmission Organization (RTO) in the USA, Serving Delaware, Illinois, Indiana, Kentucky, Maryland, Michigan, New Jersey, North Carolina, Ohio, etc. (1998–2002); PJM Interconnection LLC: Norristown, PA, USA, 2024. [Google Scholar]
- PJM Interconnection. PJM East Hourly Data from the PJM East Region (2001–2018); PJM Interconnection: Norristown, PA, USA, 2018. [Google Scholar]
- PJM Interconnection. PJM West Hourly Data from the PJM West Region (2001–2018); PJM Interconnection: Norristown, PA, USA, 2018. [Google Scholar]
- American Electric Power (AEP). A Major Investor-Owned Electric Utility in the USA, Delivering Electricity Across 11 States (2004–2018); American Electric Power (AEP): Columbus, OH, USA, 2018. [Google Scholar]
- Dayton Power and Light Company. Serving Over 500,000 Customers Within a 6,000-Square-Mile Area in West Central Ohio, Around Dayton (2004–2018); Dayton Power and Light Company: Dayton, OH, USA, 2018. [Google Scholar]
Research | Model | Contribution | Limitation |
---|---|---|---|
Aly et al. [15] | ANN, WNN, KF | Six clustering hybrid models combining ANN, WNN, and KF for load forecasting. | Did not account for weekday/weekend patterns. |
Singh et al. [16] | Standard ANN | Regional load forecasting using historical temperature, humidity, and load data. | Excludes yearly holiday schedules; limited datasets. |
Khwaja et al. [17] | Boosted ANN | Iterative minimization of forecasting error using BooNN, improving prediction accuracy. | No specific limitations mentioned. |
Marino et al. [18] | LSTM, S2S | Comparison of LSTM and S2S architectures for building-level forecasting. | Only focused on a single building-level dataset. |
Ageng et al. [19] | LSTM, Data Preparation | Addressed data interpolation and de-noising for household load forecasting. | Weather and atmospheric conditions not considered. |
Ogunjuyigbe et al. [20] | LSTM, ARIMA | Comparative analysis of LSTM with ARIMA for improved accuracy. | Limited consideration of holidays and weather data. |
Mubashar et al. [21] | MLR, ANN, SVR | Use of Gaussian filtering and validation across academic, commercial, and residential datasets. | Did not consider long holidays or special events. |
Bashir & Haoyong [22] | Prophet, LSTM | Hybrid Prophet-LSTM model with residual nonlinear data trained by LSTM. | Excluded weekend/weekday patterns; limited dataset validation. |
Neeraj & Mathew [23] | SSA-LSTM | Proposed SSA-LSTM model with noise reduction via signal processing. | No weather- or climate-related data; holidays not considered. |
Yang et al. [24] | DPSO-LSTM | Combined DPSO algorithm with LSTM for weekly load forecasting. | Did not distinguish weekday patterns or consider weather conditions. |
Kong & Dong [25] | RNN, LSTM | Demonstrated improvement in forecast accuracy when using weather data with RNN-based LSTM. | Limited historical data used in evaluation. |
Amarasinghe et al. [18] | CNN | Benchmarking classical CNN against LSTM for peak load demand forecasting. | Model was not validated with diverse datasets. |
Imani et al. [30] | CNN, SVR | Proposed Nonlinear Relationship Extraction (NRE) using CNN and SVR for load–temperature correlation. | Socio-demographic data and household occupancy not considered. |
Alhussein et al. [31] | CNN-LSTM | Proposed hybrid CNN-LSTM model for feature extraction and sequence learning. | Did not consider socio-demographic data or household occupancy. |
Wang et al. [27] | XGBoost, VMD, SVMD | Hybrid XGBoost model combined with trend analysis using VMD for industrial load forecasting. | Model not tested on multiple datasets. |
Zheng et al. [29] | LSTM, XGBoost, EMD | Developed a hybrid model combining EMD, LSTM, and XGBoost for similarity-based forecasting. | No major limitations reported. |
Barman et al. [32] | GOA-SVM, GA-SVM, PSO-SVM | Proposed Grasshopper Optimization Algorithm-based SVM for minimizing forecasting errors. | Did not incorporate comprehensive regional climate factors. |
Dataset | Description | Time Span |
---|---|---|
PJM—PJM Interconnection LLC [69] | Regional transmission organization (RTO) in the USA, serving Delaware, Illinois, Indiana, Kentucky, Maryland, Michigan, New Jersey, North Carolina, Ohio, etc. | 1998–2002 |
PJME—PJM East [70] | Hourly data from the PJM East region. | 2001–2018 |
PJMW—PJM West [71] | Hourly data from the PJM West region. | 2001–2018 |
AEP—American Electric Power (AEP) [72] | A major investor-owned electric utility in the USA, delivering electricity across 11 states. | 2004–2018 |
Dayton—Dayton Power and Light Company [73] | Serving over 500,000 customers within a 6000-square-mile (16,000 km2) area in West Central Ohio, around Dayton. | 2004–2018 |
Model | Dataset | Model1 MAPE | Model2 MAPE | Model1 | Model2 |
---|---|---|---|---|---|
SVM | PJM | 5.13 | 6.87 | 0.96 | 0.71 |
PJME | 5.80 | 8.59 | 0.96 | 0.63 | |
PJMW | 2.80 | 8.42 | 0.96 | 0.63 | |
AEP | 6.23 | 8.08 | 0.94 | 0.57 | |
Dayton | 7.36 | 8.49 | 0.93 | 0.62 | |
RNN | PJM | 19.46 | 19.44 | 0.92 | 0.93 |
PJME | 9.49 | 9.58 | 0.93 | 0.93 | |
PJMW | 4.28 | 4.87 | 0.59 | 0.90 | |
AEP | 7.86 | 7.49 | 0.57 | 0.89 | |
Dayton | 12.74 | 15.54 | 0.62 | 0.87 | |
LSTM | PJM | 19.96 | 21.12 | 0.92 | 0.89 |
PJME | 9.21 | 9.57 | 0.93 | 0.92 | |
PJMW | 4.70 | 4.71 | 0.91 | 0.92 | |
AEP | 7.00 | 7.46 | 0.93 | 0.91 | |
Dayton | 10.80 | 15.46 | 0.92 | 0.89 | |
TCN | PJM | 19.46 | 19.44 | 0.92 | 0.93 |
PJME | 7.85 | 9.20 | 0.95 | 0.90 | |
PJMW | 3.90 | 4.55 | 0.88 | 0.91 | |
AEP | 7.86 | 7.49 | 0.57 | 0.89 | |
Dayton | 12.74 | 15.54 | 0.62 | 0.87 | |
Transformer | PJM | 19.96 | 21.12 | 0.92 | 0.89 |
PJME | 8.10 | 9.45 | 0.94 | 0.89 | |
PJMW | 4.05 | 4.60 | 0.89 | 0.90 | |
AEP | 7.00 | 7.46 | 0.93 | 0.91 | |
Dayton | 10.80 | 15.46 | 0.92 | 0.89 | |
PredXGBR | PJM | 1.07 | 6.87 | 0.99 | 0.71 |
PJME | 1.28 | 8.59 | 0.99 | 0.58 | |
PJMW | 1.07 | 8.42 | 0.98 | 0.59 | |
AEP | 0.98 | 8.08 | 0.99 | 0.57 | |
Dayton | 1.12 | 8.49 | 0.99 | 0.62 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zabin, R.; Haque, K.F.; Abdelgawad, A. PredXGBR: A Machine Learning Framework for Short-Term Electrical Load Prediction. Electronics 2024, 13, 4521. https://doi.org/10.3390/electronics13224521
Zabin R, Haque KF, Abdelgawad A. PredXGBR: A Machine Learning Framework for Short-Term Electrical Load Prediction. Electronics. 2024; 13(22):4521. https://doi.org/10.3390/electronics13224521
Chicago/Turabian StyleZabin, Rifat, Khandaker Foysal Haque, and Ahmed Abdelgawad. 2024. "PredXGBR: A Machine Learning Framework for Short-Term Electrical Load Prediction" Electronics 13, no. 22: 4521. https://doi.org/10.3390/electronics13224521
APA StyleZabin, R., Haque, K. F., & Abdelgawad, A. (2024). PredXGBR: A Machine Learning Framework for Short-Term Electrical Load Prediction. Electronics, 13(22), 4521. https://doi.org/10.3390/electronics13224521