Enhanced Machine-Learning Techniques for Medium-Term and Short-Term Electric-Load Forecasting in Smart Grids

Khan, Sajawal ur Rehman; Hayder, Israa Adil; Habib, Muhammad Asif; Ahmad, Mudassar; Mohsin, Syed Muhammad; Khan, Farrukh Aslam; Mustafa, Kainat

doi:10.3390/en16010276

Open AccessArticle

Enhanced Machine-Learning Techniques for Medium-Term and Short-Term Electric-Load Forecasting in Smart Grids

by

Sajawal ur Rehman Khan

^1,2

,

Israa Adil Hayder

³

,

Muhammad Asif Habib

¹

,

Mudassar Ahmad

¹

,

Syed Muhammad Mohsin

^4,5,*

,

Farrukh Aslam Khan

^6,*

and

Kainat Mustafa

⁷

¹

Department of Computer Science, National Textile University, Faisalabad 37610, Pakistan

²

Department of Textile and Clothing, National Textile University, Karachi Campus, Karachi 74900, Pakistan

³

Ministry of Education, General Directorate of Vocational Education, Department of Scientific Affairs, Baghdad 10053, Iraq

⁴

Department of Computer Science, COMSATS University Islamabad, Islamabad 45550, Pakistan

⁵

College of Intellectual Novitiates (COIN), Virtual University of Pakistan, Lahore 55150, Pakistan

⁶

Center of Excellence in Information Assurance (CoEIA), King Saud University, Riyadh 11653, Saudi Arabia

⁷

Department of Computer Science, Virtual University of Pakistan, Lahore 55150, Pakistan

^*

Authors to whom correspondence should be addressed.

Energies 2023, 16(1), 276; https://doi.org/10.3390/en16010276

Submission received: 1 November 2022 / Revised: 11 December 2022 / Accepted: 16 December 2022 / Published: 27 December 2022

(This article belongs to the Section F: Electrical Engineering)

Download

Browse Figures

Versions Notes

Abstract

:

Nowadays, electric load forecasting through a data analytic approach has become one of the most active and emerging research areas. It provides future consumption patterns of electric load. Since there are large fluctuations in both electricity production and use, it is a difficult task to achieve a balance between electric load and demand. By analyzing past electric consumption records to estimate the upcoming electricity load, the issue of fluctuating behavior can be resolved. In this study, a framework for feature selection, extraction, and regression is put forward to carry out the electric load prediction. The feature selection phase uses a combination of extreme gradient boosting (XGB) and random forest (RF) to determine the significance of each feature. Redundant features in the feature extraction approach are removed by applying recursive feature elimination (RFE). We propose an enhanced support vector machine (ESVM) and an enhanced convolutional neural network (ECNN) for the regression component. Hyperparameters of both the proposed approaches are set using the random search (RS) technique. To illustrate the effectiveness of our proposed strategies, a comparison is also performed between the state-of-the-art approaches and our proposed techniques. In addition, we perform statistical analyses to prove the significance of our proposed approaches. Simulation findings illustrate that our proposed approaches ECNN and ESVM achieve higher accuracies of 98.83% and 98.7%, respectively.

Keywords:

smart grid; feature extraction; feature selection; load forecasting; random forest; recursive feature eliminator; support vector machine; convolutional neural network

1. Introduction

Electricity is an essential part of our daily lives and has a significant impact on the activities performed by individuals working in different fields. Due to the rapid population growth, there is a high demand for electricity in all parts of the globe [1]. Because of the limited capabilities of traditional electric grid stations, they are being replaced with the latest digital power grid system called smart grid (SG). Through SG, the management of electric-load distribution has become easier for utilities. It is also beneficial to minimize the discrepancy between electricity supply and demand. Controlling the generation, supply, and consumption of electricity is a considerably significant task for SG. One pathway of interaction between utilities and customers is smart meters (SM), which are part of the SG. Another important element of SG is demand side management (DSM), which is used to shift high-voltage equipment from hours of high consumption to hours of low consumption [2]. DSM is beneficial to control energy consumption according to power generation [3,4]. Figure 1 shows that load control, energy efficiency, renewable energy integration, better power quality, and plug-in hybrid electric vehicle functions are also possible using SG.

SG is also very useful in the planning of electric-load transmission and distribution. Transmission planning is necessary for utilities. It establishes the regions where electric-load expansion is needed to maintain the stability and pace of development. It also guarantees the efficient generation and distribution of electricity by the generators [5,6,7]. Similarly, distribution management is a robust method that helps meet specifications, such as determining the location, size, and installation of distribution facilities. By forecasting future load, electric-load generation and consumption can be better planned.

The forecasting of electric load helps utilities understand future demand patterns and plan according to the demand [8,9]. The utility risk is minimized by forecasting and understanding the energy patterns, which helps meet the energy demand. It helps the utilities determine the demand of the consumers and set an appropriate time-frame for maintaining the power supply in residential areas [10,11,12,13,14,15,16]. Predicting the amount of electricity needed to operate and manage the supply chain also helps the utility in avoiding excess power. In the event of increased load demand, the utility looks for a cost-effective generation while maintaining stability.

The authors in [17] state that short-term load forecasting (STLF) can forecast the electric load from one day to a week, and medium-term load forecasting (MTLF) technique is employed to anticipate electric load from one week to one year in the future. Long-term load forecasting (LTLF) is utilized if the future load forecast is required from one year to several years ahead. In this research, STLF and MTLF are performed using a large dataset. Data analysis is used to perform computational analysis of datasets.

Data analysis is a technique to extract important information from hidden data patterns. In the real world, data have very complex forms [18,19]. In recent years, the scale of real-world data has become very large, so we refer to it as big data because of its large scale. The information about electricity obtained from big data is used by utilities to perform load analysis, which can be used to create efficient and reliable electric-load management and distribution plans. In recent decades, power forecasting has been carried out using various models. However, industrialization and urbanization have increased energy consumption as a result of the depletion of natural resources. Therefore, there is an urgent need to improve the current power-forecasting frameworks and develop new ones to address these concerns and increase awareness of customers so as to make them active participants of the SG community [20,21].

The research community has presented several machine-learning (ML) and deep learning (DL) forecasting techniques to deal with the issue of electric-load forecasting [22]. However, each technique has its advantages and limitations. The primary purpose of load forecasting is to achieve reasonable accuracy rates. For large datasets, redundant features are an obstacle to more accurate results [23,24,25]. Furthermore, conventional methods are not suitable to deal with these large datasets [26] and the manual setting of hyperparameters is error-prone and increases computation time [15,27,28].

In this paper, we present a feature selection and extraction approach that excludes less important variables to solve the above problems in accurately predicting electrical load. In the regression section, an enhanced convolutional neural network (ECNN) with more layers and dynamic adjustment of hyperparameters using random search (RS) algorithm is proposed. An enhanced support vector machine (ESVM) is also proposed and RS is used to adjust its hyperparameters. Following are some of the key contributions of this study:

Focusing on big data generated by the SG community, an effective and efficient data-analytic approach is presented for electric-load prediction to ensure grid stability.
The parameters of the support vector machine (SVM) and convolutional neural network (CNN) are dynamically set using the RS method in our proposed ESVM and ECNN techniques for electric-load prediction.
A feature selection and extraction model using a mix of XGB and RF approaches is proposed to address the challenges of large datasets’ computational complexity, feature selection, and feature extraction.

The rest of the manuscript is organized as follows: State-of-the-art literature review, focused research areas, and motivation of this study are discussed in Section 2. Proposed system model is described in Section 3 and results of our study are presented in Section 4 along with relevant discussion. Finally, Section 5 concludes this study and presents potential future research directions.

2. Literature Review

To ensure grid stability and to effectively manage electricity user demand, SG continuously monitors and manages the electric load with the help of efficient and effective load-forecasting algorithms. The research community has proposed different electric-load forecasting techniques to support grid stability and efficient grid management. A brief description of the current literature on electric-load forecasting is given below.

Linear regression (LR) was used by the authors of [29,30] to analyze the dependent variables and to identify the independent variables in a regression-based approach. The independent variable was considered first because it varied the most. The dependent variable in load forecasting is generally energy demand, which depends on the electricity supply. Contrarily, the independent variables are typically weather-related, such as wind, temperature, and humidity. Simulations proved the efficacy of the proposed approach.

Artificial neural networks (ANN) may be used to carry out nonlinear simulations and adjustments as we do not need to know the load and weather elements in advance. The ANN reacts as new data come into view. Currently, it is employed to address issues in power systems such as topological immobility, alarm production, fault detection, and security assessment [27,31,32].

In [33], time-series analysis (TSA) was performed on correctly consecutive data at constant intervals to obtain optimal results. This technique is used to understand the sequence of data and predict the possible outcomes depending on previous values. The proposed approach was used to predict electric load over a limited time period. Expert systems have a high degree of intelligence. As new data are fed into the expert system, it can learn more and expand its knowledge. Knowledge engineers are used in expert systems to gain knowledge and develop new prediction models for load forecasting. The authors of [34] used expert systems for electric-load forecasting.

Fuzzy logic (FL) is a logical system similar to Boolean logic [35]. In Boolean logic, 1 and 0 are used as input values. In FL, comparisons are used to generate inputs. Mathematical formulas are not used in this method to convert input values into output values. Noise does not affect the fuzzy logic. Defuzzification is proposed in [35] to obtain an accurate electric-load forecast. In [36], the authors performed predictions using different datasets of smart homes with a data-analytic approach, but could not properly manage the big data. In [37], predictions were performed using the long short-term memory (LSTM) and other ML techniques. Only LSTM performed better in their work.

The authors in [38] presented a model to forecast the future electricity consumption record using feature-selection and regression models. In [15], the prediction was performed using two datasets. However, the values of the hyper-parameters of the ML techniques were set manually in their research work. In [39], the authors used a three-step model to perform load and price forecasting. In the first step, conditional mutual information and flexible wavelet packet transform methods were used to separate the signals into various frequencies. The second phase involved implementing a nonlinear least square SVM (NLSSVM) and a multi-input multi-output model to demonstrate the association between price and load. In the last section, the method TV-SABC, which is a modified version of the artificial bee colony (ABC) optimization algorithm based on time-varying coefficients, was used to improve the NLSSVM parameters through a learning process.

A new feature-selection method was presented in [40]. The main contribution of their proposed model was to develop an interaction between the relevance and redundancy of features for the best feature selection. However, the computational time of their research was increased due to the manual tuning of the hyperparameters of ML techniques. In [23], a load-forecasting model was implemented using CNN, EPNET and LSTM. The effectiveness of the suggested strategies was assessed using the mean square error. CNN surpassed the latest available methods; however, the presence of redundant features introduced redundancy to the data and increased the time complexity. In [24], the authors used SVM as a regression technique to perform electric-load forecasting. However, SVM was not able to handle a large dataset.

The authors of [41] employed grey correlation analysis to choose the features, and used kernel function and principal component analysis to extract the features. They eliminated the redundant features of the dataset but used the conventional SVM technique to predict the load. Deep LSTM with DNN was used to predict price and electric load. These techniques improved accuracy and provided good results on large datasets. Sophisticated results were obtained by implementing a deep auto-encoder technique in [25]. However, The necessity of removing unnecessary features was not taken into account by the authors. A gated recurrent unit was presented in [42] to forecast the record of electricity consumption and price. However, the authors failed to achieve high accuracy rates using this technique. In [43], the authors proposed ESDM and DCNN to predict the electric load. The parameters of SVM were dynamically adjusted and the number of layers of CNN was increased to obtain better results. The DCNN algorithm provided better results in their proposed work.

In [44], the next-day prediction was performed to increase the layers of ANN and the optimizing algorithm. The proposed algorithm was also compared with some traditional ML techniques to show improved higher accuracy rates. In [45], a deep CNN was proposed by the authors for predicting the weekly load for the upcoming days. Nevertheless, a small dataset was used for this. A hybrid of CNN and LSTM is utilized for electric consumption record prediction in [28]. Redundancy was not taken into account, and they used data from three years. The authors of [46] used SVM and extreme learning machine (ELM), respectively. However, they manually adjusted the parameters and worked with limited datasets. For electric-load prediction, the authors of [47] combined CNN and gated recurrent unit methods. Additionally, earth-worm optimization (EWO) was used to dynamically modify the CNN–GRU hyperparameters. The suggested approach worked well; however, the authors examined electric-load data from three years only.

To improve the accuracy of the 168-hour forecasts, authors of [48] proposed a collection of ML models using historical load, weather, and holidays data. The authors did not consider removing less-important features and used conventional methods for forecasting. The primary goal of the analysis in [49] was to use a CNN-based model to incorporate traditional elements (weather, holidays, etc.) as well as current COVID-19 pandemic trends and how they relate to the STLF issue. Although the research is useful for future pandemics, it uses conventional techniques for prediction. The outcomes can be further enhanced by adjusting CNN’s hyperparameters.

The authors of [50] analyzed the ISO-NE dataset using a novel machine-learning technique, which contains daily electricity consumption data for eight years. The best features were extracted using DT and RF classifiers. The authors used SVM and CNN for the prediction of electric load and cost. Coronavirus herd immunity optimization technique was used to enhance efficiency by modifying the hyperparameters and the proposed technique was used as a classifier to enhance the performance. By adding an extra layer to the CNN and adjusting its parameters, the likelihood of an overfitting classifier is decreased. Statistical results inferred that the presented approach performed well.

To eliminate redundancy in [51], the authors employed a three-step process that includes feature selection, feature extraction, and feature prediction. The hybrid approaches XGB and DT were used for feature selection. The redundant bits of the RFE were removed using a feature extraction approach. The classification and prediction capabilities of SVM and ELM were improved using machine learning methods. GA was used to modify the hyperparameters of ELM, and the grid search algorithm improved SVM. The results show the superiority of the proposed strategy over their counterparts.

For feature importance, an XGBoost and a decision tree hybrid feature selector were proposed by the authors in [52]. The recommended structure discovered and produced a small set of fuzzy rules that focused on building electricity usage behavior using time-series data from prior operations. The responsiveness of the fuzzy system is shown and an assessment of its performance is presented, which reveals that the generated rule base has higher accuracy. Furthermore, a smaller overall set of rules is created and compared to the default decision tree configuration for assessment.

To enhance the performance of STLF, the authors in [53] presented a two-step PSO technique. Through the initial step, PSO was used to categorize the ideal input shapes for the neural network. Subsequently, the available training data was again divided into homogeneous clusters using PSO. A distinct neural network was used for each cluster. In a bus electricity consumption forecasting issue, experimental findings validated the resilience of the presented methodology, and the proposed approach was tested on a load-profiling issue, which performed better than the most popular techniques in the literature on load profiling.

The authors of [54] developed an inverse and discrete PSO method to improve the variable mode decomposition method for forecasting of next day electricity price on the basis of previously recorded weather data and utility bill information from the Greek electricity market. The forecast results were reviewed to determine if either of the two presented divide-and-conquer preprocessing approaches provided a better estimate of the short-term electric utility cost. The variational mode decomposition-based method that results in fewer mistakes in power price prediction was enhanced by the proposed variation of PSO, which had an average absolute percentage error value of 6.15%.

In [55], the authors proposed a framework for cluster-based ensemble prediction that uses an adaptive selection process to choose ensemble members for stacking and tweaking regressors. The prediction accuracy on peak and off-peak data were tested with the presented method for developing structurally adaptable estimators for each cluster. Results of the studies showed that, in this context, more reliable ensemble models were generated by associate selection techniques that focused on the effects of off-peak performance. Compared to the standalone estimators, the ensemble models performed better overall.

According to the literature review, most authors performed their predictions using conventional techniques, which have certain limitations. By improving these conventional techniques, the electric-load prediction accuracy rate can be increased. Most of the authors used small datasets, which are not very useful for predicting future loads. In this work, our main goal is to improve the accuracy rate using big data. Redundancy was also not considered in many articles. To eliminate redundancy, we used RF, XGBoost, and RFE techniques in our proposed ESVM and ECNN methods.

3. Proposed System Model

In this study, a three-stage model is proposed to accomplish STLF and MTLF. Two techniques, ECNN and ESVM, are proposed to perform accurate electric-load forecasting. The presented system model used is shown in Figure 2. The number of steps performed is given below:

3.1. Input Data

In the proposed work, eight years of huge electricity record from January 2011 to December 2018 was utilized for electric-load forecasting. These big-forecast data were downloaded from the website ISO/NE [56]. Temperature, humidity, weather, congestion, etc., are just a few examples of the independent and dependent data that make up the dataset. Our target data is the column named “SYSLoad” which represents the system load. The other features related to the target data are the day-ahead cleared demand, clearing price for the regulation market, real-time demand, dew point temperature, local day-ahead marginal price, dry bulb temperature, day-ahead energy component, real-time marginal loss component, day-ahead congestion component, real-time congestion component, and the day-ahead energy component. We used eight years or 96 months of data because the consumption patterns of similar months are roughly the same. Finally, 80% of the data are selected for training, 10% for testing, and 10% for validation.

3.2. Feature Selection and Extraction

The computational complexity of the model might be increased by several less significant features that are typically present in big datasets. If these less-important features are efficiently eliminated, then the complexity of the model can be minimized. Our proposed feature selection and extraction methods effectively eliminate the less-important features.

Statistical mechanics is applied to the dataset for the feature-selection procedure of the presented model. The importance of each feature is calculated to select the most relevant features. Accurate results are obtained by combining the XGB and RF techniques, as shown in Figure 3. A threshold is also set to exclude less-important features. The feature selection is conducted according to Equation (1).

f (s) = \{\begin{matrix} i f, X G B i (f) + R F i (f) \geq t, \\ D r o p i f, X G B i (f) + R F i (f) < t \end{matrix}

(1)

XGBi indicates the features calculated with XGB, and RFi indicates the features calculated with RF. The features are represented by the symbol f and the threshold by the symbol t.

Feature extraction is performed after feature selection by RFE. Feature extraction selects only non-redundant features having a significant influence on the intended features of the dataset. RFE recursively collects features, uses those features to build a model, and then evaluates or reports the accuracy of the presented model. To forecast the target variable, RFE can integrate many characteristics. The following sub-sections describe feature-selection and feature-extraction techniques in detail.

3.2.1. Extreme Gradient Boosting

This technique belongs to an open-source library and is the enhanced version of the decision tree (DT), which provides more reliable and accurate results than DT. It is based on the assumption that the overall prediction error is minimized when the best feasible future model is merged with earlier models. According to this work, the importance of each feature of XGB is calculated using a scale of 0 to 1, with the more significant feature containing a value close to 1 and the less-important feature having a value close to 0.

3.2.2. Random Forest

Numerous DT methods are combined to create the RF. By bagging or bootstrap aggregation, the “forest” that the RF algorithm creates is trained. By grouping machine-learning algorithms, the bagging meta-algorithm improves their accuracy. It predicts the mean or average value of the used DT techniques in the forest. Compared to DT, RF provides more accurate results by reducing the problem of overfitting the dataset. We calculate the importance of the features using RF.

3.2.3. Recursive Feature Eliminator

This is a machine-learning technique employed for best feature selection and elimination of weak features. Through RFE, in this work, each feature is converted into a true or false dimension. Then, a threshold is set to eliminate less-important features and only the most important features are fed to the regression model.

3.3. Regression of the Load

Many authors have applied various ML and DL techniques to perform regressions. However, they still have problems with overfitting and accuracy. In our work, we improve both ML and DL methods. We prove that the problem of overfitting could be minimized, and the accuracy of load forecasting could be enhanced, if these methods are properly tuned based on the threshold value. SVM belongs to ML and CNN belongs to DL methods. RS refers to an algorithm that uses some kind of randomness or probability. Therefore, the RS is used to tune the hyperparameters of CNN and SVM and we call these techniques ECNN and ESVM, respectively.

3.3.1. Random Search Algorithm

The objective function’s random inputs are created and evaluated by the random search algorithm. This is helpful because it does not assume anything regarding how the objective function is structured. It allows for the identification of counter-intuitive solutions and can be helpful in issues when there is a lot of expertise that may impact or bias the optimization strategy.

3.3.2. Enhanced Convolutional Neural Network

The combination of RS and CNN is used to propose ECNN in this study. The CNN technique is a part of deep learning and can have several layers. The first layer of CNN is the convolutional layer, which serves just as an input filter. When the same filter is applied multiple times, a feature map is created, which gives the positions and intensity of the detected features in the given big data. The dense layer is the second layer of our proposed CNN. This layer is utilised to thoroughly link all of the neurons that were received from the layers that were connected before. After connecting all the features, overfitting may occur. To avoid this problem, the dropout layer can be used. The pooling layer is also a part of the CNN, and we have used the max-pooling layer in the proposed CNN. Max-pooling selects the most salient features from the feature map. To improve the efficiency of the CNN, we have extended the layers. Moreover, the parameters of the CNN are dynamically set using RS. The individual layers and parameters are explained in Table 1, and the system model of the presented enhanced convolutional neural network (ECNN) is exhibited in Figure 4.

3.3.3. Enhanced Support Vector Machine

Support vector machine is a well-known supervised machine learning algorithm and mutual adjustment of its parameters is a significant challenge. In our proposed ESVM, the parameters are adjusted using the RS algorithm. As kernel, the radial basis function (RBF) is utilized. The ESVM has 15 iterations, while the C values and gamma values of the SVM are tweaked using RS. The proposed system model for the presented ESVM is shown in Figure 5.

4. Results and Discussions

The simulations are performed with Anaconda Spyder3 software. The user system has 12 GB RAM, a Core i5 processor, and belongs to the sixth generation. The Python language is used for the implementation of this research.

4.1. Feature Selection and Extraction

The importance of features is shown in Figure 6 and Figure 7, which is calculated by the feature-selection method. The importance of all features is represented on a scale from 0 to 1. Using RFE, the best features are chosen, while the worst features are removed. In this analysis, each feature is transformed into a true or false dimension using RFE. Features accepted by RFE have true dimensions and rejected features have false dimensions. Features rejected by RFE are ‘DA_LMP’, ‘DA_EC’, ‘DryBulb’, and ‘DewPnt’. The threshold for XGB is 0.8 for feature importance calculation.

The RFE = True is set to eliminate irrelevant features. After the feature selection and extraction methods, three features, DA_LMP, DryBulb, and RT_CC, are removed from the total number of features by RFE. The threshold for RF is 0.7 for feature importance calculation. The remaining features are forwarded for regression.

4.2. Regression of Electric Load

After removing the less-important features, only the data from the most important features are passed to the regression model. Figure 8 given below shows the normal load.

In our work, STLF and MTLF are performed for 1 week, 1 month, and 4 months load forecasting, as shown in Figure 9, Figure 10, and Figure 11, respectively.

The different colored lines represent these predicted values. The red line represents the actual load. Lines most similar to the actual load represent high accuracy, whereas less-similar lines represent low accuracy. In the above figures, it can be seen that the prediction line of the proposed ECNN technique is more similar to the actual load line. The proposed ESVM has a lower similarity than ECNN. However, there is a big difference between the actual load line and the lines of the conventional techniques.

4.3. Performance Evaluation

Performance evaluation is very important to assess the efficiency and effectiveness of any proposed algorithm. In this study, we have used four evaluation metrics namely: mean absolute percentage error (MAPE), root mean square error (RMSE), mean absolute error (MAE), and mean squared error (MSE) to evaluate the performance of our proposed techniques. Figure 12 and Table 2 display the error rates of the proposed techniques and the most recent techniques.

The results presented in Table 3 show a big difference in accuracy between the proposed techniques and the conventional techniques. The accuracy of load forecasting is increased by our proposed techniques. The Table 4 lists numerous tests based on correlations, as well as parametric and nonparametric statistical analyses based on hypotheses for both the new methods and the traditional ones.

The results of the various statistical tests are covered in Table 4. A value of 0 indicates that the hypothesis is accepted, and a result greater than 0 indicates that it is rejected.

5. Conclusions

In this work, we studied the power-load prediction problem using an improved framework based on feature selection, extraction, and regression. The main objective of the efficient and effective electric-load prediction for big data is successfully achieved using our proposed ECNN and ESVM forecasting models. Furthermore, our proposed forecasting models helped decrease computational complexity of the forecasting model by eliminating less-important features using modern feature-selection and extraction methods. The numbers of layers of our proposed ECNN are increased and the hyperparameters of the proposed techniques ECNN and ESVM are dynamically adjusted. Simulation results of our proposed techniques are compared with conventional CNN and SVM techniques using four performance error estimators, i.e., MAE, RMSE, MAPE and MSE. The performance metrics proved that our proposed ECNN and ESVM electric-load forecasting models have the lowest error rates.

Due to the growing worldwide interest in reliable and sustainable energy supply, incorporating more renewable and alternative energy sources reduces stress on existing electric transmission systems. The proposed schemes should be helpful in finding the exact power generation from distributed sources and power consumption that helps in smooth working of the smart grid. The same infrastructure can be implemented for industrial power-management systems and will also be effective for smart agriculture systems.

A continuous network service should be required for the smooth working of SG. In a disaster situation, the smart grid faces significant performance or network congestion issues. Mobile network operators cannot guarantee adequate service during severe weather events such as storms, torrential rain, or lightning strikes. Due to vulnerabilities in the infrastructure used for implementation, smart meters could be hacked and exploited to alter electricity consumption.

In the future work, the proposed model will be further optimized using heuristic techniques, and further tests will be performed for LTLF. Renewable energy sources are also included to enhance the stability of the smart grid. We intend to perform experiments using our proposed model with another forecasting model to establish whether they offer better accuracy and convergence time. The suggested structure could be improved by expanding the current system to include a feedback module that may control the desired behavior of the residential buildings depending on particular thresholds established by the electricity provider. The proposed forecasting model will help to detect electricity thefts using classifiers. Different authentication and control access parameters should be implemented to avoid vulnerability in SG infrastructures.

Author Contributions

Conceptualization, S.u.R.K., M.A.H. and I.A.H.; methodology, S.u.R.K., M.A.H., I.A.H. and M.A.; software, S.u.R.K., M.A.H., I.A.H. and M.A.; validation, S.u.R.K., M.A.H., I.A.H., M.A., S.M.M. and F.A.K.; formal analysis, M.A., S.M.M. and F.A.K.; investigation, I.A.H., M.A., S.M.M., F.A.K. and K.M.; resources, M.A., S.M.M., F.A.K. and K.M.; data curation, S.u.R.K., M.A.H. and I.A.H.; writing—original draft preparation, S.u.R.K., M.A.H. and I.A.H.; writing—review and editing, S.M.M., F.A.K. and K.M.; visualization, M.A.H., I.A.H. and M.A.; supervision, M.A.H., I.A.H., M.A. and S.M.M.; project administration, M.A.H., I.A.H., M.A. and S.M.M.; funding acquisition, S.M.M., F.A.K. and K.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Not applicable.

Conflicts of Interest

Authors hereby agree to submit this version of the article at Energies—MDPI and declare no known conflict of interest.

References

Zhu, Z.; Tang, J.; Lambotharan, S.; Chin, W.H.; Fan, Z. An integer linear programming-based optimization for home demand-side management in smart grid. In Proceedings of the 2012 IEEE PES Innovative Smart Grid Technologies (ISGT), Washington, DC, USA, 16–20 January 2012; pp. 1–5. [Google Scholar]
Samadhi, P.; Wong, V.W.; Schober, R. Load scheduling and power trading in systems with high penetration of renewable energy resources. IEEE Trans. Smart Grid 2015, 7, 1802–1812. [Google Scholar] [CrossRef]
Sheraz, A.; Herodotou, H.; Mohsin, S.M.; Javaid, N.; Ashraf, N.; Aslam, S. A survey on deep learning methods for power load and renewable energy forecasting in smart microgrids. Renew. Sustain. Energy Rev. 2021, 144, 110992. [Google Scholar]
Khursheed, A.; Aslam, S.; Mohsin, S.M.; Alhussein, M. A fair pricing mechanism in smart grids for low energy consumption users. IEEE Access 2021, 9, 22035–22044. [Google Scholar]
Pinson, P.; Madsen, H. Benefits and challenges of electrical demand response: A critical review. Renew. Sustain. Energy Rev. 2014, 39, 686–699. [Google Scholar]
Tabar, V.S.; Jirdehi, M.A.; Hemmati, R. Energy management in microgrid based on the multi objective stochastic programming incorporating portable renewable energy resource as demand response option. Energy 2017, 118, 827–839. [Google Scholar] [CrossRef]
Zheng, J.; Gao, D.W.; Lin, L. Smart meters in smart grid: An overview. In Proceedings of the 2013 IEEE Green Technologies Conference (GreenTech), Denver, CO, USA, 4–5 April 2013; pp. 57–64. [Google Scholar]
Hafeez, G.; Khan, I.; Jan, S.; Shah, I.A.; Khan, F.A.; Derhab, A. A Novel Hybrid Load Forecasting Framework with Intelligent Feature Engineering and Optimization Algorithm in Smart Grid. Appl. Energy 2021, 299, 117178. [Google Scholar] [CrossRef]
Hafeez, G.; Alimgeer, K.S.; Qazi, A.B.; Khan, I.; Usman, M.; Khan, F.A.; Wadud, Z. A Hybrid Approach for Energy Consumption Forecasting with a New Feature Engineering and Optimization Framework in Smart Grid. IEEE Access 2020, 8, 96210–96226. [Google Scholar] [CrossRef]
Liu, Y. Wireless sensor network applications in smart grid: Recent trends and challenges. Int. J. Distrib. Sens. Netw. 2012, 8, 492819. [Google Scholar] [CrossRef]
Siano, P.; Sarno, D. Assessing the benefits of residential demand response in a real time distribution energy market. Appl. Energy 2016, 161, 533–551. [Google Scholar] [CrossRef]
Aghaei, J.; Alizadeh, M.I. Demand response in smart electricity grids equipped with renewable energy sources: A review. Renew. Sustain. Energy Rev. 2013, 18, 64–72. [Google Scholar] [CrossRef]
Paterakis, N.G.; Erdinc, O.; Catalao, J.P. An overview of Demand Response: Key-elements and international experience. Renewable and Sustainable Energy Reviews. Renew. Sustain. Energy Rev. 2017, 69, 871–891. [Google Scholar] [CrossRef]
Davito, B.; Tai, H.; Uhlaner, R. The smart grid and the promise of demand-side management. McKinsey Smart Grid 2010, 3, 8–44. [Google Scholar]
Aslam, S.; Bukhsh, R.; Khalid, A.; Javaid, N.; Ullah, I.; Fatima, I.; Hasan, Q.U. An efficient home energy management scheme using cuckoo search. In Lecture Notes on Data Engineering and Communications Technologies, Proceedings of the International Conference on P2P, Parallel, Grid, Cloud and Internet Computing, Barcelona, Spain, 8–10 November 2017; Springer: Cham, Switzerland, 2017; pp. 167–178. [Google Scholar]
Khursheed, A.; Aslam, S.; Haider, S.I.; Mohsin, S.M.; Islam, S.U.; Khattak, H.A.; Shah, S. Energy forecasting using multiheaded convolutional neural networks in efficient renewable energy resources equipped with energy storage system. Trans. Emerg. Telecommun. Technol. 2019, 33, e3837. [Google Scholar]
Hahn, H.; Meyer-Nieberg, S.; Pickl, S. Electric load forecasting methods: Tools for decision making. Eur. J. Oper. Res. 2009, 199, 902–907. [Google Scholar] [CrossRef]
Wang, K.; Yu, J.; Yu, Y.; Qian, Y.; Zeng, D.; Guo, S.; Xiang, Y.; Wu, J. A survey on energy internet: Architecture, approach, and emerging technologies. IEEE Syst. J. 2017, 12, 2403–2416. [Google Scholar] [CrossRef]
Jiang, H.; Wang, K.; Wang, Y.; Gao, M.; Zhang, Y. Energy big data: A survey. IEEE Access 2016, 4, 3844–3861. [Google Scholar] [CrossRef]
Bessa, R.J. Solar power forecasting for smart grids considering ICT constraints. In Proceedings of the 4th Solar Integration Workshop, Berlin, Germany, 10–11 November 2014. [Google Scholar]
Hafeez, G.; Alimgeer, K.S.; Wadud, Z.; Khan, I.; Usman, M.; Qazi, A.B.; Khan, F.A. An Innovative Optimization Strategy for Efficient Energy Management with Day-ahead Demand Response Signal and Energy Consumption Forecasting in Smart Grid using Artificial Neural Network. IEEE Access 2020, 8, 84415–84433. [Google Scholar] [CrossRef]
Hafeez, G.; Alimgeer, K.S.; Wadud, Z.; Shafiq, Z.; Ali, M.U.; Khan, I.; Khan, F.A.; Derhab, A. A Novel Accurate and Fast Converging Deep Learning based Model for Electrical Energy Consumption Forecasting in Smart Grid. Energies 2020, 13, 2244. [Google Scholar] [CrossRef]
Rafiei, M.; Niknam, T.; Khooban, M.H. Probabilistic forecasting of hourly electricity price by generalization of ELM for usage in improved wavelet neural network. IEEE Trans. Ind. Inform. 2016, 13, 71–79. [Google Scholar] [CrossRef]
Ali, U.; Rauf, A.; Iqbal, U.; Shoukat, I.A.; Hassan, A. A Big data analytics for a novel electrical load forecasting technique. Int. J. Inf. Technol. Secur. 2019, 11, 33–40. [Google Scholar]
Zahid, M.; Ahmed, F.; Javaid, N.; Abbasi, R.A.; Zainab Kazmi, H.S.; Javaid, A.; Bilal, M.; Akbar, M.; Ilahi, M. Electricity price and load forecasting using enhanced convolutional neural network and enhanced support vector regression in smart grids. Electronics 2019, 8, 122. [Google Scholar] [CrossRef] [Green Version]
Mujeeb, S.; Javaid, N.; Akbar, M.; Khalid, R.; Nazeer, O.; Khan, M. Big data analytics for price and load forecasting in smart grids. In Lecture Notes on Data Engineering and Communications Technologies, Proceedings of the International Conference on Broadband and Wireless Computing, Communication and Applications, Taichung, Taiwan, 27–29 October 2018; Springer: Cham, Switzerland, 2018; pp. 77–87. [Google Scholar]
Amjady, N. Short-term hourly load forecasting using time-series modeling with peak load estimation capability. IEEE Trans. Power Syst. 2001, 16, 498–505. [Google Scholar] [CrossRef] [PubMed]
Tian, C.; Ma, J.; Zhang, C.; Zhan, P. A deep neural network model for short-term load forecast based on long short-term memory network and convolutional neural network. Energies 2018, 11, 3493. [Google Scholar] [CrossRef] [Green Version]
Haida, T.; Muto, S. Regression based peak load forecasting using a transformation technique. IEEE Trans. Power Syst. 1994, 9, 1788–1794. [Google Scholar] [CrossRef]
Charytoniuk, W.; Chen, M.S.; Van Olinda, P. Nonparametric regression based short-term load forecasting. IEEE Trans. Power Syst. 1998, 13, 725–730. [Google Scholar] [CrossRef]
Aziz, S.; Irshad, M.; Haider, S.A.; Wu, J.; Deng, D.N.; Ahmad, S. Protection of a Smart Grid with the Detection of Cyber-Malware Attacks using Efficient and Novel Machine Learning Models. Front. Energy Res. 2022, 1102. [Google Scholar] [CrossRef]
Ruan, J.; Wang, H.; Aziz, S.; Wang, G.; Zhou, B.; Fu, X. Interval state estimation based defense mechanism against cyber attack on power systems. In Proceedings of the IEEE Conference on Energy Internet and Energy System Integration (EI2), Beijing, China, 26–28 November 2017; pp. 1–5. [Google Scholar]
Park, D.C.; El-Sharkawi, M.A.; Marks, R.J.; Atlas, L.E.; Damborg, M.J. Electric load forecasting using an artificial neural network. IEEE Trans. Power Syst. 1991, 6, 442–449. [Google Scholar] [CrossRef] [Green Version]
Kandil, M.S.; El-Debeiky, S.M.; Hasanien, N.E. Long-term load forecasting for fast developing utility using a knowledge-based expert system. IEEE Trans. Power Syst. 2002, 17, 491–496. [Google Scholar] [CrossRef]
Ayub, N.; Javaid, N.; Mujeeb, S.; Zahid, M.; Khan, W.Z.; Khattak, M.U. Electricity load forecasting in smart grids using support vector machine. In Advances in Intelligent Systems and Computing, Proceedings of the International Conference on Advanced Information Networking and Applications, Matsue, Japan, 27–29 March 2019; Springer: Cham, Switzerland, 2019; pp. 1–13. [Google Scholar]
Jindal, A.; Singh, M.; Kumar, N. Consumption-aware data analytical demand response scheme for peak load reduction in smart grid. IEEE Trans. Ind. Electron. 2018, 65, 8993–9004. [Google Scholar] [CrossRef]
Mujeeb, S.; Javaid, N.; Ilahi, M.; Wadud, Z.; Ishmanov, F.; Afzal, M.K. Deep long short-term memory: A new price and load forecasting scheme for big data in smart cities. Sustainability 2019, 11, 987. [Google Scholar] [CrossRef] [Green Version]
Chitsaz, H.; Zamani-Dehkordi, P.; Zareipour, H.; Parikh, P.P. Electricity price forecasting for operational scheduling of behind-the-meter storage systems. IEEE Trans. Smart Grid 2017, 9, 6612–6622. [Google Scholar] [CrossRef]
Ghasemi, A.; Shayeghi, H.; Moradzadeh, M.; Nooshyar, M. A novel hybrid algorithm for electricity price and load forecasting in smart grids with demand-side management. Appl. Energy 2016, 177, 40–59. [Google Scholar] [CrossRef]
Abedinia, O.; Amjady, N.; Zareipour, H. A new feature selection technique for load and price forecast of electrical power systems. IEEE Trans. Power Syst. 2016, 32, 62–74. [Google Scholar] [CrossRef]
Wang, K.; Xu, C.; Zhang, Y.; Guo, S.; Zomaya, A.Y. Robust big data analytics for electricity price forecasting in the smart grid. IEEE Trans. Big Data 2017, 5, 34–45. [Google Scholar] [CrossRef]
Fan, C.; Xiao, F.; Zhao, Y. A short-term building cooling load prediction method using deep learning algorithms. Appl. Energy 2017, 195, 222–233. [Google Scholar] [CrossRef]
Khan, Z.A.; Zafar, A.; Javaid, S.; Aslam, S.; Rahim, M.H.; Javaid, N. Hybrid meta-heuristic optimization-based home energy management system in smart grid. J. Ambient. Intell. Humaniz. Comput. 2019, 10, 4837–4853. [Google Scholar] [CrossRef]
Wang, J.; Liu, F.; Song, Y.; Zhao, J. A novel model: Dynamic choice artificial neural network (DCANN) for an electricity price forecasting system. Appl. Soft Comput. 2016, 48, 281–297. [Google Scholar] [CrossRef]
Khan, S.; Javaid, N.; Chand, A.; Khan, A.B.M.; Rashid, F.; Afridi, I.U. Electricity load forecasting for each day of week using deep CNN. In Advances in Intelligent Systems and Computing, Proceedings of the Workshops of the International Conference on Advanced Information Networking and Applications, Matsue, Japan, 27–29 March 2019; Springer: Cham, Switzerland, 2019; pp. 1107–1119. [Google Scholar]
Cheng, Y.; Jin, L.; Hou, K. Short-term power load forecasting based on improved online ELM-K. In Proceedings of the 2018 International Conference on Control, Automation and Information Sciences (ICCAIS), Hangzhou, China, 24–27 October 2018; pp. 128–132. [Google Scholar]
Ayub, N.; Irfan, M.; Awais, M.; Ali, U.; Ali, T.; Hamdi, M.; Alghamdi, A.; Muhammad, F. Big Data Analytics for Short and Medium-Term Electricity Load Forecasting Using an AI Techniques Ensembler. Energies 2020, 13, 5193. [Google Scholar]
Aguilar Madrid, E.; Antonio, N. Short-Term Electricity Load Forecasting with Machine Learning. Information 2021, 12, 50. [Google Scholar] [CrossRef]
Tudose, A.M.; Picioroaga, I.I.; Sidea, D.O.; Bulac, C.; Boicea, V.A. Short-Term Load Forecasting Using Convolutional Neural Networks in COVID-19 Context: The Romanian Case Study. Energies 2021, 14, 4046. [Google Scholar] [CrossRef]
Aslam, S.; Ayub, N.; Farooq, U.; Alvi, M.J.; Albogamy, F.R.; Rukh, G.; Haider, S.I.; Azar, A.T.; Bukhsh, R. Towards Electric Price and Load Forecasting Using CNN-Based Ensembler in Smart Grid. Sustainability 2021, 13, 12653. [Google Scholar] [CrossRef]
Ahmad, W.; Ayub, N.; Ali, T.; Irfan, M.; Awais, M.; Shiraz, M.; Glowacz, A. Towards Short Term Electricity Load Forecasting Using Improved Support Vector Machine and Extreme Learning Machine. Energies 2020, 13, 2907. [Google Scholar] [CrossRef]
Arvanitidis, A.I.; Bargiotas, D.; Daskalopulu, A.; Laitsos, V.M.; Tsoukalas, L.H. Enhanced Short-Term Load Forecasting Using Artificial Neural Networks. Energies 2021, 14, 7788. [Google Scholar] [CrossRef]
Panapakidis, I.; Katsivelakis, M.; Bargiotas, D. A Metaheuristics-Based Inputs Selection and Training Set Formation Method for Load Forecasting. Symmetry 2022, 14, 1733. [Google Scholar] [CrossRef]
Arvanitidis, A.I.; Bargiotas, D.; Kontogiannis, D.; Fevgas, A.; Alamaniotis, M. Optimized Data- Driven Models for Short-Term Electricity Price Forecasting Based on Signal Decomposition and Clustering Techniques. Energies 2022, 15, 7929. [Google Scholar] [CrossRef]
Kontogiannis, D.; Bargiotas, D.; Daskalopulu, A.; Arvanitidis, A.I.; Tsoukalas, L.H. Structural Ensemble Regression for Cluster-Based Aggregate Electricity Demand Forecasting. Electricity 2022, 3, 480–504. [Google Scholar] [CrossRef]
ISO NE Electricity Market Data. Available online: https://www.iso-ne.com/isoexpress/web/reports/load-and-demand (accessed on 4 February 2021).

Figure 1. Smart grid.

Figure 2. Proposed system model for electric load forecasting.

Figure 3. Feature-selection and extraction model.

Figure 4. System model of proposed enhanced convolutional neural network (ECNN).

Figure 5. System model of proposed enhanced support vector machine (ESVM).

Figure 6. Feature importance calculated by XGB.

Figure 7. Feature importance calculated by RF.

Figure 8. Normal data for eight years.

Figure 9. One week load forecasting.

Figure 10. One month load forecasting.

Figure 11. Four months load forecasting.

Figure 12. Error values of different techniques for four months load forecasting.

Table 1. Layers and parameters used for proposed ECNN.

Parameters	Values
Sequential Model
Conv1D	No. of Neurons = 64
	Kernel Size = 2
	ReLU Activation-function
Dense	Parameters tuning with RS
	Units = 10
	ReLU Activation-function
Dropout	0.000001
Maxpooling1D	Size of Pool = 2
Maxpooling1D	Padding = same
Dense	Parameters tuning with RS
	Units = 50
	ReLU Activation-function
MaxPooling1D	Size of Pool = 2
MaxPooling1D	Padding = same
Dense	Parameters tuning with RS
Compiling Model
Loss Function	MSE
Metric	Accuracy
Optimizer	Adam
Training Model
Epochs	200
Verbose	0
Validation split	0.30
Batch size	10

Table 2. Error values (%).

Techniques	MSE	MAE	MAPE	RMSE
SVM	12.4	10.5	1.7	12.3
Proposed ESVM	10	9.5	1.3	9.02
CNN	3.03	6.9	14.0	2.1
Proposed ECNN	1.17	1.2	11.9	1.4

Table 3. Accuracy rates (%).

Techniques	MSE	MAE	MAPE	RMSE
SVM	87.6	89.5	98.3	87.7
Proposed ESVM	90	91.5	98.7	90.2
CNN	96.97	93.1	86	97.9
Proposed ECNN	98.83	98.8	88.1	98.6

Table 4. Statistical analysis tests for proposed techniques and conventional techniques. Note: PSHT: Parametric statistical hypothesis tests; CT: Correlation test; NSHT: Nonparametric statistical hypothesis tests.

Techniques	Test	NSHT		CT		PSHT
Techniques	Test	Wilcoxon Test	Kruskal Test	Pearson’s Test	Kendall’s Test	Chi-Squared Test	ANOVA Test
SVM	F-Statistics	104,549	26.0771	−0.0404	−0.036	158,449.28	30
SVM	p-value	0.000	0.000	0.257	0.143	0.000	0.000
Proposed ESVM	F-Statistics	132,003	0.2949	−0.037	−0.0362	164,404.40	0.064
Proposed ESVM	p-value	0.805	0.588	0.317	0.144	0.000	0.803
CNN	F-Statistics	37,953	1.4537	0.996	0.949	576.09	1.3971
CNN	p-value	0.000	0.228	0.000	0.000	1.000	0.238
Proposed ECNN	F-Statistics	131,225	0.0001	0.736	0.5321	37,915.93	0.6539
Proposed ECNN	p-value	0.655	0.990	0.000	0.000	0.000	0.418

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Khan, S.u.R.; Hayder, I.A.; Habib, M.A.; Ahmad, M.; Mohsin, S.M.; Khan, F.A.; Mustafa, K. Enhanced Machine-Learning Techniques for Medium-Term and Short-Term Electric-Load Forecasting in Smart Grids. Energies 2023, 16, 276. https://doi.org/10.3390/en16010276

AMA Style

Khan SuR, Hayder IA, Habib MA, Ahmad M, Mohsin SM, Khan FA, Mustafa K. Enhanced Machine-Learning Techniques for Medium-Term and Short-Term Electric-Load Forecasting in Smart Grids. Energies. 2023; 16(1):276. https://doi.org/10.3390/en16010276

Chicago/Turabian Style

Khan, Sajawal ur Rehman, Israa Adil Hayder, Muhammad Asif Habib, Mudassar Ahmad, Syed Muhammad Mohsin, Farrukh Aslam Khan, and Kainat Mustafa. 2023. "Enhanced Machine-Learning Techniques for Medium-Term and Short-Term Electric-Load Forecasting in Smart Grids" Energies 16, no. 1: 276. https://doi.org/10.3390/en16010276

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Enhanced Machine-Learning Techniques for Medium-Term and Short-Term Electric-Load Forecasting in Smart Grids

Abstract

1. Introduction

2. Literature Review

3. Proposed System Model

3.1. Input Data

3.2. Feature Selection and Extraction

3.2.1. Extreme Gradient Boosting

3.2.2. Random Forest

3.2.3. Recursive Feature Eliminator

3.3. Regression of the Load

3.3.1. Random Search Algorithm

3.3.2. Enhanced Convolutional Neural Network

3.3.3. Enhanced Support Vector Machine

4. Results and Discussions

4.1. Feature Selection and Extraction

4.2. Regression of Electric Load

4.3. Performance Evaluation

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI