1. Introduction
In the present day electricity supply markets, the utility of load forecasting tool is high as it helps in managing the demand leading to a transparent price of electricity to the consumers. Furthermore, it helps in the security, decision making, reliability, and stability of the transmission system. After a deep extensive literature study, there are, namely, three prospectives of load forecasting on the behalf of time span: short-, mid-, and long-term perspective. Each approach has a different view as per data complexity and input data parameters utilized in coordination with seasonal, as well as environmental, factors. It has also been observed that the varying level of accuracy has been achieved depending on the foresting approach used [
1,
2].
In parallel, the pricing mechanism is also an important issue in electricity markets. Electricity has been traded through the bidding mechanism via the power exchange in which generating companies (GENCOs) can submit generation bids corresponding to their bidding prices, and consumers do the same with respect to their load demand. The market is cleared at an equilibrium point where both generation and demand bids meet. The quantity of electricity demand at this equilibrium point is called market clearing volume (MCV) and the lowest price at that point is called the market clearing price (MCP). At the MCP, generation companies must be satisfied to sell their generation, and consumers must be satisfied to purchase their electricity demand corresponding to their respective bids. Hence, the proper bidding strategy is a critical issue for market players in order to maximize their profit in electricity markets. Generally, limited information is available about the market, therefore; both generators and consumers rely on load demand forecast information available for preparing their strategies corresponding to their bids [
3,
4].
In the last 20 years, many efforts have been made to develop models for MCV forecasting such as statistical, artificial intelligence, signal processing, and data mining-based standalone and hybrid models. Among these, artificial intelligence (AI)-based models are promising because of their ability to find hidden relationships between inputs and outputs of the system. These AI-based models are also the most common, accurate, and efficient ones for load-profile estimation and have been utilized in three different ways: one is an individual neural network (NN) and the other two are known as hybrid models (evolutionary and pre-processing-based) [
1,
5,
6,
7,
8]. In forecasting, the main problem associated with the neural network (NN) is learning and data preprocessing. Therefore, most of the research available has been carried out by considering these factors. The parameters of NN are determined by gradient search algorithms associated with the problem of local minima and are also quite sensitive to the persistence of initial values that result in higher error rates (due to over and under training). Thus, for the initialization of parameters during the learning and training of NN, some other global search optimization techniques have been employed. In [
9], a traditional Genetic Algorithm (GA) for optimization of the fuzzy rule base of the hybrid fuzzy NN is utilized; whereas, a modified GA with new genetic operations has also been proposed to optimize the fuzzy rules of Neural Fuzzy Network (NFN) for hourly-load forecasting [
10]. The problem of over and under-forecasting during the learning process of the modified Radial Basis Function Neural Network (RBFNN) has been resolved using a GA-based optimization algorithm [
11]. For improving the forecasting accuracy of NN, the Particle Swarm Optimization (PSO)-based algorithm has been employed instead of the Levenberg Marquardt (LM) algorithm [
12]. By employing a four-step-ahead load forecast, the parameters of the Recurrent Support Vector Machine (RSVM) have been optimized using GA. Standard v-SVM suffers from high-frequency components; hence, a Gaussian loss function-based g-SVM has been proposed to approximate the load series with normal trend data and Embedded Chaotic PSO (ECPSO) has been used for parameters selection of g-SVM [
13]. In [
14], the PSO, based on Wang-Mendel (WM) for optimization of the fuzzy rule base of a load forecaster is demonstrated. To overcome the slow processing speed and over-training of SVM, Ant Colony Optimization (ACO) has been deployed for Wavelet Transform (WT) processed load sub-series [
15].
Time series-based wavelet neural network was proposed in 2001, in which, training data was processed through the time-series input selection technique; WT normalized the input data and then, the final prediction was done using multi-layer perceptron neural networks with denormalization of the data series [
16]. Reference [
17] designed four single hidden layer FFNNs with WT for a 24 h load prediction. The authors in [
18] proposed two separate three-layer perceptron networks for prediction of the next day load corresponding to low and high-frequency components decomposed by WT of a historically similar day load series vector. The hourly seasonal load series behavior has been characterized by different frequency components using WT, and final forecasting has been carried out using a PSO+NN model [
13]. Reference [
19] reported two hybrid models: the first is wavelet-based fuzzy neural networks (WFNN) and the second one is a fuzzy neural network based on Choquet Integral (FNCI) for peak and minimum load forecasting and achieved better results as compared with Adaptive Neuro-Fuzzy Inference System (ANFIS). To examine the behavior of the historical load patterns, reference [
20] uses the regression model, and the prediction has been done by LM algorithm-based wavelet neural network. The authors in [
21] utilized the WT+NN for primary forecasting and improving the accuracy of this model using a WT-based ANFIS approach.
Similarly, in order to handle the non-linear and non-stationary building heat load data, Gao (2020) et al. [
22] proposed a novel ensemble prediction model which integrates the Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN) and support vector regression (SVR). The CEEMDAN algorithm automatically decomposes the heat load data patterns into an intrinsic modes function (IMF) signal as per the characteristics of patterns on multiple time scales. In the same year, Gao et al. [
23] deployed the CEEMDAN for the features extraction of hourly solar irradiance forecasting. A stable average RMSE of 38.49 W/m
has been achieved by using deep learning network-based models such as convolution neural network (CNN) and long short-term memory network (LSTM) on individual intrinsic time series [
23]. Further, a hybrid CEEMDAN and LSTM-based model has also been utilized to improve the forecasting accuracy of stock market prices [
24]. Similarly, CEEMDAN has also been utilized for the decomposition of wind speed data for improving the forecasting accuracy with NN’s [
25].
Zhang et al. [
26] introduced a variational mode decomposition (VMD) algorithm with an aim to improve the forecasting accuracy of wind speed, in which VMD was deployed to decompose the original wind series data into IMF’s, and then each subseries was forecasted using a GA-based NN. Similarly, the decomposed IMF’s of wind data has also been further denoised by using WT, and then final forecasting was performed by using back propagation (BP) and RBF-based NN’s [
27]. For the forecasting of wind power, a pattern recognition-based hybrid method is proposed in which VMD is utilized for data processing, Gram-Schmidt Orthogonalization (GSO) is used for feature selection, and in the last step, the forecasting Extreme Learning Machine (ELM) was utilized for the training of each feature-based sub-series [
28]. Similarly, VMD has been utilized for load forecasting with a long short-term memory network (LSTM) [
29].
The conventional WT decomposes the signal into low frequency and high-frequency components. However, for better accuracy in the results, the low-frequency component of the signal is further decomposed into the low and high-frequency components using multi-resolution analysis theory. The decomposed series has been further processed through the Group Method of Data Handling (GMDH)-based algorithm for the forecasting of load data series [
30]. In 2021, the same decomposition process was carried out for temperature data in which the mother wavelet was chosen on the basis of the energy-entropy ratio; for training and testing data, a different learning algorithm-based NN was proposed [
31]. On the other hand, WPD decomposes the load profile between higher and lower frequency components again into lower and higher frequency components with neural networks, and achieved almost 20% more accurate results compared to traditional WT [
32]. Advanced WT has been presented in which the entropy cost function is used to select the best wavelet basis for data decomposition, mutual information for feature selection, and neural networks for prediction of electricity load with a one and multi-step-ahead basis [
33]. In order to deal with the data noise of WPD, decomposed series correlation analysis has been deployed, and data with all the features has been trained through an improved weighted extreme learning machine [
34]. In this paper, to extract the maximum features of the input signal, the data was decomposed using the proposed signal processing technique, i.e., WPD. Unlike WT, it decomposes approximate and detailed components at the same time to achieve the maximum resolution to the input data.
As per the existing literature, it has been observed that the pre-processing of data is still an open issue from the forecasting point of view. Therefore, in this, the authors proposed a time series (statistical)-based forecasting model in which WPD is used as an input data pre-processing tool for MCV forecasting. The results of the proposed model have been compared with stand-alone NN and conventional WT-based models for a single step ahead of point forecasting. The contribution is summarised as follows: First, a practical and transparent approach with the newly demonstrated LNNTD model has been implemented to forecast MCV; the input neurons of this have been selected using ACF. Second, the proposed MCV forecast framework has been implemented to forecast MCV for a period of one year with all seasonal estimation weeks. The concept of a moving window has been adapted with a cyclic test period of one month up to a one year forecast using the one-year training data set. In WPD-based models, two types of input selection criteria have been adopted; first, one combination of WPD-based decomposed series with ACF-based time lags (TL’s) have been used as input vectors (neurons) of the model, and a similar theory has also been used for conventional WT-based models. In later (proposed) models, each WPD-based series has been forecasted individually, and the TL’s for this have been selected on the basis of mutual correlation among all time-lags using ACF. Third, as per the existing literature, for the very first time, TS’s of the forecasts have been measured for the validation of results on a single step-ahead of the forecasted values. The multiple step-ahead forecasting is conducted using an iterative approach up to the sixth step to check whether the forecast is applicable or not. Next,
Section 2 describes the strategy of the proposed model, the experimental work is presented in
Section 3, the Discussion is in
Section 4, and finally the paper is concluded in
Section 5.