Hybrid Load Forecasting Using Gaussian Process Regression and Novel Residual Prediction

Darab, Cosmin; Antoniu, Turcu; Beleiu, Horia Gheorghe; Pavel, Sorin; Birou, Iulian; Micu, Dan Doru; Ungureanu, Stefan; Cirstea, Stefan Dragos

doi:10.3390/app10134588

Open AccessArticle

Hybrid Load Forecasting Using Gaussian Process Regression and Novel Residual Prediction

by

Cosmin Darab

^1,*,

Turcu Antoniu

¹

,

Horia Gheorghe Beleiu

¹

,

Sorin Pavel

¹,

Iulian Birou

²,

Dan Doru Micu

³,

Stefan Ungureanu

¹

and

Stefan Dragos Cirstea

¹

Department of Electrical Engineering, Technical University of Cluj-Napoca, 28 Memorandumului Street, 400114 Cluj-Napoca, Romania

²

Department of Electric Machines and Drives, Faculty of Electrical Engineering, Technical University of Cluj-Napoca, 28 Memorandumului Street, 400114 Cluj-Napoca, Romania

³

Department of Electrotechnics and Measurements, Faculty of Electrical Engineering, Technical University of Cluj-Napoca, 28 Memorandumului Street, 400114 Cluj-Napoca, Romania

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2020, 10(13), 4588; https://doi.org/10.3390/app10134588

Submission received: 12 June 2020 / Revised: 27 June 2020 / Accepted: 29 June 2020 / Published: 2 July 2020

(This article belongs to the Section Electrical, Electronics and Communications Engineering)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Short-term electricity load forecasting has attracted considerable attention as a result of the crucial role that it plays in power systems and electricity markets. This paper presents a novel hybrid forecasting method that combines an autoregressive model with Gaussian process regression. Mixed-user, hourly, historical data are used to train, validate, and evaluate the model. The empirical wavelet transform was used to preprocess the data. Among the perturbing factors, the most influential predictors that were recorded were the weather factors and day type. The developed methodology is upgraded using a novel closed-loop algorithm that uses the forecasting values and influential factors to predict the residuals. Most performance indicators that are computed indicate that forecasting the residuals actually improves the method’s precision, decreasing the mean absolute percentage error from 5.04% to 4.28%. Measured data are used to validate the effectiveness of the presented approach, making it a suitable tool for use in load forecasting by utility companies.

Keywords:

energy forecasting; Gaussian process regression; machine learning models; empirical wavelet transform; residuals prediction

1. Introduction

In recent years, considerable effort has been invested in the reduction of energy use, the promotion of renewable energy sources (RES), and the implementation of mitigation measures that will lead to a reduction in the environmental cost. The industrial sector is believed to account for 38% of total global energy consumption. Studies have shown that by simply implementing the energy efficiency solutions that are currently available, by 2040, industry could produce almost double the value per unit of energy, compared to its current levels [1]. Considering the emerging trends in energy consumption coupled with rising energy prices, it is crucial that the implementation of new forecast methodology is customized to users’ specific profiles and capabilities of user profile.

Energy load forecasting has been one of the main focal points of energy-related research in both industry and academia in recent years. Nowadays, almost all decisions regarding the electricity market and utility industry use load forecasting as their primary criterion. For example, system operators use energy load forecasting to ensure power system reliability. It has also been proven to be a useful dynamical method for aggregators participating in the energy trading market or managing electricity consumption. Forecasting methods may essentially be classified as follows: very short-term load forecasting (a few minutes ahead to a several hours), short-term (one day ahead to one week ahead), medium-term load forecasting (ranging from one month to one year), and long-term load forecasting (more than one year) [2]. Short-term predictions are used in demand and supply management and to balance electricity generation [3], transmission, distribution and use on different scales [4].

Another aspect of load forecasting in the context of the energy sector is the integration of RES into the power grid [5]. The use of a high-quality forecasting model may increase the impact of renewable sources [6]. In terms of energy management, load forecasting is an indispensable tool in demand response (DR) strategies [7]. However, the execution of reliable energy load forecasting is not easy, and several challenges arise in the process [8]. For this reason, researchers focus on overcoming these obstacles using new technologies, such as big data [9].

When energy load forecasting was still nascent, simple methods, such as linear regression [10], hybrid regressions [11], Kalman filters [12], Kalman estimators [13], or autoregressive integrating moving average (ARIMA) [14] and metabolic nonlinear grey models [15], were used. While these models are effective when dealing with linear data, they are unable to provide accurate forecasting in complex, non-linear energy load time series. Load forecasting methods have changed in response to advances in artificial intelligence neural network development [16]. Paras et al. used a neural networks approach for very short energy forecasting [17] and Nima et al. integrated a feature selection to the neural network algorithm [18]. As these models used optimal parameters [19], hybrid methods—basically, artificial neural network optimized methods—were proposed [20]. Many of the optimization methods used rely on feature selection [21] or hybrid feature selection to improve the parameters used for forecasting methodology [22]. Recent research has shown that performing a feature selection when starting to train the algorithm yields better results, as the model will be more likely to learn the characteristics of the time series when dealing with the optimum input-output set [23].

Hybrid algorithms that combine at least two prediction methods when developing the forecasting model have been developed [24]. Huaiguang et al. proposed a support vector regression method enforced with feature selection of data [25]. Sibonelo Motepe also proposed a hybrid methodology but using deep learning algorithms [26]. Due to the proven efficiency of these methods, the research presented herein is also based on a hybrid approach.

Unlike traditional forecasting methods that are used to predict the load on a single node (e.g., a substation), the focus of this paper is on short-term total energy load forecasting, which yields greater accuracy by using the measured information in all nodes of interest. Another benefit of the proposed methodology over the classical methods mentioned above is the use of the residuals as predictors. This approach will customize the algorithm to better respond to the characteristics of the data. By doing this with each iteration the algorithm will better adapt to the data sets.

The hybrid algorithm described in Figure 1, uses the autoregressive (AR) time series model in combination with Gaussian process regression (GPR), coupled with a data pre-processing algorithm, the empirical wavelet transform (EWT), for data optimization. This combination of parametric and non-parametric time series, AR and GPR, showed great results in predicting the energy load consumption characteristics [27]. For optimizing the forecasting model, the use of rational quadratic (RQ) kernel function is proposed. This covariance function was chosen after testing different other kernels, due to its compatibility with the non-linear profile of load data.

The remainder of this paper is organized as follows: Section 2 presents the energy load data used to develop the forecasting method. The subsequent section deals with data preprocessing and presents the EWT used. Section 4 presents the GPR hybrid forecasting method proposed in this paper. Subsequently, the model performance indicators used to assess the algorithm are described in Section 5. Section 6 presents the results and evaluates the forecasting method’s performance. The final section of the paper concludes the study with some of the authors’ observations and recommendations for future research regarding the presented topic.

2. Data Analysis

This paper tests how different factors impact energy consumption behavior. To develop the forecasting algorithms, load data were gathered from the Romanian energy transmission and system operator [28] and correlated with weather measurements for the city of Bucharest [29]. The weather data were correlated with energy load data to provide information at hourly intervals on the same time samples. Energy load data consist of energy consumption for every hour, starting with 1 January 2018 and ending on 1 June 2019. The weather data that were taken into account consist of five variables that were sampled in the same time intervals. The weather factors that were considered to have the greatest impact on load consumption were temperature, changes in atmospheric pressure, dew point temperature, the level of precipitation in mm, and air humidity, all measured at two meters above ground reference. Another important factor considered was whether or not the day was a working day. This data type is binary and is 1 if the day is a weekend day or a national holiday and is 0 otherwise. The final factor to be considered was the date (month, day, day of the week); this criterion was added as, upon data analysis, this principle was observed to create patterns in load demand. Prior to algorithm implementation, all data were processed, and an outlier test was performed to improve the datasets.

Figure 2 illustrates how energy load fluctuates in accordance with day type; as expected a significant increase in energy load for working days could be observed. Figure 3 represents temperature influence on load demand. A clear dependency of the load with respect to the temperature could be observed. Further interesting pattern-forming characteristics for the datasets are presented in Figure 4 and Figure 5, which illustrate the influences of humidity and dew point temperature on power consumption, respectively. These characteristics are studied in order to determine if a feature selection would benefit the algorithm and which of the predictors influence less the energy load data.

3. Empirical Wavelet Transform

EWT [30] is used to preprocess the energy load data. This will ensure that the analyzed data will follow a smooth path that shapes accordingly to data changes without capturing the peak values introduced by field equipment. All data are preprocessed before using the forecasting algorithm. This method is not used in real time because data are collected until the end of the day. After acquiring the new data set, the EWT preprocess starts and the result is then added to the historical data. This wavelet method adapts to the processed signal by building a set of bandpass filters. To construct the wavelet filter bank, a robust peak detection is used prior to determining a maxima for the spectrum segmentation. This method is proposed based on multiple studies that have validated it as a perfect fit for non-stationary time signals as an energy load measurements vector [31].

To perform a short description of the algorithm, first, a limit, ω_n, is defined between each step segment. The segments are denoted with

Λ_{n} = [ω_{n - 1}, ω_{n}]

, and because ω_n ϵ (0,π), the

U_{n = 1}^{N} Λ_{n} = [0, π]

, where N is the number of segments. The scaling function is defined below:

{\hat{ϕ}}_{n} (ω) = {\begin{matrix} 1 & i f | ω | \leq (1 - γ) ω_{n} \\ c o s [\frac{π}{2} β (\frac{1}{2 γ ω_{n}} (| ω | - ω_{n} + γ ω_{n}))] & i f (1 - γ) ω_{n} \leq | ω | \leq (1 + γ) ω_{n} \\ 0 & o t h e r w i s e \end{matrix}

(1)

where function

β (x) = {\begin{matrix} 0 i f x \leq 0 \\ 1 i f x \geq 1 \end{matrix}

and

β (x) + β (1 - x) = 1, \forall x ϵ [0, 1]

and γ ϵ (0,1).

The corresponding wavelets are then defined as:

{\hat{ψ}}_{n} (ω) = {\begin{matrix} 1 & i f (1 + γ) ω_{n} \leq | ω | \leq (1 - γ) ω_{n + 1} \\ c o s [\frac{π}{2} β (\frac{1}{2 γ ω_{n + 1}} (| ω | - ω_{n + 1} + γ ω_{n + 1}))] & i f (1 - γ) ω_{n + 1} \leq | ω | \leq (1 + γ) ω_{n + 1} \\ s i n [\frac{π}{2} β (\frac{1}{2 γ ω_{n}} (| ω | - ω_{n} + γ ω_{n}))] & i f (1 - γ) ω_{n} \leq | ω | \leq (1 + γ) ω_{n} \\ 0 & o t h e r w i s e \end{matrix}

(2)

After defining the wavelet methodology, the detailed coefficients and the approximation coefficients required for the signal reconstruction must be defined. The detailed coefficients are obtained from the inner product of the signal with the empirical wavelets, and the approximation coefficients result from the inner product of the signal with the scaling function:

W_{f}^{ε} (n, t) = 〈 f, ψ_{n} 〉 = \int^{} f (γ ω) \bar{ψ_{n} (γ ω - t)} d ω

(3)

W_{f}^{ε} (0, t) = 〈 f, ϕ_{1} 〉 = \int^{} f (γ ω) \bar{ϕ_{1} (γ ω - t)} d ω

(4)

The signal reconstruction is obtained by summing the approximation with all the detailed coefficients, as shown below:

f (t) = W_{f}^{ε} (0, t) ⋆ ϕ_{1} (t) + \sum_{n = 1}^{N} W_{f}^{ε} (n, t) ⋆ ψ_{n} (t)

(5)

For the energy load signal, a level three decomposition with the proposed EWT method is applied. A graphical comparison of the original signal and the reconstructed signal after processing is illustrated below in Figure 6. As presented, the EWT, smoothens the input data which will prove very beneficial for the forecasting algorithm because it will help form patterns more easily.

4. Energy Load Forecasting Method

For the proposed energy load prediction, several forecasting models were tested. The best match for the above-presented historical data was an AR model combined with GPR and an error-adjusting function specific to the datasets.

First, a simple AR model is presented, beginning with the mathematical representation [32]:

y_{m} = c t + \sum_{i = 1}^{n} c_{i} y_{m - 1} + e_{m}

(6)

where c_i is the method coefficient for the current iteration and represents how the previous value influences the current value, e_m represents the error for the mth value, and ct is the first coefficient of the function with a constant value.

To compute the order of the AR model, a partial autocorrelation function is used with Yule–Walker equations:

p_{i} = \sum_{j = 1}^{k} Φ_{p j} * \sum_{t = 1}^{M - i} (y_{m} - \bar{y}) (y_{m + 1} - \bar{y}) \frac{1}{c t * M}

(7)

where k is the value at which the correlation function Φpk cuts off after exceeding the model order, M represents the number of observations, and ct is the mean value iteration of the finite time series.

The next step for the method after determining the order of the model is to estimate the model coefficients. To accomplish this, the first equation is expressed in vector form to express the interconnection between the input and the output:

\underset{y}{\underset{⏟}{[\begin{matrix} y_{m} \\ \begin{matrix} y_{m - 1} \\ ⋮ \end{matrix} \\ y_{m - ω + 1} \end{matrix}]}} = \underset{X}{\underset{⏟}{[\begin{matrix} 1 & y_{m - 1} & y_{m - 2} & \dots & y_{m - n} \\ 1 & y_{m - 2} & y_{m - 3} & \dots & y_{m - n + 1} \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 1 & y_{m - ω} & y_{m - ω + 1} & \dots & y_{m - n + ω} \end{matrix}]}} \underset{C}{\underset{⏟}{[\begin{matrix} c_{t} \\ \begin{matrix} c_{1} \\ ⋮ \end{matrix} \\ c_{n} \end{matrix}]}} + [\begin{matrix} e_{1} \\ \begin{matrix} e_{2} \\ ⋮ \end{matrix} \\ e_{n + 1} \end{matrix}]

(8)

where ω represents the number of input/output pairs that are constructed using the original time series. Having written the equation in vector form, we can determine the coefficients of the model using the least squares method:

C = {(X^{T} X)}^{- 1} X^{T} y

(9)

Next, the proposed GPR model is presented [33]. It can be written as

f (x) ~ GP (m (x), k (x, x^{'}))

, an expression of its mean and covariance functions. The expression of the output regarding the n-order input vector is computed as follows:

y = f (x) + e

(10)

where e represents the noise, and may be approximated with a Gaussian distribution of zero-mean value, a variance of

σ_{n}^{2}

, x is the input vector, and y is the scalar output:

x_{i} = {[y_{m - i}, y_{m - i + 1}, \to y_{m - i + n - 1}]}^{T}

(11)

The model, together with the noise assumption, determines the likelihood that represents the probability density based on the parameters. The objective is to construct a training set

D = {(x_{i}, y_{i}) | i = 1, 2 \dots, ω}

using the input vector and scalar output described above:

p (y | f) = N (y | f, σ_{n}^{2} I)

(12)

The likelihood presented in the above expression is given by the Gaussian distribution with

y | f

mean and

σ_{n}^{2} I

variance,

y = {[y_{1}, y_{2} \dots y_{ω}]}^{T}

,

f = {[f (x_{1}), f (x_{2}) \dots f (x_{ω})]}^{T}

, and I being the unit matrix of

ω

order. Marginal distribution p(

f

) is defined as having mean zero and a Gram K matrix:

p (f) = N (f | 0, K), K_{i, j} = k (x_{j}, x_{j})

(13)

With the above knowledge, the marginal distribution of y is computed:

p (y) = \int^{} p (y | f) p (f) d f = N (f | 0, K + σ_{n}^{2} I)

(14)

To ground the predictions in theory, for the predicted output

y_{*}

with the input vector

x_{*}

, all the parameters’ values are averaged and weighted according to their posterior probability:

[\begin{matrix} y \\ y_{*} \end{matrix}] ~ N (0, [\begin{matrix} K_{y} & k_{*} \\ k_{*}^{T} & k_{* *} + σ_{n}^{2} \end{matrix}])

(15)

If there are

ω

training data points, then

k_{*}

is the

ω

order matrix of the covariances evaluated with all the data points for training and testing. Having considered the constraints of the conditioning Gaussians [33], the prediction can be determined:

p (y_{*} | y) ~ N (k_{*}^{T} K_{y}^{- 1} y | k_{* *} - k_{*}^{T} K_{y}^{- 1} k_{*} + σ_{n}^{2})

(16)

where

k_{* *} = k (x_{*}, x_{*})

and

K_{y}

is the covariance matrix.

The kernel, or covariance, function is that which defines the distance and similarity between the training data points. After testing various functions, the best performing kernel function was the RQ:

k (x, x^{'}) = σ_{n}^{2} {(1 + \frac{{(x - x^{'})}^{2}}{2 α l^{2}})}^{- α}

(17)

This can also be represented as an infinite sum of squared exponential covariance functions with different length scale characteristics (l) and scale mixture (α). RQ kernel was determined as the best choice to minimize the distance between two neighbors through experimental testing. It was decided that for this specific data string from all tested functions the best match was RQ. After integrating the kernel function, the prediction mean and covariance may be calculated:

{\begin{matrix} m (x_{*}) = c^{T} x_{*} + k_{*}^{T} K_{y}^{- 1} (y - m) \\ σ^{2} (x_{*}) = k_{* *} - k_{*}^{T} K_{y}^{- 1} k_{*} + σ_{n}^{2} \end{matrix}

(18)

This paper’s forecasting method is based on a hybrid of the autoregressive model and a double GPR method, which takes a predicted error into consideration:

y = h (x) + c^{T} x + ϵ (x)

(19)

where, h(x)

~ GP

(0,k(x,x′)), c represents the method coefficients matrix and

ϵ (x)

represents the predicted residuals. This formulation indicates that the output is the summation of a zero-mean Gaussian process, a set of fixed basis functions, and the zero-mean process of the residuals.

5. Results and Analysis

To test the method’s accuracy, three of the most commonly used performance metrics in machine learning were used: mean absolute percentage error (MAPE), root mean square error (RMSE) and mean absolute error (MAE) [34]. Each of these metrics measures the difference between the experimental and the predicted data. The smaller their value, the better the algorithm will perform:

M A P E = \frac{\sum_{i = 1}^{n} | \frac{y_{i} - y_{* i}}{y_{i}} |}{n} * 100 %

(20)

R M S E = \sqrt{\frac{\sum_{i = 1}^{n} {(y_{i} - y_{* i})}^{2}}{n}}

(21)

M A E = \frac{\sum_{i = 1}^{n} | y_{i} - y_{* i} |}{n}

(22)

where

y_{i} a n d y_{* i}

are the measured and predicted parameters at i time and n represents the number of the considered data.

The algorithm was trained and tested using the energy load historical data described above. Before starting the validation process, and in order to enhance the processm some measurements against overfitting are required. Overfitting represents the set of data that corresponds too closely or exactly to another particular set of data and may therefore fail to predict future observations reliably. To protect against overfitting, a fivefold cross-validation method was used. The described performance metrics of the algorithm are MAPE = 4.89%, RMSE = 245.3 MWh, MAE = 188.4 MWh. Figure 7 presents a comparison between the predicted and actual values of the load. The first observation is that the forecasted load mimics the characteristics of the historical data. Also, as computed above, the results are expected to generate solid forecasting data for this specific energy load trend.

The graphical representations from Figure 8 present the residuals with true and predicted responses, and the influences of temperature and month. It can be observed how the month of the year influences the residual precision, with the lowest values for the middle of the year. The same patterns can be observed for the temperature predictor; in this case, the residuals are higher for lower values of the temperature.

As proposed in the methodology section, to further improve the algorithm, the residuals were predicted using the same regression methodology. Applying this dual-layer prediction process improves the forecasting values and the performance metrics: MAPE = 4.09%, RMSE = 221.6 MWh, and MAE = 153.2 MWh. Figure 9 illustrates the load’s actual value in blue, the predicted value in red, and the dual-layer prediction in magenta.

To further validate and test the algorithm’s fitness for the specific load pattern, another test was performed. As presented above, the algorithm was implemented on historic energy load data from January 2018 to June 2019. Data collected from the month of July were used to test the algorithm’s performance. Figure 10 presents a graphical representation of the measured and predicted energy load values for this specific period of time. This graph validates the algorithm that uses residual prediction and also shows its superiority over the hybrid method without error prediction.

A comparison was made between the proposed forecasting method with and without error prediction. Table 1 presents the performance metrics for the July testing.

6. Conclusions

Accurate electricity load forecast plays an essential role in planning for utilities and electricity markets in electricity power networks. This paper proposes a novel hybrid GPR forecasting method of the aggregated energy load of a mixture of residential, commercial, and industrial clients. The relatively new EWT method was used for data preprocessing. To ensure a precise load prediction, some of the most important influencing factors were selected. Among these, the most influential were temperature, working/non-working day, and humidity. The hybrid method assumes the combination of an autoregressive time series model and GRP. Using the same methodology, further forecasting of the residuals was added. Following implementation of the algorithm, the method was validated on a new set of data. It was observed that a better approach would be to further apply an error prediction for the residuals. The proposed forecasting method was evaluated using three performance metrics. For the presented load data, this algorithm presents good results that mimic the characteristics of the aggregated plot. The novelty of the approach consists of applying EWT to preprocess the data, combining an AR method with GPR, and using the generated residuals as predictors to increase the algorithm’s precision. A future development of the method may be developed by adding a closed-loop system methodology in which the method is executed until the residuals decrease no more.

Author Contributions

All the authors contributed to this study. C.D., H.G.B., D.D.M.: conceptualization, investigation and writing of the original draft; T.A., S.P., I.B.: funding acquisition and project administration; T.A., S.U.: software and data curation; C.D., H.B.: resources; D.M., S.P., S.D.C.: supervision and writing of review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by a grant of the Romanian Ministry of Research and Innovation, CCCDI – UEFISCDI, project number PN-III-P1-1.2-PCCDI-2017-0404 / 31PCCDI/2018, within PNCDI III.

Conflicts of Interest

The authors declare no conflicts of interest.

Nomenclature

RES	Renewable Energy Sources
DR	Demand response
AR	Autoregressive
GPR	Gaussian process regression
EWT	Empiric wavelet transform
RQ	Rational quadratic
MAPE	Mean absolute percentage error
RMSE	Root mean square error
MAE	Mean absolute error

References

T.I.E. Agency. Global Energy & CO₂ Status Report. 2017. Available online: https://www.iea.org/ (accessed on 1 May 2020).
Jiang, P.; Liu, F.; Song, Y. A hybrid forecasting model based on date-framework strategy and improved feature selection technology for short term load forecasting. Energy 2017, 119, 694–709. [Google Scholar] [CrossRef]
Ahmad, T.; Chen, H. Utility companies strategy for short-term energy demand forecasting using machine learning based models. Sustain. Cities Soc. 2018, 39, 401–417. [Google Scholar] [CrossRef]
Chen, C.; Wang, F.; Zhou, B.; Chan, K.W.; Cao, Y.; Tan, Y. An interval optimization based day-ahead scheduling scheme for renewable energy management in smart distribution systems. Energy Convers. Manag. 2015, 106, 584–596. [Google Scholar] [CrossRef]
Wang, H.; Lei, Z.; Zhang, X.; Zhou, B.; Peng, J. A review of deep learning for renewable energy forecasting. Energy Convers. Manag. 2019, 198. [Google Scholar] [CrossRef]
Wang, H.; Yi, H.; Peng, J.; Wang, G.; Liu, Y.; Jiang, H.; Liu, W. Deterministic and probabilistic forecasting of photovoltaic power based on deep convolutional neural network. Energy Convers. Manag. 2017, 153, 409–422. [Google Scholar] [CrossRef]
Pinto, R.; Bessa, R.J.; Matos, M.A. Multi-period flexibility forecast for low voltage prosumers. Energy 2017, 141, 2251–2263. [Google Scholar] [CrossRef] [Green Version]
Mak, S.T. Advanced Metering and Demand Response Applications in New Technologies for Smart Grid Operation; IOP Publishing: Chicago, IL, USA, 2015; ISSN 978-0-7503-1158-8. [Google Scholar] [CrossRef]
Diamantoulakis, P.D.; Kapinas, V.M.; Karagiannidis, G.K. Big Data Analytics for Dynamic Energy Management in Smart Grids. Big Data Res. 2015, 2, 94–101. [Google Scholar] [CrossRef] [Green Version]
Enayatifar, R.; Sadaei, H.J.; Abdullah, A.H.; Gani, A. Imperialist competitive algorithm combined with refined high-order weighted fuzzy time series (RHWFTS–ICA) for short term load forecasting. Energy Convers. Manag. 2013, 76, 1104–1116. [Google Scholar] [CrossRef]
He, Y.; Zheng, Y. Short-term power load probability density forecasting based on Yeo-Johnson transformation quantile regression and Gaussian kernel function. Energy 2018, 154, 143–156. [Google Scholar] [CrossRef]
Takeda, H.; Tamura, Y.; Sato, S. Using the ensemble Kalman filter for electricity load forecasting and analysis. Energy 2016, 104. [Google Scholar] [CrossRef]
Lynch, C.; O’Mahony, M.J.; Guinee, R.A. Accurate day ahead temperature prediction using a 24 hour Kalman filter estimator. In Proceedings of the 2015 11th Conference on Ph.D. Research in Microelectronics and Electronics (PRIME), Glasgow, UK, 29 June–2 July 2015. [Google Scholar]
Eseye, A.T.; Zhang, J.; Zheng, D. Short-term photovoltaic solar power forecasting using a hybrid Wavelet-PSO-SVM model based on SCADA and Meteorological information. Renew. Energy 2018, 118, 357–367. [Google Scholar] [CrossRef]
Wang, Q.; Li, S.; Li, R.; Ma, M. Forecasting U.S. shale gas monthly production using a hybrid ARIMA and metabolic nonlinear grey model. Energy 2018, 160, 378–387. [Google Scholar] [CrossRef]
Beccali, M.; Cellura, M.; Brano, V.L.; Marvuglia, A. Forecasting daily urban electric load profiles using artificial neural networks. Energy Convers. Manag. 2004, 45, 2879–2900. [Google Scholar] [CrossRef]
Mandal, P.; Senjyu, T.; Funabashi, T. Neural networks approach to forecast several hour ahead electricity prices and loads in deregulated market. Energy Convers. Manag. 2006, 47, 2128–2142. [Google Scholar] [CrossRef]
Amjady, N.; Keynia, F. Day-ahead price forecasting of electricity markets by a new feature selection algorithm and cascaded neural network technique. Energy Convers. Manag. 2009, 50, 2976–2982. [Google Scholar] [CrossRef]
Singh, S.N.; Mohapatra, A. Repeated wavelet transform based ARIMA model for very short-term wind speed forecasting. Renew. Energy 2019, 136, 758–768. [Google Scholar] [CrossRef]
Sepasi, S.; Reihani, E.; Howlader, A.M.; Roose, L.R.; Matsuura, M.M. Very short term load forecasting of a distribution system with high PV penetration. Renew. Energy 2017, 106, 142–148. [Google Scholar] [CrossRef]
Capizzi, G.; Napoli, C.; Bonanno, F. Innovative Second-Generation Wavelets Construction with Recurrent Neural Networks for Solar Radiation Forecasting. IEEE Trans. Neural Netw. Learn. Syst. 2012, 23, 1805–1815. [Google Scholar] [CrossRef] [Green Version]
Semero, Y.K.; Zhang, J.; Zheng, D. PV power forecasting using an integrated GA-PSO-ANFIS approach and Gaussian process regression based feature selection strategy. CSEE J. Power Energy Syst. 2018, 4, 210–218. [Google Scholar] [CrossRef]
Yang, A.; Li, W.; Yang, X. Short-term electricity load forecasting based on feature selection and Least Squares Support Vector Machines. Knowl. Based Syst. 2019, 163, 159–173. [Google Scholar] [CrossRef]
Kushwaha, V.; Pindoriya, N.M. A SARIMA-RVFL hybrid model assisted by wavelet decomposition for very short-term solar PV power generation forecast. Renew. Energy 2019, 140, 124–139. [Google Scholar] [CrossRef]
Jiang, H.; Zhang, Y.; Muljadi, E.; Zhang, J.J.; Gao, D.W. A Short-Term and High-Resolution Distribution System Load Forecasting Approach Using Support Vector Regression With Hybrid Parameters Optimization. IEEE Trans. Smart Grid 2018, 9, 3341–3350. [Google Scholar] [CrossRef]
Motepe, S.; Hasan, A.N.; Stopforth, R. Improving Load Forecasting Process for a Power Distribution Network Using Hybrid AI and Deep Learning Algorithms. IEEE Access 2019, 7, 82584–82598. [Google Scholar] [CrossRef]
Zhang, C.; Wei, H.; Zhao, X.; Liu, T.; Zhang, K. A Gaussian process regression based hybrid approach for short-term wind speed prediction. Energy Convers. Manag. 2016, 126, 1084–1092. [Google Scholar] [CrossRef]
Transelectrica. Available online: http://transelectrica.ro/en/web/tel/despre-noi1 (accessed on 1 May 2020).
R. P. Ltd. Weather_in_the_World. Available online: https://rp5.ru/Weather_archive_in_Bucharest,_Filaret (accessed on 1 May 2020).
Gilles, J. Empirical wavelet transform. IEEE Trans. Signal Process. 2013, 61, 3999–4010. [Google Scholar] [CrossRef]
Dedinec, A.; Filiposka, S.; Dedinec, A.; Kocarev, L. Deep belief network based electricity load forecasting: An analysis of Macedonian case. Energy 2016, 115, 1688–1700. [Google Scholar] [CrossRef]
Jin, Y. Multi-Objective Machine Learning; Springer Science & Business Media: Warsaw, Poland, 2006. [Google Scholar]
Rasmussen, C.E.; Williams, C.K.I. Gaussian Processes for Machine Learning; MIT Press: Cambridge, MA, USA, 2006; ISBN 026218253X. Available online: www.GaussianProcess.org/gpml (accessed on 30 April 2020).
Wang, Y.; Zhang, N.; Tan, Y.; Hong, T.; Kirschen, D.S.; Kang, C. Combining Probabilistic Load Forecasts. IEEE Trans. Smart Grid 2019, 10, 3664–3674. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Schematic flow of the proposed method.

Figure 2. Energy consumption based on day type.

Figure 3. Temperature influence on energy load.

Figure 4. Energy load—humidity influence.

Figure 5. Energy load—dew point influence.

Figure 6. Comparison between reconstructed empirical wavelet transform (EWT) signal and the original signal.

Figure 7. Prediction comparison.

Figure 8. Residuals plot: (a) month predictor; (b) predicted response comparison; (c) temperature predictor; (d) true response comparison.

Figure 9. Comparison between historical data, predicted value, and forecasting value with error prediction.

Figure 10. Validation plot for July load forecasting.

Table 1. Performance metrics for July testing.

Performance Metrics	Simple Forecasting	With Error Prediction
MAPE (%)	5.04	4.28
RMSE (MWh)	205.4	158.7
MAE (MWh)	188.7	138.5

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Darab, C.; Antoniu, T.; Beleiu, H.G.; Pavel, S.; Birou, I.; Micu, D.D.; Ungureanu, S.; Cirstea, S.D. Hybrid Load Forecasting Using Gaussian Process Regression and Novel Residual Prediction. Appl. Sci. 2020, 10, 4588. https://doi.org/10.3390/app10134588

AMA Style

Darab C, Antoniu T, Beleiu HG, Pavel S, Birou I, Micu DD, Ungureanu S, Cirstea SD. Hybrid Load Forecasting Using Gaussian Process Regression and Novel Residual Prediction. Applied Sciences. 2020; 10(13):4588. https://doi.org/10.3390/app10134588

Chicago/Turabian Style

Darab, Cosmin, Turcu Antoniu, Horia Gheorghe Beleiu, Sorin Pavel, Iulian Birou, Dan Doru Micu, Stefan Ungureanu, and Stefan Dragos Cirstea. 2020. "Hybrid Load Forecasting Using Gaussian Process Regression and Novel Residual Prediction" Applied Sciences 10, no. 13: 4588. https://doi.org/10.3390/app10134588

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Hybrid Load Forecasting Using Gaussian Process Regression and Novel Residual Prediction

Abstract

1. Introduction

2. Data Analysis

3. Empirical Wavelet Transform

4. Energy Load Forecasting Method

5. Results and Analysis

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

Nomenclature

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI