Equation Based New Methods for Residential Load Forecasting

Alam, S. M. Mahfuz; Ali, Mohd. Hasan

doi:10.3390/en13236378

Open AccessArticle

Equation Based New Methods for Residential Load Forecasting

by

S. M. Mahfuz Alam

and

Mohd. Hasan Ali

^*

Department of EECE, The University of Memphis, Memphis, TN 38152, USA

^*

Author to whom correspondence should be addressed.

Energies 2020, 13(23), 6378; https://doi.org/10.3390/en13236378

Submission received: 23 October 2020 / Revised: 20 November 2020 / Accepted: 27 November 2020 / Published: 2 December 2020

(This article belongs to the Section A1: Smart Grids and Microgrids)

Download

Browse Figures

Versions Notes

Abstract

:

This work proposes two non-linear and one linear equation-based system for residential load forecasting considering heating degree days, cooling degree days, occupancy, and day type, which are applicable to any residential building with small sets of smart meter data. The coefficients of the proposed nonlinear and linear equations are tuned by particle swarm optimization (PSO) and the multiple linear regression method, respectively. For the purpose of comparison, a subtractive clustering based adaptive neuro fuzzy inference system (ANFIS), random forests, gradient boosting trees, and long-term short memory neural network, conventional and modified support vector regression methods were considered. Simulations have been performed in MATLAB environment, and all the methods were tested with randomly chosen 30 days data of a residential building in Memphis City for energy consumption prediction. The absolute average error, root mean square error, and mean average percentage errors are tabulated and considered as performance indices. The efficacy of the proposed systems for residential load forecasting over the other systems have been validated by both simulation results and performance indices, which indicate that the proposed equation-based systems have the lowest absolute average errors, root mean square errors, and mean average percentage errors compared to the other methods. In addition, the proposed systems can be easily practically implemented.

Keywords:

adaptive neuro fuzzy inference system (ANFIS); random forest (RF); gradient boosting trees; long term short memory (LSTM); equation-based prediction system; load forecasting; smart buildings

1. Introduction

The energy utilization in residential and commercial buildings all over the USA is almost 40% of the overall energy generation. With the increase of luxury requirement of residents, the energy consumption is ever-increasing [1,2]. Therefore, providing the required power by grid is a hard task, especially during peak hours of the days. However, this problem can be solved in two ways. Firstly, by proper planning and allocation of energy resources by the grid, adequate power can be supplied to the consumers. Secondly, by implementing effective demand-side energy management system in the smart building that is capable of scheduling the load efficiently, the total cost of energy can be reduced by utilizing less loads that are operated by the grid power during the peak hours without affecting the consumers’ comfort demands [3,4]. An efficient load forecasting system helps the buildings’ energy management system schedule the loads ahead of time, operate the energy sources and energy storage systems effectively during peak hours to reduce the cost of energy and remove burden on the grids [5,6,7]. It also creates possibility for the smart building to sell energy to the grid during peak hours to achieve some incentives [8]. Moreover, with the knowledge of load forecasting, the grid can allocate resources ahead of time and efficiently to meet up with the load demands [9,10]. Therefore, researchers have been investigating on improved and effective load forecasting methods over the last two decades.

Based on the forecast horizon or time scale, the load forecasting is classified generally into three categories, namely short-term load forecasting (STLF), medium-term load forecasting (MTLF), and long-term load forecasting (LTLF) [11,12]. Moreover, among various types of methods that are found in the literature, the most common are either time series or regression model type. The performance of time series models such as exponential smoothing, autoregressive integrated moving average (ARIMA) model, etc., depend upon the correlation between the loads and their previous values and availability of very large data set [11,13,14,15]. Another popular conventional method that is found in the literature is regression trees [16,17]. The random forest is a homogeneous ensemble approach with the combination of many decision trees without dependency on each other [18]. The drawback of the random forest method is its inability to extrapolate meaning; its prediction range is confined by the range in the training data as it takes the average value of all the trees. Moreover, it can be overfitted if the data set is large or noisy. Gradient boosting tree is another ensemble method that has been used for prediction [19,20]. The contrast between gradient boosting tree and random forest is that gradient boosting utilizes one tree for error minimization based on the experience of the previous tree. However, the gradient boosting tree method is more vulnerable to be overfitted in the presence of noise during data training and more parameters are needed to be tuned as compared to random forest. However, once it is properly tuned, it performs better than the random forest approach. Moreover, multiple linear regression-based forecasting method is found in the literature. The drawback of this approach is that it performs well for linear systems only, whereas the buildings’ loads are mostly non-linear in nature and the power consumption is non-linear as well [21,22]. Therefore, researchers have been focusing more on artificial intelligent based prediction system because of its ability to predict non-linear loads well under different indoor and outdoor conditions [23].

The artificial intelligence-based methods include fuzzy logic (FL), adaptive neuro fuzzy inference system (ANFIS), artificial neural network (ANN), support vector machine (SVM), etc. Among the artificial intelligence systems, the ANN method is found to be popular for load forecasting [24,25,26,27,28]. Moreover, a multi block neural network with a view to predicting price and load has been proposed in a recent work [29]. In addition, the Ridgelet and Elman neural networks-based load forecasting has been described in another work [30]. However, the ANN method requires a lot of historical data during training and validation stage for future data prediction [11]. In addition to that, the performance of the ANN depends upon several factors such as correlation between the inputs and output, the proper and efficient tuning of weight and bias of the hidden and output layer [24].

Therefore, in order to get better prediction method, the authors proposed a new two input fuzzy logic system for residential load forecasting that performs better than the ANN system [31]. Between the two inputs of the fuzzy systems, one is the temperature, and the other is a variable that is calculated from the occupancy number and day type. In another work, the fuzzy based peak energy management system is proposed for the industrial consumer [32]. It is to note here that the fuzzy logic is a non-linear system that operates on IF-THEN logic [33]. In addition, a fuzzy system is a slow system as it operates on the fuzzy rules that depend on the number of inputs and the membership function for each input. If each input has m membership functions and there are n inputs in the fuzzy system, then mⁿ rules need to be evaluated for each iteration of the fuzzy system, and therefore it is practically not suitable to implement, especially if the number of inputs exceeds two as the system becomes slower. Moreover, a new subtractive clustering based ANFIS system is proposed by authors, where the temperature and another variable being calculated from occupancy and day type, are considered as inputs, and the proposed ANFIS system performed better than the conventional ANN system [34]. The ANFIS system is a combination of both neural network and fuzzy system. Therefore, the ANFIS system requires a lot of data for the training and validation of the system than is required for neural network. In addition, similar to the fuzzy system, the ANFIS system becomes a slower system to be implemented practically for prediction if the number of inputs exceeds two.

Moreover, in recent times, the upgraded version of the recurrent neural network, named the long short term memory (LSTM) model, has been popular for forecasting [35,36,37]. The LSTM operates well where the conventional recurrent network fails with a large scale of sequential input data. However, the LSTM is more complicated than conventional neural networks, and, being a black box, it lacks interpretability. In addition, it does not perform well in case of small input data, if the parameters are not properly tuned or input data are not sequential.

In addition, stochastic optimization-based prediction systems have been gaining popularity for forecasting. They are utilized mainly in the case of uncertainty in the system. A hybrid stochastic approach, for bidding strategies and demand uncertainties of large consumer, is proposed in a work [38]. Similarly, stochastic optimizations have been proposed for dealing with uncertainties in cooling demand [39] and for risk assessments of large consumer group [40].

However, the energy consumption of a residential building depends upon the habits of residents living there, responses to different environmental conditions, mode of comfort, etc. Although the consumers’ reference comfort temperature during the different conditions mentioned above can be different, however, in general, it is assumed that in USA, if the outside average temperature of the day is 65 °F, then no heating and cooling are required to be comfortable [41]. Therefore, if the energy consumption is categorized based on heating degree days (HDD), cooling degree days (CDD), number of occupants, and the day type, the uncertainty of energy reduces a lot. HDD is a term that is used for showing how much the day’s temperature is below the consumers’ reference comfort temperature (65 °F) and this constant temperature is used for HDD calculation all over USA for all the seasons [42,43]. Similarly, CDD is a term that defines how much the temperature is above the consumers’ reference comfort temperature (65 °F) [42,43].

Based on the above background, a new method, that is practically implementable and does not need a lot of data for training but works better than conventional prediction systems, requires attention and investigation. Therefore, having been motivated by this fact, this paper proposes a new method based on both non-linear and linear equations for residential load forecasting. The coefficients in the proposed non-linear equations have been tuned by the Particle Swarm optimization (PSO) algorithm. The PSO is a stochastic optimization technique that has advantageous inherent features such as a fast convergence rate as compared to other optimization techniques such as the genetic algorithm and provide more effective solutions. Moreover, it is practically implementable and has been applied in different applications [44,45,46,47,48]. The coefficients of the proposed linear equation-based system are tuned by multiple linear regression (MLR) using the least squares approach, which is available in MATLAB software.

The main contributions of this paper are summarized as follows:

Three generalized equations are developed for predicting load consumption based on the HDD, CDD, occupancy, and day type. The coefficients of the non-linear equations and linear equation are optimized by the well-known PSO and multiple linear regression method, respectively.
In order to see efficacy of the proposed equation-based methods, in predicting the loads, their performance have been compared with that of a recently published forecasting method such as the subtractive clustering based ANFIS approach, random forest, gradient boosting trees and LSTM, and conventional and modified support vector regression models.

In this work, the predicted data for all methods are simulated in MATLAB software and different errors are considered as performance indices to validate the efficacy of the proposed equations-based prediction systems.

The rest of the paper is organized as follows. In Section 2, the proposed equation-based prediction systems are described. Section 3 explains the conventional forecasting method, i.e., the ANFIS system, random forest, gradient boosting, LSTM, conventional and modified support vector regression. Simulation results are presented and explained in Section 4. The conclusion and future research directions are provided in Section 5. Finally, the references are enlisted.

2. Proposed Equation Based Prediction Methods

The load consumption of a building depends highly on temperature. The increase in temperature increases the load consumption if the temperature is above a certain temperature, which in general is 65 °F in USA, due to a higher cooling requirement. In addition, if the temperature goes below the same temperature mentioned above, the load consumption increases due to a higher requirement of heating. Therefore, energy consumption of a residential building is dependent upon HDD and CDD, which represent temperatures below or above 65 °F. Based on this fact, the energy consumption, e, can be expressed as the following:

e \propto H D D,

e \propto C D D .

Moreover, for the same temperature, HDD, CDD, the energy consumption increases with an increase in the number of occupants and vise versa in the same apartment. Therefore,

e \propto O c c u p a n t .

In addition, the energy consumption pattern of a building is different for a normal working day, weekend, any special day, or special occasion. The special day depends on the family living in a building when there may have some religious festival celebrations, some family events happening, or more than usual family members staying in the building for some reasons. In addition, it can be a normal working day or weekend or even holiday. Therefore,

e \propto D a y t y p e .

Therefore, based on the above discussion, three types of equations, as shown in (1) to (3) below, have been developed for load predictions in this work. The first equation is linear in nature as variable HDCC, occupant number (O), and day type (D), and values are linearly multiplied with the coefficients to predict the total energy consumption of the day. Moreover, the other two equations are non-linear in nature as some power values of HDCC, O are multiplied with the coefficients, whereas D is used as power for Equations (2) and (3). The exponential component is used in (2), whereas the variable is a constant whose values are determined by the optimization algorithm for (3).

e = C_{1} H D C C + C_{2} O + C_{3} D + C_{4},

(1)

e = C_{5} H D C C^{m} + C_{6} O^{n} + C_{7} e x p^{D} + C_{8},

(2)

e = C_{9} H D C C^{p} + C_{10} O^{q} + C_{11} a^{D} + C_{12},

(3)

where, e and O represent the total load consumptions in kWh in a day and number of occupants present on that day, respectively. HDCC represents the HDD values, which is the difference between the average day’s temperature and 65 °F if the temperature is below or equal to 65 °F. Moreover, HDCC represents the CDD values, which is the difference between 65 °F and the day’s average temperature values if the temperature is above 65 °F. The coefficients C₁, C₅, and C₉ depend on HDD or CDD values for (1) to (3), respectively. C₂, C₆, C₁₀ are the coefficients for number of occupants. The coefficients C₃, C₇, and C₁₁ vary with the day type. The coefficients C₄, C₈, and C₁₂ are considered to be off-sets that are dependent on HDD, CDD, occupancy and the day-type. The values for D for normal working days, weekend, and special days are considered to be 0, 1, and 2, respectively, for this work. The equations proposed in (1) and (2) to (3) are linear and non-linear in nature, respectively, whose performance certainly depends on the properly tuned values of the coefficients with the varying HDD, CDD values, occupancy, or the type of the days. Therefore, multiple linear regression method and PSO algorithm have been utilized to obtain the coefficients of the linear equation in (1) and non-linear equations proposed in (2) and (3), respectively, in order to predict the optimal total energy consumption of the day.

The working principle of the equation-based methods for residential load predictions, which consider the HDD, CDD, occupancy and the values of D based on normal working days, weekends, or special days as inputs (x), are shown in Figure 1. In this work, generalized equations are formulated based on the inputs. The dotted line portion shown in Figure 1 represents the equation-based prediction systems. First the inputs (x) are fed into the equation-based prediction system so that the ranges of the input variables are selected. Once the ranges of the variables are selected, the MLR/PSO tuned coefficient values are sent to the main equation block where Equations (1) to (3) are utilized to predict the energy consumption based on the inputs and the coefficients. The multiple linear regression method (MLR) or the PSO method provides the optimized coefficient (C₁…C₄/C₅…C₈/C₉…C₁₂) values for the proposed Equations (1)–(3) based on the range of inputs (i.e, HDD, CDD, O, D) which are summarized in Table 1, Table 2 and Table 3. These optimized tuned coefficients are obtained from the previous input data and the energy consumption data obtained from the smart meter.

2.1. Parameter Tuning by Multiple Linear Regression (MLR) Algorithm

In MATLAB, the command, regress is used for calculating the coefficients of the linear model, which has the following format:

e = C x,

(4)

subject to

\sum {(y - e)}^{2} = m i n i m u m,

where the input matrix, x = [HCDD; O; D; U], C = [C₁ C₂ C₃ C₄] and y represent the anticipated output obtained from the smart meter. U is a unity vector of length of HDCC vector to determine the values of C₄ by the multiple linear regression algorithm and introduced in the x matrix as dummy as for each set of data, the columns number of C matrix should be equal to the rows number of x matrix. By matrix multiplication of C and x matrix, the predicted output (e) is calculated and put to the condition shown above until the coefficient values (C₁,….C₄), for which the summation of square of the difference between the anticipated output (y) and predicted output (e) gets minimum.

2.2. Parameter Tuning by Particle Swarm Optimization (PSO) Algorithm

As already mentioned, in this work, the PSO method has been used for parameters tuning of the non-linear equations shown in (2) and (3). It has been widely applied in applications such as energy management [44,45], load predictions [46,47,48], etc. It is very easy to implement and has faster convergence speed and effective over other optimization algorithms such as the genetic algorithm [45].

In PSO, a random number of particles are chosen for search space and the objective function is defined. Based on the cost function at any current location, the optimal position and cost are determined and updated among the particles. Each particle then finds its new position based on its current position, previous velocity and global optimal location among the particles. After updating its positions and velocity vectors, again the best position and cost among the particles are circulated and updated. Therefore, by updating the situations (position and velocity vectors) and collaborating the information of optimal best location and optimal cost, the swarm as a group reaches its optimal goal.

The PSO algorithm is characterized by the two-model equations of velocity and position vector in an N-dimensional solution space as shown below:

v_{i}^{k + 1} = w v_{i}^{k} + c_{1} r_{1} (p_{i}^{k} - x_{i}^{k}) + c_{2} r_{2} (p_{g}^{k} - x_{i}^{k}),

(5)

x_{i}^{k + 1} = x_{i}^{k} + v_{i}^{k + 1},

(6)

where

v_{i}^{k + 1}

represents

i^{t h}

particle velocity of

{(k + 1)}^{t h}

iteration of N dimensional search space. Similarly,

x_{i}^{k}

corresponds to

i^{t h}

particle velocity of

k^{t h}

iteration.

p_{i}^{k}

and

p_{g}^{k}

correspond to the individual best position of the i particle and global best position of the swarm, respectively. Moreover,

r_{1}

and

r_{2}

are randomly chosen numbers, which are uniformly distributed between 0 and 1.

c_{1}

and

c_{2}

are known as learning factors which control the significance of the best solution. The values for both learning factors are chosen to be 2. The value for the inertia coefficient,

w

for each iteration number is calculated using the following equation:

w = w_{m a x} - \frac{t (w_{m a x} - w_{m i n})}{M a x I},

(7)

where,

w_{m a x}

and

w_{m i n}

represent the upper and lower value of w and

t

, respectively,

M a x I

correspond to the current iteration number and maximum iteration number, respectively.

The objective function for the current work is considered as follows:

f_{o b j} = y - e,

(8)

subject to

| y - e | = m i n i m u m .

The procedure of the PSO algorithm is described as follows:

• Initialization:

1. Load the input (x) and anticipated output (y) value based on the smart meter data.

2. Set the parameters of the PSO obtained from several trials which gave the optimal output.

Search space dimension = 1

Population size = 30

Maximum number of iterations = 150

w_{m a x}

= 0.9 and

w_{m i n}

= 0.2

Penalty factor = 500

• Iteration:

1. Randomly generation of velocity and position vectors, which is done by PSO.

2. Evaluate the cost function based on (5) to measure the fitness values for the corresponding inputs.

3. Start the iteration

Run the algorithm 150 times.
Based on (5) and (6), update the velocity and position vectors.

Determine the predicted energy (e) for the predicted horizon. If the constraint is violated, then add the penalty factor.

Determine the cost function.

Update the individual best and global best values based on the cost function.

Update the inertia weight.

Repeat step 3 until maximum number of iterations is reached.

After the optimal coefficients are obtained from the MLR and PSO, the coefficients are put into (1) to (3) to get the predicted outputs. The coefficients, based on different HDD, CDD, occupancy and day type condition, as determined by the MLR and PSO methods, are shown in Table 1, Table 2 and Table 3, respectively.

Interpretability is the main advantage of this proposed method. The model explains the energy consumption based on the heating degree days (HDD), cooling degree days (CDD), occupancy, and the day type. The proposed equation-based system is practically implementable as it needs only three parameters (temperature, number of occupants, type of the day.). The predicted temperature information for future days can be easily found online. The number of occupants can be inserted by the consumer, or a motion detector can be placed inside the building to count the number of occupants. Moreover, normal working days and weekend information can be available from an online calendar and the special day information can be inserted by the consumers. Once the coefficients and the temperature range are known to consumers, they even can calculate the energy consumption by hands. Moreover, it requires moderate amount of data (energy consumption, HDD, CDD, occupancy, day type) for parameter coefficient tuning by MLR and PSO. It is very convenient for practical implementation. However, the energy consumption of a residential building depends upon the habits of residents living there, responses to different environmental condition chance, mode of comfort (the usage of appliances based on consumer comfort desire under different conditions), etc. Therefore, these three equations can be implemented for any building provided that the coefficient is re-tuned based on the energy consumption pattern and other conditions such as country, region, and location.

In the first condition in Table 1, it refers to the temperatures for which CDD will be 17 °F above the reference temperature (65 °F). All temperatures equal to or higher than 82 °F (65 °F + 17 °F), would have an equivalent value for CDD of 17 °F or higher. Similarly, in the second condition, the values of CDD between 0 to less than 17 °F refer to all the values of temperatures from 65 °F to 81 °F (below 82 °F). Moreover, the value of HDD in the third condition refers to all the temperature less than or equal to 20 °F lower than the reference value 65 °F. In this case, all the temperatures that will be in the range 0 °F to 45 °F (65 °F − 20 °F) will be equivalent value for HDD of 20 °F or higher. Finally, all the temperatures in the range above 45 °F (65 °F − 20 °F) to 64.9 °F (65 °F − 0.1 °F) would be equivalent for HDD to have value less than 20 °F to 0.1 °F value. Therefore, by choosing these four ranges, all temperatures are considered. Similarly, the temperature of different ranges in terms of HDD and CDD are considered in Table 2 and Table 3.

It is important to note that the HDD and CDD values are calculated based on the constant reference temperature (65 °F) for USA. However, the consumers’ temperature comfort for different seasons and conditions can be different. Therefore, in order to cope with both conditions and predict the accurate energy consumption with HDD/CDD, the coefficients (C₁ for Equation (1), C₅ for Equation (2), and C₉ for Equation (3)) are tuned and based on HDD/CDD values for the defined range of HDD/CDD, and represent the energy variation with per degree variation of HDD/CDD (kWh/°F). Moreover, if the above methods are used for other residential places located in others countries, regions, etc., then the HDD/CDD values should be calculated based on that region’s reference temperature and the coefficient should be tuned accordingly.

3. Conventional Methods

As already mentioned, in this work, the performance of the proposed equation-based methods has been compared with that of the conventional methods such as the ANFIS, random forest, gradient boosting trees, and LSTM. These conventional methods are described below.

3.1. Adaptive Neuro Fuzzy Inference System (ANFIS) Based Load Forecasting

The ANFIS is an intelligent model with the inherent contribution of both a neural network and a fuzzy system. In this work, a Sugeno-type ANFIS system is considered. The ANFIS system is governed by two major stages, namely antecedent and conclusion. Both parts are related to each other by fuzzy rules. For the chosen Sugeno type ANFIS system, the fuzzy rules are formulated by the following equation [34]:

I f (x_{1} = A_{i}) a n d (x_{2} = B_{i}) t h e n f_{i} = p_{i} x_{1} + q_{i} x_{2} + r_{i}

(9)

where, x₁ and x₂ correspond to the inputs to the ANFIS system. Two inputs that have been chosen, are temperature (x₁) and a variable, R (x₂), as shown in (10). A_i and B_i represent the fuzzy sets. Therefore, f_i indicates the output that is governed by the fuzzy rules. For example, temperature corresponds to A₁ and R value corresponds to B₁, rule 1 of the output would be: f₁ = p₁A₁ + q₁B₁ + r₁. During the training process, the parameters (i.e., p_i, q_i, and r_i) are calculated. The input, R is determined by (10):

R = o c c u p a n t + 1.5 \times d .

(10)

The value of d can be 0, 1, and 2 based on normal working days, weekend, and special days, respectively. Therefore, if the number of occupants for a day is 5, and the day is a normal working day (d = 0), the value of R would be 5. If the day is a weekend (d = 1) or special day (d = 2), for the same number of occupants (5), the values of R would be 6.5 and 8, respectively.

In the ANFIS system, at first the data is utilized during the training process and the rules are extracted and membership functions types and their positions are determined through training and testing. Finally, the results are used for future predictions. For this work, during training, temperature, R values, and output energy consumption data of 304 days are provided. The parameters for the input (temperature, R) and output (total energy consumption) membership functions are tuned by the hybrid algorithm that utilizes the backpropagation method for the parameter of input membership function. In addition, output membership function parameters are optimized by the least square estimation method. Subtractive clustering defines the number of the fuzzy rules along with the number of membership functions and membership type. Therefore, the subtractive method is very useful if the data pattern is unknown, as well as if one is unsure as whether or not to choose the number of membership function with the membership type and center position.

The parameters of subtractive clustering are chosen from [34]. In normal fuzzy system, if both inputs have 10 membership functions, then the total fuzzy rules would have been 100, which have to be analyzed for each input data. However, for the chosen subtractive clustering parameters, each input has 10 membership functions and the total number of fuzzy rules is 10, as shown in Figure 2, which makes the subtractive clustering beneficial and the system faster. The minimum error and number of epochs are chosen to be 0 and 500, respectively. The minimal root-mean-square error is found to be 5.13 after 500 epochs. The tuned Gaussian fuzzy membership functions are shown in Figure 3. The parameters of ANFIS system are used from [34].

3.2. Random Forest Based Load Forecasting

Random forest is an ensemble approach that emphasizes the predictions of all the decision trees that are independent upon each other [49]. The sample size is randomly selected and fitted into a regression tree. The process is known as bagging and the selected sample is called bootstrap. This sample is replaced with another random sample each time. The probability of all the observations is assumed to be same. The bagging algorithm then implements the classification and regression tree (CART) algorithm to obtain a set of regression trees and finally averages the output of all trees based on the following equation:

{\hat{Y}}^{'} = \frac{1}{r} \sum_{i = 1}^{r} \hat{h} (X^{'}, S_{n}^{θ_{i}}),

(11)

where,

{\hat{Y}}^{'}

is the output estimation based on new input

X^{'}

and

\hat{h} (X^{'}, S_{n}^{θ_{i}})

is the predicted output of bootstrap sample of S_n. θ_i represents a randomly chosen variable having identical distribution.

For this method, the input variables considered are temperature, occupancy, and day type. The energy consumption per day is the output of the prediction system. The unbiased importance of input variables that are measured using the out of bag method and the number of levels, is shown in Figure 4.

The parameter of this method, optimized by the Bayesian optimization algorithm [50], are summarized in the Table 4.

3.3. Gradient Boosting Trees Based Load Forecasting

The gradient boosting is an additive model that is characterized by the following equation [51]:

F_{m} (x) = F_{m - 1} (x) + h_{m} (x),

(12)

where F_m(x) represents the prediction sum of all m regression trees and h_m(x) is the fixed sized regression trees. In MATLAB, the least square boosting (LSBoost) is used for regression [52,53]. At each iteration, the ensemble adds a new tree to the difference between the response observed and the summation of prediction of all trees used before. The LSBoost is efficient in minimizing the mean-squared error. Similar to the random forest method, the variables such as temperature, occupancy, and day type are considered as inputs for this method. The energy consumption per day is the output of the prediction system. The parameters of this method, optimized by the Bayesian optimization algorithm, are summarized in Table 5.

3.4. LSTM Based Load Forecasting

The LSTM is an improved version recurrent neural network (RNN) with added cell state and gates and thus it has the ability to overcome the gradient vanishing problem that the conventional RNN has [35,36]. The LSTM is characterized by the following sets of equations:

f_{t} = σ (W_{f} \cdot [h_{t - 1}, x_{t}] + b_{f}),

(13)

i_{t} = σ (W_{f} \cdot [h_{t - 1}, x_{t}] + b_{i}),

(14)

o_{t} = σ (W_{o} \cdot [h_{t - 1}, x_{t}] + b_{o}),

(15)

\tilde{C_{t}} = t a n h (W_{C} \cdot [h_{t - 1}, x_{t}] + b_{C}),

(16)

C_{t} = f_{t} ʘ C_{t - 1} + i_{t} ʘ \tilde{C_{t}},

(17)

h_{t} = o_{t} * t a n h (C_{t}),

(18)

where, f_t represents forget gates that control the amount of previous states to be reflected on the current states. It is the input and o_t is the output gates that decide the amount of new information to update the cell state and to output depending on cell state. σ keeps the output values between 0 to 1. All the gates are updated based on current input x_t and previous output h_t−1. C_t and

\tilde{C_{t}}

represent cell state and the value required for calculating cell state, respectively. For the LSTM based load forecasting, the input variables are temperature, occupancy, and day type. The training of the LSTM approach is shown in Figure 5. For the LSTM model parameters, the Adam optimization approach is used [34] and the parameters for LSTM are shown in Table 6.

3.5. Conventional and Modified Support Vector Regression Based Load Forecasting

The modified support vector regression (SVR)-based prediction method involves three stages for residential buildings energy consumption predictions, as shown in Figure 6. In the first stage, the previous historical data inputs (x_tr) and known energy consumptions (y_tr) are fed into the SVR training stage, which produce the values of β₀, b₀. β₀ has 14 values which correspond to coefficients for 14 input parameters such as temperature, humidity, wind speed, etc. The obtained values of β₀, b₀ by the SVR training system are then considered as the initial values for the PSO stage. In the PSO stage, the predicted inputs (x) and anticipated consumption (y), which can be obtained from smart meter by similar day/input approach, are inserted. As already mentioned, energy consumption in a residential building depends on the temperature range, other environmental conditions range, occupancy, or even the day type. Therefore, more sets of parameter values are required to be considered based on temperature range to predict the energy consumption more accurately. Therefore, four sets of β_optn, b_optn values are generated by the PSO method based on the temperature range and one of four sets values of β_optn, b_optn based on the corresponding temperature is used by the SVR equation to predict the energy consumption of the residential building, as shown in Figure 6, where n = 1, 2,…4.

The support vector regression, because of its dependence on kernel function, is considered as a nonparametric technique [54]. In MATLAB, epsilon-insensitive support vector regression is available in which the set of training data of both predictor variables (x_tr) and observed response values (y_tr) are provided with a view to deriving a function f(x) which will deviate from all y within the limit of ε values. Therefore, the equation for the f(x) can be expressed as shown in (19) [54,55].

f (x) = x^{'} β + b,

(19)

where, x is the set of N observation, β and b represent the coefficients of input and bias, respectively. In order to formulate a convex optimization problem and to ensure that f(x) is as flat as possible, it is required to minimize the objective function, which can be represented by the following equation:

J (β) = \frac{1}{2} β^{'} β .

(20)

Subject to

\forall n : | y_{n} - (x^{'} β + b) | \leq ε,

where, ε is the residue. Since it might not be possible for f(x) to satisfy the constraint in (20) for all values of x, two slack variables

ξ_{n}

and

ξ_{n}^{*}

are included with a view of maintaining the constraint shown in (21) for all values of x. Therefore, the objective function presented in (20) can be rewritten as follows:

J (β) = \frac{1}{2} β^{'} β + C \sum_{n = 1}^{N} (ξ_{n} + ξ_{n}^{*}) .

(21)

Which subjects to:

\forall n : y_{n} - (x^{'} β + b) \leq ε + ξ_{n},

\forall n : (x^{'} β + b) - y_{n} \leq ε + ξ_{n}^{*},

\forall n : ξ_{n} \geq 0,

\forall n : ξ_{n}^{*} \geq 0,

where, C is known as the box constraint that has the ability to control the penalty when the observation does not fall within the ε margin. It also controls the trades between the flatness of f(x) and maximum tolerable values beyond ε margin.

The linear ε-insensitive loss function can be expressed as:

L_{ε} = {\begin{array}{l} 0 i f | y - f (x) | \leq ε \\ | y - f (x) | \leq ε o t h e r w i s e \end{array},

L (α) = \frac{1}{2} \sum_{i = 1}^{N} \sum_{j = 1}^{N} (α_{i} - α_{i}^{*}) (α_{j} - α_{j}^{*}) G (x_{i}, x_{j}) + ε \sum_{i = 1}^{N} (α_{i} + α_{i}^{*}) - \sum_{i = 1}^{N} y_{i} (α_{i} - α_{i}^{*}) .

(22)

The non-linear support vector regression can be achieved using Lagrange dual formulations. Then, the objective function becomes as shown in (22). The constraints in (22) are:

\sum_{n = 1}^{N} (α_{n} - α_{n}^{*}) = 0,

\forall n : 0 \leq α_{n} \leq C,

\forall n : 0 \leq α_{n}^{*} \leq C,

where, the linear Kernel function can be expressed as:

G (x_{i}, x_{j}) = x_{i}^{'} x_{j} .

(23)

The objective function shown in (22) can be solved by the quadratic programming techniques. In this work, sequential minimal optimization method (SMO), which is a very popular approach for SVR problems, is considered. In SMO, a series of two-point optimization is considered and these two points are selected by a selection rule that is governed by second-order information. In SVR, the gradient vector is updated after each iteration by the following equation:

{(\nabla L)}_{n} = {\begin{array}{l} \sum_{i = 1}^{N} (α_{i} - α_{i}^{*}) G (x_{i}, x_{j}) + ε - y_{n}, n \leq N \\ - \sum_{i = 1}^{N} (α_{i} - α_{i}^{*}) G (x_{i}, x_{j}) + ε + y_{n}, n > N \end{array} .

(24)

After the training process described in (19)–(24), the values of β₀, b₀ are obtained and then fed in the PSO stage for further optimizations. For PSO, all the methods and parameters are used the same, as described in Section 2.2.

After the optimal coefficients are obtained from the PSO based on the temperature range, input and anticipated output, the coefficients are put into (19) to get the predicted output.

Moreover, in this work, the conventional PSO tuned SVR method, as shown in Figure 7, has also been used. Likewise, the modified SVR system, the conventional system, also involves three stages for energy consumption predictions. The SVR training stage produces the β₀, b₀ for the PSO stage. Then, the PSO provides only one set of values of β_opt, b_opt based on the predicted inputs and anticipated consumption, which can be obtained from a smart meter using the similar day/input approach. Therefore, the SVR training system and the PSO stage are the same for both methods with the exception that the modified system considers the temperature range as an additional input. The coefficients, based on different temperatures for the modified SVR method and one set for all temperatures for the conventional SVR method are shown in Table 7, where all T values are in degree Fahrenheit (°F).

4. Simulation Results and Discussion

4.1. Simulation Data and Conditions

In this work, the daily total energy demand and the average temperature data of the day were collected from an apartment located in 3571 Midland Avenue, Memphis, TN. The smart energy meter (meter 54BKW988882) data is available in the MLGW web account. Moreover, the number of occupants present at any day and type of the day information were collected from the residents in the building. A total of 334 days of data (334 sets of data) of average temperatures for a given day, average number of occupants for the day, day type, were collected. Moreover, out of these data, randomly chosen 30 days (30 sets of data) data were used for the prediction of total energy consumption per day for comparison purposes and rest 304 days data were used for the ANFIS, random forest, LSBoost, and LSTM network methods for their training and validation. Similarly, 30 days of data of HDD/CDD, occupancy, and day type value (D) were used to get the tuned values of coefficients for the proposed equation-based systems. As for modified SVR and conventional SVR, 14 inputs (temperature, average dew points, relative humidity, specific humidity, indoor humidity, average wind speed, atmospheric pressure, average precipitation, insolation index and solar radiation, occupancy, normal weekdays/weekend/special holidays, HDD, CDD) were considered and 304 sets of data of 304 days were used for training and validations.

4.2. Effectiveness of Proposed Equation Based Prediction System over ANFIS, Random Forest, LSBoosting, and LSTM, Modified and Conventional SVR Methods

For all the prediction systems, as previously explained, randomly chosen 30 days of data were used for prediction and comparison purposes. For the ANFIS system, as previously explained, two inputs such as the temperature and P values were considered. For the equation-based systems, three inputs (HDD/CDD, occupancy, day type) and for other methods except modified and conventional SVR methods, three inputs (temperature, occupancy, day type) were considered. Since for all methods, occupancy and day type are common inputs, the data for the 30 predicted days were shown in Figure 8.

Figure 9 represents the comparison of prediction of energy consumptions by the proposed equations, ANFIS, random forest, LSBoosting, LSTM, modified and conventional SVR based prediction systems with actual energy consumption data. From the results, it is evident that the proposed equation-based prediction systems perform better as compared to all other systems.

Furthermore, the absolute percentage of error (%Err), the absolute average error (A.E), root mean square error (RMSE), and mean average percentage error (MAPE) for the prediction systems have been calculated using (25), (26), (27), and (28), respectively.

The absolute percentage error shows the percentage of prediction error per day total consumption and helps determine the maximum error that occurs within the considered time period. The absolute average error predicts the average error of prediction from the actual consumption with the considered time periods. Similarly, the RMSE and MAPE shows the mean error and mean percentage of error over a considered time period. These error methods are very standard for the comparison of performance. The lower values of these errors mean the system predicts very close to the actual predictions. Therefore, these errors are used to evaluate the best system performance and these errors have been used as performance indices in this work.

% E r r = | \frac{A c t u a l - P r e d i c t e d}{A c t u a l} | \times 100,

(25)

A . E = \frac{1}{N} \sum_{i = 1}^{N} | A c t u a l_{i} - P r e d i c t e d_{i} |,

(26)

R M S E = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {(A c t u a l_{i} - P r e d i c t e d_{i})}^{2}},

(27)

M A P E = \frac{1}{N} \sum_{i = 1}^{N} % E r r_{i},

(28)

where N = 30 is used for Equations from (25) to (28). The percentage errors of proposed methods and other systems for predicting energy demands of chosen 30 days are shown in Figure 10.

Moreover, the average, root mean square and mean average percentage errors for all systems are shown in Table 8. From Table 8, it is evident that the average errors of equation-based prediction systems are less than those of ANFIS, random forest, LS boosting, LSTM, modified and conventional SVR based prediction systems. In this case, the proposed method shown in (1), (2), and (3) perform 29.75%, 47.97% and 48.63% better, respectively, than the ANFIS system. The modified SVR performs 2.87% better as compared to ANFIS system. However, the ANFIS system performs 106.8%, 96.31%, 109.01%, and 71.31% better as compared to random forest, LSBoosting, LSTM and conventional SVR methods, respectively.

Moreover, the RMSE values indicate that the equation-based systems proposed in (1) to (3) perform 48.72%, 50.83%, and 48.42% better, respectively, than the ANFIS system. The modified SVR performs 8.31% better as compared to the ANFIS system. However, the ANFIS system shows 44.18%, 59.38%, 54.87%, and 33.01% superior performance as compared to random forest, LSBoosting, LSTM and conventional SVR methods, respectively. In addition, the equation-based systems perform 19.62%, 35.21%, and 44.38% better, respectively, than the ANFIS system in terms of MAPE. Moreover, the ANFIS system performs 281.56%, 117.83%, 125.72%, 30.11%, and 170.42% better as compared to random forest, LSBoosting, LSTM, modified and conventional SVR methods, respectively. Therefore, the proposed equation-based prediction systems perform better than other methods in all cases. Moreover, the errors of the ANFIS system are considered as the reference system for all performance improvement calculations mentioned above.

In addition to the RMSE error calculation, the sum of squares due to error (SSE), the coefficient of determination (R² value) is used to evaluate the goodness of fit statistics analysis [56]. The R² values are calculated based on the following Equation (29):

R^{2} = 1 - \frac{S S E}{S S T} = 1 - \frac{\sum_{i = 1}^{N} {(y_{i} - e_{i})}^{2}}{\sum_{i = 1}^{N} {(y_{i} - \bar{y})}^{2}},

(29)

where, SST corresponds to sum of squares above the mean. Based on Equation (29), the R² value for the multiple linear regression optimization-based Equation (1) system is found to be 0.9804, which reflects that 98.04% of the total variation in the data (N = 30) are explained by the mentioned system. Moreover, SSE and SST values are found to be 139.867 and 7136.418, respectively.

5. Conclusions

This paper proposes new equations-based methods, based on HDD, CDD, occupancy and week/special days, for residential load forecasting. The performance of the proposed methods has been compared with that of the ANFIS, random forest, LSBoosting LSTM, modified and conventional SVR approaches. The forecasted energies by all methods are analyzed with actual energy consumption data for validation. The 304 days data are considered during training of the ANFIS, random forest, LSBoosting, LSTM, modified and conventional SVR systems. Moreover, 30 days of data of the same apartment are used for the prediction of all the methods. Based on the obtained simulation responses and performance indices, the following conclusions can be drawn.

The proposed equations-based methods are effective in predicting residential loads.
The proposed prediction systems require less computation and perform better than the ANFIS, random forest, LSBoosting, LSTM, and modified and conventional SVR systems. It is noteworthy that the energy consumption of a residential building depends upon the members living there with their habits, response to different environmental condition, mode of comfort, etc. Therefore, if the energy consumption is categorized based on HDD, CDD, number of occupants, the day type, the uncertainty of energy reduces much. From the Table 9 below, it is evident that if we consider the whole data range (bottom most row), the uncertainty of the system is high (12.91) in terms of standard deviation while the average energy consumption is 27.15 kWh. However, after dividing the data based on the conditions, it is evident that the average energy consumption is different than others and have much less uncertainty as compared to when considering the whole data. Moreover, this variation is seen because of various number of occupancies for a particular day and day type. That is why our proposed systems perform better than other considered systems. Moreover, our proposed systems do not require large data sets for training and sequential data for efficient prediction as it is required for LSTM but can efficiently predict any randomly chosen data.
The proposed equation-based systems have the lowest absolute average errors (1.72%, 1.27%, 1.25%), root mean square errors (2.16, 2.07, 2.17) and mean average percentage errors (7.50%, 6.05%, 5.19%) compared to other methods.
The proposed equations-based systems can easily be implemented in real practice.
In the near future, the performance of the equations-based prediction systems will be compared with other methods such as deep neural network, other new probabilistic prediction systems, etc. In addition, Bayesian optimization, which considers the data to have normal distribution, will be considered in the future work.

Author Contributions

Conceptualization, M.H.A. and S.M.M.A.; methodology, S.M.M.A.; software, S.M.M.A.; validation, S.M.M.A.; formal analysis, S.M.M.A.; investigation, S.M.M.A.; resources, S.M.M.A.; data curation, S.M.M.A.; writing—original draft preparation, S.M.M.A.; writing—review and editing, M.H.A.; visualization, S.M.M.A.; supervision, M.H.A.; project administration, M.H.A.; funding acquisition, M.H.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare that they have no conflict of interest.

References

Wells, L.; Rismanchi, B.; Aye, L. A review of Net Zero Energy Buildings with reflections on the Australian context. Energy Build. 2018, 158, 616–628. [Google Scholar] [CrossRef]
Lawrence, T.M.; Boudreau, M.-C.; Helsen, L.; Henze, G.P.; Mohammadpour, J.; Noonan, D.S.; Patteeuw, D.; Pless, S.; Watson, R.T. Ten questions concerning integrating smart buildings into the smart grid. Build. Environ. 2016, 108, 273–283. [Google Scholar] [CrossRef] [Green Version]
Ihsane, I.; Miegeville, L.; Ait-Ahmed, N.; Guerin, P. New Evaluation Metrics for Electrical Demand Forecasting: Application to the Residential Sector. In Proceedings of the 2018 AEIT International Annual Conference, Bari, Italy, 3–5 October 2018; Institute of Electrical and Electronics Engineers (IEEE): Bari, Italy, 2018; pp. 1–6. [Google Scholar]
Vossen, J.; Feron, B.; Monti, A. Probabilistic Forecasting of Household Electrical Load Using Artificial Neural Networks. In Proceedings of the 2018 IEEE International Conference on Probabilistic Methods Applied to Power Systems (PMAPS), Boise, ID, USA, 24–28 June 2018; Institute of Electrical and Electronics Engineers (IEEE): Bari, Italy, 2018; pp. 1–6. [Google Scholar]
Bayram, I.S.; Ustun, T.S. A survey on behind the meter energy management systems in smart grid. Renew. Sustain. Energy Rev. 2017, 72, 1208–1232. [Google Scholar] [CrossRef]
Amaral, H.L.M.D.; De Souza, A.N.; Gastaldello, D.S.; Palma, T.X.D.S.; Maranho, A.D.S.; Papa, J.P.; Haroldo, L.M.D.A. Use of virtual load curves for the training of neural networks for residential electricity consumption forecasting applications. In Proceedings of the 2018 13th IEEE International Conference on Industry Applications (INDUSCON), São Paulo, Brazil, 12–14 November 2018; Institute of Electrical and Electronics Engineers (IEEE): Bari, Italy, 2018; pp. 85–90. [Google Scholar]
Zheng, J.; Chen, X.; Yu, K.; Gan, L.; Wang, Y.; Wang, K. Short-term Power Load Forecasting of Residential Community Based on GRU Neural Network. In Proceedings of the 2018 International Conference on Power System Technology (POWERCON), Guangzhou, China, 6–8 November 2018; Institute of Electrical and Electronics Engineers (IEEE): Bari, Italy, 2018; pp. 4862–4868. [Google Scholar]
Li, G.; Wu, D.; Hu, J.; Li, Y.; Hossain, M.S.; Ghoneim, A. HELOS: Heterogeneous Load Scheduling for Electric Vehicle-Integrated Microgrids. IEEE Trans. Veh. Technol. 2017, 66, 5785–5796. [Google Scholar] [CrossRef]
Nokar, M.A.; Tashtarian, F.; Yaghmaee, M.H. Residential power consumption forecasting in the smart grid using ANFIS system. In Proceedings of the 2017 7th International Conference on Computer and Knowledge Engineering (ICCKE), Mashhad, Iran, 26–27 October 2017; Institute of Electrical and Electronics Engineers (IEEE): Bari, Italy, 2017; pp. 111–118. [Google Scholar]
Hong, T.; Fan, S. Probabilistic electric load forecasting: A tutorial review. Int. J. 2016, 32, 914–938. [Google Scholar] [CrossRef]
Daut, M.A.M.; Hassan, M.Y.; Abdullah, H.; Rahman, H.A.; Abdullah, M.P.; Hussin, F. Building electrical energy consumption forecasting analysis using conventional and artificial intelligence methods: A review. Renew. Sustain. Energy Rev. 2017, 70, 1108–1118. [Google Scholar] [CrossRef]
Hernandez, L.; Baladron, C.; Aguiar, J.M.; Carro, B.; Sanchez-Esguevillas, A.J.; Lloret, J.; Massana, J. A Survey on Electric Power Demand Forecasting: Future Trends in Smart Grids, Microgrids and Smart Buildings. IEEE Commun. Surv. Tutor. 2014, 16, 1460–1495. [Google Scholar] [CrossRef]
Yildiz, B.; Bilbao, J.; Sproul, A. A review and analysis of regression and machine learning models on commercial building electricity load forecasting. Renew. Sustain. Energy Rev. 2017, 73, 1104–1122. [Google Scholar] [CrossRef]
Yu, M.; Zhou, W.; Wang, B.; Jin, J. The short-term forecasting of wind speed based on EMD and ARMA. In Proceedings of the 2017 12th IEEE Conference on Industrial Electronics and Applications (ICIEA), Siem Reap, Cambodia, 18–20 June 2017; Institute of Electrical and Electronics Engineers (IEEE): Bari, Italy, 2017; pp. 495–498. [Google Scholar]
Bantupalli, M.K.; Matam, S.K. Wind Speed forecasting using empirical mode decomposition with ANN and ARIMA models. In Proceedings of the 2017 14th IEEE India Council International Conference (INDICON), Roorkee, India, 15–17 December 2017; Institute of Electrical and Electronics Engineers (IEEE): Bari, Italy, 2017; pp. 1–6. [Google Scholar]
Chowdhury, D.; Sarkar, M.; Haider, M.Z.; Alam, T. Zone Wise Hourly Load Prediction Using Regression Decision Tree Model. In Proceedings of the 2018 International Conference on Innovation in Engineering and Technology (ICIET), Dhaka, Bangladesh, 27–28 December 2018; Institute of Electrical and Electronics Engineers (IEEE): Bari, Italy, 2018; pp. 1–6. [Google Scholar]
Chen, T.; Lehr, J.; Lavrova, O.; Martinez-Ramonz, M. Distribution-level peak load prediction based on Bayesian Additive Regression Trees. In Proceedings of the 2016 IEEE Power and Energy Society General Meeting (PESGM), Boston, MA, USA, 17–21 July 2016; Institute of Electrical and Electronics Engineers (IEEE): Bari, Italy, 2016; pp. 1–5. [Google Scholar]
Wang, Z.; Wang, Y.; Zeng, R.; Srinivasan, R.S.; Ahrentzen, S. Random Forest based hourly building energy prediction. Energy Build. 2018, 171, 11–25. [Google Scholar] [CrossRef]
Persson, C.S.; Bacher, P.; Shiga, T.; Madsen, H. Multi-site solar power forecasting using gradient boosted regression trees. Sol. Energy 2017, 150, 423–436. [Google Scholar] [CrossRef]
Chen, C.-M.; Liang, C.-C.; Chu, C.-P. Long-term travel time prediction using gradient boosting. J. Intell. Transp. Syst. 2019, 24, 109–124. [Google Scholar] [CrossRef]
Aurangzeb, K. Short Term Power Load Forecasting using Machine Learning Models for energy management in a smart community. In Proceedings of the 2019 International Conference on Computer and Information Sciences (ICCIS), Sakaka, Saudi Arabia, 3–4 April 2019; Institute of Electrical and Electronics Engineers (IEEE): Bari, Italy, 2019; pp. 1–6. [Google Scholar]
Nair, A.S.; Hossen, T.; Campion, M.; Ranganathan, P. Optimal Operation of Residential EVs using DNN and Clustering based Energy Forecast. N. Am. Power Symp. 2018, 1–6. [Google Scholar] [CrossRef]
Wang, Z.; Srinivasan, R.S. A review of artificial intelligence based building energy use prediction: Contrasting the capabilities of single and ensemble prediction models. Renew. Sustain. Energy Rev. 2017, 75, 796–808. [Google Scholar] [CrossRef]
Wang, L.; Lee, E.W.; Yuen, R.K. Novel dynamic forecasting model for building cooling loads combining an artificial neural network and an ensemble approach. Appl. Energy 2018, 228, 1740–1753. [Google Scholar] [CrossRef]
Delorme-Costil, A.; Bezian, J.-J. Forecasting Domestic Hot Water Demand in Residential House Using Artificial Neural Networks. In Proceedings of the 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA), Cancun, Mexico, 18–21 December 2017; Institute of Electrical and Electronics Engineers (IEEE): Bari, Italy, 2017; pp. 467–472. [Google Scholar]
Akarslan, E.; Hocaoglu, F.O. Electricity demand forecasting of a micro grid using ANN. In Proceedings of the 2018 9th International Renewable Energy Congress (IREC), Hammamet, Tunisia, 20–22 March 2018; Institute of Electrical and Electronics Engineers (IEEE): Bari, Italy, 2018; pp. 1–5. [Google Scholar]
Liu, Y.; Li, D.; Pei, H.; Liu, K.; Li, Y.; Yang, L. Short-term load prediction method for power distributing method based on back-propagation neural network. In Proceedings of the 2017 12th IEEE Conference on Industrial Electronics and Applications (ICIEA), Siem Reap, Cambodia, 18–20 June 2017; Institute of Electrical and Electronics Engineers (IEEE): Bari, Italy, 2017; pp. 881–886. [Google Scholar]
Alonso, R.; Chavez, A. Short term load forecast method using artificial neural network with artificial immune systems. IEEE URUCON 2017, 1–4. [Google Scholar] [CrossRef]
Gao, W.; Darvishan, A.; Toghani, M.; Mohammadi, M.; Abedinia, O.; Ghadimi, N. Different states of multi-block based forecast engine for price and load prediction. Int. J. Electr. Power Energy Syst. 2019, 104, 423–435. [Google Scholar] [CrossRef]
Ghadimi, N.; Akbarimajd, A.; Shayeghi, H.; Abedinia, O. Two stage forecast engine with feature selection technique and improved meta-heuristic algorithm for electricity load forecasting. Energy 2018, 161, 130–142. [Google Scholar] [CrossRef]
Alam, S.M.M.; Ali, M.H. A New Fuzzy Logic Based Method for Residential Loads Forecasting. Paper ID: 2020TD0342. In Proceedings of the IEEE PES Transmission & Distribution (T&D) Conference & Exposition, Chicago, IL, USA, 12–15 October 2020. [Google Scholar]
Khodaei, H.; Hajiali, M.; Darvishan, A.; Sepehr, M.; Ghadimi, N. Fuzzy-based heat and power hub models for cost-emission operation of an industrial consumer using compromise programming. Appl. Eng. 2018, 137, 395–405. [Google Scholar] [CrossRef]
Hossain, M.K.; Ali, M.H. Transient Stability Augmentation of PV/DFIG/SG-Based Hybrid Power System by Nonlinear Control-Based Variable Resistive FCL. IEEE Trans. Sustain. Energy 2015, 6, 1638–1649. [Google Scholar] [CrossRef]
Alam, S.M.M.; Ali, M.H. A New Subtractive Clustering Based ANFIS System for Residential Load Forecasting. In Proceedings of the 2020 IEEE Power & Energy Society Innovative Smart Grid Technologies Conference (ISGT), Washington, DC, USA, 17–20 February 2020; Institute of Electrical and Electronics Engineers (IEEE): Bari, Italy, 2020; pp. 1–5. [Google Scholar]
Wang, Y.; Zhu, S.; Li, C. Research on Multistep Time Series Prediction Based on LSTM. In Proceedings of the 2019 3rd International Conference on Electronic Information Technology and Computer Engineering (EITCE), Xiamen, China, 18–20 October 2019; Institute of Electrical and Electronics Engineers (IEEE): Bari, Italy, 2019; pp. 1155–1159. [Google Scholar]
Kwon, B.-S.; Park, R.-J.; Bin Song, K. Short-Term Load Forecasting Based on Deep Neural Networks Using LSTM Layer. J. Electr. Eng. Technol. 2020, 15, 1501–1509. [Google Scholar] [CrossRef]
Cui, C.; He, M.; Di, F.; Lu, Y.; Dai, Y.; Lv, F. Research on Power Load Forecasting Method Based on LSTM Model. In Proceedings of the 2020 IEEE 5th Information Technology and Mechatronics Engineering Conference (ITOEC), Chongqing, China, 12–14 June 2020; Institute of Electrical and Electronics Engineers (IEEE): Bari, Italy, 2020; pp. 1657–1660. [Google Scholar]
Abedinia, O.; Zareinejad, M.; Doranehgard, M.H.; Fathi, G.; Ghadimi, N. Optimal offering and bidding strategies of renewable energy based large consumer using a novel hybrid robust-stochastic approach. J. Clean. Prod. 2019, 215, 878–889. [Google Scholar] [CrossRef]
Saeedi, M.; Moradi, M.; Hosseini, M.; Emamifar, A.; Ghadimi, N. Robust optimization based optimal chiller loading under cooling demand uncertainty. Appl. Eng. 2019, 148, 1081–1091. [Google Scholar] [CrossRef]
Bagal, H.A.; Soltanabad, Y.N.; Dadjuo, M.; Wakil, K.; Ghadimi, N. Risk-assessment of photovoltaic-wind-battery-grid based large industrial consumer using information gap decision theory. Sol. Energy 2018, 169, 343–352. [Google Scholar] [CrossRef]
US Department of Commerce National Oceanic and Atmospheric Administration National Weather Service. Available online: https://www.weather.gov/key/climate_heat_cool (accessed on 29 November 2020).
Memphis Weather Forecast Office. Available online: https://w2.weather.gov/climate/xmacis.php?wfo=meg (accessed on 29 November 2020).
K3JAE’s Weather Station Bruceton Tennessee. Available online: https://www.k3jae.com/wxdegreeday.php (accessed on 29 November 2020).
Hossain, A.; Pota, H.R.; Squartini, S.; Zaman, F.; Muttaqi, K.M. Energy management of community microgrids considering degradation cost of battery. J. Energy Storage 2019, 22, 257–269. [Google Scholar] [CrossRef]
Hossain, A.; Pota, H.R.; Squartini, S.; Abdou, A.F. Modified PSO algorithm for real-time energy management in grid-connected microgrids. Renew. Energy 2019, 136, 746–757. [Google Scholar] [CrossRef]
Zhang, J.; Wang, S. Thermal Load Forecasting Based on PSO-SVR. In Proceedings of the 2018 IEEE 4th International Conference on Computer and Communications (ICCC), Chengdu, China, 7–10 December 2018; Institute of Electrical and Electronics Engineers (IEEE): Bari, Italy, 2018; pp. 2676–2680. [Google Scholar]
Li, W.; Li, H. Electrical Load Forecasting Using Echo State Network and Optimizing by PSO Algorithm. In Proceedings of the 2017 10th International Conference on Intelligent Computation Technology and Automation (ICICTA), Changsha, China, 9–10 October 2017; Institute of Electrical and Electronics Engineers (IEEE): Bari, Italy, 2017; pp. 394–397. [Google Scholar]
Shen, Y.; Zhang, J.; Liu, J.; Zhan, P.; Chen, R.; Chen, Y. Short-term load forecasting of power system based on similar day method and PSO-DBN. In Proceedings of the 2018 2nd IEEE Conference on Energy Internet and Energy System Integration (EI2), Beijing, China, 20–22 October 2018; Institute of Electrical and Electronics Engineers (IEEE): Bari, Italy, 2018; pp. 1–6. [Google Scholar]
Lahouar, A.; Slama, J.B.H. Day-ahead load forecast using random forest and expert input selection. Energy Convers. Manag. 2015, 103, 1040–1051. [Google Scholar] [CrossRef]
Wu, J.; Chen, X.; Zhang, H.; Xiong, L.; Lei, H.; Deng, S. Hyperparameter optimization for machine learning models based on bayesian optimization. J. Electron. Sci. Technol. 2019, 17, 26–40. [Google Scholar]
Touzani, S.; Granderson, J.; Fernandes, S. Gradient boosting machine for modeling the energy consumption of commercial buildings. Energy Build. 2018, 158, 1533–1543. [Google Scholar] [CrossRef] [Green Version]
MathWorks. Ensemble Algorithms. Available online: https://www.mathworks.com/help/stats/ensemble-algorithms.html (accessed on 29 November 2020).
MathWorks. What Functionality does MATLAB Offer for Gradient Boosting that Is Equivalent to XGBoost. Available online: https://www.mathworks.com/matlabcentral/answers/457195-what-functionality-does-matlab-offer-for-gradient-boosting-that-is-equivalent-toxgboost#:~:text=Gradient%20boosting%20technique%20has%20been%20supported%20in%20MATLAB%20since%20R2011a.&text=b)%20LSBoost%20in%20’fitrensemble’,through%20the%20’LSBoost’%20method (accessed on 29 November 2020).
MathWorks. Understanding Support Vector Machine Regression. Available online: https://www.mathworks.com/help/stats/understanding-support-vector-machine-regression.html (accessed on 29 November 2020).
Qiang, S.; Pu, Y. Short-Term Power Load Forecasting Based on Support Vector Machine and Particle Swarm Optimization. J. Algorithms Comput. Technol. 2018, 13, 2003–2008. [Google Scholar] [CrossRef] [Green Version]
MathWorks. Evaluating Goodness of Fit. Available online: https://www.mathworks.com/help/curvefit/evaluating-goodness-of-fit.html (accessed on 29 November 2020).

Figure 1. Block diagram of equation-based prediction system.

Figure 2. ANFIS structure. (It is a system-built figure, only blue dots are present as only “And” rules are used).

Figure 3. Input membership functions (a) input 1 (b) input 2.

Figure 4. Random forest predictors’ (a) importance, and (b) number of levels.

Figure 5. LSTM model training.

Figure 6. Block diagram of the modified PSO tuned support vector regression-based prediction system.

Figure 7. Block diagram of conventional PSO tuned support vector regression-based prediction system.

Figure 8. (a) Occupant data and (b) day type data used for all the methods during prediction.

Figure 9. Comparison of performance of all the methods used for energy consumption forecasting.

Figure 10. Comparison of the percentage error in the prediction of all the methods for all days.

Table 1. Coefficients of Equation (1) determined by the MLR method.

Condition	C₁	C₂	C₃	C₄
$C D D \geq 17 ℉, O \geq 0, D \geq 0$	0.882	1.499	10.260	0.00
$0 \leq C D D < 17 ℉, O \geq 0, D \geq 0$	1.857	23.054	2.920	−38.64
$H D D \geq 20 ℉, O \geq 0, D \geq 0$	−0.247	10.583	−0.676	6.48
$0.1 \leq H D D < 20 ℉, O \geq 0, D \geq 0$	1.825	5.195	4.092	−19.57

Table 2. Coefficients of Equation (2) determined by the PSO method.

Condition	m	n	C₅	C₆	C₇	C₈
$C D D$ ≥ 17; O ≥ 2; D ≥ 1	0.642	0.906	1.662	6.779	8.723	−4.370
$C D D$ ≥ 17; O ≥ 2; D = 0	0.462	0.389	14.455	11.955	5.289	−1.656
10 ≤ $C D D$ ≤ 16; O ≤ 1; D ≤ 1	0.001	1.356	8.487	10.121	2.624	−8.716
3 ≤ $C D D$ ≤ 9; O ≥ 3; D = 2	0.001	0.695	0.010	9.275	3.164	0.645
$0 \leq C D D$ < 3; O ≤ 2; D ≤ 2	0.368	0.974	2.964	6.079	2.269	−9.368
$H D D$ ≥ 30; O ≥ 0; D ≥ 0	0.253	0.852	8.723	14.206	0.010	−18.75
20 ≤ $H D D$ < 30; O ≥ 0; D ≥ 0	1.043	1.155	1.649	3.158	1.195	−17.32
10 ≤ $H D D$ < 20; O ≥ 0; D ≥ 0	0.534	1.262	1.059	17.918	0.010	1.127
0.1 ≤ $H D D$ < 10; O ≥ 0; D ≥ 0	0.0291	1.2687	0.0100	3.8381	0.0100	3.0354

Table 3. Coefficients of Equation (3) determined by the PSO method.

Condition	p	q	a	C₉	C₁₀	C₁₁	C₁₂
$C D D$ ≥ 17; O ≥ 2; D ≥ 1	0.952	1.056	−6.80	3.070	13.146	−3.545	−6.80
$C D D$ ≥ 17; O ≥ 2; D = 0	0.125	0.992	−10.9	12.784	4.393	−17.94	−10.9
10 ≤ $C D D$ ≤ 16; O ≤ 1; D ≤ 1	0.001	1.375	1.979	12.598	6.295	−10.99	1.979
3 ≤ $C D D$ ≤ 9; O ≥ 3; D = 2	0.001	0.389	0.574	0.010	13.474	20.000	0.574
$0 \leq C D D$ < 3; O ≤ 2; D ≤ 2	1.041	1.304	−2.438	13.281	7.641	10.555	−2.438
$H D D$ ≥ 30; O ≥ 0; D ≥ 0	0.728	1.100	0.407	0.551	9.557	−5.325	0.407
20 ≤ $H D D$ <30; O ≥ 0; D ≥ 0	0.885	1.115	1.330	1.736	5.021	−15.914	1.330
10 ≤ $H D D$ < 20; O ≥ 0; D ≥ 0	0.368	1.137	0.933	6.257	6.409	5.787	0.933
0.1 ≤ $H D D$ < 10; O ≥ 0; D ≥ 0	0.180	1.143	−1.270	11.732	5.839	−10.59	−1.270

Table 4. Tuned parameter for random forest-based system.

Parameter	Value
Maximum Number of Split	55
Minimum Leaf Size	6
Number of Learning Cycles	494

Table 5. Tuned parameter for LSBoost system.

Parameter	Value
Maximum Number of Split	2
Minimum Leaf Size	1
Number of Learning Cycles	395
Learn Rate	0.043934

Table 6. Tuned parameter for LSTM system.

Parameter	Value
Number of Hidden Units	500
Fully Connected Layer	150
Dropout Layer	0.1
Maximum Number of Epoch	250
Minimum Batch Size	3
Initial Learn Rate	0.005
Learn Rate Drop Period	125
Learn Rate Drop Factor	0.2

Table 7. Values of β_opt, b_opt Determined by the PSO Method.

	Conventional SVR	Modified SVR
Parameter	For all T	$T \geq 75$	$65 \leq T < 75$	$50 \leq T < 65$	$T < 50$
β_opt	−0.0050	−0.0050	−0.0050	−0.0050	−0.0050
	0.0011	0.0011	0.0011	0.0011	0.0011
	−0.0018	−0.0018	−0.0018	−0.0018	−0.0018
	−0.0002	0.0349	0.0349	−0.0002	−0.0002
	0.0256	0.0009	0.0009	0.1124	0.0382
	0.0001	0.0001	0.0001	0.0001	0.0122
	0.0242	0.0242	0.0242	0.0242	0.0242
	0.1662	0.1662	0.0008	0.1444	0.0008
	7.2933	10.976	10.4529	6.1048	10.3311
	2.3878	2.3878	2.3878	0.0119	0.0119
	0.8940	0.6272	0.3571	0.8940	0.0498
	0.3268	0.0074	1.4746	1.1093	0.5171
	0.00003	0.00003	0.00003	0.00003	0.00003
	0.0043	0.1285	0.0043	0.0043	0.0043
b_opt	−0.4182	−0.4182	−0.4182	−0.4182	−0.4182

Table 8. Errors of predicted systems.

	Error
	AVG (kWh)	RMSE (kWh)	MAPE (%)
Random Forest	5.09	6.07	35.60
LSBoost	4.79	6.71	22.19
LSTM	5.10	6.52	21.06
ANFIS	2.44	4.21	9.33
Modified SVR	2.37	3.86	12.14
Conventional SVR	4.18	5.60	28.04
Equation (1)	1.72	2.16	7.50
Equation (2)	1.27	2.07	6.05
Equation (3)	1.25	2.17	5.19

Table 9. Average energy consumption and standard deviation under various conditions.

Condition	Average Energy Consumption (kWh)	Standard Deviation	%Reduction
$C D D$ ≥ 17; O ≥ 2; D ≥ 1	43.85	12.73	1.39
$C D D$ ≥ 17; O ≥ 2; D = 0	33.59	6.80	47.32
10 ≤ $C D D$ ≤ 16; O ≤ 1; D ≤ 1	12.31	4.39	65.99
3 ≤ $C D D$ ≤ 9; O ≥ 3; D = 2	38.20	6.15	50.81
$0 \leq C D D$ < 3; O ≤ 2; D ≤ 2	18.35	6.35	50.81
$H D D$ ≥ 30; O ≥ 0; D ≥ 0	38.53	8.06	37.57
20 ≤ $H D D$ < 30; O ≥ 0; D ≥ 0	24.61	7.27	43.69
10 ≤ $H D D$ < 20; O ≥ 0; D ≥ 0	21.24	6.75	47.71
0.1 ≤ $H D D$ < 10; O ≥ 0; D ≥ 0	17.93	8.39	35.01
For all HDD, CDD, O, D	27.15	12.91	-

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Alam, S.M.M.; Ali, M.H. Equation Based New Methods for Residential Load Forecasting. Energies 2020, 13, 6378. https://doi.org/10.3390/en13236378

AMA Style

Alam SMM, Ali MH. Equation Based New Methods for Residential Load Forecasting. Energies. 2020; 13(23):6378. https://doi.org/10.3390/en13236378

Chicago/Turabian Style

Alam, S. M. Mahfuz, and Mohd. Hasan Ali. 2020. "Equation Based New Methods for Residential Load Forecasting" Energies 13, no. 23: 6378. https://doi.org/10.3390/en13236378

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Equation Based New Methods for Residential Load Forecasting

Abstract

1. Introduction

2. Proposed Equation Based Prediction Methods

2.1. Parameter Tuning by Multiple Linear Regression (MLR) Algorithm

2.2. Parameter Tuning by Particle Swarm Optimization (PSO) Algorithm

3. Conventional Methods

3.1. Adaptive Neuro Fuzzy Inference System (ANFIS) Based Load Forecasting

3.2. Random Forest Based Load Forecasting

3.3. Gradient Boosting Trees Based Load Forecasting

3.4. LSTM Based Load Forecasting

3.5. Conventional and Modified Support Vector Regression Based Load Forecasting

4. Simulation Results and Discussion

4.1. Simulation Data and Conditions

4.2. Effectiveness of Proposed Equation Based Prediction System over ANFIS, Random Forest, LSBoosting, and LSTM, Modified and Conventional SVR Methods

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI