1. Introduction
Buildings in every shape and size are envisioned to be a huge potential for the efficiency improvement of the power grid, especially because residential and commercial buildings are responsible for about 30- to 40-percent of primary energy consumption worldwide. A rich body of literature for developing high performance energy management systems (EMS) for buildings can be found to achieve significant energy savings [
1,
2,
3]. Among the enabling technologies, load forecasting, in particular, short-term hourly load forecasting [
4,
5], should be the front-end application of the EMS, because it helps to better understand energy behavior and provides the baseline estimate of future real savings, especially under dynamic pricing. It helps analyze the load shape and variability and determine proper controls or demand response under a grid emergency, as well. The central aim of this paper is thus to develop an accurate and robust scheme for predicting the hourly power use of a building.
Several load forecasting techniques have been studied [
6,
7,
8]. These studies can be categorized into three types: regression techniques [
9,
10], artificial neural network (ANN)-related methods [
11,
12,
13] and time series approaches [
14,
15,
16]. In regression techniques, linear representations are applied as the main forecasting function, where mathematical relationships among electrical load demand, weather and exogenous variables are intended. This technique has a test-feasibility and short handling of non-stationary temporal cases as an advantage and disadvantage. The ANN uses historical load and weather data to identify the load model and, therefore, has good approximation capabilities for a wide range of nonlinear models. However, the ANN often converges slowly in training mode and needs to manually determine the network structure and parameters.
In time series techniques, load demands are treated as time series signals. This technique predicts load demand using time series analysis. Among these approaches, stochastic time series techniques have the advantage of finding models with a minimum mean square forecasting error. However, the gradient search-based technique used by the basic stochastic time series (STS) approach is prone to finding local optimal points that build an insufficient forecasting model, because the forecasting error function of this approach possesses multiple minimum points. In summary, this approach is sufficient, but involves computational risk caused by numerical instability.
In the case of buildings, the increase of an accumulated error from load characteristic change, caused by consumption patterns and business and working hour changes, can occur. The
Figure 1 shows that accumulated errors of the fixed model and our proposed model in load forecasting are increased dramatically when the load characteristic is changed.
Figure 1 illustrates a case where the forecasting scheme using a fixed model fails to perform best, and thus, the accumulated error increases dramatically when the load characteristics are changed. It also includes a desired performance obtained through proposed model switching, as detailed in
Section 3. Motivated by the observation that many of the previous forecasting methods using a fixed model do not perform well, as the above dynamic change may occur, we propose an accurate load forecasting technique that combines proven forecasting models with different model structures in order to improve the limited performance of existing techniques. In this research, the autoregressive moving average with exogenous variable (ARMAX) model with temperature as the exogenous variable is selected to represent the load behavior [
13,
17]. Our scheme has two core components: data pattern classification according to weekday and weekend characteristics and model switching in response to the significant change in the accumulated error. The improved robustness and resulting forecasting accuracy should help advance smart grid technologies, including energy management, security analysis, economic dispatch and power scheduling, by forecasting more accurate energy usage.
Figure 1.
Trends of increasing accumulated error.
Figure 1.
Trends of increasing accumulated error.
The remainder of this paper is organized as follows.
Section 2 describes the ARMAX model and Kalman filtering as the forecast model and a tool for estimating parameters of the model. In
Section 3, we propose the scheme for improving the accuracy and robustness by changing the model structure and reducing the accumulated error.
Section 4 provides numerical study results using MATLAB simulations.
Section 5 presents concluding remarks.
2. Background
The load pattern possesses nonlinear and dynamic characteristics, seasonal and diurnal variations and different weather conditions. The ARMAX model selection technique has been extensively studied, because it can describe the relationship between load and extra variables, including weather. In the ARMAX model, the load is denoted as a linear function that has an inaccessible white-noise input and an accessible exogenous input series. The characteristics of the load model are identified based on the variables in the function. The remaining problem becomes finding the proper values of variables in the model, namely parameter estimation.
By adequately constructing the load model and estimating parameters in the function, we can forecast the future load. Thus, an ARMAX-based load model and Kalman filtering for parameter estimation are introduced below as the processes of load forecasting.
2.1. ARMAX-Based Load Model
In this paper, the ARMAX model is used to define the relationship between load and temperature that is regarded as the exogenous variable influencing the load demand, because load demand in a building is susceptible to weather and the number of people in the building. The notation of the ARMAX model for this paper is represented as follows:
where
,
and
are the load, the exogenous variable and the white noise at time
t, respectively;
,
and
are the autoregressive (AR) part, the exogenous input part and the moving average (MA) part. Each part has a back-shift operator,
q and an order parameter and can be represented as follows:
where
, ⋯,
,
, ⋯,
and
, ⋯,
are parameters of the autoregressive part, the exogenous input part and the moving average part; and
n,
m and
r are the AR order, input order and MA order.
In determining the order number of each part, the sample autocorrelation function (ACF), the sample partial autocorrelation function (PACF) and the cross-correlation function (CCF) are employed [
18]. In general, it is challenging to select appropriate model orders, and it is therefore essential to use a technique derived from experience.
In this paper, we modify the ARMAX model for our research. First, we divide the back-shift operator into day and time back-shift operators. Because we concentrate on forecasting energy consumption, and since time and date are important factors in load forecasting, we use the day and the 24 h factor as follows:
where
d and
t are the day and time back-shift operator, respectively; and
j and
k are their respective order. Each order has the following definition:
where
j is a natural number; and
k is a number from 0 to 23. For example,
means that
j is 2 and
k is 7. Thus,
, and it means two days and seven hours ago. Second, we define the load, the exogenous variable and the noise term. For the load term, we use the electrical consumption (kWh) of building in an hour. The exogenous variable is defined as outdoor temperature (
). The last term (noise) is assumed to be white Gaussian noise.
2.2. Kalman Filter for the Parameter Estimation
This research briefly reviews the recursive discrete Kalman filter used for estimating parameters of the ARMAX model in line with the algorithm development. Details of the Kalman filtering approach to estimating parameters can be found in [
19,
20].
In order to define Kalman filtering, we have to consider the following discrete state equations:
where
,
,
,
,
and
are vectors of
system states, the
dimensions of the state transition matrix,
measurement vectors, the
output matrix,
system error and
measurement error, respectively. The noise vectors,
and
, are drawn from white Gaussian noise that has a mean of zero and no time correlation, as shown below.
and
, covariance matrices, are defined as follows:
Given the
a priori estimate of the state vector,
, and its error covariance matrix,
, we set
and then apply the basic Kalman filter algorithm to estimate the next state by recursively computing the following equations:
where
is the Kalman gain. In this Kalman filter algorithm, it is important to choose an
a priori estimate of the state
and its covariance error
, because an intelligent choice improves the accuracy and decreases the computational complexity of the algorithm. A few measurement vector samples can be considered as initial values for
and
as follow:
For our model, the discrete state equations in Equations (
5) and (
6) are defined for our forecasting model as follows:
The state transition matrix, , is a constant identity matrix;
The error covariance matrices, and , are constant identity matrices;
The state vector,
, has some parameters based on Equation (
2);
The time-varying output matrix, , is derived from the load demand and temperature.
The observation value,
, represents the load at time
k.
takes the following form, defined by Equations (
1–
3):
3. Enhanced Robustness of the Proposed Load Forecasting Scheme
Figure 2 shows the six steps of the proposed robust load forecasting. The first important step is data pattern classification. If the input data are well classified, we can determine a suitable model and reduce forecasting error. The second step, model selection, determines the structure of the forecasting model, model order and load and weather factor using the ARMAX model. The third step is model parameter estimation, during which optimal parameters of the forecasting model are estimated using the Kalman filter and database of past and present load and weather data. The third step leads to a forecast of the load at the next instant of time in the fourth step. The fifth and sixth steps are feedback processes.
Figure 2.
Steps for proposed load forecasting scheme.
Figure 2.
Steps for proposed load forecasting scheme.
In the fifth process, an accumulated error is obtained by summing up the absolute errors. If it exceeds the reasonable threshold value, K, the sixth step, model-switching scheme (MSS), is executed to replace the model structure. Through this process, we can respond to an increase of accumulated error and reduce total accumulated error. If it does not exceed the threshold value, the fourth step is executed. In this section, we describe data pattern classification for identifying the load model based on day characteristics and a model switching scheme for enhancing forecasting accuracy.
3.1. Data Pattern Classification for Selecting the Load Model
The main purpose of defining a forecasting model is to determine its order and the variables that have effects on the load. As mentioned in
Section 2, we use the ARMAX model, which depends on load and temperature. Thus, we collected hourly load data for a building in March 2012, as well as temperature data, in order to reflect day characteristics of input data. Each data point is categorized into two databases, one for the weekday and the other for the weekend.
Figure 3 and
Figure 4 show the load shape over 24 hours of weekdays and weekends in the building. These data were collected by Korea Energy Management Corporation (KEMCO) during March 2012 in Korea. As shown in
Figure 3 and
Figure 4, the graph shape differs for weekdays and weekends. Because people usually work on weekdays, the load characteristic of weekdays differs from that of the weekend: this research does not consider the holiday case on account of its irregularity. Temperature is also an influential factor. Based on the tendencies of loads due to weather and time, the input data sequences are divided as in
Figure 5.
Figure 3.
Sample of hourly electrical demand during the weekday.
Figure 3.
Sample of hourly electrical demand during the weekday.
Figure 4.
Sample of hourly electrical demand during the weekend.
Figure 4.
Sample of hourly electrical demand during the weekend.
Figure 5.
Sample of hourly electrical demand for i weeks.
Figure 5.
Sample of hourly electrical demand for i weeks.
Thus, in the weekday case, this model uses data, including days from Monday to Friday, such as
. Similarly, the weekend case uses data composed only of Saturday and Sunday:
. The value
i of the data sequence represents the
ith week data. Also, each datum has a 24-hourly load datum. In this data pattern classification, we reflect the characteristic of two data by using a general load model. By using Equations (
1) and (
2), this general model can be expressed as:
where at any instant,
t,
is the load;
is the temperature and
is the noise;
is the previous load at time;
, where
j is a day and
k is a time; Similarly,
and
are the previous temperature and noise at time,
, respectively. This basic load model illustrates the characteristics of weekday and weekend cases by using the categorized input data. In this paper, we define the order of the load model by the following equation:
Each term of the equation is labeled as follows: is the forecasted load at the next step; is the load value one hour before on the same day; is the load during the same hour 7 days prior; is the load one hour before the hour of 7 days prior; , and follow the same rule of load term. The last term, , represents total noise.
3.2. Division of Input Data Sequences
Most previous studies of load forecasting only dealt with estimating parameters of the load structure. Hence, this aspect tends to face increasing accumulated error caused by changes of input data characteristics. To overcome this problem, we substitute a candidate model for the current model when accumulated error exceeds a threshold value in our model. Therefore, candidate models reduce accumulated error as a back-up model. The candidate load models used in this paper and those characters are shown in
Table 1. Finding the proper threshold value is important for defining scheme characteristics. Too small of a threshold may cause unnecessary model switching, but too large of a value may not provide the desired robustness to the behavioral change in load in time. The determination of the optimal threshold value should be different for different cases and may require considerable experience. However, it may be advised to set a value with some margin, calculated when reasonably accurate forecasting performance is observed during the initial calibration period. In this research, this value is assumed to be 250 kWh. The proposed scheme also provides two operation modes in order to help determine the appropriate model. The first is an initial mode and the second is an executing mode. Details of these modes are presented in the following subsections.
Table 1.
Candidate load models.
Table 1.
Candidate load models.
Name | Model | Note |
---|
Candidate 1 | | Use data, an hour ago |
Candidate 2 | | Use data, a week ago |
Candidate 3 | | Use data, an hour and two hours ago |
3.2.1. Initial Mode
The initial mode of MSS aims to provide a new structure quickly. When accumulated error exceeds a threshold value, there are not enough data to select the best candidate model. Because it needs two weeks at least in order to obtain sufficient data, MSS activates training mode to reduce accumulated error until the system collects sufficient load data. As shown in
Figure 6, when this mode activates, our system generates estimated load data based on past data and chooses a model randomly or empirically among a set of models. Then, the basic model for load forecasting is substituted by the structure that was selected from the candidate models. Our scheme then forecasts the next time load using this model. Although there is a possibility that the accumulated error of the selected model again reaches the critical value, the initial mode plays a role in the lack of load data.
Figure 6.
Process of the initial mode.
Figure 6.
Process of the initial mode.
3.2.2. Executing Mode
While the purpose of the initial mode is to provide a model quickly in the case of a lack of load data, the executing mode aims to select the most accurate of the candidate structures by using sufficient data. When this mode activates, the system starts collecting sufficient load data to estimate the model coefficient for two weeks. This mode then simulates all candidates by using collected data, as shown in
Figure 7. Then, our system selects the candidate that has a minimum error. Finally, the model with the least accumulated error during this period replaces the current forecasting model. Though it takes a longer time when we have more models, this can provide the best one.
Figure 7.
Process of accurate mode.
Figure 7.
Process of accurate mode.
5. Conclusions
To improve the accuracy and robustness of the load forecasting, this paper has presented a new load forecasting strategy incorporating data pattern classification and automatic model switching and demonstrated the enhanced performance through case studies using the real energy consumption data. Specifically, this research improved accuracy by processing the input load data in terms of daily characteristics and reinforced the robustness against the structural bias error due to any change in electricity consumption pattern in the building by allowing the forecast model change. The proposed strategy has adopted ARMAX models with the Kalman filter for estimating their parameters, because they are easily implementable on top of their proven performance in time series analysis. However, it is worth noting that the forecasting model is not limited to the ARMAX models, as this study has investigated. Several candidate models with complementary structures should work with flexible and scalable architecture and interfaces, allowing for seamless transition from one model to the other in order to improve the accuracy and robustness. Accuracy and robustness against any uncertainty this research provides should help understand the energy behavior of buildings and enable the potential savings through proper load controls and demand responses, which should contribute to the cost-effective operation and stabilization of an entire generation and distribution systems, as well.