Next Article in Journal
Effects of Clay Minerals and External Pressures on Imbibition in Shales
Previous Article in Journal
Volatility and Dispersion of Hourly Electricity Contracts on the German Continuous Intraday Market
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Comparison of Baseline Load Forecasting Methodologies for Active and Reactive Power Demand

1
School of Social Sciences, Humanities and Law, Teesside University, Middlesbrough TS1 3BX, UK
2
School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough TS1 3BX, UK
3
ASM Terni S.p.A., 05100 Terni, Italy
*
Author to whom correspondence should be addressed.
Energies 2021, 14(22), 7533; https://doi.org/10.3390/en14227533
Submission received: 5 July 2021 / Revised: 28 October 2021 / Accepted: 9 November 2021 / Published: 11 November 2021
(This article belongs to the Topic Innovative Techniques for Smart Grids)

Abstract

:
Forecasting the electricity consumption is an essential activity to keep the grid stable and avoid problems in the devices connected to the grid. Equaling consumption to electricity production is crucial in the electricity market. The grids worldwide use different methodologies to predict the demand, in order to keep the grid stable, but is there any difference between making a short time prediction of active power and reactive power into the grid? The current paper analyzes the most usual forecasting algorithms used in the electrical grids: ‘X of Y’, weighted average, comparable day, and regression. The subjects of the study were 36 different buildings in Terni, Italy. The data supplied for Terni buildings was split into active and reactive power demand to the grid. The presented approach gives the possibility to apply the forecasting algorithm in order to predict the active and reactive power and then compare the discrepancy (error) associated with forecasting methodologies. In this paper, we compare the forecasting methodologies using MAPE and CVRMSE. All the algorithms show clear differences between the reactive and active power baseline accuracy. ‘Addition X of Y middle’ and ‘Addition Weighted average’ better follow the pattern of the reactive power demand (the prediction CVRMSE error is between 12.56% and 13.19%) while ‘Multiplication X of Y high’ and ‘Multiplication X of Y middle’ better predict the active power demand (the prediction CVRMSE error is between 12.90% and 15.08%).

1. Introduction

The rise in the quantity and diversity of electronic devices makes linking active and reactive power demand harder for every customer [1]. For such a reason, separate active and reactive power forecasting is a viable option. Every electrical grid uses a different approach to predict the electrical consumption. This paper will present popular algorithms used at present, and compare their accuracy, while at the same time showing the contrast between the active power and reactive power forecasting. Hence, we try to contribute to the scientific knowledge of efficient grid management, improving the prediction capacity of reactive power in the grid which facilitates exploitation of Distributed Energy Resources (DER) and integration of the renewable energy sources (RES) into the grid [2].
Forecasting methodologies are based on information from a single variable; that is, forecasting does not need standard deviation calculations that indicate the importance of each variable (weight) in the objective function, as is the case with regression methods, parameter estimation, or data reconciliation [3]. As such, consistency in calculation of the errors can be very important for comparing various algorithms [4] because it works like an indicator that is measured consistently. Additionally, coupling this with some sort of robust filtering like Hampel’s X84 rule [5,6].
The basic algorithm used in the electrical grids to generate Customer Baseline Load (CBL) is ‘X of Y’ methodology, used by, e.g., PJM ISO [7] or CAISO [8], New England ISO (ISONE) employs weighted average to forecasting [9], whereas comparable day and regression are used for short-term power forecasting of wind farms [10,11].
Numerous studies try to improve the baseline accuracy through innovative ideas, different methods, approaches, and assumptions to generate CBL [12]. Table 1 shows some of the relevant previous studies with obtained prediction errors.
While several studies presented in Table 1 used Mean Absolute Percentage Error (MAPE) to estimate the CBL predication error, some experts do not recommend using MAPE [18]. According to the International Performance Measurement and Verification Protocol (IPMVP), the best indicator for a proper baseline error estimation is Coefficient of Variation Root Mean Square Error (CVRMSE) with an established threshold for a successful baseline as CVRMSE < 20% when considering hourly energy use [19].
In addition to the studies listed in Table 1, projects using Artificial Neural Networks (ANN) resulted in more accurate forecasting, but strongly were dependent upon the type of buildings and inputted data quality [20]. We also did not apply Support Vector Regression (SVR) because the method to measure the error in this method is not consistent (the training error and the error of when performing the data validation are different in SVR [21]) with the other methods studied in this document, which would not give us the possibility of comparing SVR with the other methods shown [22].

2. Materials and Methods

This research examined electricity demand in 36 buildings situated in Terni (Italy) during the years 2016, 2017, and 2018. The data were collected by the local Distribution System Operator (DSO) ASM Terni S.p.A.; data were gathered by means of Advanced Metering Infrastructure (AMI) installed at the customers’ locations. With respect to the dataset analyzed in this paper, DSO was identified a group of customers for which data were collected every 15 min; it is worth noting that the Italian Authority prescribes to collect only a monthly aggregated measure.
The implemented procedure used the consumption over the previous days to predict the electricity demand in a window event of 30 min. The algorithms used to generate the baselines were based on the most popular statistical methods implemented in the actual electricity market worldwide: ‘X of Y days’, comparable day, weighted average, regression, and comparable day with addition or multiplication adjustments [23]. In order to analyze the data, a ‘Matlab’ script was developed. The script reads the data in ‘.xlsx’ format, generates the baselines for active and reactive power demand, and calculates the errors between the baseline and actual consumption. Mean Absolute Percentage Error (MAPE) [23] and Coefficient of Variation of the Root Mean Square Error (CVRMSE) [24] were calculated.
The baselines can be generated half an hour in advance, or one day in advance of the desired time window. Table 2 shows the inputs and outputs of the applied baseline generation methods. In some cases, the input is a previously generated baseline.

2.1. X of Y

‘X of Y’ is a statistical methodology which uses ‘X’ of the ‘Y’ days before an event. Two alternative approaches are possible: ‘X of Y middle’ and ‘X of Y high’. In the former, ‘X’ takes the middle values of the ‘Y’ quantities, while the latter excludes the lowest (Y-X) values. Usually, the ‘X of Y middle’ is more accurate while ‘X of Y high’ is a more conservative approach [25].
Equations (1) and (2) show the algorithms used to generate the baselines ‘X of Y middle’ and ‘X of Y high’ [26]. Note that subscript t here denotes timestep during the day. In the case of ‘X of Y middle’, the ‘X’ values will be those closest to the average value of the ‘Y’ values, while in the case of ‘X of Y high’ the ‘X’ values will be those closest to the highest value of the ‘Y’ values.
Equation (1) Baseline X of Y middle [26].
b t = 1 X d M i d X , Y d = X l d , t
Equation (2) Baseline X of Y high [26].
b t = 1 X d H i g h X , Y d = X l d ,   t
‘X of Y’ algorithms are used by Pennsylvania–New Jersey–Maryland interconnection (PJM), California Independent System Operator (CAISO), or New York Independent System Operator (NYISO) [26]. A study recommends that (0.4) ≤ (X/Y) ≤ (0.8) and that no data older than 60 days is used to generate the baseline as the weather change can affect the accuracy of the prediction [27]. For ‘X of Y’ algorithms, the current project took into consideration the demand data over a 5 week long period preceding each date and 0.6 ≤ (X/Y) ≤ 0.8, with Y = 5 and X = 3 or X = 4. Such choice is based upon a study [25] that recommended that the electricity demand data used to generate a baseline should be between 30 and 60 days. We selected 5 weeks (35 days) due to weather conditions, i.e., in order to minimize the impact of the changing seasons on predictions. The same study [25] used both Y = 5 and Y = 10. We chose Y = 5 because the number of previous days we selected (35) is closer to 30 than to 60. In the case when a period of time closer to 60 days would be selected, Y = 10 would be preferred.

2.2. Weighted Average

The weighted average algorithm awards more weight to the days further away from the event. The approach gives greater accuracy when there are many irregularities during the day closest to the event [28]. Equations (3) and (4) are used to generate the weighted average baseline [10].
Equation (3) Weighted average (1) [10].
W t = 1 X 1 d = 1 d = n 1 l d , t
Equation (4) Weighted average (2) [10].
b t =   1 γ W t + γ l n , t
In our study, value for the factor γ was taken as 0.9 according to recommendations [26], while the value of n = 5 was chosen for consistency with the ‘X of Y’ methodologies.

2.3. Comparable Day

The comparable day approach is simpler to apply, because it does not use any equation; the approach consists of generating a baseline with information about a ‘similar day’. While ‘similar’ is usually a subjective term, it can be determined by the weather, or day of the week, for example. The temperature is measured and used to forecast the demand using a previous day with a similar temperature; similarly, measurements of rain or snow fall may be considered [11]. Alternatively, our research took into consideration demand a week before an event to generate a new baseline. For example, to predict the electrical demand on a Monday, the data of the previous Monday were used.

2.4. Simple Regression

Simple regression makes a modification to the baseline (b1) and a more precise baseline (b2) using the temperature and the temperature historical data.
Equation (5) Simple regression.
ρ d = X i = 1 d 1 l i T i i = 1 d 1 l i i = 1 d 1 T i X i = 1 d 1 T i 2 i = 1 d 1 T i 2
Equation (6) Simple regression.
b 2 t = ρ d T t * T t + b 1 t
Equation (7) Simple regression.
T t = n = 1 X T n , t * X

2.5. Adjustments

Adjustments take a previously generated baseline and make an adjustment in the window before an event (half-hour in our research). Three types of adjustments were used in the current research, multiplication adjustment (Equations (8) and (9)), addition adjustment (Equations (10) and (11)), and linear regression adjustment (Equations (12)–(14)) [16].

2.5.1. Multiplication and Addition Adjustments

The multiplication and addition adjustment have the same concept: take a previously generated baseline and use a factor to reduce the error. In the case of multiplication adjustment, the factor multiplies the baseline, while in the case of addition adjustment the factor adds to or subtracts from the baseline. In our study, after making the adjustments, a subsequent physical analysis was undertaken, and we decided to specify zero baseline if the adjusted baseline took negative values. However, our approach is unable to predict negative demand, e.g., in the case when consumers generate electricity.
Equation (8) Multiplication factor.
M t = i = 1 t 1 l r i i = 1 t 1 b i
Equation (9) Multiplication adjusted Baseline.
b 2 t = M t b t
Equation (10) Addition factor.
S t = 1 t 1 i = 1 t 1 l r i b i
Equation (11) Addition adjusted baseline.
b 2 t = b t + S t

2.5.2. Linear Regression Adjustment

Linear regression is the most comprehensive method to forecast the demand because it takes into consideration the weather. Regression is used to predict the ‘total consumption’ during the day but is not the best way to predict the consumption in a short event window of half an hour [29]. Equations (6) through (8) are used to generate the regression baseline. The main difference between linear regression and simple regression methods is that simple regression does not use temperature, while linear regression does.
Equation (12) Linear regression factor [16].
ρ d , t = X d = 1 n l d , t T d , t d = 1 n l d , t d = 1 n T d , t X d = 1 n T d , t 2 d = 1 n T d , t 2
Equation (13) Linear regression baseline [16].
b 2 t = ρ d , t T t * T m d , t + b t
Equation (14) Average temperature [16].
T m d , t = i = 1 t T i t

2.6. Short Term Load Forecasting for Baseline Assessment

The electrical demand depends on numerous factors, e.g., temperature, solar radiation, rainfall, occupancy, etc. In order to compare the different algorithms, the current study calculated two baseline prediction errors: Mean Absolute Percentage Error (MAPE) and Coefficient of Variation Root Mean Square Error (CVRMSE). MAPE is the most widely used method to calculate the error [23], clearly showing the gap between the real and predicted values. However, MAPE exhibits drawbacks if the real value is zero or close to zero. In such cases, CVRMSE can prevent divergences [3]. CVRMSE is harder to calculate but does not have problems with the values close to zero. Equation (13) is used to calculate MAPE, while Equations (14) and (15) are used to calculate CVRMSE.
Equation (15) MAPE error.
M A P E = 100 N b t = 1 N b b t l t l t
Equation (16) CVRMSE error.
C V R M S E = 100 b m t = 1 N b b t l t 2 N b
Equation (17) Mean baseline value.
b m = t = 1 N b b t N b

2.7. Assessment Average

Our study analyzed different buildings and numerous methodologies. The results are presented as an average for each methodology following Equation (16) (tb is the number of buildings—normally 36—but in some cases can be less because of missing data to generate some of the baselines).
Equation (18) Average error.
A v e r a g e _ E r r o r = 1 t b i = 1 t b E r r o r i
An additional parameter considered in this study is the number of buildings in which each method exhibits the best results (with the lowest error). Such parameter is shown in Tables 3–6 in the Min columns, whereas the opposite counts of the buildings reporting the worst performance of a particular method are presented in the Max columns.

2.8. Data Normalization

Data normalization typically treats the development of clean data; in order to do this we used MATLAB. Buildings 9 and 36 did not have any indication of the suitable baseline generation methods, as insufficient data were available to generate baselines for them. Such a result may be due to problems with the meters. In building 32, consumption was zero for a large part of the 3 years resulting in mostly zero baseline. We considered such a case as an exception and omitted the results of building 32 from Table 7. The zero consumption could be due to the specific use of the building, e.g., as a warehouse/storage or holiday home, or due to malfunction of the equipment used to measure the electricity consumption.

3. Results and Discussion

For this study we had 3 years of data, therefore, it was very difficult graphically to see the baseline and the different methods in such a long period of time. For that reason, we included graphs for one week, one graph for active power baselines (Figure 1), and one for reactive power baseline (Figure 2). However, we followed the IPMVP [19] recommendations and used CVRMSE as principal indicator, and we complemented it with MAPE because it is a common indicator and easy to understand.

3.1. Active Power Baseline

Table 3 shows the average MAPE for all buildings. The low error average was obtained for the methodologies with multiplication and addition adjustment. The multiplication adjustment with ‘X of Y high’ has the overall lowest average error over the considered years ranging between 9.95% and 12.97% annually. Baselines generated without adjustment and using linear regression have a similar average error. The simple regression baselines have the highest average error of 34.57% in 2018. Columns ‘Min’ and ‘Max’ in Table 3 show how many buildings have low and high average errors using each of the considered methods. The normal simple regression has the high average error and at the same time is the most common method producing high errors.
The number of minimum and maximum values does not necessarily have to match the number of considered buildings (36), as insufficient electricity demand data for a particular period would prevent baseline generation, meaning that the total number of baselines would be less than 36. Another possibility is that, for the same building, multiple methods might have the same maximum or minimum errors, increasing the number of minimum or maximum counts.
Table 4 shows the average CVRMSE for all buildings and the number of buildings with the best and the worst averages for each method. In this case, baseline calculation methods with the highest and the lowest CVRMSE do not match those with the highest number of extreme error values. The lowest CVRMSE is for ‘Addition Weighted average’, ranging between 12.56% and 26.83%, while the same method also has the highest number of buildings with low errors overall during the considered 3-year period.
Depending on the method used to calculate the error, the best baselines are using multiplication setting according to the MAPE calculation or the methods with addition adjustment according to the CVRMSE calculation. Comparatively, MAPE errors were mostly lower than CVRMSE for all prediction methods.

3.2. Reactive Power Baseline

Table 5 shows the MAPE calculation for the reactive power baselines. The algorithm with the best average error is the ‘Addition X of Y middle’, while ‘Multiplication X of Y high’ has the highest number of buildings with the lowest average errors during the overall 3-year period, despite having approximately 4% higher average error each year.
Table 6 shows the CVRMSE calculation where ‘Addition Weighted average’ and ‘Addition X of Y high’ have the same average errors. Baseline calculation methodology with the highest number of buildings with the lowest average error during the overall 3-year period is ‘Addition Weighted average’.
The worst methodology to generate a baseline of reactive power demand is ‘Normal simple regression’ which has the highest average error and the highest number of buildings with high errors (in both cases MAPE and CVRMSE). Comparatively, CVRMSE errors were mostly lower than MAPE for all prediction methods.

3.3. Comparison between Active and Reactive Power Baseline (2016, 2017, and 2018)

Table 7 shows which algorithm is more accurate for each building. The building ID numbers are presented in the first column, while the other columns correspond to considered baseline generation methods. Each row corresponds to a single building, where A was placed for the baseline calculation method with the lowest MAPE for the active power, and R for the method with the lowest CVRMSE for the reactive power.
In nine buildings, the methods with the lowest error are the same but in the other 25 cases they are not coincident. In only 36% of the cases, we obtained the lowest set by applying a single method to predict active and reactive power; in the rest of the cases it would be better to apply different methods.

4. Conclusions and Future Work

Electrical grids need an upgrade in order to include the innovative technologies of electricity generation (wind turbines, solar panels) and the Distributed Energy Resources (DER). To control a process, it is necessary to understand and predict its main component factors, which in the case of the electrical grid, are the customers and generators. The presented research focuses on prediction of the customers’ characteristics, separating the demand in active and reactive power components for each building to improve the accuracy of demand prediction. The main limitation is the sample profile as all the buildings are located in Terni, Italy, in the Italian electrical market. We recommend that future research should consider other electrical markets subject to different usage patterns. While the used sample of selected buildings provided a representation of building performance, increasing the sample size may allow deeper analysis of singular cases such as the buildings with very rare or occasional electricity consumption.
When talking about active power, MAPE calculation is more important because the active power demand values are usually nonzero. For the reactive power, CVRMSE is more significant, because the reactive power demand can be zero, which makes MAPE calculations inaccurate. Consequently, the lowest error resulted from application of ‘Addition X of Y middle’ and ‘Addition Weighted average’ in prediction of reactive power (the forecasting error is between 12.9% and 15%), while ‘Multiplication X of Y high’ and ‘Multiplication X of Y middle’ provided better prediction of active power demand (the forecasting error is between 9.9% and 12.9%). However, the accuracy of the statistical methods used to generate baselines in this investigation also depended on the building to which they were applied. Better results were obtained by applying different baseline calculation methods to predict the demand for active and reactive power in different years.
Additional research is necessary to verify these findings on a larger customer dataset. In future research, it would be also interesting to investigate the accuracy of prediction methods for active and reactive power in the same building from one day to another, and whether it is better to apply different baseline forecasting methods on specific days. The recommendation would be to take a single building and explore the algorithms studied day by day to see how error varies and whether it is possible to determine which method provides more accurate results on particular days of the week/month/year.

Author Contributions

Conceptualization, E.S.; methodology, E.S. and V.V.; software, E.S.; validation, E.S.; formal analysis, E.S. and V.V.; investigation, E.S. and V.V.; resources T.B.; data curation, E.S. and T.B.; writing—original draft preparation E.S.; writing—review and editing, E.S. and V.V.; visualization, E.S. and V.V.; supervision V.V.; project administration, V.V.; funding acquisition, V.V. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partly funded by EU’s Horizon 2020 framework programme for research and innovation under grant agreement No 774478, project eDREAM—enabling new Demand REsponse Advanced, Market oriented and secure technologies, solutions and business models.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Nomenclature

S t Addition factor
b 2 t Adjusted baseline generated
ANNArtificial Neural Network
T m d , t Average temperature for a day (d)
b m Average value of the baseline
b t Baseline generated
CBLCustomer Baseline Load
CVRMSE Coefficient of Variation Root Mean Square Error (%)
CAISOCalifornia Independent System Operator
n Day before to the event bounded within X and Y
d Day number bounded within X and Y
l d , t Historical load measured
IPMVPInternational Performance Measurement and Verification Protocol
ISOIndependent System Operator
MAPE Mean Absolute Percentage Error (%)
M t Multiplication factor
N b Number of steps in the baseline
PJMPennsylvania, New Jersey, and Maryland
ρ d Regression parameter
T i Temperature historical data
T i * Temperature measured in the moment
XThe quantity of days for the method ‘X of Y’
t Time step during the day (half-hour in the current study, i.e., between 1 and 48)
t b Total number of buildings
W t Weighted parameter
γ Weighting factor

References

  1. Amber, K.; Aslam, M.; Hussain, S. Electricity consumption forecasting models for administration buildings of the UK higher education sector. Energy Build. 2015, 90, 127–136. [Google Scholar] [CrossRef]
  2. Jiayi, H.; Chuanwen, J.; Rong, X. A review on distributed energy resources and MicroGrid. Renew. Sustain. Energy Rev. 2008, 12, 2472–2483. [Google Scholar] [CrossRef]
  3. Di Martino, F.; Loia, V.; Sessa, S. Multi-species PSO and fuzzy systems of Takagi–Sugeno–Kang type. Inf. Sci. 2014, 267, 240–251. [Google Scholar] [CrossRef]
  4. de Menezes, D.; Prata, D.; Secchi, A.; Pinto, J. A review on robust M-estimators for regression analysis. Comput. Chem. Eng. 2021, 147, 107254. [Google Scholar] [CrossRef]
  5. Hampel, F.R. The breakdown points of the mean combined with some rejection rules. Technometrics 1985, 27, 95–107. [Google Scholar] [CrossRef]
  6. Chaudhary, N.; Chowdhury, D.R. Data preprocessing for evaluation of recommendation models in E-commerce. Data 2019, 4, 23. [Google Scholar] [CrossRef] [Green Version]
  7. Walawalkar, R.; Blumsack, S.; Apt, J.; Fernands, S. An economic welfare analysis of demand response in the PJM electricity market. Energy Policy 2008, 36, 3692–3702. [Google Scholar] [CrossRef]
  8. Goodin, J. California Independent System Operator demand response & proxy demand resources. In Proceedings of the 2012 IEEE PES Innovative Smart Grid Technologies (ISGT), Washington, DC, USA, 16–20 January 2012; IEEE: Piscataway, NJ, USA, 2012; pp. 1–3. [Google Scholar]
  9. Ma, F.; Luo, X.; Litvinov, E. Cloud Computing for Power System Simulations at ISO New England—Experiences and Challenges. IEEE Trans. Smart Grid 2016, 7, 2596–2603. [Google Scholar] [CrossRef]
  10. Yang, X.; Ju, R.; Yuan, Z.; Zhang, Z. Short-term Power Forecasting Method and Application of Wind Farm. MATEC Web Conf. 2018, 232, 04001. [Google Scholar] [CrossRef]
  11. Ciulla, G.; D’Amico, A. Building energy performance forecasting: A multiple linear regression approach. Appl. Energy 2019, 253, 113500. [Google Scholar] [CrossRef]
  12. Ruiz, G.R.; Bandera, C.F. Validation of Calibrated Energy Models: Common Errors. Energies 2017, 10, 1587. [Google Scholar] [CrossRef] [Green Version]
  13. Won, J.-R.; Yoon, Y.-B.; Choi, J.-H.; Yi, K.-K. Case study of customer baseline (CBL) load determination in Korea based on statistical method. In Proceedings of the 2009 Transmission & Distribution Conference & Exposition: Asia and Pacific, Seoul, Korea, 26–30 October 2009; IEEE: Piscataway, NJ, USA, 2010; pp. 1–4. [Google Scholar]
  14. Escobar, E.; Cadena, A.; Correal, M.E.; Duran, H. Model to calculate the customer base-line for a demand response program in the colombian power market. In Proceedings of the 32nd Annual IAEE Conference “Energy, Economy, Environment: The Global View”, San Francisco, CA, USA, 21–24 June 2009; pp. 1–10. [Google Scholar]
  15. Zhou, X.; Gao, Y.; Yao, W.; Yu, N. A Robust Segmented Mixed Effect Regression Model for Baseline Electricity Consumption Forecasting. J. Mod. Power Syst. Clean Energy 2020, 1–10. [Google Scholar] [CrossRef]
  16. Song, T.; Li, Y.; Zhang, X.-P.; Li, J.; Wu, C.; Wu, Q.; Wang, B. A Cluster-Based Baseline Load Calculation Approach for Individual Industrial and Commercial Customer. Energies 2019, 12, 64. [Google Scholar] [CrossRef] [Green Version]
  17. Effinger, J.; Effinger, M.; Friedman, H. Overcoming barriers to whole building M&V in commercial buildings. In Proceedings of the ACEEE Summer Study on Energy Efficiency in Buildings, Pacific Grove, CA, USA, 12–17 August 2012. [Google Scholar]
  18. Tofallis, C. A better measure of relative prediction accuracy for model selection and model estimation. J. Oper. Res. Soc. 2015, 66, 1352–1362. [Google Scholar] [CrossRef] [Green Version]
  19. IPMVP Committee. International Performance Measurement and Verification Protocol: Concepts and Options for Determining Energy and Water Savings, Volume I; (No. DOE/GO-102001-1187; NREL/TP-810-29564); National Renewable Energy Laboratory: Golden, CO, USA, 2001.
  20. Yang, Y.; Meng, Y.; Xia, Y.; Lu, Y.; Yu, H. An efficient approach for short term load forecasting. In Proceedings of the International Multiconference of Engineers and Computer Scientists, Hong Kong, China, 16–18 March 2011. [Google Scholar]
  21. Dai, W.; Chuang, Y.-Y.; Lu, C.-J. A Clustering-based Sales Forecasting Scheme Using Support Vector Regression for Computer Server. Procedia Manuf. 2015, 2, 82–86. [Google Scholar] [CrossRef] [Green Version]
  22. Chen, Y.; Xu, P.; Chu, Y.; Li, W.; Wu, Y.; Ni, L.; Bao, Y.; Wang, K. Short-term electrical load forecasting using the Support Vector Regression (SVR) model to calculate the demand response baseline for office buildings. Appl. Energy 2017, 195, 659–670. [Google Scholar] [CrossRef]
  23. AleaSoft Energy Forecasting. European Electricity Markets Panorama: Italy. 2018. Available online: https://aleasoft.com/european-electricity-markets-panorama-italy (accessed on 25 July 2019).
  24. Taylor, J.W. Triple seasonal methods for short-term electricity demand forecasting. Eur. J. Oper. Res. 2010, 204, 139–152. [Google Scholar] [CrossRef] [Green Version]
  25. EnerNOC Utility Solutions. Energy Baseline Methodologies for Industrial Facilities; Northwest Energy Efficiency Alliance: Portland, OR, USA, 2013. [Google Scholar]
  26. Mohajeryami, S.; Doostan, M.; Asadinejad, A.; Schwarz, P. Error Analysis of Customer Baseline Load (CBL) Calculation Methods for Residential Customers. IEEE Trans. Ind. Appl. 2016, 53, 5–14. [Google Scholar] [CrossRef]
  27. Križanič, F.; Oplotnik, Z.J. Analysis of the Energy Market Operator Activity in Eight European Countries. Int. J. Energy Econ. Policy 2014, 4, 716–725. [Google Scholar]
  28. Lora, A.T.; Santos, J.M.R.; Expósito, A.G.; Ramos, J.L.M.; Santos, J.C.R. Electricity market price forecasting based on weighted nearest neighbors techniques. IEEE Trans. Power Syst. 2007, 22, 1294–1301. [Google Scholar] [CrossRef] [Green Version]
  29. Tsekouras, G.; Dialynas, E.; Hatziargyriou, N.; Kavatza, S. A non-linear multivariable regression model for midterm energy forecasting of power systems. Electr. Power Syst. Res. 2007, 77, 1560–1568. [Google Scholar] [CrossRef]
Figure 1. Active power baseline building 3.
Figure 1. Active power baseline building 3.
Energies 14 07533 g001
Figure 2. Reactive power baseline building 3.
Figure 2. Reactive power baseline building 3.
Energies 14 07533 g002
Table 1. Overview of relevant CBL prediction errors in literature.
Table 1. Overview of relevant CBL prediction errors in literature.
ReferenceDescriptionError
[13]The authors completed short term forecasting, for 10 industrial customers in Korea.MAPE 2.8–26%
[1]The authors examined daily electricity consumption of an administration building located at the Southwark campus of London South Bank University in London.RMSE 34%
[14]The authors undertook short term forecasting for eight users and implemented the baselines in a demand response program.Average real error 1.15–43%
[15]The authors completed short time load forecasting in buildings blocks situated in California.MAPE 10.88–17.97%
[11]The authors simulated 195 scenarios of a non-residential building.MAPE 22–40%
[16]The data used in this paper were collected from 162 industrial and commercial customers.Overall Performance Index 6–23.8%
[17]The data used in this paper were collected from three commercial buildings in EEUU.CVRMSE < 20%
Table 2. Inputs and outputs of baseline methodologies.
Table 2. Inputs and outputs of baseline methodologies.
TypeInputOutput
X of YLoad demand historicalBaseline with one week to one day in advance
Weighted averageLoad demand historicalBaseline with one week to one day in advance
Comparable dayLoad demand historicalBaseline with one week to one day in advance
RegressionLoad demand historical
Temperature historical
Measurement of temperature
Baseline with one step (half hour) in advance
AdjustmentsLoad demand historical
Measurement of load demand
Baseline with one step (half hour) in advance
Table 3. Baseline, active power, MAPE (highlighted in red are the worst results, highlighted in blue are the best results).
Table 3. Baseline, active power, MAPE (highlighted in red are the worst results, highlighted in blue are the best results).
Baseline, Active PowerMAPE [%] Min Max
201620172018201620172018201620172018
Normal X of Y high21.7425.6830.45000000
Normal X of Y middle20.4123.6125.28000000
Normal comparable day18.7218.3422.43001000
Normal weighted average21.5625.4330.10000002
Normal simple regression27.1632.5434.57000122321
Multiplication X of Y high10.429.9512.97456000
Multiplication X of Y middle10.469.9413.443128000
Multiplication comparable day11.5410.9314.79545000
Multiplication weighted average10.489.9713.05667000
Multiplication simple regression12.8912.5016.14001000
Addition X of Y high11.3910.7914.40000000
Addition X of Y middle10.9710.1513.38353000
Addition comparable day11.7311.0814.43012000
Addition weighted average11.2210.5514.03002000
Addition simple regression13.1912.1716.58000000
Linear regression X of Y high25.0926.3331.04000000
Linear regression X of Y middle25.0026.1430.71000000
Linear regression comparable day25.6626.8031.68000254
Linear regression weighted average25.0426.2630.90000000
Linear regression simple regression26.2427.1532.17000757
Table 4. Baseline, active power, CVRMSE (highlighted in red are the worst results, highlighted in blue are the best results).
Table 4. Baseline, active power, CVRMSE (highlighted in red are the worst results, highlighted in blue are the best results).
Baseline, Active PowerCVRMSE (%) Min Max
201620172018201620172018201620172018
Normal X of Y high23.2924.5045.68001000
Normal X of Y middle24.1825.7248.48000002
Normal comparable day24.6725.7350.49000003
Normal weighted average24.2325.5546.89000000
Normal simple regression31.0132.5954.1600091313
Multiplication X of Y high18.9634.3943.74535010
Multiplication X of Y middle22.5738.5644.40010114
Multiplication comparable day34.5239.3764.16000468
Multiplication weighted average19.3530.3638.82012012
Multiplication simple regression26.7848.4653.80000374
Addition X of Y high12.5913.2626.99366000
Addition X of Y middle12.5913.2327.001610000
Addition comparable day14.0514.9730.36261000
Addition weighted average12.5613.1926.838108000
Addition simple regression14.9215.2730.80001000
Linear regression X of Y high29.4127.7346.66000000
Linear regression X of Y middle29.4427.7446.69000000
Linear regression comparable day30.0028.5648.04000022
Linear regression weighted average29.4327.7246.61000000
Linear regression simple regression30.3928.5648.08000222
Table 5. Baseline, reactive power, MAPE (highlighted in red are the worst results, highlighted in blue are the best results).
Table 5. Baseline, reactive power, MAPE (highlighted in red are the worst results, highlighted in blue are the best results).
Baseline, Reactive PowerMAPE (%) Min Max
201620172018201620172018201620172018
Normal X of Y high38.0147.9448.60112000
Normal X of Y middle35.0950.0849.39002010
Normal comparable day31.9048.0446.91000010
Normal weighted average38.5148.5549.64000211
Normal simple regression113.02109.54109.84010172225
Multiplication X of Y high24.8340.0539.77986000
Multiplication X of Y middle22.4242.7643.35021043
Multiplication comparable day23.8743.6843.63010110
Multiplication weighted average25.1240.2640.14223100
Multiplication simple regression32.0644.7547.30245000
Addition X of Y high24.6535.8235.28123000
Addition X of Y middle20.8135.7035.19026000
Addition comparable day21.7936.9936.18422000
Addition weighted average24.5335.5635.09153100
Addition simple regression37.8751.3052.56000023
Linear regression X of Y high38.3548.2044.89020000
Linear regression X of Y middle38.5748.5344.90000000
Linear regression comparable day39.3049.4045.83001000
Linear regression weighted average38.3948.2344.84000000
Linear regression simple regression46.5456.8554.91100111
Table 6. Baseline, reactive power, CVRMSE (highlighted in red are the worst results, highlighted in blue are the best results).
Table 6. Baseline, reactive power, CVRMSE (highlighted in red are the worst results, highlighted in blue are the best results).
Baseline, Reactive Power CVRMSE (%) Min Max
201620172018201620172018201620172018
Normal X of Y high27.2026.6924.69000000
Normal X of Y middle29.4930.6328.44000001
Normal comparable day28.8128.9927.83000011
Normal weighted average28.7328.4925.89000000
Normal simple regression71.0460.7760.01253121113
Multiplication X of Y high31.7853.1728.47111120
Multiplication X of Y middle50.8550.5459.77000127
Multiplication comparable day61.4868.7164.87000374
Multiplication weighted average26.3957.8229.72101020
Multiplication simple regression23.2937.3924.97001011
Addition X of Y high15.0814.5212.90596000
Addition X of Y middle15.3614.9713.18023000
Addition comparable day16.7916.0614.65221000
Addition weighted average15.1614.6212.908810000
Addition simple regression40.8943.3639.65000112
Linear regression X of Y high32.6130.7722.48000000
Linear regression X of Y middle32.7330.9222.56000000
Linear regression comparable day33.2231.3723.36000100
Linear regression weighted average32.6630.8122.47000000
Linear regression simple regression49.3349.8743.28010010
Table 7. The most accurate methods used for active (A) and reactive (R) power calculation per building ID.
Table 7. The most accurate methods used for active (A) and reactive (R) power calculation per building ID.
IDabcdefghijklmnopqrst
1 A R
2 AR
3 R A
4 R A
5 RA
6 AR
7 AR
8 A R
9
10 AR
11 R A
12 A R
13 A
14R A
15 A R
16 A R
17 A R
18 A R
19 AR
20 ARAR
21 A R
22 R A
23 AR
24 A R
25 AR
26 A R A
27R A
28 RA
29 A R
30 AR
31 AR
32
33 AR
34 AR
35 A R
36
A count00100685700322010000
R count22000610153623000000
Normal
Multiplication
Addition
Linear regression
aNormal X of Y high
bNormal X of Y middle
cNormal comparable day
dNormal weighted average
eNormal simple regression
fMultiplication X of Y high
gMultiplication X of Y middle
hMultiplication comparable day
iMultiplication weighted average
jMultiplication simple regression
kAddition X of Y high
lAddition X of Y middle
mAddition comparable day
nAddition weighted average
oAddition simple regression
pLinear regression X of Y high
qLinear regression X of Y middle
rLinear regression comparable day
sLinear regression weighted average
tLinear regression simple regression
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Segovia, E.; Vukovic, V.; Bragatto, T. Comparison of Baseline Load Forecasting Methodologies for Active and Reactive Power Demand. Energies 2021, 14, 7533. https://doi.org/10.3390/en14227533

AMA Style

Segovia E, Vukovic V, Bragatto T. Comparison of Baseline Load Forecasting Methodologies for Active and Reactive Power Demand. Energies. 2021; 14(22):7533. https://doi.org/10.3390/en14227533

Chicago/Turabian Style

Segovia, Edgar, Vladimir Vukovic, and Tommaso Bragatto. 2021. "Comparison of Baseline Load Forecasting Methodologies for Active and Reactive Power Demand" Energies 14, no. 22: 7533. https://doi.org/10.3390/en14227533

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop