Generating Synthetic Electricity Load Time Series at District Scale Using Probabilistic Forecasts
Abstract
:1. Introduction
1.1. The Energy Market
- Consists of at least 50 natural persons;
- Involves at least 75% of the shares being held by natural persons who are located within one postal area and a radius of 50 km;
- Requires that no member possesses more than 10% of the shares.
- Mean, variance, and autocorrelation are not constant over time of non-stationary time series after removing seasonality.
- Discontinuous time series possess bounds in the sequence of observations.
1.2. Time Series Synthesis
1.3. Contributions
- A DECTS is the aggregated electricity consumption time series of N distinct households.
2. Data
2.1. Analysis and Data Transformation
2.2. Clustering ACORN Groups
- Normalizing each participant’s time series by its median (Equation (5));
- Calculating monthly mean values of normalized time series (Figure 5 (left));
- Calculating half-hourly mean values of normalized time series (Figure 5 (right));
- Applying Kmeans to , where the number of clusters equals 2, and separate time series with regard to their seasonal variations (more seasonal vs. less seasonal characteristics, Figure 5 (left));
- Applying Kmeans to to further separate time series with respect to their daily sequence (Figure 5 (right)). The number of clusters is adjusted dynamically to ensure that each cluster has a minimum of 15 participants. This criterion is crucial for generating artificial DECTS with diverse characteristics, as one DECTS represents the collective consumption of 10 households.
3. Methodology
3.1. Problem Description
3.2. Concept
3.3. Time Series Forecast
- Within a one-hot encoding, each class is represented by a binary vector. In this encoding, each class occurrence assigns to 1 and otherwise to 0.
- Periodic encodings are transformations of one-hot encodings into more continuous variables by using sine and cosine functions. This can only be applied to periodic variables like daytime, day of the week, or day of the year.
Difference filter with order d to eliminate non-stationarity and backshift operator depicting the shift to reference values. | |
, , | Regression parameters within an ARIX model |
n, m | Number of past and future exogenous variables that could differ |
x, y | Exogenous, endogenous variables |
3.4. Pre-Processing Independent Variables for Seasonal and Short-Term Model Training
- Include calendar data and exogenous/endogenous variables (temperature T, relative humidity RH). Holidays are considered to be uniform due to similar characteristics and consequently possess the same one-hot encoding (Definition 5).
- As partial autocorrelations of DECTS only show significant dependencies for the first two lags (Figure 9), p equals 2 within Equations (7) and (8) to train PNN-C. Longer but weaker temporal dependencies (see lags 5 to 8 in Figure 9) due to time-shifted activities like doing sport, cooking, or washing are neglected.
- While periodic encodings (Definition 6) of tod are neglected for FNN-S/PNN-S, they are included in PNN-C to analyze half-hourly effects on the district electricity consumption. Moreover, FNN-S and PNN-S use doy as periodic encoding in the input space to simulate seasonal variations.
- Additonally, seasonal and short-term models use different temperature observations in the input space, where the maximum daily value is taken for PNN-S and half-hourly values are the best choice for PNN-C.
3.5. Pre-Processing Dependent Variables for Seasonal and Short-Term Model Training
3.6. Sampling Training Data to Generate Artificial DECTS
4. Probabilistic Feedforward Neural Network
4.1. Architecture
4.2. Training Strategy
- MSE: Mean-Squared-Error;
- VE: Variance-Error;
- : Measurements;
- : Ouputs of the μ-Layer;
- : Ouputs of the σ-Layer.
4.3. Generating Synthetic Time Series
- Predict mean seasonal daily means with FNN-S and and generate half-hourly values using average seasonal daily profiles.
- Use to synthesize a set of time series of stochastic relative daily mean values with PNN-S and , .
- Multiply by to receive absolute stochastic daily means . As each is used again in the input array to predict , daily means are generated iteratively (Figure 12).
- After appending to , half-hourly values are generated iteratively (Figure 12) with PNN-C to synthesize individual short-term variations.
5. Results
5.1. Distribution of Half-Hourly Values
5.2. Examples of Daily Sequences on Different Days
5.3. Autocorrelation of Entire Time Series
5.4. Principal Components of Daily Sequences
5.5. Correlating Real vs. Synthetic Time Series
6. Discussion
7. Conclusions and Outlook
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
AR | Auto-Regressive |
ARIMA | Auto-Regressive Integrated Moving-Average |
ARIX | Auto-Regressive Integrated with eXogenous variables |
ARMA | Auto-Regressive Moving-Average |
ARX | Auto-Regressive with eXogenous variables |
cINN | Conditional Invertible Neural Networks |
DECTS | District Electricity Consumption Time Series |
dow | Day of Week |
doy | Day of Year |
EMS | Energy Management System |
FNN | Feedforward Neural Network |
GAN | Generative Adversarial Network |
hol | Holiday |
PNN | Probabilistic Neural Network |
RE | Regression Equation |
REC | Renewable Energy Communities |
RH | Relative Humidity |
T | Temperature |
tod | Time of Day |
Appendix A. ACORN User Segmentation
ACORN A | Lavish Lifestyles | ACORN J | Starting Out |
ACORN B | Executive Wealth | ACORN K | Student Life |
ACORN C | Mature Money | ACORN L | Modest Means |
ACORN D | City Sophisticates | ACORN M | Striving Families |
ACORN E | Career Climbers | ACORN N | Poorer Pensioners |
ACORN F | Countryside Communities | ACORN O | Young Hardship |
ACORN G | Successful Suburbs | ACORN P | Struggling Estates |
ACORN H | Steady Neighborhoods | ACORN Q | Difficult Circumstance |
ACORN I | Comfortable Seniors | ACORN U | Not Private Households |
References
- Available online: https://eur-lex.europa.eu/legal-content/en/TXT/?uri=CELEX:32019L0944 (accessed on 7 February 2024).
- Flemming, S.; Bender, T.; Surmann, A.; Pelka, S.; Martin, A.; Kühnbach, M. Vor-Ort-Systeme als flexibler Baustein im Energiesystem? Eine Cross-Sektorale Potenzialanalyse; Fraunhofer-Publica: Berlin, Germany, 2023. [Google Scholar] [CrossRef]
- Koch, A.; Schmelcher, S.; Sternkopf, T.; Wrede, M. Modellierung Sektorintegrierter Energieversorgung IM Quartier—Untersuchung der Vorteile der Optimierung von Energiesystemen auf Quartiersebene gegenüber der Optimierung auf Gebäudeebene. Available online: https://www.dena.de/fileadmin/dena/Publikationen/PDFs/2022/STUDIE_Modellierung_sektorintegrierter_Energieversorgung_im_Quartier.pdf (accessed on 5 January 2023).
- Energieforschungsprogramm der Bundesregierung. Available online: https://www.bmwk.de/Redaktion/DE/Publikationen/Energie/7-energieforschungsprogramm-der-bundesregierung.pdf?__blob=publicationFile&v=4 (accessed on 5 January 2023).
- Available online: https://www.bmwk.de/Redaktion/DE/Dossier/netze-und-netzausbau.html (accessed on 19 July 2023).
- Available online: https://wirtschaftslexikon.gabler.de/definition/energiemanagementsystem-53996 (accessed on 19 July 2023).
- Deutsch, M.; Timpe, P. The effect of age on residential energy demand. In 8. Dynamics of Consumption; European Council for an Energy Efficient Economy: Stockholm, Sweden, 2013. [Google Scholar]
- Estiri, H.; Zagheni, E. Age matters: Ageing and household energy demand in the United States. Energy Res. Soc. Sci. 2019, 55, 62–70. [Google Scholar] [CrossRef]
- Abrahamse, W.; Steg, L. Factors Related to Household Energy Use and Intention to Reduce It: The Role of Psychological and Socio-Demographic Variables. Hum. Ecol. Rev. 2011, 18, 30–40. [Google Scholar]
- Frederiks, E.; Stenner, K.; Hobman, E. The Socio-Demographic and Psychological Predictors of Residential Energy Consumption: A Comprehensive Review. Energies 2015, 8, 573–609. [Google Scholar] [CrossRef]
- Cerqueira, V.; Torgo, L.; Mozeti?, I. Evaluating time series forecasting models: An empirical study on performance estimation methods. Mach. Learn. 2020, 109, 1997–2028. [Google Scholar] [CrossRef]
- Hittmeir, M.; Ekelhart, A.; Mayer, R. On the Utility of Synthetic Data: An Empirical Evaluation on Machine Learning Tasks. In Proceedings of the 14th International Conference on Availability, Reliability and Security. Association for Computing Machinery (ARES ’19), Canterbury, UK, 26–29 August 2019. [Google Scholar] [CrossRef]
- Iwana, B.K.; Uchida, S. An empirical survey of data augmentation for time series classification with neural networks. PLoS ONE 2021, 16, e0254841. [Google Scholar] [CrossRef] [PubMed]
- Thapa, R.; Shimada, M.; Watanabe, M.; Motohka, T.; Shiraishi, T. The tropical forest in South East Asia: Monitoring and scenario modeling using Synthetic Aperture Radar data. Appl. Geogr. 2013, 41, 168–178. [Google Scholar] [CrossRef]
- Yilmaz, B.; Korn, R. Synthetic demand data generation for individual electricity consumers: Generative Adversarial Networks (GANs). Energy AI 2022, 9, 100161. [Google Scholar] [CrossRef]
- Available online: https://eur-lex.europa.eu/legal-content/DE/TXT/PDF/?uri=CELEX:02018L2001-20181221&from=EN (accessed on 5 January 2023).
- Shamshad, A.; Bawadi, M.; Wan Hussin, W.; Majid, T.; Sanusi, S. First and second order Markov chain models for synthetic generation of wind speed time series. Energy 2005, 30, 693–708. [Google Scholar] [CrossRef]
- Talbot, P.W.; Rabiti, C.; Alfonsi, A.; Krome, C.; Kunz, M.R.; Epiney, A.; Wang, C.; Mandelli, D. Correlated synthetic time series generation for energy system simulations using Fourier and ARMA signal processing. Int. J. Energy Res. 2020, 44, 8144–8155. [Google Scholar] [CrossRef]
- Richardson, I.; Thomson, M.; Infield, D. A high-resolution domestic building occupancy model for energy demand simulations. Energy Build. 2008, 40, 1560–1566. [Google Scholar] [CrossRef]
- Fischer, D.; Härtl, A.; Wille-Haussmann, B. Model for Electric Load Profiles With High Time Resolution for German Households. Energy Build. 2015, 92. [Google Scholar] [CrossRef]
- Naumann, S.; Klaiber, S.; Kummerow, A.; Bretschneider, P. Simulation of Coordinated Market Grid Operations considering Uncertainties. In Proceedings of the 2018 IEEE PES Innovative Smart Grid Technologies Conference Europe (ISGT-Europe), Sarajevo, Bosnia and Herzegovina, 21–25 October 2018; pp. 1–6. [Google Scholar] [CrossRef]
- Klaiber, S. Analyse, Identifikation und Prognose Preisbeeinflusster Elektrischer Lastzeitreihen. Ph.D. Thesis, Technische Universität Ilmenau, Ilmenau, Germany, 2020. [Google Scholar]
- Asre, S.; Anwar, A. Synthetic Energy Data Generation Using Time Variant Generative Adversarial Network. Electronics 2022, 11, 355. [Google Scholar] [CrossRef]
- Zhang, C.; Kuppannagari, S.; Kannan, R.; Prasanna, V.K. Generative Adversarial Network for Synthetic Time Series Data Generation in Smart Grids. In Proceedings of the 2018 IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids (SmartGridComm), Aalborg, Denmark, 29–31 October 2018. [Google Scholar] [CrossRef]
- Wang, K.; Gou, C.; Duan, Y.; Lin, Y.; Zheng, X.; Wang, F.Y. Generative adversarial networks: Introduction and outlook. IEEE/CAA J. Autom. Sin. 2017, 4, 588–598. [Google Scholar] [CrossRef]
- Heidrich, B.; Turowski, M.; Phipps, K.; Schmieder, K.; Süß, W.; Mikut, R.; Hagenmeyer, V. Controlling Non-Stationarity and Periodicities in Time Series Generation Using Conditional Invertible Neural Networks. Appl. Intell. 2022, 53, 8826–8843. [Google Scholar] [CrossRef]
- Available online: https://www.loadprofilegenerator.de (accessed on 2 August 2023).
- Available online: https://synpro-lastprofile.de/ (accessed on 2 August 2023).
- Tunnicliffe Wilson, G. Time Series Analysis: Forecasting and Control, 5th ed.; Box, G.E., Jenkins, G.M., Reinsel, G.C., Ljung, G.M., Eds.; John Wiley and Sons Inc.: Hoboken, NJ, USA, 2015; p.712, ISBN: 978-1-118-67502-1. J. Time Ser. Anal. 2016, 37, 709–711. [Google Scholar] [CrossRef]
- Available online: https://www.kaggle.com/datasets/jeanmidev/smart-meters-in-london (accessed on 19 July 2023).
- Available online: https://www.caci.co.uk/wp-content/uploads/2021/06/Acorn-User-Guide-2020.pdf (accessed on 19 July 2023).
- Savi, M.; Olivadese, F. Short-Term Energy Consumption Forecasting at the Edge: A Federated Learning Approach. IEEE Access 2021, 9, 95949–95969. [Google Scholar] [CrossRef]
- Makridakis, S.; Hibon, M. ARMA models and the Box-Jenkins methodology. J. Forecast. 1997, 16, 147–163. [Google Scholar] [CrossRef]
- Kang, J.; Reiner, D.M. What is the effect of weather on household electricity consumption? Empirical evidence from Ireland. Energy Econ. 2022, 111, 106023. [Google Scholar] [CrossRef]
- Pinheiro, M.; Madeira, S.; Francisco, A. Short-term electricity load forecasting? A systematic approach from system level to secondary substations. Appl. Energy 2023, 332, 120493. [Google Scholar] [CrossRef]
- Gasparin, A.; Lukovic, S.; Alippi, C. Deep Learning for Time Series Forecasting: The Electric Load Case. arXiv 2019, arXiv:1907.09207. [Google Scholar] [CrossRef]
- Available online: https://transparency.entsoe.eu (accessed on 28 November 2023).
- Heidrich, B.; Phipps, K.; Neumann, O.; Turowski, M.; Mikut, R.; Hagenmeyer, V. ProbPNN: Enhancing Deep Probabilistic Forecasting with Statistical Information. arXiv 2023, arXiv:2302.02597. [Google Scholar]
- Available online: https://www.tensorflow.org. (accessed on 13 March 2024).
- Tukey, J.W. Exploratory Data Analysis; Addison-Wesley: Boston, MA, USA, 1977. [Google Scholar]
- Jolliffe, I. Principal Component Analysis; Springer: Berlin/Heidelberg, Germany, 1986. [Google Scholar]
Process Variable | FNN-S | PNN-S | PNN-C |
---|---|---|---|
p | X | 1 | 2 |
X | 3 days | 3 days | |
X | 1 day | 1 day | |
X | 1 day | 1 day | |
X | 1 day | 1 day | |
X | 1 day | 1 day | |
X | 6 days | 6 days | |
X | 1 day | 1 day | |
Exogenous variables | X | , | , |
dow/H | one-hot-enc | one-hot-enc | one-hot-enc |
tod | X | X | periodic-enc |
doy | periodic-enc | periodic-enc | periodic-enc |
Model | Input | Target | Output |
---|---|---|---|
FNN-S | |||
PNN-S | , | ||
PNN-C |
Measure | 0 h | 6 h | 12 h | 18 h | |
---|---|---|---|---|---|
training data | [kW] [kW] | 0.295231 0.103369 | 0.318273 0.101921 | 0.404327 0.146100 | 0.693896 0.251876 |
synthetic data | [kW] [kW] | 0.315641 0.099668 | 0.323386 0.081332 | 0.409102 0.120735 | 0.712146 0.244308 |
PC1- | PC2- | PC1- | PC2- | |
---|---|---|---|---|
Training data | 0.009 | −0.007 | 0.225 | 0.070 |
Synthetic data | −0.009 | 0.007 | 0.232 | 0.053 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Richter, L.; Bender, T.; Lenk, S.; Bretschneider, P. Generating Synthetic Electricity Load Time Series at District Scale Using Probabilistic Forecasts. Energies 2024, 17, 1634. https://doi.org/10.3390/en17071634
Richter L, Bender T, Lenk S, Bretschneider P. Generating Synthetic Electricity Load Time Series at District Scale Using Probabilistic Forecasts. Energies. 2024; 17(7):1634. https://doi.org/10.3390/en17071634
Chicago/Turabian StyleRichter, Lucas, Tom Bender, Steve Lenk, and Peter Bretschneider. 2024. "Generating Synthetic Electricity Load Time Series at District Scale Using Probabilistic Forecasts" Energies 17, no. 7: 1634. https://doi.org/10.3390/en17071634
APA StyleRichter, L., Bender, T., Lenk, S., & Bretschneider, P. (2024). Generating Synthetic Electricity Load Time Series at District Scale Using Probabilistic Forecasts. Energies, 17(7), 1634. https://doi.org/10.3390/en17071634