energies-logo

Journal Browser

Journal Browser

Clustering of the Electricity Consumption Time Series in the Big Data Era

A special issue of Energies (ISSN 1996-1073). This special issue belongs to the section "C: Energy Economics and Policy".

Deadline for manuscript submissions: closed (31 December 2019) | Viewed by 14320

Special Issue Editors


E-Mail Website
Guest Editor
Department of Informatics, Warsaw University of Life Sciences, 02-776 Warsaw, Poland
Interests: machine learning; computational intelligence; data science; data mining; smart metering
Special Issues, Collections and Topics in MDPI journals

E-Mail Website1 Website2
Guest Editor
Department of Informatics, Faculty of Applied Informatics and Mathematics, Warsaw University of Life Sciences, Warsaw, Poland
Interests: machine learning; image analysis and pattern recognition; artificial neural networks; quantum information and reversible computing; classical and quantum entropies; physics of information; distributed computing; modeling of complex systems
Special Issues, Collections and Topics in MDPI journals

Special Issue Information

Dear Colleagues,

We are living in the era of emerging technological advancements. The days when almost everything was done manually are gone, and now we live in the time where a lot of activities have been taken over by machines, software, and automatic processes. In this context, and machine learning (ML) has a special place in the advancements being made today. ML applies science to computers and machines to allow them to develop human-like intelligence. Through this technology, machines are able to perform some of the simple to complex tasks that humans need to do on a regular basis.

The concept of Big Data is defined by Gartner as high volume, high velocity, high variety and high veracity data that require new processing paradigms to enable insight discovery, improve decision making, and optimize processes. The potential of Big Data is highlighted in its definition; however, the realization of this potential depends on improving traditional approaches or developing new approaches capable of handling this data. Due to its potential, Big Data has been referred to as a revolution that will transform how we live, work, and think. The main purpose of this revolution is to make use of large amounts of data to enable knowledge discovery and better decision-making.

The ability to extract value from Big Data depends on data analytics, which can be carried out using ML systems. While ML provides significant support in various areas such as time series forecasting, the road to excellence is long. This is because ML has not been able to overcome a number of challenges—especially in the Big Data era—that still stand in the way of progress. Examples of those challenges in terms of Big Data characteristics are:
— for volume: processing performance, feature engineering; for variety: data heterogeneity, dirty and noisy data;

— for velocity: real time processing/streaming, concept drift; and

— for veracity: data provenance.

Data stream clustering has lately become an increasingly popular research topic due to the fact that nowadays more data is produced than there is available storage space. This makes traditional mining approaches that require looking at data more than once unusable. Despite many techniques having been proposed, few have tackled the problem of clustering data streams composed of multiple time series, which we refer to as the problem of clustering streaming time series. Researchers recognize that time series are pervasive in many scientific endeavors, such as electricity load forecasting.

Currently, most techniques for streaming time series clustering rely on a similarity measure based on the Pearson correlation, which is easily computable in an incremental manner, and quite useful in a streaming scenario. However, when relying only on this particular measure, the similarities detected are mainly based on time series shapes and trends, which do not necessarily reflect their true underlying model.

In learning from data streams, which most often occur in non-stationary environments, we can encounter concept drift—a phenomenon by which the data distribution changes and makes the current prediction model inaccurate or obsolete. This can cause the degradation of predictive accuracy if the learning process overlooks the drift and does not adapt to it. Detecting and adapting to concept drift is not a trivial task, and several methods exist that can be roughly split into performance monitoring algorithms and distribution comparing algorithms.

In the light of these challenges, the main scope of this Special Issue is to develop new methods applicable to various machine learning algorithms to improve their quality and robustness in the context of electricity load clustering.

Dr. Krzysztof Gajowniczek
Prof. Arkadiusz Orlowski
Guest Editor

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Energies is an international peer-reviewed open access semimonthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2600 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

  • Machine learning
  • Big Data
  • Smart Metering
  • Time Series Clustering
  • Stream Mining

Benefits of Publishing in a Special Issue

  • Ease of navigation: Grouping papers by topic helps scholars navigate broad scope journals more efficiently.
  • Greater discoverability: Special Issues support the reach and impact of scientific research. Articles in Special Issues are more discoverable and cited more frequently.
  • Expansion of research network: Special Issues facilitate connections among authors, fostering scientific collaborations.
  • External promotion: Articles in Special Issues are often promoted through the journal's social media, increasing their visibility.
  • e-Book format: Special Issues with more than 10 articles can be published as dedicated e-books, ensuring wide and rapid dissemination.

Further information on MDPI's Special Issue polices can be found here.

Published Papers (2 papers)

Order results
Result details
Select all
Export citation of selected articles as:

Research

Jump to: Review

25 pages, 5373 KiB  
Article
Simulation Study on the Electricity Data Streams Time Series Clustering
by Krzysztof Gajowniczek, Marcin Bator, Tomasz Ząbkowski, Arkadiusz Orłowski and Chu Kiong Loo
Energies 2020, 13(4), 924; https://doi.org/10.3390/en13040924 - 19 Feb 2020
Cited by 1 | Viewed by 2216
Abstract
Currently, thanks to the rapid development of wireless sensor networks and network traffic monitoring, the data stream is gradually becoming one of the most popular data generating processes. The data stream is different from traditional static data. Cluster analysis is an important technology [...] Read more.
Currently, thanks to the rapid development of wireless sensor networks and network traffic monitoring, the data stream is gradually becoming one of the most popular data generating processes. The data stream is different from traditional static data. Cluster analysis is an important technology for data mining, which is why many researchers pay attention to grouping streaming data. In the literature, there are many data stream clustering techniques, unfortunately, very few of them try to solve the problem of clustering data streams coming from multiple sources. In this article, we present an algorithm with a tree structure for grouping data streams (in the form of a time series) that have similar properties and behaviors. We have evaluated our algorithm over real multivariate data streams generated by smart meter sensors—the Irish Commission for Energy Regulation data set. There were several measures used to analyze the various characteristics of a tree-like clustering structure (computer science perspective) and also measures that are important from a business standpoint. The proposed method was able to cluster the flows of data and has identified the customers with similar behavior during the analyzed period. Full article
Show Figures

Figure 1

Review

Jump to: Research

61 pages, 8942 KiB  
Review
A Review on Time Series Aggregation Methods for Energy System Models
by Maximilian Hoffmann, Leander Kotzur, Detlef Stolten and Martin Robinius
Energies 2020, 13(3), 641; https://doi.org/10.3390/en13030641 - 3 Feb 2020
Cited by 124 | Viewed by 11607
Abstract
Due to the high degree of intermittency of renewable energy sources (RES) and the growing interdependences amongst formerly separated energy pathways, the modeling of adequate energy systems is crucial to evaluate existing energy systems and to forecast viable future ones. However, this corresponds [...] Read more.
Due to the high degree of intermittency of renewable energy sources (RES) and the growing interdependences amongst formerly separated energy pathways, the modeling of adequate energy systems is crucial to evaluate existing energy systems and to forecast viable future ones. However, this corresponds to the rising complexity of energy system models (ESMs) and often results in computationally intractable programs. To overcome this problem, time series aggregation (TSA) is frequently used to reduce ESM complexity. As these methods aim at the reduction of input data and preserving the main information about the time series, but are not based on mathematically equivalent transformations, the performance of each method depends on the justifiability of its assumptions. This review systematically categorizes the TSA methods applied in 130 different publications to highlight the underlying assumptions and to evaluate the impact of these on the respective case studies. Moreover, the review analyzes current trends in TSA and formulates subjects for future research. This analysis reveals that the future of TSA is clearly feature-based including clustering and other machine learning techniques which are capable of dealing with the growing amount of input data for ESMs. Further, a growing number of publications focus on bounding the TSA induced error of the ESM optimization result. Thus, this study can be used as both an introduction to the topic and for revealing remaining research gaps. Full article
Show Figures

Graphical abstract

Back to TopTop