Next Article in Journal
Market Openness and Its Relationship to Connecting Markets Due to COVID-19
Previous Article in Journal
A Framework for Monitoring Ecosystems-Based Adaptation to Climate Change: Experience from The Gambia
Previous Article in Special Issue
Sustainable Development with Schumpeter Extended Endogenous Type of Innovation and Statistics in European Countries
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Anomaly Detection with Machine Learning Algorithms and Big Data in Electricity Consumption

by
Simona-Vasilica Oprea
1,*,
Adela Bâra
1,
Florina Camelia Puican
1 and
Ioan Cosmin Radu
2
1
Department of Economic Informatics and Cybernetics, Bucharest University of Economic Studies, Romana Square 6, 010374 Bucharest, Romania
2
Departament of Engineering in Foreign Languages, University Politehnica of Bucharest, Splaiul Independenței, No. 313, 060042 Bucharest, Romania
*
Author to whom correspondence should be addressed.
Sustainability 2021, 13(19), 10963; https://doi.org/10.3390/su131910963
Submission received: 23 August 2021 / Revised: 30 September 2021 / Accepted: 30 September 2021 / Published: 2 October 2021

Abstract

When analyzing smart metering data, both reading errors and frauds can be identified. The purpose of this analysis is to alert the utility companies to suspicious consumption behavior that could be further investigated with on-site inspections or other methods. The use of Machine Learning (ML) algorithms to analyze consumption readings can lead to the identification of malfunctions, cyberattacks interrupting measurements, or physical tampering with smart meters. Fraud detection is one of the classical anomaly detection examples, as it is not easy to label consumption or transactional data. Furthermore, frauds differ in nature, and learning is not always possible. In this paper, we analyze large datasets of readings provided by smart meters installed in a trial study in Ireland by applying a hybrid approach. More precisely, we propose an unsupervised ML technique to detect anomalous values in the time series, establish a threshold for the percentage of anomalous readings from the total readings, and then label that time series as suspicious or not. Initially, we propose two types of algorithms for anomaly detection for unlabeled data: Spectral Residual-Convolutional Neural Network (SR-CNN) and an anomaly trained model based on martingales for determining variations in time-series data streams. Then, the Two-Class Boosted Decision Tree and Fisher Linear Discriminant analysis are applied on the previously processed dataset. By training the model, we obtain the required capabilities of detecting suspicious consumers proved by an accuracy of 90%, precision score of 0.875, and F1 score of 0.894.
Keywords: anomaly detection; unsupervised and supervised machine learning; big data; smart grid; fraud detection anomaly detection; unsupervised and supervised machine learning; big data; smart grid; fraud detection

Share and Cite

MDPI and ACS Style

Oprea, S.-V.; Bâra, A.; Puican, F.C.; Radu, I.C. Anomaly Detection with Machine Learning Algorithms and Big Data in Electricity Consumption. Sustainability 2021, 13, 10963. https://doi.org/10.3390/su131910963

AMA Style

Oprea S-V, Bâra A, Puican FC, Radu IC. Anomaly Detection with Machine Learning Algorithms and Big Data in Electricity Consumption. Sustainability. 2021; 13(19):10963. https://doi.org/10.3390/su131910963

Chicago/Turabian Style

Oprea, Simona-Vasilica, Adela Bâra, Florina Camelia Puican, and Ioan Cosmin Radu. 2021. "Anomaly Detection with Machine Learning Algorithms and Big Data in Electricity Consumption" Sustainability 13, no. 19: 10963. https://doi.org/10.3390/su131910963

APA Style

Oprea, S.-V., Bâra, A., Puican, F. C., & Radu, I. C. (2021). Anomaly Detection with Machine Learning Algorithms and Big Data in Electricity Consumption. Sustainability, 13(19), 10963. https://doi.org/10.3390/su131910963

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop