Enabling Alarm-Based Fault Prediction for Smart Meters in District Heating Systems: A Danish Case Study
Highlights
- According to real-world data, smart meters in district heating systems display a plethora of alarms that are not tended to, and ensuring that they function correctly is critical for operating and managing district heating systems.
- A framework leveraging machine learning was developed that can predict if an alarm will occur in a future time span utilizing historical sensor and alarm data from smart meters in district heating networks.
- Prediction of alarms in smart meters can aid in ensuring proper operation of district heating systems, which includes billing of customers.
- Additionally, for implementation of future smart solutions, robust meter readings are necessary, and alarm prediction can aid in ensuring these.
Abstract
:1. Introduction
1.1. Related Works
- Occurrence-time prediction: does an event happen or not in a future period?
- Discrete-time prediction: in which among several time slots does an event occur?
- Continuous-time prediction: at which exact time does an event occur in the future?
1.2. Contributions
- A sensor fault prediction framework based on alarms generated in smart meters and operational data from the smart meters.
- First application of fault prediction in smart meter sensors in the DH domain, to the best of our knowledge.
- The application of the framework to a case study with real-world data.
2. Case Study and Data Preprocessing
2.1. Consumption Data
2.2. Alarm Data
2.3. Identification of Fault Prediction Possibilities
2.4. Modification of Alarm Definition
2.5. Preparation of Data for Prediction
3. Methodology
3.1. Development Phase Framework
- Determining machine learning algorithm: Execute the framework on each of the proposed machine learning methods calculating the KPIs with the static windowing parameters, p = 24, s = 0, and f = 24. This identifies the hyperparameters for each machine learning method given the ranges it can pick from (also applying to the default hyperparameters). The best-performing machine learning method is then used onward.
- Determining windowing parameters: Carry out three types of experiments where the three windowing parameters are adjusted individually to determine the effects on the KPIs. The best-performing mix of windowing parameters is determined (as mentioned in Section 2.5) by a mixture of maximizing the KPIs while also considering the usefulness of the prediction for maintenance crews, as that changes when adjusting the windowing parameters (this is why it cannot be performed automatically).
3.1.1. Description of Machine Learning Algorithms
- Random forest (RF).
- k-nearest neighbor (KNN).
- Support vector machine (SVM).
- Adaboost—using decision stumps.
- Gaussian naive Bayes (GNB).
Random Forest
Adaboost
K-Nearest Neighbor
Support Vector Machine
Gaussian Naive Bayes
3.1.2. Key Performance Indicators
3.1.3. Hyperparameter Tuning and Cross-Validation
3.2. Online Phase Framework
4. Results
4.1. Selection of Machine Learning Algorithm and Hyperparameter Tuning
4.2. Determination of Horizons
5. Discussion
6. Conclusions and Future Work
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
TPR | True-positive rate |
DH | District heating |
DHS | District heating system |
KPI | Key performance indicator |
KNN | k-nearest neighbor |
RF | Random forest |
GNB | Gaussian naive Bayes |
SVM | Support vector machine |
T-SNE | T-distributed stochastic neighbor embedding |
References
- Schaffer, M.; Tvedebrink, T.; Marszal, A. Three years of hourly data from 3021 smart heat meters installed in Danish residential buildings. Sci. Data 2022, 9, 420. [Google Scholar] [CrossRef] [PubMed]
- Yliniemi, K.; van Deventer, J.; Delsing, J. Sensor fault detection in a district heating substation. In Proceedings of the 10th IMEKO TC10 International Conference on Technical Diagnostics, Budapest, Hungary, 9–10 June 2005. [Google Scholar]
- The Danish Safety Technology Authority Regulation on Usage of Data Recording Instruments for Measuring of Usage of Water, Gas, Power or Heat. 2018. Available online: https://www.retsinformation.dk/eli/lta/2018/582 (accessed on 8 May 2024).
- Zheng, W.; Xu, S.; Liu, J.; Zhu, J.; Luo, Q. Participation of strategic district heating networks in electricity markets: An arbitrage mechanism and its equilibrium analysis. Appl. Energy 2023, 350, 121732. [Google Scholar] [CrossRef]
- Andresen, C.; Torsæter, B.N.; Haugdal, H.; Uhlen, K. Fault Detection and Prediction in Smart Grids. In Proceedings of the 2018 IEEE 9th International Workshop on Applied Measurements for Power Systems (AMPS), Bologna, Italy, 26–28 September 2018; pp. 1–6. [Google Scholar] [CrossRef]
- Massaro, A.; Kostadinov, D.; Silva, A.; Obeid Guzman, A.; Aghasaryan, A. Predicting Network Hardware Faults through Layered Treatment of Alarms Logs. Entropy 2023, 25, 917. [Google Scholar] [CrossRef] [PubMed]
- Betti, A.; Tucci, M.; Crisostomi, E.; Piazzi, A.; Barmada, S.; Thomopulos, D. Fault Prediction and Early-Detection in Large PV Power Plants Based on Self-Organizing Maps. Sensors 2021, 21, 1687. [Google Scholar] [CrossRef] [PubMed]
- Zhang, S.; Wang, Y.; Liu, M.; Bao, Z. Data-Based Line Trip Fault Prediction in Power Systems Using LSTM Networks and SVM. IEEE Access 2018, 6, 7675–7686. [Google Scholar] [CrossRef]
- Alqudah, M.; Kezunovic, M.; Obradovic, Z. Automated Power System Fault Prediction and Precursor Discovery Using Multi-Modal Data. IEEE Access 2023, 11, 7283–7296. [Google Scholar] [CrossRef]
- Mortensen, L.K.; Shaker, H.R.; Veje, C.T. Relative fault vulnerability prediction for energy distribution networks. Appl. Energy 2022, 322, 119449. [Google Scholar] [CrossRef]
- Chen, S.; Friedrich, D.; Yu, Z.; Yu, J. District Heating Network Demand Prediction Using a Physics-Based Energy Model with a Bayesian Approach for Parameter Calibration. Energies 2019, 12, 3408. [Google Scholar] [CrossRef]
- Dang, L.M.; Shin, J.; Li, Y.; Tightiz, L.; Nguyen, T.; Song, H.K.; Moon, H. Toward Explainable Heat Load Patterns Prediction for District Heating. Sci. Rep. 2023, 13, 7434. [Google Scholar] [CrossRef]
- Dahl, M.; Brun, A.; Kirsebom, O.S.; Andresen, G.B. Improving Short-Term Heat Load Forecasts with Calendar and Holiday Data. Energies 2018, 11, 1678. [Google Scholar] [CrossRef]
- Sakkas, N.P.; Abang, R. Thermal load prediction of communal district heating systems by applying data-driven machine learning methods. Energy Rep. 2022, 8, 1883–1895. [Google Scholar] [CrossRef]
- Månsson, S.; Kallioniemi, P.O.J.; Sernhed, K.; Thern, M. A machine learning approach to fault detection in district heating substations. Energy Procedia 2018, 149, 226–235. [Google Scholar] [CrossRef]
- Zhang, F.; Fleyeh, H. Anomaly Detection of Heat Energy Usage in District Heating Substations Using LSTM based Variational Autoencoder Combined with Physical Model. In Proceedings of the 2020 15th IEEE Conference on Industrial Electronics and Applications (ICIEA), Kristiansand, Norway, 9–13 November 2020; pp. 153–158. [Google Scholar] [CrossRef]
- Gadd, H.; Werner, S. Fault detection in district heating substations. Appl. Energy 2015, 157, 51–59. [Google Scholar] [CrossRef]
- Guelpa, E.; Verda, V. Automatic fouling detection in district heating substations: Methodology and tests. Appl. Energy 2020, 258, 114059. [Google Scholar] [CrossRef]
- Dey, M.; Rana, S.P.; Dudley, S. A Case Study Based Approach for Remote Fault Detection Using Multi-Level Machine Learning in A Smart Building. Smart Cities 2020, 3, 401–419. [Google Scholar] [CrossRef]
- Du, Z.; Fan, B.; Jin, X.; Chi, J. Fault detection and diagnosis for buildings and HVAC systems using combined neural networks and subtractive clustering analysis. Build. Environ. 2014, 73, 1–11. [Google Scholar] [CrossRef]
- Johra, H.; Leiria, D.; Heiselberg, P.; Marszal, A.; Tvedebrink, T. Treatment and analysis of smart energy meter data from a cluster of buildings connected to district heating: A Danish case. E3s Web Conf. 2020, 172, 12004. [Google Scholar] [CrossRef]
- Leiria, D.; Johra, H.; Marszal-Pomianowska, A.; Pomianowski, M.Z.; Kvols Heiselberg, P. Using data from smart energy meters to gain knowledge about households connected to the district heating network: A Danish case. Smart Energy 2021, 3, 100035. [Google Scholar] [CrossRef]
- Pedersen, A.; Ustrup, S.; Mortensen, L.; Shaker, H. Data Validation for Digitally Enabled Operation & Maintenance of District Heating Systems. In Proceedings of the 2nd International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME), Maldives, Maldives, 16–18 November 2022. [Google Scholar] [CrossRef]
- Liu, Y.; Dillon, T.; Yu, W.; Rahayu, W.; Mostafa, F. Missing Value Imputation for Industrial IoT Sensor Data with Large Gaps. IEEE Internet Things J. 2020, 7, 6855–6867. [Google Scholar] [CrossRef]
- Dzaferagic, M.; Marchetti, N.; Macaluso, I. Fault Detection and Classification in Industrial IoT in Case of Missing Sensor Data. IEEE Internet Things J. 2022, 9, 8892–8900. [Google Scholar] [CrossRef]
- Ara, T.; M, P.; Bali, M. Fault Prediction in Wireless Sensor Networks using Soft Computing. In Proceedings of the 2020 International Conference on Smart Technologies in Computing, Electrical and Electronics (ICSTCEE), Bengaluru, India, 9–10 October 2020; pp. 532–538. [Google Scholar] [CrossRef]
- Siryani, J.; Tanju, B.; Eveleigh, T.J. A Machine Learning Decision-Support System Improves the Internet of Things’ Smart Meter Operations. IEEE Internet Things J. 2017, 4, 1056–1066. [Google Scholar] [CrossRef]
- Zhao, L. Event Prediction in the Big Data Era: A Systematic Survey. ACM Comput. Surv. 2021, 54, 1–37. [Google Scholar] [CrossRef]
- Inceoglu, F.; Jeppesen, J.; Kongstad, P.; Hernández Marcano, N.; Jacobsen, R.; Karoff, C. Using Machine Learning Methods to Forecast if Solar Flares Will Be Associated with CMEs and SEPs. Astrophys. J. 2018, 861, 128. [Google Scholar] [CrossRef]
- De Caigny, A.; Coussement, K.; De Bock, K.W. A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees. Eur. J. Oper. Res. 2018, 269, 760–772. [Google Scholar] [CrossRef]
- Lin, Y.L.; Yen, M.F.; Yu, L.C. Grid-Based Crime Prediction Using Geographical Features. ISPRS Int. J. Geo-Inf. 2018, 7, 298. [Google Scholar] [CrossRef]
- Kamstrup A/S. 2023. Available online: https://www.kamstrup.com/en-en (accessed on 12 September 2023).
- Kamstrup A/S. MULTICAL 603—Data Sheet. 2023. Available online: https://www.kamstrup.com/en-en/heat-solutions/meters-devices/meters/multical-603/documents (accessed on 8 May 2024).
- Wang, P.G.; Scharling, M.; Wittchen, K.B.; Kern-Hansen, C. Teknisk Rapport 13–18: 2001–2010 Dansk Design Reference Year Supplerende Datasæt; Technical Report; Danish Metrological Institute: Copenhagen, Denmark, 2013. [Google Scholar]
- Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
- Rokach, L.; Maimon, O. Data Mining with Decision Trees: Theory and Applications; World Scientific Publishing Co. Pte. Ltd.: River Edge, NJ, USA, 2015. [Google Scholar]
- Witten, I.H.; Frank, E. Practical Machine Learning Tools and Techniques, 2nd ed.; Elsevier: Amsterdam, The Netherlands, 2016. [Google Scholar]
- Alpaydin, E. Introduction to Machine Learning; MIT Press: Cambridge, MA, USA, 2010. [Google Scholar]
- Mitchell, T.M. Machine Learning: Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression. 2005. Available online: https://api.semanticscholar.org/CorpusID:482346 (accessed on 8 May 2024).
Variable Name | Variable | Unit | Description |
---|---|---|---|
ConsumerID | ID | - | A unique ID for each consumer. |
DateTime | DT | - | It is in the format DD-MM-YYYY HH:MM. |
Energy consumption | E | [GJ] | The heat energy utilized over a one-hour period. |
Forward flow · forward temperature | [m3·°C] | The total water flow for each hour multiplied by the forward temperature. One can extract the temperature again by dividing by the flow, F. This is the setup, as you then can extract the mean forward temperature for that hour instead of an instantaneous value. | |
Return flow · return temperature | [m3·°C] | The total water flow for each hour multiplied by the return temperature. | |
Flow | F | [m3] | The total flow of water that hour. |
Alarm # | Description | Potential Manifestation in Data |
---|---|---|
8 | When the temperature sensor either violates a specified upper threshold (165 °C degrees in these data) or is disabled. | Forward temperature increasing values over time (drift). |
128 | Wrong (forward-return), meaning the difference between forward and return temperature. . | decreasing over time until it reaches zero and goes negative. |
2048 | The water flow rate violates a specified threshold for more than one hour. | Heightened flow values for each consecutive hour until reaching the set limit. |
RF | Parameters | n_estimators | max_depth | min_samples_split | min_samples_leaf | bootstrap |
Values | 10–150 | 3–50 | 2–10 | 1–5 | True, False | |
Default | 100 | None | 2 | 1 | True | |
KNN | Parameters | n_neighbors | leaf_size | p | weights | metric |
Values | 1–50 | 2–50 | 1–2 | Uniform, Distance | Minkowski, Chebyshev | |
Default | 5 | 30 | 2 | Uniform | Minkowski | |
AdaBoost | Parameters | n_estimators | learning_rate | |||
Values | 10–200 | 0.1–1 | ||||
Default | 50 | 1 |
ML Method | Setup | Precision | TPR | AP |
---|---|---|---|---|
RF | Default | 88% | 79% | 91% |
RGS | 93% | 66% | 89% | |
KNN | Default | 90% | 57% | 79% |
RGS | 95% | 4% | 63% | |
SVM | Default | 93% | 72% | 91% |
AdaBoost | Default | 89% | 75% | 90% |
RGS | 91% | 72% | 90% | |
GNB | Default | 90% | 74% | 89% |
Fold 1 | Fold 2 | Fold 3 | Fold 4 | Fold 5 | |
---|---|---|---|---|---|
KNN | 78% | 71% | 56% | 80% | 73% |
RF | 75% | 74% | 54% | 78% | 73% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Søndergaard, H.A.N.; Shaker, H.R.; Jørgensen, B.N. Enabling Alarm-Based Fault Prediction for Smart Meters in District Heating Systems: A Danish Case Study. Smart Cities 2024, 7, 1126-1148. https://doi.org/10.3390/smartcities7030048
Søndergaard HAN, Shaker HR, Jørgensen BN. Enabling Alarm-Based Fault Prediction for Smart Meters in District Heating Systems: A Danish Case Study. Smart Cities. 2024; 7(3):1126-1148. https://doi.org/10.3390/smartcities7030048
Chicago/Turabian StyleSøndergaard, Henrik Alexander Nissen, Hamid Reza Shaker, and Bo Nørregaard Jørgensen. 2024. "Enabling Alarm-Based Fault Prediction for Smart Meters in District Heating Systems: A Danish Case Study" Smart Cities 7, no. 3: 1126-1148. https://doi.org/10.3390/smartcities7030048
APA StyleSøndergaard, H. A. N., Shaker, H. R., & Jørgensen, B. N. (2024). Enabling Alarm-Based Fault Prediction for Smart Meters in District Heating Systems: A Danish Case Study. Smart Cities, 7(3), 1126-1148. https://doi.org/10.3390/smartcities7030048