Forecasting of Daily Heat Production in a District Heating Plant Using a Neural Network

Maryniak, Adam; Banaś, Marian; Michalak, Piotr; Szymiczek, Jakub

doi:10.3390/en17174369

Open AccessArticle

Forecasting of Daily Heat Production in a District Heating Plant Using a Neural Network

Department of Power Systems and Environmental Protection Facilities, Faculty of Mechanical Engineering and Robotics, AGH University of Krakow, Mickiewicza 30, 30-059 Kraków, Poland

^*

Author to whom correspondence should be addressed.

Energies 2024, 17(17), 4369; https://doi.org/10.3390/en17174369

Submission received: 1 July 2024 / Revised: 26 August 2024 / Accepted: 28 August 2024 / Published: 1 September 2024

(This article belongs to the Collection Energy Efficiency and Environmental Issues)

Download

Browse Figures

Versions Notes

Abstract

Artificial neural networks (ANNs) can be used for accurate heat load forecasting in district heating systems (DHSs). This paper presents an application of a shallow ANN with two hidden layers in the case of a local DHS. The developed model was used to write a simple application in Python 3.10 that can be used in the operation of a district heating plant to carry out a preliminary analysis of heat demand, taking into account the ambient temperature on a given day. The model was trained using the real data from the period 2019–2022. The training was sufficient for the number of 150 epochs. The prediction effectiveness indicator was proposed. In the considered case, the effectiveness of the trained network was 85% and was better in comparison to five different regression models. The developed tool was based on an open-source programming environment and proved its ability to predict heating load.

Keywords:

artificial neural network; heat load forecasting; heat load predictions; Python; Keras library

1. Introduction

District heating systems (DHSs) are commonly used as an effective method forenergy supplies in the residential sector. In total, household energy consumption district heat in 2021 had 8.6% and 18.3% shares in the EU and Poland, respectively [1]. The total length of heating networks in Poland rose by 6.6% between 2012 and 2022 [2], reaching over 25,000 km, of which 96% was located in urban areas [3].

Heat load in DHSs is influenced by numerous factors [4,5]. Hence, for efficient operation of DHS plants, various heat load prediction techniques have been developed during recent decades. In the literature [6,7,8], these techniques are divided into physical-based, data-driven and hybrid methods. The physical-based models produce correlations between heat load and physical input factors (DHS parameters, material characteristics, weather, etc.). As they require a large amount of input data, they are time consuming. Data-driven models use historical energy demand and influencing factors to predict thermal load not requiring physical analysis of DHSs [9]. Taking advantage of this feature, in combination with the use of computer techniques, a wide range of approaches have been developed, especially machine learning solutions [10]. Among others, artificial neural networks (ANNs) are popular due to their good computing time [11], lower complexity in multi-variable issues [12] and easy adaptation to system nonlinearities [13] and various heat load profiles [14].However, despite the rapidly rising number of applications, machine learning and deep learning in DHS applications have still low shares [15]. Therefore, this part of ANN use is of interest.

Most works have focused on the heat load prediction in DHSs applying various ANN-based models using different sets of input variables (Table 1).

In general, the authors mainly compared various machine learning solutions in terms of their prediction accuracy. Wei et al. [32] analysed seven models, concluding that the weather forecasting data mostly influenced model performance. Li et al. [4] proposed an Elman neural network to model heat load using ambient temperature and sunlight factor as input variables. Using the experimental data from a heating company, they achieved a relative error for short-term prediction below 2%.

A different approach was presented by Bujalski et al. [33]. The authors compared results from ANN predictions with those from an exponential regression model with an input ambient air temperature and obtained forecast accuracy of 6.86% and 12.92%, respectively. As more accurate planning of heat or electricity generation is of significant economic importance [34], there is a need to analyse the potential benefits resulting from the use of ANN-based tools in relation to the usually used heat load demand curves.

As presented in Table 1, most of the studies were devoted to 24 h ahead heat-load forecasts based on hourly weather data. The authors focused on a comparison of various ANN-based models. The use of heating curves was not shown, although in our experience these models are often used in numerous low-power facilities. In addition, such objects often do not possess archived reliable hourly data on ambient climate and heat production. But, in practise, only general daily data are available for smaller-capacity facilities. In such cases, the question arises as to whether they can be used to forecast heat production with knowledge of the basic parameter defining ambient conditions, i.e., the air temperature. This is the gap that emerged from the literature analysis presented.

The second task was the use of freely available software. This way, the user was not restricted by the need for specialised commercial software or additional costs. Therefore, Python language (version 3.10) was chosen as it is often used for data analysis and scientific computing. It also contains various packages for machine learning applications [35], which makes this solution cheap and flexible in terms of future development.

This paper aimed to develop a neural network model designed to predict the heat demand produced in a district heating plant and to assess its accuracy in relation to the typical heating curves. The resulting model was used to write a simple application that can be used in the operation of a district heating plant to carry out a forecast of heat demand based on the ambient temperature on a given day.

This paper is structured as follows. In the design section, the process of analysing the obtained data and adapting them to the computational needs of the neural network model is presented and discussed. Then, the process of building the neural network model based on the available libraries, which are briefly presented, is described. Once the prepared model is trained and all validation calculations are performed, an interface is prepared for the heating plant staff. Then, the conclusions are given.

2. Materials and Methods

2.1. Research Strategy

The research algorithm is outlined in Figure 1. After briefly detailing plant characteristics, a measurement data analysis was performed. All variables were outlined and then inconsistencies were identified to prepare the input dataset for further analysis.

In the next step, two kinds of forecasting models were developed and examined. The architecture of the ANN model was selected after a literature review and then several tests and validations were conducted, showing the model’s ability to properly work with the specific input dataset. Finally, it was tested and its quality was evaluated using the proposed performance indicator. Similarly, using the same data, five regression models were developed and then tested. Finally, their comparison was presented.

2.2. Operation Parameters of the Heating Plant

This study uses measurement data obtained in a municipal heating plant located in west Poland. This plant is supplied mostly by natural gas, the consumption of which in 2022 was 2.4 ∙ 10⁶ m³. This is the main fuel used at this heating plant, but it is also possible to use heating oil, the consumption of which in 2022 was 644 m³. In further analysis, daily data from 1 January 2019 to 30 June 2022 were used.

The purpose of a district heating plant is to supply heat to consumers with a system of pipelines. The heat supplied can be determined by measuring the parameters of the district heating network. Water at a set temperature feeds into the network and then returns to the district heating plant at a correspondingly lower temperature. The temperature difference, the mass of water flowing through the network and the specific heat of the water allow for the calculation of the heat that has been discharged into the district heating network.

Q = m ∙ c ∙ ΔT,

(1)

where

m = \dot{m} \cdot Δ τ = \dot{V} \cdot ρ \cdot Δ τ,

(2)

with

m—the mass of the water in the heating network, kg;
ρ—the density of the water, kg/m³;
c—the specific heat of the water, J/kgK;
$\dot{m}$ —the mass flow rate of the water in the heating network, kg/s;
$\dot{V}$ —the volume flow rate of the water in the heating network, kg/s;
Δτ—the time, s;
∆T—the difference between the supply and return temperature, K.

Some important aspects that may be of particular interest to energy engineers are issues such as the parameters of the district heating network, heat losses in the pipeline and ways of feeding the network. The parameters analysed and their influence on the operation of a district heating plant are given in Table 2.

The supply pressure of the heating water at the inlet of a network can be controlled in the range of 0.18–0.70 MPa. This way, it is possible to increase the water mass flow rate. Together with the supply temperature, the heating power of the plant is controlled [36].

The parameters presented above are mostly strictly dependent on the operation of the plant and it would be pointless to forecast them. For example, the temperature of the heating medium at the return of the network is a result of the supply temperature and the thermal energy transferred to the end-users. After an in-depth analysis, it becomes apparent that the key parameter that affects the heat demand of consumers is the ambient temperature (Figure 2); it was chosen for further analysis.

Figure 3 presents the relation between daily heat demand and ambient temperature for the whole analysed period. During the warm season, when buildings connected to the considered network are not heated, there is almost constant heat demand (below 100 GJ),which can be attributed mainly to tap water and small commercial consumers.

Peak thermal power was 7.6 MW, 7.0 MW and 6.7 MW in 2021, 2020 and 2019, respectively (Figure 4). Despite differences in daily values, the presented heating power duration curves are very similar. Space heating of buildings lasted for about 2/3 of the year, until the 240th day.

2.3. Input Data Preparation and Preliminary Analysis

The measurement data obtained from the district heating company were prepared in Microsoft Excel 2007 spreadsheets and Excel format spreadsheets, but without consistently using the same data storage format. In order to import the data, it was necessary to organise the data in advance and check the validity of the data types (Table 3). To this end, after data import, information was obtained on what type of data was stored in the consecutive columns. The measurements in all columns except the date column are of the floating-point type, as expected.

Another problem encountered after data import was that unknown values were present in many records. The more significant gaps were observed in the last two columns of the data, i.e., in the operating times of the boilers. Due to the lack of data on the operating principles and operation of the heating block, these data were omitted, and this simplification did not take into account the breakdown of the operation by individual boilers. In the part of the data where the data were complete, the periods were observed when the boilers operated simultaneously or in configurations dependent on service or fault repair. This simplification unfortunately loses the possibility to take into account the relationship between the individual unit and the parameters obtained during the operation of the heating plant. With only 1593 data records available, it was necessary to focus on the most important problem of this issue, which is heat demand forecasting.

The next step in the preliminary analysis of the imported data was to calculate basic statistical information on each of the variables, such as the sum, mean value, median value, standard deviations, minimum value, maximum value, and percentages in the ranges presented (Table 4).

Preliminary analysis of the data leads to the conclusion that the minimum values of the presented columns should not be equal to 0. Such a situation is possible, but not during the normal operation of the heat plant. Therefore, all zero records were deleted.

After this operation, the set of records was reduced to 1534. Next, to determine the relationship between the individual data columns, a heat map was generated using the SeaBorn library. This map (Figure 5) indicates the degree of correlation between the data as determined by the value of Pearson’s correlation coefficient.

The bars in black represent correlations close to a value of −1, i.e., as the value of one parameter increases, the other decreases. The white boxes represent correlations close to 1, i.e., where, as one parameter increases, the other also increases.

The data presented above include only measured values. Sampling was carried out once a day, which resulted in each measurement being linked to a reading date. Based on the date, two additional columns were created with the day of the week (Monday–Sunday), with values from 0 to 6 corresponding to the day of the week, and a column with the month name (January–December) with values from 1 to 12.

The next step was to create graphs for each pair of parameters to illustrate the correlation between them (Figure 6, Figure 7, Figure 8 and Figure 9). Unfortunately, some of the graphs show that some data are redundant. For example, as the supplying and return temperatures were available, there was no need to use their difference.

3. Simulation Programme

3.1. Programming Environment

The data analysis as well as the entire process of creating the neural network was carried out using the Python language, which is clear and versatile. The availability of numerous libraries, add-ons and modules, which enable simple work with data, makes this language commonly used in many industrial and scientific applications. As an open-source product, it is readily used on all available platforms, such as Windows, Linux and McOS. Python is a high-level language, which makes the programme code created concise, clear and, therefore, easy and understandable for the user.

3.2. Keras Library

Keras is a machine learning library for the Python language, first released in 2015. Keras offers consistent and simple APIs, minimises the number of user actions required for typical use cases and provides clear and useful error messages. It also includes extensive documentation and guides for developers. Keras can run on a variety of engines, such as Tensorflow, Theano, PlaidML, MXNet and CNTK. In the case of this study, the Tesorflow engine was used. As Keras is based on Tensorflow, it is fully GPU-compatible, which contributes significantly to speeding up model training on large training sets.

3.3. Simulation Algorithm

The programme developed consists of two files. The first one defines the computational model of the neural network and performs the training of the network based on the imported data; the result of running it is the file model.keras, which stores the exported network model. Then, to use the created model in practise, a file defining the user interface is written. Here, windows and functions are defined (Figure 10).

3.4. Data Scaling

Due to the variety of input data, the different units used and the different order of stored values, it was necessary to scale the data so that the validity of any of the columns was not implied by the nature of the column value. To this end, column values were reduced to values in the range <–1,1>. For each column of values, the arithmetic mean and standard deviation were calculated. The value of the arithmetic mean was then subtracted from each value and the result was divided by the value of the standard deviation so that the values were brought into the desired range:

X_{s t a n d} = \frac{X - X_{a v g}}{σ},

(3)

with

X_stand—the standardised value;
X_avg—the average value;
σ—the standard deviation.

To avoid the influence of the test set on the value of the arithmetic mean and standard deviation, these were calculated using the training set and were then used to scale part of the test data.

The scaled data were fitted for further analysis. Scaling can take various forms, and in some cases, more complex methods are required, e.g., when dealing with graphics files, it is necessary to convert graphics into a sequence of numerical values.

An additional way to adapt the data to the computational needs is normalisation, which involves reducing the range of data to values in the interval <0; 1>:

X_{n o r m} = \frac{X - X_{m i n}}{X_{m a x} - X_{m i n}},

(4)

with

X_norm—the normalised value;
X_min—the minimum value;
X_max—the maximum value.

This method can be used when the data are subject to a normal distribution. In other cases, standardisation works well.

3.5. Model of the Neural Network

Among a vast variety of possible architectures of neural networks, the authors decided to use a DNN (deep neural network) in a quite simple architecture, which was FFN (feedforward neural network). This kind of neural network allows for the regression, classification and processing of signals, such as time series, while requiring low hardware requirements. This is achieved because of several simplifications of the network design: the data flow is unidirectional, there are no loops or feedback, and the data do not return to previous layers. At the same time, such networks (FNNs) can have a multilayer structure, allowing for deeper representation, good generalisation and error reduction, flexibility in modelling, good error reduction and relatively high optimisation capabilities.

Due to the fact of having a low number of input data records (~1500), there is a high probability of the network’s overfitting during the learning process. Building a small network is one way to minimise the occurrence of this phenomenon. To choose the right solution, we performed a literature review on similar solutions (Table 5).

Based on it, we developed a network with two hidden layers. It was created using the Sequential class from the Keras library, which means that layers are added sequentially one by one (Figure 11).

The layers added to the model are dense (dense) layers. The first and the second dense layers have 64 units (neurons, i.e., k = 64 in Figure 11) and use the Rectified Linear Unit (ReLu) activation functions [45]. The last dense layer has 1 unit, suggesting that the model is intended for regression (predicting continuous values). In this layer, the activation function was not defined, which resulted in a full range of values at the output. If the sigmoid function was used, the values generated would be in the range 0–1, and it would require an additional transformation to a range appropriate for the forecasted parameter.

The model was compiled with the rmsprop optimiser. The loss function MSE (mean squared error) was used to optimise the written model and was minimised during the training. The metric MAE (mean absolute error) was used to assess the performance of the written model and the output from this function was not used when training the model.

3.6. Algorithm for k-Component Cross-Validation

The k-component validation algorithm was used to divide the entire dataset into smaller parts and then to use one of them to validate the learning process. The algorithm was executed so many times that each part, which by default was a training set, at some point became a validation set. The use of this type of algorithm minimised the error resulting from the random selection of a validation set. Figure 12 shows the data partitioning scheme used in the network learning process for k = 4.

3.7. User Interface

In order to use the neural network model in an easy and intuitive way, it is necessary to prepare an interface intended for the users, i.e., the process engineers of the district heating plant. To this end, the PyQt5 module dedicated to GUI windows design was used. A tool for creating and editing windows, PyQt5 Designer, is available with the PyQt5 library. It is a very intuitive editor that significantly speeds up the programmer’s work. The editor offers an intuitive interface with many basic options for selecting common elements such as buttons, windows, checkboxes or labels. It is possible to create a new window and then use the drag-and-drop method to add the desired elements. Once the final visualisation of the window has been created, it is possible to save a file with the extension .ui, which then needs to be converted to a file with the extension .py to configure and adjust the logic behind the functions and individual elements.

The algorithm of this part of the developed tool is presented in Figure 13. After importing the developed model to Keras, the programme’s main window presents a welcome message and information about the measurement data used to create the neural network model. When the button is clicked, an auxiliary window is launched with a request for the forecast ambient temperature for the day indicated by the user. After entering the value and confirming it with the ‘OK’ button, a function is started and the value of heat demand based on the entered temperature is computed. The calculation requires a neural network model created as a result of running the master file. The forecast value is presented in the message of the main window.

4. Results and Discussion

Network Performance

As mentioned previously, a cross-validation process for k = 4 was used. The input dataset was divided into 20% and 80% of test and train subsets, respectively. Before the necessary number of neurons was set, several tests were performed at first. The maximum number of 500 epochs was set and the MAE was plotted (Figure 14 and Figure 15) for different numbers of neurons.

The plots presented in Figure 14 show that validation curves are higher than training curves, which means that the model tends to overfit [46] when using 8 and 32 neurons in each layer. This problem was solved by increasing the number of neurons (Figure 15).

The satisfactory quality of the model was obtained for 64 neurons. A further increase in their number led to underfit behaviour.

A good fit trend between the training and validation curves in Figure 15a was obtained when the number of epochs was above 150. Further learning beyond this value did not contribute to a reduction in error or the reduction in error was imperceptible. This conclusion is consistent with [47] and can be seen in Figure 16.

To obtain the final result, a final model was run on the previously prepared test data and fed into the test set to test the effectiveness of the training. The resulting mean average percentage error (MAPE) was 15%. Various measures assess the effectiveness of neural networks [48,49,50]. However, MAPE is probably the most commonly used in the evaluation of the accuracy of ANN predictions (Table 6).

Wojdyga [51] tested four different neural networks. In the first test case, the author compared one-hour-, two-hour- and three-hour-ahead forecasts using the same training dataset. The MAPE was 2.9%, 4.1% and 4.7%, respectively. In the second case, one-hour-ahead prediction was based on heat load data from one day and two days before, resulting in MAPEs of 2.1% and 4.2%, respectively.

Chramcov and Vařacha [18] used the outdoor temperature and time of the day (social component) as the main driving factors to produce a heat load forecast. Neural Network Synthesis was applied and half-hourly heat demand data were used. They obtained an MAPE average value of 5–6% and 8.5% with and without the inclusion of outdoor temperature in the input dataset.

The next step was to compare the performance of the developed ANN model with regression models. Based on the data presented in Figure 2, there were five derived heating curves, as given in Table 7.

In this study, the test data that were not used in the training and validation process were entered in the developed model. The average relative error for n samples was then computed:

e_{p r e d} = \frac{1}{n} \sum_{i = 1}^{n} |\frac{y_{i} - y_{i, p r e d}}{y_{i}}| .

(5)

Then, prediction effectiveness was obtained from the following relationship:

E P = 1 - e_{p r e d} .

(6)

In the considered case, EP = 0.850 for the ANN model.

Increasing the degree of the polynomial from 2 to 3 was more effective than from 3 to 4. In the latter case, there was no improvement in forecast accuracy. The lowest R² coefficient was 0.855 in the case of the exponential model, which means that over 92% of the variance in heating energy was explained by the independent variable (external temperature) in the model. This indicates a very strong correlation [52]. Nevertheless, prediction effectiveness in all cases was lower than that of the solution based on the neural network. Hence, the proposed model is the most effective.

These results were confirmed in the experiment using 20 randomly selected test data points for which heating demand was estimated by all presented models (Figure 17). It showed good accuracy of the presented model and its ability to effectively predict daily heating production.

Currently, boilers in the plant are controlled using a heating curve that links output thermal power (heating water flow rate, supply and return temperatures) with ambient temperature.

5. Conclusions

The main research objective of this study was to develop an efficient heat demand forecasting model using a neural network. The simple model of an ANN with two hidden layers was written in Python and built using freely available software. As it is commonly available among the variables measured in heating plants, the ambient air temperature was used as the input parameter to this model. The accuracy of the prediction assessed by mean absolute percentage error, MAPE = 15%, was at a level comparable with several studies. Some authors reported better results. However, they focused on hourly forecasts using hourly weather data. In this study, only a limited amount of daily input data were available. Therefore, it can be concluded that the developed model performed properly.

In the next step, using the same input data, four heating curves were developed and then used in the heat demand forecast. To compare their effects with the new model, there was a new indicator proposed: prediction effectiveness. The presented results showed that the ANN model performed the best.

The conclusions confirm that neural networks are promising tools in the field of artificial intelligence. However, to fully exploit their potential, further research and development of the technology are needed, especially in the context of heat demand forecasting.

In further steps, it would be advisable to develop the presented topic in two ways. The first one includes the measurement of new meteorological parameters influencing heating demand, such as solar irradiance or wind speed. The second one is to perform hourly measurements instead of current measurements with daily resolution. Then, a new, more accurate forecast model could be developed and practically used.

Author Contributions

Conceptualization, A.M. and P.M.; methodology, A.M. and P.M.; software (Python 3.10), A.M.; validation, A.M.; formal analysis, M.B.; investigation, A.M. and J.S.; resources, P.M.; data curation, J.S.; writing—original draft preparation, P.M.; writing—review and editing, P.M., J.S. and M.B.; visualisation, A.M.; supervision, P.M.; project administration, M.B.; funding acquisition, M.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The input data are confidential.

Conflicts of Interest

The authors declare no conflicts of interest.

Symbols and Abbreviations

date	day and month of a year; hour of a day
day	day of a week
holiday	distinction between working day and holiday
EE	electrical energy
HL	heating load
I_sol	solar irradiance
p_ret	return water pressure
p_sup	supply water pressure
q_water	heating water flow rate
RH	relative humidity of ambient air
SF	sunlight factor
T_i	indoor air temperature in a building
T_e	external air temperature
T_ret	return water temperature
T_sup	supply water temperature
V_wind	wind velocity
σ	standard deviation
Ac-GRN	Active Graph Recurrent Network
ANFIS	adaptive neuro-fuzzy inference system
BP	back propagation
BiLSTM	Bidirectional Long Short-Term Memory network
DWT	discrete wavelet transform
ETR	extremely randomised tree regression
GAM	Generalised Additive Model
GBDT	gradient boosting decision tree
MLP	Multilayer Perceptron
MLR	Multivariate linear regression
NARX	nonlinear autoregressive neural network model with external inputs
OLS	Ordinary Least Squares Regression
SLFN	single-layer feedforward neural network
SOMA	Self-Organising Migration Algorithm
SVR	Support Vector Regression
TCN	Temporal Convolutional Network

References

Energy 2023. Central Statistical Office. Rzeszów 2023. Available online: https://stat.gov.pl/en/topics/environment-energy/energy/energy-2023,1,11.html (accessed on 11 June 2024).
Housing Economy and Municipal Infrastructure in 2022. Statistics Poland. Statistical Office in Lublin, Warsaw, Lublin 2023. Available online: https://stat.gov.pl/en/topics/municipal-infrastructure/municipal-infrastructure/housing-economy-and-municipal-infrastructure-in-2022,5,19.html (accessed on 11 June 2024).
Municipal infrastructure—Energy and Gas Supply System in 2022. Statistics Poland. News Releases. 14 September 2023. Available online: https://stat.gov.pl/en/topics/municipal-infrastructure/municipal-infrastructure/municipal-infrastructure-energy-and-gas-supply-system-in-2022,12,1.html (accessed on 11 June 2024).
Li, Q.; Jiang, S.; Wu, X. The Elman Network of Heat Load Forecast Based on the Temperature and Sunlight Factor. In Advancements in Smart City and Intelligent Building. ICSCIB 2018; Fang, Q., Zhu, Q., Qiao, F., Eds.; Advances in Intelligent Systems and Computing; Springer: Singapore, 2019; Volume 890. [Google Scholar] [CrossRef]
Gadd, H.; Werner, S. Daily heat load variations in Swedish district heating systems. Appl. Energy 2013, 106, 47–55. [Google Scholar] [CrossRef]
Gong, M.; Wang, J.; Bai, Y.; Li, B.; Zhang, L. Heat load prediction of residential buildings based on discrete wavelet transform and tree-based ensemble learning. J. Build. Eng. 2020, 32, 101455. [Google Scholar] [CrossRef]
Wang, C.; Li, X.; Li, H. Role of input features in developing data-driven models for building thermal demand forecast. Energy Build. 2022, 277, 112593. [Google Scholar] [CrossRef]
Wang, Y.; Li, Z.; Liu, J.; Zhao, Y.; Sun, S. A novel combined model for heat load prediction in district heating systems. Appl. Therm. Eng. 2023, 227, 120372. [Google Scholar] [CrossRef]
Huang, Y.; Zhao, Y.; Wang, Z.; Liu, X.; Liu, H.; Fu, Y. Explainable district heat load forecasting with active deep learning. Appl. Energy 2023, 350, 121753. [Google Scholar] [CrossRef]
Oyucu, S.; Dogan, F.; Aksöz, A.; Biçer, E. Comparative Analysis of Commonly Used Machine Learning Approaches for Li-Ion Battery Performance Prediction and Management in Electric Vehicles. Appl. Sci. 2024, 14, 2306. [Google Scholar] [CrossRef]
Ma, W.; Fang, S.; Liu, G.; Zhou, R. Modeling of district load forecasting for distributed energy system. Appl. Energy 2017, 204, 181–205. [Google Scholar] [CrossRef]
Mosavi, A.; Salimi, M.; Faizollahzadeh Ardabili, S.; Rabczuk, T.; Shamshirband, S.; Varkonyi-Koczy, A.R. State of the Art of Machine Learning Models in Energy Systems, a Systematic Review. Energies 2019, 12, 1301. [Google Scholar] [CrossRef]
Livingstone, D.; Manallack, D.; Tetko, I. Data modelling with neural networks: Advantages and limitations. J. Comput. Aided Mol. Des. 1997, 11, 135–142. [Google Scholar] [CrossRef]
Hua, P.; Wang, H.; Xie, Z.; Lahdelma, R. District heating load patterns and short-term forecasting for buildings and city level. Energy 2024, 289, 129866. [Google Scholar] [CrossRef]
Forootan, M.M.; Larki, I.; Zahedi, R.; Ahmadi, A. Machine Learning and Deep Learning in Energy Systems: A Review. Sustainability 2022, 14, 4832. [Google Scholar] [CrossRef]
Simonović, M.B.; Nikolić, V.D.; Petrović, E.P.; Ćirić, I.T. Heat load prediction of small district heating system using artificial neural networks. Therm. Sci. 2017, 21, 1355–1365. [Google Scholar] [CrossRef]
Gong, M.; Zhou, H.; Wang, Q.; Wang, S.; Yang, P. District heating systems load forecasting: A deep neural networks model based on similar day approach. Adv. Build. Energy Res. 2019, 14, 372–388. [Google Scholar] [CrossRef]
Chramcov, B.; Vařacha, P. Usage of the Evolutionary Designed Neural Network for Heat Demand Forecast. In Nostradamus: Modern Methods of Prediction, Modeling and Analysis of Nonlinear Systems; Zelinka, I., Rössler, O., Snášel, V., Abraham, A., Corchado, E., Eds.; Advances in Intelligent Systems and Computing; Springer: Berlin/Heidelberg, Germany, 2013; Volume 192. [Google Scholar] [CrossRef]
Bujalski, M.; Madejski, P.; Fuzowski, K. Day-ahead heat load forecasting during the off-season in the district heating system using Generalized Additive model. Energy Build. 2023, 278, 112630. [Google Scholar] [CrossRef]
Abghari, S.; Garcia-Martin, E.; Johansson, C.; Lavesson, N.; Grahn, H. Trend Analysis to Automatically Identify Heat Program Changes. Energy Procedia 2017, 116, 407–415. [Google Scholar] [CrossRef]
Żymełka, P.Y.; Szega, M. Short-term scheduling of gas-fired CHP plant with thermal storage using optimization algorithm and forecasting models. Energy Convers. Manag. 2021, 231, 113860. [Google Scholar] [CrossRef]
Petković, D.; Protić, M.; Shamshirband, S.; Akib, S.; Raos, M.; Marković, D. Evaluation of the most influential parameters of heat load in district heating systems. Energy Build. 2015, 104, 264–274. [Google Scholar] [CrossRef]
Suryanarayana, G.; Lago, J.; Geysen, D.; Aleksiejuk, P.; Johansson, C. Thermal load forecasting in district heating networks using deep learning and advanced feature selection methods. Energy 2018, 157, 141–149. [Google Scholar] [CrossRef]
Kováč, S.; Micha’čonok, G.; Halenár, I.; Važan, P. Comparison of Heat Demand Prediction Using Wavelet Analysis and Neural Network for a District Heating Network. Energies 2021, 14, 1545. [Google Scholar] [CrossRef]
Song, J.; Li, W.; Zhu, S.; Zhou, C.; Xue, G.; Wu, X. Predicting hourly heating load in district heating system based on the hybrid Bi-directional long short-term memory and temporal convolutional network model. J. Clean. Prod. 2024, 463, 142769. [Google Scholar] [CrossRef]
An, W.; Zhu, X.; Yang, K.; Kim, M.K.; Liu, J. Hourly Heat Load Prediction for Residential Buildings Based on Multiple Combination Models: A Comparative Study. Buildings 2023, 13, 2340. [Google Scholar] [CrossRef]
Hietaharju, P.; Ruusunen, M.; Leiviskä, K. Enabling Demand Side Management: Heat Demand Forecasting at City Level. Materials 2019, 12, 202. [Google Scholar] [CrossRef] [PubMed]
Manno, A.; Martelli, E.; Amaldi, E. A Shallow Neural Network Approach for the Short-Term Forecast of Hourly Energy Consumption. Energies 2022, 15, 958. [Google Scholar] [CrossRef]
Dahl, M.; Brun, A.; Kirsebom, O.S.; Andresen, G.B. Improving Short-Term Heat Load Forecasts with Calendar and Holiday Data. Energies 2018, 11, 1678. [Google Scholar] [CrossRef]
Idowu, S.; Saguna, S.; Ahlund, C.; Schelen, O. Applied machine learning: Forecasting heat load in district heating system. Energy Build. 2016, 133, 478–488. [Google Scholar] [CrossRef]
Li, B.; Shao, Y.; Lian, Y.; Li, P.; Lei, Q. Bayesian Optimization-Based LSTM for Short-Term Heating Load Forecasting. Energies 2023, 16, 6234. [Google Scholar] [CrossRef]
Wei, Z.; Zhang, T.; Yue, B.; Ding, Y.; Xiao, R.; Wang, R.; Zhai, X. Prediction of residential district heating load based on machine learning: A case study. Energy 2021, 231, 120950. [Google Scholar] [CrossRef]
Bujalski, M.; Madejski, P.; Fuzowski, K. Heat demand forecasting in District Heating Network using XGBoost algorithm. E3S Web Conf. 2021, 323, 00004. [Google Scholar] [CrossRef]
Bujalski, M.; Madejski, P. Forecasting of Heat Production in Combined Heat and Power Plants Using Generalized Additive Models. Energies 2021, 14, 2331. [Google Scholar] [CrossRef]
Månsson, S.; Kallioniemi, P.-O.J.; Sernhed, K.; Thern, M. A machine learning approach to fault detection in district heating substations. Energy Procedia 2018, 149, 226–235. [Google Scholar] [CrossRef]
Szymiczek, J.; Szczotka, K.; Banaś, M.; Jura, P. Efficiency of a Compressor Heat Pump System in Different Cycle Designs: A Simulation Study for Low-Enthalpy Geothermal Resources. Energies 2022, 15, 5546. [Google Scholar] [CrossRef]
Stienecker, M.; Hagemeier, A. Developing Feedforward Neural Networks as Benchmark for Load Forecasting: Methodology Presentation and Application to Hospital Heat Load Forecasting. Energies 2023, 16, 2026. [Google Scholar] [CrossRef]
Dagdougui, H.; Bagheri, F.; Le, H.; Dessaint, L. Neural Network Model for Short-Term and Very-Short-Term Load Forecasting in District Buildings. Energy Build. 2019, 203, 109408. [Google Scholar] [CrossRef]
Wang, H.T. Typical Building Thermal and Thermal Load Forecasting Based on Wavelet Neural Network. Procedia Comput. Sci. 2020, 166, 529–533. [Google Scholar] [CrossRef]
Trabert, U.; Pag, F.; Orozaliev, J.; Jordan, U.; Vajen, K. Peak shaving at system level with a large district heating substation using deep learning forecasting models. Energy 2024, 301, 131690. [Google Scholar] [CrossRef]
Salehi, S.; Kavgic, M.; Bonakdari, H.; Begnoche, L. Comparative study of univariate and multivariate strategy for short-term forecasting of heat demand density: Exploring single and hybrid deep learning models. Energy AI 2024, 16, 100343. [Google Scholar] [CrossRef]
Gao, Z.; Yu, J.; Zhao, A.; Hu, Q.; Yang, S. A hybrid method of cooling load forecasting for large commercial building based on extreme learning machine. Energy 2022, 238, 122073. [Google Scholar] [CrossRef]
Semmelmann, L.; Hertel, M.; Kircher, K.J.; Mikut, R.; Hagenmeyer, V.; Weinhardt, C. The impact of heat pumps on day-ahead energy community load forecasting. Appl. Energy 2024, 368, 123364. [Google Scholar] [CrossRef]
Shi, J.; Teh, J. Load forecasting for regional integrated energy system based on complementary ensemble empirical mode de-composition and multi-model fusion. Appl. Energy 2024, 353, 122146. [Google Scholar] [CrossRef]
Wang, Y.; Li, Y.; Song, Y.; Rong, X. The Influence of the Activation Function in a Convolution Neural Network Model of Facial Expression Recognition. Appl. Sci. 2020, 10, 1897. [Google Scholar] [CrossRef]
Candela-Leal, M.O.; Gutiérrez-Flores, E.A.; Presbítero-Espinosa, G.; Sujatha-Ravindran, A.; Ramírez-Mendoza, R.A.; Lozoya-Santos, J.d.J.; Ramírez-Moreno, M.A. Multi-Output Sequential Deep Learning Model for Athlete Force Prediction on a Treadmill Using 3D Markers. Appl. Sci. 2022, 12, 5424. [Google Scholar] [CrossRef]
Salah, S.; Alsamamra, H.R.; Shoqeir, J.H. Exploring Wind Speed for Energy Considerations in Eastern Jerusalem-Palestine Using Machine-Learning Algorithms. Energies 2022, 15, 2602. [Google Scholar] [CrossRef]
Sokół, S.; Pawuś, D.; Majewski, P.; Krok, M. The Study of the Effectiveness of Advanced Algorithms for Learning Neural Networks Based on FPGA in the Musical Notation Classification Task. Appl. Sci. 2022, 12, 9829. [Google Scholar] [CrossRef]
Stokfiszewski, K.; Sztoch, P.; Sztoch, R.; Wosiak, A. Building Energy Use Intensity Prediction with Artificial Neural Networks. In Progress in Polish Artificial Intelligence Research 4; Wojciechowski, A., Lipiński, P., Eds.; Monografie Politechniki Łódzkiej Nr. 2437; Wydawnictwo Politechniki Łódzkiej: Łódź, Poland, 2023; pp. 313–318. [Google Scholar] [CrossRef]
Karunia, K.; Putri, A.E.; Fachriani, M.D.; Rois, M.H. Evaluation of the Effectiveness of Neural Network Models for Analyzing Customer Review Sentiments on Marketplace. Public Res. J. Eng. Data Technol. Comput. Sci. 2024, 2, 52–59. [Google Scholar] [CrossRef]
Wojdyga, K. Predicting Heat Demand for a District Heating Systems. Int. J. Energy Power Eng. 2014, 3, 237–244. [Google Scholar] [CrossRef]
Jierula, A.; Wang, S.; OH, T.-M.; Wang, P. Study on Accuracy Metrics for Evaluating the Predictions of Damage Locations in Deep Piles Using Artificial Neural Networks with Acoustic Emission Data. Appl. Sci. 2021, 11, 2314. [Google Scholar] [CrossRef]

Figure 1. The research algorithm.

Figure 2. Ambient temperature and heat production.

Figure 3. Heating energy and ambient temperature.

Figure 4. Daily duration curves of heating power in 2019, 2020 and 2021.

Figure 5. The Pearson correlation coefficient matrix plot for the heat load and influencing factors.

Figure 6. Graphs showing correlations between individual parameters. Part 1.

Figure 7. Graphs showing correlations between individual parameters. Part 2.

Figure 8. Graphs showing correlations between individual parameters. Part 3.

Figure 9. Graphs showing correlations between individual parameters. Part 4.

Figure 10. Algorithm of source code operation.

Figure 11. Structure of the artificial network.

Figure 12. Flow chart of the k-method—cross-validation.

Figure 13. Algorithm of forecast tool operation.

Figure 14. Plots of MAE against the number of epochs during training and validation of a network with (a) 8 neurons in each layer; (b) 32 neurons in each layer.

Figure 15. Plots of MAE against the number of epochs during training and validation of a network with (a) 64 neurons in each layer; (b) 120 neurons in each layer.

Figure 16. The RMSE training and validation errors for the 64 neurons.

Figure 17. Real and predicted heating energy.

Table 1. Simulation models used in various data-driven forecast methods of DHSs.

Method	Input	Input Data Format	Output	Forecast	Tool	References
BP, GABP, RBF Elman NN	Date, T_e, SF, HL	1 h	HL	24 h	Matlab 2015b	[4]
BP	Date, HL, T_e	1 h	HL	1 … 7 days	n.a.	[16]
XGBoost	Date, T_e, V_wind, HL,	1 h	HL	24 h, 168 h	n.a.	[17]
SOMA	Date, T_e, HL	0.5 h	HL	24 h	Matlab 2010	[18]
BPNN, RBFNN, SVR, XGBoost	T_e, I_sol, RH, V_wind, T_i, HL	1 h	HL	24 h	Python 3.8.12	[8]
DWT–ETR, DWT–GBDT, DWT–SVR	Date, V_wind, RH, T_e, HL, T_sup, T_ret	1 h	HL	1 h	Python 3.6	[6]
GAM	Date, T_e	1 h	HL	24 h	R 1.8-38	[19]
SVR	T_e, HL	1 d	Tsup	3 d	R	[20]
n.a.	T_e, V_wind, RH, HL	1 h	HL	24 h	R	[21]
ANFIS	HL, T_e, T_ret	1 h	HL	1–5, 8, 10, 12, 24 h	Matlab	[22]
SVR, GBT, DNN	Date, HL, T_e	1 h	HL	24 h	Python	[23]
DWT–BP	T_e, HL	1 h	HL	1 h	Matlab 2020a	[24]
BiLSTM–TCN	T_sup, T_ret, P_sup, P_ret, T_e	10 min	HL	24 h	Python	[25]
Ac-GRN	T_e, I_sol, R_H, V_wind, HL	1 h	HL	15, 30, 45, 60 days	Pytorch 1.12.1	[9]
PCA-BP-ELMAN, FA-ELMAN, GA-BP	T_e, I_sol, V_wind, RH, T_sup, T_ret, HL, q_water	1 h	HL	1 h	Matlab	[26]
NARX	HL, T_sup, Date	1 h	HL	48 h	Matlab 2017a	[27]
SLFN	T_e, I_sol, HR, V_wind, Day,	1 h	HL	24 h	Matlab	[28]
OLS, MLP, SVR	T_e, V_wind, I_sol, Date, HL, holiday	1 h	HL, EE	24 h	Python 2.7	[29]
SVR, FF, RT, MLR	T_sup, T_ret, p_sup, q_water, Date, T_e, RH, V_wind, I_sol	1 h	HL	24 h	Matlab 2013a	[30]
LSTM	HL, T_e	1 h	HL	24, 48, 72, 168 h	Matlab 2020b	[31]

Table 2. Parameters recorded in the heating plant.

Variable	Description	Unit
Date	Date and time	dd-mm-yyy, hh-mm
T_e	Ambient temperature	°C
T_sup,set	Setpoint network supplying temperature	°C
T_sup	Network supply temperature	°C
T_ret	Network return temperature	°C
dT	Temperature difference	°C
T_sup,b	Outlet boiler temperature	MPa
p_sup	Supply network pressure	m³
dp	Pressure network difference	MJ
$\dot{V}$	Heating water flow rate	m³/h
Q_sup	Heat supplied to the network	GJ
G_gas	Total gas consumption	m³
Q_oil	Oil boiler heat	GJ
Q_self,tw	Self-consumption—tap water	GJ
Q_self,other	Self-consumption—other	GJ
V_oil	Current heating oil volume	%
G_water	Treated water consumption	m³
p_dt	Degassing tank pressure	MPa
T_dt	Degassing tank water temperature	°C
t_K1	Operating time of the K1 boiler	h
t_K2	Operating time of the K2 boiler	h
t_K3	Operating time of the K3 boiler	h

Table 3. Deficiencies in imported data.

No.	Variable	Non-Null Count	Data Type
1	Date	1593	object
2	T_e	1593	float64
3	T_sup,set	1593	float64
4	T_sup	1593	float64
5	T_ret	1593	float64
6	dT	1593	float64
7	T_sup,b	1593	float64
8	p_sup	1593	float64
9	dp	1593	float64
10	$\dot{V}$	1593	float64
11	Q_sup	1592	float64
12	G_gas	1563	float64
13	Q_oil	1593	float64
14	Q_self,tw	1593	float64
15	Q_self,other	1593	float64
16	V_oil	1592	float64
17	G_water	1593	float64
18	p_dt	1593	float64
19	T_dt	1593	float64
20	t_K1	1593	float64
21	t_K2	1506	float64
22	t_K3	1476	float64

Table 4. Statistical analysis of the selected variables in the source files.

Index	T_e	T_sup,set	T_sup	T_ret	dT	T_sup,b	p_sup	dp	G_gas	Q_sup
Count	1534	1534	1534	1534	1534	1534	1534	1534	1534	1534
Mean	9.765	75.594	76.308	50.870	25.437	125.565	0.617	0.135	2041.5	398.493
σ	7.042	12.494	12.521	4.291	10.080	2.798	0.045	0.00858	752.47	6350.63
Min	–9.86	51.38	30.81	18.83	4.57	76.44	0.18	0.01	271	17
25%	4.145	64.5	65.253	47.823	17.57	124.77	0.58	0.13	1288.25	80
50%	9.24	72.55	73.45	50.385	24.855	125.645	0.64	0.14	2119.5	224
75%	15.703	84.378	85.115	53.27	32.798	126.99	0.66	0.14	2725	366
max	26.15	124.1	124.1	66.19	59.2	128.8	0.7	0.2	6214	248

Table 5. Parameters of neural networks used in energy consumption forecasts.

Neurons	Hidden Layers	Inputs	Prediction	Object	Reference
40	2	7	12 h	Hospital	[37]
20	2	7	22 h	Hospital	[37]
10, 24, 48	2, 3	8, 12	1 h, 24 h	District buildings	[38]
6	1	1	24 h	Residential and commercial buildings	[39]
100/64/1	1	6	24 h	Residential and commercial buildings	[14]
10	2	4	12, 24 h	Industrial objects, district heating	[40]
64	2	1–8	24 h	Office	[41]
30	3	8	56 h	Commercial buildings	[42]
32–256	2	10	24 h	Community grid	[43]
60	3	13	24 h	District heating system	[7]
32, 256	1, 3	7	1 h	Integrated energy system	[44]

Table 6. Accuracy metrics in various data-driven forecast methods of DHSs.

Method	Forecast Period	Error	Metric	Reference
BP, GABP, RBF Elman NN	24 h	2%	Relative error	[4]
DWT–ETR, DWT–GBDT, DWT–SVR	1 h	2.15–5.32%	MAPE	[6]
BPNN, RBFNN, SVR, XGBoost	14 days, hourly	2.6–7%	MAPE	[8]
Ac-GRN	15–60 h	7.63–8.99	MAE	[9]
GAM	24 h	3.26%	MAPE	[19]
SVR	1 d	0.49–2.40 MW	MAE	[20]
n.a.	7 d	1.31–4.07%	MAPE	[21]
ANFIS	1–24 h	3–55 MW	RMSE	[22]
SVR, GBT, DNN	24 h	4.15–11.75%	MAPE	[23]
PCA-BP-ELMAN, FA-ELMAN, GA-BP	24 h	7.45–27.95%	MAPE	[26]
NARX	48 h	1.29–9.43%	MAPE	[27]
SLFN	24 h	5.22–25.66%	MAPE	[28]
OLS, MLP, SVR	24 h	2.5–10.0%	MAPE	[29]
SVR, FF, RT, MLR	24 h	0.05–0.3	NRMSE	[30]
LSTM	24 h	0.35–0.65 kW	RMSE	[31]

Table 7. Regression models of heating energy.

Model	Equation	R²	EP
Linear	Q_s = −19.45 T_e + 429.1	0.860	0.677
Polynomial (degree 2)	Q_s = 0.513 T_e² − 30.1 T_e + 460.1	0.892	0.765
Polynomial (degree 3)	Q_s = 0.039 T_e³ − 0.629 T_e² − 23.01 T_e + 460.8	0.908	0.811
Polynomial (degree 4)	Q_s = 0.000089 T_e⁴ + 0.036 T_e³ − 0.601 T_e² − 22.97 T_e + 460.3	0.908	0.810
Exponential	$Q_{s} = 491 \times$ e^−0.09T_e	0.855	0.755

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Maryniak, A.; Banaś, M.; Michalak, P.; Szymiczek, J. Forecasting of Daily Heat Production in a District Heating Plant Using a Neural Network. Energies 2024, 17, 4369. https://doi.org/10.3390/en17174369

AMA Style

Maryniak A, Banaś M, Michalak P, Szymiczek J. Forecasting of Daily Heat Production in a District Heating Plant Using a Neural Network. Energies. 2024; 17(17):4369. https://doi.org/10.3390/en17174369

Chicago/Turabian Style

Maryniak, Adam, Marian Banaś, Piotr Michalak, and Jakub Szymiczek. 2024. "Forecasting of Daily Heat Production in a District Heating Plant Using a Neural Network" Energies 17, no. 17: 4369. https://doi.org/10.3390/en17174369

APA Style

Maryniak, A., Banaś, M., Michalak, P., & Szymiczek, J. (2024). Forecasting of Daily Heat Production in a District Heating Plant Using a Neural Network. Energies, 17(17), 4369. https://doi.org/10.3390/en17174369

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Forecasting of Daily Heat Production in a District Heating Plant Using a Neural Network

Abstract

1. Introduction

2. Materials and Methods

2.1. Research Strategy

2.2. Operation Parameters of the Heating Plant

2.3. Input Data Preparation and Preliminary Analysis

3. Simulation Programme

3.1. Programming Environment

3.2. Keras Library

3.3. Simulation Algorithm

3.4. Data Scaling

3.5. Model of the Neural Network

3.6. Algorithm for k-Component Cross-Validation

3.7. User Interface

4. Results and Discussion

Network Performance

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Symbols and Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI