Towards Sustainable Energy: Predictive Models for Space Heating Consumption at the European Central Bank

Almeida, Fernando; Castelli, Mauro; Côrte-Real, Nadine

doi:10.3390/environments12040131

Open AccessArticle

Towards Sustainable Energy: Predictive Models for Space Heating Consumption at the European Central Bank

by

Fernando Almeida

^*

,

Mauro Castelli

and

Nadine Côrte-Real

NOVA Information Management School, Universidade NOVA de Lisboa, 1070-312 Lisboa, Portugal

^*

Author to whom correspondence should be addressed.

Environments 2025, 12(4), 131; https://doi.org/10.3390/environments12040131

Submission received: 19 February 2025 / Revised: 8 April 2025 / Accepted: 18 April 2025 / Published: 21 April 2025

(This article belongs to the Special Issue Life Cycle Assessment: Methods and Tools to Achieve Sustainable Decarbonization and Circular Economy in the Building Sector)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Space heating consumption prediction is critical for energy management and efficiency, directly impacting sustainability and efforts to reduce greenhouse gas emissions. Accurate models enable better demand forecasting, promote the use of green energy, and support decarbonization goals. However, existing models often lack precision due to limited feature sets, suboptimal algorithm choices, and limited access to weather data, which reduces generalizability. This study addresses these gaps by evaluating various Machine Learning and Deep Learning models, including K-Nearest Neighbors, Support Vector Regression, Decision Trees, Linear Regression, XGBoost, Random Forest, Gradient Boosting, AdaBoost, Long Short-Term Memory, and Gated Recurrent Units. We utilized space heating consumption data from the European Central Bank Headquarters office as a case study. We employed a methodology that involved splitting the features into three categories based on the correlation and evaluating model performance using Mean Squared Error, Mean Absolute Error, Root Mean Squared Error, and R-squared metrics. Results indicate that XGBoost consistently outperformed other models, particularly when utilizing all available features, achieving an R² value of 0.966 using the weather data from the building weather station. This model’s superior performance underscores the importance of comprehensive feature sets for accurate predictions. The significance of this study lies in its contribution to sustainable energy management practices. By improving the accuracy of space heating consumption forecasts, our approach supports the efficient use of green energy resources, aiding in the global efforts towards decarbonization and reducing carbon footprints in urban environments.

Keywords:

space heating consumption; office buildings; sustainable energy; machine learning; deep learning

1. Introduction

The study of building energy demand has gained significant importance due to the growing focus on energy sustainability, especially following the European Directive on Energy Performance of Buildings (EPB) implementation. In Europe, buildings are responsible for 40% of total energy consumption and 36% of total CO₂ emissions [1]. Accurate predicting building energy consumption is crucial for effective energy management, as it helps identify abnormal energy usage and diagnose potential causes, provided that sufficient historical data is available [2]. However, conventional energy prediction methods often fall short due to their reliance on rigid assumptions and limited adaptability to dynamic energy consumption patterns.

Recently, there has been a shift from merely calculating energy consumption to analyzing the actual energy use of buildings [3,4]. This shift is driven by the complexity of building energy systems and behavior, which non-calibrated models fail to accurately predict, thus requiring real-time data analysis of energy use. Traditionally, estimating building energy use involves applying a model with known system structures, properties, and external variables (forward approach). These engineering methods utilize physical principles to calculate thermal dynamics and energy behavior at the building or sub-component level [5]. Despite their theoretical accuracy, these physics-based models often struggle with scalability and require extensive domain expertise. Furthermore, their reliance on detailed building specifications, which may not always be available, limits their practical applicability in large-scale or heterogeneous building environments.

Various software tools, such as DOE-2 (e.g., version 2.2) [6], EnergyPlus (e.g., version 9.6.0) [7], and TRNSYS (e.g., version 18.02) [8], have been developed for this purpose. However, these tools require detailed knowledge of numerous building parameters and behaviors, which are often unavailable. Consequently, simplified methods for predicting building energy use have been developed. For instance, the steady-state method using degree days was presented in [9]. Yao and Steemers [10] also introduced a simple method for formulating load profiles for U.K. domestic buildings using a thermal dynamic model to predict daily energy demand profiles for appliances, domestic hot water, and space heating. While these methods provide useful approximations, they often lack the flexibility to capture non-linear dependencies between variables and fail to adapt to evolving building occupancy patterns or climatic variations.

Organizational measures for energy efficiency encompass a variety of strategies aimed at reducing consumption while maximizing resource utilization [11,12]. Such methods include the installation of energy-efficient lighting systems [13], improving HVAC for better performance [14], and innovative building technologies [15]. Although these measures significantly contribute to energy conservation, their effectiveness is often constrained by high initial costs, resistance to change, and the lack of real-time monitoring mechanisms. The absence of adaptive strategies also limits their ability to respond to fluctuating energy demands.

In recent years, the development of energy management, especially in space heating consumption (SHC), has been transformed by the introduction and implementation of Artificial Intelligence (AI) with Machine Learning (ML) [16]. AI refers to imitating intelligence processes by computer systems, including learning, reasoning, and self-correction. At the same time, ML is a subset of AI that exclusively deals with developing algorithms that allow computers to learn, make predictions, or take actions based on data [17]. These technologies, including models such as neural networks [18], DTs [19], and support vector machines [20], are developed and used to optimize energy utilization, predict demand, patterns, and costs, and detect inefficiencies in various energy systems. For instance, neural networks can be trained to examine complicated energy data and give insights into energy usage patterns, which will help with making informed decisions regarding energy optimization strategies [21]. However, while ML-based approaches provide substantial improvements in predictive accuracy, their performance is often model-dependent and sensitive to data quality [22]. Many studies fail to incorporate diverse data sources, leading to biased predictions that do not generalize well across different buildings or climatic conditions. Furthermore, the black-box nature of some ML models raises concerns regarding interpretability, making it difficult for facility managers to trust and act on predictions. With the potential for real-time monitoring and adaptive control, Internet of Things (IoT) devices could help in dynamic adjustments that optimize energy consumption according to the situation. Similarly, predictive maintenance models can detect equipment failures and operational deterioration, enabling proactive interventions to be carried out before interruptions and the subsequent wastage of energy resources [14].

While this study presents a novel approach by integrating multiple ML models and localized weather data for SHC prediction, it builds upon a growing body of research exploring data-driven energy forecasting methods. Numerous studies have leveraged ML and DL models for building energy demand estimation, yet many focus on single techniques or limited feature sets, restricting their applicability across different climatic and operational contexts. A comprehensive review of prior works is necessary to contextualize our study, highlight gaps in existing methodologies, and reinforce the need for a comparative ML framework in SHC forecasting.

With new developments in ML, it has been possible to achieve considerable progress in the past few years [23]. For instance, Xue et al. [24] proposed an ML-based framework that applies to multi-step-ahead district heating system load forecasting by testing SVR, deep neural network, and extreme gradient boosting (XGBoost) models. Furthermore, Li & Yao [25] developed a system for the prediction of building heating as well as cooling loads that includes occupant behavior as a predictor variable and also considers five ML models. Additionally, Jovanović, Sretenović, and Živković [26] investigate the prediction of heating energy consumption for a university campus, employing various artificial neural network architectures. However, challenges remain in refining predictive approaches due to the complex relationship between input variables and SHC, particularly in diverse campus settings.

On the other hand, Yuan et al. [27] introduce a novel sample data selection method (SDSM) to enhance the prediction accuracies of Back Propagation Neural Network (BPNN) and Multi-layer Perceptron MLP models for heating energy consumption, demonstrating significant reductions in training and prediction errors for BPNN models. Moreover, Potočnik, Škerl, and Govekar [28] present an ML-based approach for short-term heat demand forecasting in district heating systems, highlighting the superiority of Gaussian Process Regression (GPR) in achieving accurate forecasts for the most prominent Slovenian DH system. Jang et al. [29] focus on enhancing the prediction accuracy of building heat consumption using LSTM models, showing improved performance when operation pattern data from non-residential buildings is incorporated. However, challenges remain, such as the limited applicability of SDSM to MLP models and the need for more comprehensive data to enhance model accuracy in diverse building scenarios.

ML models have been applied to predict building heat loads, optimize energy consumption, and enhance sustainability. Dalipi et al. [30] introduce a supervised ML model for predicting heat load in a district heating system, evaluating SVR, Partial Least Square, and RF algorithms. This study focuses on different algorithms’ performance in a district heating context, highlighting their varying predictive capabilities. In another approach, Moradzadeh et al. [31] propose a methodology for forecasting heating and cooling loads in residential buildings using MLP and SVR techniques. This study explicitly targets residential buildings and emphasizes the predictive accuracy of these models in that context. Abdelkader et al. [32] address the need for energy-efficient buildings by comparing various ML models, finding that the radial basis neural network performs better. However, they identify limitations such as the focus on specific meteorological parameters and reliance on simulated data, which may impact the real-world applicability of these models.

In addressing the imperative for energy efficiency across diverse domains, Shen et al. [33] emphasize optimizing energy usage in greenhouses to reduce production costs, employing mathematical modeling and algorithmic optimization. Similarly, Moradzadeh et al. [34] contribute to advancing accurate building energy consumption prediction, focusing on residential buildings’ cooling and heating loads. Their novel hybrid model, Gaussian SVR, integrates the Group Method of Data Handling (GMDH) and SVR techniques, presenting promising results, albeit with complexities and applicability concerns. Furthermore, while Shen et al. achieve promising outcomes, challenges persist due to the intricacies of greenhouse energy exchange and seasonal variations, suggesting further research to expand the model’s scope to address summer cooling and optimize temperature settings throughout the week.

Building energy consumption prediction, particularly for heating loads, has been explored through various ML and optimization techniques such as SVR, neural networks, and hybrid models as shown in Table 1. These studies have demonstrated promising results despite the absence of certain critical variables, indicating the potential for further advancements in energy forecasting methodologies. However, the exclusion of essential parameters suggests the need for more comprehensive models that incorporate a broader range of influential factors. Addressing these gaps, this study aims to enhance the precision and generalizability of SHC predictions at the European Central Bank headquarters by evaluating and comparing multiple ML models using data from multiple weather stations. The goal is to improve demand forecasting accuracy, support sustainable energy management, and contribute to global decarbonization efforts.

The contributions of this study are as follows:

We conducted a thorough comparison of various ML and DL models, including K-Nearest Neighbor (KNN), Support Vector Regression (SVR), Decision Tree (DT), Linear Regression (LR), XGBoost, Random Forest (RF), Gradient Boosting (GB), AdaBoost, Long-Short-Term-Memory (LSTM), and Gated Recurrent Unit (GRU), to determine the most accurate model for SHC prediction.
We incorporated data from four distinct weather stations, improving the generalizability of the findings and ensuring that the models were evaluated across diverse climatic conditions.
We utilized two comprehensive feature sets (Feature Set 1 and Feature Set 2) derived from detailed weather data, enhancing the robustness and depth of the analysis.

Furthermore, this study underscores the potential for organizations to develop in-house energy management solutions in compliance with the EPB European Directive, promoting energy efficiency and sustainability.

2. Dataset

The dataset includes operational and environmental aspects of SHC consumption at the European Central Bank (ECB) Headquarters in Frankfurt. This state-of-the-art building has multiple sensor networks as part of an advanced Building Control System, enabling granular analysis.

Data from the district heating network supplying the building was collected. It was complemented with data from several weather stations located around and on top of the building, selected based on proximity. This comprehensive dataset offers a rich resource for understanding the intricate interplay between environmental variables and heating demands.

These data are used for historical analysis to understand building heating demands and make high-level estimations for the future. However, adopting ML techniques would significantly enhance the accuracy and efficiency of these forecasts. ML can analyze complex patterns and relationships within the data that traditional methods might overlook, leading to more precise and reliable predictions. This comprehensive dataset, therefore, offers a rich resource for understanding the intricate interplay between environmental variables and heating demands, providing a solid foundation for advanced predictive modeling through ML.

2.1. Heating Consumption Data

SHC data give an overview of energy consumption within a building by providing a list of important parameters, including SHC value, volume, inflow water temperature, and return water temperature. These data were obtained from a district heating network and represent the actual heating demand required to heat the entire building area, which is actively used by real-world ventures. The heating load in the studied building is primarily used for space heating and hot water supply. The contribution of each component varies, with space heating being more dependent on weather conditions such as outdoor temperature and humidity, while hot water supply exhibits a more stable demand pattern influenced by occupancy schedules. The district heating system is the baseload supplier, giving the building foundation a stable and diversified heat provision.

This data were collected by a smart meter, which is fitted to the premises’ technical areas. This device, outfitted with different sensors, detects and accurately documents the heating usage. The data monitor is situated at the interface between the district heating network provider and the office building’s heating system. This data monitor is an integral part of the overall data acquisition process.

The Heating Degree Day (HDD) is a metric that can estimate the energy demand required to heat a building. It represents the number of degrees by which a day’s average temperature falls below a specific base temperature, the threshold below which buildings require heating [35]. In this case, the 15 °C threshold has been selected based on expert knowledge, ensuring accuracy and relevance in measuring the building heating requirements. It is important to note that different climatic regions may employ different base temperatures to account for local variations in heating needs. Heating Degree Hours (HDH) is used in this study because it provides a more granular and precise measure of heating demand than HDD, particularly when evaluating hourly temperature data. The formula for calculating HDH is as follows:

\{\begin{matrix} \frac{15 - T_{o u t} (h)}{24} i f T_{o u t} (h) < 15 \\ 0 i f T_{o u t} (h) \geq 15 \end{matrix}\}

(1)

where T_out is the hourly outside temperature, and HDH represents the hourly degree hours.

By using HDH, we can capture short-term variations in temperature that influence heating needs, leading to more accurate predictions of energy consumption for Heating.

2.2. Weather Variables from the Building Weather Station (BWS)

The weather variable dataset consists of various meteorological parameters for evaluating atmospheric conditions, including temperature, humidity, wind speed, etc. These factors reflect weather pattern dynamics that influence building operations and energy usage. This dataset, collected from a weather station atop the ECB skyscraper, provides localized insights. It has up-to-date weather information that applies to the immediate surroundings of the building. Positioning the weather station at such an elevation maximizes the gathering of appropriate data that echo the building’s atmosphere.

Connection with the Building Automation System (BAS) enhances the efficiency and availability of data. The weather station has a smooth link with the BAS, making data transmission and storage much easier. With this integration in place, the BAS control rooms are adequately equipped to provide the building managers and operators with real-time monitoring of weather conditions, thus enabling them to make informed decisions based on up-to-date weather information. Additionally, the BAS’s historical data analysis functionality allows retrospective weather trends and patterns. The BAS makes it possible to archive weather data over time, allowing for deep analysis and enabling key actors to find connections between weather variables and building performance metrics.

2.3. Local Weather Stations

The local weather data comprise a comprehensive archive of meteorological information collected from three local weather stations: Frankfurt Airport (station 1420), Frankfurt am Main–Westend (station 1424), and Offenbach Weather Park (station 7341). These stations are placed explicitly at varying distances from the main study building, each allowing for the specific study of localized weather conditions that may influence operations and energy management. This dataset is sourced from DWD (Deutscher Wetterdienst) and their climate data center, which is renowned as the most reliable and authoritative source for historical weather data. The DWD obtains a broader picture to understand weather phenomena and trends using weather stations covering diverse areas. The data acquisition process from the DWD’s climate data center is precise and includes careful extraction and compilation processes. Researchers access historical weather data from selected weather stations through the Deutscher Wetterdienst Climate Data Center on the DWD website.

2.4. Trends

Figure 1 illustrates the total heating degree hours (HDH15) categorized by season across four weather stations: the BWS, 7341, 1420, and 1424. Observing the data, the BWS records the highest total heating degree hours during the winter season, surpassing 7000, while 1424 exhibits the lowest total, with around 1420 heating degree hours. Conversely, as the graph indicates, the total heating degree hours exhibit a consistent pattern across all stations, with the winter season consistently registering the highest values and the summer season showing the lowest.

The second feature set covers various environmental factors relating to weather, time, and environmental conditions, vital to understanding the intricate interactions affecting SHC inside the building. This representation excludes direct measurements, in this case, ‘Heating water volume m³’, ‘Return temperature °C’, and ‘Flow temperature °C’, aimed at capturing the broader context of heating demand. For meteorological variables, it incorporates the following values: humidity, temperature, dew point, air temperatures, vapor pressure, absolute humidity, visibility, and wind speed and temporal indicators, such as season, month, weekday, day, and hour, it provides a complete framework that embraces the external factors that affect energy demand for Heating. These features are selected based on their well-established impact on heating demand, as demonstrated in the existing literature on building energy modeling. Moreover, introducing the smart meters’ data transmission helps to include features like precipitation yes/no, air pressure, wind direction, and year. Thus, the study becomes more diversified as it allows the examination of long-term trends and seasonal variations in heating demand.

This trend can be attributed to the fundamental principle that colder temperatures in winter necessitate increased heating to maintain indoor comfort levels. As a result, wintertime calls for more space SHC than other seasons and, hence, higher heating degree hours. On the other hand, in the summer period, less heat is required due to higher outside temperatures, reducing the number of heating degree hours.

Examining variations among the weather stations, the BWS records the highest total heating degree hours during winter seasons, indicating that it experiences colder temperatures or requires more heating resources than the other stations during the winter. On the other hand, local weather station 1424 frequently shows the lowest total heating degree hours, which may be due to milder temperatures or lower heating requirements.

2.5. Feature Selection

This study considers Feature Set 1 and Feature Set 2, offering unique insights into the factors influencing SHC and building energy efficiency. Feature Set 1 will cover the operational features, such as those relating to the heating system, that enable the analysis and improvement of the building’s performance. The measurements, like ‘Heating water volume m³’, ‘Return temperature °C’, and ‘Flow temperature °C’ are very informative. They provide the operating mode of the heating infrastructure for accurate monitoring and control. The selection of these features is based on their direct relevance to heating system performance, as supported by prior studies on energy consumption modeling in buildings. Besides that, one of the ways of enhancing the dataset accuracy is by using other indicators such as humidity, dew point, air temperature, vapor pressure, absolute humidity, visibility, relative humidity, sunshine duration, month, precipitation yes/no, air pressure, wind speed, wind direction, and day.

2.6. Feature Selection Methodology

These features are selected for their ability to capture both direct and indirect influences on SHC patterns within the building. First, the features of Feature Set 1 are the three dimensions, ‘Heating water volume m³’, ‘Return temperature [°C]’, and ‘Flow temperature [°C]’, as they are the parameters that determine the operation of the heating system in the building. Furthermore, Feature Set 2 has a vast collection of meteorological, temporal, and environmental variables like humidity levels, air temperature, and precipitation. These features are selected for their indirect but significant impact on heating demands, reflecting the ambient conditions and external factors that influence indoor temperature regulation and energy usage.

Feature Divisions

In this study, features were categorized into three groups: 3 main features, 7 main features, and all features, based on correlation analysis and model interpretability. This selection ensures better generalization, computational efficiency, and a more robust training process by focusing on highly correlated attributes to simplify model complexity. It is pertinent to mention that this study did not use Principal Component Analysis (PCA) or Recursive Feature Elimination (RFE) for feature selection, as our primary objective was to analyze the impact of high-dimensional feature sets on heating consumption prediction. Instead of reducing dimensionality through PCA, which transforms features into principal components, or using RFE, which iteratively removes less significant features, we relied on correlation-based feature selection. This approach allowed us to retain interpretable variables and assess their individual and combined influence on model performance.

The correlations between features and the target variable, Heating, vary across different weather stations due to each location’s unique environmental conditions and building characteristics. For instance, in Feature Set 1, the correlation between ‘Heating water volume m³’ and Heating may be higher for one weather station than others, reflecting the specific heating system dynamics in that building. Similarly, in Feature Set 2, the correlation between ‘humidity temperature’ and Heating may differ among weather stations, indicating variations in the influence of meteorological factors on heating demand.

Table 2 presents the correlation values for Feature Set 1, (which includes operational features related to the heating system, such as ‘Heating water volume m³’, ‘Return temperature °C’, and ‘Flow temperature °C’) across all four weather stations reveals notable variations in the relationships between features and the target variable, Heating. As an illustration, ‘Water volume m³’ shows the most significant correlations throughout all stations, which proves it is the main factor determining energy consumption for office space heating. In contrast, attributes like ‘Hour’ and ‘Flow temperature °C’ are generally low correlations, implying a weaker association between heating demand and these features. The most apparent way correlations differ across weather stations is that there are prominent differences in correlation for some features among different weather stations. For instance, at the BWS, HDH15 emerges as the second-strongest correlation, while at the 1420 weather station, ‘humidity temperature’ occupies that place.

Similarly, the depiction of correlation in Table 3 for Feature Set 2 (which consists of environmental and temporal factors like humidity, temperature, dew point, wind speed, and time indicators such as season, month, and hour) across all four weather stations shows individual relations among attributes and the objective variable Heating. However, HDH15 seems to have robust correlations with all stations over the years, thus highlighting their effect on how heating demand is distributed. However, the ‘Hour’ and ‘Day’ variables are most weakly correlated within the entire group, representing the lowest degrees of relationship with the heating demand. Also, the associations among the parameters are slightly different for different weather stations. For example, ‘humidity temperature’ as a component for SHC yields the highest correlation value at the 1420 station, but ‘BWS global radiation’ shows the highest correlation at the BWS. These variations point out that such variables should be considered when forecasting the heating usage by a station and developing models for it.

3. Methodology

The methodology involves collecting weather data from multiple weather stations, which is then stored in a centralized database for processing. The data undergoes feature extraction, where it is categorized into technical (Feature Set 1) and non-technical (Feature Set 2) features. Feature selection is then performed to refine these sets for analysis. The selected features are further divided into three categories: a subset of 3 features, a subset of 7 features, and the entire feature set. These feature sets are subsequently used to model heating demand, with the performance of the models evaluated using Mean Squared Error (MSE), Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), and R-Squared metrics to determine accuracy and reliability. The proposed methodology is shown in Figure 2.

3.1. Preprocessing

The preprocessing starts by separating numeric columns, excluding ‘Year’, ‘Month’, and ‘Hour’, from non-numeric ones. Missing values are hierarchically imputed using group means, prioritized by ‘Year’, ‘Month’, and ‘Hour’. The remaining missing values are then filled with group means based on ‘Year’ and ‘Month’, followed by ‘Year’ and ‘Season’. Finally, any remaining missing values are imputed with the annual average. Categorical variables, including ‘Season’ and ‘Weekday’, are transformed into numerical format through label encoding. Moreover, hyperparameters used for each model were carefully selected through grid search and cross-validation techniques.

3.2. Modelling

The study assesses a range of models like KNN, SVR, DT, Linear Regression, XGBoost, RF, GB, AdaBoost, LSTM, and GRU to establish the significance of using localized weather data on the accuracy of predictive models for SHC. Hyperparameter tuning was conducted using grid search with cross-validation to ensure optimal performance of each model. Variations in hyperparameter tuning, such as learning rate and max depth, significantly impact the predictive accuracy of ensemble models. A lower learning rate allows for more gradual learning, reducing the risk of overfitting but requiring longer training times. In contrast, a higher learning rate speeds up convergence but may lead to suboptimal solutions. Similarly, increasing max depth enhances the model’s ability to capture complex patterns but can also increase the risk of overfitting. Details of these models are listed in Table 4.

The chosen building for this study, the European Central Bank Headquarters, was selected due to its availability of high-quality, detailed weather and energy consumption data. This building also represents a complex urban structure with diverse heating requirements, making it a suitable case study for testing the robustness of the models. Experiments on other buildings were not carried out due to the unavailability of similarly detailed datasets that include localized weather conditions and operational parameters. However, the methodology is designed to be generalizable and can be applied to other buildings with appropriate data availability.

3.3. Evaluation Metrics

In this research, various evaluation metrics are used to evaluate the developed models for measuring the SHC. The metrics are Mean Squared Error (MSE), Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), Mean Absolute Percentage Error (MAPE), and R-squared (R²).

Mean Squared Error (MSE): MSE measures the average of the squares of the errors between actual (

y_{i}

) and predicted (

\hat{y_{i}}

) values [46]. This study quantifies the average squared difference between the observed and predicted SHC values, providing insight into the overall accuracy of the predictive model. MSE is particularly important for penalizing more significant errors more heavily, making it a critical metric when large prediction deviations are undesirable.

M S E : \frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - \hat{y_{i}})}^{2}

(2)

Mean Absolute Error (MAE): MAE calculates the average of the absolute differences between actual (

y_{i}

) and predicted (

\hat{y_{i}}

) values. It measures the average magnitude of prediction errors without considering their direction [47]. In the context of this study, MAE evaluates the average magnitude of errors in predicting SHC, offering a straightforward measure of model performance. Unlike MSE, MAE treats all errors equally, making it more interpretable and suitable for understanding the typical prediction error.

M A E : \frac{1}{n} \sum_{i = 1}^{n} {|y_{i} - \hat{y_{i}}|}^{2}

(3)

Root Mean Squared Error (RMSE): RMSE is the square root of the average of the squared differences between actual (

y_{i}

) and predicted (

\hat{y_{i}}

) values [48]. It measures the standard deviation of the prediction errors and is interpretable in the same units as the target variable. In this study, RMSE assesses the typical error magnitude of the predictive model in predicting SHC. RMSE is crucial when the scale of prediction errors needs to be expressed in the same unit as the target variable, making it easier to contextualize the error magnitude. It is also more sensitive to outliers than MAE, which may be advantageous in specific scenarios.

R M S E : \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - \hat{y_{i}})}^{2}}

(4)

R-Squared (R²): R², also known as the coefficient of determination, measures the proportion of the variance in the dependent variable (y) that is explained by the independent variable(s) (

\bar{y}

) [49]. It ranges from 0 to 1, where 1 indicates that the model perfectly predicts the dependent variable based on the independent variables, and 0 indicates that the model does not explain any variability in the dependent variable. In this study, R² assesses the goodness of fit of the predictive model to the observed SHC data, indicating how well the model captures the variability in the target variable based on the features used for prediction. R² is particularly important for evaluating the model’s explanatory power and understanding how well the model generalizes to unseen data. It provides an intuitive measure of model effectiveness by comparing explained variance to total variance.

R^{2} = 1 - \frac{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - \hat{y_{i}})}^{2}}{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}}

(5)

Loss Function: The loss function was constructed using standard regression metrics, including Mean Squared Error (MSE) as the primary optimization criterion. MSE was chosen due to its sensitivity to large errors, ensuring that the model minimizes significant deviations in predictions. Additionally, we monitored Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE) to assess model performance comprehensively. These loss functions were optimized using the gradient boosting framework of XGBoost, which iteratively reduces errors by adjusting model weights based on residuals from previous iterations.

3.4. Experimental Setup

Experiments are conducted using common Python libraries for data preprocessing, model development, and evaluation. Pandas is utilized for data manipulation and analysis, while NumPy supports numerical computing and array operations. Scikit-learn provides a range of algorithms for ML tasks, including model training, evaluation, and preprocessing. DL models are constructed using TensorFlow and PyTorch, which offer high-level APIs for building neural networks and optimizing performance. The matplotlib library facilitates data visualization to track dataset characteristics and model performance. Random seed values are set to ensure reproducibility across experiments. The dataset is divided into training and testing sets with an 80–20 split, where 80% of the data are for training and the rest 20% are for testing. In addition, a K-fold cross-validation with K = 5 is applied during the training phase to minimize the risk of overfitting and verify the models’ generalization capacity.

Several experiments are conducted based on the systematic division of the feature sets. The details of these feature sets are shown in Table 5.

4. Results

We tested the performance of the different predictive models built from various feature sets. Each model was evaluated considering MSE, MAE, RMSE, and R2 to assess its predictive accuracy.

4.1. Feature Set 1

The evaluation of predictive models using Feature Set 1 consists of a comprehensive array of operational and environmental features crucial for understanding SHC patterns. Various combinations of Feature Set 1 are utilized for model evaluation, with feature divisions based on correlations with the target variable. The main aim is to identify the most influential features and assess their impact on model performance in predicting SHC across different weather stations.

Across various weather stations, the XGBoost ensemble learning model consistently outperforms other ML and DL models in predicting SHC, as shown in Table 6. For weather station 1420, XGBoost excels with an MSE of 2.5, MAE of 0.085, RMSE of 15.7, and an R² value of 97.6 in Feature Set 1 with only 3 features. These results indicate that even with a minimal feature set, XGBoost effectively captures the underlying data patterns, making it a suitable choice for scenarios with limited data availability. It maintains its lead with excellent metric values in the 7-feature and all-feature sets, underscoring its accuracy and the strength of ensemble learning techniques. This suggests that adding more features enhances the model’s ability to generalize, further improving prediction performance. This trend continues for weather station 1424, where XGBoost achieves the lowest MSE of 2.5175, MAE of 8.4946, RMSE of 15.8668, and an R² value of 97.5322 in the 3-feature set. The model’s ability to perform well with different feature subsets demonstrates its adaptability and robustness in various climatic conditions. Consistent performance across different feature sets at this station further highlights XGBoost’s ability to capture complex data relationships.

The robustness of XGBoost is also evident at weather stations 7341 and the BWS, as shown in Table 7. For station 7341, XGBoost achieves an MSE of 2.906, MAE of 8.875, RMSE of 17.0471, and an R² value of 97.1514 in the 3-feature set. These findings reinforce the effectiveness of gradient boosting in refining predictions by iteratively reducing errors. It also achieves lower MSE, MAE, and RMSE values and higher R² values in the larger feature sets. Similarly, at the BWS, XGBoost demonstrates superior accuracy with an MSE of 2.1923, MAE of 8.0211, RMSE of 14.8064, and an R² value of 97.851 in the 3-feature set. These results indicate that XGBoost effectively learns from different environmental conditions, making it a viable option for diverse geographical locations. Its consistent top performance across all feature sets at this station highlights its effectiveness in predicting SHC based on local weather data. Such reliability across multiple weather stations confirms that XGBoost is a robust and scalable model for SHC prediction, offering practical applications for real-world energy forecasting scenarios.

Discussion Feature Set 1

The XGBoost model outperforms all other models within both divisions of 3 Features, 7 Features, and All Features datasets, as the findings from Feature Set 1 indicate. XGBoost is the algorithm that has the best MSE, MAE, RMSE, and R² values. This consistent performance underscores the strength of ensemble learning techniques, particularly XGBoost, in effectively capturing the nonlinear relationships between features and SHC. Unlike other models, XGBoost leverages its gradient boosting framework to iteratively refine weak learners, reducing bias and variance effectively. Using gradient-boosting algorithms enables better handling of complex interactions and outliers, contributing to its superior predictive accuracy compared to other models. Among all the models considered, the XGBoost model is consistently better than KNN, SVR, DT, Linear Regression, RF, GB, and AdaBoost, as well as Deep Learning models such as LSTM and GRU, which exhibit lower predictive performance in this study. The lower performance of LSTM and GRU may be attributed to the relatively small dataset size and the absence of sequential dependencies in the feature set, which limits the advantages of recurrent architectures. Irrespective of the dataset type and weather station type, XGBoost demonstrates superior accuracy.

When comparing the results achieved by considering different weather stations, BSW has the smallest MSE, MAE, and RMSE in all models and feature divisions among the other weather stations, thus supporting the better predictive accuracy resulting from this station. This means that the environment and the building characteristics at the BWS may be the ones that are more suitable for the accurate prediction of SHC at the ECB. The superior performance at the BSW suggests that this station’s localized weather conditions and building characteristics are more representative of the factors influencing SHC at the ECB. This finding emphasizes the importance of selecting relevant weather stations for predictive modeling and highlights the potential for station-specific optimization in future applications.

Regarding the division of features, the All Features set is the most consistent in producing less MSE, MAE, and RMSE for each weather station and model, implying better predictive ability compared to the 3 Features and 7 Features sets. This superior performance is attributed to XGBoost’s ability to effectively handle high-dimensional data and select the most relevant features through its built-in feature importance mechanism. Thus, combining operational and environmental features and behavioral patterns results in accurate model performance in predicting the amount of Heating consumed. Including a diverse combination of operational and environmental features, along with behavioral patterns, enhances the model’s ability to generalize and capture key predictors of SHC. This result justifies using comprehensive feature sets to improve the reliability and accuracy of predictions.

4.2. Feature Set 2

In this section, we present the results of Feature Set 2, where different combinations of features were evaluated to investigate their impact on predictive model performance. Like Feature Set 1, Feature Set 2 was also constructed based on correlations among weather parameters and their potential influence on SHC.

XGBoost consistently emerges as the top performer in predictive accuracy across various feature divisions, as shown in Table 8. In the 3-Features division, XGBoost achieves the lowest MSE of 28.7, MAE of 35.7, RMSE of 53.6, and an R² value of 71.8. This suggests that even a limited subset of features contributes significantly to accurate predictions, demonstrating the model’s efficiency in handling high-impact variables. Similarly, in the 7-Features and All Features divisions, XGBoost consistently maintains its lead with the lowest MSE, MAE, and RMSE values and the highest R² values. These observations highlight that adding more features enhances model performance but does not drastically alter the ranking of XGBoost as the best performer. These results demonstrate XGBoost’s robustness in effectively leveraging feature correlations to improve predictions, irrespective of the feature division. Its gradient-boosting mechanism allows it to capture complex relationships, providing a clear justification for its superior performance across all datasets. This ability to integrate non-linear dependencies among features further validates its suitability for SHC forecasting. This consistent performance highlights the robustness of XGBoost in leveraging different feature divisions to accurately predict SHC for weather station 1420. For weather station 1424, XGBoost is the top-performing model in predictive accuracy, as shown in Table 8. The strong correlation between predicted and actual values suggests that the model efficiently learns from historical data, making it a practical choice for real-world deployment. This reliable performance underscores XGBoost’s suitability in utilizing various feature divisions to accurately forecast SHC for weather station 1424.

Similarly, for weather station 7341, XGBoost consistently demonstrates superior performance across feature divisions within Feature Set 2, as shown in Table 9. In the 3-Features division, XGBoost achieves an MSE of 37.1, MAE of 41.8, RMSE of 60.9, and an R² value of 63.6. While the R² value is slightly lower in this case, it still indicates a strong relationship between predicted and actual SHC values, confirming the model’s dependability. In the 7-Features and All Features divisions, XGBoost excels with low MSE, MAE, and RMSE and high R² values. This further emphasizes the importance of incorporating a well-balanced feature set to enhance predictive accuracy. This performance emphasizes XGBoost’s capability to employ different feature divisions to precisely predict SHC for weather station 7341. Likewise, XGBoost consistently emerges as the top-performing model regarding predictive accuracy for the BWS, as shown in Table 9. In the 3-Features division, XGBoost achieves an MSE of 27.6, MAE of 35.3, RMSE of 52.5, and an R² value of 73. These values suggest that even with fewer features, XGBoost maintains a high level of accuracy, making it a computationally efficient option. In the 7-Features and All Features divisions, XGBoost maintains its lead with notably low MSE, MAE, and RMSE values, and high R² values as shown in Table 9. The findings indicate that XGBoost’s ensemble learning mechanism is highly effective in capturing SHC patterns, making it an optimal choice for energy demand forecasting in varying conditions.

Discussion Feature Set 2

In Feature Set 2, we find that XGBoost is the highest-scoring model for weather stations and other features such as temperature, humidity, and pressure. This superior performance is primarily due to XGBoost’s ability to handle missing data effectively, optimize decision trees through gradient boosting, and assign different weights to features, thereby improving model robustness. Unlike traditional machine learning models that rely on a fixed set of assumptions, XGBoost dynamically adjusts to patterns in the data, capturing complex dependencies and nonlinear relationships more effectively. In comparison, deep learning models such as LSTM and GRU exhibit lower predictive performance in this study. Their comparatively weaker results may be attributed to the limited dataset size and the absence of sequential dependencies in the features, which reduces the effectiveness of recurrent architectures for this particular task. This leads us to conclude that XGBoost is very useful in considering all the information given by Feature Set 2, which results in highly accurate predictions of SHC at different weather stations.

The best-performing weather station varies depending on the feature division. However, the BWS tends to exhibit the best results across different feature divisions. This indicates that the BWS has more representative or informative data for predicting SHC than other weather stations in the dataset.

The feature division aspect of XGBoost is comparable to other methods, consistently performing exceptionally well across all divisions. Notably, the ‘All Features’ division is the most effective, indicating that the comprehensive set of features in Feature Set 2 yields more precise predictions for SHC than the subsets of 3 or 7 features. Training a model using all available features likely enriches the data, providing a deeper understanding and more accurate modeling of complex heating behaviors.

To minimize the risk of overfitting, cross-validation techniques were rigorously applied during model training. We employed k-fold cross-validation to ensure that models generalized well to unseen data and to prevent excessive reliance on any particular subset of the dataset. Furthermore, XGBoost’s built-in regularization mechanisms, such as L1 and L2 penalties, help control model complexity, reducing overfitting risks while maintaining high predictive accuracy. The robustness of XGBoost across different feature sets and weather stations further supports the model’s ability to generalize well beyond the training data.

4.3. Comparison

Different studies have explored various machine learning techniques for heat load prediction, each utilizing distinct feature sets and methodologies to enhance forecasting accuracy. The MLR-ANN model [50] incorporates historical and current outdoor temperature, wind speed, solar radiation, and seasonal information, achieving a strong RMSE of 82 with an impressive R² of 98.2. However, the absence of reported MSE and MAE values limits a comprehensive evaluation of its performance. Similarly, the Bi-LSTM model [51] leverages both past and future weather information, demonstrating reasonable accuracy with an MAE of 14 and an RMSE of 19. Despite these promising results, the lack of an R² value makes it difficult to assess its overall fit compared to other models.

The Parallel Convolutional Neural Network–Long Short-Term Memory Attention (PCLA) model [52] enhances heat load forecasting by integrating spatial and temporal features using district heater-related variables, weather forecasts, and time factors. This approach results in a lower MSE of 66.2, an MAE of 57.1, and an R² of 94.2, demonstrating strong predictive capabilities. However, this study’s XGBoost model surpasses all previous techniques, achieving the lowest MSE (3), MAE (1.8), and RMSE (5.4), along with the highest R² (99.7). The comparison of this study with existing studies is shown in Table 10.

4.4. Deployment Considerations and Policy Implications

For real-world applications, the deployment of predictive models like XGBoost for SHC forecasting can be integrated into Building Management Systems (BMS) to enhance energy efficiency. The models can be deployed as part of an automated control system that continuously monitors operational and environmental variables, optimizing heating consumption in response to real-time data. Deployment can be achieved through cloud-based solutions or on-premises implementations, where the trained models are embedded in BMS software to provide real-time predictive insights. Additionally, an API-based integration could facilitate seamless communication between the predictive model and existing building automation platforms, ensuring adaptability to different infrastructure settings.

This study serves as a framework for EU institutions to enhance their understanding of building heating demands and develop more efficient, data-driven energy management strategies. Leveraging ML-based heating predictions enables policymakers to create adaptive and resource-efficient in-house solutions that align with EU energy regulations, including the Energy Performance of Buildings Directive (EPBD) and the EU Green Deal’s carbon neutrality goals. Integrating predictive modeling into energy policies facilitates real-time monitoring, proactive energy adjustments, and optimized resource utilization, leading to reduced carbon footprints and the promotion of sustainable energy practices.

5. Limitations of the Study and Models

Despite the strong predictive performance of XGBoost and other models evaluated in this study, several limitations must be acknowledged. First, the generalizability of the models is constrained by the specific dataset used, which is derived from the European Central Bank Headquarters. While the models demonstrated high accuracy within this setting, their effectiveness in other buildings with different structural characteristics, occupancy patterns, and heating systems remains uncertain. Second, although weather data from multiple stations were considered, variations in local microclimates and unaccounted environmental factors may impact the accuracy of predictions. Additionally, while XGBoost outperformed other models, its computational complexity and higher training time compared to simpler models such as Decision Trees or Linear Regression could be a limiting factor in real-time applications or scenarios with limited computational resources. Moreover, deep learning models like LSTM and GRU, despite their ability to capture temporal dependencies, exhibited lower performance due to limited historical data, indicating the need for larger datasets to leverage their full potential. Finally, the study primarily focused on supervised learning approaches, leaving room for future research to explore hybrid models incorporating unsupervised learning techniques for feature extraction or reinforcement learning for adaptive energy management strategies. Addressing these limitations through expanded datasets, feature engineering, and model optimization can further enhance the robustness and applicability of SHC prediction models.

6. Conclusions

This study employed various combinations of feature sets to investigate their impact on the accuracy of predictive models for SHC across different weather stations. We identified XGBoost as the top-performing model consistently across all feature divisions and weather stations through comprehensive evaluations. XGBoost demonstrated superior predictive accuracy, effectively utilizing the comprehensive information in Feature Set 2 to enhance prediction performance. For instance, at the BWS, XGBoost achieved an MSE of 2.1923, MAE of 8.0211, RMSE of 14.8064, and an R² value of 97.851 in the 3-feature set, showcasing its robustness. Similarly, at weather station 1420, XGBoost excelled with an MSE of 2.5, MAE of 8.5, RMSE of 15.7, and an R² value of 97.6. Our analysis revealed that the BWS generally exhibited the best results, indicating its potential for providing more representative data for SHC prediction. Moreover, the All Features division consistently outperformed the subsets of features (3 or 7), emphasizing the importance of utilizing comprehensive feature sets to capture intricate relationships within the data. For example, across all weather stations, the All Features division consistently resulted in lower error metrics, with XGBoost achieving its highest accuracy levels.

This study focuses on a single building due to data availability, but the methodology is designed to be generalizable to other office buildings with similar heating demand characteristics. The selected building, consisting of two interconnected skyscrapers, presents a unique case with complex heating dynamics influenced by its architectural design and urban environment. The proposed approach is highly scalable, as it can be implemented in other buildings with access to granular heating consumption data from Building Management Systems (BMS). These systems collect detailed operational data, which, when combined with weather data obtained from an open-source website, enables easy adaptation to various buildings and locations. However, the accuracy of the model relies heavily on the quality and granularity of the data, highlighting the importance of robust data management practices for effective implementation and scalability across different settings.

In future work, we will delve deeper into feature engineering to identify additional variables that could enhance predictive performance. Exploring advanced ensemble techniques or DL architectures explicitly tailored for the SHC prediction task could also be beneficial. Additionally, an analysis of computational efficiency, including training times and model complexity trade-offs, will be conducted to assess real-world implementation feasibility. Furthermore, incorporating external factors such as building characteristics, occupancy patterns, or socioeconomic factors could improve the robustness and generalizability of the predictive models. An extended evaluation of model interpretability using SHAP (Shapley Additive Explanations) values will also be explored to identify the most influential predictors, ensuring better transparency and trust in the predictions. Overall, continued research in this domain holds the potential to refine predictive models and contribute to more efficient energy management strategies in office building settings.

Moreover, this study’s findings can extend to residential buildings by adjusting feature selection and model training to account for occupancy patterns, insulation levels, and heating system variations. Residential heating consumption is more dynamic due to diverse user behaviors and seasonal changes. XGBoost’s ability to capture complex relationships between weather, operations, and heating suggests its potential for residential heating forecasts. Integrating smart meters and IoT-based monitoring can further enhance energy efficiency, helping homeowners and policymakers reduce costs and carbon footprints. Future work can validate the model’s adaptability using diverse residential datasets.

This study underscores organizations’ ability to develop in-house energy management solutions, enabling them to autonomously meet their energy responsibilities. By aligning with the EPB European Directive’s standards for energy performance in buildings, our approach bridges a critical gap in research, highlighting the practical application of advanced predictive models in enhancing energy efficiency. The insights gained from this study can guide organizations in leveraging their resources to meet stringent energy efficiency measures, ultimately contributing to sustainable energy management and compliance with European standards.

Author Contributions

Conceptualization, F.A.; methodology, F.A.; software, F.A.; validation, M.C. and N.C.-R.; formal analysis, F.A.; investigation, F.A.; resources, F.A.; data curation, F.A.; writing—original draft preparation, F.A.; writing—review and editing, M.C. and N.C.-R.; visualization, F.A.; supervision, M.C. and N.C.-R.; project administration, F.A.; funding acquisition, M.C. and N.C.-R. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by national funds through FCT (Fundação para a Ciência e a Tecnologia), under the project—UIDB/04152/2020 (DOI: 10.54499/UIDB/04152/2020)—Centro de Investigação em Gestão de Informação (MagIC)/NOVA IMS).

Data Availability Statement

Restrictions apply to the availability of these data. The data were obtained from the European Central Bank and are not publicly available. Access to these data is subject to the ECB’s approval. Interested researchers should contact the corresponding author, who will facilitate the request process with the ECB.

Conflicts of Interest

The authors have no competing interests to declare that are relevant to the content of this article.

References

EU. Council. Directive 2010/31/EU of the European Parliament and of the Council of 19 May 2010 on the Energy Performance of Buildings (Recast); European Commission: Brussels, Belgium, 2010. [Google Scholar]
Mirjalili, M.A.; Aslani, A.; Zahedi, R.; Soleimani, M. A comparative study of machine learning and deep learning methods for energy balance prediction in a hybrid building-renewable energy system. Sustain. Energy Res. 2023, 10, 8. [Google Scholar] [CrossRef]
Marín-García, D.; Bienvenido, D.H.; Nieto-Julián, E.; Campos, J.J.M.; Farinha, M.J.O.; Farinha, F. Analysis of the Regulations That Affect Energy Efficiency with Respect to Consumption of HVAC System for Residential Buildings in Southern Spain and Portugal; Springer: Cham, Switzerland, 2019. [Google Scholar]
Bienvenido-Huertas, D.; Sánchez-García, D.; Marín-García, D.; Rubio-Bellido, C. Analysing energy poverty in warm climate zones in Spain through artificial intelligence. J. Build. Eng. 2023, 68, 106116. [Google Scholar] [CrossRef]
Zhao, H.X.; Magoulès, F. A review on the prediction of building energy consumption. Renew. Sustain. Energy Rev. 2012, 16, 3586–3592. [Google Scholar] [CrossRef]
DOE. DOE 2. Available online: https://www.doe2.com/ (accessed on 22 May 2024).
E. Plus. Energy Plus. Available online: https://energyplus.net/ (accessed on 22 May 2024).
TRNSYS. Transient System Simulation Tool. Available online: https://www.trnsys.com/ (accessed on 22 May 2024).
Al-Homoud, M.S. Computer-aided building energy analysis techniques. Build. Environ. 2001, 36, 421–433. [Google Scholar] [CrossRef]
Yao, R.; Steemers, K. A method of formulating energy load profile for domestic buildings in the UK. Energy Build. 2005, 37, 663–671. [Google Scholar] [CrossRef]
Shi, C.; Zheng, J.; Wang, Y.; Gan, C.; Zhang, L.; Sheldon, B.W. Machine Learning-Driven Scattering Efficiency Prediction in Passive Daytime Radiative Cooling. Atmosphere 2025, 16, 95. [Google Scholar] [CrossRef]
Nuthakki, S.; Kulkarni, C.S.; Kathiriya, S.; Nuthakki, Y. Artificial Intelligence Applications in Natural Gas Industry: A Literature Review. Int. J. Eng. Adv. Technol. 2024, 13, 64–70. [Google Scholar] [CrossRef]
Muhamad, W.N.W.; Zain, M.Y.M.; Wahab, N.; Aziz, N.H.A.; Kadir, R.A. Energy Efficient Lighting System Design for Building. In Proceedings of the ISMS 2010—UKSim/AMSS 2010 International Conference on Intelligent Systems, Modelling and Simulation, Liverpool, UK, 27–29 January 2010; pp. 282–286. [Google Scholar] [CrossRef]
Yang, Y.; Hu, G.; Spanos, C.J. Stochastic Optimal Control of HVAC System for Energy-Efficient Buildings. IEEE Trans. Control Syst. Technol. 2022, 30, 376–383. [Google Scholar] [CrossRef]
Rocha, P.; Siddiqui, A.; Stadler, M. Improving energy efficiency via smart building energy management systems: A comparison with policy measures. Energy Build. 2015, 88, 203–213. [Google Scholar] [CrossRef]
Budler, L.C.; Gosak, L.; Stiglic, G. Review of artificial intelligence-based question-answering systems in healthcare. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2023, 13, e1487. [Google Scholar] [CrossRef]
Hartmann, T.; Moawad, A.; Schockaert, C.; Fouquet, F.; Le Traon, Y. Meta-Modelling Meta-Learning. In Proceedings of the 2019 ACM/IEEE 22nd International Conference on Model Driven Engineering Languages and Systems (MODELS), Munich, Germany, 15–20 September 2019; pp. 300–305. [Google Scholar] [CrossRef]
Mostafa, N.; Ramadan, H.S.M.; Elfarouk, O. Renewable energy management in smart grids by using big data analytics and machine learning. Mach. Learn. Appl. 2022, 9, 100363. [Google Scholar] [CrossRef]
Liu, X.; Ding, Y.; Tang, H.; Xiao, F. A data mining-based framework for the identification of daily electricity usage patterns and anomaly detection in building electricity consumption data. Energy Build. 2021, 231, 110601. [Google Scholar] [CrossRef]
Almuhaini, S.H.; Sultana, N. Forecasting Long-Term Electricity Consumption in Saudi Arabia Based on Statistical and Machine Learning Algorithms to Enhance Electric Power Supply Management. Energies 2023, 16, 2035. [Google Scholar] [CrossRef]
Behrang, M.A.; Assareh, E.; Assari, M.R.; Ghanbarzadeh, A. Using bees algorithm and artificial neural network to forecast world carbon dioxide emission. Energy Sources Part A Recovery Util. Environ. Eff. 2011, 33, 1747–1759. [Google Scholar] [CrossRef]
Nuthakki, S. Conversational AI and LLM’s Current and Future Impacts in Improving and Scaling Health Services. Int. J. Comput. Eng. Technol. 2023, 14, 149–155. [Google Scholar] [CrossRef]
Cui, X.; Zhu, J.; Jia, L.; Wang, J.; Wu, Y. A novel heat load prediction model of district heating system based on hybrid whale optimization algorithm (WOA) and CNN-LSTM with attention mechanism. Energy 2024, 312, 133536. [Google Scholar] [CrossRef]
Xue, P.; Jiang, Y.; Zhou, Z.; Chen, X.; Fang, X.; Liu, J. Multi-step ahead forecasting of heat load in district heating systems using machine learning algorithms. Energy 2019, 188, 116085. [Google Scholar] [CrossRef]
Li, X.; Yao, R. A machine-learning-based approach to predict residential annual space heating and cooling loads considering occupant behaviour. Energy 2020, 212, 118676. [Google Scholar] [CrossRef]
Jovanović, R.; Sretenović, A.A.; Živković, B.D. Ensemble of various neural networks for prediction of heating energy consumption. Energy Build. 2015, 94, 189–199. [Google Scholar] [CrossRef]
Yuan, T.; Zhu, N.; Shi, Y.; Chang, C.; Yang, K.; Ding, Y. Sample data selection method for improving the prediction accuracy of the heating energy consumption. Energy Build. 2018, 158, 234–243. [Google Scholar] [CrossRef]
Potočnik, P.; Škerl, P.; Govekar, E. Machine-learning-based multi-step heat demand forecasting in a district heating system. Energy Build. 2021, 233, 110673. [Google Scholar] [CrossRef]
Jang, J.; Han, J.; Leigh, S.B. Prediction of heating energy consumption with operation pattern variables for non-residential buildings using LSTM networks. Energy Build. 2022, 255, 111647. [Google Scholar] [CrossRef]
Dalipi, F.; Yildirim Yayilgan, S.; Gebremedhin, A. Data-Driven Machine-Learning Model in District Heating System for Heat Load Prediction: A Comparison Study. Appl. Comput. Intell. Soft Comput. 2016, 2016, 3403150. [Google Scholar] [CrossRef]
Moradzadeh, A.; Mansour-Saatloo, A.; Mohammadi-Ivatloo, B.; Anvari-Moghaddam, A. Performance evaluation of two machine learning techniques in heating and cooling loads forecasting of residential buildings. Appl. Sci. 2020, 10, 3829. [Google Scholar] [CrossRef]
Abdelkader, E.M.; Al-Sakkaf, A.; Ahmed, R. A comprehensive comparative analysis of machine learning models for predicting heating and cooling loads. Decis. Sci. Lett. 2020, 9, 409–420. [Google Scholar] [CrossRef]
Shen, Y.; Wei, R.; Xu, L. Energy consumption prediction of a greenhouse and optimization of daily average temperature. Energies 2018, 11, 65. [Google Scholar] [CrossRef]
Moradzadeh, A.; Mohammadi-Ivatloo, B.; Abapour, M.; Anvari-Moghaddam, A.; Roy, S.S. Heating and Cooling Loads Forecasting for Residential Buildings Based on Hybrid Machine Learning Applications: A Comprehensive Review and Comparative Analysis. IEEE Access 2022, 10, 2196–2215. [Google Scholar] [CrossRef]
Meng, Q.; Xi, Y.; Zhang, X.; Mourshed, M.; Hui, Y. Evaluating multiple parameters dependency of base temperature for heating degree-days in building energy prediction. Build. Simul. 2021, 14, 969–985. [Google Scholar] [CrossRef]
Guo, G.; Wang, H.; Bell, D.; Bi, Y.; Greer, K. KNN model-based approach in classification. In Proceedings of the on the Move to Meaningful Internet Systems 2003: CoopIS, DOA, and ODBASE: OTM Confederated International Conferences, CoopIS, DOA, and ODBASE 2003, Catania, Sicily, Italy, 3–7 November 2003; pp. 986–996. [Google Scholar]
Zhang, F.; O’Donnell, L.J. Support Vector Regression; Academic Press: Cambridge, MA, USA, 2020. [Google Scholar]
Myles, A.J.; Feudale, R.N.; Liu, Y.; Woody, N.A.; Brown, S.D. An introduction to decision tree modeling. J. Chemom. A J. Chemom. Soc. 2004, 18, 275–285. [Google Scholar] [CrossRef]
Su, X.; Yan, X.; Tsai, C.L. Linear regression. Wiley Interdiscip. Rev. Comput. Stat. 2012, 4, 275–294. [Google Scholar] [CrossRef]
Chen, T.; Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar] [CrossRef]
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Natekin, A.; Knoll, A. Gradient boosting machines, a tutorial. Front. Neurorobot. 2013, 7, 21. [Google Scholar] [CrossRef] [PubMed]
Solomatine, D.P.; Shrestha, D.L. AdaBoost. RT: A boosting algorithm for regression problems. In Proceedings of the IEEE International Joint Conference on Neural Networks, Budapest, Hungary, 25–29 July 2004; Volume 2, pp. 1163–1168. [Google Scholar] [CrossRef]
Gers, F.A.; Schmidhuber, J.; Cummins, F. Learning to forget: Continual prediction with LSTM. Neural Comput. 2000, 12, 2451–2471. [Google Scholar] [CrossRef] [PubMed]
Yao, K.; Cohn, T.; Vylomova, K.; Duh, K.; Dyer, C. Depth-Gated Recurrent Neural Networks. arXiv 2015, arXiv:1508.03790. [Google Scholar]
Alaraj, M.; Kumar, A.; Alsaidan, I.; Rizwan, M.; Jamil, M. Energy Production Forecasting from Solar Photovoltaic Plants Based on Meteorological Parameters for Qassim Region, Saudi Arabia. IEEE Access 2021, 9, 83241–83251. [Google Scholar] [CrossRef]
Dadhich, M.; Pahwa, M.S.; Jain, V.; Doshi, R. Predictive Models for Stock Market Index Using Stochastic Time Series ARIMA Modeling in Emerging Economy. In Advances in Mechanical Engineering; Springer: Berlin/Heidelberg, Germany, 2021; pp. 281–290. [Google Scholar] [CrossRef]
Shrivastava, S.; Bal, P.K.; Ashrit, R.; Sharma, K.; Lodh, A.; Mitra, A.K. Performance of NCUM global weather modeling system in predicting the extreme rainfall events over the central India during the Indian summer monsoon 2016. Model. Earth Syst. Environ. 2017, 3, 1409–1419. [Google Scholar] [CrossRef]
Brahimi, T. Using artificial intelligence to predict wind speed for energy application in Saudi Arabia. Energies 2019, 12, 4669. [Google Scholar] [CrossRef]
Hua, P.; Wang, H.; Xie, Z.; Lahdelma, R. District heating load patterns and short-term forecasting for buildings and city level. Energy 2024, 289, 129866. [Google Scholar] [CrossRef]
Cui, M. District heating load prediction algorithm based on bidirectional long short-term memory network model. Energy 2022, 254, 124283. [Google Scholar] [CrossRef]
Chung, W.H.; Gu, Y.H.; Yoo, S.J. District heater load forecasting based on machine learning and parallel CNN-LSTM attention. Energy 2022, 246, 123350. [Google Scholar] [CrossRef]

Figure 1. Trends of heating hours across weather stations in all seasons.

Figure 2. Proposed methodology.

Table 1. Overview of existing studies.

Ref.	Year	Region	Features	Technique (s)	Model (s)	Limitation	This Study
[24]	2019	China	Heat load, specific heat capacity of water, total mass flow rate of District Heating System, supply and return temperature at substations.	Machine Learning, Deep Learning, Ensemble Learning	Support Vector Regression, Deep Neural Network, Extreme Gradient Boosting	Further exploration is needed for application in model predictive control and DHS operation optimization.	XGBoost also performed well in our study, but its effectiveness varies based on data complexity.
[25]	2020	China	Exterior wall and window U-value, exterior window, inflation rate, % of time occupant stay at home, infiltration rate.	Machine Learning, Deep Learning	SVT, ANN	Suggested sample sizes for training and validation sets may not be universally applicable.	This study addresses feature variability by integrating dynamic environmental factors.
[26]	2015	Norway	Mean daily outside temperature, daily wind speed, total daily solar radiation, minimum daily temperature, maximum daily temperature, relative humidity, day of the week, month of the year, and SHC of the previous day.	Deep Learning	FFNN, RBFN, ANFIS	Complexity of the relationship between input variables and SHC.	This study used tree-based models for better interpretability of complex interactions.
[27]	2018	China	Meteorological parameters	Deep Learning	BPNN, MLR	Exclusion of significant factors like solar radiation due to limited practical engineering data.	GPR’s computational complexity limits its scalability; our approach prioritizes efficiency.
[28]	2021	Slovenia	Past aggregated statistics, currently available data sampled in 3 h, weather forecast, time, and seasonal cycle information.	Machine Learning	GPR	Need for systematic analysis to fine-tune model parameters for specific factors of different DH systems.	Ensemble models provide an alternative that balances accuracy and explainability.
[29]	2022	South Korea	Building environmental data, outdoor environmental data, patterns of energy consumption.	Deep Learning	LSTM	Need for more comprehensive data beyond essential sensor readings for improved model performance.	This study prioritizes real-time data over simulated inputs.
[30]	2016	Norway	Time of day, forward temperature, return temperature, flow rate, and heat load.	Machine Learning, Ensemble Learning	SVR, RF, PLS	Focus on specific meteorological parameters, potentially overlooking other influential factors like wind speed and humidity.	This study goes beyond seasonal variations by considering long-term operational trends.
[31]	2020	Greece	Relative compactness, surface area, wall area, roof area, overall height, heating area.	Machine Learning, Deep Learning	SVR, MLR	Reliance on simulated data may limit real-world applicability.
[32]	2020	-	Relative compactness, surface area, wall area, roof area, overall height, orientation, glazing area, and glazing area distribution.	Deep Learning	ANN, GRNN, RBNN	Reliance on simulated data and limited consideration of real-world factors may impact applicability.
[33]	2018	China	Actuator status data, environmental data, greenhouse real-time heating power.	Optimization Algorithms	PSO, GA	Limited applicability due to seasonal variations and complexity of energy exchange in greenhouses.
[34]	2022	Greece	Relative compactness, surface area, wall area, roof area, overall height, heating area	Machine Learning, Deep Learning	Hybrid model	Complexity and applicability to real-world data beyond energy consumption prediction.

Table 2. Correlation of Feature Set 1 (Operational + Environmental).

Feature	Correlation	Feature	Correlation	Feature	Correlation	Feature	Correlation
7341		1420		BWS		1424
Heating water volume m³	95.8874	Heating water volume m³	95.8874	Heating water volume m³	95.8874	Heating water volume m³	95.8874
Heating Degree Hours (HDH15)	77.1261	humidity temperature	78.8903	Heating Degree Hours (HDH15)	85.2723	Heating Degree Hours (HDH15)	80.6293
humidity temperature	74.1434	Heating Degree Hours (HDH15)	77.9527	air temperature	79.5549	wet bulb	78.6527
air temperature	72.5457	dew point	75.4706	relative humidity	39.2373	air temperature	75.2487
dew point	70.2496	air temperature	73.2	Return temperature °C	35.5769	dew point	71.782
vapor pressure	66.3501	vapor pressure	68.0134	Season	33.9704	vapor pressure	68.0187
absolute humidity	65.7995	absolute humidity	66.9944	global radiation	21.1192	absolute humidity	67.1705
Return temperature °C	35.5769	Visibility	43.3005	Brightness highest value	20.1686	Return temperature °C	35.5769
Season	33.9704	Return temperature °C	35.5769	Month	19.4003	Season	33.9704
visibility	30.4258	Season	33.9704	Year	8.9028	relative humidity	28.3693
relative humidity	22.5466	relative humidity	26.9666	wind speed	8.8434	Month	19.4003
Month	19.4003	sunshine duration	25.0405	amount precipitation	3.8948	Year	8.9028
wind speed	15.1184	Month	19.4003	air pressure	3.5429	precipitation yes/no	7.7676
precipitation yes/no	12.1546	precipitation yes/no	15.133	Weekday	2.7477	Weekday	2.7477
coverage clouds	10.2735	air pressure	10.9283	Day	2.6939	Day	2.6939
air pressure	9.9306	Year	8.9028	Flow temperature °C	1.9519	air pressure	2.3972
Year	8.9028	wind speed	5.6757	Hour	1.4639	Flow temperature °C	1.9519
highest wind peak	8.7917	wind direction	3.5276	precipitation yes/no	1.3629	Hour	1.4639
Weekday	2.7477	Weekday	2.7477	wind direction	1.1482	precipitation	0.4398
Day	2.6939	Day	2.6939	precipitation	0.9242
Flow temperature °C	1.9519	Flow temperature °C	1.9519
Hour	1.4639	Hour	1.4639
precipitation height	0.9622	Precipitation	0.925
wind direction	0.9327

Table 3. Correlation of Feature Set 2.

Feature	Correlation	Feature	Correlation	Feature	Correlation	Feature	Correlation
7341		1420		BWS		1424
Heating Degree Hours (HDH15)	77.1261	humidity temperature	78.8903	Heating Degree Hours (HDH15)	85.2723	Heating Degree Hours (HDH15)	80.6293
humidity temperature	74.1434	Heating Degree Hours (HDH15)	77.9527	air temperature	79.5549	wet bulb	78.6527
air temperature	72.5457	dew point	75.4706	relative humidity	39.2373	air temperature	75.2487
dew point	70.2496	air temperature	73.2	Season	33.9704	dew point	71.782
vapor pressure	66.3501	vapor pressure	68.0134	global radiation	21.1192	vapor pressure	68.0187
absolute humidity	65.7995	absolute humidity	66.9944	Brightness highest value	20.1686	absolute humidity	67.1705
Season	33.9704	visibility	43.3005	Month	19.4003	Season	33.9704
Visibility	30.4258	Season	33.9704	Year	8.9028	relative humidity	28.3693
relative humidity	22.5466	relative humidity	26.9666	wind speed	8.8434	Month	19.4003
Month	19.4003	sunshine duration	25.0405	amount precipitation	3.8948	Year	8.9028
wind speed	15.1184	Month	19.4003	air pressure	3.5429	precipitation yes/no	7.7676
precipitation yes/no	12.1546	precipitation yes/no	15.133	Weekday	2.7477	Weekday	2.7477
coverage clouds	10.2735	air pressure	10.9283	Day	2.6939	Day	2.6939
air pressure	9.9306	Year	8.9028	Hour	1.4639	air pressure	2.3972
Year	8.9028	wind speed	5.6757	precipitation yes/no	1.3629	Hour	1.4639
highest wind peak	8.7917	wind direction	3.5276	wind direction	1.1482	precipitation	0.4398
Weekday	2.7477	Weekday	2.7477	precipitation	0.9242
Day	2.6939	Day	2.6939
Hour	1.4639	Hour	1.4639
precipitation height	0.9622	precipitation	0.925
wind direction	0.9327

Table 4. Summary of models.

Method	Description	Advantages	Disadvantages	Peculiarities
Machine Learning
KNN [36]	Predicts based on the ‘k’ most similar data points	Simple and intuitive	Computationally intensive for large datasets	Uses local weather data; sensitive to the choice of k
SVR [37]	Finds the optimal hyperplane to minimize prediction errors	Handles non-linear relationships well	Sensitive to parameter selection and kernel choice	Suitable for non-linear effects between inputs and target
DT [38]	Recursively divides input space into smaller regions	Easy to interpret; captures non-linear relationships	Prone to overfitting	Provides insight into feature importance
Linear Regression [39]	Fits a linear equation to minimize differences between observed and predicted values	Easy to use, interpret, and implement	Limited to linear relationships	Simple and fast; serves as a baseline model
Ensemble Learning
XGBoost [40]	Ensemble method combining multiple decision trees	Handles complex non-linear relationships well; robust	Can be computationally expensive	Effective for large datasets; includes regularization
RF [41]	Combines predictions from multiple decision trees	Reduces overfitting; handles complex relationships	Less interpretable than single decision trees	Leverages diverse perspectives of individual trees
GB [42]	Iteratively trains decision trees to correct previous errors	High predictive accuracy	Prone to overfitting if not finely tuned	Optimizes a loss function; each tree corrects the residuals of the previous.
AdaBoost [43]	Sequentially trains weak learners, focusing on misclassified instances	Improves model performance iteratively	Sensitive to noisy data	Weights instances based on difficulty to predict correctly
Deep Learning
LSTM [44]	Recurrent neural network designed to handle long sequences and temporal dependencies	Suitable for sequential data; handles vanishing gradient problem	Requires significant computational resources	Includes 64 units in the LSTM layer; uses early stopping to prevent overfitting
GRU [45]	RNN variant that regulates information flow for modeling temporal dependencies	Simplified architecture compared to LSTM; less computationally intensive	Can still be computationally expensive	Uses 64 units in the GRU layer; early stopping is included to optimize performance

Table 5. Feature divisions for experiments.

Features	Feature Set 1	Feature Set 2
1420
3 Features	Heating water volume m³, Heating Degree Hours (HDH15), humidity temperature	Heating Degree Hours (HDH15), dew point, humidity temperature
7 Features	Heating water volume m³, air temperature, Heating Degree Hours (HDH15), dew point, absolute humidity, vapor pressure, humidity temperature	air temperature, Heating Degree Hours (HDH15), dew point, absolute humidity, vapor pressure, humidity temperature, visibility
All	All the features included in Table 2	All the features included in Table 3
1424
3 Features	Heating water volume m³, Heating Degree Hours (HDH15), wet bulb	air temperature, Heating Degree Hours (HDH15), wet bulb
7 Features	Heating water volume m³, air temperature, Heating Degree Hours (HDH15), absolute humidity, vapor pressure, dew point, wet bulb	Season, air temperature, Heating Degree Hours (HDH15), absolute humidity, vapor pressure, dew point, wet bulb
All	All the features included in Table 2	All the features included in Table 3
7341
3 Features	Heating water volume m³, Heating Degree Hours (HDH15), humidity temperature	air temperature, Heating Degree Hours (HDH15), humidity temperature
7 Features	Heating water volume m³, air temperature, Heating Degree Hours (HDH15), absolute humidity, vapor pressure, dew point, wet bulb	Season, air temperature, Heating Degree Hours (HDH15), absolute humidity, vapor pressure, humidity temperature, dew point
All	All the features included in Table 2	All the features included in Table 3
BWS
3 Features	Heating water volume m³, air temperature, Heating Degree Hours (HDH15)	air temperature, relative humidity, Heating Degree Hours (HDH15)
7 Features	Season, Return temperature °C, Heating water volume m³, air temperature, relative humidity, global radiation, Heating Degree Hours (HDH15)	Month, Season, air temperature, relative humidity, radiation, Brightness highest value, Heating Degree Hours (HDH15)
All	All the features included in Table 2	All the features included in Table 3

Table 6. Evaluation of models on 1420 and 1424 weather station dataset using Feature Set 1.

1420
3 Features					7 Features				All Features
Models	MSE	MAE	RMSE	R2	MSE	MAE	RMSE	R2	MSE	MAE	RMSE	R2
Machine Learning					Machine Learning				Machine Learning
KNN	3.06	9.22	17.49	97	3.08	9.47	17.5	96.9	3.06	9.2	17.4	97
SVR	3.67	11	19.17	96.4	4.08	11.02	20.1	96	3.67	11	19.1	96.4
DT	4.86	11.42	22.05	95.24	4.8	11.26	21.9	95	4.88	11.4	22.1	95.2
Linear Regression	6.81	14.37	26.1	93.32	6.67	13.89	25.8	93.4	6.81	14.3	26.1	93.3
Ensemble Learning					Ensemble Learning				Ensemble Learning
XGBoost	2.5	8.5	15.7	97.6	2.4	8.3	15.5	97.7	2.4	8.4	15.7	97.5
RF	2.8	8.9	16.6	97.3	2.5	8.5	15.9	97.5	2.7	8.8	16.6	97.2
GB	2.8	9.5	16.8	97.2	2.9	9.7	17	97.2	2.8	9.5	16.8	97.2
AdaBoost	7.8	20.8	27.9	92.4	8.7	22.3	29.5	91.5	9.1	22.8	30.1	91
Deep Learning					Deep Learning				Deep Learning
LSTM	3.2	9.5	17.9	96.9	3.2	10	18	96.7	9.3	20.1	30.6	90.7
GRU	3.3	10	18.3	96.7	30	96.1	17.4	97	8.1	18.5	28.5	92
1424
3 Features					7 Features				All Features
Machine Learning					Machine Learning				Machine Learning
KNN	3.1	9.1	17.5	97	3	9.3	17.3	97.1	0.7	3	8.3	99.3
SVR	3.5	10.6	18.6	96.6	3.8	10.8	19.5	96.3	1.6	6.6	12.7	98.4
DT	4.3	11.2	20.7	95.8	4.2	11	20.4	95.9	2.4	9.6	15.6	97.6
Linear Regression	6.5	13.9	25.6	93.6	6.5	13.7	25.6	93.6	5.9	13.3	24.2	94.3
Ensemble Learning					Ensemble Learning				Ensemble Learning
XGBoost	2.5	8.5	15.9	97.5	2.4	8.4	15.4	97.7	0.4	1.9	6.3	99.6
RF	2.6	8.8	16.2	97.4	2.5	8.6	15.8	97.5	0.4	2.6	6.4	99.6
GB	2.8	9.6	16.8	97.2	2.8	9.6	16.8	97.2	1.1	5.2	10.3	99
AdaBoost	11.3	26.8	33.6	88.9	11.3	27.6	33.6	89	10.2	27.9	31.9	89.6
Deep Learning					Deep Learning				Deep Learning
LSTM	3.2	9.5	17.9	96.9	3.2	10	18	96.7	9.3	20.2	30.6	90.8
GRU	3.3	10	18.3	96.7	30	96.1	17.4	97	7.3	17.7	27	92

Table 7. Evaluation of models on the 7341 and BWS weather station dataset using Feature Set 1.

7341
3 Features					7 Features				All Features
Models	MSE	MAE	RMSE	R2	MSE	MAE	RMSE	R2	MSE	MAE	RMSE	R2
Machine Learning					Machine Learning				Machine Learning
KNN	2.5	8.6	16	97.5	1.7	7.3	12.9	98.4	0.7	3.1	8.2	99.3
SVR	2.8	10	16.7	97.3	1.7	7	13.2	98.3	1.4	6.3	11.7	98.7
DT	3.6	10.4	18.8	96.5	2.1	6.7	14.4	98	3.1	10.9	17.7	96.9
Linear Regression	6.3	14.1	25	93.9	5.8	14	24	94.3	5.6	13.9	23.8	94.5
Ensemble Learning					Ensemble Learning				Ensemble Learning
XGBoost	2.2	8	14.8	97.9	1.1	4.9	10.6	98.9	0.3	1.8	5.4	99.7
RF	2.3	8.8	15.1	97.8	1.2	5.3	10.8	98.9	0.4	2.5	6.1	99.6
GB	2.5	8.8	15.9	97.5	1.7	7.2	13.1	98.3	1	5.1	10	99
AdaBoost	7.7	20.9	27.7	92.5	9	25.4	30.1	91.1	10.4	28.1	32.3	89.8
Deep Learning					Deep Learning				Deep Learning
LSTM	3.2	9.5	17.9	96.9	3.2	10	18	96.7	8.8	20	29.8	91.2
GRU	3.3	10	18.3	96.7	30	96.1	17.4	97	8	19.2	28.3	92.1
BWS
3 Features					7 Features				All Features
Machine Learning					Machine Learning				Machine Learning
KNN	3.3	9.5	18.1	96.8	3	9.3	17.3	97.1	0.7	3	8.1	99.4
SVR	4.1	11.2	20.2	96	3.8	10.8	19.5	96.3	1.6	6.6	12.7	98.4
DT	5.2	11.9	22.8	94.9	4.2	11	20.4	95.9	2.4	9.6	15.6	97.6
Linear Regression	3.3	9.5	18.1	96.8	3	9.3	17.3	97.1	0.7	3	8.1	99.4
Ensemble Learning					Ensemble Learning				Ensemble Learning
XGBoost	2.9	8.9	17	97.2	2.4	8.4	15.4	97.7	0.4	1.9	6.3	99.6
RF	3.1	9.9	17.6	97	2.5	8.6	15.8	97.5	0.4	2.6	6.4	99.6
GB	3.1	9.2	17.7	96.9	2.8	9.6	16.8	97.2	1.1	5.2	10.3	99
AdaBoost	9.1	22.2	30.2	91.1	9.9	23.8	31.4	90.3	10.2	27.1	32	89.9
Deep Learning					Deep Learning				Deep Learning
LSTM	3.2	9.5	17.9	96.9	3.2	10	18	96.7	9.9	21	31.5	90.2
GRU	3.3	10	18.3	96.7	30	96.1	17.4	97	7.4	18.1	27.3	92.6

Table 8. Evaluation of models on 1420 and 1424 weather station dataset using Feature Set 2.

1420
3 Features					7 Features				All Features
Models	MSE	MAE	RMSE	R2	MSE	MAE	RMSE	R2	MSE	MAE	RMSE	R2
Machine Learning					Machine Learning				Machine Learning
KNN	33.2	37.6	57.6	67.4	29.9	35.2	54.7	70.7	13.3	21.3	36.5	86.9
SVR	31.9	38.6	56.5	68.7	29.2	36	54	71.4	11.7	21.7	34.3	88.5
DT	40.4	40.9	63.6	60.4	49.4	43.4	70.3	51.6	10.2	18.4	32	90
Linear Regression	36.2	44.7	60.1	64.5	33.8	41.9	58.1	66.9	29	39.7	53.9	71.6
Ensemble Learning					Ensemble Learning				Ensemble Learning
XGBoost	28.7	35.7	53.6	71.8	26	33.5	51	74.5	4.6	14.1	21.4	95.5
RF	34.4	38	58.7	66.3	28.8	34.5	53.7	71.7	4.7	13.2	21.8	95.3
GB	30.3	37.9	55.1	70.3	27.7	36	52.6	72.9	9.1	20.6	30.1	91.1
AdaBoost	34.8	41.6	59	65.9	35.6	44.5	59.7	65.1	38.3	54.2	61.9	62.4
Deep Learning					Deep Learning				Deep Learning
LSTM	31	38.1	55	69.5	29.4	37.2	54.2	71.1	9.3	20.1	30.6	90.7
GRU	30.7	37.8	55	69.5	27.7	35.9	52.6	72.8	8.1	18.5	28.5	92
1424
3 Features					7 Features				All Features
Machine Learning					Machine Learning				Machine Learning
KNN	39	42.3	62.4	61.8	35.2	39.6	59.4	65.5	9.9	18.2	31.5	90.3
SVR	34.2	40.1	58.5	66.4	32.2	37.8	56.7	68.5	12.6	22.5	35.5	87.7
DT	44.5	44.6	66.7	56.4	52.5	46.6	72.5	48.5	9	16.9	29.9	91.2
Linear Regression	34.6	42.6	58.8	66.1	33.3	42.1	57.7	67.4	31.4	40.9	56	69.2
Ensemble Learning					Ensemble Learning				Ensemble Learning
XGBoost	34.2	40.4	58.5	66.5	31.5	38.1	56.1	69.1	4.4	13.4	20.9	95.7
RF	39.8	42.7	63	61	38.2	40.7	61.8	62.6	4.3	12.4	20.7	95.8
GB	33.5	40.4	57.9	67.2	30.8	38.6	55.5	69.8	8.4	19.5	29	91.8
AdaBoost	36.4	43.8	60.3	64.3	35.3	43.9	59.4	65.4	31.8	47.7	56.4	68.8
Deep Learning					Deep Learning				Deep Learning
LSTM	33	40.4	58	67	31.2	38.2	55.9	69.3	9.36	20.2	30.6	90.8
GRU	33.5	40.4	57.9	67	31.1	38.7	55.7	69.4	7.3	17.7	27.1	92.7

Table 9. Evaluation of models on the 7341 and BWS datasets using Feature Set 2.

7341
3 Features					7 Features				All Features
Models	MSE	MAE	RMSE	R2	MSE	MAE	RMSE	R2	MSE	MAE	RMSE	R2
Machine Learning					Machine Learning				Machine Learning
KNN	42.5	44	65.2	58.4	35.5	39.7	59.6	65.2	15.4	23.4	39.2	84.9
SVR	38	41.8	61.6	62.7	32.8	38.1	57.3	67.8	12.4	22.4	35.1	87.9
DT	48.6	46.3	69.7	52.3	52.4	46.3	72.4	48.6	11.4	19.1	33.7	88.8
Linear Regression	40.3	45.7	63.5	60.5	36.8	44.8	60.6	64	31.7	41.5	56.3	68.9
Ensemble Learning					Ensemble Learning				Ensemble Learning
XGBoost	37.1	41.8	60.9	63.6	31.9	38.1	56.5	68.7	4.7	14	21.7	95.4
RF	42.6	44	65.3	58.2	37.7	40.4	61.4	63	5	13.6	22.4	95.1
GB	36.8	42.4	60.7	63.9	32.1	39.4	56.7	68.5	9.2	20.7	30.3	91
AdaBoost	40.4	45.9	63.6	60.4	39.3	47.4	62.7	61.5	43.4	59.3	65.9	57.4
Deep Learning					Deep Learning				Deep Learning
LSTM	37.8	42.9	61.5	62.8	32.4	39.1	56.9	68.2	9.9	21	31.5	90.2
GRU	38.1	43.7	61.8	62.5	32.1	39.2	56.6	68.5	7.4	18.1	27.3	92.6
BWS
3 Features					7 Features				All Features
Machine Learning					Machine Learning				Machine Learning
KNN	31.1	36.9	55.7	69.5	22.4	29.4	47.3	78.1	11.2	19.6	33.5	89
SVR	27.3	35.4	52.3	73.2	22.2	29.7	47.2	78.2	11.1	20.7	33.3	89.1
DT	45.3	43.3	67.3	55.6	30.5	32	55.2	70.1	7.1	15	26.6	93.1
Linear Regression	27.7	36.7	52.6	72.8	26.7	36.8	51.7	73.8	25.3	36.1	50.3	75.2
Ensemble Learning					Ensemble Learning				Ensemble Learning
XGBoost	27.6	35.3	52.5	73	18	26.9	42.5	82.3	3.6	12.1	19	96.5
RF	32.4	37.6	57	68.2	17.4	25.5	41.7	83	3.5	11	18.6	96.6
GB	26.8	35.5	51.8	73.7	19.2	28.5	43.8	81.2	7.5	18.4	27.4	92.6
AdaBoost	28.1	37.7	53	72.4	26.1	38.1	51.1	74.4	30.7	47.8	55.4	69.9
Deep Learning					Deep Learning				Deep Learning
LSTM	27.2	35.7	52.2	73.2	23.6	32.3	48.6	76.7	8.89	20	29.8	91.2
GRU	27.8	36.4	52.8	72.6	21.3	30.3	46.2	79	8.05	19.2	28.3	92.1

Table 10. Comparison of this study with existing studies.

Ref.	Technique Name	Features	MSE	MAE	RMSE	R2
Hua et al. [50]	MLR-ANN	Historical and current outdoor temperature, wind speed, solar radiation, and season	-	-	82	98.2
Cui [51]	Bi-LSTM	Past and future weather information	-	14	19	-
Chung et al. [52]	Parallel Convolutional Neural Network–Long Short-Term Memory Attention (PCLA)	District heater-related variables, heat load-derived variables, weather forecasts, time factors	66.2	57.1	-	94.2
This Study	XGBoost	operational and environmental features	0.3	1.8	5.4	99.7

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Almeida, F.; Castelli, M.; Côrte-Real, N. Towards Sustainable Energy: Predictive Models for Space Heating Consumption at the European Central Bank. Environments 2025, 12, 131. https://doi.org/10.3390/environments12040131

AMA Style

Almeida F, Castelli M, Côrte-Real N. Towards Sustainable Energy: Predictive Models for Space Heating Consumption at the European Central Bank. Environments. 2025; 12(4):131. https://doi.org/10.3390/environments12040131

Chicago/Turabian Style

Almeida, Fernando, Mauro Castelli, and Nadine Côrte-Real. 2025. "Towards Sustainable Energy: Predictive Models for Space Heating Consumption at the European Central Bank" Environments 12, no. 4: 131. https://doi.org/10.3390/environments12040131

APA Style

Almeida, F., Castelli, M., & Côrte-Real, N. (2025). Towards Sustainable Energy: Predictive Models for Space Heating Consumption at the European Central Bank. Environments, 12(4), 131. https://doi.org/10.3390/environments12040131

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Towards Sustainable Energy: Predictive Models for Space Heating Consumption at the European Central Bank

Abstract

1. Introduction

2. Dataset

2.1. Heating Consumption Data

2.2. Weather Variables from the Building Weather Station (BWS)

2.3. Local Weather Stations

2.4. Trends

2.5. Feature Selection

2.6. Feature Selection Methodology

Feature Divisions

3. Methodology

3.1. Preprocessing

3.2. Modelling

3.3. Evaluation Metrics

3.4. Experimental Setup

4. Results

4.1. Feature Set 1

Discussion Feature Set 1

4.2. Feature Set 2

Discussion Feature Set 2

4.3. Comparison

4.4. Deployment Considerations and Policy Implications

5. Limitations of the Study and Models

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI