1. Introduction
The study of building energy demand has gained significant importance due to the growing focus on energy sustainability, especially following the European Directive on Energy Performance of Buildings (EPB) implementation. In Europe, buildings are responsible for 40% of total energy consumption and 36% of total CO
2 emissions [
1]. Accurate predicting building energy consumption is crucial for effective energy management, as it helps identify abnormal energy usage and diagnose potential causes, provided that sufficient historical data is available [
2]. However, conventional energy prediction methods often fall short due to their reliance on rigid assumptions and limited adaptability to dynamic energy consumption patterns.
Recently, there has been a shift from merely calculating energy consumption to analyzing the actual energy use of buildings [
3,
4]. This shift is driven by the complexity of building energy systems and behavior, which non-calibrated models fail to accurately predict, thus requiring real-time data analysis of energy use. Traditionally, estimating building energy use involves applying a model with known system structures, properties, and external variables (forward approach). These engineering methods utilize physical principles to calculate thermal dynamics and energy behavior at the building or sub-component level [
5]. Despite their theoretical accuracy, these physics-based models often struggle with scalability and require extensive domain expertise. Furthermore, their reliance on detailed building specifications, which may not always be available, limits their practical applicability in large-scale or heterogeneous building environments.
Various software tools, such as DOE-2 (e.g., version 2.2) [
6], EnergyPlus (e.g., version 9.6.0) [
7], and TRNSYS (e.g., version 18.02) [
8], have been developed for this purpose. However, these tools require detailed knowledge of numerous building parameters and behaviors, which are often unavailable. Consequently, simplified methods for predicting building energy use have been developed. For instance, the steady-state method using degree days was presented in [
9]. Yao and Steemers [
10] also introduced a simple method for formulating load profiles for U.K. domestic buildings using a thermal dynamic model to predict daily energy demand profiles for appliances, domestic hot water, and space heating. While these methods provide useful approximations, they often lack the flexibility to capture non-linear dependencies between variables and fail to adapt to evolving building occupancy patterns or climatic variations.
Organizational measures for energy efficiency encompass a variety of strategies aimed at reducing consumption while maximizing resource utilization [
11,
12]. Such methods include the installation of energy-efficient lighting systems [
13], improving HVAC for better performance [
14], and innovative building technologies [
15]. Although these measures significantly contribute to energy conservation, their effectiveness is often constrained by high initial costs, resistance to change, and the lack of real-time monitoring mechanisms. The absence of adaptive strategies also limits their ability to respond to fluctuating energy demands.
In recent years, the development of energy management, especially in space heating consumption (SHC), has been transformed by the introduction and implementation of Artificial Intelligence (AI) with Machine Learning (ML) [
16]. AI refers to imitating intelligence processes by computer systems, including learning, reasoning, and self-correction. At the same time, ML is a subset of AI that exclusively deals with developing algorithms that allow computers to learn, make predictions, or take actions based on data [
17]. These technologies, including models such as neural networks [
18], DTs [
19], and support vector machines [
20], are developed and used to optimize energy utilization, predict demand, patterns, and costs, and detect inefficiencies in various energy systems. For instance, neural networks can be trained to examine complicated energy data and give insights into energy usage patterns, which will help with making informed decisions regarding energy optimization strategies [
21]. However, while ML-based approaches provide substantial improvements in predictive accuracy, their performance is often model-dependent and sensitive to data quality [
22]. Many studies fail to incorporate diverse data sources, leading to biased predictions that do not generalize well across different buildings or climatic conditions. Furthermore, the black-box nature of some ML models raises concerns regarding interpretability, making it difficult for facility managers to trust and act on predictions. With the potential for real-time monitoring and adaptive control, Internet of Things (IoT) devices could help in dynamic adjustments that optimize energy consumption according to the situation. Similarly, predictive maintenance models can detect equipment failures and operational deterioration, enabling proactive interventions to be carried out before interruptions and the subsequent wastage of energy resources [
14].
While this study presents a novel approach by integrating multiple ML models and localized weather data for SHC prediction, it builds upon a growing body of research exploring data-driven energy forecasting methods. Numerous studies have leveraged ML and DL models for building energy demand estimation, yet many focus on single techniques or limited feature sets, restricting their applicability across different climatic and operational contexts. A comprehensive review of prior works is necessary to contextualize our study, highlight gaps in existing methodologies, and reinforce the need for a comparative ML framework in SHC forecasting.
With new developments in ML, it has been possible to achieve considerable progress in the past few years [
23]. For instance, Xue et al. [
24] proposed an ML-based framework that applies to multi-step-ahead district heating system load forecasting by testing SVR, deep neural network, and extreme gradient boosting (XGBoost) models. Furthermore, Li & Yao [
25] developed a system for the prediction of building heating as well as cooling loads that includes occupant behavior as a predictor variable and also considers five ML models. Additionally, Jovanović, Sretenović, and Živković [
26] investigate the prediction of heating energy consumption for a university campus, employing various artificial neural network architectures. However, challenges remain in refining predictive approaches due to the complex relationship between input variables and SHC, particularly in diverse campus settings.
On the other hand, Yuan et al. [
27] introduce a novel sample data selection method (SDSM) to enhance the prediction accuracies of Back Propagation Neural Network (BPNN) and Multi-layer Perceptron MLP models for heating energy consumption, demonstrating significant reductions in training and prediction errors for BPNN models. Moreover, Potočnik, Škerl, and Govekar [
28] present an ML-based approach for short-term heat demand forecasting in district heating systems, highlighting the superiority of Gaussian Process Regression (GPR) in achieving accurate forecasts for the most prominent Slovenian DH system. Jang et al. [
29] focus on enhancing the prediction accuracy of building heat consumption using LSTM models, showing improved performance when operation pattern data from non-residential buildings is incorporated. However, challenges remain, such as the limited applicability of SDSM to MLP models and the need for more comprehensive data to enhance model accuracy in diverse building scenarios.
ML models have been applied to predict building heat loads, optimize energy consumption, and enhance sustainability. Dalipi et al. [
30] introduce a supervised ML model for predicting heat load in a district heating system, evaluating SVR, Partial Least Square, and RF algorithms. This study focuses on different algorithms’ performance in a district heating context, highlighting their varying predictive capabilities. In another approach, Moradzadeh et al. [
31] propose a methodology for forecasting heating and cooling loads in residential buildings using MLP and SVR techniques. This study explicitly targets residential buildings and emphasizes the predictive accuracy of these models in that context. Abdelkader et al. [
32] address the need for energy-efficient buildings by comparing various ML models, finding that the radial basis neural network performs better. However, they identify limitations such as the focus on specific meteorological parameters and reliance on simulated data, which may impact the real-world applicability of these models.
In addressing the imperative for energy efficiency across diverse domains, Shen et al. [
33] emphasize optimizing energy usage in greenhouses to reduce production costs, employing mathematical modeling and algorithmic optimization. Similarly, Moradzadeh et al. [
34] contribute to advancing accurate building energy consumption prediction, focusing on residential buildings’ cooling and heating loads. Their novel hybrid model, Gaussian SVR, integrates the Group Method of Data Handling (GMDH) and SVR techniques, presenting promising results, albeit with complexities and applicability concerns. Furthermore, while Shen et al. achieve promising outcomes, challenges persist due to the intricacies of greenhouse energy exchange and seasonal variations, suggesting further research to expand the model’s scope to address summer cooling and optimize temperature settings throughout the week.
Building energy consumption prediction, particularly for heating loads, has been explored through various ML and optimization techniques such as SVR, neural networks, and hybrid models as shown in
Table 1. These studies have demonstrated promising results despite the absence of certain critical variables, indicating the potential for further advancements in energy forecasting methodologies. However, the exclusion of essential parameters suggests the need for more comprehensive models that incorporate a broader range of influential factors. Addressing these gaps, this study aims to enhance the precision and generalizability of SHC predictions at the European Central Bank headquarters by evaluating and comparing multiple ML models using data from multiple weather stations. The goal is to improve demand forecasting accuracy, support sustainable energy management, and contribute to global decarbonization efforts.
The contributions of this study are as follows:
We conducted a thorough comparison of various ML and DL models, including K-Nearest Neighbor (KNN), Support Vector Regression (SVR), Decision Tree (DT), Linear Regression (LR), XGBoost, Random Forest (RF), Gradient Boosting (GB), AdaBoost, Long-Short-Term-Memory (LSTM), and Gated Recurrent Unit (GRU), to determine the most accurate model for SHC prediction.
We incorporated data from four distinct weather stations, improving the generalizability of the findings and ensuring that the models were evaluated across diverse climatic conditions.
We utilized two comprehensive feature sets (Feature Set 1 and Feature Set 2) derived from detailed weather data, enhancing the robustness and depth of the analysis.
Furthermore, this study underscores the potential for organizations to develop in-house energy management solutions in compliance with the EPB European Directive, promoting energy efficiency and sustainability.
2. Dataset
The dataset includes operational and environmental aspects of SHC consumption at the European Central Bank (ECB) Headquarters in Frankfurt. This state-of-the-art building has multiple sensor networks as part of an advanced Building Control System, enabling granular analysis.
Data from the district heating network supplying the building was collected. It was complemented with data from several weather stations located around and on top of the building, selected based on proximity. This comprehensive dataset offers a rich resource for understanding the intricate interplay between environmental variables and heating demands.
These data are used for historical analysis to understand building heating demands and make high-level estimations for the future. However, adopting ML techniques would significantly enhance the accuracy and efficiency of these forecasts. ML can analyze complex patterns and relationships within the data that traditional methods might overlook, leading to more precise and reliable predictions. This comprehensive dataset, therefore, offers a rich resource for understanding the intricate interplay between environmental variables and heating demands, providing a solid foundation for advanced predictive modeling through ML.
2.1. Heating Consumption Data
SHC data give an overview of energy consumption within a building by providing a list of important parameters, including SHC value, volume, inflow water temperature, and return water temperature. These data were obtained from a district heating network and represent the actual heating demand required to heat the entire building area, which is actively used by real-world ventures. The heating load in the studied building is primarily used for space heating and hot water supply. The contribution of each component varies, with space heating being more dependent on weather conditions such as outdoor temperature and humidity, while hot water supply exhibits a more stable demand pattern influenced by occupancy schedules. The district heating system is the baseload supplier, giving the building foundation a stable and diversified heat provision.
This data were collected by a smart meter, which is fitted to the premises’ technical areas. This device, outfitted with different sensors, detects and accurately documents the heating usage. The data monitor is situated at the interface between the district heating network provider and the office building’s heating system. This data monitor is an integral part of the overall data acquisition process.
The Heating Degree Day (HDD) is a metric that can estimate the energy demand required to heat a building. It represents the number of degrees by which a day’s average temperature falls below a specific base temperature, the threshold below which buildings require heating [
35]. In this case, the 15 °C threshold has been selected based on expert knowledge, ensuring accuracy and relevance in measuring the building heating requirements. It is important to note that different climatic regions may employ different base temperatures to account for local variations in heating needs. Heating Degree Hours (HDH) is used in this study because it provides a more granular and precise measure of heating demand than HDD, particularly when evaluating hourly temperature data. The formula for calculating HDH is as follows:
where T
out is the hourly outside temperature, and HDH represents the hourly degree hours.
By using HDH, we can capture short-term variations in temperature that influence heating needs, leading to more accurate predictions of energy consumption for Heating.
2.2. Weather Variables from the Building Weather Station (BWS)
The weather variable dataset consists of various meteorological parameters for evaluating atmospheric conditions, including temperature, humidity, wind speed, etc. These factors reflect weather pattern dynamics that influence building operations and energy usage. This dataset, collected from a weather station atop the ECB skyscraper, provides localized insights. It has up-to-date weather information that applies to the immediate surroundings of the building. Positioning the weather station at such an elevation maximizes the gathering of appropriate data that echo the building’s atmosphere.
Connection with the Building Automation System (BAS) enhances the efficiency and availability of data. The weather station has a smooth link with the BAS, making data transmission and storage much easier. With this integration in place, the BAS control rooms are adequately equipped to provide the building managers and operators with real-time monitoring of weather conditions, thus enabling them to make informed decisions based on up-to-date weather information. Additionally, the BAS’s historical data analysis functionality allows retrospective weather trends and patterns. The BAS makes it possible to archive weather data over time, allowing for deep analysis and enabling key actors to find connections between weather variables and building performance metrics.
2.3. Local Weather Stations
The local weather data comprise a comprehensive archive of meteorological information collected from three local weather stations: Frankfurt Airport (station 1420), Frankfurt am Main–Westend (station 1424), and Offenbach Weather Park (station 7341). These stations are placed explicitly at varying distances from the main study building, each allowing for the specific study of localized weather conditions that may influence operations and energy management. This dataset is sourced from DWD (Deutscher Wetterdienst) and their climate data center, which is renowned as the most reliable and authoritative source for historical weather data. The DWD obtains a broader picture to understand weather phenomena and trends using weather stations covering diverse areas. The data acquisition process from the DWD’s climate data center is precise and includes careful extraction and compilation processes. Researchers access historical weather data from selected weather stations through the Deutscher Wetterdienst Climate Data Center on the DWD website.
2.4. Trends
Figure 1 illustrates the total heating degree hours (HDH15) categorized by season across four weather stations: the BWS, 7341, 1420, and 1424. Observing the data, the BWS records the highest total heating degree hours during the winter season, surpassing 7000, while 1424 exhibits the lowest total, with around 1420 heating degree hours. Conversely, as the graph indicates, the total heating degree hours exhibit a consistent pattern across all stations, with the winter season consistently registering the highest values and the summer season showing the lowest.
The second feature set covers various environmental factors relating to weather, time, and environmental conditions, vital to understanding the intricate interactions affecting SHC inside the building. This representation excludes direct measurements, in this case, ‘Heating water volume m³’, ‘Return temperature °C’, and ‘Flow temperature °C’, aimed at capturing the broader context of heating demand. For meteorological variables, it incorporates the following values: humidity, temperature, dew point, air temperatures, vapor pressure, absolute humidity, visibility, and wind speed and temporal indicators, such as season, month, weekday, day, and hour, it provides a complete framework that embraces the external factors that affect energy demand for Heating. These features are selected based on their well-established impact on heating demand, as demonstrated in the existing literature on building energy modeling. Moreover, introducing the smart meters’ data transmission helps to include features like precipitation yes/no, air pressure, wind direction, and year. Thus, the study becomes more diversified as it allows the examination of long-term trends and seasonal variations in heating demand.
This trend can be attributed to the fundamental principle that colder temperatures in winter necessitate increased heating to maintain indoor comfort levels. As a result, wintertime calls for more space SHC than other seasons and, hence, higher heating degree hours. On the other hand, in the summer period, less heat is required due to higher outside temperatures, reducing the number of heating degree hours.
Examining variations among the weather stations, the BWS records the highest total heating degree hours during winter seasons, indicating that it experiences colder temperatures or requires more heating resources than the other stations during the winter. On the other hand, local weather station 1424 frequently shows the lowest total heating degree hours, which may be due to milder temperatures or lower heating requirements.
2.5. Feature Selection
This study considers Feature Set 1 and Feature Set 2, offering unique insights into the factors influencing SHC and building energy efficiency. Feature Set 1 will cover the operational features, such as those relating to the heating system, that enable the analysis and improvement of the building’s performance. The measurements, like ‘Heating water volume m3’, ‘Return temperature °C’, and ‘Flow temperature °C’ are very informative. They provide the operating mode of the heating infrastructure for accurate monitoring and control. The selection of these features is based on their direct relevance to heating system performance, as supported by prior studies on energy consumption modeling in buildings. Besides that, one of the ways of enhancing the dataset accuracy is by using other indicators such as humidity, dew point, air temperature, vapor pressure, absolute humidity, visibility, relative humidity, sunshine duration, month, precipitation yes/no, air pressure, wind speed, wind direction, and day.
2.6. Feature Selection Methodology
These features are selected for their ability to capture both direct and indirect influences on SHC patterns within the building. First, the features of Feature Set 1 are the three dimensions, ‘Heating water volume m³’, ‘Return temperature [°C]’, and ‘Flow temperature [°C]’, as they are the parameters that determine the operation of the heating system in the building. Furthermore, Feature Set 2 has a vast collection of meteorological, temporal, and environmental variables like humidity levels, air temperature, and precipitation. These features are selected for their indirect but significant impact on heating demands, reflecting the ambient conditions and external factors that influence indoor temperature regulation and energy usage.
Feature Divisions
In this study, features were categorized into three groups: 3 main features, 7 main features, and all features, based on correlation analysis and model interpretability. This selection ensures better generalization, computational efficiency, and a more robust training process by focusing on highly correlated attributes to simplify model complexity. It is pertinent to mention that this study did not use Principal Component Analysis (PCA) or Recursive Feature Elimination (RFE) for feature selection, as our primary objective was to analyze the impact of high-dimensional feature sets on heating consumption prediction. Instead of reducing dimensionality through PCA, which transforms features into principal components, or using RFE, which iteratively removes less significant features, we relied on correlation-based feature selection. This approach allowed us to retain interpretable variables and assess their individual and combined influence on model performance.
The correlations between features and the target variable, Heating, vary across different weather stations due to each location’s unique environmental conditions and building characteristics. For instance, in Feature Set 1, the correlation between ‘Heating water volume m3’ and Heating may be higher for one weather station than others, reflecting the specific heating system dynamics in that building. Similarly, in Feature Set 2, the correlation between ‘humidity temperature’ and Heating may differ among weather stations, indicating variations in the influence of meteorological factors on heating demand.
Table 2 presents the correlation values for Feature Set 1, (which includes operational features related to the heating system, such as ‘Heating water volume m³’, ‘Return temperature °C’, and ‘Flow temperature °C’) across all four weather stations reveals notable variations in the relationships between features and the target variable, Heating. As an illustration, ‘Water volume m
3’ shows the most significant correlations throughout all stations, which proves it is the main factor determining energy consumption for office space heating. In contrast, attributes like ‘Hour’ and ‘Flow temperature °C’ are generally low correlations, implying a weaker association between heating demand and these features. The most apparent way correlations differ across weather stations is that there are prominent differences in correlation for some features among different weather stations. For instance, at the BWS, HDH15 emerges as the second-strongest correlation, while at the 1420 weather station, ‘humidity temperature’ occupies that place.
Similarly, the depiction of correlation in
Table 3 for Feature Set 2 (which consists of environmental and temporal factors like humidity, temperature, dew point, wind speed, and time indicators such as season, month, and hour) across all four weather stations shows individual relations among attributes and the objective variable Heating. However, HDH15 seems to have robust correlations with all stations over the years, thus highlighting their effect on how heating demand is distributed. However, the ‘Hour’ and ‘Day’ variables are most weakly correlated within the entire group, representing the lowest degrees of relationship with the heating demand. Also, the associations among the parameters are slightly different for different weather stations. For example, ‘humidity temperature’ as a component for SHC yields the highest correlation value at the 1420 station, but ‘BWS global radiation’ shows the highest correlation at the BWS. These variations point out that such variables should be considered when forecasting the heating usage by a station and developing models for it.
3. Methodology
The methodology involves collecting weather data from multiple weather stations, which is then stored in a centralized database for processing. The data undergoes feature extraction, where it is categorized into technical (Feature Set 1) and non-technical (Feature Set 2) features. Feature selection is then performed to refine these sets for analysis. The selected features are further divided into three categories: a subset of 3 features, a subset of 7 features, and the entire feature set. These feature sets are subsequently used to model heating demand, with the performance of the models evaluated using Mean Squared Error (MSE), Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), and R-Squared metrics to determine accuracy and reliability. The proposed methodology is shown in
Figure 2.
3.1. Preprocessing
The preprocessing starts by separating numeric columns, excluding ‘Year’, ‘Month’, and ‘Hour’, from non-numeric ones. Missing values are hierarchically imputed using group means, prioritized by ‘Year’, ‘Month’, and ‘Hour’. The remaining missing values are then filled with group means based on ‘Year’ and ‘Month’, followed by ‘Year’ and ‘Season’. Finally, any remaining missing values are imputed with the annual average. Categorical variables, including ‘Season’ and ‘Weekday’, are transformed into numerical format through label encoding. Moreover, hyperparameters used for each model were carefully selected through grid search and cross-validation techniques.
3.2. Modelling
The study assesses a range of models like KNN, SVR, DT, Linear Regression, XGBoost, RF, GB, AdaBoost, LSTM, and GRU to establish the significance of using localized weather data on the accuracy of predictive models for SHC. Hyperparameter tuning was conducted using grid search with cross-validation to ensure optimal performance of each model. Variations in hyperparameter tuning, such as learning rate and max depth, significantly impact the predictive accuracy of ensemble models. A lower learning rate allows for more gradual learning, reducing the risk of overfitting but requiring longer training times. In contrast, a higher learning rate speeds up convergence but may lead to suboptimal solutions. Similarly, increasing max depth enhances the model’s ability to capture complex patterns but can also increase the risk of overfitting. Details of these models are listed in
Table 4.
The chosen building for this study, the European Central Bank Headquarters, was selected due to its availability of high-quality, detailed weather and energy consumption data. This building also represents a complex urban structure with diverse heating requirements, making it a suitable case study for testing the robustness of the models. Experiments on other buildings were not carried out due to the unavailability of similarly detailed datasets that include localized weather conditions and operational parameters. However, the methodology is designed to be generalizable and can be applied to other buildings with appropriate data availability.
3.3. Evaluation Metrics
In this research, various evaluation metrics are used to evaluate the developed models for measuring the SHC. The metrics are Mean Squared Error (MSE), Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), Mean Absolute Percentage Error (MAPE), and R-squared (R²).
Mean Squared Error (MSE): MSE measures the average of the squares of the errors between actual (
) and predicted (
) values [
46]. This study quantifies the average squared difference between the observed and predicted SHC values, providing insight into the overall accuracy of the predictive model. MSE is particularly important for penalizing more significant errors more heavily, making it a critical metric when large prediction deviations are undesirable.
Mean Absolute Error (MAE): MAE calculates the average of the absolute differences between actual (
) and predicted (
) values. It measures the average magnitude of prediction errors without considering their direction [
47]. In the context of this study, MAE evaluates the average magnitude of errors in predicting SHC, offering a straightforward measure of model performance. Unlike MSE, MAE treats all errors equally, making it more interpretable and suitable for understanding the typical prediction error.
Root Mean Squared Error (RMSE): RMSE is the square root of the average of the squared differences between actual (
) and predicted (
) values [
48]. It measures the standard deviation of the prediction errors and is interpretable in the same units as the target variable. In this study, RMSE assesses the typical error magnitude of the predictive model in predicting SHC. RMSE is crucial when the scale of prediction errors needs to be expressed in the same unit as the target variable, making it easier to contextualize the error magnitude. It is also more sensitive to outliers than MAE, which may be advantageous in specific scenarios.
R-Squared (R
2): R
2, also known as the coefficient of determination, measures the proportion of the variance in the dependent variable (y) that is explained by the independent variable(s) (
) [
49]. It ranges from 0 to 1, where 1 indicates that the model perfectly predicts the dependent variable based on the independent variables, and 0 indicates that the model does not explain any variability in the dependent variable. In this study, R² assesses the goodness of fit of the predictive model to the observed SHC data, indicating how well the model captures the variability in the target variable based on the features used for prediction. R² is particularly important for evaluating the model’s explanatory power and understanding how well the model generalizes to unseen data. It provides an intuitive measure of model effectiveness by comparing explained variance to total variance.
Loss Function: The loss function was constructed using standard regression metrics, including Mean Squared Error (MSE) as the primary optimization criterion. MSE was chosen due to its sensitivity to large errors, ensuring that the model minimizes significant deviations in predictions. Additionally, we monitored Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE) to assess model performance comprehensively. These loss functions were optimized using the gradient boosting framework of XGBoost, which iteratively reduces errors by adjusting model weights based on residuals from previous iterations.
3.4. Experimental Setup
Experiments are conducted using common Python libraries for data preprocessing, model development, and evaluation. Pandas is utilized for data manipulation and analysis, while NumPy supports numerical computing and array operations. Scikit-learn provides a range of algorithms for ML tasks, including model training, evaluation, and preprocessing. DL models are constructed using TensorFlow and PyTorch, which offer high-level APIs for building neural networks and optimizing performance. The matplotlib library facilitates data visualization to track dataset characteristics and model performance. Random seed values are set to ensure reproducibility across experiments. The dataset is divided into training and testing sets with an 80–20 split, where 80% of the data are for training and the rest 20% are for testing. In addition, a K-fold cross-validation with K = 5 is applied during the training phase to minimize the risk of overfitting and verify the models’ generalization capacity.
Several experiments are conducted based on the systematic division of the feature sets. The details of these feature sets are shown in
Table 5.
6. Conclusions
This study employed various combinations of feature sets to investigate their impact on the accuracy of predictive models for SHC across different weather stations. We identified XGBoost as the top-performing model consistently across all feature divisions and weather stations through comprehensive evaluations. XGBoost demonstrated superior predictive accuracy, effectively utilizing the comprehensive information in Feature Set 2 to enhance prediction performance. For instance, at the BWS, XGBoost achieved an MSE of 2.1923, MAE of 8.0211, RMSE of 14.8064, and an R² value of 97.851 in the 3-feature set, showcasing its robustness. Similarly, at weather station 1420, XGBoost excelled with an MSE of 2.5, MAE of 8.5, RMSE of 15.7, and an R² value of 97.6. Our analysis revealed that the BWS generally exhibited the best results, indicating its potential for providing more representative data for SHC prediction. Moreover, the All Features division consistently outperformed the subsets of features (3 or 7), emphasizing the importance of utilizing comprehensive feature sets to capture intricate relationships within the data. For example, across all weather stations, the All Features division consistently resulted in lower error metrics, with XGBoost achieving its highest accuracy levels.
This study focuses on a single building due to data availability, but the methodology is designed to be generalizable to other office buildings with similar heating demand characteristics. The selected building, consisting of two interconnected skyscrapers, presents a unique case with complex heating dynamics influenced by its architectural design and urban environment. The proposed approach is highly scalable, as it can be implemented in other buildings with access to granular heating consumption data from Building Management Systems (BMS). These systems collect detailed operational data, which, when combined with weather data obtained from an open-source website, enables easy adaptation to various buildings and locations. However, the accuracy of the model relies heavily on the quality and granularity of the data, highlighting the importance of robust data management practices for effective implementation and scalability across different settings.
In future work, we will delve deeper into feature engineering to identify additional variables that could enhance predictive performance. Exploring advanced ensemble techniques or DL architectures explicitly tailored for the SHC prediction task could also be beneficial. Additionally, an analysis of computational efficiency, including training times and model complexity trade-offs, will be conducted to assess real-world implementation feasibility. Furthermore, incorporating external factors such as building characteristics, occupancy patterns, or socioeconomic factors could improve the robustness and generalizability of the predictive models. An extended evaluation of model interpretability using SHAP (Shapley Additive Explanations) values will also be explored to identify the most influential predictors, ensuring better transparency and trust in the predictions. Overall, continued research in this domain holds the potential to refine predictive models and contribute to more efficient energy management strategies in office building settings.
Moreover, this study’s findings can extend to residential buildings by adjusting feature selection and model training to account for occupancy patterns, insulation levels, and heating system variations. Residential heating consumption is more dynamic due to diverse user behaviors and seasonal changes. XGBoost’s ability to capture complex relationships between weather, operations, and heating suggests its potential for residential heating forecasts. Integrating smart meters and IoT-based monitoring can further enhance energy efficiency, helping homeowners and policymakers reduce costs and carbon footprints. Future work can validate the model’s adaptability using diverse residential datasets.
This study underscores organizations’ ability to develop in-house energy management solutions, enabling them to autonomously meet their energy responsibilities. By aligning with the EPB European Directive’s standards for energy performance in buildings, our approach bridges a critical gap in research, highlighting the practical application of advanced predictive models in enhancing energy efficiency. The insights gained from this study can guide organizations in leveraging their resources to meet stringent energy efficiency measures, ultimately contributing to sustainable energy management and compliance with European standards.