Developing a Data-Driven AI Model to Enhance Energy Efficiency in UK Residential Buildings

Seraj, Hamidreza; Bahadori-Jahromi, Ali; Amirkhani, Shiva

doi:10.3390/su16083151

Open AccessArticle

Developing a Data-Driven AI Model to Enhance Energy Efficiency in UK Residential Buildings

by

Hamidreza Seraj

^1,*

,

Ali Bahadori-Jahromi

¹

and

Shiva Amirkhani

²

¹

Department of Civil and Environmental Engineering, School of Computing and Engineering, University of West London, London W5 5RF, UK

²

Sustainability and Climate Change, WSP, 6 Devonshire Square, London EC2M 4YE, UK

^*

Author to whom correspondence should be addressed.

Sustainability 2024, 16(8), 3151; https://doi.org/10.3390/su16083151

Submission received: 5 March 2024 / Revised: 29 March 2024 / Accepted: 6 April 2024 / Published: 10 April 2024

(This article belongs to the Section Energy Sustainability)

Download

Browse Figures

Versions Notes

Abstract

:

Residential buildings contribute 30% of the UK’s total final energy consumption. However, with less than one percent of its housing stock being replaced annually, retrofitting existing homes has significant importance in meeting energy-efficiency targets. Consequently, many physics-based and data-driven models and tools have been developed to analyse the effects of retrofit strategies from various points of view. This paper aims to develop a data-driven AI model that predicts buildings’ energy performance based on their features under various retrofit scenarios. In this context, four different machine learning models were developed based on the EPC (Energy Performance Certificate) dataset for residential buildings and standard assessment procedure (SAP) guidelines in the UK. Additionally, an interface was designed that enables users to analyse the effect of different retrofit strategies on a building’s energy performance using the developed AI models. The results of this study revealed the artificial neural network as the most accurate predictive model, with a coefficient of determination (R²) of 0.82 and a mean percentage error of 11.9 percent. However, some conceptual irregularities were observed across all the models when dealing with different retrofit scenarios. All summary, such tools can be further improved to offer a potential alternative or support to physics-based models, enhancing the efficiency of retrofitting processes in buildings.

Keywords:

machine learning; energy performance certificate; building energy consumption

1. Introduction

The building sector accounts for the largest share of global total final energy consumption (more than 35 percent), and over 85% of this is allocated to the operational needs of buildings [1]. In response, various energy conservation plans for buildings have been developed to ensure their efficiency and sustainability. In this regard, building energy performance prediction is crucial for implementing energy efficiency plans in buildings, facilitating effective conservation measures that lead to reduced energy consumption.

Energy prediction models can be classified into two different categories: physics-based and data-driven (artificial intelligence) modelling approaches [2]. Physics-based models (e.g., IESEVE, EnergyPlus) utilize physical and thermodynamic laws and require a large set of input data regarding building characteristic details, such as envelope materials and thickness. These models often involve lengthy simulation times and lack flexibility when it comes to making changes to individual variables, making them less suitable for dynamic scenarios.

On the other hand, data-driven models utilize artificial intelligence algorithms (e.g., machine learning algorithms) to discover non-linear relationships between inputs (e.g., building features) and outputs (e.g., annual energy consumption). This capability enables them to learn consumption patterns from historical data, enhancing their predictive accuracy. Hence, many machine learning algorithms, including regression- and classification-based approaches, have been developed to predict building energy consumption, and various studies of different case studies have been conducted to assess the accuracy and effectiveness of these models in estimating buildings’ energy consumption.

Razak et al. [3] conducted a study to utilize ensemble learning classification-based methods including support vector machine (SVM), gradient boosting (GB), and extra trees (ET) to predict residential buildings’ energy performance certificates in the UK. The findings of this study highlighted the better performance of the ET algorithm; however, the accuracy index of all the developed models ranged between 0.56 and 0.64. This study was later extended by [4], in which nine different models including an artificial neural network (ANN), a deep neural network (DNN), and ensemble learning regression-based techniques were developed to predict annual energy consumption (KWh/m²) in residential buildings across the UK. The output of the developed models in this study revealed outstanding results for the deep neural network (DNN), artificial neural network (ANN) support vector machine (SVM), and gradient boosting (GB) algorithms, in which the R² values obtained were higher than 0.9. However, feature importance results in these models show that the output of the model is highly dominated by the “total floor area”, which is in contrast with the fact that the heating system is the most important factor in UK homes’ energy consumption [5].

Another study, conducted in the UK by Seyedzadeh et al. [6] to predict energy performance in non-domestic buildings, used a gradient boost regression tree algorithm to assist in building energy retrofit planning. This study utilized a comprehensive dataset encompassing detailed information on various building features such as air infiltration rates, the characteristics of each building facade, internal gains from equipment, etc. Their model also utilized advanced evolutionary algorithms for optimization, which resulted in less than a 2 percent error.

Some studies have incorporated on-site measurements to develop their ML models. In this regard, Shao et al. [7] utilized an SVM regression algorithm to predict daily energy consumption in a hotel building in China. They created their dataset using daily weather data (temperature and relative humidity) along with on-site measurements of building energy system characteristics. The result of this study revealed outstanding results in terms of R² and mean squared error metrics, which were recorded as 0.94 and 2.2%, respectively. Moreover, Dep et al. [8] found the recurrent neural network (RNN) as the most accurate algorithm in predicting hourly space heating demand for a single-family house located in Switzerland. They utilized 37 sensors to collect data and three different feature selection methods to analyse 41 features and create prediction models. The evaluation of the developed models shows that the RNN has better performance in terms of the R² metric.

Araujo et al. [9] utilized three different ML models, including the GB, ET, and ANN-multilayer perceptron (MLP), based on residential buildings’ EPC database to predict annual heating, cooling, and overall primary energy (KWh/m²) for a case study building. They selected 20 features including general details, construction elements, equipment, and the glazing system of a property. The performance indicator results of the model showed acceptable results for predicting total annual energy in which the R² value obtained was 0.79 for the ET model. On the other hand, its performance was less accurate when predicting annual cooling load, with an R² value not exceeding 0.41.

Pham et al. [10] utilized a random forest (RF) model to predict hourly energy consumption in five educational buildings on 1-step-ahead, 12-step-ahead, and 24-step-ahead bases. They also considered different scenarios to assess the effect of variations in the length of the training data (ranging from 67% to 92%) on the accuracy of the model. The study results showed the model performs acceptably for predicting 1-step-ahead, with mean absolute percentage error (MAPE) values between 5% and 15% for each dataset, and building and changing the model’s training size doesn’t necessarily affect its accuracy.

The accuracy of the above-mentioned predictive models is highly dependent on elements such as the extent of the dataset, designed features, target variable, step of prediction, and so on. Moreover, the absence of a user interface to facilitate assessing models’ accuracy from a conceptual point of view was notable, resulting in these studies relying solely on statistical metrics to evaluate the model performance. In order to fill this gap, this research focused on designing models’ features (feature engineering) based on the Standard Assessment Procedure (SAP) and utilizing four machine learning models (including XGB, RF, ET, and ANN-MLP) to develop an AI-based tool that predicts a residential building’s annual energy consumption based on its characteristics. Moreover, an interface will be designed that enables users to input a specific case study building’s characteristics and assess the impact of retrofit strategies on the building’s energy performance, addressing the following key research questions in this study:

What are the key building features that significantly impact energy consumption in residential buildings, and how do they contribute to the predictive accuracy of machine learning models?
To what extent can machine learning models effectively capture dynamic changes in energy consumption patterns in residential buildings?
How do different machine learning algorithms, such as XGB, RF, ET, and ANN-MLP, compare in terms of their effectiveness in predicting residential buildings’ annual energy consumption?

2. Materials and Methods

2.1. Dataset

The study utilized the Energy Performance Certificate dataset for residential buildings in the UK, published by the Department for Levelling Up, Housing and Communities [11]. While the dataset includes information from across the UK, this study specifically focuses on residential buildings in London. This dataset contains detailed numerical and categorical information on building envelope characteristics, energy systems, and estimated annual energy consumption. The data was collected by energy assessors (experts) and follows the Standard Assessment Procedure (SAP), ensuring a comprehensive and standardized assessment of energy performance in residential buildings.

2.2. Data Pre-Processing

Raw data often includes various irregularities, such as missing values, noise, inconsistencies, and redundancies. These anomalies can impact the effectiveness of subsequent learning processes. As a result, the preprocessing step is a common procedure to minimize the impact of any data irregularities on the quality and reliability of subsequent processing steps [12]. It should be noted that in this research, data processing and model development were conducted using the “Sklearn” Python package [13].

To reduce noise and data irregularities in the model, case studies with missing values and outliers were excluded from the dataset. This ensured that the model was built on more accurate observations. For instance, case studies where the floor height was less than 2 m, total annual energy consumption exceeded 600 kWh/m², or the floor area was less than 20 m² were removed from the dataset.

The dataset includes some categorical features, so two common methods for data encoding were utilized in the model: one-hot encoding and label encoding. Each method comes with its own set of pros and cons. While one-hot encoding is effective in representing categorical variables, it can increase dimensionality and complexity, potentially reducing model interpretability; On the other hand, label encoding assigns ordinal meaning to categories. As a result, both methods were applied in each ML model to assess their effectiveness.

Data normalization is another essential pre-processing step which involves the transformation of features in a defined range (within the range of 0 to 1 in this study) so that greater numeric feature values (e.g., total floor area and percentage of low-energy lighting) cannot dominate the model, which will lead to minimizing the bias of these features in the model [14].

Furthermore, a correlation analysis of the input variables was conducted in this study using a correlation matrix. A correlation matrix is a table (or figure) that represents the correlation coefficient between variables, and a correlation coefficient is a statistical measure that quantifies the relation between two variables. This value ranges between −1 and 1, where 1 indicates a perfect positive correlation, −1 shows a perfect negative correlation, and 0 indicates no correlation. A high correlation between two variables (close to 1 or −1) is an indication that they are highly dependent on each other.

In this regard, the correlation matrix of input variables of the study is shown in Figure 1. So, to reduce the complexity and dimension of the model, whenever the correlation coefficient was calculated at more than 0.8, the redundant variables were removed from the model. For instance, as illustrated in Figure 1, a noticeable correlation is evident between “number of heated rooms” and “number of habitable rooms” with “total floor area”. Consequently, both variables were considered redundant due to this high correlation and were subsequently removed from the model. Overall, correlation analysis eliminated four input variables, resulting in twenty variables remaining in the model. This boosted the model’s simplicity and eliminated unnecessary redundancy, ensuring a more focused and efficient model performance.

2.3. Feature Engineering

A “feature” is an attribute or variable used to describe some aspect of an individual data object, and the general idea of “feature engineering” includes the process of transformation, generation, extraction, selection, analysis, and evaluation of features within a dataset [15]. In this phase of the research, potential features related to both the building energy system and building physics were extracted from the dataset. Additionally, new features were generated based on the descriptions provided for building characteristics in the EPC dataset. Designing new features and their assigned values (or categories) follows the Standard Assessment Procedure (SAP). For instance, using the collected information about wall descriptions, two features were created: “wall type” (including categories like solid brick, cavity wall, and timber frame) and “wall insulation” (with categories such as insulated and as-built). Further details about the selected and designed features can be found in Table 1.

2.4. Data Exploration

An investigation into the 14,000 remaining case studies within the dataset (after data pre-processing) reveals significant trends across various parameters. Regarding property type, the majority comprises flats, accounting for approximately 58% of the dataset, followed by houses at 33%, maisonettes at 7%, and bungalows at 2%. Analysing the heating systems utilized, a clear dominance of gas boilers emerges, representing over 85% of the case studies. Other heating systems such as electric storage and electric room heaters also feature, although to a lesser extent. Notably, only 0.5% of the case studies employ air source heat pumps as their primary heating system. Looking into the construction age bands, nearly 40% of the properties were constructed before 1950, suggesting potential issues with building insulation standards, particularly if they have not been retrofitted. Conversely, only 3.5% of the properties were constructed post-2007, highlighting a modernization gap in the housing stock. Overall, the data shows a central tendency around 200 kWh/m² annual energy consumption, with an average consumption of 226 kWh/m², as shown in Figure 2.

2.5. Model Selection

In this study, a diverse set of ML algorithms was applied to predict building annual energy consumption, including ensemble learning methods as well as an artificial neural network with a multilayer perceptron. Ensemble learning methods integrate individual regressors to improve the predictions. It is accepted that each regressor is likely to learn different aspects of training data. So, combining multiple regressors using ensemble learning can enable the final ML model to search in a wide solution space [16]. In this regard, the ensemble learning methods chosen for this study are random forest (RF), XGBoost (XGB), and extra trees (ET).

These algorithms were chosen based on their accurate performance in the literature for predicting annual building energy consumption, effectiveness in handling both categorical and numerical input features [17], and their robustness in non-linear relationship modelling [16,18]. The following section will provide a highlight of the key features and algorithms employed in the selected ML models.

2.6. Models Theoretical Background

2.6.1. Random Forest

The random forest algorithm is an ensemble learning method that operates by constructing a multitude of decision trees during the training process. A decision tree is a tree-like model where an input is progressively split into subsets based on the values of particular features; and the goal is to create a model that predicts the target variable’s value by learning simple decision rules, inferred from the data features.

A random forest model generates hundreds or even thousands of these decision trees, which act as regression functions on their own, and the final output of the random forest regression is the average of the outputs of all the decision trees. If “X” represents the input vector containing “m” features with

X = \{x_{1}, x_{2}, . . ., x_{m}\}

, “Y” represents the output scalar and “S_n” the training set containing “n” samples, which can be expressed as:

S_{n} = \{(X_{1}, Y_{1}), (X_{2}, Y_{2}), \dots, (X_{n}, Y_{n})\} X \in R^{m}, Y \in R

(1)

So, the random forest model can be built by randomly sampling a feature subset for each decision tree, or by randomly sampling a training data subset for each decision tree. This randomly collected sample process is called a “bootstrap”. Each bootstrap sample is obtained by randomly selecting n observations with replacements from the original dataset, and each observation has a probability of 1/n to be selected.

Furthermore, the “bagging” algorithm selects several bootstrap samples

(S_{n}^{θ_{1}}, S_{n}^{θ_{2}}, \dots, S_{n}^{θ_{q}})

in order to build a collection of “q” prediction trees

\hat{h} (X, S_{n}^{θ_{1}}), \hat{h} (X, S_{n}^{θ_{2}}) \dots, \hat{h} (X, S_{n}^{θ_{q}})

. The ensemble produces q outputs corresponding to each tree,

\hat{Y_{1}} = \hat{h} (X, S_{n}^{θ_{1}}), \hat{Y_{2}} = \hat{h} (X, S_{n}^{θ_{2}}), \dots . ., \hat{Y_{q}} = \hat{h} (X, S_{n}^{θ_{q}})

. Then, the aggregation is performed by averaging the outputs of all the trees. Consequently, the estimation

\hat{Y}

can be obtained by Equation (1), where

\hat{Y_{1}}

is the output of the l-th tree, and l = 1, 2, …, q [19]:

\hat{Y} = \frac{1}{q} \sum_{l = 1}^{q} \hat{Y_{1}} = \frac{1}{q} \sum_{l = 1}^{q} \hat{h} (X, S_{n}^{θ_{1}})

(2)

The framework for using the random forest regression model is presented in Figure 3.

2.6.2. Extra Trees

Extremely randomized trees (extra trees) is an extension of the random forest algorithm that utilizes additional randomness, not only in the selection of feature subset, but also in the determination of the splitting thresholds for each node. This increased level of randomness aims to create a more diverse set of trees, potentially less likely to overfit a dataset [20].

Although extra trees employs the same principle as random forest, its two key differences are that it splits nodes by choosing cut points fully at random, and it uses the whole learning sample (rather than a bootstrap replica) to grow the trees [20,21]. From a computational point of view, the random splitting process in extra trees (rather than the optimization process for the splitting feature in random forest) is less computationally demanding. As a result, additional randomness in the splitting, robustness against overfitting, and computational efficiency makes extra trees a particularly useful model when dealing with high-dimensional datasets and complex relationships.

2.6.3. XGBoost

Extreme gradient boosting (known as XGBoost) is an end-to-end tree boosting which employs a sparsity-aware algorithm for sparse data and a weighted quantile sketch for tree learning [22]. One of the key factors associated with XGBoost is its scalability, where the system runs ten times faster than existing popular algorithms on a single machine [22]. In this regard, parallel and distributed computing accelerates the learning process, which results in quicker exploration of models.

While random forest builds a set of independent trees and averages their predictions, XGBoost sequentially adds new trees to the ensemble; and each tree aims to improve upon the correction of errors made by the previous trees. A brief description of the XGBoost algorithm for regression along with relevant equations is presented in the following paragraphs.

Given a dataset,

D = {(x_{i}, y_{i}) : i = 1,2, \dots, n x_{i} \in R^{m}, y_{i} \in R}

, n observations are obtained, each comprising m features, along with a corresponding variable, y. Let

\hat{y_{i}}

denote the result produced by an ensemble represented by the generalized model as Equation (3). In this equation,

f_{k}

represents a regression tree, and

f_{k} (x_{i})

denotes the score assigned by the k-th tree to the i-th observation in the dataset. To achieve the function

f_{k}

, the regularized objective function should be minimized according to Equation (4), where

L

is the loss function.

To avoid excessive model complexity, the penalty term Ω is incorporated, as shown in Equation (5). In this equation, T represents the total number of leaf nodes, and ω is the score of each leaf node. γ and λ are controlling factors employed to avoid overfitting [23,24]. Hence, the objective function that is minimized in the j-th iteration is re-written as Equation (6). This function can be simplified using a Taylor polynomial and approximated as Equation (7), where “

g_{i}

” is the first-order derivative, and “

h_{i}

” denotes the second-order derivative in Equations (8) and (9).

\hat{y_{i}} = ϕ (x_{i}) = \sum_{k = 1}^{K} f_{k} (x_{i})

(3)

L (ϕ) = \sum_{i} l (y_{i}, \hat{y_{i}}) + \sum_{k} Ω (f_{k})

(4)

Ω (f_{k}) = γ T + \frac{1}{2} {λ | |ω| |}^{2}

(5)

L^{j} = \sum_{i = 1}^{n} l (y_{i}, {\hat{y_{i}}}^{(j - 1)} + f_{j} (x_{i})) + Ω (f_{i})

(6)

L^{j} = \sum_{i = 1}^{n} [l (y_{i}, {\hat{y}}^{(j - 1)}) + g_{i} f_{j} (x_{i}) + \frac{1}{2} h_{i} f_{j}^{2} (x_{i})] + Ω (f_{i})

(7)

g_{i} = \partial_{{\hat{y}}_{i}^{(j - 1)}} l (y_{i}, {\hat{y}}_{i}^{(j - 1)})

(8)

h_{i} = {\partial^{2}}_{{\hat{y}}_{i}^{(j - 1)}} l (y_{i}, {\hat{y}}_{i}^{(j - 1)})

(9)

2.6.4. ANN-MLP

While ensemble learning techniques utilize the strengths of multiple simpler models to improve the overall predictive performance of the system, artificial neural network (ANN) models focus on building a single, complex system to find relationships within input data (as shown in Figure 4). In this regard, various types of ANN models have been developed, including the multilayer perceptron (MLP), convolutional neural network, and recurrent neural network. However, some research has indicated that an MLP neural network has the most reliable performance for prediction models, particularly in scenarios involving multiple input and output variables [25,26].

A MLP network consists of an input layer, one or more hidden layers, and an output layer. Each layer has several processing units (nodes), fully interconnected through weighted connections to subsequent layers. Figure 4 represents an overview of the conceptual framework of an ANN-MLP model, where

R_{t - 1}

to

R_{t - d}

is the input to the i-th node, and Q_t is the output of the output layer. The MLP transforms “n” inputs to “l” outputs using activation functions, such as Relu or sigmoid, based on Equation (10). In this equation,

σ

denotes the activation function,

x_{j}

the activation of the j-th hidden layer node,

w_{i j}

the interconnection weight between the j-th hidden layer node and the i-th output layer node, and

b_{i}

is the bias term for the i-th output layer.

a_{i} = σ (\sum_{j = 1}^{n} w_{i j} x_{j} + b_{i})

(10)

Similar to many machine learning algorithms, a loss function will be defined that measures the difference between the actual output and predicted output of the network. So, the aim is to reduce the error by adjusting the number and weights of the interconnections between layers using algorithms such as gradient descent back propagation. More details of the application and principals of the algorithm can be found in [27,28].

Figure 4. General structure of an ANN-MLP system [29].

2.7. Hyperparameter Tuning

Hyperparameter tuning plays a crucial role in enhancing models’ prediction performance. It involves finding the optimal set of parameters that define the main structure of the model and cannot be directly learned from training data. For example, in an XGBoost model, hyperparameters such as the learning rate, the maximum depth of each tree, and the minimum loss reduction are tuned to optimize the performance of the model. Similarly, in the ANN-MLP model, hyperparameters such as the number of hidden layers, number of nodes in each hidden layer, optimization algorithm used to update the weights, maximum number of iterations (epochs) for training the neural network, and activation function are adjusted during the tuning process.

In some cases, particularly when dealing with large and complex datasets, methods like random search can offer similar benefits to more sophisticated hyperparameter tuners while requiring fewer computational resources and being easier to implement. Empirical experiments have shown that a simple random search algorithm, sampling as few as 60 hyperparameter combinations, can perform as effectively as an exhaustive grid search spanning over 4000 hyperparameter values [18,30]. As a result, considering the complexity of the dataset, the randomized search method has been utilized in this study for the hyperparameter tuning process. Figure 5 presents the procedure of the research in this study.

2.8. Performance Evaluation

In order to assess the performance of the model, the dataset was split into training and test sets using an 80–20 split, with approximately 11,000 samples allocated for training and the remaining 20% for testing. Additionally, to ensure robustness in the evaluation, a k-fold cross-validation technique with k = 5 was employed. This approach allowed for iterative training and evaluation of the model on different subsets of the data, mitigating the risk of overfitting and providing a more reliable estimate of its generalization performance.

Moreover, to assess the performance of a trained machine learning model, many evaluation metrics have been developed. Evaluation metrics not only quantify the difference between actual and predicted values but also are designed to highlight issues such as overfitting, large errors, and outliers in the model. In this regard, the following metrics were utilized in this research to evaluate the models’ performance.

Coefficient of determination (R²): a statistical measure that represents the proportion of variance in the dependent variable that is explained by the independent variables in the model. It is calculated as the ratio of the explained variance to the total variance as Equation (11), where $y_{i}$ is the actual value of the dependent variable, and ${\hat{y}}_{i}$ is the predicted value.

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - {\bar{y}}_{i})}^{2}}

(11)

Normalized mean bias error (NMBE) measures the average bias in a set of predicted values, relative to the average of the actual values. It can be inferred from Equation (12) that a lower absolute value in the NMBE indicates that the model’s predictions are closer to actual values.

N M B E = \frac{1}{\bar{y}} . \frac{\sum_{i = 1}^{n} y_{i} - {\hat{y}}_{i}}{n}

(12)

Root mean square error measures the average deviation between the predicted values and the actual values in a dataset using Equation (13).

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}

(13)

Mean percentage error measures the percentage deviation between the predicted values and the actual values in a dataset, as shown in Equation (14).

M P E = \frac{1}{n} \sum_{i = 1}^{n} \frac{|y_{i} - {\hat{y}}_{i}|}{y_{i}} \times 100 %

(14)

3. Results

As previously noted, the results of the developed models have been analysed from two perspectives. Firstly, the models were evaluated using standard machine learning metrics such as “R²”. Secondly, an interface was developed to investigate the models’ response to changes in building features (possible retrofits), such as adding insulation to external walls, or using triple-glazed windows. In the first step, the performance of the developed ML algorithms (XGBoost, RF, ANN-MLP, and ET) in predicting building energy performance is presented in Table 2. It can be observed that the ANN-MLP model showed the highest accuracy in terms of evaluation metrics with an R² of 0.82, RMSE of 36.21, MPE of 11.86, and NMBE of 11.9. The other three models performed similarly, with XGBoost showing slightly better results.

Considering the complexity of the objective function in this research, relying only on ML evaluation metrics may not provide a comprehensive understanding of the models’ performance. Therefore, the feature importance analysis for ensemble learning models, which highlights the relative significance of different feature in predicting building energy performance, is presented in Figure 6.

In this regard, the conducted analysis aligns with the fact that almost 63 percent of energy consumption in UK homes is related to space heating [5], in which both RF and XGB models highlight the “heating system” as the most important feature in predicting building energy performance, with relative importance scores of 0.26 and 0.29, respectively. Although the ET model also identifies “heating system” as the third most important feature, its relative score of 0.11 suggests a lower impact for this feature compared to the other models.

In the XGB model, other top important features are related to the building envelope (floor insulation, external wall type, and roof insulation) and have a direct impact on space heating. Moreover, “hot water system type”, which has the third ranking in the XGB model feature importance score, is another major energy consumer in UK homes, accounting for 17% of total energy demand [5]. These consistencies across all the models, particularly the XGB model, emphasize the accuracy of the developed model in terms of prioritizing critical features for building energy performance prediction.

In order to get more insight about models’ predictive performance, Figure 7 displays the difference between the predicted and actual values of buildings’ annual energy consumption (Kwh/m²) for 2800 test cases. This figure reveals that concentration of residuals (actual–predicted) occurs around the central line “y = 0”, especially between lines “X = 100” and “X = 250”; which indicates higher accuracy of the models for case studies with actual annual energy consumption from 100 to 250 Kwh/m². Moreover, in this area, the plot appears to be relatively symmetric with a slight positive bias and some outliers (especially in ensemble models), suggesting potential areas for model refinement.

Lighter-density data points below the central line “y = 0” can be observed in the right side of the “X = 400” line, which indicates the negative bias of the models for these test case studies. All in all, based on the overall distribution of residuals, the ANN-MLP model appears to perform more robustly, with a smaller number of outliers, more symmetric, and concentration around the central line.

Furthermore, to investigate the performance of the developed models and ensure their conceptual functionality, a user interface was designed using the “Streamlit” framework. Figure 8 illustrates a capture of this interface, which enables users to input building features for a specific case study and observe the models’ prediction.

Through this interface, various characteristics can be modified, such as adding insulation to external walls or changing the heating system. By observing the response of each model, it can be evaluated whether the predictions align with the expected outcomes. For instance, when insulation is added, it is expected that annual energy consumption will decrease. Through this interactive analysis, the aim is to validate the models’ predictive accuracy and verify their ability to capture real-world changes in building characteristics.

In this context, a case study characterized by the features outlined in Table 3 was initially considered, representing the base case scenario. Subsequently, various retrofit strategies detailed in Table 4 were examined to assess their potential impact on the case study’s energy performance.

It can be observed that adding insulation leads to a reduction in energy consumption across all the models, ranging from 20% in the ANN-MLP model to 4% in the ET model. This aligns with expectations as improved thermal insulation reduces heat loss and enhances energy efficiency. However, the amount of reduced energy consumption is more acceptable and realistic in the XGBoost and ANN-MLP models, which highlights their accuracy in this particular retrofit. On the other hand, filling cavity walls, which is another practice for energy efficiency in homes, does not result in an energy consumption reduction in the XGB and ANN-MLP models, which suggests an area for potential improvements in the models.

When utilizing an air source heat pump, only the ANN-MLP model predicts a reduction in energy consumption (around 7%), while a significant increase is recorded across all the other models. This may indicate a lack of accuracy among the ensemble learning models in this specific retrofit scenario. Overall, based on analysed data for this case study, it can be interpreted that the ANN-MLP model shows more reliability when dealing with potential retrofit interventions.

4. Discussion

The present study aimed to incorporate ML models along with SAP guidelines and the EPC dataset in the UK to design an AI tool which predicts annual primary energy consumption in residential buildings. The obtained results indicated that the model built on ANN-MLP showed the best performance in terms of both statistical metrics and predicting dynamic changes in building features. However, the validation results showed lower accuracy compared to the studies conducted by Seyedzadeh et al. [6] and Shao et al. [7]. This disparity could be justified considering that they utilized either a more detailed dataset for limited case studies or implemented on-site measurements.

Further comparisons with the works of Araujo et al. [9] and Razak et al. [4], who developed models based on residential buildings’ EPC certificate datasets in different countries with varying feature engineering approaches, revealed comparable levels of accuracy in terms of statistical metrics (R² value of around 0.8). However, it is important to note that the dynamic response of their models to building retrofits was not explicitly presented in their studies. This underscores the contribution of this research in evaluating the adaptability of AI tools to predict energy consumption changes resulting from building retrofits.

The analysis of feature importance in this study aligns with the findings of Razak et al. [3], particularly regarding the significance of building envelope-related features, such as wall type and floor insulation, which are among the top five factors influencing buildings’ energy consumption. On the other hand, while the heating system emerges as the most significant feature in this study and measured data [5], it is not among the top features affecting the predictive performance of the models developed by Razak et al. [3].

Moreover, the literature lacks detailed analyses of models’ predictive performance under specific conditions. Contrary to our results, which demonstrate minimal errors within the range of 150 to 250 kWh/m² of building energy consumption (as depicted in Figure 7), prior studies have not specified the conditions under which their models exhibit optimal performance. This gap highlights the need for more comprehensive investigations to better understand how contextual factors affect model performance and improve the reliability of energy consumption forecasts.

Moving forward, exploring the integration of additional factors such as control systems and occupant behaviour into predictive models could lead to more comprehensive and precise energy consumption forecasts.

5. Conclusions

In this paper, four machine learning models including XGBoost, random forest, extra trees, and an artificial neural network were developed based on the EPC database for residential buildings and standard assessment procedure (SAP) guidelines in the UK. Additionally, an interface was designed that enables user to analyse the effect of different retrofit strategies on a building’s energy performance. The results of this study indicate that the ANN-MLP model outperforms other machine learning models with a coefficient of determination (R²) of 0.82 and a mean percentage error of 11.9 percent. Across all the models, the heating system is the most influential feature in predicting building energy performance, followed by building envelope features such as wall type and insulation.

Furthermore, the obtained results suggest higher accuracy of the models in case studies with actual annual energy consumption ranging from 100 to 250 kWh/m², while some outliers have been observed in predicting case studies with annual energy consumption exceeding 400 kWh/m². Despite these challenges, these developed AI models offer a potential alternative or support to physics-based models in facilitating the retrofitting process of buildings. Utilizing the various datasets available, these models can contribute significantly to improving energy efficiency in residential buildings and ultimately help in achieving energy efficiency targets.

Author Contributions

Original draft, H.S.; Methodology, H.S.; Software, H.S.; Writing—original draft, A.B.-J. and H.S.; Writing—review & editing, A.B.-J. and S.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data is contained within the article.

Conflicts of Interest

Author Shiva Amirkhani was employed by the company Sustainability and Climate Change, WSP. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationship that could be constructed as a potential conflict of interest.

References

UNEP. Towards a Zero-Emissions, Efficient and Resilient Buildings and Construction Sector. Global Status Report for Buildings and Construction. 2020. pp. 9–10. Available online: https://globalabc.org/sites/default/files/inline-files/2020%20Buildings%20GSR_FULL%20REPORT.pdf (accessed on 4 March 2024).
Somu, N.; Raman M R, G.; Ramamritham, K. A deep learning framework for building energy consumption forecast. Renew. Sustain. Energy Rev. 2021, 137, 110591. [Google Scholar] [CrossRef]
Olu-Ajayi, R.; Alaka, H.; Sulaimon, I.; Grishikashvili, K.; Sunmola, F.; Oseghale, R.; Ajayi, S. Ensemble Learning for Energy Performance Prediction of Residential Buildings. Research.Herts.Ac.Uk. Available online: https://www.research.herts.ac.uk/ws/files/32433844/Ensemble_learning_for_energy_performance_prediction_of_residential_buildings.pdf (accessed on 23 October 2023).
Olu-Ajayi, R.; Alaka, H.; Sulaimon, I.; Sunmola, F.; Ajayi, S. Building energy consumption prediction for residential buildings using deep learning and other machine learning techniques. J. Build. Eng. 2022, 45, 103406. [Google Scholar] [CrossRef]
Committee on Climate Change. Energy Consumption in UK Homes; Committee on Climate Change: London, UK, 2019. [Google Scholar]
Seyedzadeh, S.; Rahimian, F.P.; Oliver, S.; Rodriguez, S.; Glesk, I. Machine learning modelling for predicting non-domestic buildings energy performance: A model to support deep energy retrofit decision-making. Appl. Energy 2020, 279, 115908. [Google Scholar] [CrossRef]
Shao, M.; Wang, X.; Bu, Z.; Chen, X.; Wang, Y. Prediction of energy consumption in hotel buildings via support vector machines. Sustain. Cities Soc. 2020, 57, 102128. [Google Scholar] [CrossRef]
Deb, C.; Dai, Z.; Schlueter, A. A machine learning-based framework for cost-optimal building retrofit. Appl. Energy 2021, 294, 116990. [Google Scholar] [CrossRef]
Araújo, G.R.; Gomes, R.; Ferrão, P.; Gomes, M.G. Optimizing building retrofit through data analytics: A study of multi-objective optimization and surrogate models derived from energy performance certificates. Energy Built Environ. 2023. [Google Scholar] [CrossRef]
Pham, A.D.; Ngo, N.T.; Truong, T.T.H.; Huynh, N.T.; Truong, N.S. Predicting energy consumption in multiple buildings using machine learning for improving energy efficiency and sustainability. J. Clean. Prod. 2020, 260, 121082. [Google Scholar] [CrossRef]
H. & C. Department for Levelling Up. Energy Performance Certificates and Display Energy Certificates Data for Buildings in England and Wales. Department for Levelling Up, Housing & Communities. 2023. Available online: https://epc.opendatacommunities.org/ (accessed on 10 November 2023).
Obaid, H.S.; Dheyab, S.A.; Sabry, S.S. The impact of data pre-processing techniques and dimensionality reduction on the accuracy of machine learning. In Proceedings of the 2019 9th Annual Information Technology, Electromechanical Engineering and Microelectronics Conference (IEMECON), Jaipur, India, 13–15 March 2019; pp. 279–283. [Google Scholar]
Pedregosa, F. Scikit-learn: Machine learning in python Fabian. J. Mach. Learn. Res. 2011, 12, 2825. [Google Scholar]
Singh, D.; Singh, B. Investigating the impact of data normalization on classification performance. Appl. Soft Comput. 2020, 97, 105524. [Google Scholar] [CrossRef]
Dong, G.; Liu, H. Feature Engineering for Machine Learning and Data Analytics; CRC press: Boca Raton, FL, USA, 2018. [Google Scholar]
Lu, X.; Zhou, W.; Ding, X.; Shi, X.; Luan, B.; Li, M. Ensemble Learning Regression for Estimating Unconfined Compressive Strength of Cemented Paste Backfill. IEEE Access 2019, 7, 72125–72133. [Google Scholar] [CrossRef]
Jin, Z.; Shang, J.; Zhu, Q.; Ling, C.; Xie, W.; Qiang, B. RFRSF: Employee Turnover Prediction Based on Random Forests and Survival Analysis. In Web Information Systems Engineering–WISE 2020; Springer: Cham, Switzerland, 2020; Volume 12343, pp. 503–515. [Google Scholar] [CrossRef]
Villalobos-Arias, L.; Quesada-López, C. Comparative study of random search hyper-parameter tuning for software effort estimation. In Proceedings of the 17th International Conference on Predictive Models and Data Analytics in Software Engineering, Athens, Greece, 19–20 August 2021; pp. 21–29. [Google Scholar] [CrossRef]
Li, Y.; Zou, C.; Berecibar, M.; Nanini-Maury, E.; Chan, J.C.W.; Van den Bossche, P.; Van Mierlo, J.; Omar, N. Random forest regression for online capacity estimation of lithium-ion batteries. Appl. Energy 2018, 232, 197–210. [Google Scholar] [CrossRef]
Geurts, P.; Ernst, D.; Wehenkel, L. Extremely randomized trees. Mach. Learn. 2006, 63, 3–42. [Google Scholar] [CrossRef]
Ahmad, M.W.; Reynolds, J.; Rezgui, Y. Predictive modelling for solar thermal energy systems: A comparison of support vector regression, random forest, extra trees and regression trees. J. Clean. Prod. 2018, 203, 810–821. [Google Scholar] [CrossRef]
Chen, T.; Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar] [CrossRef]
Pan, B. Application of XGBoost algorithm in hourly PM2.5 concentration prediction. IOP Conf. Ser. Earth Environ. Sci. 2018, 113, 012127. [Google Scholar] [CrossRef]
Dong, W.; Huang, Y.; Lehane, B.; Ma, G. XGBoost algorithm-based prediction of concrete electrical resistivity for structural health monitoring. Autom. Constr. 2020, 114, 103155. [Google Scholar] [CrossRef]
Zhang, H.; Feng, H.; Hewage, K.; Arashpour, M. Artificial Neural Network for Predicting Building Energy Performance: A Surrogate Energy Retrofits Decision Support Framework. Buildings 2022, 12, 829. [Google Scholar] [CrossRef]
Beccali, M.; Ciulla, G.; Brano, V.L.; Galatioto, A.; Bonomolo, M. Artificial neural network decision support tool for assessment of the energy performance and the refurbishment actions for the non-residential building stock in Southern Italy. Energy 2017, 137, 1201–1218. [Google Scholar] [CrossRef]
Haykin, S. Neural Networks: A Comprehensive Foundation; Prentice Hall PTR: Hoboken, NJ, USA, 1998. [Google Scholar]
Yilmaz, I.; Kaynar, O. Multiple regression, ANN (RBF, MLP) and ANFIS models for prediction of swell potential of clayey soils. Expert Syst. Appl. 2011, 38, 5958–5966. [Google Scholar] [CrossRef]
Khan, M.T.; Shoaib, M.; Hammad, M.; Salahudin, H.; Ahmad, F.; Ahmad, S. Application of machine learning techniques in rainfall–runoff modelling of the soan river basin, Pakistan. Water 2021, 13, 3528. [Google Scholar] [CrossRef]
Villalobos-Arias, L.; Quesada-López, C.; Martínez, A.; Jenkins, M. Hyper-parameter tuning of classification and regression trees for software effort estimation. In Trends and Applications in Information Systems and Technologies; Springer: Cham, Switzerland, 2021; Volume 1367, pp. 589–598. [Google Scholar]

Figure 1. Correlation matrix for input variables.

Figure 2. Frequency of annual energy consumption in number of case studies.

Figure 3. Illustrative diagram of random forest regression model [20].

Figure 5. Methodological approach of the research.

Figure 6. Feature importance analysis for (A) XGBoost, (B) random forest, and (C) extra trees models.

Figure 7. Model performance analysis for (A) XGBoost, (B) random forest, (C) extra trees, and (D) ANN-MLP models.

Figure 8. Snapshot of user interface for AI model to predict building energy consumption.

Table 1. List of designed and selected features (model inputs).

No.	Variable	Type	Range/Categories
1	Property type	Categorical	House, flat, maisonette, bungalow
2	Built form	Categorical	Detached, semi-detached, mid-terrace, end-terrace, enclosed end-terrace, enclosed mid-terrace
3	Glazing type	Categorical	Double, single. secondary, and triple glazing
4	Glazed area	Categorical	Normal, more than typical, less than typical, much more than typical
5	Hot water system	Categorical	Gas boiler, electric immersion, gas multipoint, electric instantaneous, electric boiler, gas instantaneous, air source heat pump
6	Main heating system	Categorical	Gas boiler, electric warm air system, electric room heaters, electric underfloor heating system, electric storage heater, gas-fired warm air system, air source heat pump, electric boiler, gas-fired room heater
7	Secondary heating system	Categorical	Coal/wood heaters, portable electric heaters, portable gas heaters, none
8	Ventilation system	Categorical	Natural, mechanical—extract only, mechanical—supply and extract
9	Construction age band	Categorical	Before 1900, 1900–1929, 1930–1949, 1950–1966, 1967–1975, 1976–1982, 1983–1990, 1991–1995, 1996–2002, 2003–2006, 2007 onwards
10	Heating distribution system	Categorical	Radiators, warm air, room heaters, underfloor system
11	Floor type	Categorical	Solid, premises/dwelling below, to unheated space, suspended, to external air
12	Floor insulation	Categorical	As-built, other dwelling/premises below, insulated
13	Wall type	Categorical	Solid, cavity wall, timber frame
14	Wall insulation	Categorical	As-built, filled cavity, internal insulation, filled cavity plus insulation, external insulation
15	Roof type	Categorical	Dwelling above, pitched roof, flat roof, room roof
16	Roof insulation	Categorical	Another dwelling above, insulated, as-built
17	Floor area	Numerical	20–500 square meters
18	Multi-glazed area of the total glazed area of the property	Numerical	0–100%
19	Floor height	Numerical	2–5 m
20	Percentage of low-energy lighting in the property	Numerical	0–100%

Table 2. Evaluation metrics for developed models.

	R²	MPE	RMSE	NMBE
XGBoost	0.78	12.63	39.37	12.29
Random forest	0.77	13.42	40.83	13.1
Extra trees	0.77	13.07	40.75	13.5
ANN-MLP	0.82	11.86	36.21	11.9

Table 3. Case study building characteristics.

Property type:	Flat	Construction year:	1976–1982	Glazing type:	Double glazed
Built form:	Enclosed mid-terrace	Floor type and insulation:	Suspended, as-built	Roof type and insulation:	Another dwelling above
Hot water system:	Gas boiler	Heating system:	Gas boiler	Ventilation system:	Natural
External wall and insulation:	Cavity wall, as-built	Floor area:	60	Low-energy lighting (%):	100

Table 4. Effect of selected retrofit on case study’s energy performance.

Annual Energy Consumption Based on Model Prediction (Kwh/m²)				Possible Retrofits
XGB	RF	ET	ANN-MLP	Possible Retrofits
176	190	191	181	Base case scenario
181	191	193	167	Changing windows to triple glazing
181	191	193	200	Single-glazed windows
284	192	243	169	Utilizing air source heat pump for hot water and heating system
217	190	197	188	Utilizing mechanical ventilation (supply + extract)
185	192	195	214	50% low-energy lighting
157	177	176	152	Adding internal insulation
156	180	186	140	Adding external insulation
188	177	176	186	Filling cavities and adding insulation

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Seraj, H.; Bahadori-Jahromi, A.; Amirkhani, S. Developing a Data-Driven AI Model to Enhance Energy Efficiency in UK Residential Buildings. Sustainability 2024, 16, 3151. https://doi.org/10.3390/su16083151

AMA Style

Seraj H, Bahadori-Jahromi A, Amirkhani S. Developing a Data-Driven AI Model to Enhance Energy Efficiency in UK Residential Buildings. Sustainability. 2024; 16(8):3151. https://doi.org/10.3390/su16083151

Chicago/Turabian Style

Seraj, Hamidreza, Ali Bahadori-Jahromi, and Shiva Amirkhani. 2024. "Developing a Data-Driven AI Model to Enhance Energy Efficiency in UK Residential Buildings" Sustainability 16, no. 8: 3151. https://doi.org/10.3390/su16083151

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Developing a Data-Driven AI Model to Enhance Energy Efficiency in UK Residential Buildings

Abstract

1. Introduction

2. Materials and Methods

2.1. Dataset

2.2. Data Pre-Processing

2.3. Feature Engineering

2.4. Data Exploration

2.5. Model Selection

2.6. Models Theoretical Background

2.6.1. Random Forest

2.6.2. Extra Trees

2.6.3. XGBoost

2.6.4. ANN-MLP

2.7. Hyperparameter Tuning

2.8. Performance Evaluation

3. Results

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI