Real Implementation and Testing of Short-Term Building Load Forecasting: A Comparison of SVR and NARX

Hernández, Juan José; Zapirain, Irati; Camblong, Haritza; Barroso, Nora; Curea, Octavian

doi:10.3390/en18071775

Open AccessArticle

Real Implementation and Testing of Short-Term Building Load Forecasting: A Comparison of SVR and NARX

by

Juan José Hernández

¹,

Irati Zapirain

^1,2,

Haritza Camblong

^1,*

,

Nora Barroso

¹

and

Octavian Curea

²

¹

Department of Systems Engineering and Control, Faculty of Engineering of Gipuzkoa, University of the Basque Country (UPV/EHU), Plaza de Europa 1, 20018 Donostia-San Sebastian, Spain

²

Estia Institute of Technology, Technopole Izarbel, University of Bordeaux, F-64210 Bidart, France

^*

Author to whom correspondence should be addressed.

Energies 2025, 18(7), 1775; https://doi.org/10.3390/en18071775

Submission received: 15 February 2025 / Revised: 27 March 2025 / Accepted: 28 March 2025 / Published: 2 April 2025

(This article belongs to the Special Issue Advances in Renewable Energy Power Forecasting and Integration)

Download

Browse Figures

Versions Notes

Abstract

:

In self-consumption (SC) configurations, energy management systems (EMSs) are increasingly being implemented to maximise the self-consumption ratio (SCR). Recent studies have demonstrated that prediction-based EMSs significantly enhance decision-making capabilities compared to non-predictive EMSs. This paper presents the design, implementation, and testing on a real system of two machine learning (ML)-type predictive models capable of forecasting the electricity consumption of an individual building using a small dataset. A nonlinear autoregressive with exogenous input (NARX) neural network model and a support vector regression (SVR) model were designed and compared. These models predict day-ahead hourly electricity consumption using forecasted meteorological data from Meteo Galicia (MG) and building occupancy data, both automatically obtained and pre-processed. In order to compensate for the lack of recurrence of the SVR model, the effect of introducing an additional input, a time vector, was analysed. It is proved that both ML models trained with a small dataset are able to predict the next day’s average hourly power with a mean MAPE below 13.96% and a determination coefficient (R²) greater than 0.78. The model that most accurately predicts the hourly average power of a week is the SVR, which achieves a mean MAPE and R² of 10.73% and 0.85, respectively.

Keywords:

short-term load forecasting; machine learning; nonlinear autoregressive exogenous; support vector regression; automatic pre-processing; small dataset

1. Introduction

Increasing societal awareness of environmental challenges has prompted a critical reassessment of our current energy paradigm. There is an urgent need for an energy transition, one that systematically reduces reliance on fossil fuels while enhancing the integration of renewable energy sources (RESs) into the electrical grid [1].

A clear trend in this context is the shift towards a more decentralised and distributed energy grid. In this model, energy production and consumption are localised, enabling solutions such as self-consumption (SC) or collective self-consumption (CSC). The widespread adoption of photovoltaic (PV) systems—facilitated by the feasibility of rooftop solar panel installations [2]—has further accelerated the deployment of SC. Nevertheless, the feasibility of SC depends on achieving an acceptable self-consumption ratio (SCR). To ensure this, energy management systems (EMSs) are usually implemented [3,4].

Within the SC framework, the aim of the EMS is usually to keep the consumption curve of the building following the production curve for as long as possible by, for example, acting on the flexible loads (FLs) of the building. A common FL used in demand-side management (DSM) or demand response (DR) is the heating, ventilation, and air-conditioning (HVAC) system [5]. HVAC systems offer a significant margin for energy efficiency improvements [6,7] and present a notable advantage over energy storage systems (ESSs), as they are already installed in most buildings, whereas batteries require additional investment and deployment. Despite this, numerous studies have proposed renewable energy-based grids incorporating energy storage systems (ESSs) as a means of energy balancing. For instance, ref. [8] presents an energy balancing methodology that utilises batteries while also integrating predictive models such as deep learning and support vector regression (SVR).

Recent research has increasingly focused on EMS strategies that incorporate predictive models, demonstrating that integrating future forecasts into control strategies enhances system performance [9,10]. As shown in [11], EMSs integrating predictions can improve optimisation results compared to an EMS that operates without predictions. In the context of SC, incorporating energy consumption and production forecasts into the EMS is especially valuable when the objective is to maximise SCR [12].

The accuracy of these forecasts, however, is influenced by numerous factors, including weather conditions, building occupancy levels, and user behaviour [13,14]. This can lead to prediction errors, which, as demonstrated in [15], influence the decision-making process of an EMS. Therefore, the predictive models need to be as robust and accurate as possible, as demonstrated in [4]. Therefore, it is essential to properly select and design a predictive model capable of minimising prediction errors.

There are different types of models that have been proposed in the literature for prediction purposes, being the most popular data-driven predictive models [16]. Among these, two principal categories stand out: statistical analysis (SA)-based models—particularly autoregressive (AR) techniques—and machine learning (ML) methodologies.

SA models have been extensively evaluated in diverse forecasting applications, consistently yielding robust results. The Box–Jenkins method, which identifies, estimates, and diagnoses mainly autoregressive integrated moving average (ARIMA)-type models, increases the accuracy of predictions [17]. This is exemplified in [18], where the electricity price for the next 24 h is predicted through the Box–Jenkins method using data from the previous three days. Using a simple ARIMA model, the electricity price is predicted with a mean absolute percentage error (MAPE) of 3.55%. Seasonal autoregressive integrated moving average (SARIMA) models have also been widely used for energy consumption forecast. This is the case in the work introduced in [19], where a SARIMA identified by Box–Jenkins method is used to perform short-term load forecasting (STLF) for a single university building. The model is able to forecast weekdays with an MAPE of 18.82%. However, SA models exhibit limitations when dealing with systems characterised by pronounced nonlinearities, restricting their applicability in complex scenarios.

Nonlinear predictive models are frequently employed for electricity consumption forecasting in buildings, as they effectively capture system nonlinearities [20]. In this sense, ML models and more specifically neural networks (NNs) have gained relevance [16]. NNs can enhance the operation of an EMS because they can substantially improve its performance. This is evidenced by [21], where the authors implement an EMS based on NN models in a building that forecast the building’s consumption, weather conditions, and building comfort specifications. The model predictive control (MPC)-type EMS is implemented in simulation using EnergyPlus software and its performance is compared with a second MPC that does not include predictions as inputs. They conclude that the MPC fed with forecasts outperforms the conventional MPC.

Despite their advantages, complex NN architectures often require extensive datasets and computational resources for training. In [22], multiple-layer perceptron (MLP)-type networks are proposed with 12 inputs, adjusted between 24 and 48 neurons, and two or three hidden layers. This complex structure results in the network predicting building energy consumption with an MAPE of 1.71%, but requiring 14 h of training. Similarly, the long short-term memory (LSTM) network presented in [23] employs 512 neurons, but delivers a suboptimal MAPE of 21.99%. Alternatively, in [24], a convolutional neural network (CNN) is combined with a double LSTM that comprises 10 layers and over 33,000 parameters. While the MAPE obtained is excellent, the training time is extremely high. This makes it difficult to regularly update the model, which is of great interest when there is a substantial change in, for example, weather conditions (sudden increase in temperature) or in the pattern of the consumption curve when a high-consumption device is added.

A critical challenge in ML-based energy forecasting is the availability of long-term training datasets, particularly for newly constructed buildings equipped with smart meters, but lacking historical data. In [25], an NN model trained on a three-day time window successfully forecasts small-scale loads within a building, achieving an MAPE of 7.05%. However, studies exploring ML models trained on limited datasets remain scarce. The ability to train models on small datasets offers the dual advantage of reduced computational cost and the flexibility for frequent retraining, enabling the model to rapidly adapt to dynamic conditions. In this regard, new types of NNs are coming to light that are appropriate in the case of small-dataset availability. Examples of these are the Kolmogorov–Arnold networks (KANs) [26] or liquid neural networks (LNNs) [27]. The former are presented as an alternative to MLPs, which mainly differ in how learning is performed. While the neurons in MLPs are activated by fixed activation functions (sigmoid, linear, etc.), in the case of KANs, the functions are learnable (B-spline) [28]. As for LNNs, they stand out for their ability to adapt to noisy conditions and have a simple structure, which may make them suitable for applications where computationally low-cost predictions are required.

Moreover, while several studies in the literature explore ML-based approaches for predicting energy consumption in existing buildings, most rely on simulated data rather than real-world measurements. Consequently, the number of studies implementing ML models in fully operational systems remains limited.

To address this gap, this research proposes the design, implementation, and evaluation of two ML models deployed in a real system. These models perform day-ahead hourly consumption forecasting for a single building in real-time, as required by the EMS. As the EMS needs to control the consumption of the HVAC system, the prediction of the building’s electricity consumption should not include the effect of the HVAC system. Therefore, the consumption corresponding to the HVAC system has been removed from the total consumption curve of the building automatically.

Following the methodology outlined in [29], where three ML models were compared, this study evaluates the performance of two ML approaches: a nonlinear autoregressive model with exogenous inputs (NARX) and a SVR model. It may be interesting to compare both ML models, because this comparison is not often seen in the literature where simple predictive models trained with few data are proposed. To validate the effectiveness of ML approaches, a benchmark model is also introduced for comparative analysis.

This study investigates the impact of training ML models, specifically NARX and SVR, with small datasets and how this affects their predictive performance. Unlike traditional approaches that rely on large-scale datasets, this research challenges the assumption that extensive data are necessary for effective model training. Additionally, it focuses on the development of real-time predictive models tailored for integration into an EMS operating in real-world conditions, ensuring that data acquisition and pre-processing can be performed in real time. A key aspect of the study is the analysis of recurrent terms in ML models for time series forecasting, particularly in the context of building energy consumption prediction. Since SVR lacks a recurrent component, this work introduces a novel approach by incorporating a time vector to compensate for this limitation, enhancing its ability to capture temporal dependencies.

The primary contributions of this work are threefold. First, it provides an evaluation of the impact of small-dataset training on NARX and SVR performance, offering insights that challenge the prevailing dependence on large datasets in ML-based energy forecasting. Second, a methodological framework for the development of real-time predictive models is proposed, addressing critical challenges of real-time data acquisition and data pre-processing within an operational EMS. Lastly, the study presents a novel approach for integrating temporal dependencies into SVR using a time vector, demonstrating its effectiveness in improving time series forecasting accuracy. These contributions collectively advance the field of real-time energy forecasting by enhancing model adaptability to limited data and improving the predictive capabilities of SVR in time-dependent scenarios.

The remainder of this paper is structured as follows. Section 2 presents the case study, to which the methodology introduced in Section 3 is applied. In Section 3, the detailed step-by-step design of the predictive models is introduced. Section 4 discusses the obtained results, and Section 5 summarises the main conclusions.

2. Case Study

2.1. Description of the EMS

The following work was developed within the EKATE project, a POCTEFA project titled Photovoltaic Electric Energy Management (EMS) and Shared Self-Consumption in the France–Spain Border Area Using Blockchain Technology and Internet of Things (IoT).

Within the framework of EKATE, a pilot project was carried out to design and implement an EMS in an SC context. The EMS aims to maximise the SCR and ensure the thermal comfort of the building by acting on the HVAC system. Therefore, the EMS is intended to control the consumption of the HVAC system. The studied building is part of ESTIA Institute of Technology located in Izarbel Technology Park (Bidart, France). Precisely, the EMS is applied in ESTIA 2 building, which comprises four floors and hosts various activities, accommodating different types of users.

The EMS is based on predictions. Specifically, it contains the day-ahead forecasts of ESTIA 2 building energy consumption and the PV production provided by a PV installation located in the same area. Day-ahead forecasts of building energy consumption and PV production facilitate more informed decision-making by the EMS, enabling it to better adapt to the conditions of the following day. Accurate predictions contribute to improved EMS performance, ultimately leading to higher SCR. Consequently, greater prediction accuracy enhances the adaptability of EMS decisions, increasing the likelihood of achieving a higher SCR.

The general scheme of the EMS is depicted in Figure 1.

The building has several devices consuming electricity, including computers, various laboratory machines (not high consumption), lighting, and the HVAC system. This system provides both heating and cooling for the entire building. The HVAC system consists of 10 external heat pumps connected to 73 internal units.

2.2. Analysis of the Data Used

Having proposed two data-driven models, NARX and SVR, for energy consumption forecasting, the quality and quantity of input data used for training the model are essential to be able to accurately forecast consumption. Therefore, preparing the input data to meet the requirements of the models was a fundamental step. However, it is in Section 3.2 where we have attempted to explain how the data introduced below have been processed.

Three types of data are used: historical measured consumption data, predicted weather data, and building occupancy data. For the SVR model, a time vector has also been considered.

2.2.1. Consumption Data

Regarding consumption data, both the building total consumption data and the consumption data of the 10 individual heat pumps were available. Consumption data are provided in kW by the PME/PMI smart meter installed in ESTIA 2. It is registered with a time step of 10 min. Regarding heat pump consumption, this is measured with energy meters and obtained through the KNX bus of the building (in W), with a time interval of 5 min.

As said before, in order to address the EMS specification, the aim was to predict the energy consumption associated with the heat pumps. Figure 2 shows both the original consumption curve and the pre-processed consumption curve, meaning the curve resulting from subtracting the effect of the heat pumps from the total consumption. As Figure 3 illustrates, the temperatures during the month of April are quite warm, so there was not very large use of the HVAC system, especially in the last week of April. This would explain why the difference between the two curves is not very large.

In order to see how the consumption curve changed after subtracting the consumption related to the HVAC system, the standard deviation (σ) of both curves was calculated. It can be concluded that the pre-processed consumption curve, with a standard deviation of 4.95, is smoother than the original consumption curve (σ = 5.8).

2.2.2. Temperature Data

Due to the influence of meteorological conditions on the building consumption, external temperature (

T_{e x t}

) was considered as an input for the model.

T_{e x t}

data were obtained from the Meteo Galicia (MG) meteorological agency, one of the few agencies that provides free real-time forecasts of various weather variables. The acquisition and pre-processing of the predicted

T_{e x t}

data were carried out in order to be able to work on a real system. Figure 3 shows the predicted

T_{e x t}

during the month of April.

In order to statistically analyse the temperature data introduced as input to the model, the temperature range and mean, median, and standard deviation were calculated (see Table 1).

The metric values show that the model was trained for a wide range of external temperatures in order to adapt to different scenarios.

2.2.3. Building Occupancy Data

Using information regarding the schedules of ESTIA 2 users provided by the ESTIA Institute of Technology, a daily occupancy rate feature was introduced to estimate the number of building occupants based on time-dependent patterns:

O (d a y, t) = \{\begin{matrix} 0, d a y i s a w e e k e n d o r h o l i d a y \\ 0, 0 \leq t < 8 \\ 0.5, t = 8 \\ 1, 9 \leq t < 12 \\ 0.5, 12 \leq t < 15 \\ 1, 15 \leq t < 18 \\ 0.5, t = 18 \\ 0, t > 18 \end{matrix}

(1)

where

d a y

represents the day of the week and

t

the hour (0–23).

As can be seen in Figure 4, the occupancy curve shows two peaks, which represent the occupancy of the building at ratio 1. The decrease around midday corresponds to most users leaving the building for lunch.

3. Methodology

Day-ahead forecasting of ESTIA 2 average hourly power was performed with the NARX and SVR models. With a sampling time of 1 h, the prediction of the following 24 h was carried out during one week of April. The difference in SVR model performance was analysed when a time vector was introduced as an additional input to the model.

Furthermore, the

T_{e x t}

data used as input to the model were not measured data, but predicted data. Data acquisition and pre-processing were performed automatically (see Section 3.2). Both models, NARX and SVR, were designed, and were evaluated and compared based on the minimisation of MAPE and R² (see Equations (2) and (3)). Likewise, both ML models were also compared with a benchmark model, named the persistence model. The persistence model assumes that nothing changes between the present time and the future, meaning “tomorrow will be as today”. In a time series, the previous value is considered the present value [30].

Figure 5 shows the steps followed to design the proposed ML models to carry out day-ahead forecasting of the consumption.

3.1. Evaluation Metrics

Widely referenced evaluation metrics in academic studies include the mean absolute error (MAE), root mean square error (RMSE), or MAPE. It is also quite common to find the coefficient of determination (R²) metric in many papers that attempt to evaluate the performance of predictive models.

In this work, MAPE and R² metrics were employed to evaluate model performance. MAPE represents the measure of the average deviation in absolute terms. It is therefore related to average absolute errors between the actual and predicted values.

MAPE evaluates uniform forecast error in percentage [30]:

M A P E = \frac{1}{n} \sum_{j = 1}^{n} \frac{|y_{j} - \hat{y_{J}}|}{y_{j}} \times 100 [%]

(2)

where

y_{j}

and

\hat{y_{J}}

are the measured and corresponding predicted values.

n

is the total number of data in the dataset considered for performance evaluation.

R², in contrast, is a metric related to mean square error (MSE) and the variance. Therefore, it represents the values that deviate significantly from the real value, meaning the outliers of the predictions. Equation (3) calculates the coefficient of determination.

R^{2} = 1 - \frac{M S E (y_{j}, \hat{y_{J}})}{V a r (y_{j})} = 1 - \frac{\sum {(y_{j} - \hat{y_{J}})}^{2}}{\sum {(y_{j} - m e a n (y_{j}))}^{2}}

(3)

where

M S E (y_{j}, \hat{y_{J}})

is the mean square error between measured and predicted values and

V a r (y_{j})

the variance value of the measured dataset. Likewise, as in Equation (2),

y_{j}

and

\hat{y_{J}}

are the measured and corresponding predicted values.

A value for R² of 1 means a perfect fit between measured and predicted values. A R² of zero means that there is no relationship between the dependent (output) and independents (input) values, or in other words, the regression line between both is completely horizontal [31] (see Equation (4)).

0 \leq R^{2} \leq 1

(4)

To assess the performance of the proposed prediction models over the analysed period (one week), it is essential to evaluate both the mean error of the models’ predictions and the occurrence of unusually extreme values or outliers. Both aspects are considered critical for evaluating the models’ effectiveness. Moreover, these two metrics provide intuitive values for comparison, distinguishing them from other commonly used metrics such as RMSE or MAE.

3.2. Automatic Data Pre-Processing

Robust data pre-processing is critical in ML, converting raw data into a structured format optimised for analysis and model training [32]. Real-world datasets frequently contain missing values, noise, and inconsistencies arising from measurement errors and variations in data collection methodologies. As said, this study examined the ESTIA 2 building dataset, which includes building load, HVAC load from ten heat pumps, and meteorological data. Disparities in sampling rates, measurement units, and timestamp formats necessitate systematic data integration to ensure consistency and enhance forecasting accuracy.

The following pre-processing steps were performed automatically.

Data Quality Control: Missing values undergo treatment through deletion or imputation (linear interpolation, mean imputation), while time-based integrity checks identify and eliminate corrupted records.

Timestamp Standardisation: Timestamps in multiple formats (e.g., Unix time) are converted to ISO 8601 (UTC) to enable seamless dataset integration.

Time Series Aggregation: Given the hourly forecasting resolution, smart-meter time series are resampled using a rolling mean window of ±30 min, following prior work [29]:

\bar{x} (h) = \frac{1}{N_{h}} \sum_{t \in W_{h}} x_{t}

(5)

where

x_{t}

represents the recorded measurement at time

t

,

W_{h}

is the set of recorded timestamps within the rolling window for hour

h

, and

N_{h}

is the total number of recorded measurements within that window.

This rigorous pre-processing framework enhances data consistency, refines input quality, and improves predictive modelling accuracy.

3.3. Predictive Model Design

NARX neural networks, a subset of recurrent neural networks (RNNs), have been extensively employed in time series prediction due to their straightforward implementation and fast training process.

These networks are built upon an MLP architecture, comprising an input layer, a hidden layer, and an output layer, all interconnected through adjustable weights. The neurons in both the hidden and output layers are also associated with bias values [33]. During the training phase, the network iteratively adjusts these weights and biases to optimise their values, enhancing the relationship between the input and output data.

Equation (6) shows the input–output relationship using a NARX:

\hat{y^{t + 1}} = f (x_{1}^{t}, x_{1}^{t - 1}, x_{1}^{t - 2}, \dots, x_{P}^{t - D_{x p}}, y^{t}, y^{t - 1}, y^{t - 2}, \dots, y^{t - D_{y}}),

(6)

where

y^{t + 1}

is the future value of the target variable, p is the total exogenous inputs,

D_{x p}

is the time lag of each exogenous input

x_{p}

, and

D_{y}

is the time lag of the historical targeted values (

(y^{t}, y^{t - 1}, y^{t - 2}, \dots, y^{t - D_{y}})

.

For the NARX model design, it is necessary to select the inputs that can best describe the consumption behaviour, as well as the TW, which defines the number of days selected to perform the model training. With regard to the selection of inputs and the TW, it was decided to use the results of the analysis carried out in [29]. In that work, the inputs to the model are selected by performing the prediction with all possible input combinations and seeking to minimise the MAPE. Therefore, predicted external temperature and occupancy were used as inputs. The two mentioned inputs were selected and other possible combinations were discarded as they did not provide better results. Moreover, a time vector was considered as input to the SVR model, in order to compensate for the non-recurrent nature of this ML technique.

Regarding the TW, following the analysis carried out in [29], we chose to train the models with a TW of 21 days. In the mentioned work, predictions were carried out after training the model with different TWs (7, 14, or 28 days), and it was concluded that training the model with 21 days yielded the most accurate predictions.

Finally, the hyperparameters of the models were adjusted. The tuning of the NARX hyperparameters was carried out manually following a certain criterion. The model performed day-ahead forecasts on a daily training basis. Each time a NN model is trained, the weights of the model are randomly initialised; therefore, we decided that the model needed to predict entire weeks of April three consecutive times on a daily training basis. The average weekly MAPE obtained for each of the three times was calculated. The process was repeated for each combination of hyperparameter values, i.e., for each model constructed. The model with the lowest average MAPE was selected.

Regarding the NARX model, three different hyperparameters were adjusted to try to achieve the lowest MAPE: (i) input and feedback delays, (ii) number of neurons in the single hidden layer, and (iii) activation function for the hidden and output layers. The simulation plan outlined in Table 2 was designed accordingly, where all possible values for each of the hyperparameters are combined. Each of the possible combinations of the hyperparameter values defined in Table 2 constitutes an NARX model.

After carrying out the forecasts with all possible NARX models, the model able to forecast the selected week with the lowest mean MAPE was selected. The chosen NARX model hyperparameter values are given in Table 3.

Support vector machines (SVMs) are a supervised learning approach commonly used for function estimation. While SVM is primarily applied to classification problems, it is also well suited for regression tasks. The regression variant of SVM is known as SVR, with the most widely used formulation being Vapnik’s ε-SVR [34]. For SVR with a radial basis kernel, three hyperparameters need to be determined: the penalty coefficient C, ε, and the kernel coefficient γ.

For the SVR model design without taking as input the time vector, the optimal hyperparameter values are given in Table 4. Bayesian optimisation [35] was employed to optimise its hyperparameters.

For the design of SVR considering time vector, the optimal hyperparameter values selected are given in Table 5.

The overfitting problem in the case of NARX was avoided thanks to the early stopping method of the Deep Learning Toolbox of Matlab [36]. To prevent SVR models from suffering overfitting or underfitting, cross-validation was applied while evaluating different model settings during training. However, given the nature of the time series data, a specific implementation from scikit-learn, TimeSeriesSplit [37], wasn used. This method, a variation of k-fold cross-validation, ensures that the temporal structure of the data is preserved.

4. Results and Discussion

The forecasting results are presented in this section. In order to be able to compare both NARX and SVR model performance, Table 6 and Table 7 record the MAPE and R² values obtained using the proposed ML models and the persistence model each predicted day. A whole week was predicted.

Beginning by comparing the ML models with the reference model, it can be seen that the persistence model outperforms the SVR model that does not consider the time vector. However, it fails to improve on NARX on the vast majority of days, and does not exceed the accuracy with which it predicts the SVR with the time vector. In general, it can be stated that the persistence model gives good results only when the curve does not vary from day to day. In the case of predicting the consumption of a building, the consumption varies considerably, so the persistence model can hardly overcome the operation of ML models.

Moreover, it is possible to observe a clear difference between the SVR results with and without the time vector. Considering the average MAPE of the week, the SVR with the time vector as input outperforms the SVR model without considering the mentioned input by 50.22%.

Figure 6 shows the consumption curves of NARX and SVR without the time vector compared with the real consumption during a week of April 2023. Likewise, the persistence model is also drawn. It can be seen how clearly the persistence model fails to represent the real curve, especially on Mondays and Saturdays, where there is a change in the consumption pattern. Likewise, the SVR that does not include the time vector input hardly follows the real consumption curve. Looking at the output of this model, it seems that it closely resembles the occupancy curve of the building, which has two peaks during the day and a null value during the night and at weekends.

The MAPE and R² outcomes clearly demonstrate that the SVR model that considers time vector as input surpasses the NARX model on the majority of days. When examining the average MAPE achieved by the two models over the course of the week, SVR shows a 3.62% improvement compared to NARX. Although this difference in accuracy is not substantial, SVR proves to be superior in closely tracking the consumption pattern, as illustrated in Figure 7.

The NARX model evidently struggles to accurately forecast the consumption pattern during night hours. Looking at the curves at night, especially early mornings and first hours of the night, it can clearly be observed that NARX suffers from sudden peaks. This may be due to abrupt changes in the occupancy curve during early morning hours (from 0 to 1) and evenings (from 1 to 0). It can also clearly be seen how the models are worse at weekends.

On average, NARX and SVR with time vector predict working days 37.64% and 28.27% better than weekends, respectively. There may be different reasons that the accuracy of the weekend consumption forecast drops. It may be that there is a lack of sufficient weekend training data: when using a 21-day TW, only 6 weekend days are used to train the model. Likewise, the occupancy curve at weekends might be improved. During weekends, the occupancy is considered zero; however, some workers from start-ups work on Saturdays. This can result in the model not being able to accurately follow the weekend consumption pattern.

In order to more extensively study the calculated value of 3.62% improvement in SVR with the time vector with respect to the NARX model, a paired-t test and a Kolmogorov–Smirnov test were performed in order to assess if this improvement was statistically significant.

Paired t-test: The calculated the statistical difference between the means of the two curves using Equation (7).

t = \frac{\bar{d}}{\frac{s_{d}}{\sqrt{n}}}

(7)

where

\bar{d}

is the average of the value differences between the two compared curves,

s_{d}

the standard deviation between the differences, and

n

the number of samples.

The paired t-test yielded a value of 0.53, which exceeds the commonly used significance threshold of 0.05 or 5%. This indicates that the null hypothesis cannot be rejected, suggesting that the mean differences between the two prediction curves are not statistically significant.

Kolmogorov–Smirnov test: In order to compare the full distributions of the predictions of the two models and not only their means, the Kolmogorov–Smirnov test was performed. Obtaining a maximum cumulative difference (D) of 0.16 from a sample (n) of 168 points and assuming a significance of 5%, which is the most commonly used, the critical value was 0.221 according to the critical table. As 0.16 < 0.221, the null hypothesis cannot be rejected either, and this suggests that there is no significant difference between the two distributions at the 5% significance level, i.e., the predictions of both models follow a similar statistical distribution.

5. Conclusions

Two ML models, NARX and SVR, were designed and implemented in the framework of the study presented in this paper. Their aim was to perform day-ahead forecasting of the average hourly power consumption of the ESTIA 2 building without considering heat pump consumption, as specified by the EMS. Indeed, the EMS aims to control the building’s HVAC system to maximise the SCR.

Due to the limited data available, the proposed ML models were trained on a small dataset. Predicted external temperature and building occupancy were used as inputs to the model.

As the forecast models operate as in a real system, an automatic pre-processing step was added in order to automatically obtain predicted external temperature data from the MG meteorological agency and apply pre-processing techniques to all the needed data. The automated data acquisition and pre-processing made it much easier to introduce the prediction of, for example, a second building into the analysis. Thus, the methodology is scalable to more buildings. Likewise, the proposed methodology allows extension of the range of data, i.e., to use more data for the training of the ML models if so desired.

The proposed ML models were compared with a benchmark model, i.e., the persistence model. As is evident, this model presents problems when the consumption pattern changes from one day to the next. This is why NARX outperforms the reference model nearly every day. With respect to the SVR, which does not consider the time vector, it fails to represent the real consumption curve and fails to improve the persistence model. Nevertheless, when predicting with the SVR model that considers as input a time vector, the results improve substantially: it manages to predict the consumption of the analysed week with an accuracy of average MAPE and R² of 10.73% and 0.85, respectively, 50.22% better than the SVR without the time vector. Unlike NARX, the SVR lacks a recurrent term, which allows for representing the dynamics of the systems. Considering the mentioned input may have helped SVR to predict more accurately.

In general terms, it can be stated that for this case study and under these conditions, the SVR with the time vector is the model that outperforms the rest, despite predicting with an accuracy of only 3.62% better than the NARX model. However, paired t-tests and Kolmogorov–Smirnov tests suggest that regarding mean values and distributions, there is no evidence that the predictions made by the time vector SVR and the NARX model are statistically different.

Moreover, it was observed that by removing the heat pump consumption from the total consumption curve of the building, the consumption curve became substantially less variable, i.e., smoother. This probably made the prediction much simpler. While it has been found that NARX can be very effective in predicting curves with high variability [29], it may be that in this case where the curve is flatter, SVR has had an advantage over NARX.

It is also worth mentioning that in general, all the models show worse results when predicting weekend consumption, especially NARX and SVR without the time vector. It could be possible in future work to improve the occupancy curve in order to bring it more in line with reality. Since actual occupancy data for the ESTIA 2 building are not available, it could be tested in another case study where real occupancy data would be available on a day-to-day basis.

Likewise, the methodology was tested in only one month of the year. For this reason, we believe it is necessary to test the methodology in other periods in future work, in order to generalise the models and be able to draw more general conclusions.

Moreover, seeing the relevance that hybrid models are gaining for prediction purposes, in future work, it might be interesting to see whether or not a hybrid model based on, for example, an SVR with ensemble learning could outperform the simple SVR model.

In other future work, the hyperparameter adjustment process could also be optimised by applying techniques such as genetic algorithms (GAs).

Finally, it may also be of interest to extend the study by calculating the confidence intervals of the predictions.

Author Contributions

Conceptualisation, H.C. and J.J.H.; methodology, I.Z.; software, J.J.H. and I.Z.; validation, J.J.H. and I.Z.; formal analysis, I.Z.; investigation, J.J.H.; resources, I.Z.; data curation, J.J.H. and O.C.; writing—original draft preparation, I.Z.; writing—review and editing, H.C. and N.B; visualisation, N.B.; supervision, H.C.; project administration, O.C.; funding acquisition, H.C. and O.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by FEDER Interreg POCTEFA program [grant number EFA41/1] and EUSKAMPUS FUNDAZIOA [grant number: EUSK22/18]. The APC was funded by FEDER Interreg POCTEFA program [grant number EFA41/1].

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

European Commision. Renewable Energy Targets. Available online: https://energy.ec.europa.eu/topics/renewable-energy/renewable-energy-directive-targets-and-rules/renewable-energy-targets_en (accessed on 4 February 2025).
Stephant, M.; Abbes, D.; Hassam-Ouari, K.; Labrunie, A.; Robyns, B. Distributed optimization of energy profiles to improve photovoltaic self-consumption on a local energy community. Simul. Model. Pract. Theory 2021, 108, 102242. [Google Scholar] [CrossRef]
Laguili, O.; Eynard, J.; Grieu, S. Energy Efficiency Improvement Through MPC-Based Management of EWHs in Collective Dwellings. In Proceedings of the 2024 9th International Conference on Smart and Sustainable Technologies (SpliTech), Bol and Split, Croatia, 25–28 June 2024; pp. 1–6. [Google Scholar] [CrossRef]
Nair, U.R.; Sandelic, M.; Sangwongwanich, A.; Dragicevic, T.; Costa-Castello, R.; Blaabjerg, F. An Analysis of Multi Objective Energy Scheduling in PV-BESS System Under Prediction Uncertainty. IEEE Trans. Energy Convers. 2021, 36, 2276–2286. [Google Scholar] [CrossRef]
Köse, A.; Sukhanov, I.; Maivel, M.; Tepljakov, A.; Hokmabad, H.N.; Petlenkov, E. Practical Demand Side Management and Demand Response in Large Scale Buildings with Multiple Case Studies. In Proceedings of the 2024 20th International Conference on the European Energy Market (EEM), Istanbul, Turkiye, 10–12 June 2024; pp. 1–5. [Google Scholar] [CrossRef]
Mor, G.; Cipriano, J.; Grillone, B.; Amblard, F.; Parakkal Menon, R.; Page, J.; Brennenstuhl, M.; Pietruschka, D.; Baumer, R.; Eicker, U. Operation and energy flexibility evaluation of direct load controlled buildings equipped with heat pumps. Energy Build. 2021, 253, 111484. [Google Scholar] [CrossRef]
Wani, M.; Swain, A.; Ukil, A. Control Strategies for Energy Optimization of HVAC Systems in Small Office Buildings using EnergyPlus. In Proceedings of the 2019 IEEE Innovative Smart Grid Technologies—Asia (ISGT Asia), Chengdu, China, 21–24 May 2019. [Google Scholar] [CrossRef]
Sidorov, D.; Tao, Q.; Muftahov, I.; Zhukov, A.; Karamov, D.; Dreglea, A.; Liu, F. Energy balancing using charge/discharge storages control and load forecasts in a renewable-energy-based grids. In Proceedings of the 2019 Chinese Control Conference (CCC), Guangzhou, China, 27–30 July 2019; pp. 6865–6870. [Google Scholar] [CrossRef]
Ospina, J.; Gupta, N.; Newaz, A.; Harper, M.; Faruque, M.O.; Collins, E.G.; Meeker, R.; Lofman, G. Sampling-Based Model Predictive Control of PV-Integrated Energy Storage System Considering Power Generation Forecast and Real-Time Price. IEEE Power Energy Technol. Syst. J. 2019, 6, 195–207. [Google Scholar] [CrossRef]
Watari, D.; Taniguchi, I.; Catthoor, F.; Marantos, C.; Siozios, K.; Shirazi, E.; Soudris, D.; Onoye, T. Thermal Comfort Aware Online Energy Management Framework for a Smart Residential Building. In Proceedings of the 2021 Design, Automation & Test in Europe Conference & Exhibition (DATE), Grenoble, France, 1–5 February 2021; pp. 535–538. [Google Scholar] [CrossRef]
Chen, Z.; Zhang, Y.; Zhang, T. An intelligent control approach to home energy management under forecast uncertainties. In Proceedings of the 2015 IEEE 5th International Conference on Power Engineering, Energy and Electrical Drives (POWERENG), Riga, Latvia, 11–13 May 2015; pp. 657–662. [Google Scholar] [CrossRef]
Strobel, L.; Schwarz, B.; Munzke, N.; Hiller, M. Optimized energy management of a photovoltaic-heat pump sector coupling system with electrical and thermal energy storages in an office building. In IET Conference Proceedings; Institution of Engineering and Technology (IET): London, UK, 2024; pp. 237–244. [Google Scholar] [CrossRef]
Fay, D.; Ringwood, J.V. On the Influence of Weather Forecast Errors in Short-Term Load Forecasting Models. IEEE Trans. Power Syst. 2010, 25, 1751–1758. [Google Scholar] [CrossRef]
Schulte, K.; Engel, L.; Quakernack, L.; Haubrock, J. Optimized photovoltaic power forecast using k-means clustering based error reduction. In Proceedings of the 2023 IEEE PES Innovative Smart Grid Technologies Europe (ISGT EUROPE), Grenoble, France, 23–26 October 2023; pp. 1–5. [Google Scholar] [CrossRef]
Boralessa, M.A.K.S.; Hovden, S.; Wickramarathna, A.V.U.A.; Hemapala, K.T.M.U. Effect of Renewable Energy Forecasting Error on Model Predictive Control Based Microgrid Energy Management System. In Proceedings of the 2022 IEEE IAS Global Conference on Emerging Technologies (GlobConET), Arad, Romania, 20–22 May 2022; pp. 959–962. [Google Scholar] [CrossRef]
Bourdeau, M.; Zhai, X.Q.; Nefzaoui, E.; Guo, X.; Chatellier, P. Modeling and forecasting building energy consumption: A review of data-driven techniques. Sustain. Cities Soc. 2019, 48, 101533. [Google Scholar] [CrossRef]
Boland, J. Box-Jenkins Models. University of South Australia. Available online: https://lo.unisa.edu.au/pluginfile.php/1156111/mod_resource/content/1/Box-Jenkins.pdf (accessed on 8 February 2024).
Jakasa, T.; Androcec, I.; Sprcic, P. Electricity price forecasting—ARIMA model approach. In Proceedings of the 2011 8th International Conference on the European Energy Market (EEM), Zagreb, Croatia, 25–27 May 2011; pp. 222–225. [Google Scholar] [CrossRef]
Chitsaz, H.; Shaker, H.; Zareipour, H.; Wood, D.; Amjady, N. Short-term electricity load forecasting of buildings in microgrids. Energy Build. 2015, 99, 50–60. [Google Scholar] [CrossRef]
Olu-Ajayi, R.; Alaka, H.; Owolabi, H.; Akanbi, L.; Ganiyu, S. Data-Driven Tools for Building Energy Consumption Prediction: A Review. Energies 2023, 16, 2574. [Google Scholar] [CrossRef]
Eini, R.; Abdelwahed, S. A Neural Network-based Model Predictive Control Approach for Buildings Comfort Management. In Proceedings of the 2020 IEEE International Smart Cities Conference (ISC2), Piscataway, NJ, USA, 28 September–1 October 2020; pp. 1–7. [Google Scholar] [CrossRef]
Dagdougui, H.; Bagheri, F.; Le, H.; Dessaint, L. Neural network model for short-term and very-short-term load forecasting in district buildings. Energy Build. 2019, 203, 109408. [Google Scholar] [CrossRef]
Kong, W.; Dong, Z.Y.; Hill, D.J.; Luo, F.; Xu, Y. Short-Term Residential Load Forecasting Based on Resident Behaviour Learning. IEEE Trans. Power Syst. 2018, 33, 1087–1088. [Google Scholar] [CrossRef]
Khan, Z.; Hussain, T.; Ullah, A.; Rho, S.; Lee, M.; Baik, S. Towards Efficient Electricity Forecasting in Residential and Commercial Buildings: A Novel Hybrid CNN with a LSTM-AE based Framework. Sensors 2020, 20, 1399. [Google Scholar] [CrossRef] [PubMed]
Baek, S.-J.; Yoon, S.-G. Short-Term Load Forecasting for Campus Building with Small-Scale Loads by Types Using Artificial Neural Network. In Proceedings of the 2019 IEEE Power & Energy Society Innovative Smart Grid Technologies Conference (ISGT), Washington, DC, USA, 18–21 February 2019; pp. 1–5. [Google Scholar] [CrossRef]
Liu, Z.; Wang, Y.; Vaidya, S.; Ruehle, F.; Halverson, J.; Soljačić, M.; Hou, T.Y.; Tegmark, M. KAN: Kolmogorov-Arnold Networks. arXiv 2025, arXiv:2404.19756. [Google Scholar] [CrossRef]
Kumar, K.; Verma, A.; Gupta, N.; Yadav, A. Liquid Neural Networks: A Novel Approach to Dynamic Information Processing. In Proceedings of the 2023 International Conference on Advances in Computation, Communication and Information Technology (ICAICCIT), Faridabad, India, 23–24 November 2023; pp. 725–730. [Google Scholar] [CrossRef]
Le, N.; Ngo, A.P.; Nguyen, H.T. Kolmogorov-Arnold Networks for Supervised Learning Tasks in Smart Grids. In Proceedings of the 2024 56th North American Power Symposium (NAPS), El Paso, TX, USA, 13–15 October 2024; pp. 1–6. [Google Scholar] [CrossRef]
Zapirain, I.; Etxegarai, G.; Hernández, J.; Boussaada, Z.; Aginako, N.; Camblong, H. Short-term electricity consumption forecasting with NARX, LSTM, and SVR for a single building: Small data set approach. Energy Sources Part A Recovery Util. Environ. Eff. 2022, 44, 6898–6908. [Google Scholar] [CrossRef]
Ahmed, R.; Sreeram, V.; Mishra, Y.; Arif, M.D. A review and evaluation of the state-of-the-art in PV solar power forecasting: Techniques and optimization. Renew. Sustain. Energy Rev. 2020, 124, 109792. [Google Scholar] [CrossRef]
Romeo, G. Data analysis for business and economics. In Elements of Numerical Mathematical Economics with Excel; Elsevier: Amsterdam, The Netherlands, 2020; pp. 695–761. ISBN 978-0-12-817648-1. [Google Scholar]
Maharana, K.; Mondal, S.; Nemade, B. A review: Data pre-processing and data augmentation techniques. Glob. Transit. Proc. 2022, 3, 91–99. [Google Scholar] [CrossRef]
Khalil, M.; McGough, A.S.; Pourmirza, Z.; Pazhoohesh, M.; Walker, S. Machine Learning, Deep Learning and Statistical Analysis for forecasting building energy consumption—A systematic review. Eng. Appl. Artif. Intell. 2022, 115, 105287. [Google Scholar] [CrossRef]
Vapnik, V.; Golowich, S.E.; Smola, A.J. Support Vector Method for Function Approximation, Regression Estimation and Signal Processing. In Proceedings of the 10th International Conference on Neural Information Processing Systems, Denver, CO, USA, 3–5 December 1996. [Google Scholar]
Klein, A.; Falkner, S.; Bartels, S.; Hennig, P.; Hutter, F. Fast bayesian optimization of machine learning hyperparameters on large datasets. In Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, Ft. Lauderdale, FL, USA, 20–22 April 2017. [Google Scholar] [CrossRef]
Deep Learning Toolbox Getting Started Guide. Available online: https://es.mathworks.com/help/deeplearning/getting-started-with-deep-learning-toolbox.html (accessed on 22 March 2025).
Scikit-Learn. TimeSeriesSplit. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.TimeSeriesSplit.html (accessed on 7 March 2025).

Figure 1. Flowchart of the EMS of ESTIA 2 building in the EKATE project.

Figure 2. April 2023 ESTIA 2 building consumption data with and without the effect of HVAC system.

Figure 3. External temperature data of April 2023 obtained from MG.

Figure 4. Occupancy curve of a single day of ESTIA 2 building.

Figure 5. Flowchart of the proposed methodology for model design.

Figure 6. Energy consumption forecasting results of NARX and SVR without time vector and persistence model during of one week of April.

Figure 7. Energy consumption forecasting results of NARX and SVR with time vector and persistence model during of one week of April.

Table 1. Statistical metrics of external temperature data from April 2023.

Variable	Statistical Metrics
Variable	Mean	Median	Standard Deviation	Range
External temperature	13.56	13.99	3.81	20.64

Table 2. Simulation plan for NARX model design.

	Training Characteristics	Model Structure	Model Design
	Training Characteristics	Input Combination	Time Window (TW)	Activation Function	Neurons	Input and Feedback Delays
NARX	Learning algorithm: Levenberg–Marquardt	Temperature + Occupancy	21	Hidden = {sigmoid, tansig} Output = {sigmoid, tansig, linear}	[2,3,4,5,10,15]	[2,4,6,8,10]
	Error = MSE
	Daily NARX training = 3 times

Table 3. NARX model hyperparameter values.

Hyperparameters of NARX
Hidden layers	nº of hidden layers = 1
Hidden layers	number of neurons = 4
Delay vectors	Input = 5
Delay vectors	Feedback = 5
Activation function	Hidden layer = Hyperbolic tangent
Activation function	Output layer = Linear function
Training parameters	Learning algorithm: Levenberg–Marquardt

Table 4. SVR model hyperparameters values for energy consumption forecasting without considering the time vector as input.

Input Combinations	TW	C	$ε$	$γ$
Temperature + Occupancy	21	$10^{1.5}$	0.001	$10^{0.05}$

Table 5. SVR model hyperparameter values for energy consumption forecasting considering the time vector as input.

Input Combinations	TW	C	$ε$	$γ$
Temperature + Occupancy + Time vector	21	$10^{- 0.93}$	$10^{- 3}$	$10^{0.36}$

Table 6. MAPE values of each day of the week.

	MAPE (%)
	NARX	SVR w/o Time Vector	SVR w/Time Vector	Persistence
24 April 2023 (Monday)	12.85	20.59	11.30	43.36
25 April 2023 (Tuesday)	12.39	22.21	11.41	9.69
26 April 2023 (Wednesday)	14.08	19.31	8.14	9.44
27 April 2023 (Thursday)	10.78	22.17	8.62	10.55
28 April 2023 (Friday)	13.00	24.84	13.92	18.08
29 April 2023 (Saturday)	20.16	19.89	8.23	35.54
30 April 2023 (Sunday)	14.51	21.83	13.49	11.31

Table 7. R² values of each day of the week.

		R²
	NARX	SVR w/o Time Vector	SVR W/Time Vector	Persistence
24 April 2023 (Monday)	0.87	0.77	0.92	0.55
25 April 2023 (Tuesday)	0.98	0.75	0.96	0.92
26 April 2023 (Wednesday)	0.75	0.75	0.93	0.93
27 April 2023 (Thursday)	0.92	0.68	0.93	0.89
28 April 2023 (Friday)	0.89	0.79	0.93	0.86
29 April 2023 (Saturday)	0.60	0.15	0.74	0.14
30 April 2023 (Sunday)	0.50	0.25	0.60	0.60

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hernández, J.J.; Zapirain, I.; Camblong, H.; Barroso, N.; Curea, O. Real Implementation and Testing of Short-Term Building Load Forecasting: A Comparison of SVR and NARX. Energies 2025, 18, 1775. https://doi.org/10.3390/en18071775

AMA Style

Hernández JJ, Zapirain I, Camblong H, Barroso N, Curea O. Real Implementation and Testing of Short-Term Building Load Forecasting: A Comparison of SVR and NARX. Energies. 2025; 18(7):1775. https://doi.org/10.3390/en18071775

Chicago/Turabian Style

Hernández, Juan José, Irati Zapirain, Haritza Camblong, Nora Barroso, and Octavian Curea. 2025. "Real Implementation and Testing of Short-Term Building Load Forecasting: A Comparison of SVR and NARX" Energies 18, no. 7: 1775. https://doi.org/10.3390/en18071775

APA Style

Hernández, J. J., Zapirain, I., Camblong, H., Barroso, N., & Curea, O. (2025). Real Implementation and Testing of Short-Term Building Load Forecasting: A Comparison of SVR and NARX. Energies, 18(7), 1775. https://doi.org/10.3390/en18071775

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Real Implementation and Testing of Short-Term Building Load Forecasting: A Comparison of SVR and NARX

Abstract

1. Introduction

2. Case Study

2.1. Description of the EMS

2.2. Analysis of the Data Used

2.2.1. Consumption Data

2.2.2. Temperature Data

2.2.3. Building Occupancy Data

3. Methodology

3.1. Evaluation Metrics

3.2. Automatic Data Pre-Processing

3.3. Predictive Model Design

4. Results and Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI