Next Article in Journal
Reversible Molten Catalytic Methane Cracking Applied to Commercial Solar-Thermal Receivers
Next Article in Special Issue
Assessment of the Impact of Occupants’ Behavior and Climate Change on Heating and Cooling Energy Needs of Buildings
Previous Article in Journal
Optimal Sizing of On-Board Energy Storage Systems and Stationary Charging Infrastructures for a Catenary-Free Tram
Previous Article in Special Issue
Effect on the Thermal Properties of Mortar Blocks by Using Recycled Glass and Its Application for Social Dwellings
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Prediction of Cooling Energy Consumption in Hotel Building Using Machine Learning Techniques

Faculty of Mining and Geoengineering, AGH University of Science and Technology, 30-059 Kraków, Poland
*
Author to whom correspondence should be addressed.
Energies 2020, 13(23), 6226; https://doi.org/10.3390/en13236226
Submission received: 22 October 2020 / Revised: 22 November 2020 / Accepted: 24 November 2020 / Published: 26 November 2020

Abstract

:
The diversification of energy sources in buildings and the interdependence as well as communication between HVAC installations in the building have resulted in the growing interest in energy load prediction systems that enable proper management of energy resources. In addition, energy storage and the creation of energy buffers are also important in terms of proper resource management, for which it is necessary to correctly determine energy consumption over time. It is obvious that the consumption of cooling energy depends on meteorological conditions. Knowing the parameters of the outside air and the number of users, it is, therefore, possible to determine the hourly energy consumption of a cooling system in a building with some accuracy. The article presents models of cooling energy prediction in summer for a hotel building in southern Poland. The paper presents two methods that are often used for energy prediction: neural networks and support vector machines. Meteorological data, time data, and occupancy level were used as input parameters. Based on the collected input and output data, various configurations were tested to identify the model with the best accuracy. As the analysis showed, higher prediction accuracy was obtained thanks to the use of neural networks. The best of the proposed models was characterized by the WAPE and CV coefficients of 19.93% and 27.03%, respectively.

Graphical Abstract

1. Introduction

Nowadays, people spend the majority of time indoors, which leads to increased costs associated with maintaining comfort conditions in buildings. Internal installations such as heating, cooling, and ventilation systems therefore play a key role and thus constitute the main source of costs. Hence, the combination of economic and environmental factors is an important task for manufacturers and designers, contributing to the development of new solutions to provide comfort conditions for small operating and investment outlays. In 2018, the industry sector accounted for about 32% of total global energy consumption. The transport sector accounted for 28% of the energy use. The building construction and operations accounts for the largest share of global final energy use—36%, of which 8% was connected with operation of the non-residential building, 22% with residential building operation and 6% with the construction industry [1]. Global final energy consumption in buildings [2] in 2018 increased 1% from 2017, and about 7% since 2010. According to the Energy efficiency indicators Report published by IEA, nearly 50% of building consumption is related to space heating and 4% with cooling. Increasing energy consumption in the European Union led to appearance of the general regulation in this field—Directive 201013/EU of the European Parliament and of the Council on the energy performance of buildings [3]. According to Article No. 8, Member States should set system requirements for the purpose of optimizing the energy use of technical systems in the buildings. Regulations cover at least heating, hot water, air-conditioning, and large ventilation systems. Furthermore, Member States may encourage the use of active control systems such as automation, control, and monitoring systems that aim to save energy.
In general, methods for estimating energy use have two purposes: design or optimization of the building and HVAC systems (forward modeling), and calculating retrofit savings or implementing model predictive control in existing buildings (data-driven modeling). Behavior of the system is described by a mathematical model, which includes input variables, system structure (physical description), and output variables (reaction to the input variables). Data-driven modeling may be divided into three groups: “Black-Box” (Empirical), Calibrated Simulation, and Gray-Box Approach, which differ in data requirements, time, and effort required to develop the appropriate models. In the first method, a simple or multivariate regression model that describes a relationship between measured energy use and the various input parameters is constructed. A Calibrated Simulation Approach uses a simulation computer program to evaluate existing buildings’ energy consumption and then calibrates the physical input parameters to the program. The Gray-Box Approach formulates a physical model and identifies important parameters by statistical analysis. This method is a mixture of physics-based and data-driven methods, and it could be implemented for fault detection and diagnosis (FDD) and online control [4,5].
Building energy consumption prediction is crucial to appropriate energy management, thus improving energy efficiency of systems and performance of the buildings. Generally, for building energy consumption prediction, two techniques are used: statistical methods and artificial intelligence methods. In recent years, artificial intelligence methods have become very popular. This technique is often applied to the prediction of energy consumption due to good, accurate prediction results [6]. Among the most popular data-driven prediction models using empirical approach modeling are artificial neural networks (ANNs) and support vector machines (SVM). One of the popular techniques is also decision tree (DT) and random forest (RF), which generates multiple decision trees that operate as an ensemble [7]. To improve their solutions, many authors use various methods mentioned above and choose the results of the best one [8,9,10]. The literature contains numerous interesting solutions using different methods. Table 1 illustrates applications of certain algorithms in literature.
Jovanović et al. [22] presented a prediction of heating energy using various neural networks: feed forward backpropagation neural network (FFNN), radial basis function network (RBFN), and adaptive neuro-fuzzy interference system (ANFIS). The subject of analysis was the university buildings in Norway. The authors have predicted building energy use based on the input feature: mean daily outside temperature, mean daily wind speed, total daily solar radiation, minimum daily temperature, maximum daily temperature, relative humidity, day of the week, month of the year, and heating consumption of the previous day. Data for the working days in the cold period for three years was used. For model FFNN, all input variables mentioned above are used; for model RBFN, seven most influencing parameters, and for model ANFIS, only three of them. The results showed that all three models have very good agreement with measured values. Sha et al. [16] presented a simplified energy prediction method based on the three input features: degree-day, day type, and month type. Their study adopted three machine learning algorithms: MLR, SVR, and ANN. The results showed that ANN and SVR methods have better performance than the MLR model. The authors mentioned that all of the methods do not have sufficient quality in heating energy prediction, due to the size of the training dataset. Manjarres et al. [20] proposed to implement the HVAC energy management system in a separate part of the office building in Spain. Solution includes a two-way communication system, enhanced database management system, and a set of machine learning algorithms based on random forest (RF) regression techniques. The proposed optimizer included information such as: indoor and outdoor temperatures, relative humidity, and occupancy level. The simulations took into account different modes of operation of HVAC systems. Data were collected from 63 days in summer and 46 days in winter, which was the basis during the simulation phase. The solution assumes the ON/OFF operation of the HVAC system and the operation of the mechanical ventilation system in accordance with the proposed solutions, which is to ensure minimization of energy consumption while maintaining the assumed temperature inside the rooms. The authors estimate that the implementation of the described system will contribute to the reduction of heat demand by 48% and 39% for cooling consumption. Ahmad et al. [21] presented a comparison between feed-forward back-propagation artificial neural network and random forest for HVAC electricity consumption of a hotel building in Madrid. Results showed that ANN performed marginally better than RF. Generally, both models have comparable quality of prediction and could be implemented in the building system.
In order to increase the accuracy of forecasting, as well as to expand the possibility of implementing solutions, many authors decide to modify classic prediction models or combine several approaches. The biggest problem of these patterns is the nonlinearity of relationships. Zhong et al. [18] proposed using a novel vector field-based support vector regression method. The purpose of the method is to define the optimal feature space by modifying the input data space. The resulting algorithm is then used to build a predictive model. The implementation of such a solution in an office building in China gave very good results compared to commonly used methods. The algorithm proposed in this article is used to determine the refrigeration load forecasting model based on the integrated data set. The input parameters are both external and internal. Casteleiro-Roca et al. [15] described an intelligent hybrid model to predict the short-term energy demand in a hotel, including three techniques: clustering, MLP, and LS-SVR. It was used for predicting the power load of the building for each hour in a 24-h horizon. The authors identified three input variables: the energy demand in the previous 24 h, the mean temperature of the previous day, and the occupancy rate of the hotel. The obtained results were compared with conventional forecast techniques based on ARIMAX modeling and a method based on tree models. The hybrid appeared to have better accuracy (lower mean absolute error) than the above-mentioned models.
Artificial Intelligence Algorithms are also widely used for electricity consumption forecasting, including district public consumption [23]. Güngör et al. [24] presented electricity consumption prediction for a variety of households using different prediction algorithms (Holt-Winters, ARIMA, LSTM i TESLA). Results showed that TESLA performance is better than other prediction methods. The authors have used five different classifiers (Logistic Regression, Stochastic Gradient Descent, K-Nearest Neighbors, Random Forest, Support Vector Machine). Another example of such algorithms is the forecast of meteorological conditions for the needs of the HVAC system proposed by Işik and Inalli. A back-propagation neural network perceptron model with seven inputs was used to predict temperature, solar radiation, and relative humidity. A comparison with the ANFIS model showed that the ANN model has better accuracy in terms of forecasting meteorological data for HVAC. [25]
In this study, cooling energy consumption predictive models based on an artificial neural network and support vector machine algorithms are presented. The paper includes a statistical analysis of historical data of hotel building. The cooling load and proposed variables during the summer season are considered. The main contribution of this paper is the development of high-accuracy predictive models and comparison neural network-based model with the support vectors approach. The structure of this study was organized as follows. Firstly, an introduction to energy prediction is presented. A short review of the prediction method was included. Section 2 provided the case of the study and used methods description, including the details of data collection, input variables, and proposed models. In the Results section, the main results obtained from models are presented. In these sections, real energy consumption and predicted load are compared and analyzed. Finally, conclusions and remarks are given in Section 5.

2. Materials and Methods

2.1. Case of Study

The Turówka hotel building located in Wieliczka near Kraków (the south-central part of Poland) is used as a case of study for this paper. The five-story building is a reconstruction of a historic salt store. The hotel has a floor area of 5525.00 m2 and a volume of 19,300 m3. Facilities include 50 double rooms, a hotel bar, a restaurant, a drink bar, a conference room, and a pool. A detailed description of the building with an analysis of the cooling and heating load was described in the previous article [26]. Based on the analysis of the summer period in the hotel building, it was found that the cooling system is responsible for 50–60% of the total energy load. Additionally, the analysis showed a clear relationship between the outside temperature and the cooling load; therefore, the presented predictive model applies to the cooling energy prediction during the summer. The data used in the research mainly include three parts: meteorological data, load data, and data related to operational conditions. The input variables used for the predictive models for the calculation of the cooling demands include weather conditions, occupancy level in the hotel, hour, and day of the week. The cooling energy load data is provided by meters systems installed in the building cooling system i.e., feed and return of the high and, depending on demand, the low parameter of the refrigerant. Data transmitted via a serial communications protocol—MODBUS RTU—is stored in a recording system. The measurement system consists of MULTICAL heat meters by Kamstrup and flow sensors submitted to a type of approval according to EN 1434 [27], which includes the 2400-h measurement stability test of the flow sensors. The meteorological data used in this paper are obtained from the National Research Institute—Polish Institute of Meteorology and Water Management; only the outside temperature was measured directly at the hotel area. The data used in predictive models are an hourly time series collected in a summer season in Poland, where the cooling load was observed. The hourly meteorological data were calculated as an average value based on the measurement with sampling time set to 10 min. Data directly from the analyzed object, i.e., outdoor temperature and cooling energy consumption, were collected with a sampling time of 1 min. The data used for the models were calculated as an arithmetic average of measurement from an hourly period of time. For this study, the measurement season was selected from 15 May to 15 September 2019. The framework of the case study is shown in Figure 1.

2.2. Methodology

2.2.1. Methodology of Artificial Neural Networks

There are many types of artificial neural networks (ANN) including simple feed forward networks, recurrent neural networks, and spiking neural networks, RBFNs. One of the most commonly used models is the multi-layer back-propagation neural network (BPNN). The BPNN architecture includes three types of the layers: an input layer (variable), an output layer (predicted value), and a hidden layer. A basic processing unit in this model is a neuron. A schematic diagram of an artificial neural network structure which consists of all three layers is shown in Figure 2.
For Multi-layer Perceptron (MLP), neurons in the input layer distribute the input signals xi to neurons in the hidden layer. Each neuron j in the hidden layer sums up its input signals xi after weighting them with the strengths of the respective connections wji from the input layer and computes its output yj as a function f of the sum [28]:
y j = f w j i x i
In this study, the Statistica Artificial Neural Network Package was used. A partition of the data into training (70%), validation (15%), and test (15%) is carried out.

2.2.2. Methodology of Support Vector Machines

Support Vector Machine is a supervised learning method increasingly used in solving nonlinear problems. This method is commonly used for classification, regression, and clustering. One of the main features of this model is the lack of local minima. It consists of two main parts: universal linear learning algorithm and a specific kernel that calculates the inner product of input points in feature space [29].
The universal linear function is as Equation (2):
f(x) = wT φ(x) + b
where f(x) means the forecasting values and the coefficients w and b are adjustable.
The main aim of SVM is to find the optimal hyperplane between classes, with the maximal margin. The margin is defined as the distance between the closest point in each class and hyperplane. For this purpose, the ε-insensitive loss function is used. Minimizing the overall errors is expressed by Equation (3) [30]:
min w , b , ξ * , ξ R w , ξ * , ξ = 1 2 w T w + C i = 1 N ξ i * + ξ i
with the constraints:
y i w T φ x i b , ε + ξ i *
y i + w T φ x i + b , ε + ξ i
ξ i * , ξ i 0
The method of operation of the support vector machine is shown in Figure 3.
There are four types of kernel function linear, polynomial, radial basis function (RBF), and sigmoidal function [29]. The most used kernel functions are the Gaussian RBF with a function of kernel defined by Equation (7):
K(xi,xj) = exp(−γ|xi − xj|2)
where xi and xj are vectors in the input space and γ is kernel parameter. An equivalent definition involves a σ parameter, where γ = 1/2σ2.
The forecasting accuracy of the SVM model is affected by hyperparameters. In the ε-SVR, three proper parameters need to be determined: C, ε, and the kernel parameter (γ). Parameter ε represents the width of the ε-insensitive loss function and can affect the number of support vectors in the model. The higher ε value results in fewer support vectors and more flat estimates. Parameter C is related to model complexity and the degree of the deviations larger than ε which are tolerated. In models with high C parameter values, the main objective is to minimize the empirical risk only, without regard to model complexity part in the optimization formulation. Parameters σ and γ describe the Gaussian function width [31]. In this paper, the radial basis function RBF function is used as a kernel function to estimate the cooling load of the hotel building. A grid-search technique was applied to find the optimal parameter values. The kernel parameter γ was selected in priori based on the literature. According to the Limsvm-2.6 [32], the kernel parameter is defined as γ = 1/n, where n means the number of input variables. For d-dimensional problems, Cherkassy and Ma [31] noticed that the width parameter σ depends on the number of input variables d in accordance with the formula σd~(0.2–0.5). On this basis, four values of γ parameters were found: 0.1, 0.3, 0.5, 0.7. The ten-fold cross-validation was applied to reduce the error of the model. The dataset was randomly divided into two parts: learning samples (75%) and testing samples (25%). The feasible ranges of the parameters are set as follows: C ∈ [0.5, 150] and ε ∈ [0.01, 0.5].

2.3. Model Evaluation Index

The performance of the proposed models is evaluated by the mean absolute error (MAE), root mean square error (RMSE), weighted absolute percentage error (WAPE), coefficient of variance (CV), and coefficient of determination (r) [11,12,18,33]. The indicators are calculated as follows:
M A E   = 1 n i = 1 n E A E P
R M S E   = 1 n i = 1 n E A E P 2
W A P E   = i = 1 n E A E P i = 1 n E A
C V = 1 n i = 1 n E A E P 2 E A ¯
r = c o v E A , E P σ E A σ E P
where n denates the entire number of observations, E A is the actual value, E A ¯ denotes the mean of actual values, and E P represents the predicted value.

3. Results

3.1. Preliminary Statistical Analysis

The first step in the analysis was a preliminary statistical analysis of the dataset. From the available data, nine parameters were selected that could potentially have an impact on cooling energy consumption. Apart from the most obvious values, such as temperature, air humidity, wind speed, and relative humidity, it was decided to use time variables such as the hour and day of the week as well as the occupancy level. Table 2 presents the information summary of the data used in this work.
In the beginning, the relationship between cooling energy consumption and the values of the variables was determined. Graphs for each of the variables are presented in Figure 4.
The Pearson correlation coefficient was the key parameter determining the acceptance of a given variable for analysis. Coefficients for each between the studied variables and the predicted output were provided. The results are summarized in Figure 5.
Variables with low correlation coefficients can be significant in more complex predictive models. Based on the analysis of the correlation coefficient, the rejection of two parameters was initially planned: day and occupancy level. Due to the low correlation coefficients of the variables which, according to the authors, may have a significant impact on the predicted values, a more detailed analysis of the cooling load variability overtime was performed. Figure 6 shows plots of variation over time for the raw data collected with a sampling time of 1 min, and also after taking the hourly mean which was then accepted for prediction models.
As shown in Figure 6, there is a clear relationship between the demand for cooling energy and the time it was recorded. This relationship is not linear, hence the low correlation coefficient in the previous analysis step. The adoption of averaged values for further analysis may of course affect the accuracy of prediction, due to the unstable variability of the demand value; however, due to practical aspects, it was decided that the hourly variables were more efficient and could give sufficient effects in the prediction models. Despite the low correlation coefficients for time parameters and the occupancy level, it was decided that these parameters may be of great importance for a hotel facility due to the variability of the energy load depending on the use by guests.

3.2. Prediction Models

3.2.1. Artificial Neural Networks

As mentioned in Section 2.2.1, the Statistica Artificial Neural Network Package was used to prepare predictive models. The data was divided into three groups. In addition, 70% of the data was used for the design of the network, 15% was used for validation, and 15% for testing. The hidden layer activation-functions, output layer activation-function, and the number of hidden neurons are selected using the methodology based on statistical tests and least-squares estimation. Models were created by combining different types of activation functions and a different number of hidden neurons. The selection of the function was made from identity, logistic, hyperbolic tangent, and exponential function. To find the optimum number of hidden neurons, various numbers of neurons were examined. The various combination of activation functions and numbers of hidden neurons as described above were tested. Sum-of-squares was selected as the error function during the network training process. Training Algorithm BFGS (Broyden–Fletcher–Goldfarb–Shanno) was chosen for this work. Five models with the best accuracy were selected and described in Table 3.
Figure 7 shows the schemes of regressions for all data sets according to each of the MLP models. The plots explain the correlation between the real values and the MLP model output. The solid line in each plot represents the best linear fit between the output and target values.
In Table 4, sensitivity coefficients are presented. The analysis describes the change in the system’s outputs due to variations in the parameters that affect the system. Performing the sensitivity analysis consists of controlling how the network error behaves in the case of fluctuations of the independent variables. For each input variable, its values are converted to the mean (from the training set) so that it does not contribute any information to the model. After supplying such modified data to the network input, the final prediction error is checked. A larger error value means that the model depends on the proposed variable. The higher the value of the sensitivity analysis coefficients, the greater the importance of a given variable for a good fit of the model.
According to the statistical analysis and the analysis of the sensitivity of neural networks, a clear impact on energy consumption is noticeable in the case of parameters such as relative humidity, maximum wind speed, occupancy level, hour, and day of the week

3.2.2. Support Vectors Machines

As mentioned above, ranges of gamma, C, and epsilon parameters were defined and tested in the search for optimal values. Below, in Table 5, models with given characteristic parameters are defined which were characterized by the best fit during the tests.
In Figure 8, the relationship between the real energy consumptions and the predicted values was plotted. The line of a theoretical perfect 1:1 match line was marked in red.
Based on the coefficients defined in Section 2.3, selected models of neural networks were compared. As previously mentioned, five matching indexes were selected for this purpose: MAE, RMSE, WAPE, CV, and correlation coefficient (r). The results for cooling consumption are presented in Table 6.
Similarly, for selected SVM models, the coefficient values are summarized in Table 7.
Figure 9 shows comparison of the real values and predicted results of the MLP and SVM models with the high accuracy; it is MLP 2 and an SVM 4 model.
The figure shows the real values of hourly consumption of cooling energy in the analyzed period with a dashed gray line. The blue color marks the results obtained with the MLP 2 model, which received the best results among the proposed ANN networks, and the red color indicates SVM 4, analogically the best among the presented SVM models. In both of the proposed models, the worst effects present at predicting extreme values, where the cooling load is very high or very low. Comparing the models with each other, it is noticeable that neural networks perform better in this matter, and the difference between the real load and the predicted value is significantly smaller.

4. Discussion

The analysis of the cooling energy load in the hotel building showed that, based on several parameters characterizing the external conditions, building using conditions and time, it is possible to estimate the overall cooling load of the building. The analysis used nine parameters, including hour, day of the week, average temperature, occupancy level, wind direction, average wind speed, maximum wind speed, total hourly precipitation, and relative humidity. The external temperature has an obvious influence on energy consumption, which determines the demand for cooling energy. As shown by the preliminary analysis, relative humidity, wind speed, and direction can also be considered significant factors. Despite the low correlation coefficients of the time parameters (hour and day of the week) and the occupancy level, it was decided that, in the general model, they may be significant; therefore, they were not omitted in further analysis. Based on the assumed parameters, five models of neural networks based on the MLP algorithm and four SVM models with different configurations of parameters defining the models were proposed. Five model evaluation indicators were used to check the quality of the models shown: the mean absolute error (MAE), root mean square error (RMSE), weighted absolute percentage error (WAPE), coefficient of variance (CV), and coefficient of determination (r). Among all the proposals, a better fit was found for the ANN models, the correlation coefficient of which did not fall below 0.9. As shown by the sensitivity analysis of inputs (Table 4), time and occupancy level played an important role in the model, despite the seemingly low importance of these parameters. Total hourly precipitation had the least impact on the overall result. The selected neural networks are characterized by the WAPE coefficient ranging from 19.93 to 22.77% and the MAE coefficient of 10.27 to 11.80 kWh/h. The best of the proposed models, i.e., MLP 2, achieved the value of the correlation coefficient r at the level of 0.93. For comparison, the mean absolute error for the proposed SVM models varies from 13.13 to 14.46 kWh/h, and the weighted absolute percentage error from 25.45 to 28.04%. The most precise SVM model—SVM 4—is characterized by a correlation coefficient of 0.88. The proposed prediction methods are widely used for energy prediction due to the possibility of their application in nonlinear dependencies of many variables. As the results show, the use of neural networks in energy prediction enables the achievement of better model fit coefficients. Both methods used were based on the nine variables mentioned above. The greatest differences between the proposed models are visible in the extreme values. The SVM model performs much worse at very low and very high cooling loads. In the event of high values of cooling energy consumption, both the ANN and SVM models lower the predicted values compared to the actual values. This is especially noticeable with the SVM model, which directly affects the lower fit factors.

5. Conclusions

Reducing energy consumption is an issue that is becoming more and more popular nowadays. This is related to both the obvious economic aspects as well as the growing awareness of the society regarding the limitation of human impact on the environment, including the exploitation of non-renewable resources and emissions of pollutants into the environment. There are new solutions on the market to reduce the energy demand of buildings. Systems of proper management of energy demand, especially in combined energy economies, are also becoming more and more popular. The key issue in such solutions is the ability to accurately determine the demand for energy at a given moment. As a consequence, machine learning techniques are also becoming more and more popular, used to predict the energy load of a building.
The article presents two popular methods of energy prediction: neural networks and support vector machines. For each of the methods, several models differing in characteristic parameters and selected optimization methods are presented. The subject of the analysis is a building of historical importance, the modernization of which is limited due to the minimization of interference with the cubature and the overall appearance of the facility. In addition to replacing the heat source and internal installations, one of the effective methods of reducing operating costs may be the proper management of energy installations, which requires a building energy consumption forecasting system. The models use parameters that may affect consumption, both characterizing external conditions, time of use, and the number of guests. Based on the collected data, five models of neural networks and four SVM models were presented. Each of them was compared according to the proposed forecasting accuracy coefficients. The analysis showed that, despite the initially insignificant influence of some parameters on the obtained results, they played an important role in the predictive model. The best fit coefficients were obtained for models using artificial neural networks to predict the heating load. The proposed model made it possible to estimate the demand for cooling energy with a matching coefficient of 0.93. The quality of the model can be improved by extending the analysis time to several summer seasons. Hotel buildings are a specific object due to the users who have different preferences as to the conditions inside the rooms. Individual control allows them to maintain thermal comfort by changing the temperature, and thus their behavior directly affects energy consumption for cooling purposes.

Author Contributions

Conceptualization, M.B. and K.Z.; methodology, M.B.; software, K.Z.; validation, M.B.; formal analysis, M.B.; investigation, K.Z.; resources, K.Z.; data curation, K.Z.; writing—Original draft preparation, K.Z.; writing—Review and editing, M.B.; visualization, K.Z.; supervision, M.B.; project administration, M.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the European Regional Development Fund, Intelligent Development Program, Grant No. POIR.01.01.01-00-0720/16.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

AHUAir Handling Unit
ANFISAdaptive Neuro-Fuzzy Interference System
ANNArtificial Neural Network
ARIMAAutoregressive, Integrated and Moving Average
ARIMAXAuto-Regressive Integrated Moving Average with eXplanatory variable
BFGSBroyden–Fletcher–Goldfarb–Shanno
BPNNBack Propagation Neural Network
BSIBlind System Identification
CVCoefficient of Variance
DTDecision Tree
ELMExtreme Learning Machine
FDDFault Detection and Diagnosis
FFNNFeed Forward Backpropagation Neural Network
GBDTGradient Boosting Decision Tree
GRNNGeneral Regression Neural Network
HVACHeating, Ventilation, Air Conditioning
kNNk Nearest Neighbors
LS-SVRLeast Squares SVR
LSTMLong Short-Term Memory Neural Networks
MAEMean Absolute Error
MLPMultilayer Perceptron
MLRMultiple Linear Regression
MOGAMulti-Objective Genetic Algorithm
NARX RNNNonlinear Autoregressive Exogenous Recurrent Neural Networks
PPDPredicted Percentage Dissatisfied
RBFRadial Basis Function
RBFNRadial Basis Function Network
RFRandom Forest
RMSERoot Mean Squared Error
SVMSupport Vector Machine
SVRSupport Vector Regression
WAPEWeighted Absolute Percentage Error
XGBoostExtreme Gradient Boosting

References

  1. International Energy Agency. Global Status Report for Buildings and Construction. Towards a Zero-Emissions, Efficient and Resilient Buildings and Construction Sector; United Nations Environment Programme: Paris, France, 2019. [Google Scholar]
  2. International Energy Agency. Energy Efficiency Indicators; Highlights, IEA: Paris, France, 2019. [Google Scholar]
  3. Directive 2010/31/EU of the European. Parliament and of the Council of 19 May 2010 on the Energy Performance of Buildings; IWA Publishing: London, UK, 2010. [Google Scholar]
  4. ASHRAE. Handbook of Fundamentals; ASHRAE: Atlanta, GA, USA, 2013. [Google Scholar]
  5. Lee, H.; Kim, S. Black-Box Classifier Interpretation Using Decision Tree and Fuzzy Logic-Based Classifier Implementation. Int. J. Fuzzy Log. Intell. Syst. 2016, 16, 27–35. [Google Scholar] [CrossRef] [Green Version]
  6. Wei, Y.; Zhang, X.; Shi, Y.; Xia, L.; Pan, S.; Wu, J.; Han, M.; Zhao, X. A review of data-driven approaches for prediction and classification of building energy consumption. Renew. Sustain. Energy Rev. 2018, 82, 1027–1047. [Google Scholar] [CrossRef]
  7. Yu, Z.; Haghighat, F.; Fung, B.C.M.; Yoshino, H. A decision tree method for building energy demand modeling. Energy Build. 2010, 42, 1637–1646. [Google Scholar] [CrossRef] [Green Version]
  8. Chou, J.S.; Bui, D.K. Modeling heating and cooling loads by artificial intelligence for energy-efficient building design. Energy Build. 2014, 82, 437–446. [Google Scholar] [CrossRef]
  9. Wang, R.; Lu, S.; Li, Q. Multi-criteria comprehensive study on predictive algorithm of hourly heating energy consumption for residential buildings. Sustain. Cities Soc. 2019, 39, 1016–1023. [Google Scholar] [CrossRef]
  10. Ahmad, T.; Chen, H. Nonlinear autoregressive and random forest approaches to forecasting electricity load for utility energy management systems. Sustain. Cities Soc. 2019, 45, 460–473. [Google Scholar] [CrossRef]
  11. Sendra-Arranz, R.; Gutiérrez, A. A long short-term memory artificial neural network to predict daily HVAC consumption in buildings. Energy Build. 2020, 216, 109952. [Google Scholar] [CrossRef]
  12. Wang, R.; Lu, S.; Feng, W. A novel improved model for building energy consumption prediction based on model integration. Appl. Energy 2020, 262, 114561. [Google Scholar] [CrossRef]
  13. Casteleiro-Roca, J.L.; Gómez-González, J.F.; Calvo-Rolle, J.L.; Jove, E.; Quintián, H.; Gonzalez Diaz, B.; Mendez Perez, J.A. Short-Term Energy Demand Forecast in Hotels Using Hybrid Intelligent Modeling. Sensors 2019, 19, 2485. [Google Scholar] [CrossRef] [Green Version]
  14. Jaber, A.A.; Saleh, A.M.A.; Ali, H.A.M. Prediction of Hourly Cooling Energy Consumption of Educational Buildings Using Artificial Neural Network. Int. J. Adv. Sci. Eng. Inf. Technol. 2019, 9, 159–166. [Google Scholar] [CrossRef]
  15. Nasruddin, N.; Sholahudin, S.; Satrio, P.; Mahlia, T.M.I.; Giannetti, N.; Saito, K. Optimization of HVAC system energy consumption in a building using artificial neural network and multi-objective genetic algorithm. Sustain. Energy Technol. Assess. 2018, 35, 48–57. [Google Scholar] [CrossRef]
  16. Shaa, H.; Xua, P.; Hub, C.; Lib, Z.; Chena, Y.; Chena, Z. A simplified HVAC energy prediction method based on degree-day. Sustain. Cities Soc. 2019, 51, 101698. [Google Scholar] [CrossRef]
  17. Wei, Y.; Xia, L.; Pan, S.; Wu, J.; Zhang, X.; Han, M.; Zhang, W.; Xie, J.; Li, Q. Prediction of occupancy level and energy consumption in office building using blind system identification and neural networks. Appl. Energy 2019, 240, 276–294. [Google Scholar] [CrossRef]
  18. Zhong, H.; Wang, J.; Jia, H.; Mu, Y.; Lv, S. Vector field-based support vector regression for building energy consumption prediction. Appl. Energy 2019, 242, 403–414. [Google Scholar]
  19. Koschwitz, D.; Frisch, J.; van Treeck, C. Data-driven heating and cooling load predictions for non-residential buildings based on support vector machine regression and NARX Recurrent Neural Network: A comparative study on district scale. Energy 2018, 165, 134–142. [Google Scholar] [CrossRef]
  20. Manjarres, D.; Mera, A.; Perea, E.; Lejarazu, A.; Gil-Lopez, S. An energy-efficient predictive control for HVAC systems applied totertiary buildings based on regression techniques. Energy Build. 2017, 152, 409–417. [Google Scholar] [CrossRef]
  21. Ahmad, M.W.; Mourshed, M.; Rezgui, Y. Trees vs Neurons: Comparison between random forest and ANN forhigh-resolution prediction of building energy consumption. Energy Build. 2017, 147, 77–89. [Google Scholar] [CrossRef]
  22. Jovanović, R.Ž.; Sretenović, A.A.; Živković, B.D. Ensemble of various neural networks for prediction of heating energyconsumption. Energy Build. 2015, 94, 189–199. [Google Scholar] [CrossRef]
  23. Kuo, P.H.; Huang, C.J. A High Precision Artificial Neural Networks Model for Short-Term Energy Load Forecasting. Energies 2018, 11, 213. [Google Scholar] [CrossRef] [Green Version]
  24. Güngör, O.; Akşanlı, B.; Aydoğan, R. Algorithm selection and combining multiple learners for residential energy prediction. Future Gener. Comput. Syst. 2019, 99, 391–400. [Google Scholar]
  25. Işik, E.; Inalli, M. Artificial neural networks and adaptive neuro-fuzzy inference systems approaches to forecast the meteorological data for HVAC: The case of cities for Turkey. Energy 2018, 154, 7–16. [Google Scholar] [CrossRef]
  26. Borowski, M.; Mazur, P.; Kleszcz, S.; Zwolińska, K. Energy Monitoring in a Heating and Cooling System in a Building Based on the Example of the Turówka Hotel. Energies 2020, 13, 1968. [Google Scholar] [CrossRef] [Green Version]
  27. European Standard EN-1434:2015 Thermal Energy Meters; BSI: London, UK, 2015.
  28. Pham, D.T.; Liu, X. Neural Networks for Identification, Prediction and Control; Springer: London, UK, 1995. [Google Scholar]
  29. Ke-Lin, D.; Swamy, M.N.S. Neural Networks and Statistical Learning; Springer: London, UK, 2014. [Google Scholar]
  30. Hong, W.-C. Hybrid Intelligent Technologies in Energy Demand in Forecasting; Springer International Publishing: Berlin/Heidelberg, Germany, 2020. [Google Scholar]
  31. Cherkassky, V.; Ma, Y. Practical Selection of SVM Parameters and Noise Estimation for SVM Regression. Neural Netw. 2004, 17, 113–126. [Google Scholar] [CrossRef] [Green Version]
  32. Chang, C.C.; Lin, C.J. LIBSVM. A Library for Support Vector Machines. 2001 Software. Available online: http://www.csie.ntu.edu.tw/~cjlin/libsvm (accessed on 30 September 2020).
  33. Schroeter, B.J.E. Artificial Neural Networks in Precipitation Nowcasting: An Australian Case Study. In Artificial Neural Network Modelling; Shanmuganathan, S., Samarasinghe, S., Eds.; Springer International Publishing: Cham, Switzerland, 2016; pp. 325–339. [Google Scholar]
Figure 1. The framework of the case study.
Figure 1. The framework of the case study.
Energies 13 06226 g001
Figure 2. An illustration of a typical ANN topology.
Figure 2. An illustration of a typical ANN topology.
Energies 13 06226 g002
Figure 3. An illustration of a typical ANN topology.
Figure 3. An illustration of a typical ANN topology.
Energies 13 06226 g003
Figure 4. Relationship between cooling energy consumption and selected variables.
Figure 4. Relationship between cooling energy consumption and selected variables.
Energies 13 06226 g004
Figure 5. Matrix of the Pearson correlation coefficients.
Figure 5. Matrix of the Pearson correlation coefficients.
Energies 13 06226 g005
Figure 6. Hourly variation of cooling energy consumption: (a) for the sample days (30 May, 30 June, 30 July, 30 August) based on data sampled every minute; (b) for example days (30 May, 30 June, 30 July, 30 August) on the basis of hourly average data; (c) mean values grouped by hour and categorized by month based on data sampled every minute.
Figure 6. Hourly variation of cooling energy consumption: (a) for the sample days (30 May, 30 June, 30 July, 30 August) based on data sampled every minute; (b) for example days (30 May, 30 June, 30 July, 30 August) on the basis of hourly average data; (c) mean values grouped by hour and categorized by month based on data sampled every minute.
Energies 13 06226 g006
Figure 7. Comparison between real cooling energy consumption and prediction of SVM model: (a) for the model MLP-1; (b) for the model MLP-2; (c) for the model MLP-3; (d) for the model MLP-4; (e) for the model MLP-5.
Figure 7. Comparison between real cooling energy consumption and prediction of SVM model: (a) for the model MLP-1; (b) for the model MLP-2; (c) for the model MLP-3; (d) for the model MLP-4; (e) for the model MLP-5.
Energies 13 06226 g007aEnergies 13 06226 g007b
Figure 8. Comparison between real cooling energy consumption and prediction of SVM model: (a) for SVM 1 model; (b) for SVM 2 model; (c) for SVM 3 model; (d) for SVM 4 model; performance of models.
Figure 8. Comparison between real cooling energy consumption and prediction of SVM model: (a) for SVM 1 model; (b) for SVM 2 model; (c) for SVM 3 model; (d) for SVM 4 model; performance of models.
Energies 13 06226 g008
Figure 9. Prediction performance during cooling season of the model with highest accuracy (a) neural network MLP 2; (b) Support Vector Machine SVM 4.
Figure 9. Prediction performance during cooling season of the model with highest accuracy (a) neural network MLP 2; (b) Support Vector Machine SVM 4.
Energies 13 06226 g009
Table 1. A review of predictive models for building energy consumption.
Table 1. A review of predictive models for building energy consumption.
AuthorType of BuildingInputsOutputsMethods
Sendra-Arranz et al., 2020 [11]Solar house MagicBoxoutdoor temperature, relative humidity, irradiance, indoor CO2 level, indoor temperature, reference temperature (set by the user)Power consumption by an HVAC systemLong Short-Term Memory Neural Networks (LSTM)
Wang et al., 2020 [12]Educational Building (2 buildings)meteorological data (including outdoor dry bulb temperature, wet bulb temperature, relative humidity, wind direction, wind speed, air pressure, horizontal total radiation), time variable (hour of the day, day type), historical data (energy consumption at the same time as the previous day).Building energy consumptionRF, GBDT, XGBoost, SVR, and kNN models (Random Forest, Gradient Boosted Decision Tree, Extreme Gradient Boosting, Support Vector Machine, and K-Nearest Neighbor)
Casteleiro-Roca J.L. et al., 2019 [13]Hotel Buildingthe energy demand in the previous 24 h, the mean temperature of the previous day and the occupancy rate of the hotel.Power demandHybrid model: clustering, LS-SVR, ANN (MLP, Tan-Sigmoid function), ARIMAX
Jaber et al., 2019 [14]University Buildingtime, outdoor dry-bulb temperature, orientation of the building, overall heat transfer coefficient, space volume and window to wall ratioHourly cooling energyfeed forward artificial neural network (ANN)
Nasruddina, 2019 [15]University Buildingcooling set point, RH set point, starting delay, stopping delay, supply air flow rate, window area, wall thickness, supply air temperature, supply radiant temperature, supply radiant flow rate.Annual energy consumption, PPDCombination of artificial neural network (ANN-MLP) and MOGA
Sha et al., 2019 [16]Large Retail Buildingdegree-day, day type, and month typeCooling/heating Energy HVAC electricity consumptionSVM, ANN, and MLR
Wei et al., 2019 [17]Office Buildingelectricity consumption of appliances, number of occupants, electricity consumption of lighting, solar radiation, electricity consumption of the fresh-air system, outdoor temperatureElectricity consumption by an air-conditioning systemFFNN, ELM with BSI
Zhong et al., 2019 [18]Office Buildingbarometric pressure, dry-bulb temperature, relative humidity, average wind speed, mean wind direction, mean wind direction indoor temperature and indoor relative humidityCooling loadNovel Vector Field-based SVR model
Koschwitz et. al., 2018 [19]200 non-residential buildings (district scale)dew point temperature, mean wind direction, mean wind velocity, outdoor temperature, precipitation intensity, precipitation quantity, relative humidity, school holiday time, working time scheduleHourly heating and cooling loadNARX RNN SVM-R (RBF Kernel and polynomial Kernel)
Manjarres et al., 2017 [20]Office Buildingdoor, outdoor temperatures and relative humidities and occupancy levels, and operation of HVACThermal consumption for heating, electrical consumptionRF regression
Muhammad Waseem Ahmad 2017 [21]Hotel Buildingoutdoor air temperature, dew point temperature, relative humidity, windspeed, hour of the day, day of the week, month of the year, number of guests for the day, number of rooms bookedHVAC electricity consumptionfeed forward backpropagation neural network (FFNN), RF
Jovanović et al., 2015 [22]University Campusesmean daily outside temperature, mean daily wind speed, total daily solar radiation, minimum daily temperature, maximum daily temperature, relative humidity, day of the week, month of the year, heating consumption of the previous day.Heating consumptionFFNN, radial basis function network (RBFN) and adaptive neuro-fuzzy interference system (ANFIS)
Table 2. Descriptive statistics for input and output variables.
Table 2. Descriptive statistics for input and output variables.
VariableUnitMinimumMaximumMeanMedian
Hour-From 0:00 to 23:00
Day of the week-From Monday to Sunday
Average temperature°C6.2436.1620.2219.75
Occupancy level%18.90100.0077.3583.00
Wind direction°From 12.50° to 341.00°
Average wind speedm/s0.005.070.630.40
Maximum wind speedm/s0.0013.122.222.07
Total hourly precipitationmm0.0015.000.120.00
Relative humidity%16.6598.0670.9374.60
Cooling energy consumptionkWh/h0.00141.3051.3542.34
Table 3. Network configurations tested.
Table 3. Network configurations tested.
Model NameHidden Layer Activation-FunctionNumber of Hidden UnitsOutput Layer Activation-FunctionCorrelation Coefficients
MLP 1Logistic90Logistic0.912
MLP 2Hyperbolic tangent68Logistic0.925
MLP 3Logistic83Logistic0.909
MLP 4Hyperbolic tangent71Logistic0.903
MLP 5Hyperbolic tangent54Logistic0.902
Table 4. Sensitivity analysis of inputs.
Table 4. Sensitivity analysis of inputs.
VariableMLP 1MLP 2MLP 3MLP 4MLP 5
Hour2.042.782.141.961.73
Day of the week2.012.531.891.581.71
Average temperature9.978.747.045.936.84
Occupancy level1.642.821.481.301.42
Wind direction1.391.421.261.281.20
Average wind speed1.781.931.721.441.31
Maximum wind speed1.931.971.831.551.51
Total hourly precipitation1.041.041.041.041.03
Relative humidity2.302.842.332.011.97
Table 5. Network configurations tested.
Table 5. Network configurations tested.
Model NameGammaCEpsilonNo. of VectorsCorrelation Coefficients
SVM 10.1100.208840.853
SVM 20.3100.208090.867
SVM 30.5100.178770.873
SVM 40.7100.139790.879
Table 6. Performance evaluation of different ANN models for cooling energy consumption.
Table 6. Performance evaluation of different ANN models for cooling energy consumption.
IndicatorsMLP 1MLP 2MLP 3MLP 4MLP 5
MAE11.0910.2711.4611.6911.80
RMSE15.0513.8815.3515.7115.78
WAPE21.54%19.93%22.34%22.77%22.77%
CV29.29%27.03%29.88%30.59%30.72%
r0.910.930.910.900.90
Table 7. Performance evaluation of different SVM models for cooling energy consumption.
Table 7. Performance evaluation of different SVM models for cooling energy consumption.
IndicatorsSVM 1SVM 2SVM 3SVM 4
MAE14.4613.7913.4813.13
RMSE18.8418.1017.8017.44
WAPE28.04%26.98%26.20%25.45%
CV36.69%35.24%34.65%33.95%
r0.850.870.870.88
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Borowski, M.; Zwolińska, K. Prediction of Cooling Energy Consumption in Hotel Building Using Machine Learning Techniques. Energies 2020, 13, 6226. https://doi.org/10.3390/en13236226

AMA Style

Borowski M, Zwolińska K. Prediction of Cooling Energy Consumption in Hotel Building Using Machine Learning Techniques. Energies. 2020; 13(23):6226. https://doi.org/10.3390/en13236226

Chicago/Turabian Style

Borowski, Marek, and Klaudia Zwolińska. 2020. "Prediction of Cooling Energy Consumption in Hotel Building Using Machine Learning Techniques" Energies 13, no. 23: 6226. https://doi.org/10.3390/en13236226

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop