1. Introduction
Governments, regulatory agencies, and public bodies have been promoting policies and measures for healthy and energy-efficient buildings, issuing directives such as the Directive (2018/844) [
1] developed by the European Parliament. Despite these efforts, a significant amount of energy is often wasted in industrial, commercial, and residential buildings, causing uncomfortable conditions for occupants. Typical examples of such inefficiencies are retail stores, in which energy managers struggle to find a good balance in the tradeoff between optimisation of comfort and minimisation of energy consumption by HVAC systems. Indeed, managers need customers to feel comfortable while shopping at any time of the year and in all environmental conditions. Furthermore, they seek to gain important indoor air quality (IAQ) certifications [
2,
3] in order to distinguish themselves against their competitors.
Currently, HVAC systems are often managed directly on site in buildings, manually operating on thermostats to regulate thermal comfort with not enough attention to energy efficiency. For instance, in commercial buildings, HVAC systems are often left active at the end of the day, thus continuing to work and consuming unnecessary energy during the night when shops are closed. Another common issue in the vast majority of buildings is that whatever HVAC configuration in terms of ON/OFF and set point (i.e., desired target temperature) is configured in the early morning is usually left unchanged throughout the day. However, according to outdoor weather and its impact on the indoor environments, it might be convenient applying different HVAC settings during the day (e.g., switching the devices off). In such scenarios, automated systems capable of continuously optimising HVAC devices over time have drawn the interest of managers. Indeed, these solutions aim to address the aforementioned challenges, efficiently meeting the comfort requirements throughout the day while reducing the energy footprint. This takes a central role in scenarios where a large amount of geographically distributed and physically heterogeneous sites are managed as each of them might require a different control strategy.
Multiple works from the literature deal with the topic of comfort maximisation jointly with energy consumption minimisation in buildings. Despite the progress in this field resulting in innovative solutions, e.g., based on advanced passive strategies [
4,
5] or model predictive control (MPC) [
6,
7,
8,
9], their major drawback is the limited scalability. This refers to the capability to replicate and automate a specific approach across different environments regardless of their physical characteristics. These solutions often require a comprehensive analysis of each building and the machinery installed therein to define tailored physical or mathematical models (e.g., [
4,
5,
8,
10,
11,
12]), which typically need manual updates over time. In parallel, other solutions use complex building-related information (e.g., [
6,
7,
13,
14]) and might require customisations within the monitored environment. While designing smart and adaptive solutions using data collected from Internet of Things (IoT) sensors is essential to optimise HVAC systems [
15], it is crucial to ensure that their deployment and replicability involve automated operations. This makes them attractive from a business perspective, especially for managers who need to control tens or hundreds of buildings. Finally, different works provide theoretical analysis techniques for comfort and energy optimisation but lack real-world validation [
16,
17,
18]. In this regard, Ngarambe et al. [
19] underline that experimental studies demonstrating the benefit of artificial intelligence (AI) control strategies (e.g., MPC approaches) in real environments take a central role.
In this work, we propose a novel solution called energy-efficient comfort optimisation (EECO) based on deep learning (DL) to regulate HVAC systems in an automated manner. It does not require any intervention of expert personnel or prior information of buildings (e.g., installed HVAC devices, layout, and materials) as it works on real data collected from the environment. In this regard, our aim is to analyse how the different agents, including passive phenomena, impact the parameters within the environment through the collected data. This allows us to provide an adaptive solution for the monitored environment that implicitly considers the influence of the different sources. From an applicability perspective, the proposed solution holds the potential for being applied in any building equipped with a control system capable of gathering environmental and energy consumption data and interfacing with local HVAC devices. Basically, after an initial configuration of the main parameters (e.g., the comfort interval throughout the day and some parameters of the comfort model), the proposed solution can effectively work just after its deployment, and it keeps up to date independently over time, resulting in an automated and practical solution. The objective of HVAC optimisation is to guarantee the comfort requirements, at least during opening hours, and then balance both thermal comfort and energy consumption concerns. Indoor comfort is modelled by means of predicted mean vote (PMV) [
20,
21], a thermal comfort index referenced by different indoor comfort standards all over the world, including European Standard EN 16798. A shallow 1D convolutional neural network (CNN) is used as DL architecture to predict the short-term evolution of future indoor environmental parameters (i.e., temperature, humidity, and carbon dioxide (CO
2)) and the energy consumption of the HVAC system. The idea behind the DL model is to predict the environmental and energy impact of a set of possible device configurations (ON/OFF and set point) for
m upcoming time periods. Basically, a tree of possible actuation strategies that keeps track of the environment evolution in the next future based on past (real or predicted) conditions is generated. Each branch of the resulting tree is then evaluated to select the strategy that maintains the best expected comfort at minimal energy cost.
Our work can contribute to reducing the carbon footprint of buildings caused by HVAC systems, improving the comfort conditions to occupants and saving on operating costs required to control thermal comfort. The designed approach has been tested during the summer and winter periods in a real environment of a small production plant belonging to a large retail company in northern Italy. Furthermore, an additional analysis based on software simulations is proposed.
The main contributions of this work are the following:
A practical solution, with no prior information of the local environment (e.g., installed HVAC devices and building features) or need for customisation or intervention of expert personnel, capable of selecting an efficient HVAC configuration in terms of ON/OFF and set point that aims to guarantee the given thermal comfort while minimising energy consumption.
An adaptive and continuous update of the actuations through short-term decisions based on long-term predictions of the environment.
A comparison analysis in terms of tradeoff between thermal comfort and energy consumption with the manual approach, which sets a static set point temperature throughout the day, and a greedy PMV-based solution, which configures the HVAC devices according to the current environmental conditions.
The remainder of this paper is organised as follows.
Section 2 describes the relevant literature.
Section 3 provides background regarding the neural network architecture and predicted mean vote (PMV) index.
Section 4 presents the proposed methodology.
Section 5 illustrates the experimental setup, while
Section 6 describes and presents the results.
Section 7 discusses the limitations of the proposed solution. Finally, conclusions are provided in
Section 8.
2. State of the Art
In the recent scientific literature, a number of research works have been proposed to achieve thermal comfort, trying to solve the problem of the tradeoff between comfort maximisation and energy minimisation from different perspectives. In this regard, different works [
16,
17,
18] tackle the problem through Pareto analysis. This approach provides a set of possible tradeoffs between comfort and energy consumption, each of which might be a feasible solution for the deployment. However, the mentioned research works provide static analysis with a limited number of software simulations and do not consider any prediction in the future for proactive decision making. Additionally, they model the objective functions through ad hoc mathematical models for the specific environment under evaluation, thus limiting their applicability across multiple sites. Finally, while calculating the Pareto front can be useful, a proper strategy is necessary to select a single configuration that guarantees good comfort at a low cost, and this is missing in these works.
Other research initiatives have tackled the problem from another perspective: they physically model the buildings through simulation software to provide either simulated environments for analysis or generate a large amount of data to train AI models [
7,
10,
11,
12]. For instance, Gao et al. [
12] propose a DL solution based on reinforcement learning validated by means of a simulated building thermal environment and an HVAC system; a large amount of hourly simulated data are used to train their AI models. Another solution based on reinforcement learning is presented by Valladares et al. [
11]. In their study, a reinforcement learning model is first trained with 10 years of simulated data, following a similar approach to Gao et al. [
12], before being deployed in real environments to evaluate the performance. By means of training data collected over a large time interval, they achieve a balance among indoor comfort, air quality, and the energy consumption of the air conditioning and ventilation systems. Unlike the research works proposed by Valladares et al. [
11] and Gao et al. [
12], a different solution based on model predictive control (MPC) is proposed by Ascione et al. [
10]. However, even in this case, it relies on simulation-based physical models to optimise the hourly set point temperature for the next 24 h. Furthermore, Jing et al. [
7] propose a simple PMV-based approach to keep the environment within the comfort level and overcome the typical temperature-based mechanism. Despite improvements in terms of daily energy savings, the proposed solution only focuses on thermal comfort, with no attention for a tradeoff between PMV index and energy consumption in the HVAC control strategy. Additionally, the proposed solution is validated and analysed using simulation models, without any validation in real environments. Finally, other works rely on advanced passive strategies. For instance, Liu et al. [
4] analyse the applicability and effectiveness of these technologies in residential buildings through physical models, resulting in significant energy savings. Additionally, de Araujo Passos et al. [
5], in their study, define a mathematical model to optimise a novel HVAC system by relying on passive energy sources (e.g., solar irradiance and heat exchangers) as much as possible. Significant energy-saving results have been achieved, demonstrating that over half of the energy demand can be met passively.
All the research works described above are based on building modelling. In addition to a significant manual effort to model various aspects of the environment (e.g., layout, materials, location, and installed HVAC machinery), this approach provides clear limitations. Firstly, detailed modelling of individual buildings impacts scalability, limiting their replicability across multiple sites with limited effort. Secondly, the usage of simulated data might hinder a faithful replication of real-world environments, which can be affected by unexpected events (e.g., windows or doors being opened or rapid increases in occupancy). In this regard, the validation of AI-control solutions in real environments is fundamental to demonstrate their benefits in the intelligent control of HVAC systems [
19].
Other approaches that do not rely on physical models of buildings are proposed in the literature. Chen et al. [
8] propose an MPC solution by modelling the building through mathematical models. However, complex building-specific information is used (e.g., conduction/convection coefficient, wall thickness, air mass flow rate, etc.). It is worth noting that, in this work, feedback from occupants takes a central role to adapt the thermal comfort based on personal perception, resulting in improved comfort outcomes. In this regard, other studies based on MPC delve into how personal preferences affect the optimisation of energy consumption and the well-being of occupants [
9]. To address the limitations of physical-based models, as per our goal, Manjarres et al. [
13] introduce a framework aimed at minimising energy consumption while ensuring indoor temperatures remain within predefined ranges. The proposed framework outlines an optimal schedule for HVAC ON/OFF and mechanical ventilation (MV) operation for the next 24 h. However, it requires the installation of specific sensors (e.g., in the outlet conduct of the air handling unit within the HVAC device). Additionally, it primarily considers indoor temperature rather than thermal comfort (e.g., PMV index) and does not account for updates to the operating schedule throughout the day in response to potential environmental changes. Similarly, Yang et al. [
6] propose an MPC approach designed to overcome the constraints associated with physical models by integrating AI. Additionally, they introduce an update mechanism over time to capture any possible environmental change. However, their solution requires customisations within the environments in terms of advanced sensors (e.g., combined temperature–humidity–pressure–lux (THPL) sensors) to be installed in specific locations as well as detailed information regarding chilled water of HVAC devices. This bounds their approach to the specific environment being evaluated. Another approach that effectively keeps up with with environmental changes but includes complex building-related information is proposed by Martell et al. [
14]. Indeed, the authors propose a multi-objective control architecture to estimate optimal set points where the computed Pareto front is updated hourly, thus selecting optimal temperature set points for each hour of the day. Despite the update mechanism, even in this case, complex parameters closely tied to the evaluated environment are considered. For instance, the authors use heat gains resulting from different natural phenomena (e.g., convection, ventilation, and infiltration) to model the indoor temperature behaviour, which might be different across different sites.
In summary, existing solutions for comfort optimisation present various limitations that might impact their applicability to real-world scenarios. Indeed, they provide theoretical analysis with no HVAC strategy selection and real-world validation [
16,
17,
18], rely on tailored physical (or mathematical) models [
4,
5,
7,
10,
11,
12], or use complex information of the local environment [
6,
8,
9,
13,
14]. Furthermore, no clear update mechanisms of HVAC settings over time are taken into account, except in rare cases [
6,
10,
14]. Our solution aims to overcome the above limitations. On one hand, it does not require preliminary analysis to define physical or mathematical models of the environment or gather building-specific information. Instead, it adapts to the monitored environment by learning from the collected data. On the other hand, we rely on long-term predictions to make short-term decisions and continuously select the actuation strategy that optimises comfort and minimises energy consumption over time.
4. Methodology
In this work, we tackle the problem of energy-efficient comfort optimisation in indoor environments. That is, we study and develop a methodology for the automated control of HVAC systems so that the defined comfort requirements within the considered environment (e.g., by managers) during the day are respected with minimal energy consumption. As introduced in
Section 3.3, the thermal comfort index (PMV), as defined by Fanger [
23], depends on a set of parameters (such as air temperature and humidity of the environment), which, in a real-world environment, can be influenced by the outdoor conditions. In this regard, adapting the HVAC optimisation according to outdoor weather takes a central role from a research perspective [
32]. At first glance, a trivial greedy PMV-based mechanism that activates the HVAC system when the thermal comfort level is outside the desired range, similar to the approach proposed by Jing et al. [
7], might be viewed as a viable solution. However, such an approach, which makes decisions only considering the current state, might not work as desired. In particular, let us first define four comfort states (represented in
Figure 2) based on the comfort interval defined throughout the day:
No Comfort (NC): the shop is closed (e.g., at night or on Sundays).
No Comfort then Comfort (NC-C): usually early morning before the opening.
Comfort (C): the shop is open (e.g., during a working day).
Comfort then No Comfort (C-NC): generally late afternoon before closing.
Indeed, a greedy approach might not be able to achieve the target comfort at the beginning of the working time (NC-C state), i.e., when the comfort level is far from the target value because of a long inactivity period (e.g., night closure, holiday, etc.). For the same reason, it might activate the HVAC system when the store closure is approaching (C-NC state), leading to inefficient energy utilisation.
Based on these premises, we propose an AI-based solution called EECO, in which a CNN is used to predict the future comfort level and energy consumption of the HVAC. Specifically, given a range of possible HVAC configurations (meaning, ON/OFF, and SP), the CNN predicts the effects of each choice on future comfort and energy consumption. At every quarter of an hour, the system computes the predictions for the next m quarters of an hour, generating an m-level tree of candidate sequences of HVAC configurations.
The ultimate goal of EECO is to select the branch of the tree (hence, a sequence of future HVAC configurations), which, based on the CNN predictions, will minimise an objective function defined as the weighted summation of thermal comfort index PMV and energy.
In the remainder of this section, we describe the whole process of comfort optimisation, including input/output of the CNN, the structure of the decision tree, and the logic behind the choice on the HVAC settings. This process is described in Algorithms 1 and 2 and illustrated in
Figure 3 and
Figure 4.
Algorithm 1 Tree building |
Input: Root node (), Historical data (X), Tree depth (m), Target comfort (), Operating mode (o) |
Output: Tree (t) |
1: | procedure BuildTree(,X,m,,o) |
2: | |
▹ Init tree node at level 0 |
3: | for do |
▹ Loop over tree levels |
4: | |
▹ Init level i |
5: | |
▹ Extract n-1 rows from X |
6: | for do |
▹ Loop over parents |
7: | |
▹ Init list of children of parent node |
8: | |
▹ Init list of actuations |
9: | if then |
10: | GetAct(, , , o) |
11: | end if |
12: | for do |
▹ with |
13: | |
14: | |
15: | |
16: | GetNode() |
17: | .insert() |
18: | end for |
19: | .insert() |
▹ Add nodes of list to level i |
20: | end for |
21: | t.insert() |
▹ Add level i to the tree |
22: | end for |
23: | return t |
24: | end procedure |
Algorithm 2 Get actuations |
Input: Current temperature (T), Current humidity (H), Target comfort (), Operating mode (o) |
Output: List of actuations (A) |
1: | procedure GetAct(T, H, , o) |
2: | |
▹ Init list of actuations |
3: | GetRangeTemperature(T, H, ) |
4: | if <= T<= then |
5: | |
6: | if o = HEATING then |
7: | while do |
8: | |
9: | |
10: | end while |
11: | else if o = COOLING then |
12: | while do |
13: | |
14: | |
15: | end while |
16: | end if |
17: | else |
18: | if o = HEATING then |
19: | |
20: | |
21: | else if o = COOLING then |
22: | |
23: | |
24: | end if |
25: | end if |
26: | return A |
27: | end procedure |
The decision tree is built every quarter of an hour (or time slot) using the output from the previous time slot as a root node. The process that builds the tree is formulated in Algorithm 1 BuildTree. BuildTree takes as input the current root node , historical data of HVAC settings, weather conditions, energy consumption, the target comfort value (a PMV value), and the operating mode o (either heating or cooling). The root node’s attributes include the current HVAC settings, i.e., the operational settings in time slot . In general, a node of the tree is characterised by a 3-digit label and range of attributes. The first digit of the label indicates the level of the tree to which the node belongs, the second digit is the index of the parent node, and the third digit is the index of the node. The attributes are the current HVAC settings at time , i.e., the pair of values , fan speed, and operating mode. Node’s attributes also include average energy consumption , indoor temperature , indoor humidity , and indoor CO.
Figure 3 illustrates a portion of the tree built during time slot
, starting from Level 0, which consists of root node
. Level 1 of the tree is populated with a set of children nodes
,
, each one defined with pair
, i.e., a set of possible HVAC configurations that could be applied during time slot
(Level 1 in
Figure 3). Like all the other tree levels, Level 1 includes the OFF actuation (line 8 of Algorithm 1) and a set of actuations that are computed with Algorithm 2 (called at line 10 of Algorithm 1) using the indoor temperature of the parent node (
for Level 1), the indoor humidity of the parent node (
for Level 1), the target comfort level
, and the HVAC’s operating mode
o (either HEATING or COOLING). Algorithm 2 defines the temperature range to be within the desired target comfort
and, based on that information and HVAC’s operating mode
o, selects the strategy to enter the comfort range or move within that through a couple of actuations.
One of the nodes at Level 1 is the output of the process executed during time slot and contains the HVAC configuration for time slot . Moreover, such a node will be the root node when the process is executed in time slot . Which is the right node? The selection of the most appropriate node is completed by populating the tree up to Level m using the predictions of the CNN to simulate the behaviour of the system in different conditions over the time (until time slot ). The solution is the node at Level 1 that belongs to the branch of the tree whose sequence of actuations guarantees the best comfort at the minimum energy consumption in the long term. The logic behind this decision is explained in the following steps:
Given a level
, and a parent node
, with
and
, the cardinality of Level
,
is the list of possible HVAC actuations for time slot
applied to the children nodes of parent
(lines 8, 10). In
Figure 3,
. For each HVAC configuration
, the system predicts the effects of such configuration on comfort and energy consumption starting from the parent’s conditions
of indoor temperature
, indoor humidity
, indoor CO
2 , and energy consumption
(line 14).
As sketched in
Figure 4, the prediction for node
k is obtained by feeding the CNN with an array of
rows of historical HVAC settings, environmental values, and other features (see
Table 1) observed from
to
. The n
line contains the node’s attributes
and other features related to
. This operation is summarised at line 16 with function
GetNode. Node
, generated using actuation
, is added to the list of children nodes
of parent
(line 17).
The list of children nodes is added to Level , which is then added to the tree when all the parents of the previous Level have been processed.
The above steps are repeated until the maximum tree depth m is reached.
The result of the process is a set
B of simulated sequences of HVAC configurations from time
to time
, which can also be viewed as a set of paths across the decision tree (or branches) from the root node to the leaves. The final step consists of choosing the best path, i.e., the path that minimises both PMV and energy values, as formally expressed in Equation (
1):
The objective function
is the weighted sum of predicted comfort and energy for branch
b, where
is a sum of the predicted values of thermal comfort on each node of the branch, while
is the sum of the predicted values of energy consumption. More precisely,
and
are computed as follows:
where
is a positive number smaller than 1, so that
(
at the power of
i) decreases as the tree level
i increases to provide less importance to the nodes far from the root (i.e., far in the future).
The energy is normalised with the estimation of the maximum energy consumed by the HVAC system in a quarter of an hour and multiplied by the number of time slots in a branch (m). controls the relative weight of comfort and energy values. In our analysis, we focus on a scenario where comfort holds priority. In this regard, we set .
For a given value of
, the solution is represented by the branch
such that
Hence, the output of the whole process is the HVAC configuration for the next time slot , i.e., the attributes of node at Level 1 of branch . The above process is executed every 15 min.
8. Conclusions
In this paper, we have presented an automated solution that leverages AI to continuously regulate HVAC devices with the aim to optimise comfort while minimising the energy footprint. It does not require any preliminary information of the local environment or any physical or mathematical modelling. Through the collected data, it implicitly evaluates the effect of different agents, including building features (e.g., wall thickness, orientation, and window presence) and passive phenomena (e.g., passive heating) on the monitored parameters, thus adapting to the observed environment.
We have tested our approach in a real warehouse of a small production plant belonging to an Italian retail company. Compared to a static approach where the HVAC set point is fixed at a specific temperature, the evaluation results in the real environment show that our solution can slightly improve the indoor comfort with minimal impact on the building’s energy footprint in summer (i.e., cooling mode). On the other hand, during cold months (i.e., heating mode), it achieves higher energy savings (up to approximately 16%) while providing slightly worse comfort conditions.
Due to clear limitations in comparing multiple approaches in a real environment, we have provided an additional comparison analysis based on software simulations between our solution and two other approaches (i.e., the fixed set point and a greedy PMV-based approach). In this regard, the simulated results show significant improvements during the winter months compared to the summer period, confirming the results obtained in the real-world scenario. Indeed, the simulations show slightly reduced performance in terms of comfort requirements but underline substantial energy savings (exceeding 20%). Despite the promising results in our evaluated scenario, the application of our solution on a large scale is subject to overcoming some limitations mentioned in the previous section. Nevertheless, in contrast to non-intelligent approaches that follow a single objective function, the obtained results demonstrate the capability of our solution to guarantee a tradeoff between the comfort level and the energy consumption by dynamically selecting the configuration (ON/OFF and set point) of the HVAC devices.