Applications of Deep Reinforcement Learning for Home Energy Management Systems: A Review
Abstract
:1. Introduction
1.1. Modern and Future Home Energy Management Systems—Complexity and Advancements
1.2. Fundamentals of RL and DRL
- : The set of states representing the environment’s possible configurations.
- : The set of actions available to the agent.
- : The transition probability, indicating the likelihood of moving from state to state after taking action .
- : The reward function, providing feedback for transitioning between states due to an action.
- : The discount factor, which balances the importance of immediate rewards versus future rewards .
- Observations: Information about the environment’s state that helps determine the agent’s next action. In some cases, the agent may not have access to all state variables, which can complicate the learning process.
- Actions: The choices the agent makes to influence the environment’s state. Actions can be discrete or continuous, depending on the problem domain.
- Reward: A scalar value returned by the environment after an action, indicating the desirability of that action. The agent aims to maximize the total reward over its interactions with the environment.
- Trajectory: A sequence of state-action-reward tuples, representing the agent’s interaction with the environment, e.g.,
- Replay Buffer: A memory mechanism used to store past experiences , which helps to break the correlation between consecutive samples and enables more effective optimization, especially in deep RL approaches [53].
- Policy: A function mapping observation to actions, defining the agent’s behavior. The policy can be deterministic or stochastic, depending on the application.
- Discount Factor: A parameter that balances the trade-off between immediate and future rewards, shaping the agent’s long-term strategy.
- Deep Q-Network (DQN): The DQN extends the capabilities of Q-learning by employing convolutional neural networks (CNNs) for the approximation of the Q-value function. Innovations such as experience replay and target networks facilitate the stabilization of learning in high-dimensional tasks. The experience replay method involves storing agent–environment interactions and randomly sampling them to reduce data correlation. Target networks provide a means of ensuring stable learning objectives by updating parameters periodically. These mechanisms enabled DQN to achieve superhuman performance in Atari games, thereby demonstrating the potential of DRL [56].
- Proximal Policy Optimization (PPO): PPO simplifies policy optimization by replacing complex constraints with a clipped surrogate objective, thereby ensuring stable updates and avoiding overly aggressive policy changes. The process alternates between sampling data and optimizing policies using minibatch gradient descent. The robustness and computational efficiency of PPO make it a preferred choice for robotics and scalable DRL applications [57].
- Advantage Actor–Critic (A2C): The A2C algorithm extends reinforcement learning by employing an actor–critic architecture, which combines policy optimization through the actor network and value estimation via the critic network. The actor network generates probabilistic actions based on states, while the critic evaluates these actions to refine policy gradients. A2C introduces an advantage function to stabilize training by quantifying the quality of actions compared to others. In the context of hydrocracking optimization, A2C effectively integrates with a DNN surrogate model to adapt quickly to changing targets. This model serves as the environment for the agent, enabling accurate determination of optimal operating conditions with enhanced computational efficiency [58].
- Asynchronous Advantage Actor–Critic (A3C): A3C introduces the concept of asynchronous parallel agents interacting with separate environment instances, which serves to decorrelate data and stabilize the training process. The combination of policy-based (actor) and value-based (critic) methods allows for the efficient training of policies using n-step returns and entropy regularization, which encourages exploration. The versatility of this approach has made it an effective method for navigation, robotics, and continuous control tasks [59].
- Deep Deterministic Policy Gradient (DDPG): The DDPG algorithm extends reinforcement learning to continuous control tasks by employing an actor–critic architecture with two neural networks—an actor to generate deterministic actions and a critic to estimate Q-values. Experience replay and target networks are key stabilizing mechanisms, decoupling experience correlations and maintaining consistent targets for updates. An enhanced variant introduces prioritized experience replay, which ranks experiences based on temporal-difference errors, emphasizing higher-value experiences during training. This prioritization accelerates learning, improves stability, and mitigates sensitivity to hyperparameter changes [60].
1.3. Deploying Artificial Intelligence/Machine Learning/RL in Building Automation—Trends and Significant Challenges
1.3.1. Data Collection Architectures
1.3.2. Scalability Considerations
1.3.3. Data Security and Privacy
1.4. An Original Contribution and the Paper Structure
- Synergy between RL and IoT for real-time smart home systems. While RL and IoT have individually shown promise in home and building automation [43,70,71,72], this review is among the first to extensively analyze how RL can be leveraged with IoT networks to achieve real-time monitoring and adaptive control in energy management. Moreover, it demonstrates the potential for more efficient and autonomous building operations through the utilization of IoT sensors to feed RL systems with real-time data on energy usage, occupancy, and environmental conditions;
- Innovative approaches to DSR optimization. This review identifies a novel application of RL in enhancing DSR programs, enabling homes, particularly prosumers, to dynamically respond to fluctuations in energy prices and grid conditions. By utilizing RL, homes and buildings can autonomously learn optimal strategies for shifting or reducing energy loads, contributing to grid stability and energy cost savings, particularly in the context of peak demand periods. The ability of RL to adapt to varying DR signals and building-specific constraints presents a significant advancement over traditional rule-based approaches;
- Advanced scheduling for energy and resource optimization. A unique focus of this review is the application of RL in scheduling algorithms for home automation systems, particularly in relation to energy consumption, occupancy prediction, and appliance usage. This review explores how RL and DRL can optimize multiobjective scheduling problems, balancing comfort, energy efficiency, and operational costs. Such applications are critical for ensuring flexible home and prosumer systems, capable of responding to dynamic energy demands and varying occupant needs;
- Integration of RL and DRL with RES and energy storage systems. One of the most novel aspects of this review is the examination of how RL and DRL techniques can be used to manage RESs, such as solar and wind, in conjunction with energy storage systems, especially important for modern and future prosumer applications. By enabling intelligent decision-making about when to store, use, or sell generated energy, RL and DRL algorithms can help maximize the self-consumption of renewables and ensure grid or microgrid independence. This is particularly important in homes and buildings aiming for net-zero energy performance, as RL-driven strategies can optimize the use of intermittent RES in real time;
- Bridging the gap between theory and practice. While much of the existing research on RL in building automation remains theoretical or simulation-based, this review uniquely emphasizes the need for practical case studies and real-world implementations. It identifies key challenges such as scalability, data availability, and heterogeneous system integration, offering insights into how these challenges can be overcome when deploying RL-based systems in operational environments.
2. Methodology of the Review
3. State of the Art and Practice
- IoT applications
- Algorithms used: DRL, Deep Q-learning, Q-learning, DDPG
- Objectives: Focuses on optimizing cost and comfort, with additional considerations for autonomy, personalization, and privacy
- Verification: All experiments and models are verified through simulations
- DSR applications
- Algorithms used: A variety including MORL, Q-learning (and its variations with Fuzzy Reasoning), DQN, MARL, PPO, Actor–Critic methods, among others
- Objectives: Primarily target cost and comfort optimization
- Both simulations and some evaluations using real-world data or physical testing setups (e.g., MATLAB and Arduino Uno)
- Scheduling applications
- Algorithms used: Q-learning, DQN, PPO, MADDPG, among others
- Objectives: Focus on cost and comfort optimization, with several entries solely targeting cost
- Verification: Predominantly simulations, with some studies using practical data from real-world networks
- Data security and privacy
- Algorithms used: TRPO, SAC, Q-learning, PPO, DDPG, and others
- Objectives: Aimed at optimizing cost and comfort, with a specific focus on energy systems integrating renewable sources and storage
- Verification: All studies verify their findings through simulations, with some using real-world data from energy markets and PV profiles.
- Electric vehicles
- Algorithms used: Q-learning, DDPG, MDP, DQN, and others
- Objectives: Aimed at optimizing cost and comfort, with a specific focus on grid stability and integrating renewable sources
- Verification: All studies verify their findings through simulations with real-world data.
4. Applications of Reinforcement Learning for Home Automation
Problems, Gaps and Challenges
- Mean Absolute Error: This metric represents the mean of the absolute value of the discrepancy between the predicted and actual values. This provides a straightforward measure of the accuracy of the model, with lower values indicating superior performance.
- Mean Squared Error: This calculates the mean of the squares of the discrepancies between the predicted and actual values, thereby emphasizing larger errors. This metric is particularly useful for identifying models that are prone to occasional significant errors.
- Root Mean Squared Error: The square root of the MSE provides an interpretable measure of prediction error in the same units as the target variable. A lower RMSE value is indicative of superior performance.
- The median absolute error: The median of the absolute differences between predicted and actual values is calculated, thereby ensuring robustness to outliers and providing an accurate measure of typical prediction error.
- Mean Absolute Percentage Error: This expression of the prediction error as a percentage of the actual values provides a normalized accuracy metric suitable for scenarios where the target variable varies in scale.
- The R2 value: Also known as the coefficient of determination, this is a statistical measure that quantifies the proportion of variance in a dependent variable that is explained by a given independent variable. It indicates the proportion of variance in the actual data that can be explained by the model. An R2 value approaching 1 indicates a superior quality model, whereas values approaching 0 suggest a lack of efficacy in capturing the data’s inherent variability.
5. Opportunities and Future Perspective in Application of Reinforcement Learning in Home Automation and Home and Building Energy Management
- Adaptability and optimizationThe utilization of DRL models in HEMS facilitates the dynamic realignment of energy management strategies (storage and consumption) in accordance with fluctuating market and weather conditions, as well as evolving user preferences. This approach has the potential to significantly enhance energy and cost savings [172,173];
- Integration with renewable energy sourcesRL-based HEMS systems facilitate the integration of renewable energy sources, such as photovoltaic panels and wind turbines, thereby enhancing energy independence and reducing reliance on the power grid [172];
- HEMS as a part of the Smart Grid
- Data security and privacyThe implementation of DRL in HEMS systems requires the utilization of sophisticated data protection and security methodologies to guarantee the confidentiality of user data and the integrity of the system against the threat of cyberattacks [29].
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Nomenclature
Nomenclature | Definition |
A2C | Advantage Actor–Critic |
A3C | Asynchronous Advantage Actor–Critic |
AI | Artificial Intelligence |
ANN | Artificial Neural Networks |
BACS | Building Automation and Control Systems |
BMS | Building Management Systems |
DDPG | Deep Deterministic Policy Gradients |
CDDPG | Charging Control Deep Deterministic Policy Gradient |
CNN | Convolutional Neural Networks |
DDQN | Double Deep Q-learning |
DERs | Distributed Energy Resources |
DQN | Deep Q-network |
DRL | Deep Reinforcement Learning |
DSM | Demand Side Management |
DSO | Distribution System Operator |
DSR | Demand Side Response |
DT | Digital Twin |
DTA | Dual Targeting Algorithm |
EPBD | Energy Performance of Buildings Directive |
EV | Electric Vehicle |
GDPR | General Data Protection Regulation |
HEMS | Home Energy Management Systems |
HVAC | Heating, Ventilation and Air Condition |
IFC | Industry Foundation Classes |
IoT | Internet of Things |
LLM | Large Language Models |
LSTM | Long Short-Term Memory |
MADDPG | Multi-agent Deep Deterministic Policy Gradient |
MARL | Multi-Agent Reinforcement Learning |
MDP | Markov Decision Process |
ML | Machine Learning |
MORL | Multi-Objective Reinforcement Learning |
PPO | Proximal Policy Optimization |
PV | Photovoltaic |
RES | Renewable Energy Sources |
RL | Reinforcement Learning |
SAC | Soft Actor–Critic |
SRI | Smart Readiness Indicator |
TD3 | Twin Delayed Deep Deterministic Policy Gradient |
TL | Transfer Learning |
TRPO | Trust Region Policy Optimization |
V2G | Vehicle-to-Grid |
V2H | Vehicle-to-Home |
References
- Filho, G.P.R.; Villas, L.A.; Gonçalves, V.P.; Pessin, G.; Loureiro, A.A.F.; Ueyama, J. Energy-Efficient Smart Home Systems: Infrastructure and Decision-Making Process. Internet Things 2019, 5, 153–167. [Google Scholar] [CrossRef]
- Pratt, A.; Krishnamurthy, D.; Ruth, M.; Wu, H.; Lunacek, M.; Vaynshenk, P. Transactive Home Energy Management Systems: The Impact of Their Proliferation on the Electric Grid. IEEE Electrif. Mag. 2016, 4, 8–14. [Google Scholar] [CrossRef]
- Diyan, M.; Silva, B.N.; Han, K. A Multi-Objective Approach for Optimal Energy Management in Smart Home Using the Reinforcement Learning. Sensors 2020, 20, 3450. [Google Scholar] [CrossRef] [PubMed]
- Pau, G.; Collotta, M.; Ruano, A.; Qin, J. Smart Home Energy Management. Energies 2017, 10, 382. [Google Scholar] [CrossRef]
- Umair, M.; Cheema, M.A.; Afzal, B.; Shah, G. Energy Management of Smart Homes over Fog-Based IoT Architecture. Sustain. Comput. Inform. Syst. 2023, 39, 100898. [Google Scholar] [CrossRef]
- Deanseekeaw, A.; Khortsriwong, N.; Boonraksa, P.; Boonraksa, T.; Marungsri, B. Optimal Load Scheduling for Smart Home Energy Management Using Deep Reinforcement Learning. In Proceedings of the 2024 12th International Electrical Engineering Congress (iEECON), Pattaya, Thailand, 6–8 March 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 1–4. [Google Scholar]
- Ożadowicz, A.; Grela, J. An Event-Driven Building Energy Management System Enabling Active Demand Side Management. In Proceedings of the 2016 Second International Conference on Event-Based Control, Communication, and Signal Processing (EBCCSP), Krakow, Poland, 13–15 June 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 1–8. [Google Scholar]
- Verschae, R.; Kato, T.; Matsuyama, T. Energy Management in Prosumer Communities: A Coordinated Approach. Energies 2016, 9, 562. [Google Scholar] [CrossRef]
- European Parliament Directive (EU) 2024/1275 of the European Parliament and the Council on the Energy Performance of Buildings; The European Parliament And The Council of The European Union: Strasbourg, France, 2024.
- European Commission. Energy Roadmap 2050; European Commission: Brussels, Belgium, 2012.
- Fokaides, P.A.; Panteli, C.; Panayidou, A. How Are the Smart Readiness Indicators Expected to Affect the Energy Performance of Buildings: First Evidence and Perspectives. Sustainability 2020, 12, 9496. [Google Scholar] [CrossRef]
- Märzinger, T.; Österreicher, D. Extending the Application of the Smart Readiness Indicator—A Methodology for the Quantitative Assessment of the Load Shifting Potential of Smart Districts. Energies 2020, 13, 3507. [Google Scholar] [CrossRef]
- Ożadowicz, A. A Hybrid Approach in Design of Building Energy Management System with Smart Readiness Indicator and Building as a Service Concept. Energies 2022, 15, 1432. [Google Scholar] [CrossRef]
- ISO 52120-1:2021; I. 205 T.C. Energy Performance of Buildings Contribution of Building Automation, Controls and Building Management. International Organization for Standardization: Geneva, Switzerland, 2021.
- Favuzza, S.; Ippolito, M.; Massaro, F.; Musca, R.; Riva Sanseverino, E.; Schillaci, G.; Zizzo, G. Building Automation and Control Systems and Electrical Distribution Grids: A Study on the Effects of Loads Control Logics on Power Losses and Peaks. Energies 2018, 11, 667. [Google Scholar] [CrossRef]
- Mahmood, A.; Baig, F.; Alrajeh, N.; Qasim, U.; Khan, Z.; Javaid, N. An Enhanced System Architecture for Optimized Demand Side Management in Smart Grid. Appl. Sci. 2016, 6, 122. [Google Scholar] [CrossRef]
- Hou, P.; Yang, G.; Hu, J.; Douglass, P.J.; Xue, Y. A Distributed Transactive Energy Mechanism for Integrating PV and Storage Prosumers in Market Operation. Engineering 2022, 12, 171–182. [Google Scholar] [CrossRef]
- Kato, T.; Ishikawa, N.; Yoshida, N. Distributed Autonomous Control of Home Appliances Based on Event Driven Architecture. In Proceedings of the 2017 IEEE 6th Global Conference on Consumer Electronics (GCCE), Nagoya, Japan, 24–27 October 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 1–2. [Google Scholar]
- Charbonnier, F.; Morstyn, T.; McCulloch, M.D. Scalable Multi-Agent Reinforcement Learning for Distributed Control of Residential Energy Flexibility. Appl. Energy 2022, 314, 118825. [Google Scholar] [CrossRef]
- Delsing, J. Local Cloud Internet of Things Automation: Technology and Business Model Features of Distributed Internet of Things Automation Solutions. IEEE Ind. Electron. Mag. 2017, 11, 8–21. [Google Scholar] [CrossRef]
- Yassine, A.; Singh, S.; Hossain, M.S.; Muhammad, G. IoT Big Data Analytics for Smart Homes with Fog and Cloud Computing. Future Gener. Comput. Syst. 2019, 91, 563–573. [Google Scholar] [CrossRef]
- Machorro-Cano, I.; Alor-Hernández, G.; Paredes-Valverde, M.A.; Rodríguez-Mazahua, L.; Sánchez-Cervantes, J.L.; Olmedo-Aguirre, J.O. HEMS-IoT: A Big Data and Machine Learning-Based Smart Home System for Energy Saving. Energies 2020, 13, 1097. [Google Scholar] [CrossRef]
- Bawa, M.; Caganova, D.; Szilva, I.; Spirkova, D. Importance of Internet of Things and Big Data in Building Smart City and What Would Be Its Challenges. In Smart City 360°; Leon-Garcia, A., Lenort, R., Holman, D., Staš, D., Krutilova, V., Wicher, P., Cagáňová, D., Špirková, D., Golej, J., Nguyen, K., Eds.; Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering; Springer International Publishing: Cham, Switzerland, 2016; Volume 166, pp. 605–616. ISBN 978-3-319-33680-0. [Google Scholar]
- Lawal, K.N.; Olaniyi, T.K.; Gibson, R.M. Leveraging Real-World Data from IoT Devices in a Fog–Cloud Architecture for Resource Optimisation within a Smart Building. Appl. Sci. 2023, 14, 316. [Google Scholar] [CrossRef]
- Akter, M.N.; Mahmud, M.A.; Oo, A.M.T. A Hierarchical Transactive Energy Management System for Microgrids. In Proceedings of the 2016 IEEE Power and Energy Society General Meeting (PESGM), Boston, MA, USA, 17–21 July 2016; IEEE: Piscataway, NJ, USA, 2016; Volume 2016, pp. 1–5. [Google Scholar]
- Taghizad-Tavana, K.; Ghanbari-Ghalehjoughi, M.; Razzaghi-Asl, N.; Nojavan, S.; Alizadeh, A. An Overview of the Architecture of Home Energy Management System as Microgrids, Automation Systems, Communication Protocols, Security, and Cyber Challenges. Sustainability 2022, 14, 15938. [Google Scholar] [CrossRef]
- Kiehbadroudinezhad, M.; Merabet, A.; Abo-Khalil, A.G.; Salameh, T.; Ghenai, C. Intelligent and Optimized Microgrids for Future Supply Power from Renewable Energy Resources: A Review. Energies 2022, 15, 3359. [Google Scholar] [CrossRef]
- Chamana, M.; Schmitt, K.E.K.; Bhatta, R.; Liyanage, S.; Osman, I.; Murshed, M.; Bayne, S.; MacFie, J. Buildings Participation in Resilience Enhancement of Community Microgrids: Synergy Between Microgrid and Building Management Systems. IEEE Access 2022, 10, 100922–100938. [Google Scholar] [CrossRef]
- Al-Ani, O.; Das, S. Reinforcement Learning: Theory and Applications in HEMS. Energies 2022, 15, 6392. [Google Scholar] [CrossRef]
- Wang, Z.; Hong, T. Reinforcement Learning for Building Controls: The Opportunities and Challenges. Appl. Energy 2020, 269, 115036. [Google Scholar] [CrossRef]
- Benjamin, A.; Badar, A.Q.H. Reinforcement Learning Based Cost-Effective Smart Home Energy Management. In Proceedings of the 2023 IEEE 3rd International Conference on Sustainable Energy and Future Electric Transportation (SEFET), Bhubaneswar, India, 9–12 August 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 1–5. [Google Scholar]
- Yu, L.; Qin, S.; Zhang, M.; Shen, C.; Jiang, T.; Guan, X. A Review of Deep Reinforcement Learning for Smart Building Energy Management. IEEE Internet Things J. 2021, 8, 12046–12063. [Google Scholar] [CrossRef]
- Wei, T.; Wang, Y.; Zhu, Q. Deep Reinforcement Learning for Building HVAC Control. In Proceedings of the 54th Annual Design Automation Conference 2017, Austin, TX, USA, 18–22 June 2017; ACM: New York, NY, USA, 2017; pp. 1–6. [Google Scholar]
- Yu, L.; Xie, W.; Xie, D.; Zou, Y.; Zhang, D.; Sun, Z.; Zhang, L.; Zhang, Y.; Jiang, T. Deep Reinforcement Learning for Smart Home Energy Management. IEEE Internet Things J. 2020, 7, 2751–2762. [Google Scholar] [CrossRef]
- Kodama, N.; Harada, T.; Miyazaki, K. Home Energy Management Algorithm Based on Deep Reinforcement Learning Using Multistep Prediction. IEEE Access 2021, 9, 153108–153115. [Google Scholar] [CrossRef]
- Perez, K.X.; Baldea, M.; Edgar, T.F. Integrated Smart Appliance Scheduling and HVAC Control for Peak Residential Load Management. In Proceedings of the 2016 American Control Conference (ACC), Boston, MA, USA, 6–8 July 2016; IEEE: Piscataway, NJ, USA, 2016; Volume 2016, pp. 1458–1463. [Google Scholar]
- Tekler, Z.D.; Low, R.; Yuen, C.; Blessing, L. Plug-Mate: An IoT-Based Occupancy-Driven Plug Load Management System in Smart Buildings. Build. Environ. 2022, 223, 109472. [Google Scholar] [CrossRef]
- Fambri, G.; Badami, M.; Tsagkrasoulis, D.; Katsiki, V.; Giannakis, G.; Papanikolaou, A. Demand Flexibility Enabled by Virtual Energy Storage to Improve Renewable Energy Penetration. Energies 2020, 13, 5128. [Google Scholar] [CrossRef]
- Mancini, F.; Lo Basso, G.; de Santoli, L. Energy Use in Residential Buildings: Impact of Building Automation Control Systems on Energy Performance and Flexibility. Energies 2019, 12, 2896. [Google Scholar] [CrossRef]
- Liu, Z.; Zhang, X.; Sun, Y.; Zhou, Y. Advanced Controls on Energy Reliability, Flexibility and Occupant-Centric Control for Smart and Energy-Efficient Buildings. Energy Build. 2023, 297, 113436. [Google Scholar] [CrossRef]
- Babar, M.; Grela, J.; Ożadowicz, A.; Nguyen, P.; Hanzelka, Z.; Kamphuis, I. Energy Flexometer: Transactive Energy-Based Internet of Things Technology. Energies 2018, 11, 568. [Google Scholar] [CrossRef]
- Chen, Y.; Yang, Y.; Xu, X. Towards Transactive Energy: An Analysis of Information-related Practical Issues. Energy Convers. Econ. 2022, 3, 112–121. [Google Scholar] [CrossRef]
- Sheshalani, B.; Zapiee, M.K.; Mohana, D. Smart Home Automation System Using IOT. Int. J. Recent Technol. Appl. Sci. 2022, 4, 44–53. [Google Scholar] [CrossRef]
- Yar, H.; Imran, A.S.; Khan, Z.A.; Sajjad, M.; Kastrati, Z. Towards Smart Home Automation Using IoT-Enabled Edge-Computing Paradigm. Sensors 2021, 21, 4932. [Google Scholar] [CrossRef] [PubMed]
- Almusaylim, Z.A.; Zaman, N. A Review on Smart Home Present State and Challenges: Linked to Context-Awareness Internet of Things (IoT). Wirel. Netw. 2019, 25, 3193–3204. [Google Scholar] [CrossRef]
- Sun, H.; Yu, H.; Fan, G.; Chen, L. Energy and Time Efficient Task Offloading and Resource Allocation on the Generic IoT-Fog-Cloud Architecture. Peer Peer Netw. Appl. 2020, 13, 548–563. [Google Scholar] [CrossRef]
- García-Monge, M.; Zalba, B.; Casas, R.; Cano, E.; Guillén-Lambea, S.; López-Mesa, B.; Martínez, I. Is IoT Monitoring Key to Improve Building Energy Efficiency? Case Study of a Smart Campus in Spain. Energy Build. 2023, 285, 112882. [Google Scholar] [CrossRef]
- Arif, S.; Khan, M.A.; Rehman, S.U.; Kabir, M.A.; Imran, M. Investigating Smart Home Security: Is Blockchain the Answer? IEEE Access 2020, 8, 117802–117816. [Google Scholar] [CrossRef]
- Graveto, V.; Cruz, T.; Simöes, P. Security of Building Automation and Control Systems: Survey and Future Research Directions. Comput. Secur. 2022, 112, 102527. [Google Scholar] [CrossRef]
- Parikh, S.; Dave, D.; Patel, R.; Doshi, N. Security and Privacy Issues in Cloud, Fog and Edge Computing. Procedia Comput. Sci. 2019, 160, 734–739. [Google Scholar] [CrossRef]
- Abed, S.; Jaffal, R.; Mohd, B.J. A Review on Blockchain and IoT Integration from Energy, Security and Hardware Perspectives. Wirel. Pers. Commun. 2023, 129, 2079–2122. [Google Scholar] [CrossRef]
- Sutton, R.S.; Barto, A.G. Reinforcement Learning: An Introduction, 2nd ed.; Adaptive Computation and Machine Learning; The MIT Press: Cambridge, MA, USA, 2018; ISBN 9780262039246. (Hardcover). [Google Scholar]
- Lillicrap, T.P.; Hunt, J.J.; Pritzel, A.; Heess, N.; Erez, T.; Tassa, Y.; Silver, D.; Wierstra, D. Continuous Control with Deep Reinforcement Learning. arXiv 2015, arXiv:1509.02971. [Google Scholar]
- Wang, X.; Wang, S.; Liang, X.; Zhao, D.; Huang, J.; Xu, X.; Dai, B.; Miao, Q. Deep Reinforcement Learning: A Survey. IEEE Trans. Neural Netw. Learn. Syst. 2024, 35, 5064–5078. [Google Scholar] [CrossRef] [PubMed]
- Liu, X.; Zhang, J.; Hou, Z.; Yang, Y.I.; Gao, Y.Q. From Predicting to Decision Making: Reinforcement Learning in Biomedicine. WIREs Comput. Mol. Sci. 2024, 14, e1723. [Google Scholar] [CrossRef]
- Roderick, M.; MacGlashan, J.; Tellex, S. Implementing the Deep Q-Network. arXiv 2017, arXiv:1711.07478. [Google Scholar]
- Schulman, J.; Wolski, F.; Dhariwal, P.; Radford, A.; Klimov, O. Proximal Policy Optimization Algorithms. arXiv 2017, arXiv:1707.06347. [Google Scholar]
- Oh, D.-H.; Adams, D.; Vo, N.D.; Gbadago, D.Q.; Lee, C.-H.; Oh, M. Actor-Critic Reinforcement Learning to Estimate the Optimal Operating Conditions of the Hydrocracking Process. Comput. Chem. Eng. 2021, 149, 107280. [Google Scholar] [CrossRef]
- Mnih, V.; Badia, A.P.; Mirza, M.; Graves, A.; Lillicrap, T.P.; Harley, T.; Silver, D.; Kavukcuoglu, K. Asynchronous Methods for Deep Reinforcement Learning. arXiv 2016, arXiv:1602.01783. [Google Scholar]
- Hou, Y.; Liu, L.; Wei, Q.; Xu, X.; Chen, C. A Novel DDPG Method with Prioritized Experience Replay. In Proceedings of the 2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Banff, AB, Canada, 5–8 October 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 316–321. [Google Scholar]
- Kumar Rachakatla, S.; Ravichandran, P.; Reddy Machireddy, J. Scalable Machine Learning Workflows in Data Warehousing: Automating Model Training and Deployment with AI. Aust. J. Mach. Learn. Res. Appl. 2022, 2, 262–286. [Google Scholar]
- Stone, G.B.; Talbert, D.A.; Eberle, W. A Survey of Scalable Reinforcement Learning. Int. J. Intell. Comput. Res. 2022, 13, 1118–1124. [Google Scholar] [CrossRef]
- Sanz-Jimeno, R.; Álvarez-Díaz, S. A Tool Based on the Industry Foundation Classes Standard for Dynamic Data Collection and Automatic Generation of Building Automation Control Networks. J. Build. Eng. 2023, 78, 107625. [Google Scholar] [CrossRef]
- Ruiz-Zafra, A.; Benghazi, K.; Noguera, M. IFC+: Towards the Integration of IoT into Early Stages of Building Design. Autom. Constr. 2022, 136, 104129. [Google Scholar] [CrossRef]
- Tang, S.; Shelden, D.R.; Eastman, C.M.; Pishdad-Bozorgi, P.; Gao, X. BIM Assisted Building Automation System Information Exchange Using BACnet and IFC. Autom. Constr. 2020, 110, 103049. [Google Scholar] [CrossRef]
- Sadeghi Eshkevari, S.; Tang, X.; Qin, Z.; Mei, J.; Zhang, C.; Meng, Q.; Xu, J. Reinforcement Learning in the Wild: Scalable RL Dispatching Algorithm Deployed in Ridehailing Marketplace. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, 14–18 August 2022; ACM: New York, NY, USA, 2022; pp. 3838–3848. [Google Scholar]
- Mo, K.; Ye, P.; Ren, X.; Wang, S.; Li, W.; Li, J. Security and Privacy Issues in Deep Reinforcement Learning: Threats and Countermeasures. ACM Comput. Surv. 2024, 56, 1–39. [Google Scholar] [CrossRef]
- Papernot, N.; McDaniel, P.; Sinha, A.; Wellman, M.P. SoK: Security and Privacy in Machine Learning. In Proceedings of the 2018 IEEE European Symposium on Security and Privacy (EuroS&P), London, UK, 24–26 April 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 399–414. [Google Scholar]
- Benyahya, M.; Kechagia, S.; Collen, A.; Nijdam, N.A. The Interface of Privacy and Data Security in Automated City Shuttles: The GDPR Analysis. Appl. Sci. 2022, 12, 4413. [Google Scholar] [CrossRef]
- Ożadowicz, A. Generic IoT for Smart Buildings and Field-Level Automation—Challenges, Threats, Approaches, and Solutions. Computers 2024, 13, 45. [Google Scholar] [CrossRef]
- Yu, J.; Kim, M.; Bang, H.C.; Bae, S.H.; Kim, S.J. IoT as a Applications: Cloud-Based Building Management Systems for the Internet of Things. Multimed. Tools Appl. 2016, 75, 14583–14596. [Google Scholar] [CrossRef]
- Kastner, W.; Kofler, M.; Jung, M.; Gridling, G.; Weidinger, J. Building Automation Systems Integration into the Internet of Things. The IoT6 Approach, Its Realization and Validation. In Proceedings of the Emerging Technology and Factory Automation (ETFA), Barcelona, Spain, 16–19 September 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 1–9. [Google Scholar]
- Stijin, V.; Dorien, A.; Glenn, R.; Yixiao, M. Waide Paul Final Report on The Technical Support to The Development of a Smart Readiness Indicator for Buildings; European Commission: Brussels, Belgium, 2020. [Google Scholar]
- European Parliament Directive (EU) 2018/844 of the European Parliament and the Council on the Energy Performance of Buildings; The European Parliament and The Council of the European Union: Strasbourg, France, 2018.
- Ramezani, B.; da Silva, M.C.G.; Simões, N. Application of Smart Readiness Indicator for Mediterranean Buildings in Retrofitting Actions. Energy Build. 2021, 249, 111173. [Google Scholar] [CrossRef]
- Janhunen, E.; Pulkka, L.; Säynäjoki, A.; Junnila, S. Applicability of the Smart Readiness Indicator for Cold Climate Countries. Buildings 2019, 9, 102. [Google Scholar] [CrossRef]
- Ożadowicz, A.; Grela, J. Impact of Building Automation Control Systems on Energy Efficiency—University Building Case Study. In Proceedings of the 2017 22nd IEEE International Conference on Emerging Technologies and Factory Automation (ETFA), Limassol, Cyprus, 12–15 September 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 1–8. [Google Scholar]
- Ożadowicz, A.; Grela, J. Energy Saving in the Street Lighting Control System—A New Approach Based on the EN-15232 Standard. Energy Effic. 2017, 10, 563–576. [Google Scholar] [CrossRef]
- Laroui, M.; Nour, B.; Moungla, H.; Cherif, M.A.; Afifi, H.; Guizani, M. Edge and Fog Computing for IoT: A Survey on Current Research Activities & Future Directions. Comput. Commun. 2021, 180, 210–231. [Google Scholar] [CrossRef]
- Genkin, M.; McArthur, J.J. B-SMART: A Reference Architecture for Artificially Intelligent Autonomic Smart Buildings. Eng. Appl. Artif. Intell. 2023, 121, 106063. [Google Scholar] [CrossRef]
- Seitz, A.; Johanssen, J.O.; Bruegge, B.; Loftness, V.; Hartkopf, V.; Sturm, M. A Fog Architecture for Decentralized Decision Making in Smart Buildings. In Proceedings of the 2017 2nd International Workshop on Science of Smart City Operations and Platforms Engineering, in Partnership with Global City Teams Challenge, SCOPE 2017, Pittsburgh, PA, USA, 21 April 2017; Association for Computing Machinery, Inc.: New York, NY, USA, 2017; pp. 34–39. [Google Scholar]
- Mansour, M.; Gamal, A.; Ahmed, A.I.; Said, L.A.; Elbaz, A.; Herencsar, N.; Soltan, A. Internet of Things: A Comprehensive Overview on Protocols, Architectures, Technologies, Simulation Tools, and Future Directions. Energies 2023, 16, 3465. [Google Scholar] [CrossRef]
- Yousefpour, A.; Fung, C.; Nguyen, T.; Kadiyala, K.; Jalali, F.; Niakanlahiji, A.; Kong, J.; Jue, J.P. All One Needs to Know about Fog Computing and Related Edge Computing Paradigms: A Complete Survey. J. Syst. Archit. 2019, 98, 289–330. [Google Scholar] [CrossRef]
- Kastner, W.; Jung, M.; Krammer, L. Future Trends in Smart Homes and Buildings. In Industrial Communication Technology Handbook, 2nd ed.; Zurawski, R., Ed.; CRC Press Taylor & Francis Group: Boca Raton, FL, USA, 2015; pp. 59-1–59-20. ISBN 978-1-4822-0732-3. [Google Scholar]
- Lobaccaro, G.; Carlucci, S.; Löfström, E. A Review of Systems and Technologies for Smart Homes and Smart Grids. Energies 2016, 9, 348. [Google Scholar] [CrossRef]
- Bouchabou, D.; Nguyen, S.M.; Lohr, C.; LeDuc, B.; Kanellos, I. A Survey of Human Activity Recognition in Smart Homes Based on IoT Sensors Algorithms: Taxonomies, Challenges, and Opportunities with Deep Learning. Sensors 2021, 21, 6037. [Google Scholar] [CrossRef] [PubMed]
- Grela, J.; Ożadowicz, A. Building Automation Planning and Design Tool Implementing EN 15 232 BACS Efficiency Classes. In Proceedings of the 2016 IEEE 21st International Conference on Emerging Technologies and Factory Automation (ETFA), Berlin, Germany, 6–9 September 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 1–4. [Google Scholar]
- Sharda, S.; Sharma, K.; Singh, M. A Real-Time Automated Scheduling Algorithm with PV Integration for Smart Home Prosumers. J. Build. Eng. 2021, 44, 102828. [Google Scholar] [CrossRef]
- Sangoleye, F.; Jao, J.; Faris, K.; Tsiropoulou, E.E.; Papavassiliou, S. Reinforcement Learning-Based Demand Response Management in Smart Grid Systems With Prosumers. IEEE Syst. J. 2023, 17, 1797–1807. [Google Scholar] [CrossRef]
- Ożadowicz, A. A New Concept of Active Demand Side Management for Energy Efficient Prosumer Microgrids with Smart Building Technologies. Energies 2017, 10, 1771. [Google Scholar] [CrossRef]
- Sierla, S.; Pourakbari-Kasmaei, M.; Vyatkin, V. A Taxonomy of Machine Learning Applications for Virtual Power Plants and Home/Building Energy Management Systems. Autom. Constr. 2022, 136, 104174. [Google Scholar] [CrossRef]
- Razghandi, M.; Zhou, H.; Erol-Kantarci, M.; Turgut, D. Smart Home Energy Management: Sequence-to-Sequence Load Forecasting and Q-Learning. In Proceedings of the 2021 IEEE Global Communications Conference (GLOBECOM), Madrid, Spain, 7–11 December 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 1–6. [Google Scholar]
- Zhang, H.; Wu, D.; Boulet, B. A Review of Recent Advances on Reinforcement Learning for Smart Home Energy Management. In Proceedings of the 2020 IEEE Electric Power and Energy Conference (EPEC), Piscataway, NJ, USA, 9–10 November 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 1–6. [Google Scholar]
- Lu, R.; Hong, S.H.; Yu, M. Demand Response for Home Energy Management Using Reinforcement Learning and Artificial Neural Network. IEEE Trans. Smart Grid 2019, 10, 6629–6639. [Google Scholar] [CrossRef]
- Radhamani, R.; Karthick, S.; Kishore Kumar, S.; Gokulraj, M. Deployment of an IoT-Integrated Home Energy Management System Employing Deep Reinforcement Learning. In Proceedings of the 2024 2nd International Conference on Artificial Intelligence and Machine Learning Applications Theme: Healthcare and Internet of Things (AIMLA), Namakkal, India, 15–16 March 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 1–4. [Google Scholar]
- Dhayalan, V.; Raman, R.; Kalaivani, N.; Shrirvastava, A.; Reddy, R.S.; Meenakshi, B. Smart Renewable Energy Management Using Internet of Things and Reinforcement Learning. In Proceedings of the 2024 2nd International Conference on Computer, Communication and Control (IC4), Indore, India, 8–10 February 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 1–5. [Google Scholar]
- Wang, Y.; Xiao, R.; Wang, X.; Liu, A. Constructing Autonomous, Personalized, and Private Working Management of Smart Home Products Based on Deep Reinforcement Learning. Procedia CIRP 2023, 119, 72–77. [Google Scholar] [CrossRef]
- Chen, S.-J.; Chiu, W.-Y.; Liu, W.-J. User Preference-Based Demand Response for Smart Home Energy Management Using Multiobjective Reinforcement Learning. IEEE Access 2021, 9, 161627–161637. [Google Scholar] [CrossRef]
- Angano, W.; Musau, P.; Wekesa, C.W. Design and Testing of a Demand Response Q-Learning Algorithm for a Smart Home Energy Management System. In Proceedings of the 2021 IEEE PES/IAS PowerAfrica, Virtual Conference, 23–27 August 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 1–5. [Google Scholar]
- Amer, A.A.; Shaban, K.; Massoud, A.M. DRL-HEMS: Deep Reinforcement Learning Agent for Demand Response in Home Energy Management Systems Considering Customers and Operators Perspectives. IEEE Trans. Smart Grid 2023, 14, 239–250. [Google Scholar] [CrossRef]
- Liu, W.; Wang, Y.; Jiang, F.; Cheng, Y.; Rong, J.; Wang, C.; Peng, J. A Real-Time Demand Response Strategy of Home Energy Management by Using Distributed Deep Reinforcement Learning. In Proceedings of the 2021 IEEE 23rd International Conference on High Performance Computing & Communications; 7th International Conference on Data Science & Systems; 19th International Conference on Smart City; 7th International Conference on Dependability in Sensor, Cloud & Big Data Systems & Application (HPCC/DSS/SmartCity/DependSys), Haikou, China, 20–22 December 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 988–995. [Google Scholar]
- Alfaverh, F.; Denai, M.; Sun, Y. Demand Response Strategy Based on Reinforcement Learning and Fuzzy Reasoning for Home Energy Management. IEEE Access 2020, 8, 39310–39321. [Google Scholar] [CrossRef]
- Li, H.; Wan, Z.; He, H. A Deep Reinforcement Learning Based Approach for Home Energy Management System. In Proceedings of the 2020 IEEE Power & Energy Society Innovative Smart Grid Technologies Conference (ISGT), Washington, DC, USA, 17–20 February 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 1–5. [Google Scholar]
- Mathew, A.; Roy, A.; Mathew, J. Intelligent Residential Energy Management System Using Deep Reinforcement Learning. IEEE Syst. J. 2020, 14, 5362–5372. [Google Scholar] [CrossRef]
- Ding, H.; Xu, Y.; Chew Si Hao, B.; Li, Q.; Lentzakis, A. A Safe Reinforcement Learning Approach for Multi-Energy Management of Smart Home. Electr. Power Syst. Res. 2022, 210, 108120. [Google Scholar] [CrossRef]
- Chu, Y.; Wei, Z.; Sun, G.; Zang, H.; Chen, S.; Zhou, Y. Optimal Home Energy Management Strategy: A Reinforcement Learning Method with Actor-Critic Using Kronecker-Factored Trust Region. Electr. Power Syst. Res. 2022, 212, 108617. [Google Scholar] [CrossRef]
- Lissa, P.; Deane, C.; Schukat, M.; Seri, F.; Keane, M.; Barrett, E. Deep Reinforcement Learning for Home Energy Management System Control. Energy AI 2021, 3, 100043. [Google Scholar] [CrossRef]
- Liu, Y.; Zhang, D.; Gooi, H.B. Optimization Strategy Based on Deep Reinforcement Learning for Home Energy Management. CSEE J. Power Energy Syst. 2020, 6, 572–582. [Google Scholar] [CrossRef]
- Kumari, A.; Tanwar, S. Reinforcement Learning for Multiagent-Based Residential Energy Management System. In Proceedings of the 2021 IEEE Globecom Workshops (GC Wkshps), Madrid, Spain, 7–11 December 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 1–6. [Google Scholar]
- Kumari, A.; Kakkar, R.; Tanwar, S.; Garg, D.; Polkowski, Z.; Alqahtani, F.; Tolba, A. Multi-Agent-Based Decentralized Residential Energy Management Using Deep Reinforcement Learning. J. Build. Eng. 2024, 87, 109031. [Google Scholar] [CrossRef]
- Amer, A.; Shaban, K.; Massoud, A. Demand Response in HEMSs Using DRL and the Impact of Its Various Configurations and Environmental Changes. Energies 2022, 15, 8235. [Google Scholar] [CrossRef]
- Roslann, A.; Asuhaimi, F.A.; Ariffin, K.N.Z. Energy Efficient Scheduling in Smart Home Using Deep Reinforcement Learning. In Proceedings of the 2022 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET), Kota Kinabalu, Malaysia, 13–15 September 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 1–6. [Google Scholar]
- Xiong, L.; Tang, Y.; Liu, C.; Mao, S.; Meng, K.; Dong, Z.; Qian, F. Meta-Reinforcement Learning-Based Transferable Scheduling Strategy for Energy Management. IEEE Trans. Circuits Syst. I Regul. Pap. 2023, 70, 1685–1695. [Google Scholar] [CrossRef]
- Kahraman, A.; Yang, G. Home Energy Management System Based on Deep Reinforcement Learning Algorithms. In Proceedings of the 2022 IEEE PES Innovative Smart Grid Technologies Conference Europe (ISGT-Europe), Novi Sad, Serbia, 10–12 October 2022; IEEE: Piscataway, NJ, USA, 2022; Volume 2022, pp. 1–5. [Google Scholar]
- Aldahmashi, J.; Ma, X. Real-Time Energy Management in Smart Homes Through Deep Reinforcement Learning. IEEE Access 2024, 12, 43155–43172. [Google Scholar] [CrossRef]
- Seveiche-Maury, Z.; Arrubla-Hoyos, W. Proposal of a Decision-Making Model for Home Energy Saving through Artificial Intelligence Applied to a HEMS. In Proceedings of the 2023 IEEE Colombian Caribbean Conference (C3), Barranquilla, Colombia, 22–25 November 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 1–6. [Google Scholar]
- Wei, G.; Chi, M.; Liu, Z.-W.; Ge, M.; Li, C.; Liu, X. Deep Reinforcement Learning for Real-Time Energy Management in Smart Home. IEEE Syst. J. 2023, 17, 2489–2499. [Google Scholar] [CrossRef]
- Jiang, F.; Zheng, C.; Gao, D.; Zhang, X.; Liu, W.; Cheng, Y.; Hu, C.; Peng, J. A Novel Multi-Agent Cooperative Reinforcement Learning Method for Home Energy Management under a Peak Power-Limiting. In Proceedings of the 2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Toronto, ON, Canada, 11–14 October 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 350–355. [Google Scholar]
- Diyan, M.; Khan, M.; Zhenbo, C.; Silva, B.N.; Han, J.; Han, K.J. Intelligent Home Energy Management System Based on Bi-Directional Long-Short Term Memory and Reinforcement Learning. In Proceedings of the 2021 International Conference on Information Networking (ICOIN), Jeju Island, Republic of Korea, 13–16 January 2021; IEEE: Piscataway, NJ, USA, 2021; Volume 2021, pp. 782–787. [Google Scholar]
- Zenginis, I.; Vardakas, J.; Koltsaklis, N.E.; Verikoukis, C. Smart Home’s Energy Management Through a Clustering-Based Reinforcement Learning Approach. IEEE Internet Things J. 2022, 9, 16363–16371. [Google Scholar] [CrossRef]
- Haq, E.U.; Lyu, C.; Xie, P.; Yan, S.; Ahmad, F.; Jia, Y. Implementation of Home Energy Management System Based on Reinforcement Learning. Energy Rep. 2022, 8, 560–566. [Google Scholar] [CrossRef]
- Thattai, K.; Ravishankar, J.; Li, C. Consumer-Centric Home Energy Management System Using Trust Region Policy Optimization- Based Multi-Agent Deep Reinforcement Learning. In Proceedings of the 2023 IEEE Belgrade PowerTech, Belgrade, Serbia, 25–29 June 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 1–6. [Google Scholar]
- Langer, L.; Volling, T. A Reinforcement Learning Approach to Home Energy Management for Modulating Heat Pumps and Photovoltaic Systems. Appl. Energy 2022, 327, 120020. [Google Scholar] [CrossRef]
- Xiong, S.; Liu, D.; Chen, Y.; Zhang, Y.; Cai, X. A Deep Reinforcement Learning Approach Based Energy Management Strategy for Home Energy System Considering the Time-of-Use Price and Real-Time Control of Energy Storage System. Energy Rep. 2024, 11, 3501–3508. [Google Scholar] [CrossRef]
- Lee, S.; Choi, D.-H. Reinforcement Learning-Based Energy Management of Smart Home with Rooftop Solar Photovoltaic System, Energy Storage System, and Home Appliances. Sensors 2019, 19, 3937. [Google Scholar] [CrossRef] [PubMed]
- Abedi, S.; Yoon, S.W.; Kwon, S. Battery Energy Storage Control Using a Reinforcement Learning Approach with Cyclic Time-Dependent Markov Process. Int. J. Electr. Power Energy Syst. 2022, 134, 107368. [Google Scholar] [CrossRef]
- Härtel, F.; Bocklisch, T. Minimizing Energy Cost in PV Battery Storage Systems Using Reinforcement Learning. IEEE Access 2023, 11, 39855–39865. [Google Scholar] [CrossRef]
- Xu, G.; Shi, J.; Wu, J.; Lu, C.; Wu, C.; Wang, D.; Han, Z. An Optimal Solutions-Guided Deep Reinforcement Learning Approach for Online Energy Storage Control. Appl. Energy 2024, 361, 122915. [Google Scholar] [CrossRef]
- Wang, B.; Zha, Z.; Zhang, L.; Liu, L.; Fan, H. Deep Reinforcement Learning-Based Security-Constrained Battery Scheduling in Home Energy System. IEEE Trans. Consum. Electron. 2024, 70, 3548–3561. [Google Scholar] [CrossRef]
- Kumar, P.P.; Nuvvula, R.S.S.; Tan, C.C.; Al-Salman, G.A.; Guntreddi, V.; Raj, V.A.; Khan, B. Energy-Aware Vehicle-to-Grid (V2G) Scheduling with Reinforcement Learning for Renewable Energy Integration. In Proceedings of the 2024 12th International Conference on Smart Grid (icSmartGrid), Setubal, Portugal, 27–29 May 2024; pp. 345–349. [Google Scholar]
- Almughram, O.; Abdullah ben Slama, S.; Zafar, B.A. A Reinforcement Learning Approach for Integrating an Intelligent Home Energy Management System with a Vehicle-to-Home Unit. Appl. Sci. 2023, 13, 5539. [Google Scholar] [CrossRef]
- Zhang, F.; Yang, Q.; An, D. CDDPG: A Deep-Reinforcement-Learning-Based Approach for Electric Vehicle Charging Control. IEEE Internet Things J. 2021, 8, 3075–3087. [Google Scholar] [CrossRef]
- Li, S.; Hu, W.; Cao, D.; Dragicevic, T.; Huang, Q.; Chen, Z.; Blaabjerg, F. Electric Vehicle Charging Management Based on Deep Reinforcement Learning. J. Mod. Power Syst. Clean Energy 2022, 10, 719–730. [Google Scholar] [CrossRef]
- Alfaverh, F.; Denaï, M.; Sun, Y. Electrical Vehicle Grid Integration for Demand Response in Distribution Networks Using Reinforcement Learning. IET Electr. Syst. Transp. 2021, 11, 348–361. [Google Scholar] [CrossRef]
- Maeng, J.; Min, D.; Kang, Y. Intelligent Charging and Discharging of Electric Vehicles in a Vehicle-to-Grid System Using a Reinforcement Learning-Based Approach. Sustain. Energy Grids Netw. 2023, 36, 101224. [Google Scholar] [CrossRef]
- Ding, T.; Zeng, Z.; Bai, J.; Qin, B.; Yang, Y.; Shahidehpour, M. Optimal Electric Vehicle Charging Strategy With Markov Decision Process and Reinforcement Learning Technique. IEEE Trans. Ind. Appl. 2020, 56, 5811–5823. [Google Scholar] [CrossRef]
- Kaewdornhan, N.; Srithapon, C.; Liemthong, R.; Chatthaworn, R. Real-Time Multi-Home Energy Management with EV Charging Scheduling Using Multi-Agent Deep Reinforcement Learning Optimization. Energies 2023, 16, 2357. [Google Scholar] [CrossRef]
- Suleman, A.; Amin, M.A.; Fatima, M.; Asad, B.; Menghwar, M.; Hashmi, M.A. Smart Scheduling of EVs Through Intelligent Home Energy Management Using Deep Reinforcement Learning. In Proceedings of the 2022 17th International Conference on Emerging Technologies (ICET), Swabi, Pakistan, 29–30 November 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 18–24. [Google Scholar]
- Markiewicz, M.; Skała, A.; Grela, J.; Janusz, S.; Stasiak, T.; Latoń, D.; Bielecki, A.; Bańczyk, K. The Architecture for Testing Central Heating Control Algorithms with Feedback from Wireless Temperature Sensors. Energies 2023, 16, 5584. [Google Scholar] [CrossRef]
- van Tilburg, J.; Siebert, L.C.; Cremer, J.L. MARL-IDR: Multi-Agent Reinforcement Learning for Incentive-Based Residential Demand Response. In Proceedings of the 2023 IEEE Belgrade PowerTech, Belgrade, Serbia, 25–29 June 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 1–8. [Google Scholar]
- Sun, Y.; Zhang, S.; Liu, M.; Zheng, R.; Dong, S. Energy Management Based on Safe Multi-Agent Reinforcement Learning for Smart Buildings in Distribution Networks. Energy Build. 2024, 318, 114410. [Google Scholar] [CrossRef]
- Liu, J.; Liu, P.; Feng, L.; Wu, W.; Li, D.; Chen, Y.F. Automated Clash Resolution for Reinforcement Steel Design in Concrete Frames via Q-Learning and Building Information Modeling. Autom. Constr. 2020, 112, 103062. [Google Scholar] [CrossRef]
- Li, A.; Xiao, F.; Fan, C.; Hu, M. Development of an ANN-Based Building Energy Model for Information-Poor Buildings Using Transfer Learning. Build. Simul. 2021, 14, 89–101. [Google Scholar] [CrossRef]
- Pinto, G.; Wang, Z.; Roy, A.; Hong, T.; Capozzoli, A. Transfer Learning for Smart Buildings: A Critical Review of Algorithms, Applications, and Future Perspectives. Adv. Appl. Energy 2022, 5, 100084. [Google Scholar] [CrossRef]
- Ali, S.M.M.; Augusto, J.C.; Windridge, D. A Survey of User-Centred Approaches for Smart Home Transfer Learning and New User Home Automation Adaptation. Appl. Artif. Intell. 2019, 33, 747–774. [Google Scholar] [CrossRef]
- Arun, S.L.; Selvan, M.P. Intelligent Residential Energy Management System for Dynamic Demand Response in Smart Buildings. IEEE Syst. J. 2017, 12, 1329–1340. [Google Scholar] [CrossRef]
- Ghenai, C.; Husein, L.A.; Al Nahlawi, M.; Hamid, A.K.; Bettayeb, M. Recent Trends of Digital Twin Technologies in the Energy Sector: A Comprehensive Review. Sustain. Energy Technol. Assess. 2022, 54, 102837. [Google Scholar] [CrossRef]
- Cheng, N.; Wang, X.; Li, Z.; Yin, Z.; Luan, T.; Shen, X.S. Toward Enhanced Reinforcement Learning-Based Resource Management via Digital Twin: Opportunities, Applications, and Challenges; IEEE Network: New York, NJ, USA, 2024; p. 1. [Google Scholar] [CrossRef]
- Henzel, J.; Wróbel, Ł.; Fice, M.; Sikora, M. Energy Consumption Forecasting for the Digital-Twin Model of the Building. Energies 2022, 15, 4318. [Google Scholar] [CrossRef]
- Ceccolini, C.; Sangi, R. Benchmarking Approaches for Assessing the Performance of Building Control Strategies: A Review. Energies 2022, 15, 1270. [Google Scholar] [CrossRef]
- Pan, Y.; Shen, Y.; Qin, J.; Zhang, L. Deep Reinforcement Learning for Multi-Objective Optimization in BIM-Based Green Building Design. Autom. Constr. 2024, 166, 105598. [Google Scholar] [CrossRef]
- Shaqour, A.; Hagishima, A. Systematic Review on Deep Reinforcement Learning-Based Energy Management for Different Building Types. Energies 2022, 15, 8663. [Google Scholar] [CrossRef]
- Qi, T.; Ye, C.; Zhao, Y.; Li, L.; Ding, Y. Deep Reinforcement Learning Based Charging Scheduling for Household Electric Vehicles in Active Distribution Network. J. Mod. Power Syst. Clean Energy 2023, 11, 1890–1901. [Google Scholar] [CrossRef]
- Jendoubi, I.; Bouffard, F. Multi-Agent Hierarchical Reinforcement Learning for Energy Management. Appl. Energy 2023, 332, 120500. [Google Scholar] [CrossRef]
- Qin, Y.; Ke, J.; Wang, B.; Filaretov, G.F. Energy Optimization for Regional Buildings Based on Distributed Reinforcement Learning. Sustain. Cities Soc. 2022, 78, 103625. [Google Scholar] [CrossRef]
- Anvari-Moghaddam, A.; Rahimi-Kian, A.; Mirian, M.S.; Guerrero, J.M. A Multi-Agent Based Energy Management Solution for Integrated Buildings and Microgrid System. Appl. Energy 2017, 203, 41–56. [Google Scholar] [CrossRef]
- Kumar Nunna, H.S.V.S.; Srinivasan, D. Multi-Agent Based Transactive Energy Framework for Distribution Systems with Smart Microgrids. IEEE Trans. Ind. Inform. 2017, 13, 2241–2250. [Google Scholar] [CrossRef]
- Vamvakas, D.; Michailidis, P.; Korkas, C.; Kosmatopoulos, E. Review and Evaluation of Reinforcement Learning Frameworks on Smart Grid Applications. Energies 2023, 16, 5326. [Google Scholar] [CrossRef]
- Zhang, L.; Gao, Y.; Zhu, H.; Tao, L. A Distributed Real-Time Pricing Strategy Based on Reinforcement Learning Approach for Smart Grid. Expert Syst. Appl. 2022, 191, 116285. [Google Scholar] [CrossRef]
- Huang, X.; Zhang, D.; Zhang, X. Energy Management of Intelligent Building Based on Deep Reinforced Learning. Alex. Eng. J. 2021, 60, 1509–1517. [Google Scholar] [CrossRef]
- Wang, Z.; Xiao, F.; Ran, Y.; Li, Y.; Xu, Y. Scalable Energy Management Approach of Residential Hybrid Energy System Using Multi-Agent Deep Reinforcement Learning. Appl. Energy 2024, 367, 123414. [Google Scholar] [CrossRef]
- Knap, P.; Gerding, E. Energy Storage in the Smart Grid: A Multi-Agent Deep Reinforcement Learning Approach. In Trends in Clean Energy Research: Proceedings of the 9th International Conference on Advances on Clean Energy Research (ICACER 2024), Lille, France, 27–29 April 2024; Chen, L., Ed.; Springer Nature: Cham, Switzerland, 2024; pp. 221–235. [Google Scholar]
- Sobhani, A.; Khorshidi, F.; Fakhredanesh, M. DeePLS: Personalize Lighting in Smart Home by Human Detection, Recognition, and Tracking. SN Comput. Sci. 2023, 4, 773. [Google Scholar] [CrossRef]
- Safaei, D.; Sobhani, A.; Kiaei, A.A. DeePLT: Personalized Lighting Facilitates by Trajectory Prediction of Recognized Residents in the Smart Home. Int. J. Inf. Technol. 2024, 16, 2987–2999. [Google Scholar] [CrossRef]
- Manganelli, M.; Consalvi, R. Design and Energy Performance Assessment of High-Efficiency Lighting Systems. In Proceedings of the 2015 IEEE 15th International Conference on Environment and Electrical Engineering (EEEIC), Rome, Italy, 10–13 June 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 1035–1040. [Google Scholar]
- Liu, J.; Chen, H.-M.; Li, S.; Lin, S. Adaptive and Energy-Saving Smart Lighting Control Based on Deep Q-Network Algorithm. In Proceedings of the 2021 6th International Conference on Control, Robotics and Cybernetics (CRC), Shanghai, China, 9–11 October 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 207–211. [Google Scholar]
- Suman, S.; Rivest, F.; Etemad, A. Toward Personalization of User Preferences in Partially Observable Smart Home Environments. IEEE Trans. Artif. Intell. 2023, 4, 549–561. [Google Scholar] [CrossRef]
- Almilaify, Y.; Nweye, K.; Nagy, Z. SCALEX: Scalability Exploration of Multi-Agent Reinforcement Learning Agents in Grid-Interactive Efficient Buildings. In Proceedings of the 10th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, Istanbul, Turkey, 15–16 November 2023; ACM: New York, NY, USA, 2023; pp. 261–264. [Google Scholar]
- Khan, M.A.; Saleh, A.M.; Waseem, M.; Sajjad, I.A. Artificial Intelligence Enabled Demand Response: Prospects and Challenges in Smart Grid Environment. IEEE Access 2023, 11, 1477–1505. [Google Scholar] [CrossRef]
- Gao, Y.; Li, S.; Xiao, Y.; Dong, W.; Fairbank, M.; Lu, B. An Iterative Optimization and Learning-Based IoT System for Energy Management of Connected Buildings. IEEE Internet Things J. 2022, 9, 21246–21259. [Google Scholar] [CrossRef]
- Malagnino, A.; Montanaro, T.; Lazoi, M.; Sergi, I.; Corallo, A.; Patrono, L. Building Information Modeling and Internet of Things Integration for Smart and Sustainable Environments: A Review. J. Clean. Prod. 2021, 312, 127716. [Google Scholar] [CrossRef]
- Pinthurat, W.; Surinkaew, T.; Hredzak, B. An Overview of Reinforcement Learning-Based Approaches for Smart Home Energy Management Systems with Energy Storages. Renew. Sustain. Energy Rev. 2024, 202, 114648. [Google Scholar] [CrossRef]
- Sheng, R.; Mu, C.; Zhang, X.; Ding, Z.; Sun, C. Review of Home Energy Management Systems Based on Deep Reinforcement Learning. In Proceedings of the 2023 38th Youth Academic Annual Conference of Chinese Association of Automation, YAC 2023, Hefei, China, 27–29 August 2023; Institute of Electrical and Electronics Engineers Inc.: Piscataway, NJ, USA, 2023; pp. 1239–1244. [Google Scholar]
- Daneshvar, M.; Pesaran, M.; Mohammadi-ivatloo, B. Transactive Energy in Future Smart Homes. In The Energy Internet; Su, W., Huang, A.Q., Eds.; Elsevier: Amsterdam, The Netherlands, 2019; pp. 153–179. [Google Scholar]
- Rodrigues, S.D.; Garcia, V.J. Transactive Energy in Microgrid Communities: A Systematic Review. Renew. Sustain. Energy Rev. 2023, 171, 112999. [Google Scholar] [CrossRef]
- Nizami, S.; Tushar, W.; Hossain, M.J.; Yuen, C.; Saha, T.; Poor, H.V. Transactive Energy for Low Voltage Residential Networks: A Review. Appl. Energy 2022, 323, 119556. [Google Scholar] [CrossRef]
Database | Publication Type | Building Automation | Home Automation | Reinforcement Learning | Building Automation + Reinforcement Learning | Home Automation + Reinforcement Learning |
---|---|---|---|---|---|---|
Web of Science | Articles | 13,770 | 2368 | 46,764 | 164 | 20 |
Reviews | 888 | 179 | 2007 | 13 | 3 | |
Scopus | Articles | 11,628 | 8481 | 51,883 | 103 | 101 |
Reviews | 967 | 622 | 3206 | 9 | 3 | |
Google Scholar | Any type | 3,150,000 | 3,170,000 | 4,680,000 | 250,000 | 204,000 |
Reviews | 172,000 | 191,000 | 63,200 | 24,600 | 21,500 |
Database | Publication Type | Building Automation | Home Automation | Reinforcement Learning | Building Automation + Reinforcement Learning | Home Automation + Reinforcement Learning |
---|---|---|---|---|---|---|
Springer | Articles | 36,848 | 15,339 | 64,648 | 2434 | 1073 |
Reviews | 2898 | 1297 | 5232 | 462 | 173 | |
Science Direct | Articles | 70,247 | 25,541 | 83,149 | 3760 | 1312 |
Reviews | 8619 | 4046 | 12,9646 | 1323 | 579 | |
MDPI | Articles | 1346 | 379 | 3797 | 17 | 2 |
Reviews | 133 | 47 | 261 | 7 | 1 | |
IEEE Xplore | Conferences | 28,815 | 8194 | 31,831 | 430 | 65 |
Journals | 5861 | 1032 | 876 | 202 | 24 | |
Taylor and Francis | Articles | 154,033 | 54,445 | 310,108 | 16,794 | 9081 |
Reviews | 4498 | 1737 | 5397 | 512 | 228 | |
ACM Digital Library | All type | 149,403 | 28,663 | 47,165 | 13,237 | 4325 |
Reviews | 201 | 43 | 62 | 18 | 4 | |
Wiley Online Library | Journal | 236,565 | 71,939 | 223,645 | 18,307 | 10,196 |
Books | 45,956 | 18,855 | 36,651 | 5294 | 3001 |
Reference /Year | Application | Algorithm Method | Objectives | Verification |
---|---|---|---|---|
[95] 2024 | IoT | Deep Reinforcement Learning (DRL) | Cost and Comfort | Simulation |
[96] 2024 | IoT | Deep Q-learning | Cost and Comfort | Simulation |
[97] 2023 | IoT | Q-learning | Other (Autonomy, Personalization, and Privacy) | Simulation |
[34] 2020 | IoT | Deep Deterministic Policy Gradients (DDPG) | Cost and Comfort | Simulation |
[98] 2021 | Demand Response | Multi-Objective Reinforcement Learning (MORL) | Cost and Comfort | Simulation |
[99] 2021 | Demand Response | Q-learning | Cost and Comfort | Real (Physical system testing using MATLAB and Arduino Uno) |
[100] 2023 | Demand Response | Deep Q-network (DQN) | Cost and Comfort | Simulation (evaluated using real-world data) |
[101] 2021 | Demand Response | MATD3—Multi-Agent Twin Delayed Deep Deterministic Policy Gradient | Cost and Comfort | Simulation (evaluated using real-world data) |
[102] 2020 | Demand Response | Q-learning combined with Fuzzy Reasoning | Cost | Simulation |
[94] 2019 | Demand Response | Multi-Agent Reinforcement Learning (MARL) combined with Artificial Neural Networks (ANN) | Cost and Comfort | Simulation |
[103] 2020 | Demand Response | Proximal Policy Optimization (PPO) | Cost | Simulation |
[31] 2023 | Demand Response | Q-learning combined with Fuzzy Reasoning | Cost and Comfort | Simulation |
[35] 2021 | Demand Response | DDPG with Dual Targeting Algorithm (DTA) | Cost and Comfort | Simulation |
[104] 2020 | Demand Response | DQN | Cost | Simulation |
[105] 2022 | Demand Response | Primal-Dual Deep Deterministic Policy Gradient (PD-DDPG) | Cost | Simulation |
[106] 2022 | Demand Response | Actor–Critic using Kronecker-Factored Trust Region (ACKTR) | Cost and Comfort | Simulation (evaluated using real-world data) |
[107] 2021 | Demand Response | DRL | Cost and Comfort | Simulation |
[108] 2020 | Demand Response | DQN and Double Deep Q-learning (DDQN) | Cost and Comfort | Simulation (validated using a real-world database combined with the household energy storage model) |
[109] 2021 | Demand Response | Q-learning | Cost | Simulation |
[110] 2024 | Demand Response | DQN | Cost and Comfort | Simulation |
[92] 2021 | Demand Response | Q-learning | Cost | Simulation |
[111] 2022 | Demand Response | DQN | Cost and Comfort | Simulation |
[6] 2024 | Scheduling | DQN, Advantage Actor–Critic (A2C), and Proximal Policy Optimization (PPO) | Cost | Simulation |
[112] 2022 | Scheduling | Q-learning | Cost and Comfort | Simulation |
[113] 2023 | Scheduling | Meta-Reinforcement Learning (Meta-RL) with Long Short-Term Memory (LSTM) | Cost | Simulation (using practical data from Australia’s electricity network) |
[114] 2022 | Scheduling | DQN, DDPG, and Twin Delayed Deep Deterministic Policy Gradient (TD3) | Cost | Simulation |
[115] 2024 | Scheduling | PPO | Cost | Simulation (using real-world datasets) |
[116] 2023 | Scheduling | DQN | Cost and Comfort | Real (using real-time data from a test bench with household devices) |
[117] 2023 | Scheduling | PPO | Cost and Comfort | Simulation (based on real-world data) |
[118] 2020 | Scheduling | Multi-agent Deep Deterministic Policy Gradient (MADDPG) | Cost | Simulation |
[119] 2021 | Scheduling | Q-learning | Cost and Comfort | Simulation |
[120] 2022 | Scheduling | DDPG | Cost and Comfort | Simulation |
[121] 2022 | Scheduling | Q-learning | Cost and Comfort | Simulation |
[3] 2020 | Scheduling | Q-learning | Cost and Comfort | Simulation |
[122] 2023 | RES + Storage | Trust Region Policy Optimization (TRPO) based Multi-Agent Deep Reinforcement Learning (DRL) | Cost and Comfort | Simulation (using real-world data from the Australian National Electricity Market and PV profiles) |
[123] 2022 | RES + Storage | DDPG | Cost and Comfort | Simulation |
[124] 2024 | RES + Storage | SAC | Cost | Simulation |
[125] 2019 | RES + Storage | Q-learning | Cost and Comfort | Simulation |
[126] 2022 | RES + Storage | Q-learning | Cost | Simulation |
[127] 2023 | RES + Storage | PPO with LSTM networks | Cost | Simulation |
[128] 2024 | RES + Storage | DRL, specifically DDPG and PPO | Cost | Simulation |
[129] 2024 | RES + Storage | Actor–Critic-based RL with Distributional Critic Net | Cost | Simulation |
[130] 2024 | EV (V2G) | Deep Q-Learning, MDP | Cost and grid stability | Simulation (based on real-world data) |
[131] 2023 | EV (V2G) | Q-Learning, RL-HCPV | Cost and Comfort | Simulation (based on real-world data) |
[132] 2021 | EV | Charging Control Deep Deterministic Policy Gradient (CDDPG) | Cost | Simulation (based on real-world data) |
[133] 2022 | EV | Deep Reinforcement Learning (DRL), LSTM | Cost | Simulation (based on real-world data) |
[134] 2021 | EV (V2G) | Q-Learning | Cost and Comfort | Simulation (based on real-world data) |
[135] 2023 | EV (V2G) | Model-Free RL | Cost | Simulation (based on real-world data) |
[136] 2020 | EV | DDPG, MDP | Cost and grid stability | Simulation (based on real-world data) |
[137] 2023 | EV + Scheduling | Multi-Agent Deep Reinforcement Learning (MADRL) | Cost and Comfort | Simulation (based on real-world data) |
[138] 2022 | EV + Scheduling | Deep Q-Network (DQN), Double DQN, Dueling DQN | Cost and Comfort | Simulation |
Opportunity | Home Automation | Building Automation |
---|---|---|
Demand Response and Load Shifting |
|
|
Integration with Renewable Energy |
|
|
Energy Storage Management |
|
|
Smart Lighting and Occupancy-based Control |
| |
Scalability and Complexity |
| |
Integration with Smart Grids and IoT |
|
|
Renewable Energy Prosumers |
|
|
Integration of EVs |
|
|
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Latoń, D.; Grela, J.; Ożadowicz, A. Applications of Deep Reinforcement Learning for Home Energy Management Systems: A Review. Energies 2024, 17, 6420. https://doi.org/10.3390/en17246420
Latoń D, Grela J, Ożadowicz A. Applications of Deep Reinforcement Learning for Home Energy Management Systems: A Review. Energies. 2024; 17(24):6420. https://doi.org/10.3390/en17246420
Chicago/Turabian StyleLatoń, Dominik, Jakub Grela, and Andrzej Ożadowicz. 2024. "Applications of Deep Reinforcement Learning for Home Energy Management Systems: A Review" Energies 17, no. 24: 6420. https://doi.org/10.3390/en17246420
APA StyleLatoń, D., Grela, J., & Ożadowicz, A. (2024). Applications of Deep Reinforcement Learning for Home Energy Management Systems: A Review. Energies, 17(24), 6420. https://doi.org/10.3390/en17246420