Optimal Control Policy for Energy Management of a Commercial Bank
Abstract
:1. Introduction
2. Literature Review
- Design of specific MDP model for solving the building energy management problem for a bank branch building, in addition to a proposed computational saving mechanism by decomposing the problem into two sub-problems, i.e., lighting control and thermal control (Section 4).
- Specification of the practical implementation process involved with the proposed model (Section 3).
- Presentation of a sample case study and analysis of the simulation results based on the statistical data obtained from an actual bank branch building in Lahore, Pakistan (Section 5).
- Comparison of the proposed model with a generic MDP-based model for building energy management, highlighting the advantages of the proposed model over the existing approach (Section 6).
3. Background and Problem Formulation
3.1. Description of Problem
- A bank building has two types of occupants: one that is fixed, i.e., the number of bank officers and staff, and second, the customers.
- Number of customers visiting a branch can vary from 200 to 700 per day. This number increases before festivals and on salary days of the month.
- Customer Service is fixed by a specific time zones, normally from 9 a.m. to 5 p.m.
- It is mostly active for 5 days a week.
- It has an Automatic Teller Machine (ATM) room that is operational 24 h.
- It has an IT room that is also operational 24 h.
- Size of a branch of a bank is usually between 1300 sq. feet to 2000 sq. feet. However, the head office of a bank can be larger and can have multiple floors.
- Most of the staff have computers active during active hours of operation of the bank.
- Multiple air conditioning units are used in a branch.
3.2. Markov Decision Process (MDP)
- A set of states, S (assumed discrete).
- A set of actions, A (assumed discrete).
- Transition probabilities, P, which define the probability distribution over next states given the current state and current action P(St+1|St,At). Here the subscript represents the decision instant (or time).
- A policy is a mapping from states to actions.
- A value function for a policy that gives the expected sum of discounted rewards when acting under that policy .
3.3. Scalability Issues
4. Proposed MDP Model
4.1. Set of States
4.2. Actions
4.3. Transition Probabilities
4.4. Reward/Cost Functions
4.5. Implementation Methodology
5. Case Study
5.1. Parameter Values for Temperature Control
5.2. Parameter Values for Lighting Control
5.3. Calculation of Optimal Policies
6. Results
6.1. Simulation Case 1: No Change in Parameters
6.2. Simulation Case 2: Variation in Number of Customers
6.3. Simulation Case 3: Desired Temperature
6.4. Simulation Case 4: Effect of Power Sources
6.5. Simulation Case 5: Desired Lighting
6.6. Comparison with Existing Approaches
7. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Code Availability
Abbreviations
Symbol | Meaning |
Value of state under policy | |
Expected value | |
Discount factor | |
Reward of state | |
Set of states | |
An element of | |
Number of customers in ith state | |
Status of jth air conditioner in ith state | |
Status of jth light bulb in ith state | |
Amount of lighting required in ith state | |
Difference between the actual and the desired temperature in ith state | |
Active power source in ith state | |
Set of actions | |
An element of | |
Probability | |
Cost associated with state | |
Probability of increase in cost given cost | |
Probability of decrease in cost given cost |
References
- Rinkesh. Causes and Solutions to the Global Energy Crisis. Available online: https://www.conserve-energy-future.com/causes-and-solutions-to-the-global-energy-crisis.php (accessed on 13 March 2022).
- Dounis, A.; Caraiscos, C. Advanced control systems engineering for energy and comfort management in a building environment—A review. Renew. Sustain. Energy Rev. 2009, 13, 1246–1261. [Google Scholar] [CrossRef]
- Costa, A.; Keane, M.; Torrens, J.I.; Corry, E. Building operation and energy performance: Monitoring, analysis and optimization toolkit. Appl. Energy 2013, 101, 310–316. [Google Scholar] [CrossRef]
- Whitehouse, K.; Ranjan, J.; Lu, J.; Sookoor, T.; Saadat, M.; Burke, C.M.; Staengl, G.; Canfora, A.; Haj-Hariri, H. Towards occupancy-driven heating and cooling. IEEE Des.Test Comput. 2012, 29, 17–25. [Google Scholar] [CrossRef]
- Luo, Z.; Lu, Y.; Cang, Y.; Yang, L. Study on dual-objective optimization method of life cycle energy consumption and economy of office building based on HypE genetic algorithm. Energy Build. 2022, 256, 111749. [Google Scholar] [CrossRef]
- Hachem-Vermette, C.; Singh, K. Optimization of energy resources in various building cluster archetypes. Renew. Sustain. Energy Rev. 2022, 157, 112050. [Google Scholar] [CrossRef]
- Iqbal, N.; Kim, D.H. IoT Task Management Mechanism Based on Predictive Optimization for Efficient Energy Consumption in Smart Residential Buildings. Energy Build. 2022, 257, 111762. [Google Scholar]
- Xu, Y.; Zhang, G.; Yan, C.; Wang, G.; Jiang, Y.; Zhao, K. A two-stage multi-objective optimization method for envelope and energy generation systems of primary and secondary school teaching buildings in China. Build. Environ. 2021, 204, 108142. [Google Scholar] [CrossRef]
- Jain, A.; Behl, M.; Mangharam, R. Data Predictive Control for building energy management. In Proceedings of the 2017 American Control Conference (ACC), Seattle, WA, USA, 24–26 May 2017; pp. 44–49. [Google Scholar]
- Liu, J.; Tang, H.; Matsui, M.; Takanokura, M.; Zhou, L.; Gao, X.; Jing, L.; Hao, T.; Matsui, M.; Lei, Z.; et al. Optimal management of energy storage system based on reinforcement learning. In Proceedings of the 33rd Chinese Control Conference (CCC), Seattle, WA, USA, 24–26 May 2014. [Google Scholar]
- Dalamagkidis, K.; Kolokotsa, D.; Kalaitzakis, K.; Stavrakakis, G. Reinforcement learning for energy conservation and comfort in buildings. Build. Environ. 2007, 42, 2686–2698. [Google Scholar] [CrossRef]
- Ruelens, F.; Claessens, B.J.; Vandael, S.; De Schutter, B.; Babuska, R.; Belmans, R. Residential demand response of thermostatically controlled loads using batch reinforcement learning. IEEE Trans. Smart Grid 2017, 8, 2519–2531. [Google Scholar] [CrossRef] [Green Version]
- O’Neill, D.; Levorato, M.; Goldsmith, A.; Mitra, U. Residential demand response using reinforcement learning. Smart Grid Communications (SmartGridComm). In Proceedings of the 2010 First IEEE International Conference, Gaithersburg, MD, USA, 4–6 October 2010. [Google Scholar]
- Kasahara, M.; Matsuba, T.; Kuzuu, Y.; Yamazaki, T. Design and tuning of robust PID controller for HVAC systems. ASHRAE Trans. 1999, 105, 154. [Google Scholar]
- Mathews, E.H.; Arndt, D.C.; Piani, C.B.; Van Heerden, E. Developing cost efficient control strategies to ensure optimal energy use and sufficient indoor comfort. Appl. Energy 2000, 66, 135–159. [Google Scholar] [CrossRef]
- Henze, G.P.; Dodier, R.H.; Krarti, M. Development of a predictive optimal controller for thermal energy storage systems. HVACR Res. 1997, 3, 233–264. [Google Scholar] [CrossRef]
- Curtiss, P.S.; Kreider, J.F.; Shavit, G. Neural Networks Applied to Buildings—A Tutorial and Case Studies in Prediction and Adaptive Control; No. CONF-960254; American Society of Heating, Refrigerating and Air-Conditioning Engineers, Inc.: Atlanta, GA, USA, 1996. [Google Scholar]
- Yang, R.; Wang, L. Multi-zone building energy management using intelligent control and optimization. Sustain. Cities Soc. 2013, 6, 16–21. [Google Scholar] [CrossRef]
- Wang, L.; Wang, Z.; Yang, R. Intelligent multiagent control system for energy and comfort management in smart and sustainable buildings. IEEE Trans. Smart Grid 2012, 3, 605–617. [Google Scholar] [CrossRef]
- Kolokotsa, D.; Pouliezos, A.; Stavrakakis, G.; Lazos, C. Predictive control techniques for energy and indoor environmental quality management in buildings. Build. Environ. 2009, 44, 1850–1863. [Google Scholar] [CrossRef]
- Cook, D.; Youngblood, M.; Heierman, E.; Gopalratnam, K.; Rao, S.; Litvin, A.; Khawaja, F. MavHome: An Agent-Based Smart Home. In Proceedings of the First IEEE International Conference on Pervasive Computing and Communications, (PerCom 2003), Fort Worth, TX, USA, 4 March 2003. [Google Scholar]
- Ożadowicz, A. A Hybrid Approach in Design of Building Energy Management System with Smart Readiness Indicator and Building as a Service Concept. Energies 2022, 15, 1432. [Google Scholar] [CrossRef]
- Mariano-Hernández, D.; Callejo, L.H.; Zorita-Lamadrid, A.; Duque-Pérez, O.; García, F.S. A review of strategies for building energy management system: Model predictive control, demand side management, optimization, and fault detect & diagnosis. J. Build. Eng. 2021, 33, 101692. [Google Scholar]
- Yu, L.; Qin, S.; Zhang, M.; Shen, C.; Jiang, T.; Guan, X. A review of deep reinforcement learning for smart building energy management. IEEE Internet Things J. 2021, 8, 12046–12063. [Google Scholar] [CrossRef]
- Gomes, J.; Paiva, A.; Costa, S.; Balestrassi, P.; Paiva, E. Weighted multivariate mean square error for processes optimization: A case study on flux-cored arc welding for stainless steel claddings. Eur. J. Oper. Res. 2013, 226, 522–535. [Google Scholar] [CrossRef]
Sr. No. | Parameter Description | Values and Ranges |
---|---|---|
1 | No. of air conditioners | 0 to 4 |
2 | Modes of each air conditioner | 4 (0, 1, 2, 3) |
3 | Number of customers | 0 to 9 |
4 | Power modes | 2 (1, 2) |
5 | Range of desirable temperature | 24 to 27 |
6 | Range of room temperature | 24 to 41 |
7 | α1 | 1 |
8 | α2 | 2 |
9 | Total states | 368,639 |
Sr. No. | Range of Customers | Temperature Range | |
---|---|---|---|
1 | C ≥ 5 | t (24, 28) | 0.75 |
2 | C ≥ 5 | t (29, 35) | 0.8 |
3 | C ≤ 5 | t (24, 28) | 0.85 |
4 | C ≤ 5 | t (29, 35) | 0.9 |
5 | C ≥ 5 | t (36, 41) | 0.95 |
6 | C ≤ 5 | t (36, 41) | 0.98 |
Sr. No. | Parameter Description | Values and Ranges |
---|---|---|
1 | Number of lights | 10 |
2 | Mode of light | 2 (0, 1) |
3 | Desired lighting conditions | 0 to 10 |
4 | Number of customers | 0 to 9 |
5 | Power modes | 2 (1, 2) |
6 | α1 | 1 |
7 | α2 | 2 |
8 | Total states | 225,280 |
Sr. No. | Proposed Approach | Previous Work |
---|---|---|
1 | Specific for a particular type of building | Generic model was given |
2 | Time dependent | Time not taken into consideration |
3 | Different MDP model, i.e., decomposition is with respect to time | Model decomposition was goal based |
4 | Multiple air conditioning units | Only one centralized air conditioning unit |
5 | Multiple lighting switches with separate control of each | Only one lighting switch that controlled lighting levels |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Tahir, I.; Nasir, A.; Algethami, A. Optimal Control Policy for Energy Management of a Commercial Bank. Energies 2022, 15, 2112. https://doi.org/10.3390/en15062112
Tahir I, Nasir A, Algethami A. Optimal Control Policy for Energy Management of a Commercial Bank. Energies. 2022; 15(6):2112. https://doi.org/10.3390/en15062112
Chicago/Turabian StyleTahir, Ifrah, Ali Nasir, and Abdullah Algethami. 2022. "Optimal Control Policy for Energy Management of a Commercial Bank" Energies 15, no. 6: 2112. https://doi.org/10.3390/en15062112
APA StyleTahir, I., Nasir, A., & Algethami, A. (2022). Optimal Control Policy for Energy Management of a Commercial Bank. Energies, 15(6), 2112. https://doi.org/10.3390/en15062112