Artificial Intelligence-Based Adaptive Traffic Signal Control System: A Comprehensive Review
Abstract
:1. Introduction
- Investigated the ATSC system based on the number of RIs, viz., SI and MI.
- Investigated various techniques used to design an ATSC system, viz., Fuzzy Logic (FL), Metaheuristic (MH), Dynamic Programming (DP), Reinforcement Learning (RL), Deep Reinforcement Learning (DRL), and Hybrid techniques.
- Further, these techniques are investigated based on a (a) single-intersection environment (ATSC for non-cooperative, which is isolated/islanded in the non-cooperative environment) and (b) multiple-intersection environment (ATSC with a cooperative environment).
- Evaluates gaps in knowledge and unaddressed challenges for ATSC with possible future research direction.
2. Preliminaries
2.1. Traffic Signal Control System
2.2. Traffic Signal Timing
- Signal Phase: A signal phase is a period during which a specific movement, like vehicles or pedestrians, is given the right of way at an intersection, controlled by traffic lights. It is part of the traffic signal cycle that manages traffic flow [4].
- Signal Cycle: Refers to one complete rotation in which each traffic direction movement is signaling to facilitate safe passage for the traffic at a specific road intersection.
- Signal Sequence: The order in which signals are phased during a signal cycle.
- Cycle Length: The number of seconds needed for a signal to go through one full signal cycle.
- Green Time: The number of seconds when a certain traffic flow at an RI continues at a maximum flow rate for a particular traffic lane.
- Timing of Phases: The amount of time in seconds taken by a particular phase for a given direction during a specific single complete signal cycle [4].
- Change Interval: After the green movement period, yellow duration is offered.
- Offset: The order of the synchronized phases’ temporal relationships. Extending the green phase duration for a specific movement can decrease the number of stopped vehicles and the latency. On the other hand, a rise in the green period of one traffic flow typically causes an increase in the delay and the number of vehicles lining up for competitors’ traffic lane flows. Therefore, a proper traffic signal strategy allots time to maximize the overall traffic results, such as the average waiting time.
- Average Waiting Time: This is how long it takes a vehicle to leave a road intersection after it stops at the intersection.
2.3. Microsimulation Tools
2.3.1. Microscopic
- Open Source;
- System
- -
- Discrete control system (DC)
- -
- Continuous control systems (CC);
- Visualization (2D/3D);
- Scope of Application
- -
- Regional (R)
- -
- City (C)
- -
- Country (Co);
- Output file format;
- Capability of importing maps;
- Programming language supported;
- Level of programming proficiency required.
2.3.2. Macroscopic
3. Review of Previous Research
- Number of road intersections considered (SI/MI);
- Objective;
- Methods/parameters used;
- Control system strategy;
- Data source;
- Microsimulation tool used.
3.1. Single-Intersection ATSC (SI-ATSC)
3.1.1. SI-ATSC Using Reinforcement Learning: (SI-ATSC-RL)
3.1.2. SI-ATSC Using Metaheuristic: (SI-ATSC-MH)
3.1.3. SI-ATSC Using Fuzzy Logic: (SI-ATSC-FL)
3.2. Multiple Intersection ATSC (MI-ATSC)
3.2.1. MI-ATSC Using Reinforcement Learning: (MI-ATSC-RL)
3.2.2. MI-ATSC Using Metaheuristic: (MI-ATSC-MH)
3.2.3. Hybrid Approach for (MI-ATSC-Hybrid)
4. Discussion
- In developing countries, the demand for transport infrastructure is increasing exponentially. However, government agencies cannot provide this in a short time-frame, emphasizing the urgent need for an efficient traffic control system to tackle the demand. ATSC is one solution for this.
- In contrast to traditional models, the ATSC system considers the traffic patterns and vehicle movements and responds to these factors in real-time [101,102]. It enables day-to-day operations by utilizing dynamic traffic flow and providing a user-friendly interface to its commuters. TSC system optimization is a challenging and intricate topic to solve. Stochastic processes are often included due to the unpredictable nature of traffic flow demand and behaviors [103]. The solution space for practical issues is so enormous that finding optimal solutions is challenging.
- By decreasing the time lost in heavy traffic, fuel waste, and the effects of pollution, which are always alarming on a global scale, ATSC will aid in boosting economic output. As a result, there is a greater need for improved traffic management technologies. To facilitate a smoother traffic flow, ATSC dynamically modifies the timing of traffic signals [104] based on current traffic circumstances.
- The findings from this review demonstrate that modern ATSC systems designed using various techniques offer substantial improvements in managing the dynamic density of the traffic flow.
- When applied to single intersections (SI) and multiple intersections (MI), ATSC systems have proven to reduce travel time, vehicle idling, and emissions, contributing to a smoother traffic flow. A key observation from the studies is that the complexity of traffic signal optimization increases significantly in MI scenarios due to the interdependency between intersections. In contrast, SI systems are easier to manage but may result in bottlenecks in adjacent areas due to isolated decision-making. Techniques like Reinforcement Learning (RL), Deep Reinforcement Learning (DRL), Fuzzy logic (FL), Dynamic programming (DP), metaheuristic technique (MH), and hybrid methods have shown promising results in dynamically adjusting signal timings based on real-time traffic data. Despite the advancements, gaps remain, particularly in applying ATSC systems in complex, real-world traffic environments with mixed vehicle types and pedestrian interactions.
- A few SI-ATSC systems utilizing RL techniques, such as SARSA, Q-Learning, and TD error [34,38,44], have shown the advantage of reducing vehicle queue lengths by up to 19%. However, these approaches are limited by the restricted number of available action spaces. Future research needs to increase ample action space by utilizing NNs.
- Several SI-ATSC systems employ MH techniques, such as PSO, heuristic GA, and PSO [46,48,49], which are beneficial for addressing the suggested non-linear programming tasks involving time-varying delay terms. However, a critical challenge with these methods in interrupted traffic flow is their difficulty in accurately defining the adaptive user equilibrium. The scalability of the methods is a crucial aspect that should be addressed in future work.
- Several SI-ATSC systems utilize FL techniques, such as FLP and fuzzy inference [52,55], which offer the advantage of scheduling anticipated signal times by adjusting the signal duration based on the number of vehicles at the intersection. However, these strategies’ limitations do not account for factors such as driver categories, vehicle types, and road infrastructures. Further research using these techniques will include driver categories and vehicle types.
- The MI-ATSC systems employing RL techniques, such as FA, MARL, Q-Learning, and two-level hierarchical DRL (D3QN) [59,60,62,70,71], aim to optimize traffic signals by balancing safety, efficiency, and decarbonization goals. These techniques excel under high-traffic-demand conditions, successfully addressing all three objectives. However, a key limitation is that each agent must communicate throughout the modeling process, which can reduce the overall communication efficiency.
- Some MI-ATSC systems utilize MH techniques, such as PSO, GA, ACO, SA, and CS [46,82,83]. These techniques aim to manage road infrastructure (RI) in real-time through decentralized algorithms, reducing congestion by adjusting signal durations and collecting input data from the execution environment. Future research should incorporate traffic characteristics in oversaturated conditions into the optimization process.
- The review also highlights the continued dominance of fixed-time traffic control systems in simulation-based studies, with fewer real-time dynamic systems being explored. This indicates a need for more research and experimentation with real-time adaptive systems, especially in developing countries with prevalent infrastructure challenges. Integrating data from external sources such as weather, pedestrian activity, and unexpected road events has been identified as a critical factor in enhancing ATSC performance. As microsimulation tools like SUMO and VISSIM become more sophisticated, they offer greater opportunities to fine-tune and test these systems under various traffic conditions.
5. Conclusions and Future Work
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
AC | Actor–Critic |
Ac | Actuated Control |
ACO | Ant Colony Optimization |
ATSC | Adaptive Traffic Signal Control |
ABC | Artificial bee colony |
ATT | Average travel time |
BC | Bee Colony Algorithm |
Bi | Binary |
BP | Back Pressure |
CS | Cuckoo Search |
CWT | Cumulative waiting time |
Cycle L | Cycle length |
DP | Dynamic Programming |
DRL | Deep Reinforcement Learning |
FITS | Fuzzy Intelligence Transportation System |
FL | Fuzzy Logic |
FLATSC | Fuzzy Logic Adaptive Traffic Signal Control |
GA | Genetic Algorithm |
Gn | Green time |
HS | Harmony Search |
JADE | Java agent development Environment |
KS-DDPG | Knowledge Sharing Deep Deterministic Policy Gradient |
MARL | Multi Agent Reinforcement Learning |
MBRL | Model-Based RL |
MH | Metaheuristic Algorithm |
MI | Multiple Intersection |
MOL | Multi-objective Learning |
MOLAC | Multi-objective Learning Agent Cooperation |
Nash-A2C | Nash Advantage Actor Critic |
Nash-A3C | Nash Asynchronous Advantage Actor Critic |
NN | Neural Network |
NSGA | Non-dominated Sorting Genetic |
PL | Programming Language; |
PSO: | Particle Swarm Optimization |
QL | Queue Learning |
Ql | Queue length |
QLAC | Queue Learning Actor–Critic |
RBF | Radial basis Function |
RL | Reinforcement Learning |
SA | Sarsa Algorithm |
SAFA | Sarsa Algorithm with Function approximation |
SI | Single Intersection |
SUMO | Simulation of Urban Mobility |
TSC | Traffic Signal Control |
TSS | Traffic Signal Scheduling |
TST | Traffic Signal Timing |
TSO | Traffic signal optimization |
VISSIM | Verkehr In Stadten–SIMulationmodell |
WCA | Water Cycle Algorithm |
References
- Neelakandan, S.; Berlin, M.A.; Tripathi, S.; Devi, V.B.; Bhardwaj, I.; Arulkumar, N. IoT-Based Traffic Prediction and Traffic Signal Control System for Smart City. Soft Comput. 2021, 25, 12241–12248. [Google Scholar] [CrossRef]
- Nielsen, O.A.; Frederiksen, R.D.; Simonsen, N. Using Expert System Rules to Establish Data for Intersections and Turns in Road Networks. Int. Trans. Oper. Res. 1998, 5, 569–581. [Google Scholar] [CrossRef]
- Jing, P.; Huang, H.; Chen, L. An Adaptive Traffic Signal Control in a Connected Vehicle Environment: A Systematic Review. Information 2017, 8, 101. [Google Scholar] [CrossRef]
- Kim, M.; Schrader, M.; Yoon, H.-S.; Bittle, J.A. Optimal Traffic Signal Control Using Priority Metric Based on Real-Time Measured Traffic Information. Sustainability 2023, 15, 7637. [Google Scholar] [CrossRef]
- Zaghal, R.; Thabatah, K.; Salah, S. Towards a Smart Intersection Using Traffic Load Balancing Algorithm. In Proceedings of the 2017 Computing Conference, London, UK, 18–20 July 2017; pp. 485–491. [Google Scholar]
- Mishra, S.; Singh, V.; Gupta, A.; Bhattacharya, D.; Mudgal, A. Adaptive Traffic Signal Control for Developing Countries Using Fused Parameters Derived from Crowd-Source Data. Transp. Lett. 2023, 15, 296–307. [Google Scholar] [CrossRef]
- Jovanović, A.; Teodorović, D. Pre-Timed Control for an under-Saturated and over-Saturated Isolated Intersection: A Bee Colony Optimization Approach. Transp. Plan. Technol. 2017, 40, 556–576. [Google Scholar] [CrossRef]
- Ahmed, E.K.E.; Khalifa, A.M.A.; Kheiri, A. Evolutionary Computation for Static Traffic Light Cycle Optimisation. In Proceedings of the 2018 International Conference on Computer, Control, Electrical, and Electronics Engineering (ICCCEEE), Khartoum, Sudan, 12–14 August 2018; pp. 1–6. [Google Scholar]
- Noaeen, M.; Naik, A.; Goodman, L.; Crebo, J.; Abrar, T.; Abad, Z.S.H.; Bazzan, A.L.C.; Far, B. Reinforcement Learning in Urban Network Traffic Signal Control: A Systematic Literature Review. Expert Syst. Appl. 2022, 99, 116830. [Google Scholar] [CrossRef]
- Tian, Y.; Liu, S.; Yan, X.; Zhu, T.; Zhang, Y. Active Control Method of Traffic Signal Based on Parallel Control Theory. IEEE J. Radio Freq. Identif. 2024, 8, 334–340. [Google Scholar] [CrossRef]
- Li, D.; Zhu, F.; Wu, J.; Wong, Y.D.; Chen, T. Managing Mixed Traffic at Signalized Intersections: An Adaptive Signal Control and CAV Coordination System Based on Deep Reinforcement Learning. Expert Syst. Appl. 2024, 238, 121959. [Google Scholar] [CrossRef]
- Sawarkar, A.D.; Shrimankar, D.D.; Ali, S.; Agrahari, A.; Singh, L. Bamboo Plant Classification Using Deep Transfer Learning with a Majority Multiclass Voting Algorithm. Appl. Sci. 2024, 14, 1023. [Google Scholar] [CrossRef]
- Haydari, A.; Yılmaz, Y. Deep Reinforcement Learning for Intelligent Transportation Systems: A Survey. IEEE Trans. Intell. Transp. Syst. 2022, 23, 11–32. [Google Scholar] [CrossRef]
- Agarwal, A.; Sahu, D.; Nautiyal, A.; Gupta, M.; Agarwal, P. Fusing Crowdsourced Data to an Adaptive Wireless Traffic Signal Control System Architecture. Internet Things 2024, 26, 101169. [Google Scholar] [CrossRef]
- Chen, L.; Englund, C. Cooperative Intersection Management: A Survey. IEEE Trans. Intell. Transp. Syst. 2016, 17, 570–586. [Google Scholar] [CrossRef]
- Anirudh, R.; Krishnan, M.; Kekuda, A. Intelligent Traffic Control System Using Deep Reinforcement Learning. In Proceedings of the 2022 International Conference on Innovative Trends in Information Technology (ICITIIT), Kottayam, India, 12–13 February 2022; pp. 1–8. [Google Scholar]
- Saleem, M.; Abbas, S.; Ghazal, T.M.; Adnan Khan, M.; Sahawneh, N.; Ahmad, M. Smart Cities: Fusion-Based Intelligent Traffic Congestion Control System for Vehicular Networks Using Machine Learning Techniques. Egypt. Inform. J. 2022, 23, 417–426. [Google Scholar] [CrossRef]
- Liu, B.; Ding, Z. A Distributed Deep Reinforcement Learning Method for Traffic Light Control. Neurocomputing 2022, 490, 390–399. [Google Scholar] [CrossRef]
- Zhao, P.; Gao, Y.; Sun, X. How Does Artificial Intelligence Affect Green Economic Growth?—Evidence from China. Sci. Total Environ. 2022, 834, 155306. [Google Scholar] [CrossRef]
- Tajalli, M.; Hajbabaie, A. Traffic Signal Timing and Trajectory Optimization in a Mixed Autonomy Traffic Stream. IEEE Trans. Intell. Transp. Syst. 2022, 23, 6525–6538. [Google Scholar] [CrossRef]
- Majstorović, Ž.; Tišljarić, L.; Ivanjko, E.; Carić, T. Urban Traffic Signal Control under Mixed Traffic Flows: Literature Review. Appl. Sci. 2023, 13, 4484. [Google Scholar] [CrossRef]
- Kang, L.; Lu, W.; Liu, L. Research on Route Hierarchical Control Strategy from the Perspective of Macroscopic Traffic Network. J. Intell. Transp. Syst. 2022, 27, 818–833. [Google Scholar] [CrossRef]
- Ratrout, N.T.; Rahman, S.M. A Comparative Analysis of Currently Used Microscopic and Macroscopic Traffic. Science 2009, 34, 121–133. [Google Scholar]
- Chevallier, E.; Leclercq, L. A Macroscopic Theory for Unsignalized Intersections. Transp. Res. Part B Methodol. 2007, 41, 1139–1150. [Google Scholar] [CrossRef]
- Gökçe, M.A.; Öner, E.; Işık, G. Traffic Signal Optimization with Particle Swarm Optimization for Signalized Roundabouts. Simulation 2015, 91, 456–466. [Google Scholar] [CrossRef]
- Ahmed, M.A.A.; Khoo, H.L.; Ng, O.-E. Discharge Control Policy Based on Density and Speed for Deep Q-Learning Adaptive Traffic Signal. Transp. B Transp. Dyn. 2023, 11, 1707–1726. [Google Scholar] [CrossRef]
- Tsitsokas, D.; Kouvelas, A.; Geroliminis, N. Two-Layer Adaptive Signal Control Framework for Large-Scale Dynamically-Congested Networks: Combining Efficient Max Pressure with Perimeter Control. Transp. Res. Part C Emerg. Technol. 2023, 152, 104128. [Google Scholar] [CrossRef]
- Zhao, D.; Dai, Y.; Zhang, Z. Computational Intelligence in Urban Traffic Signal Control: A Survey. IEEE Trans. Syst. Man, Cybern. Part C Appl. Rev. 2012, 42, 485–494. [Google Scholar] [CrossRef]
- Kolat, M.; Kővári, B.; Bécsi, T.; Aradi, S. Multi-Agent Reinforcement Learning for Traffic Signal Control: A Cooperative Approach. Sustainability 2023, 15, 3479. [Google Scholar] [CrossRef]
- Kang, D.; Li, Z.; Levin, M.W. Evasion Planning for Autonomous Intersection Control Based on an Optimized Conflict Point Control Formulation. J. Transp. Saf. Secur. 2022, 14, 2074–2110. [Google Scholar] [CrossRef]
- Levin, M.W.; Rey, D. Conflict-Point Formulation of Intersection Control for Autonomous Vehicles. Transp. Res. Part C Emerg. Technol. 2017, 85, 528–547. [Google Scholar] [CrossRef]
- Kaige, W.; Shiru, Q.; Yumei, Z. A Stochastic Adaptive Control Model for Isolated Intersections. In Proceedings of the 2007 IEEE International Conference on Robotics and Biomimetics (ROBIO), Sanya, China, 15–18 December 2007; pp. 2256–2260. [Google Scholar] [CrossRef]
- Shoufeng, L.; Ximin, L.; Shiqiang, D. Q-Learning for Adaptive Traffic Signal Control Based on Delay Minimization Strategy. In Proceedings of the 2008 IEEE International Conference on Networking, Sensing and Control, ICNSC, Sanya, China, 6–8 April 2008; pp. 687–691. [Google Scholar] [CrossRef]
- El-Tantawy, S.; Abdulhai, B.; Abdelgawad, H. Design of Reinforcement Learning Parameters for Seamless Application of Adaptive Traffic Signal Control. J. Intell. Transp. Syst. 2014, 18, 227–245. [Google Scholar] [CrossRef]
- Sharifzadeh, S.; Chiotellis, I.; Triebel, R.; Cremers, D. Learning to Drive Using Inverse Reinforcement Learning and Deep Q-Networks. arXiv 2016, arXiv:1612.03653. [Google Scholar]
- Gao, J.; Shen, Y.; Liu, J.; Ito, M.; Shiratori, N. Adaptive Traffic Signal Control: Deep Reinforcement Learning Algorithm with Experience Replay and Target Network. arXiv 2017, arXiv:1705.02755. [Google Scholar]
- Touhbi, S.; Babram, M.A.; Nguyen-Huu, T.; Marilleau, N.; Hbid, M.L.; Cambier, C.; Stinckwich, S. Adaptive Traffic Signal Control: Exploring Reward Definition for Reinforcement Learning. Procedia Comput. Sci. 2017, 109, 513–520. [Google Scholar] [CrossRef]
- Genders, W.; Razavi, S. Evaluating Reinforcement Learning State Representations for Adaptive Traffic Signal Control. Procedia Comput. Sci. 2018, 130, 26–33. [Google Scholar] [CrossRef]
- Wang, H.; Chen, H.; Wu, Q.; Ma, C.; Li, Y. Multi-Intersection Traffic Optimisation: A Benchmark Dataset and a Strong Baseline. IEEE Open J. Intell. Transp. Syst. 2022, 3, 126–136. [Google Scholar] [CrossRef]
- Garg, D.; Chli, M.; Vogiatzis, G. Deep Reinforcement Learning for Autonomous Traffic Light Control. In Proceedings of the 2018 3rd IEEE International Conference on Intelligent Transportation Engineering (ICITE), Singapore, 3–5 September 2018; pp. 214–218. [Google Scholar]
- Chin, Y.K.; Lee, L.K.; Bolong, N.; Yang, S.S.; Teo, K.T.K. Exploring Q-Learning Optimization in Traffic Signal Timing Plan Management. In Proceedings of the 2011 Third International Conference on Computational Intelligence, Communication Systems and Networks, Bali, Indonesia, 26–28 July 2011; pp. 269–274. [Google Scholar]
- Liang, X.; Du, X.; Wang, G.; Han, Z. A Deep Reinforcement Learning Network for Traffic Light Cycle Control. IEEE Trans. Veh. Technol. 2019, 68, 1243–1253. [Google Scholar] [CrossRef]
- Tang, D.; Duan, Y. Traffic Signal Control Optimization Based on Neural Network in the Framework of Model Predictive Control. Actuators 2024, 13, 251. [Google Scholar] [CrossRef]
- Thorpe, T.L.; Anderson, C.W. Traffic Light Control Using SARSA with Three State Representations; IBM Corp.: Armonk, NY, USA, 1996. [Google Scholar]
- El-Tantawy, S.; Abdulhai, B. An Agent-Based Learning towards Decentralized and Coordinated Traffic Signal Control. In Proceedings of the 13th International IEEE Conference on Intelligent Transportation Systems, Funchal, Portugal, 19–22 September 2010; pp. 665–670. [Google Scholar] [CrossRef]
- Panovski, D.; Zaharia, T. Simulation-Based Vehicular Traffic Lights Optimization. In Proceedings of the 2016 12th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS), Naples, Italy, 28 November–1 December 2016; pp. 258–265. [Google Scholar]
- Li, Z.; Shahidehpour, M.; Bahramirad, S.; Khodaei, A. Optimizing Traffic Signal Settings in Smart Cities. IEEE Trans. Smart Grid 2017, 8, 2382–2393. [Google Scholar] [CrossRef]
- Yu, H.; Ma, R.; Zhang, H.M. Optimal Traffic Signal Control under Dynamic User Equilibrium and Link Constraints in a General Network. Transp. Res. Part B Methodol. 2018, 110, 302–325. [Google Scholar] [CrossRef]
- Jia, H.; Lin, Y.; Luo, Q.; Li, Y.; Miao, H. Multi-Objective Optimization of Urban Road Intersection Signal Timing Based on Particle Swarm Optimization Algorithm. Adv. Mech. Eng. 2019, 11, 1687814019842498. [Google Scholar] [CrossRef]
- Elgarej, M.; Khalifa, M.; Youssfi, M. Traffic Lights Optimization with Distributed Ant Colony Optimization Based on Multi-Agent System BT—Networked Systems; Abdulla, P.A., Delporte-Gallet, C., Eds.; Springer International Publishing: Cham, Switzerland, 2016; pp. 266–279. [Google Scholar]
- Chuo, H.S.E.; Tan, M.K.; Chong, A.C.H.; Chin, R.K.Y.; Teo, K.T.K. Evolvable Traffic Signal Control for Intersection Congestion Alleviation with Enhanced Particle Swarm Optimisation. In Proceedings of the 2017 IEEE 2nd International Conference on Automatic Control and Intelligent Systems (I2CACIS), Kota Kinabalu, Malaysia, 21–21 October 2017; pp. 92–97. [Google Scholar]
- Yu, D.; Tian, X.; Xing, X.; Gao, S. Signal Timing Optimization Based on Fuzzy Compromise Programming for Isolated Signalized Intersection. Math. Probl. Eng. 2016, 2016, 1682394. [Google Scholar] [CrossRef]
- Jin, J.; Ma, X.; Kosonen, I. An Intelligent Control System for Traffic Lights with Simulation-Based Evaluation. Control Eng. Pract. 2017, 58, 24–33. [Google Scholar] [CrossRef]
- Tunc, I.; Soylemez, M.T. Fuzzy Logic and Deep Q Learning Based Control for Traffic Lights. Alexandria Eng. J. 2023, 67, 343–359. [Google Scholar] [CrossRef]
- Aksaç, A.; Uzun, E.; Özyer, T. A Real Time Traffic Simulator Utilizing an Adaptive Fuzzy Inference Mechanism by Tuning Fuzzy Parameters. Appl. Intell. 2012, 36, 698–720. [Google Scholar] [CrossRef]
- Vogel, A.; Oremović, I.; Šimić, R.; Ivanjko, E. Improving Traffic Light Control by Means of Fuzzy Logic. In Proceedings of the 2018 International Symposium ELMAR, Zadar, Croatia, 16–19 September 2018; pp. 51–56. [Google Scholar] [CrossRef]
- Ilgin Guler, S.; Menendez, M.; Meier, L. Using Connected Vehicle Technology to Improve the Efficiency of Intersections. Transp. Res. Part C Emerg. Technol. 2014, 46, 121–131. [Google Scholar] [CrossRef]
- Su, G.; Yang, J.J. Enhancing the Robustness of Traffic Signal Control with StageLight: A Multiscale Learning Approach. Eng 2024, 5, 104–115. [Google Scholar] [CrossRef]
- Prashanth, L.A.; Bhatnagar, S.; Member, S. Approximation for Traffic Signal Control. IEEE Trans. Intell. Transp. Syst. 2011, 12, 412–421. [Google Scholar]
- El-Tantawy, S.; Abdulhai, B.; Abdelgawad, H. Multiagent Reinforcement Learning for Integrated Network of Adaptive Traffic Signal Controllers (MARLIN-ATSC): Methodology and Large-Scale Application on Downtown Toronto. IEEE Trans. Intell. Transp. Syst. 2013, 14, 1140–1150. [Google Scholar] [CrossRef]
- Zeinaly, Z.; Sojoodi, M.; Bolouki, S. A Resilient Intelligent Traffic Signal Control Scheme for Accident Scenario at Intersections via Deep Reinforcement Learning. Sustainability 2023, 15, 1329. [Google Scholar] [CrossRef]
- Abdoos, M.; Mozayani, N.; Bazzan, A.L.C. Hierarchical Control of Traffic Signals Using Q-Learning with Tile Coding. Appl. Intell. 2014, 40, 201–213. [Google Scholar] [CrossRef]
- Aslani, M.; Mesgari, M.S.; Wiering, M. Adaptive Traffic Signal Control with Actor-Critic Methods in a Real-World Traffic Network with Different Traffic Disruption Events. Transp. Res. Part C Emerg. Technol. 2017, 85, 732–752. [Google Scholar] [CrossRef]
- Aziz, H.M.A.; Zhu, F.; Ukkusuri, S.V. Learning-Based Traffic Signal Control Algorithms with Neighborhood Information Sharing: An Application for Sustainable Mobility. J. Intell. Transp. Syst. Technol. Plan. Oper. 2018, 22, 40–52. [Google Scholar] [CrossRef]
- Haddad, T.A.; Hedjazi, D.; Aouag, S. A Deep Reinforcement Learning-Based Cooperative Approach for Multi-Intersection Traffic Signal Control. Eng. Appl. Artif. Intell. 2022, 114, 105019. [Google Scholar] [CrossRef]
- Li, Z.; Yu, H.; Zhang, G.; Dong, S.; Xu, C.-Z. Network-Wide Traffic Signal Control Optimization Using a Multi-Agent Deep Reinforcement Learning. Transp. Res. Part C Emerg. Technol. 2021, 125, 103059. [Google Scholar] [CrossRef]
- Lin, Y.; Dai, X.; Li, L.; Wang, F.-Y. An Efficient Deep Reinforcement Learning Model for Urban Traffic Control. arXiv 2018, arXiv:1808.01876. [Google Scholar]
- Bouktif, S.; Cheniki, A.; Ouni, A.; El-Sayed, H. Deep Reinforcement Learning for Traffic Signal Control with Consistent State and Reward Design Approach. Knowl.-Based Syst. 2023, 267, 110440. [Google Scholar] [CrossRef]
- Kumar, R.; Sharma, N.V.K.; Chaurasiya, V.K. Adaptive Traffic Light Control Using Deep Reinforcement Learning Technique. Multimed. Tools Appl. 2024, 83, 13851–13872. [Google Scholar] [CrossRef]
- Abdoos, M.; Bazzan, A.L.C. Hierarchical Traffic Signal Optimization Using Reinforcement Learning and Traffic Prediction with Long-Short Term Memory. Expert Syst. Appl. 2021, 171, 114580. [Google Scholar] [CrossRef]
- Zhang, G.; Chang, F.; Jin, J.; Yang, F.; Huang, H. Multi-Objective Deep Reinforcement Learning Approach for Adaptive Traffic Signal Control System with Concurrent Optimization of Safety, Efficiency, and Decarbonization at Intersections. Accid. Anal. Prev. 2024, 199, 107451. [Google Scholar] [CrossRef]
- Wang, T.; Zhu, Z.; Zhang, J.; Tian, J.; Zhang, W. A Large-Scale Traffic Signal Control Algorithm Based on Multi-Layer Graph Deep Reinforcement Learning. Transp. Res. Part C Emerg. Technol. 2024, 162, 104582. [Google Scholar] [CrossRef]
- Zhou, B.; Zhou, Q.; Hu, S.; Ma, D.; Jin, S.; Lee, D.-H. Cooperative Traffic Signal Control Using a Distributed Agent-Based Deep Reinforcement Learning With Incentive Communication. IEEE Trans. Intell. Transp. Syst. 2024, 25, 10147–10160. [Google Scholar] [CrossRef]
- Wu, Q.; Wu, J.; Shen, J.; Du, B.; Telikani, A.; Fahmideh, M.; Liang, C. Distributed Agent-Based Deep Reinforcement Learning for Large Scale Traffic Signal Control. Knowl.-Based Syst. 2022, 241, 108304. [Google Scholar] [CrossRef]
- Steingröver, M.; Schouten, R.; Peelen, S.; Nijhuis, E.; Bakker, B. Reinforcement Learning of Traffic Light Controllers Adapting to Traffic Congestion. Belgian/Netherlands Artif. Intell. Conf. 2005, 216–223. [Google Scholar]
- Prabuchandran, K.J.; AN, H.K.; Bhatnagar, S. Multi-Agent Reinforcement Learning for Traffic Signal Control. In Proceedings of the 17th International IEEE Conference on Intelligent Transportation Systems (ITSC), Qingdao, China, 8–11 October 2014; pp. 2529–2534. [Google Scholar]
- Khamis, M.A.; Gomaa, W. Enhanced Multiagent Multi-Objective Reinforcement Learning for Urban Traffic Light Control. In Proceedings of the 2012 11th International Conference on Machine Learning and Applications, Boca Raton, FL, USA, 12–15 December 2012; Volume 1, pp. 586–591. [Google Scholar]
- Khamis, M.A.; Gomaa, W. Adaptive Multi-Objective Reinforcement Learning with Hybrid Exploration for Traffic Signal Control Based on Cooperative Multi-Agent Framework. Eng. Appl. Artif. Intell. 2014, 29, 134–151. [Google Scholar] [CrossRef]
- Prashanth, L.A.; Bhatnagar, S. Reinforcement Learning with Average Cost for Adaptive Control of Traffic Lights at Intersections. In Proceedings of the 2011 14th International IEEE Conference on Intelligent Transportation Systems (ITSC), Washington, DC, USA, 5–7 October 2011; pp. 1640–1645. [Google Scholar]
- Houli, D.; Zhiheng, L.; Yi, Z. Multiobjective Reinforcement Learning for Traffic Signal Control Using Vehicular Ad Hoc Network. EURASIP J. Adv. Signal Process. 2010, 2010, 1–7. [Google Scholar] [CrossRef]
- Jin, J.; Ma, X. A Multi-Objective Agent-Based Control Approach With Application in Intelligent Traffic Signal System. IEEE Trans. Intell. Transp. Syst. 2019, 20, 3900–3912. [Google Scholar] [CrossRef]
- Hajbabaie, A.; Benekohal, R.F. A Program for Simultaneous Network Signal Timing Optimization and Traffic Assignment. IEEE Trans. Intell. Transp. Syst. 2015, 16, 2573–2586. [Google Scholar] [CrossRef]
- Dakic, I.; Stevanovic, J.; Stevanovic, A. Backpressure Traffic Control Algorithms in Field-like Signal Operations. In Proceedings of the 2015 IEEE 18th International Conference on Intelligent Transportation Systems, Gran Canaria, Spain, 15–18 September 2015; pp. 137–142. [Google Scholar]
- Nguyen, P.T.M.; Passow, B.N.; Yang, Y. Improving Anytime Behavior for Traffic Signal Control Optimization Based on NSGA-II and Local Search. In Proceedings of the 2016 International Joint Conference on Neural Networks (IJCNN), Vancouver, BC, Canada, 24–29 July 2016; pp. 4611–4618. [Google Scholar]
- Wardrop, J.G. Road Paper. Some Theoretical Aspect of Road Traffic Research. Proc. Inst. Civ. Eng. 1952, 1, 325–362. [Google Scholar] [CrossRef]
- Guo, J.; Kong, Y.; Li, Z.; Huang, W.; Cao, J.; Wei, Y. A Model and Genetic Algorithm for Area-Wide Intersection Signal Optimization under User Equilibrium Traffic. Math. Comput. Simul. 2019, 155, 92–104. [Google Scholar] [CrossRef]
- El Hatri, C.; Boumhidi, J. Q-Learning Based Intelligent Multi-Objective Particle Swarm Optimization of Light Control for Traffic Urban Congestion Management. In Proceedings of the 2016 4th IEEE International Colloquium on Information Science and Technology (CiSt), Tangier, Morocco, 24–26 October 2016; pp. 794–799. [Google Scholar]
- Gao, K.; Zhang, Y.; Sadollah, A.; Su, R. Optimizing Urban Traffic Light Scheduling Problem Using Harmony Search with Ensemble of Local Search. Appl. Soft Comput. 2016, 48, 359–372. [Google Scholar] [CrossRef]
- Srivastava, S.; Sahana, S.K. Nested Hybrid Evolutionary Model for Traffic Signal Optimization. Appl. Intell. 2017, 46, 113–123. [Google Scholar] [CrossRef]
- Massow, K.; Pfeifer, N.; Ketzler, F.; Radusch, I. Close-Range Coordination to Enhance Constant Distance Spacing Policies in Oversaturated Traffic Systems. Sensors 2024, 24, 4865. [Google Scholar] [CrossRef]
- Chen, S.; Sun, D.J. An Improved Adaptive Signal Control Method for Isolated Signalized Intersection Based on Dynamic Programming. IEEE Intell. Transp. Syst. Mag. 2016, 8, 4–14. [Google Scholar] [CrossRef]
- Lu, K.; Jiang, S.; Xin, W.; Zhang, J.; He, K. Algebraic Method of Regional Green Wave Coordinated Control. J. Intell. Transp. Syst. 2022, 27, 799–817. [Google Scholar] [CrossRef]
- Zhang, Z.; Zhang, W.; Liu, Y.; Xiong, G. Mean Field Multi-Agent Reinforcement Learning Method for Area Traffic Signal Control. Electronics 2023, 12, 4686. [Google Scholar] [CrossRef]
- Gao, K.; Zhang, Y.; Su, R.; Yang, F.; Suganthan, P.N.; Zhou, M. Solving Traffic Signal Scheduling Problems in Heterogeneous Traffic Network by Using Meta-Heuristics. IEEE Trans. Intell. Transp. Syst. 2019, 20, 3272–3282. [Google Scholar] [CrossRef]
- Jiang, C.-Y.; Hu, X.-M.; Chen, W.-N. An Urban Traffic Signal Control System Based on Traffic Flow Prediction. In Proceedings of the 2021 13th International Conference on Advanced Computational Intelligence (ICACI), Wanzhou, China, 14–16 May 2021; pp. 259–265. [Google Scholar]
- Storani, F.; Di Pace, R.; De Schutter, B. A Traffic Responsive Control Framework for Signalized Junctions Based on Hybrid Traffic Flow Representation. J. Intell. Transp. Syst. 2022, 27, 606–625. [Google Scholar] [CrossRef]
- Manandhar, B.; Joshi, B. Adaptive Traffic Light Control with Statistical Multiplexing Technique and Particle Swarm Optimization in Smart Cities. In Proceedings of the 2018 IEEE 3rd International Conference on Computing, Communication and Security (ICCCS), Kathmandu, Nepal, 25–27 October 2018; pp. 210–217. [Google Scholar]
- Bernas, M.; Płaczek, B.; Smyła, J. A Neuroevolutionary Approach to Controlling Traffic Signals Based on Data from Sensor Network. Sensors 2019, 19, 1776. [Google Scholar] [CrossRef] [PubMed]
- Bie, Y.; Cheng, S.; Liu, Z. Optimization of Signal-Timing Parameters for the Intersection with Hook Turns. Transport 2017, 32, 233–241. [Google Scholar] [CrossRef]
- Tarek, Z.; AL-Rahmawy, M.; Tolba, A. Fog Computing for Optimized Traffic Control Strategy. J. Intell. Fuzzy Syst. 2019, 36, 1401–1415. [Google Scholar] [CrossRef]
- Xu, H.; Zhang, N.; Li, Z.; Zhuo, Z.; Zhang, Y.; Zhang, Y.; Ding, H. Energy-Saving Speed Planning for Electric Vehicles Based on RHRL in Car Following Scenarios. Sustainability 2023, 15, 15947. [Google Scholar] [CrossRef]
- Zhao, Z.; Wang, K.; Wang, Y.; Liang, X. Enhancing Traffic Signal Control with Composite Deep Intelligence. Expert Syst. Appl. 2024, 244, 123020. [Google Scholar] [CrossRef]
- Mok, K.; Zhang, L. Adaptive Traffic Signal Management Method Combining Deep Learning and Simulation. Multimed. Tools Appl. 2022, 83, 15439–15459. [Google Scholar] [CrossRef]
- Li, T.; Guo, F.; Krishnan, R.; Sivakumar, A. An Analysis of the Value of Optimal Routing and Signal Timing Control Strategy with Connected Autonomous Vehicles. J. Intell. Transp. Syst. 2022, 28, 252–266. [Google Scholar] [CrossRef]
Features | SUMO | VISSIM | AIMSUN | MATSim | CORSIM | Paramics |
---|---|---|---|---|---|---|
Open source | Y | N | N | Y | N | N |
System | CC | CC | CC | CC | DC | DC |
Visualization | 2D/3D | 2D/3D | 2D/3D | 2D | 2D/3D | 2D/3D |
Pedestrian | Y | Y | Y | N | Y | Y |
Scope of application | C | C/R | R/Co | C/R | C/R | C/R |
Output | XML file | XML file | Graph based | Text based | XML, CSV files | HTML, CSV, XML files |
Import maps | Y | Y | Y | Y | NA | NA |
Programming language | CPP,VB, Matlab, Python | CPP,VB, Matlab, Python | Python, CPP | NA | NA | NA |
Level of coding | Difficult | Easy | Difficult | NA | NA | NA |
Y: Yes; N: No; NA: Not available |
Ref. | RL Approach | State | Action | Reward | Compared with |
---|---|---|---|---|---|
[44] | SARSA | #Vehicle fixed, Constant and variable vehicle distance | Bi phase | Fixed penalty (−1) | FTS different states |
[38] | QL | Queue size | Bi phase | Total latency | FTS |
[32] | SARSA | # of vehicles | Bi phase | Coefficients of state | FTS, Ac |
[45] | QL | Length of queue, Total delay | Gn phase | Change in total delay | FTS |
[34] | SARSA, TD error | Queue size, Total delay | Bi and Gn phases | Immediate delay, Total delay, Queue size | FTS, Ac |
[33] | QL | Total delay time | Time change in Gn phase | Total latency | FTS |
[37] | QL | Max left queue size | Gn phase time | Queue size, Total delay, Throughput | Variable vehicle demand |
FTS: Fixed-time scheduling; Ac: Actuated control; #: Number of; Gn: Green; Bi: Binary |
Objectives | Mathematical Model | Variables | Intersection | Control System Strategy | Source of Information | Simulator Used | Method | Ref. | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Cycle L. | Green Time | Offsets | Phase Seq. | SI | MI | Fixed Time | Real Time | Sensor\Detector | Camera | Simulate\Various Sources | |||||
↓ ATT | N | N | Y | N | N | Y | N | Y | N | N | Y | N | Vis sim | PSO | [25] |
↓ Gn at RI | N | N | Y | N | Y | Y | N | N | Y | NA | NA | NA | JADE | ACO | [50] |
↓ ATT | N | N | Y | N | N | Y | N | Y | N | N | N | Y | Sumo | GA and Hyper heuristic | [8] |
↓ Ql at RI | N | Y | Y | Y | N | Y | N | Y | N | N | Y | NA | NA | PSO | [51] |
↓ Avg. delay at RI | Y | Y | Y | NA | NA | Y | NA | Y | NA | NA | NA | Y | NA | BC | [7] |
↓: Dec; Y: Yes; N: NO; NA: Not applicable; Avg: Average; Gn: Green time |
Ref. | Approach | Solution Strategy | Scenario | Simulator | Compared with |
---|---|---|---|---|---|
[75] | MBRL | Congestion value sharing | 12 RIs | GDL | TC-1 [76] |
[70] | Bayesian trans func. | MOL | 12 RIs | GDL | TC-1 [76] |
[77] | Bayesian trans func. | MOLAC | 12 mixed intersections | GDL | TC-1 [76] |
[78] | MBRL with Bayesian trans func. | MOLAC | 22 RIs | GDL | TC-1 [76] |
[79] | QL, AC | FA | 2 × 2 grid, 5 RIs | GDL | FTS |
[59] | QL | FA | 2 × 2 grid, 3 × 3 grid, 5 RIs, 9 RIs | GDL | FTS |
[80] | MBRL | MOL | Real-world Beijing road map | Paramics | FTS, Ac, Single-agent RL |
[60] | QL | Direct coordination, Indirect coordination | RI in central Toronto | Paramics | FTS, Semi-Ac, Full Ac |
[81] | SAFA | MOL Threshold, lexicographic ordering | 3 RIs in Stockholm | Sumo | Multiple FA |
[62] | QL | 2-level hierarchical control | 3 × 3 grid | Aimsun | 1-level QL |
[63] | AC | Tile coding RBF | Real road map metropolitan Tehran | Aimsun | QL, FTS, Ac |
[64] | Avg reward | Multi-reward design | 8, 11 RIs | Vissim | QL, FTS, Ac |
FA: Function approximation; FTS: Fixed-time scheduling; Ac: Actuated control |
Objectives | Mathematical Model | Variables | Intersection | Control System Strategy | Source of Information | Simulator Used | Method | Ref. | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Cycle L. | Green Time | Offsets | Phase Seq. | SI | MI | Fixed Time | Real Time | Sensor\Detector | Camera | Simulate\Various Sources | |||||
↑ Flow rate | N | Y | Y | N | N | N | Y | Y | N | N | N | Y | Sumo | PSO | [46] |
↓ Avg delay, ↓ Ql, ↓ ATT, ↓ cost | Y | Y | Y | Y | Y | N | Y | N | Y | N | N | N | Sumo | NSGA- II, LS, NSGA- II, MODELA | [84] |
↑ Flow rate, ↓ Avg. delay | N | N | Y | N | Y | N | Y | Y | N | N | N | N | Sumo | Sumo, PSO | [87] |
Improve parameters for over saturated states length at RI | Y | Y | Y | Y | Y | N | Y | Y | N | N | Y | N | Corsim | GA | [82] |
↓ ATT | Y | Y | Y | Y | Y | N | Y | Y | N | N | N | N | Sumo | GA | [47] |
↓ ATT | N | Y | Y | N | Y | Y | Y | Y | N | N | N | Y | Paramics | SA, GA, CS | [47] |
↑: Inc; ↓: Dec; Y: Yes; N: NO; Avg: Average; Ql: Queue length; ATT: Average travel time |
Objectives | Mathematical Model | Variables | Intersection | Control System Strategy | Source of Information | Simulator Used for Test | Method | Ref. | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Cycle L. | Green Time | Offsets | Phase Seq. | SI | MI | Fixed Time | Real Time | Sensor\Detector | Camera | Simulate\Various Sources | |||||
↓ Avg. delay at RI | Y | N | Y | N | Y | Y | N | N | Y | Y | N | N | Sumo | Statistical multiplexing, PSO | [97] |
↓ CWT | Y | Y | Y | Y | Y | N | Y | Y | N | N | N | N | NA | Artificial BC, HS, and Water Cycle Algo. | [94] |
↓ Congestion, ↓ Network interruption, ↓ CWT | N | N | N | N | N | N | N | N | Y | N | N | Y | City flow | Nash-A2C, Nash-A3C | [74] |
↓ ATT | N | Y | Y | N | N | N | Y | N | Y | N | N | N | Sumo | NeuroEvolution strategy | [98] |
↓ Avg.delay at RI | Y | Y | Y | N | Y | Y | N | Y | N | Y | N | N | NA | GA | [99] |
↓ ATT, ↓ Avg. delay | N | Y | Y | N | N | N | Y | N | Y | Y | N | N | NA | PSO, Three sub controller | [100] |
↓: Dec; Y: Yes; N: NO; Avg: Average; NA: Not available; CWT: Cumulative travel time |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Agrahari, A.; Dhabu, M.M.; Deshpande, P.S.; Tiwari, A.; Baig, M.A.; Sawarkar, A.D. Artificial Intelligence-Based Adaptive Traffic Signal Control System: A Comprehensive Review. Electronics 2024, 13, 3875. https://doi.org/10.3390/electronics13193875
Agrahari A, Dhabu MM, Deshpande PS, Tiwari A, Baig MA, Sawarkar AD. Artificial Intelligence-Based Adaptive Traffic Signal Control System: A Comprehensive Review. Electronics. 2024; 13(19):3875. https://doi.org/10.3390/electronics13193875
Chicago/Turabian StyleAgrahari, Anurag, Meera M. Dhabu, Parag S. Deshpande, Ashish Tiwari, Mogal Aftab Baig, and Ankush D. Sawarkar. 2024. "Artificial Intelligence-Based Adaptive Traffic Signal Control System: A Comprehensive Review" Electronics 13, no. 19: 3875. https://doi.org/10.3390/electronics13193875
APA StyleAgrahari, A., Dhabu, M. M., Deshpande, P. S., Tiwari, A., Baig, M. A., & Sawarkar, A. D. (2024). Artificial Intelligence-Based Adaptive Traffic Signal Control System: A Comprehensive Review. Electronics, 13(19), 3875. https://doi.org/10.3390/electronics13193875