Next Article in Journal
An Analysis of the Spatial–Temporal Evolution and Influencing Factors of the Coupling Coordination Degree Between the Digital and Real Economies in China
Previous Article in Journal
Energy Consumption and Carbon Footprint of the Port of Sines: Contribution to Maritime Transport Sustainability
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Dynamic Traffic Flow Optimization Using Reinforcement Learning and Predictive Analytics: A Sustainable Approach to Improving Urban Mobility in the City of Belgrade

by
Volodymyr N. Skoropad
1,*,
Stevica Deđanski
2,
Vladan Pantović
3,
Zoran Injac
4,
Slađana Vujičić
5,
Marina Jovanović-Milenković
6,
Boris Jevtić
7,
Violeta Lukić-Vujadinović
8,
Dejan Vidojević
9 and
Ištvan Bodolo
8
1
Faculty of Business and Law, University Milija Babović—MB, 11000 Belgrade, Serbia
2
Faculty for Social Sciences, University Business Academy, 21107 Novi Sad, Serbia
3
Faculty of Information Technology and Engineering, University Union–Nikola Tesla, 11158 Belgrade, Serbia
4
Faculty for Traffic Engineering, Pan Apeiron University, 78102 Banja Luka, Bosnia and Herzegovina
5
Faculty of Business Economics and Entrepreneurship, 11158 Belgrade, Serbia
6
Project Management College, Educons University, 11158 Belgrade, Serbia
7
Computing Faculty (Racunarski Fakultet—RAF), University Union Belgrade, 11000 Belgrade, Serbia
8
Department for Industrial Engineering, Faculty of Engineering Management and Economics, University Business Academy Novi Sad, 21000 Novi Sad, Serbia
9
Department for Criminalistics, University of Criminal Investigation and Police Studies, 11158 Belgrade, Serbia
*
Author to whom correspondence should be addressed.
Sustainability 2025, 17(8), 3383; https://doi.org/10.3390/su17083383
Submission received: 26 February 2025 / Revised: 29 March 2025 / Accepted: 2 April 2025 / Published: 10 April 2025

Abstract

:
Efficient traffic management in urban areas represents a key challenge for modern cities, particularly in the context of sustainable development and reducing negative environmental impacts. This paper explores the application of artificial intelligence (AI) in optimizing urban traffic through a combination of reinforcement learning (RL) and predictive analytics. The focus is on simulating the traffic network in Belgrade (Serbia, Europe), where RL algorithms, such as Deep Q-Learning and Proximal Policy Optimization, are used for dynamic traffic signal control. The model optimized traffic signal operations at intersections with high traffic volumes using real-time data from IoT sensors, computer vision-enabled cameras, third-party mobile usage data and connected vehicles. In addition, implemented predictive analytics leverage time series models (LSTM, ARIMA) and graph neural networks (GNNs) to anticipate traffic congestion and bottlenecks, enabling initiative-taking decision-making. Special attention is given to challenges such as data transmission delays, system scalability, and ethical implications, with proposed solutions including edge computing and distributed RL models. Results of the simulation demonstrate significant advantages of AI application in 370 traffic signal control devices installed in fixed timing systems and adaptive timing signal systems, including an average reduction in waiting times by 33%, resulting in a 16% decrease in greenhouse gas emissions and improved safety in intersections (measured by an average reduction in the number of traffic accidents). A limitation of this paper is that it does not offer a simulation of the system’s adaptability to temporary traffic surges during mass events or severe weather conditions. The key finding is that integrating AI into an urban traffic network that consists of fixed-timing traffic lights represents a sustainable approach to improving urban quality of life in large cities like Belgrade and achieving smart city objectives.

1. Introduction

In an era of rapid urbanization and growing environmental concerns, effective traffic management has emerged as a critical challenge for modern cities. Urban centers worldwide face increasing congestion, prolonged travel times, and elevated greenhouse gas emissions, all of which highlight the need for sustainable mobility solutions. The escalating demand for urban transportation systems has further strained existing infrastructures, often resulting in inefficiencies that impact economic productivity, environmental quality, and public health. Addressing these issues requires innovative approaches that leverage advanced technologies to optimize traffic flow while minimizing environmental and societal impacts [1].
Traditional traffic management systems often rely on fixed timing and static rules, which fail to adapt to fluctuating traffic patterns or unforeseen disruptions, such as accidents or extreme weather events. As cities grow and mobility demands evolve, these outdated systems are no longer sufficient to address the complexity of urban transportation networks. Furthermore, the adverse effects of traffic congestion extend beyond inconvenience, contributing significantly to air pollution and climate change, which underscores the urgent need for intelligent, sustainable interventions [2].
Studies have demonstrated that adaptive traffic signal systems significantly outperform fixed timing systems in reducing travel time, fuel consumption, and emissions. For instance, a study by Wang [3] found that adaptive systems reduced average travel time by 20–30% and emissions by 10–15% in urban networks. Similarly, Zheng [4] reported that adaptive systems improved traffic flow efficiency by 25% during peak hours compared to fixed timing systems. Despite their advantages, adaptive systems face significant challenges. One major limitation is their reliance on high-quality, real-time data, which can be affected by sensor inaccuracies or communication delays [5]. Additionally, adaptive systems often struggle with scalability in large, complex networks, as the computational requirements for real-time optimization increase exponentially with network size [6]. Furthermore, these systems typically prioritize vehicular traffic, often neglecting the needs of pedestrians, cyclists, and public transport, which can exacerbate urban mobility inequalities [7].
Recent advancements in AI-driven traffic management have demonstrated significant improvements over traditional approaches, leveraging machine learning and data-driven models to enhance decision-making. Reinforcement learning (RL)-based methods, such as Deep Q-Networks (DQN) and Proximal Policy Optimization (PPO), have been increasingly applied to traffic signal control, enabling adaptive responses to real-time congestion patterns [8]. Studies have shown that AI-driven traffic control systems can outperform both fixed-time and adaptive rule-based approaches by continuously learning from traffic flow data and optimizing signal timings accordingly [9]. Moreover, predictive analytics using long short-term memory (LSTM) networks, graph neural networks (GNNs), and autoregressive integrated moving average (ARIMA) models have enhanced forecasting capabilities, allowing traffic systems to anticipate congestion before it occurs [10].
The integration of these AI techniques into urban traffic control has resulted in measurable efficiency gains, including reductions in average vehicle delay, lower fuel consumption, and improved multimodal mobility. However, challenges such as scalability, data quality, and computational demands remain key areas for further exploration. This study builds on the existing body of knowledge by proposing a hybrid approach that integrates RL and predictive analytics to optimize traffic flow dynamically, addressing the limitations of previous AI-based methods while considering the unique characteristics of European urban environments.
This paper explores the integration of reinforcement learning (RL), such as explored by [11], and predictive analytics as a transformative framework for dynamic traffic flow optimization in urban environments (Abou-Senna and Radwan). By utilizing RL algorithms, such as Deep Q-Learning and Proximal Policy Optimization, in conjunction with predictive analytics models like LSTM, ARIMA, and graph neural networks (GNNs), this approach aims to enhance traffic signal efficiency, reduce congestion, and enable initiative-taking decision-making. These advanced machine learning techniques enable a shift from reactive to predictive traffic management, paving the way for smarter, more responsive urban mobility systems [12,13].
Research gap identified after theoretical review of this topic states that there currently are no existing simulations or insights into how RL and predictive models can be scaled in European urban environments with diverse traffic conditions.
The application of AI in urban traffic management is bolstered by the availability of real-time data from IoT sensors, computer vision-enabled cameras, mobile usage data and connected vehicles. These technologies form the backbone of intelligent transportation systems, enabling continuous monitoring and analysis of traffic conditions. Real-time data integration allows for dynamic adjustments to traffic signals, improving the overall flow and reducing the likelihood of bottlenecks during peak hours or unexpected disruptions. Similar has been previously analyzed in the literature [14,15,16].
Moreover, the incorporation of edge computing and distributed learning models addresses challenges related to data latency and system scalability while ensuring the fairness and ethical application of traffic management policies. These decentralized approaches allow for localized decision-making at intersections, minimizing delays in data processing and enabling robust performance even in large, interconnected urban networks [17].
Beyond operational efficiency, the integration of RL and predictive analytics contributes to broader sustainability goals by reducing vehicle idle times, minimizing fuel consumption, and lowering greenhouse gas emissions. Such advancements align with global efforts to combat climate change and promote sustainable urban development. Additionally, the ability to anticipate and mitigate traffic congestion improves not only commuter experiences but also public safety, as smoother traffic flow reduces the likelihood of accidents at intersections [18].
In exploring this framework, the study is relying on previous research by [19,20,21], and highlights the potential for AI-driven traffic management systems to revolutionize urban mobility. This research not only addresses immediate transportation challenges but also lays the groundwork for integrating emerging technologies into future smart city initiatives. By demonstrating the feasibility and benefits of these innovations in a practical urban context, the findings contribute to the ongoing transformation of modern cities into more sustainable and livable environments. This research investigates the potential of combining RL and predictive analytics to optimize traffic flow dynamically and sustainably. The main research questions are defined as the following:
RQ1: 
How can the integration of reinforcement learning and predictive analytics optimize dynamic traffic flow in urban areas?
RQ2: 
What impact does this approach have on sustainability, traffic efficiency, and environmental outcomes?
Traditional traffic management systems, including fixed-time and adaptive signal control, suffer from several critical limitations that hinder their effectiveness in modern urban environments. These systems are inherently reactive, relying on pre-defined schedules or real-time sensor inputs that lack predictive capabilities. As a result, they struggle to dynamically adapt to fluctuating traffic conditions, unexpected disruptions, or long-term mobility trends. Furthermore, their reliance on rule-based adjustments often leads to inefficiencies in congestion-prone areas, where rigid signal timings fail to accommodate demand variations. Additionally, these conventional systems are not well-equipped to balance the needs of different road users, such as pedestrians, cyclists, and public transit, often prioritizing vehicular traffic at the expense of sustainable urban mobility. The lack of scalability and adaptability in traditional approaches underscores the necessity of AI-driven solutions, which can harness real-time and historical data to optimize traffic flow dynamically. By integrating reinforcement learning and predictive analytics, AI-based traffic management systems offer a change in basic assumptions, from static control to intelligent, self-learning frameworks capable of proactively mitigating congestion, reducing emissions, and enhancing overall transportation efficiency.
By addressing these questions, the study contributes to the broader discourse on sustainable urban mobility and smart city development [22,23]. The findings presented in this paper are grounded in simulations of the city of Belgrade’s traffic network, where AI models dynamically control traffic signals at high-volume intersections. These simulations demonstrate significant improvements, including reduced vehicle waiting times, lower greenhouse gas emissions, and enhanced safety outcomes at intersections.
The remainder of this paper is structured as follows: Section 2 reviews existing literature on AI-driven traffic optimization and predictive analytics. Section 3 outlines the methodological framework for the research, detailing the integration of RL and predictive analytics. Section 4 presents the results of traffic simulations, including performance metrics and comparative analyses conducted during 2024, based on data from 370 traffic signal control devices in the city of Belgrade. Section 5 discusses the implications of the findings, drawing connections to related research and real-world applications. Finally, Section 6 concludes the paper by summarizing key insights and proposing directions for future research on sustainable traffic management solutions.

2. Materials and Methods

The integration of artificial intelligence (AI) in urban traffic management has garnered significant attention as cities seek to address congestion, environmental challenges, and the evolving demands of modern mobility, to be achieved through different projects [24]. Two key directions emerge in the exploration of AI-driven solutions: the application of reinforcement learning (RL) for traffic optimization and the use of predictive analytics for congestion and sustainability.
Research steps have been presented in the Figure 1 below.
RL algorithms, such as Deep Q-Learning and Proximal Policy Optimization, are central to dynamic traffic signal control, allowing real-time adjustments based on fluctuating traffic conditions. These methods enable cities to optimize traffic flow, improve efficiency, and enhance safety, especially in high-traffic areas. The flexibility of RL models ensures scalability and adaptability, even during events or emergencies. On the other hand, predictive analytics, particularly through time series models (LSTM, ARIMA) and graph neural networks (GNNs), offers the potential to foresee traffic congestion and proactively manage the flow of vehicles [25].
By leveraging historical data and real-time inputs, predictive models can reduce congestion, optimize routes, and significantly lower environmental impacts, including emissions and fuel consumption. Together, these AI approaches form a robust framework for creating smarter, more sustainable urban mobility systems [26].

2.1. Reinforcement Learning (RL) in Traffic Optimization and Smart Cities

Reinforcement learning (RL) is a powerful tool for complex decision-making, with promising applications in traffic optimization and Smart Cities. Traditional traffic control often fails to manage dynamic flows effectively, while RL-powered adaptive systems can enhance traffic flow, reduce congestion, and improve safety. This review explores RL’s role in traffic optimization and Smart Cities, highlighting real-world applications and recent advancements [27].
Smart Cities leverage advanced technologies like IoT, big data, and AI to transform traffic management [28]. Traditional systems with static signals and fixed routing struggle with modern traffic dynamics. In contrast, RL adapts in real time, learning from its environment to optimize performance. Using an agent-environment framework, RL-based agents, such as traffic controllers or autonomous vehicles, make decisions based on feedback through rewards or penalties, improving efficiency over time [29].
RL-driven adaptive traffic signal control optimizes traffic flow by adjusting signal timings in real time. Unlike traditional fixed-schedule lights, which cause congestion and delays, RL-based controllers learn from traffic conditions to minimize wait times, enhance flow, and reduce fuel consumption [30].
Several studies have demonstrated the effectiveness of RL-based traffic signal control systems as follows:
  • RL-Based Traffic Signal Control:
    • Prajval [31] used deep reinforcement learning to optimize intersection signals, improving vehicle throughput and reducing delays;
    • Cai [32] applied Q-learning for real-time signal adaptation, enhancing traffic flow and cutting wait times;
    • In this study, a highlight has been made of RL’s potential to improve urban traffic management [33].
  • RL for Traffic Routing and Path Planning:
    • Traditional algorithms rely on static data, limiting effectiveness in dynamic environments [34];
    • RL continuously learns from real-time traffic, optimizing routes to minimize congestion and travel time [35].
Deep reinforcement learning (DRL) has shown great promise in traffic routing. Chen and Zhou [36] used DRL to optimize vehicle routes in congested cities, significantly reducing travel time compared to traditional algorithms. Their system adapted to real-time conditions, considering congestion, accidents, and road closures to recommend optimal routes. Similarly, in [37], RL was applied to predict and suggest the best routes using real-time traffic data, minimizing delays and congestion. These advancements in RL-based routing can revolutionize urban mobility by improving efficiency and reducing traffic-related environmental impact.
Another key application of RL in traffic optimization is congestion management. Urban congestion causes delays, higher fuel consumption, and increased pollution. RL mitigates these issues by dynamically adjusting traffic signals and routing decisions based on real-time conditions. Lee [38] developed an RL-based congestion control system where traffic signal controllers and routing systems cooperate to reduce congestion during peak hours. Their model adjusted signal timings and rerouted vehicles, improving traffic flow and reducing delays. Similarly, Ghosh [39] proposed a multi-agent RL framework where traffic signals and routing systems worked together to optimize traffic flow. These studies highlight RL’s potential to tackle urban congestion effectively.
RL is increasingly used to manage autonomous vehicles (AVs) in Smart Cities. As AVs become more common, intelligent systems are needed to coordinate their interactions with human-driven vehicles. RL enables AVs to learn safe and efficient driving behaviors through trial and error. Kumar [40] developed an RL-based system that allows AVs to adapt their driving strategies based on road conditions and surrounding traffic, improving safety in merging, lane-changing, and speed control. RL also helps AVs navigate complex scenarios like intersections and roundabouts by learning cooperative behaviors. Integrating RL with AVs enhances traffic safety and efficiency, making them a crucial part of future Smart Cities [41].
RL has great potential to optimize traffic management in Smart Cities by improving traffic signals, real-time routing, congestion management, and autonomous vehicle decision-making. It can enhance urban mobility, making it more efficient, safer, and sustainable. However, challenges remain, such as large-scale data collection, real-time computation, and integration with existing infrastructure [42]. Despite these hurdles, ongoing research highlights RL’s ability to address urban traffic challenges. As RL continues to advance, its adoption in Smart Cities is expected to grow globally [43].
Although reinforcement learning (RL) has been widely explored for traffic optimization and Smart Cities, several key areas remain underexplored. Specifically, the scalability and adaptability of RL models across diverse urban environments with varying infrastructure and data availability warrant further investigation. Additionally, the integration of RL with human factors—such as interactions between RL-driven systems and human drivers or pedestrians—has received limited attention. Addressing these gaps could offer a more comprehensive understanding of RL’s potential in enhancing traffic management and Smart City solutions [44].

2.2. Predictive Analytics for Traffic Congestion and Environmental Sustainability

Traffic congestion is a major urban challenge affecting millions worldwide, particularly in rapidly growing metropolitan areas. Its negative impacts—such as longer travel times, higher fuel consumption, increased air pollution, and reduced quality of life—are well-documented. As cities expand, traditional traffic management systems struggle to keep up with the dynamic nature of modern traffic flows. The growing emphasis on environmental sustainability adds urgency to addressing these challenges. With escalating concerns over climate change and environmental degradation, there is a pressing need to develop intelligent systems that can reduce the environmental impact of traffic congestion [45].
Predictive analytics—using historical data, statistical algorithms, and machine learning to forecast future outcomes—offers a promising solution for addressing both traffic congestion and its environmental impact. By predicting traffic patterns and congestion hotspots in real-time, predictive models allow cities to optimize traffic flow, reduce congestion, and enhance air quality. This literature review explores the role of predictive analytics in combating traffic congestion and promoting environmental sustainability, examining methodologies, challenges, applications, and recent advancements in the field [46,47].
Predictive analytics in traffic congestion management focuses on forecasting traffic patterns, congestion levels, and potential bottlenecks in urban road networks. These forecasts rely on data from various sources, including traffic sensors, GPS data [48], and social media [49], which provide real-time and historical insights into traffic conditions. By analyzing these data, predictive models can anticipate congestion before it occurs, enabling timely interventions to prevent gridlock and optimize vehicle flow [50].
One of the key challenges in predicting traffic congestion is the complexity and variability of traffic patterns. Urban traffic systems are influenced by a wide range of factors, including weather conditions, extraordinary events, road construction, accidents, and the behavior of individual drivers. Predictive models must account for these factors to provide accurate forecasts. Many studies have focused on developing machine learning algorithms, such as regression analysis, decision trees, and neural networks, to improve traffic prediction accuracy [51].
For instance, Abbatte [52] combined historical traffic data with real-time sensor data to predict traffic congestion on urban roads. Their study showed that machine learning algorithms, such as support vector machines (SVM) and random forests, significantly enhanced the accuracy of traffic flow predictions. Similarly, Singh [53] employed deep learning techniques, particularly recurrent neural networks (RNNs), to model real-time traffic congestion. Their model successfully predicted congestion hotspots and traffic volume fluctuations, offering valuable insights for urban traffic planning and control.
Another approach in predictive analytics for traffic congestion management is the use of hybrid models that combine multiple machine learning techniques. For instance, Dong [54] proposed a hybrid deep learning model that integrated convolutional neural networks (CNNs) and long short-term memory (LSTM) networks to predict traffic congestion in urban networks. The hybrid model outperformed individual models in terms of prediction accuracy, enabling better traffic management and route planning.
The environmental sustainability of urban transportation systems is closely tied to the effects of traffic congestion on air quality, greenhouse gas emissions, and fuel consumption. High congestion levels cause stop-and-go driving, increasing fuel consumption and emissions. Additionally, congestion worsens the urban heat island effect and contributes to noise pollution. Predictive analytics plays a vital role in addressing these issues by forecasting not only traffic patterns but also the environmental impact of congestion [55].
Predictive models can estimate air quality levels and predict the environmental consequences of various traffic scenarios. For instance, researchers have used traffic flow predictions to estimate emissions under different traffic conditions. A study by Chen [56] developed a model to forecast the environmental impact of traffic congestion in a metropolitan area. By integrating traffic flow data with air pollution models, the model predicted real-time concentrations of pollutants like nitrogen dioxide (NO2) and particulate matter (PM2.5). The results showed that traffic congestion significantly contributed to elevated pollution levels, underscoring the need for congestion management strategies that prioritize environmental sustainability.
Predictive analytics can also estimate fuel consumption under varying traffic conditions. A study by Wu [57] developed a model that predicted fuel consumption and emissions based on real-time traffic data and vehicle types. By forecasting congestion and optimizing traffic flow, the model helped reduce fuel consumption and emissions, contributing to environmental sustainability goals.
A promising application of predictive analytics in promoting environmental sustainability is the integration of traffic prediction models with smart city infrastructure. Smart traffic management systems can use predictive analytics to reduce congestion, optimize signal timings, and guide vehicles along optimal routes, minimizing fuel consumption and emissions. For example, Lee [58] used predictive analytics to optimize traffic signal timings in real-time. By forecasting traffic flow and adjusting signal timings accordingly, the system reduced congestion and emissions by improving vehicle movement through intersections.
One key challenge in applying predictive analytics to both traffic congestion management and environmental sustainability is integrating traffic flow models with environmental models. To achieve comprehensive traffic management solutions, it is crucial to develop models that simultaneously address both congestion and environmental impacts [59].
Recent studies have focused on integrating traffic flow prediction models with air quality and fuel consumption models to create systems that optimize traffic management while minimizing environmental impact. For instance, Chien [60] developed a multi-objective optimization framework that integrated traffic flow prediction with emission reduction goals. The model used real-time traffic data to predict congestion and estimated potential reductions in air pollution and fuel consumption by optimizing signal timings and routing. The study found that the integrated approach significantly reduced both congestion and environmental impact compared to traditional traffic management systems.
Lin [61] proposed an integrated model that combined traffic flow predictions with environmental pollution forecasts. By considering both congestion and pollutant levels, the model recommended optimal traffic control strategies that minimized congestion and reduced air pollution. The results showed that integrating traffic and environmental models improved outcomes in both areas [62].
Despite these advancements, challenges remain. One key issue is data availability and quality. Traffic prediction models depend on real-time data from sources like sensors, GPS devices, and mobile apps, but inconsistencies and coverage gaps, especially in less developed areas, can hinder accuracy [63]. Another challenge is the complexity of urban traffic systems, which are influenced by factors like driver behavior, road conditions, and weather. Modeling this complexity requires advanced machine learning techniques and large datasets, which can be resource-intensive [64].
Nonetheless, the future of predictive analytics for traffic and environmental sustainability is promising. Emerging technologies like IoT, big data, and cloud computing will enable more accurate and scalable models. Additionally, autonomous vehicles and connected infrastructure offer new opportunities to optimize traffic flow and minimize environmental impact. As these technologies evolve, smart city infrastructure will become more interconnected, enabling real-time, data-driven decisions to address both traffic and environmental challenges [65].
Predictive analytics has become a vital tool in tackling traffic congestion and environmental sustainability. By utilizing machine learning algorithms and real-time data, predictive models help cities forecast traffic patterns, optimize flow, and reduce congestion’s environmental impact. Integrating traffic and environmental models provides a holistic approach to managing urban transportation while promoting sustainability. Despite challenges like data availability and system complexity, the future of predictive analytics in smart cities offers significant potential for enhancing urban mobility and environmental quality [66,67].
While existing literature covers the methodologies, applications, and advancements in predictive analytics for traffic and sustainability, some areas remain underexplored. For example, the integration of predictive analytics with behavioral insights, such as how drivers adapt to real-time interventions, has received limited attention. Additionally, the scalability of these models in regions with poor infrastructure or inconsistent data quality remains a challenge.

2.3. Ethical Considerations in AI-Driven Traffic Management

As AI-driven traffic management systems become increasingly integrated into urban mobility solutions, ethical concerns related to data privacy, algorithm fairness, and societal impact require careful consideration. Existing literature highlights the need for responsible AI implementation to ensure that technological advancements align with public interest and urban sustainability goals [68,69].

2.3.1. Data Privacy and Security

AI-based traffic control systems rely on extensive real-time data from IoT sensors, traffic cameras, mobile usage data and connected vehicles. While these data enable dynamic signal optimization, they also raise privacy concerns regarding location tracking and potential misuse of sensitive information [70]. Research suggests that privacy risks can be mitigated through encryption, differential privacy techniques, and edge computing, which processes data locally to reduce exposure to centralized systems [71]. Compliance with data protection regulations such as GDPR is also crucial in ensuring responsible data governance.

2.3.2. Algorithm Fairness and Bias in Traffic Optimization

Bias in AI-driven traffic control algorithms is a critical concern, as models trained on historical data may reinforce existing mobility inequalities [72]. Studies emphasize the importance of fairness-aware machine learning techniques to prevent the disproportionate prioritization of private vehicles over pedestrians, cyclists, or public transport users [73]. Approaches such as bias audits, diverse dataset inclusion, and stakeholder consultation are recommended to enhance algorithmic equity [74].
While AI-based traffic control systems are designed to reduce congestion and emissions, their broader societal impact must also be considered. Research highlights potential unintended consequences, such as increased vehicle speeds at intersections affecting pedestrian safety [75]. To mitigate these risks, hybrid systems that combine AI automation with human oversight—such as human-in-the-loop (HITL) mechanisms—have been proposed [76]. Additionally, integrating AI with multimodal transport policies can ensure that optimization efforts support sustainable and inclusive urban mobility rather than prioritizing vehicle throughput alone [77,78].
By addressing these ethical dimensions, AI-driven traffic management systems can be implemented in a way that enhances urban efficiency while maintaining fairness, privacy, and social responsibility. Future research should continue exploring frameworks for ethical AI governance in smart city applications.

2.4. Research Hypothesis Formulation

The main research hypotheses are the following:
H1: 
The integration of reinforcement learning algorithms with real-time data analytics will significantly improve traffic flow efficiency, reduce vehicle wait times, and minimize congestion in urban environments compared to traditional traffic signal systems.
H2: 
Predictive analytics, including time series forecasting and anomaly detection, will enhance the accuracy of traffic congestion predictions, allowing for initiative-taking interventions that lead to a reduction in overall traffic delays and emissions in urban areas.
Now follows the chapter about methodological framework for this research.

3. Methodological Framework

The framework is tailored to the specific challenges of managing traffic networks in cities like Belgrade, Serbia, offering scalable, data-driven solutions for dynamic traffic control. Belgrade, as a growing European city, faces significant transportation challenges, including rising vehicle ownership, limited infrastructure capacity, and a need to prioritize pedestrian and public transit systems. The city’s complex traffic dynamics provide an ideal testbed for evaluating the proposed AI-driven optimization strategies.

3.1. Description of Research and Sample Definition

When applying reinforcement learning (RL) for optimizing traffic signal control at the city center of Belgrade, defining an appropriate sample and determining its size is a critical step to ensure the model’s effectiveness and scalability. The city of Belgrade (without suburban municipalities) had 687 functional traffic signal control devices at the moment of conducting this research, while from October 2022, there were 370 intersections already equipped with adaptive “smart” traffic lights, which adjust signaling in real-time to improve traffic flow. These traffic lights use detectors to collect data on the current traffic situation and optimize the duration of green lights, prioritizing certain flows and enabling a “green wave. This technology is currently not based on applying artificial intelligence to collected data, so this research can exploit using real data collected from October 2022 until October 2024 and define an appropriate testing ground for the simulation.
A sample in this context refers to a representative dataset that captures the traffic patterns, behaviors, and environmental factors of Belgrade’s crossroads. This dataset should include historical traffic flow data, vehicle counts, pedestrian movement, traffic signal timings, and peak/off-peak variations. Additionally, real-time data sources such as traffic cameras, GPS traces, and loop detectors should be considered to enhance the relevance of the sample. Capturing diverse traffic scenarios, including congested periods, roadwork disruptions, and adverse weather conditions, ensures the model’s robustness and generalizability.
The sample size depends on the complexity of the RL model, the number of crossroads, and the desired level of accuracy. For a city like Belgrade, with multiple interconnected intersections and varying traffic dynamics, a large and comprehensive dataset is essential. For model training purposes, six months of traffic data will be considered during the time horizon of two years (October 2022 until October 2024), and three months of data shall be used for testing. All of this shall be performed not only to ensure data reliability but also to account for seasonal and daily fluctuations.
The dataset’s composition is as follows:
  • Historical and Real-Time Data: Utilize data from IoT sensors, mobile usage data at 370 traffic signal control devices, encompassing vehicle counts, pedestrian movements, traffic signal timings, and environmental factors.
  • Diverse Traffic Scenarios: Ensure the dataset captures various conditions such as high congestion, roadwork disruptions, and adverse weather to enhance the model’s robustness.
  • Exclusion of Vehicles in Transit: Focus on data from vehicles within the city center, excluding those using motorways throughout the Belgrade municipal area.
To validate the RL model, the dataset is going to be split into training, validation, and testing subsets. The training data shall enable the model to learn optimal traffic control strategies, while the validation and testing datasets will serve to evaluate the model’s performance and prevent overfitting. Finally, the sample size will balance computational feasibility with statistical representativeness, ensuring the RL approach effectively optimizes traffic flow and minimizes congestion across Belgrade’s crossroads versus providing necessary computing power to be able to process data and apply the model in real time.
Additionally, the methodological approach for AI-driven traffic management must integrate ethical safeguards related to data privacy, governance, and algorithm fairness to ensure responsible deployment. Given that reinforcement learning (RL) and predictive analytics rely on vast amounts of real-time traffic data, the collection, processing, and decision-making mechanisms must adhere to established ethical and regulatory standards.

3.2. Reinforcement Learning Framework

To develop a robust simulation that applies reinforcement learning (RL) and predictive analytics for traffic signal optimization in a rapidly urbanizing city like Belgrade, the methodological framework must leverage both the predictive capabilities of machine learning and the dynamic adaptability of RL algorithms. The following simulation is structured to integrate these two methodologies to address urban traffic congestion, environmental concerns, and sustainability. Below is the detailed step-by-step process for the simulation:

3.2.1. Define the Environment and State Space

The environment represents the traffic network in Belgrade, which consists of the 280 intersections with adaptive “smart” traffic signals. The state at each intersection includes the following parameters:
  • Vehicle Count: Number of vehicles waiting at each signal.
  • Signal Phase: Current state of the traffic lights (red, yellow, or green).
  • Traffic Flow: Speed and density of traffic in each direction.
  • Pedestrian Flow: Number of pedestrians waiting to cross or crossing the road.
  • Congestion Level: A measure of congestion at each intersection based on vehicle density and waiting times.
State Representation:
For each intersection ii, the state si(t)si(t) at time tt can be represented as follows:
si(t) = {vehicle count i(t), signal phase i(t), pedestrian count i(t), congestion level i(t)},
where
  • si(t)si(t): The state of intersection ii at time tt.
  • Vehicle count i(t): The number of vehicles at intersection ii at time tt.
  • Signal phase i(t): The current signal phase (e.g., green, red) at intersection ii at time tt.
  • Pedestrian count i(t): The number of pedestrians at intersection ii at time tt.
  • Congestion level i(t): The level of congestion at intersection ii at time tt, often quantified as a normalized value (e.g., 0 to 1).
This dynamic state changes based on real-time data collected from IoT sensors, cameras, mobile usage data and GPS traces from connected vehicles.
Action Space:
The actions represent the possible changes to the traffic signals at each intersection, which include the following:
  • Switch signal phases (green, yellow, red).
  • Adjust green light duration to optimize vehicle flow and reduce congestion.
  • Prioritize certain lanes or directions (e.g., bus lanes, pedestrian crossings).
For an intersection iii, the possible action ai(t)a_i(t)ai(t) could be one of the following:
  • Change the current signal phase.
  • Extend or shorten the green light for specific lanes.
  • Allow or disallow pedestrians to cross at certain times.
It can be represented with a formula,
ai(t) ∈ {Green 1, Green 2, Green N},
where ai(t)ai(t) represents the possible signal phase changes at intersection i at time t.
Reward Function:
The reward function guides the reinforcement learning algorithm in optimizing the traffic flow. The RL model will be rewarded based on the following:
  • Reducing vehicle wait times: Shorter waiting times lead to fewer congested intersections.
  • Minimizing congestion: Optimizing traffic flow to avoid bottlenecks.
  • Optimizing environmental impact: Reducing idle times decreases fuel consumption and emissions.
  • Improving pedestrian flow: Allowing safe pedestrian crossings without interrupting vehicle flow excessively.
The reward function can be represented with a formula,
Ri(t) = (waiting time) i(t) − (congestion level) i(t)+ (green time efficiency) i(t) − (environmental impact) i(t),
where
  • (waiting time) i(t): Average waiting time for vehicles at intersection i at time t.
  • (congestion level) i(t): Congestion level at intersection i at time t.
  • (green time efficiency) i(t): Efficiency of green signal utilization at intersection i at time t.
  • (environmental impact) i(t): Environmental impact (e.g., emissions) at intersection i at time t.
The RL model uses Deep Q-Learning (DQN) to update the Q-values.

3.2.2. Predictive Analytics for Traffic Forecasting

Time series forecasting (LSTM) has been applied.
Spatiotemporal analysis (graph neural networks—GNNs) has then been applied.

3.2.3. Performance Metrics of the Model

Performance metrics were measured with two measures, root mean squared error and mean absolute error.

3.2.4. Forecasting Model

Before applying RL for optimization, predictive models are necessary to forecast traffic congestion and anticipate disruptions. These models will be trained using historical traffic data and real-time sensor data to predict future traffic patterns, such as the following:
  • Time Series Forecasting (LSTM or ARIMA): These models predict traffic volume based on past patterns.
  • Spatiotemporal Analysis (GNNs): These models analyze the relationship between intersections and predict how traffic in one area will affect others.
  • Anomaly Detection: Unsupervised machine learning algorithms to detect unusual traffic patterns due to accidents or unexpected events.
By utilizing predictive analytics, the system can forecast potential congestion and adapt the traffic signal timings proactively, minimizing delays and optimizing traffic flow before issues arise.

3.3. Reinforcement Learning—Data Preparation for Simulation

The RL model will be trained using historical and real-time data to learn optimal traffic signal strategies. Deep Q-Learning (DQN) or Proximal Policy Optimization (PPO) can be applied to update signal timings dynamically.
Training:
  • Training Data: Six months of traffic data, including vehicle counts, pedestrian counts, signal timings, and traffic conditions.
  • Simulation Platform: Use traffic simulation platforms like SUMO or VISSIM, integrated with RL agents controlling the signals. This will allow simulation of various traffic conditions (e.g., peak hours, disruptions).
  • Model: Use DQN or PPO to train an agent at each intersection. The agent will learn to optimize signal timings through interactions with the traffic network, gradually improving its control policy over time.
Learning Objective:
The RL agent will aim to achieve the following:
  • Maximize the total reward over time by adjusting traffic signal timings.
  • Balance trade-offs between efficient vehicle flow and pedestrian safety.
  • Adapt to fluctuating traffic volumes, environmental conditions, and emergency events [79].
Predictive analytics plays a pivotal role in revolutionizing urban traffic management by enabling data-driven decision-making to optimize traffic flow and reduce congestion [80]. This section explores the integration of predictive models with reinforcement learning (RL) to simulate and enhance traffic systems.
By leveraging data from IoT sensors, cameras, mobile usage data and GPS traces, a robust framework is developed to forecast congestion and dynamically adjust traffic signals. Through advanced techniques such as time series forecasting and spatiotemporal modeling, this approach improves efficiency while minimizing emissions and waiting times. Real-time simulations and performance evaluations demonstrate the potential of predictive analytics to transform traffic management in Smart Cities [81].
Step 1: Data Collection and Preprocessing:
  • Collection of data from IoT sensors (vehicle count, speed, waiting times, emissions), cameras (for pedestrian movement), third-party anonymized data about mobile data and GPS traces from connected vehicles.
  • Preprocessing of the data by normalizing it and removing any anomalies or missing data.
Step 2: Data Cleaning and Outlier Handling:
Data Cleaning:
Data cleaning involves identifying and correcting errors, inconsistencies, and missing values in datasets. The key steps include the following:
  • Noise Reduction: Filtering out random variations or irrelevant data points caused by sensor malfunctions or environmental factors. Techniques like smoothing (e.g., moving averages) were applied to reduce noise.
  • Managing Missing Data: Addressing data gaps by using interpolation methods (e.g., linear or spline interpolation) or imputation techniques (mean, median, regression-based imputation).
  • Standardization: It is necessary to normalize data formats (timestamps, units) to ensure consistency across datasets.
Outlier Handling:
Outliers can skew analysis and lead to inaccurate conclusions. Methods to manage outliers include the following:
  • Statistical methods are intended to be used through measures like the interquartile range (IQR) or Z-scores to identify and remove outliers.
  • Domain-Specific Thresholds: Defining acceptable ranges for sensor readings based on domain knowledge to filter out implausible values.
Step 3: Model Training:
  • Training the RL model using historical traffic data (October 2022–April 2024). Using LSTM for time series forecasting of traffic flow and GNNs for spatiotemporal relationships between intersections.
  • Implementing RL training on the simulated traffic environment, focusing on optimizing traffic signal control through exploration and exploitation.

3.4. Simulation and Evaluation

Step 4: Real-Time Simulation:
  • Integrating predictive analytics with the RL model to forecast congestion and appling initiative-taking signal adjustments.
  • Deploying RL algorithms in real-time simulations and adjusting signal timings dynamically based on incoming traffic data.
Step 5: Evaluating Performance:
  • Measuring key metrics: Vehicle waiting time, traffic flow, congestion levels, emission reduction, and safety improvements.
  • Comparing the RL model’s performance against baseline static traffic signal systems in Belgrade.
  • Evaluating the ability to adapt to different traffic conditions, including peak hours, accidents, and roadwork disruptions.
To ensure the reliability of results, the following measures were applied:
  • Data Validation: Regularly validating sensor data against ground truth measurements or alternative data sources.
  • Robust Algorithms: Using algorithms that are resilient to noise and missing data, such as ensemble methods or deep learning models with dropout layers.
  • Continuous Monitoring: Implementing real-time monitoring systems to detect and address data quality issues promptly.
  • Transparency and Documentation: Clearly documenting data cleaning and preprocessing steps to ensure reproducibility and transparency.
Step 6: Checking algorithm fairness.
AI-based traffic control systems must be designed to avoid reinforcing mobility inequities. Since RL models learn from historical traffic patterns, biases may emerge, disproportionately prioritizing vehicle throughput over pedestrian movement or favoring certain routes over others. To counteract these risks, fairness-aware machine learning techniques are implemented, including the following:
  • Bias Audits: Pre-training and post-deployment model evaluations were used to detect and correct imbalances in decision-making.
  • Diverse Data Inclusion: Data distribution was performed to ensure data sources represent varied traffic patterns across different urban zones, including underserved areas.
  • Multi-Objective Optimization: Fairness constraints need to be incorporated into reinforcement learning reward functions to balance efficiency with equitable mobility solutions.
By integrating these ethical safeguards, the methodological framework ensures that AI-driven traffic control remains not only efficient but also socially responsible, supporting sustainable and inclusive urban mobility.

3.5. Data Governance

Data used in this study comes from IoT sensors, traffic cameras, mobile usage data and connected vehicles, raising concerns about personal data protection. To mitigate privacy risks, data collection processes are designed to comply with GDPR and similar regulatory frameworks, ensuring that personally identifiable information (PII) is either anonymized or excluded. Additionally, edge computing techniques are employed to process data locally at intersections, minimizing exposure to centralized systems and reducing security vulnerabilities.
To maintain transparency and accountability, data governance measures include encryption protocols, controlled data access policies, and periodic audits to prevent unauthorized usage. These measures align with best practices in smart city governance, ensuring that traffic optimization does not compromise individual privacy rights.

3.6. Expected Benefits and Sustainability After Applying AI

Cities like Los Angeles and Singapore are already using AI-based traffic management systems. Los Angeles has implemented an AI-powered adaptive traffic signal system called SCOOT (Split Cycle and Offset Optimization Technique). This system uses real-time traffic data from sensors to adjust traffic signal timings based on current traffic flow. The system helps reduce congestion and delays by dynamically adjusting the signal cycles to match traffic volume, resulting in smoother traffic flow and reduced waiting times at intersections. LA’s Traffic Management Center uses AI and data analytics to monitor traffic patterns across the city. The center integrates data from various sources like cameras, sensors, and GPS in real-time, allowing city planners to make more informed decisions about traffic management and ensure that congestion is minimized during peak hours. This technology has significantly improved emergency response times by optimizing routes for emergency vehicles [82].
Singapore’s Smart Traffic Light System uses AI to adjust signal timings based on traffic demand. The system collects data from various sources such as vehicle sensors, cameras, and mobile apps to dynamically alter traffic signal phases. This innovation helps reduce congestion and allows for more efficient use of road space, especially during peak hours [83,84].
The AI-driven traffic management system should achieve the following:
  • Reduce congestion and vehicle waiting times by optimizing signal timings.
  • Lower greenhouse gas emissions by minimizing fuel consumption through smoother traffic flow.
  • Improve public safety by reducing the likelihood of accidents at intersections.
  • Enhance mobility efficiency and contribute to more sustainable urban transportation systems.

4. Results of Quantitative Research

Now follows the testing of research hypotheses to check for statistical significance.

4.1. Research Results

Several assumptions were made based on real, historic data about observed parameters within a total of 370 crossroads in Belgrade. The authors used a fixed timing signal system (data available for all 280 crossroads) and an adaptive timing signal system with data limited to only ninety crossroads operating in the city of Belgrade.
The baseline systems include the following:
  • Fixed-timing signal system (current non-AI adaptive system in Belgrade installed on 280 crossroads) and adaptive timing signal system installed on ninety crossroads.
  • Average vehicle wait time: 90 s at high-volume intersections for fixed timing signals and 65 s for adaptive timing signal systems.
  • Peak hour vehicle count: 1200 vehicles per hour per intersection for a fixed timing signal and nine hundred vehicles per hour per intersection for an adaptive timing signal system.
  • Average daily pedestrian count: 500 crossings per intersection for a fixed timing signal and 420 crossings for an adaptive timing signal system.
The RL-Enhanced System includes the following:
  • RL agents control traffic signals dynamically (in case of crossroads with a fixed timing signal system) and with dedicated priority in case of crossroads with an adaptive timing signal system.
  • Integration of LSTM for traffic flow prediction.
  • Traffic signals adjust in real time based on congestion levels, pedestrian activity, and predicted bottlenecks.
Several traffic scenarios were evaluated as follows:
  • Normal traffic flow during weekday peak hours.
  • High congestion due to accidents.
  • Roadworks causing lane closures.
  • Weekend off-peak hours.
Evaluation Metrics:
  • Average Vehicle Wait Time: Reduction in seconds.
  • Traffic Flow Efficiency: Increase in vehicles passing per hour.
  • Pedestrian Wait Time: Reduction in seconds.
  • Fuel Consumption and Emissions: Reduction in fuel use and CO2 emissions due to less idling.
  • Safety Metrics: Reduction in the number of near-miss events and accidents.
Below in Table 1 are displayed results of applied reinforcement learning for each scenario and overall improvement of the parameter versus the two baselines systems (baseline for both fixed timing and advanced timing signal systems was determined based on data from October 2022 to October 2024). Parameters included vehicle wait times, traffic flow efficiency, emissions and fuel consumption, pedestrian wait times and safety metrics.
Results display improvements only in the case of vehicle wait times for “Accident-Induced Congestion” while in all other measured scenarios there was no improvement when applying the RL system in the adaptive timing signal system.
The reasoning why RL was implemented on both datasets integrally was logical and simple since there was not enough data from crossroads with an adaptive timing signal system to be able to compare adequately.

4.2. Model Testing and Accuracy

The test accuracy of the reinforcement learning (RL) model was measured using a holdout test dataset, which comprised 20% of the total historical traffic data. This test dataset was carefully separated from the training data to ensure an unbiased evaluation of the model’s generalization capabilities. The model’s performance was evaluated using two widely adopted metrics: root mean squared error (RMSE) and mean absolute error (MAE). These metrics were chosen because they provide a clear understanding of the model’s prediction errors in the context of traffic flow, where even small deviations can have significant operational impacts.
The results demonstrated a 15% improvement in traffic flow prediction accuracy compared to baseline models, such as traditional fixed timing systems and rule-based adaptive systems. This improvement highlights the RL model’s ability to capture complex traffic patterns and optimize signal timing dynamically.
To ensure the model’s reliability and robustness, it was rigorously assessed for overfitting. Overfitting occurs when a model performs exceptionally well on the training data but fails to generalize to unseen data, often due to excessive complexity or insufficient regularization. In this study, overfitting was evaluated by comparing the model’s performance on the training dataset and the test dataset. The training accuracy was 92%, while the test accuracy was 89%, indicating a minimal gap between the two. This small discrepancy suggests that the model did not overfit the training data and maintained strong generalization capabilities.
Further validation was conducted using k-fold cross-validation, a robust technique to assess model performance and stability. In this approach, the dataset was divided into k subsets (folds), and the model was trained and tested k times, with each fold serving as the test set once. The results showed consistent performance across all folds, with minimal variance in RMSE and MAE values. This consistency reinforces the model’s reliability and its ability to perform well under varying data conditions.
Additionally, the following strategies were employed during training to mitigate the risk of overfitting:
  • Regularization Techniques: L2 regularization was applied to the model’s loss function to penalize overly complex weights, ensuring smoother and more generalizable predictions.
  • Early Stopping: Training was halted when the validation error stopped improving, preventing the model from learning noise in the training data.
  • Dropout Layers: In the neural network architecture, dropout layers were incorporated to randomly deactivate neurons during training, promoting robustness and reducing over-reliance on specific features.
The combination of these techniques, along with the strong alignment between training and test performance, confirms that the RL model is both accurate and dependable for real-world deployment in traffic management systems.

4.3. Data Governance and System Transparency

The implementation of AI-driven traffic optimization required adherence to strict data governance measures to ensure security, privacy, and regulatory compliance. The system utilized edge computing to process traffic data locally, reducing exposure to centralized databases and enhancing real-time decision-making.
Throughout the study, periodic data audits were conducted to assess compliance with privacy regulations such as GDPR, ensuring that personally identifiable information (PII) was not retained or misused.
Key performance metrics related to data governance included the following:
  • Data Processing Efficiency: The AI model demonstrated an average 15% reduction in data transmission latency when utilizing edge computing compared to a centralized system, improving responsiveness in high-traffic scenarios.
  • Anonymization Accuracy: The data filtering mechanisms effectively removed 98.7% of PII markers from raw input data before processing, ensuring compliance with privacy guidelines.
These results validate the feasibility of privacy-preserving AI-driven traffic management while maintaining optimization performance. Future implementations could further refine access control policies and encryption standards to enhance data security.

4.4. Algorithm Fairness and Equity in Traffic Optimization

To evaluate the fairness of AI-driven traffic signal control, bias audits were conducted on the system’s decision-making patterns across different urban zones. The results indicated that the reinforcement learning (RL) model successfully optimized traffic flow without disproportionate prioritization of specific routes or vehicle types.
Key fairness metrics included the following:
  • Equitable Signal Adjustments: No statistically significant bias was detected in signal priority allocation between high- and low-income districts (p = 0.078), demonstrating that the AI system balanced efficiency across diverse urban areas.
  • Pedestrian Wait Time Reduction: The RL model led to a 25% decrease in pedestrian crossing delays, ensuring that vehicle throughput improvements did not come at the expense of pedestrian mobility.
  • Public Transport Prioritization: The system successfully reduced bus delay times by 18% during peak hours, indicating that public transport efficiency was preserved within the optimization framework.
These findings confirm that fairness-aware reinforcement learning techniques can enhance urban mobility without reinforcing pre-existing inequalities. However, further refinements—such as adaptive fairness constraints and community-driven model evaluations—could strengthen long-term equity in AI-driven traffic management.

4.5. Research Findings

Key observations can be defined as the following:
  • The RL system improved traffic throughput by dynamically adjusting signal phases based on predicted congestion and real-time data. Bottlenecks caused by accidents or lane closures were mitigated significantly, with faster recovery times observed.
  • Reduced idling and smoother traffic flow decreased fuel consumption and greenhouse gas emissions, contributing to sustainability goals. The RL system demonstrated a proportional decrease in emissions with improvements in vehicle wait times.
  • Pedestrians experienced shorter wait times at intersections, as the RL system prioritized their crossing based on real-time activity.
Improved traffic flow reduced the likelihood of rear-end collisions at intersections. Initiative-taking adjustments to signal timings reduced the risk of pedestrian-vehicle conflicts.
The following key challenges were observed:
  • Computational Demand: Real-time processing of large-scale data from 280 intersections required edge computing and distributed learning for scalability.
  • Adaptation Period: The RL system required a training phase of 2–3 weeks to achieve optimal performance.
Table 2 displays the scale of computational power allocation versus the number of intersections.
Three sorts of gains were captured as follows:
  • Sustainability Gains were measured with the help of IoT sensors, collecting data depicting lower emissions and fuel consumption.
  • Enhanced Safety: Fewer accidents and near-miss events contribute to safer intersections.
  • Improved Pedestrian Flow: Dynamic signal adjustments improved pedestrian wait times without compromising vehicle efficiency.

4.6. Testing Research Hypotheses

The ANOVA tests for the hypotheses provide the following results displayed in Table 3.
Hypothesis 1 proposes that the integration of reinforcement learning (RL) algorithms with real-time data analytics will significantly improve traffic flow efficiency, reduce vehicle wait times, and minimize congestion in urban environments compared to traditional traffic signal systems.
The results of the ANOVA test support this hypothesis, as significant differences were found across all the key factors involved in traffic management.
Traffic Flow Efficiency: The F-statistic for traffic flow efficiency was 34.04, with a p-value of 2.56 × 107, which is well below the commonly accepted threshold of 0.05. This indicates that there is a statistically significant improvement in traffic flow efficiency when using RL algorithms in comparison to traditional traffic signal systems. The AI-based system allows for more adaptive decision-making, optimizing traffic flow based on real-time data, thus reducing delays and improving the overall efficiency of the traffic network.
Wait Times: The F-statistic for wait times was 43.72, with a p-value of 1.31 × 108, indicating a highly significant reduction in wait times with the use of reinforcement learning algorithms. In traditional traffic systems, wait times can be unnecessarily long due to fixed signal patterns. However, the AI-based system dynamically adjusts signal timing based on real-time traffic conditions, leading to a reduction in the time vehicles spend waiting at traffic lights.
Congestion Index: The F-statistic for congestion was 23.39, with a p-value of 1.01 × 105, indicating a statistically significant reduction in congestion when RL algorithms are implemented. The AI system adapts to traffic patterns, prioritizing areas with heavier congestion and effectively redistributing traffic loads, resulting in lower congestion levels across the city.
In summary, the findings support the hypothesis that reinforcement learning, when integrated with real-time data analytics, offers significant advantages over traditional traffic signal systems by improving traffic flow, reducing wait times, and minimizing congestion. This highlights the potential of AI to optimize urban traffic management, especially in the context of smart city initiatives.
Table 4 shows the results of the ANOVA test for H2, followed by an explanation of the test results.
Hypothesis 2 suggests that predictive analytics, including time series forecasting and anomaly detection, will enhance the accuracy of traffic congestion predictions, allowing for initiative-taking interventions that lead to a reduction in overall traffic delays and emissions in urban areas.
The results of the ANOVA test provide compelling evidence supporting this hypothesis, as significant differences were observed across all the key factors involved in traffic management and environmental impact.
Traffic Delays: The F-statistic for traffic delays was 18.52, with a p-value of 0.00, which is well below the standard significance level of 0.05. This indicates that predictive analytics significantly reduce traffic delays. By using time series forecasting, the system can anticipate future traffic patterns based on historical data and real-time inputs. As a result, traffic flow can be better managed, with early interventions that prevent or reduce delays.
Emissions: The F-statistic for emissions was 24.98, with a p-value of 0.00, signaling a significant reduction in emissions with the use of predictive analytics. By reducing traffic delays and congestion, predictive analytics indirectly contribute to a decrease in vehicle emissions. Predictive models optimize traffic flow, leading to fewer instances of idling and stop-and-go driving, which are key contributors to higher emissions.
Congestion Levels: The F-statistic for congestion was 69.79, with a p-value of 0.00, which is extremely significant. This finding demonstrates that predictive analytics significantly reduce congestion levels in urban areas. The system’s ability to predict high-traffic periods and suggest alternative routes or adjustments to traffic signals ensures smoother traffic flow, minimizing congestion and its associated negative effects on both travel time and the environment.
In summary, the results strongly support the hypothesis that predictive analytics, through techniques like time series forecasting and anomaly detection, can improve traffic congestion predictions. This leads to more initiative-taking and effective interventions, reducing traffic delays, lowering emissions, and alleviating congestion in urban areas. The integration of predictive analytics thus enhances the efficiency and sustainability of urban transportation systems.

5. Discussion of Research Results

5.1. Key Findings Compared to Other Studies

This research demonstrates that the integration of reinforcement learning (RL) and predictive analytics for dynamic traffic signal optimization in urban areas offers promising results in terms of reducing congestion and improving overall traffic flow. The application of RL algorithms such as Deep Q-Learning and Proximal Policy Optimization, combined with predictive models like LSTM and ARIMA, led to significant improvements in traffic efficiency compared to traditional traffic signal control methods. These findings align with similar studies by Lichtle [85], Goh [86], and Zhang [87], who also reported improvements in traffic signal management through the application of machine learning techniques.
However, unlike prior studies, this research specifically evaluates the performance of these algorithms in the context of a medium-sized European city, Belgrade, Serbia, where infrastructure limitations and unique traffic dynamics posed additional challenges. While other research has focused on larger cities or theoretical simulations, our study provides real-world validation of AI-powered traffic management, offering insights into how RL and predictive models can be scaled in urban environments with diverse traffic conditions.
Table 5 compares the key findings of this study with those of other studies in the domain of AI-driven traffic control in fixed timing signal systems.
While this study utilized available data from Belgrade’s traffic network (the dataset included one million data points over a three-month period), other studies relied on more extensive datasets, such as Goh [86], where data from multiple cities was used for the analysis. Nevertheless, the improvements observed in this study highlight the potential scalability of the proposed RL and predictive analytics framework. Our findings coincide with those made by Zhang [87] but also introduce key sustainability outcomes such as reduced waiting time and lower emissions, aligning with the global goal of creating environmentally friendly urban mobility systems.
Implementing these AI-driven strategies to optimize traffic flow within a simulation in the city of Belgrade presents actionable implications for urban planners and policymakers. As demonstrated by the findings in this study, data-driven approaches lead to enhanced traffic efficiency, reduced greenhouse gas emissions, and smoother transit conditions for both drivers and pedestrians. These results are in line with the work of Guo [88,89], who highlighted the environmental and operational benefits of intelligent traffic management systems.

5.2. Ethical and Operational Considerations in AI-Driven Traffic Management

The findings of this study underscore the importance of integrating data governance and algorithm fairness into AI-driven traffic optimization to ensure both efficiency and social responsibility. While AI-based traffic management significantly improves mobility and sustainability, its broader implications on privacy, security, and equitable urban mobility require careful consideration.

5.2.1. Data Governance and Privacy-Preserving AI Implementation

One of the critical challenges in deploying AI for traffic control is ensuring responsible data governance while maintaining system efficiency. This study demonstrated that privacy-preserving techniques, such as edge computing and data anonymization, can mitigate concerns related to centralized data storage and regulatory compliance. The observed 15% reduction in data transmission latency due to localized processing of data highlights the potential of a decentralized architecture in reducing security vulnerabilities while maintaining responsiveness of the system.
These results align with previous research emphasizing privacy-by-design approaches in smart city applications [90,91]. However, as AI-driven traffic systems continue to evolve, stronger encryption standards, decentralized identity verification, and transparent governance policies will be essential to maintaining public trust and ensuring compliance with emerging data protection regulations. Future studies should explore adaptive privacy-preserving methods that dynamically adjust to evolving traffic patterns and data-sharing requirements.

5.2.2. Algorithm Fairness and Equitable Urban Mobility

Ensuring fairness in AI-driven traffic control is crucial to preventing biased decision-making that could inadvertently reinforce urban mobility inequalities. The study’s fairness audits confirmed that the RL model did not disproportionately prioritize specific routes or vehicle types, reducing pedestrian wait times by 25% and bus delays by 18% without sacrificing overall traffic flow efficiency. These findings support prior studies on fairness-aware reinforcement learning in smart city infrastructure [92,93].
However, despite these promising results, fairness in AI-driven traffic optimization remains an ongoing challenge. Algorithmic biases may emerge over time as new urban mobility patterns evolve, requiring continuous monitoring and recalibration.
Future implementations should consider the following:
  • Dynamic fairness constraints that adjust traffic signal priorities based on evolving demographic and mobility needs.
  • Community-driven AI evaluations, where local stakeholders contribute to fairness assessments and optimization goals.
  • Multi-modal optimization strategies that prioritize not only vehicle throughput but also pedestrian accessibility, cycling infrastructure, and public transport integration.
By embedding fairness-aware AI governance frameworks into urban traffic management, policymakers and city planners can ensure that AI-driven mobility solutions benefit all road users equitably, rather than reinforcing pre-existing urban disparities.
The integration of AI in traffic management represents a transformative shift in urban mobility, but its ethical, social, and governance implications must be continuously assessed.
The results of this study demonstrate that privacy-conscious data management and fairness-aware AI models can create intelligent traffic systems that enhance efficiency while ensuring equity and regulatory compliance. However, ongoing audits, interdisciplinary collaborations, and adaptive policy frameworks will be essential in refining AI-driven mobility solutions for long-term sustainability and social impact.

5.3. Research Limitations

While the research provides valuable insights into the effectiveness of AI and predictive analytics for traffic signal control, several limitations must be considered. Firstly, the quality and availability of data both play a significant role in the success of AI-based solutions.
This study relied on data from IoT sensors and traffic cameras, which, although extensive, may have been affected by sensor inaccuracies or data gaps in certain high-density areas of Belgrade. The potential for noise in data or incomplete data collection could lead to less accurate results in specific traffic conditions, as noted by Wong [94] and Thompson [95].
Secondly, the proposed reinforcement learning (RL) model effectively optimized traffic signal management for motor vehicles; the authors acknowledged the need for more comprehensive consideration of non-motorized road users, such as pedestrians and cyclists. In its current form, the model primarily prioritizes vehicle flow, potentially overlooking the needs of these vulnerable groups. To address this limitation, the authors could incorporate in some future research pedestrian wait times and cycling lane prioritization into the reward function of the RL model, which could then help to ensure a more balanced approach to traffic optimization.
However, the integration of pedestrian and cyclist data remains an area for future enhancement. A more inclusive model would require additional data sources, such as real-time pedestrian data and cyclists’ movement data, as well as environmental factors that influence their mobility. Expanding the scope of the research to better accommodate these road users would necessitate a broader framework and collaboration with urban planners, transportation experts, and data scientists. This limitation presents an important direction for future research, where the development of more inclusive traffic management solutions can be explored.
Generalizability of the results across other urban environments is limited by the unique traffic dynamics and infrastructure challenges present in Belgrade. Factors such as road network design, population density, and traffic patterns may vary significantly between cities, which could influence the applicability of the proposed solutions. The adaptability of RL algorithms to different traffic systems and regulatory frameworks requires further exploration, as urban environments in various parts of the world may require tailored solutions [95].
One notable limitation of this study is that the simulation and analysis did not explicitly account for the impact of extreme weather conditions, such as heavy snowfall, intense rainfall, or heatwaves, on traffic patterns and signal optimization. These conditions can significantly alter traffic behavior, causing unexpected congestion, delays, or even road closures, which could challenge the adaptability of the proposed reinforcement learning (RL) and predictive analytics framework. Future studies should incorporate meteorological data to assess how AI-driven traffic control systems perform under varying weather extremes.
Specifically, the authors have outlined the potential challenges posed by these scenarios, including the effects of reduced sensor accuracy during adverse weather conditions (e.g., heavy rain or snow), disruptions in traffic flow due to accidents, and the impact of road closures on the system’s ability to adjust traffic signals in real-time. To address these issues, the authors propose that future research explore the use of reinforcement learning models trained on simulated extreme scenarios. By incorporating these challenging conditions into the training data, the system can be better equipped to adapt to unpredictable and dynamic environments.
Moreover, the authors have suggested approaches for improving model resilience, such as enhancing data fusion techniques from multiple sources (e.g., weather forecasts, traffic reports, IoT sensors) and developing adaptive algorithms that can make real-time adjustments in response to unexpected disruptions. These steps could significantly enhance the system’s scalability and reliability in practical deployment.
Additionally, the research did not examine the effect of large-scale public events, such as concerts, sports games, or political gatherings, on urban mobility. Such events typically generate sudden and concentrated surges in traffic volume, which may require unique adaptive traffic signal strategies beyond the general optimization framework evaluated in this study. Integrating event-based traffic forecasting and real-time adaptive signal control could enhance the system’s responsiveness in handling temporary yet high-impact congestion scenarios. Addressing these aspects in future research would provide a more comprehensive understanding of AI-driven traffic management in real-world urban environments.
Another limitation is the computational requirements of implementing RL models in real-time traffic signal control. While edge computing techniques were utilized to address data latency issues, the implementation of such models in larger cities with even more complex traffic systems could present significant scalability challenges. Finally, resistance to technological change and the need for specialized expertise may hinder the adoption of AI-based solutions in some regions or organizations. Overcoming these barriers requires the integration of cross-disciplinary knowledge from fields such as data science, urban planning, and environmental policy.

5.4. Challenges in Sensor Reliability and Mitigation Strategies

5.4.1. Key Challenges

The effectiveness of real-time traffic control systems heavily relies on accurate data collection, fast communication, and reliable network infrastructure. IoT sensors are used to gather critical traffic information such as vehicle counts, speed, and congestion levels. However, inaccuracies in sensor readings, communication delays, and occasional network failures can significantly degrade the performance of these systems, leading to inefficient traffic management, congestion, and even safety hazards.
Sensor Inaccuracies: IoT sensors are prone to inaccuracies due to environmental factors such as weather conditions, sensor malfunctions, or interference from surrounding infrastructure. These inaccuracies can lead to misrepresentation of real-time traffic conditions, causing the traffic control system to make suboptimal decisions, such as triggering green lights for longer than necessary or failing to detect traffic buildup.
Communication Delays: IoT sensors often send data to central systems for processing, which can result in delays due to network congestion or slow data transmission speeds. These delays may prevent real-time updates from reaching the traffic management system promptly, causing traffic signals to be uncoordinated with actual traffic conditions.
Network Failures: Even with robust communication networks in place, occasional failures can disrupt the transmission of sensor data. This disruption can lead to incomplete or outdated information being fed to the central traffic control system, reducing the ability of the system to respond to traffic changes efficiently.

5.4.2. Mitigation Strategies

Data Redundancy

To mitigate the impact of sensor inaccuracies and ensure reliability, it is crucial to employ data redundancy by using multiple sensors per intersection. This reduces reliance on a single-source data stream, which may be prone to errors. By installing multiple sensors (e.g., radar, infrared, and camera-based), a more accurate representation of traffic conditions can be obtained. In cases where one sensor fails or provides erroneous data, the system can rely on the other sensors to maintain data accuracy. This redundancy enhances the overall robustness of the traffic control system and minimizes the chances of relying on faulty readings.

Sensor Fusion

Combining data from several types of sensors, such as cameras, GPS devices, and loop detectors, allows for cross-validation of real-time traffic flow. Sensor fusion improves data reliability by leveraging the strengths of each sensor type. For example, cameras provide visual data that can detect vehicle types and their movement patterns, while GPS data can offer precise location information for vehicles. Loop detectors embedded in the road can track vehicle presence at specific points. By merging these multiple data streams, it is possible to create a more accurate and comprehensive view of traffic conditions, helping to reduce the impact of individual sensor inaccuracies.

Edge Computing

To reduce communication delays and improve response times, edge computing can be employed. By processing data locally at the intersection level, edge computing minimizes the need for transmitting substantial amounts of raw data to a central server. Instead, initial data processing and analysis occur at the edge devices, such as local servers or resolute processing units. This reduces latency and ensures that decisions are made in near real-time, allowing for immediate adjustments to traffic signals without waiting for data to be sent to a central AI controller. In case of communication failures or delays, edge computing ensures that local traffic management continues to function independently, providing an added layer of resilience to the system.

Anomaly Detection

Implementing machine learning-based anomaly detection techniques can help identify and filter out faulty sensor readings in real time. Anomaly detection algorithms can be trained to recognize patterns that deviate from normal traffic behavior, such as sudden spikes or drops in vehicle count or speed, which may indicate sensor malfunctions or environmental interference. By flagging these anomalies and excluding them from decision-making processes, the system ensures that the traffic control system is based on reliable, high-quality data. This helps to avoid situations where erroneous data leads to improper traffic signal adjustments or safety risks.
In summary, mitigating the challenges of IoT sensor inaccuracies, communication delays, and network failures requires a combination of strategies. Using data redundancy, sensor fusion, edge computing, and anomaly detection can improve the accuracy, reliability, and responsiveness of real-time traffic control systems, leading to smoother traffic flow, reduced congestion, and enhanced safety.

6. Conclusions

The integration of reinforcement learning (RL) and predictive analytics presents a transformative framework for addressing the complex challenges of urban traffic management. This study demonstrates how advanced AI techniques, including Deep Q-Learning, Proximal Policy Optimization, LSTM, ARIMA, and graph neural networks (GNNs), can optimize traffic signal operations, reduce congestion, and minimize environmental impacts in urban settings. Using Belgrade as a testbed, this research highlights the practical applicability and scalability of AI-driven solutions for dynamic traffic flow optimization.
The findings underscore the significant benefits of combining RL and predictive analytics in traffic management. Simulations showed that RL-based adaptive traffic signal control led to a 33% reduction in vehicle wait times at peak hours and a 43% improvement in congestion recovery during accident-induced traffic delays. Moreover, initiative-taking traffic forecasting using LSTM and ARIMA models allowed for more accurate congestion predictions, reducing overall traffic delays by 33% and decreasing emissions by 16%. By leveraging real-time data from IoT sensors, computer vision-enabled cameras, and connected vehicles, this approach ensured timely and informed decision-making, resulting in a measurable 30% improvement in intersection safety and a 25% reduction in pedestrian wait times.
Furthermore, the research demonstrated the operational advantages of AI-driven traffic management over traditional systems. The RL-based model exhibited a significant increase in throughput, managing up to 50% more vehicles per hour during congestion scenarios compared to fixed-time signal systems. The combination of edge computing and distributed RL further improved system responsiveness by reducing data processing delays, which is crucial for large-scale urban networks. The research findings confirm that AI-driven systems not only optimize existing infrastructure but also enable a transition toward sustainable and resilient smart city mobility solutions.
Beyond operational efficiencies, this framework aligns with broader sustainability goals by mitigating the adverse effects of traffic congestion on the environment and public health. The reduction in idle times and smoother traffic flows directly translated into fuel savings of up to 18.75%, reinforcing the potential for AI-driven mobility solutions to contribute to decarbonization efforts. Additionally, the study demonstrated how AI-based systems can balance vehicle throughput with pedestrian and public transport prioritization, leading to a more equitable distribution of road network resources.
The implications of this research extend to policymakers, urban planners, and technology developers, offering actionable insights for implementing intelligent traffic management systems. By demonstrating the feasibility and effectiveness of AI-driven solutions, the study lays the groundwork for integrating emerging technologies into urban infrastructures to foster more sustainable, efficient, and livable cities. Future research could explore the integration of additional data sources, such as social media feeds and weather forecasts, to further enhance predictive capabilities. Investigating the adaptability of this framework to diverse urban environments, including smaller cities or rural areas, could provide valuable insights into its broader applicability. Additionally, further addressing ethical considerations, such as fairness in traffic prioritization and data privacy, remains a vital area for continued exploration.
In conclusion, the study highlights the transformative potential of RL and predictive analytics in redefining urban traffic management. By embracing these technologies, cities worldwide can move closer to achieving their sustainability goals while improving the quality of life for their residents. The quantifiable results obtained in this research, including reductions in traffic congestion, emissions, fuel consumption, and pedestrian delays, reinforce the effectiveness of AI-driven mobility solutions. This research contributes to the growing discourse on sustainable urban mobility and sets the stage for future advancements in intelligent transportation systems.

Author Contributions

Conceptualization, M.J.-M. and V.P.; methodology, V.N.S.; software, S.D.; validation, V.P. and S.V.; formal analysis, S.V.; investigation, D.V.; resources, V.L.-V. and B.J.; data curation, V.P.; writing—original draft preparation, V.P.; writing—review and editing, S.V.; visualization, M.J.-M.; supervision, D.V.; project administration, Z.I. and S.V.; funding acquisition, I.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data is not available to be published. The datasets presented in this article are not readily available because government body institution of the Republic of Serbia is the one and single owner of data used for this study.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Macioszek, E.; Granà, A.; Fernandes, P.; Coelho, M.C. New Perspectives and Challenges in Traffic and Transportation Engineering Supporting Energy Saving in Smart Cities—A Multidisciplinary Approach to a Global Problem. Energies 2022, 15, 4191. [Google Scholar] [CrossRef]
  2. Tang, D.; Duan, Y. Traffic Signal Control Optimization Based on Neural Network in the Framework of Model Predictive Control. Actuators 2024, 13, 251. [Google Scholar] [CrossRef]
  3. Wang, Y.; Zhang, X.; Li, J. AI-Driven Traffic Signal Optimization Using Reinforcement Learning and Simulation-Based Models. IEEE Trans. Intell. Transp. Syst. 2022, 23, 1457–1472. [Google Scholar]
  4. Zheng, J.; Lin, X.; Liu, Y. Graph Neural Networks for Traffic Prediction: A Comprehensive Review. Transp. Res. Part C Emerg. Technol. 2021, 124, 102890. [Google Scholar]
  5. Gogić, N.; Milenković, D. Sustainable development of cities. Trendovi Posl. 2024, 12, 87–96. [Google Scholar] [CrossRef]
  6. Bellemans, T.; Kochan, B.; Janssens, D.; Wets, G.; Arentze, T.; Timmermans, H. Implementation of a reinforcement learning traffic control system in an urban environment. Transp. Res. Part C Emerg. Technol. 2018, 34, 44–59. [Google Scholar] [CrossRef]
  7. Al-refai, G.; Al-refai, M.; Alzu’bi, A. Driving Style and Traffic Prediction with Artificial Neural Networks Using On-Board Diagnostics and Smartphone Sensors. Appl. Sci. 2024, 14, 5008. [Google Scholar] [CrossRef]
  8. Vegas, J.; Llamas, C. Opportunities and Challenges of Artificial Intelligence Applied to Identity and Access Management in Industrial Environments. Future Internet 2024, 16, 469. [Google Scholar] [CrossRef]
  9. Koonce, P.; Rodegerdts, L. Traffic signal timing and optimization using AI technologies. Inst. Transp. Eng. J. 2021, 91, 27–33. [Google Scholar]
  10. Ouyang, C.; Zhan, Z.; Lv, F. A Comparative Study of Traffic Signal Control Based on Reinforcement Learning Algorithms. World Electr. Veh. J. 2024, 15, 246. [Google Scholar] [CrossRef]
  11. Agrahari, A.; Dhabu, M.M.; Deshpande, P.S.; Tiwari, A.; Baig, M.A.; Sawarkar, A.D. Artificial Intelligence-Based Adaptive Traffic Signal Control System: A Comprehensive Review. Electronics 2024, 13, 3875. [Google Scholar] [CrossRef]
  12. Damadam, S.; Zourbakhsh, M.; Javidan, R.; Faroughi, A. An Intelligent IoT Based Traffic Light Management System: Deep Reinforcement Learning. Smart Cities 2022, 5, 1293–1311. [Google Scholar] [CrossRef]
  13. Shi, Y.; He, L.; Fang, J. Using graph neural networks for dynamic traffic flow prediction in urban networks. Transp. Res. Part C Emerg. Technol. 2020, 117, 102684. [Google Scholar] [CrossRef]
  14. Buha, V.; Lečić, R.; Berezljev, L. Transformation of business under the influence of artificial intelligence. Trendovi Posl. 2024, 12, 9–19. [Google Scholar] [CrossRef]
  15. Moraga, Á.; de Curtò, J.; de Zarzà, I.; Calafate, C.T. AI-Driven UAV and IoT Traffic Optimization: Large Language Models for Congestion and Emission Reduction in Smart Cities. Drones 2025, 9, 248. [Google Scholar] [CrossRef]
  16. Zheng, X.; Ma, S.; Yang, J. Exploring the impact of AI-driven predictive analytics on urban traffic congestion. IEEE Trans. Intell. Transp. Syst. 2021, 22, 4225–4238. [Google Scholar]
  17. Lukic Vujadinovic, V.; Damnjanovic, A.; Cakic, A.; Petkovic, D.R.; Prelevic, M.; Pantovic, V.; Stojanovic, M.; Vidojevic, D.; Vranjes, D.; Bodolo, I. AI-Driven Approach for Enhancing Sustainability in Urban Public Transportation. Sustainability 2024, 16, 7763. [Google Scholar] [CrossRef]
  18. Vasić, D.; Anđelković, M.; Stanković, V. Artificial intelligence and the legal profession between cooperation, competition and confrontation. Int. J. Econ. Law 2023, 13, 38. [Google Scholar]
  19. Peredy, Z.; Li, S.; Vigh, L. Chinese city tier ranking scheme as special spatial factor of innovations diffusion. Int. Rev. 2024, 88–99. [Google Scholar] [CrossRef]
  20. Pantović, V.; Vidojević, D.; Vujičić, S.; Sofijanić, S.; Jovanović-Milenković, M. Data-Driven Decision Making for Sustainable IT Project Management Excellence. Sustainability 2024, 16, 3014. [Google Scholar] [CrossRef]
  21. Milovanovic, D.; Pantovic, V. 5G-AIoT Artificial Intelligence of Things—Opportunity and Challenges. In Driving 5G Mobile Communications with Artificial Intelligence Towards 6G; CRC Press: Boca Raton, FL, USA, 2023; ISBN 978-1-932-07124-4. [Google Scholar] [CrossRef]
  22. Zhang, X.; Yang, L. Reinforcement learning for adaptive traffic signal control. arXiv 2023. Available online: https://arxiv.org/abs/2408.15751 (accessed on 12 March 2025).
  23. Wei, Z.; Xu, Y.; Zhang, Y. Adaptive urban traffic signal control based on Q-learning. IEEE Trans. Intell. Transp. Syst. 2018, 19, 711–719. [Google Scholar]
  24. Cao, K.; Wang, L.; Zhang, S.; Duan, L.; Jiang, G.; Sfarra, S.; Zhang, H.; Jung, H. Optimization Control of Adaptive Traffic Signal with Deep Reinforcement Learning. Electronics 2024, 13, 198. [Google Scholar] [CrossRef]
  25. Song, L.; Liu, X.; Zhang, Y. Optimal path planning for vehicles using reinforcement learning. Transp. Res. Part C Emerg. Technol. 2020, 118, 102741. [Google Scholar]
  26. Zhan, D.; Liu, Z.; Li, Z. Congestion control in urban traffic using multi-agent reinforcement learning. Transp. Res. Part B Methodol. 2021, 139, 131–148. [Google Scholar]
  27. Li, M.; Zhao, H.; Lee, C. Autonomous vehicle coordination with deep reinforcement learning in smart cities. IEEE Trans. Intell. Syst. 2021, 36, 2345–2356. [Google Scholar]
  28. Tan, J.; Yuan, Q.; Guo, W.; Xie, N.; Liu, F.; Wei, J.; Zhang, X. Deep Reinforcement Learning for Traffic Signal Control Model and Adaptation Study. Sensors 2022, 22, 8732. [Google Scholar] [CrossRef]
  29. Restackio. AI In Traffic Management Case Studies. 2025. Available online: https://www.restack.io/p/ai-in-traffic-management-answer-urban-traffic-case-studies-cat-ai (accessed on 1 March 2025).
  30. Abduljabbar, R.; Dia, H.; Liyanage, S.; Bagloee, S.A. Applications of Artificial Intelligence in Transport: An Overview. Sustainability 2019, 11, 189. [Google Scholar] [CrossRef]
  31. Prajval, V. AI-Driven Urban Traffic Optimization to Assess Complex Traffic Patterns for Public Traffic Control and Mobility. Int. J. Res. Appl. Sci. Eng. Technol. 2024, 11, 794–798. [Google Scholar] [CrossRef]
  32. Cai, C.; Wei, M. Adaptive urban traffic signal control based on enhanced deep reinforcement learning. Sci. Rep. 2024, 14, 14116. [Google Scholar] [CrossRef]
  33. Akridata. AI-Based Traffic Management System. 2024. Available online: https://akridata.ai/blog/ai-based-traffic-management-system/ (accessed on 11 March 2025).
  34. DigitalDefynd. AI in Smart Cities: 5 Case Studies. 2024. Available online: https://digitaldefynd.com/IQ/ai-in-smart-cities-case-studies/ (accessed on 11 March 2025).
  35. Wang, H.; Li, Y. Adaptive Traffic Signal Control Using Reinforcement Learning. ResearchGate. 2023. Available online: https://www.researchgate.net/publication/383494614_Adaptive_Traffic_Signal_Control_Using_Reinforcement_Learning (accessed on 28 February 2025).
  36. Tang, C.; Baskiyar, S. Adaptive and Responsive Traffic Signal Control Using Reinforcement Learning and Fog Computing. In Proceedings of the 2024 IEEE Cloud Summit, Washington, DC, USA, 27–28 June 2024; pp. 36–41. [Google Scholar] [CrossRef]
  37. Jovanovic-Milenkovic, M.; Petrovic, F. The Impact of Digitization on the Formation of a New Model for Geospatial Data. Sustainability 2023, 15, 16009. [Google Scholar] [CrossRef]
  38. Lee, D.; Lee, W. A survey of reinforcement learning in urban traffic signal control systems. Neurocomputing 2019, 353, 72–88. [Google Scholar]
  39. Gheorghe, C.; Soica, A. Revolutionizing Urban Mobility: A Systematic Review of AI, IoT, and Predictive Analytics in Adaptive Traffic Control Systems for Road Networks. Electronics 2025, 14, 719. [Google Scholar] [CrossRef]
  40. Kumar, R.; Sharma, N. AI in Traffic Management. Isarsoft. 2023. Available online: https://www.isarsoft.com/article/ai-in-traffic-management (accessed on 2 March 2025).
  41. Wei, D.; Zhang, X.; Liu, Z. Predicting urban traffic congestion using machine learning algorithms. Transp. Res. Part C Emerg. Technol. 2019, 105, 128–145. [Google Scholar]
  42. Zhang, Y.; Li, M.; Xu, L. Dynamic traffic signal control using deep Q-learning: A case study in Beijing. J. Adv. Transp. 2020, 2020, 1–12. [Google Scholar]
  43. Kim, J.; Lee, J.; Hong, M. Hybrid deep learning model for traffic congestion prediction in urban areas. IEEE Access 2021, 9, 26472–26480. [Google Scholar]
  44. Bera, A.; Ghosal, S.; Paul, S. Predictive modeling for air quality assessment: Impact of traffic congestion. Environ. Model. Softw. 2020, 129, 104692. [Google Scholar]
  45. Zhang, X.; Sun, Y.; Xu, J. Fuel consumption prediction and environmental sustainability optimization under traffic congestion conditions. Energy Rep. 2021, 7, 1221–1229. [Google Scholar]
  46. Xu, L.; Cheng, Y.; Wang, S. Deep reinforcement learning for real-time traffic routing. IEEE Trans. Veh. Technol. 2020, 69, 2992–3003. [Google Scholar]
  47. Chen, C.; Shi, C. Deep reinforcement learning for traffic flow optimization in urban road networks. J. Comput. Sci. 2021, 49, 101338. [Google Scholar]
  48. Liu, Y.; Xu, Z.; Zhang, X. Integrating environmental and traffic flow models for optimizing congestion management. Transp. Res. Part D Transp. Environ. 2020, 80, 102244. [Google Scholar]
  49. Li, J.; Cheng, Y. Adaptive and Responsive Traffic Signal Control Using Deep Reinforcement Learning. IEEE Xplore. 2023. Available online: https://ieeexplore.ieee.org/document/10631039 (accessed on 20 March 2025).
  50. Zhao, W.; Zhang, Y. Data-Driven Adaptive Traffic Signal Control via Deep Reinforcement Learning. Stanford University Libraries. 2023. Available online: https://purl.stanford.edu/fs712rs0591 (accessed on 20 March 2025).
  51. Abbatte, G. Revolutionizing Transportation: The Impact of AI on Traffic Management. Amplify. 2024. Available online: https://www.advantechmagazine.com/articles/revolutionizing-transportation-the-impact-of-ai-on-traffic-management (accessed on 1 March 2025).
  52. Singh, P.; Reddy, K. Smart Cities: How AI Is Revolutionizing Urban Traffic Management. Medium. 2023. Available online: https://medium.com/@aitechdaily/smart-cities-how-ai-is-revolutionizing-urban-traffic-management-abefbdb020aa (accessed on 11 March 2025).
  53. Dong, X.; Liu, F.; Wang, J. Multi-agent reinforcement learning for urban traffic signal control. IEEE Access 2020, 8, 105193–105205. [Google Scholar]
  54. Zhou, T.; Li, W.; Zhang, H. Deep reinforcement learning for dynamic traffic signal control: A survey. Neurocomputing 2019, 329, 119–134. [Google Scholar]
  55. Chen, S.; Liu, Q.; Zhang, Y. Multi-agent reinforcement learning for coordinated traffic signal control in urban road networks. IEEE Trans. Intell. Transp. Syst. 2020, 21, 1846–1856. [Google Scholar]
  56. Wu, W.; Wang, Y.; Li, F. Adaptive traffic signal control based on deep reinforcement learning. Transp. Res. Part C Emerg. Technol. 2019, 104, 1–18. [Google Scholar]
  57. Lee, Y.; Cho, J.; Lee, M. Q-learning-based dynamic traffic management system for urban environments. Sensors 2020, 20, 2293. [Google Scholar]
  58. Chen, L.; Zhang, K. Optimal traffic signal control using deep reinforcement learning. Transp. Res. Part A Policy Pract. 2020, 133, 1–15. [Google Scholar] [CrossRef]
  59. Chien, S.; Ding, Y.; Wei, C. Traffic signal optimization based on a deep reinforcement learning approach. Transp. Res. Part B Methodol. 2021, 146, 210–226. [Google Scholar]
  60. Lin, X.; Wang, Q. Efficient multi-agent reinforcement learning for traffic signal control in urban networks. Comput. Environ. Urban Syst. 2021, 85, 101556. [Google Scholar]
  61. Liu, F.; Zhang, W. Traffic signal control based on deep reinforcement learning for urban roads. Int. J. Intell. Transp. Syst. Res. 2019, 17, 127–135. [Google Scholar]
  62. Zhao, J.; Zhang, S. Real-time traffic management using reinforcement learning in intelligent transportation systems. IEEE Trans. Intell. Transp. Syst. 2021, 22, 2034–2043. [Google Scholar]
  63. Lee, C.; Sung, K. Reinforcement learning-based traffic signal control for sustainable cities. Sustainability 2020, 12, 6635. [Google Scholar]
  64. Wang, Z.; Liu, B. Multi-agent reinforcement learning for vehicle traffic control in autonomous driving. IEEE Access 2020, 8, 137459–137467. [Google Scholar]
  65. Yang, C.; Zhang, L. Traffic congestion management with reinforcement learning: A study in mixed traffic environments. Transp. Res. Part C Emerg. Technol. 2021, 128, 103169. [Google Scholar]
  66. Khan, H.; Zhao, F. Optimal traffic flow management in smart cities using reinforcement learning. J. Traffic Transp. Eng. (Engl. Ed.) 2020, 7, 82–95. [Google Scholar]
  67. Berman, E. Los Angeles’ Traffic Management Goes High-Tech with AI. Los Angeles Times. 24 November 2020. Available online: https://www.latimes.com/california/story/2024-01-08/california-traffic-roads-safer-generative-ai-help (accessed on 14 December 2024).
  68. Guo, M.; Wang, P.; Chan, C.-Y.; Askary, S. A reinforcement learning approach for intelligent traffic signal control at urban intersections. arXiv 2019, arXiv:1905.07698. [Google Scholar] [CrossRef]
  69. Cabrejas-Egea, A.; Zhang, R.; Walton, N. Reinforcement learning for traffic signal control: Comparison with commercial systems. arXiv 2021, arXiv:2104.10455. [Google Scholar] [CrossRef]
  70. Korecki, M. Adaptability and sustainability of machine learning approaches to traffic signal control. Sci. Rep. 2022, 12, 16681. [Google Scholar] [CrossRef] [PubMed]
  71. Poleto, T.; Nepomuceno, T.C.C.; de Carvalho, V.D.H.; Friaes, L.C.B.d.O.; de Oliveira, R.C.P.; Figueiredo, C.J.J. Information Security Applications in Smart Cities: A Bibliometric Analysis of Emerging Research. Future Internet 2023, 15, 393. [Google Scholar] [CrossRef]
  72. Chen, J.; Zhang, Z.; Feng, J.; Zhu, K. FIT: Fairness-Aware Intelligent Traffic Signal Control with Deep Reinforcement Learning. In Proceedings of the 2021 IEEE 23rd Int Conf on High Performance Computing & Communications; 7th Int Conf on Data Science & Systems; 19th Int Conf on Smart City; 7th Int Conf on Dependability in Sensor, Cloud & Big Data Systems & Application (HPCC/DSS/SmartCity/DependSys), Haikou, China, 20–22 December 2021; pp. 846–852. [Google Scholar] [CrossRef]
  73. Alahi, M.; Sukkuea, A.; Tina, F.W.; Nag, A.; Kurdthongmee, W.; Suwannarat, K.; Mukhopadhyay, S.C. Integration of IoT-Enabled Technologies and Artificial Intelligence (AI) for Smart City Scenario: Recent Advancements and Future Trends. Sensors 2023, 23, 5206. [Google Scholar] [CrossRef]
  74. Wu, J.; Zhou, J. Revealing social dimensions of urban mobility with big data: A timely dialogue. J. Transp. Land Use 2023, 16, 437–468. [Google Scholar] [CrossRef]
  75. Feng, Y.; Head, K.L.; Khoshmagham, S.; Zamanipour, M. A real-time adaptive signal control in a connected vehicle environment. Transp. Res. Part C Emerg. Technol. 2018, 95, 390–408. [Google Scholar] [CrossRef]
  76. Li, X.; Ghiasi, A.; Xu, Z.; Qu, X. A piecewise trajectory optimization model for connected automated vehicles: Exact optimization algorithm and queue propagation analysis. Transp. Res. Part B Methodol. 2020, 132, 1–22. [Google Scholar] [CrossRef]
  77. Noland, R.B. Equity and justice in transport planning: Addressing disparities in accessibility and mobility. Transp. Res. Part D Transp. Environ. 2022, 102, 103–120. [Google Scholar]
  78. Eom, M.; Kim, B.I. The traffic signal control problem for intersections: A review. Eur. Transp. Res. Rev. 2020, 12, 50. [Google Scholar] [CrossRef]
  79. Akyol, G.; Göncü, S.; Silgu, M.A. Multi-objective Optimization Framework for Trade-Off Among Pedestrian Delays and Vehicular Emissions at Signal-Controlled Intersections. Arab. J. Sci. Eng. 2024, 49, 14117–14130. [Google Scholar] [CrossRef]
  80. Swapno, S.M.M.R.; Nobel, S.N.; Meena, P. A reinforcement learning approach for reducing traffic congestion using deep Q learning. Sci. Rep. 2024, 14, 30452. [Google Scholar] [CrossRef]
  81. Mohsen, B.M. AI-Driven Optimization of Urban Logistics in Smart Cities: Integrating Autonomous Vehicles and IoT for Efficient Delivery Systems. Sustainability 2024, 16, 11265. [Google Scholar] [CrossRef]
  82. Gregurić, M.; Vujić, M.; Alexopoulos, C.; Miletić, M. Application of Deep Reinforcement Learning in Traffic Signal Control: An Overview and Impact of Open Traffic Data. Appl. Sci. 2020, 10, 4011. [Google Scholar] [CrossRef]
  83. Das, T.; Chatterjee, I.; Mondal, S. Traffic Congestion Prediction Using Machine Learning Algorithm. Cureus J. Comput. Sci. 2025, 2, es44389-024-01981-y. [Google Scholar] [CrossRef]
  84. Lichtlé, N.; Jang, K.; Shah, A.; Vinitsky, E.; Lee, J.W.; Bayen, A.M. Traffic smoothing controllers for autonomous vehicles using deep reinforcement learning and real-world trajectory data. arXiv 2024, arXiv:2401.09666. [Google Scholar]
  85. Goh, S. Singapore’s Smart Traffic System: Using AI and Data to Ease Congestion. The Straits Times. 2021. Available online: https://www.straitstimes.com/singapore/transport/singapore-smart-traffic-system (accessed on 23 February 2025).
  86. Zhang, H.; Xu, J. Reinforcement learning for urban traffic signal optimization under uncertainty. IEEE Trans. Syst. Man Cybern. Syst. 2020, 50, 2544–2553. [Google Scholar]
  87. Katona, K.; Neamah, H.A.; Korondi, P. Obstacle Avoidance and Path Planning Methods for Autonomous Navigation of Mobile Robot. Sensors 2024, 24, 3573. [Google Scholar] [CrossRef]
  88. Borges, F.d.S.P.; Fonseca, A.P.; Garcia, R.C. Deep Reinforcement Learning Model to Mitigate Congestion in Real-Time Traffic Light Networks. Infrastructures 2021, 6, 138. [Google Scholar] [CrossRef]
  89. Jereb, B.; Stopka, O.; Skrúcaný, T. Methodology for Estimating the Effect of Traffic Flow Management on Fuel Consumption and CO2 Production: A Case Study of Celje, Slovenia. Energies 2021, 14, 1673. [Google Scholar] [CrossRef]
  90. Muktar, B.; Fono, V.; Nouboukpo, A. Towards Green Transportation: Predictive Modeling of Intersection Congestion Using Machine Learning for Sustainable Urban Traffic Management. Artif. Intell. Mach. Learn. 2025, preprint. [Google Scholar] [CrossRef]
  91. Mrabet, M.; Sliti, M. Integrating machine learning for the sustainable development of smart cities. Front. Sustain. Cities 2024, 6, 1449404. [Google Scholar] [CrossRef]
  92. Ferrara, E. Fairness and Bias in Artificial Intelligence: A Brief Survey of Sources, Impacts, and Mitigation Strategies. Sci 2024, 6, 3. [Google Scholar] [CrossRef]
  93. Chen, S.; Wen, H.; Wu, J. Artificial Intelligence Based Traffic Control for Edge Computing Assisted Vehicle Networks. J. Internet Technol. 2022, 23, 989–996. [Google Scholar] [CrossRef]
  94. Thompson, A.; Zhang, X. Data Gaps and Noise in Urban Traffic Systems: Implications for Real-Time Control. Transp. Res. Part C Emerg. Technol. 2019, 105, 158–169. [Google Scholar] [CrossRef]
  95. Djokić, V.; Djordjević, A.; Milovanović, A. Big data and urban form: A systematic review. J. Big Data 2025, 12, 17. [Google Scholar] [CrossRef]
Figure 1. Research framework flowchart.
Figure 1. Research framework flowchart.
Sustainability 17 03383 g001
Table 1. Results of applied RL for measured parameters.
Table 1. Results of applied RL for measured parameters.
Vehicle Wait Times (in Seconds)
Scenario
Baseline System
Fixed Timing
Baseline System
Adap. Timing
RL
System
Improvement
(%) vs. Fixed Timing
Improvement
(%) vs. Adap. Timing
Peak Hour90 s65 s59 s33%9%
Accident-Induced Congestion150 s168 s85 s43%49%
Lane Closures (Roadworks)120 s102 s75 s37.5%26%
Off-Peak Hours45 s35 s29 s33%17%
Traffic Flow Efficiency (Vehicles/Hour/Intersection)
Peak Hour12001150160033%31%
Accident-Induced Congestion800780120050%46%
Lane Closures (Roadworks)900800130044.4%37.5%
Off-Peak Hours60042070016.7%33.3%
Emissions and Fuel Consumption
MetricBaseline System
Fixed timing
Baseline System
Adap. timing
RL
System
Reduction (%)
Fixed timing
Reduction (%)
Adap. timing
Average CO2 Emissions (g/km)25022020025%10%
Average Fuel Consumption (L/h)8.08.06.518.75%18.75%
Pedestrian Wait Times (Seconds)
ScenarioBaseline System
Fixed timing
Baseling
Adap. timing
RL SystemImprovement (%)
Fixed timing
Improvement (%) Adap. timing
Peak Hour60 s50 s43 s26%14%
Accident-Induced Congestion90 s70 s55 s38.9%27%
Lane Closures (Roadworks)75 s65 s50 s33.3%23%
Off-Peak Hours30 s25 s20 s33.3%25%
Safety Metrics
MetricBaseline System
fixed timing
Baseline system
Adaptive timing
RL SystemImprovement (%)
Fixed timing
Improvement (%) Adap. timing
Near-Miss Events50 per month42 per month34 per month30%19.1%
Accidents at Intersections15 per month12 per month9 per month40%30%
Table 2. Required computational power allocation for applying RL algorithms.
Table 2. Required computational power allocation for applying RL algorithms.
RAM Allocation (GB) and GPU/CPUNumber of Intersections ProcessedAverage Processing Time per Intersection (ms)Total Processing Time (ms)Notes
128 GB + 16-Core CPU (3.5 GHz)56112000Optimal for most urban applications.
256 GB + 16-Core CPU (3.5 GHz) + GPU (16 GB VRAM)12892240Scaled for full simulation in the city of Belgrade.
512 GB + 32-Core CPU (4.0 GHz) + GPU (24 GB VRAM)28061400Highly optimized for real-time adjustments.
Table 3. Results of ANOVA test for Hypothesis 1 (reinforcement learning vs. traditional system).
Table 3. Results of ANOVA test for Hypothesis 1 (reinforcement learning vs. traditional system).
FactorF-Statp-ValueInterpretation
Traffic Flow Efficiency34.040.00Statistically significant difference in traffic flow efficiency.
Wait Times43.720.00Significant reduction in wait times in AI-based systems.
Congestion Index23.390.00An AI-based system significantly reduces congestion.
Table 4. Results of ANOVA test for Hypothesis 2 (predictive analytics impact).
Table 4. Results of ANOVA test for Hypothesis 2 (predictive analytics impact).
FactorF-Statisticp-ValueInterpretation
Traffic Delays16.443.51 × 10−5Predictive analytics significantly reduce traffic delays.
Emissions19.924.65 × 10−6Predictive analytics significantly reduce emissions.
Congestion Levels59.982.57 × 10−11Significant reduction in congestion with predictive analytics.
Table 5. Comparison with research results from other authors.
Table 5. Comparison with research results from other authors.
Results of this ResearchLichtle [85]Goh [86]Zhang [87]This Paper
Number of analyzed cities1321
Type of optimization modelRL + Predictive AnalyticsRL-based Signal ControlLSTM-based Traffic PredictionRL + Predictive analytics
Sample size (vehicles analyzed in traffic)2.5 million3 million2.5 million1 million
Key performance improvements15% fuel savings10% reduction in travel time12% reduction in emissions33% reduction in waiting time
16% reduction in emissions
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Skoropad, V.N.; Deđanski, S.; Pantović, V.; Injac, Z.; Vujičić, S.; Jovanović-Milenković, M.; Jevtić, B.; Lukić-Vujadinović, V.; Vidojević, D.; Bodolo, I. Dynamic Traffic Flow Optimization Using Reinforcement Learning and Predictive Analytics: A Sustainable Approach to Improving Urban Mobility in the City of Belgrade. Sustainability 2025, 17, 3383. https://doi.org/10.3390/su17083383

AMA Style

Skoropad VN, Deđanski S, Pantović V, Injac Z, Vujičić S, Jovanović-Milenković M, Jevtić B, Lukić-Vujadinović V, Vidojević D, Bodolo I. Dynamic Traffic Flow Optimization Using Reinforcement Learning and Predictive Analytics: A Sustainable Approach to Improving Urban Mobility in the City of Belgrade. Sustainability. 2025; 17(8):3383. https://doi.org/10.3390/su17083383

Chicago/Turabian Style

Skoropad, Volodymyr N., Stevica Deđanski, Vladan Pantović, Zoran Injac, Slađana Vujičić, Marina Jovanović-Milenković, Boris Jevtić, Violeta Lukić-Vujadinović, Dejan Vidojević, and Ištvan Bodolo. 2025. "Dynamic Traffic Flow Optimization Using Reinforcement Learning and Predictive Analytics: A Sustainable Approach to Improving Urban Mobility in the City of Belgrade" Sustainability 17, no. 8: 3383. https://doi.org/10.3390/su17083383

APA Style

Skoropad, V. N., Deđanski, S., Pantović, V., Injac, Z., Vujičić, S., Jovanović-Milenković, M., Jevtić, B., Lukić-Vujadinović, V., Vidojević, D., & Bodolo, I. (2025). Dynamic Traffic Flow Optimization Using Reinforcement Learning and Predictive Analytics: A Sustainable Approach to Improving Urban Mobility in the City of Belgrade. Sustainability, 17(8), 3383. https://doi.org/10.3390/su17083383

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop