A Division-of-Labour Approach to Traffic Light Scheduling

Raubenheimer, Hendrik; Engelbrecht, Andries

doi:10.3390/app14178022

Open AccessArticle

A Division-of-Labour Approach to Traffic Light Scheduling

by

Hendrik Raubenheimer

^1,† and

Andries Engelbrecht

^1,2,3,*,†

¹

Computer Science Division, Stellenbosch University, Stellenbosch 7600, South Africa

²

Department of Industrial Engineering, Stellenbosch University, Stellenbosch 7600, South Africa

³

Center for Applied Mathematics and Bioinformatics, Gulf University for Science and Technology, Mubarak Al-Abdullah 32093, Kuwait

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Appl. Sci. 2024, 14(17), 8022; https://doi.org/10.3390/app14178022 (registering DOI)

Submission received: 15 March 2024 / Revised: 5 August 2024 / Accepted: 15 August 2024 / Published: 7 September 2024

(This article belongs to the Special Issue Exploration and Application of Swarm Intelligence and Evolutionary Computation)

Download

Browse Figures

Versions Notes

Abstract

:

Featured Application

Traffic light scheduling optimisation using a task allocation model based on division-of-labour behaviour as observed in insect colonies.

Abstract

Traffic light scheduling is a critical aspect of traffic management with many recently developed solutions that incorporate computational intelligence approaches. This paper presents a traffic light scheduling algorithm based on a task allocation model that simulates the division of labour among insects in a colony, specifically ant colonies. The developed algorithm switches the green light based on a probability calculated every second from the traffic volume around the traffic light. The application of this algorithm to several benchmark simulated traffic scenarios shows optimal performance compared to five other traffic scheduling algorithms.

Keywords:

division of labour; task allocation; traffic light scheduling; reinforcement learning

1. Introduction

Much of the productivity of modern society has become dependent on transport. The efficient management of traffic plays a particularly important role in the transport industry and, to an extent, society. Poor traffic management can cause accidents, and productivity is reduced by the resulting congestion. A study in Ghana found that traffic congestion reduced worker productivity by 9% per workday [1]. It is also estimated that 27% of global warming has been caused by transportation [2].

One of the most important aspects of traffic management, especially in urban areas, is traffic light scheduling. Many traffic light scheduling algorithms have been developed, as reviewed in [3,4,5], and newly developed algorithms tend to incorporate computational intelligence approaches to accommodate for the dynamic nature of traffic [6,7,8,9]. Despite the availability of smart traffic light scheduling algorithms, it is still the case that static and deterministic timing schedules are utilised that do not take the dynamic nature of this optimisation problem into consideration. This is mainly due to the ease of implementation of these static approaches and the little demand for software and hardware resources.

This paper presents a flexible and efficient model based on the division of labour within ant colonies [10] for the traffic light scheduling problem. The dynamic self-organising behaviour seen in the division of labour in insect colonies makes models of division of labour for task allocation an ideal approach to cope with the dynamic nature of the traffic light scheduling problem. In addition, the task allocation models of division of labour are very simple to understand and implement. The proposed traffic scheduling algorithm is applied to eight different traffic scenarios. The effectiveness of the algorithm is assessed by comparing the performance of this algorithm to the performances of five other traffic scheduling algorithms. The developed division-of-labour algorithm and another dynamic algorithm, the max pressure algorithm [11], generally performed best across these scenarios.

The remainder of the report is structured as follows: Section 2 provides background on traffic signal control and division of labour in insect colonies. The division-of-labour approach to traffic light scheduling is proposed in Section 3. Section 4 describes the empirical procedure followed to analyse the performance of the division-of-labour traffic light scheduling model. The results are presented and discussed in Section 5.

2. Background

This section provides background on traffic signal control and existing solutions in Section 2.1. Division of labour in insect colonies is discussed in Section 2.2.

2.1. Traffic Signal Control

Traffic light control consists of showing a combination of signals that allow certain pre-defined movements of vehicles at traffic intersections from incoming lanes to outgoing lanes for a chosen period of time and disallowing other movements. A phase refers to a specific combination of movement signals that allows nonconflicting traffic movements. A cycle refers to a cyclic sequence of all possible phases of an intersection.

The simplest approach to traffic light control is the static controller, which allocates the same fixed amount of time to every phase in a cycle. This approach is stable but generally not efficient because the fixed timings prevent the controller from adapting to changing traffic distributions.

The Webster method [5] is a widely used adaptive traffic light scheduling algorithm. After a predefined period of time, it calculates a cycle length and splits the cycle length between the phases based on the volume of traffic since the last calculation. The calculation uses the time lost for every phase change, including the acceleration of any stationary vehicles, the number of cars travelling through the intersection, and the saturation flow rate. The saturation flow rate is defined as the highest amount of vehicular flow possible. It can be proved [5] that the Webster method minimises the travel time of vehicles passing through an intersection when the traffic distribution is uniform.

The max pressure traffic light scheduling algorithm [11] is a delay-related algorithm designed to reduce the risk of over-saturation at a traffic intersection by minimising the “pressure” of an intersection. The “pressure” of a phase is defined as the difference between the number of vehicles in the incoming lanes and the number of vehicles in the corresponding outgoing lanes. It can be proved [11] that max pressure maximises the throughput of a road network when all intersections use this algorithm.

Wei et al. [5] presented an overview of classical traffic scheduling algorithms that have been successfully deployed to real-world traffic. The paper talks about the deterministic timing-based Webster algorithm, the offset-based GreenWave and Maxband algorithms, the rule-based actuated control, self-organising traffic lights (SOTL), and the Sydney coordinated adaptive traffic system (SCATS) algorithms, as well as the dynamic max pressure algorithm.

Yau et al. [8] presented six algorithms utilising reinforcement learning of some form. Reinforcement learning [12] is a machine learning paradigm where a model, referred to as an agent, learns what actions are most optimal to accomplish a specific task in a given environment through a trial-and-error-based learning process. A more in-depth discussion of reinforcement learning is beyond the scope of this paper, but further detail can be found in [13]. Genders and Razavi [14] implemented traffic scheduling algorithms that utilise the deep Q-network (DQN) [15] and deep deterministic policy gradient (DDPG) [16] reinforcement learning algorithms that also utilise neural networks.

Actuated control-based traffic scheduling algorithms are a broad range of real-time algorithms that utilise sensors and complex sets of rules. Most modern traffic scheduling algorithms in urban areas, such as SCATS and SCOOT [17], are considered actuated control. A more comprehensive review of modern actuated control-based traffic scheduling is beyond the scope of this paper, but more information on these approaches can be found in Eom et al. [18].

Simulation of urban mobility (SUMO) [19] is an established open-source program for traffic simulation. Useful features of this simulator include being able to apply custom traffic light scheduling algorithms to traffic lights and importing real-world road networks from OpenStreetMap (https://www.openstreetmap.org (accessed on 1 July 2023)). Genders and Razavi [14] developed a Python framework for the development and evaluation of adaptive traffic signal control models.

García-Nieto et al. [20] developed a traffic scheduling algorithm that implements particle swarm optimisation.

2.2. Division of Labour in Insect Colonies

The ecological success of many social insects has been attributed to the self-organised, self-adaptive, and distributive division of labour among the individuals of these social systems [21]. Division-of-labour occurs in a social biological system when all of the individuals of that system (e.g., a colony) are co-adapted through divergent task specialisation, such that there is a fitness gain as a consequence of such specialisation [22]. Various tasks occur within insect colonies, including reproduction, brood care, foraging, waste disposal, defence, cemetery organisation, and nest construction, amongst others. Task allocation and coordination occur without any central control; they are the result of self-organisation and self-adaptation. Individuals respond to simple local cues, for example, the pattern of interactions with other individuals or pheromone droppings by other individuals [23]. The task allocation is dynamic, based on external or internal stimuli. Despite task specialisation, task switching does occur, resulting in ratios of workers that perform different tasks, which vary over time. Variations in these ratios are caused by internal perturbations or external environmental conditions, e.g., changes in climate, food availability, and predation. For interest, Eugene N. Marias was the first to report observations of such task switching [24]. Very important to the survival of certain termite species is the maintenance of underground fungi gardens. During a drought, termites are forced to transport water over very large distances to feed the gardens. During such times, all worker termites, including the soldiers, have been observed to perform the task of water transportation. Not even in the case of intentional harm to the nest structure do the ants deviate from the water transportation task. In less disastrous conditions, soldiers will immediately appear on the periphery of any “wounds” in the nest structure, while worker ants repair the structure.

It is this self-organised, dynamic, and distributed division of labour in nature that lends it to applications of multi-task decision-making problems that can be solved by multiple artificial agents.

Bonabeau et al. [10] developed a single task allocation model that simulates the division-of-labour behaviour among insects in colonies. The model works by assigning every ant a response threshold and every task a stimulus level based on the intensity of the stimuli of the task. The response threshold of an ant is the likelihood of the ant to react to the stimuli associated with a task. An ant will perform a task with high probability if the stimulus of the task exceeds the threshold of the ant. The stimulus level of a task increases at a fixed rate the longer it is not satisfied. The stimulus level of a task decreases when an ant performs that task.

The model lets an ant k perform a task j with a probability indicated by

P_{k j} (X = 0 \to X = 1) = \frac{s_{j}^{2}}{s_{j}^{2} + θ_{k j}^{2}}

(1)

where X is the state of the ant (

X = 0

indicates inactivity with respect to task j, and

X = 1

indicates active performance on task j). The variable

s_{j}

indicates the stimulus value of task j, and

θ_{k j}

indicates the response threshold of ant k towards the task j.

An active ant will become inactive with respect to task j with probability p, i.e.,

P_{k j} (X = 1 \to X = 0) = p

(2)

Therefore, on average, an ant will spend

1 / p

time units working on the task.

The stimulus intensity of task j is calculated as

s_{j} (t + 1) = s_{j} (t) + δ - α n_{a c t}

(3)

where

δ

is the increase in stimulus intensity over time,

α

is the work efficiency of the ants, and

n_{a c t}

is the number of active ants.

Campos et al. [25] applied a method for assigning resources based on this variant of the single task-allocation model to a dynamic flow shop scheduling problem. Klazar [26] developed an algorithm inspired by the division of labour in ant colonies for the dynamic allocation of tasks in distributed computing.

3. Division-of-Labour Model for Traffic Light Scheduling

The developed division-of-labour-based traffic scheduling algorithm is designed for use in a complex intersection with multiple traffic phases. The single task-allocation model is used where the next traffic phase in the cycle is represented as the task.

The probability of the traffic phase changing every second is calculated as

P = \{\begin{matrix} \frac{{(\bar{s} \times s_{f})}^{ω}}{{(\bar{s} \times s_{f})}^{ω} + {(\frac{θ}{\bar{s} + θ} \times L + θ \times s_{f})}^{ω}} & if s + θ \neq 0 \\ 0 & if s + θ = 0 \end{matrix}

(4)

where

\bar{s} = \frac{s}{n - 1}

and

ω = 2 + \frac{s + θ}{c}

. The number of traffic phases in the cycle is represented as n. The number of stationary vehicles waiting for the traffic phase to change is represented as s, and

θ

is the number of vehicles that will become stationary if the traffic phase were to change. The saturation flow rate is represented as

s_{f}

, and L is the time lost when the traffic phase changes, which includes the yellow light duration in seconds, the number of seconds where all traffic lights are red, and the time lost while the vehicles accelerate. The variable c is a parameter that sets how sensitive the steepness threshold is to the total traffic volume around the intersection.

The variables

\bar{s}

and

θ

are scaled with the saturation flow rate

s_{f}

to allow these variables to be compared to the time lost when the traffic phase changes, L. The variable

\bar{s}

represents the average number of vehicles per phase not currently allocated the green light. The stimulus value,

s_{j}

, in Equation (1) is thereby set as

\bar{s} \times s_{f}

, the time needed to clear the average traffic volume in a phase not currently allocated the green light.

The variable L is scaled with

\frac{θ}{\bar{s} + θ}

so that the effect of L is proportional to the ratio of the number of vehicles shown the green light, and the average number of vehicles per phase not shown the green light. The threshold value,

θ_{k j}

, in Equation (1) is thereby set as

\frac{θ}{\bar{s} + θ} \times L + θ \times s_{f}

, a representation of the time lost if the traffic phase is changed.

A dynamic steepness threshold,

ω

, is used to prevent the traffic phase from constantly changing in high-intensity traffic scenarios and allow for some flexibility in lower-intensity traffic scenarios. It was found that a linear relationship exists between the total amount of traffic around the intersection and the most optimal value for

ω

, so

ω

is set to be linearly proportional to the total amount of traffic around the intersection,

s + θ

. Two is added to

\frac{s + θ}{c}

so that

ω

starts from the default steepness threshold value of two as the total traffic volume increases from zero.

The use of probability in every time unit to determine whether the controller should continue to the next phase in the cycle or not is what distinguishes this approach from existing approaches. The developed division-of-labour-based traffic scheduling algorithm is formally described in Algorithm 1. Every timestamp of the phase has a probability P of not changing, a minimum green light duration,

t_{\min}

, and a maximum green light duration,

t_{\max}

.

Algorithm 1 Division-of-Labour-Based Traffic Scheduling Algorithm

input: current phase duration t, minimum phase duration $t_{\min}$ , maximum phase duration $t_{\max}$
for every timestamp do
$t = t + 1$
if $t \geq t_{\min}$ then
r = random number between 0 and 1
For each traffic phase, calculate P as in Equation (4)
if $P > r$ or $t \geq t_{\max}$ then
Set current phase to the next phase in the cycle
$t = 0$
end if
end if
end for

It should be noted that the computational cost of the division-of-labour algorithm is not significant. Most of the cost relates to the calculation of Equation (4), which is a simple calculation for the probability of a traffic phase change for each traffic phase. This results in a

O (n)

calculation per time step. Sampling of the resulting probability distribution is also computationally efficient, as illustrated in Algorithm 1, requiring sampling of a random value for each traffic phase. Again, this results in

O (n)

computational cost, with the overall cost per time step also being

O (n)

.

4. Empirical Procedure

The developed division-of-labour-based traffic scheduling algorithm was tested against the static, Webster, and max pressure algorithms, as well as the DQN and DDPG-based reinforcement learning algorithms. These algorithms were selected for their applicability to isolated intersections. The static and Webster algorithms were chosen for their ability to handle constant streams of traffic, and the max pressure algorithm was chosen for its ability to handle changing streams of traffic. Furthermore, these three algorithms were also chosen for being established from their frequent use in practice. The DQN and DDPG-based algorithms were chosen for their potential performance gains from the usage of reinforcement learning and neural networks.

All simulations simulated six hours of traffic on one of two intersection layouts. The first intersection layout consisted of a three-by-three grid of identical complex four-way intersections, as visualised in Figure 1. An individual intersection in this layout can be visualised in Figure 2. The first phase of the cycle of these intersections consisted of north-to-east and south-to-west moving traffic, and the second phase of the cycle consisted of north-to-south and south-to-north moving traffic. The third phase of the cycle consisted of west-to-north and east-to-south moving traffic, and the fourth phase of the cycle consisted of east-to-west and west-to-east moving traffic. Every road connected to the intersection consisted of three separate lanes. For this intersection layout, the algorithms were evaluated on five scenarios with different traffic characteristics:

Scenario One: Low traffic was constant in all directions, with an average of three vehicles entering the simulation every five seconds. This scenario tested the efficiency of an algorithm when controlling low amounts of traffic.
Scenario Two: High traffic was constant in all directions, with an average of three vehicles entering the simulation every second. This scenario tested the ability of an algorithm to handle high amounts of traffic.
Scenario Three: Traffic intensity in all directions was scaled by $sin x$ , where $x = \frac{t π}{21,600}$ and t was the elapsed time since the simulation started in seconds. This scenario tested the ability of an algorithm to handle varying levels of traffic that models a “rush hour”.
Scenario Four: Every incoming lane had a traffic intensity scaled by $sin x$ , where $x = \frac{k t π}{21,600}$ and k is unique to every incoming lane and $k \in {7, 8, 9, 10, 11, 12}$ . This scenario tested the ability of an algorithm to handle fluctuating traffic where traffic intensity varies across the grid.
Scenario Five: Traffic was scaled as in Scenario Four, but the roads between traffic lights were half the length. This scenario tested the ability of an algorithm to handle traffic in a smaller network.

The second intersection layout was an irregular layout modelled after a section of the road network in the town of Stellenbosch (South Africa), as visualised in Figure 3. Unlike the first intersection layout, the intersections are not identical and have cycles with traffic phases determined by the number of lanes entering either side of the intersection. An example of one of these intersections is visualised in Figure 4. Vehicles were allowed to enter and leave the simulation from the main roads and a cul-de-sac in the centre of the intersection layout. For this intersection, the algorithms were evaluated on three scenarios with different traffic characteristics as follows:

Scenario Six: Low traffic was constant from all areas that vehicles could enter from. This evaluated the efficiency of an algorithm when controlling low amounts of traffic in an irregular intersection layout.
Scenario Seven: High traffic was constant from all areas that vehicles could enter from. This evaluated the ability of an algorithm to handle high amounts of traffic in an irregular intersection layout.
Scenario Eight: Traffic intensity was scaled by $sin x$ , where $x = \frac{t π}{21,600}$ and t was the elapsed time since the simulation started in seconds. This scenario evaluated the ability of an algorithm to handle varying levels of traffic that models a “rush hour” in an irregular intersection layout.

A saturation flow rate of 0.38 was assumed for all simulations. The yellow light duration was set to two seconds, and the duration for which all traffic lights are set to red when changing phases was set to three seconds. The Webster, max pressure, and developed division-of-labour algorithms use a minimum green light duration of seven seconds to prevent extremely short green light durations. A maximum green light duration of 60 s is enforced on the Webster and division-of-labour algorithms to prevent scenarios where traffic scenarios with low traffic demand are never given time.

With respect to the Webster algorithm, the cycle duration has a minimum of 38 s and a maximum of 250 s, and timing is recomputed every 450 s. The static uniform algorithm has green light durations that depend on the scenario to which it is applied. The durations for a scenario are calculated by applying the Webster algorithm to the point in the scenario with the highest traffic volume. An algorithm was applied to every scenario 30 times, and the average vehicle delay and travel time caused by the algorithm were recorded over the 30 independent runs.

A value for the c parameter of the division-of-labour-based traffic scheduling algorithm, which sets how sensitive the steepness is to the total traffic volume around the intersection, was found through a grid search before any algorithms were applied to simulations. It was found that values for c smaller than one lead to poor performance, and values much larger than 50 lead to poor performance as well. For those reasons, the average vehicle delay was taken across 30 independent runs after applying the division-of-labour algorithm to Scenario Four, with values of c between and including 1 and 50. Scenario Four was chosen for the variety of traffic intensities that arise.

5. Results and Discussion

Table 1 shows the results of the parameter optimisation process followed for finding a value for c for the division-of-labour algorithm. Performance degrades as the value for c increases from 1 to 5, after which performance improves, as the value for c increases from 5 to the most optimal value, 35. Performance degrades again as the value for c increases from 35 to 50. The value for c has no significant impact on the average vehicle travel time.

Figures 5, 7, 9, 11, 13 and 15–17 show smoothed graphs of the trimmed average delay, along with their standard deviations after the static, Webster, max pressure, DDPG, DQN, and division-of-labour-based traffic scheduling algorithms were applied to the scenarios. In these figures, the solid lines indicate the average delay and the color ranges indicate the standard deviations. Figures 6, 8, 10, 12 and 14 show box plots of the average vehicle travel time for scenarios one to five. Travel time results were excluded for scenarios six, seven, and eight due to almost identical performances by the different traffic controllers. This is a result of the larger grid reducing the effect on average travel time by the controllers.

Figure 5 shows that, in terms of delay, the max pressure algorithm performed best in the scenario with a constant low-traffic intensity in the first intersection layout. The division-of-labour and static algorithms both performed second best, and the Webster, DQN, and DDPG algorithms performed fourth, fifth, and sixth best, respectively. Max pressure was best able to account for variations in the traffic which become more significant when there is sparse traffic. The reinforcement learning algorithms were the least capable of accounting for these variations.

Figure 6 shows that, in terms of travel time, the max pressure algorithm performed the best in scenario one because of the dynamic nature of the algorithm. The static, division-of-labour, and Webster algorithms performed second, third, and fourth best, respectively, whereas the reinforcement learning algorithms performed the worst. The DDPG and DQN algorithms performed worst, as both were trained to minimise delay instead of travel time.

Figure 7 shows that, in terms of delay, the max pressure algorithm performed best in the scenario with a constant high-traffic intensity in the first intersection layout. The division-of-labour and static algorithms performed second best, and the Webster algorithm performed fourth best. The DDPG and DQN algorithms were both unable to generalise to the high traffic intensity after training. Figure 8 shows that, in terms of travel time, the algorithms performed similarly, with the max pressure algorithm performing slightly better than the other algorithms.

Figure 9 shows that, in terms of delay, the max pressure algorithm performed best in the traffic scenario simulating a “rush hour” in the first intersection layout. The DDPG algorithm was able to generalise well after training and performed second best, while the division-of-labour algorithm performed third best. The high levels of overlap suggest an insignificant difference in performance between DDPG and the division-of-labour algorithm. The Webster and static algorithms performed the fourth best, with the Webster algorithm performing slightly better as the traffic intensity increased. The DQN algorithm was again unable to generalise after training.

Figure 10 shows that, in terms of travel time, the max pressure and division-of-labour algorithms performed the best in scenario three because of their dynamic nature. The Webster and static algorithms performed third and fourth best, respectively, whereas the DDPG algorithm performed worst due to being trained to minimise delay instead of travel time.

Figure 11 shows that, in terms of delay, the max pressure algorithm performed best in the traffic scenario where every incoming lane was allocated a different varying traffic intensity because of the dynamic nature of the algorithm. The division-of-labour, DDPG, and Webster algorithms performed second best, with the Webster algorithm performing slightly worse at the beginning and end of the simulation. The static algorithm performed worst because of the inability of the algorithm to adapt to changing traffic distributions, and the DQN algorithm was again unable to generalise after training.

Figure 12 shows that, in terms of travel time, the max pressure algorithm performed best because of the dynamic nature of the algorithm. The division-of-labour, DDPG, and Webster algorithms performed second best, and the static algorithm performed fifth best. This is the only scenario where DPPG performed decently in terms of travel time.

Figure 13 and Figure 14 show that, in terms of both delay and travel time, the division-of-labour algorithm performed the best in the scenario simulating a smaller road network, with slightly better performance than the static and Webster algorithms. Both the DQN and DDPG algorithms were unable to generalise after training. The max pressure algorithm was unable to effectively handle the traffic in this scenario. This can be attributed to the fact that the smaller road lengths result in the outgoing lanes of traffic lights also being incoming lanes of adjacent traffic lights, and vice versa. Less effective pressure calculations are, therefore, made when queues form in incoming lanes.

Figure 15 shows that the max pressure and DDPG algorithms performed best in the scenario with a constant low-traffic intensity in the second intersection layout. The division-of-labour algorithm performed third best, and the static and Webster algorithms performed fourth best. The max pressure, DDPG and division-of-labour algorithms perform the best because of their dynamic nature, which allows these algorithms to account for variations in the traffic distribution, which become more significant when there is sparse traffic and enables these algorithms to be applied to the intersections with different shapes, as found in the second intersection layout. Despite the generally lower intensity of traffic, the DQN algorithm was again unable to generalise because of the variations in traffic distribution caused by the irregular intersection layout.

Figure 16 shows that the max pressure and division-of-labour algorithms performed first and second best, respectively. The static and Webster algorithms performed third best, with the static algorithm performing slightly better towards the end. The DDPG and DQN algorithms were both unable to generalise after training. Given the performances of the static and Webster algorithms relative to the division-of-labour algorithm in the first intersection for the constant high-traffic intensity scenario, it was expected that these algorithms would perform better.

Figure 17 shows the performances of the algorithms applied to the traffic scenario simulating a "rush hour" in the second intersection layout. The max pressure and division-of-labour algorithms performed first and second best, respectively. The static and Webster algorithms performed third best. The DDPG and DQN algorithms were both unable to generalise after training.

6. Conclusions

Eight different traffic scenarios were applied to a developed traffic scheduling algorithm based on the division of labour in insect colonies, along with the static, Webster, max pressure, and two reinforcement learning-based traffic scheduling algorithms. Of the algorithms that could handle all scenarios, the division-of-labour algorithm performed best. The max pressure algorithm performed best in the scenarios the algorithm could be applied to but is not suitable for smaller road networks. Furthermore, max pressure has the weakness of not accounting for the waiting time of vehicles, and scenarios can arise where the max pressure algorithm causes some vehicles to wait for long periods of time. The DQN algorithm was not able to generalise to the training scenarios after training. The DDPG algorithm generally performed well but also did not generalise to scenarios with high traffic intensities.

Many other traffic light scheduling algorithms exist that utilise reinforcement learning [8], and future work can compare some of these algorithms to the developed division-of-labour algorithm. Future work can also compare the performance of the developed division-of-labour algorithm to algorithms that optimise the offsets between fixed timing algorithms in a network of intersections, like the GreenWave and Maxband algorithms [5].

Traffic light scheduling is ultimately a multi-objective optimisation problem. This paper reported the results of two objectives, i.e., minimisation of delay and travel time, individually. Ultimately, the traffic light scheduling problem is a dynamic multi-objective optimisation problem. Future work will expand the division-of-labour model to a multi-objective division-of-labour algorithm.

Author Contributions

Conceptualisation, A.E.; methodology, H.R. and A.E.; software, H.R.; validation, H.R.; formal analysis, H.R.; investigation, H.R. and A.E.; resources, H.R.; data curation, H.R.; writing—original draft preparation, H.R.; writing—review and editing, A.E.; visualisation, H.R.; supervision, A.E.; project administration, H.R. and A.E. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets generated during and/or analysed during the current study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

DDPG	Deep deterministic policy gradients
DQN	Deep Q-network
SUMO	Simulation of urban mobility

References

Harriet, T.; Poku, K.; Emmanuel, A.K. An assessment of traffic congestion and its effect on productivity in urban Ghana. Int. J. Bus. Soc. Sci. 2013, 4, 10–20. [Google Scholar]
The U.S. Environmental Protection Agency. Global Greenhouse Gas Emissions Data. 2023. Available online: https://www.epa.gov/ghgemissions/global-greenhouse-gas-overview (accessed on 1 July 2023).
Qadri, S.; Gökce, M.; Öner, E. State-of-art review of traffic signal control methods: Challenges and opportunities. Eur. Transp. Res. Rev. 2020, 12, 55. [Google Scholar] [CrossRef]
Tomar, I.; Sreedevi, I.; Pandey, N. State-of-Art Review of Traffic Light Synchronization for Intelligent Vehicles: Current Status, Challenges, and Emerging Trends. Electronics 2022, 11, 465. [Google Scholar] [CrossRef]
Wei, H.; Zheng, G.; Gayah, V.; Li, Z. A Survey on Traffic Signal Control Methods. arXiv 2019, arXiv:1904.08117. [Google Scholar]
Arena, F.; Pau, G.; Ralescu, A.; Severiono, A.; You, I. An Innovative Framework for Dynamic Traffic Lights Management Based on the Combined Use of Fuzzy Logic and Several Network Architectures. J. Adv. Transp. 2022, 2022, 1383349. [Google Scholar] [CrossRef]
Sun, G.; Qi, R.; Liu, Y.; Xo, F. A dynamic traffic signal scheduling system based on improved greedy algorithm. PLoS ONE 2024, 19, e0298417. [Google Scholar] [CrossRef] [PubMed]
Yau, K.L.A.; Qadir, J.; Khoo, H.L.; Ling, M.H.; Komisarczuk, P. A Survey on Reinforcement Learning Models and Algorithms for Traffic Signal Control. ACM Comput. Surv. 2017, 50, 1–38. [Google Scholar] [CrossRef]
Younes, M.B.; Boukerche, A.F.M. An efficient dynamic traffic light scheduling algorithm considering emergency vehicles for intelligent transportation systems. Wirel. Netw. 2017, 24, 2451–2463. [Google Scholar] [CrossRef]
Bonabeau, E.; Theraulaz, G.; Deneubourg, J.L. Quantitative Study of the Fixed Response Threshold Model for the Regulation of Division of Labour in Insect Societies. Proc. R. Soc. B Biol. Sci. 1996, 263, 1565–1569. [Google Scholar] [CrossRef]
Varaiya, P. The max-pressure controller for arbitrary networks of signalized intersections. In Advances in Dynamic Network Modeling in Complex Transportation Systems; Springer: Berlin/Heidelberg, Germany, 2013; pp. 27–66. [Google Scholar]
Sutton, R.S. Temporal Credit Assignment in Reinforcement Learning; University of Massachusetts Amherst: Amherst, MA, USA, 1984. [Google Scholar]
Sutton, R.; Barto, A. Reinforcement Learning: An Introdution, 2nd ed.; MIT Press: Cambridge, MA, USA, 2014. [Google Scholar]
Genders, W.; Razavi, S. An Open-Source Framework for Adaptive Traffic Signal Control. arXiv 2019, arXiv:1909.00395. [Google Scholar] [CrossRef]
Mnih, V.; Kavukcuoglu, K.; Silver, D.; Rusu, A.A.; Veness, J.; Bellemare, M.G.; Graves, A.; Riedmiller, M.; Fidjeland, A.K.; Ostrovski, G.; et al. Human-level control through deep reinforcement learning. Nature 2015, 518, 529–533. [Google Scholar] [CrossRef] [PubMed]
Lillicrap, T.; Hunt, J.; Pritzel, A.; Heess, N.; Erez, T.; Tassa, Y.; Silver, D.; Wierstra, D. Continuous control with deep reinforcement learning. arXiv 2015, arXiv:1509.02971. [Google Scholar] [CrossRef]
Stevanovic, A.; Kergaye, C.; Martin, P.T. Scoot and scats: A closer look into their operations. In Proceedings of the 88th Annual Meeting of the Transportation Research Board, Washington, DC, USA, 11–15 January 2009. [Google Scholar]
Eom, M.; Kim, B.I. The traffic signal control problem for intersections: A review. Eur. Transp. Res. Rev. 2020, 12, 1–20. [Google Scholar] [CrossRef]
Lopez, P.A.; Behrisch, M.; Bieker-Walz, L.; Erdmann, J.; Flötteröd, Y.P.; Hilbrich, R.; Lücken, L.; Rummel, J.; Wagner, P.; Wießner, E. Microscopic Traffic Simulation using SUMO. In Proceedings of the 21st IEEE International Conference on Intelligent Transportation Systems, Maui, HI, USA, 4–7 November 2018; IEEE: Piscataway, NJ, USA, 2018. [Google Scholar]
García-Nieto, J.; Olivera, A.C.; Alba, E. Optimal Cycle Program of Traffic Lights With Particle Swarm Optimization. IEEE Trans. Evol. Comput. 2013, 17, 823–839. [Google Scholar] [CrossRef]
Naug, D.; Gadagkar, R. Flexible Division of Labor Mediated by Social Interactions in an Insect Colony—A Simulation Model. J. Theor. Biol. 1999, 197, 123–133. [Google Scholar] [CrossRef] [PubMed]
Sendova-Franks, A.; Franks, N. Self-Assembly, Self-Organization and Division of Labour. Philos. Trans. R. Soc. Lond. 1999, 354, 1395–1405. [Google Scholar] [CrossRef]
Gordon, D.; Mehdiabadi, N. Encounter Rate and Task Allocation in Harvester Ants. Behav. Ecol Sociobiol. 1999, 45, 370–377. [Google Scholar] [CrossRef]
Marais, E. Die Siel van die Mier (The Soul of the Ant), 5th ed.; J.L. van Schaik: Pretoria, South Africa, 1948. [Google Scholar]
Campos, M.; Bonabeau, E.; Theraulaz, G.; Deneubourg, J.L. Dynamic scheduling and division of labor in social insects. Adapt. Behav. 2000, 8, 83–95. [Google Scholar] [CrossRef]
Klazar, R. Ant-Inspired Strategies for Opportunistic Load Balancing in the Distributed Computation of Solutions to Embarrassingly Parallel Problems. Master’s Thesis, University of Pretoria, Pretoria, South Africa, 2016. [Google Scholar]

Figure 1. A visualisation of the first intersection layout. Red dots represent traffic lights.

Figure 2. A complex four-way intersection in the first intersection layout; green indicates a green traffic light for traffic that may go, and red indicates a red light for traffic that has to stop.

Figure 3. A visualisation of the second intersection layout. Red dots represent traffic lights, the white dot represents a round-about, and the blue dot represents the internal cul-de-sac vehicles were allowed to enter and leave the layout from, in addition to the outer edges.

Figure 4. An example of an irregular four-way intersection in the second intersection layout; green indicates a green traffic light for traffic that may go, and red indicates a red light for traffic that has to stop.

Figure 5. Average vehicle delay during the constant low-traffic simulation (scenario one).

Figure 6. Average travel time during the constant low-traffic simulation (scenario one).

Figure 7. Average vehicle delay during the constant high-traffic simulation (scenario two).

Figure 8. Average travel time during the constant high-traffic simulation (scenario two).

Figure 9. Average vehicle delay during the "rush hour" simulation (scenario three).

Figure 10. Average travel time during the “rush hour” simulation (scenario three).

Figure 11. Average vehicle delay during the fluctuating traffic simulation (scenario four).

Figure 12. Average travel time during the fluctuating traffic simulation (scenario four).

Figure 13. Average vehicle delay during simulation in the smaller network (scenario five).

Figure 14. Average travel time during simulation in the smaller network (scenario five).

Figure 15. Average vehicle delay during the constant low-traffic simulation in Stellenbosch (scenario six).

Figure 16. Average vehicle delay during the constant high traffic simulation in Stellenbosch (scenario seven).

Figure 17. Average vehicle delay during the “rush hour” simulation in Stellenbosch (scenario eight).

Table 1. Results after grid search optimisation for the parameter of the division-of-labour traffic scheduling algorithm.

c	Average Vehicle Delay
1	816.1956
5	831.2264
10	829.6111
15	818.5456
20	810.5541
25	802.7875
30	795.0609
35	782.9047
40	789.7331
45	811.7620
50	829.0596

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Raubenheimer, H.; Engelbrecht, A. A Division-of-Labour Approach to Traffic Light Scheduling. Appl. Sci. 2024, 14, 8022. https://doi.org/10.3390/app14178022

AMA Style

Raubenheimer H, Engelbrecht A. A Division-of-Labour Approach to Traffic Light Scheduling. Applied Sciences. 2024; 14(17):8022. https://doi.org/10.3390/app14178022

Chicago/Turabian Style

Raubenheimer, Hendrik, and Andries Engelbrecht. 2024. "A Division-of-Labour Approach to Traffic Light Scheduling" Applied Sciences 14, no. 17: 8022. https://doi.org/10.3390/app14178022

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Division-of-Labour Approach to Traffic Light Scheduling

Abstract

Featured Application

Abstract

1. Introduction

2. Background

2.1. Traffic Signal Control

2.2. Division of Labour in Insect Colonies

3. Division-of-Labour Model for Traffic Light Scheduling

4. Empirical Procedure

5. Results and Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI