1. Introduction
The Beckmann model for searching static traffic equilibria in road networks is among the most widely used models by transportation planners [
1,
2]. The equilibria found are practical for evaluating the network efficiency and distribution of business centers and residential areas, and establishing urban development plans, etc. This model introduces a cost function on every link of a transportation network, which defines a dependence of the travel cost on the flow along the link. In practice, the BPR functions are usually employed [
3]:
where
are free flow times, and
are road capacities of a given network’s link
e. We take these functions with parameters
and
.
Nesterov and de Palma [
4] proposed an alternative model called the stable dynamics model, which takes an intermediate place between static and dynamic network assignment models. Namely, its equilibrium can be interpreted as the stationary regime of some dynamic process. Its key assumption is that we no longer introduce a complex dependence of the travel cost on the flow (as in the standard static models) but only pose capacity constraints, i.e., the flow value on each link imposes the feasible set of travel times
Unlike in the Beckmann model, there is no one-to-one correspondence between equilibrium travel times and flows on the links of the network. We can illustrate the difference on a simple example of two parallel routes (
Figure 1).
Let the input flow take values 1000, 2000, and 3000 veh/h. For the stable dynamics model, in the first and second cases, all drivers choose the upper route; the equilibrium travel time simply equals the upper route’s free flow time (0.5 h) in the first case and varies from 0.5 to 1 h (according to the model) in the second. In the third case, the input flow exceeds the upper route’s capacity, so the upper route’s flow is 2000 veh/h, the lower one is 1000 veh/h, and the equilibrium travel time is 1 h. All these equilibria can be interpreted as stationary regimes of some dynamic processes, e.g., the last case can be viewed as the result of the queue at the beginning of the upper route (since this route’s capacity is smaller than the input flow) created by drivers who wanted to take this route until the waiting time plus the route’s travel time reached the lower route’s travel time [
4]. In the Beckmann model, equilibria are as follows: for all three cases, only the upper route is used, and the equilibrium travel times are approximately 0.5, 0.6, and 0.9 h, respectively. Chudak, Dos Santos Eleuterio, and Nesterov [
5] conducted a detailed comparison—for large and small networks—of equilibria in these two models.
In the Beckmann model, searching equilibria reduces to minimization of a potential function. One of the most popular and effective approaches to solve this problem numerically is the famous Frank–Wolfe method [
6,
7] as well as its numerous modifications [
8,
9,
10,
11].
In the case of the stable dynamics model, one cannot directly apply the Frank–Wolfe method. However, an equilibrium can be found as a solution of a pair of primal and dual optimization problems. The same holds also for the Beckmann model, so in both cases we can apply primal-dual (sub)gradient methods.
In this work, we compare several primal-dual gradient methods for searching equilibria in both the Beckmann and the stable dynamics models, namely, the universal gradient method (UGM) [
12], the universal method of similar triangles (UMST) [
13], and the method of weighted dual averages (WDA) [
14]. The main advantage of the above universal methods is an automatic adjustment to a local (Hölder) smoothness of a minimized function, which is especially important since the dual problems we are dealing with are essentially non-smooth. Due to the primal-dual nature of these methods, one can use an adaptive stopping criterion guaranteeing required accuracy.
The main contributions of this paper include the following:
We propose a novel way to reconstruct admissible flows (i.e., meeting the capacity constraints and induced by flows on the paths) in the stable dynamics model and a novel computable duality gap, which can be used in a stopping criterion.
We provide theoretical upper bounds on the complexity of searching equilibria by the considered algorithms: UMST, UGD, and WDA.
We conducted numerical experiments comparing these algorithms on the Anaheim transportation network—the source code is available for use and can be found in [
15].
The paper is organized as follows. In
Section 2, we give a problem statement and define equilibria in the Beckmann and the stable dynamics models and corresponding optimization problems.
Section 3 is devoted to the complexity analysis of UGM, UMST, and WDA. We show that the number of iterations required to obtain an
-solution of primal and dual problems is
for UGM and UMST. In
Section 4, results of experiments on the Anaheim transportation network are presented. Finally, some conclusions are drawn in
Section 5.
2. Problem Statement
Let the urban road network be represented by a directed graph
, where vertices
V correspond to intersections or centroids [
16] and edges
E correspond to roads, respectively. Suppose we are given the travel demands: namely, let
(veh/h) be a trip rate for an origin–destination pair
w from the set
. Here,
is the set of all possible origins of trips, and
is the set of destination nodes. For OD pair
denote by
the set of all simple paths from
i to
j. Respectively,
is the set of all possible routes for all OD pairs. Agents traveling from node
i to node
j are distributed among paths from
, i.e., for any
there is a flow
along the path
p, and
. Flows from vertices from the set
O to vertices from the set
D create the traffic in the entire network
G, which can be represented by an element of
Note that the dimension of
X can be extremely large: e.g., for
Manhattan network
. To describe a state of the network, we do not need to know an entire vector
x but only flows on arcs:
where
. Let us introduce a matrix
such that
for
,
, so in vector notation we have
. To describe an equilibrium we use both path- and link-based notations
or
.
One of the key ideas behind the Beckmann model is that the cost (e.g., travel time, gas expenses) of passing a link
e is the same for all agents and depends solely on the flow
along it. In what follows, we denote this cost for a given flow
by
. Another essential point is a behavioral assumption on agents called the first Wardrop’s principle: we suppose that each of them knows the state of the whole network and chooses a path
p minimizing the total cost
The cost functions are supposed to be continuous, non-decreasing, and non-negative. Then
, where
, is an equilibrium state, i.e., it satisfies conditions
if and only if
is a minimum of the potential function:
and
[
1].
Another way to find an equilibrium numerically is by solving a dual problem. According to Theorem 4 from [
4], we can construct it in the following way:
where
is the conjugate function of
,
. Finally, we obtain the dual problem, the solution of which is
:
When we search for the solution to this problem numerically, on every step of an applied method, we can reconstruct primal variable
f from the current dual variable
t:
(see
Section 3.1). Then we can use the duality gap, which is always nonnegative, for the estimation of the method’s accuracy:
It vanishes only at the equilibrium .
Stable dynamics model [
4].
An equilibrium state
of the stable dynamics model satisfies the next conditions:
where
is defined earlier by (
2). The above formula can be reformulated in terms of an optimization problem:
Here, we add underlined constant terms to show that the pair
is an equilibrium if and only if it is a solution of the saddle-point problem
where its primal problem is
and its dual problem is
In contrast with the Beckmann model, the equilibrium state in the stable dynamics model is defined by pair (in particular, it differs from the system optimum in the model only by the time value).
3. Numerical Methods
We have the following objective functions
The stable dynamics model:
In both cases, it has the form
The optimization problem (
4) is convex, non-smooth, and composite. We use all these properties to identify the best optimization method to solve the considered problem.
3.1. Subgradient
In our research, we consider first-order methods, i.e., they require a subgradient of , the properties and effective computation of which we discuss in this section.
To get the subdifferential
, let us re-write
in the following way:
where the vector
encodes a path
p. Obviously, the shortest path may not be unique. Using the rules of subgradient calculus [
17] we get the following expression:
i.e., the subdifferential
is a sum of convex hulls of binary vectors that encode the shortest length paths. An important consequence is that for any
and
,
, the following bound holds:
where
H is the diameter of the graph
G.
Note that any element from the set has the form , where is a flow distribution on links induced by concentrated on the shortest paths for given times t (and vice versa: any such f corresponds to a subgradient of ).
In practice, the calculation of flows
f is the most expensive part, since we have to find the shortest paths for all pairs
. We use the following Algorithm 1. We use Dijkstra’s Algorithm [
18] to find the shortest paths in line 3, which runs in
time; finding the traversal order with topological sort (Section 22.4 in [
19]) and further flows aggregation have linear performance
. Hence, the total complexity of Algorithm 1 is
. When the transportation network is an (almost) planar graph or another sparse graph,
and the complexity is
. Moreover, flows reconstruction for every source
can be computed in parallel, and Dijkstra’s algorithm can also be parallelized and has efficient implementations [
20,
21].
Algorithm 1 Flows reconstruction. |
Input: times t- 1:
{flows on edges} - 2:
for origin o in O do - 3:
Get a shortest-path tree from o to all destinations in D with weights t - 4:
{sorting from furthest to closest vertices} - 5:
{total output flow from each vertex} - 6:
for - 7:
for v in do - 8:
Get predecessor p of v in - 9:
- 10:
- 11:
- 12:
end for - 13:
end for - 14:
return flows f
|
3.2. Reconstruction of Admissible Flows in SD Model
For given times
t considered, Algorithm 1 reconstructs feasible flows
f, i.e.,
for some
. These flows meet all the constraints in the Beckmann model, but they can violate the capacity constraints in the stable dynamics model. In the latter case, an additional step is required to obtain admissible flows from
f. Note that we could instead find flows that meet capacity constraints first (Theorem 8 from [
4]), but to reconstruct feasible flows from them is a more complex problem.
Suppose we are given some flows
such that
Then for any
we can construct admissible flows
in the following way: let
, then
In practice, we propose the following procedure to find admissible flows g: run some optimization method (e.g., UGM) for a small number of iterations for the same problem but with decreased capacities: instead of ; if obtained feasible flows satisfy , then take ; otherwise, run it again with capacities and check , etc.
The stopping criterion we use for the stable dynamics model is based on a duality gap
where
,
are estimates of an equilibrium
after
N iterations of the applied method. Note that here the duality gap with
is not applicable.
3.3. Universal Gradient Method
The method for solving non-smooth problems with smooth techniques was proposed by Nesterov [
12] and was called the universal gradient method. The pseudocode of UGM for the considered problem (
4) is provided in Algorithm 2. Here, the euclidean prox-structure is used. Note that we did not specify the stopping criterion as it can be different for different models.
Algorithm 2 Universal gradient method. |
Input:, accuracy - 1:
Set , - 2:
repeat - 3:
- 4:
while true do - 5:
- 6:
if then - 7:
break - 8:
else - 9:
- 10:
end if - 11:
end while - 12:
- 13:
until Stopping criterion is fulfilled
|
Now let us define
where
are the estimates of the local Lipschitz constant in UGM and UMST methods.
Convergence of the UGM was proved in [
12] and is summarized in the following lemma and theorem.
Appendix A is the proofs for UGM.
Lemma 1. After N iterations of UGM for the stable dynamics model, it holds thatwhere , , and are defined by (8), and is the distance from the starting point to a solution. Theorem 1. Let , where M comes from (5). Then, after at mostiterations of UGM for the stable dynamics model, it holds that . Moreover, the stopping criterion (7) is fulfilled after at mostiterations, where ξ comes from (6). Now we provide results on the rate of convergence for the Beckmann model. The stopping criterion in this case is the following:
Lemma 2. After N iterations of UGM for the Beckmann model, it holds thatwhere , , are defined by (8), and Theorem 2. Let , where M comes from (5). Then after at mostiterations of UGM for the Beckmann model, it holds that . Moreover, the stopping criterion (14) is fulfilled after at mostiterations, where 3.4. Universal Method of Similar Triangles
Let us introduce the following notations:
Flows are reconstructed in the following way:
Lemma 3. After N iterations of UMST for the stable dynamics model, it holds thatwhere is defined by (18) and is the distance from the starting point to a solution. Theorem 3. Let , where M comes from (5). Then, after at mostiterations of UGM for the stable dynamics model, it holds that . Moreover, the stopping criterion (7) with is fulfilled after at mostiterations, where ξ comes from (6). Theorem 4. Let , where M comes from (5). Then, after at mostiterations of UMST for the Beckmann model, it holds that . Moreover, the stopping criterion (14) with is fulfilled after at mostiterations, where is defined by (17). Appendix B is the proofs for UMST. 3.5. Method of Weighted Dual Averages
Convergence of the WDA method was proved in [
14] and is summarized in the following theorem.
Appendix C is the proofs for WAD.
Theorem 5. Non-composite WDA-method satisfies the following bounds
For the stable dynamics model: For the Beckmann model if :
4. Numerical Experiments
This section presents numerical results for the algorithms described above, namely, composite variants of UMST and UGM, both composite and non-composite WDA-method, on the Anaheim network [
5,
22]. The network consists of 38 zones, 416 nodes, and 916 links. Experiments and the source code in Python 3 [
23] can be found in [
15]. We used Dijkstra’s algorithm for finding the shortest paths in the network from the graph-tool library [
24], where it is implemented in C++. We also used the Numpy library [
25] for all vector operations.
Parameters of the network are adjusted to the Beckmann model, so we have to increase the capacities to ensure the existence of an equilibrium for the stable dynamics model. In our experiments, the capacities are multiplied by
. In
Figure 2, we plot the number of (inner) iterations of the algorithms required to fulfill the stopping criterion (
7) against
. We consider the number of inner iterations for Algorithms 3 and 2 since the complexity of an inner iteration in this case is similar to the complexity of an iteration of the other algorithms. Note that according to ([
12], formula (2.23)) the number
of inner iterations of UGM or UMST at step
k is bounded as
so asymptotic rates from Theorems 1–4 are still valid.
As we can see, the best results are shown by UMST, followed by UGM having similar performance. Both composite and non-composite WDA-method in Algorithm 4 are much slower.
Algorithm 3 Universal Method of Similar Triangles. |
Input:, accuracy - 1:
, , - 2:
repeat - 3:
- 4:
while true do - 5:
- 6:
if then - 7:
break - 8:
else - 9:
- 10:
end if - 11:
end while - 12:
- 13:
until Stopping criterion is fulfilled
|
Algorithm 4 Method of Weighted Dual Averages. |
Input: accuracy , constant - 1:
, , - 2:
repeat - 3:
Compute subgradient , set - 4:
Set , where - 5:
Set non-composite case: composite case:
- 6:
- 7:
until Stopping criterion is fulfilled
|
For the Beckmann model, we also compare our methods with the Frank–Wolfe algorithm (Algorithm 5), of which the theoretical convergence rate for a convex objective (with Lipschitz-continuous gradient) is
[
7,
26].
Figure 3 shows the convergence rates of the methods for the Beckmann model. The Frank–Wolfe method demonstrates the best results and is followed by UMST. Unlike the stable dynamics case, the composite WDA-method is faster than UGM. However, the non-composite WDA-method has the worst performance again.
Algorithm 5 Frank–Wolfe algorithm. |
Input: accuracy - 1:
, , - 2:
repeat - 3:
, - 4:
, - 5:
- 6:
until Stopping criterion is fulfilled
|
5. Conclusions
We considered several primal-dual subgradient methods for finding equilibria in the stable dynamics and the Beckmann models. We suggested a way to reconstruct admissible flows in the stable dynamics model, which provides us with a novel computable duality gap. Complexity bounds for UMST and UGM were presented in terms of the iterations number required to achieve a desired accuracy in the dual function value or the duality gap. Finally, we conducted numerical experiments comparing convergence of the considered algorithms on the Anaheim transportation network: UMST is the best one for optimization of the dual problems in both models. Furthermore, using the duality gap as a stopping criterion, we compared these methods with the Frank–Wolfe algorithm for the Beckmann model, which, as expected, remains the most suitable approach in this case (but it is not applicable for the stable dynamics model).
The reader may be interested in another related topic—searching stochastic traffic equilibria. In [
27,
28], we (with our colleagues) studied the application of the UMST for finding Nash–Wardrop stochastic equilibria in the Beckmann model. In this case, a driver selects a route randomly according to Gibbs’ distribution taking into account current time costs on the links of the network. It leads to iteration complexity
, where
is a stochasticity parameter (when
, the model boils down to the ordinary Beckmann model). However, the great decrease in the number of iterations comes along with a more expensive calculation of the objective function’s gradient.