Finding Equilibria in the Traffic Assignment Problem with Primal-Dual Gradient Methods for Stable Dynamics Model and Beckmann Model

Kubentayeva, Meruza; Gasnikov, Alexander

doi:10.3390/math9111217

Open AccessFeature PaperArticle

Finding Equilibria in the Traffic Assignment Problem with Primal-Dual Gradient Methods for Stable Dynamics Model and Beckmann Model

by

Meruza Kubentayeva

^1,* and

Alexander Gasnikov

^1,2,3

¹

Institute for Information Transmission Problems, RAS, Bolshoy Karetny Per. 19, Build.1, 127051 Moscow, Russia

²

Moscow Institute of Physics and Technology, 9 Institutskiy Per., Dolgoprudny, 141701 Moscow Region, Russia

³

Higher School of Economics, 20 Myasnitskaya Ulitsa, 101000 Moscow, Russia

^*

Author to whom correspondence should be addressed.

Mathematics 2021, 9(11), 1217; https://doi.org/10.3390/math9111217

Submission received: 6 May 2021 / Revised: 22 May 2021 / Accepted: 24 May 2021 / Published: 27 May 2021

(This article belongs to the Special Issue Numerical Methods and Algorithms Applied in Intelligent Transportation Systems)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, we consider the application of several gradient methods to the traffic assignment problem: we search equilibria in the stable dynamics model (Nesterov and De Palma, 2003) and the Beckmann model. Unlike the celebrated Frank–Wolfe algorithm widely used for the Beckmann model, these gradients methods solve the dual problem and then reconstruct a solution to the primal one. We deal with the universal gradient method, the universal method of similar triangles, and the method of weighted dual averages and estimate their complexity for the problem. Due to the primal-dual nature of these methods, we use a duality gap in a stopping criterion. In particular, we present a novel way to reconstruct admissible flows in the stable dynamics model, which provides us with a computable duality gap.

Keywords:

stable dynamics model; Beckmann model; traffic equilibrium; universal gradient method; universal method of similar triangles; method of weighted dual averages; duality gap

1. Introduction

The Beckmann model for searching static traffic equilibria in road networks is among the most widely used models by transportation planners [1,2]. The equilibria found are practical for evaluating the network efficiency and distribution of business centers and residential areas, and establishing urban development plans, etc. This model introduces a cost function on every link of a transportation network, which defines a dependence of the travel cost on the flow along the link. In practice, the BPR functions are usually employed [3]:

τ_{e} (f_{e}) = {\bar{t}}_{e} (1 + ρ {(\frac{f_{e}}{{\bar{f}}_{e}})}^{\frac{1}{μ}}),

(1)

where

{\bar{t}}_{e}

are free flow times, and

{\bar{f}}_{e}

are road capacities of a given network’s link e. We take these functions with parameters

ρ = 0.15

and

μ = 0.25

.

Nesterov and de Palma [4] proposed an alternative model called the stable dynamics model, which takes an intermediate place between static and dynamic network assignment models. Namely, its equilibrium can be interpreted as the stationary regime of some dynamic process. Its key assumption is that we no longer introduce a complex dependence of the travel cost on the flow (as in the standard static models) but only pose capacity constraints, i.e., the flow value on each link imposes the feasible set of travel times

τ_{e} (f_{e}) = \{\begin{matrix} {\bar{t}}_{e}, & 0 \leq f_{e} < {\bar{f}}_{e}, \\ [{\bar{t}}_{e}, \infty], & f_{e} = {\bar{f}}_{e}, \\ + \infty, & f_{e} > {\bar{f}}_{e} . \end{matrix}

(2)

Unlike in the Beckmann model, there is no one-to-one correspondence between equilibrium travel times and flows on the links of the network. We can illustrate the difference on a simple example of two parallel routes (Figure 1).

Let the input flow take values 1000, 2000, and 3000 veh/h. For the stable dynamics model, in the first and second cases, all drivers choose the upper route; the equilibrium travel time simply equals the upper route’s free flow time (0.5 h) in the first case and varies from 0.5 to 1 h (according to the model) in the second. In the third case, the input flow exceeds the upper route’s capacity, so the upper route’s flow is 2000 veh/h, the lower one is 1000 veh/h, and the equilibrium travel time is 1 h. All these equilibria can be interpreted as stationary regimes of some dynamic processes, e.g., the last case can be viewed as the result of the queue at the beginning of the upper route (since this route’s capacity is smaller than the input flow) created by drivers who wanted to take this route until the waiting time plus the route’s travel time reached the lower route’s travel time [4]. In the Beckmann model, equilibria are as follows: for all three cases, only the upper route is used, and the equilibrium travel times are approximately 0.5, 0.6, and 0.9 h, respectively. Chudak, Dos Santos Eleuterio, and Nesterov [5] conducted a detailed comparison—for large and small networks—of equilibria in these two models.

In the Beckmann model, searching equilibria reduces to minimization of a potential function. One of the most popular and effective approaches to solve this problem numerically is the famous Frank–Wolfe method [6,7] as well as its numerous modifications [8,9,10,11].

In the case of the stable dynamics model, one cannot directly apply the Frank–Wolfe method. However, an equilibrium can be found as a solution of a pair of primal and dual optimization problems. The same holds also for the Beckmann model, so in both cases we can apply primal-dual (sub)gradient methods.

In this work, we compare several primal-dual gradient methods for searching equilibria in both the Beckmann and the stable dynamics models, namely, the universal gradient method (UGM) [12], the universal method of similar triangles (UMST) [13], and the method of weighted dual averages (WDA) [14]. The main advantage of the above universal methods is an automatic adjustment to a local (Hölder) smoothness of a minimized function, which is especially important since the dual problems we are dealing with are essentially non-smooth. Due to the primal-dual nature of these methods, one can use an adaptive stopping criterion guaranteeing required accuracy.

The main contributions of this paper include the following:

We propose a novel way to reconstruct admissible flows (i.e., meeting the capacity constraints and induced by flows on the paths) in the stable dynamics model and a novel computable duality gap, which can be used in a stopping criterion.
We provide theoretical upper bounds on the complexity of searching equilibria by the considered algorithms: UMST, UGD, and WDA.
We conducted numerical experiments comparing these algorithms on the Anaheim transportation network—the source code is available for use and can be found in [15].

The paper is organized as follows. In Section 2, we give a problem statement and define equilibria in the Beckmann and the stable dynamics models and corresponding optimization problems. Section 3 is devoted to the complexity analysis of UGM, UMST, and WDA. We show that the number of iterations required to obtain an

ε

-solution of primal and dual problems is

O (1 / ε^{2})

for UGM and UMST. In Section 4, results of experiments on the Anaheim transportation network are presented. Finally, some conclusions are drawn in Section 5.

2. Problem Statement

Let the urban road network be represented by a directed graph

G = (V, E)

, where vertices V correspond to intersections or centroids [16] and edges E correspond to roads, respectively. Suppose we are given the travel demands: namely, let

d_{w}

(veh/h) be a trip rate for an origin–destination pair w from the set

O D \subseteq {w = (i, j) : i \in O, j \in D}

. Here,

O \subseteq V

is the set of all possible origins of trips, and

D \subseteq V

is the set of destination nodes. For OD pair

w = (i, j)

denote by

P_{w}

the set of all simple paths from i to j. Respectively,

P = ⋃_{w \in O D} P_{w}

is the set of all possible routes for all OD pairs. Agents traveling from node i to node j are distributed among paths from

P_{w}

, i.e., for any

p \in P_{w}

there is a flow

x_{p} \in R_{+}

along the path p, and

\sum_{p \in P_{w}} x_{p} = d_{w}

. Flows from vertices from the set O to vertices from the set D create the traffic in the entire network G, which can be represented by an element of

X = \{x \in R_{+}^{| P |} : \sum_{p \in P_{w}} x_{p} = d_{w}, w \in O D\} .

Note that the dimension of X can be extremely large: e.g., for

n \times n

Manhattan network

log | P | = Ω (n)

. To describe a state of the network, we do not need to know an entire vector x but only flows on arcs:

f_{e} (x) = \sum_{p \in P} δ_{e p} x_{p} for e \in E,

where

δ_{e p} = 1 {e \in p}

. Let us introduce a matrix

Θ

such that

Θ_{e, p} = δ_{e p}

for

e \in E

,

p \in P

, so in vector notation we have

f = Θ x

. To describe an equilibrium we use both path- and link-based notations

(x, t)

or

(f, t)

.

Beckmann model.

One of the key ideas behind the Beckmann model is that the cost (e.g., travel time, gas expenses) of passing a link e is the same for all agents and depends solely on the flow

f_{e}

along it. In what follows, we denote this cost for a given flow

f_{e}

by

t_{e} = τ_{e} (f_{e})

. Another essential point is a behavioral assumption on agents called the first Wardrop’s principle: we suppose that each of them knows the state of the whole network and chooses a path p minimizing the total cost

T_{p} (t) = \sum_{e \in p} t_{e} .

The cost functions are supposed to be continuous, non-decreasing, and non-negative. Then

(x^{*}, t^{*})

, where

t^{*} = {(t_{e}^{*})}_{e \in E}

, is an equilibrium state, i.e., it satisfies conditions

\begin{matrix} t_{e}^{*} = τ_{e} (f_{e}^{*}), where f^{*} = Θ x^{*}, \\ x_{p_{w}}^{*} > 0 ⟹ T_{p_{w}} (t^{*}) = T_{w} (t^{*}) = min_{p \in P_{w}} T_{p} (t^{*}), \end{matrix}

if and only if

x^{*}

is a minimum of the potential function:

\begin{matrix} Ψ (x) = \sum_{e \in E} \underset{σ_{e} (f_{e})}{\underset{⏟}{\int_{0}^{f_{e}} τ_{e} (z) d z}} ⟶ min_{f = Θ x, x \in X} \\ ⟺ Ψ (f) = \sum_{e \in E} σ_{e} (f_{e}) ⟶ min_{f = Θ x : x \in X}, \end{matrix}

and

t_{e}^{*} = τ_{e} (f_{e}^{*})

[1].

Another way to find an equilibrium numerically is by solving a dual problem. According to Theorem 4 from [4], we can construct it in the following way:

\begin{matrix} min_{f = Θ x : x \in X} Ψ (f) & = min_{x \in X, f} [Ψ (f) + sup_{t \in R^{| E |}} 〈 t, Θ x - f 〉] = sup_{t \in R^{| E |}} min_{x \in X, f} [Ψ (f) + 〈 t, Θ x - f 〉] \\ = sup_{t \in R^{| E |}} [- \sum_{e \in E} max_{f_{e}} {t_{e} f_{e} - σ_{e} (f_{e})} + min_{x \in X} \sum_{p} \sum_{e \in E} t_{e} δ_{e p} x_{p}] \\ = max_{t \in dom σ^{*}} - [\sum_{e \in E} σ_{e}^{*} (t_{e}) - \sum_{w \in O D} d_{w} T_{w} (t)] = - min_{t \geq \bar{t}} Q (t), \end{matrix}

where

σ_{e}^{*} (t_{e}) = sup_{f_{e} \geq 0} {t_{e} f_{e} - σ_{e} (f_{e})} = {\bar{f}}_{e} {(\frac{t_{e} - {\bar{t}}_{e}}{{\bar{t}}_{e} ρ})}^{μ} \frac{(t_{e} - {\bar{t}}_{e})}{1 + μ}

is the conjugate function of

σ_{e} (f_{e})

,

e \in E

. Finally, we obtain the dual problem, the solution of which is

t^{*}

:

Q (t) = - \sum_{w \in O D} d_{w} T_{w} (t) + \sum_{e \in E} σ_{e}^{*} (t_{e}) ⟶ min_{t \geq \bar{t}} .

(3)

When we search for the solution to this problem numerically, on every step of an applied method, we can reconstruct primal variable f from the current dual variable t:

f \in \partial \sum_{w \in O D} d_{w} T_{w} (t)

(see Section 3.1). Then we can use the duality gap, which is always nonnegative, for the estimation of the method’s accuracy:

Δ (f, t) = Ψ (f) + Q (t) .

It vanishes only at the equilibrium

(f^{*}, t^{*})

.

Stable dynamics model [4].

An equilibrium state

(x^{*}, t^{*})

of the stable dynamics model satisfies the next conditions:

\begin{matrix} t_{e}^{*} \in τ_{e} (f_{e}^{*}), \\ x_{p_{w}}^{*} > 0 ⟹ T_{p_{w}} (t^{*}) = T_{w} (t^{*}), \end{matrix}

where

τ (f)

is defined earlier by (2). The above formula can be reformulated in terms of an optimization problem:

\begin{matrix} x^{*} & = \underset{x \in X}{arg min} \sum_{w \in O D} \sum_{p \in P_{w}} x_{p} T_{p} (t^{*}) \\ = \underset{x \in X}{arg min} \sum_{e \in E} t_{e}^{*} f_{e} (x) \\ = \underset{x \in X}{arg min} \sum_{e \in E} [t_{e}^{*} f_{e} (x) - \underset{̲}{(t_{e}^{*} - {\bar{t}}_{e}) {\bar{f}}_{e}}], \end{matrix}

\begin{matrix} t_{e}^{*} \in τ_{e} (f_{e}^{*}) ⟺ t_{e}^{*} & = \underset{t_{e} \geq {\bar{t}}_{e}}{arg max} t_{e} (f_{e}^{*} - {\bar{f}}_{e}) \\ = \underset{t_{e} \geq {\bar{t}}_{e}}{arg max} [t_{e} (f_{e}^{*} - {\bar{f}}_{e}) + \underset{̲}{{\bar{t}}_{e} {\bar{f}}_{e}}] . \end{matrix}

Here, we add underlined constant terms to show that the pair

(f^{*}, t^{*})

is an equilibrium if and only if it is a solution of the saddle-point problem

\sum_{e \in E} [t_{e} f_{e} - (t_{e} - {\bar{t}}_{e}) {\bar{f}}_{e}] ⟶ min_{\begin{matrix} f = Θ x : \\ x \in X \end{matrix}} max_{t_{e} \geq {\bar{t}}_{e}},

where its primal problem is

\begin{matrix} Ψ (x) = sup_{t_{e} \geq {\bar{t}}_{e}} \sum_{e \in E} [t_{e} f_{e} - (t_{e} - {\bar{t}}_{e}) {\bar{f}}_{e}] = \sum_{e \in E} {\bar{t}}_{e} f_{e} + \sum_{e \in E} sup_{t_{e} \geq {\bar{t}}_{e}} (t_{e} - {\bar{t}}_{e}) (f_{e} - {\bar{f}}_{e}) ⟶ min_{f = Θ x : x \in X} \\ ⟺ Ψ (f) = \sum_{e \in E} f_{e} {\bar{t}}_{e} ⟶ min_{\begin{matrix} f = Θ x : \\ x \in X, f_{e} \leq {\bar{f}}_{e} \end{matrix}}, \end{matrix}

and its dual problem is

\begin{matrix} Q (t) & = - inf_{f = Θ x : x \in X} \sum_{e \in E} [t_{e} f_{e} - (t_{e} - {\bar{t}}_{e}) {\bar{f}}_{e}] \\ = - \sum_{w \in O D} d_{w} T_{w} (t) + 〈 t - \bar{t}, \bar{f} 〉 ⟶ min_{t_{e} \geq {\bar{t}}_{e}} . \end{matrix}

In contrast with the Beckmann model, the equilibrium state in the stable dynamics model is defined by pair

(f^{*}, t^{*})

(in particular, it differs from the system optimum

(f^{*}, \bar{t})

in the model only by the time value).

3. Numerical Methods

We have the following objective functions

The stable dynamics model:

$Q (t) = \underset{Φ (t)}{\underset{⏟}{- \sum_{w \in O D} d_{w} T_{w} (t)}} + \underset{h (t)}{\underset{⏟}{〈 t - \bar{t}, \bar{f} 〉}},$
The Beckmann model:

$Q (t) = \underset{Φ (t)}{\underset{⏟}{- \sum_{w \in O D} d_{w} T_{w} (t)}} + \underset{h (t)}{\underset{⏟}{\sum_{e \in E} {\bar{f}}_{e} {(\frac{t_{e} - {\bar{t}}_{e}}{{\bar{t}}_{e} ρ})}^{μ} \frac{(t_{e} - {\bar{t}}_{e})}{1 + μ}}} .$

In both cases, it has the form

Q (t) = Φ (t) + h (t) ⟶ min_{t \geq \bar{t}} .

(4)

The optimization problem (4) is convex, non-smooth, and composite. We use all these properties to identify the best optimization method to solve the considered problem.

3.1. Subgradient

In our research, we consider first-order methods, i.e., they require a subgradient of

Φ (t)

, the properties and effective computation of which we discuss in this section.

To get the subdifferential

\partial Φ (t)

, let us re-write

Φ (t)

in the following way:

Φ (t) = - \sum_{w \in O D} d_{w} T_{w} (t) = - \sum_{w \in O D} d_{w} min_{p \in P_{w}} 〈 t, a_{p} 〉,

where the vector

a_{p} = {(δ_{e p})}_{e \in E}

encodes a path p. Obviously, the shortest path may not be unique. Using the rules of subgradient calculus [17] we get the following expression:

\partial Φ (t) = - \sum_{w \in O D} d_{w} \partial (min_{p \in P_{w}} 〈 t, a_{p} 〉) = - \sum_{w \in O D} d_{w} Conv {a_{p} : p \in P_{w}, T_{p} (t) = T_{w} (t)},

i.e., the subdifferential

\partial Φ (t)

is a sum of convex hulls of binary vectors that encode the shortest length paths. An important consequence is that for any

t_{1}, t_{2} \in R_{+}^{| E |}

and

\nabla Φ (t_{1}) \in \partial Φ (t_{1})

,

\nabla Φ (t_{2}) \in \partial Φ (t_{2})

, the following bound holds:

{∥ \nabla Φ (t_{1}) - \nabla Φ (t_{2}) ∥}_{2} \leq M = \sqrt{2 H} \sum_{w \in O D} d_{w},

(5)

where H is the diameter of the graph G.

Note that any element from the set

\partial Φ (t)

has the form

\nabla Φ (t) = - f

, where

f = Θ x

is a flow distribution on links induced by

x \in X

concentrated on the shortest paths for given times t (and vice versa: any such f corresponds to a subgradient of

Φ (t)

).

In practice, the calculation of flows f is the most expensive part, since we have to find the shortest paths for all pairs

w \in O D

. We use the following Algorithm 1. We use Dijkstra’s Algorithm [18] to find the shortest paths in line 3, which runs in

O (| E | + | V | log | V |)

time; finding the traversal order with topological sort (Section 22.4 in [19]) and further flows aggregation have linear performance

O (| V |)

. Hence, the total complexity of Algorithm 1 is

O (| O | (| E | + | V | log | V |))

. When the transportation network is an (almost) planar graph or another sparse graph,

| E | = O (| V |)

and the complexity is

O (| O | \cdot | V | log | V |)

. Moreover, flows reconstruction for every source

o \in O

can be computed in parallel, and Dijkstra’s algorithm can also be parallelized and has efficient implementations [20,21].

Algorithm 1 Flows reconstruction.

Input: times t

1:: $f ≔ 0_{| E |}$ {flows on edges}
2:: for origin o in O do
3:: Get a shortest-path tree $T_{o}$ from o to all destinations in D with weights t
4:: $traversal_order ≔ TopologicalSort (T_{o})$ {sorting from furthest to closest vertices}
5:: $f_{out} ≔ 0_{| V |}$ {total output flow from each vertex}
6:: $f_{out} [v] ≔ d_{w}$ for $w = (o, v) \in O D$
7:: for v in $traversal_order$ do
8:: Get predecessor p of v in $T_{o}$
9:: $e ≔ (p, v)$
10:: $f [e] ≔ f [e] + f_{out} [v]$
11:: $f_{out} [p] ≔ f_{out} [p] + f_{out} [v]$
12:: end for
13:: end for
14:: return flows f

3.2. Reconstruction of Admissible Flows in SD Model

For given times t considered, Algorithm 1 reconstructs feasible flows f, i.e.,

f = Θ x

for some

x \in X

. These flows meet all the constraints in the Beckmann model, but they can violate the capacity constraints in the stable dynamics model. In the latter case, an additional step is required to obtain admissible flows from f. Note that we could instead find flows that meet capacity constraints first (Theorem 8 from [4]), but to reconstruct feasible flows from them is a more complex problem.

Suppose we are given some flows

g = Θ x

such that

ξ = 1 - max_{e \in E} g_{e} / {\bar{f}}_{e} > 0 .

(6)

Then for any

f = Θ x

we can construct admissible flows

π (f)

in the following way: let

η = {max}_{e \in E} f_{e} / {\bar{f}}_{e} - 1

, then

π (f) = \{\begin{matrix} f, & η \leq 0, \\ \frac{ξ f + η g}{ξ + η}, & η > 0 . \end{matrix}

In practice, we propose the following procedure to find admissible flows g: run some optimization method (e.g., UGM) for a small number of iterations for the same problem but with decreased capacities:

\frac{1}{2} \bar{f}

instead of

\bar{f}

; if obtained feasible flows

{\hat{f}}^{N}

satisfy

{\hat{f}}^{N} \leq \frac{3}{4} \bar{f}

, then take

g = {\hat{f}}^{N}

; otherwise, run it again with capacities

\frac{3}{4} \bar{f}

and check

{\hat{f}}^{N} \leq \frac{7}{8} \bar{f}

, etc.

Stopping criterion.

The stopping criterion we use for the stable dynamics model is based on a duality gap

Q ({\hat{t}}^{N}) + Ψ (π ({\hat{f}}^{N})) \leq ε,

(7)

where

{\hat{f}}^{N} \in \{Θ x : x \in X\}

,

{\hat{t}}^{N} \geq \bar{t}

are estimates of an equilibrium

(f^{*}, t^{*})

after N iterations of the applied method. Note that here the duality gap with

{\hat{f}}^{N}

is not applicable.

3.3. Universal Gradient Method

The method for solving non-smooth problems with smooth techniques was proposed by Nesterov [12] and was called the universal gradient method. The pseudocode of UGM for the considered problem (4) is provided in Algorithm 2. Here, the euclidean prox-structure is used. Note that we did not specify the stopping criterion as it can be different for different models.

Algorithm 2 Universal gradient method.

Input:

L_{0} > 0

, accuracy

ε > 0

1:: Set $t^{0} ≔ \bar{t}$ , $k ≔ 0$
2:: repeat
3:: $L_{k + 1} ≔ L_{k} / 2$
4:: while true do
5:: $t^{k + 1} ≔ \underset{t \in dom h}{arg min} 〈 \nabla Φ (t^{k}), t - t^{k} 〉 + h (t) + L_{k + 1} \frac{{∥ t - t^{k} ∥}_{2}^{2}}{2}$
6:: if $Φ (t^{k + 1}) \leq Φ (t^{k}) + 〈\nabla Φ (t^{k}), t^{k + 1} - t^{k}〉 + L_{k + 1} \frac{{∥ t^{k + 1} - t^{k} ∥}_{2}^{2}}{2} + \frac{ε}{2}$ then
7:: break
8:: else
9:: $L_{k + 1} ≔ 2 L_{k + 1}$
10:: end if
11:: end while
12:: $k ≔ k + 1$
13:: until Stopping criterion is fulfilled

Now let us define

{\hat{f}}^{N} = - \frac{1}{S_{N}} \sum_{k = 0}^{N - 1} \frac{\nabla Φ (t^{k})}{L_{k + 1}}, {\hat{t}}^{N} = \frac{1}{S_{N}} \sum_{k = 1}^{N} \frac{t^{k}}{L_{k}}, S_{N} = \sum_{k = 1}^{N} \frac{1}{L_{k}},

(8)

where

L_{k}

are the estimates of the local Lipschitz constant in UGM and UMST methods.

Convergence of the UGM was proved in [12] and is summarized in the following lemma and theorem. Appendix A is the proofs for UGM.

Lemma 1.

After N iterations of UGM for the stable dynamics model, it holds that

\begin{matrix} Q ({\hat{t}}^{N}) - Q (t^{*}) \leq \frac{R^{2}}{S_{N}} + \frac{ε}{2}, \end{matrix}

(9)

\begin{matrix} 0 \leq Q ({\hat{t}}^{N}) + Ψ ({\hat{f}}^{N}) + 〈 t^{*} - \bar{t}, {({\hat{f}}^{N} - \bar{f})}_{+} 〉 \leq \frac{R^{2}}{S_{N}} + \frac{ε}{2}, \end{matrix}

(10)

\begin{matrix} {∥ {({\hat{f}}^{N} - \bar{f})}_{+} ∥}_{2} \leq \frac{4 R}{S_{N}} + \sqrt{\frac{2 ε}{S_{N}}}, \end{matrix}

(11)

where

{\hat{f}}^{N}

,

{\hat{t}}^{N}

, and

S_{N}

are defined by (8), and

R = {∥ t^{*} - \bar{t} ∥}_{2}

is the distance from the starting point to a solution.

Theorem 1.

Let

L_{0} \leq \frac{M^{2}}{ε}

, where M comes from (5). Then, after at most

N_{Q} = 2 {(\frac{R M}{ε})}^{2}

(12)

iterations of UGM for the stable dynamics model, it holds that

Q ({\hat{t}}^{N}) - Q (t^{*}) \leq ε

. Moreover, the stopping criterion (7) is fulfilled after at most

N_{s t o p} = O ({(\frac{R M}{ε})}^{2} max \{1, {(\frac{〈 g - f^{*}, \bar{t} 〉}{ξ R {min}_{e} {\bar{f}}_{e}})}^{2}\})

(13)

iterations, where ξ comes from (6).

Now we provide results on the rate of convergence for the Beckmann model. The stopping criterion in this case is the following:

Q ({\hat{t}}^{N}) + Ψ ({\hat{f}}^{N}) \leq ε .

(14)

Lemma 2.

After N iterations of UGM for the Beckmann model, it holds that

\begin{matrix} Q ({\hat{t}}^{N}) - Q (t^{*}) \leq \frac{R^{2}}{S_{N}} + \frac{ε}{2}, \\ 0 \leq Q ({\hat{t}}^{N}) + Ψ ({\hat{f}}^{N}) \leq \frac{{∥ τ ({\hat{f}}^{N}) - \bar{t} ∥}_{2}^{2}}{S_{N}} + \frac{ε}{2}, \end{matrix}

where

{\hat{f}}^{N}

,

{\hat{t}}^{N}

,

S_{N}

are defined by (8), and

R = {∥ t^{*} - \bar{t} ∥}_{2}

Theorem 2.

Let

L_{0} \leq \frac{M^{2}}{ε}

, where M comes from (5). Then after at most

N_{Q} = 2 {(\frac{R M}{ε})}^{2}

(15)

iterations of UGM for the Beckmann model, it holds that

Q ({\hat{t}}^{N}) - Q (t^{*}) \leq ε

. Moreover, the stopping criterion (14) is fulfilled after at most

N_{s t o p} = 2 {(\frac{\tilde{R} M}{ε})}^{2}

(16)

iterations, where

{\tilde{R}}^{2} = ρ^{2} \sum_{e \in E} \frac{{\bar{t}}_{e}^{2}}{{\bar{f}}_{e}^{2 / μ}} {(\sum_{w \in O D} d_{w})}^{2 / μ} .

(17)

3.4. Universal Method of Similar Triangles

Let us introduce the following notations:

ϕ_{0} (t) = \frac{1}{2} {∥ t - t^{0} ∥}_{2}^{2},

ϕ_{k + 1} (t) = ϕ_{k} (t) + α_{k + 1} [Φ (y^{k + 1}) + 〈\nabla Φ (y^{k + 1}), t - y^{k + 1}〉 + h (t)] .

Flows are reconstructed in the following way:

{\hat{f}}^{N} = - \frac{1}{A_{N}} \sum_{k = 1}^{N} α_{k} \nabla Φ (y^{k})

(18)

Lemma 3.

After N iterations of UMST for the stable dynamics model, it holds that

\begin{matrix} Q (t^{N}) - Q (t^{*}) \leq \frac{R^{2}}{A_{N}} + \frac{ε}{2}, \end{matrix}

(19)

\begin{matrix} 0 \leq Q (t^{N}) + Ψ ({\hat{f}}^{N}) + 〈 t^{*} - \bar{t}, {({\hat{f}}^{N} - \bar{f})}_{+} 〉 \leq \frac{R^{2}}{A_{N}} + \frac{ε}{2}, \end{matrix}

(20)

\begin{matrix} {∥ {({\hat{f}}^{N} - \bar{f})}_{+} ∥}_{2} \leq \frac{4 R}{A_{N}} + \sqrt{\frac{2 ε}{A_{N}}}, \end{matrix}

(21)

where

{\hat{f}}^{N}

is defined by (18) and

R = {∥ t^{*} - \bar{t} ∥}_{2}

is the distance from the starting point to a solution.

Theorem 3.

Let

L_{0} \leq \frac{4 M^{2}}{ε}

, where M comes from (5). Then, after at most

N_{Q} = 4 {(\frac{R M}{ε})}^{2}

(22)

iterations of UGM for the stable dynamics model, it holds that

Q (t^{N}) - Q (t^{*}) \leq ε

. Moreover, the stopping criterion (7) with

{\hat{t}}^{N} = t^{N}

is fulfilled after at most

N_{s t o p} = O ({(\frac{R M}{ε})}^{2} max \{1, {(\frac{〈 g - f^{*}, \bar{t} 〉}{ξ R {min}_{e} {\bar{f}}_{e}})}^{2}\})

(23)

iterations, where ξ comes from (6).

Theorem 4.

Let

L_{0} \leq \frac{4 M^{2}}{ε}

, where M comes from (5). Then, after at most

N_{Q} = 4 {(\frac{R M}{ε})}^{2}

(24)

iterations of UMST for the Beckmann model, it holds that

Q (t^{N}) - Q (t^{*}) \leq ε

. Moreover, the stopping criterion (14) with

{\hat{t}}^{N} = t^{N}

is fulfilled after at most

N_{s t o p} = 4 {(\frac{\tilde{R} M}{ε})}^{2}

(25)

iterations, where

\tilde{R}

is defined by (17). Appendix B is the proofs for UMST.

3.5. Method of Weighted Dual Averages

Convergence of the WDA method was proved in [14] and is summarized in the following theorem. Appendix C is the proofs for WAD.

Theorem 5.

Non-composite WDA-method satisfies the following bounds

For the stable dynamics model:

$Q ({\hat{t}}^{k}) - Q (t^{*}) = O (\frac{M + {∥ \bar{f} ∥}_{2}}{\sqrt{k}} (\frac{R^{2}}{χ} + χ)),$
For the Beckmann model if $μ \leq 1$ :

$Q ({\hat{t}}^{k}) - Q (t^{*}) = O (\frac{1}{\sqrt{k}} (M + max_{e} {\bar{f}}_{e} {[\frac{2 R + χ}{{\bar{t}}_{e} ρ}]}^{μ}) (\frac{R^{2}}{χ} + χ)) .$

4. Numerical Experiments

This section presents numerical results for the algorithms described above, namely, composite variants of UMST and UGM, both composite and non-composite WDA-method, on the Anaheim network [5,22]. The network consists of 38 zones, 416 nodes, and 916 links. Experiments and the source code in Python 3 [23] can be found in [15]. We used Dijkstra’s algorithm for finding the shortest paths in the network from the graph-tool library [24], where it is implemented in C++. We also used the Numpy library [25] for all vector operations.

Stable dynamics model.

Parameters of the network are adjusted to the Beckmann model, so we have to increase the capacities to ensure the existence of an equilibrium for the stable dynamics model. In our experiments, the capacities are multiplied by

2.5

. In Figure 2, we plot the number of (inner) iterations of the algorithms required to fulfill the stopping criterion (7) against

1 / ε

. We consider the number of inner iterations for Algorithms 3 and 2 since the complexity of an inner iteration in this case is similar to the complexity of an iteration of the other algorithms. Note that according to ([12], formula (2.23)) the number

N (k)

of inner iterations of UGM or UMST at step k is bounded as

N (k) \leq 2 k + {log}_{2} (\frac{M^{2}}{ε L_{0}}),

so asymptotic rates from Theorems 1–4 are still valid.

As we can see, the best results are shown by UMST, followed by UGM having similar performance. Both composite and non-composite WDA-method in Algorithm 4 are much slower.

Algorithm 3 Universal Method of Similar Triangles.

Input:

L_{0} > 0

, accuracy

ε > 0

1:: $u^{0} = t^{0} ≔ \bar{t}$ , $A_{0} ≔ 0$ , $k ≔ 0$
2:: repeat
3:: $L_{k + 1} ≔ L_{k} / 2$
4:: while true do
5:: $\{\begin{matrix} α_{k + 1} ≔ \frac{1}{2 L_{k + 1}} + \sqrt{\frac{1}{4 L_{k + 1}^{2}} + \frac{A_{k}}{L_{k + 1}}}, A_{k + 1} ≔ A_{k} + α_{k + 1} \\ y^{k + 1} ≔ \frac{α_{k + 1} u^{k} + A_{k} t^{k}}{A_{k + 1}}, u^{k + 1} ≔ \underset{t \in dom h}{arg min} ϕ_{k + 1} (t) \\ t^{k + 1} ≔ \frac{α_{k + 1} u^{k + 1} + A_{k} t^{k}}{A_{k + 1}} \end{matrix}$
6:: if $Φ (t^{k + 1}) \leq Φ (y^{k + 1}) + 〈\nabla Φ (y^{k + 1}), t^{k + 1} - y^{k + 1}〉 + \frac{L_{k + 1}}{2} {∥ t^{k + 1} - y^{k + 1} ∥}_{2}^{2} + \frac{α_{k + 1}}{2 A_{k + 1}} ε$
then
7:: break
8:: else
9:: $L_{k + 1} ≔ 2 L_{k + 1}$
10:: end if
11:: end while
12:: $k ≔ k + 1$
13:: until Stopping criterion is fulfilled

Algorithm 4 Method of Weighted Dual Averages.

Input: accuracy

ε > 0

, constant

χ > 0

1:

s^{0} ≔ \vec{0}

,

t^{0} ≔ \bar{t}

,

k ≔ 0

2:

repeat

3:

Compute subgradient

g^{k}

, set

s^{k + 1} ≔ s^{k} + \frac{1}{{∥ g^{k} ∥}_{2}} g^{k}

non-composite case: $g^{k} ≔ \nabla Φ (t^{k}) + \nabla h (t^{k})$
composite case: $g^{k} ≔ \nabla Φ (t^{k})$

4:

Set

β_{k + 1} ≔ \frac{{\hat{β}}_{k + 1}}{χ}

, where

{\hat{β}}_{k + 1} = \sum_{i = 0}^{k} \frac{1}{{\hat{β}}_{i}}, {\hat{β}}_{0} = 1

5:

Set

t^{k + 1}

non-composite case: $t^{k + 1} ≔ \underset{t \in dom h}{arg min} 〈 s^{k + 1}, t 〉 + \frac{β_{k + 1}}{2} {∥ t - t^{0} ∥}_{2}^{2}$
composite case: $t^{k + 1} ≔ \underset{t \in dom h}{arg min} 〈 s^{k + 1}, t 〉 + \frac{β_{k + 1}}{2} {∥ t - t^{0} ∥}_{2}^{2} + \sum_{i = 0}^{k} \frac{1}{{∥ g^{k} ∥}_{2}} h (t)$

6:

k ≔ k + 1

7:

until Stopping criterion is fulfilled

Beckmann model.

For the Beckmann model, we also compare our methods with the Frank–Wolfe algorithm (Algorithm 5), of which the theoretical convergence rate for a convex objective (with Lipschitz-continuous gradient) is

O (1 / ε)

[7,26].

Figure 3 shows the convergence rates of the methods for the Beckmann model. The Frank–Wolfe method demonstrates the best results and is followed by UMST. Unlike the stable dynamics case, the composite WDA-method is faster than UGM. However, the non-composite WDA-method has the worst performance again.

Algorithm 5 Frank–Wolfe algorithm.

Input: accuracy

ε > 0

1:: $t^{0} ≔ \bar{t}$ , $f^{0} ≔ \underset{s \in {Θ x : x \in X}}{arg min} 〈 t^{0}, s 〉$ , $k ≔ 0$
2:: repeat
3:: $s^{k} ≔ \underset{s \in {Θ x : x \in X}}{arg min} 〈 t^{k}, s 〉$ , $t_{e}^{k} ≔ \frac{\partial Ψ (f^{k})}{\partial f_{e}} = τ_{e} (f^{k})$
4:: $γ_{k} ≔ \frac{2}{k + 2}$ , $f^{k + 1} ≔ (1 - γ_{k}) f^{k} + γ_{k} s^{k}$
5:: $k ≔ k + 1$
6:: until Stopping criterion is fulfilled

5. Conclusions

We considered several primal-dual subgradient methods for finding equilibria in the stable dynamics and the Beckmann models. We suggested a way to reconstruct admissible flows in the stable dynamics model, which provides us with a novel computable duality gap. Complexity bounds for UMST and UGM were presented in terms of the iterations number required to achieve a desired accuracy in the dual function value or the duality gap. Finally, we conducted numerical experiments comparing convergence of the considered algorithms on the Anaheim transportation network: UMST is the best one for optimization of the dual problems in both models. Furthermore, using the duality gap as a stopping criterion, we compared these methods with the Frank–Wolfe algorithm for the Beckmann model, which, as expected, remains the most suitable approach in this case (but it is not applicable for the stable dynamics model).

The reader may be interested in another related topic—searching stochastic traffic equilibria. In [27,28], we (with our colleagues) studied the application of the UMST for finding Nash–Wardrop stochastic equilibria in the Beckmann model. In this case, a driver selects a route randomly according to Gibbs’ distribution taking into account current time costs on the links of the network. It leads to iteration complexity

O (\frac{1}{\sqrt{γ ε}})

, where

γ > 0

is a stochasticity parameter (when

γ \to 0

, the model boils down to the ordinary Beckmann model). However, the great decrease in the number of iterations comes along with a more expensive calculation of the objective function’s gradient.

Author Contributions

Conceptualization, A.G.; formal analysis, M.K. and A.G.; software, M.K.; supervision, A.G; validation, M.K. and A.G.; writing—original draft, M.K.; Writing—review and editing, M.K. and A.G. All authors have read and agreed to the published version of the manuscript.

Funding

The research of M. Kubentayeva was supported by the Russian Science Foundation (project 18-71-10108). The research of A. Gasnikov was partially supported by RFBR, project number 18-29-03071 mk, and was partially supported by the Ministry of Science and Higher Education of the Russian Federation (Goszadaniye) No. 075-00337-20-03.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

We would like to thank Yu. Nesterov for fruitful discussions.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Appendix A. Proofs for UGM

Proof of Lemma 1.

Note that function

Φ (t)

satisfies (5). Then according to Theorem 1 in [12], applied with

ν = 0

, one has

\begin{matrix} Q ({\hat{t}}^{N}) & \leq \frac{1}{S_{N}} \sum_{k = 1}^{N} \frac{1}{L_{k}} Q (t^{k}) \\ \leq min_{t \geq \bar{t}} \{\frac{1}{S_{N}} \sum_{k = 0}^{N - 1} \frac{1}{L_{k + 1}} [Φ (t^{k}) + 〈 \nabla Φ (t^{k}), t - t^{k} 〉] + h (t) + \frac{{∥ t - t^{0} ∥}_{2}^{2}}{S_{N}}\} + \frac{ε}{2} . \end{matrix}

(A1)

Equation (9) follows immediately if one substitutes

t = t^{*}

. Now let us estimate the first term on the r.h.s.

\begin{matrix} min_{t \geq \bar{t}} & \{\frac{1}{S_{N}} \sum_{k = 0}^{N - 1} \frac{1}{L_{k + 1}} [Φ (t^{k}) + 〈 \nabla Φ (t^{k}), t - t^{k} 〉] + h (t) + \frac{{∥ t - \bar{t} ∥}_{2}^{2}}{S_{N}}\} \\ = min_{t \geq \bar{t}} \{\frac{1}{S_{N}} \sum_{k = 0}^{N - 1} \frac{1}{L_{k + 1}} \underset{\leq Φ (0)}{\underset{⏟}{[Φ (t^{k}) + 〈 \nabla Φ (t^{k}), 0 - t^{k} 〉]}} - 〈 {\hat{f}}^{N}, t 〉 + 〈 \bar{f}, t - \bar{t} 〉 + \frac{{∥ t - \bar{t} ∥}_{2}^{2}}{S_{N}}\} \\ \leq Φ (0) - 〈 {\hat{f}}^{N}, \bar{t} 〉 + min_{t \geq \bar{t}} \{〈 \bar{f} - {\hat{f}}^{N}, t - \bar{t} 〉 + \frac{{∥ t - \bar{t} ∥}_{2}^{2}}{S_{N}}\} \\ = - Ψ ({\hat{f}}^{N}) - \frac{S_{N} {∥ {({\hat{f}}^{N} - \bar{f})}_{+} ∥}_{2}^{2}}{4} . \end{matrix}

Here, we used

Φ (0) = - \sum_{w \in O D} d_{w} T_{w} (0) = 0

. Therefore,

Q ({\hat{t}}^{N}) + Ψ ({\hat{f}}^{N}) + \frac{S_{N} {∥ {({\hat{f}}^{N} - \bar{f})}_{+} ∥}_{2}^{2}}{4} \leq \frac{ε}{2} .

Now, notice that since the flow

{\hat{f}}^{N}

is induced by some traffic distribution

x \in X

, we have

\begin{matrix} 0 & \leq Φ (t^{*}) + 〈 t^{*}, {\hat{f}}^{N} 〉 \\ = Q (t^{*}) - 〈 t^{*} - \bar{t}, \bar{f} 〉 + Ψ ({\hat{f}}^{N}) - 〈 \bar{t}, {\hat{f}}^{N} 〉 + 〈 t^{*}, {\hat{f}}^{N} 〉 \\ = Q (t^{*}) + Ψ ({\hat{f}}^{N}) + 〈 t^{*} - \bar{t}, {\hat{f}}^{N} - \bar{f} 〉 \\ \leq Q ({\hat{t}}^{N}) + Ψ ({\hat{f}}^{N}) + 〈 t^{*} - \bar{t}, {({\hat{f}}^{N} - \bar{f})}_{+} 〉, \end{matrix}

hence

Q ({\hat{t}}^{N}) + Ψ ({\hat{f}}^{N}) \geq - 〈 t^{*} - \bar{t}, {({\hat{f}}^{N} - \bar{f})}_{+} 〉 \geq - R {∥ {({\hat{f}}^{N} - \bar{f})}_{+} ∥}_{2} .

This yields

\frac{S_{N} {∥ {({\hat{f}}^{N} - \bar{f})}_{+} ∥}_{2}^{2}}{4} - R {∥ {({\hat{f}}^{N} - \bar{f})}_{+} ∥}_{2} \leq \frac{ε}{2},

and thus

{∥ {({\hat{f}}^{N} - \bar{f})}_{+} ∥}_{2} \leq \frac{2 R}{S_{N}} (1 + \sqrt{1 + \frac{ε S_{N}}{2 R^{2}}}) \leq \frac{4 R}{S_{N}} + \sqrt{\frac{2 ε}{S_{N}}} .

□

Proof of Theorem 1.

Theorem 1 in [12] ensures that

L_{k} \leq \frac{M^{2}}{ε}

for all

k \geq 0

, thus

S_{N} \geq \frac{ε N}{M^{2}}

. Then the first bound (12) follows immediately from (9).

Now, let us prove the second bound. First, suppose

{\hat{f}}_{e}^{N} \leq {\bar{f}}_{e}

for all

e \in E

. Then

π ({\hat{f}}^{N}) = {\hat{f}}^{N}

, thus by (10) for

N = N_{Q}

Q ({\hat{t}}^{N}) + Ψ (π ({\hat{f}}^{N})) = Q ({\hat{t}}^{N}) + Ψ ({\hat{f}}^{N}) \leq \frac{R^{2}}{S_{N}} + \frac{ε}{2} \leq \frac{{(R M)}^{2}}{ε N} + \frac{ε}{2} \leq ε .

Otherwise, if

{\hat{f}}_{e}^{N} ≰ {\bar{f}}_{e}

, one has

π ({\hat{f}}^{N}) = \frac{ξ {\hat{f}}^{N} + η g}{ξ + η}

, where

η = {max}_{e \in E} {\hat{f}}_{e}^{N} / {\bar{f}}_{e} - 1

, hence (9) and (10) yield

\begin{matrix} Q ({\hat{t}}^{N}) + Ψ (π ({\hat{f}}^{N})) & \leq \frac{ξ}{ξ + η} (Q ({\hat{t}}^{N}) + Ψ ({\hat{f}}^{N})) + \frac{η}{ξ + η} (Q ({\hat{t}}^{N}) + Ψ (g)) \\ = \frac{ξ}{ξ + η} (Q ({\hat{t}}^{N}) + Ψ ({\hat{f}}^{N})) + \frac{η}{ξ + η} (Q ({\hat{t}}^{N}) - Q (t^{*})) \\ + \frac{η}{ξ + η} (Ψ (g) - Ψ (f^{*})) \\ \leq \frac{R^{2}}{S_{N}} + \frac{ε}{2} + \frac{η}{ξ} 〈 g - f^{*}, \bar{t} 〉 . \end{matrix}

Finally, according to (11)

η = max_{e \in E} {\hat{f}}_{e}^{N} / {\bar{f}}_{e} - 1 = {∥ \frac{{({\hat{f}}^{N} - \bar{f})}_{+}}{\bar{f}} ∥}_{\infty} \leq \frac{1}{{min}_{e} {\bar{f}}_{e}} {∥ {({\hat{f}}^{N} - \bar{f})}_{+} ∥}_{2} \leq \frac{1}{{min}_{e} {\bar{f}}_{e}} (\frac{4 R}{S_{N}} + \sqrt{\frac{2 ε}{S_{N}}}) .

Combining all bounds together we obtain

Q ({\hat{t}}^{N}) + Ψ (π ({\hat{f}}^{N})) \leq \frac{R^{2} M^{2}}{ε N} + \frac{〈 g - f^{*}, \bar{t} 〉}{ξ {min}_{e} {\bar{f}}_{e}} (\frac{4 R M^{2}}{ε N} + \sqrt{\frac{2 M^{2}}{N}}) + \frac{ε}{2},

and substituting

N = N_{s t o p}

, we conclude that the stopping criterion (7) is fulfilled. □

Proof of Lemma 2.

First of all, note that

max_{t \geq \bar{t}} \{〈 {\hat{f}}^{N}, t 〉 - \sum_{e \in E} σ_{e}^{*} (t_{e})\} = \sum_{e \in E} σ_{e} ({\hat{f}}_{e}^{N}) = Ψ ({\hat{f}}^{N}),

and maximum is attained at point

t = \nabla Ψ ({\hat{f}}^{N}) = τ ({\hat{f}}^{N})

. As in the proof of Theorem 1, the inequality (A1) holds in Beckmann’s model case. Then, the first term in the r.h.s. can be estimated as follows

\begin{matrix} min_{t \geq \bar{t}} & \{\frac{1}{S_{N}} \sum_{k = 0}^{N - 1} \frac{1}{L_{k + 1}} [Φ (t^{k}) + 〈 \nabla Φ (t^{k}), t - t^{k} 〉] + h (t) + \frac{{∥ t - \bar{t} ∥}_{2}^{2}}{S_{N}}\} \\ = min_{t \geq \bar{t}} \{\frac{1}{S_{N}} \sum_{k = 0}^{N - 1} \frac{1}{L_{k + 1}} \underset{\leq Φ (0)}{\underset{⏟}{[Φ (t^{k}) + 〈 \nabla Φ (t^{k}), 0 - t^{k} 〉]}} - 〈 {\hat{f}}^{N}, t 〉 + \sum_{e \in E} σ_{e}^{*} (t_{e}) + \frac{{∥ t - \bar{t} ∥}_{2}^{2}}{S_{N}}\} \\ \leq Φ (0) + \{\sum_{e \in E} σ_{e}^{*} (t_{e} ({\hat{f}}_{e}^{N})) - 〈 {\hat{f}}^{N}, τ ({\hat{f}}^{N}) 〉 + \frac{{∥ τ ({\hat{f}}^{N}) - \bar{t} ∥}_{2}^{2}}{S_{N}}\} \\ = - Ψ ({\hat{f}}^{N}) + \frac{1}{S_{N}} {∥ τ ({\hat{f}}^{N}) - \bar{t} ∥}_{2}^{2}, \end{matrix}

and we finally get an upper bound on the duality gap:

0 \leq Q ({\hat{t}}^{N}) + Ψ ({\hat{f}}^{N}) \leq \frac{{∥ τ ({\hat{f}}^{N}) - \bar{t} ∥}_{2}^{2}}{S_{N}} + \frac{ε}{2} .

At the same time, substituting

t = t^{*}

one obtains

Q ({\hat{t}}^{N}) \leq Q (t^{*}) + \frac{{∥ t^{*} - \bar{t} ∥}^{2}}{S_{N}} + \frac{ε}{2} .

□

Proof of Theorem 2.

By construction,

{\hat{t}}_{e}^{N} \leq \sum_{w \in O D} d_{w}

for all

e \in E

, thus

{∥ τ ({\hat{f}}^{N}) - \bar{t} ∥}_{2} \leq \tilde{R}

. According to Theorem 1 in [12]

S_{N} \geq \frac{ε N}{M^{2}}

; thus, the statement follows immediately from Lemma 2. □

Appendix B. Proofs for UMST

Proof of Lemma 3.

According to the inequality (30) in [13]

Q (t^{N}) \leq min_{t \geq \bar{t}} \{\frac{1}{A_{N}} \sum_{k = 1}^{N} α_{k} [Φ (y^{k}) + 〈\nabla Φ (y^{k}), t - y^{k}〉] + h (t) + \frac{{∥ t - t^{0} ∥}_{2}^{2}}{2 A_{N}}\} + \frac{ε}{2} .

(A2)

Note that the above inequality has the same form as (A1), if one replaces

S_{N}

with

A_{N}

,

\frac{1}{L_{k + 1}}

with

α_{k}

,

y^{k}

with

t^{k}

, and

\frac{{∥ t - t^{0} ∥}_{2}^{2}}{S_{N}}

with

\frac{{∥ t - t^{0} ∥}_{2}^{2}}{2 A_{N}}

. Then the claim follows by the same reasoning as in the proof of Lemma 1. □

Proof of Theorem 3.

Due to (5) one has

Φ (t^{k + 1}) \leq Φ (y^{k + 1}) + 〈 \nabla Φ (y^{k + 1}), t^{k + 1} - y^{k + 1} 〉 + M {∥ t^{k + 1} - y^{k + 1} ∥}_{2} .

From Young’s inequality we get that

M {∥ t^{k + 1} - y^{k + 1} ∥}_{2} \leq \frac{α_{k + 1}}{2 A_{k + 1}} ε + \frac{A_{k + 1} M^{2}}{2 α_{k + 1} ε} {∥ t^{k + 1} - y^{k + 1} ∥}_{2}^{2} .

If

L_{k + 1} \geq \frac{A_{k + 1} M^{2}}{α_{k + 1} ε}

, then the stopping condition for inner iterations is fulfilled. Therefore, at the end of the k-th iteration either

L_{k + 1} < \frac{2 A_{k + 1} M^{2}}{α_{k + 1} ε}

or

L_{k + 1} = \frac{L_{k}}{2}

.

Now we are going to prove by induction that

α_{k} \geq \frac{ε}{2 M^{2}}

, which is equivalent to

L_{k} \leq \frac{2 M^{2}}{ε} + \frac{4 M^{4}}{ε^{2}} A_{k}

, for all

k \geq 1

. For

k = 1

it follows from

A_{1} = α_{1}

and

L_{0} \leq \frac{4 M^{2}}{ε}

. In the case where

L_{k + 1} < \frac{2 A_{k + 1} M^{2}}{α_{k + 1} ε}

equation

A_{k + 1} = L_{k + 1} α_{k + 1}^{2}

immediately yields

α_{k + 1} \geq \frac{ε}{2 M^{2}}

. If

L_{k + 1} = \frac{L_{k}}{2}

, then by the induction hypothesis and monotonicity of the sequence

{A_{k}}_{k \in N}

we obtain

L_{k + 1} \leq \frac{M^{2}}{ε} + \frac{2 M^{4}}{ε^{2}} A_{k - 1} < \frac{2 M^{2}}{ε} + \frac{4 M^{4}}{ε^{2}} A_{k} .

Therefore,

A_{N} \geq \frac{ε N}{2 M^{2}} .

(A3)

Arguing in the same way as in the proof of Theorem 1, we obtain that

Q (t^{N}) - Q (t^{*}) \leq \frac{2 R^{2} M^{2}}{ε N} + \frac{ε}{2}

and

Q ({\hat{t}}^{N}) + Ψ (π ({\hat{f}}^{N})) \leq \frac{2 R^{2} M^{2}}{ε N} + \frac{〈 g - f^{*}, \bar{t} 〉}{ξ {min}_{e} {\bar{f}}_{e}} (\frac{8 R M^{2}}{ε N} + \sqrt{\frac{4 M^{2}}{N}}) + \frac{ε}{2} .

After substitution

N = N_{Q}

or

N = N_{s t o p}

the claim follows. □

Proof of Theorem 4.

Repeating the proof of Theorem 2, we obtain that

Q (t^{N}) - Q (t^{*}) \leq \frac{R^{2}}{A_{N}} + \frac{ε}{2}, Q (t^{N}) + Ψ ({\hat{f}}^{N}) \leq \frac{{\tilde{R}}^{2}}{A_{N}} + \frac{ε}{2} .

Then, we conclude applying (A3). □

Appendix C. Proof for WDA

Proof of Theorem 5.

According to Equation (3.5) from [14],

Q ({\hat{t}}^{k}) - Q (t^{*}) = O (\frac{L}{\sqrt{k}} (\frac{R^{2}}{χ} + χ)),

whenever

{g_{k}}_{2} \leq L

for all k.

In case of the stable dynamics model

\nabla h (t) = \bar{f}

; thus, we can take

L = M + {∥ \bar{f} ∥}_{2}

.

For the Beckmann model

\frac{\partial h (t)}{\partial t_{e}} = {\bar{f}}_{e} {(\frac{t_{e} - {\bar{t}}_{e}}{{\bar{t}}_{e} ρ})}^{μ} .

Theorem 3 in [14] yields that

{∥ t^{k} - t^{*} ∥}_{2}^{2} \leq R^{2} + χ^{2}

for all k, thus

{∥ t^{k} - \bar{t} ∥}_{2} \leq 2 R + χ

. Then, using

μ \leq 1

one obtains

{∥ \nabla h (t^{k}) ∥}_{2} \leq {∥ t^{k} - \bar{t} ∥}_{2}^{μ} max_{e} \frac{{\bar{f}}_{e}}{{({\bar{t}}_{e} ρ)}^{μ}} \leq {(2 R + χ)}^{μ} max_{e} \frac{{\bar{f}}_{e}}{{({\bar{t}}_{e} ρ)}^{μ}},

thus we can take

L = M + max_{e} {\bar{f}}_{e} {(\frac{2 R + χ}{{\bar{t}}_{e} ρ})}^{μ} .

□

References

Beckmann, M.J.; McGuire, C.B.; Winsten, C.B. Studies in the Economics of Transportation; Yale University Press: New Haven, CT, USA, 1956. [Google Scholar]
Patriksson, M. The Traffic Assignment Problem: Models and Methods; Courier Dover Publications: Mineola, NY, USA, 2015. [Google Scholar]
US Bureau of Public Roads. Traffic Assignment Manual; Department of Commerce, Urban Planning Division: Washington, DC, USA, 1964.
Nesterov, Y.; De Palma, A. Stationary dynamic solutions in congested transportation networks: Summary and perspectives. Netw. Spat. Econ. 2003, 3, 371–395. [Google Scholar] [CrossRef]
Chudak, F.A.; Dos Santos Eleuterio, V.; Nesterov, Y. Static traffic assignment problem: A comparison between Beckmann (1956) and Nesterov & de Palma (1998) models. In Proceedings of the 7th Swiss Transport Research Conference, Ascona, Switzerland, 12–14 September 2007. [Google Scholar]
Frank, M.; Wolfe, P. An algorithm for quadratic programming. Nav. Res. Logist. Q. 1956, 3, 95–110. [Google Scholar] [CrossRef]
Jaggi, M. Revisiting Frank–Wolfe: Projection-free sparse convex optimization. In Proceedings of the 30th International Conference on Machine Learning, Atlanta, GA, USA, 16–21 June 2013; pp. 427–435. [Google Scholar]
Fukushima, M. A modified Frank–Wolfe algorithm for solving the traffic assignment problem. Transp. Res. Part B Methodol. 1984, 18, 169–177. [Google Scholar] [CrossRef]
LeBlanc, L.J.; Helgason, R.V.; Boyce, D.E. Improved efficiency of the Frank–Wolfe algorithm for convex network programs. Transp. Sci. 1985, 19, 445–462. [Google Scholar] [CrossRef]
Arezki, Y.; Van Vliet, D. A full analytical implementation of the PARTAN/Frank–Wolfe algorithm for equilibrium assignment. Transp. Sci. 1990, 24, 58–62. [Google Scholar] [CrossRef]
Chen, A.; Jayakrishnan, R.; Tsai, W. Faster Frank–Wolfe Traffic Assignment with New Flow Update Scheme. J. Transp. Eng. ASCE 2002, 128. [Google Scholar] [CrossRef]
Nesterov, Y. Universal gradient methods for convex optimization problems. Math. Program. 2015, 152, 381–404. [Google Scholar] [CrossRef] [Green Version]
Gasnikov, A.V.; Nesterov, Y.E. Universal method for stochastic composite optimization problems. Comput. Math. Math. Phys. 2018, 58, 48–64. [Google Scholar] [CrossRef]
Nesterov, Y. Primal-dual subgradient methods for convex problems. Math. Program. 2009, 120, 221–259. [Google Scholar] [CrossRef]
Kubentayeva, M. TransportNet. 2021. Available online: https://github.com/MeruzaKub/TransportNet (accessed on 30 April 2021).
Sheffi, Y. Urban Transportation Networks; Prentice-Hall: Englewood Cliffs, NJ, USA, 1985; Volume 6. [Google Scholar]
Rockafellar, R.T. Convex Analysis; Princeton University Press: Princeton, NJ, USA, 2015. [Google Scholar]
Dijkstra, E.W. A note on two problems in connexion with graphs. Numer. Math. 1959, 1, 269–271. [Google Scholar] [CrossRef] [Green Version]
Cormen, T.H.; Leiserson, C.E.; Rivest, R.L.; Stein, C. Introduction to Algorithms, 3rd ed.; MIT Press: Cambridge, MA, USA, 2009. [Google Scholar]
Crauser, A.; Mehlhorn, K.; Meyer, U.; Sanders, P. A parallelization of Dijkstra’s shortest path algorithm. In International Symposium on Mathematical Foundations of Computer Science; Springer: Berlin/Heidelberg, Germany, 1998; pp. 722–731. [Google Scholar]
Yin, C.; Wang, H. Developed Dijkstra shortest path search algorithm and simulation. In Proceedings of the 2010 International Conference on Computer Design and Applications, Qinhuangdao, China, 25–27 June 2010; Volume 1, pp. V1–V116. [Google Scholar]
Transportation Networks for Research Core Team. Transportation Networks for Research. 2021. Available online: https://github.com/bstabler/TransportationNetworks (accessed on 30 April 2021).
Van Rossum, G.; Drake, F.L. Python 3 Reference Manual; CreateSpace: Scotts Valley, CA, USA, 2009. [Google Scholar]
Peixoto, T.P. The graph-tool python library. Figshare 2014. [Google Scholar] [CrossRef]
Harris, C.R.; Millman, K.J.; van der Walt, S.J.; Gommers, R.; Virtanen, P.; Cournapeau, D.; Wieser, E.; Taylor, J.; Berg, S.; Smith, N.J.; et al. Array programming with NumPy. Nature 2020, 585, 357–362. [Google Scholar] [CrossRef] [PubMed]
Pedregosa, F.; Negiar, G.; Askari, A.; Jaggi, M. Linearly Convergent Frank-Wolfe with Backtracking Line-Search. In Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, Palermo, Italy, 3–5 June 2020. [Google Scholar]
Gasnikov, A.V.; Kubentayeva, M.B. Searching stochastic equilibria in transport networks by universal primal-dual gradient method. Comput. Res. Model. 2018, 10, 335–345. [Google Scholar] [CrossRef]
Baimurzina, D.R.; Gasnikov, A.V.; Gasnikova, E.V.; Dvurechensky, P.E.; Ershov, E.I.; Kubentaeva, M.B.; Lagunovskaya, A.A. Universal method of searching for equilibria and stochastic equilibria in transportation networks. Comput. Math. Math. Phys. 2019, 59, 19–33. [Google Scholar] [CrossRef]

Figure 1. Parallel routes.

Figure 2. Convergence rates of UMST, UGM, composite and non-composite WDA-methods for the stable dynamics model with the stopping criterion (7). Here,

\tilde{ε}

is the relative accuracy

ε / Δ_{0}

, where

Δ_{0}

is the duality gap at the start point.

Figure 2. Convergence rates of UMST, UGM, composite and non-composite WDA-methods for the stable dynamics model with the stopping criterion (7). Here,

\tilde{ε}

is the relative accuracy

ε / Δ_{0}

, where

Δ_{0}

is the duality gap at the start point.

Figure 3. Convergence rates of UMST, UGM, composite and non-composite WDA-methods, and the Frank–Wolfe method for the Beckmann model with the stopping criterion (14). Here,

\tilde{ε}

is the relative accuracy

ε / Δ_{0}

, where

Δ_{0}

is the duality gap at the start point.

Figure 3. Convergence rates of UMST, UGM, composite and non-composite WDA-methods, and the Frank–Wolfe method for the Beckmann model with the stopping criterion (14). Here,

\tilde{ε}

is the relative accuracy

ε / Δ_{0}

, where

Δ_{0}

is the duality gap at the start point.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kubentayeva, M.; Gasnikov, A. Finding Equilibria in the Traffic Assignment Problem with Primal-Dual Gradient Methods for Stable Dynamics Model and Beckmann Model. Mathematics 2021, 9, 1217. https://doi.org/10.3390/math9111217

AMA Style

Kubentayeva M, Gasnikov A. Finding Equilibria in the Traffic Assignment Problem with Primal-Dual Gradient Methods for Stable Dynamics Model and Beckmann Model. Mathematics. 2021; 9(11):1217. https://doi.org/10.3390/math9111217

Chicago/Turabian Style

Kubentayeva, Meruza, and Alexander Gasnikov. 2021. "Finding Equilibria in the Traffic Assignment Problem with Primal-Dual Gradient Methods for Stable Dynamics Model and Beckmann Model" Mathematics 9, no. 11: 1217. https://doi.org/10.3390/math9111217

APA Style

Kubentayeva, M., & Gasnikov, A. (2021). Finding Equilibria in the Traffic Assignment Problem with Primal-Dual Gradient Methods for Stable Dynamics Model and Beckmann Model. Mathematics, 9(11), 1217. https://doi.org/10.3390/math9111217

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Finding Equilibria in the Traffic Assignment Problem with Primal-Dual Gradient Methods for Stable Dynamics Model and Beckmann Model

Abstract

1. Introduction

2. Problem Statement

3. Numerical Methods

3.1. Subgradient

3.2. Reconstruction of Admissible Flows in SD Model

3.3. Universal Gradient Method

3.4. Universal Method of Similar Triangles

3.5. Method of Weighted Dual Averages

4. Numerical Experiments

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Proofs for UGM

Appendix B. Proofs for UMST

Appendix C. Proof for WDA

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI