A Quadratic Mean Field Games Model for the Langevin Equation

Camilli, Fabio

doi:10.3390/axioms10020068

Open AccessArticle

A Quadratic Mean Field Games Model for the Langevin Equation

by

Fabio Camilli

Dipartimento di Scienze di Base e Applicate per l’Ingegneria, Sapienza Università di Roma, Via Scarpa 16, 00161 Roma, Italy

Axioms 2021, 10(2), 68; https://doi.org/10.3390/axioms10020068

Submission received: 10 January 2021 / Revised: 25 March 2021 / Accepted: 16 April 2021 / Published: 19 April 2021

(This article belongs to the Special Issue Differential Models, Numerical Simulations and Applications)

Download Versions Notes

Abstract

:

We consider a Mean Field Games model where the dynamics of the agents is given by a controlled Langevin equation and the cost is quadratic. An appropriate change of variables transforms the Mean Field Games system into a system of two coupled kinetic Fokker–Planck equations. We prove an existence result for the latter system, obtaining consequently existence of a solution for the Mean Field Games system.

Keywords:

langevin equation; Mean Field Games system; kinetic Fokker–Planck equation; hypoelliptic operators

MSC:

35K40; 91A16

1. Introduction

The Mean Field Games (MFG in short) theory concerns the study of differential games with a large number of rational, indistinguishable agents and the characterization of the corresponding Nash equilibria. In the original model introduced in [1,2], an agent can typically act on its velocity (or other first order dynamical quantities) via a control variable. Mean Field Games where agents control the acceleration have been recently proposed in [3,4,5].

A prototype of stochastic process involving acceleration is given by the Langevin diffusion process, which can be formally defined as

\ddot{X} (t) = - b (X (t)) + σ \dot{B} (t),

(1)

where

\ddot{X}

is the second time derivative of the stochastic process X, B a Brownian motion and

σ

a positive parameter. The solution of (1) can be rewritten as a Markov process

(X, V)

solving

\{\begin{matrix} \dot{X} (t) = V (t), \\ \dot{V} (t) = - b (X (t)) + σ \dot{B} (t) . \end{matrix}

The probability density function of the previous process satisfies the kinetic Fokker–Planck equation

\partial_{t} p - \frac{σ^{2}}{2} Δ_{v} p - b (x) \cdot D_{v} p + v \cdot D_{x} p = 0 in (0, \infty) \times R^{d} \times R^{d} .

The previous equation, in the case

b \equiv 0

, was first studied by Kolmogorov [6] who provided an explicit formula for its fundamental solution. Then considered by Hörmander [7] as motivating example for the general theory of the hypoelliptic operators (see also [8,9,10]).

We consider a Mean Field Games model where the dynamics of the single agent is given by a controlled Langevin diffusion process, i.e.,

\{\begin{matrix} \dot{X} (s) = V (s), & s \geq t \\ \dot{V} (s) = - b (X (s)) + α (s) + σ \dot{B} (s) & s \geq t \\ X (t) = x, V (t) = v \end{matrix}

(2)

for

(t, x, v) \in [0, T] \times R^{d} \times R^{d}

. In (2), the control law

α : [t, T] \to R^{d}

, which is a progressively measurable process with respect to a fixed filtered probability space such that

E [\int_{t}^{T} | α (t) |^{2} d t] < + \infty

, is chosen to maximize the functional

\begin{matrix} J (t, x, v; α) = E_{t, (x, v)} {\int_{t}^{T} & [f (X (s), V (s), m (s)) - \frac{1}{2} {| α (s) |}^{2}] d s \\ + u_{T} (X (T), V (T))}, \end{matrix}

where

m (s)

is the distribution of the agents at time s. Let u the value function associated with the previous control problem, i.e.,

u (t, x, v) = sup_{α \in A_{t}} {J (t, x, v; α)}

where

A_{t}

is the the set of the control laws. Formally, the couple

(u, m)

satisfies the MFG system (see Section 4.1 in [3] for more details)

\{\begin{matrix} \partial_{t} u + \frac{σ^{2}}{2} Δ_{v} u - b (x) \cdot D_{v} u + v \cdot D_{x} u + \frac{1}{2} {| D_{v} u |}^{2} = - f (x, v, m) \\ \partial_{t} m - \frac{σ^{2}}{2} Δ_{v} m - b (x) \cdot D_{v} m + v \cdot D_{x} m + {div}_{v} (m D_{v} u) = 0 \\ m (0, x, v) = m_{0} (x, v), u (T, x, v) = u_{T} (x, v) . \end{matrix}

(3)

for

(t, x, v) \in (0, T) \times R^{d} \times R^{d}

. The first equation is a backward Hamilton–Jacobi–Bellman equation, degenerate in the x-variable and with a quadratic Hamiltonian in the v variable, and the second equation is forward kinetic Fokker–Planck equation. In the standard setting, MFG systems with quadratic Hamiltonians has been extensively considered in literature both as a reference model for the general theory and also since, thanks to the Hopf-Cole change of variable, the nonlinear Hamilton-Jacobi-Bellman equation can be transformed into a linear equation, allowing to use all the tools developed for this type of problem (see for example [2,11,12,13,14,15]). Recently, a similar procedure has been used for ergodic hypoelliptic MFG with quadratic cost in [16] and for a flocking model involving kinetic equations in Section 4.7.3 of [17].

We study (3) by means of a change of variable introduced in [11,14] for the standard case. By defining the new unknowns

ϕ = e^{u / σ^{2}}

and

ψ = m e^{- u / σ^{2}}

, the system (3) is transformed into a system of two kinetic Fokker–Planck equations

\{\begin{matrix} \partial_{t} ϕ + \frac{σ^{2}}{2} Δ_{v} ϕ - b (x) \cdot D_{v} ϕ + v \cdot D_{x} ϕ = - \frac{1}{σ^{2}} f (x, v, ψ ϕ) ϕ \\ \partial_{t} ψ - \frac{σ^{2}}{2} Δ_{v} ψ - b (x) \cdot D_{v} ψ + v \cdot D_{x} ψ = \frac{1}{σ^{2}} f (x, v, ψ ϕ) ψ \\ ψ (0, x, v) = \frac{m_{0} (x, v)}{ϕ (0, x, v)}, ϕ (T, x, v) = e^{\frac{u_{T} (x, v)}{σ^{2}}} . \end{matrix}

(4)

for

(t, x, v) \in (0, T) \times R^{d} \times R^{d}

. In the previous problem, the coupling between the two equations is only in the source terms. Following [14], we prove existence of a weak solution to (4) by showing the convergence of an iterative scheme defined, starting from

ψ^{(0)} \equiv 0

, by solving alternatively the backward problem

\{\begin{matrix} \partial_{t} ϕ^{(k + \frac{1}{2})} + \frac{σ^{2}}{2} Δ_{v} ϕ^{(k + \frac{1}{2})} & - b (x) \cdot D_{v} ϕ^{(k + \frac{1}{2})} + v \cdot D_{x} ϕ^{(k + \frac{1}{2})} \\ = - \frac{1}{σ^{2}} f (ψ^{(k)} ϕ^{(k + \frac{1}{2})}) ϕ^{(k + \frac{1}{2})} \\ ϕ^{(k + \frac{1}{2})} (T, x, v) = e^{\frac{u_{T} (x, v)}{σ^{2}}}, \end{matrix}

(5)

and the forward one

\{\begin{matrix} \partial_{t} ψ^{(k + 1)} - \frac{σ^{2}}{2} Δ_{v} ψ^{(k + 1)} & - b (x) \cdot D_{v} ψ^{(k + 1)} + v \cdot D_{x} ψ^{(k + 1)} \\ = \frac{1}{σ^{2}} f (ψ^{(k + 1)} ϕ^{(k + \frac{1}{2})}) ψ^{(k + 1)} \\ ψ^{(k + 1)} (0, x, v) = \frac{m_{0} (x, v)}{ϕ^{(k + \frac{1}{2})} (0, x, v)} . \end{matrix}

(6)

We show that the resulting sequence

(ϕ^{(k + \frac{1}{2})}, ψ^{(k + 1)})

,

k \in N

, monotonically converges to the solution of (4). Hence, by the inverse change of variable (see again [11,14] for details)

u = \frac{ln (ϕ)}{σ^{2}}, m = ϕ ψ,

(7)

we obtain a solution of the original problem (3). We have

Theorem 1.

The sequence

(ϕ^{(k + \frac{1}{2})}, ψ^{(k + 1)})

defined by (5) and (6) converges in

L^{2} ([0, T] \times R^{d} \times R^{d})

and a.e. to a weak solution

(ϕ, ψ)

of (4). Moreover, the couple

(u, m)

defined by (7) is a weak solution to (3).

The main difficulty in the study of problems (3) and (4) is due both in the degeneracy of the second order operator with respect to x and in the unbounded dependence of the coefficients of the first order terms with respect to v. To overcome the previous difficulties we rely on the results for linear kinetic Fokker–Planck equations developed in [18]. We mention that existence of weak solutions for the standard MFG problem, possibly degenerate, has been studied in [19], but the results in this paper do not cover the present setting. The previous iterative procedure also suggests a monotone numerical method for the approximation of (4), hence for (3). Indeed, by approximating (5) and (6) by finite differences and solving alternatively the resulting discrete equations, we obtain an approximation of the sequence

(ϕ^{(k + \frac{1}{2})}, ψ^{(k + 1)})

. A corresponding procedure for the standard quadratic MFG system was studied in [14], where the convergence of the method is proved. We plan to study the properties of the previous numerical procedure in a future work.

2. Well Posedness of the Kinetic Fokker–Planck System

In this section, we study the existence of a solution to system (4). The proof of the result follows the strategy implemented in Section 2 of [14] for the case of a standard MFG system with quadratic Hamiltonian and relies on the results for linear kinetic Fokker–Planck equations in Appendix A of [18]. We remark the model here studied does not fit exactly the problem treated in [18] because of the presence of a zero order term in the Fokker-Planck equation. Hence some technical aspects should be analyzed in more detail, however the present paper is mainly intended to give some idea on the change of variabile for the kinetic MGF.

We fix the assumptions we will assume in the whole paper. The vector field

b : R^{d} \to R^{d}

and the coupling cost

f : R^{d} \times R^{d} \times R \to R

are assumed to satisfy

\begin{matrix} b \in L^{\infty} (R^{d}), \\ f \in L^{\infty} (R^{d} \times R^{d} \times R), f \leq 0 and f (x, v, \cdot) strictly decreasing . \end{matrix}

Moreover, the diffusion coefficient

σ

is positive and the initial and terminal data satisfy

\begin{matrix} m_{0} \in L^{\infty} (R^{d} \times R^{d}), m_{0} \geq 0, \int \int m_{0} (x, v) d x d v = 1, \\ and \exists R_{0} > 0 s . t . supp {m_{0}} \subset R^{d} \times B (0, R_{0}) \end{matrix}

(8)

and

\begin{matrix} u_{T} \in C^{0} (R^{d} \times R^{d}) and \exists C_{0}, C_{1} > 0 s . t . \forall (x, v) \in R^{d} \times R^{d} \\ - C_{0} {(| v |}^{2} + | x |) - C_{0} \leq u_{T} (x, v) \leq - C_{1} {(| v |}^{2} + | x |) + C_{1} . \end{matrix}

(9)

Note that (9) implies that

e^{u_{T} / σ^{2}} \in L^{\infty} (R^{d} \times R^{d}) \cap L^{2} (R^{d} \times R^{d})

. We denote with

(\cdot, \cdot)

the scalar product in

L^{2} ([0, T] \times R^{d} \times R^{d})

and with

〈 \cdot, \cdot 〉

the pairing between

X = L^{2} ([0, T] \times R_{x}^{d}; H^{1} (R_{v}^{d}))

and its dual

X^{'} = L^{2} ([0, T] \times R_{x}^{d}; H^{- 1} (R_{v}^{d}))

. We define the following functional space

Y = \{g \in L^{2} ([0, T] \times R_{x}^{d}, H^{1} (R_{v}^{d})), \partial_{t} g + v \cdot D_{x} g \in L^{2} ([0, T] \times R_{x}^{d}, H^{- 1} (R_{v}^{d}))\}

and we set

Y_{0} = {g \in Y : g \geq 0}

. If

g \in Y

, then it admits (continuous) trace values

g (0, x, v)

,

g (T, x, v) \in L^{2} (R^{d} \times R^{d})

(see [18], Lemma A.1) and therefore the initial/terminal conditions for (4) are well defined in

L^{2}

sense. We first prove the well posedness of problems (5) and (6).

Proposition 2.

We have

(i): For any $ψ \in Y_{0}$ , there exists a unique solution $ϕ \in Y_{0}$ to

$\{\begin{matrix} \partial_{t} ϕ + \frac{σ^{2}}{2} Δ_{v} ϕ - b (x) \cdot D_{v} ϕ + v \cdot D_{x} ϕ = - \frac{1}{σ^{2}} f (x, v, ψ ϕ) ϕ \\ ϕ (T, x, v) = e^{\frac{u_{T} (x, v)}{σ^{2}}} . \end{matrix}$

(10)

Moreover, $ϕ \in L^{\infty} ([0, T] \times R^{d} \times R^{d})$ and, for any $R > 0$ , there exist $δ_{R} \in R$ and $ρ > 0$ such that

$ϕ (t, x, v) \geq C_{R} : = e^{\frac{1}{σ^{2}} (δ_{R} - ρ T)} \forall t \in [0, T], (x, v) \in B (0, R) \subset R^{d} \times R^{d} .$

(11)
(ii): Let $Φ : Y_{0} \to Y_{0}$ be the map which associates to ψ the unique solution of (10). Then, if $ψ_{2} \leq ψ_{1}$ , we have $Φ (ψ_{2}) \geq Φ (ψ_{1})$ .

Proof.

We first prove existence of a solution to the nonlinear problem (10) by a fixed point argument exploiting the results for the corresponding linear problem proved in [18]. Fixed

ψ \in Y_{0}

, consider the map

F = F (φ)

from

L^{2} ([0, T] \times R^{d} \times R^{d})

into itself that associates with

φ

the weak solution

ϕ \in L^{2} ([0, T] \times R^{d} \times R^{d})

of the linear problem

\{\begin{matrix} \partial_{t} ϕ + \frac{σ^{2}}{2} Δ_{v} ϕ - b (x) \cdot D_{v} ϕ + v \cdot D_{x} ϕ = - \frac{1}{σ^{2}} f (ψ φ) ϕ \\ ϕ (T, x, v) = e^{\frac{u_{T} (x, v)}{σ^{2}}} . \end{matrix}

(12)

By Prop. A.2 of [18],

ϕ

belongs to

Y

and it coincides with the unique solution of (12) in this space. Moreover, the following estimate

{∥ ϕ ∥}_{L^{2} ([0, T] \times R_{x}^{d}; H^{1} (R_{v}^{d}))} + {∥ \partial_{t} ϕ + v \cdot D_{x} ϕ ∥}_{L^{2} ([0, T] \times R_{x}^{d}; H^{- 1} (R_{v}^{d}))} \leq C

(13)

holds for some constant C which depends only on

∥ e^{u_{T} / σ^{2}} ∥_{L^{2}}

,

{∥ f ∥}_{L^{\infty}}

and

σ

. Hence F maps

B_{C}

, the closed ball of radius C of

L^{2} ([0, T] \times R^{d} \times R^{d})

, into itself.

To show that the map F is continuous on

B_{C}

, consider

{φ_{n}}_{n \in N}, φ \in L^{2} ([0, T] \times R^{d} \times R^{d})

such that

∥ φ_{n} {- φ ∥}_{L^{2}} \to 0

and set

ϕ_{n} = F (φ_{n})

. Then

ϕ_{n} \in Y

, and, by the estimate (13), we get that, up to a subsequence, there exists

\bar{ϕ} \in Y

such that

ϕ_{n} \to \bar{ϕ}

,

D_{v} ϕ_{n} \to D_{v} \bar{ϕ}

in

L^{2} ([0, T] \times R^{d} \times R^{d})

,

\partial_{t} ϕ_{n} + v \cdot D_{x} ϕ_{n} \to \partial_{t} {\bar{ϕ}}_{n} + v \cdot D_{x} {\bar{ϕ}}_{n}

in

L^{2} ([0, T] \times R_{x}^{d}; H^{- 1} (R_{v}^{d}))

. Moreover,

φ_{n} \to φ

almost everywhere. By the definition of weak solution to (12), we have that

〈 \partial_{t} ϕ_{n} + v \cdot D_{x} ϕ_{n}, w 〉 - \frac{σ^{2}}{2} (D_{v} ϕ_{n}, D_{v} w) - (b \cdot D_{v} ϕ_{n}, w) = (- \frac{1}{σ^{2}} ϕ_{n} f (φ_{n} ψ), w),

(14)

for any

w \in D ([0, T] \times R^{d} \times R^{d})

, the space of infinite differentiable functions with compact support in

[0, T] \times R^{d} \times R^{d}

. Employing weak convergence for left hand side of (14) and the Dominated Convergence Theorem for the right hand one, we get for

n \to \infty

〈 \partial_{t} \bar{ϕ} + v \cdot D_{x} \bar{ϕ}, w 〉 - \frac{σ^{2}}{2} (D_{v} \bar{ϕ}, D_{v} w) - (b \cdot D_{v} \bar{ϕ}, w) = (- \bar{ϕ} f (φ ψ), w)

for any

w \in D ([0, T] \times R^{d} \times R^{d})

. Hence

\bar{ϕ} = F (φ)

and

F (φ_{n}) \to F (φ)

for

n \to \infty

in

L^{2} ([0, T] \times R^{d} \times R^{d})

. The compactness of the map F in

L^{2} ([0, T] \times R^{d} \times R^{d})

follows by the compactness of the set of the solutions to (12), see Theorem 1.2 of [20]. We conclude, by Schauder’s Theorem, that there exists a fixed-point of the map F in

L^{2}

, hence in

Y

, and therefore a solution to the nonlinear parabolic Equation (10).

Observe that, if

ϕ

is a solution of (10), then

\tilde{ϕ} = e^{λ t} ϕ

is a solution of

\partial_{t} \tilde{ϕ} + \frac{σ^{2}}{2} Δ_{v} \tilde{ϕ} - b (x) \cdot D_{v} \tilde{ϕ} + v \cdot D_{x} \tilde{ϕ} - λ \tilde{ϕ} = - \frac{1}{σ^{2}} f (e^{- λ t} ψ \tilde{ϕ}) \tilde{ϕ}

(15)

with the corresponding final condition. In the following, we assume that

λ > 0

. To show that

ϕ

is non-negative, we will exploit the following property (see Lemma A.3 of [18]): given

ϕ \in Y

and defined

ϕ^{\pm} = max (\pm ϕ, 0)

, then

ϕ^{\pm} \in X

and

〈 \partial_{t} ϕ + v \cdot D_{x} ϕ, ϕ^{-} 〉 = \frac{1}{2} ({\int \int | ϕ (0, x, v)}^{-} |^{2} d x d v - \int \int {| ϕ {(T, x, v)}^{-} |}^{2} d x d v) .

(16)

Let

ϕ

be a solution of (15), multiply the equation by

ϕ^{-}

and integrate. Then, since

ϕ (T, x, v)

is non-negative, by (16) we get

\begin{matrix} - \frac{1}{σ^{2}} (ϕ f (e^{λ t} ϕ ψ), ϕ^{-}) = 〈 \partial_{t} ϕ + v \cdot D_{x} ϕ, ϕ^{-} 〉 - \\ \frac{σ^{2}}{2} (D_{v} ϕ, D_{v} ϕ^{-}) - (b \cdot D_{v} ϕ, ϕ^{-}) - λ (ϕ, ϕ^{-}) = \\ \frac{1}{2} \int \int {| ϕ {(0, x, v)}^{-} |}^{2} d x d v + \frac{σ^{2}}{2} (D_{v} ϕ^{-}, D_{v} ϕ^{-}) + λ (ϕ^{-}, ϕ^{-}) \geq \\ λ (ϕ^{-}, ϕ^{-}), \end{matrix}

where it has been exploited that, by integration by parts,

(b \cdot D_{v} ϕ, ϕ^{-}) = 0

. Since

f \leq 0

and therefore

- (ϕ f (e^{λ t} ϕ ψ), ϕ^{-}) = (ϕ^{-} f (e^{λ t} ϕ ψ), ϕ^{-}) \leq 0,

we get

(ϕ^{-}, ϕ^{-}) \equiv 0

, hence

ϕ \geq 0

.

To prove the uniqueness of the solution to (10), consider two solutions

ϕ_{1}

,

ϕ_{2}

of (15) and set

\bar{ϕ} = ϕ_{1} - ϕ_{2}

. Multiplying the equation for

\bar{ϕ}

by

\bar{ϕ}

, integrating and using

\bar{ϕ} (x, v, T) = 0

, we get

\begin{matrix} - \frac{1}{σ^{2}} (f (e^{- λ t} ψ ϕ_{1}) ϕ_{1} - f (e^{- λ t} ψ ϕ_{2}) ϕ_{2}, ϕ_{1} - ϕ_{2}) = 〈 \partial_{t} \bar{ϕ} + v \cdot D_{x} \bar{ϕ}, \bar{ϕ} 〉 - \\ \frac{σ^{2}}{2} (D_{v} \bar{ϕ}, D_{v} \bar{ϕ}) - (b \cdot D_{v} \bar{ϕ}, \bar{ϕ}) - λ (\bar{ϕ}, \bar{ϕ}) = \\ - \frac{1}{2} \int \int {| \bar{ϕ} (x, v, 0) |}^{2} d x d v - \frac{σ^{2}}{2} (D_{v} \bar{ϕ}, D_{v} \bar{ϕ}) - λ (\bar{ϕ}, \bar{ϕ}) \leq - λ (ϕ_{1} - ϕ_{2}, ϕ_{1} - ϕ_{2}) \end{matrix}

(17)

and, by the strict monotonicity of f, we conclude that

ϕ_{1} = ϕ_{2}

.

To prove that

ϕ

is bounded from above, we observe that the function

\bar{ϕ} (t, x, v) = e^{C_{1} + (T - t) {∥ f ∥}_{\infty} / σ^{2}}

, where

C_{1}

as in (9), is a supersolution of the linear problem (12) for any

φ \in L^{2} ([0, T] \times R^{d} \times R^{d})

, i.e.,

ϕ (T, x, v) \geq e^{u_{T} (x, v) / σ^{2}}

and

\partial_{t} \bar{ϕ} + \frac{σ^{2}}{2} Δ_{v} \bar{ϕ} - b (x) \cdot D_{v} \bar{ϕ} + v \cdot D_{x} \bar{ϕ} \leq - \frac{1}{σ^{2}} f (ψ φ) \bar{ϕ} .

By the Maximum Principle (see Prop. A.3 (i) in [18]), we get that

\bar{ϕ} \geq ϕ

, where

ϕ

is the solution of (12). Since the previous property holds for any

φ \in L^{2} ([0, T] \times R^{d} \times R^{d})

, we conclude that

\bar{ϕ} \geq ϕ

, where

ϕ

is the solution of the nonlinear problem (10).

A similar argument show that

\underset{̲}{ϕ} (x, v, t) = e^{(- C_{0} {(| v |}^{2} + | x | + 1) - ρ (T - t)) / σ^{2}}

, where

C_{0}

as in (9) and

ρ

sufficiently large, is a subsolution of (12) for any

φ \in L^{2} ([0, T] \times R^{d} \times R^{d})

. Indeed, replacing

\underset{̲}{ϕ}

in the equation, we get that the inequality

\begin{matrix} \partial_{t} \underset{̲}{ϕ} + \frac{σ^{2}}{2} Δ_{v} \underset{̲}{ϕ} - b (x) \cdot D_{v} \underset{̲}{ϕ} + v \cdot D_{x} \underset{̲}{ϕ} = \\ = \frac{\underset{̲}{ϕ}}{σ^{2}} (ρ - C_{0} d σ^{2} + 2 C_{0}^{2} σ^{2} {| v |}^{2} + 2 C_{0} b (x) \cdot v - C_{0} v \cdot \frac{x}{| x |}) \geq \\ - \frac{1}{σ^{2}} f (ψ φ) \underset{̲}{ϕ} \end{matrix}

is satisfied for

ρ

large enough and, moreover,

\underset{̲}{ϕ} \leq e^{u_{T} (x, v) / σ^{2}}

. Hence

\underset{̲}{ϕ} \leq ϕ

, where

ϕ

is the solution of the nonlinear problem (10), and, from this estimate, we deduce (11).

We finally prove the monotonicity of the map

Φ

. Set

ϕ_{i} = Φ (ψ_{i})

,

i = 1, 2

, and consider the equation satisfied by

\bar{ϕ} = e^{λ t} ϕ_{1} - e^{λ t} ϕ_{2}

, multiply it by

{\bar{ϕ}}^{+}

and integrate. Performing a computation similar to (17), we get

\begin{matrix} - \frac{1}{σ^{2}} (f (ϕ_{1} ψ_{1}) ϕ_{1} - f (ϕ_{2} ψ_{2}) ϕ_{2}, {\bar{ϕ}}^{+}) \leq - λ ({\bar{ϕ}}^{+}, {\bar{ϕ}}^{+}) . \end{matrix}

Since, by monotonicity of f and non-negativity of

ϕ_{i}

, we have

\begin{matrix} - (f (ϕ_{1} ψ_{1}) ϕ_{1} - f (ϕ_{2} ψ_{2}) ϕ_{2}, {\bar{ϕ}}^{+}) = - (f (ϕ_{1} ψ_{1}) (ϕ_{1} - ϕ_{2}), {\bar{ϕ}}^{+}) - \\ ((f (ϕ_{1} ψ_{1}) - f (ϕ_{2} ψ_{2})) ϕ_{2}, {\bar{ϕ}}^{+}) \geq 0, \end{matrix}

we get

({\bar{ϕ}}^{+}, {\bar{ϕ}}^{+}) = 0

and therefore

ϕ_{1} \leq ϕ_{2}

. □

We set

Y_{R} = {ϕ \in Y_{0} : ϕ \geq C_{R} \forall (x, v) \in B (0, R), t \in [0, T]},

where

C_{R}

is defined as in (11).

Proposition 3.

Given

R > R_{0}

, where

R_{0}

as in (8), we have

(i): For any $ϕ \in Y_{R}$ , there exists a unique solution $ψ \in Y_{0}$ to

$\{\begin{matrix} \partial_{t} ψ - \frac{σ^{2}}{2} Δ_{v} ψ - b (x) \cdot D_{v} ψ + v \cdot D_{x} ψ = \frac{1}{σ^{2}} f (x, v, ψ ϕ) ψ \\ ψ (0, x, v) = \frac{m_{0} (x, v)}{ϕ (0, x, v)} . \end{matrix}$

(18)

Moreover

$ψ (x, v, t) \leq \frac{∥ m_{0} ∥_{L^{\infty}}}{C_{R}} \forall t \in [0, T], (x, v) \in R^{d} \times R^{d},$

(19)

where $C_{R}$ as in (11).
(ii): Let $Ψ : Y_{R} \to Y_{0}$ be the map which associates with $ϕ \in Y_{R}$ the unique solution of (18). Then, if $ϕ_{2} \leq ϕ_{1}$ , we have $Ψ (ϕ_{2}) \geq Ψ (ϕ_{1})$ .

Proof.

First observe that, since

R > R_{0}

, then

ψ (0, x, v)

is well defined for

ϕ \in Y_{R}

. The proof of the first part of

(i)

is very similar to the one of the corresponding result in Proposition 2, hence we only prove the bound (19). If

ψ

is a solution of (18), then

\tilde{ψ} = e^{- λ t} ψ

is a solution of

\partial_{t} \tilde{ψ} - \frac{σ^{2}}{2} Δ_{v} \tilde{ψ} - b (x) \cdot D_{v} \tilde{ψ} + v \cdot D_{x} ψ + λ \tilde{ψ} = \frac{1}{σ^{2}} f (x, v, e^{λ t} \tilde{ψ} ϕ) ψ .

(20)

Let

ψ

be a solution of (20), set

\bar{ψ} = ψ - e^{- λ t} {∥ m_{0} ∥}_{L^{\infty}} / C_{R}

and observe that

\bar{ψ} (0) \leq 0

. Multiply the equation for

\bar{ψ}

by

{\bar{ψ}}^{+}

and integrate to obtain

\begin{matrix} (ψ f (e^{λ t} ψ ϕ), {\bar{ψ}}^{+}) = \\ 〈 \partial_{t} \bar{ψ} + v \cdot D_{x} \bar{ψ}, {\bar{ψ}}^{+} 〉 + \frac{1}{σ^{2}} (D_{v} \bar{ψ}, D_{v} {\bar{ψ}}^{+}) - (b (x) D_{v} \bar{ψ}, {\bar{ψ}}^{+}) + λ (\bar{ψ}, {\bar{ψ}}^{+}) \geq \\ \int \int | {\bar{ψ}}^{+} {(x, v, T) |}^{2} d x d v + λ ({\bar{ψ}}^{+}, {\bar{ψ}}^{+}) \geq λ ({\bar{ψ}}^{+}, {\bar{ψ}}^{+}) . \end{matrix}

Since

ψ \geq 0

and

f \leq 0

, we have

(ψ f (e^{λ t} ψ ϕ), {\bar{ψ}}^{+}) \leq 0

and therefore

{\bar{ψ}}^{+} \equiv 0

. Hence the upper bound (19).

Now we prove (ii). Set

ψ_{i} = Ψ (ϕ_{i})

,

i = 1, 2,

and

\bar{ψ} = e^{- λ t} ψ_{1} - e^{- λ t} ψ_{2}

. Multiply the equation satisfied by

\bar{ψ}

by

{\bar{ψ}}^{+}

and integrate. Since, by monotonicity and negativity of f, we have

\begin{matrix} (f (e^{λ t} ϕ_{1} ψ_{1}) ψ_{1} - f (e^{λ t} ϕ_{2} ψ_{2}) ψ_{2}, {\bar{ψ}}^{+}) = (f (e^{λ t} ϕ_{1} ψ_{1}) (ψ_{1} - ψ_{2}), {\bar{ψ}}^{+}) + \\ (ψ_{2} (f (e^{- λ t} ϕ_{1} ψ_{1}) - f (e^{- λ t} ϕ_{2} ψ_{2})), {\bar{ψ}}^{+}) \leq 0 . \end{matrix}

Then

\begin{matrix} 0 \geq 〈 \partial_{t} \bar{ψ} + v \cdot D_{x} \bar{ψ}, {\bar{ψ}}^{+} 〉 + \frac{1}{σ^{2}} (D_{v} \bar{ψ}, D_{v} {\bar{ψ}}^{+}) - (b (x) D_{v} \bar{ψ}, {\bar{ψ}}^{+}) + λ (\bar{ψ}, {\bar{ψ}}^{+}) \geq \\ \int \int | {\bar{ψ}}^{+} {(x, v, T) |}^{2} d x d v + λ ({\bar{ψ}}^{+}, {\bar{ψ}}^{+}) \geq λ ({\bar{ψ}}^{+}, {\bar{ψ}}^{+}) . \end{matrix}

Hence

{\bar{ψ}}^{+} \equiv 0

and therefore

ψ_{1} \leq ψ_{2}

. □

Proof of Theorem 1.

Given

ψ^{(0)} \equiv 0

, consider the sequence

(ϕ^{(k + \frac{1}{2})}, ψ^{(k + 1)})

,

k \in N

, defined in (5) and (6). It can rewritten as

\{\begin{matrix} ϕ^{(k + \frac{1}{2})} = Φ (ψ^{(k)}) \\ ψ^{(k + 1)} = Ψ (ϕ^{(k + \frac{1}{2})}) \end{matrix}

(21)

where the maps

Φ

,

Ψ

are as in Propositions 2 and, respectively 3. Observe that, by (11), we have

ϕ^{(k + \frac{1}{2})} \in Y_{R}

for

R > R_{0}

and

ψ^{(k + 1)} \geq 0

for any k. Hence the sequence

(ϕ^{(k + \frac{1}{2})}, ψ^{(k + 1)})

is well defined. We first prove by induction the monotonicity of the components of

(ϕ^{(k + \frac{1}{2})}, ψ^{(k + 1)})

. By non-negativity of solutions to (18), we have

ψ^{(1)} = Φ (ϕ^{(\frac{1}{2})}) \geq 0

and therefore

ψ^{(1)} \geq ψ^{(0)}

. Moreover, by the monotonicity of

Φ

,

ϕ^{(\frac{3}{2})} = Φ (ψ^{(1)}) \leq Φ (ψ^{(0)}) = ϕ^{(\frac{1}{2})}

. Now assume that

ψ^{(k + 1)} \geq ψ^{(k)}

. Then

ϕ^{(k + \frac{3}{2})} = Φ (ψ^{(k + 1)}) \leq Φ (ψ^{(k)}) = ϕ^{(k + \frac{1}{2})}

and

ψ^{(k + 2)} = Ψ (ϕ^{(k + \frac{3}{2})}) \geq Ψ (ϕ^{(k + \frac{1}{2})}) = ψ^{(k + 1)},

therefore the monotonicity of two sequences.

Since

ϕ^{(k + \frac{1}{2})} \geq 0

and, by (19), for

k \to \infty

, the sequence

ψ^{(k + 1)} \leq {∥ m_{0} ∥}_{L^{\infty}} / C_{R}

,

(ϕ^{(k + \frac{1}{2})}, ψ^{(k + 1)})

converges a.e. and in

L^{2} ([0, T] \times R^{d} \times R^{d})

to a couple

(ϕ, ψ)

. Taking into account the estimate (13), the a.e. convergence of the two sequences and repeating an argument similar to the one employed for the continuity of the map F in Proposition 2, we get that the couple

(ϕ, ψ)

satisfies, in weak sense, the first two equations in (4). The terminal condition for

ϕ

is obviously satisfied, while the initial condition for

ψ

, in

L^{2}

sense, follows by convergence of

ϕ^{(k + \frac{1}{2})} (0)

to

ϕ (0)

.

We now consider the couple

(u, m)

given by the change of variable in (7). We first observe that, by Theorem 1.5 of [10], we have

\partial_{t} ϕ + v \cdot D_{x} ϕ

,

D_{v} ϕ

,

Δ_{v} ϕ \in L^{2} ([0, T] \times R^{d} \times R^{d})

and a corresponding regularity for

ψ

. Taking into account the boundedness of

ϕ

and the estimate in (11), we have that u,

\partial_{t} u + v \cdot D_{x} u

,

D_{v} u

,

Δ_{v} u \in L_{l o c}^{2} ([0, T] \times R^{d} \times R^{d})

. Hence we can write the equation for u in weak form, i.e.,

(\partial_{t} u + v \cdot D_{x} u, w) - \frac{σ^{2}}{2} (D_{v} u, D_{v} w) - (b \cdot D_{v} u, w) + \frac{1}{2} (| D_{v} u |^{2}, w) = - (f (m), w),

for any

w \in D ([0, T] \times R^{d} \times R^{d})

, with final datum in trace sense. In a similar way, since m,

\partial_{t} m + v \cdot D_{x} m

,

D_{v} m

,

Δ_{v} m \in L_{l o c}^{2} ([0, T] \times R^{d} \times R^{d})

and m is locally bounded, we can rewrite also the equation for m in weak form, i.e.,

(\partial_{t} m + v \cdot D_{x} m, w) + \frac{σ^{2}}{2} (D_{v} m, D_{v} w) - (b \cdot D_{v} m, w) - (m D_{v} u, D w) = 0,

for any

w \in D ([0, T] \times R^{d} \times R^{d})

with the initial datum in trace sense. □

Funding

This research received no external funding.

Acknowledgments

The author wishes to thank Alessandro Goffi (Univ. di Padova) and Sergio Polidoro (Univ. di Modena e Reggio Emilia) for useful discussions.

Conflicts of Interest

The author declares no conflict of interest.

References

Huang, M.; Caines, P.E.; Malhame, R.P. Large-population cost-coupled LQG problems with non uniform agents: Individual-mass behaviour and decentralized ϵ-Nash equilibria. IEEE Trans. Autom. Control 2007, 52, 1560–1571. [Google Scholar] [CrossRef]
Lasry, J.-M.; Lions, P.-L. Mean field games. Jpn. J. Math. 2007, 2, 229–260. [Google Scholar] [CrossRef] [Green Version]
Achdou, Y.; Mannucci, P.; Marchi, C.; Tchou, N. Deterministic mean field games with control on the acceleration. Nodea Nonlinear Differ. Eq. Appl. 2020, 27, 33. [Google Scholar] [CrossRef]
Bardi, M.; Cardaliaguet, P. Convergence of some Mean Field Games systems to aggregation and flocking models. arXiv 2004, arXiv:2004.04403. [Google Scholar]
Cannarsa, P.; Mendico, C. Mild and weak solutions of Mean Field Games problem for linear control systems. Minimax Theory Appl. 2020, 5, 221–250. [Google Scholar]
Kolmogoroff, A. Zufällige Bewegungen (zur Theorie der Brownschen Bewegung). Ann. Math. 1934, 35, 116–117. [Google Scholar] [CrossRef]
Hörmander, L. Hypoelliptic second order differential equations. Acta Math. 1967, 119, 147–171. [Google Scholar] [CrossRef]
Lanconelli, E.; Polidoro, S. On a class of hypoelliptic evolution operators. Rend. Sem. Mat. Univ. Politec. Torino 1994, 52, 29–63. [Google Scholar]
Armstrong, S.; Mourrat, J.-C. Variational methods for the kinetic Fokker-Planck equation. arXiv 1902, arXiv:1902.04037. [Google Scholar]
Bouchut, F. Hypoelliptic regularity in kinetic equations. J. Math. Pures Appl. 2002, 81, 1135–1159. [Google Scholar] [CrossRef] [Green Version]
Guéant, O.; Lasry, J.; Lions, P. Mean field games and applications. In Paris-Princeton Lectures on Mathematical Finance 2010; Lecture Notes in Math; Springer: Berlin, Germany, 2011; Volume 2003, pp. 205–266. [Google Scholar]
Gomes, D.A.; Mitake, H. Existence for stationary mean-field games with congestion and quadratic Hamiltonians. Nodea Nonlinear Differ. Eq. Appl. 2015, 22, 1897–1910. [Google Scholar] [CrossRef] [Green Version]
Gomes, D.A.; Pimentel, E.A.; Voskanyan, V. Regularity Theory for Mean-Field Game Systems; Springer Briefs in Mathematics; Springer: Berlin, Germany, 2016. [Google Scholar]
Guéant, O. Mean field games equations with quadratic Hamiltonian: A specific approach. Math. Models Methods Appl. Sci. 2012, 22, 37. [Google Scholar] [CrossRef] [Green Version]
Ullmo, D.; Swiecicki, I.; Gobron, T. Quadratic mean field games. Phys. Rep. 2019, 799, 1–35. [Google Scholar] [CrossRef] [Green Version]
Feleqi, E.; Gomes, D.; Tada, T. Hypoelliptic mean field games—A case study. Minimax Theory Appl. 2020, 5, 305–326. [Google Scholar]
Carmona, R.; Delarue, F. Probabilistic Theory of Mean Field Games with Applications. I Mean Field FBSDEs, Control, and Games; Probability Theory and Stochastic Modelling, 83; Springer: Cham, Switzerland, 2018. [Google Scholar]
Degond, P. Global existence of smooth solutions for the Vlasov-Fokker-Planck equation in 1 and 2 space dimensions. Ann. Sci. École Norm. Sup. 1986, 19, 519–542. [Google Scholar] [CrossRef] [Green Version]
Cardaliaguet, P.; Graber, P.J.; Porretta, A.; Tonon, D. Second order mean field games with degenerate diffusion and local coupling. Nodea Nonlinear Differ. Eq. Appl. 2015, 22, 1287–1317. [Google Scholar] [CrossRef] [Green Version]
Camellini, F.; Eleuteri, M.; Polidoro, S. A compactness result for the Sobolev embedding via potential theory. arXiv 1806, arXiv:1806.03606. [Google Scholar]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Camilli, F. A Quadratic Mean Field Games Model for the Langevin Equation. Axioms 2021, 10, 68. https://doi.org/10.3390/axioms10020068

AMA Style

Camilli F. A Quadratic Mean Field Games Model for the Langevin Equation. Axioms. 2021; 10(2):68. https://doi.org/10.3390/axioms10020068

Chicago/Turabian Style

Camilli, Fabio. 2021. "A Quadratic Mean Field Games Model for the Langevin Equation" Axioms 10, no. 2: 68. https://doi.org/10.3390/axioms10020068

APA Style

Camilli, F. (2021). A Quadratic Mean Field Games Model for the Langevin Equation. Axioms, 10(2), 68. https://doi.org/10.3390/axioms10020068

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Quadratic Mean Field Games Model for the Langevin Equation

Abstract

1. Introduction

2. Well Posedness of the Kinetic Fokker–Planck System

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI