2.1. Notations and Differentiability in Wasserstein Space
The set of probability measures
on
is denoted by
, where
, that is, they are square integrable. For any
, the set of measurable functions
is denoted by
, where functions
are square integrable with respect to
. For any
, the set of measurable functions
is denoted by
, where functions
are square integrable with respect to the product measure
. Define
(respectively,
) is defined as the subset of elements
(respectively,
), where elements
are bounded
-a.e. (respectively,
-a.e.). We denote by
the essential supremum. The set of all square integrable random variables valued in
on
is denoted by
. For any random variable
X, its probability law is denoted by
under
. Without any loss of generality, we assume that
is sufficiently rich to take
-valued random variables with square integrable law, that is,
. We equip the measurable space
with the 2-Wasserstein distance
and endow the space
with the corresponding Borel
-field
.
We shall depend on the concept of derivative about probability measure, which is presented by P.L. Lions in [
10]. The concept is established by lifting the functions
into function
, which is defined on
and satisfies
. On the contrary, when function
is defined on
, we call function
u the inverse-lifted function of
, which is defined on
and satisfies
for
. When the lifted function
is Fréchet differentiable (respectively, Fréchet differentiable with respect to continuous derivatives) on
, then the function
u is differentiable (respectively,
on
. In terms of Riesz’ theorem:
, we use
to represent the Fréchet derivative
in this case, which is regarded as an element
of
, where derivative
is called as the derivative of function
u at
. Moreover,
for
. As mentioned in [
11], if the function
u is
and for all
, a version of the mapping
is continuous such that the mapping
is continuous at any point
and
, and if the mapping
is differentiable, whose derivative is jointly continuous at any point
and
, then the function
u is partially
. We denote by
the gradient of
. If the function
u is partially
,
, and for all compact set
of
,
then we say that the function
. As mentioned in [
11], when the lifted function
is twice continuously Fréchet differentiable on
and has Lipschitz continuous Fréchet derivative, then we have
. In terms of Riesz’ theorem, the second Fréchet derivative
in this case is viewed as a bilinear form or a symmetric operator (thus it is bounded) on
, and we also obtain the equality ([
12] Appendix A.2) that for all
,
,
where
is independent of
with zero mean and unit variance.
In order to guarantee the controlled reflected McKean–Vlasov, SDE (
1) is well-posed, we state the following assumptions.
()
(i) For all , there is a constant such that .
(ii) The drift coefficient b and diffusion coefficient are measurable functions on .
(iii) There is a positive constant
C such that for any
,
,
and
,
where
(iv) For all , the functions are continuous on .
Under (
), we see that
b and
meet the conditions in Theorem 3.2 of [
13], and domain
satisfies the condition on domain
in Assumption 2.5 of [
13], therefore Equation (
1) has a unique solution, which is denoted by
for
.
The items that appear in cost functional (
2) are supposed to fulfill the following conditions.
()
(i) The functions , and G are measurable, respectively, on , and .
(ii) There is a positive constant
C such that for any
,
(iii) The coefficients
, and
G, respectively, on
,
, and
are continuous, and are also local Lipschitz continuous, uniformly with respect to
: there is a positive constant
C such that for any
,
,
and
,
Under (
), for any
, the cost functional (
2) is well-posed and finite.
Considering Peng’s pioneering research [
14] about non-Markovian stochastic control, the dynamic programming principle related to the reflected McKean–Vlasov stochastic control indicates that the value function
V in (
3) is the first element of the pair
, which is the solution to the following BSPDE with Neumann boundary condition:
where
is the derivative of
v with respect to
t, and
is inverse-lifted function, respectively, of
.
Therefore, by the lifting identification, we regard the function
v as a function on
and reserve the same notation
(notice that
v is dependent on
just by its law), then we notice from the relation (
4) and (
5) between derivatives in the Wasserstein space
and in the Hilbert space
that the BSPDE (
6) is also written in
as
with Hamiltonian function
for
; here,
is independent of
.
2.2. Definition of Solutions to BSPDEs and Main Result
For a Banach space
, the set of
-measurable and square-integrable
-valued random variables is denoted by the space
, and the set of
-measurable càdlàg
-valued process
such that
is denoted by
, where
is the predictable
-algebra corresponding to
on
and
. The set of
-measurable
-valued processes
satisfying
is denoted by
. For simplicity, we neglect the subscript for the space
and space
, particularly if there is no confusion about the adaptedness and filtration.
is denoted as the Sobolev space of real-valued functions
and its up to
m-th order derivatives are in
, which is endowed with the Sobolev norm
,
and
. The space of trace-zero functions in
is denoted by
. For
,
. For simplicity,
and
can be expressed as
. The norm and the inner product in the usual Hilbert space
are denoted, respectively, by
and
, and the duality between Hilbert space
and their dual spaces is denoted by
when there is no confusion. For
, we set
and the two spaces are complete and equipped, respectively, with the norms
and
Now, the notion of solutions to BSPDEs with general nonlinear coefficients is introduced as follows.
Definition 1. For any and any , let R be a random function such thatis -measurable, and let . We say that there is a weak solution to the BSPDE:if and satisfies Equation (9) in the weak sense, that is, for all ,andIf the regularity has been proved, we call the above a strong solution. Particularly, we have a case of nonlinear term
R with
which is corresponding to BSPDE (
7).
In order to obtain that Equation (
7) is well-posed, we additionally take the assumptions below.
()
(i) Let
and along with another function
, the pair
belongs to
and satisfies BSPDE
in the weak sense with
.
(ii) For any
, we assume that
,
, and there are nonnegative constants
and
such that for any
,
,
(iii) There is an
-valued
-measurable function
satisfying
that is,
and for any
, there exists a unique solution to the controlled reflected McKean–Vlasov SDE (
1) associated with the drift coefficient
.
Finally, we summarize the main theorem. In the next section, we make some preparations to prove it.
Theorem 1. Suppose that (), (), and () hold. There is a unique strong solution to BSPDE (7). For the strong solution, we also obtain that . In addition, for any , , which is the value function (3). We obtain the optimal control and the associated state process , respectively, expressed as and We take into consideration the conditions (
) and (
) as standing assumptions in this context, and they are standard to ensure that the BSPDE (
7) is adapted and the controlled McKean–Vlasov SDE with reflection is well-posed.
By assumption (
) (i), in order to obtain
, it is standard for the requests of
G (by
-theory of BSPDE of [
15]); considering the Skorohod conditions of McKean–Vlasov SDE (
1) with reflection, one has
so the reflected control problem only involves
and
.
In assumption () (ii), we suppose that the Hamilton function is Lipschitz continuous with respect to u and , which implies that for any .