Stochastic Differential Games and a Unified Forward–Backward Coupled Stochastic Partial Differential Equation with Lévy Jumps

Dai, Wanyang

doi:10.3390/math12182891

Open AccessFeature PaperArticle

Stochastic Differential Games and a Unified Forward–Backward Coupled Stochastic Partial Differential Equation with Lévy Jumps

by

Wanyang Dai

School of Mathematics, Nanjing University, Nanjing 210093, China

Mathematics 2024, 12(18), 2891; https://doi.org/10.3390/math12182891

Submission received: 6 August 2024 / Revised: 8 September 2024 / Accepted: 9 September 2024 / Published: 16 September 2024

Download

Browse Figures

Versions Notes

Abstract

:

We establish a relationship between stochastic differential games (SDGs) and a unified forward–backward coupled stochastic partial differential equation (SPDE) with discontinuous Lévy Jumps. The SDGs have q players and are driven by a general-dimensional vector Lévy process. By establishing a vector-form Ito-Ventzell formula and a 4-tuple vector-field solution to the unified SPDE, we obtain a Pareto optimal Nash equilibrium policy process or a saddle point policy process to the SDG in a non-zero-sum or zero-sum sense. The unified SPDE is in both a general-dimensional vector form and forward–backward coupling manner. The partial differential operators in its drift, diffusion, and jump coefficients are in time-variable and position parameters over a domain. Since the unified SPDE is of general nonlinearity and a general high order, we extend our recent study from the existing Brownian motion (BM)-driven backward case to a general Lévy-driven forward–backward coupled case. In doing so, we construct a new topological space to support the proof of the existence and uniqueness of an adapted solution of the unified SPDE, which is in a 4-tuple strong sense. The construction of the topological space is through constructing a set of topological spaces associated with a set of exponents

{γ_{1}, γ_{2}, \dots}

under a set of general localized conditions, which is significantly different from the construction of the single exponent case. Furthermore, due to the coupling from the forward SPDE and the involvement of the discontinuous Lévy jumps, our study is also significantly different from the BM-driven backward case. The coupling between forward and backward SPDEs essentially corresponds to the interaction between noise encoding and noise decoding in the current hot diffusion transformer model for generative AI.

Keywords:

stochastic differential game; non-zero-sum game; zero-sum game; non-Gaussian noise; stochastic partial differential equation; discontinuous Lévy jump; forward and backward coupling; diffusion transformer

MSC:

91A15; 91A23; 60H15

1. Introduction

Recently, big data was conceptually characterized by its three-dimensional statistical features in De Mauro et al. [1]: high volume (amount of data), high velocity (speed of data in and out), and/or high variety (range of data types and sources) with respect to the movement and processing of time–space data clusters. Furthermore, Dai [2] used the general-dimensional vector Lévy process or its driven general-dimensional vector system of forward–backward coupled stochastic differential equations (FB-SDEs) to quantitatively capture the typical features of big data. Particularly, in the work of Dai [2], the Lévy jumps were used to model the instantaneous up and down movements of real-time, high-volume data clusters. From a statistical viewpoint, this process is a generalized high-dimensional Brownian motion by allowing its sample path to be discontinuous with Lévy jumps. Meanwhile, from the physical viewpoints, the Lévy jumps correspond to white non-Gaussian noises. They may be caused by random rates in semiconduct/superconduct and communication currents (see Dai [2,3] and Duncan [4] for more details), the batch arrival and processing particles in queuing systems (see, e.g., Dai and Jiang [5], Mandelbaum, and Pats [6]), and the quantum entanglement in particle systems (see, e.g., Kong et al. [7]).

More importantly, in the current and future industrial revolutions, the major concern will be about how to handle these big data optimally and fairly or how to handle them with the best efforts. Hence, based on or to build a high-performance computing or quantum-computing facility, we propose a generalized stochastic differential game (SDG) problem with Lévy jumps by unifying the concepts in Dai [3,8] and those in Karatzas and Li [9] together with artificial intelligence (AI)-aided random decision processes (i.e., AlphaGo and AlphaFold) in Silver et al. [10,11] and Jumper et al. [12], multi-game decisions in Lee et al. [13], and AI-based backward Monte Carlo simulation in Dai [14,15]. In this SDG problem, there are general q number of players with

q \in {1, \dots, \infty}

. Each player

l \in {1, \dots, q}

has his own value process

Λ_{l}^{u}

subject to a vector-form coupled FB-SDE with countable and discontinuous Lévy jumps, which is under an admissible control policy process u. The lth component

u_{l} (\cdot)

of u for each

l \in {1, \dots, q}

is the lth player’s strategy. Concerning the players’ strategies, we classify the SDG problem into two types of game problems: non-zero-sum ones and zero-sum ones.

In a non-zero-sum SDG problem (see, e.g., Dai [8], Karatzas and Li [9]), every player chooses a policy to maximize his own value process over an admissible set

C

while the summation of all the value processes is also maximized, i.e.,

\begin{matrix} {sup}_{u \in C} Λ_{l}^{u} (0) = Λ_{l}^{u^{*}} (0) \end{matrix}

(1)

for each

l \in {0, 1, \dots, q}

, where

\begin{matrix} Λ_{0}^{u} (t) = \sum_{l = 1}^{q} Λ_{l}^{u} (t), \end{matrix}

(2)

and

Λ_{0}^{u} (0)

in Equation (2) does not have to be a constant (e.g., zero). In other words, all the players in this game can be in a win–win situation to share the limited resources in a communication network or a Blockchain quantum-cloud computing system. More precisely, in this game, we are interested in finding a so-called Pareto optimal Nash equilibrium policy process that is not only optimal to the whole game system but also fair to all the users.

Definition 1.

u^{*} (\cdot)

is called a Nash equilibrium policy process for the non-zero-sum game in Equations (1) and (2) if no player can benefit by switching his own decision policy process unilaterally under the condition that all the others do not to change their policy processes. Mathematically, we have that

\begin{matrix} Λ_{l}^{u^{*}} (0) \geq Λ_{l}^{u_{- l}^{*}} (0) \end{matrix}

(3)

for each

l \in {1, \dots, q}

and any given admissible control policy process u, where

\begin{matrix} u_{- l}^{*} = (u_{1}^{*}, \dots, u_{l - 1}^{*}, u_{l}, u_{l + 1}^{*}, \dots, u_{q}^{*}) . \end{matrix}

Furthermore, if

u^{*} (\cdot)

is also an optimal one to the sum of all the q players’ value functions at time zero, i.e.,

\begin{matrix} Λ_{0}^{u^{*}} (0) \geq Λ_{0}^{u} (0), \end{matrix}

(4)

it is called a Pareto optimal Nash equilibrium policy process.

However, in a zero-sum SDG problem (see, e.g., Karatzas and Li [9]), while each player chooses a policy to maximize his own value process over an admissible set

C

, he also tries to minimize all the other player’s value processes, i.e.,

\begin{matrix} sup_{u \in C} Λ_{l}^{u} (0) = Λ_{l}^{u^{*}} (0), sup_{u \in C} (- Λ_{k}^{u} (0)) = - Λ_{k}^{u^{*}} (0), Λ_{0}^{u} (0) = C \end{matrix}

(5)

for a constant C, a given

l \in {1, \dots, q}

, and all

k \in {1, \dots, q}

with

k \neq l

.

Definition 2.

u^{*} (\cdot)

is called a saddle point policy process for the zero-sum game in Equation (5) if it represents the best win to any given player l and the best efforts (i.e., the least losses) to all the other players. Mathematically, we have that

\begin{matrix} Λ_{l}^{u^{*}} (0) & \geq & Λ_{l}^{u_{- l}^{*}} (0) f o r e a c h l \in {1, \dots, q}, \end{matrix}

(6)

\begin{matrix} - Λ_{k}^{u^{*}} (0) & \leq & - Λ_{k}^{u_{- k}^{*}} (0) f o r a l l k \in {1, \dots, q} w i t h k \neq l \end{matrix}

(7)

for any given admissible control policy process u, where

\begin{matrix} u_{- l}^{*} & = & (u_{1}^{*}, \dots, u_{l - 1}^{*}, u_{l}, u_{l + 1}^{*}, \dots, u_{q}^{*}), \\ u_{- k}^{*} & = & (u_{1}^{*}, \dots, u_{k - 1}^{*}, u_{k}, u_{k + 1}^{*}, \dots, u_{q}^{*}) . \end{matrix}

Note that in real-world applications, this type of game can be formulated to design admission control and routing policies for communication networks, power and energy grids, go games, etc. (see, e.g., Ash [16], Hamidi et al. [17], Jumper et al. [12], and Silver et al. [10,11]), with the support of high-performance quantum-cloud computing facilities (see, e.g., those proposed in Dai [8]).

The aim in studying the non-zero-sum SDG problem in Equations (1) and (2) or the zero-sum SDG problem in Equations (5) and (2) is to determine a control policy process, i.e., to determine a Pareto optimal Nash equilibrium policy process or a saddle point policy process. The key in doing so consists of two steps. The first step is to prove a vector-form It

\hat{o}

-Ventzell formula. The second step is to prove the unique existence of the solution of a generally unified forward–backward coupled vector-form SPDE with discontinuous Lévy jumps. Here, the term SPDE is the abbreviation of stochastic partial differential equation.

If there is only one player (

q = 1

) in the game, the problem in Equations (1) and (2) (or the one in Equations (5) and (2)) reduces to a conventional stochastic optimal control problem. One of the important solution methods for the SDE-based control problem is the dynamic programming. However, as pointed out in Musiela and Zariphopoulou [18], this dynamic programming method in general faces a regularity problem that is still open. Thus, Musiela and Zariphopoulou [18] derived a backward SPDE as an alternative method to solve this problem. In general, this backward SPDE is called a stochastic Hamilton–Jacobi–Bellman (HJB) equation (see, e.g., the specific backward SPDE with

q = 1

considered in Peng [19] with no jumps and Øksendal et al. [20] with Lévy jumps). However, for our proposed general SDG problem in Equations (1) and (2) (or the one in Equations (5) and (2)) with general q number of players (

q > 1

), a unified system of coupled forward and backward vector-form SPDEs with Lévy jumps will be concerned. The forward SPDE is a general initial-valued one and represents the system state dynamics. The backward SPDE is a terminal-valued one (or called a Cauchy problem). Each player actually corresponds to one backward SPDE in Equation (8) as its value process in the unified vector form system. The solution of the unified system in Equation (8) corresponding to a non-zero-sum game or zero-sum game is used to derive the corresponding Nash equilibrium point policy process or saddle point policy process for each player in the multi-player game.

Hence, in this paper, we also study the existence and uniqueness of the solution of the following unified system of forward–backward coupled SPDEs with Lévy jumps along the line of our recent achievement in Dai [14,15] for a general backward SPDE driven by Brownian motion (BM). More precisely, we study the adapted 4-tuple strong solution

(Υ, Λ, \bar{Λ}, \tilde{Λ})

to the unified system with respect to the time–position parameter

(t, x) \in R_{+} \times D

,

\begin{matrix} \{\begin{matrix} Υ (t, x) & = G (x) + \int_{0}^{t} L (s^{-}, x, Υ, Λ, \bar{Λ}, \tilde{Λ}) d s \\ + \int_{0}^{t} J (s^{-}, x, Υ, Λ, \bar{Λ}, \tilde{Λ}) d W (s) \\ + \int_{0}^{t} \int_{Z^{h}} I (s^{-}, x, Υ, Λ, \bar{Λ}, \tilde{Λ}, z) \tilde{N} (λ d s, d z), \\ Λ (t, x) & = H (x) + \int_{t}^{τ} \bar{L} (s^{-}, x, Υ, Λ, \bar{Λ}, \tilde{Λ}) d s \\ + \int_{t}^{τ} \bar{J} (s^{-}, x, Υ, Λ, \bar{Λ}, \tilde{Λ}) d W (s) \\ + \int_{t}^{τ} \int_{Z^{h}} \bar{I} (s^{-}, x, Υ, Λ, \bar{Λ}, \tilde{Λ}, z) \tilde{N} (λ d s, d z), \end{matrix} \end{matrix}

(8)

where

t \in [0, T]

,

Z^{h} = R^{h} - {0}

or

R_{+}^{h}

for an integer

h > 0

. Note that here, we use

Z^{h} = R^{h} - {0}

(not

R^{h}

) to support our Lévy measure since a Lévy measure may be singular at zero (see, e.g., Applebaum [21]). Of course, an alternative convention can also be introduced by defining a Lévy measure on

R^{h}

if we assign

ν ({0}) = 0

(see, e.g., Sato [22]). Furthermore,

s^{-}

in Equation (8) denotes the corresponding left limit at time point s. In particular,

D \in R^{p}

with a given

p \in N = {1, 2, \dots}

is a connected domain, for examples, a p-dimensional box, a p-dimensional ball (or a general manifold), a p-dimensional sphere (or a general Riemannian manifold), or the whole Euclidean space

R^{p}

of real numbers itself. The forward equation in Equation (8) is with the given initial random vector-field G, while the backward equation in Equation (8) has the known terminal random vector-field H. In Equation (8),

Υ

and

Λ

are r-dimensional and q-dimensional random vector-field processes, respectively. Furthermore, W denotes a standard BM that is a d-dimensional one. In addition, the notation

\tilde{N}

denotes an h-dimensional centered Lévy jump process (or centered subordinator). More precisely, the forward equation in Equation (8) is r-dimensional, i.e.,

Υ = {(Υ_{1}, \dots, Υ_{r})}^{'}

. Meanwhile, the backward equation in Equation (8) is q-dimensional, i.e.,

Λ = {(Λ_{1}, \dots, Λ_{q})}^{'}

. This type of vector SPDE exists in real-world applications such as color image processing and multi-mode generative AI. A color image can typically be represented by a vector PDE (see, e.g., Caselles et al. [23], Tschumperlé and Deriche [24,25,26]). In the forward image processing by noise encoding, it corresponds to a forward vector SPDE. The added noise can be a Gaussian noise or a non-Gaussian noise. In the Gaussian noise case, it corresponds to Brownian motion. In the non-Gaussion noise case, it corresponds to a Lévy process. Similarly, in the backward image processing by noise decoding for color image recovery, it corresponds to a backward vector SPDE. Furthermore, the forward image processing and backward image processing can be performed in a coupled way. In addition, the current hot area concerning multi-mode generative AI can also be explained in the same way (see, e.g., Kratsios [27], Peluchetti [28], and Vaswani et al. [29] for more details).

Concerning the explanation and importance of the 4-tuple solution

(Υ, Λ, \bar{Λ}, \tilde{Λ})

, we provide a brief introduction as follows. The random field

Υ

described by the forward SPDE in Equation (8) corresponds to a randomized Fokker Planck equation. It is a generalized representation from a traditional mean-field game problem (see, e.g., Huang et al. [30] and Lasry and Lions [31]) to a stochastic (quantum) mean-field game (see, e.g., Kolokoltsov [32]). More precisely,

Υ

models the observed random dynamics of particle density and distribution from a filtering system. Comparing it with the study by Kolokoltsov [32], we extend the case with Gaussian noise corresponding to the Brownian motion W in Equation (8) to the case with additional non-Gaussian noise corresponding to a pure jump Lévy process

\tilde{N}

in Equation (8). Furthermore, we also add the feedback information

(Λ, \bar{Λ}, \tilde{Λ})

from the backward SPDE in Equation (8) to the forward SPDE in Equation (8). Concerning the 3-tuple solution

(Λ, \bar{Λ}, \tilde{Λ})

of the backward SPDE in Equation (8), it is a generalized situation from the case with Gaussian noise in Dai [14,15] to the case with added non-Gaussian noise. Actually, the backward SPDE in Equation (8) is a generalized Hamilton–Jacobi–Bellman (HJB) equation corresponding to a certain optimization problem, i.e.,

\begin{matrix} Λ (t, x) = E_{Q^{*}} [H (T, x) | F_{t}, X (t) = x], \end{matrix}

(9)

where

Q^{*}

is the so-called variance optimal martingale measure (see, e.g., Dai [33]). Furthermore,

F_{t}

in Equation (9) is a sigma algebra generated by the Brownian motion and the Lévy process. Due to the martingale representation theorem for Lévy processes (see, e.g., Applebaum [21]),

Λ (t, x)

can be decomposed into a macro-trend part, a Gaussian noise micro-regulating part with coefficient

\bar{Λ}

, and a non-Gaussian noise micro-regulating part with coefficient

\tilde{Λ}

. The macro-trend part may correspond to a linear or a nonlinear regression function. In the recent study of Dai [14,15], such a decomposition is referred to as a big model regression.

Note that some special form of the unified system in Equation (8) is published in Dai [14], which is a Brownian motion-driven backward SPDE subsystem of Equation (8) with useful real-world explanations. Concerning the importance and meaningfulness of the forward–backward coupled system in Equation (8), it can be explained as a generalized and randomized form of the optimality equation of the well-known mean field game problem (see, e.g., Huang et al. [30] and Lasry and Lions [31]). However, compared with the second-order partial differential operators introduced in the optimality equation of mean-field games, the orders of our partial differential operator for each

A \in {L, J, \bar{L}, \bar{J}}

in Equation (8) can be a general high order. Furthermore, as mentioned previously, the coupling between the forward and backward SPDEs in Equation (8) can be used to explain the interaction between noise encoding and noise decoding in the current hot diffusion transformer model for generative AI (see, e.g., Kratsios [27], Peluchetti [28], and Vaswani et al. [29] for more details). Compared with the studies in Dai [2] and Peluchetti [28], our current SPDE-based coupling can be directly used to model the noise encoding and noise decoding interaction processes of two-dimensional plane images or images over high-dimensional manifolds (e.g., a sphere and a tori).

More precisely, the partial differential operators of r-dimensional vector

L

,

r \times d

-dimensional matrix

J

, and

r \times h

-dimensional matrix

I

are functionals of

Υ

,

Λ

,

\bar{Λ}

, and

\tilde{Λ}

, whose partial derivatives are up to the kth order for

k \in {0, 1, 2, 3, \dots}

, and so are the partial differential operators of q-dimensional vector

\bar{L}

,

q \times d

-dimensional matrix

\bar{J}

, and

q \times h

-dimensional matrix

\bar{I}

. Hereafter, for each

A \in {L, J, \bar{L}, \bar{J}}

, we define

\begin{matrix} A (s, x, Υ, Λ, \bar{Λ}, \tilde{Λ}, z) & \equiv & A (s, x, (Υ, \frac{\partial Υ}{\partial x_{1}}, \dots, \frac{\partial^{k} Υ}{\partial x_{1}^{i_{1}} \dots \partial x_{p}^{i_{p}}}) (s, x), \\ (Λ, \frac{\partial Λ}{\partial x_{1}}, \dots, \frac{\partial^{k} Λ}{\partial x_{1}^{i_{1}} \dots \partial x_{p}^{i_{p}}}) (s, x), \\ (\bar{Λ}, \frac{\partial \bar{Λ}}{\partial x_{1}}, \dots, \frac{\partial^{k} \bar{Λ}}{\partial x_{1}^{i_{1}} \dots \partial x_{p}^{i_{p}}}) (s, x), \\ (\tilde{Λ}, \frac{\partial \tilde{Λ}}{\partial x_{1}}, \dots, \frac{\partial^{k} \tilde{Λ}}{\partial x_{1}^{i_{1}} \dots \partial x_{p}^{i_{p}}}) (s, x, \cdot), ★), \end{matrix}

(10)

where the dot “·” in

\tilde{Λ} (s, x, \cdot)

and its associated partial derivatives denote the integration in terms of the well-known Lévy measure. Furthermore, the star “★” on the right-hand side of Equation (10) represents some stochastic factors that are supposed to be known. However, if

A \in {I, \bar{I}}

, the last line on the right-hand side of Equation (10) ought to be changed to an expression of the form

\begin{matrix} (\tilde{Λ}, \frac{\partial \tilde{Λ}}{\partial x_{1}}, \dots, \frac{\partial^{k} \tilde{Λ}}{\partial x_{1}^{i_{1}} \dots \partial x_{p}^{i_{p}}}) (s, x, z), z, ★ . \end{matrix}

(11)

Note that our partial differential operators presented in Equation (10) can be of general nonlinearity and a general high order. For example,

A

can be taken as

\begin{matrix} \frac{\partial^{k} Υ}{\partial x^{k}} + λ {(\frac{\partial Υ}{\partial x})}^{k} \end{matrix}

(12)

for

k \in {1, 2, 3, \dots}

in a single-dimensional position parameter state space.

Note that if k in Equation (12) equals 2, the associated operator corresponds to the well-known KPZ equation in Hairer [34] and Kardar et al. [35]. By Cole-Hopf transformation and Itô’s formula, a solution to an SPDE associated with a scaled constant can be determined. Then, by letting the scaling factor tend to zero, a solution to the KPZ equation can be constructed and described by rough path theory. However, in our current study, we aim to show the existence and uniqueness of a solution of the generally unified system of the coupled SPDEs in Equation (8). Thus, we have different study purposes between the work in Hairer [34] and our current one. More precisely, since our unified vector-form system in Equation (8) is of a general vector dimension, general nonlinearity, general orders of partial differential operators, and general discontinuous Lévy jumps, the conventional calculation (i.e., integral by parts)-based proving approach can not be applied. Thus, by generalizing the recent study in Dai [14] from the existing Brownian motion (BM)-driven backward case to a general Lévy-driven forward–backward coupled case, we can prove the existence and uniqueness of an adapted 4-tuple strong solution to the system in Equation (8). In doing so, we construct a new topological space to support the proof. The construction of the topological space is through constructing a set of topological spaces associated with a set of exponents

{γ_{1}, γ_{2}, \dots}

under a set of general local linear growth and Lipschitz conditions, which is significantly different from the construction of the single exponent case (see, e.g., Zhou and Yong [36]). Furthermore, due to the coupling from the forward SPDE and the involvement of the discontinuous Lévy jumps, our study is also significantly different from the BM-driven backward case as studied in Dai [14].

The comparison between our current work and our previous work published in Dai [14] can be summarized as follows. Our previously published work is on a sole BM-driven backward SPDE while our current work is on a generalized Lévy process-driven forward and backward coupled SPDE system. In our BM-driven backward case, the solution is a 2-tuple vector one. Meanwhile, in our current Lévy process-driven forward and backward coupled case, our solution is a 4-tuple vector one. The major difference between BM and Lévy a process is as follows: the sample path of BM is almost surely continuous while the sample path of a Lévy process can have at most countable discontinuous jump points. Furthermore, the jump size distribution needs to be controlled by an associated Lévy measure. Thus, a Lévy process is much more complicated than BM. In this sense, our Lévy process-driven SPDE is also much more complicated than a BM-driven SPDE. Since our current study is based on the forward and backward coupling, our current study has additional complexity concerning integrating the dynamics of the forward SPDE into the dynamics of the backward SPDE. Furthermore, the mentioned complexities make our newly constructed supporting topological space significantly different from the one in our previous study in Dai [14]. In addition, these complexities also make our other related discussions significantly different from those in our previous study in Dai [14].

The solution of the FB-SPDE in Equation (8) can be interpreted in a sample surface manner with a time–position parameter

(t, x)

(for example,

Λ (t, x)

can be roughly illustrated in Figure 1). Note that, the sample surfaces may involve jumps in a time parameter t since the system is driven by the Lévy type of noises.

Besides the applications in the SDG problems, our unified system of SPDEs also has importance in many other real-world systems. Interested readers are referred to the existing literature (see, e.g., Caselles et al. [23], Bouard and Debussche [37,38], Chai [39,40], Chang et al. [41], Dai [14,15,33,42], Hall [43], Karplus and Luttinger [44], Lions and Souganidis [45], Musiela and Zariphopoulou [18], Øksendal et al. [20], and Thouless [46]) for more details. In this regard, the initial random vector-field G, the terminal random vector-field H, and the 4-tuple solution process

(Υ, Λ, \bar{Λ}, \tilde{Λ})

are allowed to be complex-valued.

The rest of the paper is organized as follows. The system of FB-SDEs for our non-zero-sum or zero-sum SDG problem is introduced in Section 2. The Queuing game and Go game aided with AlphaGo or AlphaGo Zero are exactly modeled via the system. The main theorem for the SDG game and the FB-SPDEs with required conditions are presented in Section 2.2. Finally, our main Theorem is proved in Section 3 by developing related theory.

2. Main Theorem with Examples

2.1. State and Value Processes

First, we consider a fixed complete probability space denoted by

(Ω, F, P)

. On this probability space, we first define a standard d-dimensional BM

W \equiv {W (t), t \in [0, T]}

with

W (t) = {(W_{1} (t), \dots, W_{d} (t))}^{'}

for a given

T \in [0, \infty)

. Then, we define an h-dimensional general Lévy pure jump process

L \equiv {L (t), t \in [0, T]}

with

L (t) \equiv {(L_{1} (t), \dots, L_{h} (t))}^{'}

(see, e.g., Applebaum [21], Bertoin [47], and Sato [22]). Note that the prime appearing in this paper is used to denote the associate transpose of a vector or a matrix. Moreover, we suppose that W, L, and their components are mutually independent. For a fixed

λ = {(λ_{1}, \dots λ_{h})}^{'} > 0

, which is called a reversion rate vector in many applications, we let

L (λ s) = {(L_{1} (λ_{1} s), \dots, L_{h} (λ_{h} s))}^{'}

. Then, we denote a filtration by

{F_{t}}_{t \geq 0}

with

F_{t} \equiv σ {G, W (s), L (λ s) : 0 \leq s \leq t}

for every given

t \in [0, T]

. Note that here, the notation

G

denotes a

σ

-algebra and is independent of W and L. In addition, we use

ν_{i}

for an

i \in {1, \dots, h}

to denote a Lévy measure and use

I_{A} (\cdot)

to denote the index function over a set A. Then, we can introduce the well-known Poisson random measure with a deterministic time-homogeneous intensity measure

d s ν_{i} (d z_{i})

as follows:

\begin{matrix} N_{i} ((0, t] \times A) \equiv \sum_{0 < s \leq t} I_{A} (L_{i} (s) - L_{i} (s^{-})) . \end{matrix}

(13)

Thus, it follows from Theorem 13.4 and Corollary 13.7 in pages 237 and 239 of Kallenberg [48] that

L_{i}

for each

i \in {1, \dots, h}

have the following expression:

\begin{matrix} L_{i} (t) = a_{i} (t) + \int_{(0, t]} \int_{Z} z_{i} N_{i} (λ_{i} d s, d z_{i}), t \geq 0 . \end{matrix}

(14)

For convenience, we take the constant

a_{i}

to be zero.

Then, we can elaborate on the value process

V_{l}^{u}

with

l \in {1, \dots, q}

for the non-zero-sum SDG problem in Equations (1) and (2) or the zero-sum SDG problem in Equation (5) by a generalized system of coupled FB-SDEs with Lévy jumps under a given control rule u, i.e.,

\begin{matrix} \{\begin{matrix} X (t) & = x + \int_{0}^{t} b (s^{-}, X, Λ, \bar{Λ}, \tilde{Λ}, u) d s \\ + \int_{0}^{t} σ (s^{-}, X, Λ, \bar{Λ}, \tilde{Λ}, u) d W (s) \\ + \int_{0}^{t} \int_{Z^{h}} η (s^{-}, X, Λ, \bar{Λ}, \tilde{Λ}, u, z) \tilde{N} (d s, d z), \\ Λ (t) & = H (X (T), \cdot) + \int_{t}^{T} c (s^{-}, X, Λ, \bar{Λ}, \tilde{Λ}, u) d s \\ - \int_{t}^{T} α (s^{-}, X, Λ, \bar{Λ}, \tilde{Λ}, u) d W (s) \\ - \int_{t}^{T} \int_{Z^{h}} ζ (s^{-}, X, Λ, \bar{Λ}, \tilde{Λ}, u, z) \tilde{N} (d s, d z) . \end{matrix} \end{matrix}

(15)

Note that the coefficients in Equation (15) are assumed not to contain any partial derivative operators, i.e., the system in Equation (15) is the one of coupled SDEs (rather than SPDEs as in Equation (8)), and the abbreviated notion

f (t, x, X, Λ, \bar{Λ}, \tilde{Λ}, u, z)

for each functional

f \in {b, σ, c, α, η, ζ}

has the form

\begin{matrix} \{\begin{matrix} f (t, X (t), Λ (t), \bar{Λ} (t), \tilde{Λ} (t, \cdot), u (t, X (t)), \cdot), & f \in {b, σ, c, α}, \\ f (t, X (t), Λ (t), \bar{Λ} (t), \tilde{Λ} (t, z), u (t, X (t)), z, \cdot), & f \in {η, ζ} . \end{matrix} \end{matrix}

(16)

Furthermore, the proof concerning the existence and uniqueness of a solution of the system in Equations (15) and (16) is a case study for the unified FB-SDEs in Dai [2]. To be clear and to illustrate the usages of the system in Equations (15) and (16), we have the following examples.

2.2. Main Theorem

First, we present our main theorem concerning how to obtain a Pareto optimal Nash equilibrium policy process for the non-zero-sum game problem in Equations (1) and (2) subject to the constraint in Equation (15). More precisely, we define the special forms of partial differential operators

\bar{L}

,

\bar{J}

, and

\bar{I}

as follows, i.e., for each

l \in {0, 1, \dots, q}

,

\begin{matrix} {\bar{L}}_{l} (t, x, Υ, Λ, \bar{Λ}, \tilde{Λ}, u) \\ \equiv & \sum_{i, j = 1}^{p} {(σ σ^{'})}_{i j} (t, x, u) \frac{\partial^{2} Λ_{l} (t, x)}{\partial x_{i} \partial x_{j}} + \sum_{i = 1}^{p} b_{i} (t, x, u) \frac{\partial Λ_{l} (t, x)}{\partial x_{i}} \\ + \sum_{j = 1}^{d} \sum_{i = 1}^{p} σ_{j i} (t, x, u) \frac{\partial α_{l j} (t, x, u)}{\partial x_{i}} + c_{l} (t, x, u) \\ - \sum_{j = 1}^{h} \int_{Z} (Λ_{l} (t, x + η_{j} (t, x, u, z_{j})) - Λ_{l} (t, x) - \sum_{i = 1}^{p} \frac{\partial Λ_{l} (t, x)}{\partial x_{i}} η_{i j} (t, x, u, z_{j})) ν_{j} (d z_{j}) \\ - \sum_{j = 1}^{h} \int_{Z} (ζ_{l j} (t, x + η_{j} (t, x, u, z_{j}), u, z_{j}) - ζ_{l j} (t, x, u, z_{j})) ν_{j} (d z_{j}), \end{matrix}

(17)

where

η_{i j}

and

η_{j}

for each

i \in {1, 2, \dots, p}

and each

j \in {1, 2, \dots, h}

are the

(i, j)

th entry and the jth column of

η

, respectively. Furthermore,

\begin{matrix} c_{0} (t, x, u) & = & \sum_{l = 1}^{q} c_{l} (t, x, u), \end{matrix}

(18)

\begin{matrix} ζ_{0 j} (t, x, u, z_{j}) & = & \sum_{l = 1}^{q} ζ_{l j} (t, x, u, z_{j}), \end{matrix}

(19)

and

ζ_{l j}

, for

l \in {1, 2, \dots, q}

and

j \in {1, 2, \dots, h}

, is the

(i, j)

th entry of

ζ

. Note that the partial derivative

\begin{matrix} \frac{\partial α_{l j} (t, x, u)}{\partial x_{i}} f o r e a c h i \in {1, 2, \dots, p}, j \in {1, 2, \dots, d}, a n d l \in {0, 1, 2, \dots, q} \end{matrix}

should be interpreted according to chain rule since

α (t, x)

is also a function in x by

(Λ, \bar{Λ}, \tilde{Λ}) (t, x)

and

u (t, x)

, where

\begin{matrix} α_{0 j} (t, x, u) = \sum_{l = 1}^{q} α_{l j} (t, x, u) . \end{matrix}

(20)

Furthermore, we define

\begin{matrix} \bar{J} (t, x, Υ, Λ, \bar{Λ}, \tilde{Λ}) & = & - \bar{Λ} (t, x), \end{matrix}

(21)

\begin{matrix} \bar{I} (t, x, Υ, Λ, \bar{Λ}, \tilde{Λ}, z) & = & - \tilde{Λ} (t, x), \end{matrix}

(22)

\begin{matrix} Λ (T, x) & = & H (x) . \end{matrix}

(23)

Then, we have the following definitions.

Definition 3.

A set

C

of stochastic processes corresponding to the operators

\bar{L}

,

\bar{J}

, and

\bar{I}

in Equations (17), (21), and (22) is called the admissible set of adapted control policy processes if

{{\bar{L}}_{l} (t, x, Υ, Λ

,

\bar{Λ}, \tilde{Λ}, u), l \in {0, 1, \dots, q}}

together with

{L, J, I, \bar{J}, \bar{I}}

satisfies the conditions stated in Theorem 1.

Definition 4.

{{\bar{L}}_{l} (t, x, Υ, Λ, \bar{Λ}, \tilde{Λ}, u), l \in {0, 1, \dots, q}}

together with

{L, J, I}

is said to satisfy the comparison principle in terms of u if

\begin{matrix} {\bar{L}}_{l} (t, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}, u^{1}) & \geq & {\bar{L}}_{l} (t, x, Υ^{2}, Λ^{2}, {\bar{Λ}}^{2}, {\tilde{Λ}}^{2}, u^{2}), \end{matrix}

(24)

\begin{matrix} H^{1} (x) & \geq & H^{2} (x) \end{matrix}

(25)

for any

u^{i} \in C

and

F_{T}

-measurable

H^{i}

with an associated solution

(Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}) (t, x)

of Equation (15), where

i \in {1, 2}

and

(t, x) \in [0, T] \times D

; then, we have

\begin{matrix} Λ^{1} (t, x) \geq Λ^{2} (t, x) . \end{matrix}

Therefore, based on these definitions, we have our main theorem for the non-zero-sum SDG problem in Equations (1) and (2) as follows.

Theorem 1.

For the operators

{\bar{L}, \bar{J}, \bar{I}}

in Equations (17)–(22) and under the terminal condition in Equation (23), if the

(r, q + 1)

-dimensional FB-SPDEs in Equation (8) have a 4-tuple solution

(Υ (t, x)

,

Λ (t, x)

,

\bar{Λ} (t, x)

,

\tilde{Λ} (t, x))

and

{{\bar{L}}_{l}

(t, x, Υ, Λ

,

\bar{Λ}

,

\tilde{Λ}, u)

,

l \in {0, 1, \dots, q}}

together with

{L, J, I, \bar{J}, \bar{I}}

satisfies the comparison principle, there is a Pareto optimal Nash equilibrium policy process

u^{*} (t, X (t))

over the admissible set

C

to the non-zero-sum SDG problem in Equations (1) and (2), where

(X (t)

,

Λ (t), \bar{Λ} (t), \tilde{Λ} (t, z))

is the unique adapted strong solution of the FB-SDE system in Equation (15) with the coefficients

f (t, x, X, Λ, \bar{Λ}, \tilde{Λ}, u^{*}, z)

corresponding to Equation (16) for each

f \in {b, σ, c, α, η, ζ}

. Furthermore, the solution has the following expressions for each

j \in {1, \dots, h}

:

\begin{matrix} Λ_{l} (t) & = & Λ_{l} (t, X (t)), \end{matrix}

(26)

\begin{matrix} {\bar{Λ}}_{l j} (t) & = & - (α_{l j} (t, X (t), u^{*}) + \sum_{i = 1}^{p} σ_{l i} (t, X (t), u^{*}) \frac{\partial Λ_{l} (t, X (t))}{\partial x_{i}}), \end{matrix}

(27)

\begin{matrix} {\tilde{Λ}}_{l j} (t, z)) & = & - (Λ_{l} (t, X (t) + η_{j} (t, X (t), u^{*}, z_{j})) - Λ_{l} (t, X (t))) \\ - ζ_{l j} (t, X (t) + η_{j} (t, X (t), u^{*}, z_{j}), u^{*}, z_{j}) . \end{matrix}

(28)

We provide a proof for Theorem 1 in Section 3. Here, we first present a counterpart for the zero-sum SDG problem in Equations (5) and (2). In this case, the system of FB-SPDEs in Equation (8) is

(r, q)

-dimensional for each

l \in {1, \dots, q}

since the terminal summation constraint should be a constant as shown in Equation (5).

Corollary 1.

For the operators

{\bar{L}, \bar{J}, \bar{I}}

in Equations (17)–(22) with the index l replaced by

m l

and under the terminal condition in Equation (23), if the

(r, q^{2})

-dimensional FB-SPDEs in Equation (8) have a 4-tuple solution

(Υ (t, x)

,

Λ (t, x)

,

\bar{Λ} (t, x)

,

\tilde{Λ} (t, x))

and

{{\bar{L}}_{m l}

(t, x, Υ, Λ

,

\bar{Λ}

,

\tilde{Λ}, u)

,

l \in {1, \dots, q}}

for each

m \in {1, \dots, q}

together with

{L, J, I, \bar{J}, \bar{I}}

satisfies the comparison principle, there is a saddle point policy process

u^{*} (t, X (t))

over the admissible set

C

to the zero-sum SDG problem in Equations (5) and (2) for each fixed

l \in {1, \dots, q}

, where

(X (t)

,

Λ (t), \bar{Λ} (t), \tilde{Λ} (t, z))

is a strong solution of the system of FB-SDEs in Equation (15). Furthermore, it is unique and adapted as given in Equations (26)–(28).

Note that the proof for Corollary 1 is similar to the one for Theorem 1. Based on Equation (5) and Definition 2, we can derive the required system of HJB equations as for Equation (17) and so on, which is a special form of FB-SPDE in Equation (8). Then, based on this system of HJB equations, we can discuss the comparison principle similar to that for Theorem 1.

2.3. Real-World Examples

2.3.1. Sharing vs. Competition in Cloud Services and Energy Grids

In this subsubsection, we conduct case studies in terms of resources-sharing and resources-competition in quantum-cloud computing services and power-energy grids. They can be either non-zero-sum game-oriented problems or zero-sum game-oriented problems. In doing so, we use queuing networks to model the dynamics of these systems’ internal data (or more fashionably called Big Data) flows. They typically consist of arrival processes, service processes, and buffer (quantum particle) storages with a certain kind of service regime and network architecture (see, e.g., an example in Figure 2).

In this example, two real-world physical queuing systems aided with quantum-cloud computing service centers are presented. The first one in the left-lower part of Figure 2 is with p-job classes, and it can be considered as a multi-input multi-output (MIMO) wireless channel presented in Dai [3,8]. The focus of this case is on how to allocate the common shared transmission rate capacity to different users fairly in a non-zero-sum game manner. The second one in the right-lower part of Figure 2 is also with p-job classes and it typically represents a power-energy grid. In this case, the users are competing with each other in order to gain access into the grid via admission control while establishing the best routing path in a BestGo way (i.e., in a zero-sum game manner). The quantum-cloud computing service centers in the right-upper part of Figure 2 are with J-job classes. They are equipped with blockchain software architecture for the purpose of system security and management as studied in Dai [8]. Note that both the physical systems and the quantum-cloud centers can be studied as independent queuing systems. For the purpose of the current paper, we use the physical queuing systems as illustrative examples.

The major performance measure for these queuing systems is the queue length process (see, e.g., Dai [3,49]), which is represented through

Q (\cdot) = {(Q_{1} (\cdot), \dots, Q_{p} (\cdot))}^{'}

. Each component

Q_{l} (t)

denotes the number of the lth class jobs stored in the ith buffer for each

l \in {1, \dots, p}

at a fixed time point t. If we use

Q (0)

to denote the initial queue length for the system, the queuing dynamics of these systems can be presented by

\begin{matrix} Q (t) = Q (0) + A (t) - D (t), \end{matrix}

(29)

where the lth component

A_{l} (t)

of

A (t)

for each

l \in {1, \dots, p}

is the total number of jobs that arrived to buffer l by time t, and the lth component

D_{l} (t)

of

D (t)

is the total number of jobs that departed from buffer l by time t. More precisely, we assume that each

A_{l} (\cdot)

for

l \in {1, \dots, p}

is a Lévy process with intensity measure

a_{l} (t, Q (t), z_{l}) d t ν_{l} (d z_{l})

. In this regard, it is the job arrival rate to buffer i at time t and depends on the queue state at that moment. Furthermore, we suppose that the Lévy process is time-inhomogeneous. Similarly, we assume that each

D_{i} (\cdot)

is also a time-inhomogeneous Lévy process with intensity measure

d_{l} (t, Q (t), z_{l}) d t ν_{i} (d z_{l})

that is the assigned service rate to buffer l at time t. Furthermore, we assume that the routing proportion from buffer j to buffer l for jobs finishing service at buffer j is

p_{j l} (t, Q (t), z_{j})

. Based on this primitive data flow structure, we can model the dynamics that can be represented by Equation (15) for two types of typical queuing systems.

More precisely, in an inventory system (see, e.g., Dai and Jiang [5], and references therein), the queue length process

Q_{l} (\cdot)

for each

l \in {1, \dots, p}

can be either positive or negative. If

Q_{l} (t) \geq 0

, it is called the inventory level for the lth queue at time t, and otherwise, it is called the back-order level. Furthermore, in a general queuing system, the non-negativity of

Q_{l} (\cdot)

is usually a system constraint. However, for such a constrained dynamical system, people are frequently interested in an input and/or service fluid rate control problem; for example, the stochastic fluid limit model derived in Dai [3] has the purpose of meeting such a goal. In a fluid limit model, the impact from the constraint

Q_{l} (\cdot) \geq 0

and its associated boundary reflection is washed out owing to fluid scaling and the functional strong law of large number.

Thus, by the discussion on pages 190–193 of Applebaum [21], the queue length process in Equation (29) for an inventory system or a fluid limit model can be further expressed by a forward SDE (a special form of the forward–backward system in Equation (15)) with

Z = R_{+}

, i.e.,

\begin{matrix} d Q_{l} (t) & = & \sum_{j = 1}^{h} \int_{Z} η_{l j} (t, Q (t), z_{j}) ν_{j} (d z_{j}) d t + \sum_{j = 1}^{h} \int_{Z} η_{l j} (t, Q (t), z_{j}) {\tilde{N}}_{j} (d t, d z_{j}), \end{matrix}

(30)

where

η_{l k} (t, Q (t), z_{j})

for each

l, k \in {1, \dots, p}

and

j \in {1, \dots, h}

is given as follows:

\begin{matrix} η_{l j} (t, Q (t), z_{j}) = \{\begin{matrix} a_{l} (t, Q_{l} (t), z_{l}) - d_{l} (t, Q_{l} (t), z_{l}) & if j = l, \\ \sum_{k \neq l} p_{k l} (t, Q_{k} (t), z_{j}) d_{k} (t, Q_{k} (t), z_{j}) & if j \neq l . \end{matrix} \end{matrix}

Note that the added Lévy process-driven term in Equation (30) is used to model the batch arrivals in an inventory system or the massive fluid data rate oscillation in a high-performance quantum-cloud computing based queuing system. Furthermore, the coefficients in Equation (30) may be discontinuous at the queue state

Q_{l} (t) = 0

for a stochastic fluid limit model. However, since the system in Equation (30) is designed in a controllable manner, the service rate

d_{l} (s, Q (s))

can always be set to zero when

Q_{l} (t) = 0

. Hence, the generalized Lipschitz and linear growth conditions required by Dai [2] may be imposed. Thus, the system in Equation (30) can still be well-posed even in this extreme case. Readers are also referred to Mandelbaum and Massey [50], Mandelbaum and Pats [6], and Konstantopoulos et al. [51] for some specific formulations of the system in Equation (30).

Next, owing to the system in Equation (30), we can formulate a queuing game problem with

q = p

. In this game, we try to schedule the service capacity over certain resource pool

D

in

R_{+}^{p}

(e.g., called the available transmission rate capacity region for the related discussion in Dai [3]) in the queuing system to different players. Under a given utility function

c_{l} (s^{-}, Q_{l}, Λ_{l}, {\bar{Λ}}_{l \cdot}, {\tilde{Λ}}_{l \cdot})

and a target terminal value

H_{l} (Q_{l} (T))

, the value process for each player

l \in {1, \dots, p}

can be represented by

\begin{matrix} Λ_{l} (t) & = & H_{l} (Q_{l} (T)) + \int_{t}^{T} c_{l} (s^{-}, Q_{l}, Λ_{l}, {\bar{Λ}}_{l \cdot}, {\tilde{Λ}}_{l \cdot}) d s \\ - \int_{t}^{T} {\bar{Λ}}_{l \cdot} (s^{-}) d W (s) - \int_{t}^{T} \int_{Z^{h}} {\tilde{Λ}}_{l \cdot} (s^{-}, z) \tilde{N} (d s, d z), \end{matrix}

(31)

where

{\bar{Λ}}_{l \cdot}

and

{\tilde{Λ}}_{l \cdot}

with

l \in {1, \dots, p}

are the lth rows of

\bar{Λ}

and

\tilde{Λ}

, respectively. Then, we first present the following claim.

Claim 2.

If

c_{l} (t, Q_{l}, Λ_{l}, {\bar{Λ}}_{l \cdot}, {\tilde{Λ}}_{l \cdot})

for each

l \in {1, \dots, p}

satisfies the generalized Lipschitz and linear growth conditions in Dai [2], there is a Pareto optimal Nash equilibrium policy for the non-zero-sum game problem in Equations (1) and (2) subject to the queuing constraint in Equation (30) and the associated value process in Equation (31).

The statement in Claim 2 is a special case of the maximum principle and our main theorem presented in Section 2.2. Note that “the generalized Lipschitz and linear growth conditions in Dai [2]” stated in Claim 2 are actually special cases of the conditions in Equations (59)–(62) in Section 3.1 of this paper. More precisely, we can present these conditions by using

f \in {b, σ, c, α}

in Equation (15) to replace

A \in {L, \bar{L}, J, \bar{J}}

in Equation (8), by using

f \in {η, ζ}

in Equation (15) to replace

A \in {I, \bar{I}}

in Equation (8), and by using the norm

∥ v ∥ = {∥ v ∥}_{C (D)}

without involving partial derivatives for the conditions in Dai [2] to replace the corresponding

{∥ u ∥}_{C^{k} (D)}

involved partial derivatives in Equations (59)–(62) in the current paper. Under the generalized Lipschitz and linear growth conditions in Dai [2], the system in Equation (15) is well-posed for any measurable control function u. Thus, the state constraint for our game-theoretic problems is feasible, which means that the related Nash equilibrium point policy process and saddle point policy process may be derived.

Furthermore, here, we remark that if we consider the servers in the left-lower part of Figure 2 as input ports in a power-energy grid (e.g., the one in the right-lower part of Figure 2), different users will compete with each other to obtain the best routing path according to some routing probability (e.g., the parameter p in the right-lower part of Figure 2), some value functions (see, e.g., Dai [52]), and some routing algorithms (see, e.g., Ash [16]). On certain occasions, this type of routing problem (e.g., the well-known AlphaGo and AlphaGo Zero for mastering the Go game) may be summarized as a zero-sum game problem as formulated in Equation (5).

2.3.2. Mastering the Zero-Sum Game of Go

In a recent hot paper (i.e., Silver et al. [10,11]), the authors designed and implemented AlphaGo and AlphaGo Zero policies to master the game of Go with the help of deep neural network-based AI technologies (see, e.g., the illustration in Dai [14]). In a game of Go, two players compete with each other with the hope to surround more territory than their opponents over a grid of lines on a square board (e.g., it can be modeled as a

19 \times 19

image). More precisely, each player wishes to obtain the larger number of intersections when the rule of area scoring (i.e., a player’s score is the number of stones that the player has on the board, plus the number of empty intersections surrounded by that player’s stones) is applied. Corresponding to the dynamical representation for a queuing process in Equation (29), we use

Q_{l} (t)

with

Q_{l} (0) = 0

for each

l \in {1, 2}

to denote the number of intersections won by the lth player by time t and call it the performance process for the lth player. The process

A_{l} (t)

in Equation (29) with

A_{l} (0) = 0

is called the “win” process in this game of Go. It denotes the cumulative number of intersections won by the lth player by time t. Furthermore, it corresponds to a sequence of independently and identically distributed (

i . i . d .

) time random variables

{τ_{1} (l), τ_{2} (l), \dots}

and a sequence of

i . i . d .

random rewards

{ξ_{1} (l), ξ_{2} (l), \dots}

.

Now, consider

τ_{n} (l)

with

τ_{0} (l) = 0

for each

l \in {1, 2}

and

n \in {1, 2, \dots}

as the thinking time required by the lth player to reach a decision in his nth move. In a competition of Go, the decision time is confined by some positive constant c (e.g., c takes 2 s in a competition between AlphaGo and a human expert, and meanwhile, c takes 0.2 s in a competition between AlphaGo Zero and AlphaGo). Therefore, for each

l \in {1, 2}

, define

\begin{matrix} τ_{n}^{c} (l) = τ_{n} (l) I_{{τ_{n} (l) \leq c}}, {\bar{τ}}_{n}^{c} (l) = τ_{n} (l) I_{{τ_{n} (l) > c}} \end{matrix}

(32)

and for

n \in {0, 1, \dots}

, let

\begin{matrix} T_{0} & = & 0, \end{matrix}

(33)

\begin{matrix} T_{1} & = & τ_{1}^{c} (1), \end{matrix}

(34)

\begin{matrix} T_{2} & = & τ_{1}^{c} (1) + τ_{1}^{c} (2), \end{matrix}

(35)

\begin{matrix} T_{2 n + 1} & = & T_{2 n} + τ_{1}^{c} (1), \end{matrix}

(36)

\begin{matrix} T_{2 n + 2} & = & T_{2 n + 1} + τ_{1}^{c} (2) . \end{matrix}

(37)

Furthermore, let

S (t)

denote the raw board representation of the position and its history for a game of Go at time t. In addition, let

I (t) = {I_{[T_{n}, T_{n + 1}]} (t), n \in {1, 2, \dots}}

be the decision regime-switching indicator process, i.e., if

I (t) = I_{[T_{2 n}, T_{2 n + 1}]} (t) = 1

for some

n \in {1, 2, \dots}

, the 1st player is in an active decision-making period while the 2nd player is in a waiting period. More precisely, at the end of each time interval

[T_{2 n}, T_{2 n + 1})

, the 1st player makes a move decision according to some probability distribution

p (I (t), S (t))

and a value function

v (I (t), S (t))

with

t = T_{2 n + 1}

. Note that this decision can either be made rationally by the 1st player (e.g., via AlphaGo Zero) if the thinking time

τ_{n} (1) \leq c

or be made irrationally (say, according to a given probability distribution

p (1)

) if

τ_{n} (1) > c

. Similarly, at the end of each interval

[T_{2 n + 1}, T_{2 n + 2})

, the 2nd player makes a move decision according to some probability distribution

p (I (t), S (t))

and a value function

v (I (t), S (t))

with

t = T_{2 n + 2}

. This decision can also either be made rationally by the 2nd player (e.g., via AlphaGo) if the thinking time

τ_{n} (2) \leq c

or be made irrationally (say, according to a given probability distribution

p (2)

) if

τ_{n} (2) > c

. The main difference between AlphaGo Zero and AlphaGo is the efficiency of their internal algorithms. In a real competition, AlphaGo Zero defeated AlphaGo with scores of 100:0. Essentially, AlphaGo Zero is improved AlphaGo by the removal of the dependence of human knowledge (i.e., by removing the supervised learning of policy networks used in AlphaGo); interested readers are referred to Silver et al. [10,11] for more details. After the movement (including pass), the lth player receives a reward

ξ_{n} (l)

. It can be a value of 1 or a random number representing an area he just wins, which can be a naturally surrounding one or the one by obtained by defeating his opponent by the rule of “life” or “mutual life”. Furthermore,

ξ_{n} (l)

can be position

S (t)

-dependent. The process

D_{l} (t)

in Equation (29) with

D_{l} (0) = 0

is called the “loss” process in this game of Go. Besides the

i . i . d .

random time sequence

{τ_{1} (l), τ_{2} (l), \dots}

, it is also associated with a sequence of random loss costs

{ζ_{1} (l), ζ_{2} (l), \dots}

. The cost

ζ_{n} (l)

for each

n \in {1, 2, \dots}

can be a value of zero or a random number representing the area he losses due to the rule of “death”. Moreover,

ζ_{n} (l)

can also be position

S (t)

-dependent.

In this study, we suppose that

τ_{n} (l)

, for each

l \in {1, 2}

and

n \in {1, 2, \dots}

, is exponentially distributed with parameter

λ (l)

. Let

N_{l} (t)

be the total number of moves made by the lth player during time interval

[0, t]

, i.e.,

\begin{matrix} N_{1} (t) & \equiv & max \{n, T_{2 n + 1} \leq t\} \\ = & max \{n, \sum_{i = 1}^{n} τ_{i} (1) \leq t - \sum_{i = 1}^{n - 1} τ_{i}^{c} (2) - \sum_{i = 1}^{n} {\bar{τ}}_{i}^{c} (1)\}, \end{matrix}

(38)

\begin{matrix} N_{2} (t) & \equiv & max \{n, T_{2 n + 2} \leq t\} \\ = & max \{n, \sum_{i = 1}^{n} τ_{i}^{c} (2) \leq t - \sum_{i = 1}^{n} τ_{i}^{c} (1) - \sum_{i = 1}^{n} {\bar{τ}}_{i}^{c} (2)\} . \end{matrix}

(39)

Then, for each

l \in {1, 2}

, the counterpart of the equation in Equation (30) can be written as

\begin{matrix} Q_{l} (t) = \sum_{i = 1}^{N_{l} (t)} ξ_{i} (l) - \sum_{i = 1}^{N_{2 - l + 1} (t)} ζ_{i} (l) . \end{matrix}

(40)

By the exponential distribution assumption and the fact that the thinking times between the two players are independent, it follows from centering operations for the two terms in the right-hand side of Equation (40) that

\begin{matrix} d Q_{l} (t) & = & \int_{Z} η_{l l} (t, u (I (t), Q (t)), z) ν_{l} (d z) d t + \int_{Z} η_{l l} (t, u (I (t), Q (t)), z) {\tilde{N}}_{l} (d t, d z) \\ + \int_{Z} η_{l (2 - l + 1)} (t, u (I (t), Q (t)), z) ν_{2 - l + 1} (d z) d t \\ + \int_{Z} η_{l (2 - l + 1)} (t, u (I (t), Q (t)), z) {\tilde{N}}_{2 - l + 1} (d t, d z) \end{matrix}

(41)

for each

l \in {1, 2}

with

Z = [0, 19 \times 19]

. in Equation (41),

ν_{l} (d z)

for each

l \in {1, 2}

is some Lévy measure. Meanwhile,

{\tilde{N}}_{l} (d t, d z)

is the corresponding compensated Poisson random measure. Since the process

Q (t) = {(Q_{1} (t), Q_{2} (t))}^{'}

at a time t is uniquely determined by

S (t)

, the move decision

u (I (t), Q (t))

in Equation (41) is given by

p (I (t), S (t))

.

The target for a player (say, the lth player) to find such a policy is to reach the final win in this game of Go. In other words, the scores in this game competition should satisfy

Q_{l} (T) > Q_{2 - l + 1} (T)

for each

l \in {1, 2}

, where the notation T is the terminal time for this game. To obtain a specific representation for the value process corresponding to the one in Equation (31), we let

\begin{matrix} H_{l} (Q (T)) & = & I_{{Q_{l} (T) > Q_{2 - l + 1} (T)}}, \end{matrix}

(42)

\begin{matrix} H (Q (T)) & = & {(H_{1} (Q (T)), H_{2} (Q (T)))}^{'} . \end{matrix}

(43)

Furthermore, let

F_{t} = σ {N_{1} (s), N_{2} (s), s \leq t}

. Then, we have that

\begin{matrix} Λ_{l} (t) & = & E [H_{l} (Q (T)) | I (t), S (t)] \\ = & E [H_{l} (Q (T)) | F_{t}] \\ = & H_{l} (Q (T)) - \int_{t}^{T} \int_{Z} {\tilde{Λ}}_{l \cdot} (s^{-}, u (I (s), Q (s)), z) \tilde{N} (d s, d z), \end{matrix}

(44)

where

{\tilde{Λ}}_{l \cdot} = ({\tilde{Λ}}_{l 1}, {\tilde{Λ}}_{l 2})

,

\tilde{N} (d s, d z) = {({\tilde{N}}_{1} (d s, d z), {\tilde{N}}_{2} (d s, d z))}^{'}

. The first equality in Equation (44) is due to the definition of a Markovian decision process, and the second equality in Equation (44) is due to the martingale representation theorem (see page 266 of Applebaum [21] for more details). Thus, based on the state process in Equation (41) and the value process in Equation (44), we can formulate a zero-sum game problem as in Equation (5) for this game of Go. Furthermore, by a direct verification or by approximating through Doob’s functional representation (see, e.g., Lemma 1.13 on page 7 of Kallenberg [48]), the coefficients in Equations (41) and (44) can be assumed to satisfy the generalized Lipschitz and linear growth conditions in Dai [2]. Then, we have the following claim.

Claim 3.

Under the generalized Lipschitz and linear growth conditions in Dai [2], there is a saddle point policy process to the zero-sum game problem in Equations (5)–(2) with

Λ_{1} (0) + Λ_{2} (0) = 1

and subject to the constraints in Equations (41) and (44).

Similar to the explanation for Claim 2, Claim 3 is a special case of Corollary 1. Furthermore, “the generalized Lipschitz and linear growth conditions in Dai [2]” stated in Claim 3 can be explained in the same way as for Claim 2.

3. Proof of Main Theorem

Since the proof heavily depends on the existence and uniqueness of a solution of the

(r, q + 1)

-dimensional FB-SPDEs in Equation (8), we first give a generalized discussion in Section 3.1 concerning the unified SPDE with the hope to be applied to more areas.

3.1. Unique Existence of Solution to the Unified SPDE

Let D be

R^{p}

or a domain in

R^{p}

and assume that there exists a sequence of nondecreasing closed and connected sets

{D_{n}, n \in N}

with

N = {0, 1, \dots}

such that

\begin{matrix} D = ⋃_{n = 0}^{\infty} D_{n} . \end{matrix}

(45)

For each

k \in N

and

l \in {r, q}

, let

C^{k} (D, R^{l})

be a Banach space endowed with the uniform norm as follows:

\begin{matrix} ∥ f ∥_{C^{k} (D, l)}^{2} \equiv \sum_{n = 0}^{\infty} ξ (n + 1) ∥ f ∥_{C^{k} (D_{n}, l)}^{2} . \end{matrix}

(46)

Furthermore, we suppose that this Banach space consists of all continuously differentiable functions f whose derivative orders are up to the integer k. In addition, for a function

ξ (n)

that is a discrete and fast decaying one with respect to each

n \in {0, 1, \dots}

. More precisely, we take

ξ (n)

as follows:

\begin{matrix} ξ (n) = \frac{1}{((n^{10})!) (η (n)!) e^{n}}, η (n) = {[max \{| x_{1} | + \dots + | x_{p} |, x \in D_{n}\}]}^{n} . \end{matrix}

(47)

Note that the notation

[a]

used in Equation (47) denotes the summation of the unity and the integral part of number

a \in R

. Furthermore, we take

\begin{matrix} ∥ f ∥_{C^{k} (D_{n}, l)} = max_{c \in {0, 1, 2, \dots, k}} max_{j \in {1, 2, \dots, r (c)}} sup_{x \in D_{n}} |f_{j}^{(c)} (x)|, \end{matrix}

(48)

where

r (c)

corresponding to a

c \in {0, 1, 2, \dots, k}

denotes the number summation of partial derivatives whose orders are c. Furthermore, we denote

\begin{matrix} f_{r, i_{1} \dots i_{p}}^{(c)} (x) = \frac{\partial^{c} f_{r} (x)}{\partial x_{1}^{i_{1}} \dots \partial x_{p}^{i_{p}}} \end{matrix}

(49)

satisfying

i_{1} + \dots + i_{p} = c

for

i_{l} \in {0, 1, 2, \dots, c}

with

l \in {1, 2, \dots, p}

and

r \in {1, 2, \dots, l}

. Hereafter, each

j \in {1, \dots, r (c)}

is indexed in a way that it corresponds to a p-tuple

(i_{1}, \dots, i_{p})

and a

r \in {1, \dots, l}

, i.e.,

\begin{matrix} f_{i_{1}, \dots, i_{p}}^{(c)} & \equiv & (f_{1, i_{1}, \dots, i_{p}}^{(c)}, \dots, f_{q, i_{1}, \dots, i_{p}}^{(c)}), \end{matrix}

(50)

\begin{matrix} f^{(c)} (x) & \equiv & (f_{1}^{(c)} (x), \dots, f_{r (c)}^{(c)} (x)) . \end{matrix}

(51)

Furthermore, whenever the partial derivative on the boundary

\partial D

is concerned, it is defined in a one-side manner.

Next, let

L_{F}^{2} ([0, T], C^{k} (D; R^{l}))

be the set consisting of

R^{l}

-valued random vector-field processes

Z (t, x)

. Furthermore, we suppose that these vector-field processes are measurable and adapted to the filtration

{F_{t}, t \in [0, T]}

corresponding to a

x \in D

. In the sequel, the “

R^{l}

-valued” is also called “

C^{k} (D; R^{l})

-valued”. In this sense, the vector-field processes

Z (t, x)

are in

C^{k} (D, R^{l})

for a given

t \in [0, T]

), satisfying

\begin{matrix} E [\int_{0}^{T} ∥ Z (t) ∥_{C^{k} (D, l)}^{2} d t] < \infty . \end{matrix}

(52)

In particular, for each

l \in {r, q}

, let

L_{G_{l}}^{2} (Ω, C^{k} (D; R^{l}))

be the set consisting of

R^{l}

-valued random vector-fields

ζ (x)

that are

G_{l}

-measurable with each

x \in D

and satisfy

\begin{matrix} ∥ ζ ∥_{L_{G}^{2} (Ω, C^{k} (D, R^{l}))}^{2} \equiv E [∥ ζ ∥_{C^{k} (D, l)}^{2}] < \infty, \end{matrix}

(53)

where

G_{r} = G

and

G_{q} = F_{T}

. Similarly, let

L_{p}^{2} ([0, T] \times Z^{h},

C^{k} (D, R^{l \times h}))

represent the set consisting of

R^{l \times h}

-valued random vector-field processes denoted by

\tilde{Λ} (t, x, z) =

({\tilde{Λ}}_{1} (t, x, z_{1}),

\dots,

{\tilde{Λ}}_{h} (t, x, z_{h}))

, which are predictable for every fixed point

x \in D

and

z \in Z^{h}

with the corresponding norm as follows:

\begin{matrix} E [\sum_{i = 1}^{h} \int_{0}^{T} \int_{Z} {∥{\tilde{Λ}}_{i} (t, z_{i})∥}_{C^{k} (D, l)}^{2} ν_{i} (d z_{i}) d t] < \infty . \end{matrix}

(54)

Thus, we can define

\begin{matrix} Q_{F}^{2} ([0, T] \times D) & \equiv & L_{F}^{2} ([0, T], C^{k} (D, R^{r})) \\ \times L_{F}^{2} ([0, T], C^{k} (D, R^{q})) \\ \times L_{F, p}^{2} ([0, T], C^{k} (D, R^{q \times d})) \\ \times L_{p}^{2} ([0, T] \times Z^{h}, C^{k} (D, R^{q \times h})) . \end{matrix}

(55)

Finally, let

\begin{matrix} L_{ν}^{2} (Z^{h}, C^{c} (D, R^{q \times h})) \\ \equiv \{\tilde{v} : Z^{h} \to C^{c} (D, R^{q \times h}), \sum_{i = 1}^{h} \int_{Z} {∥{\tilde{v}}_{i} (z_{i})∥}_{C^{c} (D, q)}^{2} ν_{i} (d z_{i}) < \infty\} \end{matrix}

(56)

which is endowed with the norm

\begin{matrix} ∥ \tilde{v} ∥_{D, ν, c}^{2} \equiv \sum_{i = 1}^{h} \int_{Z} {∥{\tilde{v}}_{i} (z_{i})∥}_{C^{c} (D, q)}^{2} λ_{i} ν_{i} (d z_{i}) \end{matrix}

(57)

for any

\tilde{v} \in L_{ν}^{2} (Z^{h}, C^{c} (D, R^{q \times h}))

and

c \in {0, 1, \dots, k}

. Furthermore, define

\begin{matrix} V^{k} (D) & \equiv & C^{k} (D, R^{r}) \\ \times C^{k} (D, R^{q}) \\ \times C^{k} (D, R^{q \times d}) \\ \times {\bar{L}}_{ν}^{2} (Z^{h}, C^{k} (D, R^{q \times h})) . \end{matrix}

(58)

Then, we can impose some conditions to guarantee the existence and uniqueness of a 4-tuple solution of the unified FB-SPDE in Equation (8), which is a strong and adapted solution.

First, for each

A \in {L, \bar{L}, J, \bar{J}}

and any

(u^{i}, v^{i}, {\bar{v}}^{i}, {\tilde{v}}^{i}) \in V^{k} (D)

with

i \in {1, 2}

, we define

\begin{matrix} Δ A (s, x, u^{1}, v^{1}, {\bar{v}}^{1}, {\tilde{v}}^{1}, u^{2}, v^{2}, {\bar{v}}^{2}, {\tilde{v}}^{2}) \\ \equiv & A (s, x, u^{1}, v^{1}, {\bar{v}}^{1}, {\tilde{v}}^{1}) - A (s, x, u^{2}, v^{2}, {\bar{v}}^{2}, {\tilde{v}}^{2}) . \end{matrix}

Then, we assume that the generalized local Lipschitz condition is true almost surely (a.s.),

\begin{matrix} ∥Δ A (s, x, u^{1}, v^{1}, {\bar{v}}^{1}, {\tilde{v}}^{1}, u^{2}, v^{2}, {\bar{v}}^{2}, {\tilde{v}}^{2})∥ \\ \leq & K_{D_{n}} (∥ u^{1} - u^{2} ∥_{C^{k} (D_{n}, r)} + ∥ v^{1} - v^{2} ∥_{C^{k} (D_{n}, q)} + ∥ {\bar{v}}^{1} - {\bar{v}}^{2} ∥_{C^{k} (D_{n}, q d)} + {∥ {\tilde{v}}^{1} - {\tilde{v}}^{2} ∥}_{D_{n}, ν, k}), \end{matrix}

(59)

where the constant

K_{D_{n}} \geq 0

depends on

D_{n}

and may be unbounded as

D_{n} \to D

along

n \in {0, 1, 2, \dots}

. Note that for a vector (or a matrix) A, we use

∥ A ∥

to denote the largest absolute value of its components (or entries). Furthermore, for each

A \in {I, \bar{I}}

, we suppose that

\begin{matrix} \sum_{i = 1}^{h} \int_{Z} {∥Δ A_{i} (s, x, u^{1}, v^{1}, {\bar{v}}^{1}, {\tilde{v}}^{1}, u^{2}, v^{2}, {\bar{v}}^{2}, {\tilde{v}}^{2}, z_{i})∥}^{2} λ_{i} ν_{i} (d z_{i}) \\ \leq & K_{D_{n}} (∥ u^{1} - u^{2} ∥_{C^{k} (D_{n}, r)}^{2} + ∥ v^{1} - v^{2} ∥_{C^{k} (D_{n}, q)}^{2} + ∥ {\bar{v}}^{1} - {\bar{v}}^{2} ∥_{C^{k} (D_{n}, q d)}^{2} + {∥ {\tilde{v}}^{1} - {\tilde{v}}^{2} ∥}_{D_{n}, ν, k}^{2}), \end{matrix}

(60)

where

A_{i}

is the ith column of

A

.

Second, for each

A \in {L, \bar{L}, J, \bar{J}}

and any

(u, v, \bar{v}, \tilde{v}) \in V^{k} (D)

, we assume that the generalized linear growth condition holds

\begin{matrix} ∥A (s, x, u, v, \bar{v}, \tilde{v})∥ \\ \leq & K_{D_{n}} ({∥ u ∥}_{C^{k} (D_{n}, r)} + {∥ v ∥}_{C^{k} (D_{n}, q)} + ∥ \bar{v} ∥_{C^{k} (D_{n}, q d)} + {∥ \tilde{v} ∥}_{D_{n}, ν, k}) . \end{matrix}

(61)

Similarly, for each

A \in {I, \bar{I}}

, we suppose that

\begin{matrix} \sum_{i = 1}^{h} \int_{Z} {∥A_{i} (s, x, u, v, \bar{v}, \tilde{v}, z_{i})∥}^{2} λ_{i} ν (d z_{i}) \\ \leq & K_{D_{n}} ({∥ u ∥}_{C^{k} (D_{n}, r)}^{2} + {∥ v ∥}_{C^{k} (D_{n}, q)}^{2} + ∥ \bar{v} ∥_{C^{k} (D_{n}, q d)}^{2} + {∥ \tilde{v} ∥}_{D_{n}, ν, k}^{2}) . \end{matrix}

(62)

Concerning the reasonability of conditions in Equations (59)–(62), it can be illustrated as follows. If all the concerned partial derivative operators are linear, these conditions are naturally satisfied (see the examples in Dai [14] for numerical simulations). Even if these partial derivative operators are strongly nonlinear, we can still use some approximation techniques to make these conditions useful in some applications (see, e.g., Dai [15]).

Now, let

C^{l}

be the l-dimensional complex Euclidean space and all the related norms are interpreted in the corresponding complex-valued sense. Then, we can present a proposition as follows.

Proposition 1.

Suppose that

(G, H) \in L_{G}^{2} (Ω, C^{k} (D; C^{r})) \times L_{F_{T}}^{2} (Ω, C^{k} (D; C^{q}))

and conditions in Equations (59)–(62) are true. Furthermore, assume that each

A \in {L, \bar{L}, J, \bar{J}, I, \bar{I}}

is

{F_{t}}

-adapted for every fixed

x \in D

,

z \in Z^{h}

, and any given

(u, v, \bar{v}, \tilde{v}) \in V^{k} (D)

with

\begin{matrix} L (\cdot, x, 0) \in L_{F}^{2} ([0, T], C^{k} (D, C^{r})), \end{matrix}

(63)

\begin{matrix} J (\cdot, x, 0) \in L_{F}^{2} ([0, T], C^{k} (D, C^{r \times d})), \end{matrix}

(64)

\begin{matrix} I (\cdot, x, 0, \cdot) \in L_{F}^{2} ([0, T] \times Z^{h}, C^{k} (D, C^{r \times h})), \end{matrix}

(65)

\begin{matrix} \bar{L} (\cdot, x, 0) \in L_{F}^{2} ([0, T], C^{k} (D, C^{q})), \end{matrix}

(66)

\begin{matrix} \bar{J} (\cdot, x, 0) \in L_{F}^{2} ([0, T], C^{k} (D, C^{q \times d})), \end{matrix}

(67)

\begin{matrix} \bar{I} (\cdot, x, 0, \cdot) \in L_{F}^{2} ([0, T] \times Z^{h}, C^{k} (D, C^{q \times h})) . \end{matrix}

(68)

Then, there uniquely exists a 4-tuple solution of the unified FB-SPDE in Equation (8), which is a strong and adapted solution, i.e.,

\begin{matrix} (Υ, Λ, \bar{Λ}, \tilde{Λ}) \in Q_{F}^{2} ([0, T] \times D), \end{matrix}

(69)

and

(Υ, Λ) (\cdot, x)

is càdlàg for each

x \in D

almost surely (a.s.).

Note that in Proposition 1, the forward equation is r-dimensional, i.e.,

Υ = {(Υ_{1}, \dots, Υ_{r})}^{'}

. Meanwhile, the backward equation is q-dimensional, i.e.,

Λ = {(Λ_{1}, \dots, Λ_{q})}^{'}

. This type of vector SPDE exists in real-world applications such as color image processing and multi-mode generative AI. The existence and uniqueness of such a vector solution proved in the proposition can guarantee the image processing and recovery in a multi-channel computer vision and network system. For example, a color image can typically be represented by a vector PDE (see, e.g., Caselles et al. [23], Tschumperlé and Deriche [24,25,26]). In the image forward processing by noise encoding, it corresponds to a forward vector SPDE. The added noise can be a Gaussian noise or a non-Gaussian noise. In the Gaussian noise case, it corresponds to Brownian motion. In the non-Gaussian noise case, it corresponds to a Lévy process. Similarly, in the backward image processing by noise decoding for color image recovery, it corresponds to a backward vector SPDE. Furthermore, the forward image processing and backward image processing can be in a coupled way. In addition, the current hot area concerning multi-mode generative AI can also be explained in the same way (see, e.g., Kratsios [27], Peluchetti [28], and Vaswani et al. [29] for more details) as introduced in the introduction of this paper. Finally, for the convenience of presentation, in the next subsection, we first prove our main theorem (Theorem 1) in Section 3.2 by assuming the truth of Proposition 1. Then, we prove Proposition 1 formally in Section 3.3.

3.2. Proof of Theorem 1

Proof.

First, if the claim in Proposition 1 is true, we can prove a general q-dimensional vector-form It

\hat{o}

-Ventzell formula with Lévy jumps. More precisely, consider the operators

{\bar{L}, \bar{J}, \bar{I}}

given in Proposition 1 and suppose

Λ (t, x)

is a solution of the q-dimensional vector-form B-SPDE with Lévy jumps:

\begin{matrix} d Λ (t, x) & = & \bar{L} (t^{-}, x, Υ, Λ, \bar{Λ}, \tilde{Λ}, u) d t + \bar{J} (t^{-}, x, Υ, Λ, \bar{Λ}, \tilde{Λ}, u) d W (t) \\ + \int_{Z^{h}} \bar{I} (t^{-}, x, Υ, Λ, \bar{Λ}, \tilde{Λ}, u, z) \tilde{N} (λ d t, d z) . \end{matrix}

(70)

Furthermore, assume that X is a solution of the p-dimensional vector-form F-SDE given by

\begin{matrix} d X (t) & = & b (t^{-}, X, Λ, \bar{Λ}, \tilde{Λ}, u) d t + σ (t^{-}, X, Λ, \bar{Λ}, \tilde{Λ}, u) d W (t) \\ + \int_{Z^{h}} η (t^{-}, X, Λ, \bar{Λ}, \tilde{Λ}, u, z) \tilde{N} (d t, d z) . \end{matrix}

(71)

In addition, for a

t \in [0, T]

, let

\begin{matrix} Λ (t) \equiv Λ (t, X (t)) . \end{matrix}

(72)

Then, by applying It

\hat{o}

’s formula presented in Theorem 1.16 of ksendal and Sulem [53], we can extend the It

\hat{o}

-Ventzell formula with Lévy jumps in single-dimensional case (see, e.g., Øksendal and Zhang [54]) to the general q-dimensional vector-form situation as follows:

\begin{matrix} d Λ (t) & = & {\bar{L} (t^{-}, X) + \sum_{i = 1}^{p} \frac{\partial Λ (t, X (t))}{\partial x_{i}} b_{i} (t^{-}, X) \\ + \sum_{i = 1}^{p} \frac{\partial \bar{J} (t^{-}, X)}{\partial x_{i}} {(σ^{'} (t^{-}, X))}_{i} \\ + \frac{1}{2} \sum_{i, j = 1}^{p} \frac{\partial Λ^{2} (t, X (t))}{\partial x_{i} \partial x_{j}} {((σ (t^{-}, X) σ^{'} (t^{-}, X)))}_{i j} \\ + \sum_{j = 1}^{h} \int_{z_{j} \in Z} (Λ (t^{-}, X (t^{-}) + η_{j} (t^{-}, X, z_{j})) - Λ (t^{-}, X (t^{-})) \\ - \sum_{i = 1}^{p} \frac{\partial Λ (t^{-}, X (t^{-}))}{\partial x_{i}} η_{j}^{i} (t^{-}, X, z_{j}) \\ + {\bar{I}}_{j} (t^{-}, X + η_{j} (X, z_{j}), z_{j}) - {\bar{I}}_{j} (t^{-}, X, z_{j})) λ_{j} ν_{j} (d z_{j})} d t \\ + \{\bar{J} (t^{-}, X) + \sum_{i = 1}^{p} \frac{\partial Λ (t^{-}, X (t^{-}))}{\partial x_{i}} b_{i} (t^{-}, X)\} d W (t) \\ + \sum_{j = 1}^{h} \int_{z_{j} \in Z} {Λ (t^{-}, X (t^{-}) + η_{j} (t^{-}, X, z_{j})) - Λ (t^{-}, X (t^{-})) \\ + {\bar{I}}_{j} (t^{-}, X + η_{j} (X, z_{j}), z_{j})} {\tilde{N}}_{j} (λ_{j} d t, d z_{j}), \end{matrix}

(73)

where

{(σ^{'})}_{i}

is the

i t h

column of the transpose of matrix

σ

, and

η_{j}^{i}

is the

i t h

component of vector

η_{j}

, etc. Furthermore, if

f \in {b, σ, \bar{L}, \bar{J}}

, it is given by

\begin{matrix} f (t, X) \equiv f (t, X, Λ, \bar{Λ}, \tilde{Λ}, u), \end{matrix}

and if

f \in {η, \bar{I}}

, it is represented by

\begin{matrix} f_{j} (t, X, z_{j}) \equiv f_{j} (t, X, Λ, \bar{Λ}, \tilde{Λ}, u, z_{j}) . \end{matrix}

Second, we show that there uniquely exists a 4-tuple strong solution

(X (t)

,

(Λ (t)

,

\bar{Λ} (t)

,

\tilde{Λ} (t, z))

of the system in Equation (15), which is a strong and adapted solution. Furthermore, it corresponds to a given control process

u \in C

and has the relationship in Equations (26)–(28).

In fact, by Proposition 1, there uniquely exists an adapted 4-tuple strong solution of the

(r, q + 1)

-dimensional coupled SPDEs in Equation (8), which corresponds to specific

{\bar{L}, \bar{J}, \bar{I}}

in Equations (17)–(22), the terminal condition in Equation (23), and the control process

u \in C

. For convenience, we use

(Υ (t, x)

,

Λ (t, x)

,

\bar{Λ} (t, x)

,

\tilde{Λ} (t, x, \cdot))

to denote this solution. Thus, by It

\hat{o}

’s-Ventzell formula in Equation (73), we have that

\begin{matrix} (Λ (t), \bar{Λ} (t), \tilde{Λ} (t, \cdot)) \equiv (Λ (t, X (t)), \bar{Λ} (t, X (t)), \tilde{Λ} (t, X (t), \cdot)) \end{matrix}

is the required solution to the system in Equation (15).

Finally, by combining the above study and the discussion on the single-dimensional case (i.e.,

p = q = 1

) for the related optimal control problem in Øksendal et al. [20], we can reach a proof for our general vector-form game problem as stated in Theorem 1. □

3.3. Proof of Proposition 1

As the first step, we prove a version of Proposition 1 under more strict conditions. In doing so, we assume that D is a closed domain in

R^{p}

and

C^{\infty} (D, R^{l})

is the Banach space

\begin{matrix} C^{\infty} (D, R^{l}) \equiv \{f \in ⋂_{c = 0}^{\infty} C^{c} (D, R^{l}), {∥ f ∥}_{C^{\infty} (D, l)} < \infty\}, \end{matrix}

(74)

where

\begin{matrix} {∥ f ∥}_{C^{\infty} (D, q)}^{2} = \sum_{c = 0}^{\infty} ξ (c) {∥ f ∥}_{C^{c} (D, l)}^{2} . \end{matrix}

(75)

Furthermore, as in Equation (47), we take

ξ (c) = \frac{1}{((c^{10})!) (η (c)!) e^{c}}

with

\begin{matrix} η (c) = {[max \{| x_{1} | + \dots + | x_{p} |, x \in D\}]}^{c} . \end{matrix}

Next, let

L_{F}^{2} ([0, T], C^{\infty} (D; R^{l}))

be the set consisting of

R^{l}

-valued random vector-field processes

Z (t, x)

. Furthermore, we suppose that these vector-field processes are measurable and adapted to the filtration

{F_{t}, t \in [0, T]}

corresponding to an

x \in D

. Hereafter, the “

R^{l}

-valued” is also called “

C^{\infty} (D; R^{l})

-valued”. In this sense, the vector-field processes

Z (t, x)

are in

C^{\infty} (D, R^{l})

for a given

t \in [0, T]

), satisfying

\begin{matrix} E [\int_{0}^{T} ∥ Z (t) ∥_{C^{\infty} (D, l)}^{2} d t] < \infty . \end{matrix}

(76)

In particular, let

L_{G_{l}}^{2} (Ω, C^{\infty} (D; R^{l}))

with

l \in {r, q}

represent the set consisting of

R^{l}

-valued random vector-fields

ζ (x)

that are

G_{l}

-measurable for each

x \in D

and satisfy

\begin{matrix} ∥ ζ ∥_{L_{G}^{2} (Ω, C^{\infty} (D, R^{l}))}^{2} \equiv E [∥ ζ ∥_{C^{\infty} (D, l)}^{2}] < \infty, \end{matrix}

(77)

where

G_{r} = G

and

G_{q} = F_{T}

. In addition, let

L_{p}^{2} ([0, T] \times Z^{h},

C^{\infty} (D, R^{l \times h}))

represent the set consisting of

R^{l \times h}

-valued random vector-field processes. Furthermore, these vector-field processes are denoted by

\tilde{Λ} (t, x, z) =

({\tilde{Λ}}_{1} (t, x, z_{1}),

\dots,

{\tilde{Λ}}_{h} (t, x, z_{h}))

, which are predictable for every fixed

x \in D

and

z \in Z^{h}

with the corresponding norm as follows:

\begin{matrix} E [\sum_{i = 1}^{h} \int_{0}^{T} \int_{Z} {∥{\tilde{Λ}}_{i} (t, z_{i})∥}_{C^{\infty} (D, l)}^{2} ν_{i} (d z_{i}) d t] < \infty . \end{matrix}

(78)

Thus, we can define

\begin{matrix} Q_{F}^{2} ([0, T] \times D) & \equiv & L_{F}^{2} ([0, T], C^{\infty} (D, R^{r})) \\ \times L_{F}^{2} ([0, T], C^{\infty} (D, R^{q})) \\ \times L_{F, p}^{2} ([0, T], C^{\infty} (D, R^{q \times d})) \\ \times L_{p}^{2} ([0, T] \times Z^{h}, C^{\infty} (D, R^{q \times h})) . \end{matrix}

(79)

Finally, let

\begin{matrix} L_{ν}^{2} (Z^{h}, C^{c} (D, R^{q \times h})) \\ \equiv \{\tilde{v} : Z^{h} \to C^{c} (D, R^{q \times h}), \sum_{i = 1}^{h} \int_{Z} {∥{\tilde{v}}_{i} (z_{i})∥}_{C^{c} (D, q)}^{2} ν_{i} (d z_{i}) < \infty\} \end{matrix}

(80)

which is endowed with the norm

\begin{matrix} ∥ \tilde{v} ∥_{ν, c}^{2} \equiv \sum_{i = 1}^{h} \int_{Z} {∥{\tilde{v}}_{i} (z_{i})∥}_{C^{c} (D, q)}^{2} λ_{i} ν_{i} (d z_{i}) \end{matrix}

(81)

for any

\tilde{v} \in L_{ν}^{2} (Z^{h}, C^{c} (D, R^{q \times h}))

and

c \in {0, 1, \dots, \infty}

. Furthermore, define

\begin{matrix} V^{\infty} (D) & \equiv & C^{\infty} (D, R^{r}) \\ \times C^{\infty} (D, R^{q}) \\ \times C^{\infty} (D, R^{q \times d}) \\ \times {\bar{L}}_{ν}^{2} (Z^{h}, C^{\infty} (D, R^{q \times h})) . \end{matrix}

(82)

Then, we impose some additional conditions to the unified system in Equation (8).

First, for each

A \in {L, \bar{L}, J, \bar{J}}

, every

c \in {0, 1, 2, \dots,}

, and any

(u^{i}, v^{i}, {\bar{v}}^{i}, {\tilde{v}}^{i}) \in V^{\infty} (D)

with

i \in {1, 2}

, we define

\begin{matrix} Δ A^{(c)} (s, x, u^{1}, v^{1}, {\bar{v}}^{1}, {\tilde{v}}^{1}, u^{2}, v^{2}, {\bar{v}}^{2}, {\tilde{v}}^{2}) \\ \equiv A^{(c)} (s, x, u^{1}, v^{1}, {\bar{v}}^{1}, {\tilde{v}}^{1}) - A^{(c)} (s, x, u^{2}, v^{2}, {\bar{v}}^{2}, {\tilde{v}}^{2}) . \end{matrix}

Then, we assume that the corresponding local Lipschitz condition is true a.s. along

c \in {0, 1, 2, \dots}

:

\begin{matrix} ∥Δ A^{(c + l + o)} (s, x, u^{1}, v^{1}, {\bar{v}}^{1}, {\tilde{v}}^{1}, u^{2}, v^{2}, {\bar{v}}^{2}, {\tilde{v}}^{2})∥ \\ \leq K_{D, c} (∥ u^{1} - u^{2} ∥_{C^{k + c} (D, r)} + {∥ v^{1} - v^{2} ∥}_{C^{k + c} (D, q)} \\ + ∥ {\bar{v}}^{1} - {\bar{v}}^{2} ∥_{C^{k + c} (D, q d)} + {∥ {\tilde{v}}^{1} - {\tilde{v}}^{2} ∥}_{ν, k + c}) . \end{matrix}

(83)

where

K_{D, c} \geq 0

in Equation (83) is a constant corresponding to a fixed

c \in {0, 1, 2, \dots}

. However, in contrast to the constant assumed in Equation (59), here it depends on not only D (the given domain) but also the order c of the associated derivatives. Furthermore, it can be unbounded when c tends to infinity or D tends to

R^{p}

. In addition, the integer

l \in {0, 1, 2}

represents the lth partial derivative order of

Δ A^{(c)} (s, x, u, v, \bar{v}, \tilde{v})

with respect to t (the time variable). Meanwhile, the integer

o \in {0, 1, 2}

represents the oth partial derivative order of

Δ A^{(c + l)} (s, x, u, v, \bar{v}, \tilde{v})

with respect to a component of u, v,

\bar{v}

, or

\tilde{v}

. In the end, for each

A \in {I, \bar{I}}

, we suppose that

\begin{matrix} \sum_{i = 1}^{h} \int_{Z} {∥Δ A_{i}^{(c + l + o)} (s, x, u^{1}, v^{1}, {\bar{v}}^{1}, {\tilde{v}}^{1}, u^{2}, v^{2}, {\bar{v}}^{2}, {\tilde{v}}^{2}, z_{i})∥}^{2} λ_{i} ν_{i} (d z_{i}) \\ \leq K_{D, c} (∥ u^{1} - u^{2} ∥_{C^{k + c} (D, r)}^{2} + {∥ v^{1} - v^{2} ∥}_{C^{k + c} (D, q)}^{2} \\ + ∥ {\bar{v}}^{1} - {\bar{v}}^{2} ∥_{C^{k + c} (D, q d)}^{2} + {∥ {\tilde{v}}^{1} - {\tilde{v}}^{2} ∥}_{ν, k + c}^{2}), \end{matrix}

(84)

where

A_{i}

is the ith column of

A

.

Second, for each

A \in {L, \bar{L}, J, \bar{J}}

, every

c \in {0, 1, 2, \dots,}

, and any

(u, v, \bar{v}, \tilde{v}) \in V^{\infty} (D)

, we suppose that the corresponding local linear growth condition holds

\begin{matrix} ∥A^{(c + l + o)} (s, x, u, v, \bar{v}, \tilde{v})∥ \\ \leq K_{D, c} (δ_{0 c} + {∥ u ∥}_{C^{k + c} (D, r)} + {∥ v ∥}_{C^{k + c} (D, q)} \\ + ∥ \bar{v} ∥_{C^{k + c} (D, q d)} + {∥ \tilde{v} ∥}_{ν, k + c}), \end{matrix}

(85)

where

δ_{0 c} = 1

if

c = 0

and

δ_{0 c} = 0

if

c > 0

. Similarly, for each

A \in {I, \bar{I}}

, we suppose that

\begin{matrix} \sum_{i = 1}^{h} \int_{Z} {∥A_{i}^{(c + l + o)} (s, x, u, v, \bar{v}, \tilde{v}, z_{i})∥}^{2} λ_{i} ν (d z_{i}) \\ \leq K_{D, c} (δ_{0 c} + {∥ u ∥}_{C^{k + c} (D, r)}^{2} + {∥ v ∥}_{C^{k + c} (D, q)}^{2} \\ + ∥ \bar{v} ∥_{C^{k + c} (D, q d)}^{2} + {∥ \tilde{v} ∥}_{ν, k + c}^{2}) . \end{matrix}

(86)

Then, we have the following claim.

Claim 4.

Suppose that

(G, H) \in L_{G}^{2} (Ω, C^{\infty} (D; R^{r})) \times L_{F_{T}}^{2} (Ω, C^{\infty} (D; R^{q}))

and conditions in Equations (83)–(86) are true. Furthermore, assume that each

A \in {L, \bar{L}, J, \bar{J}, I, \bar{I}}

is

{F_{t}}

-adapted for every fixed

x \in D

,

z \in Z^{h}

, and any given

(u, v, \bar{v}, \tilde{v}) \in V^{\infty} (D)

with

\begin{matrix} L (\cdot, x, 0) \in L_{F}^{2} ([0, T], C^{\infty} (D, R^{r})), \end{matrix}

(87)

\begin{matrix} J (\cdot, x, 0) \in L_{F}^{2} ([0, T], C^{\infty} (D, R^{r \times d})), \end{matrix}

(88)

\begin{matrix} I (\cdot, x, 0, \cdot) \in L_{F}^{2} ([0, T] \times Z^{h}, C^{\infty} (D, R^{r \times h})), \end{matrix}

(89)

\begin{matrix} \bar{L} (\cdot, x, 0) \in L_{F}^{2} ([0, T], C^{\infty} (D, R^{q})), \end{matrix}

(90)

\begin{matrix} \bar{J} (\cdot, x, 0) \in L_{F}^{2} ([0, T], C^{\infty} (D, R^{q \times d})), \end{matrix}

(91)

\begin{matrix} \bar{I} (\cdot, x, 0, \cdot) \in L_{F}^{2} ([0, T] \times Z^{h}, C^{\infty} (D, R^{q \times h})) . \end{matrix}

(92)

Then, there uniquely exists a 4-tuple solution of the unified FB-SPDE in Equation (8), which is a strong and adapted solution, i.e.,

\begin{matrix} (Υ, Λ, \bar{Λ}, \tilde{Λ}) \in Q_{F}^{2} ([0, T] \times D), \end{matrix}

(93)

and

(Υ, Λ) (\cdot, x)

is càdlàg for each

x \in D

almost surely (a.s.).

The proof of Claim 4 is divided into the following two parts due to its length. In the first part of the proof of Claim 4, we prove the following two lemmas.

Lemma 1.

Under the conditions in Claim 4, if we take a quadruplet for every fixed

x \in D

and

z \in Z^{h}

as follows,

\begin{matrix} (Υ^{1} (\cdot, x), Λ^{1} (\cdot, x), {\bar{Λ}}^{1} (\cdot, x), {\tilde{Λ}}^{1} (\cdot, x, z)) \in Q_{F}^{2} ([0, T] \times D) . \end{matrix}

(94)

then there exists another quadruplet

(Υ^{2} (\cdot, x), Λ^{2} (\cdot, x), {\bar{Λ}}^{2} (\cdot, x), {\tilde{Λ}}^{2} (\cdot, x, z))

such that

\begin{matrix} \{\begin{matrix} Υ^{2} (t, x) & = G (x) + \int_{0}^{t} L (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) d s \\ + \int_{0}^{t} J (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) d W (s) \\ + \int_{0}^{t} \int_{Z^{h}} I (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}, z) \tilde{N} (λ d s, d z), \\ Λ^{2} (t, x) & = H (x) + \int_{t}^{T} \bar{L} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) d s \\ + \int_{t}^{T} (\bar{J} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) \\ + {\bar{Λ}}^{1} (s^{-}, x) - {\bar{Λ}}^{2} (s^{-}, x)) d W (s) \\ + \int_{t}^{T} \int_{Z^{h}} (\bar{I} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}, z) \\ + {\tilde{Λ}}^{1} (s^{-}, x, z) - {\tilde{Λ}}^{2} (s^{-}, x, z)) \tilde{N} (λ d s, d z), \end{matrix} \end{matrix}

(95)

where for each

s \in [0, T]

and

z \in Z^{h}

,

\begin{matrix} \tilde{N} (λ d s, d z) & \equiv & {({\tilde{N}}_{1} (λ_{1} d s, d z_{1}), \dots, {\tilde{N}}_{h} (λ_{h} d s, d z_{h}))}^{'}, \end{matrix}

(96)

\begin{matrix} {\tilde{N}}_{i} (λ_{i} d s, d z_{i}) & = & N_{i} (λ_{i} d s, d z_{i}) - λ_{i} d s ν_{i} (d z_{i}), i \in {1, \dots, h} . \end{matrix}

(97)

Furthermore,

(Υ^{2}, Λ^{2})

is a pair of

{F_{t}}

-adapted càdlàg processes. Meanwhile,

({\bar{Λ}}^{2}, {\tilde{Λ}}^{2})

is the pair of its associated predictable processes. In addition, for each

x \in D

,

\begin{matrix} E [\int_{0}^{T} ∥ Υ^{2} (t, x) ∥^{2} d t] < \infty, \end{matrix}

(98)

\begin{matrix} E [\int_{0}^{T} ∥ Λ^{2} (t, x) ∥^{2} d t] < \infty, \end{matrix}

(99)

\begin{matrix} E [\int_{0}^{T} ∥ {\bar{Λ}}^{2} (t, x) ∥^{2} d t] < \infty, \end{matrix}

(100)

\begin{matrix} E [\sum_{i = 1}^{h} \int_{0}^{T} \int_{Z} {∥{\tilde{Λ}}_{i}^{2} (t, x, z_{i})∥}^{2} ν_{i} (d z_{i}) d t] < \infty . \end{matrix}

(101)

Proof.

Consider a point

x \in D

and a quadruplet as in Equation (94). By the conditions in Equations (83)–(92), we have that

\begin{matrix} L (\cdot, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) \in L_{F}^{2} ([0, T], C^{\infty} (D, R^{r})), \end{matrix}

(102)

\begin{matrix} J (\cdot, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) \in L_{F}^{2} ([0, T], C^{\infty} (D, R^{r \times d})), \end{matrix}

(103)

\begin{matrix} I (\cdot, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) \in L_{F}^{2} ([0, T] \times Z^{h}, C^{\infty} (D, R^{r \times h})), \end{matrix}

(104)

\begin{matrix} \bar{L} (\cdot, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) \in L_{F}^{2} ([0, T], C^{\infty} (D, R^{q})), \end{matrix}

(105)

\begin{matrix} \bar{J} (\cdot, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) \in L_{F}^{2} ([0, T], C^{\infty} (D, R^{q \times d})), \end{matrix}

(106)

\begin{matrix} \bar{I} (\cdot, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) \in L_{F}^{2} ([0, T] \times Z^{h}, C^{\infty} (D, R^{q \times h})) . \end{matrix}

(107)

By considering

L

,

J

, and

I

in Equations (102)–(104) as new and starting with

L (\cdot, x, 0, 0, 0, 0)

,

J (\cdot, x, 0, 0, 0, 0)

, and

I (\cdot, x, 0, 0, 0, 0)

, we can define

Υ^{2}

by the forward iteration in Equation (95). Furthermore,

Υ^{2}

is an

{F_{t}}

-adapted càdlàg process satisfying the condition in Equation (98).

Now, consider

\bar{L}

,

\bar{J}

, and

\bar{I}

in Equations (105)–(107) as new and starting at

\bar{L} (\cdot, x, 0, 0, 0, 0)

,

\bar{J} (\cdot, x, 0, 0, 0, 0)

, and

\bar{I} (\cdot, x, 0, 0, 0, 0)

. Then, by the martingale representation theorem stated in Theorem 5.3.5 of Applebaum [21], we know that there are unique predictable processes

{\bar{Λ}}^{2} (\cdot, x)

and

{\tilde{Λ}}^{2} (\cdot, x, z)

satisfying

\begin{matrix} {\hat{Λ}}^{2} (t, x) \\ \equiv & E [H (x) + \int_{0}^{T} \bar{L} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) d s \\ + \int_{0}^{T} (\bar{J} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) + {\bar{Λ}}^{1} (s^{-}, x)) d W (s) \\ + \int_{0}^{T} \int_{Z} (\bar{I} (s^{-}, x, Υ^{1}, V^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}, z) + {\tilde{Λ}}^{1} (s^{-}, x, z)) \tilde{N} (λ d s, d z)| F_{t}] \\ = & {\hat{Λ}}^{2} (0, x) + \int_{0}^{t} {\bar{Λ}}^{2} (s^{-}, x) d W (s) + \int_{0}^{t} \int_{Z} {\tilde{Λ}}^{2} (s^{-}, x, z) \tilde{N} (λ d s, d z) . \end{matrix}

(108)

Furthermore,

{\bar{Λ}}^{2}

and

{\tilde{Λ}}^{2}

satisfy the conditions in Equations (100)–(101) and the following relationship:

\begin{matrix} {\hat{Λ}}^{2} (0, x) \\ = & {\hat{Λ}}^{2} (T, x) - \int_{0}^{T} {\bar{Λ}}^{2} (s^{-}, x) d W (s) - \int_{0}^{T} \int_{Z} {\tilde{Λ}}^{2} (s^{-}, x, z) \tilde{N} (λ d s, d z) \\ = & H (x) + \int_{0}^{T} \bar{L} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) d s \\ + \int_{0}^{T} (\bar{J} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) + {\bar{Λ}}^{1} (s^{-}, x) - {\bar{Λ}}^{2} (s^{-}, x)) d W (s) \\ + \int_{0}^{T} \int_{Z} (\bar{I} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}, z) + {\tilde{Λ}}^{1} (s^{-}, x, z) - {\tilde{Λ}}^{2} (s^{-}, x, z)) \tilde{N} (λ d s, d z) . \end{matrix}

(109)

Thus, it follows from the discussion in page 8 of Protter [55] that we can take

{\hat{Λ}}^{2} (\cdot, x)

as a càdlàg process. Furthermore, let

Λ^{2}

be a process defined by

\begin{matrix} Λ^{2} (t, x) & = & E [H (x) + \int_{t}^{T} \bar{L} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) d s \\ + \int_{t}^{T} (\bar{J} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) + {\bar{Λ}}^{1} (s^{-}, x)) d W (s) \\ + \int_{t}^{T} \int_{Z} (\bar{I} (s^{-}, x, Υ^{1}, Λ^{1}, z) + {\tilde{Λ}}^{1} (s^{-}, x, z)) \tilde{N} (λ d s, d z)| F_{t}] . \end{matrix}

(110)

Therefore, by Equations (85)–(86) and simple computation, we can conclude that

Λ^{2} (\cdot, x)

satisfies the condition in Equation (99). In addition, it follows from Equations (108)–(110) that

\begin{matrix} Λ^{2} (t, x) & = & {\hat{V}}^{2} (t, x) - \int_{0}^{t} \bar{L} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) d s \\ - \int_{0}^{t} (\bar{J} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) + {\bar{Λ}}^{1} (s^{-}, x)) d W (s) \\ - \int_{0}^{t} \int_{Z} (\bar{I} (s^{-}, x, Υ^{1}, Λ^{1}, z) + {\tilde{Λ}}^{1} (s^{-}, x, z)) \tilde{N} (λ d s, d z), \end{matrix}

(111)

which implies that the process

Λ^{2} (\cdot, x)

is a càdlàg one.

Hence, corresponding to a quadruplet in Equation (94), it follows from Equations (108), (109), and (111) that the associated quadruplet (

Υ^{2} (\cdot, x)

,

Λ^{2} (\cdot, x),

{\bar{Λ}}^{2} (\cdot, x)

,

{\tilde{Λ}}^{2} (\cdot, x, z)

) satisfies the system in Equation (95) of the lemma.

Furthermore, we can conclude that

\begin{matrix} Λ^{2} (t, x) \\ \equiv & Λ^{2} (0, x) - \int_{0}^{t} \bar{L} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{2}) d s \\ - \int_{0}^{t} (\bar{J} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) + {\bar{Λ}}^{1} (s^{-}, x) - {\bar{Λ}}^{2} (s^{-}, x)) d W (s) \\ - \int_{0}^{t} \int_{Z} (\bar{I} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{-}, {\tilde{Λ}}^{1}, z) + {\tilde{Λ}}^{1} (s^{-}, x, z) - {\tilde{V}}^{2} (s^{-}, x, z)) \tilde{N} (λ d s, d z) . \end{matrix}

(112)

Thus, we reach a proof for Lemma 1. □

Lemma 2.

Under the conditions of Claim 4 and for a quadruplet introduced in Equation (94) with

x \in D

and

z \in Z^{h}

, let

(Υ (t, x), Λ (t, x), \bar{Λ} (t, x), \tilde{Λ} (t, x, z))

be given by Equation (95). Then,

(Υ^{(c)} (\cdot, x)

,

Λ^{(c)} (\cdot, x)

,

{\bar{Λ}}^{(c)} (\cdot, x)

,

{\tilde{Λ}}^{(c)} (\cdot, x, z))

for every

c \in {0, 1, \dots,}

a.s. exists such that

\begin{matrix} \{\begin{matrix} Υ_{i_{1} \dots i_{p}}^{(c)} (t, x) & = G_{i_{1} \dots i_{p}}^{(c)} (x) + \int_{0}^{t} L_{i_{1} \dots i_{p}}^{(c)} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) d s \\ + \int_{0}^{t} J_{i_{1} \dots i_{p}}^{(c)} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) d W (s) \\ + \int_{0}^{t} \int_{Z} I_{i_{1} \dots i_{p}}^{(c)} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}, z) \tilde{N} (λ d s, d z), \\ V_{i_{1} \dots i_{p}}^{(c)} (t, x) & = H_{i_{1} \dots i_{p}}^{(c)} (x) + \int_{t}^{T} {\bar{L}}_{i_{1} \dots i_{p}}^{(c)} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) d s \\ + \int_{t}^{T} ({\bar{J}}_{i_{1} \dots i_{p}}^{(c)} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) \\ + {\bar{Λ}}_{i_{1} \dots i_{p}}^{1, (c)} (s^{-}, x) - {\bar{Λ}}_{i_{1} \dots i_{p}}^{(c)} (s^{-}, x)) d W (s) \\ + \int_{t}^{T} \int_{Z} ({\bar{I}}_{i_{1} \dots i_{p}}^{(c)} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}, z) \\ + {\tilde{Λ}}_{i_{1} \dots i_{p}}^{1, (c)} (s^{-}, x, z) - {\tilde{Λ}}_{i_{1} \dots i_{p}}^{(c)} (s^{-}, x, z)) \tilde{N} (λ d s, d z), \end{matrix} \end{matrix}

(113)

where

i_{1} + \dots + i_{p} = c

for each

i_{l} \in {0, 1, 2, \dots, c}

and

l \in {1, 2, \dots, p}

.

(Υ_{i_{1} \dots i_{p}}^{(c)}

,

Λ_{i_{1} \dots i_{p}}^{(c)})

corresponding to an integer

c \in {0, 1, 2, \dots}

is an

{F_{t}}

-adapted càdlàg process. Furthermore,

({\bar{Λ}}_{i_{1} \dots i_{p}}^{(c)}

,

{\tilde{Λ}}_{i_{1} \dots i_{p}}^{(c)})

is its associated predictable processes. All of them satisfy the conditions in Equations (99)–(101).

Proof.

Without loss of generality, we only consider an interior point x of D. Otherwise, we may employ the associate one-side derivative to replace the one in the following proof.

First, we prove the claim in the lemma to hold for

c = 1

. To do so, for a fixed

t \in [0, T], x \in D, z \in Z^{h}

, and

(Υ^{1} (t, x), Λ^{1} (t, x), {\bar{Λ}}^{1} (t, x), {\tilde{Λ}}^{1} (t, x, z))

as given in the lemma, define

\begin{matrix} (Υ_{i_{l}}^{(1)} (t, x), Λ_{i_{l}}^{(1)} (t, x), {\bar{Λ}}_{i_{l}}^{(1)} (t, x), {\tilde{Λ}}_{i_{l}}^{(1)} (t, x, z)) \end{matrix}

(114)

by Equation (95). However, we replace each generalized operator

A \in {L, J, I, \bar{L}, \bar{J}, \bar{I}}

by its corresponding first-order partial derivative

\begin{matrix} A_{i_{l}}^{(1)} \in \{L_{i_{l}}^{(1)}, J_{i_{l}}^{(1)}, I_{i_{l}}^{(1)}, {\bar{L}}_{i_{l}}^{(1)}, {\bar{J}}_{i_{l}}^{(1)}, {\bar{I}}_{i_{l}}^{(1)}\} \end{matrix}

with respect to

x_{l}

for

l \in {1, \dots, p}

if

i_{l} = 1

. Then, we can show that the quadruplet defined in Equation (114) corresponding to an integer l is the required first order of the partial derivative of

(Υ, Λ, \bar{Λ}, \tilde{Λ})

introduced in Equation (95) corresponding to the given

(Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1})

.

In fact, for an interior point x of D, we can take constant

δ

that is sufficiently small such that

x + δ e_{l} \in D

. Furthermore, the notation

e_{l}

denotes the unit vector where only its lth component is the unity and other components are all zeroes. Without loss of generality, we can take

δ > 0

. Thus, for a function

f \in {Υ, Λ, \bar{Λ}, \tilde{Λ}, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}}

corresponding to each

i_{l} = 1

and

l \in {1, 2, \dots, p}

, we let

\begin{matrix} f_{i_{l}, δ} (t, x) \equiv f (t, x + δ e_{l}) . \end{matrix}

(115)

In addition, define

\begin{matrix} Δ f_{i_{l}, δ}^{(1)} (t, x) = \frac{f_{i_{l}, δ} (t, x) - f (t, x)}{δ} - f_{i_{l}}^{(1)} (t, x), \end{matrix}

(116)

and let

\begin{matrix} Δ A_{i_{l}, δ}^{(1)} (t, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) \\ = & \frac{1}{δ} (A (t, x + δ e_{l}, Υ^{1} (t, x + δ e_{l}), Λ^{1} (t, x + δ e_{l}), {\bar{Λ}}^{1} (t, x + δ e_{l}), {\tilde{Λ}}^{1} (t, x + δ e_{l}, z)) \\ - A (t, x, Υ^{1} (s, x), Λ^{1} (t, x), {\bar{Λ}}^{1} (t, x), {\tilde{Λ}}^{1} (t, x, z))) \\ - A_{i_{l}}^{(1)} (t, x, Υ^{1} (s, x), Λ^{1} (t, x), {\bar{Λ}}^{1} (t, x), {\tilde{Λ}}^{1} (t, x, z)) \end{matrix}

(117)

for each

A \in {L, J, I, \bar{L}, \bar{J}, \bar{I}}

.

Next, we use Tr

(A)

to represent the trace of the matrix

A^{'} A

corresponding to a matrix A. Furthermore, we use

{(T r (A))}_{j}

to denote the jth term in the trace’s summation. In addition, let

\begin{matrix} Z_{δ} (t, x) & \equiv & ζ (Δ Υ_{i_{l}, δ}^{(1)} (t, x) + Δ Λ_{i_{l}, δ}^{(1)} (t, x)) \\ = & (T r (Δ Υ_{i_{l}, δ}^{(1)} (t, x)) + T r (Δ Λ_{i_{l}, δ}^{(1)} (t, x))) e^{2 γ t} . \end{matrix}

(118)

for a given

t \in [0, T]

,

δ > 0

, and

γ > 0

. Thus, by Equation (112) and It

\hat{o}

’s formula stated in Theorem 1.14 and Theorem 1.16 of Øksendal and Sulem [53], we know that

\begin{matrix} Z_{δ} (t, x) + \int_{t}^{T} T r (Δ {\bar{J}}_{i_{l}, δ}^{(1)} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) \\ + Δ {\bar{Λ}}_{i_{l}, δ}^{1, (1)} (s^{-}, x) - Δ {\bar{Λ}}_{i_{l}, δ}^{(1)} (s, x)) e^{2 γ s} d s \\ + \sum_{j = 1}^{h} \int_{t}^{T} \int_{Z} (T r (Δ {\bar{I}}_{i_{l}, δ}^{(1)} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}, z) \\ {+ Δ {\tilde{Λ}}_{i_{l}, δ}^{1, (1)} (s^{-}, x, z_{j}) - Δ {\tilde{Λ}}_{i_{l}, δ}^{(1)} (s^{-}, x, z)))}_{j} e^{2 γ s} N_{j} (λ_{j} d s, d z_{j}) \\ = & 2 \int_{0}^{t} (- γ T r (Δ Υ_{i_{l}, δ}^{(1)} (s, x)) + {(Δ Υ_{i_{l}, δ}^{(1)} (s, x))}^{'} (Δ L_{i_{l}, δ}^{(1)} (s, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}))) e^{2 γ s} d s \\ + 2 \int_{t}^{T} (- γ T r (Δ Λ_{i_{l}, δ}^{(1)} (s, x)) + {(Δ Λ_{i_{l}, δ}^{(1)} (s, x))}^{'} (Δ {\bar{L}}_{i_{l}, δ}^{(1)} (s, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}))) e^{2 γ s} d s \\ - M_{δ} (t, x) \\ \leq & (- 2 γ + \frac{1}{\hat{γ}}) (\int_{0}^{t} T r (Δ Υ_{i_{l}, δ}^{(1)} (s, x)) e^{2 γ s} d s + \int_{t}^{T} T r (Δ Λ_{i_{l}, δ}^{(1)} (s, x)) e^{2 γ s} d s) \\ + \hat{γ} \int_{0}^{t} {∥Δ L_{i_{l}, δ}^{(1)} (s, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1})∥}^{2} e^{2 γ s} d s \\ + \hat{γ} \int_{t}^{T} {∥Δ {\bar{L}}_{i_{l}, δ}^{(1)} (s, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1})∥}^{2} e^{2 γ s} d s - M_{δ} (t, x) \\ = & \hat{γ} \int_{0}^{t} {∥Δ L_{i_{l}, δ}^{(1)} (s, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1})∥}^{2} e^{2 γ s} d s \\ + \hat{γ} \int_{t}^{T} {∥Δ {\bar{L}}_{i_{l}, δ}^{(1)} (s, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1})∥}^{2} e^{2 γ s} d s - M_{δ} (t, x) \end{matrix}

(119)

if, in the last equality, we take

\begin{matrix} \hat{γ} = \frac{1}{2 γ} > 0 . \end{matrix}

(120)

Note that

M_{δ} (t, x)

in Equation (119) is a martingale, which can be represented by a form as follows:

\begin{matrix} M_{δ} (t, x) \\ = & - 2 \sum_{j = 1}^{d} \int_{0}^{t} {(Δ Υ_{i_{l}, δ}^{(1)} (s^{-}, x))}^{'} Δ {(J_{j})}_{i_{l}, δ}^{(1)} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) e^{2 γ s} d W_{j} (s) \\ - 2 \sum_{j = 1}^{h} \int_{0}^{t} \int_{Z} {(Δ Υ_{i_{l}, δ}^{(1)} (s^{-}, x))}^{'} Δ {(I_{j})}_{i_{l}, δ}^{(1)} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}, z_{j}) e^{2 γ s} {\tilde{N}}_{j} (λ_{j} d s, d z_{j}) \\ + 2 \sum_{j = 1}^{d} \int_{t}^{T} {(Δ Λ_{i_{l}, δ}^{(1)} (s^{-}, x))}^{'} (Δ {({\bar{J}}_{j})}_{i_{l}, δ}^{(1)} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) \\ + Δ {({\bar{Λ}}_{j}^{1})}_{i_{l}, δ}^{(1)} (s^{-}, x) - Δ {({\bar{Λ}}_{j})}_{i_{l}, δ}^{(1)} (s^{-}, x)) e^{2 γ s} d W_{j} (s) \\ + 2 \sum_{j = 1}^{h} \int_{t}^{T} \int_{Z} {(Δ Λ_{i_{l}, δ}^{(1)} (s^{-}, x))}^{'} (Δ {({\bar{I}}_{j})}_{i_{l}, δ}^{(1)} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}, z_{j}) \\ + Δ {({\tilde{Λ}}_{j}^{1})}_{i_{l}, δ}^{(1)} (s^{-}, x, z_{j}) - Δ {({\tilde{Λ}}_{j})}_{i_{l}, δ}^{(1)} (s^{-}, x, z_{j})) e^{2 γ s} {\tilde{N}}_{j} (λ_{j} d s, d z_{j}) . \end{matrix}

(121)

Therefore, it follows from Equation (119) and the martingale property that

\begin{matrix} E [Z_{δ} (t, x) + \int_{t}^{T} T r (Δ {\bar{J}}_{i_{l}, δ}^{(1)} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) \\ + Δ {\bar{Λ}}_{i_{l}, δ}^{1, (1)} (s^{-}, x) - Δ {\bar{Λ}}_{i_{l}, δ}^{(1)} (s^{-}, x)) e^{2 γ s} d s \\ + \sum_{j = 1}^{h} \int_{t}^{T} \int_{Z} (T r (Δ {\bar{I}}_{i_{l}, δ}^{(1)} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}, z) \\ {+ Δ {\tilde{Λ}}_{i_{l}, δ}^{1, (1)} (s^{-}, x, z) - Δ {\tilde{Λ}}_{i_{l}, δ}^{(1)} (s^{-}, x, z)))}_{j} e^{2 γ s} N_{j} (λ_{j} d s, d z_{j})] \\ \leq & \hat{γ} E [\int_{0}^{t} {∥Δ L_{i_{l}, δ}^{(1)} (s, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1})∥}^{2} e^{2 γ s} d s] \\ + \hat{γ} [\int_{t}^{T} {∥Δ {\bar{L}}_{i_{l}, δ}^{(1)} (s, x, U^{1}, V^{1}, {\bar{V}}^{1}, {\tilde{V}}^{1})∥}^{2} e^{2 γ s} d s] . \end{matrix}

(122)

Furthermore, by Equations (119)–(122) and Burkholder–Davis–Gundy’s inequality given in Theorem 48 on page 193 of Protter [55]), the following fact holds:

\begin{matrix} E [sup_{0 \leq t \leq T} |M_{δ} (t, x)|] \\ \leq \hat{γ} K_{1} E [\int_{0}^{t} {∥Δ L_{i_{l}, δ}^{(1)} (s, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1})∥}^{2} e^{2 γ s} d s] \\ + \hat{γ} K_{1} [\int_{t}^{T} {∥Δ {\bar{L}}_{i_{l}, δ}^{(1)} (s, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1})∥}^{2} e^{2 γ s} d s], \end{matrix}

(123)

where the constant

K_{1} \geq 0

depends only on

K_{D, 0}

,

K_{D, 1}

, T, and d. Note that the detailed estimation procedure for the quantity on the right-hand side of Equation (123) is postponed to the same argument used for Equation (150) in the following part of the Proof of Claim 1 since more exact calculations are required there.

Now, for the random variable set

{Z_{δ} (t, x)

δ \in [0, σ]}

corresponding to a given

t \in [0, T]

,

x \in D

, and

σ > 0

. By Lemma 1.3 of Peskir and Shiryaev [56], we know that there exists a countable subset

C = {δ_{1}, δ_{2}, \dots} \subset [0, σ]

such that

\begin{matrix} {esssup}_{δ \in [0, σ]} Z_{δ} (t, x) = sup_{δ \in C} Z_{δ} (t, x), a . s ., \end{matrix}

(124)

where “esssup” represents the essential supremum. In addition, we let

\begin{matrix} \{\begin{matrix} {\bar{Z}}_{δ_{1}} (t, x) = Z_{δ_{1}} (t, x), \\ {\bar{Z}}_{δ_{n + 1}} (t, x) = {\bar{Z}}_{δ_{n}} (t, x) \lor Z_{δ_{n + 1}} (t, x) f o r n \in {1, 2, \dots} . \end{matrix} \end{matrix}

(125)

In Equation (125), the notation

α \lor β

denotes

max {α, β}

for two given real numbers

α

and

β

. Then, we have the observation that

\begin{matrix} \{\begin{matrix} Z_{δ} (t, x) \leq {\bar{Z}}_{δ} (t, x) & for each δ \in C \\ {\bar{Z}}_{δ_{1}} (t, x) \leq {\bar{Z}}_{δ_{2}} (t, x) & for any δ_{1}, δ_{2} \in C satisfying δ_{1} \leq δ_{2} . \end{matrix} \end{matrix}

(126)

By the second inequality in Equation (126), we can see that

\{{\bar{Z}}_{δ} (t, x), δ \in C\}

is an upwards directed set. Therefore, by Equation (124), we know that

\begin{matrix} E [{esssup}_{0 \leq δ \leq σ} Z_{δ} (t, x)] \\ \leq & E [{esssup}_{δ \in C} {\bar{Z}}_{δ} (t, x)] \\ = & lim_{n \to \infty} E [{\bar{Z}}_{δ_{n}} (t, x)] \\ = & lim_{n \to \infty} E [max_{δ \in {δ_{1}, \dots, δ_{n}}} Z_{δ} (t, x)] \end{matrix}

(127)

for the corresponding sequence of

{δ_{n}, n = 1, 2, \dots}

with

t \in [0, T]

,

x \in D

, and

σ > 0

. Furthermore, for a given

n \in {2, 3, \dots}

, define

\begin{matrix} {\bar{M}}_{δ_{n}} (t, x) = M_{δ_{n}} (t, x) I_{{Z_{δ_{n}} \geq {\bar{Z}}_{δ_{n - 1}}}} + M_{δ_{n - 1}} (t, x) I_{{Z_{δ_{n}} < {\bar{Z}}_{δ_{n - 1}}}} . \end{matrix}

(128)

Then, it follows from the induction method with respect to each

n \in {1, 2, \dots}

and Equation (119) that

\begin{matrix} E [max_{δ \in {δ_{1}, \dots, δ_{n}}} Z_{δ} (t, x)] \\ \leq & \hat{γ} lim_{n \to \infty} E [\int_{0}^{t} max_{δ \in {δ_{1}, \dots, δ_{n}}} {∥Δ L_{i_{l}, δ}^{(1)} (s, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1})∥}^{2} e^{2 γ s} d s \\ + \int_{t}^{T} max_{δ \in {δ_{1}, \dots, δ_{n}}} {∥Δ {\bar{L}}_{i_{l}, δ}^{(1)} (s, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1})∥}^{2} e^{2 γ s} d s] - lim_{n \to \infty} E [{\bar{M}}_{δ_{n}} (t, x)] \\ \leq & K E [\int_{0}^{t} {esssup}_{0 \leq δ \leq σ} {∥Δ L_{i_{l}, δ}^{(1)} (s, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1})∥}^{2} e^{2 γ s} d s \\ + \int_{t}^{T} {esssup}_{0 \leq δ \leq σ} {∥Δ {\bar{L}}_{i_{l}, δ}^{(1)} (s, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1})∥}^{2} e^{2 γ s} d s] \\ + \int_{0}^{T} {esssup}_{0 \leq δ \leq σ} {∥Δ J_{i_{l}, δ}^{(1)} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1})∥}^{2} e^{2 γ s} d s \\ + \int_{0}^{T} \sum_{i = 1}^{h} \int_{Z} {esssup}_{0 \leq δ \leq σ} {∥Δ I_{i, i_{l}, δ}^{(1)} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}, z_{i})∥}^{2} e^{2 γ s} λ_{i} ν_{i} (d z_{i}) d s], \end{matrix}

(129)

where the constant

K \geq 0

depends only on

K_{D, 0}

, d, T, and

γ

. Furthermore, the second inequality of Equation (129) follows from the calculation in Equation (122) and the fact that

\begin{matrix} |E [{\bar{M}}_{δ_{n}} (t, x)]| \leq E [sup_{t \in [0, T]} ∥M_{δ_{n}} (t, x)∥] + E [sup_{t \in [0, T]} ∥M_{δ_{n - 1}} (t, x)∥] . \end{matrix}

(130)

Now, we recall the condition that

\begin{matrix} (Υ^{1} (\cdot, x), Λ^{1} (\cdot, x), {\bar{Λ}}^{1} (\cdot, x), {\tilde{Λ}}^{1} (\cdot, x, z)) \in Q_{F}^{2} ([0, T] \times D) . \end{matrix}

Then, we can conclude that

\begin{matrix} ∥(Υ^{1, (c)} (t, x + ξ e_{l}), Λ^{1, (c)} (t, x + ξ e_{l}), {\bar{Λ}}^{1, (c)} (t, x + ξ e_{l}), {\tilde{Λ}}^{1, (c)} (t, x + ξ e_{l}, z))∥ \\ \leq & ∥(max_{x \in D} ∥Υ^{1, (c)} (t, x)∥, max_{x \in D} ∥Λ^{1, (c)} (t, x)∥, max_{x \in D} ∥{\bar{Λ}}^{1, (c)} (t, x)∥, max_{x \in D} ∥{\tilde{Λ}}^{1, (c)} (t, x, z)∥)∥ \end{matrix}

(131)

for a given

x \in D

,

z \in Z^{h}

, any given

c \in {0, 1, 2, \dots}

, and an arbitrarily small number

ξ

satisfying

x + ξ e_{l} \in D

. Note that the related quantities on the right-hand side of Equation (131) are a.s. squarely integrable with respect to the Lebesgue measure and/or the Lévy measure. Therefore,

{\tilde{Λ}}^{1} (t, x, \cdot)

(the integration of

{\tilde{Λ}}^{1} (t, x, z)

in terms of the Lévy measure) is also infinitely smooth in each

x \in D

due to the dominated convergence theorem. Therefore, it follows from the mean-value theorem that

\begin{matrix} Δ A_{i_{l}, δ}^{(1)} (t, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) \\ = & ξ_{1} A_{i_{l}}^{(2)} (t, x + ξ e_{l}, Υ^{1} (t, x + ξ e_{l}), Λ^{1} (t, x + ξ e_{l}), {\bar{Λ}}^{1} (t, x + ξ e_{l}), {\tilde{Λ}}^{1} (t, x + ξ e_{l}, \cdot)) \end{matrix}

(132)

a.s. for each

A \in {L, J, \bar{L}}

, where

ξ_{1} \in (0, δ)

and

ξ \in (0, ξ_{1})

are constants depending on

δ

. Furthermore, by Equations (83), (131), and (132), we know that the left-hand side of Equation (132) with respect to

δ

is bounded by a squarely-integrable random variable with respect to the measure

d t \times d P

. Similarly, for

A = \bar{J}

and each

z \in Z^{h}

, we a.s. have that

\begin{matrix} Δ A_{i_{l}, δ}^{(1)} (t, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}, z) \\ = & ξ_{1} A_{i_{l}}^{(2)} (t, x + ξ e_{l}, Υ^{1} (t, x + ξ e_{l}), Λ^{1} (t, x + ξ e_{l}), {\bar{Λ}}^{1} (t, x + ξ e_{l}), {\tilde{Λ}}^{1} (t, x + ξ e_{l}, z), z) . \end{matrix}

(133)

In addition, by Equations (84), (131), and (132), we know that the left-hand side of Equation (133) with respect to

δ

is bounded by a squarely-integrable random variable with respect to the measure

d t \times ν (d z) \times d P

. Therefore, by Equations (127)–(129) and the dominated convergence theorem, we have that

\begin{matrix} lim_{σ \to 0} E [{esssup}_{0 \leq δ \leq σ} Z_{δ} (t, x)] \\ \leq & K E [\int_{0}^{t} lim_{σ \to 0} {esssup}_{0 \leq δ \leq σ} {∥Δ L_{i_{l}, δ}^{(1)} (s, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1})∥}^{2} e^{2 γ s} d s \\ + \int_{t}^{T} lim_{σ \to 0} {esssup}_{0 \leq δ \leq σ} {∥Δ {\bar{L}}_{i_{l}, δ}^{(1)} (s, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1})∥}^{2} e^{2 γ s} d s \\ + \int_{0}^{T} lim_{σ \to 0} {esssup}_{0 \leq δ \leq σ} ∥Δ J_{i_{l}, δ}^{(1)} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1})∥ e^{2 γ s} d s \\ + \int_{0}^{T} \sum_{i = 1}^{h} \int_{Z} lim_{σ \to 0} {esssup}_{0 \leq δ \leq σ} ∥Δ I_{i_{l}, δ}^{(1)} (s^{-}, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}, z_{i})∥ e^{2 γ s} λ_{i} ν_{i} (d z_{i}) d s] . \end{matrix}

(134)

Hence, it follows from Equation (134) and Fatou’s lemma that there exists a subsequence

N^{'} \subset N

satisfying

\begin{matrix} {esssup}_{0 \leq δ \leq σ_{n}} Z_{δ} (t, x)) \to 0 along n \in N^{'} a . s . \end{matrix}

(135)

corresponding to a given sequence

σ_{n}

such that

σ_{n} \to 0

along

n \in N

. Thus, by Equation (135), we know that the first-order partial derivatives of

Υ

and

Λ

with respect to

x_{l}

for every

l \in {1, 2, \dots, p}

exist. More exactly, they a.s. equal

Υ_{i_{l}}^{(1)} (t, x)

and

Λ_{i_{l}}^{(1)} (t, x)

respectively for a given

t \in [0, T]

and

x \in D

, which are all

{F_{t}}

-adapted.

Next, we provide a proof for the claim with respect to

\bar{Λ}

. In fact, by the proof as given in Equations (127)–(129), we can conclude that the following quantity is bounded by the one on the right-hand side of Equation (134):

\begin{matrix} lim_{σ \to 0} E [\int_{t}^{T} {esssup}_{0 \leq δ \leq σ} T r (Δ {\bar{J}}_{i_{l}, δ}^{(1)} (s, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) \\ + Δ {({\bar{Λ}}^{1})}_{i_{l}, δ}^{(1)} (s, x) - Δ {\bar{Λ}}_{i_{l}, δ}^{(1)} (s, x)) e^{2 γ s} d s] . \end{matrix}

(136)

Therefore, it follows from Equations (135) and (136) that

\begin{matrix} lim_{δ \to 0} Δ {\bar{Λ}}_{i_{l}, δ}^{(1)} (t, x) = lim_{δ \to 0} (Δ {\bar{J}}_{i_{l}, δ}^{(1)} (t, x, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}) + Δ {({\bar{Λ}}^{1})}_{i_{l}, δ}^{(1)} (t, x)) = 0, a . s . \end{matrix}

Hence, we know that the first-order partial derivative of

\bar{Λ}

with respect to

x_{l}

for every

l \in {1, 2, \dots, p}

exists. More precisely, it a.s. equals

{\bar{Λ}}_{i_{l}}^{(1)} (t, x)

for any given

t \in [0, T]

and

x \in D

, which is an

{F_{t}}

-predictable process. Similarly, we can make the conclusion for

{\tilde{Λ}}_{i_{l}}^{(1)} (t, x, z)

associated with each l, t, x, and z.

Second, we assume that there exists a 4-tuple

(Υ^{(c - 1)} (t, x)

,

Λ^{(c - 1)} (t, x),

{\bar{Λ}}^{(c - 1)} (t, x),

{\tilde{Λ}}^{(c - 1)} (t, x, z))

that corresponds to a given

(Υ^{1} (t, x),

Λ^{1} (t, x)

,

{\bar{Λ}}^{1} (t, x),

{\tilde{Λ}}^{1} (t, x, z))

\in Q_{F}^{2} ([0, T] \times D)

for a given

c \in {1, 2, \dots}

. Then, we can prove that the following vector of derivatives exists for the given integer

c \in {1, 2, \dots}

:

\begin{matrix} (Υ^{(c)} (t, x), Λ^{(c)} (t, x), {\bar{Λ}}^{(c)} (t, x), {\tilde{Λ}}^{(c)} (t, x, z)) \end{matrix}

(137)

In fact, for the given integer

c \in {1, 2, \dots}

and arbitrarily fixed integer numbers

i_{1} \geq 0

, …,

i_{p} \geq 0

such that

i_{1} + \dots + i_{p} = c - 1

, we take a function

f \in {Υ, Λ, \bar{Λ}, \tilde{Λ}}

and define

\begin{matrix} f_{i_{1} \dots (i_{l} + 1) \dots i_{p}, δ}^{(c - 1)} (t, x) \equiv f_{i_{1} \dots i_{p}}^{(c - 1)} (t, x + δ e_{l}) \end{matrix}

(138)

for each

l \in {1, 2, \dots, p}

and sufficiently small

δ > 0

, which correspond to the

(c - 1)

th-order partial derivative

A_{i_{1} \dots i_{p}}^{(c - 1)} (s, x + δ e_{l}, Υ^{1} (s, x + δ e_{l}), Λ^{1} (s, x + δ e_{l}))

of

A \in {L, J, I, \bar{L}, \bar{J}, \bar{I}}

via Equation (95). Similarly, let

\begin{matrix} (Υ_{i_{1} \dots (i_{l} + 1) \dots i_{p}}^{(c)} (t, x), Λ_{i_{1} \dots (i_{l} + 1) \dots i_{p}}^{(c)} (t, x), {\bar{Λ}}_{i_{1} \dots (i_{l} + 1) \dots i_{p}}^{(c)} (t, x), {\tilde{Λ}}_{i_{1} \dots (i_{l} + 1) \dots i_{p}}^{(c)} (t, x, z)) \end{matrix}

be given as in Equation (95), where we replace

A \in {L, J, I, \bar{L}, \bar{J}, \bar{I}}

by their corresponding cth-order partial derivatives

A_{i_{1} \dots (i_{l} + 1) \dots i_{p}}^{(c)}

for a given

t, x

,

Υ^{1} (t, x)

,

Λ^{1} (t, x)

,

{\bar{Λ}}^{1} (t, x)

, and

{\tilde{Λ}}^{1} (t, x, z)

. Furthermore, for a function

f \in {Υ, Λ, \bar{Λ}, \tilde{Λ}, Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1}}

, let

\begin{matrix} Δ f_{i_{1} \dots (i_{l} + 1) \dots i_{p}, δ}^{(c)} (t, x) = \frac{f_{i_{1} \dots (i_{l} + 1) \dots i_{p}, δ}^{(c - 1)} (t, x) - f_{i_{1} \dots i_{p}}^{(c - 1)} (t, x)}{δ} - f_{i_{1} \dots (i_{l} + 1) \dots i_{p}}^{(c)} (t, x) \end{matrix}

(139)

Then, we let

\begin{matrix} Δ A_{i_{1} \dots (i_{l} + 1) \dots i_{p}, δ}^{(c)} (t, x, Υ^{1}, Λ^{1}) \\ \equiv & \frac{1}{δ} (A_{i_{1} \dots i_{p}}^{(c - 1)} (t, x + δ e_{l}, Υ^{1} (t, x + δ e_{l}), Λ^{1} (t, x + δ e_{l}), \cdot) - A_{i_{1} \dots i_{p}}^{(c - 1)} (s, x, Υ^{1} (s, x), Λ^{1} (s, x) \cdot)) \\ - A_{i_{1} \dots (i_{l} + 1) \dots i_{p}}^{(c)} (s, x, Υ^{1} (s, x), Λ^{1} (s, x) \cdot) \end{matrix}

(140)

for each

A \in {L, J, I, \bar{L}, \bar{J}, \bar{I}}

. Thus, by repeating the proof for the first step, it follows from It

\hat{o}

’s formula that the following partial derivatives exist for the given integer

c \in {1, 2, \dots}

and all integers

l \in {1, \dots, p}

:

\begin{matrix} (Υ_{i_{1} \dots (i_{l} + 1) \dots i_{p}}^{(c)} (t, x), Λ_{i_{1} \dots (i_{l} + 1) \dots i_{p}}^{(c)} (t, x), {\bar{Λ}}_{i_{1} \dots (i_{l} + 1) \dots i_{p}}^{(c)} (t, x), {\tilde{Λ}}_{i_{1} \dots (i_{l} + 1) \dots i_{p}}^{(c)} (t, x, z)) . \end{matrix}

Hence, the claim in Equation (137) holds.

Third, by the continuity of all concerned partial derivatives with respect to

x \in D

, it follows from the induction method in terms of the integer number

c \in {1, 2, \dots}

that the claims presented in the lemma hold. Therefore, we reach a proof for Lemma 2. □

The second part of the Proof of Claim 4 is as follows.

Proof.

In the second part of the Proof of Claim 4, we let

D_{F}^{2} ([0, T]

,

C^{\infty} (D, R^{l}))

with

l \in {r, q}

be the set of

R^{l}

-valued

{F_{t}}

-adapted càdlàg processes that satisfy the condition in Equation (76). Furthermore, for any given number sequence

γ = {γ_{c}, c = 0, 1, 2, \dots}

and

γ_{c} \in R

, let

M_{γ}^{D} [0, T]

represent the following Banach space:

\begin{matrix} M_{γ}^{D} [0, T] & \equiv & D_{F}^{2} ([0, T], C^{\infty} (D, R^{r})) \\ \times D_{F}^{2} ([0, T], C^{\infty} (D, R^{q})) \\ \times L_{F, p}^{2} ([0, T], C^{\infty} (D, R^{q \times d})) \\ \times L_{p}^{2} ([0, T] \times R_{+}^{h}, C^{\infty} (D, R^{q \times h})) . \end{matrix}

(141)

Note that the space presented in Equation (141) is a generalized version of many existing studies (see, e.g., the studies in Yong and Zhou [36] and Situ [57] for stochastic ordinary differential equations). Our endowed norm here depends on partial derivatives. Furthermore, our space presented here is also different from the one studied in Dai [14] for solely a B-SPDE since our space here depends on additional dynamics of the F-BSPDE and the Lévy measure. More precisely, our endowed norm can be presented as follows

\begin{matrix} {∥(Υ, Λ, \bar{Λ}, \tilde{Λ})∥}_{M_{γ}^{D}}^{2} & \equiv & \sum_{c = 0}^{\infty} ξ (c) {∥(Υ, Λ, \bar{Λ}, \tilde{Λ})∥}_{M_{γ_{c}, c}^{D}}^{2} \end{matrix}

(142)

for any given

(Υ, Λ, \bar{Λ}, \tilde{Λ}) \in M_{γ}^{D} [0, T]

, and

\begin{matrix} {∥(Υ, Λ, \bar{Λ}, \tilde{Λ})∥}_{M_{γ_{c}, c}^{D}}^{2} & = & E [sup_{0 \leq t \leq T} {∥Υ (t)∥}_{C^{c} (D, q)}^{2} e^{2 γ_{c} t}] \\ + E [sup_{0 \leq t \leq T} {∥Λ (t)∥}_{C^{c} (D, q)}^{2} e^{2 γ_{c} t}] \\ + E [\int_{0}^{T} {∥\bar{Λ} (t)∥}_{C^{c} (D, q d)}^{2} e^{2 γ_{c} t} d t] \\ + E [\int_{0}^{T} {∥\tilde{Λ} (t)∥}_{ν, c}^{2} e^{2 γ_{c} t} d t] . \end{matrix}

(143)

Now, by Equation (95), we can define the following map:

\begin{matrix} Ξ : (Υ^{1} (\cdot, x), Λ^{1} (\cdot, x), {\bar{Λ}}^{1} (\cdot, x), {\tilde{Λ}}^{1} (\cdot, x, z)) \to (Υ (\cdot, x), Λ (\cdot, x), \bar{Λ} (\cdot, x), \tilde{Λ} (\cdot, x, z)) . \end{matrix}

Furthermore, we can prove that

Ξ

forms a contraction mapping in

M_{γ}^{D} [0, T]

. To give a proof for this claim, consider

\begin{matrix} (Υ^{i} (\cdot, x), Λ^{i} (\cdot, x), {\bar{Λ}}^{i} (\cdot, x), {\tilde{Λ}}^{i} (\cdot, x, z)) \in M_{γ}^{D} [0, T] \end{matrix}

for each

i \in {1, 2, \dots}

, satisfying

\begin{matrix} (Υ^{i + 1} (\cdot, x), Λ^{i + 1} (\cdot, x), {\bar{Λ}}^{i + 1} (\cdot, x), {\tilde{Λ}}^{i + 1} (\cdot, x, z)) \\ = & Ξ (Υ^{i} (\cdot, x), Λ^{i} (\cdot, x), {\bar{Λ}}^{i} (\cdot, x), {\tilde{Λ}}^{i} (\cdot, x, z)) . \end{matrix}

In addition, define

\begin{matrix} Δ f^{i} = f^{i + 1} - f^{i} w i t h f \in {Υ, Λ, \bar{Λ}, \tilde{Λ}} \end{matrix}

and let

\begin{matrix} ζ (Δ Υ^{i} (t, x) + Δ Λ^{i} (t, x)) = (T r (Δ Υ^{i} (t, x)) + T r (Δ Λ^{i} (t, x))) e^{2 γ_{0} t} . \end{matrix}

(144)

Therefore, it follows from Equation (83) and the similar argument as used in proving Equation (119) that

\begin{matrix} ζ (Δ Υ^{i} (t, x) + Δ Λ^{i} (t, x)) \\ + \int_{t}^{T} T r (Δ \bar{J} (s, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}) \\ + Δ {\bar{Λ}}^{i - 1} (s, x) - Δ {\bar{Λ}}^{i} (s, x)) e^{2 γ_{0} s} d s \\ + \sum_{j = 1}^{h} \int_{t}^{T} \int_{Z} (T r (Δ \bar{I} (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}, z) \\ {+ Δ {\tilde{Λ}}^{i - 1} (s^{-}, x, z) - Δ {\tilde{Λ}}^{i} (s^{-}, x, z)))}_{j} e^{2 γ_{0} s} N_{j} (λ_{j} d s, d z_{j}) \\ \leq & {\hat{γ}}_{0} (\int_{0}^{t} {∥Δ L (s, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1})∥}^{2} e^{2 γ_{0} s} d s \\ + \int_{t}^{T} {∥Δ \bar{L} (s, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1})∥}^{2} e^{2 γ_{0} s} d s) - M^{i} (t, x) \\ \leq & {\hat{γ}}_{0} K_{a, 0} N^{i - 1} (t) - M^{i} (t, x), \end{matrix}

(145)

for a constant

γ_{0} > 0

and a given

i \in {2, 3, \dots}

. Furthermore, the constant

K_{a, 0} > 0

in Equation (145) depends only on

K_{D, 0}

. Note that we have taken

\begin{matrix} {\hat{γ}}_{0} = \frac{1}{2 γ_{0}} > 0 . \end{matrix}

(146)

for the last inequality in Equation (145). Moreover,

N^{i - 1} (t)

in Equation (145) can be expressed by

\begin{matrix} N^{i - 1} (t) \\ = & \int_{0}^{t} {∥Δ Υ^{i - 1} (s)∥}_{C^{k} (D, r)}^{2} e^{2 γ_{0} s} d s \\ + \int_{t}^{T} ({∥Δ Λ^{i - 1} (s)∥}_{C^{k} (D, q)}^{2} + {∥Δ {\bar{Λ}}^{i - 1} (s)∥}_{C^{k} (D, q d)}^{2} + {∥Δ {\tilde{Λ}}^{i - 1} (s)∥}_{ν, k}^{2}) e^{2 γ_{0} s} d s . \end{matrix}

(147)

In addition,

M^{i} (t, x)

in Equation (145) is a martingale, which is of the following form:

\begin{matrix} M^{i} (t, x) = \\ - 2 \sum_{j = 1}^{d} \int_{0}^{t} {(Δ Υ^{i} (s^{-}, x))}^{'} \\ Δ J_{j} (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}) e^{2 γ s} d W_{j} (s) \\ - 2 \sum_{j = 1}^{h} \int_{0}^{t} \int_{Z} {(Δ Υ^{i} (s^{-}, x))}^{'} \\ Δ I_{j} (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}, z_{j}) e^{2 γ s} {\tilde{N}}_{j} (λ_{j} d s, d z_{j}) \\ + 2 \sum_{j = 1}^{d} \int_{t}^{T} {(Δ Λ^{i} (s^{-}, x))}^{'} (Δ {\bar{J}}_{j} (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}) \\ + {(Δ {\bar{Λ}}^{i - 1})}_{j} (s^{-}, x) - {(Δ {\bar{Λ}}^{i})}_{j} (s^{-}, x)) e^{2 γ_{0} s} d W_{j} (s) \\ + 2 \sum_{j = 1}^{h} \int_{t}^{T} \int_{Z} {({(Δ Λ^{i})}_{j} (s^{-}, x))}^{'} (Δ {\bar{I}}_{j} (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}, z_{j}) \\ + {(Δ {\tilde{Λ}}^{i - 1})}_{j} (s^{-}, x, z_{j}) - {(Δ {\tilde{Λ}}^{i})}_{j} (s^{-}, x, z_{j})) e^{2 γ_{0} s} {\tilde{N}}_{j} (λ_{j} d s, d z_{j}) . \end{matrix}

(148)

Then, it follows from Equations (145)–(148) and It

\hat{o}

’s integral properties (i.e., martingale properties) that

\begin{matrix} E [(ζ (Δ Υ^{i} (t, x) + Δ Λ^{i} (t, x))) e^{2 γ_{0} t} \\ + \int_{t}^{T} T r (Δ \bar{J} (s, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}) \\ + Δ {\bar{Λ}}^{i - 1} (s, x) - Δ {\bar{Λ}}^{i} (s, x)) e^{2 γ_{0} s} d s \\ + \sum_{j = 1}^{h} \int_{t}^{T} \int_{Z} (T r (Δ I (s, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}, z) \\ {+ Δ {\tilde{Λ}}^{i - 1} (s^{-}, x, z) - Δ {\tilde{Λ}}^{i} (s^{-}, x, z)))}_{j} e^{2 γ_{0} s} λ_{j} d s ν_{j} (d z_{j})] \\ \leq & {\hat{γ}}_{0} (T + 1) K_{a, 0} {∥(Δ Υ^{i - 1}, Λ^{i - 1}, Δ {\bar{Λ}}^{i - 1}, Δ {\tilde{Λ}}^{i - 1})∥}_{M_{γ_{0}, k}^{D}}^{2} . \end{matrix}

(149)

Next, it follows from Equation (148) that

\begin{matrix} E [sup_{0 \leq t \leq T} | M^{i} (t, x) |] \\ \leq & 2 \sum_{j = 1}^{d} E [sup_{0 \leq t \leq T} | \int_{0}^{t} {(Δ Υ^{i} (s^{-}, x))}^{'} \\ Δ J_{j} (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}) e^{2 γ_{0} s} d W_{j} (s) |] \\ + 2 \sum_{j = 1}^{h} E [sup_{0 \leq t \leq T} | \int_{0}^{t} \int_{Z} {(Δ Υ^{i} (s^{-}, x))}^{'} \\ Δ I_{j} (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}, z_{j}) e^{2 γ_{0} s} \tilde{N} (λ_{j} d s, d z_{j}) |] \\ + 4 \sum_{j = 1}^{d} E [sup_{0 \leq t \leq T} | \int_{0}^{t} {(Δ Λ^{i} (s^{-}, x))}^{'} \\ (Δ {\bar{J}}_{j} (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}) \\ + {(Δ {\bar{Λ}}^{i - 1})}_{j} (s^{-}, x) - {(Δ {\bar{Λ}}^{i})}_{j} (s^{-}, x)) e^{2 γ_{0} s} d W_{j} (s) |] \\ + 4 \sum_{j = 1}^{h} E [sup_{0 \leq t \leq T} | \int_{0}^{t} \int_{Z} {(Δ Λ^{i} (s^{-}, x))}^{'} \\ (Δ {\bar{I}}_{j} (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}, z_{j}) \\ + {(Δ {\tilde{Λ}}^{i - 1})}_{j} (s^{-}, x, z_{j}) - {(Δ {\tilde{Λ}}^{i})}_{j} (s^{-}, x, z_{j})) e^{2 γ_{0} s} \tilde{N} (λ_{j} d s, d z_{j}) |] . \end{matrix}

(150)

Then, it follows from Burkholder–Davis–Gundy’s inequality (as stated in Theorem 48 of Protter [55]) that the right-hand side of Equation (150) is bounded by

\begin{matrix} K_{b, 0} (\sum_{j = 1}^{d} E [(\int_{0}^{T} ∥ Δ Υ^{i} (s^{-}, x) ∥^{2} \\ ∥ {(Δ J^{i})}_{j} (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}) ∥^{2} e^{4 γ_{0} s} d s)^{\frac{1}{2}}] \\ + \sum_{j = 1}^{h} E [(\int_{0}^{T} \int_{Z} ∥ Δ Υ^{i} (s^{-}, x) ∥^{2} \\ ∥ Δ I_{j} (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}, z_{j}) ∥^{2} e^{4 γ_{0} s} λ_{j} ν_{j} (d z_{j}) d s)^{\frac{1}{2}}] \\ + \sum_{j = 1}^{d} E [(\int_{0}^{T} ∥ Δ Λ^{i} (s^{-}, x) ∥^{2} \\ ∥ {(Δ {\bar{J}}^{i})}_{j} (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}) \\ + {(Δ {\bar{Λ}}^{i - 1})}_{j} (s^{-}, x) - {(Δ {\bar{Λ}}^{i})}_{j} (s^{-}, x) ∥^{2} e^{4 γ_{0} s} d s)^{\frac{1}{2}}] \\ + \sum_{j = 1}^{h} E [(\int_{0}^{T} \int_{Z} ∥ Δ Λ^{i} (s^{-}, x) ∥^{2} \\ ∥ Δ {\bar{I}}_{j} (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}, z_{j}) \\ + {(Δ {\tilde{Λ}}^{i})}_{j} (s^{-}, x, z_{j}) - {(Δ {\tilde{Λ}}^{i})}_{j} (s^{-}, x, z_{j}) ∥^{2} e^{4 γ_{0} s} λ_{j} ν_{j} (d z_{j}) d s)^{\frac{1}{2}}]), \end{matrix}

(151)

where the constant

K_{b, 0} \geq 0

depends only on

K_{D, 0}

and T. Furthermore, it follows from the direct observation that the quantity in Equation (151) is bounded by

\begin{matrix} K_{b, 0} (E [{(sup_{0 \leq t \leq T} ∥ Δ Υ^{i} (t, x) ∥^{2} e^{2 γ_{0} t})}^{\frac{1}{2}} \\ (\sum_{j = 1}^{d} {(\int_{0}^{T} {∥Δ J_{j} (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1})∥}^{2} e^{2 γ_{0} s} d s)}^{\frac{1}{2}} \\ + \sum_{j = 1}^{h} (\int_{0}^{T} \int_{Z} {∥Δ I_{j} (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}, z_{j})∥}^{2} \\ e^{2 γ_{0} s} λ_{j} ν_{j} (d z_{j}) d s)^{\frac{1}{2}})] \\ + & (E [{(sup_{0 \leq t \leq T} ∥ Δ Λ^{i} (t, x) ∥^{2} e^{2 γ_{0} t})}^{\frac{1}{2}} \\ (\sum_{j = 1}^{d} (\int_{0}^{T} ∥ Δ {\bar{J}}_{j} (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}) \\ + {(Δ {\bar{Λ}}^{i - 1})}_{j} (s^{-}, x) - {(Δ {\bar{Λ}}^{i})}_{j} (s^{-}, x) ∥^{2} e^{2 γ_{0} s} d s)^{\frac{1}{2}} \\ + \sum_{j = 1}^{h} (\int_{0}^{T} \int_{Z} ∥ Δ {\bar{I}}_{j} (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}, z_{j}) \\ + {(Δ {\tilde{Λ}}^{i - 1})}_{j} (s^{-}, x, z_{j}) - {(Δ {\tilde{Λ}}^{i})}_{j} (s^{-}, x, z_{j}) ∥^{2} e^{2 γ_{0} s} λ_{j} ν_{j} (d z_{j}) d s)^{\frac{1}{2}})]) . \end{matrix}

(152)

In addition, by direct computation, we know that the quantity in Equation (152) is dominated by

\begin{matrix} \frac{1}{2} E [sup_{0 \leq t \leq T} ∥ Δ Υ^{i} (t, x) ∥^{2} e^{2 γ_{0} t}] \\ + d K_{b, 0}^{2} E [\int_{0}^{T} T r (Δ J (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}) e^{2 γ_{0} s} d s] \\ + K_{b, 0}^{2} E [\sum_{j = 1}^{h} \int_{0}^{T} \int_{Z} T r {(Δ I (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}, z_{j}))}_{j} \\ e^{2 γ_{0} s} λ_{j} ν_{j} (d z_{j}) d s] \\ + \frac{1}{2} E [sup_{0 \leq t \leq T} ∥ Δ Λ^{i} (t, x) ∥^{2} e^{2 γ_{0} t}] \\ + d K_{b, 0}^{2} E [\int_{0}^{T} T r (Δ \bar{J} (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}) \\ + Δ {\bar{Λ}}^{i - 1} (s^{-}, x) - Δ {\bar{Λ}}^{i} (s^{-}, x)) e^{2 γ_{0} s} d s] \\ + K_{b, 0}^{2} E [\sum_{j = 1}^{h} \int_{0}^{T} \int_{Z} T r (Δ \bar{I} (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}, z_{j}) \\ + Δ {\tilde{Λ}}^{i - 1} (s^{-}, x, z) - Δ {\tilde{Λ}}^{i} (s^{-}, x, z_{j})) e^{2 γ_{0} s} λ_{j} ν_{j} (d z_{j}) d s] . \end{matrix}

(153)

Due to Equation (149), the quantity in Equation (153) is bounded by

\begin{matrix} \frac{1}{2} (E [sup_{0 \leq t \leq T} ∥ Δ Υ^{i} (t) ∥_{C^{0} (r)}^{2} e^{2 γ_{0} t}] + E [sup_{0 \leq t \leq T} ∥ Δ Υ^{i} (t) ∥_{C^{0} (q)}^{2} e^{2 γ_{0} t}]) \\ + {\hat{γ}}_{0} (T + 1) d K_{a, 0} K_{b, 0}^{2} {∥(Δ Υ^{i - 1}, Δ Λ^{i - 1}, Δ {\bar{Λ}}^{i - 1}, Δ {\tilde{Λ}}^{i - 1})∥}_{M_{γ_{0}, k}^{D}}^{2}, \end{matrix}

(154)

where the constant

K_{a, 0} \geq 0

depends only on T, d, and

K_{D, 0}

. Thus, it follows from Equations (83) and (145)–(154) that

\begin{matrix} E [sup_{0 \leq t \leq T} ∥ Δ Υ^{i} (t) ∥_{C^{0} (q)}^{2} e^{2 γ_{0} t}] + E [sup_{0 \leq t \leq T} ∥ Δ Λ^{i} (t) ∥_{C^{0} (q)}^{2} e^{2 γ_{0} t}] \\ \leq 2 (1 + d K_{b, 0}^{2}) K_{a, 0} {\hat{γ}}_{0} (T + 1) {∥(Δ Υ^{i - 1}, Δ Λ^{i - 1}, Δ {\bar{Λ}}^{i - 1}, Δ {\tilde{Λ}}^{i - 1})∥}_{M_{γ_{0}, k}^{D}}^{2} . \end{matrix}

(155)

Furthermore, it follows from Equation (145) and Equation (83) that for

i \in {3, 4, \dots}

,

\begin{matrix} E [\int_{t}^{T} T r (Δ {\bar{Λ}}^{i} (s, x)) e^{2 γ_{0} s} d s] \\ \leq & 2 E [\int_{t}^{T} T r (Δ \bar{J} (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}) \\ + Δ {\bar{Λ}}^{i - 1} (s^{-}, x) - Δ {\bar{Λ}}^{i} (s, x)) e^{2 γ_{0} s} d s] \\ + 2 E [\int_{t}^{T} T r (Δ \bar{J} (s^{-}, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}) \\ + Δ {\bar{Λ}}^{i - 1} (s^{-}, x)) e^{2 γ_{0} s} d s] \\ \leq & 2 {\hat{γ}}_{0} K_{C, 0} (∥ (Δ Υ^{i - 1}, Δ Λ^{i - 1}, Δ {\bar{Λ}}^{i - 1}, Δ {\tilde{Λ}}^{i - 1}) ∥_{M_{γ_{0}, k}^{D}}^{2} \\ + ∥ (Δ Υ^{i - 2}, Δ Λ^{i - 2}, Δ {\bar{Λ}}^{i - 2}, Δ {\tilde{Λ}}^{i - 2}) ∥_{M_{γ_{0}, k}^{D}}^{2}), \end{matrix}

(156)

where the constant

K_{C, 0} \geq 0

depends only on

K_{D, 0}

and T. Similarly, it follows from Equation (84) that

\begin{matrix} E [\sum_{j = 1}^{h} \int_{t}^{T} \int_{Z} {(T r (Δ {\tilde{Λ}}^{i} (s^{-}, x, z)))}_{j} e^{2 γ_{0} s} λ_{j} d s ν_{j} (d z_{j})] \\ \leq & 2 E [\sum_{j = 1}^{h} \int_{t}^{T} \int_{Z} (T r (Δ \bar{I} (s, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\bar{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}, z) \\ + Δ {\tilde{Λ}}^{i - 1} (s^{-}, x, z) - Δ {\tilde{Λ}}^{i} (s^{-}, x, z)))_{j} e^{2 γ_{0} s} λ_{j} d s ν_{j} (d z_{j})] \\ + 2 E [\sum_{j = 1}^{h} \int_{t}^{T} \int_{Z} (T r (Δ \bar{I} (s, x, Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}, Υ^{i - 1}, Λ^{i - 1}, {\bar{Λ}}^{i - 1}, {\tilde{Λ}}^{i - 1}, z) \\ + Δ {\tilde{Λ}}^{i - 1} (s^{-}, x, z)))_{j} e^{2 γ_{0} s} λ_{j} d s ν_{j} (d z_{j})] \\ \leq & 2 {\hat{γ}}_{0} K_{C, 0} (∥ (Δ Υ^{i - 1}, Δ Λ^{i - 1}, Δ {\bar{Λ}}^{i - 1}, Δ {\tilde{Λ}}^{i - 1}) ∥_{M_{γ_{0}, k}^{D}}^{2} \\ + ∥ (Δ Υ^{i - 2}, Δ Λ^{i - 2}, Δ {\bar{λ}}^{i - 2}, Δ {\tilde{Λ}}^{i - 2}) ∥_{M_{γ_{0}, k}^{D}}^{2}) . \end{matrix}

(157)

Thus, it follows from Equations (145) and (155)–(157), and the observation, that all the related functions and norms are continuous with respect to x, and we know that

\begin{matrix} {∥(Δ Υ^{i}, Δ Λ^{i}, Δ {\bar{Λ}}^{i}, Δ {\tilde{Λ}}^{i})∥}_{M_{γ_{0}, 0}^{D}}^{2} \\ \leq & {\hat{γ}}_{0} K_{d, 0} ({∥(Δ Υ^{i - 1}, Δ Λ^{i - 1}, Δ {\bar{Λ}}^{i - 1}, Δ {\tilde{Λ}}^{i - 1})∥}_{M_{γ_{0}, k}^{D}}^{2} \\ + {∥(Δ Υ^{i - 2}, Δ Λ^{i - 2}, Δ {\bar{Λ}}^{i - 2}, Δ {\tilde{Λ}}^{i - 2})∥}_{M_{γ_{0}, k}^{D}}^{2}), \end{matrix}

(158)

where the constant

K_{d, 0} \geq 0

depends only on

K_{D, 0}

and T.

Now, by Lemma 2 and along the same procedure in Equation (144), we define

\begin{matrix} ζ (Δ Υ^{c, i} (t, x) + Δ Λ^{c, i} (t, x)) \equiv (T r (Δ Υ^{c, i} (t, x)) + T r (Δ Λ^{c, i} (t, x))) e^{2 γ_{c} t}, \end{matrix}

(159)

for each

c \in {1, 2, \dots}

, where

\begin{matrix} Δ Υ^{c, i} (t, x)) & = & (Δ Υ^{(0), i} (t, x)), Δ Υ^{(1), i} (t, x) {), \dots, Δ Υ^{(c), i} (t, x))}^{'}, \\ Δ Λ^{c, i} (t, x)) & = & (Δ Λ^{(0), i} (t, x)), Δ Λ^{(1), i} (t, x) {), \dots, Δ Λ^{(c), i} (t, x))}^{'} . \end{matrix}

Then, by It

\hat{o}

’s formula and along the same procedure in Equation (158), we have that

\begin{matrix} {∥(Δ Υ^{i}, Δ Λ^{i}, Δ {\bar{Λ}}^{i}, Δ {\tilde{Λ}}^{i})∥}_{M_{γ_{c}, c}^{D}}^{2} \\ \leq & {\hat{γ}}_{c} K_{d, c} ({∥(Δ Υ^{i - 1}, Δ Λ^{i - 1}, Δ {\bar{Λ}}^{i - 1}, Δ {\tilde{Λ}}^{i - 1})∥}_{M_{γ_{c}, k + c}^{D}}^{2} \\ + {∥(Δ Υ^{i - 2}, Δ Λ^{i - 2}, Δ {\bar{Λ}}^{i - 2}, Δ {\tilde{Λ}}^{i - 2})∥}_{M_{γ_{c}, k + c}^{D}}^{2}) \\ \leq & \frac{δ}{({(c + 1)}^{10} {(c + 2)}^{10} \dots {(c + k)}^{10}) (η (c + 1) η (c + 2) \dots η (c + k))} \\ ({∥(Δ Υ^{i - 1}, Δ Λ^{i - 1}, Δ {\bar{Λ}}^{i - 1}, Δ {\tilde{Λ}}^{i - 1})∥}_{M_{γ_{k + c}, k + c}^{D}}^{2} \\ + {∥(Δ Υ^{i - 2}, Δ Λ^{i - 2}, Δ {\bar{Λ}}^{i - 2}, Δ {\tilde{Λ}}^{i - 2})∥}_{M_{γ_{k + c}, k + c}^{D}}^{2}), \end{matrix}

(160)

where we selected a sequence of numbers

γ

for the last inequality of Equation (160), which satisfies

γ_{0} < γ_{1} < \dots

and

\begin{matrix} {\hat{γ}}_{c} K_{d, c} ({(c + 1)}^{10} {(c + 2)}^{10} \dots {(c + k)}^{10}) (η (c + 1) η (c + 2) \dots η (c + k)) \leq δ \end{matrix}

for a

δ > 0

to make the number

2 \sqrt{e^{k} δ}

sufficiently small. Therefore, we have that

\begin{matrix} {∥(Δ Υ^{i}, Δ Λ^{i}, Δ {\bar{Λ}}^{i}, Δ {\tilde{Λ}}^{i})∥}_{M_{γ}^{D}}^{2} \\ \leq & e^{k} δ ({∥(Δ Υ^{i - 1}, Δ Λ^{i - 1}, Δ {\bar{Λ}}^{i - 1}, Δ {\tilde{Λ}}^{i - 1})∥}_{M_{γ}^{D}}^{2} \\ + {∥(Δ Υ^{i - 2}, Δ Λ^{i - 2}, Δ {\bar{Λ}}^{i - 2}, Δ {\tilde{Λ}}^{i - 2})∥}_{M_{γ}^{D}}^{2}) . \end{matrix}

(161)

Since

{(a^{2} + b^{2})}^{1 / 2} \leq a + b

for two real numbers

a, b \geq 0

, we know that

\begin{matrix} {∥(Δ Υ^{i}, Δ Λ^{i}, Δ {\bar{Λ}}^{i}, Δ {\tilde{Λ}}^{i})∥}_{M_{γ}^{D}} \\ \leq & \sqrt{e^{k} δ} ({∥(Δ Υ^{i - 1}, Δ Λ^{i - 1}, Δ {\bar{Λ}}^{i - 1}, Δ {\tilde{Λ}}^{i - 1})∥}_{M_{γ}^{D}} \\ + {∥(Δ Υ^{i - 2}, Δ Λ^{i - 2}, Δ {\bar{Λ}}^{i - 2}, Δ {\tilde{Λ}}^{i - 2})∥}_{M_{γ}^{D}}) . \end{matrix}

(162)

Therefore, by Equation (162), we know that

\begin{matrix} \sum_{i = 3}^{\infty} {∥(Δ Υ^{i}, Δ Λ^{i}, Δ {\bar{Λ}}^{i}, Δ {\tilde{Λ}}^{i})∥}_{M_{γ}^{D}} \\ \leq & \frac{\sqrt{e^{k} δ}}{1 - 2 \sqrt{e^{k} δ}} (2 {∥(Δ Υ^{2}, Δ Λ^{2}, Δ {\bar{Λ}}^{2}, Δ {\tilde{Λ}}^{2})∥}_{M_{γ}^{D}} \\ + {∥(Δ Υ^{1}, Δ Λ^{1}, Δ {\bar{Λ}}^{1}, Δ {\tilde{Λ}}^{1})∥}_{M_{γ}^{D}}) \\ < & \infty . \end{matrix}

(163)

Thus, from Equation (163), we see that

(Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i})

along

i \in {1, 2, \dots}

forms a Cauchy sequence in the generalized Banach space

M_{γ}^{D} [0, T]

. Thus, we can conclude that there exists some 4-tuple

(Υ, V, \bar{Λ}, \tilde{Λ})

satisfying

\begin{matrix} (Υ^{i}, Λ^{i}, {\bar{Λ}}^{i}, {\tilde{Λ}}^{i}) \to (Υ, Λ, \bar{Λ}, \tilde{Λ}) as i \to \infty in M_{γ}^{D} [0, T] . \end{matrix}

(164)

Finally, by Equation (164) and extending the discussion for Theorem 5.2.1 of Øksendal [58] to the generalized Banach space

M_{γ}^{D} [0, T]

, we can reach a proof for Claim 4. □

Proof

(Proof of Proposition 1). First of all, we generalize the previous discussion for Claim 4 to the corresponding case for a complex-valued system with an open (or partially open) domain D (e.g.,

R^{p}

or

R_{+}^{p}

) as presented in Equation (45). More precisely, let

C^{\infty} (D, C^{l})

with

l \in {r, q}

be the Banach space endowed with the norm

\begin{matrix} {∥ f ∥}_{C^{\infty} (D, l)}^{2} \equiv \sum_{n = 0}^{\infty} ξ (n + 1) {∥ f ∥}_{C^{\infty} (D_{n}, l)}^{2}, \end{matrix}

(165)

where the norm

{∥ f ∥}_{C^{\infty} (D_{n}, l)}^{2}

in Equation (165) is interpreted in the corresponding complex-valued sense. In addition, define

\begin{matrix} {\bar{Q}}_{F}^{2} ([0, τ] \times D) \end{matrix}

(166)

to be the corresponding space in Equation (79) if the terminal time T is replaced by a stopping time

τ \in [0, T]

and the norm in Equation (75) is substituted by the one in Equation (165). Finally, we use the same approach to interpret the spaces

L_{G}^{2} (Ω, C^{\infty} (D; C^{r}))

and

L_{F_{τ}}^{2} (Ω, C^{\infty} (D; C^{q}))

. Then, we have the following claim.

If

(G, H) \in L_{G}^{2} (Ω, C^{\infty} (D; C^{r})) \times L_{F_{τ}}^{2} (Ω, C^{\infty} (D; C^{q}))

and the system in Equation (8) satisfies the conditions in Equations (83)–(86) over

D_{n}

for each

n \in {0, 1, \dots}

with associated (local) linear growth and Lipshitz constant

K_{D_{n}, c}

; furthermore, assume that each

A \in {L, \bar{L}, J, \bar{J}, I, \bar{I}}

is

{F_{t}}

-adapted for every fixed

x \in D

,

z \in Z^{h}

, and any given

(u, v, \bar{v}, \tilde{v}) \in V^{\infty} (D)

with conditions in Equations (87)–(92) being true; then, for the system in Equation (8), there uniquely exists a 4-tuple solution, which is a strong and adapted solution, i.e.,

\begin{matrix} (Υ, Λ, \bar{Λ}, \tilde{Λ}) \in {\bar{Q}}_{F}^{2} ([0, τ] \times D), \end{matrix}

(167)

and

(Υ, Λ) (\cdot, x)

is càdlàg for each

x \in D

a.s.

To prove the claim in Equation (167), we first consider a real-valued system corresponding to the case that

τ = T

, whose proof is along the line of the one for Claim 4. More precisely, for any given number sequence

γ = {γ_{D_{c}}, c = 0, 1, 2, \dots}

with

γ_{D_{c}} \in R

, replace the norm for the Banach space

M_{γ}^{D} [0, T]

defined in Equation (141) by

\begin{matrix} {∥(Υ, Λ, \bar{Λ}, \tilde{Λ})∥}_{M_{γ}^{D}}^{2} & \equiv & \sum_{c = 0}^{\infty} ξ (c) {∥(Υ, Λ, \bar{Λ}, \tilde{Λ})∥}_{M_{γ_{D_{c}}, c}^{D_{c}}}^{2}, \end{matrix}

(168)

for any given

(Υ, Λ, \bar{Λ}, \tilde{Λ})

in this space, where

\begin{matrix} {∥(Υ, Λ, \bar{Λ}, \tilde{Λ})∥}_{M_{γ_{D_{c}}}^{D_{c}}}^{2} & = & E [sup_{0 \leq t \leq T} {∥Υ (t)∥}_{C^{c} (D_{c}, r)}^{2} e^{2 γ_{D_{c}} t}] \\ + E [sup_{0 \leq t \leq T} {∥Λ (t)∥}_{C^{c} (D_{c}, q)}^{2} e^{2 γ_{D_{c}} t}] \\ + E [\int_{0}^{T} {∥\bar{Λ} (t)∥}_{C^{c} (D_{c}, q d)}^{2} e^{2 γ_{D_{c}} t} d t] \\ + E [\int_{0}^{T} {∥\tilde{Λ} (t)∥}_{ν, c}^{2} e^{2 γ_{D_{c}} t} d t] . \end{matrix}

Then, it follows from a similar argument to that used for Equation (161) in the proof of Claim 4 that

\begin{matrix} (Υ^{1} (\cdot, x), Λ^{1} (\cdot, x), {\bar{Λ}}^{1} (\cdot, x), {\tilde{Λ}}^{1} (\cdot, x, z)) \in {\bar{Q}}_{F}^{2} ([0, T] \times D) \end{matrix}

with

(Υ^{0}, Λ^{0}, {\bar{Λ}}^{0}, {\tilde{Λ}}^{0}) = (0, 0, 0, 0)

, where

(Υ^{1}, Λ^{1}, {\bar{Λ}}^{1}, {\tilde{Λ}}^{1})

is defined through Equation (95) in Lemma 1. Furthermore, over each

D_{c}

with

c \in {0, 1, \dots}

, we have that

\begin{matrix} {∥(Δ Υ^{i}, Δ Λ^{i}, Δ {\bar{Λ}}^{i}, Δ {\tilde{Λ}}^{i})∥}_{M_{γ}^{D}}^{2} \\ \leq & e^{k} δ ({∥(Δ Υ^{i - 1}, Δ Λ^{i - 1}, Δ {\bar{Λ}}^{i - 1}, Δ {\tilde{Λ}}^{i - 1})∥}_{M_{γ}^{D}}^{2} \\ + {∥(Δ Υ^{i - 2}, Δ Λ^{i - 2}, Δ {\bar{Λ}}^{i - 2}, Δ {\tilde{Λ}}^{i - 2})∥}_{M_{γ}^{D}}^{2}), \end{matrix}

(169)

where

δ

is a constant that can be determined by suitably choosing a sequence of numbers

γ

satisfying

γ_{D_{0}} < γ_{D_{1}} < \dots

and

0 < \sqrt{e^{k} δ} / (1 - 2 \sqrt{e^{k} δ}) < 1

(note that

γ_{D_{c}}

may depend on both

D_{c}

and c for each

c \in {0, 1, \dots}

). Thus, it follows from Equation (169) that the remaining justification for the claim in Equation (167) can be conducted along the line of proof for Claim 4. Second, we consider a real-valued system corresponding to the case that

τ

is a general random stopping time. The detailed proof for this case can be accomplished by extending the proof corresponding to

τ = T

via the techniques developed in Dai [33,42] for both forward and backward SDEs, and the related discussions in Yong and Zhou [36]. Third, by directly generalizing the discussion concerning the real-valued system to complex-valued systems, we reach a proof for the claim in Equation (167).

Secondly, we present the rest of the proof for Proposition 1 by applying the claim in Equation (167). As explained previously, we only consider the case corresponding to a real-valued system. In fact, let

M

represent the defined space to hold the value of

(Υ, Λ, \bar{Λ}, \tilde{Λ})

and related partial derivatives in Equation (10). Thus, it follows from the convention imposed in Equations (10) and (11) of this paper and (7.1) of Ethier and Kurtz [59] that there exists a vector polynomial sequence

ρ_{n}^{A}

for each

A \in {L, J, I, \bar{L}, \bar{J}, \bar{I}}

,

t \in [0, T]

, and

z \in Z^{h}

such that

\begin{matrix} A^{n} (t, y, z) \equiv \int A (t, y, z) ρ_{n}^{A} (y - w) d w . \end{matrix}

(170)

Furthermore, in Equation (170), y and w are points in

\in M

, and the related convergence along

n \in {1, 2, \dots}

is uniform in terms of y over every compact subset S of

M

. In addition, the dimension of vector function

ρ_{n}^{A}

corresponds to

A

and makes the product between

A

and

ρ_{n}^{A}

meaningful. It follows from the explanation for (7.3) in Ethier and Kurtz [59] that the component (e.g., corresponding to each

Λ

in

R^{q}

) of

ρ_{n}^{A}

can be taken as

\begin{matrix} ρ_{n}^{v} (z) = n^{q} {(1 - \frac{{∥ z ∥}^{2}}{n^{2}})}^{n^{4}} π^{- q / 2} . \end{matrix}

(171)

Moreover, it follows from the proof of Proposition 7.1 in Ethier and Kurtz [59] that

A^{n} (t, y, z)

with respect to a given

n \in {1, 2, \dots}

is a polynomial vector in terms of y. Hence, it follows from the conditions in Equations (59)–(62) that

A^{n}

satisfies the conditions in Equations (83)–(86) over S if we replace every

A \in {L, J, I, \bar{L}, \bar{J}, \bar{I}}

in Equation (8) by its counterpart

A^{n}

in Equation (170). Thus, there uniquely exists an adapted 4-tuple strong solution

(Υ^{n}, Λ^{n}, {\bar{Λ}}^{n}, {\tilde{Λ}}^{n})

corresponding to S in the space of

{\bar{Q}}_{F}^{2} ([0, τ] \times D)

given by Equation (166) to the system of coupled FB-SPDEs in Equation (8). In other words, for the equation with respect to an

n \in {1, 2, \dots}

\begin{matrix} \{\begin{matrix} Υ^{n} (t, x) & = G (x) + \int_{0}^{t} L^{n} (s^{-}, x, Υ^{n}, Λ^{n}, {\bar{Λ}}^{n}, {\tilde{Λ}}^{n}) d s \\ + \int_{0}^{t} J^{n} (s^{-}, x, Υ^{n}, Λ^{n}, {\bar{Λ}}^{n}, {\tilde{Λ}}^{n}) d W (s) \\ + \int_{0}^{t} \int_{Z^{h}} I^{n} (s^{-}, x, Υ^{n}, Λ^{n}, {\bar{Λ}}^{n}, {\tilde{Λ}}^{n}, z) \tilde{N} (λ d s, d z), \\ Λ^{n} (t, x) & = H (x) + \int_{t}^{τ} {\bar{L}}^{n} (s^{-}, x, Υ^{n}, Λ^{n}, {\bar{Λ}}^{n}, {\tilde{Λ}}^{n}) d s \\ + \int_{t}^{τ} {\bar{J}}^{n} (s^{-}, x, Υ^{n}, Λ^{n}, {\bar{Λ}}^{n}, {\tilde{Λ}}^{n}) d W (s) \\ + \int_{t}^{τ} \int_{Z^{h}} {\bar{I}}^{n} (s^{-}, x, Υ^{n}, Λ^{n}, {\bar{Λ}}^{n}, {\tilde{Λ}}^{n}, z) \tilde{N} (λ d s, d z) \end{matrix} \end{matrix}

(172)

there uniquely exists a 4-tuple solution corresponding to S in

{\bar{Q}}_{F}^{2} ([0, τ] \times D)

, which is a strong and adapted solution.

Next, consider an operator

A \in {L, J, \bar{L}, \bar{J}}

and take sufficiently large positive integers m and n such that

n > m

. Then, by our imposed conditions in Equations (59)–(62) of this paper and by the related computations in (7.3)–(7.4) of Ethier and Kurtz [59], we know that the inequalities in Equations (83) and (85) are true on set S as shown in the following calculation:

\begin{matrix} ∥A^{n} (t, y^{n}, z) - A^{m} (t, y^{m}, z)∥ \\ \leq & \tilde{K} (∥ u^{n} - u^{m} ∥_{C^{k} (D, r)} + ∥ v^{n} - v^{m} ∥_{C^{k} (D, q)} + ∥ {\bar{v}}^{n} - {\bar{v}}^{m} ∥_{C^{k} (D, q d)} + {∥ {\tilde{v}}^{n} - {\tilde{v}}^{m} ∥}_{ν, k}) \\ + O (1 / m - 1 / n) + O ((1 / m^{4}) (1 / m^{4} - 1 / n^{4})), \end{matrix}

where

\tilde{K} \geq 0

denotes a constant. Meanwhile, for each

A \in {I, \bar{I}}

, it follows from the condition in Equation (60) of this paper and the related computations in (7.3)–(7.4) of Ethier and Kurtz [59] that the facts in Equations (84) and (86) are true on set S, e.g.,

\begin{matrix} \sum_{i = 1}^{h} \int_{Z} {∥A_{i}^{n} (t, y^{n}, z_{i}) - A_{i}^{m} (t, y^{m}, z_{i})∥}^{2} λ_{i} ν_{i} (d z_{i}) \\ \leq & \tilde{K} (∥ u^{n} - u^{m} ∥_{C^{k} (D, r)}^{2} + ∥ v^{n} - v^{m} ∥_{C^{k} (D, q)}^{2} + ∥ {\bar{v}}^{n} - {\bar{v}}^{m} ∥_{C^{k} (D, q d)}^{2} + {∥ {\tilde{v}}^{n} - {\tilde{v}}^{m} ∥}_{ν, k}^{2}) \\ + O (1 / m - 1 / n) + O ((1 / m^{4}) (1 / m^{4} - 1 / n^{4})) . \end{matrix}

Therefore, it follows from Equations (83)–(86) and the similar proving procedure for the claim in Equation (167) that the solution sequence

{(Υ^{n}, Λ^{n}, {\bar{Λ}}^{n}, {\tilde{Λ}}^{n}), n \in {1, 2, \dots}}

corresponding to S is a Cauchy sequence in the space of

{\bar{Q}}_{F}^{2} ([0, T] \times D)

as introduced in Equation (166) and, hence, in the space of

Q_{F}^{2} ([0, T] \times D)

as given in Equation (55). In addition, we know that there uniquely exists a limit process

Ξ = (Υ, Λ, \bar{Λ}, \tilde{Λ}) \in Q_{F}^{2} ([0, T] \times D)

such that it is the unique 4-tuple solution of the FB-SPDE in Equation (8). Furthermore, it corresponds to the given set S, and it is a strong and adapted solution.

In the end, we show that there uniquely exists a 4-tuple process

Ξ = (Υ, Λ, \bar{Λ}, \tilde{Λ}) \in Q_{F}^{2} ([0, T] \times D)

such that it is a strong and adapted solution of the FB-SPDE in Equation (8), and it corresponds to

M

. More precisely, let

S_{1} \subset S_{2} \subset \dots

be an increasing compact set sequence, which satisfies

S_{r} \to M

along

r \to \infty

. Then, the rest proof concerning the unique existence follows from repeating the previous procedure with respect to

r \in {1, 2, \dots}

, the justified claim in Equation (167), and the proofs for Lemma 4.1 of Dai [42] and Proposition 18 of Dai [33]. Therefore, we reach a proof for Proposition 1. □

4. Conclusions

In this paper, we establish a relationship between SDGs and a unified forward–backward coupled SPDE with discontinuous Lévy jumps. The SDGs have q players and are driven by a general-dimensional vector Lévy process. By establishing vector-form Ito-Ventzell’s formula and a 4-tuple vector-field solution to the unified SPDE, we obtain a Pareto optimal Nash equilibrium policy process or a saddle point policy process to the SDG in a non-zero-sum or zero-sum sense. The unified SPDE is in both general dimensional vector-form and forward–backward coupling manners. The partial differential operators in its drift, diffusion, and jump coefficients are in time-variable and position parameters over a domain. Since the unified SPDE is of general nonlinearity and a general high order, we extend our recent study from the existing BM-driven backward case to a general Lévy-driven forward–backward coupled case. In doing so, we construct a new topological space to support the proof of the existence and uniqueness of an adapted solution of the unified SPDE, which is in a 4-tuple strong sense. The construction of the topological space is through constructing a set of topological spaces associated with a set of exponents

{γ_{1}, γ_{2}, \dots}

under a set of general localized conditions, which is significantly different from the construction of the single exponent case. Furthermore, due to the coupling from the forward SPDE and the involvement of the discontinuous Lévy jumps, our study is also significantly different from the BM-driven backward case. The coupling between forward and backward SPDEs essentially corresponds to the interaction between noise encoding and noise decoding in the current hot diffusion transformer model for generative AI. Finally, our forward–backward coupled SPDE studied in this paper is a classical (integer) derivative-oriented one. It is possible to extend our current study to the Caputo fractional derivative case by constructing a suitable topological supporting space, which will be our future work.

Funding

Supported by National Natural Science Foundation of China with Grant No. 11771006.

Data Availability Statement

All data included in this study are available upon request by contacting the corresponding author.

Conflicts of Interest

The author declares that the publication of this paper has no conflicts of interest.

References

Mauro, A.D.; Greco, M.; Grimaldi, M. A formal definition of big data based on its essential features. Libr. Rev. 2016, 65, 122–135. [Google Scholar] [CrossRef]
Dai, W. A unified system of FB-SDEs with Lévy jumps and double completely-S skew reflections. Commun. Math. Sci. 2018, 16, 659–704. [Google Scholar] [CrossRef]
Dai, W. Optimal rate scheduling via utility-maximization for J-user MIMO Markov fading wireless channels with cooperation. Oper. Res. 2013, 6, 1450–1462. [Google Scholar] [CrossRef]
Duncan, T.E. Mutual information for stochastic signals and Lévy processes. IEEE Trans. Inf. Theory 2010, 56, 18–24. [Google Scholar] [CrossRef]
Dai, W.; Jiang, Q. Stochastic optimal control of ATO systems with batch arrivals via diffusion approximation. Probab. Eng. Inf. Sci. 2007, 21, 477–495. [Google Scholar] [CrossRef]
Mandelbaum, A.; Pats, G. State-dependent stochastic networks. Part I: Approximations and applications with continuous diffusion limits. Ann. Appl. Probab. 1998, 8, 569–646. [Google Scholar] [CrossRef]
Kong, J.R.; Jiménez-Martínez, R.; Troullinou, C.; Lucivero, V.G.; Tóth, G.; Mitchell, M.W. Measurement-induced, spatially-extended entanglement in a hot, strongly-interacting atomic system. Nat. Commun. 2020, 11, 2415. [Google Scholar] [CrossRef]
Dai, W. Game-theoretic policy computing and simulation for blockchained buffering system via diffusion approximation. Probab. Eng. Inf. Sci. 2024, 1–36. [Google Scholar] [CrossRef]
Karatzas, I.; Li, Q. BSDE approach to non-zero-sum stochastic differential games of control and stopping. In Stochastic Processes, Finance and Control; World Scientific Publishers: Singapore, 2012; pp. 105–153. [Google Scholar]
Silver, D.; Huang, A.; Maddison, C.J.; Guez, A.; Sifre, L.; Driessche, G.V.D.; Schrittwieser, J.; Antonoglu, I.; Panneershelvam, V.; Lanctot, M.; et al. Mastering the game of Go with deep neural networks and tree search. Nature 2016, 529, 484–489. [Google Scholar] [CrossRef]
Silver, D.; Schrittwieser, J.; Simonyan, K.; Antonoglu, I.; Huang, A.; Guez, A.; Hubert, T.; Baker, L.; Lai, M.; Bolton, A.; et al. Mastering the game of Go without human knowledge. Nature 2017, 550, 354–359. [Google Scholar] [CrossRef]
Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Zidek, A.; Potapenko, A.; et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef] [PubMed]
Lee, K.H.; Nachum, O.; Yang, M.; Lee, L.; Freeman, D.; Xu, W.; Guadarrama, S.; Fischer, I.; Jang, E.; Michalewski, H. Multi-game decision transformers. arXiv 2022, arXiv:2205.15241v1. [Google Scholar]
Dai, W. Convolutional neural network based simulation and analysis for backward stochastic partial differential equations. Comput. Math. Appl. 2022, 119, 21–58. [Google Scholar] [CrossRef]
Dai, W. Simulating a strongly nonlinear backward stochastic partial differential equation via efficient approximation and machine learning. AIMS Math. 2024, 9, 18688–18711. [Google Scholar] [CrossRef]
Ash, G.R. Dynamic Routing in Telecommunications Network; McGraw-Hill: New York, NY, USA, 1997. [Google Scholar]
Hamidi, V.; Smith, K.S.; Wilson, R.S. Smart grid technology review within the transmission and distribution sector. In Proceedings of the 2010 IEEE PES Innovative Smart Grid Technologies Conference Europe, Gothenburg, Sweden, 1–8 October 2010. [Google Scholar]
Musiela, M.; Zariphopoulou, T. Stochastic partial differential equations and portfolio choice. In Contemporary Quantitative Finance; Chiarella, C., Novikov, A., Eds.; Springer: Berlin/Heidelberg, Germany, 2009; pp. 195–216. [Google Scholar]
Peng, S. Stochastic Hamilton-Jacobi-Bellman equations. SIAM J. Control Optim. 1992, 30, 284–304. [Google Scholar] [CrossRef]
Øksendal, B.; Sulem, A.; Zhang, T. A Stochastic HJB Equation for Optimal Control of Forward-Backward SDEs. In The Fascination of Probability, Statistics and Their Applications; Springer International Publishing: Berlin/Heidelberg, Germany, 2016. [Google Scholar]
Applebaum, D. Lévy Processes and Stochastic Calculus; Cambridge University Press: Cambridge, UK, 2004. [Google Scholar]
Sato, K.I. Lévy Processes and Infinite Divisibility; Cambridge University Press: Cambridge, UK, 1999. [Google Scholar]
Caselles, V.; Sapiro, G.; Chung, D.H. Vector median filters, morphology, and PDE’s: Theoretical connections. In Proceedings of the International Conference on Image Processing, Kobe, Japan, 24–28 October 1999; Volume 4, pp. 177–184. [Google Scholar]
Tschumperlé, D.; Deriche, R. Regularization of orthonormal vector sets using coupled PDE’s. In Proceedings of the IEEE Workshop on Variational and Level Set Methods in Computer Vision 2001, Vancouver, BC, Canada, 13 July 2001; pp. 3–10. [Google Scholar]
Tschumperlé, D.; Deriche, R. Constrained and unconstrained PDE’s for vector image restoration. In Proceedings of the Scandinavian Conference on Image Analysis, Bergen, Norway, 11–14 June 2001. [Google Scholar]
Tschumperlé, D.; Deriche, R. Anisotropic diffusion partial differential equations for multichannel image regularization: Framework and applications. Adv. Imaging Electron Phys. 2007, 145, 149–209. [Google Scholar]
Kratsios, A.; Debarnot, V.; Dokmannić, I. Small transformers compute universal metric embeddings. J. Mach. Learn. Res. 2023, 24, 1–48. [Google Scholar]
Peluchetti, S. Diffusion bridge mixture transports, Schrödinger bridge problems and generative modeling. J. Mach. Learn. Res. 2023, 24, 1–51. [Google Scholar]
Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.; Polosukhin, I. Attention is all you need. Adv. Neural Inf. Process. Syst. 2017, 30, 5998–6008. [Google Scholar]
Huang, M.; Malhame, R.P.; Caines, P.E. Large population stochastic dynamic games: Closed-loop Mckean-Vlason systems and the Nash certainty equivalence principle. Commun. Inf. Syst. 2006, 6, 221–252. [Google Scholar]
Lasry, J.M.; Lions, P.L. Mean field games. Jpn. J. Math. 2007, 2, 229–260. [Google Scholar] [CrossRef]
Kolokoltsov, V.N. Quantum mean-field games. Ann. Appl. Probab. 2022, 32, 2254–2288. [Google Scholar] [CrossRef]
Dai, W. Mean-variance hedging based on an incomplete market with external risk factors of non-Gaussian OU processes. Math. Eng. 2015, 2015, 625289. [Google Scholar] [CrossRef]
Hairer, M. Solving the KPZ equation. Ann. Math. 2013, 178, 559–664. [Google Scholar] [CrossRef]
Kardar, M.; Parisi, G.; Zhang, Y.C. Dynamic scaling of growing interfaces. Phys. Rev. Lett. 1986, 66, 889–892. [Google Scholar] [CrossRef]
Yong, J.; Zhou, X.Y. Stochastic Controls: Hamiltonian Systems and HJB Equations; Springer: New York, NY, USA, 1999. [Google Scholar]
de Bouard, A.; Debussche, A. A stochastic nonlinear Schrödinger equation with multiplicative noise. Commun. Math. Phys. 1999, 205, 161–181. [Google Scholar] [CrossRef]
de Bouard, A.; Debussche, A. The stochastic nonlinear Schrödinger equation in H¹. Stoch. Anal. Appl. 2003, 21, 97–126. [Google Scholar] [CrossRef]
Chai, J.D. Density functional theory with fractional orbit occupations. J. Chem. Phys. 2012, 136, 154104. [Google Scholar] [CrossRef]
Chai, J.D. Thermally-assisted-occupation density functional theory with generalized-gradient approximations. J. Chem. 2014, 140, 18A521. [Google Scholar] [CrossRef]
Chang, C.Z.; Zhang, J.; Feng, X.; Shen, J.; Zhang, Z.; Guo, M.; Li, K.; Ou, Y.; Wei, P.; Wang, L.L.; et al. Experimental observation of the quantum anomalous Hall effect in a magnetic topologic insulator. Science 2013, 340, 167. [Google Scholar] [CrossRef]
Dai, W. Mean-variance portfolio selection based on a generalized BNS stochastic volatility model. Int. J. Comput. 2011, 88, 3521–3534. [Google Scholar] [CrossRef]
Hall, E. On a new action of the magnet on electric currents. Am. J. Math. 1879, 2, 287–292. [Google Scholar] [CrossRef]
Karplus, R.; Luttinger, J.M. Hall effect in ferromagnetics. Phys. Rev. 1954, 95, 1154–1160. [Google Scholar] [CrossRef]
Lions, P.L.; Souganidis, T. Fully nonlinear stochastic partial differential equations Équations aux dérivés partielles stochastiques compltèement non-linéaires. Comptes Rendus l’Acad. Sci.—Ser. I—Math. 1998, 326, 1085–1092. [Google Scholar]
Thouless, D.J. The quantum Hall Effect and the Schrödinger equation with competing periods. In Number Theory and Physics: Proceedings of the Winter School, Les Houches, France, 7–16 March 1989; Springer: Berlin/Heidelberg, Germany, 1990; Volume 47, pp. 170–176. [Google Scholar]
Bertoin, J. Lévy Processes; Cambridge University Press: Cambridge, UK, 1996. [Google Scholar]
Kallenberg, O. Foundations of Modern Probability; Springer: New York, NY, USA, 1997. [Google Scholar]
Dai, W. Brownian Approximations for Queueing Networks with Finite Buffers: Modeling, Heavy Traffic Analysis and Numerical Implementations. Ph.D. Thesis, Georgia Institute of Technology, Atlanta, GA, USA, 1996. [Google Scholar]
Mandelbaum, A.; Massey, W.A. Strong approximations for time-dependent queues. Math. Oper. Res. 1995, 20, 33–64. [Google Scholar] [CrossRef]
Konstantopoulos, T.; Last, G.; Lin, S.J. On a class of Lévy stochastic networks. Queueing Syst. 2004, 46, 409–437. [Google Scholar] [CrossRef]
Dai, W. Optimal control with monotonicity constraints for a parallel-server loss channel serving multi-class jobs. Math. Comput. Model. Dyn. Syst. 2014, 20, 284–315. [Google Scholar] [CrossRef]
Øksendal, B.; Sulem, A. Applied Stochastic Control of Jump Diffusions; Springer: Berlin/Heidelberg, Germany, 2005. [Google Scholar]
Øksendal, B.; Zhang, T. The Ito^-Ventzell formula and forward stochastic differential equations driven by Poisson random measures. Osaka J. Math. 2007, 44, 207–230. [Google Scholar]
Protter, P.E. Stochastic Integration and Differential Equations, 2nd ed.; Springer: New York, NY, USA, 2004. [Google Scholar]
Peskir, G.; Shiryaev, A. Optimal Stopping and Free-Boundary Probelms; Birkhäuser: Basel, Switzerland, 2006. [Google Scholar]
Situ, R. On solutions of backward stochastic differential equations with jumps and applications. Stoch. Processes Their Appl. 1997, 66, 209–236. [Google Scholar]
Øksendal, B. Stochastic Differential Equations, 6th ed.; Springer: New York, NY, USA, 2005. [Google Scholar]
Ethier, S.N.; Kurtz, T.G. Markov Process: Characterization and Convergence; Wiley: New York, NY, USA, 1986. [Google Scholar]

Figure 1. A rough illustration of sample surface solution to the coupled SPDEs.

Figure 2. A physical queuing system with quantum-cloud service centers and Blockchain.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Dai, W. Stochastic Differential Games and a Unified Forward–Backward Coupled Stochastic Partial Differential Equation with Lévy Jumps. Mathematics 2024, 12, 2891. https://doi.org/10.3390/math12182891

AMA Style

Dai W. Stochastic Differential Games and a Unified Forward–Backward Coupled Stochastic Partial Differential Equation with Lévy Jumps. Mathematics. 2024; 12(18):2891. https://doi.org/10.3390/math12182891

Chicago/Turabian Style

Dai, Wanyang. 2024. "Stochastic Differential Games and a Unified Forward–Backward Coupled Stochastic Partial Differential Equation with Lévy Jumps" Mathematics 12, no. 18: 2891. https://doi.org/10.3390/math12182891

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Stochastic Differential Games and a Unified Forward–Backward Coupled Stochastic Partial Differential Equation with Lévy Jumps

Abstract

1. Introduction

2. Main Theorem with Examples

2.1. State and Value Processes

2.2. Main Theorem

2.3. Real-World Examples

2.3.1. Sharing vs. Competition in Cloud Services and Energy Grids

2.3.2. Mastering the Zero-Sum Game of Go

3. Proof of Main Theorem

3.1. Unique Existence of Solution to the Unified SPDE

3.2. Proof of Theorem 1

3.3. Proof of Proposition 1

4. Conclusions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI