Controlled Discrete-Time Semi-Markov Random Evolutions and Their Applications

Anatoliy Swishchuk; Nikolaos Limnios

doi:10.3390/math9020158

Abstract

In this paper, we introduced controlled discrete-time semi-Markov random evolutions. These processes are random evolutions of discrete-time semi-Markov processes where we consider a control. applied to the values of random evolution. The main results concern time-rescaled weak convergence limit theorems in a Banach space of the above stochastic systems as averaging and diffusion approximation. The applications are given to the controlled additive functionals, controlled geometric Markov renewal processes, and controlled dynamical systems. We provide dynamical principles for discrete-time dynamical systems such as controlled additive functionals and controlled geometric Markov renewal processes. We also produce dynamic programming equations (Hamilton–Jacobi–Bellman equations) for the limiting processes in diffusion approximation such as controlled additive functionals, controlled geometric Markov renewal processes and controlled dynamical systems. As an example, we consider the solution of portfolio optimization problem by Merton for the limiting controlled geometric Markov renewal processes in diffusion approximation scheme. The rates of convergence in the limit theorems are also presented.

Keywords:

semi-Markov chain; controlled discrete-time semi-Markov random evolutions; averaging; diffusion approximation; diffusion approximation with equilibrium; rates of convergence; controlled additive functional; controlled dynamical systems; controlled geometric Markov renewal processes; HJB equation; Merton problem; Banach space

1. Introduction

Random evolutions were introduced over 40 years ago, see, e.g., in [1,2] and for its asymptotic theory in [3,4,5,6] and references therein. Discrete-time random evolutions, induced by discrete-time Markov chains, are introduced by Cohen [7] and Keepler [8], and discrete-time semi-Markov random evolutions (DTSMRE) by Limnios [9]. See also [10]. Koroliuk and Swishchuk [4], Swishchuk and Wu [5], Anisimov [11,12,13,14], and Koroliuk and Limnios [3], studied discrete-time random evolutions induced by the embedded Markov chains of continuous time semi-Markov processes. This is equivalent to discrete-time Markov random evolution stopped at random time (continuous). One of the examples of discrete-time random evolutions is the geometric Markov renewal process (GMRP). Applications of GMRP in finance have been considered in [15,16,17]. Optimal stopping of GMRP and pricing of European and American options for underlying assets modelled by GMRP have been studied in [18].

Discrete-time semi-Markov chains (SMC) have only recently been used in applications. Especially, in DNA analysis, image and speech processing, reliability theory, etc., see in [19] and references therein. These applications have stimulated a research effort in this area. While the literature in discrete-time Markov chains theory and applications is quite extensive, there is only a small amount of the literature on SMC and most of them are related to hidden semi-Markov models for estimation.

The present article is a continuation of our previous work [20]. Thus, we keep all our notation and definitions the same as in the latter paper. Compared with our previous work [20], where we studied random evolutions of semi-Markov chains, here we considered additionally a control on the random evolution, which we call controlled discrete-time semi-Markov random evolution (CDTSMRE) in a Banach space, and we presented time-rescaled convergence theorems. In particular, we get weak convergence theorems in Skorokhod space

D [0, \infty)

for càdlàg stochastic processes, see, e.g., in [21]. The limit theorems include averaging, diffusion approximation, and diffusion approximation with equilibrium. For the above limit theorems we also presented rates of convergence results. Finally, we give some applications regarding the above mentioned results, especially to controlled additive functionals (CAF), CGMRP, and controlled dynamical systems (CDS), and optimization problems.

Regarding the optimization problems, we provide dynamical principles for discrete-time dynamical systems such as CAF and CGMRPs (see Section 2.4), see, e.g., [22,23,24]. We also produce dynamic programming equations (Hamilton–Jacobi–Bellman equations) for the limiting processes in diffusion approximation such as CAF, CGMRP, and CDS. As an example, we consider the solution of portfolio optimization problem by Merton for the limiting CGMRP in DA (see Section 4.4). Merton problem, or Merton portfolio’s problem, is a problem in continuous-time finance associated with portfolio choice. In

(B, S)

-security market, which consists of a stock and a risk-free asset, an investor must choose how much to consume, and must allocate his wealth between the stock and the risk-free asset in a such way that maximizes expected utility. The problem was formulated and first solved by Robert Merton in 1969, and published in 1971 [25].

Results presented here are new and deals with CDTSMRE on Banach spaces. This paper contains new and original results on dynamical principle for CDTSMRE and DPE (HJB equations) for the limiting processes in DA. One of the new remarkable results is the solution of Merton portfolio problem for the limiting CGMRP in DA. The method of proofs was based on the martingale approach together with convergence of transition operators of the extended semi-Markov chain via a solution of a singular perturbation problem [3,4,26]. As in our previous work [20], the tightness of these processes is proved via Sobolev’s embedding theorems [27,28,29]. It is worth mentioning that, as in the Markov case, the results presented here cannot be deduced directly from the continuous-time case. We should also note that that DTSMREs have been completely studied in [20]. For semi-Markov processes see, e.g., [30,31,32,33]. For Markov chains and additive functionals see, e.g., [34,35,36,37,38].

The paper is organized as follows. Definition and properties of discrete-time semi-Markov random evolutions and Controlled DTSMREs, as well as particular stochastic systems as applications, are introduced in Section 2. The main results of this paper, limit theorems of CDTSMRE, as averaging, diffusion approximation and diffusion approximation with equilibrium of controlled DTSMREs are considered in Section 3. In Section 4, we provide three applications of averaging, diffusion approximation, and diffusion approximation with equilibrium of controlled DTSMREs: controlled additive functionals, controlled GMRP, and controlled dynamical systems. Section 5 deals with the analysis of the rates of convergence in the limit theorems, presented in the previous sections, for controlled DTSMREs and for CAF and CGMRP. In Section 6, we give the proofs of theorems presented in the previous sections. The last section concludes the paper and indicates some future works.

2. Controlled Discrete-Time Semi-Markov Random Evolutions

2.1. Semi-Markov Chains

The aim of this section is to present some notation and to make this paper as autonomous as possible. The reader may refer to our article in [20] for more details.

Let

(E, E)

be a measurable space with countably generated

σ

-algebra and

(Ω, F, {(F_{n})}_{n \in I N}, P)

be a stochastic basis on which we consider a Markov renewal process

(x_{n}, τ_{n}, n \in I N)

in discrete time

k \in I N

, with state space

(E, E)

. Notice that

I N

is the set of non-negative integer numbers. The semi-Markov kernel q is defined by (see, e.g., in [9,19]),

\begin{matrix} q (x, B, k) : = P (x_{n + 1} \in B, τ_{n + 1} - τ_{n} = k ∣ x_{n} = x), x \in E, B \in E, k, n \in I N . \end{matrix}

(1)

We will denote also

q (x, B, Γ) = \sum_{n \in Γ} q (x, B, n)

, where

Γ \subset I N

. The process

(x_{n})

is the embedded Markov chain of the MRP

(x_{n}, τ_{n})

with transition kernel

P (x, d y)

. The semi-Markov kernel q is written as

q (x, d y, k) = P (x, d y) f_{x y} (k),

where

f_{x y} (k) : = P (τ_{n + 1} - τ_{n} = k ∣ x_{n} = x, x_{n + 1} = y)

, the conditional distribution of the sojourn time in state x given that the next visited state is y.

Define also the counting process of jumps

ν_{k} = max {n : τ_{n} \leq k}

, and the discrete-time semi-Markov chain

z_{k}

by

z_{k} = x_{ν_{k}}

, for

k \in I N

. Define now the backward recurrence time process

γ_{k} : = k - τ_{ν_{k}}

,

k \geq 0

, and the filtration

F_{k} : = σ (z_{ℓ}, γ_{ℓ}; ℓ \leq k)

,

k \geq 0

.

Let us consider a separable Banach space B of real-valued measurable functions defined on

E \times I N

, endowed with the sup norm

∥\cdot∥

and denote by

B

its Borel

σ

-algebra. The Markov chain

(z_{k}, γ_{k}), k \geq 0

, has the following transition probability operator

P^{♯}

on B

\begin{matrix} P^{♯} φ (x, k) = \frac{1}{{\bar{H}}_{x} (k)} \int_{E \ {x}} q (x, d y, k + 1) φ (y, 0) + \frac{{\bar{H}}_{x} (k + 1)}{{\bar{H}}_{x} (k)} φ (x, k + 1), \end{matrix}

(2)

where

φ \in B

, and its stationary distribution, if there exist, is given by

\begin{matrix} π^{♯} (d x \times {k}) = ρ (d x) {\bar{H}}_{x} (k) / m, \end{matrix}

where

\begin{matrix} m : = \int_{E} ρ (d x) m (x), m (x) = \sum_{k \geq 0} {\bar{H}}_{x} (k), \end{matrix}

and

ρ (d x)

is the stationary distribution of the EMC

(x_{n})

,

H_{x} (k) : = q (x, E, [0, k])

, and

{\bar{H}}_{x} (k) : = 1 - H_{x} (k) = q (x, E, [k + 1, \infty))

. The probability measure

π

defined by

π (B) = π^{♯} (B \times I N)

is the stationary probability of the SMC

(z_{k})

. Define also the r-th moment of holding time in state

x \in E

,

m_{r} (x) : = \sum_{k \geq 1} k^{r} q (x, E, k), r = 1, 2, \dots

Of course,

m (x) = m_{1} (x)

, for any

x \in E

.

Define now the stationary projection operator

Π

on the null space of the (discrete) generating operator

Q^{♯} : = P^{♯} - I

,

\begin{matrix} Π φ (x, s) = \sum_{ℓ \geq 0} \int_{E} π^{♯} (d y \times {ℓ}) φ (y, ℓ) 1 (x, s), \end{matrix}

where

1 (x, s) = 1

for any

x \in E

, and

s \in I N

. This operator satisfies the equations

Π Q^{♯} = Q^{♯} Π = 0 .

The potential operator of

Q^{♯}

, denoted by

R_{0}

, is defined by

\begin{matrix} R_{0} : = {(Q^{♯} + Π)}^{- 1} - Π = \sum_{k \geq 0} [{(P^{♯})}^{k} - Π] . \end{matrix}

2.2. General Definition and Properties of DTSMREs

We define here controlled discrete-time semi-Markov random evolutions. Let U denote a compact Polish space representing the control, and let

u_{k}

be U-valued control process and we suppose that it is a Markov chain. We note that we could also define the process

u_{ν_{k}}

which is a semi-Markov control process, considered in many papers (see, e.g., in [39,40]). We suppose that homogeneous Markov chain

u_{k}

is independent of

z_{k}

, and transition probability kernel

P^{u} = P (u_{k + 1} \in d y ∣ u_{k} = u) = Q (u, d y)

.

Let us consider a family of bounded contraction operators

D (z, u), z \in E, u \in U

, defined on B, where the maps

D (z, u) φ : E \times U \to B

are

E \times U

-measurable,

φ \in B

. Denote by I the identity operator on B. Let

Π B = N (Q^{♯})

be the null space, and

(I - Π) B = R (Q^{♯})

be the range values space of operator

Q^{♯}

. We will suppose here that the Markov chain

(z_{k}, γ_{k}, k \in I N)

is uniformly ergodic, that is,

∥({(P^{♯})}^{n} - Π) φ∥ \to 0

, as

n \to \infty

, for any

φ \in B

. In that case, the transition operator is reducible-invertible on B. Thus, we have

B = N (Q^{♯}) \oplus R (Q^{♯})

, the direct sum of the two subspaces. The domain of an operator A on B is denoted by

D (A) : = {φ \in B : A φ \in B}

.

Definition 1.

A controlled discrete-time semi-Markov random evolution (CDTSMRE)

Φ_{k}^{u}, k \in I N

, on the Banach space B, is defined by

\begin{matrix} Φ_{k}^{u} φ = D (z_{k}, u_{k}) D (z_{k - 1}, u_{k - 1}) \dots D (z_{2}, u_{2}) D (z_{1}, u_{1}) φ, \end{matrix}

(3)

for

k \geq 1,

Φ_{0}^{u} = I, u_{0} = u \in U

, and for any

φ \in B_{0} : = \cap_{x \in E, u \in U} D (D (x, u))

. Thus we have

Φ_{k} = D (z_{k}, u_{k}) Φ_{k - 1} .

The process

(z_{k}, γ_{k}, u_{k})

is a Markov chain on

E \times I N \times U

, adapted to the filtration

F_{k}^{u} : = σ (z_{ℓ}, γ_{ℓ}, u_{ℓ}; ℓ \leq k)

,

k \geq 0

. We also note that

(Φ_{k}^{u} φ, z_{k}, γ_{k}, u_{k})

is a Markov chain on

B \times E \times I N \times U

with discrete generator

\begin{matrix} L^{u} φ = [\tilde{P} + \int_{E} \int_{U} \tilde{P} (\cdot, d v) (D (v, u) - I)] φ, \end{matrix}

(4)

where

φ : = φ (x, z, s, u)

, and

\tilde{P} φ (z, s, u) : = P^{♯} P^{u} φ (z, s, u) = \sum_{s^{'} \in I N} \int_{E \times U} P^{♯} (z, s; d z^{'}, s^{'}) P^{u} (u; d u^{'}) φ (z^{'}, s^{'}, u^{'}) .

The process

M_{k}^{u}

defined by

\begin{matrix} M_{k}^{u} : = Φ_{k}^{u} - I - \sum_{ℓ = 0}^{k - 1} E [Φ_{ℓ + 1}^{u} - Φ_{ℓ}^{u} ∣ F_{ℓ}^{u}], k \geq 1, M_{0} = 0, \end{matrix}

(5)

on B, is an

F_{k}^{u}

-martingale. The random evolution

Φ_{k}^{u}

can be written as follows

\begin{matrix} Φ_{k}^{u} : = I + \sum_{ℓ = 0}^{k - 1} [D (z_{ℓ + 1}, u_{ℓ + 1}) - I] Φ_{ℓ}^{u}, \end{matrix}

and then, the martingale (5) can be written as follows,

\begin{matrix} M_{k}^{u} : = Φ_{k}^{u} - I - \sum_{ℓ = 0}^{k - 1} E [(D (z_{ℓ + 1}, u_{ℓ + 1}) - I) Φ_{ℓ}^{u} ∣ F_{ℓ}^{u}], \end{matrix}

or

\begin{matrix} M_{k}^{u} : = Φ_{k}^{u} - I - \sum_{ℓ = 0}^{k - 1} [E (D (z_{ℓ + 1}, u_{ℓ + 1}) ∣ F_{ℓ}^{u}) - I] Φ_{ℓ}^{u} . \end{matrix}

Finally, as

E [D (z_{ℓ + 1}, u_{ℓ + 1}) Φ_{ℓ}^{u} φ ∣ F_{ℓ}^{u}] = [(P^{♯} + P^{u}) D (\cdot) Φ_{ℓ} φ] (z_{ℓ}, γ_{ℓ}, u_{ℓ})

, one takes

\begin{matrix} M_{k}^{u} : = Φ_{k}^{u} - I - \sum_{ℓ = 0}^{k - 1} [\tilde{P} D (\cdot, u) - I] Φ_{ℓ}^{u} . \end{matrix}

2.3. Some Examples

Example 1.

Controlled Additive Functional or Markov Decision Process.

Let define the following controlled additive functional,

y_{k}^{u} = \sum_{l = 0}^{k} a (z_{l}, u_{l}), k \geq 0, y_{0} = y .

If we define the operator

D (z, u)

on

C_{0} (I R)

in the following way,

D (z, u) φ (y) : = φ (y + a (z, u)),

then the controlled discrete-time semi-Markov random evolution

Φ_{k} φ

has the following presentation,

Φ_{k}^{u} φ (y) = φ (y_{k}^{u}) .

Process

y_{k}^{u}

is usually called in the literature the Markov decision process (see, e.g., in [41,42,43,44]).

Example 2.

Controlled geometric Markov renewal process.

The CGMRP is defined in the following way,

S_{k}^{u} : = S_{0} \prod_{l = 1}^{k} (1 + a (z_{l}, u_{l})), k \in I N, S_{0} = s .

We suppose that

\prod_{k = 1}^{0} = 1 .

If we define the operator

D (z, u)

on

C_{0} (I R)

in the following way,

D (z, u) φ (s) : = φ (s (1 + a (z, u))),

then the controlled discrete-time semi-Markov random evolution

Φ_{k}^{u} φ

can be given as follows,

Φ_{k}^{u} φ (s) = φ (S_{k}^{u}) .

To the authors opinion, this process is defined for the first time in the literature and the notion of controlled GMRP is a new one as well.

2.4. Dynamic Programming for Controlled Models

Here, we present dynamic programming for controlled models given in Examples in previous section. Let us consider a Markov control model (see in [45])

(E, A, {A (z) | z \in E}, Q, c) .

Here, E is the state space; A is the control or action set; Q is the transition kernel, i.e., a stochastic kernel on E given

K,

where

K : = {(z, u) | z \in E, u \in A (z)}

; and

c : K \to R

is a measurable function called the cost-per-stage function.

We are interested in is to minimize the finite-horizon performance criterion either (see Example 1)

J_{1} (π, z) : = E_{z}^{π} [\sum_{l = 0}^{N - 1} a (z_{l}, u_{l}) + a_{N} (z_{N})]

or (see Example 2)

J_{2} (π, z) : = E_{z}^{π} [ln (\prod_{l = 1}^{k} (1 + a (z_{l}, u_{l})) (1 + a_{N} (z_{N})))],

where

a_{N} (z_{N})

is the terminal cost function,

π \in Π

is the set of control policies.

In this way, denoting by

J^{*}

the value function

J_{i}^{*} (z) : = inf_{Π} J_{i} (π, z), z \in E, i = 1, 2,

the problem is to find a policy

π^{*} \in Π

such that

J_{i} (π^{*}, z) = J_{i}^{*} (z), z \in E, i = 1, 2, .

Example 3.

Controlled Additive Functional.

Let us provide an algorithm for finding both the value function

J^{*}

and an optimal policy

π^{*}

for the example with function

J_{1} (π, z)

(see Example 1).

Let

J_{0, 1}, J_{1, 1}, \dots, J_{N, 1}

be the functions on E defined from

l = N

to

l = 0

by (backwards)

J_{N, 1} (z) : = a_{N} (z), l = N

and

J_{l, 1} (z) : = min_{A (z)} [a (z, u) + \int_{E} J_{l + 1, 1} (y) Q (u, d y)], l = N - 1, N - 2, \dots, 0 .

Suppose that there is a selector

f_{t} \in F

such that

f_{l} (z) \in A (z)

attains the minimum in the above expression for

J_{l} (z)

for all

z \in E,

meaning for any

z \in E

and

l = 0, \dots, N - 1,

J_{l, 1} (z) = a (z, f_{l}) + \int_{E} J_{l + 1, 1} (y) Q (f_{l}, d y) .

Then, the optimal policy is the deterministic Markov one

π^{*} = {f_{0}, \dots, f_{N - 1}}

, and the value function

J^{*}

equals

J_{0},

i.e.,

J_{1}^{*} (z) = J_{0} (z) = J_{1} (π^{*}, z), z \in E .

Example 4.

Controlled Geometric Markov Renewal Chain.

Let us provide an algorithm for finding both the value function

J^{*}

and an optimal policy

π^{*}

for the example with function

J_{2} (π, z)

(see Example 2). We will modify the expression for

S_{k}^{u}

in Example 2. Let

ln (\frac{S_{k}^{u}}{S_{0}})

be a log-return, then

ln (\frac{S_{k}^{u}}{S_{0}}) = \sum_{l = 1}^{k} ln (1 + a (z_{l}, u_{l}) .

Thus, we are interested in minimizing the finite-horizon performance criterion for

J_{2} (π, z) : = E_{z}^{π} [\sum_{l = 0}^{N - 1} ln (1 + a (z_{l}, u_{l}) + ln (1 + a_{N} (z_{N})]

Let

J_{0, 2}, J_{1, 2}, \dots, J_{N, 2}

be the functions on E defined from

l = N

to

l = 0

by (backwards)

J_{N, 2} (z) : = ln (1 + a_{N} (z)), l = N

and

J_{l, 2} (z) : = min_{A (z)} [ln (1 + a (z, u)) + \int_{E} J_{l + 1, 1} (y) Q (u, d y)], l = N - 1, N - 2, \dots, 0 .

Suppose that there is a selector

f_{t} \in F

such that

f_{l} (z) \in A (z)

attains the minimum in the above expression for

J_{l} (z)

for all

z \in E,

meaning for any

z \in E

and

l = 0, \dots, N - 1,

J_{l, 2} (z) = ln (1 + a (z, f_{l})) + \int_{E} J_{l + 1, 2} (y) Q (f_{l}, d y | z) .

Then, the deterministic Markov policy

π^{*} = {f_{0}, \dots, f_{N - 1}}

is optimal, and the value function

J^{*}

equals

J_{0},

i.e.,

J_{2}^{*} (z) = J_{0} (z) = J_{2} (π^{*}, z), z \in E .

3. Limit Theorems for Controlled Semi-Markov Random Evolutions

In this section, we present averaging, diffusion approximation, and diffusion approximation with equilibrium results for the controlled discrete-time semi-Markov random evolutions. It is worth noticing that the main scheme of results are almost the same as in our previous works in particular [20]. Nevertheless, the additional component of the control allows us to study more interesting problems.

3.1. Averaging of CDTSMREs

We consider here CDTSMREs defined in Section 2. Let us now set

k : = [t / ε]

and consider the continuous time process

M_{t}^{ε}

\begin{matrix} M_{t}^{ε, u} : = M_{[t / ε]}^{u} = Φ_{[t / ε]}^{ε, u} - I - \sum_{ℓ = 0}^{[t / ε] - 1} [\tilde{P} D^{ε} (\cdot, u) - I] Φ_{ℓ}^{ε, u} \end{matrix}

We will prove here asymptotic results for this process as

ε \to 0

.

The following assumptions are needed for averaging.

A1:: The MC $(z_{k}, γ_{k}, k \in I N)$ is uniformly ergodic with ergodic distribution $π^{♯} (B \times {k}), B \in E, k \in I N$ .
A2:: The moments $m_{2} (x), x \in E$ , are uniformly integrable.
A3:: The perturbed operators $D^{ε} (x)$ have the following representation on B

$\begin{matrix} D^{ε} (x, u) = I + ε D_{1} (x, u) + ε D_{0}^{ε} (x, u), \end{matrix}$

where operators $D_{1} (x, u)$ on B are closed and $B_{0} : = \cap_{x \in E, u \in U} D (D_{1} (x, u))$ is dense in B, ${\bar{B}}_{0} = B$ . Operators $D_{0}^{ε} (x, u)$ are negligible, i.e., ${lim}_{ε \to 0} ∥D_{0}^{ε} (x, u) φ∥ = 0$ for any $φ \in B_{0}$ .
A4:: We have $\int_{E} \int_{U} [π (d x) π_{1} (d u)] {∥D_{1} (x, u) φ∥}^{2} < \infty .$ (See A7.)
A5:: There exists Hilbert spaces H and $H^{*}$ such that compactly embedded in Banach spaces B and $B^{*},$ respectively, where $B^{*}$ is a dual space to $B .$
A6:: Operators $D^{ε} (x)$ and ${(D^{ε})}^{*} (x)$ are contractive on Hilbert spaces H and $H^{*},$ respectively.
A7:: The MC $(u_{k}, k \in I N)$ , is independent of $(z_{k})$ , and is uniformly ergodic with stationary distribution $π_{1} (d u), k \in I N$ .

We note that if

B = C_{0} (I R),

then

H = W^{l, 2} (I R)

is a Sobolev space, and

W^{l, 2} (I R) \subset C_{0} (I R)

and this embedding is compact (see [29]). For the spaces

B = L_{2} (I R)

and

H = W^{l, 2} (I R)

the situation is the same.

We also note, that semi-Markov chain

(z_{k}, u_{k})

is uniformly ergodic on

E \times U

with stationary probabilities

π (d x) π_{1} (d u),

which follows from conditions A1 and A7.

Theorem 1.

Under Assumptions A1–A7, the following weak convergence takes place,

Φ_{[t / ε, u]}^{ε} ⟹ \bar{Φ} (t), ε ↓ 0,

where the limit random evolution

\bar{Φ} (t)

is determined by the following equation,

\begin{matrix} \bar{Φ} (t) φ - φ - \int_{0}^{t} \hat{I L} \bar{Φ} (s) φ d s = 0, 0 \leq t \leq T, φ \in B_{0}, \end{matrix}

(6)

or, equivalently,

\begin{matrix} \frac{d}{d t} \bar{Φ} (t) φ = \hat{I L} \bar{Φ} (t) φ, \end{matrix}

where the limit contracted operator is then given by

\begin{matrix} \hat{I L} = {\hat{D}}_{1} = \int_{E} \int_{U} [π (d x) π_{1} (d u)] D_{1} (x, u) . \end{matrix}

(7)

This result generalize the classical Krylov–Bogolyubov averaging principle [46] on a Banach and a controlled spaces.

3.2. Diffusion Approximation of DTSMREs

For the diffusion approximation of CDTSMREs, we will consider a different time-scaling and some additional assumptions.

D1:: Let us assume that the perturbed operators $D^{ε} (x, u)$ have the following representation in B,

$\begin{matrix} D^{ε} (x, u) = I + ε D_{1} (x, u) + ε^{2} D_{2} (x, u) + ε^{2} D_{0}^{ε} (x, u), \end{matrix}$

where operators $D_{2} (x, u)$ on B are closed and $B_{0} : = \cap_{x \in E, u \in U} D (D_{2} (x, u))$ is dense in B, ${\bar{B}}_{0} = B$ ; operators $D_{0}^{ε} (x, u)$ are a negligible operator, i.e., ${lim}_{ε ↓ 0} ∥D_{0}^{ε} (x, u) φ∥ = 0$ .
D2:: The following balance condition holds,

$\begin{matrix} Π D_{1} (x, u) Π = 0, \end{matrix}$

(8)

where

$\begin{matrix} Π φ (x, k, u) : = \sum_{l \geq 0} \int_{E} \int_{U} π^{♯} (d y \times ℓ) π_{1} (d u) φ (y, ℓ, u) 1 (x, k) . \end{matrix}$

(9)
D3:: The moments $m_{3} (x), x \in E$ , are uniformly integrable.

Theorem 2.

Under Assumptions A1, A5–A7 (see Section 3.1), and D1-D3, the following weak convergence takes place,

Φ_{[t / ε^{2}]}^{ε} ⟹ Φ_{0} (t), ε ↓ 0,

where the limit random evolution

Φ_{0} (t)

is a diffusion random evolution determined by the following generator

\begin{matrix} I L = Π D_{2} (x) Π + Π D_{1} (x) R_{0} D_{1} (x) Π - Π D_{1}^{2} (x) Π, \end{matrix}

where

\begin{matrix} R_{0} : = {[\tilde{Q} + Π]}^{- 1} - Π, \end{matrix}

(10)

and

\begin{matrix} \tilde{Q} : = \tilde{P} - I . \end{matrix}

(11)

3.3. Diffusion Approximation with Equilibrium

The diffusion approximation with equilibrium or the normal deviation is obtained by considering the difference between the rescaled initial processes and the averaging limit process. This is of great interest when we have no balance condition as previously in the standard diffusion approximation scheme.

Consider now the controlled discrete-time semi-Markov random evolution

Φ_{[t / ε, u]}^{ε},

averaged evolution

\bar{Φ} (t)

(see Section 3.1) and the deviated evolution

\begin{matrix} W_{t}^{ε, u} : = ε^{- 1 / 2} [Φ_{[t / ε]}^{ε, u} - \bar{Φ} (t)] . \end{matrix}

(12)

Theorem 3.

Under Assumptions A1, A5–A6 (see Section 3.1), and D3, with operators

D^{ε} (x)

in A3, instead of D1, the deviated controlled semi-Markov random evolution

W_{t}^{ε, u}

weakly convergence, when

ε \to 0

, to the diffusion random evolution

W_{t}^{0}

defined by the following generator

\begin{matrix} I L = Π (D_{1} (x, u) - {\bar{D}}_{1}) R_{0} (D_{1} (x, u) - {\bar{D}}_{1}) Π, \end{matrix}

(13)

where Π is defined in (9).

4. Applications to Stochastic Systems

In this section, we give two applications in connection with the above results: additive functionals that has many application, e.g., in storage, reliability, and risk theories (see, e.g., in [3,4,19,47]), and to geometric Markov renewal processes, that also have many application including finance (see [15,16,17,18]). Our main goal here is to get the limiting processes and apply optimal control methods to receive the solutions of optimization problems. The limiting results for MC such as LLN and CLT were considered in [11,12].

4.1. Controlled Additive Functionals

Let us consider here the CAF,

(y_{k}^{u})

, described previously in Example 1.

Averaging of CAF. Now, if we define the continuous time process

y_{t}^{ε, u} : = ε \sum_{l = 0}^{[t / ε]} a (z_{l}, u_{l}),

then from Theorem 1 it follows that this process has the following limit

y_{0} (t) = {lim}_{ε \to 0} y_{t}^{ε}

y_{0} (t) = y + \hat{a} t,

where

\hat{a} = \int_{E} \int_{U} π (d z) π_{1} (d u) a (z, u) .

We suppose that

\int_{E} \int_{U} π (d z) π_{1} (d u) | a (z, u) | < + \infty .

Diffusion Approximation of CAF. If we consider the continuous time process

ξ_{t}^{ε, u}

as follows

ξ_{t}^{ε, u} : = ε \sum_{l = 0}^{[t / ε^{2}]} a (z_{l}, u_{l}), ξ_{0}^{ε} = y,

then under balance condition

\int_{E} \int_{U} π (d z) π_{1} (d u) a (z, u) = 0

and

\int_{E} \int_{U} π (d z) π_{1} (d u) {| a (z, u) |}^{2} < + \infty

we get that the limit process

ξ_{0} (t) = {lim}_{ε \to 0} ξ_{t}^{ε}

has the following form,

ξ_{0} (t) = y + b w_{t},

where

b^{2} = 2 {\hat{a}}_{0} - {\hat{a}}_{2}

, and

{\hat{a}}_{0} = \int_{E} \int_{U} π (d z) π_{1} (d u) a (z, u) R_{0} a (z, u), {\hat{a}}_{2} = \int_{E} \int_{U} π (d z) π_{1} (d u) a^{2} (z, u),

and

w_{t}

is a standard Wiener process.

Diffusion Approximation with Equilibrium of CAF. Let us consider the following normalized additive functional,

w_{t}^{ε, u} : = ε^{- 1 / 2} [y_{t}^{ε, u} - \hat{a} t] .

Then, this process converges to the following process,

σ w_{t},

where

σ^{2} = \int_{E} \int_{U} π (d z) π_{1} (d u) (a (z, u) - \hat{a}) R_{0} (a (z, u) - \hat{a}),

and

w_{t}

is a standard Wiener process.

In this way, the AF

y_{t}^{ε}

may be presented in the following approximated form,

y_{t}^{ε} \approx \hat{a} t + \sqrt{ε} σ w_{t} .

4.2. Controlled Geometric Markov Renewal Processes

The CGMRP is defined in the following way (see in [15,16]),

S_{k}^{u} : = S_{0} \prod_{l = 1}^{k} (1 + a (z_{l}, u_{l})), k \in I N, τ_{0} = s .

We suppose that

\prod_{k = 1}^{0} = 1 .

If we define the operator

D (z)

on

C_{0} (I R)

in the following way,

D (z, u) φ (s) : = φ (s (1 + a (z, u))),

then the discrete-time semi-Markov random evolution

Φ_{k}^{u} φ

has the following presentation,

Φ_{k}^{u} φ (s) = φ (S_{k}^{u}) .

Averaging of CGMRP. Now, define the following sequence of processes,

S_{t}^{ε, u} : = S_{0} \prod_{k = 1}^{[t / ε]} (1 + ε a (z_{k}, u_{k})), t \in {I R}_{+}, S_{0} = s .

Then, under averaging conditions the limit process

{\bar{S}}_{t}

has the following form,

{\bar{S}}_{t} = S_{0} e^{\hat{a} t},

where

\hat{a} = \int_{E} \int_{U} π (d z) π_{1} (d u) a (z, u) .

Diffusion Approximation of CGMRP. If we define the following sequence of processes,

S^{ε, u} (t) : = S_{0} \prod_{k = 1}^{[t / ε^{2}]} (1 + ε a (z_{k}, u_{k})), t \in {I R}_{+}, S_{0} = s,

then, in the diffusion approximation scheme, we have the following limit process,

S_{0} (t)

S_{0} (t) = S_{0} e^{- t {\hat{a}}_{2} / 2} e^{σ_{a} w (t)},

where

{\hat{a}}_{2} : = \int_{E} \int_{U} π (d z) π_{1} (d u) a^{2} (z, u),

σ_{a}^{2} : = \int_{E} \int_{U} π (d z) π_{1} (d u) [a^{2} (z, u) / 2 + a (z, u) R_{0} a (z, u)] .

It means that

S_{0} (t)

satisfies the following stochastic differential equation,

\frac{d S_{0} (t)}{S_{0} (t)} = \frac{1}{2} (σ_{a}^{2} - {\hat{a}}_{2}) d t + σ_{a} d w_{t},

where

w_{t}

is a standard Wiener process.

Diffusion Approximation with Equilibrium of CGMRP. Let us consider the following normalized GMRP:

w_{t}^{ε, u} : = ε^{- 1 / 2} [ln (S_{t}^{ε, u} / S_{0}) - \hat{a} t] .

It is worth noticing that in finance the expression

ln (S_{t}^{ε, u} / S_{0})

represents the log-return of the underlying asset (e.g., stock)

S_{t}^{ε, u} .

Then, this process converges to the following process,

σ w_{t},

where

σ^{2} = \int_{E} \int_{U} π (d z) π_{1} (d u) (a (z, u) - \hat{a}) R_{0} (a (z, u) - \hat{a}),

and

w_{t}

is a standard Wiener process.

In this way, the GMRP

S_{t}^{ε}

may be presented in the following approximated form,

S_{t}^{ε} \approx S_{0} e^{\hat{a} t + \sqrt{ε} σ w_{t}} .

4.3. Controlled Dynamical Systems

We consider here discrete-time CDS and their asymptotic behaviour in series scheme: average and diffusion approximation ([9]).

Define the measurable function C on

I R \times E \times U

. Let us consider the difference equation

\begin{matrix} y_{k + 1}^{ε, u} = y_{k}^{ε, u} + ε C (y_{k}^{ε}; z_{k + 1}, u_{k + 1}), k \geq 0, and y_{0}^{ε} = u, \end{matrix}

(14)

switched by the SMC

(z_{k})

.

The perturbed operators

D^{ε} (z, u), x \in E

, are defined now by

D^{ε} (z, u) φ (u) = φ (z + ε C (z, x, u)) .

Averaging of CDS. Under averaging assumptions the following weak convergence takes place,

\begin{matrix} y_{[t / ε]}^{ε, u} \Rightarrow \bar{y} (t), as ε ↓ 0, \end{matrix}

where

\bar{y} (t), t \geq 0

is the solution of the following (deterministic) differential equation,

\begin{matrix} \frac{d}{d t} \bar{y} (t) = \bar{C} (\bar{y} (t)), and \bar{y} (0) = u, \end{matrix}

(15)

where

\bar{C} (z) = \int_{E} \int_{U} π (d x) π_{1} (d u) C (z, x, u)

.

Diffusion Approximation of CDS. Under diffusion approximation conditions the following weak convergence takes place

\begin{matrix} y_{[t / ε^{2}]}^{ε, u} \Rightarrow x_{t}, as ε ↓ 0, \end{matrix}

where

x_{t}, t \geq 0

, is a diffusion processes, with initial value

x_{0} = u

, determined by the operator

\begin{matrix} I L φ (z) = a (z) φ^{'} (z) + \frac{1}{2} b^{2} (z) φ^{″} (z), \end{matrix}

provided that

b^{2} (z) > 0

, and drift and diffusion coefficients are defined as follows,

\begin{matrix} b^{2} (z) : = 2 {\bar{C}}_{0} (z) - {\bar{C}}_{2} (z), \\ a (z) : = {\bar{C}}_{01} (z) - {\bar{C}}_{1} (z), \end{matrix}

with:

{\bar{C}}_{0} (z) : = \int_{E} \int_{U} π (d x) π_{1} (d u) C_{0} (z, x, u)

,

C_{0} (z, x, u) : = C (z, x, u) R_{0} C (z, x, u)

,

{\bar{C}}_{2} (z) : = \int_{E} \int_{U} π (d x) π_{1} (d u) C^{*} (z, x, u) C (z, x, u)

, where

C^{*}

means transpose of the vector C,

{\bar{C}}_{01} (z) : = \int_{E} \int_{U} π (d x) π_{1} (d u) C_{01} (z, x, u)

,

C_{01} (z, x, u) : = C (z, x, u) R_{0} C_{z}^{'} (z, x, u)

,

{\bar{C}}_{1} (z) : = \int_{E} \int_{U} π (d x) π_{1} (d u) C_{1} (z, x, u)

,

C_{1} (z, x, u) : = C (z, x, u) C_{z}^{'} (z, x, u)

.

4.4. The Dynamic Programming Equations for Limiting Models in Diffusion Approximation

In this section, we consider the DPE, i.e., HJB Equations, for the limiting models in DA from Section 4.1, Section 4.2 andSection 4.3. As long as all limiting processes in DA in Section 4.1, Section 4.2 and Section 4.3 are diffusion processes, then we will set up a general approach to control for diffusion processes, see in [48].

Let

x_{t}^{u}

be a diffusion process satisfying the following stochastic differential equation,

d x_{t}^{u} = μ (x_{t}^{u}, u_{t}) d t + σ (x_{t}^{u}, u_{t}) d w_{t},

where

u_{t}

is the control process,

w_{t}

is a standard Wiener process. Let us also introduce the following performance criterion function,

J^{u} (t, x)

J^{u} (t, x) : = E_{t, x} [G (x_{T}^{u}) + \int_{t}^{T} F (s, x_{s}^{u}, u_{s}) d s],

where

G (x) : R \to R

is a terminal reward function (uniformly bounded),

F (t, x, u) : R_{+} \times R^{2} \to R

is a running penalty/reward function (uniformly bounded),

0 \leq t \leq T .

The problem is to maximize this performance criteria, i.e., to find the value function

J (t, x) : = sup_{u \in U_{t, T}} J^{u} (t, x),

where

U_{t, T}

is the admissible set of strategies/controls which are

F

-predictable, non-negative, and bounded.

The Dynamic Programming Principle (DPP) for diffusions states that the value function

J (t, x)

satisfies the DPP

J (t, x) = sup_{u \in U_{t, T}} E_{t, x} [J^{u} (T, x_{T}^{u}) + \int_{t}^{T} F (s, x_{s}^{u}, u_{s}) d s]

for all

(t, x) \in [0, T] \times R .

Moreover, the value function

J (t, x)

above satisfies the Dynamic Programming Equation (DPE) or Hamilton–Jacobi–Bellman (HJB) Equation:

\begin{matrix} \frac{\partial J (t, x)}{\partial t} + sup_{u \in U_{t, T}} [L_{t}^{u} J (t, x) + F (t, x, u)] = 0 \\ J (T, x) = G (x), \end{matrix}

(16)

where

L_{t}^{u}

is an infinitesimal generator of the diffusion process

x_{t}^{u}

above, i.e.,

L_{t}^{u} = μ (x, u) \frac{\partial}{\partial x} + \frac{σ^{2} (x, u)}{2} \frac{\partial^{2}}{\partial x^{2}} .

•DPE/HJB Equation for the Limiting CAF in DA (see Section 4.1)

We remind that the limiting process

ξ_{0} (t) = {lim}_{ε \to 0} ξ_{t}^{ε}

in this case has the following form

ξ_{0} (t) = y + b w_{t},

where

b^{2} = 2 {\hat{a}}_{0} - {\hat{a}}_{2},

and

{\hat{a}}_{0} = \int_{E} \int_{U} π (d z) π_{1} (d u) a (z, u) R_{0} a (z, u), {\hat{a}}_{2} = \int_{E} \int_{U} π (d z) π_{1} (d u) a^{2} (z, u),

and

w_{t}

is a standard Wiener process.

In this case, the DPE or HJB Equation (16) reads with the generator

L_{t}^{u} = \frac{1}{2} b^{2} (u) \frac{\partial^{2}}{\partial x^{2}},

with

b^{2} (u) : = 2 {\hat{a}}_{0} (u) - {\hat{a}}_{2} (u),

and

{\hat{a}}_{0} (u) : = \int_{E} π (d z) a (z, u) R_{0} a (z, u), {\hat{a}}_{2} (u) : = \int_{E} π (d z) a^{2} (z, u) .

•DPE/HJB Equation for the Limiting CGMRP in DA (see Section 4.2)

We recall that we have the following limiting process

S_{0} (t)

in this case:

S_{0} (t) = S_{0} e^{- t {\hat{a}}_{2} / 2} e^{σ_{a} w (t)},

where

{\hat{a}}_{2} : = \int_{E} \int_{U} π (d z) π_{1} (d u) a^{2} (z, u),

σ_{a}^{2} : = \int_{E} \int_{U} π (d z) π_{1} (d u) [a^{2} (z, u) / 2 + a (z, u) R_{0} a (z, u)] .

Furthermore,

S_{0} (t)

satisfies the following stochastic differential equation (SDE),

\frac{d S_{0} (t)}{S_{0} (t)} = \frac{1}{2} (σ_{a}^{2} - {\hat{a}}_{2}) d t + σ_{a} d w_{t},

where

w_{t}

is a standard Wiener process.

In this case, the DPE or HJB Equation (16) reads with the generator

L_{t}^{u} = \frac{1}{2} (σ_{a}^{2} (u) - a_{2} (u)) \frac{\partial}{\partial s} + \frac{1}{2} σ_{a}^{2} (u) \frac{\partial^{2}}{\partial s^{2}},

and

{\hat{a}}_{2} (u) : = \int_{E} π (d z) a^{2} (z, u),

σ_{a}^{2} (u) : = \int_{E} π (d z) [a^{2} (z, u) / 2 + a (z, u) R_{0} a (z, u)] .

•DPE/HJB Equation for the Limiting CDS in DA (see Section 4.3)

We remind that in the diffusion approximation the limiting process is a diffusion process

x_{t}

with a generator

L φ (z) = a (z) φ^{'} (z) + \frac{1}{2} b^{2} (z) φ^{″} (z),

provided that

b^{2} (z) > 0

, and drift and diffusion coefficients are defined as follows,

\begin{matrix} b^{2} (z) : = 2 {\bar{C}}_{0} (z) - {\bar{C}}_{2} (z), \\ a (z) : = {\bar{C}}_{01} (z) - {\bar{C}}_{1} (z), \end{matrix}

with

{\bar{C}}_{0} (z) : = \int_{E} \int_{U} π (d x) π_{1} (d u) C_{0} (z, x, u)

,

C_{0} (z, x, u) : = C (z, x, u) R_{0} C (z, x, u)

,

{\bar{C}}_{2} (z) : = \int_{E} \int_{U} π (d x) π_{1} (d u) C^{*} (z, x, u) C (z, x, u)

, where

C^{*}

means transpose of the vector C,

{\bar{C}}_{01} (z) : = \int_{E} \int_{U} π (d x) π_{1} (d u) C_{01} (z, x, u)

,

C_{01} (z, x, u) : = C (z, x, u) R_{0} C_{z}^{'} (z, x, u)

,

{\bar{C}}_{1} (z) : = \int_{E} \int_{U} π (d x) π_{1} (d u) C_{1} (z, x, u)

,

C_{1} (z, x, u) : = C (z, x, u) C_{z}^{'} (z, x, u)

.

In this case the DPE or HJB Equation (16) reads with the generator

L_{t}^{u} = a (z, u) φ^{'} (z) + \frac{1}{2} b^{2} (z, u) φ^{″} (z),

and

\begin{matrix} b^{2} (z, u) : = 2 {\bar{C}}_{0} (z, u) - {\bar{C}}_{2} (z, u), \\ a (z, u) : = {\bar{C}}_{01} (z, u) - {\bar{C}}_{1} (z, u), \end{matrix}

with:

{\bar{C}}_{0} (z, u) : = \int_{E} π (d x) C_{0} (z, x, u)

,

C_{0} (z, x, u) : = C (z, x, u) R_{0} C (z, x, u)

,

{\bar{C}}_{2} (z, u) : = \int_{E} π (d x) C^{*} (z, x, u) C (z, x, u)

, where

C^{*}

means transpose of the vector C,

{\bar{C}}_{01} (z, u) : = \int_{E} π (d x) C_{01} (z, x, u)

,

C_{01} (z, x, u) : = C (z, x, u) R_{0} C_{z}^{'} (z, x, u)

,

{\bar{C}}_{1} (z, u) : = \int_{E} π (d x) C_{1} (z, x, u)

,

C_{1} (z, x, u) : = C (z, x, u) C_{z}^{'} (z, x, u)

.

Remark 1.

Our construction here is equivalent to some extend to “Recurrent Processes of a semi-Markov type (RPSM)” studied first in [13,14] including limit theorems. Those results were described in more detail in [11,12]. In particular, “RPSM with Markov switching” reflects the case of independent Markov components

z_{k}

and

u_{k}

, and “General case of RPSM” reflects the case when

u_{k}

is dependent on

z_{k}

.

•The Merton Problem

This is an example of solution of DPE/HJB equation for the limiting CGMRP in DA. Let us consider the portfolio optimization problem proposed by Merton (1971), see in [25]. We will apply this approach to the limiting CGMRP in DA above. In this problem, the agent seeks to maximize expected wealth by trading in a risky asset and the risk-free bonds (or bank account). She/he places

$ π_{t}

for a total wealth

X_{t}

in the risky asset

S_{0} (t)

and looks to obtain the value function (performance criterion)

J^{π} (t, S, x) : = sup_{π \in U_{0, T}} E_{t, S, x} [U (X_{T}^{π})],

which depends on the current wealth x and asset price

S,

and the optimal trading strategy

π,

U (x)

is the agent’s utility function (e.g., exponential

(- e^{- γ x})

or power

x^{γ}

). We suppose that the asset price

S_{0} (t)

satisfies the following SDE

\frac{d S_{0} (t)}{S_{0} (t)} = (μ - r) d t + σ_{a} d w_{t}, S_{0} (0) = S,

where

μ : = \frac{1}{2} (σ_{a}^{2} - {\hat{a}}_{2}),

{\hat{a}}_{2} : = \int_{E} \int_{U} π (d z) π_{1} (d u) a^{2} (z, u),

σ_{a}^{2} : = \int_{E} \int_{U} π (d z) π_{1} (d u) [a^{2} (z, u) / 2 + a (z, u) R_{0} a (z, u)] .

Here,

μ

represents the expected continuously compounded rate of growth of the traded asset, r is the continuously compounded rate of return of the risk-free asset (bond or bank account).

The wealth process

X_{t}^{π}

follows the following SDE,

d X_{t}^{π} = (π_{t} (μ - r) + r X_{t}^{π}) d t + π_{t} σ_{a} d w_{t}, X_{0}^{π} = x .

From the SDEs for

S_{0} (t)

and for

X_{t}^{π}

above we conclude that the infinitesimal generator for the pair

(S_{0} (t), X_{t}^{π})

is

L_{t}^{π} = (r x + (μ - r) π) \frac{\partial}{\partial x} + \frac{1}{2} σ_{a}^{2} π \frac{\partial^{2}}{\partial x^{2}} + (μ - r) S \frac{\partial}{\partial S} + \frac{1}{2} σ_{a}^{2} S^{2} \frac{\partial^{2}}{\partial S^{2}} + σ_{a} π \frac{\partial^{2}}{\partial x \partial S} .

From HJB equation for the limiting CGRMP in DA it follows that the value function

J (t, S, x) = sup_{π \in U_{t, T}} J^{π} (t, S, x)

should satisfy the equation

\frac{\partial J (t, S, x)}{\partial t} + sup_{π} [L_{t}^{π} J (t, S, x)] = 0

with terminal condition

J (T, S, x) = U (x) .

The explicit solution of this PDE depends on the explicit form of the utility function

U (x) .

Let us take the exponential utility function

U (x) = - e^{- γ x}, γ > 0, x \in R .

In this case we can find that the optimal amount to invest in the risky asset is a deterministic function of time

π_{t}^{*} = \frac{(σ_{a}^{2} - {\hat{a}}_{2}) / 2 - r}{γ σ_{a}^{2}} e^{- r (T - t)} .

5. Rates of Convergence in Averaging and Diffusion Approximations

The rate of convergence in a limit theorem is important in several ways, both theoretical and practical. We present here the rates of convergence of CDTSMRE in the averaging, diffusion approximation and diffusion approximation with equilibrium schemes and, as corollaries, we give the rates of convergence for CAF and CGMRP in the corresponding limits.

Proposition 1.

The Rate of Convergence of CDTSMRE in the Averaging has the following form,

| | E [Φ_{[t / ε]}^{ε, u} φ] - \bar{Φ} (t) φ | | \leq ε A (T, φ, | | R_{0} | |, D_{1} (z, u)),

where

A (T, φ, | | R_{0} | |, D_{1} (z, u))

is a constant, and

0 \leq t \leq T .

The proof of this proposition is given in Section 6.4.

Proposition 2.

The Rate of Convergence of CDTSMRE in the Diffusion Approximation takes the following form,

| | E [Φ_{[t / ε^{2}]}^{ε, u} φ] - Φ_{0} (t) φ | | \leq ε D (T, | | φ | |, | | R_{0} | |, | | D_{1} | |, | | D_{2} | |),

where

D (T, | | φ | |, | | R_{0} | |, | | D_{1} | |, | | D_{2} | |)

is a constant, and

0 \leq t \leq T .

Proposition 3.

The Rate of Convergence of CDTSMRE in Diffusion Approximation with Equilibrium has the following form,

| | E [W_{t}^{ε, u} φ] - W_{t}^{0} φ | | \leq \sqrt{ε} N (T, | | φ | |, | | R_{0} | |, | | D_{1} | |, | | D_{1}^{2} | |),

where

N (T, | | φ | |, | | R_{0} | |, | | D_{1} | |, | | D_{1}^{2} | |)

is a constant and

0 \leq t \leq T .

The proofs of the above Propositions 2 and 3 are similar as the proof of Proposition 1. We give in what follows some rate of convergence results (Corollaries 1 and 2) concerning applications.

Corollary 1.

The Rate of Convergence in the Limit Theorems for CAF:

- Rate of Convergence in Averaging:

| | E y_{t}^{ε, u} - y_{0} (t) | | \leq ε a (T, | | R_{0} | |, | | a | |),

where

a (T, | | R_{0} | |, | | a | |)

is a constant, and

0 \leq t \leq T .

- Rate of Convergence in Diffusion Approximation

| | E ξ_{t}^{ε, u} - ξ_{0} (t) | | \leq ε d (T, | | R_{0} | |, | | a | |, | | a^{2} | |),

where

d (T, | | R_{0} | |, | | a | |, | | a^{2} | |)

is a constant, and

0 \leq t \leq T .

- Rate of Convergence in diffusion approximation with equilibrium for CAF

| | E W_{t}^{ε, u} - w_{t} | | \leq \sqrt{ε} n (T, | | R_{0} | |, | | a | |, | | a^{2} | |),

where

n (T, | | R_{0} | |, | | a | |, | | a^{2} | |)

is a constant, and

0 \leq t \leq T .

Corollary 2.

The Rate of Convergence in the Limit Theorems for CGMRP:

- Rate of Convergence in Averaging

| | E S_{t}^{ε, u} - {\bar{τ}}_{t} | | \leq ε a (T, | | R_{0} | |, | | a | |),

where

a (T, | | R_{0} | |, | | a | |)

is a constant, and

0 \leq t \leq T .

- Rate of Convergence in Diffusion Approximation

| | E S_{t}^{ε, u} - τ_{0} (t)) | | \leq ε d (T, | | R_{0} | |, | | a | |, | | a^{2} | |),

where

d (T, | | R_{0} | |, | | a | |, | | a^{2} | |)

is a constant, and

0 \leq t \leq T .

- Rate of Convergence in diffusion approximation with equilibrium

| | E W_{t}^{ε, u} - w_{t} | | \leq \sqrt{ε} n (T, | | R_{0} | |, | | a | |, | | a^{2} | |),

where

n (T, | | R_{0} | |, | | a | |, | | a^{2} | |)

is a constant, and

0 \leq t \leq T .

6. Proofs

The proofs here have almost the same general construction scheme as in our paper [20] except that we consider also the control process. Let

C_{B} [0, \infty)

be the space of B-valued continuous functions defined on

[0, \infty)

.

6.1. Proof of Theorem 1

The proof of the relative compactness of CDTSMRE in the average approximation is based on the following four lemmas.

The CDTSMRE

Φ_{[t / ε]}^{ε, u} φ

, see (3), is weakly compact in

D_{B} [0, \infty)

with limit points into

C_{B} [0, \infty)

.

Lemma 1.

Under Assumptions A1–A7, the limit points of

Φ_{[t / ε]}^{ε, u} φ

,

φ \in B_{0}

, as

ε \to 0

, belong to

C_{B} [0, \infty)

.

Proof.

Assumptions A5–A6 imply that the discrete-time semi-Markov random evolution

Φ_{k}^{u} φ

is a contractive operator in H and, therefore,

| | Φ_{k}^{u} φ {| |}_{H}

is a supermartingale for any

φ \in H,

where

| | \cdot {| |}_{H}

is a norm in Hilbert space H ([4,9]) Obviously, the same properties satisfy the following family

Φ_{[t / ε]}^{ε, u} .

Using Doob’s inequality for the supermartingale

| | Φ_{[t / ε]}^{ε, u} {| |}_{H}

we obtain

P {Φ_{[t / ε]}^{ε, u} \in K_{Δ}} \geq 1 - Δ,

where

K_{Δ}

is a compact set in B and

Δ

is any small number. It means that sequence

Φ_{[t / ε]}^{ε, u}

is tight in

B .

Taking into account conditions A1–A6, we obtain that discrete-time semi-Markov random evolution

Φ_{[t / ε]}^{ε, u}

is weakly compact in

D_{B} [0, + \infty)

with limit points in

C_{B} [0, + \infty),

φ \in B_{0} .

Let

J_{t}^{ε, u} : = J (Φ_{[t / ε]}^{ε, u}; [t / ε]) : = {sup}_{k \leq [t / ε]} ∥Φ_{[t / ε] + k}^{ε, u} φ - Φ_{[t / ε]}^{ε, u} φ∥

, and let

K_{Δ}

be a compact set from compact containment condition

Δ > 0

. It is sufficient to show that

J_{t}^{ε, u}

weakly converges to zero. This is equivalent to the convergence of

J_{t}^{ε, u}

in probability as

ε \to 0

.

From the very definition of

J_{t}^{ε, u}

and A3, we obtain

J_{t}^{ε, u} 1_{K_{Δ}} \leq ε sup_{k \leq [t / ε]} sup_{φ \in S_{Δ}} (∥D_{1} (z_{k}, u_{k}) φ∥ + ∥D_{0}^{ε} (z_{k}, u_{k}) φ∥),

where

1_{K_{Δ}}

is the indicator of the set

K_{Δ}

, and

S_{Δ}

is the finite

δ

-set for

K_{Δ}

. Then, for

δ < Δ

, we have

\begin{matrix} P_{π \times π_{1}} (J_{t}^{ε, u} 1_{K_{Δ}} > Δ) & \leq & P_{π \times π_{1}} (sup_{k \leq [t / ε]} D_{k} > (Δ - δ) / ε) \\ = & \sum_{i = 1}^{[t / ε]} P_{π \times π_{1}} ({sup_{k \leq [t / ε]} D_{k} > (Δ - δ) / ε} \cap D_{i}) \\ \leq & ε^{2} [t / ε] sup_{φ \in S_{Δ}} [{\tilde{P}}^{[t / ε]} ({∥D_{1} (x, u) φ∥}^{2} \\ + 2 ∥D_{1} (x, u) φ∥ ∥D_{0}^{ε} (x, u) φ∥ + {∥D_{0}^{ε} (x, u) φ∥}^{2})], \end{matrix}

where

D_{k} : = {sup}_{φ \in S_{Δ}} (∥D_{1} (z_{k}, u_{k}) φ∥ + ∥D_{0}^{ε} (z_{k}, u_{k}) φ∥)

, and

D_{i} : = {ω : D_{k} contains the maximum for the first time on the variable D_{i}}

.

It is worth noticing that the operator

{\tilde{P}}^{k}

is bounded when

k \to \infty

. So is the case for

{\tilde{P}}^{[t / ε]}

when

ε \to 0

.

Taking both

ε

and

δ

go to 0 we obtain the proof of the this lemma. □

Let us now consider the continuous time martingale

\begin{matrix} M_{t}^{ε, u} : = M_{[t / ε]}^{ε} = Φ_{[t / ε]}^{ε, u} - I - \sum_{k = 0}^{[t / ε] - 1} E_{π \times π_{1}} [Φ_{k + 1}^{ε, u} - Φ_{k}^{ε, u} ∣ F_{k}] . \end{matrix}

(17)

Lemma 2.

The process

\begin{matrix} M_{t}^{ε, u} : = Φ_{[t / ε]}^{ε, u} - I - \sum_{ℓ = 0}^{[t / ε] - 1} [\tilde{P} D^{ε} (\cdot, u) - I] Φ_{ℓ}^{ε, u}, \end{matrix}

is an

F_{[t / ε]}^{u}

-martingale.

Proof.

As long as

M_{k}^{ε, u} : = Φ_{k}^{ε, u} - I - \sum_{ℓ = 0}^{k - 1} [\tilde{P} D^{ε} (\cdot, u) - I] Φ_{ℓ}^{ε, u}

, is a martingale

M_{t}^{ε, u} = M_{[t / ε]}^{ε, u}

is an

F_{[t / ε]}^{u}

-martingale. Here, we have

E_{π \times π_{1}} [M_{k + 1}^{ε} ∣ F_{k}^{u}] = M_{k}^{ε, u}

which can be easily checked. □

Lemma 3.

The family

ℓ (\sum_{k = 0}^{[t / ε]} E_{π \times π_{1}} [Φ_{k + 1}^{ε, u} φ - Φ_{k}^{ε, u} φ ∣ F_{k}^{u}])

is relatively compact for all

ℓ \in B_{0}^{*}

, dual of the space

B_{0}

.

Proof.

Let

N_{t}^{ε, u} : = \sum_{k = 0}^{[t / ε]} E_{π \times π_{1}} [(Φ_{k + 1}^{ε, u} - Φ_{k}^{ε, u}) φ ∣ F_{k}^{u}] .

Then,

N_{t}^{ε, u} = \sum_{k = 0}^{[t / ε]} [\tilde{P} D^{ε} (\cdot, u) - I] Φ_{k}^{ε, u} .

As long as

Φ_{k + 1}^{ε, u} = D^{ε} (z_{k + 1}, u_{k + 1}) Φ_{k}^{ε, u},

we obtain

E_{π \times π_{1}} [Φ_{k + 1}^{ε, u} φ ∣ F_{k}^{u}] = E_{π \times π_{1}} [D^{ε} (z_{k + 1}, u_{k + 1}) Φ_{k}^{ε, u} φ ∣ F_{k}^{u}] .

Then,

\begin{matrix} |ℓ (\sum_{k = [t / ε] + 1}^{[(t + η) / ε]} E_{π \times π_{1}} [Φ_{k + 1}^{ε, u} φ - Φ_{k}^{ε, u} φ ∣ F_{k}^{u}])| \\ = & |ℓ (\sum_{k = [t / ε] + 1}^{[(t + η) / ε]} [\tilde{P} D^{ε} (z_{k + 1}, z_{k + 1}) - I] Φ_{k}^{ε, u} φ)| \\ \leq & ε ∥ℓ∥ ([(t + η) / ε] - [t / ε] - 1) ∥\tilde{P} (D_{1} (z_{k + 1}, u_{k + 1}) + D_{0}^{ε} (z_{k + 1}, u_{k + 1})) φ∥ \\ \leq & ε ∥ℓ∥ \frac{η}{ε} ∥\tilde{P} (D_{1} (\cdot, u) + D_{0}^{ε} (\cdot, u)) φ∥ \\ = & η ∥ℓ∥ ∥\tilde{P} (D_{1} (\cdot, u) + D_{0}^{ε} (\cdot, u)) φ∥ \to 0, η \to 0, \end{matrix}

as

∥\tilde{P} (D_{1} (\cdot, u) + D_{0}^{ε} (\cdot, u)) φ∥

is bounded for any

φ \in B_{0}

.

It means that the family

ℓ (\sum_{k = 0}^{[t / ε]} E_{π \times π_{1}} [Φ_{k + 1}^{ε, u} φ - Φ_{k}^{ε, u} φ ∣ F_{k}^{u}])

, is relatively compact for any

ℓ \in B_{0}^{*}

. □

Lemma 4.

The family

ℓ (M_{[t / ε]}^{ε, u} φ)

is relatively compact for any

ℓ \in B_{0}^{*}

, and any

φ \in B_{0}

.

Proof.

It is worth noticing that the martingale

M_{[t / ε]}^{ε, u}

can be represented in the form of the martingale differences

M_{[t / ε]}^{ε, u} = \sum_{k = 0}^{[t / ε] - 1} E_{π \times π_{1}} [Φ_{k + 1}^{ε, u} φ - E_{π \times π_{1}} (Φ_{k + 1}^{ε, u} φ ∣ F_{k}^{u})] .

Then, using the equality

E_{π \times π_{1}} [Φ_{k + 1}^{ε, u} φ ∣ F_{k}^{u}] = E_{π \times π_{1}} [D^{ε} (z_{k + 1}, u_{k + 1}) Φ_{k}^{ε, u} φ ∣ F_{k}^{u}],

we get

\begin{matrix} M_{[(t + η) / ε]}^{ε, u} φ - M_{[t / ε]}^{ε, u} φ & = & \sum_{k = [t / ε] + 1}^{[(t + η) / ε]} [D^{ε} (z_{k + 1}, u_{k + 1}) Φ_{k}^{ε, u} φ - E_{π \times π_{1}} [D^{ε} (z_{k + 1}, u_{k + 1}) Φ_{k}^{ε, u} φ ∣ F_{k}^{u}]} \\ = & \sum_{k = [t / ε] + 1}^{[(t + η) / ε]} [D^{ε} (z_{k + 1}, u_{k + 1}) Φ_{k}^{ε, u} φ - \tilde{P} D^{ε} (z_{k + 1}, u_{k + 1}) Φ_{k}^{ε, u} φ] \\ = & \sum_{k = [t / ε] + 1}^{[(t + η) / ε]} [D^{ε} (z_{k + 1}, u_{k + 1}) - \tilde{P} D^{ε} (z_{k + 1}, u_{k + 1})] Φ_{k}^{ε, u} φ, \end{matrix}

for any

η > 0

. Now, from the above, we get

\begin{matrix} E_{π \times π_{1}} |ℓ (M_{[(t + η) / ε]}^{ε, u} φ - M_{[t / ε]}^{ε, u} φ)| \\ \leq & ([t + η) / ε] - [t / ε]) ε E_{π \times π_{1}} (∥D_{1} (z_{k + 1}, u_{k + 1}) φ∥ + ∥D_{0}^{ε} (z_{k + 1}, u_{k + 1}) φ∥ \\ + ∥\tilde{P} D_{1} (\cdot, u) φ∥ + ∥\tilde{P} D_{0}^{ε} (\cdot, u) φ∥) \\ \leq & 2 η (∥\tilde{P} D_{1} (\cdot, u) φ∥ + ∥\tilde{P} D_{0}^{ε} (\cdot, u) φ∥) \to 0, η \to 0, \end{matrix}

which proves the lemma. □

Now the proof of Theorem 1 is achieved as follows.

From Lemmas 2–4 and the representation (17) it follows that the family

ℓ (Φ_{[t / ε]}^{ε, u} φ)

is relatively compact for any

ℓ \in B_{0}^{*}

, and any

φ \in B_{0}

.

Moreover, let

{I L}^{ε} (x)

,

x \in E

, be a family of perturbed operators defined on B as follows,

\begin{matrix} {I L}^{ε, u} (x) : = ε^{- 1} \tilde{Q} + \tilde{P} D_{1} (x, u) + \tilde{P} D_{0}^{ε} (x, u) . \end{matrix}

(18)

Then, the process

\begin{matrix} M_{t}^{ε, u} = Φ_{[t / ε]}^{ε, u} - I - ε \sum_{ℓ = 0}^{[t / ε] - 1} {I L}^{ε, u} Φ_{ℓ}^{ε, u}, \end{matrix}

(19)

is an

F_{t}^{ε, u}

-martingale.

The following singular perturbation problem, for the non-negligible part of compensating operator,

{I L}^{ε, u}

, denoted by

{I L}_{0}^{ε, u} (x) : = ε^{- 1} \tilde{Q} + \tilde{P} D_{1} (x, u)

,

\begin{matrix} {I L}_{0}^{ε, u} φ^{ε} = I L φ + ε θ^{ε, u}, \end{matrix}

(20)

on the test functions

φ^{ε} (z, x) = φ (z) + ε φ_{1} (z, x)

, has the solution (see [3] Proposition 5.1):

φ \in N (\tilde{Q})

,

φ_{1} = R_{0} {\tilde{D}}_{1} φ

, with

{\tilde{D}}_{1} (x, u) = \tilde{P} D_{1} (x, u) - {\hat{D}}_{1}

,

{\hat{D}}_{1} = \int_{E} π \times π_{1} (d x) D_{1} (x, u)

, and

θ^{ε, u} (x) = (P^{♯} \times P^{u}) D_{1} (x, u) R_{0} {\tilde{D}}_{1} (x, u) φ

.

The limit operator is then given by

\begin{matrix} I L Π = Π D_{1} (\cdot, u) Π, \end{matrix}

(21)

form which we get the contracted limit operator

\begin{matrix} \hat{I L} = {\hat{D}}_{1} . \end{matrix}

(22)

We note that martingale

M_{t}^{ε, u}

has the following asymptotic representation,

\begin{matrix} M_{t}^{ε, u} = Φ_{[t / ε]}^{ε, u} - I - ε \sum_{ℓ = 0}^{[t / ε] - 1} \hat{I L} Φ_{ℓ}^{ε, u} + O_{φ} (ε), \end{matrix}

(23)

where

| | O_{φ} (ε) | | \to 0

, as

ε \to 0

. The families

l (M_{[t / ε]})

and

l (\sum_{ℓ = 0}^{[t / ε] - 1} [(P^{♯} \times P^{u}) D^{ε} (\cdot, u) - I] Φ_{ℓ}^{ε, u})

are weakly compact for all

l \in B_{0}^{*}

in a dense subset

B_{0}^{*} \subset B .

It means that family

l (Φ_{[t / ε]}^{ε, u})

is also weakly compact. In this way, the sum

ε \sum_{ℓ = 0}^{[t / ε] - 1} \hat{I L} Φ_{ℓ}^{ε, u} φ

converges, as

ε \to 0

, to the integral

\int_{0}^{t} \hat{I L} \bar{Φ} (s) φ d s .

The quadratic variation of the martingale

l (M_{t}^{ε, u} φ)

tends to zero when

ε \to 0,

thus,

M_{t}^{ε, u} φ \to 0

when

ε \to 0,

for any

f \in B_{0}

and for any

l \in B_{0}^{*} .

Passing to the limit in (23), when

ε \to 0

, we get

Φ_{[t / ε]}^{ε, u} φ \to_{ε \to 0} \bar{Φ} (t) φ,

where

\bar{Φ} (t)

is defined in (6).

The quadratic variation of the martingale

M_{t}^{ε, u}

, in the average approximation, is

\begin{matrix} ⟨ ℓ (M_{[t / ε]}^{ε, u} ⟩ = \sum_{k = 0}^{[t / ε]} E_{π \times π_{1}} [ℓ^{2} (M_{k + 1}^{ε, u} φ^{ε} - M_{k}^{ε, u} φ^{ε}) ∣ F_{k}^{u}], \end{matrix}

(24)

where

φ^{ε} (x) = φ (x) + ε φ_{1} (x)

. Hence

\begin{matrix} ℓ (M_{k + 1}^{ε, u} φ^{ε} - M_{k}^{ε, u} φ^{ε}) = ℓ ((M_{k + 1}^{ε, u} - M_{k}^{ε, u}) φ) + ε ℓ ((M_{k + 1}^{ε, u} - M_{k}^{ε, u}) φ_{1}), \end{matrix}

and

\begin{matrix} M_{k + 1}^{ε, u} - M_{k}^{ε, u} = Φ_{k + 1}^{ε, u} - Φ_{k}^{ε, u} - E_{π \times π_{1}} [Φ_{k + 1}^{ε, u} - Φ_{k}^{ε, u} ∣ F_{k}^{u}] . \end{matrix}

(25)

Therefore,

\begin{matrix} ℓ (M_{k + 1}^{ε, u} φ^{ε} - M_{k}^{ε, u} φ^{ε}) \\ = & ℓ ((D {(z_{k + 1}, u_{k + 1})}^{ε} - I) Φ_{k}^{ε, u} φ) - E_{π \times π_{1}} [(D^{ε} (z_{k + 1}, u_{k + 1}) - I) φ ∣ F_{k}^{u}] \\ + ε ℓ ((D^{ε} (z_{k + 1}, u_{k + 1}) - I) φ_{1}) - E_{π \times π_{1}} [(D {(z_{k + 1}, u_{k + 1})}^{ε} - I) φ_{1} ∣ F_{k}^{u}] \\ = & ε ℓ ((D_{1} (z_{k + 1}, u_{k + 1}) + D_{0}^{ε} (z_{k + 1}, u_{k + 1})) φ) \\ - ε E_{π \times π_{1}} [(D_{1} (z_{k + 1}, u_{k + 1}) + D_{0}^{ε} (z_{k + 1}, u_{k + 1})) φ ∣ F_{k}^{u}] \\ + ε^{2} ℓ ((D_{1} (z_{k + 1}, u_{k + 1}) + D_{0}^{ε} (z_{k + 1}, u_{k + 1})) φ_{1}) \\ - ε^{2} E_{π} [(D_{1} (z_{k + 1}, u_{k + 1}) + D_{0}^{ε} (z_{k + 1}, u_{k + 1})) φ_{1} ∣ F_{k}^{u}] . \end{matrix}

(26)

Now, from (24) and (26) and from boundedness of all operators in (26) with respect to

E_{π \times π_{1}}

, it follows that

⟨ ℓ (M_{[t / ε]}^{ε, u} ⟩

goes to 0 when

ε \to 0

, and the quadratic variation of limit process

M_{t}^{0, u}

, for the martingale

M_{t}^{ε, u}

, is equals to 0.

In this case, the limit martingale

M_{t}^{0}

equals to 0. Therefore, the limit equation for

M_{t}^{ε, u}

has the form (6). As long as the solution of the martingale problem for operator

\hat{I L}

is unique, then it follows that the solution of the Equation (6) is unique as well [49,50]. It is worth noticing that operator

\hat{I L}

is a first order operator (

{\hat{D}}_{1}

, see (22)). Finally, the operator

\hat{I L}

generates a semigroup, then

\bar{Φ} (t) φ = exp [\hat{I L} t] φ

and the latter representation is unique.

6.2. Proof of Theorem 2

We can prove the relative compactness of the family

Φ_{[t / ε^{2}]}^{ε, u}

exactly on the same way, and following the same steps as above. However, in the case of diffusion approximation the limit continuous martingale

M_{0} (t)

for the martingale

M_{t}^{ε}

has quadratic variation that is not zero, that is,

\begin{matrix} M_{0} (t) φ = Φ_{0} (t) φ - φ - \int_{0}^{t} \hat{I L} Φ_{0} (s) d s, \end{matrix}

and so

⟨ ℓ (M_{0}) ⟩ \neq 0

, for

ℓ \in B_{0}^{*}

.

Moreover, operator

\hat{I L}

defined in Theorem 2 is a second-order kind operator as it contains operator

{\hat{D}}_{2}

and

Π D_{1} R_{0} \tilde{P} D_{1} Π

, compare with the first-order operator

\hat{I L}

in (7).

Let

{I L}^{ε, u} (x)

,

x \in E

, be a family of perturbed operators defined on B as follows,

\begin{matrix} {I L}^{ε, u} (x) : = ε^{- 2} \tilde{Q} + ε^{- 1} \tilde{P} D_{1} (x, u) + \tilde{P} D_{2} (x, u) + \tilde{P} D_{0}^{ε} (x, u) . \end{matrix}

(27)

Then, the process

\begin{matrix} M_{t}^{ε, u} = Φ_{[t / ε^{ε}]}^{ε, u} - I - ε^{2} \sum_{k = 0}^{[t / ε^{2}] - 1} {I L}^{ε, u} Φ_{k}^{ε, u}, \end{matrix}

(28)

is an

F_{t}^{ε, u}

-martingale with mean value zero.

For the non-negligible part of compensating operator,

{I L}^{ε, u}

, denoted by

{I L}_{0}^{ε, u} (x) : = ε^{- 2} \tilde{Q} + ε^{- 1} \tilde{P} D_{1} (x, u) + \tilde{P} D_{2} (x, u)

, consider the following singular perturbation problem,

\begin{matrix} {I L}_{0}^{ε, u} φ^{ε} = I L φ + ε θ^{ε, u} (x), \end{matrix}

(29)

where

φ^{ε} (z, x) = φ (z) + ε φ_{1} (z, x) + ε^{2} φ_{2} (z, x)

. The solution of this problem is realized by the vectors (see in [3], Proposition 5.2)

φ_{1} = R_{0} \tilde{P} D_{1} (x, u) φ, φ_{2} = R_{0} \tilde{A} φ,

with

\tilde{A} (x, u) : = A (x, u) - \hat{A}

. Finally, the negligible term

θ^{ε, u} (x)

is

θ^{ε, u} (x) = [\tilde{P} D_{1} (x, u) + ε \tilde{P} D_{2} (x, u)] φ_{2} + \tilde{P} D_{2} (x, u) φ_{1} .

Of course,

φ \in N (\tilde{Q})

.

Now the limit operator

I L

is given by

\begin{matrix} I L = \tilde{P} D_{2} (\cdot, u) + \tilde{P} D_{1} (\cdot, u) R_{0} \tilde{P} D_{1} (\cdot, u), \end{matrix}

(30)

from which, the contracted operator on the null space

N (\tilde{Q})

is

\begin{matrix} \hat{I L} = {\hat{D}}_{2} Π + Π D_{1} (x, u) R_{0} \tilde{P} D_{1} (x, u) Π . \end{matrix}

(31)

Moreover, due to the balance condition (8) we get the limit operator.

We worth noticing that Assumptions A5–A7 and D1–D3 imply that discrete-time semi-Markov random evolution

Φ_{[t / ε^{2}]}^{ε, u} φ

is a contractive operator in H and, therefore,

| | Φ_{[t / ε^{2}]}^{ε, u} φ {| |}_{H}

is a supermartingale for any

φ \in H,

where

| | \cdot {| |}_{H}

is a norm in Hilbert space H ([4,9]). By Doob’s inequality for the supermartingale

| | Φ_{[t / ε^{2}]}^{ε, u} {| |}_{H}

we obtain

P {Φ_{[t / ε^{2}]}^{ε, u} \in K_{Δ}^{1}} \geq 1 - Δ,

where

K_{Δ}^{1}

is a compact set in B and

Δ

is any positive small real number.

We conclude that under Assumptions A5–A7 and D1–D3, the family

M_{t}^{ε, u}

is tight and is weakly compact in

D_{B} [0, + \infty)

with limit points in

C_{B} [0, + \infty) .

Moreover, under Assumptions A5–A6 and D1–D2, the martingale

M_{t}^{ε, u}

has the following asymptotic presentation:

\begin{matrix} M_{t}^{ε, u} φ = Φ_{[t / ε^{2}]}^{ε, u} φ - φ - ε^{2} \sum_{k = 0}^{[t / ε^{2}] - 1} \hat{I L} Φ_{k}^{ε, u} φ + O_{φ} (ε), \end{matrix}

(32)

where

| | O_{φ} (ε) | | \to 0

, as

ε \to 0

. The families

l (M_{t}^{ε, u} ϕ)

and

l (ε^{2} \sum_{k = 0}^{[t / ε^{ε}] - 1} \hat{I L} Φ_{k}^{ε, u} φ)

are weakly compact for all

l \in B^{*}

and

φ \in B_{0} .

It means that

Φ_{[t / ε^{2}]}^{ε, u}

is also weakly compact and has a limit.

Let us denote the previous limit by

Φ_{0} (t)

, then the sum

ε^{2} \sum_{k = 0}^{[t / ε^{ε}] - 1} \hat{I L} Φ_{k}^{ε, u} φ

converges to the integral

\int_{0}^{t} \hat{I L} Φ_{0} (s) φ d s .

Let

M_{0} (t)

also be a limit martingale for

M_{t}^{ε, u}

when

ε \to 0 .

Then, from the previous steps and (32), we obtain

\begin{matrix} M_{0} (t) φ = Φ_{0} (t) φ - φ - \int_{0}^{t} \hat{I L} Φ_{0} (s) φ d s . \end{matrix}

(33)

As long as martingale

M_{t}^{ε, u}

has mean value zero, the martingale

M_{0} (t)

has also mean value zero. If we take the mean value from both parts of (33) we get

\begin{matrix} 0 = E Φ_{0} (t) φ - φ - \int_{0}^{t} \hat{I L} E Φ_{0} (t) φ d s, \end{matrix}

(34)

or, solving it, we get

\begin{matrix} E Φ_{0} (t) φ = exp [\hat{I L} t] φ . \end{matrix}

(35)

The last equality means that the operator

\hat{I L}

generates a semigroup, namely,

U (t) : = E Φ_{0} (t) φ = exp [\hat{I L} t] φ .

Now, the uniqueness of the limit evolution

Φ_{0} (t)

in diffusion approximation approximation follows from the uniqueness of solution of the martingale problem for

Φ_{0} (t)

(uniqueness of the limit process under weak compactness). As long as the solution of the martingale problem for operator

\hat{I L}

is unique, then it follows that the solution of the Equation (34) is unique as well [49,50].

6.3. Proof of Theorem 3

We note that

W_{t}^{ε, u}

in (12) has the following presentation,

\begin{matrix} W_{t}^{ε, u} = ε^{- 1 / 2} {\sum_{k = 1}^{[t / ε]} [D^{ε} (z_{k - 1}, u_{k - 1}) - I] Φ_{k}^{ε, u} - \int_{0}^{t} {\bar{D}}_{1} \bar{Φ} (s) d s} . \end{matrix}

(36)

As the balance condition

Π (D_{1} - {\hat{D}}_{1}) = 0,

holds, then we apply the diffusion approximation algorithm (see Section 3.2), i.e., to the right-hand side of (36) with the following operators,

D_{2} = 0

and

(D_{1} (z) - {\bar{D}}_{1})

instead of

D_{1} (z) .

It is worth mentioning that the family

W_{t}^{ε, u}

is weakly compact and the result is proved (see Section 6.1 and Section 6.2).

6.4. Proof of Proposition 1

The proof of this proposition is based on the estimation of

| | E_{π} [Φ_{[t / ε]}^{ε, u} φ^{ε}] - \bar{Φ} (t) φ | |,

for any

φ \in B_{0}

, where

φ^{ε} (x) = φ (x) + ε φ_{1} (x)

.

We note that

\begin{matrix} (\tilde{P} - I) φ_{1} (x) = - ({\hat{D}}_{1} - \tilde{P} D_{1} (x, u)) φ . \end{matrix}

(37)

As long as

Π ({\hat{D}}_{1} - (P^{♯} \times P^{u}) D_{1} (x, u)) φ = 0

,

φ \in B_{0}

, Equation (37) has the solution in domain

R (\tilde{P} - I)

,

φ_{1} (x) = R_{0} {\tilde{D}}_{1} φ

.

In this way,

\begin{matrix} E_{π \times π_{1}} ∥φ_{1} (x)∥ \leq 2 ∥R_{0}∥ \int_{E} \int_{U} π (d z) π_{1} (d u) ∥\tilde{P} D_{1} (\cdot, u) φ∥ : = 2 C_{1} (φ_{1} ∥R_{0}∥), \end{matrix}

(38)

where

R_{0}

is a potential operator of

\tilde{Q} : = \tilde{P} - I

.

From here we obtain

\begin{matrix} E_{π \times π_{1}} ∥(Φ_{[t / ε]}^{ε, u} - I) φ_{1}∥ \leq 4 C_{1} (φ_{1} ∥R_{0}∥), \end{matrix}

(39)

as

Φ_{k}^{ε, u}

are contractive operators.

We note also that

\begin{matrix} ∥E_{π \times π_{1}} [ε \sum_{k = 0}^{[t / ε]} \hat{I L} Φ_{k}^{ε, u} φ - \int_{0}^{t} \hat{I L} \bar{Φ} (s) φ d s]∥ \leq ε C_{2} (t, φ), \end{matrix}

(40)

where

C_{2} (t, φ) : = 4 T \int_{E} \int_{U} π (d z) π_{1} (d u) ∥\tilde{P} D_{1} (\cdot, u) φ∥

,

t \in [0, T]

. This follows from standard argument about the convergence of Riemann sums in Bochner integral (see Lemma 4.14, p. 161, [4]).

We note that

\begin{matrix} | | E_{π \times π_{1}} [Φ_{[t / ε]}^{ε, u} φ^{ε}] - \bar{Φ} (t) φ | | \leq | | E_{π \times π_{1}} [Φ_{[t / ε]}^{ε, u} φ - \bar{Φ} (t) φ | | + ε C_{1} (φ_{1} ∥R_{0}∥), \end{matrix}

(41)

where we applied representation

φ^{ε} = φ + ε φ_{1}

.

We also note that

\bar{Φ} (t)

satisfies the equation

\begin{matrix} \bar{Φ} (t) φ - φ - \int_{0}^{t} \hat{I L} \bar{Φ} (s) φ d s = 0 . \end{matrix}

(42)

Let us introduce the following martingale,

\begin{matrix} M_{[t / ε] + 1}^{ε, u} φ^{ε} : = Φ_{[t / ε]}^{ε, u} φ^{ε} - φ^{ε} - \sum_{k = 0}^{[t / ε]} E_{π \times π_{1}} [Φ_{k + 1}^{ε, u} φ^{ε} - Φ_{k}^{ε, u} φ^{ε} ∣ F_{k}^{u}] . \end{matrix}

(43)

This is of zero mean-value martingale

\begin{matrix} E_{π \times π_{1}} M_{[t / ε]}^{ε, u} φ^{ε} = 0, \end{matrix}

(44)

which comes directly from (43).

Again, from (43), we get the following asymptotic representation

\begin{matrix} M_{[t / ε]}^{ε, u} φ^{ε} & = & Φ_{[t / ε]}^{ε, u} φ - φ + ε [Φ_{[t / ε]} - I] φ_{1} - ε \sum_{k = 0}^{[t / ε]} \hat{I L} Φ_{k}^{ε, u} φ \\ - ε^{2} \sum_{k = 0}^{[t / ε]} [\tilde{P} D_{1} (\cdot, u) Φ_{k}^{ε, u} φ_{1} + o_{φ} (1)], \end{matrix}

(45)

where

o_{φ} (1) \to 0

, as

ε \to 0

, for any

φ \in B_{0}

.

Now, from Equation (6) and expressions (44) and (45), we obtain the following representation

\begin{matrix} E_{π \times π_{1}} [Φ_{[t / ε]}^{ε, u} φ - \bar{Φ} (t) φ] & = & ε E_{π \times π_{1}} [Φ_{{[t / ε]}^{ε, u}} - I] φ_{1} + E_{π \times π_{1}} [ε \sum_{k = 0}^{[t / ε]} \hat{I L} Φ_{k}^{ε, u} φ \\ - \int_{0}^{t} \hat{I L} \bar{Φ} (s) φ d s] + ε^{2} E_{π \times π_{1}} [\sum_{k = 0}^{[t / ε] - 1} R_{k}^{u} (φ_{1})], \end{matrix}

(46)

where

R_{k}^{u} (φ_{1}) : = \tilde{P} D_{1} (\cdot, u) Φ_{k}^{ε, u} φ_{1} + o_{φ} (1)

.

Let us estimate

∥R_{k}^{u} (φ_{1})∥

in (46).

\begin{matrix} ∥R_{k}^{u} (φ_{1})∥ \leq sup_{g \in K_{Δ}} (∥\tilde{P} D_{1} (z, u) g∥ + ∥o_{g} (1)∥) : = C_{2} (z, g, K_{Δ}, u), \end{matrix}

(47)

where

K_{Δ}

is a compact set,

Δ > 0

,because

Φ_{k}^{ε, u} φ_{1}

satisfies compactness condition for any

ε > 0

and any k.

In this way, we get from (46) that

\begin{matrix} ∥E_{π \times π_{1}} [\sum_{k = 0}^{[t / ε] - 1} R_{k}^{u} (φ_{1})]∥ \leq T \int_{E} π (d z) π_{1} (d u) C_{3} (z, g, K_{Δ}, u), t \in [0, T] . \end{matrix}

(48)

Finally, from inequalities (38)–(41) and from (47)–(48), we obtain the desired rate of convergence of the CDTSMRE in averaging scheme

\begin{matrix} | | E_{π \times π_{1}} [Φ_{[t / ε]}^{ε, u} φ^{ε}] - \bar{Φ} (t) φ | | \leq ε A (T, φ, ∥R_{0}∥, D_{1} (z, u)), \end{matrix}

where the constant

\begin{matrix} A (T, φ, ∥R_{0}∥, D_{1} (z, u)) : = 5 C_{1} (φ, ∥R_{0}∥) + C_{2} (T, φ) + T \int_{E} π (d z) π_{1} (d u) C_{3} (z, g, K_{Δ}, u), \end{matrix}

(49)

and

C_{3} (z, g, K_{Δ}, u)

is defined in (48). Therefore, the proof of Proposition 1 is done.

Remark 2.

In a similar way, we can obtain the rate of convergence results in diffusion approximation (see Propositions 2–3).

7. Concluding Remarks and Future Work

In this paper, we introduced controlled semi-Markov random evolutions in discrete-time in Banach space. The main results concerned time-rescaled limit theorems, namely, averaging, diffusion approximation, and diffusion approximation with equilibrium by martingale weak convergence method. We applied these results to various important families of stochastic systems, i.e., the controlled additive functionals, controlled geometric Markov renewal processes, and controlled dynamical systems. We provided dynamical principles for discrete-time dynamical systems such as controlled additive functionals and controlled geometric Markov renewal processes. We also produced dynamic programming equations (Hamilton–Jacobi–Bellman equations) for the limiting processes in diffusion approximation such as CAF, CGMRP, and CDS. As an example, we considered the solution of portfolio optimization problem by Merton for the limiting CGMRP in DA. We also point out the importance of convergence rates and obtained them in the limit theorems for CDTSMRE and CAF, CGMRP, and CDS.

The future work will be associated with the study of optimal control for the initial, not limiting models, such as CAF in Section 4.1, CGMRP in Section 4.2, and CDS in Section 4.3. Other optimal control problems would be also interesting to consider for diffusion models with equilibrium, e.g., CAF in Section 4.1 and CGMRP in Section 4.2. In our future work, the latter models will be considered for solutions of Merton portfolio’s problems as well. We will also consider in our future research the case of dependent SMC

z_{k}

and the MC

u_{k}

.

Author Contributions

These authors contributed equally to this work. All authors have read and agreed to the published version of the manuscript.

Funding

The research of the first author is partially supported by NSERC.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Not applicable.

Acknowledgments

We thank to four anonymous referees for valuable remarks and suggestions that improved the paper. The research of the first author is partially supported by NSERC. The first author also thanks to the Laboratory of Applied Mathematics of the Université de Technologie de Compiègne, Compiègne, France, very much for their hospitality during his visit.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

SMC	Discrete-time semi-Markov chain;
DTSMRE	Discrete-time semi-Markov random evolution;
CDTSMRE	Controlled discrete-time semi-Markov random evolution;
CGMRP	Controlled geometric Markov renewal processes;
CAF	Controlled additive functionals;
CDS	Controlled dynamical systems;
HJB	Hamilton–Jacobi–Bellman (equation);
DPE	Dynamic programming equation;
DPP	Dynamic programming principle;
DA	Diffusion approximation;
SDE	Stochastic differential equation.

References

Griego, R.; Hersh, R. Random evolutions, Markov chains, and Systems of partial differential equations. Proc. Natl. Acad. Sci. USA 1969, 62, 305–308. [Google Scholar] [CrossRef] [PubMed]
Hersh, R. Random evolutions: A Survey of results and problems. Rocky Mountain J. Math. 1974, 4, 443–477. [Google Scholar] [CrossRef]
Koroliuk, V.S.; Limnios, N. Stochastic Systems in Merging Phase Space; World Scientific: Singapore, 2005. [Google Scholar]
Korolyuk, V.S.; Swishchuk, A. Evolution of System in Random Media; CRC Press: Boca Raton, FL, USA, 1995. [Google Scholar]
Swishchuk, A.; Wu, J. Evolution of Biological Systems in Random Media: Limit Theorems and Stability; Kluwer: Dordrecht, The Netherlands, 2003. [Google Scholar]
Swishchuk, A.V. Random Evolutions and their Applications; Kluwer AP: Dordrecht, The Netherlands, 1995. [Google Scholar]
Cohen, J.E. Random Evolutions in Discrete and Continuous Time. Stoch. Proc. Appl. 1979, 9, 245–251. [Google Scholar] [CrossRef]
Keepler, M. Random evolutions processes induced by discrete time Markov chains. Port. Math. 1998, 55, 391–400. [Google Scholar]
Limnios, N. Discrete-time random evolutions—Difference equations and additive functionals. Commun. Statist. Theor. Methods 2011, 40, 1–11. [Google Scholar] [CrossRef]
Yin, G.G.; Zhang, Q. Discrete-Time Markov Chains. Two-Time-Scale Methods and Applications; Springer: New York, NY, USA, 2005. [Google Scholar]
Anisimov, V.V. Switching processes: Averaging principle, diffusion approximation and applications. Acta Appl. Math. 1995, 40, 95–141. [Google Scholar] [CrossRef]
Anisimov, V.V. Switching Processes in Queueing Models; J. Wiley & Sons: London, UK; ISTE: London, UK, 2008. [Google Scholar]
Anisimov, V.V. Averaging principle for switching recurrent sequences. Theory Probab. Math. Stat. 1991, 45, 1–8. [Google Scholar]
Anisimov, V.V. Averaging principle for switching processes. Theory Probab. Math. Stat. 1992, 46, 1–10. [Google Scholar]
Swishchuk, A.V.; Islam, M.S. The Geometric Markov Renewal Processes with application to Finance. Stoch. Anal. Appl. 2010, 29, 4. [Google Scholar] [CrossRef]
Swishchuk, A.V.; Islam, M.S. Diffusion Approximations of the Geometric Markov Renewal Processes and option price formulas. Int. J. Stoch. Anal. 2010, 2010, 347105. [Google Scholar] [CrossRef]
Swishchuk, A.V.; Islam, M.S. Normal Deviation and Poisson Approximation of GMRP. Communic. Stat. Theory Methods 2013, 1488–1501. [Google Scholar] [CrossRef]
Swishchuk, A.V.; Limnios, N. Optimal stopping of GMRP and pricing of European and American options. In Proceedings of the 15th Intern. Congress on Insurance: Mathematics and Economics (IME 2011), Trieste, Italy, 14–17 June 2011. [Google Scholar]
Barbu, V.; Limnios, N. Semi-Markov Chains and Hidden Semi-Markov Models. Toward Applications. Their Use in Reliability and DNA Analysis; Lecture Notes in Statistics; Springer: New York, NY, USA, 2008; Volume 191. [Google Scholar]
Limnios, N.; Swishchuk, A. Discrete-time Semi-Markov Random Evolutions and their Applications. Adv. Appl. Probab. 2013, 45, 214–240. [Google Scholar] [CrossRef]
Jacod, J.; Shiryaev, A.N. Limit Theorems for Stochastic Processes; Springer: Berlin/Heidelberg, Germany, 1987. [Google Scholar]
Bertsekas, D.; Shreve, S. Stochastic Optimal Control: The Discrete-Tiime Case; Athena Scientific Publisher: Belmont, MA, USA, 1996. [Google Scholar]
Fleming, W.H.; Rishel, R.W. Deterministic and Stochastic Optimal Control; Springer: New York, NY, USA, 1975. [Google Scholar]
Kushner, H.J. Introduction to Stochastic Control Theory; Holt, Rinehart, Winston: New York, NY, USA, 1971. [Google Scholar]
Merton, R. Optimum consumption and portfolio rules in a continuous-time model. J. Econ. Theory 1971, 3, 373–413. [Google Scholar] [CrossRef]
Sviridenko, M.N. Martingale approach to limit theorems for semi-Markov processes. Theor. Probab. Appl. 1986, 34, 540–545. [Google Scholar] [CrossRef]
Adams, R. Sobolev Spaces; Academic Press: New York, NY, USA, 1979. [Google Scholar]
Rudin, W. Functional Analysis; McGraw-Hill: New York, NY, USA, 1991. [Google Scholar]
Sobolev, S. Some Applications of Functional Analysis in Mathematical Physics, 3rd ed.; American Mathematical Society: Providence, RI, USA, 1991; Volume 90. [Google Scholar]
Pyke, R. Markov renewal processes: Definitions and preliminary properties. Ann. Math. Statist. 1961, 32, 1231–1242. [Google Scholar] [CrossRef]
Pyke, R. Markov renewal processes with finitely many states. Ann. Math. Statist. 1961, 32, 1243–1259. [Google Scholar] [CrossRef]
Pyke, R.; Schaufele, R. Limit theorems for Markov renewal processes. Ann. Math. Statist. 1964, 35, 1746–1764. [Google Scholar] [CrossRef]
Shurenkov, V.M. On the theory of Markov renewal. Theor. Probab. Appl. 1984, 19, 247–265. [Google Scholar] [CrossRef]
Maxwell, M.; Woodroofe, M. Central limit theorems for additive functionals of Markov chains. Ann. Probab. 2000, 28, 713–724. [Google Scholar]
Nummelin, E. General Irreducible Markov Chains and Non-Negative Operators; Cambridge University Press: Cambridge, UK, 1984. [Google Scholar]
Revuz, D. Markov Chains; North-Holland: Amsterdam, The Netherlands, 1975. [Google Scholar]
Skorokhod, A.V. Asymptotic Methods in the Theory of Stochastic Differential Equations; AMS: Providence, RI, USA, 1989; Volume 78. [Google Scholar]
Skorokhod, A.V.; Hoppensteadt, F.C.; Salehi, H. Random Perturbation Methods with Applications in Science and Engineering; Springer: New York, NY, USA, 2002. [Google Scholar]
Jaśkiewicz, A.; Nowak, A. Average optimality for semi-Markov control processes. Morfismos 2007, 11, 15–36. [Google Scholar]
Vega-Amaya, O.; Luque-Vásquez, F. Sample-path average cost optimality for semi-Markov control processes on Borel spaces: Unbounded cost and mean holding times. Appl. Math. 2000, 27, 343–367. [Google Scholar] [CrossRef]
Altman, E.; Shwartz, A. Markov decision problems and state-action frequencies. SIAM J. Control. Optimikzation 1991, 29, 786–809. [Google Scholar] [CrossRef]
Beutler, F.; Ross, K. Optimal policies for controlled Markov chains with a constraint. J. Mathem. Analysis Appl. 1985, 112, 236–252. [Google Scholar] [CrossRef]
Borkar, V. Dynamic programming for ergodic control with partial observations. Stoch. Proc. Appl. 2003, 103, 293–310. [Google Scholar] [CrossRef]
Boussement, M.; Limnios, N. Markov decision processes with asymptotic average failure rate constraint. Communic. Statis. Theory Methods 2004, 33, 1689–1714. [Google Scholar] [CrossRef]
Hernández-Lerma, O.; Lasserre, J.B. Discrete-Time Markov Control Processes; Springer: New York, NY, USA, 1996. [Google Scholar]
Krylov, N.; Bogolyubov, N. Introduction to Non-linear Mechanics; Princeton University Press: Princeton, NJ, USA, 1947. [Google Scholar]
Limnios, N.; Oprişan, G. Semi-Markov Processes and Reliability; Birkhäuser: Boston, MA, USA, 2001. [Google Scholar]
Cartea, A.; Jaimungal, S.; Penalva, J. It Algorithmic and High-Frequency Trading; Cambridge University Press: Cambridge, UK, 2015. [Google Scholar]
Ethier, S.N.; Kurtz, T.G. Markov Processes: Characterization and Convergence; J. Wiley: New York, NY, USA, 1986. [Google Scholar]
Stroock, D.W.; Varadhan, S.R.S. Multidimensional Diffusion Processes; Springer: Berlin/Heidelberg, Germany, 1979. [Google Scholar]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.