Optimal Strategy of the Dynamic Mean-Variance Problem for Pairs Trading under a Fast Mean-Reverting Stochastic Volatility Model

Zhang, Yaoyuan; Xiong, Dewen

doi:10.3390/math11092191

Open AccessArticle

Optimal Strategy of the Dynamic Mean-Variance Problem for Pairs Trading under a Fast Mean-Reverting Stochastic Volatility Model

by

Yaoyuan Zhang

and

Dewen Xiong

^*

School of Mathematical Sciences, Shanghai Jiao Tong University, Shanghai 200240, China

^*

Author to whom correspondence should be addressed.

Mathematics 2023, 11(9), 2191; https://doi.org/10.3390/math11092191

Submission received: 8 March 2023 / Revised: 13 April 2023 / Accepted: 4 May 2023 / Published: 6 May 2023

(This article belongs to the Section Computational and Applied Mathematics)

Download

Browse Figures

Versions Notes

Abstract

:

We discuss the dynamic mean-variance (MV) problem for pairs trading with the assumptions that one of the security prices satisfies a stochastic volatility model (SVM) and the corresponding price spread follows an Ornstein–Uhlenbeck (OU) process. We provide a semi-closed-form of the optimal strategy based on the solution of a PDE, which is difficult to solve explicitly. Thus, we assume that one of the security prices satisfies the Scott model, a fast-mean-reverting volatility model, and give a closed-form approximation for the optimal strategy. Empirical studies, by using historical data from Chinese security markets, show that the Scott model produces a more stable strategy by better capturing mean-reverting volatility.

Keywords:

stochastic volatility; Ornstein–Uhlenbeck process; asymptotic analysis; dynamic mean-variance problem; pairs trading

MSC:

91G10; 91G80; 93E20; 35C20

1. Introduction

Pairs trading is a statistical arbitrage strategy that emerged from a Morgan Stanley quantitative group from in the 1980s. Investors choose a pair of highly-correlated securities, buy the relatively under-priced security, and sell the relatively over-priced security simultaneously, with the expectation of making a profit from the price spread regression. In this paper, we discuss the dynamic mean-variance (MV) problem for pairs trading in the Scott model, a fast-mean-reverting volatility model.

The cointegration approach is popular in pairs trading and was proposed by Vidyamurthy [1]. Vidyamurthy assessed the co-movement of securities through conintegration testing and designed a trading rule based on a simple nonparametric threshold. Lin et al. [2] studied the optimal trading threshold problem by introducing a minimum profit condition for a conintegrated pair of stocks. In a recent work by Yan et al. [3], they discussed pairs trading under a delayed cointegration model. In the cointegration framework, it is very important to model price spread. Elliott et al. [4] described spread using a mean-reverting Gaussian Markov chain model and developed an analytical framework for pairs trading strategies. Bertram [5] developed a statistical arbitrage model for the spread of two log price series under the assumption of the OU process. Many authors have viewed the optimal pairs trading problem as a stochastic control problem and have considereded it by maximizing a variety of utility functions. Jurek and Yang [6] discussed asset allocation strategies between a mean reverting arbitrage opportunity described by an OU process and a risk-free asset and assigned a closed-form, optimal allocation for CRRA utility over a finite time horizon. Suzuki [7,8] and Endres and Stübinger [9] solved an optimal regime switching problem with the constraints of finite transaction times and transaction fees. Liu and Timmermann [10] derived a closed-form optimal strategy based on the cointegration assumption with the power utility over the terminal wealth. Chiu and Wong [11] assumed that the log prices of the risky assets satisfy the linear stochastic differential equation with a constant matrix of cointegration coefficients. They considered the time-consistent dynamic mean-variance problem (according to dynamic programming)and provided a closed-form optimal strategy. Recently, Zhu et al. [12] assumed that price spread follows an OU process and that one of the corresponding securities satisfies the geometric Brownian motion(GBM) model with a constant volatility. They considered the time-consistent mean-variance problem (see Björk et al. [13]) for pairs trading and provided a closed-form optimal strategy.

However, in all of the models mentioned above, the drift and the volatility of the (log) price processes are all deterministic functions or constants that are unable to capture some of the stochastic volatility characteristics that can be observed in many markets, such as mean reversion effects, see Cont [14,15], Teräsvitra and Zhao [16], and Gatheral et al. [17] for empirical evidence related to this. Therefore, in this paper, we assume that price spread follows an OU process and that one of the log prices of the securities satisfies a stochastic volatility model (SVM). Additionally, we discuss the dynamic mean-variance problem in the means of Björk et al. [13]. Because an SVM can describe price dynamics better, it is more likely to develop strategies that can be used to control trading risk more precisely and bring about greater utility. In this paper, we first discuss the optimal strategy for pairs trading under a general SVM by a PDE. Then, we specify a fast mean-reverting stochastic volatility model, the Scott model [18], and discuss the approximate optimal strategy.

The main contributions of this paper are as follows: First, we provide a semi-closed-form of the optimal strategy under a general SVM based on the solution of a PDE. Second, we provide an closed-form approximation of the optimal strategy under a fast mean-reverting volatility model that captures the mean-reverting property of volatilities by using the asymptotic analysis technique. Our approximate formula can be proven to have sufficient precision and extremely high computational efficiency compared with the traditional finite difference method(FDM), which is of great practical value. Finally, we calibrate the model parameters using securities data from the Chinese stock markets, and demonstrate the effect of our approximated optimal strategy by comparing it with the optimal strategy described in Zhu et al. [12]. Empirical studies show that the Scott model can produce a more stable strategy by better capturing mean-reverting volatility.

The remainder of this paper is organized as follows: In Section 2, we describe the model specifications as well as the optimal dynamic MV problem and provide the semi-closed-form optimal strategy based on a PDE. In Section 3, we provide a closed-form approximation of the optimal strategy for the Scott volatility model. In Section 4, we validate the trading strategy empirically using Chinese securities market data. Section 5 concludes the paper.

2. The Dynamic Mean-Variance Problem for a General Stochastic Volatility Modelstochastic volatility models

In this section, we set up the dynamic mean-variance(MV) problem for pairs trading under a general SVM. Since the MV problem is time inconsistent, we discuss the optimal strategy, according to the definition of the equilibrium strategy introduced by [13] by transforming the dynamic MV problem to a non-cooperative Nash equilibrium game.

Assume that

(Ω, F, P)

is a complete probability space and that,

W_{1} = {W_{1} (t); t \in [0, T]}

,

W_{2} = {W_{2} (t); t \in [0, T]}

and

W_{3} = {W_{3} (t); t \in [0, T]}

are Brownian motions with

d W_{1} (t) d W_{2} (t) = ρ_{12} d t, d W_{1} (t) d W_{3} (t) = ρ_{13} d t, d W_{2} (t) d W_{3} (t) = ρ_{23} d t .

We assume that

F = {F_{t}; t \in [0, T]}

is the filtration generated by

W_{1}

,

W_{2}

and

W_{3}

. The conditional expectation and conditional variance with respect to

F_{t}

are denoted as

E_{t} (\cdot)

and

{Var}_{t} (\cdot)

.

We assume that there is a pair of conintegrated securities denoted as P and Q. The price processes of P and Q are denoted by

P_{t}

and

Q_{t}

, and there is a tradable risk-free asset

Π

whose price process is denoted by

Π_{t}

. Furthermore, we also assume that the market is frictionless, i.e., there are no transaction costs and taxes, and that short selling is allowed.

Assume that the dynamic of the price

Q_{t}

satisfies the following stochastic volatility model:

\begin{matrix} d Q_{t} & = ξ Q_{t} d t + γ (y_{t}) Q_{t} d W_{1} (t), \\ d y_{t} & = α (y_{t}) d t + β (y_{t}) d W_{2} (t), \end{matrix}

(1)

where

ξ

is a constant. We assume that the spread of the log-prices log-price of P and Q satisfies an OU process. Let

X_{t} = ln (P_{t}) - ln (Q_{t})

be the spread of the log-prices; then log-price, then

X_{t}

satisfies the following SDE:

d X_{t} = κ (θ - X_{t}) d t + η d W_{3} (t),

(2)

where

κ

,

θ

, and

η

are all constants. The dynamic of the risk-free asset

Π_{t}

is given by

d Π_{t} = r Π_{t} d t .

Remark 1.

Since

P_{t} = Q_{t} e^{X_{t}}

, according to Itô’s formula, one can see that

P_{t}

satisfies the following SDE:

\begin{matrix} d P_{t} = & P_{t} [κ (θ - X_{t}) + ξ + \frac{1}{2} η^{2} + ρ_{13} η γ (y_{t})] d t \\ + P_{t} γ (y_{t}) d W_{1} (t) + P_{t} η d W_{3} (t) . \end{matrix}

(3)

We denote

h_{t}

as the weights invested in the securities P and Q at time t in a symmetric pairs trading strategy, and the corresponding wealth process

V_{t}^{h}

is given by

d V_{t}^{h} = V_{t}^{h} (h_{t} \frac{d P_{t}}{P_{t}} - h_{t} \frac{d Q_{t}}{Q_{t}} + \frac{d Π_{t}}{Π_{t}}) .

(4)

Substituting (1) and (3) into (4), one can see that

d V_{t}^{h} = V_{t}^{h} h_{t} [(κ (θ - X_{t}) + \frac{1}{2} η^{2} + ρ_{13} η γ (y_{t})) d t + η d W_{3} (t)] + V_{t}^{h} r d t .

(5)

Let

π_{t} : = e^{- r t} V_{t}^{h} h_{t}

be the discounted money invested in the security P, which can be viewed as a strategy; then, the discounted wealth process

{\overset{\underset{̲}{}}{V}}_{t} (π) : = V_{t}^{h} e^{- r t}

is given by

d {\overset{\underset{̲}{}}{V}}_{t} (π) = π_{t} [(κ (θ - X_{t}) + \frac{1}{2} η^{2} + ρ_{13} η γ (y_{t}))] d t + π_{t} η d W_{3} (t) .

Assume that the discounted wealth at time

t \in [0, T)

is

{\overset{\underset{̲}{}}{V}}_{t}

, then

{\overset{\underset{̲}{}}{V}}_{T} (π) = {\overset{\underset{̲}{}}{V}}_{t} + \int_{t}^{T} π_{u} (κ (θ - X_{u}) + \frac{1}{2} η^{2} + ρ_{13} η γ (y_{u})) d u + \int_{t}^{T} π_{u} η d W_{3} (u) .

(6)

Let

J (t, {\overset{\underset{̲}{}}{V}}_{t}; π) : = E_{t} ({\overset{\underset{̲}{}}{V}}_{T} (π)) - λ {Var}_{t} ({\overset{\underset{̲}{}}{V}}_{T} (π)),

(7)

where

λ > 0

, we consider the following dynamic MV problem:

J (t, {\overset{\underset{̲}{}}{V}}_{t}) = sup_{π_{u}; u \in [t, T]} J (t, {\overset{\underset{̲}{}}{V}}_{t}; π) .

(8)

Because of the time-inconsistency of the MV problem (8), we introduce the optimal strategy according to the definition of equilibrium strategy provided in Björk et al. [13].

Definition 1.

The strategy

π^{*} = {π_{u}^{*}; u \in [0, T]}

is called an optimal strategy if for any permutation

π_{u}^{\hat{π}, ε} = {\hat{π}}_{u} I_{u \in [t, t + ε)} + π_{u}^{*} I_{u \in [t + ε, T]}

,

\underset{ε \to 0}{lim sup} \frac{1}{ε} \{J (t, {\overset{\underset{̲}{}}{V}}_{t} (π^{*}); π^{\hat{π}, ε}) - J (t, {\overset{\underset{̲}{}}{V}}_{t} (π^{*}); π^{*})\} \leq 0

holds for any

t \in [0, T]

.

We have the following theorem

Theorem 1

(Main result I). Define

M (x, y) : = κ (θ - x) + \frac{1}{2} η^{2} + ρ_{13} η γ (y)

. The optimal strategy for the dynamic MV problem (8) is given by

π_{t}^{*} = \frac{1}{2 λ η^{2}} M (X_{t}, y_{t}) - f_{x} (t, X_{t}, y_{t}) - \frac{ρ_{23} β (y_{t})}{η} f_{y} (t, X_{t}, y_{t}),

(9)

where

f (t, x, y)

is a solution to the following equation:

\begin{matrix} 0 = & \frac{1}{2 λ η^{2}} M^{2} (x, y) + f_{t} (t, x, y) - f_{x} (t, x, y) (\frac{1}{2} η^{2} + ρ_{13} η γ (y)) \\ - \frac{ρ_{23} β (y_{t})}{η} M (x, y) f_{y} (t, x, y) + α (y) f_{y} (t, x, y) \\ + \frac{1}{2} η^{2} f_{x x} (t, x, y) + \frac{1}{2} β^{2} (y) f_{y y} (t, x, y) + ρ_{23} η β (y) f_{x y} (t, x, y), \end{matrix}

(10)

where

(x, y, t) \in R \times R \times (0, T]

with the terminal condition

f (T, \cdot, \cdot) = 0

.

Lemma 1.

Let

π^{*}

be the strategy given in Theorem 1 and

f (t, x)

be a solution of the Equation (10); then,

f (t, X_{t}, y_{t}) = E_{t} [\int_{t}^{T} π_{u}^{*} (κ (θ - X_{u}) + \frac{1}{2} η^{2} + ρ_{13} η γ (y_{u})) d u] .

(11)

Proof.

Let

F_{t} : = f (t, X_{t}, y_{t})

. It follows from Itô’s formula that

\begin{matrix} d F_{t} = & [f_{t} + f_{x} κ (θ - X_{t}) + f_{y} α (y_{t}) + \frac{1}{2} f_{x x} η^{2} + \frac{1}{2} f_{y y} β^{2} (y_{t}) + f_{x y} η β (y_{t}) ρ_{23}] d t \\ + f_{x} η d W_{3} (t) + f_{y} β (y_{t}) d W_{2} (t) . \end{matrix}

Let

M_{t} : = F_{t} + \int_{0}^{t} π_{u}^{*} M (X_{t}, y_{t}) d u,

then

\begin{matrix} d M_{t} = & π_{t}^{*} M (X_{t}, y_{t}) d t + d F_{t} \\ = & [\frac{1}{2 λ η^{2}} M {(X_{t}, y_{t})}^{2} - f_{x} (t, X_{t}, y_{t}) (M (X_{t}, y_{t}) - κ (θ - X_{t})) \\ - \frac{ρ_{23} β (y_{t})}{η} f_{y} (t, X_{t}, y_{t}) M (X_{t}, y_{t}) + f_{t} (t, X_{t}, y_{t}) + f_{y} (t, X_{t}, y_{t}) α (y_{t}) \\ + \frac{1}{2} f_{x x} (t, X_{t}, y_{t}) η^{2} + \frac{1}{2} f_{y y} (t, X_{t}, y_{t}) β^{2} (y_{t}) + f_{x y} (t, X_{t}, y_{t}) η β (y_{t}) ρ_{23}] d t \\ + f_{x} (t, X_{t}, y_{t}) η d W_{3} (t) + f_{y} (t, X_{t}, y_{t}) β (y_{t}) d W_{2} (t) \\ = & f_{x} (t, X_{t}, y_{t}) η d W_{3} (t) + f_{y} (t, X_{t}, y_{t}) β (y_{t}) d W_{2} (t) . \end{matrix}

One can see that

M_{t}

is a martingale; thus,

F_{t} = f (t, X_{t}, y_{t}) = E_{t} [\int_{t}^{T} π_{u}^{*} M (X_{t}, y_{t}) d u],

which implies (11). □

Proof of Theorem 1.

For the given

ε > 0

, let

π_{u}^{\hat{π}, ε} : = {\hat{π}}_{u} I_{u \in [t, t + ε)} + π_{u}^{*} I_{u \in [t + ε, T]}

be any permutation of

π^{*}

. For any strategy

π

, introduce

Δ_{ε} {\overset{\underset{̲}{}}{V}}_{t} (π) = {\overset{\underset{̲}{}}{V}}_{t + ε} (π) - {\overset{\underset{̲}{}}{V}}_{t} (π), Δ_{ε} f_{t} = f (t + ε, X_{t + ε}, y_{t + ε}) - f (t, X_{t}, y_{t}) .

Since

π^{*}

is not dependent on the corresponding discounted wealth process

\overset{\underset{̲}{}}{V} (π^{*})

, it follows from (6) that

\begin{matrix} {\overset{\underset{̲}{}}{V}}_{T} (π^{\hat{π}, ε}) & = {\overset{\underset{̲}{}}{V}}_{t} (π^{*}) + {{\overset{\underset{̲}{}}{V}}_{t + ε} (\hat{π}) - {\overset{\underset{̲}{}}{V}}_{t} (\hat{π})} + {{\overset{\underset{̲}{}}{V}}_{T} (π^{*}) - {\overset{\underset{̲}{}}{V}}_{t + ε} (π^{*})} \\ = {\overset{\underset{̲}{}}{V}}_{T} (π^{*}) + Δ_{ε} {\overset{\underset{̲}{}}{V}}_{t} (\hat{π} - π^{*}) . \end{matrix}

Because

\begin{matrix} {Var}_{t} ({\overset{\underset{̲}{}}{V}}_{T} (π^{\hat{π}, ε})) = E_{t} ({Var}_{t + ε} ({\overset{\underset{̲}{}}{V}}_{T} (π^{\hat{π}, ε}))) + {Var}_{t} (E_{t + ε} ({\overset{\underset{̲}{}}{V}}_{T} (π^{\hat{π}, ε}))) \\ = E_{t} ({Var}_{t + ε} ({\overset{\underset{̲}{}}{V}}_{T} (π^{*}))) + {Var}_{t} ({\overset{\underset{̲}{}}{V}}_{t + ε} (π^{*}) + Δ_{ε} f_{t} + Δ_{ε} {\overset{\underset{̲}{}}{V}}_{t} (\hat{π} - π^{*})) \\ = E_{t} ({Var}_{t + ε} ({\overset{\underset{̲}{}}{V}}_{T} (π^{*}))) + {Var}_{t} (Δ_{ε} f_{t} + Δ_{ε} {\overset{\underset{̲}{}}{V}}_{t} (\hat{π})), \end{matrix}

one can see that

\begin{matrix} J (t, {\overset{\underset{̲}{}}{V}}_{t} (π^{*}); π^{\hat{π}, ε}) = E_{t} ({\overset{\underset{̲}{}}{V}}_{T} (π^{\hat{π}, ε})) - λ {Var}_{t} ({\overset{\underset{̲}{}}{V}}_{T} (π^{\hat{π}, ε})) \\ = E_{t} ({\overset{\underset{̲}{}}{V}}_{T} (π^{*})) + E_{t} (Δ_{ε} {\overset{\underset{̲}{}}{V}}_{t} (\hat{π} - π^{*})) - λ E_{t} ({Var}_{t + ε} ({\overset{\underset{̲}{}}{V}}_{T} (π^{*}))) \\ - λ {Var}_{t} (Δ_{ε} f_{t} + Δ_{ε} {\overset{\underset{̲}{}}{V}}_{t} (\hat{π})) \\ = E_{t} (J (t + ε, {\overset{\underset{̲}{}}{V}}_{t + ε} (π^{*}); π^{*})) + E_{t} (Δ_{ε} {\overset{\underset{̲}{}}{V}}_{t} (\hat{π} - π^{*})) - λ {Var}_{t} (Δ_{ε} f_{t} + Δ_{ε} {\overset{\underset{̲}{}}{V}}_{t} (\hat{π})) . \end{matrix}

Furthermore, it follows from the proof of Lemma 1 and (6) that

\begin{matrix} Δ_{ε} f_{t} + Δ_{ε} {\overset{\underset{̲}{}}{V}}_{t} (\hat{π}) = \int_{t}^{t + ε} ({\hat{π}}_{u} - π_{u}^{*}) M (X_{u}, y_{u}) d u \\ + \int_{t}^{t + ε} \{{\hat{π}}_{u} + f_{x} (u, X_{u}, y_{u})\} η d W_{3} (u) + \int_{t}^{t + ε} f_{y} (u, X_{u}, y_{u}) β (y_{u}) d W_{2} (u), \end{matrix}

thus

\begin{matrix} lim_{ε \to 0^{+}} \frac{1}{ε} (J (t, {\overset{\underset{̲}{}}{V}}_{t} (π^{*}); π^{\hat{π}, ε}) - J (t, {\overset{\underset{̲}{}}{V}}_{t} (π^{*}); π^{*})) \\ = & lim_{ε \to 0^{+}} \frac{1}{ε} [E_{t} (Δ_{ε} {\overset{\underset{̲}{}}{V}}_{t} (\hat{π} - π^{*})) - λ {Var}_{t} (Δ_{ε} f_{t} + Δ_{ε} {\overset{\underset{̲}{}}{V}}_{t} (\hat{π})) + λ {Var}_{t} (Δ_{ε} f_{t} + Δ_{ε} {\overset{\underset{̲}{}}{V}}_{t} (π^{*}))] \\ = & ({\hat{π}}_{t} - π_{t}^{*}) M (X_{t}, y_{t}) - λ {({\hat{π}}_{t} + f_{x} (t, X_{t}, y_{t}))}^{2} η^{2} \\ - 2 λ ({\hat{π}}_{t} + f_{x} (t, X_{t}, y_{t})) f_{y} (t, X_{t}, y_{t}) β (y_{t}) η ρ_{23} - λ f_{y} {(t, X_{t}, y_{t})}^{2} β {(y_{t})}^{2} \\ + λ {(π_{t}^{*} + f_{x} (t, X_{t}, y_{t}))}^{2} η^{2} \\ + 2 λ (π_{t}^{*} + f_{x} (t, X_{t}, y_{t})) f_{y} (t, X_{t}, y_{u}) β (y_{u}) η ρ_{23} + λ f_{y} {(u, X_{u}, y_{u})}^{2} β {(y_{u})}^{2} \\ = & - λ η^{2} {({\hat{π}}_{t} - π_{t}^{*})}^{2} \leq 0, \end{matrix}

which completes the proof. □

Remark 2.

From the proof of Theorem 1, we can see that

π_{t}^{*}

is the solution of the following HJB equation:

0 = sup_{π_{t}} \{E_{t} (d J (t, X_{t}, y_{t})) - λ {Var}_{t} (d f (t, X_{t}, y_{t}) + d {\overset{\underset{̲}{}}{V}}_{t} (π_{t}))\} .

Remark 3.

It is unlikely that a closed-form solution of the PDE (10) can be achieved without specifying

α, β, a n d γ

. Therefore, we will consider the Scott model, one of the most widely known mean-reverting stochastic volatility models, and will discuss the approximate solution.

3. Closed-Form Approximation under the Scott Model

In order to capture the fast mean-reverting characteristics of volatility (see Fouque [19,20] for empirical studies), we introduce the Scott model, initially proposed by Scott (1987) [18], which is a well-known mean-reverting volatility model. Under the Scott model, the underlying security price is modeled by:

\begin{matrix} d Q_{t} = & ξ Q_{t} d t + e^{y_{t}} Q_{t} d W_{1} (t), \\ d y_{t} = & a (b - y_{t}) d t + σ d W_{2} (t), \end{matrix}

where

a > 0

and

σ > 0

. Since

α (y) = a (b - y)

,

β (y) = σ

,

γ (y) = e^{y}

. Clearly, the volatility is modeled by an OU process. We assume

a ≫ 1

to ensure the fast mean-reverting property and set

ρ = 0

for convenience. According to Theorem 1, the corresponding PDE for the optimal strategy is given by:

\begin{matrix} 0 = & \frac{1}{2 λ η^{2}} M^{2} (x, y) + f_{t} (t, x, y) - f_{x} (t, x, y) (\frac{1}{2} η^{2} + ρ_{13} η e^{y}) \\ + a (b - y) f_{y} (t, x, y) + \frac{1}{2} η^{2} f_{x x} (t, x, y) + \frac{1}{2} σ^{2} f_{y y} (t, x, y), \end{matrix}

(12)

where

(x, y, t) \in R \times R \times (0, T]

with the terminal condition

f (T, \cdot, \cdot) = 0

. It is difficult to solve the PDE (12) explicitly and to achieve a closed-form optimal strategy. Therefore, in this section, we will use the asymptotic analysis technique to find a closed-form approximation of the PDE (12).

Let

g (t, x, y) : = f_{x} (t, x, y)

; one can see from (12) that

g (t, x, y)

satisfies the following PDE:

\begin{matrix} 0 = & g_{t} (t, x, y) + \frac{1}{2} η^{2} g_{x x} (t, x, y) + \frac{1}{2} σ^{2} g_{y y} (t, x, y) \\ - g_{x} (t, x, y) (\frac{1}{2} η^{2} + ρ_{13} η e^{y}) + a (b - y) g_{y} (t, x, y) - \frac{κ}{λ η^{2}} M (x, y) . \end{matrix}

(13)

Let

ϵ = a^{- 1}

and

ν^{2} = \frac{ϵ}{2} σ^{2}

. Since

a ≫ 1

, it is reasonable to assume that

0 < ν < 1

. We introduce the following operator

L^{ϵ} = \frac{1}{ϵ} L_{0} + L_{1},

where

\begin{matrix} L_{0} = & ν^{2} \frac{\partial^{2}}{\partial y^{2}} + (b - y) \frac{\partial}{\partial y}, \\ L_{1} = & \frac{\partial}{\partial t} + \frac{1}{2} η^{2} \frac{\partial^{2}}{\partial x^{2}} - (\frac{1}{2} η^{2} + ρ_{13} η e^{y}) \frac{\partial}{\partial x} . \end{matrix}

Then the PDE (13) can be written in the following form

L^{ϵ} (g) = \frac{κ}{λ η^{2}} M (x, y), g (T, \cdot, \cdot) = 0 .

(14)

To provide an approximation for

g (t, x, y)

, we need to introduce

ϕ (y) : = \frac{1}{\sqrt{2 π} v} e^{- \frac{{(y - b)}^{2}}{2 v^{2}}}

and define the functional

Ψ (\cdot)

as

Ψ (h) : = \int_{- \infty}^{+ \infty} ϕ (y) h (y) d y

for all h such that

Ψ (| h |) < + \infty

. The following lemma can be found in Fouque [19].

Lemma 2.

The following equation

L_{0} u (t, x, y) = h (t, x, y)

only has a solution if

h (t, x, y)

satisfies

Ψ_{t, x} (h) : = \int_{- \infty}^{+ \infty} ϕ (y) h (t, x, y) d y = 0,

for each

t \in [0, T]

,

x \in (- \infty, + \infty)

.

3.1. An Approximation for $g (t, x, y)$

Assume that g is a solution of (14). We can construct an asymptotic expansion with respect to

\sqrt{ϵ}

for g as the following

g (t, x, y) = g_{0} (t, x, y) + \sqrt{ϵ} g_{1} (t, x, y) + ϵ g_{2} (t, x, y) + \dots .

Substituting it into (14), we obtain

\begin{matrix} \frac{κ}{λ η^{2}} M (x, y) = L^{ϵ} g (t, x, y) \\ = & \frac{1}{ϵ} L_{0} g_{0} (t, x, y) + \frac{1}{\sqrt{ϵ}} L_{0} g_{1} (t, x, y) + (L_{0} g_{2} (t, x, y) + L_{1} g_{0} (t, x, y)) \\ + \sqrt{ϵ} (L_{0} g_{3} (t, x, y) + L_{1} g_{1} (t, x, y)) + \dots . \end{matrix}

(15)

For simplicity, we assume that

g_{0} (t, x, y) \equiv g_{0} (t, x)

,

g_{1} (t, x, y) \equiv g_{1} (t, x)

. Since the operator

L_{0}

only involves partial derivatives with respect to y, one can see that

\begin{matrix} L_{0} g_{0} (t, x, y) & = 0, \\ L_{0} g_{1} (t, x, y) & = 0 . \end{matrix}

Then, Equation (15) can be simplified into the following form

\begin{matrix} \frac{κ}{λ η^{2}} M (x, y) = & (L_{0} g_{2} (t, x, y) + L_{1} g_{0} (t, x)) \\ + \sqrt{ϵ} (L_{0} g_{3} (t, x, y) + L_{1} g_{1} (t, x)) + \dots . \end{matrix}

(16)

It is natural to consider the following equations

\begin{matrix} L_{0} g_{2} (t, x, y) & = - L_{1} g_{0} (t, x) + \frac{κ}{λ η^{2}} M (x, y), \end{matrix}

(17)

\begin{matrix} L_{0} g_{3} (t, x, y) & = - L_{1} g_{1} (t, x) . \end{matrix}

(18)

To make sure the Equation (17) has a solution, it follows from Lemma 2 that for each

(t, x)

, the right part of (17) should satisfy the following equation

Ψ_{t, x} (L_{1} g_{0} (t, x) - \frac{κ}{λ η^{2}} M (x, y)) = 0 .

(19)

Since

L_{1} g_{0} (t, x) = \frac{\partial}{\partial t} g_{0} (t, x) + \frac{1}{2} η^{2} \frac{\partial^{2}}{\partial x^{2}} g_{0} (t, x) - (\frac{1}{2} η^{2} + ρ_{13} η e^{y}) \frac{\partial}{\partial x} g_{0} (t, x)

, if we let

c : = \frac{1}{2} η^{2} + η ρ_{13} Ψ (e^{y})

, one can see that

Ψ_{t, x} (L_{1} g_{0} (t, x)) = \frac{\partial}{\partial t} g_{0} (t, x) + \frac{1}{2} η^{2} \frac{\partial^{2}}{\partial x^{2}} g_{0} (t, x) - c g_{0} (t, x) .

Furthermore, from the definition of

M (x, y)

, one can see that

\begin{matrix} Ψ_{t, x} (M (x, y)) = & Ψ_{t, x} (κ (θ - x) + \frac{1}{2} η^{2} + ρ_{13} η e^{y}) \\ = & κ (θ - x) + c, \end{matrix}

which is a linear function of x. Assigning the boundary value of

g_{0} (t, x)

as

g_{0} (T, \cdot) = 0

, the solution can be directly given by

g_{0} (t, x) = \frac{κ^{2}}{λ η^{2}} (t - T) (θ - x) - \frac{c κ^{2}}{2 λ η^{2}} {(t - T)}^{2} + \frac{c κ}{λ η^{2}} (t - T) .

(20)

Similarly, to ensure that Equation (18) has a solution, we need

Ψ_{t, x} (L_{1} g_{1} (t, x)) = \frac{\partial}{\partial t} g_{1} (t, x) + \frac{1}{2} η^{2} \frac{\partial^{2}}{\partial x^{2}} g_{1} (t, x) - c g_{1} (t, x) = 0 .

With the bound condition

g_{1} (T, \cdot) = 0

, one can see that

g_{1} (t, x) = 0

is a solution. Therefore, a natural approximation for

g (t, x, y)

is given by

\begin{matrix} g^{*} (t, x, y) = & g_{0} (t, x) + \sqrt{ϵ} g_{1} (t, x) . \end{matrix}

Thus, we have the following theorem:

Theorem 2

(Main result II). Denote

c : = \frac{1}{2} η^{2} + ρ_{13} η Ψ (e^{y})

. Let

g^{*} (t, x, y) = \frac{κ^{2}}{λ η^{2}} (t - T) (θ - x) - \frac{c κ^{2}}{2 λ η^{2}} {(t - T)}^{2} + \frac{c κ}{λ η^{2}} (t - T) .

(21)

Then, there exists a constant C that is independent of ϵ such that

|g (t, x, y) - g^{*} (t, x, y)| \leq \sqrt{ϵ} C (1 + e^{| y |}) .

(22)

An approximate optimal strategy is given by

\begin{matrix} {\hat{π}}_{t}^{*} = & \frac{1}{2 λ η^{2}} κ (θ - X_{t}) + \frac{1}{4 λ} + \frac{1}{2 λ η} ρ_{13} e^{y_{t}} - g^{*} (t, X_{t}, y_{t}) . \end{matrix}

Remark 4.

Compared with the traditional finite difference method(FDM), we can obtain an explicit solution with a significant advantage in computational complexity using the asymptotic analysis technique. In practice, at each decision moment, we only need to calculate the optimal strategy at one specific point

(x, y, t)

, which describes the market state at that time, rather than all values on a series of grid points. In addition, since our problem is defined on an unbounded domain, in order to apply the FDM, the solution area must be cutoff from infinity and additional artificial boundary conditions must be added. This will introduce additional boundary errors and increase the complexity of the theoretical analysis and numerical calculations.

Remark 5.

From the error estimation given in Theorem 2, we can see that the accuracy of our closed-form approximation of the optimal strategy is mainly controlled by volatility mean-reversion speed parameter a. Faster mean-reversion speed implies a more accurate strategy.

3.2. Some Auxiliary Approximations

We established the following lemmas to helping prove Theorem 2.

Lemma 3.

(i)

If

y < ν^{2} + b

, then

\begin{matrix} \frac{1}{ϕ (y)} \int_{- \infty}^{y} ϕ (z) d z & \leq \sqrt{2 π} ν e^{\frac{1}{2}}, \end{matrix}

(23)

\begin{matrix} \frac{1}{ϕ (y)} \int_{- \infty}^{y} e^{z} ϕ (z) d z & \leq \sqrt{2 π} ν e^{y} . \end{matrix}

(24)

$(i i)$: If $y \geq ν^{2} + b$ , then

\begin{matrix} \frac{1}{ϕ (y)} \int_{y}^{+ \infty} e^{z} ϕ (z) d z \leq & \sqrt{2 π} ν e^{y} . \end{matrix}

(25)

Lemma 4.

Given

h (y)

that satisfies

Ψ (h) = 0

and

| h (y) | \leq C_{1} (1 + e^{y})

, for some positive constants

C_{1}

, let

X (y)

satisfy the following equation

L_{0} X (y) = h (y) .

Then,

| X (y) | < C_{2} (1 + e^{| y |}) ϵ^{- \frac{1}{2}}

for some positive constant

C_{2}

independent of ϵ.

This lemma provides an estimation for the linear operator

L_{0}

. If we choose

C_{1} = 0

in Lemma 4, by using (A1), we obtained the following corollary:

Corollary 1.

Let

X (y)

be the solution of the equation

L_{0} X (y) = 0

; then, there exists a positive constant

C_{2}

independent of ϵ such that

|X (y)| \leq C_{2} .

Lemma 5.

Let

{y_{t}; t \in [0, T]}

be the OU process in the Scott model; then, for all

τ \geq 0

, there exists a positive constant

\hat{C}

independent of ϵ such that

E [e^{| y_{t + τ} |} | y_{t} = y] \leq \hat{C} e^{| y |} .

(26)

Proofs of Lemmas 3–5 are given in the Appendix A, Appendix B and Appendix C.

3.3. The Proof of Theorem 2

We only need to consider error estimation. The residue portion can be defined as

R_{ϵ} (t, x, y) : = g_{0} (t, x) + \sqrt{ϵ} g_{1} (t, x) + ϵ g_{2} (t, x, y) + ϵ \sqrt{ϵ} g_{3} (t, x, y) - g (t, x, y) .

Recalling

L^{ϵ} g (t, x, y) = \frac{κ}{λ η^{2}} M (x, y)

and

\begin{matrix} L_{0} g_{0} (t, x) = & 0, \\ L_{0} g_{1} (t, x) = & 0, \\ L_{0} g_{2} (t, x, y) = & - L_{1} g_{0} (t, x) + \frac{κ}{λ η^{2}} M (x, y), \\ L_{0} g_{3} (t, x, y) = & - L_{1} g_{1} (t, x), \end{matrix}

(27)

we have

\begin{matrix} L^{ϵ} R_{ϵ} (t, x, y) = & \frac{1}{ϵ} L_{0} g_{0} (t, x) + \frac{1}{\sqrt{ϵ}} L_{0} g_{1} (t, x) + \{L_{0} g_{2} (t, x, y) + L_{1} g_{0} (t, x)\} \\ + \sqrt{ϵ} \{L_{0} g_{3} (t, x, y) + L_{1} g_{1} (t, x)\} \\ + ϵ L_{1} g_{2} (t, x, y) + ϵ \sqrt{ϵ} L_{1} g_{3} (t, x, y) - L^{ϵ} g (t, x, y) \\ = & ϵ (L_{1} g_{2} (t, x, y) + \sqrt{ϵ} L_{1} g_{3} (t, x, y)) . \end{matrix}

Defining

\begin{matrix} G_{ϵ} (t, x, y) = & L_{1} g_{2} (t, x, y) + \sqrt{ϵ} L_{1} g_{3} (t, x, y), \\ H_{ϵ} (x, y) = & g_{2} (T, x, y) + \sqrt{ϵ} g_{3} (T, x, y) . \end{matrix}

One can see that

R_{ϵ} (t, x, y)

solves following equation:

\{\begin{matrix} L^{ϵ} R_{ϵ} (t, x, y) & = ϵ G_{ϵ} (t, x, y), \\ R_{ϵ} (T, \cdot, \cdot) & = ϵ H_{ϵ} (\cdot, \cdot) . \end{matrix}

Applying Feymann–Kac formula,

R_{ϵ}

demonstrates probabilistic representation as follows

R_{ϵ} (t, x, y) = ϵ E_{t} [H_{ϵ} ({\hat{X}}_{T}, y_{T}) + \int_{t}^{T} G_{ϵ} (s, {\hat{X}}_{s}, y_{s}) d s | {\hat{X}}_{t} = x, y_{t} = y],

(28)

where

{\hat{X}}_{t}

is driven by

d {\hat{X}}_{t} = - (\frac{1}{2} η^{2} + ρ_{13} η e^{y_{t}}) d t + η d W_{3} (t) .

From (28), one can see that the boundary of

R_{ϵ}

is controlled by both

G_{ϵ}

and

H_{ϵ}

. In the following sections, we provide estimations for these two parts.

Let

ψ (y)

solves

L_{0} ψ (y) = e^{y} - Ψ (e^{y}) .

According to Lemma 4,

|ψ (y)| \leq C_{2} (e^{| y |} + 1) ϵ^{- \frac{1}{2}}

(29)

holds for some positive constants

C_{2}

. We choose

\begin{matrix} g_{2} (t, x, y) : = & \frac{κ ρ_{13}}{λ η} [κ (T - t) + 1] ψ (y), \end{matrix}

(30)

\begin{matrix} g_{3} (t, x, y) : = & 0 . \end{matrix}

(31)

One can see from (19) that

\begin{matrix} L_{0} g_{2} (t, x, y) = \frac{κ ρ_{13}}{λ η} [κ (T - t) + 1] (e^{y} - Ψ (e^{y}))] \\ = - \{L_{1} g_{0} (t, x) - Ψ_{t, x} (L_{1} g_{0} (t, x))\} + \frac{κ}{λ η^{2}} {M (x, y) - Ψ_{t, x} (M (x, y))} \\ = - L_{1} g_{0} (t, x) + \frac{κ}{λ η^{2}} M (x, y), \\ L_{0} g_{3} (t, x, y) = 0 . \end{matrix}

Thus,

g_{0} (t, x)

,

g_{1} (t, x)

,

g_{2} (t, x, y)

, and

g_{3} (t, x, y)

satisfy Equation (27). Furthermore, one can easily see that there exists a constant

C^{*}

such that

\begin{matrix} | g_{2} (t, x, y) | & \leq C^{*} (e^{| y |} + 1) ϵ^{- \frac{1}{2}}, \\ |G_{ϵ} (t, x, y)| & = |L_{1} g_{2} (t, x, y) + \sqrt{ϵ} L_{1} g_{3} (t, x, y)| \\ \leq C^{*} (e^{| y |} + 1) ϵ^{- \frac{1}{2}}, \\ |H_{ϵ} (x, y)| & \leq C^{*} (e^{| y |} + 1) ϵ^{- \frac{1}{2}} . \end{matrix}

Using Lemma 5, one can see that

\begin{matrix} |R_{ϵ} (t, x, y)| = & ϵ |E [H_{ϵ} ({\hat{X}}_{T}, y_{T}) + \int_{t}^{T} G_{ϵ} (s, {\hat{X}}_{s}, y_{s}) d s | {\hat{X}}_{t} = x, y_{t} = y]| \\ \leq & C^{*} ϵ^{\frac{1}{2}} E [e^{| y_{T} |} + 1 + \int_{t}^{T} (e^{| y_{s} |} + 1) d s | y_{t} = y] \\ \leq & C^{*} ϵ^{\frac{1}{2}} (E_{t} [e^{| y_{t + (T - t)} |}] + T + 1) + C^{*} ϵ^{\frac{1}{2}} \int_{0}^{T - t} E_{t} [e^{| y_{t + τ} |}] d τ \\ \leq & C^{*} ϵ^{\frac{1}{2}} (\hat{C} e^{| y |} + T + 1) + C^{*} ϵ^{\frac{1}{2}} T \hat{C} e^{| y |} \\ \leq & [C^{*} (\hat{C} + 1) (T + 1)] (e^{| y |} + 1) ϵ^{\frac{1}{2}} . \end{matrix}

Thus, we obtain

\begin{matrix} |g_{0} (t, x) - g (t, x, y)| = & |R_{ϵ} (t, x, y) - ϵ g_{2} (t, x, y) - ϵ \sqrt{ϵ} g_{3} (t, x, y)| \\ \leq & |R_{ϵ} (t, x, y)| + ϵ |g_{2} (t, x, y)| + ϵ \sqrt{ϵ} |g_{3} (t, x, y)| \\ \leq & [C^{*} (\hat{C} + 1) (T + 1) + C^{*}] (e^{| y |} + 1) ϵ^{\frac{1}{2}} + C_{3} ϵ^{\frac{3}{2}} . \end{matrix}

Denote

C = C^{*} [(\hat{C} + 1) (T + 1) + 1],

then (22) follows. The approximate optimal strategy is given by directly applying Theorem 1.

4. Empirical Experiments

In this section, we compare the effect of our strategy (the optimal approximate strategy in the Scott model) and the strategy proposed by Zhu et al. (the optimal strategy in the constant volatility model (see Zhu et al. [12])) on both real scenarios and simulated scenarios. We select three stock pairs (listed in Table 1) traded on the Chinese security markets SSE and SZSE to illustrate our results using the standard cointegration testing method mentioned by Chambers [21]. For the estimation of the Scott model, we combine the maximum likelihood estimation(MLE) with the extended Kalman filter to produce an on-line updated estimation (see Wang et al. [22] and Simon [23] for details). We also recommended Aihara [24] for an alternative robust filtering estimation. Then, we empirically validate the strategies given in Section 3 based on the real market data from the Chinese security markets SSE and SZSE.

The sample period is from 1 June 2019 to 1 December 2022 across different industries, but the stocks in each pair are in the same industry and are highly correlated in terms of both fundamentals and price series. The data were obtained from TDX software, and we only used the daily closing prices. Typically, we used the forward-adjusted prices to avoid the dividend effect. Figure 1 presents the forward-adjusted stock prices and the dynamics of the corresponding price spread for the first pair as an example.

Starting from 1 December 2021, we performed out-of-sample testing for all cases using the moving-window method. Parameters were updated everyday by using data from last 375 trading days (on and a half years). We chose

λ = 0.5

and equally allocate the initial endowment of 100 units among the selected pairs. The paths of the wealth processes obtained from our strategy under the Scott model and from the strategy proposed by Zhu et al. are shown in Figure 2, respectively. Table 2 presents some of the commonly used statistics denoting strategy performance.

Figure 2 and Table 2 indicate that our strategy under the Scott model outperforms the strategy proposed by Zhu et al. under the constant volatility model with respect to important indicators such as the Sharp ratio, the profit-loss ratio and the win rate in the out-of-sample testing.

To further compare the value of the mean-variance objective function J for these two strategies, we implemented simulations with respect to the parameters estimated using real market data for the stock pairs listed in Table 1. Parameters for the simulation are given in Table 3. As for the strategy proposed by Zhu et al., we used

e^{| b |}

as the constant volatility, which is the long-term average volatility in the Scott model.

We chose

T = 1.5, r = 0.03, d t = 1 / 250, X_{0} = 0, V_{0} = 100

, and simulated each pair 1000 times with risk aversion

λ

values rangingvaries from

0.5

to

1.5

. The statistics of the discounted terminal wealth for each pair are shown in Table 4, Table 5 and Table 6. J is the value of the objective function defined in (7).

Table 4, Table 5 and Table 6 indicate the effectiveness of both strategies by comparing the average terminal wealth with the initial asset

V_{0} = 100

. Both strategies yield a discounted final wealth greater than

V_{0}

in every case, which means that profit is always higher than the risk-free return. Furthermore, comparing the strategy statistics of the strategy proposed by Zhu et al. and our strategy in each case, we found that the standard deviation of the terminal wealth of our strategy is always smaller than that of the strategy proposed by Zhu et al. Although the mean may be lower, ultimately, the J value of our strategy is always greater than that of the strategy proposed by Zhu et al. This phenomenon suggests that the approximate optimal strategy under the Scott model outperforms the optimal strategy under the constant volatility strategy, by producing more stable profits. It is noteworthy that

λ

plays a critical role in controlling the uncertainty of the outcome result of both strategies. The mean and the standard deviation of the terminal wealth decrease as

λ

increases. Intuitively, a larger

λ

indicates more risk aversion, which leads to smaller allocation on the risky assets and thus lower uncertainty.

Clearly, our strategies show effectiveness for both simulated and real out-of-sample data. The comparison of the strategy proposed by Zhu et al. and our approximate strategy shows that the Scott model can better capture the mean-reverting characteristic of volatility, resulting in a more stable trading strategy.

5. Conclusions

In this paper, we provide a semi-closed-form optimal strategy of the mean-variance problem for pairs trading by assuming that one of the security prices satisfies a general stochastic volatility model and that the corresponding price spread follows the Ornstein–Uhlenbeck process. Then, we provide a closed-form approximate formula for the optimal strategy in the Scott model using the asymptotic analysis technique. Our approximate formula has extremely high computational efficiency and has been proven to be accurate. We implemented our approximate optimal strategy on the real historical data selected from Chinese security markets and compared it with the optimal strategy under the constant volatility model proposed by Zhu et al. [12]. The numerical results show that both strategies are effective and that the Scott model produces a more stable strategy by better capturing mean-reverting volatility.

Author Contributions

Conceptualization, Y.Z.; methodology, Y.Z. and D.X.; writing—original draft preparation, Y.Z.; writing—review and editing, Y.Z. and D.X.; visualization, Y.Z.; supervision, D.X. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China under Grant No. 11671257.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Lemma 3

(i) Let

c_{0} (y) : = \frac{y - b}{\sqrt{2} v} .

One can see that

c_{0} (y) \leq \frac{ν}{\sqrt{2}}

. Through direct computation, we have

\begin{matrix} \frac{1}{ϕ (y)} \int_{- \infty}^{y} ϕ (z) d z = & e^{\frac{{(y - b)}^{2}}{2 v^{2}}} \int_{- \infty}^{y} e^{- \frac{{(z - b)}^{2}}{2 v^{2}}} d z = \sqrt{2} ν e^{c_{0}^{2} (y)} \int_{- \infty}^{c_{0} (y)} e^{- x^{2}} d x . \end{matrix}

If

c_{0} (y) \leq 0

, then

\begin{matrix} \frac{1}{ϕ (y)} \int_{- \infty}^{y} ϕ (z) d z = & \sqrt{2} ν e^{c_{0}^{2} (y)} \int_{- \infty}^{0} e^{- (u + c_{0}^{2} (y))} d u \\ \leq & \sqrt{2} ν \int_{- \infty}^{0} e^{- u^{2}} d u = \frac{\sqrt{2 π}}{2} ν; \end{matrix}

if

c_{0} (y) > 0

, then

\begin{matrix} \frac{1}{ϕ (y)} \int_{- \infty}^{y} ϕ (z) d z \leq & \sqrt{2} ν e^{c_{0} {(y)}^{2}} \int_{- \infty}^{+ \infty} e^{- x^{2}} d x \leq \sqrt{2 π} ν e^{\frac{ν^{2}}{2}} . \end{matrix}

Recalling that

0 < ν < 1

, (23) follows. Furthermore, if we introduce

c_{1} (y) = \frac{y - b - ν^{2}}{\sqrt{2} ν},

then

\begin{matrix} \frac{1}{ϕ (y)} \int_{- \infty}^{y} e^{z} ϕ (z) d z = & e^{\frac{{(y - b)}^{2}}{2 v^{2}}} \int_{- \infty}^{y} e^{z - \frac{{(z - b)}^{2}}{2 v^{2}}} d z \\ = & e^{\frac{{(y - b)}^{2}}{2 v^{2}} + b + \frac{1}{2} ν^{2}} \int_{- \infty}^{y} e^{- \frac{{(z - b - ν^{2})}^{2}}{2 v^{2}}} d z \\ = & \frac{\sqrt{2}}{ν} e^{\frac{{(y - b)}^{2}}{2 v^{2}} + b + \frac{1}{2} ν^{2}} \int_{- \infty}^{c (y)} e^{- x^{2}} d x \\ = & \sqrt{2} ν e^{y + {c_{1} (y)}^{2}} \int_{0}^{+ \infty} e^{- {(x - c_{1} (y))}^{2}} d x \\ \leq & \sqrt{2} ν e^{y + {c_{1} (y)}^{2}} \int_{0}^{+ \infty} e^{- (x^{2} + {c_{1} (y)}^{2})} d x \\ = & \sqrt{2 π} ν e^{y}, \end{matrix}

and (24) follows.

(ii) If

y \geq b + ν^{2}

, then

c_{1} (y) \geq 0

, one can see that

\begin{matrix} \frac{1}{ϕ (y)} \int_{y}^{+ \infty} e^{z} ϕ (z) d z = & \sqrt{2} ν e^{y + {c_{1} (y)}^{2}} \int_{0}^{+ \infty} e^{- {(x + c_{1} (y))}^{2}} d x \\ \leq & \sqrt{2} ν e^{y + {c_{1} (y)}^{2}} \int_{0}^{\infty} e^{- (x^{2} + {c_{1} (y)}^{2})} d x \\ = & \sqrt{2 π} ν e^{y}, \end{matrix}

and (25) follows.

Appendix B. Proof of Lemma 4

One can easily see that

L_{0} X (y) = \frac{ν^{2}}{ϕ (y)} \frac{\partial}{\partial y} (ϕ (y) \frac{\partial X}{\partial y}) = h (y),

thus

X_{y} (y) = \frac{1}{ν^{2} ϕ (y)} \int_{- \infty}^{y} h (z) ϕ (z) d z = - \frac{1}{ν^{2} ϕ (y)} \int_{y}^{+ \infty} h (z) ϕ (z) d z .

(A1)

(i) If

y \geq ν^{2} + b

, we can see from Lemma 3 (ii) that

\begin{matrix} |X_{y} (y)| = & \frac{1}{ν^{2} ϕ (y)} |\int_{y}^{+ \infty} h (z) ϕ (z) d z| \leq \frac{C_{1}}{ν^{2} ϕ (y)} \int_{y}^{+ \infty} (1 + e^{z}) ϕ (z) d z \\ \leq & \frac{C_{1}}{ν^{2} ϕ (y)} e^{- b - ν^{2}} \int_{y}^{+ \infty} (e^{z} + e^{z + b + ν^{2}}) ϕ (z) d z \\ \leq & \frac{C_{1} (1 + e^{- b})}{ν^{2} ϕ (y)} \int_{y}^{\infty} e^{z} ϕ (z) d z \\ \leq & \frac{C_{1}}{ν} \sqrt{2 π} (1 + e^{- b}) e^{y} . \end{matrix}

Since

X (y) = \int_{ν^{2} + b}^{y} X_{y} (z) d z + X (ν^{2} + b),

one can see that:

\begin{matrix} |X (y)| \leq & \int_{ν^{2} + b}^{y} |X_{y} (s)| d s + |X (ν^{2} + b)| \\ \leq & \frac{C_{1}}{ν} \sqrt{2 π} (1 + e^{- b}) \int_{ν^{2} + b}^{y} e^{z} d z + |X (ν^{2} + b)| \\ = & \frac{C_{1}}{ν} \sqrt{2 π} (1 + e^{- b}) (e^{y} - e^{v^{2} + b}) + |X (ν^{2} + b)| \\ \leq & \sqrt{2} σ^{- 1} ϵ^{- \frac{1}{2}} [C_{1} \sqrt{2 π} (1 + e^{- b}) + | X (ν^{2} + b) |] (e^{y} + 1) \\ \leq & {\hat{C}}_{0} ϵ^{- \frac{1}{2}} (e^{| y |} + 1), \end{matrix}

(A2)

where

{\hat{C}}_{0} = σ^{- 1} [C_{1} \sqrt{2 π} (1 + e^{- b}) + | X (ν^{2} + b) |]

.

(ii) If

y < ν^{2} + b

, from Lemma 3 (i), we can see that

\begin{matrix} |X_{y} (y)| \leq & \frac{C_{1}}{ν^{2} ϕ (y)} \int_{- \infty}^{y} (1 + e^{z}) ϕ (z) d z \\ = & \frac{C_{1}}{ν^{2} ϕ (y)} \int_{- \infty}^{y} e^{z} ϕ (z) d z + \frac{C_{1}}{ν^{2} ϕ (y)} \int_{- \infty}^{y} ϕ (z) d z \\ \leq & \frac{C_{1}}{ν} \sqrt{2 π} e^{y} + \frac{C_{1}}{ν} \sqrt{2 π} e^{\frac{1}{2}} \\ \leq & \frac{C_{1}}{ν} \sqrt{2 π} (e^{| y |} + e^{\frac{1}{2} + | y |}) \\ \leq & \frac{C_{1}}{ν} (1 + e^{\frac{1}{2}}) e^{| y |} . \end{matrix}

Since

X (y) = - \int_{ν^{2} + b}^{y} X_{y} (z) d z + X (ν^{2} + b),

one can see that:

\begin{matrix} |X (y)| \leq & \int_{y}^{ν^{2} + b} |X_{y} (s)| d s + |X (ν^{2} + b)| \\ \leq & \frac{C_{1}}{ν} (1 + e^{\frac{1}{2}}) \int_{y}^{ν^{2} + b} e^{| z |} d z + |X (ν^{2} + b)| . \end{matrix}

Furthermore, if

ν^{2} + b < 0

, we have

\begin{matrix} |X (y)| \leq & \frac{C_{1}}{ν} (1 + e^{\frac{1}{2}}) \int_{y}^{ν^{2} + b} e^{- z} d z + |X (ν^{2} + b)| \\ \leq & \frac{C_{1}}{ν} (1 + e^{\frac{1}{2}}) (e^{- y} - e^{- (ν^{2} + b)}) + |X (ν^{2} + b)| \\ \leq & \frac{C_{1}}{ν} (1 + e^{\frac{1}{2}}) e^{| y |} + |X (ν^{2} + b)| \\ \leq & {\hat{C}}_{1} ϵ^{- \frac{1}{2}} (e^{| y |} + 1), \end{matrix}

(A3)

where

{\hat{C}}_{1} = σ^{- 1} [C_{1} \sqrt{2 π} (1 + e^{\frac{1}{2}}) + |X (ν^{2} + b)|]

. If

ν^{2} + b \geq 0

, we have

\begin{matrix} |X (y)| \leq & \frac{C_{1}}{ν} (1 + e^{\frac{1}{2}}) (\int_{- | y |}^{0} e^{- z} d z + \int_{0}^{ν^{2} + b} e^{z} d z) + |X (ν^{2} + b)| \\ \leq & \frac{C_{1}}{ν} (1 + e^{\frac{1}{2}}) (e^{| y |} + e^{ν^{2} + b}) + |X (ν^{2} + b)| \\ \leq & \frac{C_{1}}{ν} (1 + e^{\frac{1}{2}}) (1 + e^{ν^{2} + b}) e^{| y |} + |X (ν^{2} + b)| \\ \leq & {\hat{C}}_{2} ϵ^{- \frac{1}{2}} (e^{| y |} + 1), \end{matrix}

(A4)

where

{\hat{C}}_{2} = σ^{- 1} [C_{1} \sqrt{2 π} (1 + e^{\frac{1}{2}}) (1 + e^{1 + b}) + |X (ν^{2} + b)|]

Using (A2)–(A4) to define

C_{2} = σ^{- 1} [C_{1} \sqrt{2 π} (1 + e^{\frac{1}{2}}) (1 + e^{1 + | b |}) + sup_{y \in (b, 1 + b)} |X (y)|],

we know that

C_{2}

is greater than

{\hat{C}}_{0}, {\hat{C}}_{1}, {\hat{C}}_{2}

and is independent of

ϵ

. Naturally, we have

|X (y)| \leq C_{2} (1 + e^{| y |}) ϵ^{- \frac{1}{2}} .

Appendix C. Proof of Lemma 5

We denote

E_{t} [\cdot] = E [\cdot | y_{t} = y], {Var}_{t} [\cdot] = Var [\cdot | y_{t} = y]

for convenience. Since

y_{t + τ}

can be written as

y_{t + τ} = (1 - e^{- a τ}) b + y_{t} e^{- a τ} + σ \int_{t}^{t + τ} e^{- a (τ - s)} d W_{3} (s),

one can see that

\begin{matrix} μ_{τ} : = & E_{t} [y_{t + τ}] = b + (y - b) e^{- a τ}, \\ σ_{τ}^{2} : = & {Var}_{t} [y_{t + τ}] = σ^{2} \frac{1 - e^{- 2 a τ}}{2 a}, \end{matrix}

and

y_{t + τ}

follows the Gaussian law

N (μ_{τ}, σ_{τ}^{2})

; thus,

\begin{matrix} E_{t} [e^{| y_{t + τ} |}] \\ = & \frac{1}{σ_{τ} \sqrt{2 π}} [\int_{0}^{+ \infty} e^{x} e^{- \frac{{(x - μ_{τ})}^{2}}{2 σ_{τ}^{2}}} d x + \int_{- \infty}^{0} e^{- x} e^{- \frac{{(x - μ_{τ})}^{2}}{2 σ_{τ}^{2}}} d x] \\ = & \frac{1}{\sqrt{π}} e^{μ_{τ} + \frac{1}{2} σ_{τ}^{2}} \int_{- (\frac{μ_{τ}}{\sqrt{2} σ_{τ}} + \frac{\sqrt{2}}{2} σ_{τ})}^{+ \infty} e^{- z^{2}} d z + \frac{1}{\sqrt{π}} e^{- μ_{τ} + \frac{1}{2} σ_{τ}^{2}} \int_{\frac{μ_{τ}}{\sqrt{2} σ_{τ}} - \frac{\sqrt{2}}{2} σ_{τ}}^{+ \infty} e^{- z^{2}} d z \\ \leq & 2 e^{| μ_{τ} | + \frac{1}{2} σ_{τ}^{2}} \\ = & 2 e^{b (1 - e^{- a τ}) + \frac{σ^{2}}{4 a} (1 - e^{- 2 a τ}) + | y | e^{- a τ}} \\ \leq & 2 e^{| b | + \frac{σ^{2}}{4 a}} e^{| y |} . \end{matrix}

Since

a > 1

, let

\hat{C} = 2 e^{| b | + \frac{σ^{2}}{4}}

, then (26) follows.

References

Vidyamurthy, G. Pairs Trading: Quantitative Methods and Analysis; John Wiley & Sons: Hoboken, NJ, USA, 2004; Volume 217. [Google Scholar]
Lin, Y.X.; McCRAE, M.; Gulati, C. Loss protection in pairs trading through minimum profit bounds: A cointegration approach. Adv. Decis. Sci. 2006, 2006, 73803. [Google Scholar] [CrossRef]
Yan, T.; Chiu, M.C.; Wong, H.Y. Pairs trading under delayed cointegration. Quant. Financ. 2022, 22, 1627–1648. [Google Scholar] [CrossRef]
Elliott, R.J.; Van Der Hoek, J.; Malcolm, W.P. Pairs trading. Quant. Financ. 2005, 5, 271–276. [Google Scholar] [CrossRef]
Bertram, W.K. Analytic solutions for optimal statistical arbitrage trading. Phys. A Stat. Mech. Its Appl. 2010, 389, 2234–2243. [Google Scholar] [CrossRef]
Jurek, J.W.; Yang, H. Dynamic portfolio selection in arbitrage. In Proceedings of the EFA 2006 Meetings Paper, Zurich, Switzerland, 23–26 August 2006. [Google Scholar]
Suzuki, K. Optimal switching strategy of a mean-reverting asset over multiple regimes. Automatica 2016, 67, 33–45. [Google Scholar] [CrossRef]
Suzuki, K. Optimal pair-trading strategy over long/short/square positions—Empirical study. Quant. Financ. 2018, 18, 97–119. [Google Scholar] [CrossRef]
Endres, S.; Stübinger, J. A flexible regime switching model with pairs trading application to the S&P 500 high-frequency stock returns. Quant. Financ. 2019, 19, 1727–1740. [Google Scholar]
Liu, J.; Timmermann, A. Optimal convergence trade strategies. Rev. Financ. Stud. 2013, 26, 1048–1086. [Google Scholar] [CrossRef]
Chiu, M.C.; Wong, H.Y. Dynamic cointegrated pairs trading: Mean–variance time-consistent strategies. J. Comput. Appl. Math. 2015, 290, 516–534. [Google Scholar] [CrossRef]
Zhu, D.M.; Gu, J.W.; Yu, F.H.; Siu, T.K.; Ching, W.K. Optimal pairs trading with dynamic mean-variance objective. Math. Methods Oper. Res. 2021, 94, 145–168. [Google Scholar] [CrossRef]
Björk, T.; Murgoci, A.; Zhou, X.Y. Mean–variance portfolio optimization with state-dependent risk aversion. Math. Financ. Int. J. Math. Stat. Financ. Econ. 2014, 24, 1–24. [Google Scholar] [CrossRef]
Cont, R. Empirical properties of asset returns: Stylized facts and statistical issues. Quant. Financ. 2001, 1, 223–236. [Google Scholar] [CrossRef]
Cont, R. Volatility clustering in financial markets: Empirical facts and agent-based models. In Long Memory in Economics; Springer: Berlin/Heidelberg, Germany, 2007; pp. 289–309. [Google Scholar]
Teräsvirta, T.; Zhao, Z. Stylized facts of return series, robust estimates and three popular models of volatility. Appl. Financ. Econ. 2011, 21, 67–94. [Google Scholar] [CrossRef]
Gatheral, J.; Jaisson, T.; Rosenbaum, M. Volatility is rough. Quant. Financ. 2018, 18, 933–949. [Google Scholar] [CrossRef]
Scott, L.O. Option pricing when the variance changes randomly: Theory, estimation, and an application. J. Financ. Quant. Anal. 1987, 22, 419–438. [Google Scholar] [CrossRef]
Fouque, J.P.; Papanicolaou, G.; Sircar, K.R. Mean-reverting stochastic volatility. Int. J. Theor. Appl. Financ. 2000, 3, 101–142. [Google Scholar] [CrossRef]
Fouque, J.P.; Lorig, M.J. A fast mean-reverting correction to Heston’s stochastic volatility model. SIAM J. Financ. Math. 2011, 2, 221–254. [Google Scholar] [CrossRef]
Chambers, M.J. The estimation of continuous time models with mixed frequency data. J. Econom. 2016, 193, 390–404. [Google Scholar] [CrossRef]
Wang, X.; He, X.; Bao, Y.; Zhao, Y. Parameter estimates of Heston stochastic volatility model with MLE and consistent EKF algorithm. Sci. China Inf. Sci. 2018, 61, 042202. [Google Scholar] [CrossRef]
Simon, D. Optimal State Estimation: Kalman, H Infinity, and Nonlinear Approaches; John Wiley & Sons: Hoboken, NJ, USA, 2006. [Google Scholar]
Aihara, S. Estimation of stochastic volatility in the Hull-White model. Appl. Math. Financ. 2000, 7, 153–181. [Google Scholar] [CrossRef]

Figure 1. Stock prices and price spread dynamics for Cecep Wind Power and Cecep Solar Energy.

Figure 2. The wealth dynamics of the out-of-sample testing. Strategy proposed by Zhu et al. [12].

Table 1. Selected stock pairs.

Label	Stock P	Stock Q	Industry
1	Cecep Wind Power	Cecep Solar Energy	Clean Energy Industry
2	Shanxi Lu’An Environmental Energy	Shanxi Coking Coal Energy	Coal Industry
3	Haitong Securities	Citic Securities	Security Industry

Table 2. Statistics for the out-of-sample testing.

Model	Win Rate	Profit-Loss Ratio	Average Profit	Max Drawdown	Sharp Ratio
Our strategy	56.967%	1.762	0.140%	−2.136%	3.582
Strategy proposed by Zhu et al. [12]	55.328%	1.670	0.122%	−1.909%	3.209

Table 3. Parameters for simulation.

Label	$κ$	$θ$	$η$	$ξ$	a	b	$σ$	$ρ_{13}$	$y_{0}$
1	11.633	−0.233	0.377	1.206	5.324	−0.695	2.826	−0.338	−0.554
2	5.183	−0.173	0.366	0.792	4.109	−0.796	2.986	−0.348	−0.876
3	7.961	−0.244	0.205	−0.022	9.987	−1.246	3.235	−0.398	−1.314

Table 4. Cecep Wind Power and Cecep Solar Energy.

$λ$	Strategy Proposed by Zhu et al. [12]			Our Strategy
$λ$	Mean	S.D.	J	Mean	S.D.	J
0.25	767.267	43.255	299.519	548.729	27.084	365.340
0.5	435.940	21.627	202.065	326.670	13.542	234.976
0.75	325.497	14.418	169.581	252.651	9.028	191.521
1.0	270.276	10.814	153.339	215.641	6.771	169.794
1.25	237.143	8.651	143.593	193.435	5.417	156.758
1.5	215.055	7.209	137.096	178.631	4.514	148.067

Table 5. Shanxi Lu’An Environmental Energy and Shanxi Coking Coal Energy.

$λ$	Strategy Proposed by Zhu et al. [12]			Our Strategy
$λ$	Mean	S.D.	J	Mean	S.D.	J
0.25	220.566	18.173	138.004	181.554	8.569	163.198
0.5	162.589	9.086	121.308	143.083	4.284	133.905
0.75	143.263	6.058	115.743	130.259	2.856	124.141
1.0	133.600	4.543	112.960	123.847	2.142	119.258
1.25	127.803	3.635	111.290	120.000	1.714	116.329
1.5	123.938	3.029	110.177	117.436	1.428	114.376

Table 6. Haitong Securities and Citic Securities.

$λ$	Strategy Proposed by Zhu et al. [12]			Our Strategy
$λ$	Mean	S.D.	J	Mean	S.D.	J
0.25	981.336	50.993	331.271	518.334	17.535	441.463
0.5	542.969	25.496	217.937	311.468	8.768	273.033
0.75	396.847	16.998	180.158	242.513	5.845	216.889
1.0	323.786	12.748	161.269	208.035	4.384	188.818
1.25	279.949	10.199	149.936	187.349	3.507	171.975
1.5	250.725	8.499	142.380	173.558	2.923	160.746

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, Y.; Xiong, D. Optimal Strategy of the Dynamic Mean-Variance Problem for Pairs Trading under a Fast Mean-Reverting Stochastic Volatility Model. Mathematics 2023, 11, 2191. https://doi.org/10.3390/math11092191

AMA Style

Zhang Y, Xiong D. Optimal Strategy of the Dynamic Mean-Variance Problem for Pairs Trading under a Fast Mean-Reverting Stochastic Volatility Model. Mathematics. 2023; 11(9):2191. https://doi.org/10.3390/math11092191

Chicago/Turabian Style

Zhang, Yaoyuan, and Dewen Xiong. 2023. "Optimal Strategy of the Dynamic Mean-Variance Problem for Pairs Trading under a Fast Mean-Reverting Stochastic Volatility Model" Mathematics 11, no. 9: 2191. https://doi.org/10.3390/math11092191

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Optimal Strategy of the Dynamic Mean-Variance Problem for Pairs Trading under a Fast Mean-Reverting Stochastic Volatility Model

Abstract

1. Introduction

2. The Dynamic Mean-Variance Problem for a General Stochastic Volatility Modelstochastic volatility models

3. Closed-Form Approximation under the Scott Model

3.1. An Approximation for $g (t, x, y)$

3.2. Some Auxiliary Approximations

3.3. The Proof of Theorem 2

4. Empirical Experiments

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Proof of Lemma 3

Appendix B. Proof of Lemma 4

Appendix C. Proof of Lemma 5

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Optimal Strategy of the Dynamic Mean-Variance Problem for Pairs Trading under a Fast Mean-Reverting Stochastic Volatility Model

Abstract

1. Introduction

2. The Dynamic Mean-Variance Problem for a General Stochastic Volatility Modelstochastic volatility models

3. Closed-Form Approximation under the Scott Model

3.1. An Approximation for g ( t , x , y )

3.2. Some Auxiliary Approximations

3.3. The Proof of Theorem 2

4. Empirical Experiments

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Proof of Lemma 3

Appendix B. Proof of Lemma 4

Appendix C. Proof of Lemma 5

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.1. An Approximation for $g (t, x, y)$