Markovian Approximation of the Rough Bergomi Model for Monte Carlo Option Pricing

Zhu, Qinwen; Loeper, Grégoire; Chen, Wen; Langrené, Nicolas

doi:10.3390/math9050528

Open AccessArticle

Markovian Approximation of the Rough Bergomi Model for Monte Carlo Option Pricing

¹

School of Mathematical Sciences, Nanjing Normal University, Nanjing 210023, China

²

School of Mathematics & Centre for Quantitative Finance and Investment Strategies, Monash University, Clayton, VIC 3800, Australia

³

Data61, Commonwealth Scientific and Industrial Research Organisation, Melbourne, VIC 3008, Australia

^*

Author to whom correspondence should be addressed.

Mathematics 2021, 9(5), 528; https://doi.org/10.3390/math9050528

Submission received: 17 December 2020 / Revised: 24 February 2021 / Accepted: 25 February 2021 / Published: 3 March 2021

(This article belongs to the Special Issue Application of Stochastic Analysis in Mathematical Finance)

Download

Browse Figures

Versions Notes

Abstract

:

The recently developed rough Bergomi (rBergomi) model is a rough fractional stochastic volatility (RFSV) model which can generate a more realistic term structure of at-the-money volatility skews compared with other RFSV models. However, its non-Markovianity brings mathematical and computational challenges for model calibration and simulation. To overcome these difficulties, we show that the rBergomi model can be well-approximated by the forward-variance Bergomi model with wisely chosen weights and mean-reversion speed parameters (aBergomi), which has the Markovian property. We establish an explicit bound on the L2-error between the respective kernels of these two models, which is explicitly controlled by the number of terms in the aBergomi model. We establish and describe the affine structure of the rBergomi model, and show the convergence of the affine structure of the aBergomi model to the one of the rBergomi model. We demonstrate the efficiency and accuracy of our method by implementing a classical Markovian Monte Carlo simulation scheme for the aBergomi model, which we compare to the hybrid scheme of the rBergomi model.

Keywords:

rough fractional stochastic volatility; forward variance model; markovian representation; volatility skew; Volterra integral; rough heston; hybrid scheme; sum of ornstein-uhlenbeck processes

1. Introduction

The rough Bergomi (rBergomi) model introduced by Bayer et al. [1] has gained acceptance for stochastic volatility modelling due to its power-law at-the-money (ATM) volatility skew, which is consistent with empirical studies (see Forde and Zhang [2], Fukasawa [3], Gatheral et al. [4]) and with the effect of the no-arbitrage assumption on the market impact function (see Jusselin and Rosenbaum [5]). However, the stochastic process which characterizes this volatility model is rougher than that of a Brownian motion; in particular, the lack of Markovianity makes classical pricing methods infeasible.

In order to price options under an rBergomi model, Bayer et al. [6] proposed hierarchical adaptive sparse grids, Jacquier et al. [7] developed pricing algorithms for VIX futures and options, and McCrickerd and Pakkanen [8] developed a “turbocharged” Monte Carlo pricing method. A number of short-term approximations have been proposed to obtain fast approximations for short maturities—see, for example, Fukasawa [3], El Euch et al. [9], Bayer et al. [10], and Friz et al. [11]. Regarding the pricing of exotic options in the rBergomi model, Tomas [12] considered the pricing of Asian options, and Bayer et al. [13] and Bayer et al. [14] considered the pricing of American put options. Besides pricing, the calibration of the rBergomi model is also a challenge, for which Bayer et al. [15], Zeron and Ruiz [16], and Horvath et al. [17] propose to use deep learning methods. In spite of this number of recent efforts, the inherent challenges brought by the rBergomi model still prevent its widespread adoption in the industry.

Inspired by the technique by Abi Jaber and El Euch [18], Gatheral and Keller-Ressel [19], and Harms and Stefanovits [20], in which the authors designed a multi-factor stochastic volatility model with Markovian structure to approximate the rough Heston model, we establish an analogous multi-factor affine structure for the rBergomi model. Indeed, the Volterra kernel of the rBergomi model corresponds to a superposition of infinitely many Ornstein-Uhlenbeck (OU) processes with different speeds of mean reversion. Truncating this infinite sum into a finite sum of OU processes yields an approximation of the rBergomi model which is a classical Markovian multi-factor Bergomi model. We refer to this affine, Markovian approximation of the rBergomi model as the aBergomi model. We prove the existence and uniqueness of the solution to this aBergomi model, and show that its affine structure converges to the one of the rBergomi model. Finally, we implement a Monte Carlo scheme for the aBergomi model, and compare it to the hybrid scheme of the rBergomi model (Bennedsen et al. [21]). Our numerical tests demonstrate that using 20 exponential terms in the aBergomi kernel is sufficient to obtain accurate implied volatility curvatures while remaining computationally efficient.

The idea to interpret the conventional two-factor Bergomi model as a Markovian approximation of the rBergomi model was originally briefly suggested by Bayer et al. [1] (p. 892). Our work explores and expands upon this intuition by testing the number of factors to use in the Bergomi model and establishing their respective parameters for best approximation of the rBergomi model. For comprehensiveness, one can mention the alternative Markovian approximation proposed in Carr and Itkin [22] of the rough volatility version of the mean-reverting lognormal volatility model of Sepp [23], Langrené et al. [24], based on a closed-form vol-of-vol expansion for solving the pricing PDE arising from the use of the Dobrić-Ojeda process (Dobrić and Ojeda [25]) to approximate the fractional Brownian motion.

Compared to alternative pricing methods for the rBergomi model, the main advantage of our proposed Markovian approximation approach is that it does not require pricing methods specifically designed for rough volatility models; instead, classical Markovian pricing methods can be used for both vanilla and exotic options. In practice, the Monte Carlo pricing method is the method of choice for the aBergomi model in view of the number of terms needed for good accuracy. The computational cost of simulating our proposed aBergomi model is proportional to the number of time-steps N, which makes it an interesting alternative to the approximate

O (N log N)

hybrid scheme of Bennedsen et al. [21] and the exact

O (N^{3})

covariance-based scheme of Bayer et al. [1], Bayer et al. [10]. The main downside is that the approximation of the rBergomi power kernel by a sum of exponential terms introduces some error, for which we provide an explicit bound in the

L^{2}

sense. In particular, as in the case of the Riemann-sum scheme of Bennedsen et al. [21], the Fourier-based scheme of Benth et al. [26], or the approximation by a Dobrić-Ojeda process in Carr and Itkin [22], a truncation of the power kernel singularity at

s = t

cannot be avoided.

The paper is organized as follows. In Section 2, we introduce the Bergomi and rBergomi models and discuss their respective ATM volatility skews. The rBergomi model is closely related to the RFSV model introduced in Alòs et al. [27], for which the ATM volatility is proved to be equivalent to the power

T^{H - \frac{1}{2}}

for short maturity using the Malliavin technique, a result confirmed in Fukasawa [3] using a martingale expansion approach. We prove in this section that a similar result holds for the rBergomi model, while this does not hold for the Bergomi model (Equation (9)). We also establish the quasi-affine structure of the rough Bergomi model. Section 3 is dedicated to the approximation of the rough Bergomi model by a multi-factor Bergomi model, both theoretically and numerically. Finally, Section 4 compares numerical simulations of the rBergomi model with our approximated Bergomi (aBergomi) model with a finite number of terms, showing the effectiveness of our approximation.

2. Rough Bergomi Skew and Quasi-Affine Structure

Firstly, this section introduces the Bergomi and rough Bergomi stochastic volatility models (Definitions 1 and 2), along with the corresponding notations used throughout the paper.

We consider a filtered probability space

(Ω, F, {(F_{t})}_{t \geq 0}, Q)

, which supports two-dimensional correlated Brownian motions W and B. A log price process

X_{t} : = log (S_{t})

is assumed to follow the dynamics

d X_{t} = - \frac{1}{2} V_{t} d t + \sqrt{V_{t}} d W_{t},

(1)

where

V_{t} \geq 0

is the instantaneous spot variance process. Let

ξ_{t}^{u}, u \geq t

be the instantaneous forward variance for date u observed at time t; in particular,

ξ_{t}^{t} = V_{t}

corresponds to the spot variance.

Bayer et al. [1] proposed the so-called rough Bergomi model where the forward variance follows

d ξ_{t}^{u} = ξ_{t}^{u} η \sqrt{2 α + 1} (u - t)^{α} d B_{t}, u \geq t,

(2)

where W and B have correlation

ρ

,

α ≜ H - \frac{1}{2} \in (- \frac{1}{2}, 0)

is a negative exponent depending on the Hurst exponent

H \in (0, \frac{1}{2})

of the underlying fractional Brownian motion, and

η

is a positive parameter depending on H. The definition of the rBergomi model is summarized below:

Definition 1.

The rBergomi stochastic volatility model takes the form

\{\begin{matrix} d X_{t} = - \frac{1}{2} V_{t} d t + \sqrt{V_{t}} d W_{t}, \\ d ξ_{t}^{u} = ξ_{t}^{u} η \sqrt{2 α + 1} {(u - t)}^{α} d B_{t}, \end{matrix}

(3)

where

α = H - \frac{1}{2} \in (- \frac{1}{2}, 0)

, and

d {〈W, B〉}_{t} = ρ d t

.

By contrast, the two-factor Bergomi model is defined as follows.

Definition 2.

The two-factor Bergomi model (Bergomi [28], Bergomi [29]) is defined by:

\{\begin{matrix} d X_{t} = - \frac{1}{2} V_{t} d t + \sqrt{V_{t}} d W_{t}^{S}, \\ d ξ_{t}^{u} = ξ_{t}^{u} α_{θ} ω ((1 - θ) e^{- κ_{X} (u - t)} d W_{t}^{X} + θ e^{- κ_{Y} (u - t)} d W_{t}^{Y}), \end{matrix}

(4)

with

\begin{matrix} d {〈 W^{S}, W^{X} 〉}_{t} = ρ_{S X} d t, \\ d {〈 W^{S}, W^{Y} 〉}_{t} = ρ_{S Y} d t, \\ d {〈 W^{X}, W^{Y} 〉}_{t} = ρ_{X Y} d t, \end{matrix}

where

ξ_{t}^{t} = V_{t} = ω

is the lognormal volatility of the instantaneous variance under the normalizing factor

α_{θ} = ((1 - θ)^{2} + 2 ρ_{X Y} θ (1 - θ) + θ^{2})^{- \frac{1}{2}}

and θ is a mixing parameter of the short-term factor driven by

W^{X}

and the long-term factor driven by

W^{Y}

(

κ_{X} > κ_{Y}

).

Assumption 1.

Without loss of generality, we assume throughout the paper that the initial forward variance curve

ξ_{0}^{u}, u \geq 0

is flat. This simplification is common in the rBergomi literature; see, for example, Bayer et al. [1], Bayer et al. [6], and Bayer et al. [15]. We henceforth use the notation

ξ_{0}

for the constant initial forward variance curve.

2.1. ATM Volatility Skew

This subsection derives the ATM volatility skew of the rBergomi and Bergomi models, as the more realistic ATM volatility skew of the rBergomi model over the one of the Bergomi model is one of the motivations behind the introduction of the rBergomi model.

From Bergomi and Guyon [30], we can define the price and the volatility dynamics of a generic stochastic volatility model as follows:

\{\begin{matrix} d X_{t} = - \frac{1}{2} V_{t} d t + \sqrt{V_{t}} d W_{t}, \\ d ξ_{t}^{u} = λ (t, u, ξ_{t}^{u}) d B_{t}, \end{matrix}

(5)

where

X_{t} = ln (S_{t})

is the log-spot,

V_{t}

is the instantaneous spot variance,

ξ_{t}^{u}

is the instantaneous forward variance for date u observed at time t, and

λ = (λ_{1}, \dots, λ_{d})

is the volatility of forward instantaneous variances which takes values in

R^{d}

where d is the dimension of the Brownian motion B. Note that in this formulation, the covariance between spot and variance is modelled through the first component of

λ

, see Bergomi and Guyon [30] for more details.

One can derive the following second-order expression (w.r.t. volatility of volatility) for the Black-Scholes implied volatility:

σ_{B S} (k, T) = {\hat{σ}}_{T}^{A T M} + S_{T} k + C_{T} k^{2} + O (ε^{3}),

(6)

where

k = ln (\frac{K}{S_{0}})

, K is the strike and

ε

is a dimensionless scaling factor for the volatility of variances. The ATM volatility and the two coefficients

S_{T}

and

C_{T}

are given by

\begin{matrix} {\hat{σ}}_{T}^{A T M} & = {\hat{σ}}_{T}^{V S} [1 + \frac{ε}{4 v} C^{X ξ} + \frac{ε^{2}}{32 v^{3}} (12 (C^{X ξ})^{2} - v (v + 4) C^{ξ ξ} + 4 v (v - 4) C^{μ})], \\ S_{T} & = {\hat{σ}}_{T}^{V S} [\frac{ε}{2 v^{2}} C^{X ξ} + \frac{ε^{2}}{8 v^{3}} (4 C^{μ} v - 3 {(C^{X ξ})}^{2})], \\ C_{T} & = {\hat{σ}}_{T}^{V S} \frac{ε^{2}}{8 v^{4}} [4 C^{μ} v + C^{ξ ξ} v - 6 (C^{X ξ})^{2}], \end{matrix}

where

v = \int_{0}^{T} ξ_{0}^{s} d s

is the total variance to expiration T,

{\hat{σ}}_{T}^{V S} = \sqrt{\frac{v}{T}} = \sqrt{\frac{\int_{0}^{T} ξ_{0}^{s} d s}{T}}

is the effective volatility. Here,

ξ_{0}^{u} = ξ_{0}

for any

u \geq 0

under Assumption 1, which means that

v = ξ_{0} T

and

{\hat{σ}}_{T}^{V S} = \sqrt{ξ_{0}}

.

From Bergomi and Guyon [30], we can derive the following second-order expansion for the autocorrelations

C^{X ξ}, C^{ξ ξ}, C^{μ}

:

$C_{t}^{X ξ} (ξ) = \int_{t}^{T} d s \int_{s}^{T} d u μ (s, u, ξ) = \int_{t}^{T} d s \int_{s}^{T} d u \frac{E [d X_{s} d ξ_{s}^{u}]}{d s}$ is the doubly integrated spot-variance covariance function,
$C^{X ξ} = C_{0}^{X ξ} (ξ_{0}) = \int_{0}^{T} d s \int_{s}^{T} d u \frac{E [d X_{s} d ξ_{0}^{u}]}{d s}$ .
$C_{t}^{ξ ξ} (ξ) = \int_{t}^{T} d s \int_{s}^{T} d u \int_{s}^{T} d u^{'} ν (s, u, u^{'}, ξ) = \int_{t}^{T} d s \int_{s}^{T} d u \int_{s}^{T} u^{'} \frac{E [d ξ_{s}^{u} d ξ_{s}^{u^{'}}]}{d s}$ is the triply integrated variance/variance covariance function,
$C^{ξ ξ} = C_{0}^{ξ ξ} (ξ_{0}) = \int_{0}^{T} d t \int_{s}^{T} d u \int_{s}^{T} d u^{'} \frac{E [d ξ_{0}^{u} d ξ_{0}^{u^{'}}]}{d s}$ .
$C_{t}^{μ} (ξ) = \int_{t}^{T} d s \int_{s}^{T} d u μ (s, u, ξ) \partial_{ξ_{0}^{u}} (C_{s}^{X ξ} (ξ))$ is the double time-integral of the instance spot variance covariance function times the sensitivity of $C_{t}^{X ξ} (ξ)$ with respect to instantaneous forward variances,
$C^{μ} = C_{0}^{μ} (ξ_{0}) = \int_{0}^{T} d s \int_{s}^{T} d u \frac{E [d X_{s} d ξ_{0}^{u}]}{d s} \partial_{ξ_{0}^{u}} (C_{s}^{X ξ} (ξ))$ ,

where

μ

and

ν

are given by

\begin{matrix} μ (t, u, y) = \sqrt{y^{t}} λ_{1} (t, u, y) = \frac{E [d X_{t} d ξ_{t}^{u} | ξ_{t} = y]}{d t} = \frac{E [\frac{d S_{t}}{S_{t}} d ξ_{t}^{u} | ξ_{t} = y]}{d t}, \\ ν (t, u, u^{'}, y) = \sum_{i = 1}^{d} λ_{i} (t, u, y) λ_{i} (t, u^{'}, y) = \frac{E [d ξ_{t}^{u} d ξ_{t}^{u^{'}} | ξ_{t} = y]}{d t} . \end{matrix}

(7)

2.1.1. ATM Volatility Skew in the rBergomi Model

Theorem 1.

In the rBergomi model (3), the ATM volatility skew

ψ (T)

satisfies

ψ (T) ≜ {|\frac{\partial}{\partial_{k}} σ_{B S} (k, T)|}_{k = 0} \sim T^{H - \frac{1}{2}} .

(8)

Proof.

We first explicit the autocorrelation functional in the rBergomi model. Using the fact that

\frac{E [d X_{t} d ξ_{t}^{u}]}{d t} = ρ η \sqrt{2 α + 1} {(u - t)}^{α} \sqrt{ξ_{t}^{t}} ξ_{t}^{u}

, the autocorrelation functionals

C^{X ξ}

and

C^{ξ ξ}

are given by

\begin{matrix} C^{X ξ} & = \int_{0}^{T} d s \int_{s}^{T} d u \frac{E [d X_{s} d ξ_{0}^{u}]}{d s} \\ = ρ η \sqrt{2 α + 1} \int_{0}^{T} \sqrt{ξ_{0}^{s}} d s \int_{s}^{T} ξ_{0}^{u} {(u - s)}^{α} d u + O (ε^{3}), \\ C^{ξ ξ} & = \int_{0}^{T} d s \int_{s}^{T} d u \int_{s}^{T} d u^{'} \frac{E [d ξ_{0}^{u} d ξ_{0}^{u^{'}}]}{d s} \\ = \int_{0}^{T} d s \int_{s}^{T} d u \int_{s}^{T} d u^{'} η^{2} (2 α + 1) {(u - s)}^{α} {(u^{'} - s)}^{α} ξ_{0}^{u} ξ_{0}^{u^{'}} \\ = η^{2} (2 α + 1) \int_{0}^{T} d s {(\int_{0}^{T} ξ_{0}^{u} {(u - s)}^{α} d u)}^{2} + O (ε^{4}) . \end{matrix}

Then, using the fact that

\begin{matrix} \partial_{ξ_{s}^{u}} (C_{s}^{X ξ} (ξ)) & = ρ η \sqrt{2 α + 1} [\int_{s}^{T} d t \sqrt{ξ_{s}^{t}} {(u - t)}^{α} 1_{u > t} + \frac{1}{2 \sqrt{ξ_{s}^{u}}} \int_{u}^{T} ξ_{s}^{t} {(t - u)}^{α} d t] \\ = ρ η \sqrt{2 α + 1} [\int_{s}^{u} d t \sqrt{ξ_{s}^{t}} {(u - t)}^{α} + \frac{1}{2 \sqrt{ξ_{s}^{u}}} \int_{u}^{T} ξ_{s}^{t} {(t - u)}^{α} d t], \end{matrix}

we obtain

\begin{matrix} C^{μ} = & \int_{0}^{T} d s \int_{s}^{T} d u \frac{E [d X_{s} d ξ_{0}^{u}]}{d t} \partial_{ξ_{0}^{u}} (C_{s}^{X ξ} (ξ)) \\ = & ρ^{2} η^{2} (2 α + 1) \int_{0}^{T} \sqrt{ξ_{0}^{s}} d s \int_{s}^{T} {(u - s)}^{α} d u \\ \times [\int_{s}^{u} \sqrt{ξ_{0}^{t}} ξ_{0}^{u} {(u - t)}^{α} d t + \frac{\sqrt{ξ_{0}^{u}}}{2} \int_{u}^{T} ξ_{0}^{t} {(t - u)}^{α} d t] + O (ε^{4}) . \end{matrix}

Therefore, using Assumption 1, we obtain the following explicit first-order approximation:

C^{X ξ} = ρ η \sqrt{2 H} \int_{0}^{T} \sqrt{ξ_{0}} d s \int_{s}^{T} ξ_{0} (u - s)^{α} d u + O (ε^{3}) \approx C_{H} ρ ξ_{0}^{\frac{3}{2}} T^{H + \frac{3}{2}},

where

C_{H}

is a constant depending on H. We are then able to compute the first-order approximations of the three correlation values

C^{X ξ}, C^{ξ ξ}, C^{μ}

explicitly. The first-order approximation of

σ_{B S} (k, T)

can be written as follows:

\begin{matrix} σ_{B S} (k, T) & = {\hat{σ}}_{T}^{V S} + \frac{1}{4 v} C^{x ξ} {\hat{σ}}_{T}^{V S} ε + \frac{1}{2 v^{2}} C^{X ξ} {\hat{σ}}_{T}^{V S} ε k \\ = {\hat{σ}}_{T}^{V S} + (\frac{1}{4 v} + \frac{k}{2 v^{2}}) C_{H} ρ ξ_{0}^{\frac{3}{2}} T^{H + \frac{3}{2}} {\hat{σ}}_{T}^{V S} ε \\ = \sqrt{ξ_{0}} + (\frac{ξ_{0} T}{4} + \frac{k}{2}) C_{H} ρ T^{H - \frac{1}{2}} ε . \end{matrix}

Thus, the ATM volatility skew generated by the rBergomi model satisfies (8), which is consistent with empirical evidence (see for example, Gatheral et al. [4]). □

Remark 1.

Besides the rBergomi model, there exist other fractional volatility models which also satisfy Equation (8); see, for example, Fukasawa [31] (subsection 3.3).

2.1.2. ATM Volatility Skew in the Two-Factor Bergomi Model

We now compare this result to the volatility skew in the classical two-factor Bergomi model.

Theorem 2.

In the two-factor Bergomi model, the ATM volatility skew satisfies

ψ (T) \sim \frac{C_{1} (κ_{X} T - 1 + e^{- κ_{X} T})}{T^{2}} + \frac{C_{2} (κ_{Y} T - 1 + e^{- κ_{Y} T})}{T^{2}} .

(9)

Proof.

The Brownian motions

W^{S}, W^{X}, W^{Y}

can be decomposed as:

\begin{matrix} W^{S} = W^{1}, \\ W^{X} = ρ_{S X} W^{1} + \sqrt{1 - ρ_{S X}^{2}} W^{2}, \\ W^{Y} = ρ_{S Y} W^{1} + χ \sqrt{1 - ρ_{S Y}^{2}} W^{2} + \sqrt{(1 - χ^{2}) (1 - ρ_{S Y}^{2})} W^{3}, \end{matrix}

where

W^{1}, W^{2}, W^{3}

are three independent Brownian motions and

χ ≜ \frac{ρ_{X Y} - ρ_{S X} ρ_{S Y}}{\sqrt{1 - ρ_{S X}^{2}} \sqrt{1 - ρ_{S Y}^{2}}}

. Thus, the volatilities of variance

λ = (λ_{1}, λ_{2}, λ_{3})

in the general formulation (5) can be written as:

\begin{matrix} λ_{1} (t, u, ξ) = α_{θ} ω ξ_{0}^{u} [(1 - θ) ρ_{S X} e^{- κ_{X} (u - t)} + θ ρ_{S Y} e^{- κ_{Y} (u - t)}], \\ λ_{2} (t, u, ξ) = α_{θ} ω ξ_{0}^{u} [(1 - θ) \sqrt{1 - ρ_{S X}^{2}} e^{- κ_{X} (u - t)} + θ χ \sqrt{1 - ρ_{S Y}^{2}} e^{- κ_{Y} (u - t)}], \\ λ_{3} (t, u, ξ) = α_{θ} ω ξ_{0}^{u} θ \sqrt{(1 - χ^{2}) (1 - ρ_{S Y}^{2})} e^{- κ_{Y} (u - t)}, \end{matrix}

or equivalently:

λ_{i} (t, u, ξ) = α_{θ} ω ξ_{0}^{u} (ω_{i X} e^{- κ_{X} (u - t)} + ω_{i Y} e^{- κ_{Y} (u - t)}),

where

\begin{matrix} {(ω_{i X})}_{i = 1, 2, 3} ≜ ((1 - θ) ρ_{S X}, (1 - θ) \sqrt{1 - ρ_{S X}^{2}}, 0)^{⊤}, \\ {(ω_{i Y})}_{i = 1, 2, 3} ≜ (θ ρ_{S Y}, θ χ \sqrt{1 - ρ_{S Y}^{2}}, θ \sqrt{(1 - χ^{2}) (1 - ρ_{S Y}^{2})})^{⊤} . \end{matrix}

The corresponding covariances can be expressed similarly as:

\begin{matrix} C^{X ξ} = & \int_{0}^{T} d u \int_{0}^{u} d t \sqrt{ξ_{0}^{t}} λ_{1} (t, u, ξ_{0}) \\ = & α_{θ} ω [(1 - θ) ρ_{S X} \int_{0}^{T} d u ξ_{0}^{u} \int_{0}^{u} d t \sqrt{ξ_{0}^{t}} e^{- κ_{X} (u - t)} + θ ρ_{S Y} \int_{0}^{T} d u ξ_{0}^{u} \int_{0}^{u} d t \sqrt{ξ_{0}^{t}} e^{- κ_{Y} (u - t)}], \\ C^{ξ ξ} = & \sum_{i = 1}^{3} \int_{0}^{T} d s (\int_{s}^{T} d u λ_{i} (s, u, ξ_{0}))^{2} \\ = & α_{θ}^{2} ω^{2} \sum_{i = 1}^{3} \int_{0}^{T} d s (ω_{i X} \int_{s}^{T} d u ξ_{0}^{u} e^{- κ_{X} (u - s)} + ω_{i Y} \int_{s}^{T} d u ξ_{0}^{u} e^{- κ_{Y} (u - s)})^{2}, \\ C^{μ} = & \int_{0}^{T} d s \int_{s}^{T} d u \sqrt{ξ_{0}^{s}} λ_{1} (s, u, ξ_{0}) (\frac{1}{2 \sqrt{ξ_{0}^{u}}} \int_{u}^{T} d t λ_{1} (u, t, ξ_{0}) + \int_{s}^{u} d r \sqrt{ξ_{0}^{r}} \partial_{ξ_{0}^{u}} λ_{1} (r, u, ξ)) . \end{matrix}

Once again using Assumption 1 and the autocorrelations provided by Bergomi and Guyon [30], we obtain

\begin{matrix} C^{X ξ} = & α_{θ} ω ξ_{0}^{\frac{3}{2}} T^{2} (ω_{1 X} J (κ_{X} T) + ω_{1 Y} J (κ_{Y} T)), \\ C^{ξ ξ} = & α_{θ}^{2} ω ξ_{0}^{2} T^{3} (ω_{0} + ω_{X} I (κ_{X} T) + ω_{Y} I (κ_{Y} T) + ω_{X X} I (2 κ_{X} T) + ω_{Y Y} I (2 κ_{Y} T) + ω_{X Y} I ((κ_{X} + κ_{Y}) T)), \end{matrix}

where

\begin{matrix} ω_{0} & = \sum_{i = 1}^{3} {(\frac{ω_{i X}}{κ_{X} T} + \frac{ω_{i Y}}{κ_{Y} T})}^{2}, ω_{X} = - 2 \sum_{i = 1}^{3} \frac{ω_{i X}}{κ_{X} T} (\frac{ω_{i X}}{κ_{X} T} + \frac{ω_{i Y}}{κ_{Y} T}), ω_{Y} = - 2 \sum_{i = 1}^{3} \frac{ω_{i Y}}{κ_{Y} T} (\frac{ω_{i X}}{κ_{X} T} + \frac{ω_{i Y}}{κ_{Y} T}), \end{matrix}

ω_{X X} = \sum_{i = 1}^{3} \frac{ω_{i X}^{2}}{κ_{X}^{2} T^{2}}, ω_{Y Y} = \sum_{i = 1}^{3} \frac{ω_{i Y}^{2}}{κ_{Y}^{2} T^{2}}, ω_{X Y} = 2 \sum_{i = 1}^{3} \frac{ω_{i X} ω_{i Y}}{κ_{X} κ_{Y} T^{2}},

and

I (z) = \frac{1 - e^{- z}}{z}, J (z) = \frac{z - 1 + e^{- z}}{z^{2}}, K (z) = \frac{1 - e^{- z} - z e^{- z}}{z^{2}}, H (z) = \frac{J (z) - K (z)}{z} .

Similarly, we have

C^{μ} = α_{θ}^{2} ω^{2} ξ_{0}^{2} T^{3} (C_{1}^{μ} + C_{2}^{μ})

, with the coefficients

\begin{matrix} C_{1}^{μ} = \frac{1}{2} ω_{1 X}^{2} H (κ_{X} T) + \frac{1}{2} ω_{1 Y}^{2} H (κ_{Y} T) - ω_{1 X} ω_{1 Y} \frac{J (κ_{Y} T) - J (κ_{X} T)}{(κ_{X} + κ_{Y}) T}, \\ C_{2}^{μ} = ω_{X}^{″} J (κ_{X} T) + ω_{Y}^{″} J (κ_{Y} T) + ω_{X X}^{″} J (2 κ_{X} T) + ω_{Y Y}^{″} J (2 κ_{Y} T) + ω_{X Y}^{″} J ((κ_{X} + κ_{Y}) T), \end{matrix}

and

\begin{matrix} ω_{X}^{″} = \frac{ω_{1 X}^{2}}{κ_{X} T} + \frac{ω_{1 X} ω_{1 Y}}{κ_{Y} T}, & ω_{Y}^{″} = \frac{ω_{1 Y}^{2}}{κ_{Y} T} + \frac{ω_{1 X} ω_{1 Y}}{κ_{Y} T}, \\ ω_{X X}^{″} = - \frac{ω_{1 X}^{2}}{κ_{X} T}, & ω_{Y Y}^{″} = - \frac{ω_{1 Y}^{2}}{κ_{Y} T}, & ω_{X Y}^{″} = - \frac{ω_{1 X} ω_{1 Y}}{κ_{X} T} - \frac{ω_{1 X} ω_{1 Y}}{κ_{Y} T} . \end{matrix}

Since

C^{X ξ} \sim T^{2} (C_{1} \cdot \frac{κ_{X} T - 1 + e^{- κ_{X} T}}{(κ_{X} T)^{2}} + C_{2} \cdot \frac{κ_{Y} T - 1 + e^{- κ_{Y} T}}{(κ_{Y} T)^{2}})

and

C_{1}, C_{2}

are constants, we can derive the term structure of the ATM volatility skew as in Equation (9) with the first order in

ε

. □

However, this result derived for the Bergomi model by the Bergomi-Guyon expansion [30] is inconsistent with empirical evidence; see, for example, Bayer et al. [1]. This suggests that the power-law kernel of the forward variance curve in the rBergomi model will lead to more realistic and accurate pricing and hedging results than the exponential kernel of the forward variance curve in the Bergomi model.

2.2. Markovian Representation of the Rough Bergomi Model

The purpose of this section is to establish the infinite-dimensional affine nature and Markovianity of the rBergomi model.

Definition 3.

An Ornstein-Uhlenbeck (OU) process

Y_{t}^{x}

is the solution of the following stochastic differential equation (SDE):

d Y_{t}^{x} = x (a - Y_{t}^{x}) d t + σ d B_{t},

(10)

where

x > 0

is the mean-reversion speed,

a > 0

is the mean-reversion level, and

B_{s}

is a standard Brownian motion. Its strong solution is explicitly given by

Y_{t}^{x} = Y_{0} + σ \int_{0}^{t} e^{- x (t - s)} d B_{s} .

(11)

Assumption 2.

In the rest of the paper, we always assume that

\begin{matrix} a & ≜ Y_{0}, \end{matrix}

(12)

\begin{matrix} σ & ≜ η \sqrt{2 α + 1}, \end{matrix}

(13)

where η and α come from Definition 1 of the rBergomi model (see Bayer et al. [1]).

Definition 4.

Without loss of generality, we define, for

H < \frac{1}{2}

, the sigma-finite measure

μ (d x)

on

(0, \infty)

as

μ (d x) = \frac{d x}{x^{\frac{1}{2} + H} Γ (\frac{1}{2} - H)} .

2.2.1. Volterra-Type Integral as a Functional of a Markov Process

Theorem 3.

Using Definitions 3 and 4, the Volterra-type integral

{\tilde{X}}_{t} ≜ \int_{0}^{t} {(t - s)}^{H - \frac{1}{2}} d B_{s}

in the rBergomi model has the Markovian representation

σ {\tilde{X}}_{t} = \int_{0}^{\infty} (Y_{t}^{x} - Y_{0}) μ (d x) .

(14)

Proof.

The Laplace transform of the measure

μ

in Definition 4 is

L (μ) (τ) = \int_{0}^{\infty} e^{- τ x} μ (d x) = \int_{0}^{\infty} \frac{e^{- τ x} x^{- \frac{1}{2} - H}}{Γ (\frac{1}{2} - H)} d x = τ^{H - \frac{1}{2}},

which can be recognised as the power-law kernel in the Volterra-type integral. Consequently, we have

σ {\tilde{X}}_{t} = \int_{0}^{t} \int_{0}^{\infty} σ e^{- x (t - s)} μ (d x) d B_{s}

, and using Fubini’s stochastic theorem, see Protter [32], we obtain

σ {\tilde{X}}_{t} = \int_{0}^{\infty} \int_{0}^{t} σ e^{- x (t - s)} d B_{s} μ (d x)

. From Definition 3, where

\int_{0}^{t} σ e^{- x (t - s)} d B_{s} = Y_{t}^{x} - Y_{0}

, we obtain the Markovian representation given by Equation (14). □

Theorem 4.

The OU process (11) has the affine structure

\begin{matrix} E [exp (\int_{0}^{\infty} Y_{t}^{x} μ (d x)) | F_{s}] & = & exp (\frac{σ^{2}}{2} \int_{0}^{t - s} {(\int_{0}^{\infty} e^{- s x} μ (d x))}^{2} d s + \int_{0}^{\infty} Y_{s}^{x} e^{- (t - s) x} μ (d x)) . \end{matrix}

Proof.

From Fubini’s stochastic theorem,

\int_{0}^{\infty} Y_{t}^{x} μ (d x)

is Gaussian under the filtration

F_{s}

for

0 \leq s \leq t

, with mean

E [\int_{0}^{\infty} Y_{t}^{x} μ (d x) | F_{s}] = \int_{0}^{\infty} Y_{s}^{x} e^{- (t - s) x} μ (d x) .

Furthermore, using Itō’s isometry, we have the conditional variance:

\begin{matrix} Var (\int_{0}^{\infty} Y_{t}^{x} μ (d x) | F_{s}) & = σ^{2} \int_{s}^{t} {(\int_{0}^{\infty} e^{- (t - s) x} μ (d x))}^{2} d s \\ = σ^{2} \int_{0}^{t - s} {(\int_{0}^{\infty} e^{- s x} μ (d x))}^{2} d s . \end{matrix}

Thus,

\begin{matrix} E [exp (\int_{0}^{\infty} Y_{t}^{x} μ (d x)) | F_{s}] & = exp (\frac{1}{2} Var (\int_{0}^{\infty} Y_{t}^{x} μ (d x) | F_{s}) + E [\int_{0}^{\infty} Y_{t}^{x} μ (d x) | F_{s}]) \\ = exp (\frac{σ^{2}}{2} \int_{0}^{t - s} {(\int_{0}^{\infty} e^{- s x} μ (d x))}^{2} d s + \int_{0}^{\infty} Y_{s}^{x} e^{- (t - s) x} μ (d x)) . \end{matrix}

□

2.2.2. Quasi-Affine Structure in the rBergomi Model

From Definition 1 and Theorem 3, the rBergomi model can be rewritten in the following form:

\{\begin{matrix} d X_{t} & = - \frac{1}{2} V_{t} d t + \sqrt{V_{t}} d W_{t}, \\ log \frac{V_{t}}{ξ_{0}} & = \int_{0}^{\infty} (Y_{t}^{x} - Y_{0}) μ (d x), \end{matrix}

where

X_{t}

is the log stock price,

ξ_{0}

is the initial flat forward variance curve, and

W, B

are two Brownian motions with correlation

d {〈 W, B 〉}_{t} = ρ d t

and

ρ \in [- 1, 1]

. Our aim is now to write the log stock price

X_{t}

in a quasi-affine form as the first coordinate of an infinite-dimensional affine process. To do so, we introduce the following symmetric non-negative tensor:

L^{1} (μ) \otimes_{s} L^{1} (μ) = \{y^{\otimes 2} : y \in L^{1} (μ)\} \subset L^{1} {(μ)}^{\otimes 2} \subset L^{1} (μ^{\otimes 2}),

where we used the notation

y^{\otimes 2} ≜ y \otimes y

. Let

Π_{t} = (i \otimes 1) {(Y_{t}^{x})}^{\otimes 2} \in i L^{1} (μ) \otimes_{s} L^{1} (μ)

, where i is the imaginary unit (

i \times i = - 1

). The relation

{(\int_{0}^{\infty} Y_{t}^{x} μ (d x))}^{2} = \int_{0}^{\infty} (i \otimes 1) {(Y_{t}^{x})}^{\otimes 2} μ^{\otimes 2} (d x)

holds. Therefore, the log stock price dynamics can be written as

\begin{matrix} d X_{t} & = \sqrt{ξ_{0}} \cdot (E^{\frac{\int_{0}^{\infty} Π_{t} μ^{\otimes 2} (d x)}{4}} d W_{t} - \frac{1}{2} E^{\int_{0}^{\infty} Y_{t}^{x} μ (d x)}) \\ = \sqrt{ξ_{0}} e^{\frac{\int_{0}^{\infty} Π_{t} μ^{\otimes 2} (d x)}{4}} e^{- \frac{η^{2}}{4} t^{2 α + 1}} d W_{t} - \frac{\sqrt{ξ_{0}}}{2} e^{\int_{0}^{\infty} Y_{t}^{x} μ (d x)} e^{- \frac{η^{2}}{2} t^{2 α + 1}} d t, \end{matrix}

where

E

is the Doléans-Dade stochastic exponential.

Theorem 5.

The process

Π_{t} = (i \otimes 1) {(Y_{t}^{x})}^{\otimes 2}

satisfies the affine structure

E [e^{\int_{0}^{\infty} Π_{t} μ^{\otimes 2} (d x)} | F_{s}] = e^{Φ_{1} + Φ_{2}},

(15)

where

\begin{matrix} Φ_{1} & ≜ - \frac{1}{2} log (1 - 2 \int_{0}^{t - s} {(\int_{0}^{\infty} e^{- u x} μ (d x))}^{2} d u), \end{matrix}

(16)

\begin{matrix} Φ_{2} & ≜ \frac{\int_{0}^{\infty} Π_{s} {(e^{- (t - s) x})}^{\otimes 2} μ^{\otimes 2} (d x)}{σ^{2} - 2 σ^{2} \int_{0}^{t - s} {(\int_{0}^{\infty} e^{- u x} μ (d x))}^{2} d u} . \end{matrix}

(17)

Proof.

From Fubini’s stochastic theorem,

\frac{\int_{0}^{\infty} Y_{t}^{x} μ (d x)}{σ \sqrt{\int_{0}^{t - s} {(\int_{0}^{\infty} e^{- u x} μ (d x))}^{2} d u}}

is Gaussian under the filtration

F_{s}

for

0 \leq s \leq t

, with conditional mean

E [\frac{\int_{0}^{\infty} Y_{t}^{x} μ (d x)}{σ \sqrt{\int_{0}^{t - s} {(\int_{0}^{\infty} e^{- u x} μ (d x))}^{2} d u}} | F_{s}] = \frac{\int_{0}^{\infty} Y_{s}^{x} e^{- (t - s) x} μ (d x)}{σ \sqrt{\int_{0}^{t - s} {(\int_{0}^{\infty} e^{- u x} μ (d x))}^{2} d u}}

and conditional variance

Var (\frac{\int_{0}^{\infty} Y_{t}^{x} μ (d x)}{σ \sqrt{\int_{0}^{t - s} {(\int_{0}^{\infty} e^{- u x} μ (d x))}^{2} d u}} | F_{s}) = 1 .

Then, the random variable defined as

\frac{\int_{0}^{\infty} Π_{t} μ^{\otimes 2} (d x)}{σ^{2} \int_{0}^{t - s} {(\int_{0}^{\infty} e^{- u x} μ (d x))}^{2} d u} = {(\frac{\int_{0}^{\infty} Y_{t}^{x} μ (d x)}{σ \sqrt{\int_{0}^{t - s} {(\int_{0}^{\infty} e^{- u x} μ (d x))}^{2} d u}})}^{2}

is a noncentral

χ^{2}

distribution with one degree of freedom and noncentrality parameter

\frac{{(\int_{0}^{\infty} Y_{s}^{x} e^{- (t - s) x} μ (d x))}^{2}}{σ^{2} \int_{0}^{t - s} {(\int_{0}^{\infty} e^{- u x} μ (d x))}^{2} d u} = \frac{\int_{0}^{\infty} Π_{s} {(e^{- (t - s) x})}^{\otimes 2} μ^{\otimes 2} (d x)}{σ^{2} \int_{0}^{t - s} {(\int_{0}^{\infty} e^{- u x} μ (d x))}^{2} d u} .

Thus, the Formulas (16) and (17) for

Φ_{1}

and

Φ_{2}

follow from the characteristic function of the noncentral

χ^{2}

distribution, which concludes the proof. □

Corollary 1.

The rBergomi model admits an infinite-dimensional Markovian representation.

Proof.

This corollary follows from Theorem 5 which exhibits that the rBergomi model has an exponential-affine dependence on x; hence, the model is Markovian in each dimension. □

3. Rough Bergomi Approximation and Monte Carlo Schemes

In this Section, we first introduce the aBergomi model which is used to approximate the rBergomi model (3). After that, we will demonstrate the existence and uniqueness of the solution of this aBergomi model. We also prove that the aBergomi model is well-defined and the solution of the aBergomi model converges to that of the rBergomi model when the number of terms n in the aBergomi model goes to infinity. At the same time, we show that the aBergomi model inherits the affine structure of the Bergomi model.

3.1. Approximation of the Rough Bergomi Model by an n-Term Bergomi Model

Since the rBergomi model can be represented by

\{\begin{matrix} d S_{t} & = S_{t} \sqrt{V_{t}} d W_{t}, \\ log \{\frac{V_{t}}{ξ_{0}}\} & = \int_{0}^{\infty} σ \int_{0}^{t} e^{- x (t - s)} d B_{s} μ (d x), \end{matrix}

and the n-term Bergomi model with the same Brownian motion in the variance process can be represented by

\{\begin{matrix} d S_{t} & = S_{t} \sqrt{V_{t}} d W_{t}, \\ log \{\frac{V_{t}}{ξ_{0}}\} & = \int_{0}^{t} (\sum_{i = 1}^{n} α_{i} e^{- κ_{i} (t - s)}) d B_{s}, \end{matrix}

(18)

we can view the rBergomi model as a continuous infinite-term Bergomi model under the measure

μ (\cdot)

, in which the mean-reversion speed x has been integrated from 0 to ∞, with respect to the Brownian motion

B_{s}

. We can therefore approximate the rBergomi model by an n-term exponential kernel

K_{\exp} = \sum_{i = 1}^{n} α_{i} e^{- κ_{i} (t - s)}

instead of the power kernel

K_{pow} = \sqrt{2 α + 1} {(t - s)}^{α}

of the Volterra process in the rBergomi model.

Following Equation (18), after approximating the exponential kernel

K (τ) = \int_{0}^{\infty} e^{- x τ} μ (d x)

by the kernel

K^{n} (τ) = \sum_{i = 1}^{n} α_{i}^{n} e^{- τ x_{i}^{n}}

, we can rewrite the aBergomi model (18) as follows:

\{\begin{matrix} d S_{t}^{n} & = S_{t}^{n} \sqrt{V_{t}^{n}} d W, \\ log \{\frac{V_{t}^{n}}{ξ_{0}}\} & = \sum_{i = 1}^{n} α_{i}^{n} V_{t}^{n, i}, \\ d V_{t}^{n, i} & = - x_{i}^{n} (a - V_{t}^{n}) d t + σ d B_{t} a = Y_{0}, σ = η \sqrt{2 α + 1}, \end{matrix}

(19)

where

{(α_{i}^{n})}_{1 \leq i \leq n}

are positive weights,

{(x_{i}^{n})}_{1 \leq i \leq n}

are mean-reverting speeds, and

{〈 W, B 〉}_{t} = ρ d t

, with initial conditions

S_{0}^{n} = S_{0} = 1

and

V_{0}^{n, i} = V_{0} = 0

.

3.1.1. Existence and Uniqueness of $(S^{n}, V^{n})$

We rewrite

V^{n}

in (19) as the following stochastic equation

log (\frac{V_{t}^{n}}{ξ_{0}}) = σ \int_{0}^{t} K^{n} (t - s) d B_{s} .

(20)

Theorem 6.

Under the conditions of the model (19), there exists a unique, strong, non-negative solution

V^{n}

to Equation (20).

Proof.

Øksendal and Zhang [33] imply that there exists a unique, strong, non-negative solution

V^{n}

to Equation (20) under the conditions of the model (19). □

Then, the strong existence and uniqueness of

(S^{n}, V^{n})

follows, along with its Markovianity w.r.t. the spot price

S^{n}

and the factors

V^{n, i}

for

i \in {1, \dots, n}

.

3.1.2. Convergence of $(S^{n}, V^{n})$ to $(S, V)$

To prove that the solution of the aBergomi model

(S^{n}, V^{n})

converges to the solution of the rBergomi model

(S, V)

, we need to choose a suitable

K^{n} (τ) = \sum_{i = 1}^{n} α_{i}^{n} e^{- x_{i}^{n} τ}

to approximate

K (τ) = τ^{H - \frac{1}{2}}

. When

n \to + \infty

,

{(V^{n})}_{n \geq 1} \to V

(see Carmona et al. [34], Muravlev [35], Harms and Stefanovits [20]).

Theorem 7.

There exist weights

{(α_{i}^{n})}_{1 \leq i \leq n} > 0

, mean reversion speeds

{(x_{i}^{n})}_{1 \leq i \leq n} > 0

, and a constant C depending on H and T only such that

∥ K^{n} {- K ∥}_{2, T} \leq C n^{\frac{- 4 H}{5}},

where

{∥ \cdot ∥}_{2, T}

is the

L^{2} ([0, T], R)

norm. In particular,

‖ K^{n} {- K ‖}_{2, T} \to 0

when

n \to \infty

.

The proof of this theorem can be found in Appendix A.

Applying the previous computations and the Kolmogorov tightness criterion, we can get that the sequence

(S^{n}, V^{n})

is tight for the uniform topology and the limit satisfies the model (19).

3.2. Affine Structure of the aBergomi Model

In this section, we detail the affine property of the aBergomi model.

Theorem 8.

The process

V^{n}

(Equation (20)) has the following affine structure

E [V_{t}^{n} ∣ F_{s}] = ξ_{0} exp \{\frac{σ^{2}}{2} \sum_{i = 1}^{n} α_{i}^{n} (\frac{1}{x_{i}^{n}} - \frac{e^{- (t - s) x_{i}^{n}}}{x_{i}^{n}}) + \sum_{i = 1}^{n} V_{s}^{n, i} α_{i}^{n} e^{- (t - s) x_{i}^{n}}\} .

Proof.

Using Theorem 4, we have

\begin{matrix} E [V_{t}^{n} ∣ F_{s}] & = ξ_{0} exp \{\frac{σ^{2}}{2} \int_{0}^{t - s} (K^{n} (s))^{2} d s + \sum_{i = 1}^{n} V_{s}^{n, i} α_{i}^{n} e^{- (t - s) x_{i}^{n}}\} \\ = ξ_{0} exp \{\frac{σ^{2}}{2} \int_{0}^{t - s} (\sum_{i = 1}^{n} α_{i}^{n} e^{- s x_{i}^{n}}) d s + \sum_{i = 1}^{n} V_{s}^{n, i} α_{i}^{n} e^{- (t - s) x_{i}^{n}}\} \\ = ξ_{0} exp \{\frac{σ^{2}}{2} \sum_{i = 1}^{n} α_{i}^{n} (\frac{1}{x_{i}^{n}} - \frac{e^{- (t - s) x_{i}^{n}}}{x_{i}^{n}}) + \sum_{i = 1}^{n} V_{s}^{n, i} α_{i}^{n} e^{- (t - s) x_{i}^{n}}\} . \end{matrix}

Similarly, we can derive the affine structure of

S^{n}

by Theorem 5. □

Then, we describe the so-called hybrid scheme and introduce an algorithm to approximate the rBergomi model by the aBergomi model.

3.3. Hybrid Scheme for the rBergomi Model

Recalling Equation (3), the rough Bergomi model with time horizon

T > 0

under an equivalent martingale measure

P

can be written as:

\{\begin{matrix} d S_{t} & = S_{t} \sqrt{V_{t}} d W_{t}, \\ \frac{d ξ_{s}^{t}}{ξ_{s}^{t}} & = η \sqrt{2 α + 1} {(t - s)}^{α} d B_{s}, \end{matrix}

(21)

where

W, B

are two standard Brownian motions with correlation

ρ

. We recall from Assumption 1 that the forward variance curve

ξ_{0}^{t}

is flat for all

t \in [0, T]

:

ξ_{0}^{t} = ξ_{0} > 0

. Thus, the spot variance

V_{t}

in Equation (21) is given by

V_{t} = ξ_{0} exp (η \sqrt{2 α + 1} \int_{0}^{t} {(t - s)}^{α} d B_{s} - \frac{η^{2}}{2} t^{2 α + 1}) .

To simulate the Volterra-type integral

\tilde{X} = \sqrt{2 α + 1} \int_{0}^{t} {(t - s)}^{α} d B_{s}

, we apply the hybrid scheme proposed in Bennedsen et al. [21], which approximates the kernel function of the Brownian semi-stationary processes by a Wiener integral of the power function at

t = s

and a Riemann sum elsewhere. Let

(Ω, F, {(F_{t})}_{t \in R}, P)

be a filtered probability space which supports a standard Brownian motion

W = {(W_{t})}_{t \in R}

. We consider a Brownian semi-stationary process (Bss):

{\bar{X}}_{t} = \int_{- \infty}^{t} g (t - s) σ_{s} d W_{s} t \in R,

(22)

where

σ = {(σ_{t})}_{t \in R}

is an

{(F_{t})}_{t \in R}

-predictable process which captures the stochastic volatility of

\bar{X}

and

g : (0, \infty) \to [0, \infty)

is a Borel-measurable kernel function. We assume that

E [σ_{t}^{2}] < \infty

for all

t \in R

and the process is covariance-stationary, namely,

\begin{matrix} E [σ_{s}] & = E [σ_{t}], \\ cov (σ_{s}, σ_{t}) & = cov (σ_{0}, σ_{| s - t |}), s, t \in R . \end{matrix}

These assumptions imply that

\bar{X}

is covariance-stationary. However, the process

\bar{X}

need not be strictly stationary.

Assumption 3.

The assumptions regarding the kernel function g are as follows:

(A1): For some $α \in (- \frac{1}{2}, \frac{1}{2}) ∖ {0}$ ,

$g (x) = x^{α} L_{g} (x), x \in (0, 1],$

where $L_{g} : (0, 1] \to [0, \infty)$ is continuously differentiable, slowly varying at 0 and bounded away from 0. Moreover, there exists a constant $C > 0$ such that the derivative $L_{g}^{'}$ of $L_{g}$ satisfies

$| L_{g}^{'} (x) | \leq C (1 + \frac{1}{x}), x \in (0, 1] .$
(A2): The function g is continuously differentiable on $(0, \infty)$ , and the derivative $g^{'}$ is ultimately monotonic and satisfies $\int_{1}^{\infty} g^{'} {(x)}^{2} d x < \infty$ .
(A3): For some $β \in (- \infty, - \frac{1}{2})$ ,

$g (x) = O (x^{β}), x \to \infty .$

In order to implement the hybrid scheme to the rBergomi model, we need to introduce a particular class of non-stationary processes, namely, truncated Brownian semi-stationary (

tBss

) processes,

{\tilde{X}}_{t} = \int_{0}^{t} g (t - s) σ_{s} d W_{s} t \geq 0,

(23)

where the kernel function

g (t)

, the volatility process

σ_{s}

, and the driving Brownian motion

W_{s}

are as defined in the definition of

Bss

processes.

{\tilde{X}}_{t}

can also be seen as the truncated stochastic integral at 0 of the

Bss

process

{\bar{X}}_{t}

. Equation (23) is integrable since

g (t)

is differentiable on

(0, \infty)

.

Now, we can discretise Equation (23) in time. Let N be the total number of time-steps,

Δ t = T / N

be the time-step size, and

t_{0} = 0 \leq \dots \leq t_{j} = j Δ t \leq \dots \leq t_{N} = T

be a time grid on the interval

[0, T]

.

According to Bennedsen et al. [21], the observations

{\tilde{X}}_{t_{j}}^{N}, j = 0, 1, \dots, N

can be computed via (

κ = 1

case)

{\tilde{X}}_{t_{j}}^{N} = L_{g} (Δ t) σ_{j - 1}^{N} W_{j - 1, 1}^{N} + \sum_{k = 1}^{j} g (b_{k}^{*} Δ t) σ_{j - k}^{N} {\bar{W}}_{j - k}^{N}

(24)

using the random vectors

W_{j}^{N}, j = 0, 1, \dots, N - 1,

the random variables

σ_{j}^{N}, j = 0, 1, \dots, N - 1,

where

b_{k}^{*} = {(\frac{k^{α + 1} - {(k - 1)}^{α + 1}}{α + 1})}^{\frac{1}{α}}

, and the random vectors

{\bar{W}}_{i}^{N} ≜ \int_{\frac{i}{N}}^{\frac{i + 1}{N}} d W_{s}

(see Proposition 2.8 in Bennedsen et al. [21]). To simulate the Volterra process

\tilde{X}

, we use:

\{\begin{matrix} L_{g} & \equiv 1, \\ g (x) & \equiv x^{H - \frac{1}{2}}, \\ σ (\cdot) & \equiv \sqrt{2 α + 1} . \end{matrix}

Then,

\begin{matrix} W_{j - 1, 1}^{N} & = \int_{t_{j - 1}}^{t_{j}} {(t_{j} - s)}^{α} d W_{s} \approx {(\frac{Δ t}{2})}^{α} (W_{t_{j}} - W_{t_{j - 1}}) \\ {\bar{W}}_{j}^{N} & = \int_{t_{j}}^{t_{j + 1}} d W_{s} = W_{t_{j + 1}} - W_{t_{j}} \\ σ_{j}^{N} & = σ_{t_{j}} . \end{matrix}

The corresponding matrix representation takes the form of

[\begin{matrix} {\tilde{X}}_{t_{1}} \\ {\tilde{X}}_{t_{2}} \\ {\tilde{X}}_{t_{3}} \\ ⋮ \\ {\tilde{X}}_{t_{N}} \end{matrix}] = [\begin{matrix} W_{0, 1} & 0 & \dots & 0 & 0 \\ W_{1, 1} & g (b_{2}^{*} Δ t) {\bar{W}}_{0} & \dots & 0 & 0 \\ W_{2, 1} & g (b_{2}^{*} Δ t) {\bar{W}}_{1} & \dots & 0 & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ W_{N - 1, 1} & g (b_{2}^{*} Δ t) {\bar{W}}_{N - 2} & \dots & g (b_{N - 1}^{*} Δ t) {\bar{W}}_{1} & g (b_{N}^{*} Δ t) {\bar{W}}_{0} \end{matrix}] [\begin{matrix} σ_{t_{1}} \\ σ_{t_{2}} \\ σ_{t_{3}} \\ ⋮ \\ σ_{t_{N}} \end{matrix}] .

(25)

In the rBergomi model,

σ_{t_{i}} = σ

is a constant for

i = 1, 2, . . ., N

defined in Equation (13). When simulating

{\tilde{X}}_{t_{i}}

, we need to perform a matrix multiplication, the computational complexity of which is of order

O (N^{2})

when using the conventional matrix multiplication algorithm. However, multiplying a lower triangular Toeplitz matrix can be regarded as a discrete convolution which can be evaluated efficiently by fast Fourier transform. Therefore, the computational complexity can be reduced to

O (N log N)

. The algorithm to simulate the Volterra process

\tilde{X}

is described in Algorithm 1 below. Then, we can use a standard Euler scheme to simulate the price

(S_{t_{1}}, S_{t_{2}}, \dots, S_{t_{N}})

, as shown in Algorithm 2.

Algorithm 1: Volterra process

\tilde{X}

Table 1 reports the parameters used for our numerical experiments, which are the same as in Bayer et al. [1] and Bennedsen et al. [21]. Recall from the definition of

α

that the chosen value

α = - 0.43

corresponds to the Hurst exponent

H = 0.07

. Such small values of H are indeed consistent with empirical experiments, and one can refer to the recent works Forde et al. [36] and Gerhold [37] about the behaviour of the rBergomi model for small H.

Algorithm 2: Rough Bergomi model

3.4. Markovian Scheme for the aBergomi Model

For the sake of simplicity, we start by deriving the approximation of the rBergomi model by a Bergomi model with two terms. The same approach can be used when the number of terms is greater than two. The two-term Bergomi model (4) that we used to approximate the rBergomi model is given by

\{\begin{matrix} d S_{t} & = S_{t} \sqrt{V_{t}} d W_{t}, \\ d ξ_{s}^{t} & = η ξ_{s}^{t} (α_{1} e^{- κ_{1} (t - s)} + α_{2} e^{- κ_{2} (t - s)}) d B_{s}, \end{matrix}

(26)

where

s \in [0, t)

. Here, we introduce the process

y_{s}^{t}

defined as

\{\begin{matrix} y_{s}^{t} & = α_{1} e^{- κ_{1} (t - s)} Y_{s}^{1} + α_{2} e^{- κ_{2} (t - s)} Y_{s}^{2}, \\ d Y_{s}^{1} & = - κ_{1} Y_{s}^{1} d s + d B_{s} Y_{0}^{1} = 0, \\ d Y_{s}^{2} & = - κ_{2} Y_{s}^{2} d s + d B_{s} Y_{0}^{2} = 0, \end{matrix}

(27)

where the two parameters

κ_{1}

and

κ_{2}

come from the exponential kernel

K_{\exp}

, and

Y_{s}^{1}

and

Y_{s}^{2}

are two OU processes. Hence, the process

y_{s}^{t}

can be written as a driftless Gaussian process as follows:

d y_{s}^{t} = α_{1} e^{- κ_{1} (t - s)} d B_{s} + α_{2} e^{- κ_{2} (t - s)} d B_{s},

and its quadratic variation is given by

{〈 d y^{t}, d y^{t} 〉}_{s} = ς^{2} (t - s) d s

where

ς (u) = \sqrt{α_{1}^{2} e^{- 2 κ_{1} u} + α_{2}^{2} e^{- 2 κ_{2} u} + 2 α_{1} α_{2} e^{- (κ_{1} + κ_{2}) u}}

. The forward variation process

ξ_{s}^{t}

can be written as

d ξ_{s}^{t} = η_{s}^{t} d y_{s}^{t}

. Thus, the solution of the forward variation process is

ξ_{s}^{t} = ξ_{0} f^{t} (s, y_{s}^{t})

, where

f^{t} (s, y) = exp (η y - \frac{η^{2}}{2} χ (s, t))

and

\begin{matrix} χ (s, t) & = & \int_{t - s}^{t} ς^{2} (u) d u \\ = & \int_{t - s}^{t} α_{1}^{2} e^{- 2 κ_{1} u} + α_{2}^{2} e^{- 2 κ_{2} u} + 2 α_{1} α_{2} e^{- (κ_{1} + κ_{2}) u} d u \\ = & α_{1}^{2} e^{- κ_{1} (t - s)} \frac{1 - e^{- 2 κ_{1} s}}{2 κ_{1}} + α_{2}^{2} e^{- 2 κ_{2} (t - s)} \frac{1 - e^{- 2 κ_{2} s}}{2 κ_{2}} + 2 α_{1} α_{2} e^{- (κ_{1} + κ_{2}) (t - s)} \frac{1 - e^{- (κ_{1} + κ_{2}) s}}{κ_{1} + κ_{2}} \end{matrix} .

(28)

Recall that

V_{t} = ξ_{t}^{t} = ξ_{0} exp (η y_{t}^{t} - \frac{η^{2}}{2} χ (t, t))

and

χ (t, t) \underset{s \to t}{≃} t^{2 α + 1}

when

s \to t

when the number of terms n is large enough.

Using the approximation by the Bergomi model, we consider the parameters

{\{α_{i}, κ_{i}\}}_{(i = 1, 2, \dots, n)}

in the exponential kernel

K_{\exp} = \sum_{i = 1}^{n} α_{i} e^{- κ_{i} (t - s)}

on

s \in [0, t)

. Note that when

s \to t

, the power kernel

K_{pow} \to \infty

while

K_{\exp}

is finite. To compute the approximation numerically, we need to truncate the kernel

K_{\exp}

. To do so, we can use the

scipy . optimize

module in

Python

or the

nlinfit

function in

MATLAB

for the nonlinear regression of the parameters

{\{α_{i}, κ_{i}\}}_{(i = 1, 2, \dots, n)}

and the simulated price

\{S_{t}\}

. We exemplify the truncation of

K_{\exp}

by letting

s \in [0, T - Δ t]

, the truncated parameter

θ = T - \frac{T}{N} = T - Δ t

, and let

T = 1

.

We define the integral

I_{trunc}

on the truncated region

[0, θ t)

and apply the scaling property of Brownian motion as follows:

I_{trunc} = \sum_{i = 1}^{n} α_{i} \int_{0}^{\frac{θ t}{T}} e^{- κ_{i} (t - s)} d B_{s} = \sum_{i = 1}^{n} α_{i} \sqrt{\frac{θ}{T}} \int_{0}^{t} e^{- κ_{i} (1 - \frac{θ}{T}) s} d B_{s} .

After scaling

B_{s}

, the process

y_{s}

has to remain driftless Gaussian and satisfy

y_{s} = \sum_{i = 1}^{n} α_{i} e^{- κ_{i} (1 - \frac{θ}{T}) s} Y_{s}^{i}

, where

d Y_{s}^{i} = κ_{i} (1 - \frac{θ}{T}) Y_{s}^{i} d s + d B_{s}, Y_{0}^{i} = 0

. Then, the process

y_{s}

can be written as

d y_{s} = \sum_{i = 1}^{n} α_{i} e^{- κ_{i} (1 - \frac{θ}{T}) s} d B_{s}

. Thus, the kernel in the rBergomi model on

[0, \frac{θ}{T} t)

can be approximated by

I_{trunc} = \sqrt{\frac{θ}{T}} y_{t}

.

In view of Equations (26) and (27) and the derivations in this subsection, a simple Monte Carlo simulation scheme for the n-term aBergomi model is given by Algorithm 3. In practice, the truncation of the rBergomi power kernel means that, as is the case for the Riemann-sum scheme of Bennedsen et al. [21], this scheme is able to capture the shape of the implied volatility smile, but not its level. A multiplication factor is used in Algorithm 3 for each time-step to correct for this phenomenon. In practice, these factors can be estimated using another calibrated scheme, or more simply, from quoted option prices.

Algorithm 3: n-term aBergomi model when

T = 1

4. Simulation Results

In this section, we compare the simulated volatilities of the rBergomi and aBergomi models. To demonstrate the approximation’s accuracy and efficiency, we investigate the Mean Absolute Error (MAE) of simulated results for different number of terms and number of time-steps in numerical tests.

Figure 1 displays the power kernel

K_{pow}

in the rBergomi model and the

K_{\exp}

kernel of the 20-term aBergomi model with

T = 1

and

N = 100

. This figure suggests that this

K_{\exp}

obtained by nonlinear regression is sufficiently accurate, with a MAE of

4.05806 \times 10^{- 6}

.

The volatility smiles in Figure 2 are obtained by simulating the rBergomi model as described in Section 3.3, and the aBergomi model as described in Section 3.4 using the multiplication factors reported in Table 2. From Figure 2, we note that the at-the-money calibration is better with 50 time-steps at the cost of a worse out-of-the-money calibration. Meanwhile, 100 time-steps can approximate the rBergomi model better than 50 time-steps for almost all strikes.

We compute the MAE of the implied volatility approximation with different numbers of terms in the aBergomi model and different time-steps in Figure 3, and compare the pricing speed in Table 3. As expected, the higher the number of terms in the aBergomi model, the lower the MAE for all time-steps, but the difference between the models decreases when the number of time-steps decreases. Another expected result is that the computational time increases with both the number of terms and time-steps. The number of terms and time-step combinations provide a good trade-off between speed and accuracy, such as the 20-term aBergomi model with 100 time-steps and 20,000 Monte Carlo paths.

5. Conclusions

In this paper, we proved the power-law behavior of the ATM volatility skew as time to maturity goes to zero of the rough Bergomi model (rBergomi), and proposed an approximate Bergomi (aBergomi) model with a finite number of forward variance terms to approximate the rBergomi model. The approximation enables the adoption of classical pricing methods, while keeping the fractional feature of the model. We theoretically prove the convergence of the aBergomi model towards the rBergomi model when the number of terms is large enough, and verify this convergence numerically. We numerically compared the fast hybrid scheme for the rBergomi model to the Euler scheme for the aBergomi model. The numerical simulation results illustrate the accuracy and efficiency of the approximation. The parameters of the aBergomi model are numerically obtained by nonlinear regression on the power-law kernel of the rBergomi model. Other alternative calibration and truncation methods are worth investigating for future research, as well as further comparisons on more complex options.

Author Contributions

Conceptualization, Q.Z. and G.L.; Formal analysis, Q.Z. and W.C.; Investigation, Q.Z.; Methodology, Q.Z. and G.L.; Software, Q.Z. and N.L.; Supervision, G.L. and W.C.; Validation, Q.Z., G.L., W.C. and N.L.; Visualization, Q.Z.; Writing—original draft, Q.Z.; Writing—review & editing, Q.Z., G.L., W.C. and N.L. All authors have read and agreed to the published version of the manuscript.

Funding

The Centre for Quantitative Finance and Investment Strategies has been supported by BNP Paribas.

Acknowledgments

The authors thank the three anonymous reviewers for their useful comments which helped us to improve the article significantly.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A.1. Proof of Theorem 7

This subsection is devoted to the proof of Theorem 7.

Proof.

Let

(p_{i}^{n})_{0 \leq i \leq n}

be auxiliary mean reversion speeds such that

p_{i - 1}^{n} \leq x_{i}^{n} \leq p_{i}^{n}

for

i \leq {1, \dots, n}

and

p_{0}^{n} = 0

. Recall that

K (τ) = \int_{0}^{\infty} e^{- x τ} μ (d x)

. We have

\begin{matrix} ∥ K^{n} {- K ∥}_{2, T} & = {∥\sum_{i = 1}^{n} α_{i}^{n} e^{- x_{i}^{n} τ} - \int_{0}^{\infty} e^{- x τ} μ (d x)∥}_{2, T} \\ \leq \int_{0}^{\infty} {∥e^{- x (\cdot)}∥}_{2, T} μ (d x) + \sum_{i = 1}^{n} {∥α_{i}^{n} e^{- x_{i}^{n} (\cdot)} - \int_{p_{i - 1}^{n}}^{p_{i}^{n}} e^{- x (\cdot)} μ (d x)∥}_{2, T} . \end{matrix}

(A1)

The first term on the RHS of the inequality (A1) can be estimated as below:

\int_{p_{n}^{n}}^{\infty} {∥e^{- x (\cdot)}∥}_{2, T} μ (d x) = \int_{p_{n}^{n}}^{\infty} \sqrt{\frac{1 - e^{- 2 x T}}{2 x}} μ (d x) \leq \frac{(p_{n}^{n})^{- H}}{\sqrt{2} H Γ (\frac{1}{2} - H)} .

For the second term, applying a second-order Taylor expansion of the exponential function

e^{x} = 1 + x + \frac{x^{2}}{2} + \int_{0}^{x} \frac{{(x - u)}^{3}}{6} d u

for

t \in [0, T]

, choosing

α_{i}^{n} = \int_{p_{i - 1}^{n}}^{p_{i}^{n}} μ (d x)

and

x_{i}^{n} = {(\frac{\int_{p_{i - 1}^{n}}^{p_{i}^{n}} x^{4} μ (d x)}{\int_{p_{i - 1}^{n}}^{p_{i}^{n}} μ (d x)})}^{\frac{1}{4}}

, yields

\begin{matrix} \begin{matrix} |α_{i}^{n} e^{- x_{i}^{n} t} - \int_{p_{i - 1}^{n}}^{p_{i}^{n}} e^{- x t} μ (d x)| = & |α_{i}^{n} (1 + (- x_{i}^{n} t) + \frac{{(- x_{i}^{n} t)}^{2}}{2}) - \int_{p_{i - 1}^{n}}^{p_{i}^{n}} (1 + (- x t) + \frac{{(- x t)}^{2}}{2}) μ (d x)| \\ + |α_{i}^{n} (\int_{0}^{x_{i}^{n} t} \frac{{(x_{i}^{n} t - u)}^{3}}{6} d u) - \int_{p_{i - 1}^{n}}^{p_{i}^{n}} \int_{0}^{x t} \frac{{(x t - u)}^{3}}{6} d u μ (d x)| \\ = & \int_{p_{i - 1}^{n}}^{p_{i}^{n}} (x t - x_{i}^{n} t) + \frac{{(- x_{i}^{n} t)}^{2} - {(- x t)}^{2}}{2} μ (d x) \\ \leq & \frac{t^{2}}{2} \int_{p_{i - 1}^{n}}^{p_{i}^{n}} {(x - x_{i}^{n})}^{2} μ (d x) \end{matrix} \end{matrix}

since

\begin{matrix} \int_{p_{i - 1}^{n}}^{p_{i}^{n}} \{\int_{0}^{x_{i}^{n} t} \frac{{(x_{i}^{n} t - u)}^{3}}{6} d u - \int_{0}^{x t} \frac{{(x t - u)}^{3}}{6} d u\} μ (d x) \\ = & \int_{p_{i - 1}^{n}}^{p_{i}^{n}} \{x_{i}^{n} t \int_{0}^{1} \frac{{(x_{i}^{n} t - x_{i}^{n} t s)}^{3}}{6} d s - x t \int_{0}^{1} \frac{{(x t - x t s)}^{3}}{6} d s\} μ (d x), s = \frac{u}{x t} \\ = & \int_{p_{i - 1}^{n}}^{p_{i}^{n}} \{{(x_{i}^{n} t)}^{4} \int_{0}^{1} \frac{{(1 - s)}^{3}}{6} d s - {(x t)}^{4} \int_{0}^{1} \frac{{(1 - s)}^{3}}{6} d s\} μ (d x) \\ = & \{t^{4} \int_{0}^{1} \frac{{(1 - s)}^{3}}{6} d s\} \int_{p_{i - 1}^{n}}^{p_{i}^{n}} \{{(x_{i}^{n})}^{4} - {(x)}^{4}\} μ (d x) \\ = & \{t^{4} \int_{0}^{1} \frac{{(1 - s)}^{3}}{6} d s\} \int_{p_{i - 1}^{n}}^{p_{i}^{n}} \{{(\frac{\int_{p_{i - 1}^{n}}^{p_{i}^{n}} x μ (d x)}{\int_{p_{i - 1}^{n}}^{p_{i}^{n}} μ (d x)})}^{4} - {(x)}^{4}\} μ (d x) \\ = & 0 . \end{matrix}

Hence,

\sum_{i = 1}^{n} {∥α_{i}^{n} e^{- x_{i}^{n} (\cdot)} - \int_{p_{i - 1}^{n}}^{p_{i}^{n}} e^{- x (\cdot)} μ (d x)∥}_{2, T} \leq \frac{T^{\frac{5}{2}}}{2 \sqrt{5}} \sum_{i = 1}^{n} \int_{p_{i - 1}^{n}}^{p_{i}^{n}} (x - x_{i}^{n})^{2} μ (d x) .

Thus, the convergence of

K^{n}

depends on the weights

α_{i}

and mean reversions

x_{i}

. Let

p_{i}^{n} = i π_{n}

for each

i \in {1, \dots, n}

and

π_{n} > 0

. We have

\begin{matrix} \sum_{i = 1}^{n} \int_{p_{i - 1}^{n}}^{p_{i}^{n}} {(x - x_{i}^{n})}^{2} μ (d x) & \leq π_{n}^{2} \int_{0}^{p_{n}^{n}} μ (d x) = \frac{π_{n}^{\frac{5}{2} - H} n^{\frac{1}{2} - H}}{(\frac{1}{2} - H) Γ (\frac{1}{2} - H)} \end{matrix}

We can also proceed to get the explicit expressions of

α_{i}^{n}

and

x_{i}^{n}

as follows:

\begin{matrix} α_{i}^{n} & = \frac{{(i π_{n})}^{\frac{1}{2} - H} - {[(i - 1) π_{n}]}^{\frac{1}{2} - H}}{(\frac{1}{2} - H) Γ (\frac{1}{2} - H)}, & x_{i}^{n} = \frac{1 - 2 H}{3 - 2 H} \cdot \frac{{(i π_{n})}^{\frac{3}{2} - H} - {[(i - 1) π_{n}]}^{\frac{3}{2} - H}}{{(i π_{n})}^{\frac{1}{2} - H} - {[(i - 1) π_{n}]}^{\frac{1}{2} - H}} . \end{matrix}

Since

p_{n}^{n} = n π_{n} \to \infty

, we have

π_{n}^{\frac{5}{2} - H} n^{\frac{1}{2} - H} \to 0

as

n \to + \infty

when

π_{n} < n^{- \frac{1}{6}}

,

\begin{matrix} {∥K^{n} - K∥}_{2, T} & \leq & \frac{1}{\sqrt{2} H Γ (\frac{1}{2} - H)} [{(p_{n}^{n})}^{- H} + \frac{T^{\frac{5}{2}} H}{\sqrt{10} (\frac{1}{2} - H)} {(p_{n}^{n})}^{\frac{1}{2} - H} π_{n}^{2}] \\ = & \frac{1}{\sqrt{2} H Γ (\frac{1}{2} - H)} [n^{- H} π_{n}^{- H} + \frac{T^{\frac{5}{2}} H}{\sqrt{10} (\frac{1}{2} - H)} n^{\frac{1}{2} - H} π_{n}^{\frac{5}{2} - H}] \\ = & a x^{- H} + b x^{\frac{5}{2} - H} \end{matrix}

(A2)

Let

x = π_{n}

,

y = a x^{- H} + b x^{\frac{5}{2} - H}

and

y^{^{'}} = - a H x^{- H - 1} + b (\frac{5}{2} - H) x^{\frac{3}{2} - H} = 0

; solving for x, we obtain

x^{\frac{2}{5}} = \frac{a H}{b (\frac{5}{2} - H)}

, where

a = n^{- H}

and

b = \frac{T^{\frac{5}{2}} H}{\sqrt{10} (\frac{1}{2} - H)} n^{\frac{1}{2} - H}

\begin{matrix} x & = & π_{n} = {[\frac{n^{- H} H \sqrt{10} (\frac{1}{2} - H)}{T^{\frac{5}{2}} H n^{\frac{1}{2} - H} (\frac{5}{2} - H)}]}^{\frac{2}{5}} = {[\frac{n^{- \frac{1}{2}} \sqrt{10} (\frac{1}{2} - H)}{T^{\frac{5}{2}} (\frac{5}{2} - H)}]}^{\frac{2}{5}} = \frac{n^{- \frac{1}{5}}}{T} {[\frac{\sqrt{10} (\frac{1}{2} - H)}{(\frac{5}{2} - H)}]}^{\frac{2}{5}} \end{matrix}

When

π_{n} = \frac{n^{- \frac{1}{5}}}{T} {[\frac{\sqrt{10} (\frac{1}{2} - H)}{(\frac{5}{2} - H)}]}^{\frac{2}{5}}

, the RHS of Equation (A2) attains its minimum and

∥ K^{n} {- K ∥}_{2, T} \leq C n^{\frac{- 4 H}{5}}

where

C = \frac{1}{\sqrt{2} H Γ (\frac{1}{2} - H)} T^{H} {[\frac{\sqrt{10} (\frac{1}{2} - H)}{\frac{5}{2} - H}]}^{- \frac{5}{2} H} \frac{\frac{5}{2}}{\frac{5}{2} - H}

is a constant. □

References

Bayer, C.; Friz, P.; Gatheral, J. Pricing under rough volatility. Quant. Financ. 2016, 16, 887–904. [Google Scholar] [CrossRef]
Forde, M.; Zhang, H. Asymptotics for rough stochastic volatility models. SIAM J. Financ. Math. 1993, 8, 114–145. [Google Scholar] [CrossRef] [Green Version]
Fukasawa, M. Short-time at-the-money skew and rough fractional volatility. Quant. Financ. 2017, 17, 189–198. [Google Scholar] [CrossRef] [Green Version]
Gatheral, J.; Jaisson, T.; Rosenbaum, M. Volatility is rough. Quant. Financ. 2018, 18, 933–949. [Google Scholar] [CrossRef]
Jusselin, P.; Rosenbaum, M. No-arbitrage implies power-law market impact and rough volatility. Math. Financ. 2020, 30, 1309–1336. [Google Scholar] [CrossRef] [Green Version]
Bayer, C.; Ben Hammouda, C.; Tempone, R. Hierarchical adaptive sparse grids and quasi-Monte Carlo for option pricing under the rough Bergomi model. Quant. Financ. 2020, 20, 1457–1473. [Google Scholar] [CrossRef] [Green Version]
Jacquier, A.; Martini, C.; Muguruza, A. On VIX futures in the rough Bergomi model. Quant. Financ. 2018, 18, 45–61. [Google Scholar] [CrossRef]
McCrickerd, R.; Pakkanen, M. Turbocharging Monte Carlo pricing for the rough Bergomi model. Quant. Financ. 2018, 18, 1877–1886. [Google Scholar] [CrossRef] [Green Version]
El Euch, O.; Fukasawa, M.; Gatheral, J.; Rosenbaum, M. Short-term at-the-money asymptotics under stochastic volatility models. SIAM J. Financ. Math. 2019, 10, 491–511. [Google Scholar] [CrossRef] [Green Version]
Bayer, C.; Friz, P.; Gulisashvili, A.; Horvath, B.; Stemper, B. Short-time near-the-money skew in rough fractional volatility models. Quant. Financ. 2019, 19, 779–798. [Google Scholar] [CrossRef]
Friz, P.K.; Gassiat, P.; Pigato, P. Short dated smile under rough volatility: Asymptotics and numerics. arXiv 2020, arXiv:2009.08814. [Google Scholar]
Tomas, A. Pricing of Asian Options in the Rough Bergomi Model. Ph.D Thesis, Technische Universität Wien, Wien, Austria, 2018. [Google Scholar]
Bayer, C.; Tempone, R.; Wolfers, S. Pricing American options by exercise rate optimization. Quant. Financ. 2020, 20, 1749–1760. [Google Scholar] [CrossRef]
Bayer, C.; Qiu, J.; Yao, Y. Pricing options under rough volatility with backward SPDEs. arXiv 2020, arXiv:2008.01241. [Google Scholar]
Bayer, C.; Horvath, B.; Muguruza, A.; Stemper, B.; Tomas, M. On deep calibration of (rough) stochastic volatility models. arXiv 2019, arXiv:1908.08806. [Google Scholar]
Zeron, M.; Ruiz, I. Tensoring volatility calibration Calibration of the rough Bergomi volatility model via Chebyshev Tensors. arXiv 2020, arXiv:2012.07440. [Google Scholar]
Horvath, B.; Muguruza, A.; Tomas, M. Deep learning volatility: A deep neural network perspective on pricing and calibration in (rough) volatility models. Quant. Financ. 2021, 21, 11–27. [Google Scholar] [CrossRef]
Abi Jaber, E.; El Euch, O. Multifactor approximation of rough volatility models. SIAM J. Financ. Math. 2019, 10, 309–349. [Google Scholar] [CrossRef] [Green Version]
Gatheral, J.; Keller-Ressel, M. Affine forward variance models. Financ. Stochastics 2019, 23, 501–533. [Google Scholar] [CrossRef] [Green Version]
Harms, P.; Stefanovits, D. Affine representations of fractional processes with applications in mathematical finance. Stoch. Process. Their Appl. 2019, 129, 1185–1228. [Google Scholar] [CrossRef] [Green Version]
Bennedsen, M.; Lunde, A.; Pakkanen, M. Hybrid scheme for Brownian semistationary processes. Financ. Stochastics 2017, 21, 931–965. [Google Scholar] [CrossRef] [Green Version]
Carr, P.; Itkin, A. ADOL: Markovian approximation of a rough lognormal model. Risk Mag. 2019, 32. Available online: https://www.risk.net/cutting-edge/banking/7209816/adol-markovian-approximation-of-a-rough-lognormal-model (accessed on 21 February 2021).
Sepp, A. Log-Normal Stochastic Volatility Model: Affine Decomposition of Moment Generating Function and Pricing of Vanilla Options. 2016. Available online: https://dx.doi.org/10.2139/ssrn.2522425 (accessed on 21 February 2021).
Langrené, N.; Lee, G.; Zhu, Z. Switching to nonaffine stochastic volatility: A closed-form expansion for the Inverse Gamma model. Int. J. Theor. Appl. Financ. 2016, 19, 1–37. [Google Scholar] [CrossRef] [Green Version]
Dobrić, V.; Ojeda, F. Fractional Brownian fields, duality, and martingales. In High Dimensional Probability; Institute of Mathematical Statistics Lecture Notes—Monograph Series; Institute of Mathematical Statistics: Beachwood, OH, USA, 2006; Volume 51, pp. 77–95. [Google Scholar]
Benth, F.E.; Eyjolfsson, H.; Veraart, A. Approximating Lévy semistationary processes via Fourier methods in the context of power markets. SIAM J. Financ. Math. 2014, 5, 71–98. [Google Scholar] [CrossRef] [Green Version]
Alòs, E.; León, J.; Vives, J. On the short-time behavior of the implied volatility for jump-diffusion models with stochastic volatility. Financ. Stochastics 2007, 11, 571–589. [Google Scholar] [CrossRef] [Green Version]
Bergomi, L. Smile dynamics II. Risk Mag. 2005, 18. Available online: https://www.risk.net/derivatives/equity-derivatives/1500225/smile-dynamics-ii (accessed on 21 February 2021). [CrossRef]
Bergomi, L. Smile dynamics IV. Risk Mag. 2009, 22. Available online: https://www.risk.net/derivatives/equity-derivatives/1564129/smile-dynamics-iv (accessed on 21 February 2021). [CrossRef]
Bergomi, L.; Guyon, J. Stochastic volatility’s orderly smiles. Risk Mag. 2012, 25. Available online: https://www.risk.net/derivatives/2171452/stochastic-volatilitys-orderly-smiles (accessed on 21 February 2021).
Fukasawa, M. Asymptotic analysis for stochastic volatility: Martingale expansion. Financ. Stochastics 2011, 15, 635–654. [Google Scholar] [CrossRef] [Green Version]
Protter, P. Stochastic differential equations. In Stochastic Integration and Differential Equations; Springer: Berlin/Heidelberg, Germany, 2005; pp. 249–361. [Google Scholar]
Øksendal, B.; Zhang, T.-S. The stochastic Volterra equation. Barcelona Seminar on Stochastic Analysis; Birkhäuser: Basel, Switzerland, 1993; pp. 168–202. [Google Scholar]
Carmona, P.; Coutin, L.; Montseny, G. Approximation of some Gaussian processes. Stat. Inference Stoch. Process. 2000, 3, 161–171. [Google Scholar] [CrossRef]
Muravlev, A. Representation of a fractional Brownian motion in terms of an infinite-dimensional Ornstein-Uhlenbeck process. Russ. Math. Surv. 2011, 66, 439–441. [Google Scholar] [CrossRef]
Forde, M.; Fukasawa, M.; Gerhold, S.; Smith, B. The Rough Bergomi Model as H→0—Skew Flattening/Blow up and Non-Gaussian Rough Volatility; King’s College London Working Paper; King’s College: London, UK, 2020. [Google Scholar]
Gerhold, S. Asymptotic analysis of a double integral occurring in the rough Bergomi model. Math. Commun. 2020, 25, 171–184. [Google Scholar]

Figure 1. The power kernel

K_{pow}

in the rBergomi model and the exponential

K_{\exp}

in the 20-term aBergomi model when

T = 1

and

N = 100

.

Figure 1. The power kernel

K_{pow}

in the rBergomi model and the exponential

K_{\exp}

in the 20-term aBergomi model when

T = 1

and

N = 100

.

Figure 2. Volatility smiles for rBergomi and 20-term aBergomi models with

T = 1

using 20,000 Monte Carlo paths.

Figure 2. Volatility smiles for rBergomi and 20-term aBergomi models with

T = 1

using 20,000 Monte Carlo paths.

Figure 3. MAE of the implied volatility smiles of the aBergomi model with respect to the number of time-steps, for four different numbers of terms (10, 15, 20, and 25), using 20,000 Monte Carlo paths.

Table 1. Parameters in the rBergomi model.

$ξ_{0}$	$η$	$α$
0.026	1.9	–0.43

Table 2. Square of multiplication factors for different steps.

Time-Steps	Square of Multiplication Factors
50	0.750324
100	0.550448
150	0.485093
200	0.450392

Table 3. Runtime (in s) of the rBergomi model and the aBergomi model for different time-steps with

T = 1

and 20,000 Monte Carlo paths.

Table 3. Runtime (in s) of the rBergomi model and the aBergomi model for different time-steps with

T = 1

and 20,000 Monte Carlo paths.

Time-Steps	rBergomi	10-Term aBergomi	15-Term aBergomi	20-Term aBergomi	25-Term aBergomi
50	0.4081	0.0994	0.1392	0.1721	0.2164
100	0.5001	0.2369	0.3130	0.4024	0.4543
150	0.5602	0.3650	0.4499	0.5589	0.6869
200	0.5861	0.4257	0.5908	0.7258	0.8727

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhu, Q.; Loeper, G.; Chen, W.; Langrené, N. Markovian Approximation of the Rough Bergomi Model for Monte Carlo Option Pricing. Mathematics 2021, 9, 528. https://doi.org/10.3390/math9050528

AMA Style

Zhu Q, Loeper G, Chen W, Langrené N. Markovian Approximation of the Rough Bergomi Model for Monte Carlo Option Pricing. Mathematics. 2021; 9(5):528. https://doi.org/10.3390/math9050528

Chicago/Turabian Style

Zhu, Qinwen, Grégoire Loeper, Wen Chen, and Nicolas Langrené. 2021. "Markovian Approximation of the Rough Bergomi Model for Monte Carlo Option Pricing" Mathematics 9, no. 5: 528. https://doi.org/10.3390/math9050528

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Markovian Approximation of the Rough Bergomi Model for Monte Carlo Option Pricing

Abstract

1. Introduction

2. Rough Bergomi Skew and Quasi-Affine Structure

2.1. ATM Volatility Skew

2.1.1. ATM Volatility Skew in the rBergomi Model

2.1.2. ATM Volatility Skew in the Two-Factor Bergomi Model

2.2. Markovian Representation of the Rough Bergomi Model

2.2.1. Volterra-Type Integral as a Functional of a Markov Process

2.2.2. Quasi-Affine Structure in the rBergomi Model

3. Rough Bergomi Approximation and Monte Carlo Schemes

3.1. Approximation of the Rough Bergomi Model by an n-Term Bergomi Model

3.1.1. Existence and Uniqueness of $(S^{n}, V^{n})$

3.1.2. Convergence of $(S^{n}, V^{n})$ to $(S, V)$

3.2. Affine Structure of the aBergomi Model

3.3. Hybrid Scheme for the rBergomi Model

3.4. Markovian Scheme for the aBergomi Model

4. Simulation Results

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Appendix A

Appendix A.1. Proof of Theorem 7

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Markovian Approximation of the Rough Bergomi Model for Monte Carlo Option Pricing

Abstract

1. Introduction

2. Rough Bergomi Skew and Quasi-Affine Structure

2.1. ATM Volatility Skew

2.1.1. ATM Volatility Skew in the rBergomi Model

2.1.2. ATM Volatility Skew in the Two-Factor Bergomi Model

2.2. Markovian Representation of the Rough Bergomi Model

2.2.1. Volterra-Type Integral as a Functional of a Markov Process

2.2.2. Quasi-Affine Structure in the rBergomi Model

3. Rough Bergomi Approximation and Monte Carlo Schemes

3.1. Approximation of the Rough Bergomi Model by an n-Term Bergomi Model

3.1.1. Existence and Uniqueness of ( S n , V n )

3.1.2. Convergence of ( S n , V n ) to ( S , V )

3.2. Affine Structure of the aBergomi Model

3.3. Hybrid Scheme for the rBergomi Model

3.4. Markovian Scheme for the aBergomi Model

4. Simulation Results

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Appendix A

Appendix A.1. Proof of Theorem 7

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.1.1. Existence and Uniqueness of $(S^{n}, V^{n})$

3.1.2. Convergence of $(S^{n}, V^{n})$ to $(S, V)$