1. Introduction
The rough Bergomi (rBergomi) model introduced by Bayer et al. [
1] has gained acceptance for stochastic volatility modelling due to its power-law at-the-money (ATM) volatility skew, which is consistent with empirical studies (see Forde and Zhang [
2], Fukasawa [
3], Gatheral et al. [
4]) and with the effect of the no-arbitrage assumption on the market impact function (see Jusselin and Rosenbaum [
5]). However, the stochastic process which characterizes this volatility model is rougher than that of a Brownian motion; in particular, the lack of Markovianity makes classical pricing methods infeasible.
In order to price options under an rBergomi model, Bayer et al. [
6] proposed hierarchical adaptive sparse grids, Jacquier et al. [
7] developed pricing algorithms for VIX futures and options, and McCrickerd and Pakkanen [
8] developed a “turbocharged” Monte Carlo pricing method. A number of short-term approximations have been proposed to obtain fast approximations for short maturities—see, for example, Fukasawa [
3], El Euch et al. [
9], Bayer et al. [
10], and Friz et al. [
11]. Regarding the pricing of exotic options in the rBergomi model, Tomas [
12] considered the pricing of Asian options, and Bayer et al. [
13] and Bayer et al. [
14] considered the pricing of American put options. Besides pricing, the calibration of the rBergomi model is also a challenge, for which Bayer et al. [
15], Zeron and Ruiz [
16], and Horvath et al. [
17] propose to use deep learning methods. In spite of this number of recent efforts, the inherent challenges brought by the rBergomi model still prevent its widespread adoption in the industry.
Inspired by the technique by Abi Jaber and El Euch [
18], Gatheral and Keller-Ressel [
19], and Harms and Stefanovits [
20], in which the authors designed a multi-factor stochastic volatility model with Markovian structure to approximate the rough Heston model, we establish an analogous multi-factor affine structure for the rBergomi model. Indeed, the Volterra kernel of the rBergomi model corresponds to a superposition of infinitely many Ornstein-Uhlenbeck (OU) processes with different speeds of mean reversion. Truncating this infinite sum into a finite sum of OU processes yields an approximation of the rBergomi model which is a classical Markovian multi-factor Bergomi model. We refer to this affine, Markovian approximation of the rBergomi model as the aBergomi model. We prove the existence and uniqueness of the solution to this aBergomi model, and show that its affine structure converges to the one of the rBergomi model. Finally, we implement a Monte Carlo scheme for the aBergomi model, and compare it to the hybrid scheme of the rBergomi model (Bennedsen et al. [
21]). Our numerical tests demonstrate that using 20 exponential terms in the aBergomi kernel is sufficient to obtain accurate implied volatility curvatures while remaining computationally efficient.
The idea to interpret the conventional two-factor Bergomi model as a Markovian approximation of the rBergomi model was originally briefly suggested by Bayer et al. [
1] (p. 892). Our work explores and expands upon this intuition by testing the number of factors to use in the Bergomi model and establishing their respective parameters for best approximation of the rBergomi model. For comprehensiveness, one can mention the alternative Markovian approximation proposed in Carr and Itkin [
22] of the rough volatility version of the mean-reverting lognormal volatility model of Sepp [
23], Langrené et al. [
24], based on a closed-form vol-of-vol expansion for solving the pricing PDE arising from the use of the Dobrić-Ojeda process (Dobrić and Ojeda [
25]) to approximate the fractional Brownian motion.
Compared to alternative pricing methods for the rBergomi model, the main advantage of our proposed Markovian approximation approach is that it does not require pricing methods specifically designed for rough volatility models; instead, classical Markovian pricing methods can be used for both vanilla and exotic options. In practice, the Monte Carlo pricing method is the method of choice for the aBergomi model in view of the number of terms needed for good accuracy. The computational cost of simulating our proposed aBergomi model is proportional to the number of time-steps
N, which makes it an interesting alternative to the approximate
hybrid scheme of Bennedsen et al. [
21] and the exact
covariance-based scheme of Bayer et al. [
1], Bayer et al. [
10]. The main downside is that the approximation of the rBergomi power kernel by a sum of exponential terms introduces some error, for which we provide an explicit bound in the
sense. In particular, as in the case of the Riemann-sum scheme of Bennedsen et al. [
21], the Fourier-based scheme of Benth et al. [
26], or the approximation by a Dobrić-Ojeda process in Carr and Itkin [
22], a truncation of the power kernel singularity at
cannot be avoided.
The paper is organized as follows. In
Section 2, we introduce the Bergomi and rBergomi models and discuss their respective ATM volatility skews. The rBergomi model is closely related to the RFSV model introduced in Alòs et al. [
27], for which the ATM volatility is proved to be equivalent to the power
for short maturity using the Malliavin technique, a result confirmed in Fukasawa [
3] using a martingale expansion approach. We prove in this section that a similar result holds for the rBergomi model, while this does not hold for the Bergomi model (Equation (
9)). We also establish the quasi-affine structure of the rough Bergomi model.
Section 3 is dedicated to the approximation of the rough Bergomi model by a multi-factor Bergomi model, both theoretically and numerically. Finally,
Section 4 compares numerical simulations of the rBergomi model with our approximated Bergomi (aBergomi) model with a finite number of terms, showing the effectiveness of our approximation.
2. Rough Bergomi Skew and Quasi-Affine Structure
Firstly, this section introduces the Bergomi and rough Bergomi stochastic volatility models (Definitions 1 and 2), along with the corresponding notations used throughout the paper.
We consider a filtered probability space
, which supports two-dimensional correlated Brownian motions
W and
B. A log price process
is assumed to follow the dynamics
where
is the instantaneous spot variance process. Let
be the instantaneous forward variance for date
u observed at time
t; in particular,
corresponds to the spot variance.
Bayer et al. [
1] proposed the so-called rough Bergomi model where the forward variance follows
where
W and
B have correlation
,
is a negative exponent depending on the Hurst exponent
of the underlying fractional Brownian motion, and
is a positive parameter depending on
H. The definition of the rBergomi model is summarized below:
Definition 1. The rBergomi stochastic volatility model takes the formwhere , and . By contrast, the two-factor Bergomi model is defined as follows.
Definition 2. The two-factor Bergomi model (Bergomi [28], Bergomi [29]) is defined by:withwhere is the lognormal volatility of the instantaneous variance under the normalizing factor and θ is a mixing parameter of the short-term factor driven by and the long-term factor driven by (). Assumption 1. Without loss of generality, we assume throughout the paper that the initial forward variance curve is flat. This simplification is common in the rBergomi literature; see, for example, Bayer et al. [1], Bayer et al. [6], and Bayer et al. [15]. We henceforth use the notation for the constant initial forward variance curve. 2.1. ATM Volatility Skew
This subsection derives the ATM volatility skew of the rBergomi and Bergomi models, as the more realistic ATM volatility skew of the rBergomi model over the one of the Bergomi model is one of the motivations behind the introduction of the rBergomi model.
From Bergomi and Guyon [
30], we can define the price and the volatility dynamics of a generic stochastic volatility model as follows:
where
is the log-spot,
is the instantaneous spot variance,
is the instantaneous forward variance for date
u observed at time
t, and
is the volatility of forward instantaneous variances which takes values in
where
d is the dimension of the Brownian motion
B. Note that in this formulation, the covariance between spot and variance is modelled through the first component of
, see Bergomi and Guyon [
30] for more details.
One can derive the following second-order expression (w.r.t. volatility of volatility) for the Black-Scholes implied volatility:
where
,
K is the strike and
is a dimensionless scaling factor for the volatility of variances. The ATM volatility and the two coefficients
and
are given by
where
is the total variance to expiration
T,
is the effective volatility. Here,
for any
under Assumption 1, which means that
and
.
From Bergomi and Guyon [
30], we can derive the following second-order expansion for the autocorrelations
:
is the doubly integrated spot-variance covariance function,
.
is the triply integrated variance/variance covariance function,
.
is the double time-integral of the instance spot variance covariance function times the sensitivity of with respect to instantaneous forward variances,
,
where
and
are given by
2.1.1. ATM Volatility Skew in the rBergomi Model
Theorem 1. In the rBergomi model (3), the ATM volatility skew satisfies Proof. We first explicit the autocorrelation functional in the rBergomi model. Using the fact that
, the autocorrelation functionals
and
are given by
Then, using the fact that
we obtain
Therefore, using Assumption 1, we obtain the following explicit first-order approximation:
where
is a constant depending on
H. We are then able to compute the first-order approximations of the three correlation values
explicitly. The first-order approximation of
can be written as follows:
Thus, the ATM volatility skew generated by the rBergomi model satisfies (
8), which is consistent with empirical evidence (see for example, Gatheral et al. [
4]). □
Remark 1. Besides the rBergomi model, there exist other fractional volatility models which also satisfy Equation (8); see, for example, Fukasawa [31] (subsection 3.3). 2.1.2. ATM Volatility Skew in the Two-Factor Bergomi Model
We now compare this result to the volatility skew in the classical two-factor Bergomi model.
Theorem 2. In the two-factor Bergomi model, the ATM volatility skew satisfies Proof. The Brownian motions
can be decomposed as:
where
are three independent Brownian motions and
. Thus, the volatilities of variance
in the general formulation (
5) can be written as:
or equivalently:
where
The corresponding covariances can be expressed similarly as:
Once again using Assumption 1 and the autocorrelations provided by Bergomi and Guyon [
30], we obtain
where
and
Similarly, we have
, with the coefficients
and
Since
and
are constants, we can derive the term structure of the ATM volatility skew as in Equation (
9) with the first order in
. □
However, this result derived for the Bergomi model by the Bergomi-Guyon expansion [
30] is inconsistent with empirical evidence; see, for example, Bayer et al. [
1]. This suggests that the power-law kernel of the forward variance curve in the rBergomi model will lead to more realistic and accurate pricing and hedging results than the exponential kernel of the forward variance curve in the Bergomi model.
2.2. Markovian Representation of the Rough Bergomi Model
The purpose of this section is to establish the infinite-dimensional affine nature and Markovianity of the rBergomi model.
Definition 3. An Ornstein-Uhlenbeck (OU) process is the solution of the following stochastic differential equation (SDE):where is the mean-reversion speed, is the mean-reversion level, and is a standard Brownian motion. Its strong solution is explicitly given by Assumption 2. In the rest of the paper, we always assume thatwhere η and α come from Definition 1 of the rBergomi model (see Bayer et al. [1]). Definition 4. Without loss of generality, we define, for , the sigma-finite measure on as 2.2.1. Volterra-Type Integral as a Functional of a Markov Process
Theorem 3. Using Definitions 3 and 4, the Volterra-type integral in the rBergomi model has the Markovian representation Proof. The Laplace transform of the measure
in Definition 4 is
which can be recognised as the power-law kernel in the Volterra-type integral. Consequently, we have
, and using Fubini’s stochastic theorem, see Protter [
32], we obtain
. From Definition 3, where
, we obtain the Markovian representation given by Equation (
14). □
Theorem 4. The OU process (11) has the affine structure Proof. From Fubini’s stochastic theorem,
is Gaussian under the filtration
for
, with mean
Furthermore, using Itō’s isometry, we have the conditional variance:
2.2.2. Quasi-Affine Structure in the rBergomi Model
From Definition 1 and Theorem 3, the rBergomi model can be rewritten in the following form:
where
is the log stock price,
is the initial flat forward variance curve, and
are two Brownian motions with correlation
and
. Our aim is now to write the log stock price
in a quasi-affine form as the first coordinate of an infinite-dimensional affine process. To do so, we introduce the following symmetric non-negative tensor:
where we used the notation
. Let
, where
i is the imaginary unit (
). The relation
holds. Therefore, the log stock price dynamics can be written as
where
is the Doléans-Dade stochastic exponential.
Theorem 5. The process satisfies the affine structurewhere Proof. From Fubini’s stochastic theorem,
is Gaussian under the filtration
for
, with conditional mean
and conditional variance
Then, the random variable defined as
is a noncentral
distribution with one degree of freedom and noncentrality parameter
Thus, the Formulas (
16) and (17) for
and
follow from the characteristic function of the noncentral
distribution, which concludes the proof. □
Corollary 1. The rBergomi model admits an infinite-dimensional Markovian representation.
Proof. This corollary follows from Theorem 5 which exhibits that the rBergomi model has an exponential-affine dependence on x; hence, the model is Markovian in each dimension. □
3. Rough Bergomi Approximation and Monte Carlo Schemes
In this Section, we first introduce the aBergomi model which is used to approximate the rBergomi model (
3). After that, we will demonstrate the existence and uniqueness of the solution of this aBergomi model. We also prove that the aBergomi model is well-defined and the solution of the aBergomi model converges to that of the rBergomi model when the number of terms
n in the aBergomi model goes to infinity. At the same time, we show that the aBergomi model inherits the affine structure of the Bergomi model.
3.1. Approximation of the Rough Bergomi Model by an n-Term Bergomi Model
Since the rBergomi model can be represented by
and the
n-term Bergomi model with the same Brownian motion in the variance process can be represented by
we can view the rBergomi model as a continuous infinite-term Bergomi model under the measure
, in which the mean-reversion speed
x has been integrated from 0 to
∞, with respect to the Brownian motion
. We can therefore approximate the rBergomi model by an
n-term exponential kernel
instead of the power kernel
of the Volterra process in the rBergomi model.
Following Equation (
18), after approximating the exponential kernel
by the kernel
, we can rewrite the aBergomi model (
18) as follows:
where
are positive weights,
are mean-reverting speeds, and
, with initial conditions
and
.
3.1.1. Existence and Uniqueness of
We rewrite
in (
19) as the following stochastic equation
Theorem 6. Under the conditions of the model (19), there exists a unique, strong, non-negative solution to Equation (20). Proof. Øksendal and Zhang [
33] imply that there exists a unique, strong, non-negative solution
to Equation (
20) under the conditions of the model (
19). □
Then, the strong existence and uniqueness of follows, along with its Markovianity w.r.t. the spot price and the factors for .
3.1.2. Convergence of to
To prove that the solution of the aBergomi model
converges to the solution of the rBergomi model
, we need to choose a suitable
to approximate
. When
,
(see Carmona et al. [
34], Muravlev [
35], Harms and Stefanovits [
20]).
Theorem 7. There exist weights , mean reversion speeds , and a constant C depending on H and T only such thatwhere is the norm. In particular, when . The proof of this theorem can be found in
Appendix A.
Applying the previous computations and the Kolmogorov tightness criterion, we can get that the sequence
is tight for the uniform topology and the limit satisfies the model (
19).
3.2. Affine Structure of the aBergomi Model
In this section, we detail the affine property of the aBergomi model.
Theorem 8. The process (Equation (20)) has the following affine structure Proof. Similarly, we can derive the affine structure of by Theorem 5. □
Then, we describe the so-called hybrid scheme and introduce an algorithm to approximate the rBergomi model by the aBergomi model.
3.3. Hybrid Scheme for the rBergomi Model
Recalling Equation (
3), the rough Bergomi model with time horizon
under an equivalent martingale measure
can be written as:
where
are two standard Brownian motions with correlation
. We recall from Assumption 1 that the forward variance curve
is flat for all
:
. Thus, the spot variance
in Equation (
21) is given by
To simulate the Volterra-type integral
, we apply the hybrid scheme proposed in Bennedsen et al. [
21], which approximates the kernel function of the Brownian semi-stationary processes by a Wiener integral of the power function at
and a Riemann sum elsewhere. Let
be a filtered probability space which supports a standard Brownian motion
. We consider a Brownian semi-stationary process (
Bss):
where
is an
-predictable process which captures the stochastic volatility of
and
is a Borel-measurable kernel function. We assume that
for all
and the process is covariance-stationary, namely,
These assumptions imply that is covariance-stationary. However, the process need not be strictly stationary.
Assumption 3. The assumptions regarding the kernel function g are as follows:
- (A1)
For some ,where is continuously differentiable, slowly varying at 0 and bounded away from 0. Moreover, there exists a constant such that the derivative of satisfies - (A2)
The function g is continuously differentiable on , and the derivative is ultimately monotonic and satisfies .
- (A3)
For some ,
In order to implement the hybrid scheme to the rBergomi model, we need to introduce a particular class of non-stationary processes, namely, truncated Brownian semi-stationary (
) processes,
where the kernel function
, the volatility process
, and the driving Brownian motion
are as defined in the definition of
processes.
can also be seen as the truncated stochastic integral at 0 of the
process
. Equation (
23) is integrable since
is differentiable on
.
Now, we can discretise Equation (
23) in time. Let
N be the total number of time-steps,
be the time-step size, and
be a time grid on the interval
.
According to Bennedsen et al. [
21], the observations
can be computed via (
case)
using the random vectors
the random variables
where
, and the random vectors
(see Proposition 2.8 in Bennedsen et al. [
21]). To simulate the Volterra process
, we use:
The corresponding matrix representation takes the form of
In the rBergomi model,
is a constant for
defined in Equation (13). When simulating
, we need to perform a matrix multiplication, the computational complexity of which is of order
when using the conventional matrix multiplication algorithm. However, multiplying a lower triangular Toeplitz matrix can be regarded as a discrete convolution which can be evaluated efficiently by fast Fourier transform. Therefore, the computational complexity can be reduced to
. The algorithm to simulate the Volterra process
is described in Algorithm 1 below. Then, we can use a standard Euler scheme to simulate the price
, as shown in Algorithm 2.
Algorithm 1: Volterra process |
|
Table 1 reports the parameters used for our numerical experiments, which are the same as in Bayer et al. [
1] and Bennedsen et al. [
21]. Recall from the definition of
that the chosen value
corresponds to the Hurst exponent
. Such small values of
H are indeed consistent with empirical experiments, and one can refer to the recent works Forde et al. [
36] and Gerhold [
37] about the behaviour of the rBergomi model for small
H.
Algorithm 2: Rough Bergomi model |
|
3.4. Markovian Scheme for the aBergomi Model
For the sake of simplicity, we start by deriving the approximation of the rBergomi model by a Bergomi model with two terms. The same approach can be used when the number of terms is greater than two. The two-term Bergomi model (
4) that we used to approximate the rBergomi model is given by
where
. Here, we introduce the process
defined as
where the two parameters
and
come from the exponential kernel
, and
and
are two OU processes. Hence, the process
can be written as a driftless Gaussian process as follows:
and its quadratic variation is given by
where
. The forward variation process
can be written as
. Thus, the solution of the forward variation process is
, where
and
Recall that and when when the number of terms n is large enough.
Using the approximation by the Bergomi model, we consider the parameters in the exponential kernel on . Note that when , the power kernel while is finite. To compute the approximation numerically, we need to truncate the kernel . To do so, we can use the module in or the function in for the nonlinear regression of the parameters and the simulated price . We exemplify the truncation of by letting , the truncated parameter , and let .
We define the integral
on the truncated region
and apply the scaling property of Brownian motion as follows:
After scaling , the process has to remain driftless Gaussian and satisfy , where . Then, the process can be written as . Thus, the kernel in the rBergomi model on can be approximated by .
In view of Equations (
26) and (
27) and the derivations in this subsection, a simple Monte Carlo simulation scheme for the
n-term aBergomi model is given by Algorithm 3. In practice, the truncation of the rBergomi power kernel means that, as is the case for the Riemann-sum scheme of Bennedsen et al. [
21], this scheme is able to capture the shape of the implied volatility smile, but not its level. A multiplication factor is used in Algorithm 3 for each time-step to correct for this phenomenon. In practice, these factors can be estimated using another calibrated scheme, or more simply, from quoted option prices.
Algorithm 3: n-term aBergomi model when |
|
4. Simulation Results
In this section, we compare the simulated volatilities of the rBergomi and aBergomi models. To demonstrate the approximation’s accuracy and efficiency, we investigate the Mean Absolute Error (MAE) of simulated results for different number of terms and number of time-steps in numerical tests.
Figure 1 displays the power kernel
in the rBergomi model and the
kernel of the 20-term aBergomi model with
and
. This figure suggests that this
obtained by nonlinear regression is sufficiently accurate, with a MAE of
.
The volatility smiles in
Figure 2 are obtained by simulating the rBergomi model as described in
Section 3.3, and the aBergomi model as described in
Section 3.4 using the multiplication factors reported in
Table 2. From
Figure 2, we note that the at-the-money calibration is better with 50 time-steps at the cost of a worse out-of-the-money calibration. Meanwhile, 100 time-steps can approximate the rBergomi model better than 50 time-steps for almost all strikes.
We compute the MAE of the implied volatility approximation with different numbers of terms in the aBergomi model and different time-steps in
Figure 3, and compare the pricing speed in
Table 3. As expected, the higher the number of terms in the aBergomi model, the lower the MAE for all time-steps, but the difference between the models decreases when the number of time-steps decreases. Another expected result is that the computational time increases with both the number of terms and time-steps. The number of terms and time-step combinations provide a good trade-off between speed and accuracy, such as the 20-term aBergomi model with 100 time-steps and 20,000 Monte Carlo paths.