On the Calibration of the Kennedy Model

Tóth-Lakits, Dalma; Arató, Miklós

doi:10.3390/math12193059

Open AccessFeature PaperArticle

On the Calibration of the Kennedy Model

by

Dalma Tóth-Lakits

^*,†

and

Miklós Arató

^†

Department of Probability Theory and Statistics, Eötvös Loránd University, 1117 Budapest, Hungary

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Mathematics 2024, 12(19), 3059; https://doi.org/10.3390/math12193059

Submission received: 31 August 2024 / Revised: 26 September 2024 / Accepted: 27 September 2024 / Published: 29 September 2024

(This article belongs to the Special Issue Mathematical Finance: Statistical Inference, Stochastic Modeling, and Advanced Algorithms)

Download

Browse Figures

Versions Notes

Abstract

The Kennedy model offers a robust framework for modeling forward rates, leveraging Gaussian random fields to accommodate emerging phenomena such as negative rates. In our study, we employ maximum likelihood estimations to determine the parameters of the Kennedy field, utilizing Radon–Nikodym derivatives for enhanced accuracy. We introduce an efficient simulation method for the Kennedy field and develop a Black–Scholes-like analytical pricing formula for diverse financial assets. Additionally, we present a novel parameter estimation algorithm grounded in numerical extreme value optimization, enabling the recalibration of parameters based on observed financial product prices. To validate the efficacy of our approach, we assess its performance using real-world par swap rates in the latter part of this article.

Keywords:

Kennedy model; calibration; term structure model; option pricing; interest rate swap; Gaussian random field; Heath–Jarrow–Morton framework; HJM model

MSC:

91G15; 62M40; 60G60

1. Introduction

In the 2010s, a new phenomenon, negative rates, appeared in the financial markets, which brought extreme uncertainty to the world, resulting in the mathematical models used to describe the dynamics of interest rates being reconsidered. The model, defined by Kennedy in the 1990s, describes the dynamics of the forward rates with Gaussian random fields [1,2]. This approach has several advantages; for example, it offers a solution to handle negative rates naturally and can be connected to the industry standard Heath–Jarrow–Morton (HJM) framework [3]. Additionally, maximum likelihood estimations of the parameters and analytical Black–Scholes-like pricing formulas for different financial assets can be derived due to the standard distribution properties of the Gaussian random fields.

This article summarizes the most critical issues related to using the Kennedy model in the financial world. In Section 2, we present the term structure model for describing forward interest rates based on the Gaussian random fields proposed by Kennedy. Among other things, we present the condition for the martingale property of the discounted bond price and show which cases coincide with the Gaussian Heath–Jarrow–Morton framework. Section 3 introduces the theoretical background of the parameter estimations. The results of the Radon–Nikodym derivative of Gaussian measures with different means are shown. This section derives the maximum likelihood and probability one estimation for the parameters in the Kennedy field. Section 4 shows a practical, simple, and fast way to simulate the Kennedy field with the help of the Brownian sheet. The following Section 5 contains the analytical, fair price of various financial products (caplet, floorlet, and swap). Section 6 summarizes the calibration methods for different financial products, in our case, the optimization algorithm, which is based on a numerical extreme value search to estimate the parameters of the field. Finally, use of the previously presented calibration algorithm on real swap par rate data can be found in Section 7.

2. Kennedy Model

The development of forward rates in the model proposed by Kennedy is described in the following equation.

F (s, t) = α (s, t) + X (s, t)

(1)

where

X (s, t)

is a centered Gaussian random field with covariance structure specified by

c o v [X (s_{1}, t_{1}), X (s_{2}, t_{2})] = c (s_{1} \land s_{2}, t_{1}, t_{2}), 0 \leq s_{i} \leq t_{i}, i = 1, 2 .

(2)

The function c is given and satisfies

c (0, t_{1}, t_{2}) = 0

. We assume that the drift function

α (s, t)

is deterministic and continuous for

0 \leq s \leq t

, and the initial term structure of

α (0, t), (where t \geq 0)

is specified. Additionally, we also have

E F (0, t) = α (0, t)

for

t \geq 0

. The covariance function

c (s_{1} \land s_{2}, t_{1}, t_{2})

is symmetric in

t_{1}

and

t_{2}

, and it is non-negative definite in pairs

(s_{1}, t_{1})

and

(s_{2}, t_{2})

. The dependence on

s_{1} \land s_{2}

ensures that the Gaussian random field

X (s, t)

exhibits independent increments.

A sufficient condition for the drift surface is established to guarantee that the discounted zero-coupon bond prices are martingales. Therefore, the model can be used to price financial products in the future.

First, let us introduce the following notations, where

0 \leq s \leq t

.

\begin{matrix} R (t) & = F (t, t) \end{matrix}

(3)

\begin{matrix} F^{Δ} (s, t) & = \frac{1}{Δ} \int_{t}^{t + Δ} F (s, u) d u \end{matrix}

(4)

\begin{matrix} P (s, t) & = e^{- \int_{s}^{t} F (s, u) d u} \end{matrix}

(5)

\begin{matrix} Z (s, t) & = e^{- \int_{0}^{s} R (u) d u} P (s, t) \end{matrix}

(6)

\begin{matrix} F (s) & = σ {F (u, v), 0 \leq u \leq s, u \leq v} \end{matrix}

(7)

where

R (t)

denotes the spot rate at time t,

P (s, t)

represents the price at time s of a bond paying one unit at time

t \geq s

.

Z (s, t)

defines the discounted price of the previously defined bond at time 0, with the information available at time s captured in the

F (s)

σ

-algebra, indicating that the entire yield curve is observable at each time point. We also introduce a new notation,

F^{Δ} (s, t)

, for the continuously compounded forward rate over the interval

[t, t + Δ], (where Δ > 0

), which can be interpreted as an average of the forward rate for the current period, at time s.

An important theorem is emphasized in Kennedy’s article, which states the following [2].

Theorem 1

(Kennedy (1997) [2]). In the independent-increments case the following statements are equivalent:

(a): The discounted bond-price process ${Z (s, t), F (s), (0 \leq s \leq t)}$ is a martingale for each $t \geq 0$ ;
(b): $P (s, t) = E [e^{- \int_{s}^{t} R (u) d u} | F (s)]$ , for all $(s, t),$ $(0 \leq s \leq t)$ ; and
(c): $α (s, t) = α (0, t) + \int_{0}^{t} [c (s \land v, v, t) - c (0, v, t)] d v$ for all $(s, t)$ , $(0 \leq s \leq t)$ .

The proof of the theorem is accessible in the original article written by Kennedy [1]. Furthermore, a different derivation of the theorem can be found in Appendix A.1. To complete the proof, it is necessary to include an additional statement, which formulates an equivalent form of defining the drift term with the covariance function.

Remark 1.

The two statements for the drift term in the Kennedy model are equivalent. For all

0 \leq s \leq t

\begin{matrix} α (s, t) = α (t, t) + \int_{s}^{t} [c (s, v, t) - c (v, v, t)] d v \\ if and only if \end{matrix}

(8)

\begin{matrix} α (s, t) & = α (0, t) + \int_{0}^{t} [c (s \land v, v, t) - c (0, v, t)] d v . \end{matrix}

(9)

Proof of Remark 1.

The proof is given by showing that both directions are correct.

\begin{matrix} \Rightarrow & α (s, t) - α (0, t) = α (t, t) + \int_{s}^{t} [c (s, v, t) - c (v, v, t)] d v - α (t, t) - \int_{0}^{t} [c (0, v, t) - c (v, v, t)] d v \end{matrix}

(10)

\begin{matrix} = \int_{s}^{t} [c (s, v, t) - c (0, v, t)] d v + \int_{0}^{s} [c (v, v, t) - c (0, v, t)] d v = \int_{0}^{t} [c (s \land v, v, t) - c (0, v, t)] d v \end{matrix}

(11)

\begin{matrix} \Leftarrow & α (s, t) - α (t, t) = α (0, t) + \int_{0}^{t} [c (s \land v, v, t) - c (0, v, t)] d v - α (0, t) - \int_{0}^{t} [c (t \land v, v, t) - c (0, v, t)] d v \end{matrix}

(12)

\begin{matrix} = \int_{0}^{t} [c (s \land v, v, t) - c (v, v, t)] d v = \int_{s}^{t} [c (s, v, t) - c (v, v, t)] d v \end{matrix}

(13)

□

Connection Between HJM and the Kennedy Model

The Heath–Jarrow–Morton framework is a widely used model considered an industry standard [3]. This is also a term structure model, which creates a connection between bonds with different maturities. The HJM model is an infinite-dimensional framework. Therefore, the whole yield curve evolves in forward time instead of at a specific point.

This framework’s key point is to recognize an explicit relationship between drift and volatility parameters of the forward rate dynamics in a no-arbitrage world [4]. The critical assumption of the Heath–Jarrow–Morton model is that there is an elementary bond for each maturity. Overall, in an arbitrage-free term structure model, the forward rates must evolve like the following stochastic differential equation.

d F (s, t) = σ (s, t) (\int_{s}^{t} σ (s, u) d u) d s + σ (s, t) d W (s)

(14)

Hereinafter, the main statement of the Heath–Jarrow–Morton model is that if there are no arbitrage opportunities, the forward rate drift is driven by volatility, known as the HJM no-arbitrage drift condition. This drift condition arises from the fact that the discounted process must be martingale [3].

Kennedy stated in his article that the Kennedy model encompasses the Heath–Jarrow–Morton framework in scenarios where the coefficients

β (s, t)

and

σ_{i} (s, t)

in the underlying stochastic differential equations are deterministic, resulting in the rates

F (s, t)

being Gaussian [2]. Therefore, in this section, we will show precisely in which cases the two models can correspond to each other.

The notations of the HJM model are written consistently with those found in the book by Shreve ([4] on pages 423–435). We examine the case when a single Wiener process drives the forward interest rates, and

β (s, t)

and

σ (s, t)

are deterministic processes; then, the dynamics can be written as follows:

F (s, t) = F (0, t) + \int_{0}^{s} β (u, t) d u + \int_{0}^{s} σ (u, t) d W (u)

(15)

where

F (0, t)

refers to the initial forward year curve known at time 0,

W (u)

is a Wiener process under the actual measure, and

β (s, t)

and

σ (s, t)

are deterministic processes in the variable s. Let us denote

{ξ (t)}_{t \geq 0} = {F (0, t)}_{t \geq 0}

, which is independent from process

{W (t)}_{t \geq 0}

and is a Gaussian process.

The expected value and the covariance function of the Heath–Jarrow–Morton framework and the key Kennedy field conditions can be written in the following form.

The expected value function from the Heath–Jarrow–Morton model can be calculated in the following way.

$α (s, t) = E F (s, t) = E ξ (t) + \int_{0}^{s} β (u, t) d u = m (t) + \int_{0}^{s} β (u, t) d u$

(16)
Similarly to the previously calculated expected value function, the covariance function is calculated as follows. Let us denote the covariance function between $ξ (t_{1}), ξ (t_{2})$ with $c o v (ξ (t_{1}), ξ (t_{2})) = r (t_{1}, t_{2})$

$c (s_{1}, s_{2}, t_{1}, t_{2}) = c o v (F (s_{1}, t_{1}), F (s_{2}, t_{2})) = c o v (ξ (t_{1}), ξ (t_{2})) + \int_{0}^{min (s_{1}, s_{2})} σ (u, t_{1}) σ (u, t_{2}) d u$

(17)

The covariance function is specified as a function of $s_{1} \land s_{2}$ . This ensures that the Gaussian random field $X (s, t)$ has independent increments in time s, which is also fulfilled due to point 2 in the HJM framework. This confirms that all Gaussian HJM models (where the drift and the volatility terms are deterministic) are the well-known Kennedy model.
By adding the martingale property into the Kennedy model (like in point (c) in Theorem 1), this guarantees that the conditional expected value of the discounted bond-price process is a martingale under the risk-neutral measure. As a result, the model is arbitrage-free. Then, by matching the equations of the expected values to each other, we obtain the famous condition of the HJM model, according to which the drift term can be obtained in the form below.

$\begin{matrix} α (0, t) + \int_{0}^{s} β (u, t) d u & = α (0, t) + \int_{0}^{t} [c (s \land v, v, t) - c (0, v, t)] d v \end{matrix}$

(18)

$\begin{matrix} α (0, t) + \int_{0}^{s} β (u, t) d u & = α (0, t) + \int_{0}^{t} [r (v, t) + \int_{0}^{m i n (s, v)} σ (u, v) σ (u, t) d u - r (v, t)] d v \end{matrix}$

(19)

$\begin{matrix} \int_{0}^{s} β (u, t) d u & = \int_{0}^{t} \int_{0}^{m i n (s, v)} σ (u, v) σ (u, t) d u d v \end{matrix}$

(20)

$\begin{matrix} β (s, t) & = σ (s, t) \int_{s}^{t} σ (s, v) d v \end{matrix}$

(21)

where the last equation equals the famous no-arbitrage HJM condition.
By adding the Markov property to the previous conditions, where the discounted bond price process is martingale, we obtain an even narrower class of models. We first define the following concepts based on Kennedy’s article for a random field ${F (s, t) : 0 \leq s \leq t}$ [2].
Definition 1
(first Markov property). F satisfies the first Markov property if for all $0 \leq s_{1} \leq s_{2} < s_{3}$ , $s_{1} \leq t_{1}$ and $s_{3} \leq t_{2}$ the following holds: $F (s_{1}, t_{1}) ⊥ F (s_{3}, t_{2}) | F (s_{2}, t_{2})$ .
Definition 2
(second Markov property). F satisfies the second Markov property if for all $0 \leq s_{1} < s_{2}$ and for any $t_{1}$ , $t_{2}$ with $s_{2} \leq t_{1} \land t_{2}$ the following condition holds: $F (s_{1}, t_{1}) ⊥ F (s_{2}, t_{2}) | F (s_{2}, t_{1})$ .
Definition 3
(Markov property). F is considered Markovian if it satisfies both the first and second Markov properties.
Definition 4
(Markov in t-direction). F is said to be Markovian in the t-direction, meaning in the maturity-time coordinate, if for all $s \leq t_{1} \leq t_{2} \leq t_{3}$ the following condition holds

$F (s, t_{1}) ⊥ F (s, t_{3}) | F (s, t_{2})$

Definition 5
(strict Markov property). F is considered strictly Markovian if it is both Markov and Markovian in the t-direction.
Kennedy stated (in theorem 3.1 in [2]) that if a random field of forward rates is Markovian and satisfies the independent increments property, then the covariance function can be expressed in the following form.

$c (s, t_{1}, t_{2}) = f (s) g (t_{1}, t_{2}),$

(22)

where f is a monotone increasing, and g is a symmetric and positive, semidefinite function. This property can be written as follows for the HJM model.

$r (t_{1}, t_{2}) + \int_{0}^{s} σ (u, t_{1}) σ (u, t_{2}) d u = f (s) g (t_{1}, t_{2})$

(23)

Then, by deriving (23) according to the variable s we obtain

$σ (s, t_{1}) σ (s, t_{2}) = f^{'} (s) g (t_{1}, t_{2})$

(24)

By setting $t_{1}$ and $t_{2}$ equal to each other $(t_{1} = t_{2} = t)$ , we obtain the following equality

$σ^{2} (s, t) = f^{'} (s) g (t, t)$

(25)

Consequently,

$σ (s, t) = b (s) g (t),$

(26)

where $b (s) = \sqrt{f^{'} (s)}$ and $g (t) = \sqrt{g (t, t)}$ . Therefore, it is shown that if the HJM model is Markovian, then the $σ (s, t)$ function appears in the form of Equation (26). We thus obtained that in the Markovian case, the volatility function must be separable in the time parameters. Hence,

$σ (s, t_{1}) σ (s, t_{2}) = b^{2} (s) g (t_{1}) g (t_{2})$

(27)

For $s = 0$ , Equation (23) can be written in the following form

$r (t_{1}, t_{2}) = f (0) \cdot g (t_{1}, t_{2}) .$

(28)

From Equations (23), (28), and (39) it can be stated that

$\begin{matrix} r (t_{1}, t_{2}) & = f (s) g (t_{1}, t_{2}) - g (t_{1}) g (t_{2}) \int_{0}^{s} b^{2} (u) d u \end{matrix}$

(29)

$\begin{matrix} f (0) g (t_{1}, t_{2}) & = f (s) g (t_{1}, t_{2}) - g (t_{1}) g (t_{2}) \int_{0}^{s} b^{2} (u) d u \end{matrix}$

(30)

$\begin{matrix} g (t_{1}, t_{2}) & = g (t_{1}) g (t_{2}) \int_{0}^{s} f^{'} (u) d u \end{matrix}$

(31)

If the function $f (s)$ is constant, then we obtain the trivial case when $σ (s, t) = 0$ for all $(s, t)$ . In the non-trivial case, we obtain from Equation (31) that $f (s)$ is not constant. Therefore, we obtain

$\begin{matrix} r (t_{1}, t_{2}) & = c g (t_{1}) g (t_{2}) \end{matrix}$

(32)

Hence, we have shown that if the HJM model is Markovian, then functions $σ$ and r occur in the previously derived form. Now, we show the opposite direction: if our covariance function has this shape, then the HJM model will be Markovian.

$\begin{matrix} c (s, t_{1}, t_{2}) & = c g (t_{1}) g (t_{2}) + g (t_{1}) g (t_{2}) \int_{0}^{s} b^{2} (u) d u \end{matrix}$

(33)

$\begin{matrix} = \underset{g (t_{1}, t_{2})}{\underset{⏟}{g (t_{1}) g (t_{2})}} \underset{f (s)}{\underset{⏟}{(c + \int_{0}^{s} b^{2} (u) d u)}} \end{matrix}$

(34)

$\begin{matrix} = f (s) g (t_{1}, t_{2}) \end{matrix}$

(35)

which is exactly the necessary condition (22).
In 1992, Cheyette published an article in which a restriction was applied to the Heath–Jarrow–Morton model, which formed a subset of the original HJM models to make the model Markovian. This so-called Cheyette model is an arbitrage-free term structure model that is Markovian in a finite number of state variables and is consistent with any arbitrary initial term structure. Due to these favorable properties, the Cheyette model quickly spread throughout the industry and became widely used [5].
In this case, the volatility function has to be separable into time- and maturity-dependent factors given by the following structure [6].

$σ (s, t) = α (t) \frac{β (s)}{α (s)}$

(36)

However, this condition is completely identical to the previously derived condition for the volatility term in the Markov case in the Kennedy model.
Kennedy further narrowed the model class by requiring stationarity in addition to the Markov property and the independent increments property (stated in Theorem 3.2 in [2]).
Definition 6
(stationary). F is stationary if, for each $t > 0$ , the joint distributions of ${F (s, t) : 0 \leq s \leq t}$ are identical to those of ${F (s + u, t + u) : 0 \leq s \leq t}$ for any fixed $u > 0$ .
Therefore, the covariance function takes the form below:

$c (s, t_{1}, t_{2}) = e^{λ (s - (t_{1} \land t_{2}))} \cdot h (| t_{1} - t_{2} |)$

(37)

where $λ \geq 0,$ $| H (x) | \leq h (0) e^{- λ \frac{x}{2}}$ and $x \geq 0$ .
For the HJM framework, it was shown that $r (t_{1}, t_{2}) = 0$ , hence, according to point 5,

$\begin{matrix} c (s, t_{1}, t_{2}) & = f (s) g (t_{1}, t_{2}) = g (t_{1}) g (t_{2}) (c + \int_{0}^{s} b^{2} (u) d u) \end{matrix}$

(38)

$\begin{matrix} = e^{λ (s - (t_{1} \land t_{2}))} \cdot h (| t_{1} - t_{2} |) \end{matrix}$

(39)

For $s = 0$ and $t_{1} = t_{2} = t$ , it can be written

$\begin{matrix} c g^{2} (t) & = e^{- λ t} h (0) \end{matrix}$

(40)

$\begin{matrix} g (t) & = \sqrt{\frac{h (0)}{c}} \cdot e^{\frac{- λ t}{2}} \end{matrix}$

(41)

Returning to Equation (38)

$\begin{matrix} c + \int_{0}^{s} b^{2} (u) d u & = \frac{1}{g (t_{1}) g (t_{2})} e^{λ (s - (t_{1} \land t_{2}))} \cdot h (| t_{1} - t_{2} |) \end{matrix}$

(42)

$\begin{matrix} c + \int_{0}^{s} b^{2} (u) d u & = \frac{c}{h (0)} e^{\frac{λ t_{1}}{2}} e^{\frac{λ t_{2}}{2}} e^{λ (s - (t_{1} \land t_{2}))} \cdot h (| t_{1} - t_{2} |) \end{matrix}$

(43)

Now substituting $s = 0$

$\begin{matrix} c \cdot h (0) & = c exp \{\frac{λ}{2} (t_{1} + t_{2}) - λ (t_{1} \land t_{2})\} h (| t_{1} - t_{2} |) \end{matrix}$

(44)

$\begin{matrix} h (0) & = c exp \{\frac{λ}{2} | t_{1} - t_{2} |\} h (| t_{1} - t_{2} |) \end{matrix}$

(45)

$\begin{matrix} h (u) & = h (0) exp \{\frac{- λ}{2} u\} \end{matrix}$

(46)

Returning again to Equation (39), while substituting Equation (41)

$\begin{matrix} \frac{h (0)}{c} exp \{\frac{- λ}{2} (t_{1} + t_{2})\} (c + \int_{0}^{s} b^{2} (u) d u) & = e^{λ (s - (t_{1} \land t_{2}))} \cdot h (| t_{1} - t_{2} |) \end{matrix}$

(47)

$\begin{matrix} = e^{λ (s - (t_{1} \land t_{2}))} \cdot h (0) exp \{\frac{- λ}{2} | t_{1} - t_{2} |\} \end{matrix}$

(48)

$\begin{matrix} \frac{1}{c} (c + \int_{0}^{s} b^{2} (u) d u) & = e^{λ s} \end{matrix}$

(49)

$\begin{matrix} \int_{0}^{s} b^{2} (u) d u & = c e^{λ s} - c \end{matrix}$

(50)

By deriving the integral equation according to the variable s, we obtain the following solution

$b^{2} (u) = c λ e^{λ u}$

(51)

Therefore, the covariance function of the forward rates ${F (s, t) : 0 \leq s \leq t}$ , when the rates are stationary, strictly Markov, and satisfy the independent-increments property, can be described with the following set of four parameters ${σ, λ \geq 0, μ \geq \frac{λ}{2}, ν}$ and is of the form

$c o v [F (s_{1}, t_{1}), F (s_{2}, t_{2})] = σ^{2} e^{λ min (s_{1}, s_{2}) + (2 μ - λ) min (t_{1}, t_{2}) - μ (t_{1} + t_{2})}$

(52)

The function of the expected value of the Gaussian random field can be easily derived from the covariance function.

$\begin{matrix} α (s, t) & = ν - σ^{2} (\frac{1}{μ} - e^{- μ (t - s)} (\frac{1}{μ} + \frac{1}{λ - μ}) + e^{- λ (t - s)} \frac{1}{λ - μ}) \end{matrix}$

(53)

3. Parameter Estimation

In finance, where uncertainty reigns supreme and decisions are often made with incomplete information, accurate modeling of interest rate dynamics is paramount. This is where parameter estimation comes into play as a fundamental aspect of financial modeling, particularly in the context of Kennedy-type term structure models. While calibration is a widely adopted practice in finance, parameter estimation also holds significant importance.

The central assumption of these models is that interest rates follow stochastic processes, the parameters of which govern their behavior over time. These parameters determine the shape of the yield curve and influence the pricing of various financial instruments, such as bonds, options, and derivatives. Therefore, obtaining reliable estimates of these parameters is essential for making informed investment decisions, managing risk, and accurately pricing financial products.

Parameter estimation techniques enable practitioners to calibrate these models to observed market data, such as bond prices or interest rate derivatives. Among the most commonly used methods are maximum likelihood estimation (MLE), estimation with probability 1, and Radon–Nikodym derivatives, which allow determining parameter values that maximize the likelihood of observing the given market data under the model assumptions. Through rigorous statistical inference, these techniques provide a systematic framework for extracting information from observed market prices and estimating the underlying dynamics of interest rates.

3.1. Maximum Likelihood Estimations

This section presents the theoretical background of maximum likelihood estimations for Gaussian functionals, drawing on the work of Rozanov and Arató [7,8]. The following definitions, theorems, and theoretical results were published in a conference proceedings in Barcelona, where we gave a presentation [9].

Definition 7

(Gaussian functional). Let

(Ω, A, P)

be a probability space, and let T be a parameter set. In this context,

ξ : Ω \times T \to R

is considered a Gaussian functional if, for any

n \in N

and

c_{1}, \dots, c_{n} \in R

,

t_{1}, \dots, t_{n} \in T

, the following expression is normally distributed:

\sum_{i = 1}^{n} c_{i} ξ_{t_{i}}

(54)

In this case, P is referred to as a Gaussian measure on

(Ω, F_{ξ})

. For simplicity, we can assume that

A = F_{ξ} .

The expected value and the covariance of the Gaussian functional are denoted as follows:

m (t) = E ξ (t), B (s, t) = c o v [ξ (s), ξ (t)]

(55)

It is well-established that two Gaussian measures are either equivalent or orthogonal.

3.1.1. The Case of Different Expected Values

Let

ξ : Ω \times T \to R

be a Gaussian functional. Assume that the expected value of the Gaussian functional under the measure P is 0, while the expected value under the measure

P_{1}

is m.

E_{P} ξ (t) = 0, E_{P_{1}} ξ (t) = m (t), t \in T

(56)

Let U represent the linear space of variables structured in the following manner, while

\bar{U}

is the Hilbert space obtained by closing U:

\sum_{i = 1}^{n} c_{i} ξ_{t_{i}}, n \in N, c_{1}, \dots, c_{n} \in R, t_{1}, \dots, t_{n} \in T .

(57)

In addition, take the following scalar product:

< u, v > = \int_{Ω} u v d P .

(58)

The upcoming theorem can be found in [8].

Theorem 2

(Rozanov). The measures P and

P_{1}

are equivalent if and only if there exists an

η \in \bar{U}

such that

m (t) = \int_{ω} ξ (t) η (t) d P, t \in T .

(59)

The Radon–Nikodym derivative of the two measures in the case of equivalence is given by

\frac{d P_{1}}{d P} = e^{η - \frac{< η, η >}{2}}

(60)

A straightforward consequence of this theorem is the following statement made by Arató [7].

Theorem 3

(Arató). Let

ξ : Ω \times T \to R

be a Gaussian functional. Assume that the expected value of the Gaussian functional under the measure P is 0 and under the measure

P_{1}

is

m \cdot a (t)

. The measures P and

P_{1}

are equivalent if and only if there exists an

η \in \bar{U}

such that

a (t) = \int_{ω} ξ (t) η (t) d P, t \in T .

(61)

The Radon–Nikodym derivative of the two measures in the case of equivalence is given by

\frac{d P_{1}}{d P} = e^{m η - \frac{m^{2} < η, η >}{2}}

(62)

Theorem 4

(Maximum likelihood estimation). Using the notations from the previous theorem, the maximum likelihood estimate of m is given by

\hat{m} = \frac{η}{< η, η >} .

(63)

The estimation is normally distributed and unbiased, and the standard deviation is

D_{P_{1}}^{2} \hat{m} = \frac{1}{< η, η >}

(64)

Proof of Theorem 4.

The form of the estimation is directly obtained from the Radon–Nikodym derivative. To ascertain the expected value and the variance, we must calculate the following expected value if

X \sim N (0, σ^{2})

.

E (X^{k} e^{m X}) = \int_{- \infty}^{\infty} x^{k} e^{m x} \frac{1}{\sqrt{2 π} σ} e^{- \frac{x^{2}}{2 σ^{2}}} d x = e^{\frac{m^{2} σ^{2}}{2}} \int_{- \infty}^{\infty} x^{k} \frac{1}{\sqrt{2 π} σ} e^{- \frac{{(x - m σ^{2})}^{2}}{2 σ^{2}}} d x

(65)

we obtain the following values by calculating the first two moments:

if $k = 1$ , then → $E (X e^{m X}) = m σ^{2} e^{\frac{m^{2} σ^{2}}{2}}$
if $k = 2$ , then → $E (X^{2} e^{m X}) = (σ^{2} + m^{2} σ^{4}) e^{\frac{m^{2} σ^{2}}{2}}$

For the expected value, we obtain the following:

\begin{matrix} E_{P_{m}} \hat{m} & = \frac{1}{< η, η >} \int_{Ω} η d P_{m} = \frac{1}{< η, η >} \int_{Ω} η e^{m η - \frac{m^{2} < η, η >}{2}} d P \end{matrix}

(66)

\begin{matrix} = \frac{1}{< η, η >} e^{- \frac{m^{2} < η, η >}{2}} m < η, η > e^{\frac{m^{2} < η, η >}{2}} = m \end{matrix}

(67)

Similarly to the first moment, we can derive the second moment.

\begin{matrix} E_{P_{m}} {\hat{m}}^{2} & = \frac{1}{< η, η >^{2}} \int_{Ω} η^{2} d P_{m} = \frac{1}{< η, η >^{2}} \int_{Ω} η^{2} e^{m η - \frac{m^{2} < η, η >}{2}} d P \end{matrix}

(68)

\begin{matrix} = \frac{1}{< η, η >^{2}} e^{- \frac{m^{2} < η, η >}{2}} (< η, η > + m^{2} < η, η >^{2}) e^{\frac{m^{2} < η, η >}{2}} = \frac{1}{< η, η >} + m^{2} . \end{matrix}

(69)

The standard deviation can be derived directly from these previous calculations. Given that these are Gaussian functionals, the normality is evident. □

3.1.2. The Case of Constant Expected Value

In cases where the expected value of our Gaussian process is constant,

a (t) = 1

for every

t \in T

. Let

F = σ {(ξ (t) - ξ (s)

),

s, t \in T}

. Fix a point

t_{0} \in T

and define

h (ξ) = E_{P} [ξ (t_{0}) ∣ F)] .

Assuming that

D^{2} [ξ (t_{0}) - h (ξ)] > 0

, the maximum likelihood estimate of m is given by

\tilde{m} = ξ (t_{0}) - h (ξ) .

(70)

Proof.

The proof uses the law of total expectation and the

E_{p} (\tilde{m} (ξ (t_{0}) - h (ξ))) = D_{P}^{2} (\tilde{m})

statement.

\begin{matrix} E_{P} (\tilde{m} ξ (t)) & = E_{P} (\tilde{m} (ξ (t) - ξ (t_{0}) + ξ (t_{0}) - h (ξ) + h (ξ))) \end{matrix}

(71)

\begin{matrix} = E_{P} (\tilde{m} (ξ (t) - ξ (t_{0}) + h (ξ))) + D_{P}^{2} (\tilde{m}) \end{matrix}

(72)

\begin{matrix} = E_{P} (E_{P} ((\tilde{m} (ξ (t) - ξ (t_{0}) + h (ξ)) ∣ F)) + D_{P}^{2} (\tilde{m}) = D_{P}^{2} (\tilde{m}) \end{matrix}

(73)

Based on the previous derivations, the maximum likelihood estimation is the following:

\hat{m} = \frac{\tilde{m} / D_{P}^{2} (\tilde{m})}{D_{P}^{2} (\tilde{m} / D_{P}^{2} (\tilde{m}))}

(74)

□

3.1.3. Some Simple Examples

The established results of various stochastic processes commonly used to model financial processes directly stem from the previous theorems.

For instance, consider a Gaussian process with an expected value of m and the same covariance as the Wiener process over the interval

[a, b]

, where

a > 0

and

a < b

. In this case, the maximum likelihood estimation of the Wiener process is the value of the process at the start,

\tilde{m} = ξ (a) .

Similarly, a stationary Ornstein–Uhlenbeck process can be observed over the interval

[0, T]

. In this case, the value of

λ > 0

is known in advance, as well as the expected value and the covariance matrix of the process.

\begin{matrix} E_{P_{m}} [ξ (t)] & = m \end{matrix}

(75)

\begin{matrix} c o v_{P_{m}} [(ξ (s), ξ (t))] & = σ^{2} e^{- λ | t - s |}, s, t \in [0, T] \end{matrix}

(76)

Therefore, the following covariances can be easily determined:

E_{P} [ξ (0) ξ (t)] = σ^{2} e^{- λ t}, E_{P} [ξ (t) ξ (T)] = σ^{2} e^{- λ (T - t)}

(77)

E_{P} (\int_{0}^{T} ξ (s) d s \cdot ξ (t)) = σ^{2} \frac{2 - e^{- λ t} - e^{- λ (T - t)}}{λ} .

(78)

By leveraging the fact that, in this case, the maximum likelihood estimate is unbiased, we obtain the well-known result from Grenander [10]:

\hat{m} = \frac{ξ (0) + ξ (T) + λ \int_{0}^{T} ξ (s) d s}{2 + λ T} .

(79)

3.2. Parameter Estimations of the Kennedy Field

From now on, we investigate the case when the random field of forward rates is strictly Markov, stationary, and satisfies the property of independent increments. Then, as we have seen previously, the covariance and expected value functions have the form (52) and (53), and these functions are defined by four parameters (

ν, μ, α

and

σ

). Therefore, the expected initial forward curve is easily obtained.

\begin{matrix} α (0, t) & = ν - \frac{σ^{2}}{μ} + \frac{σ^{2}}{μ} e^{- μ t} + \frac{σ^{2}}{λ - μ} e^{- μ t} - \frac{σ^{2}}{λ - μ} e^{- λ t} \end{matrix}

(80)

\begin{matrix} = ν + \frac{σ^{2}}{μ} (e^{- μ t} - 1) + \frac{σ^{2}}{λ - μ} (e^{- μ t} - e^{- λ t}) \end{matrix}

(81)

In addition, from the equation above, it can be straightforwardly seen that the parameter

ν

refers to the expected value of the spot curve.

E F (s, s) = E R (s) = α (s, s) = ν - \frac{σ^{2}}{μ} + \frac{σ^{2}}{μ} + \frac{σ^{2}}{λ - μ} - \frac{σ^{2}}{λ - μ} = ν

(82)

It is evident that the

F (s, s + t)

field constitutes an Ornstein–Uhlenbeck process with respect to the variable s, therefore

c o v [F (s_{1}, s_{1} + t), F (s_{2}, s_{2} + t)] = σ^{2} e^{- λ t} e^{- μ | s_{1} - s_{2} |}

(83)

This implies that if we can observe the

F (s, s + t)

process over an interval defined by s for some specific value of t, then

σ^{2} e^{- λ t} μ

is determined with probability 1. If we can achieve this for two distinct values of t, then both

σ^{2} μ

and

λ

are defined with probability 1.

Examining another covariance from the field,

c o v [F (\frac{log s_{1}}{λ}, t), F (\frac{log s_{2}}{λ}, t)] = σ^{2} e^{- λ t} min (s_{1}, s_{2})

(84)

This means that

σ^{2} e^{- λ t}

is defined with probability 1, which in turn implies that both

σ^{2}

and

μ

are also defined with probability 1.

In the following, we observe the field on a region marked with T. The following

ξ (s, t)

auxiliary random field is introduced, where the expected value under the measure

P_{ν}

is

ν

.

ξ (s, t) = F (s, t) + σ^{2} (\frac{1}{μ} - e^{- μ (t - s)} (\frac{1}{μ} - \frac{1}{λ - μ}) + e^{- λ (t - s)} \frac{1}{λ - μ}) .

(85)

where

W (x_{i}, y_{j}) = \sum_{k = 1}^{i} \sum_{l = 1}^{j} ξ (k, l)

(86)

We demonstrate that the following estimate is the maximum likelihood estimate of this parameter.

\hat{ν} = \frac{\frac{e^{λ b_{1}}}{μ} ξ (a, b_{1}) + \frac{e^{λ b_{2}}}{μ - λ} ξ (a, b_{2}) + \int_{b_{1}}^{b_{2}} e^{λ ν} ξ (a, v) d v}{e^{λ b_{2}} (\frac{1}{λ} + \frac{1}{μ - λ}) + e^{λ b_{1}} (\frac{1}{μ} - \frac{1}{λ})} .

(87)

First, we obtain that

E_{P_{0}} (ξ (s, t) \hat{ν})

gives the same value for every

(s, t) \in T

. On the other hand,

E_{P_{ν}} (\hat{ν}) = ν

. Thus, based on Theorems 2 and 3,

\hat{ν}

is the maximum likelihood estimate.

4. Simulation of the Kennedy Field

This section aims to simulate the Kennedy field in

n \times m

points. We can consider this as an

n \times m

normally distributed vector whose expected value and covariance matrix are known. However, for sufficiently large n and m, simulating a multidimensional, normally distributed vector becomes exceedingly slow, due to the size of the covariance matrix. A much more effective, faster, and simpler way is to observe that if

W (x, y)

is a Brownian sheet, then

α (s, t) + σ e^{- μ t} W (e^{λ s}, e^{(2 μ - λ) t})

forms a Kennedy field with the appropriate covariance structure.

The question is how can we most efficiently generate a Brownian sheet at the

(x_{i}, y_{j})

points

(x_{1} < \dots < x_{n}, y_{1} < \dots < y_{m})

, where the division is not necessarily equidistant. Let us take independent random variables with distributions

N (0, (x_{i} - x_{i - 1}) (y_{j} - y_{j - 1})),

(where x_{0} = y_{0} = 0)

and denote them as

η (i, j)

. Consequently, the Brownian sheet can be expressed in the following form:

W (x_{i}, y_{j}) = \sum_{k = 1}^{i} \sum_{l = 1}^{j} η (k, l)

(88)

Therefore, the subsequent matrix operation should be implemented efficiently to obtain the desired results.

A \to B : B (i, j) = \sum_{k = 1}^{i} \sum_{l = 1}^{j} A (k, l)

(89)

Fortunately, ready-made, fast algorithms exist for this double summation.

In Figure 1, two different realizations of the forward rate field (denoted as

F (s, t), 0 \leq s \leq t

) are depicted, generated using the simulation algorithm described earlier. The figures contain 10,000 simulated points (

100 \times 100 = 10,000

). The higher rates are represented in light yellow, while as the values decrease, they transition into dark blue.

Since the rates have not yet been calibrated to market data, the field exhibits considerable volatility. However, it is noteworthy that the Kennedy model is capable of producing negative forward rate values. It is also important to highlight that the figures clearly show a general increase in forward rates for longer maturities, which is consistent with the empirical observations typically seen in the market.

5. Option Pricing

This section aims to show that fair prices for various financial assets can be derived analytically if we assume that the forward rates evolve according to a Gaussian random field.

5.1. European Caplet

In the case of options, compounded forward rates are used instead of instantaneous forward rates. Consequently, it is necessary to transition from the instantaneous forward rate described earlier by the Kennedy field to a discrete forward rate for a given time period, often denoted as

L (t, T_{i})

, concerning the LIBOR rate. Consistently with the following discretization scheme, the discretized version of the HJM framework, which is considered the industry standard, is the LIBOR market model (LMM).

1 + L (s, t) Δ = e^{\int_{t}^{t + Δ} F (s, u) d u} = e^{Δ F^{Δ} (s, t)}

(90)

This derivation is equivalent to the one derived by Kennedy but uses a different approach. We aim to calculate the price of an interest rate caplet with a strike K for the period from t to

t + Δ

. This can be interpreted as a European option on the forward rate given by

F^{Δ} (s, t) = \frac{1}{Δ} \int_{t}^{t + Δ} F (s, u) d u

which is exercised at time t if

f^{Δ} (t, t) > K

, resulting in a payoff at time

t + Δ

. The payoff function of this transaction is shown below.

V (t, K) = {[(e^{Δ F^{Δ} (t, t)} - 1) - (e^{Δ K} - 1)]}_{+} = {[e^{Δ F^{Δ} (t, t)} - e^{Δ K}]}_{+}

(91)

The discount factor from time s to time t is defined as follows.

D (s, t) = e^{- \int_{s}^{t} r (u) d u}

(92)

A cap normally consists of a series of such options for successive time periods; however, it is sufficient in this context to consider only a single time period. The discounted payoff of the option at time s is given by the following expression:

D (s, t + Δ) V (t, K) = e^{- \int_{s}^{t + Δ} r (u) d u} {(e^{Δ F^{Δ} (t, t)} - e^{Δ K})}_{+}

(93)

The price of a financial asset is obtained by taking the expected value of the discounted payoff function. The definition of the drift term guarantees that the model is under a risk-neutral measure, just like in the Heath–Jarrow–Morton framework.

P_{c a p l e t} (s) = E [e^{- \int_{s}^{t + Δ} r (u) d u} {(e^{Δ F^{Δ} (t, t)} - e^{Δ K})}_{+}]

(94)

For the sake of simplicity, two additional variables are introduced (

ξ (s, t)

and

η (s, t)

) to denote the time range over which the forward rate is integrated.

\begin{matrix} ξ (s, t) & = \int_{s}^{t} r (u) d u = \int_{s}^{t} F (u, u) d u \end{matrix}

(95)

\begin{matrix} η (s, t) & = \int_{s}^{t} F (t, u) d u \end{matrix}

(96)

Hence, in the case of a caplet, we deal with the following special case of

ξ

and

η

.

\begin{matrix} ξ (s, t + Δ) & = \int_{s}^{t + Δ} r (u) d u = \int_{s}^{t + Δ} F (u, u) d u \end{matrix}

(97)

\begin{matrix} η (t, t + Δ) & = Δ F^{Δ} (t, t) = \int_{t}^{t + Δ} F (t, u) d u \end{matrix}

(98)

Due to the properties of the Gaussian random field,

(ξ (s_{1}, t_{1}), η (s_{2}, t_{2}))

follows a multivariate normal distribution. Henceforth, except for necessary cases, we omit the corresponding time indices to indicate the expected value, standard deviation, and correlation between

ξ

and

η

. Consequently, let us denote them with the following notations

E ξ = μ_{1}

,

D^{2} (ξ) = σ_{1}^{2}

,

E η = μ_{2}

, and

D^{2} (η) = σ_{2}^{2}

. From now on, the conditional normal distribution theorem can be used. As a result, the conditional distribution of

ξ

given

η

is the following:

ξ | η \sim N (μ_{1} + ρ σ_{1} \frac{η - μ_{2}}{σ_{2}}, σ_{1}^{2} (1 - ρ^{2}))

(99)

where

c o r r (ξ (s_{1}, t_{1}), η (s_{2}, t_{2})) = ρ (s_{1}, t_{1}, s_{2}, t_{2})

. Therefore, the fair price of the European option can be calculated as follows.

\begin{matrix} E [e^{- \int_{s}^{t + Δ} r (u) d u} {(e^{Δ F^{Δ} (t, t)} - e^{Δ K})}_{+}] = E [e^{- ξ} {(e^{η} - e^{Δ K})}_{+}] \end{matrix}

(100)

\begin{matrix} = E [E (e^{- ξ} {(e^{η} - e^{Δ K})}_{+} | η] = E [{(e^{η} - e^{Δ K})}_{+} \cdot E ((e^{- ξ}) | η)] \end{matrix}

(101)

During the derivations, the law of total expectation and the fact that

{(e^{η} - e^{Δ K})}_{+}

is measurable for

η

is used. As we can see,

ξ \sim N (μ_{1}, σ_{1})

is normally distributed; therefore,

- ξ \sim N (- μ_{1}, σ_{1})

, where

c o r r (- ξ, η) = - ρ

. Therefore,

- ξ | η \sim N (- μ_{1} - ρ σ_{1} \frac{η - μ_{2}}{σ_{2}}, σ_{1}^{2} (1 - ρ^{2}))

. Since the conditional distribution of

- ξ

given

η

is known,

E [e^{- ξ} | η]

can be calculated as the expectation of a lognormally distributed random variable.

\begin{matrix} E [e^{- ξ} | η] = e^{- μ_{1} - ρ σ_{1} \frac{η - μ_{2}}{σ_{2}} + \frac{1}{2} σ_{1}^{2} (1 - ρ^{2})} \end{matrix}

(102)

Returning to the pricing formula,

\begin{matrix} E [{(e^{η} - e^{Δ K})}_{+} \cdot & E [e^{- ξ} | η]] = E [{(e^{η} - e^{Δ K})}_{+} \cdot e^{- μ_{1} - ρ σ_{1} \frac{η - μ_{2}}{σ_{2}} + \frac{1}{2} σ_{1}^{2} (1 - ρ^{2})}] \end{matrix}

(103)

\begin{matrix} = e^{- μ_{1} + \frac{1}{2} σ_{1}^{2} (1 - ρ^{2}) + ρ μ_{2} \frac{σ_{1}}{σ_{2}}} E [{(e^{η} - e^{Δ K})}_{+} \cdot e^{- ρ η \frac{σ_{1}}{σ_{2}}}] \end{matrix}

(104)

\begin{matrix} = e^{- μ_{1} + \frac{1}{2} σ_{1}^{2} (1 - ρ^{2}) + ρ μ_{2} \frac{σ_{1}}{σ_{2}}} \int_{Δ K}^{\infty} (e^{x (1 - ρ μ_{2} \frac{σ_{1}}{σ_{2}})} - e^{Δ K - x ρ \frac{σ_{1}}{σ_{2}}}) \frac{1}{\sqrt{2 π} σ_{2}} e^{\frac{- {(x - μ_{2})}^{2}}{2 σ_{2}^{2}}} d x \end{matrix}

(105)

\begin{matrix} = e^{μ_{2} - μ_{1} + \frac{σ_{1}^{2} + σ_{2}^{2}}{2} - ρ σ_{1} σ_{2}} \int_{Δ K}^{\infty} \frac{1}{\sqrt{2 π} σ_{2}} e^{\frac{- {(x - μ_{2} - σ_{2}^{2} + ρ σ_{1} σ_{2})}^{2}}{2 σ_{2}^{2}}} d x \end{matrix}

(106)

\begin{matrix} - e^{Δ K - μ_{1} + \frac{1}{2} σ_{1}^{2}} \int_{Δ K}^{\infty} \frac{1}{\sqrt{2 π} σ_{2}} e^{\frac{- {(x - μ_{2} + ρ σ_{1} σ_{2})}^{2}}{2 σ_{2}^{2}}} d x \end{matrix}

(107)

Finally, by subtracting the values of the two integrals from each other, we obtain the analytical pricing formula for the European call option in the case of the Kennedy fields.

\begin{matrix} P_{c a p l e t} (s) = e^{μ_{2} - μ_{1} + \frac{1}{2} (σ_{1}^{2} + σ_{2}^{2} - 2 ρ σ_{1} σ_{2})} Φ (\frac{μ_{2} + σ_{2}^{2} - ρ σ_{1} σ_{2} - Δ K}{σ_{2}}) - e^{Δ K - μ_{1} + \frac{1}{2} σ_{1}^{2}} Φ (\frac{μ_{2} - ρ σ_{1} σ_{2} - Δ K}{σ_{2}}) \end{matrix}

(108)

Expected Values and Variances

Based on the calculations in Appendix A.2 for the caplet pricing, the expected value of

ξ

and

η

, their standard deviation, and the correlation between them are as follows.

\begin{matrix} μ_{1} = & E ξ (s, t + Δ) = (ν - \frac{σ^{2}}{μ}) (t + Δ - s) \end{matrix}

(109)

\begin{matrix} μ_{2} = & E η (t, t + Δ) = (ν - \frac{σ^{2}}{μ}) Δ - \frac{σ^{2}}{μ^{2}} (e^{- μ Δ} - 1) - \frac{σ^{2}}{μ (λ - μ)} (e^{- μ Δ} - 1) + \frac{σ^{2}}{λ (λ - μ)} (e^{- λ Δ} - 1) \end{matrix}

(110)

\begin{matrix} σ_{1}^{2} = & D^{2} ξ (s, t + Δ) = \frac{2 σ^{2}}{μ^{2}} ((t + Δ - s) μ + e^{- μ (t + Δ - s)} - 1) \end{matrix}

(111)

\begin{matrix} σ_{2}^{2} = & D^{2} η (t, t + Δ) = \end{matrix}

(112)

\begin{matrix} = & \frac{σ^{2}}{(λ - μ) λ} (e^{- λ Δ} - 1) + \frac{σ^{2}}{μ (μ - λ)} (e^{- μ Δ} - 1) + \frac{σ^{2}}{μ (μ - λ)} (e^{- μ Δ} - e^{- λ Δ}) + \frac{σ^{2}}{λ μ} (1 - e^{- λ Δ}) \end{matrix}

(113)

\begin{matrix} c o v = & c o v (ξ (s, t + Δ), η (t, t + Δ)) \end{matrix}

(114)

\begin{matrix} = & \frac{σ^{2}}{μ^{2}} (1 - e^{- μ Δ} - e^{- μ (t - s)} + e^{- μ (t + Δ - s)}) + \end{matrix}

(115)

\begin{matrix} + (\frac{σ^{2}}{λ (μ - λ)} + \frac{σ^{2}}{λ μ}) (e^{λ Δ} - 1) + \frac{2 σ^{2}}{μ (μ - λ)} (e^{λ Δ - μ Δ} - 1) \end{matrix}

(116)

\begin{matrix} ρ = & c o r r (ξ (s, t + Δ), η (t, t + Δ)) = \frac{c o v (ξ (s, t + Δ), η (t, t + Δ))}{D ξ (s, t + Δ) D η (t, t + Δ)} = \frac{c o v}{σ_{1} σ_{2}} \end{matrix}

(117)

\begin{matrix} c o v (ξ, η) = & \frac{σ^{2}}{μ^{2}} (1 - e^{- μ Δ} - e^{- μ (t - s)} + e^{- μ (t + Δ - s)}) + \end{matrix}

(118)

\begin{matrix} + (\frac{σ^{2}}{λ (μ - λ)} + \frac{σ^{2}}{λ μ}) (e^{λ Δ} - 1) + \frac{2 σ^{2}}{μ (μ - λ)} (e^{λ Δ - μ Δ} - 1) \end{matrix}

(119)

5.2. European Floorlet

Similarly to the previously derived European caplet, the price of an interest rate floorlet is now derived. Thereby, using the put-call parity, the pricing formula of the swap can be easily calculated. The payoff function of this transaction is shown below.

V (t, K) = {[(e^{Δ K} - 1) - (e^{Δ F^{Δ} (t, t)} - 1)]}_{+} = {[e^{Δ K} - e^{Δ F^{Δ} (t, t)}]}_{+}

(120)

The discount factor from time s to time t is defined as follows:

D (s, t) = e^{- \int_{s}^{t} r (u) d u}

. A floorlet typically consists of a series of such options for successive time periods; however, it is sufficient in this context to consider only a single time period. The following expression gives the discounted payoff of the option at time s:

D (s, t + Δ) V (t, K) = e^{- \int_{s}^{t + Δ} r (u) d u} {(e^{Δ K} - e^{Δ F^{Δ} (t, t)})}_{+}

(121)

The price of a financial asset is obtained by taking the expected value of the discounted payoff function. The definition of the drift term guarantees that the model is under a risk-neutral measure, just like in the Heath–Jarrow–Morton framework.

P_{f l o o r l e t} (s) = E [e^{- \int_{s}^{t + Δ} r (u) d u} {(e^{Δ K} - e^{Δ F^{Δ} (t, t)})}_{+}]

(122)

The derivation is exactly the same as the price of the caplet product presented earlier and can be found in Appendix A.3. Therefore, the analytical pricing formula for the European floorlet option in the case of Kennedy fields is as follows.

\begin{matrix} P_{f l o o r l e t} (s) = e^{- μ_{1} + \frac{1}{2} σ_{1}^{2} (1 - ρ^{2})} (e^{Δ K + \frac{1}{2} ρ^{2} σ_{1}^{2}} Φ (\frac{Δ K - μ_{2} + ρ σ_{1} σ_{2}}{σ_{2}}) - e^{μ_{2} + \frac{1}{2} {(σ_{2} - ρ σ_{1})}^{2}} Φ (\frac{Δ K - μ_{2} - σ_{2}^{2} + ρ σ_{1} σ_{2}}{σ_{2}})) \end{matrix}

(123)

\begin{matrix} = e^{Δ K - μ_{1} + \frac{1}{2} σ_{1}^{2}} Φ (\frac{Δ K - μ_{2} + ρ σ_{1} σ_{2}}{σ_{2}}) - e^{μ_{2} - μ_{1} + \frac{1}{2} (σ_{1}^{2} + σ_{2}^{2} - 2 ρ σ_{1} σ_{2})} Φ (\frac{Δ K - μ_{2} - σ_{2}^{2} + ρ σ_{1} σ_{2}}{σ_{2}}) \end{matrix}

(124)

5.3. Swap

An interest rate swap is a forward contract exchanging a floating and fixed rate for a predetermined period. The special financial asset in which the floating versus fixed rate exchange only applies to one period is called a swaplet. In this section, the fair price, and hence the conditional expected value of the discounted payoff function under the risk-neutral measure for one period, is derived.

We first examine the simplest case, a one-period swap. In this case, the interest rate exchange takes place at only one point in time, T in the swap product. In this case, the swap is similar to a caplet with an extreme cap value, where the product is definitely worth calling. The price of the swaplet product at time s is as follows.

P_{s w a p l e t} (s) = E [e^{- \int_{s}^{s + Δ} F (u, u) d u} (e^{\int_{s}^{s + Δ} F (s, u) d u} - e^{Δ K})]

(125)

In this case, the previously introduced

ξ

and

η

are interpreted in the following time period.

\begin{matrix} ξ & (s, s + Δ) = \int_{s}^{s + Δ} F (u, u) d u \end{matrix}

(126)

\begin{matrix} η & (s, s + Δ) = Δ F^{Δ} (s, s) = \int_{s}^{s + Δ} F (s, u) d u \end{matrix}

(127)

As we can see, the definition of

η

is unchanged; therefore, only the value of

μ_{1}

,

σ_{1}

, and the covariance change.

\begin{matrix} μ_{1} = & (ν - \frac{σ^{2}}{μ}) Δ \end{matrix}

(128)

\begin{matrix} σ_{1}^{2} = & D^{2} ξ = \frac{2 σ^{2}}{μ^{2}} (Δ μ + e^{- μ Δ} - 1) \end{matrix}

(129)

\begin{matrix} c o v (ξ, η) = & \frac{σ^{2}}{(λ - μ) λ} (e^{- λ Δ} - 1) + \frac{σ^{2}}{μ (μ - λ)} (e^{- μ Δ} - 1) \end{matrix}

(130)

\begin{matrix} + \frac{σ^{2}}{μ (μ - λ)} (e^{- μ Δ} - e^{- λ Δ}) + \frac{σ^{2}}{λ μ} (1 - e^{- λ Δ}) \end{matrix}

(131)

\begin{matrix} σ_{2}^{2} = & D^{2} η = c o v (ξ, η) = ρ σ_{1} σ_{2} \end{matrix}

(132)

As we can see, in that case, the covariance of

ξ

and

η

equals the variance of

η

.

This calculation is easy to understand if we use the previously introduced multidimensional normally distributed random variables (

ξ

, and

η

) because, in this case,

e^{ξ}

and

e^{η}

random variables are lognormally distributed. It is well-known that the quotient of two lognormally distributed random variables with correlation

ρ

is also lognormally distributed with the following expected value and standard deviation.

ξ \sim N (μ_{1}, σ_{1}), η \sim N (μ_{2}, σ_{2}) \to (η - ξ) \sim N (μ_{2} - μ_{1}, σ_{1}^{2} + σ_{2}^{2} - 2 ρ σ_{1} σ_{2})

(133)

Hence,

\begin{matrix} E (e^{- ξ}) & = e^{- μ_{1} + \frac{1}{2} σ_{1}^{2}} \end{matrix}

(134)

\begin{matrix} E (e^{- ξ} e^{η}) & = E (\frac{e^{η}}{e^{ξ}}) = E (e^{η - ξ}) = e^{μ_{2} - μ_{1} + \frac{1}{2} (σ_{1}^{2} + σ_{2}^{2} - 2 ρ σ_{1} σ_{2})} \end{matrix}

(135)

Therefore, we can easily obtain the previously calculated result.

P_{s w a p l e t} (s) = e^{μ_{2} - μ_{1} + \frac{1}{2} (σ_{1}^{2} + σ_{2}^{2} - 2 ρ σ_{1} σ_{2})} - e^{Δ K - μ_{1} + \frac{1}{2} σ_{1}^{2}}

(136)

Furthermore, the price of a one-period long swap, the so-called swaplet at time s, can be easily obtained using the previously derived caplet and floorlet pricing formulas and the put-call parity. Therefore, this is he difference between the calculated fair price of the caplet and the floorlet option.

The fair price of a fixed vs. floating swap for more time periods at time 0 can be found in Appendix A.2.

5.4. Par Swap Rate

In the previous section, we derived the fair price of a one-period swap, the so-called swaplet.

P_{s w a p l e t} (0) = e^{μ_{2} - μ_{1} + \frac{1}{2} (σ_{1}^{2} + σ_{2}^{2} - 2 ρ σ_{1} σ_{2})} - e^{Δ K - μ_{1} + \frac{1}{2} σ_{1}^{2}}

(137)

Therefore, let us first adjust the previously defined

ξ

and

η

variables to the following time periods.

\begin{matrix} ξ & (s, s + Δ) = \int_{s}^{s + Δ} F (u, u) d u \end{matrix}

(138)

\begin{matrix} η & (s, s + Δ) = Δ F^{Δ} (s, s) = \int_{s}^{s + Δ} F (s, u) d u \end{matrix}

(139)

The so-called swap quote can be easily expressed from that equality, which equals the par swap rate. The par rate is the fixed rate that results in the swap having a zero present value, meaning it is the rate that equates the value of both legs of the swap. The derivation of this rate is important because, in many cases, the financial data contain par swap rates instead of swap prices; in other words, this financial product is quoted using the par swap rate.

\begin{matrix} P_{s w a p l e t} (0) = 0 & = e^{μ_{2} - μ_{1} + \frac{1}{2} (σ_{1}^{2} + σ_{2}^{2} - 2 ρ σ_{1} σ_{2})} - e^{Δ K - μ_{1} + \frac{1}{2} σ_{1}^{2}} \end{matrix}

(140)

\begin{matrix} e^{μ_{2} - μ_{1} + \frac{1}{2} (σ_{1}^{2} + σ_{2}^{2} - 2 ρ σ_{1} σ_{2})} & = e^{Δ K - μ_{1} + \frac{1}{2} σ_{1}^{2}} \end{matrix}

(141)

\begin{matrix} μ_{2} - μ_{1} + \frac{1}{2} (σ_{1}^{2} + σ_{2}^{2} - 2 ρ σ_{1} σ_{2}) & = Δ K - μ_{1} + \frac{1}{2} σ_{1}^{2} \end{matrix}

(142)

\begin{matrix} μ_{2} + \frac{1}{2} σ_{2}^{2} - ρ σ_{1} σ_{2} & = Δ K \end{matrix}

(143)

\begin{matrix} K & = \frac{1}{Δ} (μ_{2} + \frac{1}{2} σ_{2}^{2} - ρ σ_{1} σ_{2}) \end{matrix}

(144)

After that, we want to express the par swap rate with the original parameters of the Kennedy field.

\begin{matrix} Δ K = & μ_{2} + \frac{1}{2} σ_{2}^{2} - ρ σ_{1} σ_{2} = μ_{2} + \frac{1}{2} σ_{2}^{2} - σ_{2}^{2} = μ_{2} - \frac{1}{2} σ_{2}^{2} \end{matrix}

(145)

\begin{matrix} = & (ν - \frac{σ^{2}}{μ}) Δ - \frac{σ^{2}}{μ^{2}} (e^{- μ Δ} - 1) - \frac{σ^{2}}{μ (λ - μ)} (e^{- μ Δ} - 1) + \frac{σ^{2}}{λ (λ - μ)} (e^{- λ Δ} - 1) \end{matrix}

(146)

\begin{matrix} - \frac{σ^{2}}{2 (λ - μ) λ} (e^{- λ Δ} - 1) + \frac{σ^{2}}{2 μ (λ - μ)} (e^{- μ Δ} - 1) + \frac{σ^{2}}{2 μ (λ - μ)} (e^{- μ Δ} - e^{- λ Δ}) + \frac{σ^{2}}{2 λ μ} (e^{- λ Δ} - 1) \end{matrix}

(147)

\begin{matrix} = & (ν - \frac{σ^{2}}{μ}) Δ + (e^{- μ Δ} - 1) (- \frac{σ^{2}}{μ^{2}} - \frac{σ^{2}}{μ (λ - μ)} + \frac{σ^{2}}{2 μ (λ - μ)} + \frac{σ^{2}}{2 μ (λ - μ)}) \end{matrix}

(148)

\begin{matrix} + (e^{- λ Δ} - 1) (\frac{σ^{2}}{λ (λ - μ)} + \frac{σ^{2}}{2 λ (λ - μ)} + \frac{σ^{2}}{2 λ μ} - \frac{σ^{2}}{2 μ (λ - μ)}) \end{matrix}

(149)

\begin{matrix} = & (ν - \frac{σ^{2}}{μ}) Δ + \frac{σ^{2}}{μ^{2}} (1 - e^{- μ Δ}) - (\frac{σ^{2}}{2 λ (λ - μ)} + \frac{σ^{2}}{2 μ λ} - \frac{σ^{2}}{2 μ (λ - μ)}) (1 - e^{- λ Δ}) \end{matrix}

(150)

\begin{matrix} = & (ν - \frac{σ^{2}}{μ}) Δ + \frac{σ^{2}}{μ^{2}} (1 - e^{- μ Δ}) \end{matrix}

(151)

From here, the par swap rate can be easily written with the original parameters of the Kennedy field.

K = ν - \frac{σ^{2}}{μ} + \frac{σ^{2}}{μ^{2} Δ} (1 - e^{- μ Δ})

(152)

Therefore, if the swap par rate can be observed for at least four different tenors, then three parameters of the original four (

ν, μ

, and

σ

) can be determined with a probability of 1. However, it is worth mentioning that the parameter

λ

is omitted from the description of the par swap rate.

6. Calibration on Simulated Data

Simulated financial caplet, floorlet, and swaplet prices were generated using the previously derived analytical pricing formulas with different maturities and strikes. Therefore, first of all, we just wanted to test the punctuality of the calibration engine.

Numerical calibration is an extreme value optimization problem. The method aimed to find the parameter set that minimized the squared deviation error between the previously generated financial caplet, floorlet, and swaplet data and the analytically calculated prices with the calibrated parameters. The calibration engine was based on the stochastic gradient descent method. The extreme value optimization was based on the article of Mikhaliov and Nögel, and the implementation in Python was based on the work of Emerick and Tatsat [11,12].

The Figure 2 shows that the prices calculated with the back-estimated parameters fit our synthetic dataset almost perfectly. The calibration returned the used parameters; the difference was negligible and could be considered a numerical error.

7. Calibration on Real Data

A time series of swap par rates was obtained with the help of the Bloomberg terminal. The financial dataset contains the par swap rates of the USD SOFR fixed versus floating interest rate swaplet from July 2018 to April 2023. The historical dataset includes par swap rates for 28 different maturities daily.

We calibrated the model daily for different maturities for par swap rates with the calibration algorithm going 100 days back. As we can see in Figure 3, the Kennedy field fit the dataset nicely; however, it slightly overestimated the values for shorter maturities, while slightly underestimating the par swap rates at long maturities.

In addition to the analytical results, it can also be seen that

λ

did not play a role in the numerical implementation, since this back-estimated parameter value is highly volatile. Meanwhile, the value of the other three parameters varied on a much smaller scale, and similar trends could be observed in Figure 4.

As a result, our guess is that the

λ

parameter, which is not included in the par swap rate, describes a temporal relationship; in other words, the term structure of the model;

σ

, greatly influences the standard deviation of the field; while

ν

is used to describe the level of the yield curves, since

ν

is the parameter that describes the expected value of the spot rate (

ν = E F (s, s))

.

In the following, we plotted Kennedy fields (shown in Figure 5) for describing forward interest rates with the three parameters back-estimated from the par swap rate dataset (

ν = 0.05171817

,

μ = 0.56028928

and

σ = 0.11315586

) and three different

λ

values, to see what rates would be generated in a realistic case.

8. Discussion

Kennedy introduced a model based on Gaussian random fields for modeling forward interest rates in the 1990s, which, due to its normal distribution, can generate negative interest rates. At that time, this scenario was not considered feasible; however, in the interest rate environment of the 2010s, negative interest rates emerged. Since this model naturally handles them, this underscored the relevance of the model. Calibration on actual par swap rates demonstrated that our model fits well with the current interest rate environment and effectively describes the market.

Moving forward, our primary objective is to go beyond analytical pricing formulas and utilize models based on artificial intelligence, including LSTM and neural networks, for parameter estimation. We aim to compare the accuracy of parameters recovered through calibration with these AI-based models. Additionally, we plan to investigate the temporal stability of parameters and compare them with industry-standard models such as SABR.

9. Conclusions

Our article focused on the mathematical model based on Gaussian random fields introduced by Kennedy to describe forward interest rates. Among other things, we provided a novel proof for the equivalence of conditions regarding the martingale property of discounted bond prices. We demonstrated the relationship between the Kennedy model and the HJM framework in special cases (Markov property, stationarity). Additionally, utilizing Radon–Nikodym derivatives, we derived maximum likelihood estimates and estimates with a probability of one for the original parameters of the field. We presented a new, efficient method, based on Brownian sheets, to simulate the Kennedy field.

Subsequently, we derived analytical pricing formulas resembling Black–Scholes for various financial products, including caplets, floorlets, swaplets, and swaps. Finally, we calibrated the field using a numerical extreme value search algorithm based on stochastic gradient descent on a simulated synthetic dataset to recover the original parameters. We then calibrated it on actual par swap rates to examine how our model performed in a market environment.

In summary, we can conclude that a new result has been derived in estimating the parameters of the Kennedy field, leading to the development of an effective calibrator. This serves as a strong foundation for further investigation and comparison with other, more complex models.

Additionally, our research aims to calibrate the parameters of negative interest rate models using machine learning algorithms, enabling us to compare these results with previously derived analytical estimates.

Author Contributions

Conceptualization, M.A.; methodology, M.A. and D.T.-L.; software, D.T.-L.; validation, M.A.; formal analysis, D.T.-L.; investigation, D.T.-L.; resources, M.A.; data curation, D.T.-L.; writing—original draft preparation, D.T.-L.; writing—review and editing, M.A.; visualization, D.T.-L.; supervision, M.A.; project administration, M.A.; funding acquisition, M.A. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the KDP-2021 program and the ELTE TKP 2021-NKTA-62 funding scheme of the Ministry of Innovation and Technology from the source of the National Research, Development and Innovation Fund.

Data Availability Statement

Data used in this study include historical par swap rates of the USD SOFR fixed vs. floating interest rate swaps (Bloomberg ticker: USOSFR1Z BGN Curncy) obtained from Bloomberg. The data spans from 20 April 2018 to 20 April 2023, with daily frequency in USD, sourced from BGN. Due to proprietary restrictions, these data are not publicly available. Access to the data is subject to Bloomberg’s terms and conditions and is not shared openly. For more information, please contact the authors.

Acknowledgments

We would like to take this opportunity to thank Csaba Kőrössy, for his help and dedication to the research. Even though he joined the research late on, he spent much time understanding it, and his comments and suggestions immensely helped this article. We want to thank Fáth Gábor, who involved us in Risklab and since then has regularly consulted, given ideas, and helped on this topic, in which he also has great expertise. Finally, we would like to thank András Ványolos for his help, interest, and the enthusiasm with which he joined our project; who is motivated simply by his love for mathematical research. We enjoy doing math with you.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Appendix A.1

The proof of Theorem 1 can be found in the original article by Kennedy [1]. However, we present an alternative derivation to prove the statement. To facilitate this, we first introduce an auxiliary lemma, which will be utilized in the proof of the theorem.

Lemma A1.

{X, ξ (u), u \in U}

is a Gaussian system, where

D^{2} ξ (u) > 0

for every

u \in U

. Then, the following statements are equivalent:

(a): $E (e^{X} | ξ (u), u \in U) = 1$ and
(b): X and $ξ (u)$ are independent for all $u \in U$ and $E X + \frac{1}{2} D^{2} X = 0$ .

Proof of Lemma A1.

The equivalence is proven by deriving both directions from each other.

$(a)$ ⟹ $(b)$
First, we prove the direction of the (a) to (b). If $E (e^{X} | ξ (u), u \in U) = e^{0} = 1$ is true, then it can be stated that

$\begin{matrix} e^{E (X | ξ (v)) + \frac{1}{2} D^{2} (X | ξ (v))} = E (e^{X} | ξ (v)) = E (E (e^{X} | ξ (u), u \in U) | ξ (v)) = 1 \end{matrix}$

(A1)

Thus,

$\begin{matrix} 0 & = E (X | ξ (v)) + \frac{1}{2} D^{2} (X | ξ (v)) \end{matrix}$

(A2)

$\begin{matrix} = E X + corr (X, ξ (v)) \frac{D X}{D ξ (v)} (ξ (v) - E ξ (v)) + \frac{1}{2} D^{2} X (1 - corr {(X, ξ (v))}^{2}) \end{matrix}$

(A3)

Therefore, we can conclude that $corr (X, ξ (v)) = 0$ and $E X + \frac{1}{2} D^{2} X = 0$ .
$(b)$ ⟹ $(a)$
Then, we deduce that if (b) is fulfilled, then statement (a) is also true. If X and $ξ (u)$ are independent for all $u \in U$ and $E X + \frac{1}{2} D^{2} X = 0$ , then

$E (e^{X} | ξ (u), u \in U) = E (e^{X}) = e^{E X + \frac{1}{2} D^{2} X} = 1$

(A4)

□

Proof of Theorem 1.

It is obvious that

R (u)

, where

0 \leq u \leq s

, and

P (s, t)

are

F (s) -

measurable random variables, just as

Z (s, t)

. Therefore, it can be stated that there is no problem with the existence of expected values. The equivalence of the statements is proved circularly.

$(a)$ ⟹ $(b)$
Let us start with the statement that the discounted bond price is a martingale. Hence,

$\begin{matrix} Z (s, t) & = E [Z (t, t) | F (s)] = E [e^{- \int_{0}^{t} R (u) d u} | F (s)] \end{matrix}$

(A5)

$\begin{matrix} ⟹ P (s, t) & = e^{\int_{0}^{s} R (u) d u} Z (s, t) = E [e^{- \int_{s}^{t} R (u) d u} | F (s)] \end{matrix}$

(A6)

From statement (a), we quickly deduced that the discount factor occurs in the given form.
$(b)$ ⟹ $(c)$
Henceforth, we derive the drift term from the discount factor

$\begin{matrix} E [e^{\int_{s}^{t} (F (s, u) - R (u)) d u} | F (s)] = 1 \end{matrix}$

(A7)

According to Lemma A1, this is equivalent to the fact that $ξ (s, t)$ and $F (v_{1}, v_{2})$ are independent and $E ξ (s, t) + \frac{1}{2} D^{2} ξ (s, t) = 0$ , where $v_{1} \leq s, v_{1} \leq v_{2},$ $ξ (s, t) = \int_{s}^{t} (F (s, u) - R (u)) d u$ . Since we are dealing with Gaussian variables, it is sufficient to examine the covariance.

$\begin{matrix} cov (F (s, u) - R (u), F (v_{1}, v_{2})) = c (s \land v_{1}, u, v_{2}) - c (u \land v_{1}, u, v_{2}) = c (v_{1}, u, v_{2}) - c (v_{1}, u, v_{2}) = 0 \end{matrix}$

(A8)

Since $ξ (s, t)$ is equal to $\int_{s}^{t} (F (s, u) - R (u)) d u$ , therefore $ξ (s, t)$ and $F (v_{1}, v_{2})$ are independent.

$\begin{matrix} E ξ (s, t) = \int_{s}^{t} (α (s, u) - α (u, u)) d u \end{matrix}$

(A9)

The variance is a bit more complicated to calculate.

$\begin{matrix} D^{2} ξ (s, t) & = \int_{s}^{t} \int_{s}^{t} c (u \land v, u, v) d v d u + \int_{s}^{t} \int_{s}^{t} c (s, u, v) d v d u - 2 \int_{s}^{t} \int_{s}^{t} c (s, u, v) d v d u \end{matrix}$

(A10)

$\begin{matrix} = \int_{s}^{t} \int_{s}^{t} (c (u \land v, u, v) - c (s, u, v)) d v d u \end{matrix}$

(A11)

Let us apply the Leibniz integral rule to the following function.

$\begin{matrix} f (t) & = E ξ (s, t) + \frac{1}{2} D^{2} ξ (s, t) = \int_{s}^{t} (α (s, u) - α (u, u)) d u + \frac{1}{2} \int_{s}^{t} \int_{s}^{t} (c (u \land v, u, v) - c (s, u, v)) d v d u \end{matrix}$

(A12)

$\begin{matrix} \frac{d f}{d t} & = α (s, t) - α (t, t) + \frac{1}{2} (\int_{s}^{t} (c (t \land v, t, v) - c (s, t, v)) d v + \int_{s}^{t} (c (u \land t, u, t) - c (s, u, t)) d u), \end{matrix}$

(A13)

from which

$\begin{matrix} \frac{d f}{d t} = α (s, t) - α (t, t) + \int_{s}^{t} (c (v, v, t) - c (s, v, t)) d v . \end{matrix}$

(A14)

Since $f (s) = 0$ , thus

$\begin{matrix} E ξ (s, t) + \frac{1}{2} D^{2} ξ (s, t) = 0, \forall 0 \leq s \leq t \Leftrightarrow α (s, t) = α (t, t) + \int_{s}^{t} [c (s, v, t) - c (v, v, t)] d v, \end{matrix}$

(A15)

for all $0 \leq s \leq t$ . Using Remark 1, we have established the implication from (b) to (c).
$(c)$ ⟹ $(a)$
First, we show that part (b) of the theorem is satisfied by showing that the drift term has the form (c), and this is sufficient because part (b) immediately demonstrates that $Z (s, t)$ is a regular martingale. It can be easily seen that in Lemma A1, using the previous notations, $ξ (s, t) = \int_{s}^{t} (F (s, u) - R (u)) d u$ and $F (v_{1}, v_{2})$ are independent. During the derivations, Remark 1 is also used.

$\begin{matrix} E ξ (s, t) = & \int_{s}^{t} α (s, u) - α (u, u) d u = \int_{s}^{t} α (u, u) - \int_{s}^{u} c (s, v, u) - c (v, v, u) d v - α (u, u) d u \end{matrix}$

(A16)

$\begin{matrix} = & \int_{s}^{t} \int_{s}^{u} c (s, v, u) - c (v, v, u) d v d u \end{matrix}$

(A17)

$\begin{matrix} D^{2} ξ (s, t) = & \int_{s}^{t} \int_{s}^{t} (c (u \land v, u, v) - c (s, u, v)) d v d u \end{matrix}$

(A18)

$\begin{matrix} = & \int_{s}^{t} \int_{s}^{u} (c (v, u, v) - c (s, u, v)) d v d u + \int_{s}^{t} \int_{u}^{t} (c (u, u, v) - c (s, u, v)) d v d u \end{matrix}$

(A19)

Let us apply the Leibniz rule again for the following

f (t)

function using the fact that the covariance function

c (s_{1} \land s_{2}, t_{1}, t_{2})

is symmetric in

t_{1}

and

t_{2}

.

\begin{matrix} f (t) = & E ξ (s, t) + \frac{1}{2} D^{2} ξ (s, t) = \int_{s}^{t} \int_{s}^{u} c (s, v, u) - c (v, v, u) d v d u \end{matrix}

(A20)

\begin{matrix} + \frac{1}{2} \int_{s}^{t} \int_{s}^{u} (c (v, u, v) - c (s, u, v)) d v d u + \frac{1}{2} \int_{s}^{t} \int_{u}^{t} (c (u, u, v) - c (s, u, v)) d v d u \end{matrix}

(A21)

\begin{matrix} = & \frac{1}{2} \int_{s}^{t} \int_{s}^{u} c (s, u, v) - c (v, u, v) d v d u + \frac{1}{2} \int_{s}^{t} \int_{u}^{t} (c (u, u, v) - c (s, u, v)) d v d u \end{matrix}

(A22)

\begin{matrix} = & \frac{1}{2} \int_{s}^{t} \int_{s}^{t} c (s, u, v) - c (v, u, v) d v d u - \frac{1}{2} \int_{s}^{t} \int_{s}^{t} (c (u, u, v) - c (s, u, v)) d v d u \end{matrix}

(A23)

\begin{matrix} = & \frac{1}{2} \int_{s}^{t} \int_{s}^{t} c (s, u, v) - c (v, u, v) - c (u, u, v) + c (s, u, v) d v d u = 0 \end{matrix}

(A24)

Finally, the theorem is proved. □

Appendix A.2

This subsection calculates the expected value and standard deviation of the expressions previously marked with

ξ (s, t)

and

η (s, t)

and their correlation.

\begin{matrix} ξ (s, t) & = \int_{s}^{t} F (u, u) d u = \int_{s}^{t} r (u) d u \end{matrix}

(A25)

\begin{matrix} μ_{1} (s, t) & = E ξ (s, t) = E \int_{s}^{t} F (u, u) d u = \int_{s}^{t} E F (u, u) d u \end{matrix}

(A26)

\begin{matrix} = \int_{s}^{t} ν - \frac{σ^{2}}{μ} + \frac{σ^{2}}{μ} e^{- μ (u - u)} + \frac{σ^{2}}{λ - μ} e^{- μ (u - u)} - \frac{σ^{2}}{λ - μ} e^{- λ (u - u)} d u \end{matrix}

(A27)

\begin{matrix} = \int_{s}^{t} ν - \frac{σ^{2}}{μ} d u = (ν - \frac{σ^{2}}{μ}) (t - s) \end{matrix}

(A28)

Now, let us move on to the expected value of

η (s, t)

. The steps of the derivation are similar to what we have seen above.

\begin{matrix} η (s, t) & = \int_{s}^{t} F (s, u) d u \end{matrix}

(A29)

\begin{matrix} μ_{2} (s, t) & = E η (s, t) = E \int_{s}^{t} F (s, u) d u = \int_{s}^{t} E F (s, u) d u \end{matrix}

(A30)

\begin{matrix} = \int_{s}^{t} ν - \frac{σ^{2}}{μ} + \frac{σ^{2}}{μ} e^{- μ (u - s)} + \frac{σ^{2}}{λ - μ} e^{- μ (u - s)} - \frac{σ^{2}}{λ - μ} e^{- λ (u - s)} d u \end{matrix}

(A31)

\begin{matrix} = \int_{s}^{t} ν - \frac{σ^{2}}{μ} + \frac{σ^{2}}{μ} e^{- μ u + μ s} + \frac{σ^{2}}{λ - μ} e^{- μ u + μ s} - \frac{σ^{2}}{λ - μ} e^{- λ u + λ s} d u \end{matrix}

(A32)

\begin{matrix} = {[ν u - \frac{σ^{2}}{μ} u - \frac{σ^{2}}{μ^{2}} e^{- μ u + μ s} - \frac{σ^{2}}{μ (λ - μ)} e^{- μ u + μ s} + \frac{σ^{2}}{λ (λ - μ)} e^{- λ u + λ s}]}_{u = s}^{u = t} \end{matrix}

(A33)

\begin{matrix} = (ν - \frac{σ^{2}}{μ}) (t - s) - \frac{σ^{2}}{μ^{2}} (e^{- μ (t - s)} - 1) - \frac{σ^{2}}{μ (λ - μ)} (e^{- μ (t - s)} - 1) + \frac{σ^{2}}{λ (λ - μ)} (e^{- λ (t - s)} - 1) \end{matrix}

(A34)

After the derivation of the expected values, the standard deviation of the expressions marked with

ξ (s, t)

and

η (s, t)

are calculated.

\begin{matrix} c o v [F (u, u), F (v, v)] & = σ^{2} exp {2 μ min (u, v) - μ (u + v)} = σ^{2} e^{- μ | v - u |} \end{matrix}

(A35)

\begin{matrix} D^{2} ξ (s, t) & = \int_{s}^{t} \int_{s}^{t} σ^{2} e^{- μ | v - u |} d u d v \end{matrix}

(A36)

At first, let us deal with the inner integral, then move on to the outer integral.

\begin{matrix} \int_{s}^{t} e^{- μ | v - u |} d u = \int_{s}^{v} e^{μ u} e^{- μ v} d u + \int_{v}^{t} e^{μ v} e^{- μ u} d u = e^{- μ v} {[\frac{e^{μ u}}{μ}]}_{u = s}^{u = v} + e^{μ v} {[\frac{e^{- μ u}}{- μ}]}_{u = v}^{u = t} \end{matrix}

(A37)

\begin{matrix} = \frac{1}{μ} (1 - e^{- μ (v - s)} - e^{- μ (t - v)} + 1) = \frac{1}{μ} (2 - e^{- μ (v - s)} - e^{- μ (t - v)}) \end{matrix}

(A38)

\begin{matrix} \frac{σ^{2}}{μ} \int_{s}^{t} (2 - e^{- μ (v - s)} - e^{- μ (t - v)}) d v = \frac{σ^{2}}{μ} (2 (t - s) - e^{μ s} {[\frac{e^{- μ v}}{- μ}]}_{v = s}^{v = t} - e^{- μ t} {[\frac{e^{μ v}}{μ}]}_{v = s}^{v = t}) \end{matrix}

(A39)

\begin{matrix} = \frac{σ^{2}}{μ} (2 (t - s) + \frac{1}{μ} (e^{- μ (t - s)} - 1) - \frac{1}{μ} (1 - e^{- μ (t - s)})) = \frac{σ^{2}}{μ^{2}} (2 μ (t - s) + 2 e^{- μ (t - s)} - 2) \end{matrix}

(A40)

\begin{matrix} D^{2} ξ (s, t) = \frac{2 σ^{2}}{μ^{2}} (μ (t - s) + e^{- μ (t - s)} - 1) \end{matrix}

(A41)

The variance of

η (s, t)

can be similarly derived.

\begin{matrix} c o v [F (s, u), F (s, v)] & = σ^{2} exp {λ s + (2 μ - λ) min (u, v) - μ (u + v)} \end{matrix}

(A42)

\begin{matrix} D^{2} η (s, t) & = \int_{s}^{t} \int_{s}^{t} σ^{2} e^{λ s + (2 μ - λ) min (u, v) - μ (u + v)} d u d v \end{matrix}

(A43)

Similarly to the previous calculation, the derivation starts by calculating the inner integral without the

σ^{2}

multiplier.

\begin{matrix} \int_{s}^{t} e^{λ s + (2 μ - λ) min (u, v) - μ (u + v)} d u = \int_{s}^{v} e^{λ s + (2 μ - λ) u - μ u - μ v} d u + \int_{v}^{t} e^{λ s + (2 μ - λ) v - μ u - μ v} d u \end{matrix}

(A44)

\begin{matrix} = \int_{s}^{v} e^{λ s + (μ - λ) u - μ v} d u + \int_{v}^{t} e^{λ s + (μ - λ) v - μ u} d u = e^{λ s - μ v} \int_{s}^{v} e^{(μ - λ) u} d u + e^{λ s} \int_{v}^{t} e^{μ v - λ v - μ u} d u \end{matrix}

(A45)

\begin{matrix} = e^{λ s - μ v} {[\frac{e^{(μ - λ) u}}{μ - λ}]}_{u = s}^{u = v} + e^{λ s + μ v - λ v} {[\frac{e^{- μ u}}{- μ}]}_{u = v}^{u = t} \end{matrix}

(A46)

\begin{matrix} = e^{λ s - μ v} \frac{1}{μ - λ} (e^{μ v - λ v} - e^{μ s - λ s}) + e^{λ s + μ v - λ v} \frac{1}{μ} (e^{- μ v} - e^{- μ t}) \end{matrix}

(A47)

\begin{matrix} = \frac{1}{μ - λ} e^{λ s - λ v} - \frac{1}{μ - λ} e^{μ s - μ v} - \frac{1}{μ} e^{λ s + (μ - λ) v - μ t} + \frac{1}{μ} e^{λ s - λ v} \end{matrix}

(A48)

Now, let us move to the outer integral per term.

\begin{matrix} ① & = \frac{1}{μ - λ} e^{λ s} \int_{s}^{t} e^{- λ v} d v = \frac{e^{λ s}}{μ - λ} {[\frac{e^{- λ v}}{- λ}]}_{v = s}^{t} = \frac{e^{λ s}}{(λ - μ) λ} (e^{- λ t} - e^{- λ s}) \end{matrix}

(A49)

\begin{matrix} = \frac{1}{(λ - μ) λ} (e^{- λ (t - s)} - 1) \end{matrix}

(A50)

\begin{matrix} ② & = \frac{- 1}{μ - λ} e^{μ s} \int_{s}^{t} e^{- μ v} d v = \frac{- 1}{μ - λ} e^{μ s} {[\frac{e^{- μ v}}{- μ}]}_{u = s}^{t} = \frac{1}{μ (μ - λ)} e^{μ s} (e^{- μ t} - e^{- μ s}) \end{matrix}

(A51)

\begin{matrix} = \frac{1}{μ (μ - λ)} (e^{- μ (t - s)} - 1) \end{matrix}

(A52)

\begin{matrix} ③ & = \int_{s}^{t} \frac{- 1}{μ} e^{λ s - μ t} e^{(μ - λ) v} d v = \frac{1}{μ (μ - λ)} e^{λ s - μ t} (e^{(μ - λ) s} - e^{(μ - λ) t}) \end{matrix}

(A53)

\begin{matrix} = \frac{1}{μ (μ - λ)} (e^{- μ (t - s)} - e^{- λ (t - s)}) \end{matrix}

(A54)

\begin{matrix} ④ & = \int_{s}^{t} \frac{1}{μ} e^{λ s - λ v} d v = \frac{1}{μ} e^{λ s} \int_{s}^{t} e^{- λ v} d v = \frac{- 1}{λ μ} e^{λ s} (e^{- λ t} - e^{- λ s}) = \frac{1}{λ μ} (1 - e^{- λ (t - s)}) \end{matrix}

(A55)

Therefore, by adding the

σ^{2}

multiplier, we obtain the variance of

η (s, t)

.

D^{2} η (s, t) = \frac{σ^{2}}{(λ - μ) λ} (e^{- λ (t - s)} - 1) + \frac{σ^{2}}{μ (μ - λ)} (2 e^{- μ (t - s)} - e^{- λ (t - s)} - 1) + \frac{σ^{2}}{λ μ} (1 - e^{- λ (t - s)})

(A56)

The last variable to be calculated is

ρ (s_{1}, s_{2}, t_{1}, t_{2})

, indicating the correlation between

ξ (s_{1}, t_{1}) = \int_{s_{1}}^{t_{1}} F (u, u) d u

and

η (s_{2}, t_{2}) = \int_{s_{2}}^{t_{2}} F (s_{2}, v) d v

. However, for all financial products used in the article, the values of

t_{1}

and

t_{2}

were equal, denoted by t. Furthermore, we can assume that

s_{1} \leq s_{2}

, since

s_{1}

represents the time at which we discount, the time at which we want the value of the financial product, while

s_{2}

is the starting time of the transaction, which can start now or even later. Thus, let us suppose that

s_{1} < s_{2} < t

.

\begin{matrix} c o v [F (u, u), F (s_{2}, v)] & = σ^{2} exp {λ min (u, s_{2}) + (2 μ - λ) min (u, v) - μ (u + v)} \end{matrix}

(A57)

\begin{matrix} c o v (ξ (s_{1}, t), η (s_{2}, t)) & = \int_{s_{1}}^{t} \int_{s_{2}}^{t} σ^{2} exp {λ min (u, s_{2}) + (2 μ - λ) min (u, v) - μ (u + v)} d v d u \end{matrix}

(A58)

\begin{matrix} = \int_{s_{1}}^{s_{2}} \int_{s_{2}}^{t} σ^{2} exp {λ u + (2 μ - λ) u - μ (u + v)} d v d u \end{matrix}

(A59)

\begin{matrix} + \int_{s_{2}}^{t} \int_{s_{2}}^{t} σ^{2} exp {λ t + (2 μ - λ) min (u, v) - μ (u + v)} d v d u \end{matrix}

(A60)

The two terms of the summation are calculated separately.

\begin{matrix} ① = & σ^{2} \int_{s_{1}}^{s_{2}} \int_{s_{2}}^{t} exp {μ u - μ v} d v d u = σ^{2} \int_{s_{1}}^{s_{2}} e^{- μ u} d u \int_{s_{2}}^{t} e^{- μ v} d v \end{matrix}

(A61)

\begin{matrix} = & σ^{2} {[\frac{e^{μ u}}{μ}]}_{u = s_{1}}^{s_{2}} {[\frac{e^{- μ v}}{- μ}]}_{v = s_{2}}^{t} = \frac{σ^{2}}{μ^{2}} (e^{μ s_{2}} - e^{μ s_{1}}) (e^{- μ s_{2}} - e^{- μ t}) \end{matrix}

(A62)

\begin{matrix} = & \frac{σ^{2}}{μ^{2}} (1 - e^{- μ (t - s_{2})} - e^{- μ (s_{2} - s_{1})} + e^{- μ (t - s_{1})}) \end{matrix}

(A63)

\begin{matrix} ② = & σ^{2} \int_{s_{2}}^{t} \int_{s_{2}}^{t} exp {λ t + (2 μ - λ) min (u, v) - μ (u + v)} d v d u \end{matrix}

(A64)

\begin{matrix} = & σ^{2} \int_{s_{2}}^{t} \int_{s_{2}}^{u} exp {λ t + (2 μ - λ) min (u, v) - μ (u + v)} d v d u \end{matrix}

(A65)

\begin{matrix} + σ^{2} \int_{s_{2}}^{t} \int_{u}^{t} exp {λ t + (2 μ - λ) min (u, v) - μ (u + v)} d v d u \end{matrix}

(A66)

\begin{matrix} = & σ^{2} e^{λ t} \int_{s_{2}}^{t} e^{- μ u} \int_{s_{2}}^{u} e^{μ v - λ v} d v d u + σ^{2} e^{λ t} \int_{s_{2}}^{t} e^{μ u - λ u} \int_{u}^{t} e^{- μ v} d v d u \end{matrix}

(A67)

\begin{matrix} = & σ^{2} e^{λ t} \int_{s_{2}}^{t} e^{- μ u} {[\frac{e^{(μ - λ) v}}{μ - λ}]}_{v = s_{2}}^{u} d u + σ^{2} e^{λ t} \int_{s_{2}}^{t} e^{(μ - λ) u} {[\frac{e^{- μ v}}{- μ}]}_{v = u}^{t} d u \end{matrix}

(A68)

\begin{matrix} = & \frac{σ^{2}}{μ - λ} e^{λ t} \int_{s_{2}}^{t} e^{- μ u} (e^{(μ - λ) u} - e^{(μ - λ) s_{2}}) d u + \frac{σ^{2}}{μ} e^{λ t} \int_{s_{2}}^{t} e^{(μ - λ) u} (e^{- μ u} - e^{- μ t}) d u \end{matrix}

(A69)

\begin{matrix} = & \frac{σ^{2}}{μ - λ} \int_{s_{2}}^{t} e^{λ t - λ u} - e^{λ (t - s_{2}) - μ (u - s_{2})} d u + \frac{σ^{2}}{μ} \int_{s_{2}}^{t} e^{λ (t - u)} - e^{- μ (t - u) - λ (t - u)} d u \end{matrix}

(A70)

\begin{matrix} = & \frac{σ^{2}}{μ - λ} e^{λ t} ({[\frac{e^{- λ u}}{- λ}]}_{u = s_{2}}^{t} - {[\frac{e^{- μ u + μ s_{2} - λ s_{2}}}{- μ}]}_{u = s_{2}}^{t}) + \frac{σ^{2}}{μ} e^{λ t} ({[\frac{e^{- λ u}}{- λ}]}_{u = s_{2}}^{t} - {[\frac{e^{μ u - λ u - μ t}}{μ - λ}]}_{u = s_{2}}^{t}) \end{matrix}

(A71)

\begin{matrix} = & \frac{σ^{2}}{λ (μ - λ)} e^{λ t} (e^{- λ s_{2}} - e^{- λ t}) + \frac{σ^{2}}{μ (μ - λ)} e^{λ t} (e^{- μ t + μ s_{2} - λ s_{2}} - e^{- μ s_{2} + μ s_{2} - λ s_{2}}) \end{matrix}

(A72)

\begin{matrix} + \frac{σ^{2}}{λ μ} e^{λ t} (e^{- λ s_{2}} - e^{- λ t}) + \frac{σ^{2}}{μ (μ - λ)} e^{λ t} (e^{(μ - λ) s_{2} - μ t} - e^{(μ - λ) t - μ t}) \end{matrix}

(A73)

\begin{matrix} = & \frac{σ^{2}}{λ (μ - λ)} (e^{λ (t - s_{2})} - 1) + \frac{σ^{2}}{μ (μ - λ)} (e^{λ (t - s_{2}) - μ (t - s_{2})} - 1) + \frac{σ^{2}}{λ μ} (e^{λ (t - s_{2})} - 1) + \frac{σ^{2}}{μ (μ - λ)} (e^{λ (t - s_{2}) - μ (t - s_{2})} - 1) \end{matrix}

(A74)

\begin{matrix} = & (\frac{σ^{2}}{λ (μ - λ)} + \frac{σ^{2}}{λ μ}) (e^{λ (t - s_{2})} - 1) + \frac{2 σ^{2}}{μ (μ - λ)} (e^{λ (t - s_{2}) - μ (t - s_{2})} - 1) \end{matrix}

(A75)

By adding the calculated two terms back together, we obtain the covariance between

ξ (s_{1}, t)

and

η (s_{2}, t)

.

\begin{matrix} c o v (ξ (s_{1}, t), η (s_{2}, t)) = & \frac{σ^{2}}{μ^{2}} (1 - e^{- μ (t - s_{2})} - e^{- μ (s_{2} - s_{1})} + e^{- μ (t - s_{1})}) \end{matrix}

(A76)

\begin{matrix} + (\frac{σ^{2}}{λ (μ - λ)} + \frac{σ^{2}}{λ μ}) (e^{λ (t - s_{2})} - 1) + \frac{2 σ^{2}}{μ (μ - λ)} (e^{λ (t - s_{2}) - μ (t - s_{2})} - 1) \end{matrix}

(A77)

Therefore, the correlation between

ξ

and

η

is calculated as follows:

c o r r (ξ, η) = \frac{c o v (ξ, η)}{D ξ D η}

(A78)

Appendix A.3

In this subsection of the appendix, the analytical fair price of the European floorlet option is also derived, similarly to the previously derived European caplet option. As we have seen before, the fair price of the floorlet is the expected value of the payoff function under the risk-neutral measure.

P_{f l o o r l e t} (s) = E [e^{- \int_{s}^{t + Δ} r (u) d u} {(e^{Δ K} - e^{Δ F^{Δ} (t, t)})}_{+}]

(A79)

We use the previously introduced variables:

ξ

and

η

.

\begin{matrix} ξ (s, t + Δ) & = \int_{s}^{t + Δ} r (u) d u = \int_{s}^{t + Δ} F (u, u) d u \end{matrix}

(A80)

\begin{matrix} η (t, t + Δ) & = Δ F^{Δ} (t, t) = \int_{t}^{t + Δ} F (t, u) d u \end{matrix}

(A81)

Similarly to the previously derived pricing formula for the caplet, the conditional expected value of

ξ

to

η

follows a normal distribution. Hence, the conditional standard distribution theorem can also be used with the previously defined parameters in this case. Therefore, the fair price of the European floorlet can be calculated as follows:

\begin{matrix} E [e^{- \int_{s}^{t + Δ} r (u) d u} {(e^{Δ K} - e^{Δ F^{Δ} (t, t)})}_{+}] = E [e^{- ξ} {(e^{Δ K} - e^{η})}_{+}] \end{matrix}

(A82)

\begin{matrix} = E [E (e^{- ξ} {(e^{Δ K} - e^{η})}_{+} | η] = E [{(e^{Δ K} - e^{η})}_{+} \cdot E (e^{- ξ} | η)] \end{matrix}

(A83)

During the derivations, the law of total expectation and the fact that

{(e^{Δ K} - e^{η})}_{+}

is measurable for

η

is used.

As we can see,

ξ \sim N (μ_{1}, σ_{1})

is normally distributed, therefore

- ξ \sim N (- μ_{1}, σ_{1})

, where

c o r r (- ξ, η) = - ρ

. Therefore,

- ξ | η \sim N (- μ_{1} - ρ σ_{1} \frac{η - μ_{2}}{σ_{2}}, σ_{1}^{2} (1 - ρ^{2}))

. Since the conditional distribution of

- ξ

given

η

is known,

E [e^{- ξ} | η]

can be calculated as the expected value of a lognormal distribution.

\begin{matrix} E [e^{- ξ} | η] = e^{- μ_{1} - ρ σ_{1} \frac{η - μ_{2}}{σ_{2}} + \frac{1}{2} σ_{1}^{2} (1 - ρ^{2})} \end{matrix}

(A84)

The integral returns the expected value of a random variable that is lognormally distributed. Returning to the pricing formula

\begin{matrix} E [{(e^{Δ K} - e^{η})}_{+} \cdot E [e^{- ξ} | η]] & = E [{(e^{Δ K} - e^{η})}_{+} \cdot e^{- μ_{1} - ρ σ_{1} \frac{η - μ_{2}}{σ_{2}} + \frac{1}{2} σ_{1}^{2} (1 - ρ^{2})}] \end{matrix}

(A85)

\begin{matrix} = e^{- μ_{1} + \frac{1}{2} σ_{1}^{2} (1 - ρ^{2})} E [{(e^{Δ K} - e^{η})}_{+} \cdot e^{- ρ σ_{1} \frac{η - μ_{2}}{σ_{2}}}] \end{matrix}

(A86)

\begin{matrix} = e^{Δ K + \frac{1}{2} ρ^{2} σ_{1}^{2}} \int_{- \infty}^{Δ K} \frac{1}{σ_{2} \sqrt{2 π}} e^{- \frac{{(x - (μ_{2} - ρ σ_{1} σ_{2}))}^{2}}{2 σ_{2}^{2}}} d x \end{matrix}

(A87)

\begin{matrix} - e^{μ_{2} + \frac{{(σ_{2}^{2} - ρ σ_{1} σ_{2})}^{2}}{2 σ_{2}^{2}}} \int_{- \infty}^{Δ K} \frac{1}{σ_{2} \sqrt{2 π}} e^{- \frac{{(x - (μ_{2} + σ_{2}^{2} - ρ σ_{1} σ_{2}))}^{2}}{2 σ_{2}^{2}}} d x \end{matrix}

(A88)

Therefore, the analytical pricing formula for the European floorlet option in Kennedy fields is as follows:

\begin{matrix} P_{f l o o r l e t} (s) = e^{Δ K - μ_{1} + \frac{1}{2} σ_{1}^{2}} Φ (\frac{Δ K - μ_{2} + ρ σ_{1} σ_{2}}{σ_{2}}) - e^{μ_{2} - μ_{1} + \frac{1}{2} (σ_{1}^{2} + σ_{2}^{2} - 2 ρ σ_{1} σ_{2})} Φ (\frac{Δ K - μ_{2} - σ_{2}^{2} + ρ σ_{1} σ_{2}}{σ_{2}}) \end{matrix}

(A89)

Appendix A.4

The fair price of a fixed vs. floating swap for several periods (k) at time s can be written using the formula below. Let us denote the time periods by

s < T_{0} < T_{1} < \dots < T_{k}

and

τ_{j} = T_{j} - T_{j - 1}

.

\begin{matrix} P_{s w a p} (s) & = E [\sum_{j = 1}^{k} e^{- \int_{s}^{T_{j}} r (u) d u} (e^{Δ F^{Δ} (T_{j - 1}, T_{j - 1})} - e^{τ_{j} K})] \end{matrix}

(A90)

\begin{matrix} = E [\sum_{j = 1}^{k} e^{- \int_{s}^{T_{j}} r (u) d u + Δ F^{Δ} (T_{j - 1}, T_{j - 1})} - e^{- \int_{s}^{T_{j}} r (u) d u + τ_{j} K}] \end{matrix}

(A91)

\begin{matrix} = \sum_{j = 1}^{k} E [e^{- \int_{s}^{T_{j}} r (u) d u + Δ F^{Δ} (T_{j - 1}, T_{j - 1})}] - \sum_{j = 1}^{k} E [e^{- \int_{s}^{T_{j}} r (u) d u + τ_{j} K}] \end{matrix}

(A92)

\begin{matrix} = \sum_{j = 1}^{k} E [e^{- \int_{s}^{T_{j}} r (u) d u + Δ F^{Δ} (T_{j - 1}, T_{j - 1})}] - \sum_{j = 1}^{k} e^{τ_{j} K} E [e^{- \int_{s}^{T_{j}} r (u) d u}] \end{matrix}

(A93)

As we have done previously, more additional variables are introduced,

ξ_{j}

and

η_{j}

, in the following way.

\begin{matrix} ξ_{j} (s, T_{j}) & = \int_{s}^{T_{j}} r (u) d u = \int_{s}^{T_{j}} F (u, u) d u \end{matrix}

(A94)

\begin{matrix} η_{j} (T_{j - 1}, T_{j}) & = Δ F^{Δ} (T_{j - 1}, T_{j - 1}) = \int_{T_{j - 1}}^{T_{j}} F (T_{j - 1}, u) d u \end{matrix}

(A95)

Referring to previous calculations, we know the expected value and standard deviation of

ξ

and

η

. Therefore, the expected values of the variables are the following:

\begin{matrix} μ_{ξ_{j}} & = E ξ_{j} (s, T_{j}) = E \int_{s}^{T_{j}} F (u, u) d u = (ν - \frac{σ^{2}}{μ}) (T_{j} - s) \end{matrix}

(A96)

\begin{matrix} μ_{η_{j}} & = E η_{j} (T_{j - 1}, T_{j}) = E Δ F^{Δ} (T_{j - 1}, T_{j - 1}) = \int_{T_{j - 1}}^{T_{j}} E F (T_{j - 1}, u) d u \end{matrix}

(A97)

\begin{matrix} = (ν - \frac{σ^{2}}{μ}) τ_{j} - \frac{σ^{2}}{μ^{2}} (e^{- μ τ_{j}} - 1) - \frac{σ^{2}}{μ (λ - μ)} (e^{- μ τ_{j}} - 1) + \frac{σ^{2}}{λ (λ - μ)} (e^{- λ τ_{j}} - 1) \end{matrix}

(A98)

and the covariance is

\begin{matrix} σ_{ξ_{j}}^{2} & = D^{2} ξ_{j} (s, T_{j}) = \frac{2 σ^{2}}{μ^{2}} ((T_{j} - s) μ + e^{- μ (T_{j} - s)} - 1) \end{matrix}

(A99)

\begin{matrix} σ_{η_{j}}^{2} & = D^{2} η_{j} (T_{j - 1}, T_{j}) = \frac{σ^{2}}{(λ - μ) λ} (e^{- λ τ_{j}} - 1) + \frac{σ^{2}}{μ (μ - λ)} (e^{- μ τ_{j}} - 1) \end{matrix}

(A100)

\begin{matrix} + \frac{σ^{2}}{μ (μ - λ)} (e^{- μ τ_{j}} - e^{- λ τ_{j}}) + \frac{σ^{2}}{λ μ} (1 - e^{- λ τ_{j}}) \end{matrix}

(A101)

Due to the properties of the Gaussian random field,

(ξ_{j}, η_{j})

always follows a multivariate normal distribution, with the covariance matrix shown before.

\begin{matrix} c o v (ξ_{j}, η_{j}) = & \frac{σ^{2}}{μ^{2}} (1 - e^{- μ (T_{j} - T_{j - 1})} - e^{- μ (T_{j - 1} - s)} + e^{- μ (T_{j} - s)}) \end{matrix}

(A102)

\begin{matrix} + (\frac{σ^{2}}{λ (μ - λ)} + \frac{σ^{2}}{λ μ}) (e^{λ (T_{j} - T_{j - 1})} - 1) + \frac{2 σ^{2}}{μ (μ - λ)} (e^{λ (T_{j} - T_{j - 1}) - μ (T_{j} - T_{j - 1})} - 1) \end{matrix}

(A103)

The correlation between

ξ_{j}

and

η_{j}

is the value of the covariance normalized with the standard deviation of

ξ_{j}

and

η_{j}

.

c o r r (ξ_{j}, η_{j}) = \frac{c o v (ξ_{j}, η_{j})}{D ξ_{j} D η_{j}}

(A104)

Because of the properties of the normal distribution, the distribution of

- ξ_{j} + η_{j}

is also normally distributed where the mean of the convolution is the sum of the means, and the variance is the following.

- ξ_{j} + η_{j} \sim N (- μ_{ξ_{j}} + μ_{η_{j}}, σ_{ξ_{j}}^{2} + σ_{η_{j}}^{2} - 2 ρ σ_{ξ_{j}} σ_{η_{j}})

(A105)

Therefore, the price of the interest rate swap can be easily calculated:

\begin{matrix} P_{s w a p} (s) & = \sum_{j = 1}^{k} E (e^{- ξ_{j} + η_{j}}) - \sum_{j = 1}^{k} e^{τ_{j} K} E (e^{- ξ_{j}}) \end{matrix}

(A106)

\begin{matrix} = \sum_{j = 1}^{k} e^{- μ_{ξ_{j}} + μ_{η_{j}} + \frac{1}{2} (σ_{ξ_{j}}^{2} + σ_{η_{j}}^{2} - 2 ρ σ_{ξ_{j}} σ_{η_{j}})} - \sum_{j = 1}^{k} e^{τ_{j} K} e^{- μ_{ξ_{j}} + \frac{1}{2} σ_{ξ_{j}}^{2}} \end{matrix}

(A107)

\begin{matrix} = \sum_{j = 1}^{k} e^{- μ_{ξ_{j}} + μ_{η_{j}} + \frac{1}{2} (σ_{ξ_{j}}^{2} + σ_{η_{j}}^{2} - 2 ρ σ_{ξ_{j}} σ_{η_{j}})} - \sum_{j = 1}^{k} e^{τ_{j} K - μ_{ξ_{j}} + \frac{1}{2} σ_{ξ_{j}}^{2}} \end{matrix}

(A108)

References

Kennedy, D.P. The term structure of interest rates as a Gaussian random field. Math. Financ. 1994, 4, 247–258. Available online: https://ideas.repec.org/a/bla/mathfi/v4y1994i3p247-258.html (accessed on 15 September 2021). [CrossRef]
Kennedy, D.P. Characterizing Gaussian models of the term structure of interest rates. Math. Financ. 1997, 7, 107–116. Available online: https://ideas.repec.org/a/bla/mathfi/v7y1997i2p107-118.html (accessed on 15 September 2021). [CrossRef]
Heath, D.C.; Jarrow, R.A.; Morton, A. Bond pricing and term structure of interest rates: A new methodology for contigent claims valuation. Econometrica 1992, 60, 77–105. [Google Scholar] [CrossRef]
Shreve, S.E. Stochastic Calculus for Finance I-II, 1st ed.; Springer Finance: Pittsburg, PA, USA, 2004. [Google Scholar]
Cheyette, O. Markov representation of the Heath-Jarrow-Morton model. SSRN Electron. J. 2001. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6073 (accessed on 12 February 2024). [CrossRef]
Beyna, I.; Wystup, U. On the Calibration of the Cheyette Interest Rate Model; CPQF Working Paper Series No. 25; Frankfurt School of Finance and Management: Frankfurt am Main, Germany, 2010. [Google Scholar]
Arató, N.M. Mean estimation of Brownian sheet. Comput. Math. Appl. 1997, 33, 12–25. Available online: https://core.ac.uk/download/pdf/82766418.pdf (accessed on 3 November 2021). [CrossRef]
Rozanov, J. Infinite-dimensional Gaussian distributions: Proceedings of the Steklov Institute of Mathematics, No. 108 (1968), 3rd ed.; American Mathematical Society: Providence, Rhode Island, 1971. [Google Scholar]
Arató, M.; Tóth-Lakits, D. Modeling Negative Rates. In Contributions to Risk Analysis: RISK 2022; Fundacion MAPFRE: Madrid, Spain, 2022; pp. 251–260. [Google Scholar]
Grenander, U. Stochastic Processes and Statistical Inference. Ark. Mat. 1950, 1, 195–277. Available online: https://archive.ymsc.tsinghua.edu.cn/pacm_download/116/6848-11512_2007_Article_BF02590638.pdf (accessed on 2 February 2022). [CrossRef]
Emerick, J.; Tatsat, H. Stochastic Volatility Models—Heston Model Calibration to option prices. QuantPy 2022. Available online: https://quantpy.com.au/stochastic-volatility-models/heston-model-calibration-to-option-prices/ (accessed on 10 January 2023).
Mikhailov, S.; Nögel, U. Heston’s Stochastic Volatility Model Implementation, Calibration and Some Extensions. Fraunhofer Inst. Ind. Math. 2003, 74–79. Available online: https://www.maths.univ-evry.fr/pages_perso/crepey/Equities/051111_mikh%20heston.pdf (accessed on 15 January 2023).

Figure 1. Simulated Kennedy fields

Figure 2. Simulated Monte Carlo market prices (mesh) vs. calibrated Kennedy model prices (markers) for a caplet.

Figure 3. Par swap rate market prices (mesh) vs. calibrated Kennedy model prices (markers).

Figure 4. Historical parameter estimations in time.

Figure 5. Differently parameterized Kennedy fields for describing forward rates.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Tóth-Lakits, D.; Arató, M. On the Calibration of the Kennedy Model. Mathematics 2024, 12, 3059. https://doi.org/10.3390/math12193059

AMA Style

Tóth-Lakits D, Arató M. On the Calibration of the Kennedy Model. Mathematics. 2024; 12(19):3059. https://doi.org/10.3390/math12193059

Chicago/Turabian Style

Tóth-Lakits, Dalma, and Miklós Arató. 2024. "On the Calibration of the Kennedy Model" Mathematics 12, no. 19: 3059. https://doi.org/10.3390/math12193059

APA Style

Tóth-Lakits, D., & Arató, M. (2024). On the Calibration of the Kennedy Model. Mathematics, 12(19), 3059. https://doi.org/10.3390/math12193059

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

On the Calibration of the Kennedy Model

Abstract

1. Introduction

2. Kennedy Model

Connection Between HJM and the Kennedy Model

3. Parameter Estimation

3.1. Maximum Likelihood Estimations

3.1.1. The Case of Different Expected Values

3.1.2. The Case of Constant Expected Value

3.1.3. Some Simple Examples

3.2. Parameter Estimations of the Kennedy Field

4. Simulation of the Kennedy Field

5. Option Pricing

5.1. European Caplet

Expected Values and Variances

5.2. European Floorlet

5.3. Swap

5.4. Par Swap Rate

6. Calibration on Simulated Data

7. Calibration on Real Data

8. Discussion

9. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

Appendix A.1

Appendix A.2

Appendix A.3

Appendix A.4

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI