Pricing of Commodity and Energy Derivatives for Polynomial Processes

Benth, Fred Espen

doi:10.3390/math9020124

Open AccessArticle

Pricing of Commodity and Energy Derivatives for Polynomial Processes

by

Fred Espen Benth

Department of Mathematics, University of Oslo, P.O. Box 1053, Blindern, N-0316 Oslo, Norway

Mathematics 2021, 9(2), 124; https://doi.org/10.3390/math9020124

Submission received: 16 November 2020 / Revised: 4 January 2021 / Accepted: 5 January 2021 / Published: 7 January 2021

(This article belongs to the Special Issue Stochastic Modelling with Applications in Finance and Insurance)

Download Versions Notes

Abstract

:

Operating in energy and commodity markets require a management of risk using derivative products such as forward and futures, as well as options on these. Many of the popular stochastic models for spot dynamics and weather variables developed from empirical studies in commodity and energy markets belong to the class of polynomial jump diffusion processes. We derive a tailor-made framework for efficient polynomial approximation of the main derivatives encountered in commodity and energy markets, encompassing a wide range of arithmetic and geometric models. Our analysis accounts for seasonality effects, delivery periods of forwards and exotic temperature forwards where the underlying “spot” is a nonlinear function of the temperature. We also include in our derivations risk management products such as spread, Asian and quanto options.

Keywords:

financial derivatives; polynomial processes; polynomial basis

1. Introduction

In commodity markets, forwards and futures are traded actively in various markets and over-the-counter as a means of hedging production, controlling price risk or for pure speculation. Other derivatives, such as plain-vanilla call and put options are also widely offered for trade at organised exchanges. Such options are typically written on forward and futures contracts, but there exist also a wide zoology of tailor-made derivatives, which have payoff structures to mitigate risk exposure towards various factors beyond price, for example temperature in a power market context. To effectively apply forward, futures and derivatives in commodity and energy markets for risk management, one must have available efficient methods for quantifying the prices of these assets. This paper provides a theoretical framework for the application of polynomials to price derivatives.

The no-arbitrage theory in financial mathematics determines prices in a complete market situation as the risk-neutral expected payoff (see, e.g., [1]). In incomplete markets, which is the typical situation in commodity and energy markets, prices can be valued by the expected payoff as well, but then under a pricing measure that must be determined (the reader is referred to the works of Geman [2] and Eydeland and Wolyniec [3] for a general commodity market analysis and that or Benth, Šaltytė Benth and Koekebakker [4] for stochastic modelling and pricing). In either case, a straightforward method for computing prices is to simulate the payoff, known as a Monte Carlo approach. In a few rare cases, even formulas are available, for example the famous Black–Scholes or Black76 formulas for call and put options or Margrabe’s formula for spread options. These formulas require the very restrictive assumption of a geometric Brownian motion describing the spot dynamics. Alternatively, in a more general Markovian setting for the underlying stochastic price dynamics (for the spot price of the forward/futures prices), one can resort to the accompanying partial differential equation and numerical methods to solve this. Another approach, which requires knowledge of the characteristic function of the underlying asset dynamics, is numerical integration of Fourier-based representations of the price. For an analysis of these methods in the context of energy markets, the interested reader is referred to the work of Benth, Šaltytė Benth and Kokebakker [4].

Polynomial processes provide an attractive alternative to the pricing approaches mentioned above. A polynomial process is, roughly speaking, a process for which the conditional moments become polynomials of lower order of the process. Polynomials are extremely efficient to compute on a computer, and thus such processes provide a framework for pricing. Indeed, polynomial processes have gained, since their introduction by Cuchiero [5], a lot of attention in the research literature and in various application areas, maybe foremost within finance (see the works of Filipović and Larsson [6] and Cuchiero, Keller-Ressel and Teichmann [7] for polynomial processes in finance). Most of the models used in commodity and energy markets are within the polynomial class, and pricing of forwards, futures and derivatives are feasible using methods from polynomial process theory.

Forwards and futures are settled on commodity and energy spot prices, which can be modelled by a polynomial process or some function thereof. Such functions can be exponentials, which has a series representation in terms of monomials or, as in weather markets or power, nonlinear functions of max/min type. The latter appears for example in connection with HDD and CDD weather futures traded at the CME exchange in the US.

We analyse the problem of deriving forward and futures pricing in these different situations, where we include seasonality, multi-factor models and delivery periods into our framework. The polynomial processes are of jump-type, following the recent introduction and analysis of Filipović and Larsson [8]. When we face nonlinear functions, we rely on series expansions of the function in terms of polynomials which are related to the ratio between the distribution of the polynomial process and a target distribution. Our analysis extends in this respect the ideas and derivations by Ackerer, Filipović and Pulido [9], who studied call and put options on an asset price with stochastic volatility. Recently, Ware [10] proposed and empirically estimated a polynomial model for power prices in Alberta, Canada, while Kleisinger-Yu, Komaric, Larsson and Regez [11] studied long-term forwards in the EEX power market using some specific polynomial processes. The current paper provides a general polynomial framework for forward and futures pricing in commodity and energy markets.

We also price general options on forwards and futures. These can, in some cases, be viewed as compound options. For example, options on temperature futures are options on options on temperature. This is a new application of polynomial processes in derivatives pricing, which, as we device here, relies again on the existence of polynomials for the ratio between the polynomial process distribution and a target distribution. An example is the class of multivariate Hermite polynomials, which are closely related to the Gaussian distribution, or the Laguerre polynomials, which are associated with the Gamma distribution. We also provide series expansions for the prices of spread and quanto options, exotic derivatives which are relevant in the energy markets. It is worth noticing that the pricing formulas based on polynomial analysis gives regression-type expressions for the derivatives price in terms of the current price of the underlying (or the factors thereof). Such relations can be proven useful in statistical studies of derivatives prices.

The analysis of this paper is presented as follows. In Section 2, we give a brief account on polynomial jump-diffusions, followed up by a short survey section of different stochastic dynamical models used in pricing forwards and options in commodity and energy markets. Section 4 provides a detailed analysis of polynomial processes and pricing of commodity and energy forward contracts. In Section 5, we price options on forwards based on polynomial processes. Finally, we end the paper with conclusions and outlook.

2. Background on Polynomial Jump-Diffusion Processes

We first recall the definition of a polynomial jump-diffusion process with values in

E \subseteq R^{d}

. Our presentation is adopted from Filipović and Larsson [8].

Throughout the paper, we let

(Ω, F, P)

be a probability space equipped with a filtration

{(F_{t})}_{t \geq 0}

satisfying the usual conditions (see [12]). For a subset

E \subseteq R^{d}, d \in N

, we denote

{Pol}_{n} (E)

the set of real-valued polynomials on E with order at most

n \in N_{0} : = N \cup {0}

, while

Pol (E)

is the algebra of all polynomials on E. To be precise, a polynomial

p \in Pol (E)

is defined as the restriction

{p = q |}_{E}

of a polynomial

q \in {Pol}_{n} (R^{d})

, with degree

\deg (p) = min {\deg (q) : p = q |_{E}, q = Pol (R^{d})}

.

Consider a stochastic process

{(X (t))}_{t \geq 0}

with state space E being a special semimartingale and having the property that

f (X (t)) - f (X (0)) - \int_{0}^{t} G f (X (s)) d s

is a local martingale for all

f \in C_{b}^{2} (R^{d})

, the space of bounded twice continuously differentiable real-valued functions on

R^{d}

. Here,

G f (x) = a {(x)}^{⊤} \nabla f (x) + \frac{1}{2} Tr (σ (x) \nabla^{2} f (x)) + \int_{R^{d}} (f (x + z) - f (x) - z^{⊤} \nabla f (x)) ℓ (x, d z)

for measurable functions

a : R^{d} \to R^{d}

and

σ : R^{d} \to S_{+}^{d}

,

S_{+}^{d}

is the space of positive-semidefinite symmetric

d \times d

matrices. Furthermore,

ℓ (x, \cdot)

is the Lévy jump measure on

R^{d}

, with properties

ℓ (x, {0}) = 0

and

\int_{R^{d}} {min (| z |, | z |}^{2}) ℓ (x, d z) < \infty

for all

x \in R^{d}

. Additionally, we assume that

G f (x) = 0

for

x \in E

for all

f \in Pol (R^{d})

with the property that

f (x) = 0

for

x \in E

and that the Lévy measure has finite moments of all orders, i.e.,

\int_{R^{d}} {| z |}^{n} ℓ (x, d z) < \infty

for all

x \in R^{d}

and

n \geq 2

. Filipović and Larsson [8] referred to this as

G

being well defined. Following Filipović and Larsson ([8], Definition 1), the E-valued jump-diffusion process

{(X (t))}_{t \geq 0}

with extended generator

G

is said to be a polynomial process if

G

maps

{Pol}_{n} (E)

into itself for each

n \in N

. By Lemma 1 in Filipović and Larsson [8], a characterisation of the polynomial processes is given as

\begin{matrix} a & \in {Pol}_{1} (E) \\ σ + \int_{R^{d}} z z^{⊤} ℓ (\cdot, d z) & \in {Pol}_{2} (E) \\ \int_{R^{d}} z^{α} ℓ (\cdot, d z) & \in {Pol}_{| α |} (E) \end{matrix}

for all multi-indices

α = (α_{1}, \dots, α_{d}) \in N_{0}^{d}

where

| α | : = α_{1} + \dots + α_{d} \geq 3

. Moreover,

z^{α} = z_{1}^{α_{1}} \dots z_{d}^{α_{d}}

. This is an “if and only if” characterisation.

The attractive property of polynomial processes is that they are stable under conditional expectations of polynomials applied to the process. This property will play a key role in our analysis of derivatives pricing in energy and commodity markets. To this end, introduce a polynomial basis vector for

{Pol}_{n} (E)

, denoted by

H_{n, d} (x) = {(1, v_{1} (x), \dots, v_{K} (x))}^{⊤}

(1)

for

x \in E

and

K : = K (n, d) : = \dim {Pol}_{n} (E) - 1

. Sometimes, we may for notational reasons write

v_{0} (x)

for the basis vector 1. In the basis

H_{n, d} (x)

, we have

v_{i} \in {Pol}_{n} (E), i = 0, 1, \dots, K

.

Remark 1.

The dimension of

{Pol}_{n} (R^{2})

is easily shown to satisfy

dim {Pol}_{n} (R^{2}) = n + 1 + dim {Pol}_{n - 1} (R^{2})

. Since

dim {Pol}_{0} (R^{2}) = 1

, we find that

dim {Pol}_{n} (R^{2}) = 1 + \sum_{i = 1}^{n + 1} i = \frac{1}{2} (n + 2) (n + 1) .

In general, we have

dim {Pol}_{n} (R^{d}) = (\binom{n + d}{n})

(see, e.g., [8]).

For

p \in {Pol}_{n} (E)

, let

p_{n} \in R^{K + 1}

be the coefficient vector such that

p (x) = p_{n}^{⊤} H_{n, d} (x)

. Furthermore, let

G_{n, d}

be the

(K + 1) \times (K + 1)

-matrix representation of

G

restricted to

{Pol}_{n} (E)

, which is determined by

G H_{n, d} (x) = G_{n, d} H_{n, d} (x) .

(2)

It follows that

G p (x) = p_{n}^{⊤} G_{n, d} H_{n, d} (x) .

We end this section with the main result of importance for our analysis ([8] Theorem 1):

Theorem 1.

Assume

{(X (t))}_{t \geq 0}

is an E-valued polynomial process. Then, for any

n \in N_{0}

and

p \in {Pol}_{n} (E)

, it holds that

E [p (X (T)) | F_{t}] = p_{n}^{⊤} exp (G_{n, d} (T - t)) H_{n, d} (X (t))

for any

T \geq t

, with

p_{n} \in R^{K + 1}

being such that

p (x) = p_{n}^{⊤} H_{n, d} (x)

.

In this paper, we mostly deal with polynomial processes having state space

E = R^{d}

. However, from time to time, we also encounter other state spaces in the discussion.

3. Forwards and Options on Energy and Commodities in a Polynomial Context

We review some models which are popular to apply in a commodity and energy market modelling context. The focus is on stochastic models for the spot price dynamics, including other relevant “spots” such as temperature and wind speed indices. We argue for the overwhelming evidence of polynomial processes in energy and more generally commodities markets modelling.

3.1. Commodity “Spot” Dynamics

The classical commodity spot model is given by the Schwartz model (see [13]): let

S (t) = exp (X (t))

(3)

where X follows an Ornstein–Uhlenbeck process

d X (t) = (μ - α X (t)) d t + σ d B (t) .

(4)

Here,

μ, α

and

σ

are constants, with

σ

and

α

being positive, and B is a standard Brownian motion. The Schwartz–Smith/Gibson–Schwartz model (see [14,15]) is a two-factor extension of this which adds to X another drifted Brownian motion or Ornstein–Uhlenbeck model (see the works of Lucia and Schwartz [16] for a power market application and Prokopczuk [17] on freight futures). General multi-factor Ornstein–Uhlenbeck models which also include Lévy processes as drivers for the noise were analysed by Benth, Šaltytė-Benth and Koekebakker [4]. That is, a general spot model for commodities and energies may be

S (t) = exp (γ^{⊤} X (t))

(5)

where

γ \in R^{d}

and X follows a d-dimensional Ornstein–Uhlenbeck process

d X (t) = (μ - A X (t)) d t + d L (t) .

(6)

for

μ \in R^{d}

, A a

d \times d

-matrix and L a d-dimensional Lévy process. Ornstein–Uhlenbeck processes in

R^{d}

belong to the class of polynomial processes. Nomikos and Soldatos [18] proposed a two-factor model of this kind for spot power prices in the NordPool electricity market, where they assume a Brownian motion and jump-driven Ornstein–Uhlenbeck process. Seasonality is added to the exponential stochastic dynamics, and the model is even extended to allow for a regime switch in the level

μ

of the Gaussian factor to account for the impact of reservoir filling.

The Pilipović model (see [19] (Page 64)) is another class of polynomial mean-reverting two-factor process for the spot price of energies. Here, the spot price is assumed to follow the dynamics

d S (t) = α (L (t) - S (t)) d t + σ S (t) d B (t),

with a stochastic long-term equilibrium level L following a geometric Brownian motion and

α, σ

positive constants. By fixing the long-term level L to be a constant, this model coincides with what is called a GARCH diffusion by Filipović and Larsson [8] (Example 2).

A typical feature of many commodity markets, in particular energy and weather markets, is seasonality. In geometric models such as (5), one typically lets

t \mapsto Λ (t)

be a positive-valued seasonality function and assumes

S (t) = Λ (t) exp (γ^{⊤} X (t)) .

A simple rewriting gives,

S (t) = exp (λ (t) + γ^{⊤} X (t)) .

(7)

with

λ (t) : = ln Λ (t)

. Cartea and Figueroa [20] proposed such a model for electricity spot prices in England and Wales, where the X is a univariate Ornstein–Uhlenbeck process of the form (6) with L being a Lévy prices with both Brownian motion and a compound Poisson process present. Interestingly, Mirantes, Población and Serna [21] proposed a stochastic seasonality model for the spot dynamics of Henry Hub natural gas prices. In their model,

λ

is assumed to follow a sum of complex Ornstein–Uhlenbeck processes in order to capture a trigonometric seasonal variation which is affected by random fluctuations. If we were to allow for complex-valued processes in our framework, this would mean that we could extend the size of the vector X by this number of complex-valued Ornstein–Uhlenbeck processes and let

λ (t) = 0

in (7).

So-called arithmetic models have also been proposed, taking the form

S (t) = λ (t) + γ^{⊤} X (t)

(8)

Temperature (see the work of Benth and Šaltytė Benth [22] for empirical analysis and stochastic modeling) may be described by an arithmetic CARMA-process, which is given by (8) and a special case of (6) with A being a particular matrix and L is a Brownian or Lévy noise in just the last dimension and zero otherwise. For example, choosing

γ = u_{1}

, the canonical unit vector in

R^{d}

,

u_{1}^{⊤} X (t)

will form a continuous autoregressive process of order d, a so-called CAR(d)-model. CAR(3)-processes have been used to model the temperature dynamics in several locations across the globe (see, e.g., the works of Härdle and Lopez-Cabrera [23] for US cities and Asian cities and Swishchuk and Cui [24] for Canadian cities). Geometric multi-factor CARMA-processes have been proposed in the context of commodity futures pricing (see the work of Paschke and Prokopczuk [25] for crude oil).

Power spot markets have a cap on the range of possible prices, which defends the introduction of a stochastic model with values in a given interval. Ware [10] proposed to model the power spot price dynamics in Alberta, Canada using a polynomial transform of the Jacobi process,

d X (t) = α (μ - X (t)) d t + σ \sqrt{X (t) (1 - X (t))} d B (t)

(9)

with

α > 0

,

θ \in [0, 1]

and

σ > 0

. The Jacobi process takes values in the unit interval

[0, 1]

and is an example of a polynomial process. We remark that the Jacobi process was proposed as a stochastic volatility model by Ackerer, Filipović and Pulido [9] in financial derivatives pricing. Kleisinger-Yu et al. [11] proposed to model power spot by a quadratic polynomial of a two-factor Schwartz–Smith dynamics in a theoretical and empirical hedge study of long-term power forward contracts. They also considered a stochastic correlation between the two factors modelled as the Jacobi process.

Wind futures based on relative wind production to total capacity require a “spot” which is a process taking values in the unit interval

[0, 1]

. Hence, the Jacobi process is a potential candidate. However, Benth and Pircalabu [26] suggested an exponential process of the form (7) with X being the negative of a univariate Ornstein–Uhlenbeck process as in (6) driven by a subordinator Lévy process. This leads to the exponential of polynomial process with jumps, with state space

[0, 1]

. The Cox–Ingersoll–Ross (CIR) stochastic process was applied by Bensoussan and Brouste [27] to model 10-min wind speed data at rotor height in Wyoming, USA. The CIR stochastic dynamics is an example of a polynomial process. Benth and Rohde [28] extended the study of Bensoussan and Brouste [27] in two different directions. Considering a finite sum of squared CARMA processes, they defined a so-called CIR-CARMA model on the one hand. As an alternative to this, they defined and analysed a CARMA-process with exponential jumps. Both models fit wind speed data very well and are within the class of polynomial processes (the former in fact being a polynomial transform of a polynomial process).

There also exists some research on stochastic volatility models with polynomial structure in the context of energy markets. Kyriakou et al. [29] proposed a mean-reverting exponential model with jumps and a Heston stochastic volatility for the spot dynamics of oil prices. They calibrated their model to different refined oil futures price series in Europe and the US. Kleisinger-Yu et al. [11] modelled the correlation by a Jacobi process in a polynomial power dynamics model.

All these mentioned processes are polynomial, demonstrating from an empirical and theoretical point of view the relevance of this class of dynamical models for risk management in commodity and energy markets.

3.2. Plain-Vanilla Forward Contracts

The forward price

F (t, T)

at time

t \geq 0

of a plain-vanilla forward contract delivering at time

T \geq t

is given as

F (t, T) = E [S (T) | F_{t}],

(10)

assuming that S is integrable. We suppose in this work that the risk-free interest rate is fixed to be

r > 0

. Thus, forwards and futures prices are identical, and we do not make any distinction between them. Notice also that we assume

P

to be the pricing measure, which is not necessarily equal to the market/objective probability. If the spot can be liquidly traded, the pricing measure is equal to the risk-neutral probability (i.e., the equivalent martingale measure).

We notice from (10) that, when S follows a geometric model (7), we find from the power expansion of the exponential function

F (t, T) = \sum_{n = 0}^{\infty} \frac{1}{n!} E [{(λ (T) + γ^{⊤} X (T))}^{n} | F_{t}]

(11)

assuming that we can interchange summation and expectation. The forward price can thus be computed by conditional expectations of polynomials of X. One can also use other polynomial expansions of the exponential function, tailor-made to the distributional properties of X. An arithmetic spot model (8) gives a very simple first-order polynomial in the conditional expectation, while Ware [10], for example, assumed the spot to be a polynomial of the Jacobi process, hence leading to a forward price of a finite sum of conditional expectations of polynomials to be computed.

3.3. Exotic Forward Contracts

Temperature forwards traded at the Chicago Mercantile Exchange (CME) are written on cooling-degree days (CDD) and heating-degree-days (HDD), which can be formulated as, respectively, call and put options on temperature spot (see Jewson and Brix [30]). The German energy exchange EEX launched in 2015 and 2016 intraday cap and floor futures, which have very similar settlement as temperature forwards (see the work of Hinderks and Wagner [31] for a discussion and analysis of these cap and floor futures). If temperature S is modelled by an arithmetic process (8), a CDD-forward price is

F (t, T) = E [max (S (T) - c, 0) | F_{t}]

(12)

where c is a threshold temperature, which at the CME is set to 18 °C. The max-function is obviously not polynomial, however, as presented by Ackerer et al. [9] in the context of option pricing with stochastic volatility, and discussed in a broader context in this paper below, one can represent the forward price (12) via a series of polynomials. Such series representations may be utilised in practice by a truncation of the infinite series followed by an efficient computation of the price resorting to the polynomial property of the process.

We remark that, in power and weather markets, as well as in freight markets, the forwards deliver over a specified period of time

T_{1} \leq T \leq T_{2}

rather than at a fixed time point T. This means that the forward price becomes simply a sum (or integral) over the specified time period,

F (t, T_{1}, T_{2}) = \sum_{T = T_{1}}^{T_{2}} F (t, T)

For example, in the temperature market, one is summing over the daily CDD or HDD, which again is based on the average of the maximum and minimum temperature on a given day. In the power market, one is aggregating over the hourly spot prices in the delivery period. These delivery periods are typically given as weeks, months, quarters or even years.

3.4. Options in Energy and Commodities

Exchange-traded options in energy and commodity markets are typically plain-vanilla call and put options written on forward contracts. In the EEX market, e.g., call and put options are listed on forwards delivering over months and quarters and years. The price of such contracts is

C (t, τ) : = e^{- r (τ - t)} E [max (F (τ, T_{1}, T_{2}) - K, 0) | F_{t}]

for a call option with strike K and exercise time

τ

, written on a forward delivering over the period

[T_{1}, T_{2}]

, where

τ \leq T_{1}

. Other markets with similar contracts include temperature derivatives at CME and options on the forward freight rate agreements (FFA) at the Baltic Exchange in London, UK.

In the oil market, e.g., the options are written on forwards with fixed-delivery time, i.e., the contract is settled on

F (τ, T)

with

τ \leq T

. Such options are traded, e.g., at the New York Mercantile Exchange (NYMEX). NYMEX also offers trade in spread options on different blends of oil. Spreads play an important role in energy and commodity markets, for example the spark and dark spreads between, respectively, power and gas and power and coal. In the OTC-market, there exists an abundance of various options and derivatives based on such spreads but also other variations of options on several underlyings as the quanto options. Quanto options are settled on the product of two payoff functions with respect to price and a volume measure such as temperature (see, e.g., [32]).

Asian-style options on spot prices are also attractive products in energy and commodity markets. Asian options on the spot price of electricity were traded on the Nord Pool power exchange and at the Imarex shipping exchange a few decades ago (see [17,33]), and Asian-like payoff structures on spots appear in many tailor-made commodity derivatives contracts traded OTC. Fusai, Marena and Roncoroni [34] advocated Fourier-based pricing methods for discretely-monitored Asian options in corn and gas markets using a square-root (Cox–Ingersoll–Ross) stochastic process for the spot dynamics. Discrete and continuous-time Asian options in the context of crude oil were priced by an iterative numerical method by Kyriakou, Pouliasis and Papapostolou [35] based on the Heston stochastic volatility model and its Bates’ extension where the log-price dynamics have jumps (see also [29]). The Heston along with the SABR stochastic volatility models were proposed for oil futures price dynamics by Shiraya and Takahashi [36], who derived approximation formulas by asymptotic expansions for Asian options. The models are calibrated using WTI futures American options.

4. Polynomial Processes and Forward Pricing

As shown in the previous section, the problem of finding the forward price in commodity and energy markets can be reduced to computing

E [p (λ (t) + γ^{⊤} X (t)) | F_{s}]

(13)

for some

t \geq s \geq 0

, with X being an

R^{d}

-valued polynomial process,

γ \in R^{d}

and

λ (t)

a deterministic function. We assume

p \in {Pol}_{n} (R)

.

We see from Expression (13) that we are facing the following two problems when computing the conditional expectation. First, we shift the polynomial process by a seasonality function

λ (T)

, which means that we need to know how the polynomials act under shifting. We express this in terms of a shift in monomials, followed by a matrix for basis change on

{Pol}_{n} (R)

. Secondly, we consider polynomials on real-valued affine transformations of multi-dimensional polynomial processes. This requires an understanding of how one transforms

p (γ^{⊤} x)

for a real-valued polynomial p into a polynomial on

R^{d}

, with

γ, x \in R^{d}

. We can assign a linear transformation between the polynomial bases in

{Pol}_{n} (R)

and

{Pol}_{n} (R^{d})

. In the next two lemmas, we spell this out.

The first lemma is a simple consequence of the binomial formula:

Lemma 1.

Let

M_{n} (x) = {(1, x, x^{2}, \dots, x^{n})}^{⊤}

be a vector of monomials on

R

up to order

n \in N

. Then,

M_{n} (λ + x) = Λ_{n} (λ) M_{n} (x)

where

Λ_{n} \in R^{(n + 1) \times (n + 1)}

is a lower triangular matrix with elements

{(Λ_{n} (λ))}_{i j} = (\binom{i - 1}{j - 1}) λ^{i - j}

for

i = 1, 2, \dots n + 1

and

j \leq i

.

Proof.

Let

m \leq n

. Then, by the binomial formula,

{(λ + x)}^{m} = \sum_{k = 0}^{m} (\binom{m}{k}) λ^{m - k} x^{k}

This yields the result. □

If we are given a basis

H_{n} (x) : = H_{n, 1} (x)

of polynomials in

{Pol}_{n} (R)

,

H_{n} (x) : = {(h_{0} (x) : = 1, h_{1} (x), \dots, h_{n} (x))}^{⊤}

(14)

where

h_{k} (x) \in {Pol}_{n} (R)

, then one can find an invertible matrix

C_{n} \in R^{(n + 1) \times (n + 1)}

such that

H_{n} (x) = C_{n} M_{n} (x)

(15)

In forward pricing, we consider polynomials which are shifted by the seasonal function

λ (t)

,

p (λ (t) + x)

. Hence, if

p_{n} \in R^{n + 1}

is such that

p (x) = p_{n}^{⊤} H_{n} (x)

, for

H_{n} (x)

as in Lemma above, then we find

p (λ + x) = p_{n}^{⊤} H_{n} (λ + x) = p_{n}^{⊤} C_{n} M_{n} (λ + x) = p_{n}^{⊤} C_{n} Λ_{n} (λ) M_{n} (x) = p_{n} {(λ)}^{⊤} H_{n} (x)

with

p_{n} {(λ)}^{⊤} : = p_{n}^{⊤} C_{n} Λ_{n} (λ) C_{n}^{- 1} .

(16)

We have the following technical result on changing from univariate to multivariate polynomial bases:

Lemma 2.

There exists an

(n + 1) \times (K + 1)

-dimensional matrix

Γ_{n, d}

such that

H_{n} (γ^{⊤} x) = Γ_{n, d} H_{n, d} (x)

for any

γ, x \in R^{d}

. Here,

K + 1

is the dimension of

{Pol}_{n} (R^{d})

.

Proof.

From (15), we have that

H_{n} (γ^{⊤} x) = C_{n} M_{n} (γ^{⊤} x)

. Moreover, by the multinomial formula, it holds for

1 \leq k \leq n

,

{(γ^{⊤} x)}^{k} = {(γ_{1} x_{1} + γ_{2} x_{2} + \dots + γ_{d} x_{d})}^{k} = \sum_{k_{1} + \dots + k_{d} = k} (\binom{k}{k_{1}, k_{2}, \dots, k_{d}}) Π_{i = 1}^{d} γ_{i}^{k_{i}} x_{i}^{k_{i}},

where

(\binom{k}{k_{1}, k_{2}, \dots, k_{d}}) = \frac{k!}{k_{1}! \dots k_{d}!}

is the multinomial coefficient. Hence,

{(γ^{⊤} x)}^{k} \in {Pol}_{k} (R^{d})

and

M_{n} (γ^{⊤} x)

is an

n + 1

-dimensional vector of polynomials in

{Pol}_{n} (R^{d})

. Therefore, we can find a matrix

{\tilde{Γ}}_{n, d}

such that

M_{n} (γ^{⊤} x) = {\tilde{Γ}}_{n, d} H_{n, d} (x)

and the lemma follows. □

Typically, a seasonality function

λ (t)

may be a finite series of sine and cosine functions. Since, for example, the pair of functions

(sin (k t), cos (k t))

satisfies a two-dimensional first order linear system of ordinary differential equations, we see that truncated series of trigonometric functions fits into the framework of polynomial processes. This was pointed out and discussed by Filipović and Willems [37] in their study of dividend derivatives. As mentioned in Section 3, complex-valued Ornstein–Uhlenbeck processes were proposed by Mirantes, Población and Serna [21] to model stochastic seasonality. Their model can be viewed as an extension of the seasonality dynamics by Filipović and Willems [37] with additive (complex-valued) noise. More generally, one can allow for polynomial complex-valued processes of polynomial type to model stochastic seasonality. Such an approach would in our framework imply that we ignore the seasonality function

λ

, i.e., let

λ (t) = 0

and extend the dimensionality of the vector-valued polynomial process X (as well as allowing for more generally complex-valued polynomial processes). The price we would pay for interpreting the seasonality

λ

as a polynomial process would be that the dimensionality d of X increases and hence the dimension

K + 1

of

{Pol}_{n} (R^{d})

.

4.1. Plain-Vanilla Forward Prices

We find the following result, collecting the notation introduced in this section:

Proposition 1.

Let

p \in {Pol}_{n} (R^{d})

. For

T \geq t \geq 0

, it holds,

E [p (λ (t) + γ^{⊤} X (T)) | F_{t}] = p_{n} {(λ (T))}^{⊤} Γ_{n, d} exp (G_{n, d} (T - t)) H_{n, d} (X (t))

where

p_{n} (λ)

is given in (16),

Γ_{n, d}

in Lemma 2 and

G_{n, d}

is the polynomial transition matrix defined in (2).

Proof.

First note that a polynomial process has finite conditional moments of all orders (see [8] (Theorem 1)). Hence, the conditional expectation is well-defined. From Lemma 2, we have for

γ, x \in R^{d}

and

λ \in R

,

p (λ + γ^{⊤} x) = p {(λ)}^{⊤} H_{n} (γ^{⊤} x) = p_{n} {(λ)}^{⊤} Γ_{n, d} H_{n, d} (x) .

Hence,

E [p (λ (T) + γ^{⊤} X (T)) | F_{t}] = p_{n} {(λ (T))}^{⊤} Γ_{n, d} E [H_{n, d} (X (T)) | F_{t}]

The result follows from the polynomial process property of X (see Theorem 1). □

If the spot dynamics follows an arithmetic model (8), then we find the forward price (10) as

F (t, T) = p_{1} {(λ (T))}^{⊤} Γ_{1, d} exp (G_{1, d} (T - t)) H_{1, d} (X (t))

where

p_{1} : = {(p_{1, 0}, p_{1, 1})}^{⊤}

is the vector such that

x = p_{1, 0} h_{0} (x) + p_{1, 1} h_{1} (x)

. Of course, if we choose

H_{1} (x) = M_{1} (x)

, then

p_{1} = {(0, 1)}^{⊤}

,

C_{1} = I_{2}

, the

2 \times 2

-identity matrix and

Λ_{1} (λ) = [\begin{matrix} 1 & 0 \\ λ & 1 \end{matrix}]

Then,

p_{1} {(λ)}^{⊤} = (λ, 1) .

Furthermore,

Γ_{1, d}

is in this case the matrix mapping

M_{1} (γ^{⊤} x) = {(1, γ_{1} x_{1} + \dots + γ_{d} x_{d})}^{⊤}

into

H_{1, d} (x) = {(v_{0} (x), \dots, v_{d} (x))}^{⊤}

, with

v_{i} (x)

being polynomials of order 1 on

R^{d}

. If

v_{0} (x) = 1

and

v_{i} (x) = x_{i}

for

i = 1, \dots, d

, then

Γ_{1, d} = [\begin{matrix} 1 & 0 & \dots & 0 \\ 0 & γ_{1} & \dots & γ_{d} \end{matrix}]

Thus, we have a simple relationship when the monomials are chosen as the basis of

H_{1, d} (x)

. Indeed, the forward price is

F (t, T) = (λ (T), γ_{1}, \dots, γ_{d}) exp (G_{1, d} (T - t)) (\begin{matrix} 1 \\ X_{1} (t) \\ \cdot \\ \cdot \\ X_{d} (t) \end{matrix})

Remark 2.

Proposition 1 is a simple extension of the formulas by Kleisinger-Yu et al. [11] who focussed on the case of

p \in {Pol}_{2} (R^{d})

. Ware [10] also discussed such formulas in his polynomial approach to power spot models.

Let us discuss the forward price dynamics in Proposition 1 from an empirical viewpoint. To put our discussion into context, we note that Ware [10] proposed, among other models, a one-factor polynomial diffusion process combined with a fifth-order polynomial as a spot dynamics, i.e.,

S (t) = p (X (t))

with

X \in R

and

p \in {Pol}_{5} (R)

. Here, we ignore seasonality for simplicity in the discussion. Then, using

τ : = T - t

, the time to maturity, we get

f (t, τ) : = F (t, t + τ) = θ^{⊤} exp (G_{5, 1} τ) H_{5, 1} (X (t))

where

θ : = Γ_{5, 1}^{⊤} p_{5} \in R^{6}

. Using the monomials as basis, we have

H_{5, 1} (x) = {(1, x, x^{2}, x^{3}, x^{4}, x^{5})}^{⊤}

. Moreover, as X is polynomial, we have that its drift

a (x) : = a_{0} + a_{1} x

is in

{Pol}_{1} (R)

and diffusion

σ (x) = σ_{0} + σ_{1} x + σ_{2} x^{2}

is in

{Pol}_{2} (R)

. If

v_{k} (x) = x^{k}, k = 0, \dots, 5

, it is easy to see that

G v_{0} (x) = 0

,

G v_{1} (x) = a (x)

and

G v_{k} (x) = a (x) k v_{k - 1} (x) + \frac{1}{2} σ^{2} (x) k (k - 1) v_{k - 1} (x)

for

k = 2, \dots, 5

. Hence,

G_{5, 1}

will be a lower triangular matrix with zero in the first row and diagonal elements

k a_{1} + \frac{1}{2} k (k - 1) σ_{2}

for

k = 1, \dots, 5

. (Note that first diagonal element is zero, the second is

a_{1}

, the next is

2 a_{1} + σ_{2}

, etc. to the last which is

5 a_{1} + 10 σ_{2}

.) The distinct eigenvalues of the matrix then becomes

λ_{0} = 0

and

λ_{k} = k a_{1} + \frac{1}{2} k (k - 1) σ_{2}

for

k = 1, \dots, 5

. We can find a basis of

R^{6}

of eigenvectors

w_{0}, w_{1}, \dots, w_{5}

, where

w_{0}

can be chosen to be the first canonical basis vector of

R^{6}

with 1 in first coordinate and zeros otherwise. From this, we find

exp (G_{5, 1} τ) H_{5, 1} (X (t)) = w_{0} + \sum_{k = 1}^{5} e^{λ_{k} τ} w_{k}^{⊤} H_{5, 1} (X (t)) w_{k}

(17)

In commodity markets, one typically expects forward prices to be flat in the long end of the curve as long as spot prices are expected to possess some stationarity properties. It is evident from above that the forward prices for large

τ

’s tend to

p_{5}^{⊤} w_{0}

whenever the eigenvalues

λ_{k}

are negative. Hence, we obtain a flat forward curve in the long end when

k a_{1} + \frac{1}{2} k (k - 1) σ_{2} < 0

for

k = 1, \dots, 5

. For example, if

σ_{2} = 0

, then this is achieved when

a_{1} < 0

. On the other hand, in the Jacobi model suggested by Ware [10],

σ_{1} = - 1

, which again implies that

a_{1} < 0

. This is indeed the case in his model.

By an application of Ito’s formula, one furthermore observes that the volatility of

f (t, τ)

will satisfy the Samuelson effect, as the volatility will be determined by the diffusion term from

X (t)

and scaled by exponentials

e^{λ_{k} τ}

. Whenever

λ_{k} < 0

, these exponentials will tend to 1 when

τ

tends to zero, which gives a Samuelson effect as the forward volatility converges to the spot volatility in this case.

It is also worth noticing that the sum of exponentials in (17) gives rise to several humps in the forward term structure, that is, the curve

τ \mapsto f (t, τ)

may have several local maxima and minima according to the values of the eigenvalues. A hump shape behaviour of the forward curve is reasonable from an economic viewpoint, indicating differences in risk preferences of the traders along the forward curve. We refer to the work of Benth, Šaltytė Benth and Koekebakker [4] for a discussion of the various stylised facts of forward curves in power and commodity markets.

The analysis above can be generalised to arbitrary polynomials p, as well as also extending the dimension of the polynomial process beyond

d = 1

.

If the spot follows a geometric model (7), then, from the Taylor series expansion of the forward price in (11), one chooses the basis for

{Pol}_{n} (R)

to be the monomials, i.e.,

H_{n} (x) = M_{n} (x)

. The basis for

{Pol}_{n} (R^{d})

is arbitrary selected. We find:

Proposition 2.

Suppose the commodity spot dynamics is given by

S (t) = exp (λ (t) + γ^{⊤} X (t))

as in (7). Assume that

exp (| γ^{⊤} X (T) |) \in L^{1} (P)

. Then, the forward price in (11) is given by

F (t, T) = \sum_{n = 0}^{\infty} \frac{1}{n!} u_{n + 1}^{⊤} Λ_{n} (λ (T)) Γ_{n, d} exp (G_{n, d} (T - t)) H_{n, d} (X (t)),

where

u_{n + 1}

is the

n + 1

-canonical unit vector in

R^{n + 1}

(i.e., the vector with 1 in coordinate

n + 1

and zero otherwise),

Γ_{n, d}

in Lemma 2,

Λ_{n} (λ)

in Lemma 1 and

G_{n, d}

in (2).

Proof.

From the exponential integrability condition on

γ^{⊤} X (T)

, it follows from monotone convergence (see [38] (Theorem 2.15)) that

\begin{matrix} \sum_{n = 0}^{\infty} \frac{1}{n!} E [| λ (T) + γ^{⊤} X (T) |^{n}] & = E [\sum_{n = 0}^{\infty} \frac{1}{n!} | λ (T) + γ^{⊤} X (T) |^{n}] \\ = E [exp (| λ (T) + γ^{⊤} X (T) |] \\ \leq exp (| Λ (T) |) E [exp (| γ^{⊤} X (T) |)] < \infty \end{matrix}

Hence, in particular, we find that

E [| γ^{⊤} X (T) |^{n}] < \infty

for every

n \in N

. As

{({(γ^{⊤} X (T))}^{n})}_{n \in N}

therefore is a sequence in

L^{1} (P)

and

\sum_{n = 0}^{\infty} \frac{1}{n!} E [| γ^{⊤} X (T) |^{n}] < \infty

, it follows from dominated convergence theorem (see [38] (Theorem 2.25)) that

E [exp (λ (T) + γ^{⊤} X (T)) | F_{t}] = \sum_{n = 0}^{\infty} \frac{1}{n!} E [{(λ (T) + γ^{⊤} X (T))}^{n} | F_{t}] .

By Lemma 1, we derive

\begin{matrix} E [{(λ (T) + γ^{⊤} X (T))}^{n} | F_{t}] & = u_{n + 1}^{⊤} E [M_{n} (λ (T) + γ^{⊤} X (T)) | F_{t}] \\ = u_{n + 1}^{⊤} Λ_{n} (λ (T)) E [M_{n} (γ^{⊤} X (T)) | F_{t}] . \end{matrix}

Next, by Lemma 2 followed by the polynomial property of X yield,

\begin{matrix} E [M_{n} (γ^{⊤} X (T)) | F_{t}] & = Γ_{n, d} E [H_{n, d} (X (T)) | F_{t}] \\ = Γ_{n, d} exp (G_{n, d} (T - t)) H_{n, d} (X (t)) \end{matrix}

The result follows. □

The Proposition above allows for stating the forward price as

F (t, T) = \sum_{n = 0}^{\infty} \frac{1}{n!} f_{n} (t, T; X (t))

where

x \mapsto f_{n} (t, T; x) \in {Pol}_{n} (R^{d})

. In practical computations, one would of course truncate the sum. One could also make use of the (truncated) sum in a regression study, where one empirically could reveal the structure of the

f_{n}

’s by regressing observed forward prices against polynomials of X. Such a study requires knowledge of the state of

X (t)

, which can be recovered from the spot prices. Such recovery may involve stochastic filtering if

d > 1

.

The exponential integrability condition

exp (| γ^{⊤} X (T) |) \in L^{1} (P)

in Proposition 2 is rather restrictive. Geometric Brownian motion, e.g., does not satisfy this condition for any

γ

. From Example 2 of Filipović and Larsson [8], the GARCH diffusion process

d X (t) = κ (θ - X (t)) d t + \sqrt{2 κ} X (t) d B (t)

for constants

κ, θ \in R_{+}

is a polynomial process with ergodic solution being inverse Gaussian distributed with shape parameter 2 and

1 / θ

as scale. In the invariant case, X does not have a finite variance and hence not being exponentially integrable either. By letting

θ

be a geometric Brownian motion, the GARCH diffusion becomes the Pilipović model (see Pilipović [19]) briefly discussed in Section 3. This model will not in general be exponentially integrable. Ornstein–Uhlenbeck processes driven by compound Poisson processes having exponential jumps will have a gamma distributed stationary solution. This will also not be exponentially integrable, except under restrictive conditions on the parameters.

4.2. Exotic Forward Prices

Next, consider CDD-forwards on temperature (or a floor electricity forward) with price given in (12). We need some preparatory material on polynomial expansions of call and put payoff functions. For

n \in N_{0}

, let

ξ_{n} (x)

denote the nth Hermite polynomial (known as the “probabilistic” Hermite polynomial) defined as

ξ_{n} (x) = {(- 1)}^{n} \frac{1}{w (x)} \frac{d^{n}}{d x^{n}} w (x)

(18)

where

w (x) = \frac{1}{\sqrt{2 π}} e^{- x^{2} / 2}

(19)

is the density of the standard normal distribution function. We notice that

ξ_{0} (x) = 1

. Further, define

e_{n} (x) = \frac{ξ_{n} (x)}{\sqrt{n!}},

(20)

for

n \in N_{0}

with the usual convention that

0! = 1

. It is known that

{(e_{n})}_{n \in N_{0}}

is an ONB of the Hilbert space

L_{w}^{2} : = L^{2} (R, w (x) d x)

. Considering

f (x) = max (x - c, 0)

, we readily find that

f \in L_{w}^{2}

as

f (x) = (x - c)

for

x > c

and zero for

x < c

and w integrates any polynomial. Moreover, for any

f \in L_{w}^{2}

, we find that

{| f |}_{w}^{2} : = \int_{R} f^{2} (x) w (x) d x = E [f^{2} (Y)]

with

Y \sim N (0, 1)

, a standard normal random variable. Moreover, from elementary functional analysis, we have

f (x) = \sum_{n = 0}^{\infty} \int_{R} f (y) e_{n} (y) w (y) d y e_{n} (x) .

The following simple result holds:

Lemma 3.

Suppose

f \in L_{w}^{2}

and denote by

f^{m} (x) : = \sum_{n = 0}^{m} f_{n} e_{n} (x)

. Then, for

Y \sim N (0, 1)

,

{(f^{m} (Y))}_{m \in N}

converges to

f (Y)

in

L^{2} (P)

. Moreover,

E [f (Y)] = \sum_{n = 0}^{\infty} f_{n} E [e_{n} (Y)]

Proof.

Obviously,

f^{m} \in L_{w}^{2}

and

f - f^{m} \to 0

in

L_{w}^{2}

as

m \to \infty

. The latter means

\begin{matrix} 0 & = lim_{m \to \infty} {| f - f^{m} |}_{w}^{2} \\ = lim_{m \to \infty} \int_{R} {(f (x) - f^{m} (x))}^{2} w (x) d x \\ = lim_{m \to \infty} E [{(f (Y) - f^{m} (Y))}^{2}] \end{matrix}

Hence, the first claim follows. For the second claim, the Cauchy–Schwarz inequality implies

\begin{matrix} | E [f (Y)] - \sum_{n = 0}^{m} f_{n} E [e_{n} (Y)] |^{2} & = | E [f (Y) - f^{m} (Y)] |^{2} \\ \leq E [| f (Y) - f^{m} (Y) |^{2}] . \end{matrix}

Invoking the first claim proves the Lemma. □

We extend the previous result to more general random variables Y in the next lemma:

Lemma 4.

Suppose Y is a random variable with probability density

ϕ_{Y}

satisfying

ϕ_{Y} (y) \leq C w_{a, b^{2}} (y)

for

a . e .

y \in R

, where

C \geq 1

is a constant and

w_{a, b^{2}}

is the normal density function with mean a and variance is

b^{2}

with

0 < b < 1

. If

f \in L_{w}^{2}

then

E [f (Y)] = \sum_{n = 0}^{\infty} f_{n} E [e_{n} (Y)]

Proof.

By the Cauchy–Schwarz inequality,

\begin{matrix} | E [f (Y)] - \sum_{n = 0}^{m} f_{n} E [e_{n} (Y)] |^{2} & = | E [f (Y) - f^{m} (Y)] |^{2} \\ \leq E [| f (Y) - f^{m} (Y) |^{2}] \\ = \int_{R} {(f (y) - f^{m} (y))}^{2} ϕ_{Y} (y) d y \\ \leq C \int_{R} {(f (y) - f^{m} (y))}^{2} w_{a, b^{2}} (y) d y \\ = C \int_{R} {(f (y) - f^{m} (y))}^{2} \frac{w_{a, b^{2}} (y)}{w (y)} w (y) d y . \end{matrix}

Consider the positive function

u (y) : = \frac{w_{a, b^{2}} (y)}{w (y)} = exp (\frac{1}{2} y^{2} (1 - b^{- 2}) + \frac{a}{b^{2}} y - \frac{a^{2}}{2 b^{2}})

As

0 < b < 1

, it holds that

1 - b^{- 2} < 0

and thus u has a maximum value on

R

. It follows that

\begin{matrix} | E [f (Y)] - \sum_{n = 0}^{m} f_{n} E [e_{n} (Y)] |^{2} & \leq C sup_{y \in R} u (y) {| f - f^{m} |}_{w}^{2} . \end{matrix}

Since

f \in L_{w}^{2}

and

f^{m}

is its truncation in the basis representation, the result follows after passing to the limit. □

We recall that

f (x) = max (x - c, 0)

satisfies the requirement that

f \in L_{w}^{2}

. Moreover, CARMA(

p, q

)-processes driven by Brownian motion are normally distributed, and hence the above result applies with

ϕ_{Y}

being a normal distribution with variance less than 1. We also recall from Ackerer et al. [9] that the Jacobi volatility process has a distribution which is absolutely continuous with respect to the normal distribution. In addition, rather than using the Taylor series representation

exp (x) = \sum_{k = 0}^{\infty} \frac{1}{k!} x^{k}

, we may use the Hermite polynomials as series expansion for the exponential function since obviously

exp (x) \in L_{w}^{2}

.

The condition that the variance b is strictly less than one is very restrictive. However, one can overcome this by a change in the Hermite basis or by appropriate rescaling the function f. We provide a thorough discussion of this in Section 4.3, where we take a more general perspective. For the moment, we note that Ackerer et al. [9] used an affine transform of the Hermite polynomials as basis.

Remark 3.

We notice that the condition on the probability density of Y in Lemma 4 implies that the distribution

Φ_{Y} (d y) : = ϕ_{Y} (y) d y

is absolutely continuous with respect to

w_{a, b^{2}} (y) d y

. By assuming instead that the distribution of Y,

Φ_{Y}

is dominated by that of

w_{a, b^{2}} (y) d y

, i.e.,

Φ (d y) \leq C w_{a, b^{2}} (y) d y

in the sense

Φ_{Y} (U) \leq C \int_{U} w_{a, b^{2}} (y) d y

for every Borel set

U \subset R

, we find that there exists an

a . e .

non-negative Radon–Nikodym density

ℓ_{Y} \in L^{1} (R, w_{a, b^{2}} (y) d y)

. In this case, we can define a probability density

ϕ_{Y} (y) : = ℓ_{Y} (y) w_{a, b^{2}} (y)

. However, then,

ϕ_{Y} (y) \leq C w_{a, b^{2}} (y), a . e . y \in R

because, if this is not the case there exists a measurable set U with strictly positive mass such

ϕ_{Y} (y) > C w_{a, b^{2}} (y), y \in U

, which implies that

Φ_{Y} (U) = \int_{U} ϕ_{Y} (y) d y > C \int_{U} w_{a, b^{2}} (y) d y

being a contradiction. Further notice that the constant C must be greater than or equal to 1 simply because we have distributions with total mass 1 on both sides of the bound.

In the next subsection, we take a more general perspective where the distribution of the polynomial process does not need to be bounded by a Gaussian but other suitable classes of distributions for which we can associate polynomials. At the current stage of our exposition, we focus on the Gaussian case as this is the most relevant in connection with temperature forwards, where the underlying dynamics have empirical evidence for being normally distributed (recall discussions in Section 3).

We next show a polynomial expression for the CDD-temperature forward price: to this end, choose the basis

H_{n} (x) = {(e_{0} (x), e_{1} (x), \dots, e_{n} (x))}^{⊤}

for

{Pol}_{n} (R)

, where

{(e_{n} (x))}_{n \in N_{0}}

are the normalised Hermite polynomials defined in (20). For

{Pol}_{n} (R^{d})

, we fix an arbitrary basis

H_{n, d} (x)

.

Proposition 3.

Suppose the commodity spot dynamics is given by

S (t) = λ (t) + γ^{⊤} X (t)

as in (8), where X is a d-dimensional polynomial process. Assume that the random variable

γ^{⊤} X (T)

has an

F_{t}

-conditional probability density which is bounded by a normal density

w_{a, b^{2}}

as in Lemma 4. Then, the forward price in (12) is given by

F (t, T) = \sum_{n = 0}^{\infty} f_{n} u_{n + 1}^{⊤} C_{n} Λ_{n} (λ (T)) C_{n}^{- 1} Γ_{n, d} exp (G_{n, d} (T - t)) H_{n, d} (X (t)),

where

u_{n + 1}

is the

n + 1

-canonical unit vector in

R^{n + 1}

(i.e., the vector with 1 in coordinate

n + 1

and zero otherwise),

C_{n}

is given in (15),

Γ_{n, d}

in Lemma 2,

Λ_{n} (λ)

in Lemma 1 and

G_{n, d}

in (2).

Proof.

We find that

f (x) = max (x - c, 0) \in L_{w}^{2}

, and therefore

f (x) = \sum_{n = 0}^{\infty} f_{n} e_{n} (x)

From the condition on the density of

γ^{⊤} X (T)

given

F_{t}

, it holds from Lemma 4 that the conditional expectation is well-defined as integrability holds, and that we can commute sum and conditional expectation. That is,

\begin{matrix} F (t, T) & = E [max (λ (T) + γ^{⊤} X (T) - c, 0) | F_{t}] \\ = E [f (λ (T) + γ^{⊤} X (T)) | F_{t}] \\ = \sum_{n = 0}^{\infty} f_{n} E [e_{n} (λ (T) + γ^{⊤} X (T)) | F_{t}] \\ = \sum_{n = 0}^{\infty} f_{n} u_{n + 1}^{⊤} E [H_{n} (λ (T) + γ^{⊤} X (T)) | F_{t}] \end{matrix}

By the translation of the monomial basis in Lemma 1, we find that

E [H_{n} (λ (T) + γ^{⊤} X (T)) | F_{t}] = C_{n} Λ_{n} (λ (T)) C_{n}^{- 1} E [H_{n} (γ^{⊤} X (T)) | F_{t}]

Furthermore, invoking Lemma 2 gives

E [H_{n} (γ^{⊤} X (T)) | F_{t}] = Γ_{n, d} E [H_{n, d} (X (T)) | F_{t}] .

Finally, applying the polynomial property of X, we find

E [H_{n, d} (X (T)) | F_{t}] = exp (G_{n, d} (T - t)) H_{n, d} (X (t))

The result follows. □

We remark in passing that the above Proposition could also have been developed for other functions f than the one appearing for the CDD-temperature forwards. In fact, any function

f \in L_{w}^{2}

would do, with the only difference that the coefficients

f_{n}

appearing in the expression for F in Proposition 3 would change (as they depend explicitly on f, of course). For example, using

f (x) = exp (x)

, which defines a function in

L_{w}^{2}

, Proposition 3 provides an alternative forward price series expression to Proposition 2 for geometric spot price models.

To efficiently compute the CDD-temperature forward price by exploiting the polynomial structure of X, we truncate the infinite sum. All the matrices and vectors involved are explicitly given, except the coefficient functions

f_{n}

. We recall these to be defined as

f_{n} : = \int_{R} max (x - c, 0) e_{n} (x) w (x) d x = E [max (Y - c, 0) e_{n} (Y)]

for Y being standard normally distributed. We can compute these coefficients once for a given function

f \in L_{w}^{2}

, as they are independent of the polynomial process X.

Remark 4.

In the market for temperature forwards, the contracts are settled over a pre-specified period of time. In that case, a CDD-temperature forward is

F (t, T_{1}, T_{2}) = \sum_{n = 0}^{\infty} f_{n} u_{n + 1}^{⊤} C_{n} (\sum_{T = T_{1}}^{T_{2}} Λ_{n} (λ (T)) C_{n}^{- 1} Γ_{n, d} exp (G_{n, d} (T - t))) H_{n, d} (X (t))

after appealing to the Fubini–Tonelli theorem to commute sums.

If temperature follows a CARMA-dynamics driven by a Brownian motion, then the dimension d will indicate the autoregressive order. Moreover,

γ^{⊤} X (t)

will be Gaussian and X a d-dimensional Ornstein–Uhlenbeck process, and thus the conditions in Proposition 3 hold. We recall that the temperature dynamics is conveniently modelled by a CAR(3)-process (see [22]), which means that

d = 3

and

γ = u_{1}

, the canonical unit vector in

R^{3}

with 1 in first coordinate and zero otherwise.

Interestingly, Asian options are closely related to the above CARMA-situation by the following argument: consider an Asian option with payoff

max (T^{- 1} \int_{0}^{T} {\tilde{γ}}^{⊤} \tilde{X} (s) d s - c, 0)

at exercise time T. Here,

\tilde{X}

is a d-dimensional polynomial process and

\tilde{γ} \in R^{d}

, with the spot price being

S (t) = {\tilde{γ}}^{⊤} \tilde{X} (t)

(we ignore seasonality here in this short discussion). Let now X be the process in

R^{d + 1}

defined as

X = (\tilde{X}, X_{d + 1})

, where

d X_{d + 1} (t) = {\tilde{γ}}^{⊤} \tilde{X} (t) d t .

It follows that X is a polynomial process, and we have that the Asian option payoff can be written as

T^{- 1} max (γ^{⊤} X (T) - T c, 0)

with

γ = u_{d + 1}

, the canonical unit vector in

R^{d + 1}

with one in the last coordinate and zero otherwise. In particular, assuming that

\tilde{X}

is a multivariate Gaussian Ornstein–Uhlenbeck process, we will have that X is a Gaussian Ornstein–Uhlenbeck process and we find ourselves in a situation which is closely resembling the forward price of a CDD-temperature contract analysed above.

4.3. A General Polynomial Approach to Forward Pricing

In this subsection, we take a general perspective on forward pricing, providing a unifying expression for the forward price in markets with a polynomially based “spot”-process. The approach requires some additional conditions on the polynomial process, but on the other hand gives an attractive treatment of options on forwards, a topic which is analysed in Section 5.

Suppose that the “spot price” dynamics is given by

S (t) = g (X (t); λ (t))

(21)

for some measurable function

g : R^{d} \times R \to R

, seasonality function

λ

and X being a d-dimensional polynomial process. Examples of relevance can be

g (x; λ) : = ξ (λ + γ^{⊤} x)

for

γ \in R^{d}

,

λ \in R

and

ξ

being one of the following functions:

ξ (x) = x

(arithmetic spot model),

ξ (x) = exp (x)

(geometric spot model) or

ξ (x) = max (x - c, 0)

(spot for an exotic forward such as temperature futures). Our aim is to compute the forward price, defined as

F (t, T) = E [g (X (T); λ (T)) | F_{t}]

(22)

To achieve this goal, we employ a multivariate generalisation of the space

L_{w}^{2}

along with an integrability assumption on the conditional probability distribution of

X (T)

given

F_{t}

. In fact, without losing any generality for practical purposes in commodity and energy markets, we assume that X is also a Markovian process. (Note that non-Markovian polynomial jump diffusion processes exist, see., e.g., [8] (Page 71).)

Let us start by introducing a multi-dimensional generalisation of the space

L_{w}^{2}

. To this end, let

ρ

be a probability density function on

R^{d}

, and for

d \in N

, denote by

L_{ρ, d}^{2}

the Hilbert space of real-valued functions on

R^{d}

for which

{| g |}_{ρ, d}^{2} : = \int_{R^{d}} g^{2} (x) ρ (x) d x

with inner product

{〈 g, h 〉}_{ρ, d} : = \int_{R^{d}} g (x) h (x) ρ (x) d x

Assume further that there exists an ONB for

L_{ρ, d}^{2}

of polynomials, given by

{(v_{n_{d}})}_{n_{d} \in N_{0}^{d}}

using the multi-index notation

n_{d} : = (n_{1}, \dots, n_{d})

. We use the notation

| n_{d} | = n_{1} + \dots + n_{d}

for the order of the multi-index, where it is supposed that

v_{n_{d}} \in {Pol}_{| n_{d} |} (R^{d})

. Furthermore,

{(v_{n_{d}})}_{| n_{d} | \leq N}

is a basis of polynomials of order N, which we use as

H_{N, d} (x)

. Ranking the basis functions

v_{n_{d}}

according to their polynomial order is convenient and natural when doing approximations in practical applications of this theory.

Next, denote by

ϕ (x, d y; t, T)

the transition probability distribution on

R^{d}

of

X (T)

given

X (t) = x

. Following Filipović and Larsson [8] (Sect. 7), introduce the likelihood ratio as the function

ℓ (x, y; t, T)

such that

ϕ (x, d y; t, T) = ℓ (x, y; t, T) ρ (y) d y

(23)

We assume that such a likelihood ratio of

ϕ

with respect to

ρ

exists. In the next theorem, we state a general series representation in terms of polynomials for the forward price along with a computationally convenient truncation.

Theorem 2.

Assume that

g (\cdot; λ (T)) \in L_{ρ, d}^{2}

and

ℓ (x, \cdot; t, T) \in L_{ρ, d}^{2}

for any

0 \leq t \leq T < \infty

and

x \in R

, where g and ℓ are defined, respectively, in (21) and (23). Then, we have that

F^{N} (t, T) \to F (t, T)

(pointwise) when

N \to \infty

, where F is the forward price in (22) with representation

F (t, T) = \sum_{n \in N_{0}^{d}} g_{n_{d}} (λ (T)) ℓ_{n_{d}} (X (t); t, T)

while for any

N \in N

,

F^{N} (t, T) = \sum_{n \in N_{0}^{d}, | n_{d} | \leq N} g_{n_{d}} (λ (T)) ℓ_{n_{d}} (X (t); t, T) .

Here,

g_{n_{d}} (λ (T)) = \int_{R^{d}} g (y; λ (T)) v_{n_{d}} (y) ρ (y) d y

and

ℓ_{n_{d}} (x; t, T) = u_{n_{d}}^{⊤} exp (G_{| n_{d} |, d} (T - t)) H_{| n_{d} |, d} (x)

with

u_{n_{d}} \in R^{K (| n_{d} |, d) + 1}

is such that

v_{n_{d}} (x) = u_{n_{d}}^{⊤} H_{| n_{d} |, d} (x)

. We recall

K (n, d) = dim {Pol}_{n} (R^{d}) - 1

, and G given in (2).

Proof.

Notice first by the Markovian property of X that

F (t, T) = f (X (t); t, T)

, where

f (x; t, T) : = E [g (X (T); λ (T)) | X (t) = x] .

We find from the assumptions

g (\cdot; λ (T)), ℓ (x, \cdot; t, T) \in L_{ρ, d}^{2}

that

\begin{matrix} f (x; t, T) & = \int_{R^{d}} g (y; λ (T)) ϕ (x, d y; t, T) \\ = \int_{R^{d}} g (y; λ (T)) ℓ (x, y; t, T) ρ (y) d y \\ = {〈 g (\cdot; λ (T)), ℓ (x, \cdot; t, T) 〉}_{ρ, d} \end{matrix}

Therefore, by Parseval’s identity,

f (x; t, T) = \sum_{n_{d} \in N_{0}^{d}} g_{n_{d}} ℓ_{n_{d}} (x; t, T)

where

g_{n_{d}} (λ (T)) : = {〈 g (\cdot, λ (T)), v_{n_{d}} 〉}_{ρ, d}

and

ℓ_{n_{d}} (x; t, T) : = {〈 ℓ (x, \cdot; t, T), v_{n_{d}} 〉}_{ρ, d}

are the coefficients in the ONB representation of

g (\cdot; λ (T))

and

ℓ (x, \cdot; t, T)

in

L_{ρ, d}^{2}

, respectively. Tracing back the definitions, we find

\begin{matrix} ℓ_{n_{d}} (x; t, T) & = \int_{R^{d}} v_{n_{d}} (y) ϕ (x, d y; t, T) = E [v_{n_{d}} (X (T)) | X (t) = x] . \end{matrix}

By assumption on the polynomial basis,

v_{n_{d}} \in {Pol}_{| n_{d} |} (R^{d})

. Thus, there is a vector with length equal to the dimension of

{Pol}_{| n_{d} |} (R^{d})

such that

v_{n_{d}} (x) = u_{n_{d}}^{⊤} H_{| n_{d} |, d} (x)

. We then conclude the desired form,

ℓ_{n_{d}} (x; t, T) = u_{n_{d}}^{⊤} E [H_{| n_{d} |, d} (X (T)) | X (t) = x] = u_{n_{d}}^{⊤} exp (G_{| n_{d} |, d} (T - t)) H_{| n_{d} |, d} (x)

after appealing to the polynomial property of X. This proves the representation of

F (t, T)

.

Define for each

N \in N

the approximation

f^{N} (x; t, T) : = {〈 g (\cdot; λ (T)), ℓ^{N} (x, \cdot; t, T) 〉}_{ρ, d} = \sum_{n_{d} \in N_{0}^{d}, | n_{d} | \leq N} g_{n_{d}} (λ (T)) ℓ_{n_{d}} (x; t, T)

with the notation

ℓ^{N} (x, \cdot; t, T) : = \sum_{n_{d} \in N_{0}^{d}, | n_{d} | \leq N} ℓ_{n_{d}} (x; t, T) v_{n_{d}} (\cdot) .

We observe that

ℓ^{N} (x, \cdot; t, T)

is nothing but

y \mapsto ℓ (x, y; t, T)

projected down on the finite dimensional subspace of

L_{ρ, d}^{2}

spanned by

{(v_{n_{d}})}_{n_{d} \in N_{0}^{d}, | n_{d} | \leq N}

. This gives us

F^{N} (t, T)

.

Notice that by Parseval’s identity,

{\infty > | ℓ (x, \cdot; t, T) |}_{ρ, d}^{2} = \sum_{n_{d} \in N_{0}^{d}} ℓ_{n_{d}}^{2} (x; t, T)

From the very definitions of f and

f^{N}

, we find by the Cauchy–Schwarz inequality and Parseval’s identity,

\begin{matrix} | f (x; τ, T) - f^{N} {(x; τ, T) |}^{2} & = | {〈 g, ℓ (x, \cdot; τ, T) - ℓ^{N} (x, \cdot; τ, T) 〉}_{ρ, d} |^{2} \\ \leq {| g |}_{ρ, d}^{2} {| ℓ (x, \cdot; τ, T) - ℓ^{N} (x, \cdot; τ, T) |}_{ρ, d}^{2} \\ \leq {| g |}_{ρ, d}^{2} \sum_{n_{d} \in N_{0}^{d}, | n_{d} | > N} ℓ_{n_{d}}^{2} (x; t, T) \end{matrix}

In conclusion,

f^{N} (x; t, T) \to f (x; t, T)

for every

x \in R^{d}

when

N \to \infty

. The proof is complete. □

We notice that the dependency on the seasonality component is merged into the coefficients

g_{n_{d}} (λ (T))

and is as such not material in the analysis above. We include it simply because seasonality is present in relevant models, and we prefer to have it explicit. Additionally, it highlights a difference with the other polynomial expansions which we present in this section. Furthermore, we observe that we may compute the coefficients

g_{n_{d}} (λ (T))

by numerical integration methods, for example Gaussian quadrature or Monte Carlo simulation. Indeed, we have

g_{n_{d}} (λ (T)) = E [g (Z; λ (T)) v_{n_{d}} (Z)]

where Z is a d-dimensional random variable with probability density

ρ

.

Remark 5.

In the case X is not a polynomial process, we see by inspection of the proof of Theorem 2 that we still have an interesting representation of the forward price in terms of the polynomial moments of

X (T)

conditional on

X (t)

. Indeed, removing the polynomial property, we see that all conclusions in the theorem holds, except that

\begin{matrix} ℓ_{n_{d}} (x; t, T) & = E [v_{n_{d}} (X (T)) | X (t) = x], \end{matrix}

which will not be explicit in terms of polynomials of x. Of course, we still need the regularity assumptions of g and the likelihood ratio ℓ to hold. We can approximate the forward prices by polynomial moments of the process up to a certain order.

The main assumption in our general approach to forward pricing is the existence of a density

ρ

admitting a polynomial basis for

L_{ρ, d}^{2}

, such that there is likelihood ratio function being an element of this space. This problem is classical, and has a long history in probability and physics, where we refer to the works of Asmussen, Goffard and Laub [39] and Eggers [40] for some recent applications and studies. One thinks of

ρ

as the reference measure, and for a target distribution

ϕ

the goal is to have a Gram–Charlier series with efficiently computable polynomials. Following the discussion in Asmussen et al. [39], if

ρ

in Dimension 1 has all moments finite, there exists an orthogonal sequence of polynomials, which, moreover, defines a basis in

L_{ρ, 1}^{2}

if

ρ

has finite exponential moment. One can easily build up multivariate reference measures in general dimensions d by tensorising. For example, we may define

ρ (x) : = w^{\otimes d} (x) : = w (x_{1}) \dots w (x_{d})

. This will provide a d-dimensional version of the space

L_{w}^{2}

based on Hermite polynomials. In Rahman [41], a general multivariate basis of Hermite polynomials are defined appealing to the Rodrigues formula, i.e., based on the derivatives of the multivariate Gaussian distribution function with mean zero and general covariance. A special case of this, choosing the covariance matrix to be the identity, leads back to the definition of

ρ (x) = w^{\otimes d} (x)

. Another example could be a reference measure in

d = 1

defined by the gamma-distribution (see the work of Asmussen et al. [39], where Laguerre polynomials appear).

We discuss the case

ρ (x) = w^{\otimes d} (x)

in some more detail. We start by introducing a multi-dimensional version of

L_{w}^{2}

. To this end, for

d \in N

, denote by

L_{w, d}^{2}

the Hilbert space of real-valued functions on

R^{d}

for which

{| g |}_{w, d}^{2} : = \int_{R^{d}} g^{2} (x) w^{\otimes d} (x) d x

with inner product

{〈 g, h 〉}_{w, d} : = \int_{R^{d}} g (x) h (x) w^{\otimes d} (x) d x

where we recall

w^{\otimes d} (x) : = w (x_{1}) \dots w (x_{d})

. An ONB for

L_{w, d}^{2}

is given by

{(e_{n_{d}})}_{n_{d} \in N_{0}^{d}}

using the multi-index notation

n_{d} : = (n_{1}, \dots, n_{d})

, and

e_{n_{d}} (x) : = e_{n_{1}} \otimes \dots \otimes e_{n_{d}} (x) = e_{n_{1}} (x_{1}) \dots e_{n_{d}} (x_{d})

. Here, we recall

{(e_{n})}_{n \in N_{0}}

to be an ONB of

L_{w}^{2}

.

Let us look at some particular cases of the function g in the case of

L_{w, d}^{2}

, which are of relevance to commodity and energy markets. First, in an exponential spot price model, we have

g (x; λ) = exp (λ + γ^{⊤} x)

for

γ, x \in R^{d}

and

λ \in R

. Since

R ∋ y \mapsto exp (2 γ_{i} y) w (y)

is integrable, we see that

g (\cdot; λ) \in L_{w, d}^{2}

. We compute the coefficients

g_{n_{d}} (λ (T))

:

\begin{matrix} g_{n_{d}} (λ (T)) & = \int_{R^{d}} exp (λ (T) + γ^{⊤} x) e_{n_{d}} (x) w^{\otimes d} (x) d x \\ = e^{λ (T)} \int_{R^{d}} e^{γ_{1} x_{1}} \dots e^{γ_{d} x_{d}} e_{n_{1}} (x_{1}) \dots e_{n_{d}} (x_{d}) w (x_{1}) \dots w (x_{d}) d x_{1} \dots d_{x_{d}} \\ = e^{λ (T)} {〈 e^{γ_{1} \cdot}, e_{n_{1}} 〉}_{w} \dots {〈 e^{γ_{d} \cdot}, e_{n_{d}} 〉}_{w} \end{matrix}

In the arithmetic case, the spot takes the form

g (x, λ) = λ + γ^{⊤} x,

with again

x, γ \in R^{d}

and

λ \in R

. Since

R ∋ y \mapsto {(γ_{i} y)}^{2} w (y)

is integrable and

g (\cdot, λ) \in L_{w, d}^{2}

. We find the coefficients in this arithmetic case as follows:

\begin{matrix} g_{n_{d}} (λ (T)) & = \int_{R^{d}} (λ (T) + γ^{⊤} x) e_{n_{d}} (x) w^{\otimes d} (x) d x \\ = λ (T) \int_{R^{d}} e_{n_{d}} (x) w^{\otimes d} (x) d x \\ + \sum_{i = 1}^{d} γ_{i} \int_{R^{d}} e_{n_{1}} (x_{1}) w (x_{1}) d x_{1} \dots \int_{R^{d}} e_{n_{i - 1}} (x_{i - 1}) w (x_{i - 1}) d x_{i - 1} \\ \times \int_{R} x_{i} e_{n_{i}} (x_{i}) w (x_{i}) d x_{i} \times \int_{R^{d}} e_{n_{i + 1}} (x_{i + 1}) w (x_{i + 1}) d x_{i + 1} \\ \dots \int_{R} e_{n_{d}} (x_{d}) w (x_{d}) d x_{d} \\ = λ (T) {〈 e_{0}, e_{n_{1}} 〉}_{w} \dots {〈 e_{0}, e_{n_{d}} 〉}_{w} \\ + \sum_{i = 1}^{d} γ_{i} {〈 e_{0}, e_{n_{1}} 〉}_{w} \dots {〈 e_{0}, e_{n_{i - 1}} 〉}_{w} {〈 e_{1}, e_{n_{i}} 〉}_{w} {〈 e_{0}, e_{n_{i + 1}} 〉}_{w} \dots {〈 e_{0}, e_{n_{d}} 〉}_{w} . \end{matrix}

Now, for

n_{j} \geq 1

it holds that

\int_{R} e_{n_{j}} (y) w (y) d y = {〈 e_{0}, e_{n_{j}} 〉}_{w} = 0

. Thus, the only non-zero terms in the sum above are those where

n_{d} = (0, \dots, 0, 1, 0, \dots, 0)

, where 1 is appearing in coordinate i, in which case

g_{n_{d}} (λ (T)) = γ_{i}

. All other

n_{d}

will give

g_{n_{d}} (λ (T)) = 0

, except

n_{d} = (0, \dots, 0)

where we get

g_{n_{d}} (λ (T)) = λ (T)

. This is a very unsurprising result, of course.

Next, let us consider the case of a CDD-temperature forward or a floor electricity forward, for which we have that

g (x; λ) = ξ (λ + γ^{⊤} x)

for

ξ (z) = max (z - c, 0)

. As the max-function grows at most linearly, we have

0 \leq g (x; λ) \leq | λ | + | γ^{⊤} x |

, and it follows that

g (\cdot; λ) \in L_{w, d}^{2}

. We may represent the coefficients as

g_{n_{d}} (λ (T)) = E [max (λ (T) - c + γ^{⊤} Z, 0) e_{n_{d}} (Z)]

for

Z \sim N (0, I)

with I being the

d \times d

identity matrix. By iterated conditional expectation, conditioning on

Z_{i}

,

i = 1, \dots, d - 1

, we can define for R being a standard normal random variable

g^{d} (z_{1}, \dots, z_{d - 1}) : = E [max (λ (T) - c + γ_{1} z_{1} + \dots γ_{d - 1} z_{d - 1} + γ_{d} R, 0) e_{n_{d}} (R)]

and iteratively backwards

i = d - 1, d - 2, \dots, 1

,

g^{i} (z_{1}, \dots, z_{i - 1}) : = E [g^{i + 1} (z_{1}, \dots, z_{i - 1}, R) e_{n_{i}} (R)]

yielding

g_{n_{d}} (λ (T)) = g^{1}

.

A more detailed discussion of the condition on the likelihood ratio is in place. We first recall from Ackerer et al. [9] that the Jacobi volatility model has a likelihood ratio with respect to the Gaussian density which satisfies the condition of square integrability. Hence, the Jacobi volatility model allows itself to a series expansion in terms of the Hermite polynomials, as conducted in detail by Ackerer et al. [9]. Many of the interesting polynomial models are such that

{X (T) |}_{F_{t}}

is Gaussian. For example, we have the two-factor models of Lucia and Schwartz or CARMA-models driven by Brownian motion. This results in a conditional distribution function

ϕ (x, d y; t, T)

being a Gaussian distribution with mean

μ \in R^{d}

and covariance matrix

V \in R^{d \times d}

. Here, we collapse the notation to make our discussion more transparent. Hence, we find that the likelihood ratio is

\begin{matrix} ln ℓ (x, y; t, T) & \sim - \frac{1}{2} {(y - μ)}^{⊤} V^{- 1} (y - μ) + \frac{1}{2} y^{⊤} y \\ = - \frac{1}{2} y^{⊤} (V^{- 1} - I) y + (μ^{⊤} V^{- 1}) y - \frac{1}{2} μ^{⊤} V^{- 1} μ . \end{matrix}

Here, I is the

d \times d

identity matrix. It is evident that the function

y \mapsto ℓ (x, y; t, T) \in L_{w, d}^{2}

if and only if

- (V^{- 1} - I) - \frac{1}{2} I < 0

, or, equivalently,

V < 2 I

. This is not always true, as we can have two-factor models with independent Brownian motions having variance each strictly bigger that 2. Then, V is a diagonal matrix with variances on the diagonal which is dominating

2 I

, and the required integrability of the likelihood ratio fails.

In such cases, we can re-scale the polynomial process X. To this end, let C be some

d \times d

matrix such that

C V C^{⊤} < 2 I

. If we have available such a matrix, we can define a new stochastic process

Y (t) : = C X (t)

. Since any matrix transform of a polynomial process again is a polynomial process, Y is a polynomial process. If further C is invertible, then

X (t) = C^{- 1} Y (t)

, and we have

g (x; λ) = g (C^{- 1} y; λ) .

In Theorem 2, we assume that

g (C^{- 1} \cdot; λ (T)) \in L_{w, d}^{2}

. Furthermore, for the polynomial process Y we find that the likelihood ratio function is (as a matrix transformation of the Gaussian variable

{X (T) |}_{F_{t}}

),

\begin{matrix} ln ℓ (x, y; t, T) & \sim - \frac{1}{2} {(y - C μ)}^{⊤} {(C V C^{⊤})}^{- 1} (y - C μ) + \frac{1}{2} y^{⊤} y \\ = - \frac{1}{2} y^{⊤} ({(C V C^{⊤})}^{- 1} - I) y + (μ^{⊤} C^{⊤} {(C V C^{⊤})}^{- 1}) y \\ - \frac{1}{2} μ^{⊤} C^{⊤} {(C V C^{⊤})}^{- 1} C μ \end{matrix}

Hence, we have that

y \mapsto ℓ (x, y; t, T) \in L_{w, d}^{2}

whenever C is such that

C V C^{⊤} < 2 I

.

Here is an example of a re-scaling: Let

{X (T) |}_{F_{t}}

be bivariate Gaussian with covariance matrix

V = [\begin{matrix} σ_{1}^{2} & σ_{1} σ_{2} ρ \\ σ_{1} σ_{2} ρ & σ_{2}^{2} \end{matrix}]

where

σ_{i}, σ_{2}

are two strictly positive constants (being the marginal standard deviations) and

- 1 < ρ < 1

(which is the correlation). For example, this is the situation with the two-factor model of Lucia and Schwartz, or a CARMA-model in

d = 2

. Observe that a diagonalisation of V is given by

V = [\begin{matrix} 1 & 0 \\ \frac{σ_{2}}{σ_{1}} ρ & \sqrt{1 - ρ^{2}} \end{matrix}] [\begin{matrix} σ_{1}^{2} & 0 \\ 0 & σ_{2}^{2} \end{matrix}] [\begin{matrix} 1 & \frac{σ_{2}}{σ_{1}} ρ \\ 0 & \sqrt{1 - ρ^{2}} \end{matrix}]

Let, for some positive constant

c < 2

C : = \frac{\sqrt{c}}{max (σ_{1}, σ_{2})} {[\begin{matrix} 1 & 0 \\ \frac{σ_{2}}{σ_{1}} ρ & \sqrt{1 - ρ^{2}} \end{matrix}]}^{- 1}

Then, we find

C V C^{⊤} = \frac{c}{max {(σ_{1}, σ_{2})}^{2}} [\begin{matrix} σ_{1}^{2} & 0 \\ 0 & σ_{2}^{2} \end{matrix}] < 2 I

since

c < 2

. In conclusion, we find a scaling C of the original polynomial process, for which the covariance matrix can be dominated by 2 times the identity. Then, the likelihood ratio has the desired integrability, but we must adjust slightly the integrability condition on g. For most interesting functions g, this is not any added restriction, as for example the cases considered above.

Rather than re-scaling, we could use the multivariate Hermite polynomials introduced by Rahman [41] for a sufficiently big covariance matrix. In the above case of re-scaling, we need to have some knowledge of the matrix C before doing the computations. However, the advantage then is that one can simply apply the standard one-dimensional Hermite polynomials as basis. An approach using the multivariate Hermite polynomials requires knowledge of a suitable covariance matrix, which essentially is such that the target density

ϕ

can be dominated by this. The multivariate Hermite polynomials can then be derived, a task that must be tailor-made to the choice of matrix.

Next, we consider a case with non-Gaussian reference probability

ρ

, focussing on the one-dimensional situation. We recall that factor models with Ornstein–Uhlenbeck dynamics driven by jump processes are relevant for power price and wind speed modelling. In particular, Ornstein–Uhlenbeck processes with exponential jump processes leading to invariant

Γ

-distributions are applied (recall discussion from [4,26,28] in Section 4, say). In addition, we have CIR-processes as a model for wind speeds as we recall from [27]. The CIR-process is skewed

χ^{2}

-distributed at each time instant, a distribution which is closely related to the

Γ

-distribution. Let now

ξ

be the density of the

Γ

-distribution with scale

r > 0

and shape

m > 0

, given as

ξ (y) = \frac{1}{Γ (r) m^{r}} y^{r - 1} e^{- y / m} .

Suppose we have a target distribution

ϕ

which behaves as

y^{s - 1}

,

s > 0

for

y \sim 0

and

e^{- y / k}

for

y \sim \infty

, then the likelihood ratio will be

y^{s - 1} / y^{r - 1}

close to zero, and

exp (- (1 / k - 1 / m) y)

for

y \sim \infty

. However, integrating the square of the likelihood function against

ξ

, yields finiteness whenever

s > r / 2

and

k < 2 m

. Such conditions were found by Asmussen et al. [39] as well. Thus, tuning the m to be sufficiently large and r to be sufficiently small, we can obtain a target

Γ

-distribution such that the likelihood ratio is square integrable with respect to

ξ

. Furthermore, as is well-known, the basis of orthogonal polynomials for

L_{ξ}^{2} : = L^{2} (R_{+}, ξ (y) d y)

is the generalised Laguerre polynomials (see [42]). If we have a two-factor model, with one Gaussian and one exponential jump Ornstein–Uhlenbeck process, we can consider the tensorised space

L_{ρ, 2}^{2}

with

ρ (x) = w \otimes ξ (x_{1}, x_{2}) = w (x_{1}) ξ (x_{2})

and the canonically generated polynomials from the respective marginal densities.

5. Pricing of Options on Forwards

This section is concerned with the problem of pricing options on forwards in the framework of polynomial processes.

5.1. Options on Plain-Vanilla Forwards

Consider a European option written on a plan-vanilla forward with payoff

ζ (F (τ, T))

at time

τ \leq T

for a payoff function

ζ

, with the forward price

F (t, T)

given as in Proposition 1; that is, for some

d, n \in N

, we have

F (t, T) = h {(t, T)}^{⊤} H_{n, d} (X (t))

(24)

where

h {(t, T)}^{⊤} = p_{n} {(λ (T))}^{⊤} Γ_{n, d} exp (G_{n, d} (T - t)) .

(25)

In the formulation of Proposition 1,

H_{n, d} (x)

is some basis of the nth order polynomials on

R^{d}

. It is convenient, however, for the purpose of option pricing, to turn to the polynomial ONB

{(v_{n_{d}} (x))}_{n_{d} \in N_{0}^{d}}

of

L_{ρ, d}^{2}

, as used in Section 4.3 above. We fix

H_{n, d} (x)

to be the basis

{(v_{n_{d}} (x))}_{| n_{d} | \leq n}

from now on. Furthermore, we suppose that X is a Markovian process.

Let the price of the option (with risk-free interest rate set to zero) at time

t \leq τ

be

P (t, τ, T) = E [ζ (F (τ, T)) | F_{t}]

(26)

where we assume

ζ (F (τ, T)) \in L^{1} (P)

. The following result is essentially a repetition of Theorem 2 and is therefore formulated as a corollary.

Corollary 1.

Assume for all

0 \leq t \leq T

that

R^{d} ∋ x \mapsto ζ (h {(t, T)}^{⊤} H_{n, d} (x)) \in L_{ρ, d}^{2}

with h given as in (25). Let F be given in (24) with X being a polynomial process on

R^{d}

for which the likelihood ratio function defined in (23) satisfies

R^{d} ∋ y \mapsto ℓ (x, y; t, T) \in L_{ρ, d}^{2}

. Then, we have that

P^{N} (t, τ, T) \to P (t, τ, T)

(pointwise) when

N \to \infty

, where

P (t, τ, T)

is the option price in (26) with representation

P (t, τ, T) = \sum_{n \in N_{0}^{d}} ζ_{n_{d}} (τ, T) ℓ_{n_{d}} (X (t); t, τ)

while for any

N \in N

,

P^{N} (t, τ, T) = \sum_{n \in N_{0}^{d}, | n_{d} | \leq N} ζ_{n_{d}} (τ, T) ℓ_{n_{d}} (X (t); t, τ) .

Here,

ζ_{n_{d}} (τ, T) = \int_{R^{d}} ζ (h {(τ, T)}^{⊤} y) v_{n_{d}} (y) w^{\otimes d} (y) d y

and

ℓ_{n_{d}} (x; t, τ) = u_{n_{d}}^{⊤} exp (G_{| n_{d} |, d} (τ - t)) H_{| n_{d} |, d} (x)

with

u_{n_{d}} \in R^{K (| n_{d} |, d) + 1}

is such that

v_{n_{d}} (x) = u_{n_{d}}^{⊤} H_{| n_{d} |, d} (x)

. We recall

K (n, d) = dim {Pol}_{n} (R^{d}) - 1

, and G given in (2).

Proof.

The proof is identical to the argument of Theorem 2, but now using

P (t, τ, T) = f (X (t); t, τ, T)

where

f (x; t, τ, T) : = E [ζ (h {(τ, T)}^{⊤} X (τ)) | X (t) = x]

Notice that

τ

now plays the role of T in the proof of Theorem 2, and that

ζ

is g. We also observe that

ζ (F (τ, T)) \in L^{1} (P)

under the assumptions on g and X. □

If

ζ (z) = max (z - K, 0)

, the payoff function of a call option, we find that

0 \leq ζ (h {(t, T)}^{⊤} H_{n, d} (x)) \leq K + ∥ h (t, T) ∥ ∥ H_{n, d} (x) ∥

, using the notation

∥ \cdot ∥

for the Euclidean 2-norm on, e.g.,

R^{d}

. This shows readily that

ζ (h {(t, T)}^{⊤} \cdot) \in L_{ρ, d}^{2}

as long as

L_{ρ, d}^{2}

supports polynomials of degree n. This is the case if we choose the space

L_{w, d}^{2}

. Another popular class of derivatives in commodity and energy markets is spread options, which we discuss next.

Assume we have two commodity forwards, with respective forward prices

F_{i} (t, T)

,

i = 1, 2

which are given by (24) for two different functions

h_{i} (t, T), i = 1, 2

. Thus, the dynamics of both forward prices are driven by the same polynomial process X. The spread option payoff at time

τ

is

ζ (F_{1} (τ, T_{1}) - F_{2} (τ, T_{2}))

, with

τ \leq min (T_{1}, T_{2})

. Hence, we have potentially two different maturities of the forwards. Notice also that the order n and dimensionality d of

H_{n, d} (x)

are the same for both forwards, which is not a lack of generality as we can extend the dimensionality of both canonically, if necessary. If

x \mapsto ζ ((h_{1} {(t, T_{1})}^{⊤} - h_{2} {(t, T_{2})}^{⊤}) H_{n, d} (x)) \in L_{ρ, d}^{2}

, we can apply Corollary 1 with

h (t, T_{1}, T_{2}) : = h_{1} (t, T_{1}) - h_{2} (t, T_{2})

. A typical example of a spread is

ζ (F_{1} - F_{2}) = max (F_{1} - F_{2}, 0)

, which satisfies the regularity condition for at least

L_{w, d}^{2}

.

It is remarked in passing that we can also treat quanto options in a similar manner. A quanto option pays

ζ_{1} (F_{1} (τ, T_{1})) ζ_{2} (F_{2} (τ, T_{2}))

for two payoff functions

ζ_{1}

and

ζ_{2}

. These may be of the form of two calls, or a call and put, or two puts. Defining

ζ (H_{n, d} (x); t, T) : = ζ_{1} (h_{1} {(t, T_{1})}^{⊤} H_{n, d} (x)) ζ_{2} (h_{2} {(t, T_{2})}^{⊤} H_{n, d} (x))

and assuming

R^{d} ∋ x \mapsto ζ (H_{n, d} (x); t, T) \in L_{ρ, d}^{2}

puts us again in the situation where Corollary 1 may be applied.

Remark 6.

To include forward contracts with delivery period into the pricing framework is straightforward. Since we have

F (t, T_{1}, T_{2}) = \sum_{T = T_{1}}^{T_{2}} F (t, T) = (\sum_{T = T_{1}}^{T_{2}} h {(t, T)}^{⊤}) H_{n, d} (X (t))

we can simply redefine the meaning of

h (t, T)

in order to apply Corollary 1.

From a computational point of view, it is important to notice that the polynomial representation of the price P in Corollary 1 is split into coefficients

ζ_{n_{d}} (τ, T)

and

ℓ_{n_{d}} (x; t, τ)

. The latter family of coefficients,

ℓ_{n_{d}} (x; t, τ)

, is only dependent on the underlying stochastic model and the choice of polynomial basis, and thus can be computed irrespective of the option in question. As noted by Ackerer et al. [9], these coefficients may be relatively costly to compute, but one can do this once and apply the coefficients for the numerical evaluation of different options. The option payoff is encoded in the parameters

ζ_{n_{d}} (τ, T)

.

At this instance, we make some further comments on the numerical implementation and performance of polynomial pricing found in the literature. The already mentioned paper by Ackerer et al. [9] presents three numerical case studies for the Jacobi stochastic volatility model. Pricing a call option for given realistic model parameters for the stock market, they show that the price error is accurate in two decimal points for N chosen between 10 and 15, where the “exact” price is determined using Monte Carlo simulations. They also considered a forward start call option and an Asian option, where dimension is increased due to the payoff structure. In these two cases, the level N must be increased to above 15, which requires rather many coefficients to be computed. In [9], various approaches for the computation of the so-called Fourier coefficients (being

ζ_{n_{d}} (τ, T)

in our notation) are discussed, including recurrence relations based on properties of Hermite polynomials and Gaussian cubature integration. Moreover, there are references to efficient methods for the computation of matrix exponentials, which we encounter in the numerical computation of

ℓ_{n_{d}} (x; t, τ)

for rather high-dimensional matrices

G_{| n_{d} |, d}

. Further numerical studies on polynomial volatility models and option pricing can be found in the work of Ackerer and Filipović [43].

Related numerical studies based on polynomial processes are found in the works of Kleisinger-Yu et al. [11] and Benth and Lavagnini [44]. Kleisinger-Yu et al. [11] computed the quadratic risk minimising hedging strategy of long-term delivery forwards in the power markets. For polynomial models, this entails in rather explicit expressions which are efficiently computed for low-dimensional polynomial processes as stochastic models for the forward price dynamics. Benth and Lavagnini [44] aimed at the computation of correlators, which occur for example in the iterative definition of discretely-sampled path-dependent options or in a series expansion of Fourier-based pricing of options in stochastic volatility models. Polynomial processes allow for explicit matrix representations of the correlators, and numerical case studies show a good performance even for high-dimensional situations compared with Monte Carlo methods.

5.2. A General Polynomial Approach to Option Pricing

In this subsection, we price options on forwards which have price expressions developed in the context of Section 4.3. To this end, for

d \in N

, recall the Hilbert space

L_{ρ, d}^{2}

introduced in the previous section. Consider the “doubled” space

L_{ρ_{2}, 2 d}^{2}

which is the Hilbert space of real-valued functions on

R^{2 d}

for which

{| g |}_{ρ_{2}, 2 d}^{2} : = \int_{R^{2 d}} g^{2} (x, y) ρ_{2} (x, y) d x d y

with inner product

{〈 g, h 〉}_{ρ_{2}, 2 d} : = \int_{R^{2 d}} g (x, y) h (x, y) ρ_{2} (x, y) d x d y

where

ρ_{2} (x, y) : = ρ^{\otimes 2} (x, y) : = ρ (x) ρ (y)

. An ONB for

L_{ρ_{2}, 2 d}^{2}

is given by

{(v_{n_{d}} \otimes v_{k_{d}})}_{(n_{d}, k_{d}) \in N_{0}^{2 d}}

where we recall

{(v_{n_{d}})}_{n_{d} \in N_{0}^{d}}

to be an ONB of

L_{ρ, d}^{2}

. We also recall the notation

g (\cdot; λ (T))

from (21) for the forward price

F (t, T) = E [g (X (T); λ (T)) | F_{t}]

in (22), where X is a polynomial process in

R^{d}

which in addition is assumed to be Markovian. We further recall that we denote by

ϕ (x, d y; t, T)

the probability distribution function of

X (T)

given

X (t) = x \in R^{d}

for

t \geq T

. Under suitable conditions, Theorem 2 provides an expression and approximation of F. Consider an option written on the forward with exercise time

τ \leq T

and payoff function

ζ (F (τ, T))

for some function

ζ : R \to R

such that

ζ (F (τ, T)) \in L^{2} (P)

. The arbitrage-free option price at time

t \leq τ

(for risk-free interest rate set to zero) is given by

P (t, τ, T)

as defined in (26).

Theorem 3.

Assume

g (\cdot; λ (T)) \in L_{ρ, d}^{2}

and suppose that the likelihood function

ℓ (x, y; t, T)

defined in (23) satisfies

R^{d} \times R^{d} ∋ (x, y) \mapsto ℓ (x, y; t, T) \in L_{ρ_{2}, 2 d}^{2}

for any

0 \leq t \leq T < \infty

. If

ζ : R \to R

is Lipschitz continuous and of linear growth, then

P^{N, K} (t, τ, T) \to P (t, τ, T),

where

P (t, τ, T) = \sum_{k_{d} \in N_{0}^{d}} {(ζ \circ f)}_{k_{d}} (τ, T) ℓ_{k_{d}} (X (t); t, τ),

with

{(ζ \circ f)}_{k_{d}} (τ, T) : = {〈 ζ (f (\cdot; τ, T)), v_{k_{d}} 〉}_{ρ, d} .

and, moreover,

P^{N, K} (t, τ, T) = \sum_{k_{d} \in N_{0}^{d}, | k_{d} | \leq K} {(ζ \circ f^{N})}_{k_{d}} (τ, T) ℓ_{k_{d}} (X (t); t, τ)

Here,

ℓ_{k_{d}} (x; t, τ)

is defined in Theorem 2,

f (x; τ, T) = \sum_{n_{d} \in N_{0}^{d}} g_{n_{d}} ℓ_{n_{d}} (x; τ, T)

with

g_{n_{d}} = {〈 g (\cdot; λ (T)), v_{n_{d}} 〉}_{ρ, d}

and

f^{N} (x; τ, T) = \sum_{n_{d} \in N_{0}^{d}, | n_{d} | \leq N} g_{n_{d}} ℓ_{n_{d}} (x; τ, T) .

Proof.

By assumption

ℓ (\cdot, \cdot; t, T) \in L_{ρ_{2}, 2 d}^{2}

and therefore

y \to ℓ (x, y; t, T) \in L_{ρ, d}^{2}

,

a . e ., x \in R^{d}

. Hence, since

g (\cdot; λ (T)) \in L_{w, d}^{2}

we find a polynomial expression of F as given in Theorem 2 along with an approximation

F^{N}

. From the proof of Theorem 2, we also recall the function

f (x; τ, T) = E [g (X (T); λ (T)) | F_{τ}]

along with its representation and series expansions found in the proof of that result.

Let us show that the map

x \mapsto f (x; τ, T)

belongs to

L_{ρ, d}^{2}

: indeed, by definition of

f (x; τ, T)

, we find from the Cauchy–Schwarz inequality,

\begin{matrix} {| f (\cdot; τ, T) |}_{ρ, d}^{2} & = \int_{R^{d}} {| {〈 g (\cdot; λ (T)), ℓ (x, \cdot; τ, T) 〉}_{ρ, d} |}^{2} ρ (x) d x \\ \leq {| g |}_{ρ, d}^{2} \int_{R^{d}} \int_{R^{d}} ℓ^{2} (x, y; τ, T) ρ (y) ρ (x) d y d x \\ = {| g |}_{ρ, d}^{2} {| ℓ (\cdot, \cdot; τ, T) |}_{ρ_{2}, 2 d}^{2} \end{matrix}

which is finite by the assumption on the likelihood ratio function. It follows that

f (\cdot; τ, T) \in L_{ρ, d}^{2}

. Moreover, from Theorem 2, we find that

f^{N} (\cdot; τ, T) \to f (\cdot; τ, T)

in

L_{ρ, d}^{2}

when

N \to \infty

.

By assumption,

ζ

has linear growth. Hence, for some constant

k > 0

, it follows from

f (\cdot; τ, T) \in L_{ρ, d}^{2}

that

\int_{R^{d}} ζ {(f (x; τ, T))}^{2} ρ (x) d x \leq k \int_{R^{d}} (1 + f^{2} (x; τ, T)) ρ (x) d x < \infty .

In other words,

x \mapsto ζ (f (x; τ, T)) \in L_{ρ, d}^{2}

. Hence,

\begin{matrix} c (x; t, τ, T) : & = E [ζ (f (X (τ); τ, T)) | X (t) = x] \\ = \int_{R^{d}} ζ (f (y; τ, T)) ϕ (x, d y; t, τ) \\ = {〈 ζ (f (\cdot; τ, T)), ℓ (x, \cdot; t, τ) 〉}_{ρ, d} \\ = \sum_{k_{d} \in N_{0}^{d}} {(ζ \circ f)}_{k_{d}} (τ, T) ℓ_{k_{d}} (x; t, τ) \end{matrix}

where

{(ζ \circ f)}_{k_{d}} (τ, T) : = 〈 ζ {(f (\cdot; τ, T), v_{k_{d}} 〉}_{ρ, d} .

We find

P (t, τ, T) = c (X (t); t, τ, T)

.

We truncate this sum at multi-indices of order up to K and consider the approximation

c^{N, K} (x; t, τ, T) : = {〈 ζ (f^{N} (\cdot; τ, T)), ℓ^{K} (x, \cdot; t, τ) 〉}_{ρ, d} .

By the triangle inequality, after subtracting and adding

{〈 ζ (f (\cdot; τ, T)), ℓ^{K} (x, \cdot; t, τ) 〉}_{ρ, d}

, we find from the Cauchy–Schwarz inequality

\begin{matrix} | c (x; t, τ, T) - c^{N, K} (x; t, τ, T) | & \leq | {〈 ζ (f (\cdot; τ, T)), ℓ (x, \cdot; t, τ) - ℓ^{K} (x, \cdot; t, τ) 〉}_{ρ, d} | \\ + | {〈 ζ (f (\cdot; τ, T)) - ζ (f^{N} (\cdot; τ, T)), ℓ^{K} (x, \cdot; t, τ) 〉}_{ρ, d} | \\ \leq {| ζ (f (\cdot; τ, T)) |}_{ρ, d} {| ℓ (x, \cdot; t, τ) - ℓ^{K} (x, \cdot; t, τ) |}_{ρ, d} \\ + | ζ (f (\cdot; τ, T)) - ζ (f^{N} (\cdot; τ, T)) |_{ρ, d} {| ℓ^{K} (x, \cdot; t, τ) |}_{ρ, d} \\ \leq {| ζ (f (\cdot; τ, T)) |}_{ρ, d} {| ℓ (x, \cdot; t, τ) - ℓ^{K} (x, \cdot; t, τ) |}_{ρ, d} \\ + k | f (\cdot; τ, T) - f^{N} {(\cdot; τ, T) |}_{ρ, d} {| ℓ^{K} (x, \cdot; t, τ) |}_{ρ, d} \end{matrix}

In the last inequality, we appealed to the Lipschitz-continuity of

ζ

(denoting the Lipschitz constant

k > 0

). Recall by the definitions of

ℓ (x, y; t, τ)

and

ℓ^{K} (x, y; t, τ)

and the analysis in the proof of Theorem 2 that

ℓ^{K} (x, \cdot; t, τ) \to ℓ (x, \cdot; t, τ)

in

L_{ρ, d}^{2}

when

K \to \infty

. Moreover,

| ℓ^{K} {(x, \cdot; t, τ) |}_{ρ, d} \leq {| ℓ (x, \cdot; t, τ) |}_{ρ, d}

(indeed, the norm of

ℓ^{K} (x, \cdot; t, τ)

converges to that of

ℓ (x, \cdot; t, τ)

!) To conclude, it remains to recall that

f^{N} (\cdot; τ, T)

converges to

f (\cdot; τ, T)

in

L_{ρ, d}^{2}

, for

N \to \infty

, as shown above. □

To apply Theorem 3 in practice, we need to compute the coefficients

{(ζ \circ f^{N})}_{k_{d}}

for all

k_{d} \in N_{0}^{d}

such that

| k_{d} | \leq K

. We have that

f^{N}

is again a truncated sum, but the coefficients

g_{n_{d}}

of this is available from the computation of forward prices (or approximations thereof). This representation can be used to calculate

{(ζ \circ f^{N})}_{k_{d}}

, which require numerical integration or possibly Monte Carlo simulation by drawing from a random variable distributed according to

ρ

.

Theorem 3 provides us with an approximation of call and put option prices on forwards that again have option-like structures, i.e., an approximation of compound options. As noted above, the options in the temperature market can be viewed as a class of compound options, although we recall that temperature forwards have a measurement (delivery) period which is not allowed for by a direct use of Theorem 3. However, we can easily adjust the above arguments to account for a measurement (delivery) period in the forward contract. In this case, we have that the option price in (26) is given by

P (t, τ, T_{1}, T_{2}) = E [ζ (\sum_{T = T_{1}}^{T_{2}} F (τ, T)) | F_{t}]

To apply the arguments of Theorem 3, we must assume that

\sum_{T = T_{1}}^{T_{2}} g (\cdot; λ (T)) \in L_{ρ, d}^{2}

. If

g (\cdot; λ (T)) \in L_{ρ, d}^{2}

for any T, then this condition holds as

L_{ρ, d}^{2}

is a vector space. Tracing the steps in the proof of Theorem 3 results in

P (t, τ, T_{1}, T_{2}) = \sum_{k_{d} \in N_{0}^{d}} {(ζ \circ \tilde{f})}_{k_{d}} (τ, T_{1}, T_{2}) ℓ_{k_{d}} (X (t); t, τ),

with

{(ζ \circ \tilde{f})}_{k_{d}} (τ, T) : = {〈ζ (\sum_{T = T_{1}}^{T_{2}} f (\cdot; τ, T)), v_{k_{d}}〉}_{ρ, d} .

On the other hand, as long as

ζ

is of bounded linear growth and Lipschitz continuous,

P^{N, K} (t, τ, T_{1}, T_{2}) \to P (t, τ, T_{1}, T_{2})

with

P^{N, K} (t, τ, T_{1}, T_{2}) = \sum_{k_{d} \in N_{0}^{d}, | k_{d} | \leq K} {(ζ \circ {\tilde{f}}^{N})}_{k_{d}} (τ, T) ℓ_{k_{d}} (X (t); t, τ)

and where we find that

{\tilde{f}}^{N} (x; τ, T_{1}, T_{2}) = \sum_{n_{d} \in N_{0}^{d}, | n_{d} | \leq N} g_{n_{d}} \sum_{T = T_{1}}^{T_{2}} ℓ_{n_{d}} (x; τ, T) .

In fact,

{\tilde{f}}^{N} (x; τ, T_{1}, T_{2}) = \sum_{T = T_{1}}^{T_{2}} f^{N} (x; τ, T)

, thus, to approximate the option price on a temperature HDD or CDD futures, we simply add up a finite sum of terms

f^{N} (x; τ, T)

over T when computing the coefficients

{(ζ \circ {\tilde{f}}^{N})}_{k_{d}}

rather than using only one

f^{N} (x; τ, T)

.

The Greeks, or sensitivities with respect to different parameters of the option price, are important for hedging purposes. The so-called “delta”, the derivative of the option price with respect to the present value of the underlying asset, is readily computed in terms of the derivatives of the polynomials

ℓ_{k_{d}} (x; τ, T)

with respect to x. This lowers the polynomial order of

ℓ_{k_{d}} (x; τ, T)

by one, and we have available an explicit series representations and approximation under appropriate conditions. Other Greeks, for example the sensitivity with respect to the volatility, can also be represented as derivatives of these polynomials as the volatility will be inherent in the specification of X. This further emphasises the attractiveness of polynomial models and their expressions of option prices.

From a computational perspective, several challenging issues arise. Focussing on temperature futures options, we note above that CARMA-processes are suitable for modelling the temperature dynamics. This calls for higher-dimensional models, i.e., it is natural to have the dimension d to be around 3 or even higher. If we were to specify the seasonality also as a polynomial process, we would reach a much higher dimensionality of the underlying polynomial dynamics. As noted above, the numerical studies of Ackerer et al. [9] indicate that one needs the order of re-scaled Hermite polynomials to be about

N = 10

for call options on a stochastic volatility model. In our situation, which is of greater dimensionality, we would expect even higher order to reach a satisfactory convergence. This at the level of approximating the forward price, where, additionally, we need to aggregate up the

ℓ_{n_{d}} (x; τ, T)

over the delivery period. Then, on the next level, we again must approximate a call option, where we also may need high-order polynomials. On the other hand, we know from the theory that the sums are converging and thus the terms must tend to zero despite the involved polynomials occurring in

ℓ_{n_{d}} (x; τ, T)

. The convergence speed for these expressions should be further analysed. These are challenging computational problems which we leave for future studies.

6. Conclusions and Outlook

We derive polynomial series representations for forward prices and options on forwards based on polynomial models of the spot dynamics in commodity markets. Commodity markets have special features such as seasonality and delivery period, as well as exotic payment structures for forwards found in, e.g., power and temperature markets. In a review of the literature on modelling of price risk in energy and commodity, we present many different polynomial models, which we use as motivation and foundation for further derivatives pricing. We also note some empirical facts on polynomial models and issues and challenges concerning numerical applications.

When considering prices based on nonlinear payoff structures, such as those appearing in temperature futures and options on forwards, the successful derivation of a polynomial series expansion rests on the relationship between the generating probability density of a class of polynomials, the probability distribution of the polynomial process and the square-integrability of likelihood function. It is an interesting area of further research to gain a deeper understanding of this connection. It is also interesting to analyse further numerical implementations of some of the derivatives analysed in this paper, where dimensionality becomes a challenge.

Funding

The author acknowledges financial support from the thematic research group SPATUS funded by UiO:Energy, University of Oslo.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

Two anonymous referees are thanked for their careful reading and constructive criticism improving the presentation of the paper.

Conflicts of Interest

The author declares no conflict of interest.

References

Björk, T. Arbitrage Theory in Continuous Time Finance, 3rd ed.; Oxford University Press: Oxford, UK, 2009. [Google Scholar]
Geman, H. Commodities and Commodity Derivatives; John Wiley & Sons: Chichester, UK, 2005. [Google Scholar]
Eydeland, A.; Wolyniec, K. Energy and Power Risk Management; Wiley-Finance; John Wiley & Sons: Hoboken, NJ, USA, 2003. [Google Scholar]
Benth, F.E.; Benth, J.X.; Koekebakker, S. Stochastic Modelling of Electricity and Related Markets; World Scientific: Singapore, 2008. [Google Scholar]
Cuchiero, C. Affine and Polynomial Processes. Ph.D. Thesis, ETH, Zürich, Switzerland, 2011. [Google Scholar]
Filipović, D.; Larsson, M. Polynomial diffusions and applications in finance. Financ. Stoch. 2016, 4, 931–972. [Google Scholar] [CrossRef] [Green Version]
Cuchiero, C.; Keller-Ressel, M.; Teichmann, J. Polynomial processes and their applications to mathematical finance. Financ. Stoch. 2012, 16, 711–740. [Google Scholar] [CrossRef]
Filipović, D.; Larsson, M. Polynomial jump-diffusion models. Stoch. Syst. 2020, 10, 71–97. [Google Scholar] [CrossRef] [Green Version]
Ackerer, D.; Filipović, D.; Pulido, S. The Jacobi stochastic volatility model. Financ. Stoch. 2018, 22, 667–700. [Google Scholar] [CrossRef] [Green Version]
Ware, T. Polynomial processes for power prices. Appl. Math. Financ. 2019, 26, 453–474. [Google Scholar] [CrossRef] [Green Version]
Kleisinger-Yu, X.; Komaric, V.; Larsson, M.; Regez, M. A multi-factor polynomial framework for long-term electricity forwards with delivery period. SIAM J. Finan. Math. 2020, 11, 928–957. [Google Scholar] [CrossRef]
Karatzas, I.; Shreve, S.E. Brownian Motion and Stochastic Calculus, 2nd ed.; Springer: New York, NY, USA, 1991. [Google Scholar]
Schwartz, E.S. The stochastic behaviour of commodity prices: Implications for valuation and hedging. J. Financ. 1997, 52, 923–973. [Google Scholar] [CrossRef]
Gibson, R.; Schwartz, E.S. Stochastic convenience yield and the pricing of oil contingent claims. J. Financ. 1990, 45, 959–976. [Google Scholar] [CrossRef]
ESchwartz, S.; Smith, J.E. Short-term variations and long-term dynamics in commodity prices. Manag. Sci. 2000, 46, 893–911. [Google Scholar] [CrossRef] [Green Version]
Lucia, J.J.; Schwartz, E.S. Electricity prices and power derivatives: Evidence from the Nordic Power Exchange. Rev. Deriv. Res. 2001, 5, 5–50. [Google Scholar] [CrossRef]
Prokopczuk, M. Pricing and hedging in the freight futures market. J. Futures Mark. 2011, 31, 440–464. [Google Scholar] [CrossRef]
Nomikos, N.K.; Soldatos, O. Using affine jump diffusion models for modelling and pricing electricity derivatives. Appl. Math. Financ. 2008, 15, 41–71. [Google Scholar] [CrossRef]
Pilipović, D. Energy Risk–Valuing and Managing Energy Derivatives; McGraw-Hill: New York, NY, USA, 1998. [Google Scholar]
Cartea, A.; Figueroa, M. Pricing in electricity markets: A mean reverting jump diffusion model with seasonality. Appl. Math. Financ. 2005, 12, 313–335. [Google Scholar] [CrossRef] [Green Version]
Mirantes, A.G.; Población, J.; Serna, G. The stochastic seasonal behaviour of natural gas prices. Europ. Financ. Manag. 2012, 18, 410–443. [Google Scholar] [CrossRef]
Benth, F.E.; Benth, J.X. Modeling and Pricing in Financial Markets for Weather Derivatives; World Scientific: Singapore, 2013. [Google Scholar]
Härdle, W.; Lopez-Cabrera, B. The implied market price of weather risk. Appl. Math. Financ. 2012, 18, 59–95. [Google Scholar] [CrossRef]
Swishchuk, A.; Cui, K. Weather derivatives with applications to Canadian data. J. Math. Financ. 2013, 3, 81–95. [Google Scholar] [CrossRef] [Green Version]
Paschke, R.; Prokopczuk, M. Commodity derivatives valuation with autoregressive and moving average components in the price dynamics. J. Bank. Financ. 2010, 34, 2742–2752. [Google Scholar] [CrossRef]
Benth, F.E.; Pircalabu, A. A non-Gaussian Ornstein-Uhlenbeck model for pricing wind power futures. Appl. Math. Financ. 2018, 25, 36–65. [Google Scholar] [CrossRef] [Green Version]
Bensoussan, A.; Brouste, A. Cox-Ingersoll-Ross model for wind speed modeling and forecasting. Wind Energy 2016, 19, 1355–1365. [Google Scholar] [CrossRef]
Benth, F.E.; Rohde, V. On non-negative modeling with CARMA processes. J. Math. Anal. Appl. 2019, 476, 196–214. [Google Scholar] [CrossRef]
Kyriakou, I.; Nomikos, N.K.; Papapostolou, N.C.; Pouliasis, P.K. Affine-structure models and the pricing of energy commodity derivatives. Eur. Financ. Manag. 2016, 22, 853–881. [Google Scholar] [CrossRef] [Green Version]
Jewson, S.; Brix, A. Weather Derivative Valuation; Cambridge University Press: Cambridge, UK, 2005. [Google Scholar]
Hinderks, W.J.; Wagner, A. Pricing German energiewende products: Intraday cap/floor futures. Energy Econ. 2019, 81, 287–296. [Google Scholar] [CrossRef]
Caporin, M.; Preś, J.; Torro, H. Model based Monte Carlo pricing of energy and temperature quanto options. Energy Econ. 2012, 34, 1700–1712. [Google Scholar] [CrossRef] [Green Version]
Weron, R. Market price of risk implied by Asian-style electricity options and futures. Energy Econom. 2008, 30, 1098–1115. [Google Scholar] [CrossRef] [Green Version]
Fusai, G.; Marena, M.; Roncoroni, A. Analytical pricing of discretely monitored Asian-style options: Theory and application to commodity markets. J. Bank. Financ. 2008, 32, 2033–2045. [Google Scholar] [CrossRef] [Green Version]
Kyriakou, I.; Pouliasis, P.K.; Papapostolou, N.C. Jumps and stochastic volatility in crude oil prices and advances in average option pricing. Quant. Financ. 2016, 16, 1859–1873. [Google Scholar] [CrossRef] [Green Version]
Shiraya, K.; Takahashi, A. Pricing average options on commodities. J. Futures Mark. 2011, 31, 407–439. [Google Scholar] [CrossRef]
Filipović, D.; Willems, S. A Term structure model for dividends and interest rates. Math. Financ. 2020, 40, 1461–1496. [Google Scholar] [CrossRef] [Green Version]
Folland, G.B. Analysis–Modern Techniques and Their Applications; John Wiley & Sons: Hoboken, NJ, USA, 1984. [Google Scholar]
Asmussen, S.; Goffard, P.-O.; Laub, P.J. Orthonormal Polynomial Expansions and Lognormal Sum Densities. In Risk and Stochastics: Ragnar Norberg; Barrieu, P., Ed.; World Scientific: Singapore, 2019; Chapter 6; pp. 127–150. [Google Scholar]
Eggers, H.C. From Gram-Chalier series to orthogonal polynomials. Acta Phys. Pol. 2009, 40, 1209–1215. [Google Scholar]
Rahman, S. Wiener-Hermite polynomial expansion for multivariate Gaussian probability measures. J. Math. Anal. Appl. 2017, 454, 303–334. [Google Scholar] [CrossRef] [Green Version]
Szegö, G. Orthogonal Polynomials; American Mathematical Society Colloquium Publications: New York, NY, USA, 1939; Volume XXIII. [Google Scholar]
Ackerer, D.; Filipović, D. Option pricing with orthogonal polynomial expansions. Math. Financ. 2020, 30, 47–84. [Google Scholar] [CrossRef] [Green Version]
Benth, F.E.; Lavagnini, S. Correlators of polynomial processes. arXiv 2020, arXiv:1906:11320. [Google Scholar]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Benth, F.E. Pricing of Commodity and Energy Derivatives for Polynomial Processes. Mathematics 2021, 9, 124. https://doi.org/10.3390/math9020124

AMA Style

Benth FE. Pricing of Commodity and Energy Derivatives for Polynomial Processes. Mathematics. 2021; 9(2):124. https://doi.org/10.3390/math9020124

Chicago/Turabian Style

Benth, Fred Espen. 2021. "Pricing of Commodity and Energy Derivatives for Polynomial Processes" Mathematics 9, no. 2: 124. https://doi.org/10.3390/math9020124

APA Style

Benth, F. E. (2021). Pricing of Commodity and Energy Derivatives for Polynomial Processes. Mathematics, 9(2), 124. https://doi.org/10.3390/math9020124

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Pricing of Commodity and Energy Derivatives for Polynomial Processes

Abstract

1. Introduction

2. Background on Polynomial Jump-Diffusion Processes

3. Forwards and Options on Energy and Commodities in a Polynomial Context

3.1. Commodity “Spot” Dynamics

3.2. Plain-Vanilla Forward Contracts

3.3. Exotic Forward Contracts

3.4. Options in Energy and Commodities

4. Polynomial Processes and Forward Pricing

4.1. Plain-Vanilla Forward Prices

4.2. Exotic Forward Prices

4.3. A General Polynomial Approach to Forward Pricing

5. Pricing of Options on Forwards

5.1. Options on Plain-Vanilla Forwards

5.2. A General Polynomial Approach to Option Pricing

6. Conclusions and Outlook

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI