Least-Squares Estimators of Drift Parameter for Discretely Observed Fractional Ornstein–Uhlenbeck Processes

Kříž, Pavel; Szała, Leszek

doi:10.3390/math8050716

Open AccessArticle

Least-Squares Estimators of Drift Parameter for Discretely Observed Fractional Ornstein–Uhlenbeck Processes

by

Pavel Kříž

^*

and

Leszek Szała

Department of Mathematics, Faculty of Chemical Engineering, University of Chemistry and Technology Prague, 16628 Prague, Czech Republic

^*

Author to whom correspondence should be addressed.

Mathematics 2020, 8(5), 716; https://doi.org/10.3390/math8050716

Submission received: 3 April 2020 / Revised: 20 April 2020 / Accepted: 22 April 2020 / Published: 3 May 2020

(This article belongs to the Special Issue Stochastic Modeling in Biology)

Download

Browse Figures

Versions Notes

Abstract

:

We introduce three new estimators of the drift parameter of a fractional Ornstein–Uhlenbeck process. These estimators are based on modifications of the least-squares procedure utilizing the explicit formula for the process and covariance structure of a fractional Brownian motion. We demonstrate their advantageous properties in the setting of discrete-time observations with fixed mesh size, where they outperform the existing estimators. Numerical experiments by Monte Carlo simulations are conducted to confirm and illustrate theoretical findings. New estimation techniques can improve calibration of models in the form of linear stochastic differential equations driven by a fractional Brownian motion, which are used in diverse fields such as biology, neuroscience, finance and many others.

Keywords:

fractional Brownian motion; Ornstein–Uhlenbeck process; drift parameter estimation

MSC:

60G22; 62M09

1. Introduction

Stochastic models with fractional Brownian motion (fBm) as the noise source have attained increasing popularity recently. This is because fBm is a continuous Gaussian process, increments of which are positively, or negatively correlated if Hurst parameter

H > 1 / 2

, or

H < 1 / 2

, respectively. If

H = 1 / 2

fBm coincides with classical Brownian motion and its increments are independent. The ability of fBm to include memory into the noise process makes it possible to build more realistic models in such diverse fields as biology, neuroscience, hydrology, climatology, finance and many others. The interested reader may check monographs [1,2], or more recent paper [3] and the references therein for more information.

Let

{B_{t}^{(H)}}_{t \in [0, \infty)}

be a fractional Brownian motion with Hurst parameter H defined on an appropriate probability space

{Ω, A, P}

. Fractional Ornstein–Uhlenbeck process (fOU) is the unique solution to the following linear stochastic differential equation

\begin{matrix} d X_{t} = - λ X_{t} d t + σ d B_{t}^{(H)}, X_{0} = x_{0} \in R, t \geq 0, \end{matrix}

(1)

where

λ > 0

is a drift parameter (we consider ergodic case only) and

σ > 0

is a noise intensity (or volatility). Recall that solution to Equation (1) can be expressed by the exact analytical formula:

\begin{matrix} X_{t} = e^{- λ t} x_{0} + \int_{0}^{t} e^{- λ (t - u)} σ d B_{u}^{(H)} . \end{matrix}

(2)

A single realization of the random process

{X_{t} (ω)}_{t \in [0, \infty)}

for a particular

ω \in Ω

is the model for the single real-valued trajectory, part of which is observed. Two examples of such trajectories are given in Figure 1. We assume

H > 1 / 2

throughout this paper so that the fOU exhibits long-range dependence. For an example of application, see a neuronal model based on fOU described in the recent work [4].

The aim of this paper is to study the problem of estimating drift parameter

λ

based on an observation of a single trajectory of a fOU in discrete time instants

t = 0, h, \dots, N h

with fixed mesh size

h > 0

and increasing time horizon

T = N h \to \infty

(long-span asymptotics). Estimating drift parameter of a fOU observed in continuous time has been considered in [5,6], where least-squares estimator (LSE) and ergodic-type estimator are studied. These have advantageous asymptotic properties, they are strongly consistent and, if

H \leq 3 / 4

, also asymptotically normal. Ergodic-type estimator is easy to implement, but it has greater asymptotic variance compared to LSE, requires a priori knowledge of H and

σ

and does not provide acceptable results for non-stationary processes with limited time horizon.

A straightforward discretization of the least-squares estimator for a fOU has been introduced and studied in [7] for

H > 1 / 2

and in [8] for

H < 1 / 2

. For the precise formula, see (8). This estimator is consistent provided that both the time horizon

T = N h \to \infty

and the mesh size

h \to 0

(mixed in-fill and long-span asymptotics). However, it is not consistent when h is fixed and

T \to \infty

. This has led us to construct and study LSE-type estimators that converge in this long-span setting.

An easy modification of the ergodic-type estimator to discrete-time setting with fixed time step was given in [9], see (10) for precise formula, and its strong consistency (assuming

H \geq 1 / 2

) and asymptotic normality (for

1 / 2 \leq H < 3 / 4

) when

N \to \infty

were proved, but with possibly incorrect technique (as pointed out in [10]). Correct proofs of asymptotic normality for

0 < H \leq 3 / 4

and strong consistency for

0 < H < 1

of this estimator were provided (in more general setup) in [10]. Note that the use of this discrete ergodic estimator requires the knowledge of parameter

σ

(in contrast to the estimators of least-squares type introduced below). Other works related to estimating drift parameter for discretely observed fOU include [11,12,13], but this list is by no means complete.

This work contributes to the problem of estimating drift parameter of fOU by introducing three new LSE-type estimators: least-squares estimator from exact solution, asymptotic least-squares estimator and conditional least-squares estimator. These estimators are tailored to discrete-time observations with fixed time step. We provide proofs of their asymptotic properties and identify situations, in which these new estimators perform better than the already known ones. In particular, we eliminate the discretization error (the LSE from exact solution), construct strongly consistent estimators in the long-span regime without assuming in-fill condition (the asymptotic LSE and the conditional LSE), and eliminate the bias in the least-squares procedure caused by autocorrelation of the noise term (the conditional LSE). Especially the conditional LSE demonstrates outstanding performance in all studied scenarios. This suggests that the newly introduced (to our best knowledge) concept of conditioning in the least-squares procedure applied to the models with fractional noise provides a powerful framework for parameter estimation in this type of models. The proof of its strong consistency, presented within this paper, is rather non-trivial and may serve as a starting point for investigation of similar estimators in possibly different settings. A certain disadvantage of the conditional LSE is its complicated implementation (involving optimization procedure), which is in contrast to the other studied estimators.

Let us explain the strength of the conditional least-squares estimator in more detail. Comparison of the two trajectories in Figure 1 demonstrates the effect of different values of

λ

on trajectories of fOU. In particular, it affects the speed of exponential decay in initial non-stationary phase and the variability in stationary phase. As we illustrate below, the discretized least-squares estimator, cf. (8), utilizes information about

λ

from the exponential decay in initial phase, but is not capable to make use of the information contained in the variability in stationary phase. As a consequence, it is not consistent (in long-span setting). On the contrary, the ergodic-type estimator, cf. (10), is derived from the variance of the stationary distribution of the process. It works well for stationary processes (and is consistent), but leaves idle (and even worse, it is corrupted by) the observation of the process in its initial non-stationary phase. In result, neither of these estimators can efficiently estimate drift from long trajectories with far-from-stationary initial values. This gap is best filled with the conditional least-squares estimator, cf. (25), which effectively utilizes both information stored in non-stationary phase and in stationary phase of the observed process. This unique property is demonstrated in Results and Discussion, where the conditional LSE (denoted by

{\hat{λ}}_{5}

) dominates the other estimators.

For the three newly introduced estimators the value of the Hurst parameter H is considered to be known a priori, whereas the knowledge of volatility parameter

σ

is not required, which is an advantage of these methods. If H is not known, it can be estimated in advance by some of many methods, such as methods based on quadratic variations (cf. [14]), sample quantiles or trimmed means (cf. [15]), or on a wavelet transform (cf. [16]), to name just a few. Another useful works in this direction include simultaneous estimation of

σ

and H using the powers of the second order variations (see [17], Chapter 3.3). The estimates of H (obtained independently from

λ

) can subsequently be used in the LSE-type estimators of lambda introduced below in a way similar to [18].

In Section 2, some elements of stochastic calculus with respect to fBm are recalled, stationary fOU is introduced and precise formulas for two existing drift estimators

{\hat{λ}}_{1}

and

{\hat{λ}}_{2}

are provided. Section 3 is devoted to construction of a new LSE type estimator (

{\hat{λ}}_{3}

) based on exact formula for fOU. A certain modification of

{\hat{λ}}_{3}

(denoted as

{\hat{λ}}_{4}

), which ensures long-span consistency, is introduced in Section 4. In Section 5, we rewrite the linear model using conditional expectations to overcome the bias in LSE caused by autocorrelation of the noise. Least-squares method, applied to the conditional model with explicit formulas for conditional expectations, results in the conditional least-squares estimator (

{\hat{λ}}_{5}

). We prove strong consistency of this estimator. The actual performance of the newly introduced estimators

{\hat{λ}}_{3}

,

{\hat{λ}}_{4}

and

{\hat{λ}}_{5}

as well as its comparison to the already-known

{\hat{λ}}_{1}

and

{\hat{λ}}_{2}

, is studied by Monte Carlo simulations in various scenarios and reported in Section 6. The simulated trajectories have been obtained in software R with YUIMA package (see [19]). Section 7 summarizes key points of the article and provides possible future extensions.

2. Preliminaries

For reader’s convenience we briefly review the basic concepts from theory of stochastic models with fractional noise in this section, including definition of fBm, Wiener integral of deterministic functions w.r.t. fBm and stationary fOU. This exposition follows [2,20]. For further reading, see also the monograph [1]. In the end of this section, we also recall formulas for discretized LSE and discrete ergodic estimator.

Fractional Brownian motion with Hurst parameter

H \in (0, 1)

is a centered (zero-mean) continuous Gaussian process

{B_{t}^{(H)}}_{t \in [0, \infty)}

starting from zero (

B_{0}^{(H)} = 0

and having the following covariance structure

\begin{matrix} E (B_{s}^{(H)} B_{t}^{(H)}) = \frac{1}{2} (s^{2 H} + t^{2 H} - {| s - t |}^{2 H}), s, t \geq 0 . \end{matrix}

Note that for the purpose of construction of the stationary fOU, we need a two-sided fBm

{B_{t}^{(H)}}_{t \in R}

with t ranging over the whole

R

. In this case we have

\begin{matrix} E (B_{s}^{(H)} B_{t}^{(H)}) = \frac{1}{2} ({| s |}^{2 H} + {| t |}^{2 H} - {| s - t |}^{2 H}), s, t \in R . \end{matrix}

As a consequence, the increments of fBm are negatively correlated for

H < 1 / 2

, independent for

H = 1 / 2

and positively correlated for

H > 1 / 2

.

Consider a two-sided fBm with

H > 1 / 2

and define Wiener integral of a deterministic step function with respect to the fBm by formula

\begin{matrix} \int_{R} \sum_{i = 1}^{N} α_{i} I_{[t_{i}, t_{i + 1}]} (s) d B_{s}^{(H)} = \sum_{i = 1}^{N} α_{i} (B_{t_{i + 1}}^{(H)} - B_{t_{i}}^{(H)}) \end{matrix}

for any positive integer N, real-valued coefficients

α_{1}, \dots α_{N} \in R

and a partition

- \infty < t_{1} \leq \dots \leq t_{N + 1} < \infty

. This definition constitutes the following isometry for any pair of deterministic step functions f and g

\begin{matrix} E (\int_{R} f (t) d B_{t}^{(H)} \int_{R} g (s) d B_{s}^{(H)}) = \underset{R^{2}}{\int \int} f (t) g (s) φ (t, s) d t d s = {〈 f, g 〉}_{H}, \end{matrix}

(3)

where

φ (t, s) = {H (2 H - 1) | t - s |}^{2 H - 2}

. Using this isometry, we can extend the definition of the Wiener integral w.r.t. fBm to all elements of the space

L_{H}^{2} (R)

, defined as the completions of the space of deterministic step functions w.r.t. the scalar product

{〈 ., . 〉}_{H}

defined above. In result, the formula (3) holds true for any

f, g \in L_{H}^{2} (R)

, see also [21]. We will frequently use this formula in what follows, mainly to calculate the covariances of Wiener integrals.

Let

{B_{t}^{(H)}}_{t \in R}

be again a two-sided fBm with

H > 1 / 2

. Define

\begin{matrix} Z_{0} = \int_{- \infty}^{0} e^{- λ (0 - u)} σ d B_{u}^{(H)}, \end{matrix}

and denote by

{Z_{t}}_{t \in [0, \infty)}

the solution to (1) with initial condition

Z_{0}

in the sense that it satisfies

\begin{matrix} d Z_{t} = - λ Z_{t} d t + σ d B_{t}^{(H)} . \end{matrix}

This process is referred to as the stationary fOU and it can be expressed as

\begin{matrix} Z_{t} = \int_{- \infty}^{t} e^{- λ (t - u)} σ d B_{u}^{(H)}, t \geq 0 . \end{matrix}

Note that the stationary fOU is an ergodic stationary Gaussian process (its autocorrelation function vanishes at infinity).

Consider now a stationary fOU

{Z_{t}}_{t \in [0, \infty)}

observed at discrete time instants

t = 0, h, 2 h, \dots

. The ergodicity and the formula for the second moment of stationary fOU (see e.g., [20]) imply

\frac{1}{N} \sum_{n = 0}^{N - 1} Z_{n h}^{2} \overset{N \to \infty}{⟶} E Z_{0}^{2} = \frac{σ^{2}}{λ^{2 H}} H Γ (2 H) a . s .

(4)

Analogously,

\frac{1}{N} \sum_{n = 0}^{N - 1} Z_{n h} Z_{(n + 1) h} \overset{N \to \infty}{⟶} E Z_{0} Z_{h} a . s .,

(5)

and the expectation can be calculated using (3) and the change-of-variables formula

\begin{matrix} E Z_{0} Z_{h} & = E Z_{0} (e^{- λ h} Z_{0} + \int_{0}^{h} e^{- λ (h - u)} σ d B_{u}^{(H)}) \\ = e^{- λ h} \frac{σ^{2}}{λ^{2 H}} H Γ (2 H) + \int_{0}^{h} \int_{- \infty}^{0} e^{- λ (h - u)} e^{- λ (0 - v)} σ^{2} {(u - v)}^{2 H - 2} H (2 H - 1) d v d u \\ = e^{- λ h} \frac{σ^{2}}{λ^{2 H}} H Γ (2 H) [1 + \frac{2 H - 1}{Γ (2 H)} \int_{0}^{λ h} \int_{- \infty}^{0} e^{r + s} {(r - s)}^{2 H - 2} d s d r] . \end{matrix}

(6)

The rest of this section is devoted to the two popular estimators of the drift parameter of fOU observed at discrete time instants described in Introduction—the discretized LSE and the discrete ergodic estimator. Start with the former. Consider a straightforward discrete approximation of the Equation (1):

\begin{matrix} X_{n + h} - X_{n} \approx - λ X_{n} h + σ (B_{n + h}^{(H)} - B_{n}^{(H)}) . \end{matrix}

(7)

Application of the standard least-squares procedure to the linear approximation above provides the discretized LSE studied in [7,8], which takes the form

\begin{matrix} {\hat{λ}}_{1} = - \frac{1}{h} (F_{N} - 1), \end{matrix}

(8)

where h is the mesh size (time step) and

\begin{matrix} F_{N} = \frac{\sum_{n = 0}^{N - 1} X_{n h} X_{(n + 1) h}}{\sum_{n = 0}^{N - 1} X_{n h}^{2}}, \end{matrix}

(9)

with

X_{n h}

and

X_{(n + 1) h}

being the observations at adjacent time instants

t = n h

and

t = (n + 1) h

respectively, of the process

X_{t}

defined by (1) or (2). Note that having

{\hat{λ}}_{1}

expressed in term of

F_{N}

simplifies its comparison with the estimators newly constructed in this paper. Recall that for consistency of

{\hat{λ}}_{1}

mixed in-fill and long-span asymptotics is required due to the approximation error in (7).

The discrete ergodic estimator is derived from asymptotic behavior of the (stationary) fOU. Recall the convergence in (4). Rearranging the terms provides an asymptotic formula for drift parameter

λ

expressed in terms of the limit of the second sample moment of the stationary fOU. Substituting the stationary fOU by the observed fOU

X_{h}, X_{2 h}, \dots X_{N h}

in the asymptotic formula results in the discrete ergodic estimator:

\begin{matrix} {\hat{λ}}_{2} = {(\frac{1}{N σ^{2} H Γ (2 H)} \sum_{i = 1}^{N} X_{i h})}^{- \frac{1}{2 H}}, \end{matrix}

(10)

which was studied in [9,10]. Recall that this estimator is strongly consistent in the long span regime (no in-fill condition needed), however, it heavily builds upon the asymptotic (stationary) behavior of the process and fails for processes with non-stationary initial phase (as illustrated by numerical experiments below).

3. Least-Squares Estimator from Exact Solution

Since the estimator

{\hat{λ}}_{1}

obtained from naive discretization of (1) provides reasonable approximations only for non-stationary solutions with short time horizon and small time step

h > 0

(as seen from numerical simulations below), we eliminate discretization error by considering exact analytical formula for

X_{t}

, see (2), and corresponding exact discrete formula for

X_{t + h}

,

\begin{matrix} X_{t + h} = β X_{t} + ξ_{t}, \end{matrix}

(11)

where

\begin{matrix} β = e^{- λ h}, and ξ_{t} = \int_{t}^{t + h} e^{- λ (t + h - u)} σ d B_{u}^{(H)} . \end{matrix}

The least-squares estimator for

β

w.r.t. linear model (11) is given by

\hat{β} = F_{N}

, cf. (9), and the estimator for

λ

can be defined as

\begin{matrix} {\hat{λ}}_{3} = - \frac{1}{h} log F_{N} . \end{matrix}

(12)

Numerical simulations show that

{\hat{λ}}_{3}

works well for non-stationary solutions and short time horizon (

T = 10

in simulations). The results for

H = 0.6

are presented in Figure 2. Simulation results for

H \in {0.75, 0.9}

are similar.

On the other hand, estimator

{\hat{λ}}_{3}

does not provide good results for observations with long time horizon (

T = 1000

in simulations) since

{\hat{λ}}_{3}

is not consistent if

N \to \infty

and

h > 0

is fixed. The reason is that

ξ_{t}

and

X_{t}

in (11) are correlated. In fact, we can calculate the almost sure limit of

{\hat{λ}}_{3}

exactly. The limit is provided in Theorem 1. Its proof uses the following simple lemma (see [22]) to show the diminishing effect of the initial condition on limiting behaviour of sample averages. It is later used in the proof of Lemma 3 as well.

Lemma 1.

Consider real-valued sequences

{(a_{n})}_{n = 1}^{\infty}

and

{(b_{n})}_{n = 1}^{\infty}

such that

\begin{matrix} \frac{1}{N} \sum_{n = 1}^{N} | b_{n} | \overset{N \to \infty}{⟶} K < \infty, and a_{n} \overset{n \to \infty}{⟶} 0 . \end{matrix}

Then

\frac{1}{N} \sum_{n = 1}^{N} a_{n} b_{n} \overset{N \to \infty}{⟶} 0

.

Theorem 1.

Let

h > 0

be fixed and define

f : (0, \infty) \to R

by

\begin{matrix} f (x) = e^{- x} [1 + \frac{2 H - 1}{Γ (2 H)} \int_{0}^{x} (\int_{- \infty}^{0} e^{s} e^{r} {(r - s)}^{2 H - 2} d s) d r] . \end{matrix}

(13)

Then

\begin{matrix} lim_{N \to \infty} {\hat{λ}}_{3} = - \frac{1}{h} log f (λ h) a l m o s t s u r e l y . \end{matrix}

(14)

In particular,

lim_{N \to \infty} {\hat{λ}}_{3} < λ

.

Proof.

Recall the asymptotic behavior of a stationary fOU described in formulas (4)–(6).

Since the effect of the initial condition vanishes at infinity, the limit behaviour of the non-stationary solution

{(X_{n h})}_{n = 0}^{\infty}

is same. Indeed,

\begin{matrix} \frac{1}{N} \sum_{n = 0}^{N - 1} X_{n h} X_{(n + 1) h} - Z_{n h} Z_{(n + 1) h} = \frac{1}{N} \sum_{n = 0}^{N - 1} (X_{n h} - Z_{n h}) X_{(n + 1) h} + \frac{1}{N} \sum_{n = 0}^{N - 1} (X_{(n + 1) h} - Z_{(n + 1) h}) Z_{n h} . \end{matrix}

The convergence of the first summand to zero follows from the facts that

\begin{matrix} (X_{n h} - Z_{n h}) \overset{n \to \infty}{⟶} 0 a . s ., \frac{1}{N} \sum_{n = 0}^{N - 1} X_{(n + 1) h} \overset{N \to \infty}{⟶} 0 a . s . \end{matrix}

and Lemma 1. Similar argument guarantees convergence of the second summand to zero as well. The convergence

\begin{matrix} \frac{1}{N} \sum_{n = 0}^{N - 1} X_{n h}^{2} - Z_{n h}^{2} \overset{N \to \infty}{⟶} 0 \end{matrix}

can be shown correspondingly. In result, we obtain the almost sure convergence

\frac{\sum_{n = 0}^{N - 1} X_{n h} X_{(n + 1) h}}{\sum_{n = 0}^{N - 1} X_{n h}^{2}} \overset{N \to \infty}{⟶} e^{- λ h} [1 + \frac{2 H - 1}{Γ (2 H)} \int_{0}^{λ h} \int_{- \infty}^{0} e^{r + s} {(r - s)}^{2 H - 2} d s d r] = f (λ h) .

(15)

The claim follows immediately from definition of

{\hat{λ}}_{3}

. □

Remark 1.

Note that the convergence in (14) holds true also for

H = 1 / 2

(the double integral in f disappears) and for

H < 1 / 2

(utilizing the fact that a relation analogous to (3) is true even for

H < 1 / 2

, if the two domains of integration are disjoint). Consequently

\begin{matrix} lim_{N \to \infty} {\hat{λ}}_{3} = - \frac{1}{h} log f (λ h) \{\begin{matrix} < λ, & H > 1 / 2, \\ = λ, & H = 1 / 2, \\ > λ, & H < 1 / 2 . \end{matrix} a . s . \end{matrix}

4. Asymptotic Least-Squares Estimator

Our goal in this section is modifying

{\hat{λ}}_{3}

so that it converges to

λ

when

N \to \infty

and

h > 0

is fixed. Combination of (12) and (14) yields

F_{N} \to f (λ h)

a.s. Thus, we can define the asymptotic least-squares estimator

{\hat{λ}}_{4}

by relation

f (h {\hat{λ}}_{4}) = F_{N}

. Since f is one-to-one (see below), the explicit formula for

{\hat{λ}}_{4}

reads

\begin{matrix} {\hat{λ}}_{4} = \frac{1}{h} f^{- 1} (F_{N}) . \end{matrix}

(16)

The following lemma justifies invertibility of f in the definition of

{\hat{λ}}_{4}

.

Lemma 2.

In our setting

H > 1 / 2

the function f defined by (13) is strictly decreasing on

[0, \infty)

.

Proof.

Calculate the derivative

\begin{matrix} f^{'} (x) & = - e^{- x} [1 + \frac{2 H - 1}{Γ (2 H)} \int_{0}^{x} (\int_{- \infty}^{0} e^{s} e^{r} {(r - s)}^{2 H - 2} d s) d r] + e^{- x} [\frac{2 H - 1}{Γ (2 H)} (\int_{- \infty}^{0} e^{s} e^{x} {(x - s)}^{2 H - 2} d s)] \\ = e^{- x} \frac{2 H - 1}{Γ (2 H)} [\int_{- \infty}^{0} (e^{s} e^{x} {(x - s)}^{2 H - 2} - \int_{0}^{x} e^{s} e^{r} {(r - s)}^{2 H - 2} d r) d s - \frac{Γ (2 H)}{2 H - 1}] . \end{matrix}

Continue with

\begin{matrix} e^{s} e^{x} {(x - s)}^{2 H - 2} - \int_{0}^{x} e^{s} e^{r} {(r - s)}^{2 H - 2} d r < {(x - s)}^{2 H - 2} e^{s} (e^{x} - \int_{0}^{x} e^{r} d r) = {(x - s)}^{2 H - 2} e^{s} < {(- s)}^{2 H - 2} e^{s} . \end{matrix}

Plug this estimate into the formula for

f^{'} (x)

to see

\begin{matrix} f^{'} (x) < e^{- x} \frac{2 H - 1}{Γ (2 H)} [\int_{- \infty}^{0} {(- s)}^{2 H - 2} e^{s} d s - \frac{Γ (2 H)}{2 H - 1}] = e^{- x} \frac{2 H - 1}{Γ (2 H)} [Γ (2 H - 1) - Γ (2 H - 1)] = 0 . \end{matrix}

(17)

□

Remark 2.

Note that f is monotonous also for

H = 1 / 2

, but it is not monotonous if

H < 1 / 2

, which rules out the possibility to use estimator

{\hat{λ}}_{4}

in this singular case.

Theorem 2.

The asymptotic least-squares estimator

{\hat{λ}}_{4}

is strongly consistent, i.e.,

\begin{matrix} lim_{N \to \infty} {\hat{λ}}_{4} = λ a . s . \end{matrix}

Proof.

Recall the definition of

{\hat{λ}}_{4}

in (16) and the limit in (15):

\begin{matrix} F_{N} \overset{N \to \infty}{⟶} f (λ h) a . s . \end{matrix}

Further recall that

f : (0, \infty) \to (0, \infty)

is differentiable with strictly negative derivative (see (17)). Thus,

f^{- 1}

is also differentiable with strictly negative derivative and this implies

\begin{matrix} lim_{N \to \infty} {\hat{λ}}_{4} = lim_{N \to \infty} \frac{1}{h} f^{- 1} (F_{N}) = \frac{1}{h} f^{- 1} (f (λ h)) = λ a . s . \end{matrix}

□

The strongly consistent estimator

{\hat{λ}}_{4}

works well for stationary solutions or observations with long time horizon (see Figure 3). Moreover it does not require explicit knowledge of

σ

(in contrast to

{\hat{λ}}_{2}

, which is also strictly consistent). On the other hand it does not provide adequate results for non-stationary solutions and short time horizon, since the correction function f reflects stationary behavior of the process (see Figure 4).

5. Conditional Least-Squares Estimator

Non-stationary trajectories with long time horizon contain a lot of information about

λ

, which is encoded mainly in two aspects: speed of decay in initial non-stationary phase and variance in stationary phase (see Figure 1). However, neither of the estimators

{\hat{λ}}_{1}

,

{\hat{λ}}_{2}

,

{\hat{λ}}_{3}

or

{\hat{λ}}_{4}

can utilize all the information effectively. This motivates us to introduce another estimator. Recall that

{\hat{λ}}_{3}

fails to be consistent because of bias in LSE caused by the correlation between

X_{t}

and

ξ_{t}

in Equation (11). To eliminate the correlation between explanatory variable and noise term in the linear model, we switch to conditional expectations. Start from the following equation, which defines

η_{t}

:

\begin{matrix} X_{t + h} = E [X_{t + h} | X_{t}] + η_{t} = E_{Λ = λ} [X_{t + h} | X_{t}] + η_{t}, \end{matrix}

(18)

where

λ

is the true value of the unknown drift parameter and

E_{Λ}

the (conditional) expectation with respect to the measure generated by the fOU

{X_{t}}_{t \in [0, \infty)}

with drift value

Λ

and initial condition

x_{0}

. (

Λ

stands for an unknown throughout this section). In other words,

E_{Λ} [X_{t + h} | X_{t}]

means the conditional expectation of

X_{t + h}

, conditioned by

X_{t}

, where the process X is given by (2) with drift

λ = Λ

. Hence,

E_{λ}

has the same meaning as

E

in previous sections.

Obviously

E_{λ} [η_{t} | X_{t}] = 0

and, consequently,

c_{t} (X_{t}, Λ) = E_{Λ} [X_{t + h} | X_{t}]

and

η_{t}

are uncorrelated. Indeed,

\begin{matrix} E_{λ} ((c_{t} (X_{t}, Λ) - E_{λ} [c_{t} (X_{t}, Λ)]) η_{t}) & = & E_{λ} (E_{λ} [(c_{t} (X_{t}, Λ) - E_{λ} [c_{t} (X_{t}, Λ)]) η_{t} | X_{t}]) \\ = & E_{λ} ((c_{t} (X_{t}, Λ) - E_{λ} [c_{t} (X_{t}, Λ)]) E_{λ} [η_{t} | X_{t}]) = 0 . \end{matrix}

In result, we apply the least-squares technique to Equation (18), where

λ

is to be estimated, i.e., we would like to minimize

\begin{matrix} min_{Λ} \sum_{n = 0}^{N - 1} {(X_{(n + 1) h} - E_{Λ} [X_{(n + 1) h} | X_{n h}])}^{2} . \end{matrix}

To calculate

c_{t} (X_{t}, Λ) = E_{Λ} [X_{t + h} | X_{t}]

explicitly, use (11) and obtain

\begin{matrix} E_{Λ} [X_{t + h} | X_{t}] = e^{- Λ h} X_{t} + E_{Λ} [ξ_{t} | X_{t}] . \end{matrix}

(19)

Note that random vector

(ξ_{t}, X_{t})

has 2-dimensional normal distribution (dependent on parameter

Λ

)

\begin{matrix} [\begin{matrix} ξ_{t} \\ X_{t} \end{matrix}] \sim N ([\begin{matrix} 0 \\ e^{- Λ t} x_{0} \end{matrix}], [\begin{matrix} σ_{ξ}^{2} (Λ) & σ_{ξ, X} (Λ) \\ σ_{ξ, X} (Λ) & σ_{X}^{2} (Λ) \end{matrix}]), \end{matrix}

and we can use explicit expression for its conditional expectation to write

\begin{matrix} E_{Λ} [ξ_{t} | X_{t}] = (X_{t} - e^{- Λ t} x_{0}) \frac{σ_{ξ, X} (Λ)}{σ_{X}^{2} (Λ)} . \end{matrix}

(20)

With respect to the exact formula for

X_{t}

given by (2) and relation (3) we get

\begin{matrix} σ_{X}^{2} (Λ) & = & σ^{2} H (2 H - 1) \int_{0}^{t} \int_{0}^{t} e^{Λ (u - t)} e^{Λ (v - t)} {| u - v |}^{2 H - 2} d v d u \\ = & \frac{σ^{2} H (2 H - 1)}{Λ^{2 H}} \int_{- Λ t}^{0} \int_{- Λ t}^{0} e^{r} e^{s} {| r - s |}^{2 H - 2} d s d r, \end{matrix}

where we used change-of-variable formula in the last step. Analogously

\begin{matrix} σ_{ξ, X} (Λ) & = & σ^{2} H (2 H - 1) \int_{t}^{t + h} (\int_{0}^{t} e^{Λ (u - t - h)} e^{Λ (v - t)} {(u - v)}^{2 H - 2} d v) d u \\ = & \frac{σ^{2} H (2 H - 1)}{Λ^{2 H}} e^{- Λ h} \int_{0}^{Λ h} (\int_{- Λ t}^{0} e^{r} e^{s} {(r - s)}^{2 H - 2} d s) d r . \end{matrix}

Using the expressions for

σ_{ξ, X} (Λ)

and

σ_{X}^{2} (Λ)

in (20) we obtain

\begin{matrix} E_{Λ} [ξ_{t} | X_{t}] = (X_{t} - e^{- Λ t} x_{0}) e^{- Λ h} \frac{\int_{0}^{Λ h} (\int_{- Λ t}^{0} e^{r} e^{s} {(r - s)}^{2 H - 2} d s) d r}{\int_{- Λ t}^{0} \int_{- Λ t}^{0} e^{r} e^{s} {| r - s |}^{2 H - 2} d s d r} . \end{matrix}

(21)

Combining formula (21) with (19) yields

E_{Λ} [X_{t + h} | X_{t}] = c_{t} (X_{t}, Λ) = X_{t} A_{Λ t, Λ h} - B_{Λ t, Λ h},

with

A_{τ, x} = e^{- x} (1 + \frac{\int_{0}^{x} (\int_{- τ}^{0} e^{r} e^{s} {(r - s)}^{2 H - 2} d s) d r}{\int_{- τ}^{0} \int_{- τ}^{0} e^{r} e^{s} {| r - s |}^{2 H - 2} d s d r}),

(22)

and

B_{τ, x} = e^{- τ} e^{- x} x_{0} \frac{\int_{0}^{x} (\int_{- τ}^{0} e^{r} e^{s} {(r - s)}^{2 H - 2} d s) d r}{\int_{- τ}^{0} \int_{- τ}^{0} e^{r} e^{s} {| r - s |}^{2 H - 2} d s d r} .

(23)

We can thus reformulate the Equation (18) for the observed process X as the following model (linear in

X_{t}

, but non-linear in

Λ

):

\begin{matrix} X_{t + h} = {[X_{t} A_{Λ t, Λ h} - B_{Λ t, Λ h}]}_{Λ = λ} + η_{t} . \end{matrix}

(24)

Now we aim to apply the least-squares method to the reformulated model to get the conditional least-squares estimator

{\hat{λ}}_{5}

. To ensure the existence of global minima, we choose a closed interval

[Λ_{L}, Λ_{U}] \subset (0, \infty

) and define

{\hat{λ}}_{5}

as the minimizer of sum-of-squares function on this interval:

S_{N} ({\hat{λ}}_{5}) = min_{Λ \in [Λ_{L}, Λ_{U}]} {S_{N} (Λ)},

(25)

with criterion function

S_{N}

defined as

S_{N} (Λ) = \sum_{n = 0}^{N - 1} {[X_{(n + 1) h} - (X_{n h} A_{Λ n h, Λ h} - B_{Λ n h, Λ h})]}^{2},

(26)

where we used (24) with

t = n h

.

Note that

S_{N} (Λ)

is continuous in

Λ

and therefore a minimum on the compact interval

[Λ_{L}, Λ_{U}]

exists. Although model (24) is linear in

X_{t}

, the coefficients A and B depend on t and that complicates the numerical minimization of

S_{N}

.

Remark 3.

Let

{Z_{t}}_{t \in [0, \infty)}

be the stationary solution to (1). Then

\begin{matrix} E_{Λ} [Z_{t + h} | Z_{t}] = Z_{t} A_{\infty, Λ h} + 0 = Z_{t} f (Λ h), \end{matrix}

where f is defined in (13) and

Λ > 0

is arbitrary. Since the coefficient

f (Λ h)

does not depend on t, it is possible to calculate LSE for

f (λ h)

explicitly and to construct the estimator of λ by applying

f^{- 1}

. Such estimator coincides with

{\hat{λ}}_{4}

introduced in previous chapter. Thus

{\hat{λ}}_{4}

can be understood as the special case of conditional LSE for the stationary solution.

In order to prove strong consistency of the estimator

{\hat{λ}}_{5}

we need to verify uniform convergence of

(1 / N) S_{N} (Λ)

to a function

S_{\infty} (Λ)

specified below. Let us start with the following proposition on uniform convergence of

A_{τ, x}

and

B_{τ, x}

. This proposition will help us in the sequel to investigate limiting behaviour of the two terms

A_{Λ n h, Λ h}

and

B_{Λ n h, Λ h}

in the sum-of-squares function

S_{N}

.

Proposition 1.

Consider

A_{τ, x}

and

B_{τ, x}

defined by (22) and (23), and f defined by (13). Fix arbitrary

0 < x_{L} < x_{U} < \infty

. Then

lim_{τ \to \infty} sup_{x \in [x_{L}, x_{U}]} | A_{τ, x} - f (x) | = 0,

(27)

and

lim_{τ \to \infty} sup_{x \in [x_{L}, x_{U}]} | B_{τ, x} | = 0 .

(28)

Proof.

In order to simplify the notation, denote

\begin{matrix} I (τ) = \int_{- τ}^{0} \int_{- τ}^{0} e^{r} e^{s} {| r - s |}^{2 H - 2} d s d r, J (τ) = \int_{- τ}^{0} e^{s} {(- s)}^{2 H - 2} d s, \end{matrix}

and notice that

\begin{matrix} I (τ) \overset{τ \to \infty}{⟶} \frac{Γ (2 H)}{2 H - 1} = Γ (2 H - 1), J (τ) \overset{τ \to \infty}{⟶} Γ (2 H - 1) . \end{matrix}

Begin with (27):

\begin{matrix} sup_{x \in [x_{L}, x_{U}]} | A_{τ, x} - f (x) | \\ = sup_{x \in [x_{L}, x_{U}]} e^{- x} |\frac{\int_{0}^{x} (\int_{- τ}^{0} e^{r} e^{s} {(r - s)}^{2 H - 2} d s) d r}{\int_{- τ}^{0} \int_{- τ}^{0} e^{r} e^{s} {| r - s |}^{2 H - 2} d s d r} - \frac{\int_{0}^{x} (\int_{- \infty}^{0} e^{s} e^{r} {(r - s)}^{2 H - 2} d s) d r}{Γ (2 H - 1)}| \\ \leq e^{- x_{L}} sup_{x \in [x_{L}, x_{U}]} |\frac{\int_{0}^{x} (\int_{- τ}^{0} e^{r} e^{s} {(r - s)}^{2 H - 2} d s) d r - \int_{0}^{x} (\int_{- \infty}^{0} e^{s} e^{r} {(r - s)}^{2 H - 2} d s) d r}{I (τ)}| \\ + e^{- x_{L}} sup_{x \in [x_{L}, x_{U}]} |\frac{[Γ (2 H - 1) - I (τ)] \int_{0}^{x} (\int_{- \infty}^{0} e^{s} e^{r} {(r - s)}^{2 H - 2} d s) d r}{I (τ) Γ (2 H - 1)}| \\ \leq \frac{e^{- x_{L}}}{I (τ)} \int_{0}^{x_{U}} e^{r} (\int_{- \infty}^{- τ} e^{s} {(r - s)}^{2 H - 2} d s) d r \\ + \frac{e^{- x_{L}} |Γ (2 H - 1) - I (τ)|}{I (τ) Γ (2 H - 1)} \int_{0}^{x_{U}} e^{r} (\int_{- \infty}^{0} e^{s} {(r - s)}^{2 H - 2} d s) d r \\ \leq \frac{e^{- x_{L}} (Γ (2 H - 1) - J (τ))}{I (τ)} \int_{0}^{x_{U}} e^{r} d r + \frac{e^{- x_{L}} |Γ (2 H - 1) - I (τ)|}{I (τ)} \int_{0}^{x_{U}} e^{r} d r \overset{τ \to \infty}{⟶} 0 . \end{matrix}

Similarly

\begin{matrix} sup_{x \in [x_{L}, x_{U}]} | B_{τ, x} | \leq e^{- τ} e^{- x_{L}} x_{0} \frac{J (τ)}{I (τ)} \int_{0}^{x_{U}} e^{r} d r \overset{τ \to \infty}{⟶} 0 . \end{matrix}

□

Choose any

0 < Λ_{L} < Λ_{U} < \infty

and recall that

h > 0

is fixed. The uniform convergences in (27) and (28) imply the following convergences uniformly in

Λ \in [Λ_{L}, Λ_{U}]

:

lim_{n \to \infty} sup_{Λ \in [Λ_{L}, Λ_{U}]} | A_{Λ n h, Λ h} - f (Λ h) | = 0,

(29)

and

lim_{n \to \infty} sup_{Λ \in [Λ_{L}, Λ_{U}]} | B_{Λ n h, Λ h} | = 0,

(30)

respectively. Indeed, set

x_{L} = Λ_{L} h

and

x_{U} = Λ_{U} h

and fix any

ε > 0

. There is

τ_{0} > 0

such that for any

τ > τ_{0}

,

\begin{matrix} sup_{x \in [x_{L}, x_{U}]} | A_{τ, x} - f (x) | < ε . \end{matrix}

If

n > \frac{τ_{0}}{Λ_{L} h}

, then

Λ n h > τ_{0}

for any

Λ \in [Λ_{L}, Λ_{U}]

. Consequently

\begin{matrix} sup_{Λ \in [Λ_{L}, Λ_{U}]} | A_{Λ n h, Λ h} - f (Λ h) | < ε, \end{matrix}

which proves (29). The convergence in (30) can be shown analogously. These uniform convergences will be helpful in the proof of the following Lemma, which provides uniform convergence of

\frac{1}{N} S_{N} (Λ)

to a limiting function

S_{\infty} (Λ)

. This uniform convergence is the key ingredient for the convergence of the minimizers

{\hat{λ}}_{5}

.

Lemma 3.

Let f be defined by (13) and let

S_{N} (Λ)

be defined by (26), where

{X_{t}}_{t \in [0, \infty)}

is the observed process with drift value λ. Denote

S_{\infty} (Λ) = \frac{σ^{2}}{λ^{2 H}} H Γ (2 H) (1 - 2 f (Λ h) f (λ h) + f^{2} (Λ h)) .

(31)

Then

lim_{N \to \infty} sup_{Λ \in [Λ_{L}, Λ_{U}]} |\frac{1}{N} S_{N} (Λ) - S_{\infty} (Λ)| = 0 a . s .

(32)

Proof.

First consider the stationary solution

{Z_{t}}_{t \in [0, \infty)}

to (1) corresponding to drift value

λ

. Comparison of (6) with (13) yields

\begin{matrix} E_{λ} Z_{0} Z_{h} = f (λ h) \frac{σ^{2}}{λ^{2 H}} H Γ (2 H) . \end{matrix}

It enables us to write

\begin{matrix} S_{\infty} (Λ) = E_{λ} Z_{0}^{2} - 2 f (Λ h) E_{λ} Z_{0} Z_{h} + f^{2} (Λ h) E_{λ} Z_{0}^{2} \end{matrix}

for any

Λ > 0

, and, consequently

\begin{matrix} \frac{1}{N} S_{N} (Λ) - S_{\infty} (Λ) & = (\frac{1}{N} \sum_{n = 0}^{N - 1} X_{(n + 1) h}^{2} - E_{λ} Z_{0}^{2}) \\ - 2 (\frac{1}{N} \sum_{n = 0}^{N - 1} A_{Λ n h, Λ h} X_{(n + 1) h} X_{n h} - f (Λ h) E_{λ} Z_{0} Z_{h}) \\ + (\frac{1}{N} \sum_{n = 0}^{N - 1} X_{n h}^{2} A_{Λ n h, Λ h}^{2} - f^{2} (Λ h) E_{λ} Z_{0}^{2}) \\ + (\frac{1}{N} \sum_{n = 0}^{N - 1} B_{Λ n h, Λ h} (2 X_{(n + 1) h} - 2 A_{Λ n h, Λ h} X_{n h} + B_{Λ n h, Λ h})) . \end{matrix}

(33)

Recall that

{Z_{t}}_{t \in [0, \infty)}

is ergodic and

| Z_{t} - X_{t} |

vanishes at infinity. Using Lemma 1 in the same way as in the proof of Theorem 1 implies

\begin{matrix} sup_{Λ \in [Λ_{L}, Λ_{U}]} |\frac{1}{N} \sum_{n = 0}^{N - 1} X_{(n + 1) h}^{2} - E_{λ} Z_{0}^{2}| \overset{N \to \infty}{⟶} 0 a . s . \end{matrix}

For the second term, write

\begin{matrix} sup_{Λ \in [Λ_{L}, Λ_{U}]} |\frac{1}{N} \sum_{n = 0}^{N - 1} A_{Λ n h, Λ h} X_{(n + 1) h} X_{n h} - f (Λ h) E_{λ} Z_{0} Z_{h}| \\ \leq sup_{Λ \in [Λ_{L}, Λ_{U}]} | \frac{1}{N} \sum_{n = 0}^{N - 1} (A_{Λ n h, Λ h} - f (Λ h)) X_{(n + 1) h} X_{n h} \\ + \frac{1}{N} \sum_{n = 0}^{N - 1} f (Λ h) (X_{(n + 1) h} X_{n h} - E_{λ} Z_{0} Z_{h}) | \\ \leq \frac{1}{N} \sum_{n = 0}^{N - 1} (sup_{Λ \in [Λ_{L}, Λ_{U}]} |A_{Λ n h, Λ h} - f (Λ h)|) |X_{(n + 1) h} X_{n h}| \\ + (sup_{Λ \in [Λ_{L}, Λ_{U}]} |f (Λ h)|) |\frac{1}{N} \sum_{n = 0}^{N - 1} X_{(n + 1) h} X_{n h} - E_{λ} Z_{0} Z_{h}| . \end{matrix}

Application of Lemma 1, the convergence in (29) and the continuity of f ensure the convergence with probability one of both summands to zero as

N \to \infty

.

The uniform convergence of the third term can be shown analogously:

\begin{matrix} sup_{Λ \in [Λ_{L}, Λ_{U}]} |\frac{1}{N} \sum_{n = 0}^{N - 1} X_{n h}^{2} A_{Λ n h, Λ h}^{2} - f^{2} (Λ h) E_{λ} Z_{0}^{2}| \overset{N \to \infty}{⟶} 0 a . s ., \end{matrix}

where we use

\begin{matrix} lim_{n \to \infty} sup_{Λ \in [Λ_{L}, Λ_{U}]} | A_{Λ n h, Λ h}^{2} - f^{2} (Λ h) | = 0, \end{matrix}

which follows directly from (29) and the continuity of f.

The last term in (33) can be treated similarly:

\begin{matrix} sup_{Λ \in [Λ_{L}, Λ_{U}]} |\frac{1}{N} \sum_{n = 0}^{N - 1} B_{Λ n h, Λ h} (2 X_{(n + 1) h} - 2 A_{Λ n h, Λ h} X_{n h} + B_{Λ n h, Λ h})| \leq \frac{1}{N} \sum_{n = 0}^{N - 1} (sup_{Λ \in [Λ_{L}, Λ_{U}]} | B_{Λ n h, Λ h} |) C_{n}, \end{matrix}

where

\begin{matrix} C_{n} & = & 2 | X_{(n + 1) h} | + 2 (sup_{Λ \in [Λ_{L}, Λ_{U}]} | A_{Λ n h, Λ h} - f (Λ h) |) | X_{n h} | \\ + 2 (sup_{Λ \in [Λ_{L}, Λ_{U}]} | f (Λ h) |) | X_{n h} | + (sup_{Λ \in [Λ_{L}, Λ_{U}]} | B_{Λ n h, Λ h} |) . \end{matrix}

By (30)

\begin{matrix} sup_{Λ \in [Λ_{L}, Λ_{U}]} | B_{Λ n h, Λ h} | \overset{n \to \infty}{⟶} 0, \end{matrix}

and

\begin{matrix} \frac{1}{N} \sum_{n = 0}^{N - 1} C_{n} \overset{N \to \infty}{⟶} E_{λ} | Z_{0} | (1 + 2 sup_{Λ \in [Λ_{L}, Λ_{U}]} | f (Λ h) |) < \infty a . s . \end{matrix}

Lemma 1 concludes the proof:

\begin{matrix} \frac{1}{N} \sum_{n = 0}^{N - 1} (sup_{Λ \in [Λ_{L}, Λ_{U}]} | B_{Λ n h, Λ h} |) C_{n} \overset{N \to \infty}{⟶} 0 a . s . \end{matrix}

□

Previous considerations lead to the convergence of

{\hat{λ}}_{5}

, being the minimizers of

\frac{1}{N} S_{N} (Λ)

to the minimizer of

S_{\infty} (Λ)

. Next lemma ensures that this minimizer coincides with the true drift value

λ

.

Lemma 4.

S_{\infty}

defined by (31) is continuous on

(0, \infty)

and λ is the unique minimizer of

S_{\infty}

, i.e.,

\begin{matrix} S_{\infty} (λ) < S_{\infty} (Λ) \forall Λ > 0, Λ \neq λ . \end{matrix}

Proof.

By definition

\begin{matrix} S_{\infty} (Λ) & = & \frac{σ^{2}}{λ^{2 H}} H Γ (2 H) (1 - 2 f (Λ h) f (λ h) + f^{2} (Λ h)) \\ = & \frac{σ^{2}}{λ^{2 H}} H Γ (2 H) (1 - f^{2} (λ h) + f^{2} (λ h) - 2 f (Λ h) f (λ h) + f^{2} (Λ h)) \\ = & \frac{σ^{2}}{λ^{2 H}} H Γ (2 H) (1 - f^{2} (λ h) + {[f (λ h) - f (Λ h)]}^{2}) . \end{matrix}

The claim follows immediately, because f is one-to-one (it is strictly decreasing).

Continuity of

S_{\infty}

is a direct consequence of the continuity of f. □

Now we are in a position to prove the strong consistency of

{\hat{λ}}_{5}

.

Theorem 3.

Consider bounds

0 < Λ_{L} < Λ_{U} < \infty

so that they cover the true drift λ of the observed solution

{X_{t}}_{t \in [0, \infty)}

to Equation (1), i.e.,

λ \in (Λ_{L}, Λ_{U})

. Then

{\hat{λ}}_{5}

defined in (25) is strongly consistent, i.e.,

\begin{matrix} {\hat{λ}}_{5} \overset{N \to \infty}{⟶} λ a . s . \end{matrix}

Proof.

The proof follows standard argumentation from nonlinear regression and utilizes Lemma 3 and Lemma 4. Choose

ε > 0

sufficiently small so that

[Λ_{L}, Λ_{U}] \ (λ - ε, λ + ε) \neq \emptyset

and set

\begin{matrix} δ = min {S_{\infty} (Λ) - S_{\infty} (λ) : Λ \in [Λ_{L}, Λ_{U}] \ (λ - ε, λ + ε)} > 0 . \end{matrix}

Consider a set of full measure on which the uniform convergence (32) holds and take

N_{0} > 0

such that

\begin{matrix} sup_{Λ \in [Λ_{L}, Λ_{U}]} |\frac{1}{N} S_{N} (Λ) - S_{\infty} (Λ)| < \frac{δ}{3} \forall N \geq N_{0} . \end{matrix}

Fix any

N \geq N_{0}

. Then for arbitrary

Λ \in [Λ_{L}, Λ_{U}] \ (λ - ε, λ + ε)

we get

\begin{matrix} \frac{1}{N} S_{N} (λ) < S_{\infty} (λ) + \frac{δ}{3} < S_{\infty} (Λ) - \frac{δ}{3} < \frac{1}{N} S_{N} (Λ) . \end{matrix}

As

{\hat{λ}}_{5}

minimizes

S_{N}

, for all

N \geq N_{0}

we have

\begin{matrix} | {\hat{λ}}_{5} - λ | < ε . \end{matrix}

Since

ε

was arbitrary (if small enough), we obtain the convergence

\begin{matrix} {\hat{λ}}_{5} \overset{N \to \infty}{⟶} λ \end{matrix}

on a set of full measure. □

6. Results and Discussion

In Table 1 we present comparison of the root mean square errors (RMSE) of all considered estimators for

λ = 0.5

and several combinations of

x_{0}

, T and H. Estimators

{\hat{λ}}_{1}

and

{\hat{λ}}_{3}

demonstrate good performance in scenarios with far-from-zero initial (

x_{0} = 100

) condition and short time horizon (

T = 10

) This illustrates the fact that these estimators reflect mainly the speed of convergence to zero of the observed process in its initial phase. Increasing time horizon to

T = 1000

adds a stationary phase to the observed trajectories, which distorts the estimators

{\hat{λ}}_{1}

and

{\hat{λ}}_{3}

.

Estimators

{\hat{λ}}_{2}

and

{\hat{λ}}_{4}

perform well in settings with stationary-like initial condition (

x_{0} = 0

) and long time horizon (

T = 1000

). This is because they are constructed from the stationary behavior of the process. Taking far-from-zero initial condition ruins these estimators, unless trajectory is very long.

The conditional LSE,

{\hat{λ}}_{5}

, shows reasonable performance in all studied scenarios and it significantly outperforms the other estimators in scenario with far-from-stationary initial condition (

x_{0} = 100

) and long time horizon (

T = 1000

). This results from the unique ability of this estimator to reflect and utilize information about the drift from both non-stationary (decreasing) phase and stationary (oscillating) phase. This is also illustrated on Figure 5. On the other hand, evaluation of

{\hat{λ}}_{5}

is the most numerically demanding compared the other studied estimators.

If

x_{0} = 0

and

T = 10

,

{\hat{λ}}_{5}

shows greater RMSE than

{\hat{λ}}_{1}

and

{\hat{λ}}_{3}

in Table 1 due to

λ = 1 / 2

being relatively close to zero. This causes that

{\hat{λ}}_{1}

and

{\hat{λ}}_{3}

have smaller variance (although greater bias) compared to

{\hat{λ}}_{5}

(see Figure 6). In order to present this effect we have calculated RMSE for simulations in same scenario but with

λ = 3 / 2

(see Table 2).

{\hat{λ}}_{5}

provides smaller RMSE than the other estimators in this setting.

7. Conclusions

Three new estimators were defined and studied:

The least-squares estimator from exact solution ( ${\hat{λ}}_{3}$ ), which improves the popular discretized LSE ( ${\hat{λ}}_{1}$ ) by eliminating the discretization error. It is easy to implement, since it can be calculated by a closed formula. However, it fails to be strongly consistent in long-span regime.
The asymptotic least-squares estimator ( ${\hat{λ}}_{4}$ ), which is a modification of ${\hat{λ}}_{3}$ with respect to its asymptotic behavior. In result, ${\hat{λ}}_{4}$ is strongly consistent in the long-span regime and behaves similarly to the well-established discrete ergodic estimator ( ${\hat{λ}}_{2}$ ). The advantage of ${\hat{λ}}_{4}$ is that it does not require a priori knowledge of the volatility $σ$ . On the other hand, its implementation includes a root-finding numerical procedure.
The conditional least-squares estimator ( ${\hat{λ}}_{5}$ ), which eliminates the bias in the least-squares procedure by considering the conditional expectation of the response as the explanatory variable. The possibility to express the conditional expectation explicitly makes this approach feasible. This conditioning idea (which is new in the context of the models with fractional noise, to our best knowledge) provides exceptionally reliable estimator, which outperforms all the other studied estimators. We proved the strong consistency (in long-span regime) of this estimator. The implementation comprises solving an optimization problem.

These new estimating procedures can help practitioners or scientists from various fields to improve the calibration of their models based on available data with autocorrelated noise (these are typically observed/measured in discrete time instants) and, consequently, obtain more reliable conclusions from the calibrated models.

An interesting future extension would certainly be to explore the potential of the promising idea of the conditioning within least-squares procedure to more general models and settings (including d-dimensional fOU, fOU with

H < 1 / 2

, non-linear drift, multiplicative noise, etc.).

Author Contributions

Conceptualization, P.K.; methodology, P.K.; software, L.S.; writing—original draft, P.K. and L.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the grant LTAIN19007 Development of Advanced Computational Algorithms for Evaluating Post-surgery Rehabilitation.

Acknowledgments

We are grateful to four anonymous reviewers for their valuable comments, which helped improve this paper significantly.

Conflicts of Interest

The authors declare no conflict of interest.

References

Mishura, Y. Stochastic Calculus for Fractional Brownian Motion and Related Processes; Springer: Berlin/Heidelberg, Germany, 2008. [Google Scholar]
Biagini, F.; Hu, Y.; Øksendal, B.; Zhang, T. Stochastic Calculus for Fractional Brownian Motion and Applications; Springer: London, UK, 2008. [Google Scholar]
Abundo, M.; Pirozzi, E. On the Integral of the Fractional Brownian Motion and Some Pseudo-Fractional Gaussian Processes. Mathematics 2019, 7, 991. [Google Scholar] [CrossRef] [Green Version]
Ascione, G.; Mishura, Y.; Pirozzi, E. Fractional Ornstein–Uhlenbeck Process with Stochastic Forcing, and its Applications. Methodol. Comput. Appl. Probab. 2019. [Google Scholar] [CrossRef]
Hu, Y.; Nualart, D. Parameter estimation for fractional Ornstein–Uhlenbeck processes. Stat. Probab. Lett. 2010, 80, 1030–1038. [Google Scholar] [CrossRef] [Green Version]
Hu, Y.; Nualart, D.; Zhou, H. Parameter estimation for fractional Ornstein–Uhlenbeck processes of general Hurst parameter. Stat. Inference Stoch. Process. 2019, 22, 111–142. [Google Scholar] [CrossRef] [Green Version]
Es-Sebaiy, K. Berry-Esseen bounds for the least squares estimator for discretely observed fractional Ornstein–Uhlenbeck processes. Stat. Probab. Lett. 2013, 83, 2372–2385. [Google Scholar] [CrossRef] [Green Version]
Kubilius, K.; Mishura, Y.; Ralchenko, K.; Seleznjev, O. Consistency of the drift parameter estimator for the discretized fractional Ornstein–Uhlenbeck process with Hurst index H is an element of (0,1/2). Electron. J. Stat. 2015, 9, 1799–1825. [Google Scholar] [CrossRef]
Hu, Y.; Song, J. Parameter estimation for fractional Ornstein–Uhlenbeck processes with discrete observations. In Malliavin Calculus and Stochastic Analysis; Springer: Boston, MA, USA, 2013; Volume 34, pp. 427–442. [Google Scholar] [CrossRef] [Green Version]
Es-Sebaiy, K.; Viens, F. Optimal rates for parameter estimation of stationary Gaussian processes. Stoch. Process. Their. Appl. 2019, 129, 3018–3054. [Google Scholar] [CrossRef] [Green Version]
Azmoodeh, E.; Viitasaari, L. Parameter estimation based on discrete observations of fractional Ornstein–Uhlenbeck process of the second kind. Stat. Inference Stoch. Process. 2015, 18, 205–227. [Google Scholar] [CrossRef] [Green Version]
Neuenkirch, A.; Tindel, S. A least square-type procedure for parameter estimation in stochastic differential equations with additive fractional noise. Stat. Inference Stoch. Process. 2014, 17, 99–120. [Google Scholar] [CrossRef] [Green Version]
Xiao, W.; Zhang, W.; Xu, W. Parameter estimation for fractional Ornstein–Uhlenbeck processes at discrete observation. Appl. Math. Model. 2011, 35, 4196–4207. [Google Scholar] [CrossRef]
Istas, J.; Lang, G. Quadratic variations and estimation of the local Hölder index of a Gaussian process. Annales de l’I.H.P. Probabilités et Statistiques 1997, 33, 407–436. [Google Scholar] [CrossRef] [Green Version]
Coeurjolly, J. Hurst exponent estimation of locally self-similar Gaussian processes using sample quantiles. Ann. Stat. 2008, 36, 1404–1434. [Google Scholar] [CrossRef]
Rosenbaum, M. Estimation of the volatility persistence in a discretely observed diffusion model. Stoch. Process. Their. Appl. 2008, 118, 1434–1462. [Google Scholar] [CrossRef]
Berzin, C.; Latour, A.; León, J. Inference on the Hurst Parameter and Variance of Diffusions Driven by Fractional Brownian Motion; Springer International Publishing: Cham, Switzerland, 2014. [Google Scholar]
Brouste, A.; Iacus, S. Parameter estimation for the discretely observed fractional Ornstein–Uhlenbeck process and the Yuima R package. Comput. Stat. 2013, 28, 1529–1547. [Google Scholar] [CrossRef] [Green Version]
Brouste, A.; Fukasawa, M.; Hino, H.; Iacus, S.; Kamatani, K.; Koike, Y.; Masuda, H.; Nomura, R.; Ogihara, T.; Shimuzu, Y.; et al. The YUIMA Project: A Computational Framework for Simulation and Inference of Stochastic Differential Equations. J. Stat. Softw. 2014, 4, 1–51. [Google Scholar] [CrossRef]
Kubilius, K.; Mishura, Y.; Ralchenko, K. Parameter Estimation in Fractional Diffusion Models; Springer International Publishing AG: Cham, Switzerland, 2017. [Google Scholar]
Pipiras, V.; Taqqu, M. Integration questions related to fractional Brownian motion. Probab. Theory Relat. Fields 2000, 118, 251–291. [Google Scholar] [CrossRef]
Kříž, P.; Maslowski, B. Central limit theorems and minimum-contrast estimators for linear stochastic evolution equations. Stochastics 2019, 91, 1109–1140. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Two single trajectories of fOU with different values of λ, where σ = 2, x₀ = 20, T = 20, H = 0.6.

Figure 2. Comparison of

{\hat{λ}}_{1}

,

{\hat{λ}}_{2}

and

{\hat{λ}}_{3}

for 100 trajectories, where

H = 0.6

,

x_{0} = 100

,

T = 10

,

h = 0.1

,

σ = 2

; the horizontal line shows the true value of the estimated parameter

λ = 0.5

.

Figure 2. Comparison of

{\hat{λ}}_{1}

,

{\hat{λ}}_{2}

and

{\hat{λ}}_{3}

for 100 trajectories, where

H = 0.6

,

x_{0} = 100

,

T = 10

,

h = 0.1

,

σ = 2

; the horizontal line shows the true value of the estimated parameter

λ = 0.5

.

Figure 3. Comparison of

{\hat{λ}}_{1}

,

{\hat{λ}}_{2}

,

{\hat{λ}}_{3}

and

{\hat{λ}}_{4}

for 100 trajectories, where

H = 0.6

,

x_{0} = 0

,

T = 1000

,

h = 1

,

σ = 2

; the horizontal line shows the true value of the estimated parameter

λ = 0.5

.

Figure 3. Comparison of

{\hat{λ}}_{1}

,

{\hat{λ}}_{2}

,

{\hat{λ}}_{3}

and

{\hat{λ}}_{4}

for 100 trajectories, where

H = 0.6

,

x_{0} = 0

,

T = 1000

,

h = 1

,

σ = 2

; the horizontal line shows the true value of the estimated parameter

λ = 0.5

.

Figure 4. Comparison of

{\hat{λ}}_{1}

,

{\hat{λ}}_{2}

,

{\hat{λ}}_{3}

and

{\hat{λ}}_{4}

for 100 trajectories, where

H = 0.6

,

x_{0} = 100

,

T = 10

,

h = 0.1

,

σ = 2

; the horizontal line shows the true value of the estimated parameter

λ = 0.5

.

Figure 4. Comparison of

{\hat{λ}}_{1}

,

{\hat{λ}}_{2}

,

{\hat{λ}}_{3}

and

{\hat{λ}}_{4}

for 100 trajectories, where

H = 0.6

,

x_{0} = 100

,

T = 10

,

h = 0.1

,

σ = 2

; the horizontal line shows the true value of the estimated parameter

λ = 0.5

.

Figure 5. Comparison of

{\hat{λ}}_{1}

,

{\hat{λ}}_{2}

,

{\hat{λ}}_{3}

,

{\hat{λ}}_{4}

and

{\hat{λ}}_{5}

for 100 trajectories, where

H = 0.75

,

x_{0} = 100

,

T = 1000

,

h = 1

,

σ = 2

; the horizontal line shows the true value of the estimated parameter

λ = 0.5

.

Figure 5. Comparison of

{\hat{λ}}_{1}

,

{\hat{λ}}_{2}

,

{\hat{λ}}_{3}

,

{\hat{λ}}_{4}

and

{\hat{λ}}_{5}

for 100 trajectories, where

H = 0.75

,

x_{0} = 100

,

T = 1000

,

h = 1

,

σ = 2

; the horizontal line shows the true value of the estimated parameter

λ = 0.5

.

Figure 6. Comparison of

{\hat{λ}}_{1}

,

{\hat{λ}}_{2}

,

{\hat{λ}}_{3}

,

{\hat{λ}}_{4}

and

{\hat{λ}}_{5}

for 100 trajectories, where

H = 0.6

,

x_{0} = 0

,

T = 10

,

h = 0.1

,

σ = 2

; the horizontal line shows the true value of the estimated parameter

λ = 0.5

.

Figure 6. Comparison of

{\hat{λ}}_{1}

,

{\hat{λ}}_{2}

,

{\hat{λ}}_{3}

,

{\hat{λ}}_{4}

and

{\hat{λ}}_{5}

for 100 trajectories, where

H = 0.6

,

x_{0} = 0

,

T = 10

,

h = 0.1

,

σ = 2

; the horizontal line shows the true value of the estimated parameter

λ = 0.5

.

Table 1. Root mean square errors of the studied estimators calculated using 100 numerical simulations with

λ = 1 / 2

.

Table 1. Root mean square errors of the studied estimators calculated using 100 numerical simulations with

λ = 1 / 2

.

$x_{0}$	T	H	${\hat{λ}}_{1}$	${\hat{λ}}_{2}$	${\hat{λ}}_{3}$	${\hat{λ}}_{4}$	${\hat{λ}}_{5}$
0	1000	0.6	0.216161	0.0364895	0.167108	0.0464931	0.0462385
0	1000	0.75	0.350571	0.0484904	0.33802	0.0637274	0.0643273
0	1000	0.9	0.444239	0.120954	0.442423	0.174213	0.170646
100	1000	0.6	0.158561	0.237689	0.0840923	0.125846	0.0297993
100	1000	0.75	0.244412	0.160116	0.205152	0.358587	0.0497393
100	1000	0.9	0.326662	0.105573	0.309256	1.17871	0.0454533
0	10	0.6	0.333834	0.513897	0.347278	0.539418	0.488992
0	10	0.75	0.439545	0.590848	0.438978	0.608744	0.552812
0	10	0.9	0.528586	0.841335	0.527932	0.840721	0.635887
100	10	0.6	0.0259539	0.49369	0.024092	0.441152	0.0277489
100	10	0.75	0.031949	0.480362	0.029853	1.53454	0.0411947
100	10	0.9	0.0398104	0.457155	0.0376366	4.54265	0.0291719

Table 2. Root mean square errors of the studied estimators calculated using 100 numerical simulations with

λ = 3 / 2

.

Table 2. Root mean square errors of the studied estimators calculated using 100 numerical simulations with

λ = 3 / 2

.

T	H	${\hat{λ}}_{1}$	${\hat{λ}}_{2}$	${\hat{λ}}_{3}$	${\hat{λ}}_{4}$	${\hat{λ}}_{5}$
10	0.6	0.68946	0.641613	0.686183	0.761677	0.399386
10	0.75	1.11793	0.761793	1.10949	0.877707	0.457095
10	0.9	1.38463	1.21928	1.38299	1.52812	0.504919

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kříž, P.; Szała, L. Least-Squares Estimators of Drift Parameter for Discretely Observed Fractional Ornstein–Uhlenbeck Processes. Mathematics 2020, 8, 716. https://doi.org/10.3390/math8050716

AMA Style

Kříž P, Szała L. Least-Squares Estimators of Drift Parameter for Discretely Observed Fractional Ornstein–Uhlenbeck Processes. Mathematics. 2020; 8(5):716. https://doi.org/10.3390/math8050716

Chicago/Turabian Style

Kříž, Pavel, and Leszek Szała. 2020. "Least-Squares Estimators of Drift Parameter for Discretely Observed Fractional Ornstein–Uhlenbeck Processes" Mathematics 8, no. 5: 716. https://doi.org/10.3390/math8050716

APA Style

Kříž, P., & Szała, L. (2020). Least-Squares Estimators of Drift Parameter for Discretely Observed Fractional Ornstein–Uhlenbeck Processes. Mathematics, 8(5), 716. https://doi.org/10.3390/math8050716

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Least-Squares Estimators of Drift Parameter for Discretely Observed Fractional Ornstein–Uhlenbeck Processes

Abstract

1. Introduction

2. Preliminaries

3. Least-Squares Estimator from Exact Solution

4. Asymptotic Least-Squares Estimator

5. Conditional Least-Squares Estimator

6. Results and Discussion

7. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI