Arbitrary-Order Finite-Time Corrections for the Kramers–Moyal Operator

Rydin Gorjão, Leonardo; Witthaut, Dirk; Lehnertz, Klaus; Lind, Pedro G.

doi:10.3390/e23050517

Open AccessArticle

Arbitrary-Order Finite-Time Corrections for the Kramers–Moyal Operator

¹

Forschungszentrum Jülich, Institute for Energy and Climate Research-Systems Analysis and Technology Evaluation (IEK-STE), 52428 Jülich, Germany

²

Institute for Theoretical Physics, University of Cologne, 50937 Köln, Germany

³

Department of Epileptology, University Hospital Bonn, Venusberg Campus 1, 53127 Bonn, Germany

⁴

Helmholtz-Institute for Radiation and Nuclear Physics, University of Bonn, Nussallee 14–16, 53115 Bonn, Germany

⁵

Interdisciplinary Center for Complex Systems, University of Bonn, Brühler Straße 7, 53175 Bonn, Germany

⁶

Department of Computer Science, OsloMet—Oslo Metropolitan University, P.O. Box 4 St. Olavs plass, N-0130 Oslo, Norway

^*

Author to whom correspondence should be addressed.

Entropy 2021, 23(5), 517; https://doi.org/10.3390/e23050517

Submission received: 25 March 2021 / Revised: 15 April 2021 / Accepted: 20 April 2021 / Published: 24 April 2021

(This article belongs to the Special Issue From Time Series to Stochastic Dynamic Models)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

With the aim of improving the reconstruction of stochastic evolution equations from empirical time-series data, we derive a full representation of the generator of the Kramers–Moyal operator via a power-series expansion of the exponential operator. This expansion is necessary for deriving the different terms in a stochastic differential equation. With the full representation of this operator, we are able to separate finite-time corrections of the power-series expansion of arbitrary order into terms with and without derivatives of the Kramers–Moyal coefficients. We arrive at a closed-form solution expressed through conditional moments, which can be extracted directly from time-series data with a finite sampling intervals. We provide all finite-time correction terms for parametric and non-parametric estimation of the Kramers–Moyal coefficients for discontinuous processes which can be easily implemented—employing Bell polynomials—in time-series analyses of stochastic processes. With exemplary cases of insufficiently sampled diffusion and jump-diffusion processes, we demonstrate the advantages of our arbitrary-order finite-time corrections and their impact in distinguishing diffusion and jump-diffusion processes strictly from time-series data.

Keywords:

stochastic processes; Kramers–Moyal equation; Kramers–Moyal coefficients; Fokker–Planck equation; arbitrary-order approximations; non-parametric estimators; Bell polynomials

1. Introduction

The reconstruction of stochastic evolution equations from time-series data in terms of the Langevin equation and the corresponding Fokker–Planck equation is often challenged by the inevitably finite temporal sampling of time-series data. Moreover, the Fokker–Planck equation is restricted to continuous stochastic processes, i.e., diffusion, and thus cannot adequately describe discontinuous transitions in time-series data. A more general description of continuous and discontinuous stochastic processes can be constructed using the Kramers–Moyal equation [1,2] given by

\frac{\partial}{\partial τ} p (x, t + τ | x^{'}, t) = \sum_{n = 1}^{\infty} {(- \frac{\partial}{\partial x})}^{n} D_{n} (x) p (x, t + τ | x^{'}, t),

of which the Fokker–Planck equation is a particular case (

D_{n} \equiv 0

for

n > 2

). The Kramers–Moyal equation serves as a stepping stone to adequately describe time-series data with both diffusive and discontinuous characteristics, but it is nevertheless challenged by finite-time sampling in real-world data. Recent applications of the Kramers–Moyal equation include brain [3,4] and heart dynamics [5], stochastic harmonic oscillators [6], renewable-energy generation [7], solar irradiance [8], turbulence [9], nano-scale friction [10], and X-ray imaging [11,12].

Previous work has demonstrated that a finite sampling interval

Δ t

not only influences the first- and second-order Kramers–Moyal (KM) coefficients [13] but also causes non-vanishing, higher-order (>2) coefficients [4,14,15,16,17,18,19,20]. A more recent example is the jump-diffusion process discussed in reference [3]

d X_{t} = a (X_{t}, t) d t + b (X_{t}, t) d W (t) + ξ (X_{t}, t) d J (t),

(1)

where

a (X_{t}, t)

is a drift function,

b (X_{t}, t)

is the diffusion associated with an uncorrelated Brownian motion or Wiener process

W (t)

,

J (t)

is a Poisson process with jump rate

λ

, independent of

W (t)

, and

ξ (X_{t}, t)

is Gaussian-distributed

N (0, s)

with zero mean and variance s. For such jump-diffusion processes, additional influences of the finite temporal sampling need to be taken into account. As shown in reference [8], jump events produce terms of order

O (Δ t)

in the KM coefficients of even orders and the jump rate and amplitude induce terms of order

O (Δ t^{2})

in all coefficients. These influences are heightened for bivariate jump-diffusion processes [21], since terms of order

O (Δ t^{i}), i \geq 3

impact higher-order (≥4) coefficients [22].

Although most of the aforementioned studies reported on finite-time corrections for KM coefficients and/or conditional moments of various orders, we still lack an explicit arbitrary-order correction or a closed-form solution in which the conditional moments are represented as functions of the KM coefficients and vice versa. In this article, we derive a full expansion of the generator of the Kramers–Moyal operator in exponential form for one-dimensional Markovian processes. This is equivalent to van Kampen’s system-size expansion, which is taken over a finite time interval

τ

[23,24]. The derivations presented henceforth are generally applicable to Markovian diffusion as well as jump-diffusion processes.

On a more general level, our solution is an explicit approximate solution of the Kramers–Moyal equation [1,2], which generalises the Fokker–Planck equation for discontinuous processes [13,23,25]. Our approximation of the Kramers–Moyal operator can be taken as an arbitrary order. In particular, we focus on the solution of this partial differential equation by representing the Kramers–Moyal operator in an exponential form and equating the conditional moments with the KM coefficients after representing the exponential operator as a power series. This representation of the exponential operator can similarly be used in other problems with an equivalent formulation [26,27,28] or similar discontinuous stochastic processes with different jump distributions, e.g., the Gamma distribution [29,30].

2. Mathematical Background

The Fokker–Planck(–Kolmogorov) equation (Kolmogorov forward equation or Smoluchowski equation) for the conditional probability density

p (x, t + τ | x^{'}, t)

, that is well-known within the fields of physics and mathematics, yields the propagation in time and space of any diffusion (thus continuous) process, is given by [31]

\frac{\partial}{\partial τ} p (x, t + τ | x^{'}, t) = [\frac{\partial}{\partial x} D_{1} (x, t) + \frac{\partial^{2}}{\partial x^{2}} D_{2} (x, t)] p (x, t + τ | x^{'}, t) .

(2)

We restrict our investigation to stationary processes, hence

D_{n} (x, t) = D_{n} (x)

. Equation (2) describes the evolution of, for instance, a Brownian particle (for the case

D_{1} (x) = 0

), which results in the known heat equation, or more complicated Markovian motions with drift. Here, one recognises the function

D_{1} (x)

, the first KM coefficient, commonly denoted as drift, and the function

D_{2} (x)

, the second KM coefficient, commonly denoted as diffusion or volatility. The Fokker–Planck equation is, nevertheless, only valid for continuous motions and thus cannot describe jump-diffusion processes as in the case in Equation (1) or other stochastic motions with discontinuous paths.

A more general equation—the so-called Kramers–Moyal equation—takes higher-order KM coefficients

D_{n} (x), n \in N

into account

\frac{\partial}{\partial τ} p (x, t + τ | x^{'}, t) = L_{KM} p (x, t + τ | x^{'}, t),

(3)

where

L_{KM}

denotes the Kramers–Moyal operator defined as the power series [1,2]

L_{KM} = \sum_{n = 1}^{\infty} {(- \frac{\partial}{\partial x})}^{n} D_{n} (x),

which we will subsequently solve for

τ

and an appropriate starting condition by exponentiating the Kramers–Moyal operator

L_{KM}

.

When examining a stochastic process in terms of time-series data, there is no direct access to the KM coefficients

D_{n} (x)

but rather to the conditional moments of the data. The

n

th-order conditional moment

M_{n} (x, τ)

is given by

M_{n} (x^{'}, τ) = \int_{- \infty}^{\infty} {(x - x^{'})}^{n} p (x, t + τ | x^{'}, t) d x .

(4)

The KM coefficients

D_{n} (x)

can be retrieved from the conditional moments

M_{n} (x, τ)

via

D_{n} (x) = \frac{1}{n!} lim_{τ \to 0} \frac{M_{n} (x, τ)}{τ} .

When dealing with real-world data, we do not have access to infinite temporal resolution, meaning that the above limit

τ \to 0

is not possible. A best-case scenario is to analyse the smallest possible temporal differences. If the data are sampled at

Δ t

time steps, take

D_{n} (x) = \frac{1}{n!} \frac{1}{Δ t} M_{n} (x, Δ t) .

In order to non-parametrically retrieve the conditional moments

M_{n} (x, τ)

from data, a set of histogram or Nadaraya–Watson estimators can be utilised (see Refs. [29,32] for details). Here, we will focus not on how to estimate the conditional moments but rather on how to derive a set of finite-time corrections to estimate the KM coefficients from conditional moments. These can be retrieved from data with software packages like kramersmoyal [33] or JumpDiff [34] in Python or Langevin [35] in R.

3. The Formal Solution of the Kramers–Moyal Equation and Its Approximations

First, we explicitly derive the corrective terms and subsequently link these to the results in reference [36], connecting them to the relation between statistical cumulants and moments [13].

Let us assume a well-defined initial state of the Kramers–Moyal equation be given by

δ (x - x^{'})

. The formal solution of the time-dependent Kramers–Moyal equation (3) is given by

p (x, t + τ | x^{'}, t) = exp (τ L_{KM}) δ (x - x^{'}) = \sum_{k = 0}^{\infty} \frac{{(τ L_{KM})}^{k}}{k!} δ (x - x^{'}),

(5)

where

p (x, t + τ | x^{'}, t)

is a normalisable function, such that

\int_{- \infty}^{\infty} p (x, t + τ | x^{'}, t) d x = 1, \forall (t, τ)

. We will now proceed to show the first-, second-, third-order, and arbitrary-order approximation to the solution of this partial differential equation with this particular initial condition.

3.1. The First- and Second-Order Approximations

The first-order approximation of the formal solution of Equation (5) is given by

p (x, t + τ | x^{'}, t) = exp (τ L_{KM}) δ (x - x^{'}) = [1 + τ L_{KM} + O (τ^{2})] δ (x - x^{'}),

yielding for the conditional moments

M_{n} (x^{'}, τ)

in Equation (4)

\begin{matrix} M_{n} (x^{'}, τ) & ≃ M_{n}^{[1]} (x^{'}, τ) = \int_{- \infty}^{\infty} {(x - x^{'})}^{n} [1 + τ L_{KM}] δ (x - x^{'}) d x \\ = \int_{- \infty}^{\infty} {(x - x^{'})}^{n} δ (x - x^{'}) d x + τ \int_{- \infty}^{\infty} {(x - x^{'})}^{n} \sum_{m = 1}^{\infty} {(- \frac{\partial}{\partial x})}^{m} D_{m} (x) δ (x - x^{'}) d x \\ = 0 + τ \sum_{m = 1}^{\infty} {(- 1)}^{m} \int_{- \infty}^{\infty} D_{m} (x) [{(- \frac{\partial}{\partial x})}^{m} {(x - x^{'})}^{n}] δ (x - x^{'}) d x \\ = τ \sum_{m = 1}^{\infty} \int_{- \infty}^{\infty} D_{m} (x) \frac{n!}{(n - m)!} {(x - x^{'})}^{n - m} δ (x - x^{'}) d x \\ = τ \sum_{m = 1}^{\infty} D_{m} (x^{'}) \frac{n!}{(n - m)!} δ_{n, m} \\ = τ (n!) D_{n} (x^{'}), \end{matrix}

where the large square brackets indicate that the derivation operation is limited to the terms within the brackets. The superscript

[1]

indicates the order of approximation.

The second-order approximation is obtained in a similar fashion, now including the quadratic term from the exponential representation Equation (5), i.e.,

\begin{matrix} p (x, t + τ | x^{'}, t) & = [1 + τ L_{KM} + \frac{τ^{2}}{2} L_{KM} L_{KM} + O (τ^{3})] δ (x - x^{'}) . \end{matrix}

To alleviate the notation, we refer to the KM coefficient without explicit state dependencies, i.e.,

D_{n}

. The second-order approximation

M_{n}^{[2]} (x^{'}, τ)

of the n-th conditional moment in Equation (4) reads

\begin{matrix} M_{n}^{[2]} (x^{'}, τ) & = \int_{- \infty}^{\infty} {(x - x^{'})}^{n} [1 + τ L_{KM} + \frac{τ^{2}}{2} L_{KM} L_{KM}] δ (x - x^{'}) d x \\ = M_{n}^{[1]} (x^{'}, τ) + \frac{τ^{2}}{2} \int_{- \infty}^{\infty} {(x - x^{'})}^{n} \sum_{p = 1}^{\infty} {(- \frac{\partial}{\partial x})}^{p} D_{p} \sum_{m = 1}^{\infty} {(- \frac{\partial}{\partial x})}^{m} D_{m} δ (x - x^{'}) d x \\ = M_{n}^{[1]} (x^{'}, τ) + \frac{τ^{2}}{2} \sum_{p, m = 1}^{\infty} \int_{- \infty}^{\infty} {(x - x^{'})}^{n} {(- \frac{\partial}{\partial x})}^{p} D_{p} {(- \frac{\partial}{\partial x})}^{m} D_{m} δ (x - x^{'}) d x \\ = M_{n}^{[1]} (x^{'}, τ) + \frac{τ^{2}}{2} \sum_{p, m = 1}^{\infty} \int_{- \infty}^{\infty} \frac{n!}{(n - p)!} {(x - x^{'})}^{n - p} D_{p} {(- \frac{\partial}{\partial x})}^{m} D_{m} δ (x - x^{'}) d x \\ = M_{n}^{[1])} (x^{'}, τ) + \frac{τ^{2}}{2} \sum_{p, m = 1}^{\infty} \int_{- \infty}^{\infty} \frac{n! (n - p)! {(x - x^{'})}^{n - p - m}}{(n - p)! (n - p - m)!} D_{p} D_{m} δ (x - x^{'}) d x \\ + \frac{τ^{2}}{2} \sum_{p, m = 1}^{\infty} \sum_{s = 0}^{m - 1} \int_{- \infty}^{\infty} \frac{n! {(x - x^{'})}^{n - p - s}}{(n - p - s)!} (\binom{m}{s}) [{(- \frac{\partial}{\partial x})}^{m - s} D_{p}] D_{m} δ (x - x^{'}) d x . \end{matrix}

The first integral is only non-vanishing if

n - p - m = 0

and the second integral is only non-vanishing if

n - p - s = 0

, with

s < n

. Hence,

\begin{matrix} M_{n}^{[2]} (x^{'}, τ) & = M_{n}^{[1]} (x^{'}, τ) + \frac{τ^{2}}{2} (n!) \sum_{m = 1}^{n - 1} D_{n - m} (x^{'}) D_{m} (x^{'}) \\ + \frac{τ^{2}}{2} (n!) \sum_{s = 0}^{n - 1} \sum_{m = s + 1}^{\infty} (\binom{m}{s}) [{(\frac{\partial}{\partial x^{'}})}^{m - s} D_{n - s} (x^{'})] D_{m} (x^{'}) . \end{matrix}

Separating the terms between those with explicit derivatives of the KM coefficients and those without, it is immediately clear that the second-order approximation follows a structure given by the partial ordinary Bell polynomials

{\hat{B}}_{n, m}

[37]

{\hat{B}}_{n, m} (x_{1}, x_{2}, \dots, x_{n - m + 1}) = \sum \frac{m!}{j_{1}! j_{2}! \dots j_{n - m + 1}!} x_{1}^{j_{1}} x_{2}^{j_{2}} \dots x_{n - m + 1}^{j_{n - m + 1}} .

(6)

where the summation is taken over

j_{1}, \dots, j_{n - m + 1} \in {0, 1, 2, \dots, n - m + 1}

such that

\sum_{r = 1}^{n - m + 1} j_{r} = m and \sum_{r = 1}^{n - m + 1} r j_{r} = n .

(7)

The first- and second-order approximations can be written with the help of the partial ordinary Bell polynomials with

m = 1

and

m = 2

, respectively,

\begin{matrix} M_{n}^{[1]} (x^{'}, τ) & = (n!) τ {\hat{B}}_{n, 1} (D_{1}, \dots, D_{n}), \\ M_{n}^{[2]} (x^{'}, τ) & = (n!) [τ {\hat{B}}_{n, 1} (D_{1}, \dots, D_{n}) + \frac{τ^{2}}{2} {\hat{B}}_{n, 2} (D_{1}, \dots, D_{n - 1}) + Φ_{n}^{[2]}], \end{matrix}

(8)

where

Φ_{n}^{[2]}

incorporates all derivatives of the KM coefficients from the 2nd-order corrections, and is given by

\begin{matrix} Φ_{n}^{[2]} = \frac{τ^{2}}{2} \sum_{s = 0}^{n - 1} \sum_{m = s + 1}^{\infty} (\binom{m}{s}) [{(\frac{\partial}{\partial x^{'}})}^{m - s} D_{n - s} (x^{'})] D_{m} (x^{'}) . \end{matrix}

To simplify the description, we introduce a short-hand notation and take the superscript

(m)

in the KM coefficients:

D_{p}^{(m)} (x^{'}) = {(\frac{\partial}{\partial x^{'}})}^{m} D_{p} (x^{'})

.

These results are in line with those reported for diffusion-type processes [16,17,18,19,38], where the Kramers–Moyal operator

L_{KM} = L_{FP}

reduces to the Fokker–Planck operator and we are solely left with the first two KM coefficients, as in Equation (2). In particular, applying the second-order approximation in Equation (8) to the two first KM coefficients results in

\begin{matrix} M_{1}^{[2]} & = τ D_{1} + \frac{τ^{2}}{2} \sum_{m = 1}^{\infty} D_{m} D_{1}^{(m)}, \\ M_{2}^{[2]} & = 2 τ D_{2} + τ^{2} D_{1}^{2} + τ^{2} [\sum_{m = 1}^{\infty} D_{m} D_{2}^{(m)} + \sum_{m = 2}^{\infty} m D_{m} D_{1}^{(m - 1)}], \end{matrix}

and truncating the sums at second order yields the expressions in reference [16].

3.2. The Third-Order Approximation

Before we introduce the general formalism for the arbitrary-order approximation, we explicitly derive the third-order approximation

p (x, t + τ | x^{'}, t) = [1 + τ L_{KM} + \frac{τ^{2}}{2} L_{KM} L_{KM} + \frac{τ^{3}}{6} L_{KM} L_{KM} L_{KM} + O (τ^{4})] δ (x - x^{'}),

which leads to

\begin{matrix} M_{n}^{[3]} (x^{'}, τ) = & M_{n}^{[1]} (x^{'}, τ) + M_{n}^{[2]} (x^{'}, τ) \\ + \frac{τ^{3}}{6} \sum_{q, p, m = 1}^{\infty} \int_{- \infty}^{\infty} {(x - x^{'})}^{n} {(- \frac{\partial}{\partial x})}^{q} D_{q} {(- \frac{\partial}{\partial x})}^{p} D_{p} {(- \frac{\partial}{\partial x})}^{m} D_{m} δ (x - x^{'}) d x \\ = & M_{n}^{[1]} (x^{'}, τ) + M_{n}^{[2]} (x^{'}, τ) \\ + \frac{τ^{3}}{6} \sum_{q, p, m = 1}^{\infty} \int_{- \infty}^{\infty} \frac{n!}{(n - q - p - m)!} {(x - x^{'})}^{n - q - p - m} D_{q} D_{p} D_{m} δ (x - x^{'}) d x \\ + \frac{τ^{3}}{6} \sum_{q = 1}^{n} \sum_{p, m = 1}^{\infty} \sum_{s = 0}^{p} \int_{- \infty}^{\infty} \frac{n! {(x - x^{'})}^{n - q - s}}{(n - q - s)!} (\binom{p}{s}) [{(- \frac{\partial}{\partial x})}^{p - s} D_{q}] \\ \times D_{p} {(- \frac{\partial}{\partial x})}^{m} D_{m} δ (x - x^{'}) d x \\ = & M_{n}^{[1]} (x^{'}, τ) + M_{n}^{[2]} (x^{'}, τ) \\ + \frac{τ^{3}}{6} \sum_{q, p, m = 1}^{\infty} \int_{- \infty}^{\infty} \frac{n!}{(n - q - p - m)!} {(x - x^{'})}^{n - q - p - m} D_{q} D_{p} D_{m} δ (x - x^{'}) d x \\ + \frac{τ^{3}}{6} \sum_{q = 1}^{n} \sum_{p, m = 1}^{\infty} \sum_{s = 0}^{p} \sum_{k = 0}^{m} \sum_{r = 0}^{m - k} \int_{- \infty}^{\infty} \frac{n! {(x - x^{'})}^{n - q - s - k}}{(n - q - s - r)!} (\binom{p}{s}) (\binom{m}{k}) (\binom{m - k}{r}) \\ \times [{(- \frac{\partial}{\partial x})}^{p - s + r} D_{q}] [{(- \frac{\partial}{\partial x})}^{m - k - r} D_{p}] D_{m} δ (x - x^{'}) d x . \end{matrix}

Notice that the first integral is only non-vanishing for the combination

q + p + m = n

, which can again be expressed via the partial ordinary Bell polynomial

{\hat{B}}_{n, m}

, where

m = 3

, for the third-order approximation. The second expression requires

q + s + k = n

as well as

p + r \neq s \land m - r \neq k

. Separating these again into two expressions, one with and another without derivatives, we can express the third-order approximation as

\begin{matrix} M_{n}^{[3]} (x^{'}, τ) & = (n!) [τ {\hat{B}}_{n, 1} (D_{1}, \dots, D_{n}) + \frac{τ^{2}}{2} {\hat{B}}_{n, 2} (D_{1}, \dots, D_{n - 1}) + Φ_{n}^{[2]} \\ + \frac{τ^{3}}{6} {\hat{B}}_{n, 3} (D_{1}, \dots, D_{n - 2}) + Φ_{n}^{[3]}], \end{matrix}

(9)

where

Φ_{n}^{[3]}

incorporates all derivatives of the KM coefficients from the third-order corrections

\begin{matrix} Φ_{1}^{[3]} & = \frac{τ^{3}}{6} \sum_{p, m = 1}^{\infty} \sum_{r = 0}^{m} (\binom{m}{r}) [{(- \frac{\partial}{\partial x})}^{p + r} D_{1} (x)] [{(- \frac{\partial}{\partial x})}^{m - r} D_{p} (x)] D_{m} (x) . \end{matrix}

(10)

Here, we compare our derivation to the derivation of third-order approximation in Gottschall and Peinke [16]. We note that our derivation takes the general form of the Kramers–Moyal operator, to which the Fokker–Planck operator is circumscribed. From Equation (9), we derive an identical expression for the Fokker–Planck operator reported in reference [16]. Since the Fokker–Planck operator is limited to second-order terms, i.e.,

D_{n} \equiv 0

for

n \geq 3

, the sum in Equation (10) can be express in full. For the first conditional moment

M_{1}^{[3]}

, we obtain the corrective terms

{\tilde{Φ}}_{1}^{[3]}

given by

\begin{matrix} {\tilde{Φ}}_{1}^{[3]} & = \frac{τ^{3}}{6} \sum_{p, m = 1}^{2} \sum_{r = 0}^{m} (\binom{m}{r}) [{(- \frac{\partial}{\partial x})}^{p + r} D_{1} (x)] [{(- \frac{\partial}{\partial x})}^{m - r} D_{p} (x)] D_{m} (x) . \\ = \frac{τ^{3}}{6} [D_{1}^{(1)} D_{1}^{(1)} D_{1} + D_{1}^{(2)} D_{1} D_{1} + 3 D_{1}^{(1)} D_{1}^{(2)} D_{2} + 2 D_{1}^{(3)} D_{1} D_{2} \\ + D_{1}^{(2)} D_{2}^{(1)} D_{1} + D_{1}^{(2)} D_{2}^{(2)} D_{2} + 2 D_{1}^{(3)} D_{2}^{(1)} D_{2} + D_{1}^{(4)} D_{1} D_{1}], \end{matrix}

which is identical to Equation (A1) in the Appendix of reference [16]. Similarly, for the second conditional moment

M_{2}^{[3]}

, we obtain the corrective terms

{\tilde{Φ}}_{2}^{[3]}

\begin{matrix} {\tilde{Φ}}_{2}^{[3]} & = \frac{τ^{3}}{6} \sum_{p, m = 1}^{2} \sum_{r = 0}^{m} p (\binom{m}{r}) [{(- \frac{\partial}{\partial x})}^{p + r - 1} D_{1} (x)] [{(- \frac{\partial}{\partial x})}^{m - r} D_{p} (x)] D_{m} (x) \\ + \frac{τ^{3}}{6} \sum_{p, m = 1}^{2} \sum_{r = 0}^{m - 1} m (\binom{m - 1}{r}) [{(- \frac{\partial}{\partial x})}^{p + r} D_{1} (x)] [{(- \frac{\partial}{\partial x})}^{m - r - 1} D_{p} (x)] D_{m} (x) \\ + \frac{τ^{3}}{6} \sum_{p, m = 1}^{2} \sum_{r = 0}^{m} (\binom{m}{r}) [{(- \frac{\partial}{\partial x})}^{p + r} D_{2} (x)] [{(- \frac{\partial}{\partial x})}^{m - r} D_{p} (x)] D_{m} (x) \\ = \frac{τ^{3}}{3} [3 D_{1} D_{1}^{(1)} D_{1} + 7 D_{1}^{(2)} D_{1} D_{2} + 4 D_{1}^{(1)} D_{1}^{(1)} D_{2} + 3 D_{1}^{(1)} D_{2}^{(1)} D_{1} \\ + 4 D_{1}^{(1)} D_{2}^{(2)} D_{2} + 7 D_{1}^{(2)} D_{2}^{(1)} D_{2} + 4 D_{1}^{(3)} D_{2} D_{2} + D_{2}^{(2)} D_{1} D_{1} \\ + 2 D_{2}^{(3)} D_{2} D_{1} + D_{2}^{(2)} D_{2}^{(1)} D_{1} + D_{2}^{(2)} D_{2}^{(2)} D_{2} + 2 D_{2}^{(3)} D_{2}^{(1)} D_{2} + D_{2}^{(4)} D_{2} D_{2}] \end{matrix}

which is in agreement with Equation (A2) in the Appendix of reference [16]. A similar derivation can be found in Appendix B of reference [8], which also yields congruent findings for the first two conditional moments of jump-diffusion processes. However, no explicit expression for all terms is given in either publication.

As a simple rule of thumb, one can confer if the result is correct, as follows: the sum of the order of the KM coefficients subtracted by the derivation operation must equal n, the order of the conditional moment being calculated. In the notation used in this work, the sum of subscripts minus the sum of superscripts must equal the order n of the coefficient under investigation.

3.3. Arbitrary-Order Approximation

We now derive the arbitrary-order corrections of the Kramers–Moyal operator. This is done by induction from the previous derivations, whilst disregarding any emerging terms with derivatives of the KM coefficients

M_{n}^{[m]} (x^{'}, τ) = \int_{- \infty}^{\infty} {(x - x^{'})}^{n} \sum_{k = 1}^{m} \frac{τ^{k}}{k!} L_{KM}^{k} δ (x - x^{'}) d x = n! \sum_{k = 1}^{m} [\frac{τ^{k}}{k!} \prod_{σ (k, n)}^{m} D_{σ (k, n)} + Φ_{n}^{[k]}],

with

σ (k, n)

a partition of a set of

k \in N

obeying Equation (7). This, in turn, is the same as a collection of partial Bell polynomials, namely

M_{n}^{[m]} (x^{'}, τ) = (n!) \sum_{k = 1}^{m} [\frac{τ^{k}}{k!} {\hat{B}}_{n, k} (D_{1}, D_{2}, \dots, D_{n - k + 1}) + Φ_{n}^{[k]}],

where we combine terms with derivatives in

Φ_{n}^{[k]}

. If we disregard the derivative terms, the summation has an upper bound, namely

m \leq n

. This is directly seen as the Bell polynomials are similarly bounded, and thus we arrive at

M_{n} (x^{'}, τ) = (n!) \sum_{k = 1}^{n} \frac{τ^{k}}{k!} {\hat{B}}_{n, k} (D_{1}, D_{2}, \dots, D_{n - k + 1}) .

(11)

neglecting the derivative terms

Φ

.

From the perspective of estimation, the aim is to determine the KM coefficients

D_{n} (x^{'})

, however what we have expressed here is the relation of the conditional moments

M_{n} (x^{'}, τ)

. As we now have an explicit relation in terms of partial Bell polynomials, we will invert the relation and express the KM coefficients

D_{n} (x^{'})

as functions of the conditional moments

M_{1} (x^{'}, τ), \dots, M_{n} (x^{'}, τ)

.

Note that the first conditional moment

M_{1} (x^{'}, τ)

is solely a function of the first KM coefficient

D_{1} (x^{'})

. The second conditional moment

M_{2} (x^{'}, τ)

is a function of the second KM coefficient

D_{2} (x^{'})

, and by substitution, a function of the first conditional moment

M_{1} (x^{'}, τ)

, given by Equation (11). Subsequently

M_{3} (x^{'}, τ)

is a function of

D_{3} (x^{'})

,

M_{2} (x^{'}, τ)

, and

M_{1} (x^{'}, τ)

. Thus, by recursively substituting the

n - 1

KM coefficients by their expressions via the conditional moments, we obtain a relation of

D_{n} (x^{'})

as a function of the

M_{n} (x^{'}, τ), M_{n - 1} (x^{'}, τ), \dots, M_{1} (x^{'}, τ)

conditional moments.

To this end, we rewrite Equation (11) in terms of the partial exponential Bell polynomials

B_{n, m}

B_{n, m} (x_{1}, x_{2}, \dots, x_{n - m + 1}) = \sum \frac{n!}{j_{1}! j_{2}! \dots j_{n - m + 1}!} {(\frac{x_{1}}{1!})}^{j_{1}} {(\frac{x_{2}}{2!})}^{j_{2}} \dots {(\frac{x_{n - m + 1}}{(n - m + 1)!})}^{j_{n - m + 1}},

where the summation terms obey the constraints of the Bell polynomials given in Equation (7). This can be expressed through the partial ordinary Bell polynomials in Equation (6) as

{\hat{B}}_{n, m} (x_{1}, x_{2}, \dots, x_{n - m + 1}) = \frac{m!}{n!} B_{n, m} (1! \cdot x_{1}, 2! \cdot x_{2}, \dots, (n - m + 1)! \cdot x_{n - m + 1}) .

Thus, Equation (11) reads

M_{n} (x^{'}, τ) = \sum_{k = 1}^{n} B_{n, k} (1! τ D_{1}, 2! τ D_{2}, \dots, (n - k + 1)! τ D_{n - k + 1}) .

We can then utilise the reciprocal relations of the partial exponential Bell polynomials: for a set of variables

y_{1}, \dots, y_{n}

, defined as functions of n other variables

x_{1}, \dots, x_{n}

given by

y_{n} = \sum_{k = 1}^{n} B_{n, k} (x_{1}, x_{2}, \dots, x_{n - k + 1}),

(12)

the inverse relation holds

x_{n} = \sum_{k = 1}^{n} {(- 1)}^{k - 1} (k - 1)! B_{n, k} (y_{1}, y_{2}, \dots, y_{n - k + 1}) .

(13)

With this, we can finally express any KM coefficients

D_{n} (x^{'})

from the nth-order power series expansion, neglecting the derivative terms

Φ

, as

D_{n} (x^{'}, τ) = \frac{1}{n!} \frac{1}{τ} \sum_{k = 1}^{n} {(- 1)}^{k - 1} (k - 1)! B_{n, k} (M_{1}, M_{2}, \dots, M_{n - k + 1}) .

(14)

We note here that these relations are equivalent to the relation between cumulants and (non-central) moments of a probability distribution [13,36]. Let

M (y)

be the moment-generating function, such that

M (y) = 1 + \sum_{n = 1}^{\infty} \frac{μ_{n}^{'} y^{n}}{n!} = \exp [\sum_{n = 1}^{\infty} \frac{κ_{n} y^{n}}{n!}] = \exp [K (y)],

with

μ_{n}^{'}

the (non-central) moments and

K (x)

the cumulant-generating function. For

n < 4

, the cumulants

κ_{n}

and the central moments are the same (e.g., the mean and variance). This is not the same for higher cumulants and moments. The relation between the cumulants

κ_{n}

and the (non-central) moments

μ_{n}^{'}

is given by the reciprocal relation of the Bell polynomials, as in Equations (12) and (13). This is in line with our exponential representation of the Kramers–Moyal operator. Here, the KM coefficients are the cumulants (with the exception of the

τ

term).

4. Exemplary Cases with Constant Diffusion and Constant Jumps

Here, we present two illustrative examples: first, a constant diffusion process, the Ornstein–Uhlenbeck process; secondly, we augment this process with jumps to obtain a jump-diffusion process. We implement the corrective terms derived thus far to show the impact of the finite-time corrections. This choice of parameters, i.e., constant diffusion and constant jumps, considerably simplifies Equation (1) to

d X_{t} = - a X_{t} d t + b d W (t) + ξ d J (t),

(15)

where

- a X_{t}

is the state-dependent linear drift function, with

a > 0

, also denoted mean-reverting strength,

b > 0

a constant diffusion,

W (t)

a Brownian motion or Wiener process,

ξ

a state-independent and normally distributed jump amplitude with zero mean and variance s, and

J (t)

a Poisson process with jump rate

λ

. Note that the conventional Ornstein–Uhlenbeck process is recovered if we omit the jump process.

We have derived an expression for the conditional moments

M_{n} (x, τ)

as a function of the KM coefficients

D_{n} (x)

, given by Equation (11), which is valid for any Markovian diffusion or jump-diffision process. For our particular application to the Poissonian jump-diffusion process in Equation (1) we require at least the first six KM coefficients/first six moments. These are given by

\begin{matrix} M_{1} & = τ D_{1}, \\ M_{2} & = 2 τ D_{2} + τ^{2} D_{1}^{2}, \\ M_{3} & = 6 τ D_{3} + 6 τ^{2} D_{1} D_{2} + τ^{3} D_{1}^{3}, \\ M_{4} & = 24 τ D_{4} + 12 τ^{2} (2 D_{1} D_{3} + D_{2}^{2}) + 12 τ^{3} D_{1}^{2} D_{2} + τ^{4} D_{1}^{4}, \\ M_{5} & = 120 τ D_{5} + 120 τ^{2} (D_{1} D_{4} + D_{2} D_{3}) + 60 τ^{3} (D_{1}^{2} D_{3} + D_{1} D_{2}^{2}) + 20 τ^{4} D_{1}^{3} D_{2} + τ^{5} D_{1}^{5}, \\ M_{6} & = 720 τ D_{6} + 360 τ^{2} (2 D_{1} D_{5} + 2 D_{2} D_{4} + D_{3}^{2}) + 120 τ^{3} (3 D_{1}^{2} D_{4} + 6 D_{1} D_{2} D_{3} + D_{2}^{3}) \\ + 60 τ^{4} (2 D_{1}^{3} D_{3} + 3 D_{1}^{2} D_{2}^{2}) + 30 τ^{5} D_{1}^{4} D_{2} + τ^{6} D_{1}^{6} . \end{matrix}

We invert this expression explicitly using Equation (14) and report on the KM coefficients as functions of the conditional moments, which are given by

\begin{matrix} D_{1} & = \frac{1}{1!} lim_{τ \to 0} \frac{1}{τ} M_{1}, \\ D_{2} & = \frac{1}{2!} lim_{τ \to 0} \frac{1}{τ} [M_{2} - M_{1}^{2}], \\ D_{3} & = \frac{1}{3!} lim_{τ \to 0} \frac{1}{τ} [M_{3} - 3 M_{1} M_{2} + 2 M_{1}^{3}], \\ D_{4} & = \frac{1}{4!} lim_{τ \to 0} \frac{1}{τ} [M_{4} - 4 M_{1} M_{3} - 3 M_{2}^{2} + 12 M_{1}^{2} M_{2} - 6 M_{1}^{4}], \\ D_{5} & = \frac{1}{5!} lim_{τ \to 0} \frac{1}{τ} [M_{5} - 5 M_{1} M_{4} - 10 M_{2} M_{3} + 30 M_{1} M_{2}^{2} + 20 M_{1}^{2} M_{3} \\ - 60 M_{1}^{3} M_{2} + 24 M_{1}^{5}], \\ D_{6} & = \frac{1}{6!} lim_{τ \to 0} \frac{1}{τ} [M_{6} - 6 M_{1} M_{5} - 10 M_{3}^{2} - 15 M_{2} M_{4} + 30 M_{2}^{3} + 120 M_{1} M_{2} M_{3} \\ + 30 M_{1}^{2} M_{4} - 270 M_{1}^{2} M_{2}^{2} - 120 M_{1}^{3} M_{3} + 360 M_{1}^{4} M_{2} - 120 M_{1}^{6}] . \end{matrix}

(16)

We again note that these expressions are valid for any case of diffusion and jump-diffusion processes. In the first case, where there are no jump terms in Equation (15), i.e., the Ornstein–Uhlenbeck process, we know that all KM coefficients

D_{n} (x)

with

n \geq 3

are zero. However, this is not the case when estimating the coefficients from time-series data, i.e., from one realisation of the stochastic process sampled at finite resolution. It is common to find that these terms do not vanish due to finite-time effects. In our second case with a jump-diffusion process, the KM coefficients

D_{n} (x)

with

n \geq 3

can be related directly to the jump parameters. These relations were derived in reference [3], and are given by

\begin{matrix} D_{1} (x) & = a (x), \\ D_{2} (x) & = \frac{1}{2} [b {(x)}^{2} + s λ], \\ D_{2 n} (x) & = \frac{s^{n} λ}{2^{n} (n!)}, \end{matrix}

(17)

where

〈 ξ^{2 n} 〉 = \frac{(2 n!)}{2^{n} (n!)} {〈 ξ^{2} 〉}^{n} = \frac{(2 n!)}{2^{n} (n!)} s^{n}

, for Gaussian distributions with zero mean and variance s.

We will now compare the derived theoretical corrections to KM coefficients estimated from numerically generated time-series data. In Figure 1 and Figure 2, we display the second-, fourth-, and sixth-order KM coefficients

D_{2} (x)

,

D_{4} (x)

, and

D_{6} (x)

estimated with the first-order, second-order, and full-order approximations given by Equation (16) (or in general Equation (14)). The full-order approximations have the same order as the KM coefficients, i.e, second-, fourth-, and sixth-order approximation for

D_{2} (x)

,

D_{4} (x)

, and

D_{6} (x)

, respectively. For the data shown in Figure 1, we use a Euler–Maruyama scheme to numerically integrate an Ornstein–Uhlenbeck process Equation (15) (without the jump terms) with parameters: drift

a = 1.0

and diffusion

b = 0.5

(

λ = s = 0.0

). We numerically integrate this process with a coarse time-step

Δ t = 0.1

to deliberately emphasise the finite-time effects on the aforementioned KM coefficients. For example, the second-order KM coefficients

D_{2} (x)

takes a quadratic form, despite the fact that the diffusion term is constant. The KM coefficients

D_{4} (x)

and

D_{6} (x)

are not truly zero, as would be expected for purely diffusive processes [39,40], due to the finite-time effects, but the full-order finite-time correction approximates the theoretical values with far greater detail.

For the data shown in Figure 2 we follow a similar approach, now augmenting the Ornstein–Uhlenbeck process with Poissonian jumps, i.e., as given in Equation (15). The parameters are as follows: drift

a = 0.5

, diffusion

b = 0.5

, jump amplitude with a Gaussian distribution with variance

s = 0.75

and zero mean, a Poissonian jump rate

λ = 0.6

, and a time step

Δ t = 0.05

. For this process, we know the higher-order KM coefficients

D_{4} (x)

and

D_{6} (x)

reflect the presence of discontinuous paths, which, for our particular case of the Poissonian jump-Ornstein–Uhlenbeck process, we know the explicit inversion in Equation (17) (cf. reference [3]). For the chosen coarse time step, we notice that the estimations do not correspond exactly with the theoretical values, regardless of the order of finite-time correction chosen. This can likely be traced back to the limitations of the Kramers–Moyal equation to fully capture discontinuous stochastic processes (cf. reference [41]). Nevertheless, the higher-order finite-time corrections approximate the theoretical values with greater accuracy.

We note here that the parameter estimation from data heavily depends on the number of data points and the sampling rate of numerically simulated or real-world time-series data. Real-world time-series data can often be sampled at higher sampling rates, but not always in such a large number of datapoints. A closer inspection of the limitations of both the sampling rate and the number of data points in parameter estimation is necessary, but falls outside the scope of this publication. Moreover, it should be emphasised that, prior to any examination of time-series data within the purview of either the Fokker–Planck or the Kramers–Moyal equation, the Markov property of the data must be account for, i.e., a vanishing memory of the increments of the data. This can be examined, for example, via the Chapman–Kolmogorov equality [13].

Summarising our findings, we conclude that our proposed arbitrary-order finite-time corrections considerably help in differentiating one-dimensional purely diffusive processes and jump-diffusion processes, as these accurately show that higher-order KM coefficients

D_{n} (x)

,

n \geq 2

vanish for purely diffusive processes. These arbitrary-order finite-time corrections should now also be considered for N-dimensional stochastic processes. A first examination of the second-order finite-time corrections for two-dimensional processes was recently addressed in reference [22]. Note that the one-dimensional second-order finite-time correction for these KM coefficients was recently addressed in another publication [34]. Here, it is extended to arbitrary order.

5. Implementation: Symbolic Calculations in `Python`

In this section, we implement the main results from above to compute the moments into available software packages, e.g., kramersmoyal [33] and JumpDiff [34] in Python or Langevin [35] in R, or any self-made parametric or non-parametric estimator. In order to facilitate numerical implementations of the higher-order corrections, we include a short Python script to obtain the non-derivative corrections to any desired order, with the desired truncation of the power-series expansion.

First, we present a Python code to numerically generate the conditional moments

M_{n} (x^{'}, τ)

as functions of the KM coefficients

D_{n} (x^{'})

, as given in Equation (14). Here the parameter n indicates the order of the KM coefficients/moments n and the parameter m the order of the correction, with m ≤ n. We utilise Python’s symbolic language library sympy [42].

To generate the KM coefficients

D_{n} (x^{'})

as function of the conditional moments, the following must be implemented: Entropy 23 00517 i002

6. Conclusions

We have presented a set of arbitrary-order finite-time corrections to the Kramers–Moyal operator, solved by exponentiating the Kramers–Moyal operator, equivalent to van Kampen’s system-size expansion. We expressed the exponential operator as a power series and worked out each element of the series, ultimately combining it in a series representation via the partial Bell polynomials. We obtained a closed form for the set of arbitrary-order finite-time corrections relating the conditional moments to the Kramers–Moyal coefficients. Moreover, by representing the arbitrary-order finite-time corrections with partial Bell polynomials, we derived a reciprocal relation for the conditional moments and the Kramers–Moyal coefficients. This provided a closed-form representation of the Kramers–Moyal coefficients via conditional moments, which is crucial for time-series data estimation. We included two illustrative cases of poorly sampled diffusion and jump-diffusion processes with constant diffusion and constant jumps, demonstrating the suitability of our corrections for a non-parametric estimation of higher-order Kramers–Moyal coefficients. Our corrections approximated the theoretical values with a high degree of accuracy and help to distinguish processes with and without jumps. We are confident that our arbitrary-order finite-time corrections contribute to an improved reconstruction of stochastic evolution equations from empirical time-series data.

Author Contributions

Conceptualisation, L.R.G., K.L. and P.G.L.; methodology, L.R.G. and P.G.L.; software, L.R.G.; validation, L.R.G., D.W., K.L. and P.G.L.; formal analysis, L.R.G. and P.G.L.; investigation, L.R.G. and P.G.L.; writing–original draft preparation, L.R.G., K.L. and P.G.L.; writing–review and editing, L.R.G., D.W., K.L. and P.G.L.; visualisation, L.R.G.; supervision, D.W., K.L. and P.G.L.; funding acquisition, D.W. All authors have read and agreed to the published version of the manuscript.

Funding

L.R.G. and D.W. gratefully acknowledge support from the German Federal Ministry of Education and Research (grant no. 03EK3055B) and the Helmholtz Association (via the joint initiative “Energy System 2050—A Contribution of the Research Field Energy” and the grant “Uncertainty Quantification—From Data to Reliable Knowledge (UQ)” with grant no. ZT-I-0029). This work was performed by L.R.G. as part of the Helmholtz School for Data Science in Life, Earth and Energy (HDS-LEE).

Acknowledgments

The authors thank M. R. R. Tabar, J. Heysel, G. Ansmann, T. Rings, M. Giordano, A. Yurchenko-Tytarenko, and P. Lencastre for valuable discussions on theoretical aspects surrounding the Kramers–Moyal expansion.

Conflicts of Interest

The authors declare no conflict of interest.

References

Kramers, H.A. Brownian motion in a field of force and the diffusion model of chemical reactions. Physica 1940, 7, 284–304. [Google Scholar] [CrossRef]
Moyal, J.E. Stochastic processes and statistical physics. J. R. Stat. Soc. Ser. B (Methodol.) 1949, 11, 150–210. [Google Scholar] [CrossRef]
Anvari, M.; Tabar, M.R.R.; Peinke, J.; Lehnertz, K. Disentangling the stochastic behavior of complex time series. Sci. Rep. 2016, 6, 35435. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Kurth, J.G.; Rings, T.; Lehnertz, K. Testing jump-diffusion in epileptic brain dynamics: Impact of daily rhythms. Entropy 2021, 23, 309. [Google Scholar] [CrossRef] [PubMed]
Hashtroud, A.M.; Mirzahossein, E.; Zarei, F.; Tabar, M.R.R. Jump events in the human heartbeat interval fluctuations. J. Stat. Mech. Theory Exp. 2019, 2019, 083213. [Google Scholar] [CrossRef]
Boujo, E.; Noiray, N. Robust identification of harmonic oscillator parameters using the adjoint Fokker–Planck equation. Proc. Math. Phys. Eng. Sci. 2017, 473, 20160894. [Google Scholar] [CrossRef] [PubMed]
Anvari, M.; Lohmann, G.; Wächter, M.; Milan, P.; Lorenz, E.; Heinemann, D.; Tabar, M.R.R.; Peinke, J. Short term fluctuations of wind and solar power systems. New J. Phys. 2016, 18, 063027. [Google Scholar] [CrossRef]
Lehnertz, K.; Zabawa, L.; Tabar, M.R.R. Characterizing abrupt transitions in stochastic dynamics. New J. Phys. 2018, 20, 113043. [Google Scholar] [CrossRef]
Friedrich, J.; Grauer, R. Generalized Description of Intermittency in Turbulence via Stochastic Methods. Atmosphere 2020, 11, 1003. [Google Scholar] [CrossRef]
Jannesar, M.; Sadeghi, A.; Meyer, E.; Jafari, G.R. A Langevin equation that governs the irregular stick-slip nano-scale friction. Sci. Rep. 2019, 9, 12505. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Paganin, D.M.; Morgan, K.S. X-ray Fokker–Planck equation for paraxial imaging. Sci. Rep. 2019, 9, 17537. [Google Scholar] [CrossRef]
Morgan, K.S.; Paganin, D.M. Applying the Fokker–Planck equation to grating-based X-ray phase and dark-field imaging. Sci. Rep. 2019, 9, 17465. [Google Scholar] [CrossRef] [Green Version]
Risken, H.; Frank, T. The Fokker–Planck Equation, 2nd ed.; Springer: Berlin/Heidelberg, Germany, 1996. [Google Scholar] [CrossRef]
Ragwitz, M.; Kantz, H. Indispensable finite time corrections for Fokker-Planck equations from time series data. Phys. Rev. Lett. 2001, 87, 254501. [Google Scholar] [CrossRef]
Friedrich, R.; Renner, C.; Siefert, M.; Peinke, J. Comment on “Indispensable finite time corrections for Fokker-Planck equations from time series data”. Phys. Rev. Lett. 2002, 89, 217. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Gottschall, J.; Peinke, J. On the definition and handling of different drift and diffusion estimates. New J. Phys. 2008, 10, 083034. [Google Scholar] [CrossRef]
Lade, S.J. Finite sampling interval effects in Kramers–Moyal analysis. Phys. Lett. A 2009, 373, 3705–3709. [Google Scholar] [CrossRef] [Green Version]
Anteneodo, C.; Riera, R. Arbitrary-order corrections for finite-time drift and diffusion coefficients. Phys. Rev. E 2009, 80, 031103. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Anteneodo, C.; Queirós, S.M.D. Low-sampling-rate Kramers-Moyal coefficients. Phys. Rev. E 2010, 82, 041122. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Honisch, C.; Friedrich, R. Estimation of Kramers-Moyal coefficients at low sampling rates. Phys. Rev. E 2011, 83, 066701. [Google Scholar] [CrossRef] [Green Version]
Rydin Gorjão, L.; Heysel, J.; Lehnertz, K.; Tabar, M.R.R. Analysis and data-driven reconstruction of bivariate jump-diffusion processes. Phys. Rev. E 2019, 100, 062127. [Google Scholar] [CrossRef] [Green Version]
Aslim, E.; Rings, T.; Zabawa, L.; Lehnertz, K. Enhancing the accuracy of a data-driven reconstruction of bivariate jump-diffusion models with corrections for higher orders of the sampling interval. J. Stat. Mech. Theory Exp. 2021, 2021, 033406. [Google Scholar] [CrossRef]
van Kampen, N.G. A power series expansion of the master equation. Can. J. Phys. 1961, 39, 551–567. [Google Scholar] [CrossRef]
van Kampen, N.G. The expansion of the master equation. Adv. Chem. Phys. 1976, 34, 245–309. [Google Scholar] [CrossRef]
van Kampen, N.G. Stochastic Processes in Physics and Chemistry, 3rd ed.; North Holland: Amsterdam, The Netherlands, 2007. [Google Scholar] [CrossRef]
Thomas, P.; Grima, R. Approximate probability distributions of the master equation. Phys. Rev. E 2015, 92, 012120. [Google Scholar] [CrossRef] [Green Version]
Mvondo-She, Y.; Zoubos, K. On the combinatorics of partition functions in AdS3/LCFT2. J. High Energy Phys. 2019, 2019, 97. [Google Scholar] [CrossRef] [Green Version]
Willers, C.; Kamps, O. Non-parametric estimation of a Langevin model driven by correlated noise. arXiv 2021, arXiv:2103.02990. [Google Scholar]
Tabar, M.R.R. Analysis and Data-Based Reconstruction of Complex Nonlinear Dynamical Systems, 1st ed.; Springer International Publishing: Berlin/Heidelberg, Germany, 2019. [Google Scholar] [CrossRef]
Li, Y.; Duan, J. A data-driven approach for discovering stochastic dynamical systems with non-Gaussian Lévy noise. Phys. D 2021, 417, 132830. [Google Scholar] [CrossRef]
Friedrich, R.; Peinke, J.; Sahimi, M.; Tabar, M.R.R. Approaching complexity by stochastic methods: From biological systems to turbulence. Phys. Rep. 2011, 506, 87–162. [Google Scholar] [CrossRef]
Lamouroux, D.; Lehnertz, K. Kernel-based regression of drift and diffusion coefficients of stochastic processes. Phys. Lett. A 2009, 373, 3507–3512. [Google Scholar] [CrossRef]
Rydin Gorjão, L.; Meirinhos, F. kramersmoyal: Kramers–Moyal coefficients for stochastic processes. J. Open Source Softw. 2019, 4. [Google Scholar] [CrossRef]
Rydin Gorjão, L.; Witthaut, D.; Lind, P.G. JumpDiff: A Python Library for Statistical Inference of Jump-Diffusion Processes in Sets of Measurements. Forthcoming. Available online: https://github.com/LRydin/JumpDiff (accessed on 20 April 2021).
Rinn, P.; Lind, P.G.; Wächter, M.; Peinke, J. The Langevin approach: An R package for modeling Markov processes. J. Open Res. Softw. 2016, 4, e34. [Google Scholar] [CrossRef]
Prohorov, J.V.; Rozanov, J.A. Probability Theory, 1st ed.; Springer: Berlin/Heidelberg, Germany, 1969. [Google Scholar]
Bell, E.T. Partition polynomials. Ann. Math. 1927, 29, 38–46. [Google Scholar] [CrossRef]
Sura, P.; Barsugli, J. A note on estimating drift and diffusion parameters from timeseries. Phys. Lett. A 2002, 305, 304–311. [Google Scholar] [CrossRef]
Pawula, R.F. Generalizations and extensions of the Fokker-Planck-Kolmogorov equations. IEEE Trans. Inf. Theory 1967, 13, 33–41. [Google Scholar] [CrossRef] [Green Version]
Pawula, R.F. Approximation of the Linear Boltzmann Equation by the Fokker-Planck Equation. Phys. Rev. 1967, 162, 186–188. [Google Scholar] [CrossRef]
Mori, H.; Fujisaka, H.; Shigematsu, H. A new expansion of the master equation. Prog. Theor. Phys. 1974, 51, 109–122. [Google Scholar] [CrossRef] [Green Version]
Meurer, A.; Smith, C.P.; Paprocki, M.; Čertík, O.; Kirpichev, S.B.; Rocklin, M.; Kumar, A.; Ivanov, S.; Moore, J.K.; Singh, S.; et al. SymPy: Symbolic computing in python. PeerJ Comput. Sci. 2017, 3, e103. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Non-parametrically estimated KM coefficients of an Ornstein–Uhlenbeck process (given by Equation (15) without the jump terms) with drift

a = 1.0

and diffusion

b = 0.5

(

λ = s = 0.0

). The abscissas are re-scaled by the standard deviation

σ_{x}

of x. From left to right are shown the second-, fourth-, and sixth-order KM coefficients

D_{2} (x)

,

D_{4} (x)

, and

D_{6} (x)

. There are no corrections to the drift term

D_{1} (x)

. Note that for

D_{2} (x)

the 2nd-order and the full-order corrections are identical. The impact of solely using the second-order approximation for

D_{6} (x)

is also evident. The coefficients

D_{4} (x)

and

D_{6} (x)

are theoretically zero. In all cases, the improvements to the estimation of the respective KM coefficients

D_{n} (x)

are clear. The grey lines indicate the theoretical values. The numerical integration has a total time of

5 \times 10^{5}

and a time step

Δ t = 0.1

(

5 \times 10^{6}

datapoints) with a Euler–Maruyama scheme [33]. Consistent results are obtained when considering a much finer numerical time step

Δ t

of integration and subsequently down-sampling the data.

Figure 1. Non-parametrically estimated KM coefficients of an Ornstein–Uhlenbeck process (given by Equation (15) without the jump terms) with drift

a = 1.0

and diffusion

b = 0.5

(

λ = s = 0.0

). The abscissas are re-scaled by the standard deviation

σ_{x}

of x. From left to right are shown the second-, fourth-, and sixth-order KM coefficients

D_{2} (x)

,

D_{4} (x)

, and

D_{6} (x)

. There are no corrections to the drift term

D_{1} (x)

. Note that for

D_{2} (x)

the 2nd-order and the full-order corrections are identical. The impact of solely using the second-order approximation for

D_{6} (x)

is also evident. The coefficients

D_{4} (x)

and

D_{6} (x)

are theoretically zero. In all cases, the improvements to the estimation of the respective KM coefficients

D_{n} (x)

are clear. The grey lines indicate the theoretical values. The numerical integration has a total time of

5 \times 10^{5}

and a time step

Δ t = 0.1

(

5 \times 10^{6}

datapoints) with a Euler–Maruyama scheme [33]. Consistent results are obtained when considering a much finer numerical time step

Δ t

of integration and subsequently down-sampling the data.

Figure 2. Non-parametrically estimated KM coefficients of a jump-diffusion process given by Equation (15) with drift

a = 0.5

, diffusion

b = 0.5

, jump amplitude

s = 0.75

, and jump rate

λ = 0.6

. The abscissas are re-scaled by the standard deviation

σ_{x}

of x. From left to right are shown the second-, fourth-, and sixth-order KM coefficients

D_{2} (x)

,

D_{4} (x)

, and

D_{6} (x)

. There are no corrections to the drift term

D_{1} (x)

. Note that for

D_{2} (x)

the 2nd-order and the full-order corrections are identical. The impact of solely using the second-order approximation for

D_{6} (x)

is also evident. The coefficients

D_{4} (x)

and

D_{6} (x)

are theoretically zero. In all cases, the improvements to the estimation of the respective KM coefficients

D_{n} (x)

are clear. The grey lines indicate the theoretical values. The numerical integration has a total time of

5 \times 10^{5}

and a time step

Δ t = 0.1

(

5 \times 10^{6}

datapoints) with a Euler–Maruyama scheme [33].

Figure 2. Non-parametrically estimated KM coefficients of a jump-diffusion process given by Equation (15) with drift

a = 0.5

, diffusion

b = 0.5

, jump amplitude

s = 0.75

, and jump rate

λ = 0.6

. The abscissas are re-scaled by the standard deviation

σ_{x}

of x. From left to right are shown the second-, fourth-, and sixth-order KM coefficients

D_{2} (x)

,

D_{4} (x)

, and

D_{6} (x)

. There are no corrections to the drift term

D_{1} (x)

. Note that for

D_{2} (x)

the 2nd-order and the full-order corrections are identical. The impact of solely using the second-order approximation for

D_{6} (x)

is also evident. The coefficients

D_{4} (x)

and

D_{6} (x)

are theoretically zero. In all cases, the improvements to the estimation of the respective KM coefficients

D_{n} (x)

are clear. The grey lines indicate the theoretical values. The numerical integration has a total time of

5 \times 10^{5}

and a time step

Δ t = 0.1

(

5 \times 10^{6}

datapoints) with a Euler–Maruyama scheme [33].

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Rydin Gorjão, L.; Witthaut, D.; Lehnertz, K.; Lind, P.G. Arbitrary-Order Finite-Time Corrections for the Kramers–Moyal Operator. Entropy 2021, 23, 517. https://doi.org/10.3390/e23050517

AMA Style

Rydin Gorjão L, Witthaut D, Lehnertz K, Lind PG. Arbitrary-Order Finite-Time Corrections for the Kramers–Moyal Operator. Entropy. 2021; 23(5):517. https://doi.org/10.3390/e23050517

Chicago/Turabian Style

Rydin Gorjão, Leonardo, Dirk Witthaut, Klaus Lehnertz, and Pedro G. Lind. 2021. "Arbitrary-Order Finite-Time Corrections for the Kramers–Moyal Operator" Entropy 23, no. 5: 517. https://doi.org/10.3390/e23050517

APA Style

Rydin Gorjão, L., Witthaut, D., Lehnertz, K., & Lind, P. G. (2021). Arbitrary-Order Finite-Time Corrections for the Kramers–Moyal Operator. Entropy, 23(5), 517. https://doi.org/10.3390/e23050517

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Arbitrary-Order Finite-Time Corrections for the Kramers–Moyal Operator

Abstract

1. Introduction

2. Mathematical Background

3. The Formal Solution of the Kramers–Moyal Equation and Its Approximations

3.1. The First- and Second-Order Approximations

3.2. The Third-Order Approximation

3.3. Arbitrary-Order Approximation

4. Exemplary Cases with Constant Diffusion and Constant Jumps

5. Implementation: Symbolic Calculations in `Python`

6. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Arbitrary-Order Finite-Time Corrections for the Kramers–Moyal Operator

Abstract

1. Introduction

2. Mathematical Background

3. The Formal Solution of the Kramers–Moyal Equation and Its Approximations

3.1. The First- and Second-Order Approximations

3.2. The Third-Order Approximation

3.3. Arbitrary-Order Approximation

4. Exemplary Cases with Constant Diffusion and Constant Jumps

5. Implementation: Symbolic Calculations in Python

6. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

5. Implementation: Symbolic Calculations in `Python`