1. Introduction
Classical fractional differentiation/integration operators
,
acting on functions
, where
is a ‘discrete derivative’ with respect to ‘time’
, are defined through the binomial expansion
, viz.:
with the coefficients
Here,
denotes the gamma function
,
, and
,
. Also, see the end of this section for all unexplained notation. The asymptotics
(which follows by application of Stirling’s formula to (
2)) determines the class of functions
g and the summability properties of (
1).
Fractional operators in (
1) play an important role in the theory of discrete-time stochastic processes—in particular, time series (see, e.g., the monographs [
1,
2,
3,
4,
5] and the references therein). The autoregressive fractionally integrated moving-average ARFIMA
process
is defined as a stationary solution of the stochastic difference equation
with white noise (a sequence of standardized uncorrelated random variables (r.v.s))
. For
, the solution of (
4) is obtained by applying the inverse operator, viz.:
Since (
3) implies
(
), (
5) is a well-defined stationary process with zero mean and finite variance. The ARFIMA
process is the basic parametric model in statistical inference for time series with a long memory property (also referred to as long-range dependence) (see [
1,
2,
3,
5,
6] for a discussion of the ARFIMA
and its generalization ARFIMA
models). We note that the ARFIMA
process has an explicit covariance function and the spectral density
which explodes or vanishes at the origin
as
, depending on the sign of
d.
In this paper, we extend fractional operators in (
1) to functions
g on a regular
-dimensional lattice
. Whereas generalization of our construction to irregular lattices or more abstract index sets is an interesting and challenging open problem, our choice of
follows the traditional approach in random field theory, which heavily relies on the Fourier transform and spectral representation. We consider a rather general form of the operator
T:
where
is a random walk on
starting at
with (1-step) probabilities
. We assume that
, i.e., the random walk is non-degenerate at
. Clearly,
, where
are the
j-step probabilities,
with
. Similarly to (
1), we define fractional operators
acting on
by
with coefficients
expressed through the binomial coefficients
and random walk probabilities
.
Let us describe the content and results of this paper in more detail. The main result of
Section 2 is Theorem 1, which provides the sufficient condition
for invertibility
and the square summability of fractional coefficients in (
6), in terms of the characteristic function
(the Fourier transform) of the random walk.
Section 2 also includes a discussion of the asymptotics of (
6) as
, which is important in limit theorems and other applications of fractional integrated random fields. Using classical local limit theorems, Propositions 1 and 2 obtain ‘isotropic’ asymptotics of (
6) for a large class of random walk
, showing that
decay as
; hence,
. The last fact is interpreted as the
long-rangedependence [
3,
4,
7] of the fractionally integrated random field
, defined as a stationary solution of the difference equation,
with white noise on the r.h.s. and it is studied in
Section 3. Corollary 1 obtains conditions for the existence of the stationary solution of (
8) given by the inverse operator
, which is detailed in Examples 1 and 2 for fractional Laplacian and fractional heat operators.
Section 2 and
Section 3 also include a discussion of
tempered fractional operators
and
tempered fractional random fields solving the analogous equation
, which generalize the class of tempered ARFIMA processes [
8] and have
short-rangedependence and a summable covariance function.
Section 4 is devoted to the scaling limits of moving-average random fields on
with coefficients satisfying Assumption 1, which includes ‘isotropic’ fractional coefficients
as a special case. The scaling limits refer to the integrals
of random field
for each
from a class of (test) functions as scaling parameter
. The scaling limits are identified in Corollary 3 as self-similar Gaussian random fields with a Hurst parameter
. We note that limit theorems for random fields with long-range dependence or negative dependence have been studied in many works. The seminal paper [
9] dealt with noncentral limit theorems for Gaussian subordinated fields. Anisotropic scaling limits of linear and subordinated random fields in dimensions
were discussed in [
10,
11,
12,
13,
14,
15,
16] and in the references therein, with particular focus on scaling transition arising under critical anisotropy exponents. Whereas most of the abovementioned works considered partial sums on rectangular domains, [
17] studied the case of irregular summation regions and ‘edge effects’ arising under strong negative dependence. Statistical applications for random fields with long-range dependence were discussed in [
2,
18,
19] and other works.
We expect that this study can be extended in several directions, including anisotropic scaling, infinite variance random fields, and fractional operators in
(see [
20,
21,
22,
23,
24,
25] for discussion and the properties of fractional random fields with the continuous argument
).
Notation. In what follows, C denotes generic positive constants that may be different at different locations. We write and for the weak convergence and equality of probability distributions. Denote by the absolute-value norm on , where is either or and the Euclidean norm on . is the scalar product in . Denote by the vector in with 1 in the jth coordinate and 0’s elsewhere. For , denote by the space of functions for which and by the space of measurable functions for which the p-th power of the absolute value is integrable with respect to the Lebesgue measure on : with the identification of functions , such that almost everywhere (a.e.). Denote by the space of measurable functions for which , with the identification of functions , such that a.e. Write for the indicator function of a set A. Write for the smallest integer greater than or equal to . and .
2. Invertibility and Properties of Fractional Operators
We start with the properties of the binomial coefficients in (
2):
The identity
leads to
and the invertibility relation
The following lemma gives some basic properties of the fractional coefficients
in (
6).
Lemma 1. (i) Let . Then, the series in (6) converges for every and(ii) Let . Then, for every and and Moreover, implies and(iii) Let and . Then, Proof. (i) From (
6) and (
9) we obtain
since
is not possible. On the other hand, for
we have
and
in view of (
9).
(ii) Since
is obvious from (
9), it suffices to show (
12), since it implies
by (
11). We have
where exchanging the order of summation is legitimate as all summands are non-negative. Hence, using
and (
10), we obtain
proving part (ii).
(iii) The convergence of the series in (
13) and the equality follow as in (
12):
Lemma 1 is proved. □
Remark 1. Let . Then, the inequalities are strict: and , if for some j, i.e., is accessible from state . Moreover, if state is transient, i.e., the probability of eventual return to is strictly less than 1, which is equivalent to , then .
The main result of this section is Theorem 1, which provides the necessary and sufficient conditions for the square summability of the fractional coefficients in (
6), in terms of the characteristic function
(see (
7)). Write
for the Fourier transform of a function
. For
introduce the
temperedfractionaloperators
with coefficients
and the Fourier transform
Theorem 1. For , the following conditions are equivalent: Either of these conditions implies Moreover, for , the above conditions (16)–(18) hold with d in place of . Proof. Let
. Firstly, we consider
in (
6). They satisfy
because of (
3) and
with
. Then,
is immediate. Moreover, we have the Fourier transform
, where
satisfies
. We see that
belongs to
.
Now let us prove the implication (
16) ⇒ (
17). We use approximation by the tempered fractional coefficients
in (
15) as
. We ascertain that
a.e. as
. Next, for
,
,
the inequality
, where
becomes
. Using this, we obtain the domination for all
,
,
by a function in
according to (
16). Hence, by the dominated convergence theorem (DCT),
as
holds in
. As a consequence,
,
is a Cauchy sequence in
. By Parseval’s theorem, the inverse Fourier transforms,
are a Cauchy sequence in
, and so
converges in
to some
as
. This
f must be
because
as
for all
. We conclude that
or (
17).
Let us turn to the implication (
17) ⇒ (
16). From (
17) and
for all
it follows that
as
holds in
. By Parseval’s theorem,
,
is a Cauchy sequence in
. It follows that
for some
. We also have
for each
, such that
. Since
(see Lemma 2.3.2(a) in [
26]) we conclude that
a.e., proving (
16).
The above argument also proves (
18). On the one hand,
is the limit of
in
as
because
converges in
to
as
. On the other hand,
in
as
. We conclude that
a.e. Theorem 1 is proved. □
Next, we turn to the asymptotics of the ‘fractional coefficients’
in (
6). The proof uses the local limit theorem in [
26] for random walk probabilities
. Following the latter work, we assume that
For example, if
is symmetric, i.e.,
, and, moreover, has finite support that contains
,
,
, then the random walk satisfies our assumption (
20). The conditions in (
20) imply that the random walk has zero mean
and an invertible covariance matrix
According to the classical (integral) CLT, the normalized sum
approaches a Gaussian distribution on
with density
Lemma 2 ([
26] Theorem 2.3.11).
Under the conditions of (20), there exists , such that For ‘very atypical’ values
we use the following bound ([
26], Proposition 2.1.2): for any
there exists
, such that
Proposition 1. Let satisfy (20). The coefficients in (6) are well-defined for any and satisfywhere Proof. Let us prove (
23). Since
is positive-definite,
,
is a norm. Note that it is equivalent to the Euclidean norm because any two norms are equivalent in finite-dimensional real vector space. In particular, the spectral decomposition
—where
U is an orthogonal matrix whose columns are the real, orthonormal eigenvectors of
,
is the transpose of
U, and
is a diagonal matrix whose entries are the eigenvalues of
with
,
denoting the largest and smallest, respectively—gives
and, similarly,
. Using (
6) for a large
we decompose
, where
To show the first relation in (
24), use (
21). We have
, where, for each
fixed, the main term
and the remainder term
asymptotically behave when
as
and, for some constants
,
Hence, the first relation in (
24) follows, using
. In view of (
3), the same argument also proves the second relation in (
24) for
.
Consider (
24) for
. Split
into two sums over
, where
and
, respectively. In the sum
we also have
, and Lemma 2 entails the bound
for some constants
. Hence,
since the last integral converges for any
d. Finally, by (
22), given a large enough
, there exists
, such that
, which implies
. This proves (
24) and completes the proof of Proposition 1. □
Lemma 2 does not apply to the simple random walk (which is not aperiodic), in which case the local CLT takes a somewhat different form (see [
26], Theorem 2.1.3). The application of the latter result and the argument in the proof of Proposition 1 yields the following result:
Proposition 2. Let , . The coefficients in (6) are well-defined for any and satisfywhere Proposition 1 and Lemma 2 do not apply to random walks with a non-zero mean, as in Example 2 below (fractional heat operator), in which case the fractional coefficients exhibit an anisotropic behavior different from (
23). Such behavior is described in the following proposition. We assume that the underlying random walk factorizes into a deterministic drift by 1 in direction
and a random walk on
, as in Lemma 2:
where
and
is a probability distribution concentrated on
, such that
. Write
for the random walk starting at
with
j-step probabilities
,
, such that
,
. In order to apply Lemma 2, we make a similar assumption to (
20):
and we denote
, the respective covariance matrix. Let
be a positive function on
satisfying the homogeneity property,
. As in Example 2, the fractional coefficients for
in (
25) we write as
Proposition 3. Let (26) hold and . Then,as and We also havewhere is a continuous function on given byfor and equals 0 for . Proof. Consider the following
j-step probabilities of a random walk on
starting at
:
, where
for
,
. Let us estimate these by
, where
is the covariance matrix of the 1-step distribution
,
. Note
. By Lemma 2,
Relation (
28) follows directly from (
3), (
27), and (
30). Relation (
29) is written as
The asymptotics in (
31) is immediate from (
28) for
tending to
∞ as in (
28). The general case of (
31) also follows from (
28), using the continuity of
. For
, the details can be found in [
12] (proof of Proposition 4.1). □
Remark 2. The approximation in (
28) compares with the kernel
of the fractional heat operator
for all
,
, and some
. For
, Ref. [
25] Equation (3.7) has recently derived the analytic form in (
32) of the kernel from the absolute square of its Fourier transform:
which is the implicit definition of this kernel in [
22]. Similarly to derivations in [
25], for
, Equations (3.944.5-6) in the table of integrals [
27] give
yielding (
33).
Finally, the tempered fractional coefficients in (
15) are summable:
for any
and
any random walk
. Assuming the existence of the exponential moment
for some
, (
15) decays exponentially,
for some
. Indeed, Markov’s inequality gives
for any
and large enough
. Moreover,
, proving (
34).
3. Fractionally Integrated Random Fields on
Let
be a white noise; in other words, a sequence of r.v.s with
,
. Given a sequence
with the above noise we can associate a moving-average random field (RF),
with zero mean and covariance
, which depends on
alone and characterizes the dependence between values of
X at distinct points
.
A moving-average RF
X in (
35) will be said to be
long-rangedependent(LRD) if ;
short-rangedependent(SRD) if ;
negativelydependent(ND) if .
The above classification is important in limit theorems and applications of random fields. It is not unanimous; several related but not equivalent classifications of dependence for stochastic processes can be found in [
3,
4,
7,
17] and other works.
Many RF models with discrete arguments are defined through linear difference equations involving white noise [
28]. In this paper, we deal with fractionally integrated RFs
X solving fractional equations on
,
whose solutions are obtained by inverting these operators (see below).
Definition 1. Let and in (6) be well-defined. By the stationary solution of Equation (36) (respectively, (37)) we mean a stationary RF X, such that for each the series in (36) converges in mean square and (36) holds (respectively, the series in (37) converges in mean square and (37) holds). Corollary 1. (i) Let . Then,is a stationary solution of Equation (36) if condition (16) holds (for , (16) is also necessary for the existence of the above X). (ii) Let and (16) hold. Then, X in (38) is LRD. Moreover, it has a non-negative covariance function , and . (iii) Let and (16) hold. Then, X in (38) is ND; moreover, . (iv) Let . Then,is a stationary solution of Equation (36). Moreover, X in (39) is SRD. Furthermore, , .
Proof. (i) Let
.
X in (
38) is well-defined if and only if (
17) holds, which is, therefore, a necessary condition. Let us show that
X in (
38) is a stationary solution of (
36). We use the spectral representation of white noise,
where
is a random complex-valued spectral measure on
with zero mean and variance
. Then,
is written as
see (
18). Then,
follows by (
19) and absolute summability
(see (
11) and (
14)).
Next, let
. Then,
X in (
38) is well-defined and is written as (
41), due to
. We need to show that the series in (
36) converges in mean square towards
if and only if (
16) or (
17) hold. The latter convergence writes as
From (
41),
in view of (
17). This proves part (i).
The ARFIMA(0,
d, 0) Equation (
4) is autoregressive, since the best linear predictor (or conditional expectation in the Gaussian case) of
, given the ‘past’
, is a linear combination
of the ‘past’ observations, due to the fact that
. For spatial equations, as in (
36) or (37), an analogous property given the ‘past’
does not hold, since
as a rule. This issue is important in spatial statistics and has been discussed in the literature (see [
29,
30] and the references therein), distinguishing between ‘simultaneous’ and ‘conditional autoregressive schemes’. A recent work [
31] discusses some conditional autoregressive models with LRD property.
Definition 2. Let X be an RF with for each . We say that X has:
(i) a simultaneous autoregressive representation with coefficients if for each where the series converges in mean square and the r.v.s satisfy . (ii) a conditional autoregressive representation with coefficients if for each where the series converges in mean square and the r.v.s satisfy .
Corollary 2. (i) Let and X be a fractionally integrated RF in (38) and (16) holds. Then, X has a simultaneous autoregressive representation with coefficients , and , ; (ii) Let , X be a fractionally integrated RF in (38) and (16) holds. Then, X has a conditional autoregressive representation with coefficients and where is a complex-valued random measure given in (40) with zero mean and variance and (iii) Let and X be a (tempered) fractionally integrated RF in (39). Then, X has a simultaneous autoregressive representation with and a conditional autoregressive representation with with the same as in part (ii) and
Proof. (i) is obvious from Corollary 1 and (
36),
.
(ii) By (
16),
and
are well-defined,
and
. The orthogonality relation
follows from the spectral representations in (
40) and (
41):
It remains to show (
42), including the convergence of the series. In view of the definition of
, this amounts to showing
or, in spectral terms, to the convergence of the Fourier series
in
. Note
, where the RF
,
, results from application of the inverse operator. Since
has negative dependence (see (
41) and the proof of Corollary 1 (iii)) the covariances
are absolutely summable. Therefore, the Fourier series on the l.h.s. of (
43) converges uniformly in
to
, proving (
43).
Example 1. Fractional Laplacian. The (lattice) Laplace operator on is defined asso that , where is the transition operator of the simple random walk on with equal one-step transition probabilities to the nearest-neighbors . For , the fractional Laplace RF can be defined as a stationary solution of the difference equationwith weak white noise on the r.h.s., written as a moving-average RF: We find that , andfor some and . Hence, condition (16) for (44) translates to In particular, a stationary solution of Equation (44) on exists for all . Finally, recall that (16) is equivalent to condition (17). We could have verified the latter by using Corollary 2, which gives the asymptotics of coefficients in (45). Example 2. Fractional heat operator. For a parameter , we can extend the definition of the (lattice) heat operator on from in [12] to as follows: Thus, corresponds to the random walk on with 1-step distribution . We find that We also find that outside the origin for some since . Therefore,and if . The above result agrees with [12] for , and extends it to the arbitrary , . Example 3. Fractionally integrated time series models (case ). As noted above, the ARFIMA process is a particular case of (38) corresponding to the backward shift or the deterministic random walk . Another fractionally integrated time series model is given in Example 1 and corresponds to the symmetric nearest-neighbor random walk on with probabilities 1/2. It is of interest to compare these two processes and their properties. Let be the corresponding operators, For and , processes and are well-defined; moreover, they are stationary solutions of the respective equations and . The spectral densities of and are given by We see that when the processes and have the same 2nd order properties up to a multiplicative constant, so that in the Gaussian case is a noncausal representation of the ARFIMA.
4. Scaling Limits
As explained in the Introduction, the isotropic scaling limits refer to the limit distribution of the integrals
where
is a given stationary random field (RF) for each
from a class of (test) functions
. We choose the latter class to be
In the following,
X is a linear or moving-average RF on
:
where
are independent identically distributed (i.i.d.) r.v.s, with
,
, and
being deterministic coefficients. Obviously, stationary solution (
38) of Equation (
36) satisfying Corollary 1 is a particular case of linear RF with
. Our limits results assume an ‘isotropic’ behavior of
as
, detailed as follows. Let
denote the class of all continuous functions on
.
Assumption 1. Let be a sequence of real numbers satisfying the following properties:
The class of RFs in (
47) with coefficients satisfying Assumption 1 is related but not limited to the fractionally integrated RFs in (
36) and (
37). Note that the parameter
d is no longer restricted to being in
. By easy observation, Assumption 1 implies the LRD, ND, and SRD properties of
Section 3 in the respective cases
, and
. Following the terminology in time series [
3], the parameter
d in (
48) may be called the
memoryparameter of the linear RF
X in (
47), except that for
the memory parameter is usually defined as
.
In particular, the covariance function
of the linear RF
X in (
47) is written as
or the lattice convolution of
with itself. We will use the notation
for the lattice convolution and
for continuous convolution, viz.:
which is well-defined for any
(respectively, for any
,
).
Proposition 4. Let satisfy Assumption 1 with and some Then,where the (angular) function is given by Proof. The existence and continuity of follow from the finiteness of the integrals
,
. For (
49), it suffices to show that
Let
and
(see (
48)). Then,
. Clearly, (
50) follows from
and
To prove (
51), rewrite
as an integral and change the variable
in it. This leads to
, where
where
Relation (
51) follows once we prove the uniform convergence
. Since
is a compact set and
is continuous, the last relation is implied by the sequentional convergence
for any
and any
convergent to
:
. The proof of (
54) uses the bound
which follows from the boundedness of
and
with
; hence,
. Note
for any
and
according to (
55). Since
does not depend on
and
, Pratt’s lemma [
32] applies to the integral in (
53), resulting in (
54) and (
51). The proof of (
52) is similar and simpler and is omitted. □
The question about the asymptotics of the variance of (
46) arises, assuming the power-law asymptotics of the covariance admitting power-law behavior at large lags, which is tackled in the following proposition:
Proposition 5. (i) For any as (ii) Let satisfywhere and . Then, for any where (iii) Let . Then, for any ,
Proof. (i) Write
for the l.h.s. of (
56). First, let
. Then,
as
. Next, let
; then,
, where
is followed by
+
. Finally, for
we have
, where
follows similarly.
(ii) The convergence of the integral in (
59) follows from that of
in part (i), with
Let
denote the integral on the l.h.s. of (
58). By a change of variables,
where
for any
. Using Pratt’s lemma [
32], it suffices to prove (
58) for
. In the latter case, and with
, we see that
as in the proof of Proposition 4. Thus, (
58) follows from the DCT.
(iii) Let
be the same as in the proof of (ii). For a large
, write
where
, and
, and
. Here,
can be made arbitrarily small uniformly in
by choosing
K large enough. Next,
By the boundedness of
, we see that the integral
a.e. in
, and is bounded in
. Then, since
we conclude
by the DCT. Finally,
, and we can replace the last integral by the r.h.s. of (
60) uniformly in
provided
K is large enough. □
Proposition 5 does not apply to ND covariances satisfying (
57) with negative
. This case is more delicate, since it requires additional regularity conditions of the test functions and the occurrence of ‘edge effects’. A detailed analysis of this issue in dimension
and for indicator (test) functions of rectangles in
can be found in [
16]. Below, we present a result in this direction and sufficient conditions on
when the limits take a similar form to (
58). We introduce a subclass of test functions:
Proposition 6. Let satisfy Assumption 1 with . Then, for any we havewhere Proof. The convergence of the integral on the r.h.s. of (
63) follows from (
61) and the Minkowski integral inequality:
.
The proof of the convergence in (
62) resembles that of (
58). Write
for the integral on the l.h.s. of (
62). Using
we rewrite
,
, and
where the inner integrals tend to those on the r.h.s. of (
63) at each
, such that
,
. The remaining details are similar to (
58) and are omitted. □
Remark 3. The restriction in Proposition 6 is not necessary for (63). Indeed, if satisfies the uniform Lipschitz condition then the integral in (61) converges for , implying . On the other hand, for the indicator functions of a bounded Borel set with a ‘regular’ boundary, we typically have leading to . Relation (
48) entails the existence of the scaling limit
which is a continuous homogeneous function on
: for any
we have
With the limit function in (
64) we associate a Gaussian RF:
where
is a real-valued Gaussian white noise (also called the real-valued Gaussian random measure) with zero mean and where variance
,
is the usual and
the ‘regularized’ convolution. For the indicator test function
of a Borel set
(belonging to
) we see that the latter convolution equals
This paper uses the elementary properties of the white noise integrals in (
66) only. Namely,
is well-defined for each
and has a Gaussian distribution with zero mean and variance
(see, e.g., [
5,
7]), implying that
for each
. The interested reader is referred to [
24] on white noise calculus on the Schwartz space and to [
33] for fractional calculus with respect to fractional Brownian motion. The existence of stochastic integrals in (
66) follows from Propositions 5 and 6. Particularly, the variances
and
agree with (
59) and (
63).
Let
be the Schwartz space of all infinitely differentiable rapidly decreasing functions
, i.e., for each
and each multi-index
,
where
(see, e.g., [
34] (Section 7) for the properties of
and the dual space
of tempered Schwartz distributions). Following [
35], we say that a generalized RF
is
stationary if
and
H-self-similar (
) if
. As noted in Remark 3,
; hence, (
66) is well-defined for any
and represents stationary generalized RFs on
. By the scaling property in (
65) and a change of variables, we see that
; hence, RF
in (
66) is
-self-similar, with
The RF in (
66) appear as scaling limits in the following corollary:
Corollary 3. Let X be a linear RF satisfying Assumption 1 and be defined in (46). Then,where . Proof. Since (
46) writes as a linear form
in i.i.d. r.v.s, we can use the Lindeberg-type condition (see also [
3] (Corollary 4.3.1)). Accordingly, it suffices to show that
holds in each case,
, of the corollary. The behavior of the last variance is detailed in Propositions 5 and 6, and it grows to infinity in each case of
d. On the other hand, the l.h.s. of (
67) does not exceed
, which is bounded in cases
and
. Finally, in case
we see that the l.h.s. of (
67) does not exceed
and (
67) holds, since
. □