Asymptotic Distributions of M-Estimates for Parameters of Multivariate Time Series with Strong Mixing Property

Kushnir, Alexander; Varypaev, Alexander

doi:10.3390/engproc2021005019

Open AccessProceeding Paper

Asymptotic Distributions of M-Estimates for Parameters of Multivariate Time Series with Strong Mixing Property^†

by

Alexander Kushnir

^* and

Alexander Varypaev

Institute of Earthquake Prediction Theory and Mathematical Geophysics of Rassian Academy of Sciences, 113556 Moscow, Russia

^*

Author to whom correspondence should be addressed.

^†

Presented at the 7th International conference on Time Series and Forecasting, Gran Canaria, Spain, 19–21 July 2021.

Eng. Proc. 2021, 5(1), 19; https://doi.org/10.3390/engproc2021005019

Published: 28 June 2021

(This article belongs to the Proceedings of The 7th International Conference on Time Series and Forecasting)

Download Versions Notes

Abstract

:

The publication is devoted to studying asymptotic properties of statistical estimates of the distribution parameters

u \in R^{q}

of a multidimensional random stationary time series

z_{t} \in R^{m}, t \in ℤ

satisfying the strong mixing conditions. We consider estimates

{\hat{u}}_{n}^{δ} ({\bar{z}}_{n})

,

{\bar{z}}_{n} = {(z_{1}^{T}, \dots, z_{n}^{T})}^{T} \in R^{m n}

that provide in asymptotic

n \to \infty

the maximum values for some objective functions

Q_{n} ({\bar{z}}_{n}; u)

, which have properties similar to the well-known property of local asymptotic normality. These estimates are constructed by solving the equations

δ_{n} ({\bar{z}}_{n}; u) = 0

, where

δ_{n} ({\bar{z}}_{n}; u)

are arbitrary functions for which

δ_{n} ({\bar{z}}_{n}; u) - \underset{h}{grad} Q_{n} ({\bar{z}}_{n}; u + n^{- 1 / 2} h) \to 0

(n \to \infty)

in

P_{n, u} ({\bar{z}}_{n})

-probability uniformly on

u \in U

, were

U

is compact in

R^{q}

. In many cases, the estimates

{\hat{u}}_{n}^{δ} ({\bar{z}}_{n})

have the same asymptotic properties as well-known M-estimates defined by equations

{\hat{u}}_{n}^{Q} ({\bar{z}}_{n})

= \underset{u \in U}{a r g m a x} Q_{n} ({\bar{z}}_{n}; u)

but often can be much simpler computationally. We consider an algorithmic method for constructing estimates

{\hat{u}}_{n}^{δ} ({\bar{z}}_{n})

, which is similar to the accumulation method first proposed by R. Fischer and rigorously developed by L. Le Cam. The main theoretical result of the article is the proof of the theorem, in which conditions of the asymptotic normality of estimates

{\hat{u}}_{n}^{δ} ({\bar{z}}_{n})

are formulated, and the expression is proposed for their matrix of asymptotic mean-square deviations

\lim_{n \to \infty} n E_{n, u} {({\hat{u}}^{δ} ({\bar{z}}_{n}) - u) {({\hat{u}}^{δ} ({\bar{z}}_{n}) - u)}^{T}}

.

Keywords:

random time series; estimation of distribution parameters; local asymptotical normality; function of estimation quality; asymptotically efficient estimates

1. Introduction. Methods of Construction Asymptotically Efficient Estimates for Parameters of Stationary Time Series

In applications of mathematical statistics to modern problems of data analysis in natural science and technology, it is often impossible to use the classical observation models in the form of a sequence of independent identically distributed random variables (i.i.d. model). As a rule, the i.i.d. model does not provide sufficient accuracy of statistical inferences about the unknown parameters of the investigated physical processes, distorted by noise, if both of them are stationary random processes.

Thus, it is important to generalize the classical results of the statistical theory of parameter estimation, developed for the i.i.d. model, in order to apply them to actual practical problems in the analysis of real physical processes.

In modern systems for analyzing physical wave fields, a large number of parameters are simultaneously measured, and many sensors are used to improve the accuracy of the analysis. That is, multidimensional time series

z_{t} \in R^{m}, t \in ℤ

are subjected to statistical processing, and vector parameters are estimated as a result of this processing.

For many statistical models of multivariate time series, it is impossible to synthesize statistically efficient estimates

{\hat{u}}_{n}^{ef} ({\bar{z}}_{n})

of vector parameters

u

for which the standard deviation matrices are minimal for any finite size n of observations and are equal to the inverse Fisher information matrix:

K_{n}^{ef} (u) = E_{n, u} {({\hat{u}}_{n}^{ef} ({\bar{z}}_{n}) - u) {({\hat{u}}_{n}^{ef} ({\bar{z}}_{n}) - u)}^{T}} = J_{n}^{- 1} (u),

(1)

where

J_{n} (u) = \int_{R^{m n}} (\nabla_{u} p_{z, n} ({\bar{x}}_{n}; u)) {(\nabla_{u} p_{z, n} ({\bar{x}}_{n}; u))}^{T} p_{z, n}^{- 1} ({\bar{x}}_{n}; u) d {\bar{x}}_{n}

;

{\bar{x}}_{n} = {(x_{1}^{T}, \dots, x_{n}^{T})}^{T} \in R^{m n}; \nabla_{u} p_{z} ({\bar{x}}_{n}; u) = {(\frac{\partial}{\partial u_{k}} p_{z} ({\bar{x}}_{n}; u), k \in \bar{1, q})}^{T};

p_{z} ({\bar{x}}_{n}; u)

is the probability density of the observations

{\bar{z}}_{n}

.

At the same time, asymptotically efficient (AE) estimates

{\hat{u}}_{n}^{ae} ({\bar{z}}_{n})

can be constructed for a wide class of multivariate time series with interdependent elements

z_{t}

possessing a strong mixing property [1]. For AE-estimates, equality (1) is attained asymptotically for

n \to \infty

:

K^{ae} (u) = \lim_{n \to \infty} n E_{n, u} {({\hat{u}}_{n}^{ae} ({\bar{z}}_{n}) - u) {({\hat{u}}_{n}^{ae} ({\bar{z}}_{n}) - u)}^{T}} = \lim_{n \to \infty} n J_{n}^{- 1} (u) .

They can be found in the class

ℛ

of regular estimates

\hat{u} ({\bar{z}}_{n})

for which the random quantities

\sqrt{n} (\hat{u} ({\bar{z}}_{n}) - u)

,

u \in U

have limit distributions with finite second moments. This statement is one of the results of the extensive asymptotic theory of statistical inference for random time series, which is most fully presented in [2]. Fundamental results in this theory were obtained in the known publications [3,4,5,6]. In these books, sufficient conditions were established under which AE-estimates exist for many probabilistic models of random time series and continuous processes.

The main condition under which the AE-estimates can be constructed is the local asymptotic normality (LAN) of the likelihood ratio

L_{n} ({\bar{z}}_{n})

of observations

{\bar{z}}_{n}

[3]. It means that the likelihood ratio of the observations

{\bar{z}}_{n}

admits the following asymptotic expansion:

L_{n} ({\bar{z}}_{n}) = l n \frac{p_{z, n} ({\bar{z}}_{n}; u + n^{- 1 / 2} h)}{p_{z, n} ({\bar{z}}_{n}; u)} = h^{T} Δ_{n} ({\bar{z}}_{n}; u) - \frac{1}{2} h^{T} Γ_{n} (u) h + α_{n} ({\bar{z}}_{n}; u, h),

(2)

where

\lim_{n \to \infty} Γ_{n} (u) = Γ (u) = \lim_{n \to \infty} n^{- 1} J_{n}^{} (u)

;

Δ_{n} ({\bar{z}}_{n}; u) \in R^{q}

is a family of statistics for which probability distributions tend as

n \to \infty

to the q-dimensional Gaussian distributions with the parameters

(0, Γ (u))

uniformly in

u \in U

;

α_{n} ({\bar{z}}_{n}; u, t) \to 0

(

n \to \infty

) in

d P_{n} ({\bar{z}}_{n})

-probability uniformly in

u \in U

;

| h | < c

where

c

is any number.

Many publications, for example, [7,8,9,10,11,12,13,14], have been devoted to proving the LAN property for various probabilistic models of time series other than the i.i.d model. The results of research in this direction, obtained up to the end of the twentieth century, are summarized in the monograph [2]. It was shown that the LAN property is inherent in a wide class of multidimensional time series and continuous random processes.

The formulation of the LAN condition (2) largely determined the further development and practical applications of the asymptotic estimation theory. In the well-known monograph [6], it is shown that under the LAN condition, the maximum likelihood estimate belongs to the class

ℛ

of regular statistical estimates and is an AE-estimate.

At the same time, using the decomposition (2) of the likelihood function of observations, new AE-estimates were constructed, which differ from the traditional maximum likelihood estimates and are computationally simpler. An elegant and, in many cases, the most computationally simple method for constructing AE-estimates, was proposed in [3,4]. It is based on R. Fisher’s [15] idea of “improving” the quality of some “simple” estimate to the quality of an AE-estimate. In mentioned publications, L. Le Cam showed that the AE-estimate can be obtained using the equation:

{\hat{u}}_{n}^{ae} ({\bar{z}}_{n}) = u_{n}^{*} ({\bar{z}}_{n}) - n^{- 1 / 2} Γ_{n}^{- 1} (u_{n}^{*} ({\bar{z}}_{n})) Δ_{n} ({\bar{z}}_{n}; u_{n}^{*} ({\bar{z}}_{n})),

(3)

where

u_{n}^{*} ({\bar{z}}_{n})

is an arbitrary

\sqrt{n}

-consistent estimate of the parameter

u

for which the quantities

\sqrt{n} (u^{*} ({\bar{z}}_{n}) - u)

,

u \in U

,

n \in ℤ

have the property: for any

ε > 0

there is

C_{ε} > 0

, such that

\sup_{u \in U, n \in ℤ^{+}} [P_{n, u} {| \sqrt{n} (u^{*} ({\bar{z}}_{n}) - u) | > C_{ε}}] < ε

.

Note that Equation (3) defines a whole class of AE-estimates, the quality of which is asymptotically equivalent to the quality of the ML-estimate, since

Δ_{n} ({\bar{z}}_{n}; u)

,

Γ_{n} (u)

in the LAN expansion (2) and the

\sqrt{n}

-consistent estimate

u_{n}^{*} ({\bar{z}}_{n})

are not unique functions. For this reason, in many practically important cases, formula (3) allows one to obtain AE-estimates, which are computationally much simpler than ML-estimates.

2. Construction of M-Estimates for Parameters of Stationary Time Series with Suitable Asymptotical Properties

The AE-estimates have some disadvantages from the point of view of practical applications. First, they can be synthesized only if the probability density

p_{n, z} ({\bar{x}}_{n}; u)

of the observations

{\bar{z}}_{n}

is fully known. In practice, some important details of this density are often not fully defined. Only a certain class

K

is known to which this density belongs. Second, the quality of AE-estimates is often unstable to deviations of the actual density

p_{n, z} ({\bar{x}}_{n}; u)

from the assumed one for which they were synthesized. Even a small deviation from the expected density can lead to a significant loss in the accuracy of the AE-estimate.

In the publications [16,17], methods were developed for constructing estimates that are robust to changes in the distribution of observations, and in many applications, such robust estimates are preferable to AE-estimates. A robust estimate

\hat{u} ({\bar{z}}_{n})

is constructed by finding the global maximum of a certain objective function

Q_{n} ({\bar{z}}_{n}; u)

(a criterion of estimation quality), which differs from likelihood function:

\hat{u} ({\bar{z}}_{n}) = \underset{u \in U}{a r g m a x} Q_{n} ({\bar{z}}_{n}; u) .

(4)

In addition to robust estimates, estimates synthesized using Equation (4) arise in other problems of mathematical statistics. The examples include Bayesian estimation problems, estimation problems with interfering (nuisance) parameters, problems arising in the analysis of natural and economic dynamical systems.

The estimates obtained as the maxima of some objective functions

Q_{n} ({\bar{z}}_{n}; u)

were called “M-estimates”. Apart from books [16,17], they were considered in many other publications, for example, in [18,19]. In most of these publications, the M-estimates were constructed and analyzed for the i.i.d. model of random observations.

The authors are not aware of publications in which the asymptotic properties of M-estimates were studied with a sufficient level of mathematical rigor for multidimensional stationary random time series that have a strong mixing property. The authors are also unaware of publications devoted to the construction of computationally simple estimates that are asymptotically equivalent in quality to M-estimates.

In this paper, we consider an approach to solving these problems from the standpoint of view of the asymptotic theory of statistical inference [2], which is based on Le Cam’s concept of local asymptotically normality.

We suppose that random objective function

Q_{n} ({\bar{z}}_{n}; u)

is twice differentiable in

P_{n, u}

-probability with respect to components of the vector

u \in U

; that is, there exist the following family of vector statistics

d_{n} ({\bar{z}}_{n}; u)

and matrix function

F_{n} ({\bar{z}}_{n}; u)

:

d_{n} ({\bar{z}}_{n}; u) = {(d_{n, k} ({\bar{z}}_{n}; u) = \frac{\partial}{\partial u_{k}} Q_{n} ({\bar{z}}_{n}; u), k \in \bar{1, q})}^{T} = \nabla_{u} Q_{n} ({\bar{z}}_{n}; u) \in R^{q}, F_{n} ({\bar{z}}_{n}; u) = [\frac{\partial}{\partial u_{l}} d_{n, k} ({\bar{z}}_{n}; u), k, l \in \bar{1, q}] = Δ_{u} Q_{n} ({\bar{z}}_{n}; u) \in R^{q \times q} .

(5)

In this case, the M-estimate (4) is one of the roots

{\tilde{u}}_{n} ({\bar{z}}_{n})

of the following equation system with respect to the parameter

u

:

d_{n} ({\bar{z}}_{n}; u) = 0.

(6)

In this paper, we show how to find the estimate

{\hat{u}}_{n}^{δ} ({\bar{z}}_{n})

, which is a root of the equation system (6), and, at the same time, it is an

\sqrt{n}

-consistent estimate of the parameter u. It is proved in Theorem 1 that under certain restrictions, such an estimate

{\hat{u}}_{n}^{δ} ({\bar{z}}_{n})

can be found using the algorithm

{\hat{u}}_{n}^{δ} ({\bar{z}}_{n}) = u_{n}^{*} ({\bar{z}}_{n}) - n^{- 1 / 2} Φ_{n}^{} (u_{n}^{*} ({\bar{z}}_{n})) δ_{n} ({\bar{z}}_{n}; u_{n}^{*} ({\bar{z}}_{n})),

(7)

where

δ_{n} ({\bar{z}}_{n}; u) = n^{- 1 / 2} d_{n} ({\bar{z}}_{n}; u)

;

Φ_{n}^{} (u) =

n^{- 1} E_{u} {F_{n} ({\bar{z}}_{n}; u)}

;

u_{n}^{*} ({\bar{z}}_{n})

is any

\sqrt{n}

-consis- tent estimate of the parameter u.

Conditions are formulated in Theorem 1 on the family of statistics

δ_{n} ({\bar{z}}_{n}; u)

and the sequence of the matrix functions

Φ_{n}^{} (u)

that are sufficient for the asymptotic normality of the estimate (7):

L {\sqrt{n} ({\hat{u}}_{n}^{δ} ({\bar{z}}_{n}) - u)} \to ℕ (0, D (u))

(n \to \infty)

, where the asymptotic covariance matrix

D (u) = \lim_{n \to \infty} n E_{u} {({\hat{u}}_{n}^{δ} ({\bar{z}}_{n}) - u) {({\hat{u}}_{n}^{δ} ({\bar{z}}_{n}) - u)}^{T}}

is equal to

D (u) = Φ^{- 1} (u) Ψ (u) Φ^{- 1} (u) Ψ (u) = \lim_{n \to \infty} E_{u} {δ_{n} ({\bar{z}}_{n}; u) δ_{n}^{T} ({\bar{z}}_{n}; u)} Φ (u) = \lim_{n \to \infty} Φ_{n} (u) .

The corollary of Theorem 1 describes a method for constructing another estimate

{\tilde{u}}_{n}^{δ} ({\bar{z}}_{n})

that has the same asymptotical distribution as the estimate (7) but does not require an auxiliary

\sqrt{n}

-consistent estimate

u_{n}^{*} ({\bar{z}}_{n})

.

Note that the statements of Theorem 1 and the corollary were formulated earlier in [20]. In our paper, the above statements are proved under more general assumptions, and simpler proofs are given.

Theorem 1.

A. There exists a

\sqrt{n}

-consistent estimate

u_{n}^{*} ({\bar{z}}_{n})

of the parameter u.

B. Let the family of statistics

δ_{n} ({\bar{z}}_{n}, u) \in R^{m}

,

u \in U

, and the sequence of positive definite symmetric

q \times q

-matrix functions

Φ_{n} (u)

satisfy the following constraints:

B1. For each value of the parameter

u \in U

, the sequence of statistics

δ_{n} ({\bar{z}}_{n}, u)

is asymptotically normal with zero mean and the covariance matrix

Ψ (u)

:

L {δ_{n} ({\bar{z}}_{n}, u)} \to N (0, Ψ (u)) (n \to \infty)

where

Ψ (u) = \lim_{n \to \infty} E_{u} {δ_{n} ({\bar{z}}_{n}, u) δ_{n}^{T} ({\bar{z}}_{n}, u)} .

B2. For each value of the parameter

u \in U

, the following asymptotic expansion of the statistic

δ_{n} ({\bar{z}}_{n}, u)

holds:

δ_{n} ({\bar{z}}_{n}; u + n^{- 1 / 2} h) = δ_{n} ({\bar{z}}_{n}; u) + Φ_{n} (u) h + β_{n} ({\bar{z}}_{n}; u, h), | h | < c f o r \forall c;

where

\sup_{u \in U, | h | < c} P_{n, u} {| β_{n} ({\bar{z}}_{n}; u, h) | > ε} \to 0 (n \to \infty)

for any

ε > 0

;

\inf_{n \in ℤ^{+}, u \in U} \det Φ_{n} (u) > d; \lim_{n \to \infty} \sup_{u \in U} ‖ Φ_{n}^{- 1} (u) - Φ^{- 1} (u) ‖ = 0; \sup_{u \in U} ‖ Φ^{- 1} (u) ‖ < C;

Φ_{}^{- 1} (u)

is a continuous function of

u \in U

.

Then the following statement is true:

For any

\sqrt{n}

-consistent estimate

u_{n}^{*} ({\bar{z}}_{n})

of the parameter

u \in U

, the statistic

{\hat{u}}_{n}^{δ} ({\bar{z}}_{n}) = u_{n}^{*} ({\bar{z}}_{n}) - n^{- 1 / 2} Φ_{n}^{- 1} (u_{n}^{*} ({\bar{z}}_{n})) δ_{n} ({\bar{z}}_{n}; u_{n}^{*} ({\bar{z}}_{n}))

(8)

is the

\sqrt{n}

-consistent and asymptotically normal estimate of the parameter

u \in U

with the moments

(0, D (u))

:

L {\sqrt{n} ({\hat{u}}_{n}^{δ} ({\bar{z}}_{n}) - u)} \to ℕ (0, D (u)) (n \to \infty)

where

D (u)

=

Φ_{}^{- 1} (u) Ψ (u) Φ_{}^{- 1} (u)

.

Corollary 1.

(a) Let, for any

n \in ℤ^{+}

, a statistic

{\tilde{u}}_{n}^{δ} ({\bar{z}}_{n})

be the root of the equation

δ_{n} ({\bar{z}}_{n}; u) = 0

with respect to the parameter

u \in U

with probability equal to 1.

(b) Let the statistic

{\tilde{u}}_{n}^{δ} ({\bar{z}}_{n})

also is a

\sqrt{n}

-consistent estimate of the parameter

u \in U

. Then the statistic

{\tilde{u}}_{n}^{δ} ({\bar{z}}_{n})

is asymptotically normal with the moments

(0, D (u))

.

Remark 1.

(a) The statement similar to Statement (T1) of Theorem 1 was proved in [3,4] in the case when the objective function

Q_{n} ({\bar{z}}_{n}; u)

is the likelihood function of

{\bar{z}}_{n}

having the LAN property (2). In this case

δ_{n} ({\bar{z}}_{n}; u)

\equiv

Δ_{n} ({\bar{z}}_{n}; u)

, the matrix function

Φ_{n}^{} (u)

\equiv

Γ_{n} (u)

and

L {Δ_{n} ({\bar{z}}_{n}; u)} \to ℕ (0, Γ (u)) (n \to \infty); Γ (u) = \lim_{n \to \infty} n^{- 1} J_{n}^{} (u),

where

J_{n} (u)

is the Fisher matrix. It follows from Theorem 1, that in this case

D (u) = Γ_{}^{- 1} (u) Γ (u) Γ_{}^{- 1} (u) = Γ_{}^{- 1} (u) .

Consequently, the statistic

{\hat{u}}_{n}^{Δ} ({\bar{z}}_{n}) = u_{n}^{*} ({\bar{z}}_{n}) - n^{- 1 / 2} Γ_{n}^{- 1} (u_{n}^{*} ({\bar{z}}_{n})) Δ_{n} ({\bar{z}}_{n}; u_{n}^{*} ({\bar{z}}_{n}))

is asymptotically normal with the parameters

(0, Γ (u))

, and hence, it is the asymptotically efficient estimate of the parameter u.

(b) It follows from the corollary of Theorem 1 that a statistic

{\tilde{u}}_{n}^{Δ} ({\bar{z}}_{n})

, which has the property:

Δ_{n} ({\bar{z}}_{n}; {\tilde{u}}_{n}^{Δ} ({\bar{z}}_{n})) = 0

with probability equal to one, and at the same time is a

\sqrt{n}

-consistent estimate of the parameter

u \in U

, is asymptotically normal with the moments

(0, Γ (u))

. Consequently, the statistic

{\tilde{u}}_{n}^{Δ} ({\bar{z}}_{n})

is the asymptotically efficient estimate of the parameter

u \in U

.

Thus, Theorem 1 is, in some sense, an extension of Le Cam’s results to the case of an arbitrary objective function

Q_{n} ({\bar{z}}_{n}; u)

whose gradient satisfies conditions B1, B2 of Theorem 1.

3. Proof of Theorem 1

In the course of proving Theorem 1, we will omit, if it is obvious, the dependence of functional quantities on the observations

{\bar{z}}_{n}

and sometimes denote their dependence on the parameter u by a subscript.

In these notations, the definition of the estimate

{\hat{u}}_{n} ({\bar{z}}_{n})

can be written as

{\hat{u}}_{n} = u_{n}^{*} - n^{- 1 / 2} Φ_{n}^{- 1} (u_{n}^{*}) δ_{n} (u_{n}^{*}) .

Then we can write the following chain of equalities:

\sqrt{n} ({\hat{u}}_{n} - u) = \sqrt{n} (u_{n}^{*} - u) - Φ_{n}^{- 1} (u_{n}^{*}) δ_{n} (u_{n}^{*}) = = - Φ_{n}^{- 1} (u) δ_{n} (u) + [\sqrt{n} (u_{n}^{*} - u) - Φ_{n}^{- 1} (u_{n}^{*}) δ_{n} (u_{n}^{*}) + Φ_{n}^{- 1} (u) δ_{n} (u)] = = - Φ_{n}^{- 1} (u) δ_{n} (u) + ξ_{n, u} (u_{n}^{*}),

(9)

where

ξ_{n, u} (u_{n}^{*})

=

\sqrt{n} (u_{n}^{*} - u) + Φ_{n}^{- 1} (u_{n}^{*}) [- δ_{n} (u_{n}^{*}) + δ_{n} (u)]

. It follows from (9):

Φ_{n}^{} (u_{n}^{*}) ξ_{n, u} (u_{n}^{*}) = - δ_{n} (u_{n}^{*}) + δ_{n} (u) + Φ_{n}^{} (u_{n}^{*}) \sqrt{n} (u_{n}^{*} - u) = ρ_{n, u} (u_{n}^{*}) .

(10)

By denoting

τ_{n, u}^{*}

=

\sqrt{n} (u_{n}^{*} - u)

, we obtain from (10):

δ_{n} (u + τ_{n, u}^{*} / \sqrt{n}) - δ_{n} (u) = Φ_{n}^{} (u + τ_{n, u}^{*} / \sqrt{n}) τ_{n, u}^{*} - ρ_{n, u} (τ_{n, u}^{*}),

(11)

where the random quantities

τ_{n, u}^{*}

,

n \in ℤ^{+}

,

u \in U

have the property: for any

ε > 0

there is

С_{ε} > 0

such that

\sup_{u \in U, n \in ℤ^{+}} [P_{n, u} {| τ_{n, u}^{*} | > С_{ε}}] < ε

.

At the same time, from condition B2 of Theorem 1, we obtain:

δ_{n} ({\bar{z}}_{n}; u + n^{- 1 / 2} h) - δ_{n} ({\bar{z}}_{n}; u) = Φ_{n} (u) h + β_{n, u} ({\bar{z}}_{n}; h),

(12)

where

\sup_{u \in U, | h | < c} P_{n, u} {| β_{n, u} ({\bar{z}}_{n}; h) | > ε} \to 0 (n \to \infty)

.

The comparison Equations (11) and (12) allow us to prove the following Lemma.

Lemma 1.

Under the conditions of Theorem 1, the following convergences take place for any

ε > 0

:

(a)

\lim_{n \to \infty} \sup_{u \in U} P_{n, u} {| ρ_{n, u} (u_{n}^{*}) | > ε} = 0

, (b)

\lim_{n \to \infty} \sup_{u \in U} P_{n, u} {| ξ_{n, u} (u_{n}^{*}) | > ε} = 0

.

The proof of Lemma 1 is given in Section 5.

The following statement will be needed below.

Lemma 2.

Let some random variables

φ_{n}

and

η_{n}

have the properties:

(a)

\lim_{n \to \infty} L_{n} {φ_{n}} = \lim_{n \to \infty} P_{n} {φ_{n} < x} = F (x)

; (b) for any

ε > 0

\lim_{n \to \infty} P_{n} {| η_{n} | > ε} = 0

.

Then

\lim_{n \to \infty} L_{n} {φ_{n} + η_{n}} = \lim_{n \to \infty} P_{n} {φ_{n} + η_{n} < x} = F (x)

.

The proof of Lemma 2 is quite simple, and we omit it.

Taking into account Equations (9)–(12) and statements of Lemmas 1 and 2, we can write the following equalities:

L {\sqrt{n} ({\hat{u}}_{n} - u)} = \underset{n \to \infty}{l i m} L {Φ_{n}^{- 1} (u) δ_{n} (u) + ξ_{u, n}} = \underset{n \to \infty}{l i m} L {Φ_{n}^{- 1} (u) δ_{n} (u)},

where the existence of the limits follows from conditions B1, B2 of Theorem 1. According to conditions B1 of Theorem 1, we have:

\underset{n \to \infty}{l i m} L {δ_{n} (u)} = ℕ (0; Ψ (u))

where

Ψ (u) = \lim_{n \to \infty} E_{n} {δ_{n} (u) δ_{n}^{T} (u)}

Therefore:

\underset{n \to \infty}{l i m} L {Φ_{n}^{- 1} (u) δ_{n} (u)}

=

ℕ (0; D (u))

, where

D (u) = Φ_{}^{- 1} (u) Ψ (u) Φ_{}^{- 1} (u)

. □

4. Proof of Corollary

Under the conditions B1, B2 of Theorem 1, the statistic

{\hat{u}}_{n}^{δ} ({\bar{z}}_{n})

in Equation (8) is asymptotically normal with the moments

(0, D (u))

for any

\sqrt{n}

-consistent estimate

u_{n}^{*} ({\bar{z}}_{n})

. Consequently, due to condition (b) of the corollary, the statistic

{\hat{u}}_{n}^{δ} ({\bar{z}}_{n}) = {\tilde{u}}_{n}^{δ} ({\bar{z}}_{n}) + n^{- 1 / 2} Φ_{n}^{- 1} ({\tilde{u}}_{n}^{δ} ({\bar{z}}_{n})) δ_{n} ({\bar{z}}_{n}; {\tilde{u}}_{n}^{δ} ({\bar{z}}_{n}))

is asymptotically normal with the moments

(0, D (u))

.

But by virtue of condition (a) of the corollary, we have that

{\hat{u}}_{n}^{δ} ({\bar{z}}_{n}) = {\tilde{u}}_{n}^{δ} ({\bar{z}}_{n})

with probability equal to one. Hence, the statistic

{\tilde{u}}_{n}^{δ} ({\bar{z}}_{n})

is asymptotically normal with the moments

(0, D (u))

. □

5. Proof of Lemma 1

(a) For any

ε > 0

, q > 0 and

u \in U

, we can write the following equation:

P_{n, u} {| ρ_{n, u} (τ_{n}^{*}) | > ε} = = P_{n, u} {| ρ_{n, u} (τ_{n}^{*}) | > ε \cap | τ_{n, u}^{*} | \leq q} + P_{n, u} {| ρ_{n, u} (τ_{n}^{*}) | > ε \cap | τ_{n, u}^{*} | > q} .

(13)

Let denote

P_{n, u} ({| ρ_{n, u} (τ_{n}^{*}) | > ε} | {| τ_{n, u}^{*} | < q})

the conditional probability of the event

{| ρ_{n, u} (τ_{n}^{*}) | > ε}

under the condition of the event

{| τ_{n, u}^{*} | < q}

. Then (13) can be rewritten as:

P_{n, u} {| ρ_{n, u} (τ_{n}^{*}) | > ε} = P_{n, u} ({| ρ_{n, u} (τ_{n}^{*}) | > ε} | {| τ_{n, u}^{*} | \leq q}) P_{n, u} {| τ_{n, u}^{*} | \leq q} + + P_{n, u} ({| ρ_{n, u} (τ_{n}^{*}) | > ε} | {| τ_{n, u}^{*} | > q}) P_{n, u} {| τ_{n, u}^{*} | > q} .

(14)

According to (11), there is

C_{ε} > 0

such that

\sup_{u \in U, n \in ℤ^{+}} [P_{n, u} {| τ_{n, u}^{*} | > C_{ε}}] < ε

for any

ε > 0

. It follows then from (14) that for any

ε > 0

and

u \in U

P_{n, u} {| ρ_{n, u} (τ_{n}^{*}) | > ε} < P_{n, u} ({| ρ_{n, u} (τ_{n}^{*}) | > ε} | {| τ_{n, u}^{*} | < C_{ε}}),

(15)

where

ρ_{n, u} (τ_{n, u}^{*}) = δ_{n} (u + τ_{n, u}^{*} / \sqrt{n}) - δ_{n} (u) - Φ_{n}^{} (u + τ_{n, u}^{*} / \sqrt{n}) τ_{n, u}^{*}

.

According to (12), for any

ε > 0

,

u \in U

and

| h | < C_{ε}

\sup_{u \in U} P_{n, u} {| β_{n, u} ({\bar{z}}_{n}; h) | > ε} \to 0 (n \to \infty),

(16)

where

β_{n, u} ({\bar{z}}_{n}; h) = δ_{n} ({\bar{z}}_{n}; u + n^{- 1 / 2} h) - δ_{n} ({\bar{z}}_{n}; u) - Φ_{n} (u) h

,

It follows from (15), (16) that for any

ε > 0

\lim_{n \to \infty} \sup_{u \in U} P_{n, u} {| ρ_{n, u} (u_{n}^{*}) | > ε} = 0

.

(b) Since

| ξ_{n, u} (u_{n}^{*}) | \leq ‖ Φ_{n, u}^{- 1} (u_{n}^{*}) ‖ | ρ_{n, u} (u_{n}^{*}) |

, to prove statement (b) of Lemma 1, it suffices to check that

‖ Φ_{n}^{- 1} (u_{n}^{*}) ‖

is bounded in probability. Since

Φ_{n}^{- 1} (u)

satisfies conditions B2 of Theorem 1, for any

ε > 0

there exists

C_{ε} > 0

that for all n the following inequality holds:

P_{n, u} {‖ Φ_{n, u}^{- 1} (u_{n}^{*}) ‖ \geq C_{ε}} < ε

. So, we can write:

P_{n, u} {| ξ_{n, u} (u_{n}^{*}) | > ε} = P_{n, u} ({| ρ_{n, u} (u_{n}^{*}) | > ε} \cap (‖ Φ_{n, u}^{- 1} (u_{n}^{*}) ‖ < C_{ε})) + + P_{n, u} ({| ρ_{n, u} (u_{n}^{*}) | > ε} \cap {‖ Φ_{n, u}^{- 1} (u_{n}^{*}) ‖ \geq C_{ε}}) \leq \leq P_{n, u} ({| ρ_{n, u} (u_{n}^{*}) | > ε} \cap (‖ Φ_{n, u}^{- 1} (u_{n}^{*}) ‖ < C_{ε})) + ε .

Since

| ρ_{u, n} (u_{n}^{*}) |

satisfies statement (a) of Lemma 1, one can find a number

N_{ε}

such that

\sup_{u \in U, n > N_{ε}} P_{u, n} {| ξ_{u, n} (u_{n}^{*}) | > ε}

< 2

ε

. □

6. Conclusions

The paper investigates the asymptotic properties of statistical estimates for the vector parameter

u \in R^{q}

of a stationary multidimensional random time series

z_{t} \in R^{m}, t \in ℤ

satisfying the strong mixing conditions. We have considered estimates

\tilde{u} ({\bar{z}}_{n})

that are solutions of the equations

\nabla_{u} Q_{n} ({\bar{z}}_{n}; u) = 0

,

{\bar{z}}_{n} = {(z_{1}^{T}, \dots, z_{n}^{T})}^{T}

, where

Q_{n} ({\bar{z}}_{n}; u)

is some objective function for which

\nabla_{u} Q_{n} ({\bar{z}}_{n}; u)

satisfies the constraints of Theorem 1. We have proved that under these constraints, the estimates

\tilde{u} ({\bar{z}}_{n})

are

\sqrt{n}

-consistent and asymptotically normal with a limit covariance matrix uniquely determined by the objective function

Q_{n} ({\bar{z}}_{n}; u)

.

The results of this paper are a generalization of the methods for constructing and analyzing the asymptotic properties of M-estimates, which were previously studied for the case of independent identically distributed observations.

Institutional Review Board Statement

The management of the institute does not object to the publication of the article materials.

Informed Consent Statement

Both authors agreed with content of this paper to be published.

Data Availability Statement

It is theoretical paper, there is no data.

Conflicts of Interest

The authors declare no conflict of interest.

References

Billingsley, P. Convergence of Probability Measures. In Wiley Series in Probability and Statistics; John Wiley and Sons, Inc.: New York, NY, USA, 1999. [Google Scholar]
Taniguchi, M.; Kakizawa, Y. Asymptotic Theory of Statistical Inferences for Time Series; Springer Series in Statistics; Springer: New York, NY, USA, 2000. [Google Scholar]
Le Cam, L. Locally Asymptotically Normal Families of Distributions; University California Publication Statistics: San Diego, CA, USA, 1960; Volume 3, pp. 37–99. [Google Scholar]
Le Cam, L. Asymptotic Methods in Statistical Decision Theory; Springer: New York, NY, USA; Berlin, Germany, 1986. [Google Scholar]
Le Cam, L.; Lo Yang, G. Asymptotics in Statistics; Springer: New York, NY, USA, 1990. [Google Scholar]
Ibragimov, I.A.; Has’minskii, R.Z. Statistical Estimation. Asymptotic Theory. Applications of Mathematics; Springer: Berlin/Heidelberg, Germany, 1981; Volume 16. [Google Scholar]
Kushnir, A.F. Asymptotically optimal tests for a regression problem of testing hypotheses. Teor. Veroyatnost. Primenen. 1968, 13, 682–700. (In Russian) [Google Scholar] [CrossRef]
Kushnir, A.F.; Pinskii, A.I. Asymptotically optimal tests of testing hypothesis for an interdependent sample. Teor. Veroyatnost. Primenen. 1971, 16, 280–291. (In Russian) [Google Scholar] [CrossRef]
Rousas, G.G. Contiguity of Probability Measures. Some Applications in Statistics; Cambridge University Press: Cambridge, UK, 1972. [Google Scholar]
Devies, R.B. Asymptotic Inference in Stationary Gaussian Time Series. Advances in applied probability. Appl. Probab. Trust. 1973, 5, 469–497. [Google Scholar] [CrossRef]
Dzhaparidze, K.O.; Yaglom, A.M. Spectrum parameter estimation in time series analysis. In Developments in Statistics; Krishnaiah, P.R., Ed.; Academic Press: New York, NY, USA, 1983; Volume 4, pp. 1–181. [Google Scholar]
Dzhaparidze, K.O. Parameter Estimations and Hypothesis Testing in Spectral Analysis of Stationary Time Series; Springer: New York, NY, USA, 1986. [Google Scholar]
Liptser, R.S.; Shiryayev, A.N. Statistics of Random Processes. In Applications of Mathematics; Springer: Berlin/Heidelberg, Germany; New York, NY, USA, 1978; Volume 5, p. 6. [Google Scholar]
Kutoyants, Y.A. Parameter Estimation for Stochastic Processes; Heldermann: Berlin, Germany, 1984. [Google Scholar]
Fisher, R.A. Theory of Statistical Estimations, Proceedings of Cambridge Philosophical Society; Cambridge University Press: Cambridge, UK, 1925; Volume 22, pp. 700–725. [Google Scholar]
Huber, P.J. Robust Statistics; Wiley Series in Probability and Statistics; John Wiley and Sons: New York, NY, USA, 1981. [Google Scholar]
Huber, P.J.; Ronchetti, E.M. Robust Statistics; Wiley Series in Probability and Statistics; John Wiley and Sons: New York, NY, USA, 2009. [Google Scholar]
Newey, W.K.; McFadden, D. Large Sample Estimation and Hypothesis Testing. In Handbook of Econometrics; Engle, R.F., McFadden, D.L., Eds.; Elsevier: Amsterdam, The Netherlands, 1986; Volume 4, Chapter 36. [Google Scholar]
Borovkov, A.A. Mathematical Statistics; Gordon and Breach Science Publishers: Amsterdam, The Netherlands, 1998. [Google Scholar]
Kushnir, A.F. Identification algorithms for linear systems with correlated input and output noise. Probl. Inf. Transm. 1987, 23, 139–150. (In Russian) [Google Scholar]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kushnir, A.; Varypaev, A. Asymptotic Distributions of M-Estimates for Parameters of Multivariate Time Series with Strong Mixing Property. Eng. Proc. 2021, 5, 19. https://doi.org/10.3390/engproc2021005019

AMA Style

Kushnir A, Varypaev A. Asymptotic Distributions of M-Estimates for Parameters of Multivariate Time Series with Strong Mixing Property. Engineering Proceedings. 2021; 5(1):19. https://doi.org/10.3390/engproc2021005019

Chicago/Turabian Style

Kushnir, Alexander, and Alexander Varypaev. 2021. "Asymptotic Distributions of M-Estimates for Parameters of Multivariate Time Series with Strong Mixing Property" Engineering Proceedings 5, no. 1: 19. https://doi.org/10.3390/engproc2021005019

APA Style

Kushnir, A., & Varypaev, A. (2021). Asymptotic Distributions of M-Estimates for Parameters of Multivariate Time Series with Strong Mixing Property. Engineering Proceedings, 5(1), 19. https://doi.org/10.3390/engproc2021005019

Article Menu

Asymptotic Distributions of M-Estimates for Parameters of Multivariate Time Series with Strong Mixing Property^†

Abstract

1. Introduction. Methods of Construction Asymptotically Efficient Estimates for Parameters of Stationary Time Series

2. Construction of M-Estimates for Parameters of Stationary Time Series with Suitable Asymptotical Properties

3. Proof of Theorem 1

4. Proof of Corollary

5. Proof of Lemma 1

6. Conclusions

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Asymptotic Distributions of M-Estimates for Parameters of Multivariate Time Series with Strong Mixing Property †

Abstract

1. Introduction. Methods of Construction Asymptotically Efficient Estimates for Parameters of Stationary Time Series

2. Construction of M-Estimates for Parameters of Stationary Time Series with Suitable Asymptotical Properties

3. Proof of Theorem 1

4. Proof of Corollary

5. Proof of Lemma 1

6. Conclusions

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Asymptotic Distributions of M-Estimates for Parameters of Multivariate Time Series with Strong Mixing Property^†