A Modified Multiplicative Thinning-Based INARCH Model: Properties, Saddlepoint Maximum Likelihood Estimation, and Application

Xu, Yue; Li, Qi; Zhu, Fukang

doi:10.3390/e25020207

Open AccessArticle

A Modified Multiplicative Thinning-Based INARCH Model: Properties, Saddlepoint Maximum Likelihood Estimation, and Application

by

Yue Xu

¹,

Qi Li

² and

Fukang Zhu

^1,*

¹

School of Mathematics, Jilin University, Changchun 130012, China

²

College of Mathematics, Changchun Normal University, Changchun 130032, China

^*

Author to whom correspondence should be addressed.

Entropy 2023, 25(2), 207; https://doi.org/10.3390/e25020207

Submission received: 19 December 2022 / Revised: 15 January 2023 / Accepted: 18 January 2023 / Published: 21 January 2023

(This article belongs to the Special Issue Discrete-Valued Time Series)

Download

Browse Figure

Versions Notes

Abstract

:

In this article, we propose a modified multiplicative thinning-based integer-valued autoregressive conditional heteroscedasticity model and use the saddlepoint maximum likelihood estimation (SPMLE) method to estimate parameters. A simulation study is given to show a better performance of the SPMLE. The application of the real data, which is concerned with the number of tick changes by the minute of the euro to the British pound exchange rate, shows the superiority of our modified model and the SPMLE.

Keywords:

INARCH model; saddlepoint approximation; thinning-based model; time series of counts

1. Introduction

In practice, we can often observe a series of integer-valued data that have their own distinguishing characteristics, and many models were proposed for modeling integer-valued time series, such as the integer-valued autoregressive (INAR) process introduced by McKenzie (1985) [1], and Al-Osh and Alzaid (1987) [2]; the integer-valued moving average process proposed by Al-Osh and Alzaid (1988) [3]; the integer-valued autoregressive moving-average model defined by McKenize (1988) [4]; and the integer-valued generalized autoregressive conditional heteroscedasticity (INGARCH) model proposed by Ferland et al. (2006) [5], among others. Here we focus on two kinds of the models above: one is the INAR process, which was introduced as a convenient way to transfer the usual autoregressive structure to a discrete-valued time series, and a p-order model, which is defined as follows:

X_{t} = \sum_{i = 1}^{p} α_{i} \circ X_{t - i} + ε_{t},

where

α_{i} \in [0, 1)

for

i = 1, \dots, p,

and

{ε_{t}}

is a sequence of independent and identically distributed (i.i.d.) non-negative integer-valued random variables with

E (ε_{t}) = μ

and

Var (ε_{t}) = σ_{ε}^{2}

. The binomial thinning operator ∘ is defined by Steutel and Van Harn (1979) [6] as:

α \circ X = \sum_{i = 1}^{X} Y_{i}, if X > 0 and 0 otherwise,

where

Y_{i}

are i.i.d. Bernoulli random variables, independent of X, with a success probability are defined by

α

. This model has been generalized by Qian and Zhu (2022) [7], and Huang et al. (2023) [8], among others.

The other is the INGARCH model which was proposed by Ferland et al. (2006) [5] to model the observations of integer-valued time series which exist heteroscedasticity; this INGARCH

(p, q)

model with a Poisson deviate is defined as:

X_{t} | F_{t - 1} : P (λ_{t}), λ_{t} = α_{0} + \sum_{i = 1}^{p} α_{i} X_{t - i} + \sum_{j - 1}^{q} β_{j} λ_{t - j},

where

α_{0} > 0, α_{i} \geq 0, β_{j} \geq 0, i = 1, \dots, p, j = 1, \dots, q, p \geq 1, q \geq 0

, and

F_{t - 1}

is the

σ

-field generated by

{X_{t - 1}, X_{t - 2}, \dots}

. This model has been generalized by Hu (2016) [9], Liu et al. (2022) [10], and Weiß et al. (2022) [11], among others. Weiß (2018) [12] and Davis et al. (2021) [13] gave recent reviews. According to definitions of INAR and INGARCH models, we noticed that the INAR model is thinning-based, while the INGARCH model is specified by a conditional distribution with a time-varying mean depending on past observations. Combining the thinning-based stochastic equations and the INGARCH model, Aknouche and Scotto (2022) [14] proposed a multiplicative thinning-based INGARCH (MthINGARCH) model to model the integer-valued time series with high overdispersion and persistence. Furthermore, it fits well with heavy-tailed data regardless of the choice of innovation distribution and does not require recourse to complex random coefficient equations. The MthINGARCH model is denoted by:

\{\begin{matrix} X_{t} = λ_{t} ε_{t}, \\ λ_{t} = 1 + ω \circ m + \sum_{i = 1}^{q} α_{i} \circ X_{t - i} + \sum_{j = 1}^{p} β_{j} \circ λ_{t - j}, \end{matrix}

(1)

where the symbol ∘ stands for the binomial thinning operator, and

0 \leq ω \leq 1

,

0 \leq α_{i} < 1

and

0 \leq β_{j} < 1 (i = 1, \dots, q, j = 1, \dots, p)

, m is a fixed positive integer number that was introduced for more flexibility. Since there is no explicit probability mass function for the series

{X_{t}}

, then the traditional maximum likelihood estimation (MLE) cannot be applied to estimate the parameters; therefore, Aknouche and Scotto (2022) [14] used a two-stage weighted least squares estimation instead.

Note that the probability mass function of the random variables cannot be given directly for the likelihood function in some cases; to solve this problem, saddlepoint approximation has been proposed. Daniel (1954) [15] introduced saddlepoint techniques into the statistical field, which have been extended by Field and Ronchetti (1990) [16], Jensen (1995) [17], and Butler (2007) [18]. Saddlepoint techniques have been used successfully in many applications because of the high accuracy with which they can approximate intractable densities and tail probabilities. Pedeli et al. (2015) [19] proposed an alternative approach based on the saddlepoint approximation to log-likelihood, and the saddlepoint maximum likelihood estimation (SPMLE) was used to estimate the parameters of the INAR model, which demonstrates the usefulness of this technique. Thus, through combining the MthINGARCH model of Aknouche and Scotto (2022) [14] and the saddlepoint approximation, we propose a modified multiplicative thinning-based INARCH model for modeling high overdispersion, before applying the saddlepoint method to the estimated parameters. Although the two-stage weighted least squares estimation could be used to estimate the parameters of our modified model, we still adopted the SPMLE as it was still expected to have a better performance than the two-stage weighted least squares estimation in practice. Here, we just consider the INARCH model instead of the INGARCH model because it is difficult and complex to give the conditional cumulant-generating function of random variables for the latter model when applying the saddlepoint approximation.

This article has the following structure. A modified multiplicative thinning-based INARCH model is given, alongside some related properties in Section 2. Moreover, we use the Poisson distribution and geometric distribution for innovations. Section 3 discusses the SPMLE and its asymptotic properties, then simulation studies for both models with SPMLE are also given. A real data example is analyzed with our modified models in Section 4, and comparisons with existing models are made. In-sample and out-of-sample forecasts are used to show the superiority of the SPMLE and our modified model. The conclusion is given in Section 5. Some details of SPMLE and proof of some theorems are presented in the Appendix A.

2. A Multiplicative Thinning-Based INARCH Model

Note that

N = {0, 1, 2, \dots}

and

Z = {\dots, - 1, 0, 1, \dots}

are the set of non-negative integers and integers, respectively. It can be supposed that

{ε_{t}, t \in Z}

is a sequence of i.i.d. random variables with a mean of one and finite variance of

σ^{2}

. The modified multiplicative thinning-based INARCH (denoted by the MthINARCH

(q)

) model, which we deal with in this paper, is defined by

X_{t} = λ_{t} ε_{t}, λ_{t} = ω \circ m + \sum_{i = 1}^{q} α_{i} \circ X_{t - i},

(2)

where

0 < ω \leq 1

,

0 \leq α_{i} < 1, i = 1, \dots, q

, m is a fixed positive integer number. In real applications, we can set m as the upper integer part of the sample mean. It is assumed that the Bernoulli terms corresponding to the binomial variables

ω \circ m

and

α_{i} \circ X_{t - i}

are mutually independent and independent of the sequence

{ε_{t}, t \in Z}

. The reason that we defined the new model in this way can be explained as follows. The additive term 1 in

λ_{t}

and in (1) is unnatural, and is posed to ensure

λ_{t} > 0

, but we can achieve this by adjusting the range of

ω

; therefore, we adopted a simple version of

λ_{t}

in (2).

Now that we discuss the conditional mean and conditional variance of

X_{t}

. Note that

F_{t - 1}

is the

σ

-field generated by

X_{t - 1}, X_{t - 2}, \dots

. For

E (ε_{t}) = 1

, let

μ_{t} : = E (X_{t} | F_{t - 1}) = E (λ_{t} ε_{t} | F_{t - 1}) = E (ε_{t}) E (λ_{t} | F_{t - 1}) = E (λ_{t} | F_{t - 1}) = ω m + \sum_{i = 1}^{q} α_{i} X_{t - i}

. Then we can obtain the conditional variance; first, let

ν_{t} : = Var (λ_{t} | F_{t - 1})

and

σ_{t}^{2} : = Var (X_{t} | F_{t - 1})

. For

E (ε_{t}) = 1, Var (ε_{t}) = σ^{2}

, so

E (ε_{t}^{2}) = σ^{2} + 1

. Therefore,

\begin{matrix} ν_{t} : & = Var (λ_{t} | F_{t - 1}) = ω (1 - ω) m + \sum_{i = 1}^{q} α_{i} (1 - α_{i}) X_{t - i}, \\ σ_{t}^{2} : & = Var (X_{t} | F_{t - 1}) = E (X_{t}^{2} | F_{t - 1}) - {[E (X_{t} | F_{t - 1})]}^{2} = E (λ_{t}^{2} | F_{t - 1}) E (ε_{t}^{2}) - μ_{t}^{2} \\ = [Var (λ_{t} | F_{t - 1}) + {(E (λ_{t} | F_{t - 1}))}^{2}] E (ε_{t}^{2}) - μ_{t}^{2} \\ = (σ^{2} + 1) (ν_{t} + μ_{t}^{2}) - μ_{t}^{2} = (σ^{2} + 1) ν_{t} + σ^{2} μ_{t}^{2} . \end{matrix}

Proposition 1.

The necessary and sufficient condition for the first-order stationarity of

X_{t}

defined in (2) is that all roots of

1 - \sum_{i = 1}^{q} α_{i} z^{i} = 0

should lie outside the unit circle.

Proposition 2.

The necessary and sufficient condition for the second-order stationarity of

X_{t}

defined in (2) is that

(σ^{2} + 1) \sum_{i = 1}^{q} α_{i}^{2} < 1 .

Proofs of Propositions 1 and 2 are similar to the proofs of Theorems 2.1 and 2.2 in Aknouche and Scotto (2022) [14], so we omit the details.

For convenience, we need to specify the distribution of

{ε_{t}}

in (2). First, we let

ε_{t} \sim P (1)

, then

E (ε_{t}) = Var (ε_{t}) = 1

, and this model is denoted by PMthINARCH

(q)

. It is easy to obtain

μ_{t} = ω m + \sum_{i = 1}^{q} α_{i} X_{t - i}, σ_{t}^{2} = 2 ν_{t} + μ_{t}^{2} .

Second, let

ε_{t} \sim G e (p^{*})

. The mean of

ε_{t}

is

(1 - p^{*}) / p^{*} = 1

, so we have

p^{*} = 0.5

and the variance is

Var (ε_{t}) = 2

. This model is denoted by GMthINARCH

(q)

, then we have

μ_{t} = ω m + \sum_{i = 1}^{q} α_{i} X_{t - i}, σ_{t}^{2} = 3 ν_{t} + 2 μ_{t}^{2} .

3. Parameter Estimation

In this section, we will consider the SPMLE and its asymptotic properties, and a simulation study will be conducted to assess the performance of this estimator.

3.1. Saddlepoint Maximum Likelihood Estimation

Let

θ = {(ω, α_{1}, \dots, α_{q})}^{T}

be the unknown parameter vector. Note that according to the condition on

ε_{t}

,

σ^{2}

is no longer an unknown parameter. The maximum likelihood estimator of

θ

was obtained by maximizing the conditional log-likelihood function

l (θ) = \sum_{t = 1}^{n} log P (X_{t} = x_{t} | X_{t - 1} = x_{t - 1}, \dots, X_{t - q} = x_{t - q}),

(3)

giving

\hat{θ} = arg {max}_{θ} l (θ) .

But the above procedure is challenging to implement because it is difficult to give the likelihood function due to the thinning operations.

Now we discuss the SPMLE. The conditional moment generating function of

X_{t}

is

\begin{matrix} E (e^{u X_{t}} | X_{t - 1} = x_{t - 1}, \dots, X_{t - q} = x_{t - q}) & = E (e^{u λ_{t} ε_{t}} | X_{t - 1} = x_{t - 1}, \dots, X_{t - q} = x_{t - q}) \\ = E (e^{u (ω \circ m + \sum_{i = 1}^{q} α_{i} \circ X_{t - i}) ε_{t}} | X_{t - 1} = x_{t - 1}, \dots, X_{t - q} = x_{t - q}) \\ = E (e^{u (ω \circ m) ε_{t}}) \prod_{i = 1}^{q} E (e^{u (α_{i} \circ x_{t - i}) ε_{t}}) . \end{matrix}

Remark 1.

Here we just consider the INARCH model instead of the INGARCH model because for the INGARCH model, the conditional cumulant-generating function of

X_{t}

should be given by

E (e^{u X_{t}} | X_{t - 1} = x_{t - 1}, \dots, X_{t - q} = x_{t - q}) = E (e^{u (ω \circ m + \sum_{i = 1}^{q} α_{i} \circ X_{t - i} + \sum_{j = 1}^{p} β_{j} \circ λ_{t - i}) ε_{t}} | X_{t - 1} = x_{t - 1}, \dots, X_{t - q} = x_{t - q}) .

Notice that

X_{t}

and

λ_{t}

are correlated, it is difficult and complex to show the conditional cumulant-generating function.

Using the binomial theorem

{(a + b)}^{n} = \sum_{k = 0}^{n} C_{n}^{k} a^{n - k} b^{k}

, we have

\begin{matrix} E (e^{u (ω \circ m) ε_{t}}) & = E [E (e^{u (ω \circ m) ε_{t}} | ε_{t})] = E {(ω e^{u ε_{t}} + (1 - ω))}^{m} \\ = E [\sum_{r = 0}^{m} C_{m}^{r} {(1 - ω)}^{r} ω^{m - r} e^{u (m - r) ε_{t}}] = \sum_{r = 0}^{m} C_{m}^{r} {(1 - ω)}^{r} ω^{m - r} E (e^{u (m - r) ε_{t}}) . \end{matrix}

Similarly, we also have

E (e^{u (α_{i} \circ x_{t - i}) ε_{t}}) = \sum_{r = 0}^{x_{t - i}} C_{x_{t - i}}^{r} {(1 - α_{i})}^{r} α_{i}^{x_{t - i} - r} E (e^{u (x_{t - i} - r) ε_{t}}) .

Therefore, for the PMthINARCH

(q)

model, we have

\begin{matrix} E (e^{u (ω \circ m) ε_{t}}) & = \sum_{r = 0}^{m} C_{m}^{r} {(1 - ω)}^{r} ω^{m - r} e^{(e^{u (m - r)} - 1)}, \\ E (e^{u (α_{i} \circ x_{t - i}) ε_{t}}) & = \sum_{r = 0}^{x_{t - i}} C_{x_{t - i}}^{r} {(1 - α_{i})}^{r} α_{i}^{x_{t - i} - r} e^{(e^{u (x_{t - i} - r)} - 1)}, \end{matrix}

while for the GMthINARCH

(q)

model, we have

\begin{matrix} E (e^{u (ω \circ m) ε_{t}}) & = \sum_{r = 0}^{m} C_{m}^{r} {(1 - ω)}^{r} ω^{m - r} \frac{1}{2 - e^{u (m - r)}}, \\ E (e^{u (α_{i} \circ x_{t - i}) ε_{t}}) & = \sum_{r = 0}^{x_{t - i}} C_{x_{t - i}}^{r} {(1 - α_{i})}^{r} α_{i}^{x_{t - i} - r} \frac{1}{2 - e^{u (x_{t - i} - r)}} . \end{matrix}

Thus the conditional cumulant-generating function of

X_{t}

is:

\begin{matrix} K_{t} (u) = log [E (e^{u X_{t}} | X_{t - 1} = x_{t - 1}, \dots, X_{t - q} = x_{t - q})] = log E (e^{u (ω \circ m) ε_{t}}) + \sum_{i = 1}^{q} log E (e^{u (α_{i} \circ x_{t - i}) ε_{t}}) . \end{matrix}

A highly accurate approximation to the conditional mass function of

X_{t}

at

x_{t}

is provided by the saddlepoint approximation:

{\tilde{f}}_{X_{t} | X_{t - 1} = x_{t - 1}, \dots, X_{t - q} = x_{t - q}} (x_{t}) = {(2 π K_{t}^{″} ({\tilde{u}}_{t}))}^{- \frac{1}{2}} exp {K_{t} ({\tilde{u}}_{t}) - {\tilde{u}}_{t} x_{t}},

(4)

where

{\tilde{u}}_{t}

is the unique value of u which satisfies the saddlepoint equation

K_{t}^{'} (u) = x_{t},

with

K_{t}^{'}

and

K_{t}^{″}

represent the first and second order derivatives of

K_{t}

with respect to u. Notice that it is difficult to solve the saddlepoint equation

K_{t}^{'} (u) = x_{t}

analytically; similar to that mentioned in Pedeli et al. (2015) [19], we can use the Newton–Raphson method to solve this equation.

The log-likelihood function (3) can be approximated by summing the logarithms of the corresponding density approximations (4), yielding:

{\tilde{L}}_{n} (θ) = \sum_{t = 1}^{n} {\tilde{l}}_{t} (θ) : = \sum_{t = 1}^{n} log {\tilde{f}}_{X_{t} | X_{t - 1} = x_{t - 1}, \dots, X_{t - q} = x_{t - q}} (x_{t}) .

(5)

The value

θ

maximizing this expression is called the saddlepoint maximum likelihood estimator (SPMLE).

3.2. Asymptotic Properties of the SPMLE

Now we discuss the asymptotic properties of the SPMLE. First we give the first-order Taylor expansion of

K_{t}^{'} (u)

at

u = 0

yields,

K_{t}^{'} (u) = K_{t}^{'} (0) + u K_{t}^{″} (0) + o (u) = μ_{t} (θ) + u σ_{t}^{2} (θ) + o (u),

(6)

where

μ_{t} (θ)

and

σ_{t}^{2} (θ)

are the conditional mean and conditional variance of

X_{t}

. Notice that

{\tilde{u}}_{t}

can be given by

K_{t}^{'} ({\tilde{u}}_{t}) = x_{t}

, so with the Taylor series expansion of

K_{t}^{'} (u)

in (6), we have:

{\tilde{u}}_{t} = \frac{x_{t} - μ_{t} (θ)}{σ_{t}^{2} (θ)} + o (1), t = q + 1, \dots, n .

(7)

Then, we can obtain the second-order Taylor expansion of

K_{t} (u)

at

u = 0

, which is:

K_{t} (u) \approx u K_{t}^{'} (0) + \frac{u^{2}}{2} K_{t}^{″} (0) = u μ_{t} (θ) + \frac{u^{2}}{2} σ_{t}^{2} (θ) .

(8)

Focusing on the exponent of the saddlepoint approximation (4), Equation (8) gives

K_{t} (u) - u x_{t} \approx u (μ_{t} (θ) - x_{t}) + \frac{u^{2}}{2} σ_{t}^{2} (θ) .

Then using Equation (7), we have

K_{t} ({\tilde{u}}_{t}) - {\tilde{u}}_{t} x_{t} \approx - \frac{{[x_{t} - μ_{t} (θ)]}^{2}}{2 σ_{t}^{2} (θ)} .

(9)

Hence, we can derive from (8) and (9) that the first-order saddlepoint approximation to the conditional probability mass function is approximately:

\begin{matrix} {\tilde{f}}_{X_{t} | X_{t - 1} = x_{t - 1}, \dots, X_{t - q} = x_{t - q}} (x_{t}) = {(2 π K_{t}^{″} ({\tilde{u}}_{t}))}^{- \frac{1}{2}} \\ \times exp [- \frac{{(x_{t} - ω m - \sum_{i = 1}^{q} α_{i} x_{t - i})}^{2}}{2 [(σ^{2} + 1) (ω (1 - ω) m + \sum_{i = 1}^{q} α_{i} (1 - α_{i}) x_{t - i}) + σ^{2} {(ω m + \sum_{i = 1}^{q} α_{i} x_{t - i})}^{2}]}] . \end{matrix}

Therefore,

{\tilde{L}}_{n} (θ) = \sum_{t = 1}^{n} {\tilde{l}}_{t} (θ) = \sum_{t = 1}^{n} log {\tilde{f}}_{X_{t} | X_{t - 1} = x_{t - 1}, \dots, X_{t - q} = x_{t - q}} (x_{t})

is the quasi-likelihood function for the estimation of

θ

. To establish the large-sample properties, we have

L_{n} (θ) = \sum_{t = 1}^{n} l_{t} (θ) = \sum_{t = 1}^{n} log f_{X_{t} | X_{t - 1} = x_{t - 1}, \dots, X_{t - q} = x_{t - q}} (x_{t}),

which is the ergodic approximation of

{\tilde{L}}_{n} (θ)

. The first and second derivatives of the quasi-likelihood function are given in the Appendix A. The strong convergence and asymptotic normality for the SPMLE

{\hat{θ}}_{n}

are established in the following theorems.

First of all, the assumptions for Theorems 1 and 2 are listed as follows.

Assumption 1.

The solution of the MthINARCH process is strictly stationary and ergodic.

Assumption 2.

Θ is compact and

θ_{0} \in \overset{°}{Θ}

, where

\overset{°}{Θ}

denotes the interior of Θ. For technical reasons, we assumed the lower and upper values of each component of parameters as

0 < ω_{L} \leq ω \leq ω_{U} \leq 1

and

0 \leq α_{L} \leq α_{i} \leq α_{U} < 1

,

i = 1, \dots, q .

Theorem 1.

Let

{\hat{θ}}_{n}

be a sequence of SPMLEs satisfying

{\hat{θ}}_{n} = arg max_{θ \in Θ} {\tilde{L}}_{n} (θ)

, then under Assumptions 1 and 2,

{\hat{θ}}_{n}

converges to

θ_{0}

almost as surely, as

n \to \infty .

Theorem 2.

Under Assumptions 1 and 2, there exists a sequence of maximizers

{\hat{θ}}_{n}

of

{\tilde{L}}_{n} (θ)

such as that of

n \to \infty

,

\sqrt{n} ({\hat{θ}}_{n} - θ_{0}) \overset{d}{⟶} N (0, Σ^{- 1}),

where

Σ = - E_{θ_{0}} (\frac{\partial^{2} l_{t} (θ_{0})}{\partial θ \partial θ^{T}}),

and Σ is positively definite.

3.3. Simulation Study

In this section, simulation studies of PMthINARCH

(q)

and GMthINARCH

(q)

models for finite sample size are given, where

q = 2 .

Here, we used several combinations to show the performance of SPMLE, and the mean absolute deviation error (MADE)

\frac{1}{s} \sum_{j = 1}^{s} | \hat{θ_{j}} - θ_{j} |

was used as the evaluation criterion; here, s is the number of replications. The sample size is

n = 100, 200, 500

, and the number of replications is

s = 200

. We used the following combinations of

{(ω, α_{1}, α_{2})}^{T}

as the true values to generate the random sample: A1

= {(0.65, 0.4, 0.4)}^{T}

, A2

= {(0.9, 0.5, 0.3)}^{T}

for the PMthINARCH

(2)

model, and B1

= {(0.8, 0.4, 0.4)}^{T}

, B2

= {(0.65, 0.3, 0.5)}^{T}

for the GMthINARCH

(2)

model. Table 1 and Table 2 show the results of these simulations. Notice that as the sample sizes become larger, the MADEs become smaller, and the estimates seem to be close to the true values. Therefore, the SPMLE performs well.

4. A Real Example

Here, we considered the number of tick changes by the minute of the euro to the British pound exchange rate (ExRate for short) on December 12th from 9.00 a.m. to 9.00 p.m. The dataset is available at the website http://www.histdata.com/ (accessed on 17 January 2023). The series comprises of 720 observations with a sample mean of 13.2153 and a sample variance of 224.2498. Obviously, the sample variance is much larger than the sample mean, which shows high overdispersion, and this high overdispersion can also be seen in Figure 1a. Figure 1b,c are the plots of the autocorrelation function (ACF), and the partial autocorrelation function (PACF) means that we know the tick changes are correlated.

We analyzed the data using the PMthINARCH

(3)

model, GMthINARCH

(3)

model, Poisson INAR

(3)

(here denoted by PINAR

(3)

for short) model, and the INARCH

(3)

model. The Poisson INAR model is mentioned in Pedeli et al. (2015) [19], and the SPMLE was used to estimate the parameters. Here, the innovations in the PINAR model were assumed to be Poisson with a mean of one. The INARCH model with a Poisson deviate was proposed by Ferland et al. (2006) [5], and the MLE was used to estimate the parameters. According to Aknouche and Scotto (2022) [14], in real applications, we can set m as the upper integer part of the sample mean. Here the sample mean is 13.2153, so m is set to the value of 14. Table 3 gives the estimates of SPMLE and the values of the Akaike information criterion (AIC) and Bayesian information criterion (BIC). According to Table 3, it is clear to see that the values of AIC and BIC of PMthINARCH

(3)

and GMthINARCH

(3)

are smaller than those of the PINAR

(3)

and INARCH

(3)

models, the values of AIC and BIC of INARCH

(3)

are smaller than those of the PINAR

(3)

model. Moreover, the values of AIC and BIC of PMthINARCH

(3)

are smaller than those of GMthINARCH

(3)

. In summary, the INARCH model performed better than the PINAR model; meanwhile, the PMthINARCH model and GMthINARCH model performed better than the PINAR model and INARCH model.

According to Aknouche and Scotto (2022) [14], the two-stage weighted least squares estimation (2SWLSE) was used to estimate the parameters of the MthINGARCH model. Therefore, to compare the performance of 2SWLSE and SPMLE, and the performance of PMthINARCH, GMthINARCH, and PINAR models, to consider the in-sample and out-of-sample forecasts of these two estimation methods and the three models above, respectively. First, we considered the in-sample forecast. We used all of the observations to estimate the model, and then we could forecast the last 10 observations 711–720, the last 15 observations 706–720, and the last 20 observations 701–720; these three-time horizons of in-sample forecast are denoted by C1, C2, and C3, respectively. Similar to the in-sample forecast process, we also considered the out-of-sample forecast and divided all the observations into three-time horizons: the first one was 1–710 and 711–720, the second one was 1–705 and 706–720, and the third one was 1–700 and 701–720, which are denoted by D1, D2, and D3, respectively.

Here we illustrate the performance of the considered models by comparing the MADEs of each forecast. The MADEs of in-sample forecasts and out-of-sample forecasts for three models with SPMLE are shown in Table 4. The MADEs of the in-sample forecasts and out-of-sample forecasts for the PMthINARCH model with 2SWLSE and SPMLE are shown in Table 5, and the in-sample forecasts and out-of-sample forecasts for the GMthINARCH model with 2SWLSE and SPMLE are shown in Table 6. According to Table 4, the MADEs of PMthINARCH

(3)

and GMthINARCH

(3)

are smaller than those of PINAR

(3)

, Table 5 and Table 6 show that the MADEs of PMthINARCH

(3)

and GMthINARCH

(3)

of SPMLE are smaller than those of 2SWLSE; meanwhile, in these three Tables, the MADEs of in-sample forecasts were smaller than those of out-of-sample forecasts. In summary, the PMthINARCH model and GMthINARCH model were superior to the PINAR model in modeling this real data set, and the PMthINARCH model performed better than the GMthINARCH model. Meanwhile, the performance of SPMLE was better than 2SWLSE for MthINARCH models.

5. Conclusions

In this paper, we modified a multiplicative thinning-based INARCH model. The probability mass function of random variables is provided by saddlepoint approximation. We used the SPMLE to estimate the parameters and obtain the asymptotic distribution of the SPMLE. Moreover, to show the superiority of the MthINARCH models and the SPMLE, we used the PMthINARCH

(q)

process and GMthINARCH

(q)

process for discussion and comparison. The SPMLE performs well in the simulation studies. A real dataset indicates that the PMthINARCH model and the GMthINARCH model are able to describe the overdispersed integer-valued data, and the real data example leads to a superior performance of the MthINARCH models compared with the PINAR and INARCH models. In addition, the results also show a superior performance of SPMLE compared with 2SWLSE.

For further discussion, more research is needed for some aspects. Here we used the Poisson distribution and geometric distribution for

ε_{t}

; however, we could use the negative binomial distribution or some zero-inflated distributions as well. Moreover, we just considered the INARCH model, so the corresponding INGARCH model should be considered as well.

Author Contributions

Conceptualization, F.Z.; methodology, Y.X.; software, Y.X. and Q.L.; validation, Y.X. and Q.L.; formal analysis, Y.X. and Q.L.; investigation, Y.X. and F.Z.; resources, Q.L.; data curation, Y.X. and Q.L.; writing—original draft preparation, Y.X., Q.L. and F.Z.; writing—review and editing, Y.X., Q.L. and F.Z.; visualization, Y.X.; supervision, F.Z.; project administration, F.Z.; funding acquisition, Q.L. and F.Z. All authors have read and agreed to the published version of the manuscript.

Funding

Li’s work is supported by the National Natural Science Foundation of China (No. 12201069), the Natural Science Foundation of Jilin Province (No. 20210101160JC), the Science and Technology Research Project of Education Bureau of Jilin Province (No. JJKH20220820KJ), and Natural Science Foundation Projects of CCNU (CSJJ2022006ZK). Zhu’s work is supported by the National Natural Science Foundation of China (No. 12271206) and the Natural Science Foundation of Jilin Province (No. 20210101143JC).

Data Availability Statement

The dataset is available at the website http://www.histdata.com/ (accessed on 17 January 2023).

Acknowledgments

The authors are very grateful to three reviewers for their constructive suggestions and comments, leading to a substantial improvement in the presentation and contents.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A.1. Details of SPMLE

Here, we give the derivatives of

K_{t} (u)

mentioned in Section 3.1 of PMthINARCH

(q)

and GMthINARCH

(q)

. Now we give

K_{t}^{'} (u)

and

K_{t}^{″} (u)

of PMthINARCH

(q)

. In Section 3.1, we have

K_{t} (u) = log E (e^{u (ω \circ m) ε_{t}}) + \sum_{i = 1}^{q} log E (e^{u (α_{i} \circ x_{t - i}) ε_{t}}) = log a_{1} + \sum_{i = 1}^{q} log b_{1},

so the derivatives of

K_{t} (u)

are given by

\begin{matrix} K_{t}^{'} (u) = \frac{c_{1}}{a_{1}} + \sum_{i = 1}^{q} \frac{d_{1}}{b_{1}}, K_{t}^{″} (u) = \frac{e_{1} a_{1} - c_{1}^{2}}{a_{1}^{2}} + \sum_{i = 1}^{q} \frac{f_{1} b_{1} - d_{1}^{2}}{b_{1}^{2}}, \end{matrix}

where

\begin{matrix} a_{1} = \sum_{r = 0}^{m} C_{m}^{r} {(1 - ω)}^{r} ω^{m - r} e^{e^{u (m - r)} - 1}, \\ b_{1} = \sum_{r = 0}^{x_{t - i}} C_{x_{t - i}}^{r} {(1 - α_{i})}^{r} α_{i}^{x_{t - i} - r} e^{e^{u (x_{t - i} - r)} - 1}, \\ c_{1} = \sum_{r = 0}^{m} C_{m}^{r} {(1 - ω)}^{r} ω^{m - r} e^{u (m - r)} e^{e^{u (m - r)} - 1}, \\ d_{1} = \sum_{r = 0}^{x_{t - i}} C_{x_{t - i}}^{r} {(1 - α_{i})}^{r} α_{i}^{x_{t - i} - r} e^{u (x_{t - i} - r)} e^{e^{u (x_{t - i} - r)} - 1}, \\ e_{1} = \sum_{r = 0}^{m} C_{m}^{r} {(1 - ω)}^{r} ω^{m - r} e^{u (m - r)} {(m - r)}^{2} e^{e^{u (m - r)} - 1} [1 + e^{u (m - r)}], \\ f_{1} = \sum_{r = 0}^{x_{t - i}} C_{x_{t - i}}^{r} {(1 - α_{i})}^{r} α_{i}^{x_{t - i} - r} {(x_{t - i} - r)}^{2} e^{u (x_{t - i} - r)} e^{e^{u (x_{t - i} - r)} - 1} [1 + e^{u (x_{t - i} - r)}] . \end{matrix}

Then we give

K_{t}^{'} (u)

and

K_{t}^{″} (u)

of GMthINARCH

(q)

. In Section 3.1, we have

K_{t} (u) = log E (e^{u (ω \circ m) ε_{t}}) + \sum_{i = 1}^{q} log E (e^{u (α_{i} \circ x_{t - i}) ε_{t}}) = log a_{2} + \sum_{i = 1}^{q} log b_{2},

so the derivatives of

K_{t} (u)

are given by

\begin{matrix} K_{t}^{'} (u) = \frac{c_{2}}{a_{2}} + \sum_{t = 1}^{q} \frac{d_{2}}{b_{2}}, K_{t}^{″} (u) = \frac{e_{2} a_{2} - c_{2}^{2}}{a_{2}^{2}} + \sum_{t = 1}^{q} \frac{f_{2} b_{2} - d_{2}^{2}}{b_{2}^{2}}, \end{matrix}

where

\begin{matrix} a_{2} = \sum_{r = 0}^{m} C_{m}^{r} {(1 - ω)}^{r} ω^{m - r} \frac{1}{2 - (2 - e^{u (m - r)})}, \\ b_{2} = \sum_{r = 0}^{x_{t - i}} C_{x_{t - i}}^{r} {(1 - α_{i})}^{r} α_{i}^{x_{t - i} - r} \frac{1}{2 - (2 - e^{u (x_{t - i} - r)})}, \\ c_{2} = \frac{1}{4} \sum_{r = 0}^{m} C_{m}^{r} {(1 - ω)}^{r} ω^{m - r} (m - r) \frac{e^{u (m - r)}}{{[1 - (1 - \frac{1}{2} e^{u (m - r)})]}^{2}}, \\ d_{2} = \frac{1}{4} \sum_{r = 0}^{x_{t - i}} C_{x_{t - i}}^{r} {(1 - α_{i})}^{r} α_{i}^{x_{t - i} - r} (x_{t - i} - r) \frac{e^{u (x_{t - i} - r)}}{{[1 - (1 - \frac{1}{2} e^{u (x_{t - i} - r)})]}^{2}}, \\ e_{2} = \frac{1}{4} \sum_{r = 0}^{m} C_{m}^{r} {(1 - ω)}^{r} ω^{m - r} {(m - r)}^{2} e^{u (m - r)} \frac{1 + \frac{1}{2} e^{u (m - r)}}{{[1 - (1 - \frac{1}{2} e^{u (m - r)})]}^{3}}, \\ f_{2} = \frac{1}{4} \sum_{r = 0}^{x_{t - i}} C_{x_{t - i}}^{r} {(1 - α_{i})}^{r} α_{i}^{x_{t - i} - r} {(x_{t - i} - r)}^{2} e^{u (x_{t - i} - r)} \frac{1 + \frac{1}{2} e^{u (x_{t - i} - r)}}{{[1 - (1 - \frac{1}{2} e^{u (x_{t - i} - r)})]}^{3}} . \end{matrix}

Appendix A.2. Derivatives of the Quasi-Likelihood Function

The conditional log-quasi-likelihood function

l_{t} (θ)

is continuous on

Θ

: for

1 \leq t \leq n,

\begin{matrix} \frac{\partial l_{t} (θ)}{\partial θ} & = m_{1} \frac{\partial μ_{t} (θ)}{\partial θ} + m_{2} \frac{\partial σ_{t}^{2} (θ)}{\partial θ}, \\ \frac{\partial^{2} l_{t} (θ)}{\partial θ \partial θ^{T}} & = (m_{1} - m_{3}) \frac{\partial^{2} μ_{t} (θ)}{\partial θ \partial θ^{T}} - 2 m_{1} m_{3} \frac{\partial μ_{t} (θ)}{\partial θ} \frac{\partial σ_{t}^{2} (θ)}{\partial θ^{T}} + (m_{2} + \frac{m_{3}^{2}}{2} - m_{1}^{2} m_{3}) \frac{\partial^{2} σ_{t}^{2} (θ)}{\partial θ \partial θ^{T}}, \end{matrix}

where

m_{1} = \frac{X_{t} - μ_{t} (θ)}{σ_{t}^{2} (θ)}, m_{2} = \frac{{(X_{t} - μ_{t} (θ))}^{2} - σ_{t}^{2} (θ)}{2 σ_{t}^{4} (θ)}, m_{3} = \frac{1}{σ_{t}^{2} (θ)} .

Then the first and second derivatives of

μ_{t} (θ)

and

σ_{t}^{2} (θ)

can be easily expressed by

\begin{matrix} \frac{\partial μ_{t} (θ)}{\partial ω} & = m, \frac{\partial μ_{t} (θ)}{\partial α_{i}} = X_{t - i}, \\ \frac{\partial σ_{t}^{2} (θ)}{\partial ω} & = (σ^{2} + 1) (m - 2 ω m) + 2 σ^{2} (m^{2} ω + m \sum_{i = 1}^{q} α_{i} X_{t - i}), \\ \frac{\partial σ_{t}^{2} (θ)}{\partial α_{i}} & = (σ^{2} + 1) (X_{t - i} - 2 α_{i} X_{t - i}) + 2 σ^{2} (m ω X_{t - i} + α_{i} X_{t - i}^{2}), \\ \frac{\partial^{2} μ_{t} (θ)}{\partial ω^{2}} & = 0, \frac{\partial^{2} μ_{t} (θ)}{\partial α_{i}^{2}} = 1, \frac{\partial^{2} μ_{t} (θ)}{\partial ω α_{i}} = 0, \\ \frac{\partial^{2} σ_{t}^{2} (θ)}{\partial ω^{2}} & = - 2 m (σ^{2} + 1) + 2 m^{2} σ^{2}, \frac{\partial^{2} σ_{t}^{2} (θ)}{\partial α_{i}^{2}} = - 2 X_{t - i} (σ^{2} + 1) + 2 X_{t - i}^{2} σ^{2}, \\ \frac{\partial^{2} σ_{t}^{2} (θ)}{\partial ω α_{i}} & = 2 m σ^{2} X_{t - i} . \end{matrix}

Appendix A.3. Proof of Theorem 1

The techniques used here are mainly based on Francq and Zakoïan (2004) [20]. We will establish the following intermediate results:

(i): ${lim}_{n \to \infty} {sup}_{θ \in Θ} |\frac{1}{n} (L_{n} (θ) - {\tilde{L}}_{n} (θ))| = 0 a . s .$
(ii): $E (l_{t} (θ))$ is continuous in $θ$ .
(iii): It exists $t \in Z$ such that $σ_{t}^{2} (θ) = σ_{t}^{2} (θ_{0})$ a.s., then $\Rightarrow θ = θ_{0}$ .
(iv): Any $θ \neq θ_{0}$ has a neighbourhood $V (θ)$ such that

$\underset{n \to \infty}{lim sup} sup_{θ^{*} \in V_{k} (θ) \cap Θ} \frac{1}{n} {\tilde{L}}_{n} (θ^{*}) > E_{θ_{0}} l_{1} (θ_{0}) a . s .$

First we prove (i). Let

a t : = {sup}_{θ \in Θ} | {\tilde{μ}}_{t} (θ) - μ_{t} (θ) |

,

b_{t} : = {sup}_{θ \in Θ} | {\tilde{σ}}_{t}^{2} (θ) - σ_{t}^{2} (θ) | .

Standard arguments from Corollary 2.2 in Aknouche and Francq (2023) [21] show that

a_{t} (1 + X_{t} + {sup}_{θ \in Θ} μ_{t} (θ)) \to 0, a . s .

and

b_{t} (1 + X_{t}^{2} + {sup}_{θ \in Θ} μ_{t}^{2} (θ)) \to 0, a . s ., t \to \infty,

so we obtain the inequality

\begin{matrix} sup_{θ \in Θ} |\frac{1}{n} (L_{n} (θ) - {\tilde{L}}_{n} (θ))| = sup_{θ \in Θ} |\frac{1}{2 n} \sum_{t = 1}^{n} log \frac{{\tilde{σ}}_{t}^{2} (θ)}{σ_{t}^{2} (θ)} + (\frac{{(x_{t} - {\tilde{μ}}_{t})}^{2}}{{\tilde{σ}}_{t}^{2}} - \frac{{(x_{t} - μ_{t} (θ))}^{2}}{σ_{t}^{2}})| \\ \leq sup_{θ \in Θ} |\frac{1}{2 n} \sum_{t = 1}^{n} \frac{{\tilde{σ}}_{t}^{2} (θ) - σ_{t}^{2} (θ)}{σ_{t}^{2} (θ)} + (\frac{{(x_{t} - {\tilde{μ}}_{t} (θ))}^{2}}{{\tilde{σ}}_{t}^{2} (θ)} - \frac{{(x_{t} - μ_{t} (θ))}^{2}}{σ_{t}^{2}})| \\ \leq sup_{θ \in Θ} \frac{1}{2 n} \sum_{t = 1}^{n} \frac{| {\tilde{σ}}_{t}^{2} (θ) - σ_{t}^{2} (θ) |}{σ_{t}^{2} (θ)} + \frac{| {\tilde{μ}}_{t} (θ) - μ_{t} (θ) | | μ_{t} (θ) + {\tilde{μ}}_{t} (θ) - 2 X_{t} |}{{\tilde{σ}}_{t}^{2} (θ)} \\ + \frac{|{\tilde{σ}}_{t}^{2} (θ) - σ_{t}^{2} (θ)| {|X_{t} - μ_{t} (θ)|}^{2}}{σ_{t}^{2} (θ) {\tilde{σ}}_{t}^{2} (θ)} \\ \leq \frac{1}{2 n} \sum_{t = 1}^{n} \frac{2}{σ_{t}^{2} (θ)} a_{t} (1 + X_{t} + sup_{θ \in Θ} μ_{t} (θ)) + \frac{1 + {\tilde{σ}}_{t}^{2} (θ)}{σ_{t}^{2} (θ) {\tilde{σ}}_{t}^{2} (θ)} c_{t} (1 + X_{t}^{2} + sup_{θ \in Θ} μ_{t}^{2} (θ)) . \end{matrix}

The a.s. limit holds because of the Cesàro lemma.

We prove (ii) now. For any

θ \in Θ

, let

V_{η} (θ) = B (θ, η)

be an open ball centered at

θ

with radius

η

,

\begin{matrix} |l_{t} (\tilde{θ}) - l_{t} (θ)| & \leq | σ_{t}^{2} (\tilde{θ}) - σ_{t}^{2} (θ) | |\frac{X_{t}^{2} + μ_{t}^{2} (θ) + σ_{t}^{2} (\tilde{θ})}{σ_{t}^{2} (θ) σ_{t}^{2} (\tilde{θ})}| + \frac{| μ_{t} (\tilde{θ}) - μ_{t} (θ) | | μ_{t} (θ) + μ_{t} (\tilde{θ}) - 2 X_{t} |}{σ_{t}^{2} (\tilde{θ})} . \end{matrix}

Then

\begin{matrix} E (sup_{\tilde{θ \in V_{η} (θ)}} |l_{t} (\tilde{θ}) - l_{t} (θ)|) & \leq ‖ σ_{t}^{2} (\tilde{θ}) - σ_{t}^{2} (θ) ‖_{2} {‖ \frac{X_{t}^{2} + μ_{t}^{2} (θ) + σ_{t}^{2} (\tilde{θ})}{σ_{t}^{2} (θ) σ_{t}^{2} (\tilde{θ})} ‖}_{2} \\ + \frac{‖ μ_{t} (\tilde{θ}) - μ_{t} (θ) ‖_{2} {‖ μ_{t} (θ) + μ_{t} (\tilde{θ}) - 2 X_{t} ‖}_{2}}{σ_{t}^{2} (\tilde{θ})} \to 0, a s η \to 0 . \end{matrix}

Next, we check (iii). By Jensen’s inequality, we have

\begin{matrix} E [l_{t} (θ) - l_{t} (θ_{0})] & = E [E (\frac{1}{2} log \frac{σ_{t}^{2} (θ_{0})}{σ_{t}^{2} (θ)} + \frac{{(x_{t} - μ_{t} (θ_{0}))}^{2}}{2 σ_{t}^{2} (θ_{0})} - \frac{{(x_{t} - μ_{t} (θ))}^{2}}{2 σ_{t}^{2} (θ)} | F_{t - 1})] \\ \leq E [log E (\frac{σ_{t}^{2} (θ_{0})}{σ_{t}^{2} (θ)} | F_{t - 1})] \\ = E (log (1)) = 0 . \end{matrix}

The equality holds if

\frac{σ_{t}^{2} (θ_{0})}{σ_{t}^{2} (θ)} = 1

a.s.

F_{t - 1}

, i.e.,

θ = θ_{0}

.

Then the proof of (iv) is similar to that in the Supplementary Material A.4 in Xu and Zhu (2022) [22]. Here we omit the details.

Appendix A.4. Proof of the Positive Definiteness of Σ

Here, we prove the positive definiteness of

Σ

. By definition of positive definiteness, we need to prove for any

ξ = {(ξ_{0}, ξ_{1}, \dots, ξ_{q})}^{T} \in R^{q + 1},

if

ξ^{T} Σ ξ = 0,

then

ξ = 0

.

\begin{matrix} ξ^{T} Σ ξ & = ξ^{T} E [\frac{1}{2 σ_{t}^{4} (θ_{0})} \frac{\partial σ_{t}^{2} (θ_{0})}{\partial θ} \frac{\partial σ_{t}^{2} (θ_{0})}{\partial θ^{T}} + \frac{1}{σ_{t}^{2} (θ_{0})} \frac{\partial μ_{t} (θ_{0})}{\partial θ} \frac{\partial μ_{t} (θ_{0})}{\partial θ^{T}}] ξ \\ = E [\frac{1}{2 σ_{t}^{4} (θ_{0})} {(ξ^{T} \frac{\partial σ_{t}^{2} (θ_{0})}{\partial θ})}^{2} + \frac{1}{σ_{t}^{2} (θ_{0})} {(ξ^{T} \frac{\partial μ_{t} (θ_{0})}{\partial θ})}^{2}] . \end{matrix}

Suppose the left-hand side is

0,

then under Assumption 1, the expectation in the right-hand side is 0 for any

t \in Z .

Because

σ_{t}^{2} (θ_{0}) > 0,

this expectation is always greater than or equal to

0 .

It equals 0 only when

ξ^{T} \frac{\partial σ_{t}^{2} (θ_{0})}{\partial θ} = 0

and

ξ^{T} \frac{\partial μ_{t} (θ_{0})}{\partial θ} = 0

almost surely. Thus,

ξ^{T} Σ ξ = 0

yields

ξ^{T} \frac{\partial σ_{t}^{2} (θ_{0})}{\partial θ} = 0

and

ξ^{T} \frac{\partial μ_{t} (θ_{0})}{\partial θ} = 0

a.s. for

t \in Z,

and vice versa.

Using vector form of

\frac{\partial σ_{t}^{2} (θ_{0})}{\partial θ}

, we have

{ξ_{a}}^{T} \frac{\partial σ_{t}^{2} (θ_{0})}{\partial θ} = ξ^{T} (\begin{matrix} (σ^{2} + 1) (m - 2 ω m) + 2 σ^{2} (ω m^{2} + m \sum_{i = 1}^{q} α_{i} X_{t - i}) \\ (σ^{2} + 1) (X_{t - 1} - 2 α_{1} X_{t - 1}) + 2 σ^{2} (ω m X_{t - 1} + α_{1} X_{t - 1}^{2}) \\ ⋮ \\ (σ^{2} + 1) (X_{t - q} - 2 α_{q} X_{t - q}) + 2 σ^{2} (ω m X_{t - q} + α_{q} X_{t - q}^{2}) \end{matrix}) .

Suppose the left-hand side is 0 almost surely, then the right-hand side is also 0 almost surely, which can be written as

\begin{matrix} ξ_{0} (σ^{2} + 1) (m - 2 ω m) + 2 σ^{2} ξ_{0} (ω m^{2} + m \sum_{i = 1}^{q} α_{i} X_{t - i}) \\ + ξ_{1} (σ^{2} + 1) (X_{t - 1} - 2 α_{1} X_{t - 1}) + 2 σ^{2} ξ_{1} (ω m X_{t - 1} + α_{1} X_{t - 1}^{2}) + M_{t - 2} = 0 a . s ., \end{matrix}

where

M_{t - 2} = \sum_{k = 2}^{p} ξ_{k} [(σ^{2} + 1) (X_{t - k} - 2 α_{k} X_{t - k}) + 2 σ^{2} (ω m X_{t - k} + α_{k} X_{t - k}^{2})] .

So the coefficients of the above equation must satisfy

\begin{matrix} ξ_{i} (σ^{2} + 1) = 0, 2 σ^{2} ξ_{i} = 0, i = 0, \dots, q . \end{matrix}

For

σ^{2} > 0,

we must have

ξ_{i} = 0, i = 0, \dots, q .

Thus,

ξ = {(ξ_{0}, ξ_{1}, \dots, ξ_{q})}^{T} = 0,

which completes the proof of the positive definiteness of

Σ .

Appendix A.5. Lemmas for the Proof of Theorem 2

Similar to the proof of Theorem 1.2 in Hu (2016) [9], we give some related lemmas for the proof of Theorem 2. According to the derivatives of the quasi-likelihood function, we have

\begin{matrix} \frac{\partial μ_{t} (θ)}{\partial ω} & = m, \\ \frac{\partial σ_{t}^{2} (θ)}{\partial ω} & = (σ^{2} + 1) (m - 2 ω m) + 2 σ^{2} (m^{2} ω + m \sum_{i = 1}^{q} α_{i} X_{t - i}), \\ \leq (σ^{2} + 1) m (1 - 2 ω_{L}) + 2 σ^{2} (m^{2} ω_{U} + m \sum_{i = 1}^{q} α_{U} X_{t - i}), \end{matrix}

thus,

E {(\frac{\partial μ_{t} (θ)}{\partial ω})}^{2} < \infty

and

E {(\frac{\partial σ_{t}^{2} (θ)}{\partial ω})}^{2} < \infty

. Likewise for the other terms of parameters.

Lemma A1.

Under Assumptions 1 and 2, when

n \to \infty

,

\frac{1}{\sqrt{n}} \sum_{t = 1}^{n} \frac{\partial {\tilde{l}}_{t} (θ_{0})}{\partial θ_{i}} \overset{d}{⟶} N (0, Σ), \frac{1}{n} \sum_{t = 1}^{n} \frac{\partial^{2} {\tilde{l}}_{t} (θ_{0})}{\partial θ_{i} \partial θ_{j}} \overset{P}{⟶} - Σ .

Proof of Lemma A1.

First, we show that

n^{- 1 / 2} \sum_{t = 1}^{n} |\frac{\partial l_{t} (θ_{0})}{\partial θ_{i}} - \frac{\partial {\tilde{l}}_{t} (θ_{0})}{\partial θ_{i}}| \overset{P}{⟶} 0, n^{- 1} \sum_{t = 1}^{n} |\frac{\partial^{2} l_{t} (θ_{0})}{\partial θ_{i} \partial θ_{j}} - \frac{\partial^{2} {\tilde{l}}_{t} (θ_{0})}{\partial θ_{i} \partial θ_{j}}| \overset{P}{⟶} 0 .

Notice that

{\tilde{μ}}_{t} (θ)

and

{\tilde{σ}}_{t}^{2} (θ)

are stationary approximations of

μ_{t} (θ)

and

σ_{t}^{2} (θ)

, since

X_{t}

is stationary and ergodic, using arguments similar to Proposition 2.1.1 in Straumann (2005) [23], for fixed

θ \in Θ

,

{\tilde{μ}}_{t} (θ)

and

{\tilde{σ}}_{t}^{2} (θ)

,

μ_{t} (θ)

and

σ_{t}^{2} (θ)

are also stationary and ergodic. Hence, similar to the proof of Lemma A2 in Hu and Andrews (2021) [24], it is easy to have

n^{- 1 / 2} \sum_{t = 1}^{n} |\frac{\partial l_{t} (θ_{0})}{\partial θ_{i}} - \frac{\partial {\tilde{l}}_{t} (θ_{0})}{\partial θ_{i}}| \overset{P}{⟶} 0, n^{- 1} \sum_{t = 1}^{n} |\frac{\partial^{2} l_{t} (θ_{0})}{\partial θ_{i} \partial θ_{j}} - \frac{\partial^{2} {\tilde{l}}_{t} (θ_{0})}{\partial θ_{i} \partial θ_{j}}| \overset{P}{⟶} 0 .

Therefore, it suffices to show that

\frac{1}{\sqrt{n}} \sum_{t = 1}^{n} \frac{\partial l_{t} (θ_{0})}{\partial θ} \overset{d}{⟶} N (0, Σ), \frac{1}{n} \sum_{t = 1}^{n} \frac{\partial^{2} l_{t} (θ_{0})}{\partial θ \partial θ^{T}} \overset{P}{⟶} - Σ .

First, we should guarantee that

\begin{matrix} E_{θ_{0}} ∥\frac{\partial l_{t} (θ_{0})}{\partial θ} \frac{\partial l_{t} (θ_{0})}{\partial θ^{T}}∥ < \infty, E_{θ_{0}} ∥\frac{\partial^{2} l_{t} (θ_{0})}{\partial θ \partial θ^{T}}∥ < \infty . \end{matrix}

(A1)

Now we prove the first part of (A1).

\begin{matrix} E_{θ_{0}} {(\frac{\partial l_{t} (θ_{0})}{\partial ω})}^{2} & = E_{θ_{0}} [\frac{1}{2 σ_{t}^{4} (θ_{0})} {(\frac{\partial σ_{t}^{2} (θ_{0})}{\partial ω})}^{2} + \frac{1}{σ_{t}^{2} (θ_{0})} {(\frac{\partial μ_{t} (θ_{0})}{\partial ω})}^{2}] < \infty . \end{matrix}

Similarly, we can prove other terms, thus, the first part of (A1) holds. The proof of the second part of (A1) is similar, here we omit the details.

Under (A1),

\{\frac{\partial l_{t} (θ_{0})}{\partial θ}\}

is a martingale difference sequence with respect to

\{F_{t}\}

, it follows that at

θ = θ_{0}

,

E_{θ_{0}} (\frac{\partial l_{t} (θ_{0})}{\partial θ} | F_{t - 1}) = 0

, so

E_{θ_{0}} (\frac{\partial l_{t} (θ_{0})}{\partial θ}) = 0

. Moreover, we have shown that

Σ = E_{θ_{0}} (\frac{\partial l_{t} (θ_{0})}{\partial θ} \frac{\partial l_{t} (θ_{0})}{\partial θ^{T}})

in Section 3.2. Hence

\frac{1}{\sqrt{n}} \sum_{t = 1}^{n} \frac{\partial {\tilde{l}}_{t} (θ_{0})}{\partial θ} \overset{d}{⟶} N (0, Σ)

holds by the central limit theorem for martingale difference sequence in Billingsley (1961). Similarly, we have

E_{θ_{0}} (\frac{\partial l_{t}^{2} (θ_{0})}{\partial θ \partial θ^{T}}) = - Σ

.

Under Assumption 1,

\frac{1}{n} \sum_{t = 1}^{n} \frac{\partial^{2} {\tilde{l}}_{t} (θ_{0})}{\partial θ_{i} \partial θ_{j}} \overset{P}{⟶} - Σ

follows from the ergodic theorem. Thus, Lemma A1 is proved. □

Before showing Lemma A2, we have

{\tilde{T}}_{n} (u) \equiv {\tilde{l}}_{n} (θ_{0} + \frac{u}{\sqrt{n}}) - {\tilde{l}}_{n} (θ_{0}), u \in R^{q + 1},

we use

{\tilde{T}}_{n}

to derive the asymptotic distribution of

{\hat{θ}}_{n}

.

For any

u \in R^{q + 1}

, the Taylor series expansion of

{\tilde{T}}_{n} (u)

at

θ_{0}

is

{\tilde{T}}_{n} (u) = \frac{1}{\sqrt{n}} \sum_{t = 1}^{n} u^{T} \frac{\partial {\tilde{l}}_{t} (θ_{0})}{\partial θ} + \frac{1}{2 n} \sum_{t = 1}^{n} u^{T} \frac{\partial^{2} {\tilde{l}}_{t} (θ_{0})}{\partial θ \partial θ^{T}} u + \frac{1}{2 n} \sum_{t = 1}^{n} u^{T} [\frac{\partial^{2} {\tilde{l}}_{t} (θ^{*})}{\partial θ \partial θ^{T}} - \frac{\partial^{2} {\tilde{l}}_{t} (θ_{0})}{\partial θ \partial θ^{T}}] u,

(A2)

where

θ^{*} = θ_{n}^{*} (u)

is on the line segment connecting

θ_{0}

and

θ_{0} + \frac{u}{\sqrt{n}}

. For Euclidean distance

‖ \cdot ‖

and any compact set

K \subset R^{q + 1}

,

{sup}_{u \in K} ‖ θ^{*} - θ_{0} ‖ \to 0

, as

n \to \infty

.

Lemma A2.

Under Assumptions 1 and 2, when

n \to \infty

,

\frac{1}{n} \sum_{t = 1}^{n} [\frac{\partial^{2} {\tilde{l}}_{t} (θ^{*})}{\partial θ \partial θ^{T}} - \frac{\partial^{2} {\tilde{l}}_{t} (θ_{0})}{\partial θ \partial θ^{T}}] \overset{P}{⟶} 0 .

Proof.

Similar to Lemma A1, for any

1 \leq i, j \leq q + 1

,

\frac{1}{n} \sum_{t = 1}^{n} ∥\frac{\partial^{2} l_{t} (θ_{0})}{\partial θ_{i} \partial θ_{j}} - \frac{\partial^{2} {\tilde{l}}_{t} (θ_{0})}{\partial θ_{i} \partial θ_{j}}∥ \overset{P}{⟶} 0 .

(A3)

Using arguments similar to the proof of Theorem 2.2 of Francq and Zakoïan (2004) [20], it suffices to show

\frac{1}{n} \sum_{t = 1}^{n} [\frac{\partial^{2} l_{t} (θ^{*})}{\partial θ_{i} \partial θ_{j}} - \frac{\partial^{2} l_{t} (θ_{0})}{\partial θ_{i} \partial θ_{j}}] \overset{P}{⟶} 0 .

(A4)

By the Taylor series expansion, we have

\frac{1}{n} \sum_{t = 1}^{n} \frac{\partial^{2} l_{t} (θ^{*})}{\partial θ_{i} \partial θ_{j}} = \frac{1}{n} \sum_{t = 1}^{n} \frac{\partial^{2} l_{t} (θ_{0})}{\partial θ_{i} \partial θ_{j}} + \frac{1}{n} \sum_{t = 1}^{n} \frac{\partial}{\partial θ_{k}} (\frac{\partial^{2} l_{t} (θ^{* *})}{\partial θ_{i} \partial θ_{j}}) (θ^{*} - θ_{0}),

here

θ^{* *} = θ_{n}^{* *} (u)

is on the line segment connecting

θ_{0}

and

θ^{*}

, such that for any u, we have

‖ θ^{* *} - θ_{0} ‖ \to 0

a . s .,

n \to \infty

.

From (A2),

‖ θ^{*} - θ_{0} ‖ \to 0

a . s

, so

\frac{1}{n} \sum_{t = 1}^{n} \frac{\partial}{\partial θ_{k}} (\frac{\partial^{2} l_{t} (θ^{* *})}{\partial θ_{i} \partial θ_{j}}) (θ^{*} - θ_{0}) \to 0, a . s .

if

\underset{n \to \infty}{lim sup} ∥\frac{1}{n} \sum_{t = 1}^{n} \frac{\partial}{\partial θ_{k}} (\frac{\partial^{2} l_{t} (θ^{* *})}{\partial θ_{i} \partial θ_{j}})∥ < \infty, a . s .

(A5)

Then we have

\frac{1}{n} \sum_{t = 1}^{n} \frac{\partial^{2} l_{t} (θ^{*})}{\partial θ_{i} \partial θ_{j}} \to \frac{1}{n} \sum_{t = 1}^{n} \frac{\partial^{2} l_{t} (θ_{0})}{\partial θ_{i} \partial θ_{j}} a . s .,

so (A4) is proved.

Using arguments similar to the proof of Theorem 2.2 of Francq and Zakoïan (2004) [20], there exists a neighborhood

ν (θ_{0})

, that

E_{θ_{0}} sup_{θ \in ν (θ_{0}) \cap Θ} ∥\frac{\partial}{\partial θ_{k}} (\frac{\partial^{2} l_{t} (θ)}{\partial θ_{i} \partial θ_{j}})∥ < \infty, sup_{θ \in ν (θ_{0})} ∥\frac{1}{n} \sum_{t = 1}^{n} [\frac{\partial^{2} l_{t} (θ)}{\partial θ_{i} \partial θ_{j}} - \frac{\partial^{2} {\tilde{l}}_{t} (θ)}{\partial θ_{i} \partial θ_{j}}]∥ \overset{P}{⟶} 0 .

(A6)

Therefore, by the ergodic theorem, we have

\begin{matrix} \underset{n \to \infty}{lim sup} ∥\frac{1}{n} \sum_{t = 1}^{n} \frac{\partial}{\partial θ_{k}} (\frac{\partial^{2} l_{t} (θ^{* *})}{\partial θ_{i} \partial θ_{j}})∥ & \leq \underset{n \to \infty}{lim sup} \frac{1}{n} \sum_{t = 1}^{n} sup_{θ \in ν (θ_{0}) \cap Θ} ∥\frac{\partial}{\partial θ_{k}} (\frac{\partial^{2} l_{t} (θ)}{\partial θ_{i} \partial θ_{j}})∥ \\ = E_{θ_{0}} sup_{θ \in ν (θ_{0}) \cap Θ} ∥\frac{\partial}{\partial θ_{k}} (\frac{\partial^{2} l_{t} (θ)}{\partial θ_{i} \partial θ_{j}})∥ < \infty, \end{matrix}

so (A5) is proved.

In view of (A3), (A4) and (A6), we obtain Lemma A2. □

Lemma A3.

For any compact set

K \in R^{q + 1}

and any

ε > 0

,

lim_{σ \to 0} \underset{n \to \infty}{lim sup} P (sup_{u, v \in K, ‖ u - v ‖ < σ} |{\tilde{T}}_{n} (u) - {\tilde{T}}_{n} (v)| \geq ε) = 0 .

Proof.

For any

ϵ > 0,

by (A2) we have

\begin{matrix} lim_{δ \to 0} \underset{n \to \infty}{lim sup} P (sup_{u, v \in K, ‖ u - v ‖ < δ} |{\tilde{T}}_{n} (u) - {\tilde{T}}_{n} (v)| \geq ε) \\ \leq lim_{δ \to 0} \underset{n \to \infty}{lim sup} P (sup_{u, v \in K, ‖ u - v ‖ < δ} |\frac{1}{\sqrt{n}} \sum_{t = 1}^{n} {(u - v)}^{T} \frac{\partial {\tilde{l}}_{t} (θ_{0})}{\partial θ}| \geq \frac{ϵ}{3}) \\ + lim_{δ \to 0} \underset{n \to \infty}{lim sup} P (sup_{u, v \in K, ‖ u - v ‖ < δ} |\frac{1}{n} (\sum_{t = 1}^{n} u^{T} \frac{\partial^{2} {\tilde{l}}_{t} (θ_{0})}{\partial θ \partial θ^{T}} u - \sum_{t = 1}^{n} v^{T} \frac{\partial^{2} {\tilde{l}}_{t} (θ_{0})}{\partial θ \partial θ^{T}} v)| \geq \frac{2 ϵ}{3}) \\ + lim_{δ \to 0} \underset{n \to \infty}{lim sup} P \{sup_{u, v \in K, ‖ u - v ‖ < δ} |\frac{1}{n} [\sum_{t = 1}^{n} u^{T} (\frac{\partial^{2} {\tilde{l}}_{t} (θ^{*})}{\partial θ \partial θ^{T}} - \frac{\partial^{2} {\tilde{l}}_{t} (θ_{0})}{\partial θ \partial θ^{T}}) u \\ - \sum_{t = 1}^{n} v^{T} (\frac{\partial^{2} {\tilde{l}}_{t} (θ^{*})}{\partial θ \partial θ^{T}} - \frac{\partial^{2} {\tilde{l}}_{t} (θ_{0})}{\partial θ \partial θ^{T}}) v]| \geq \frac{2 ϵ}{3}\} . \end{matrix}

Because of Lemmas A1 and A2, we have

\frac{1}{\sqrt{n}} \sum_{t = 1}^{n} \frac{\partial {\tilde{l}}_{t} (θ_{0})}{\partial θ} = O_{p} (1), \frac{1}{n} \sum_{t = 1}^{n} \frac{\partial^{2} {\tilde{l}}_{t} (θ_{0})}{\partial θ \partial θ^{T}} = O_{p} (1),

\frac{1}{n} \sum_{t = 1}^{n} [\frac{\partial^{2} {\tilde{l}}_{t} (θ^{*})}{\partial θ \partial θ^{T}} - \frac{\partial^{2} {\tilde{l}}_{t} (θ_{0})}{\partial θ \partial θ^{T}}] = o_{p} (1),

where

O_{p} (1)

and

o_{p} (1)

for vector and matrix means

O_{p} (1)

and

o_{p} (1)

for every elements. By the compactness of

K,

we have

lim_{δ \to 0} \underset{n \to \infty}{lim sup} P (sup_{u, v \in K, ‖ u - v ‖ < δ} |\frac{1}{\sqrt{n}} \sum_{t = 1}^{n} {(u - v)}^{T} \frac{\partial {\tilde{l}}_{t} (θ_{0})}{\partial θ}| \geq \frac{ϵ}{3}) = 0,

lim_{δ \to 0} \underset{n \to \infty}{lim sup} P (sup_{u, v \in K, ‖ u - v ‖ < δ} |\frac{1}{n} (\sum_{t = 1}^{n} u^{T} \frac{\partial^{2} {\tilde{l}}_{t} (θ_{0})}{\partial θ \partial θ^{T}} u - \sum_{t = 1}^{n} v^{T} \frac{\partial^{2} {\tilde{l}}_{t} (θ_{0})}{\partial θ \partial θ^{T}} v)| \geq \frac{2 ϵ}{3}) = 0,

\begin{matrix} lim_{δ \to 0} \underset{n \to \infty}{lim sup} P & \{sup_{u, v \in K, ‖ u - v ‖ < δ} |\frac{1}{n} [\sum_{t = 1}^{n} u^{T} (\frac{\partial^{2} {\tilde{l}}_{t} (θ^{*})}{\partial θ \partial θ^{T}} - \frac{\partial^{2} {\tilde{l}}_{t} (θ_{0})}{\partial θ \partial θ^{T}}) u \\ - \sum_{t = 1}^{n} v^{T} (\frac{\partial^{2} {\tilde{l}}_{t} (θ^{*})}{\partial θ \partial θ^{T}} - \frac{\partial^{2} {\tilde{l}}_{t} (θ_{0})}{\partial θ \partial θ^{T}}) v]| \geq \frac{2 ϵ}{3}\} = 0, \end{matrix}

which completes our proof. □

Appendix A.6. Proof of Theorem 2

Proof.

Let

T (u) = u^{T} N (0, Σ) - \frac{1}{2} u^{T} Σ u

, where N is a multivariate Gaussian random vector with mean 0 and covariance matrix

Σ

. By Lemmas A1 and A2, for any

u \in R^{q + 1}

and

n \to \infty

, the finite dimensional distributions of

{\tilde{T}}_{n}

converge to those of T:

{\tilde{T}}_{n} (u) \to T (u)

.

By Lemma A3, similar to Hu (2016) [9],

{\tilde{T}}_{n} (u)

is tight on the continuous function space

C (K)

for any compact set

K \in R^{q + 1}

. So by Theorem 7.1 in Billingsley (1999) [25],

{\tilde{T}}_{n} (\cdot) \to T (\cdot)

on

C (K)

. From Appendix A.4 and Lemma A1,

Σ

is positive finite and invertible, meanwhile,

T (\cdot)

is concave with the unique maximum

Σ^{- 1} N (0, Σ) = N (0, Σ^{- 1})

.

{\tilde{T}}_{n} (\cdot)

is maximized at

u_{max} = \sqrt{n} ({\hat{θ}}_{n} - θ_{0})

. Thus, the result of Theorem 2 can be proved by the proof of Lemma 2.2 and Remark 1 in Davis et al. (1992) [26]. □

References

McKenzie, E. Some simple models for discrete variate time series. Water Resour. Bull. 1985, 21, 645–650. [Google Scholar] [CrossRef]
Al-Osh, M.A.; Alzaid, A.A. First-order integer-valued autoregressive (INAR(1)) process. J. Time Ser. Anal. 1987, 8, 261–275. [Google Scholar] [CrossRef]
Al-Osh, M.A.; Alzaid, A.A. Integer-valued moving average (INMA) process. Stat. Pap. 1988, 29, 281–300. [Google Scholar] [CrossRef]
McKenzie, E. Some ARMA models for dependent sequences of Poisson counts. Adv. Appl. Probab. 1988, 20, 822–835. [Google Scholar] [CrossRef]
Ferland, R.; Latour, A.; Oraichi, D. Integer-valued GARCH process. J. Time Ser. Anal. 2006, 27, 923–942. [Google Scholar] [CrossRef]
Steutel, F.W.; van Harn, K. Discrete analogues of self-decomposability and stability. Ann. Probab. 1979, 7, 893–899. [Google Scholar] [CrossRef]
Qian, L.; Zhu, F. A new minification integer-valued autoregressive process driven by explanatory variables. Aust. N. Z. J. Stat. 2022, 64, 478–494. [Google Scholar] [CrossRef]
Huang, J.; Zhu, F.; Deng, D. A mixed generalized Poisson INAR model with applications. J. Stat. Comput. Simul. 2023, forthcoming. [Google Scholar] [CrossRef]
Hu, X. Volatility Estimation for Integer-Valued Financial Time Series. Ph.D. Thesis, Northwestern University, Evanston, IL, USA, 2016. [Google Scholar]
Liu, M.; Zhu, F.; Zhu, K. Modeling normalcy-dominant ordinal time series: An application to air quality level. J. Time Ser. Anal. 2022, 43, 460–478. [Google Scholar] [CrossRef]
Weiß, C.H.; Zhu, F.; Hoshiyar, A. Softplus INGARCH models. Stat. Sin. 2022, 32, 1099–1120. [Google Scholar] [CrossRef]
Weiß, C.H. An Introduction to Discrete-Valued Time Series; John Wiley & Sons: Chichester, UK, 2018. [Google Scholar]
Davis, R.A.; Fokianos, K.; Holan, S.H.; Joe, H.; Livsey, J.; Lund, R.; Pipiras, V.; Ravishanker, N. Count time series: A methodological review. J. Am. Stat. Assoc. 2021, 116, 1533–1547. [Google Scholar] [CrossRef]
Aknouche, A.; Scotto, M. A multiplicative Thinning-Based Integer-Valued GARCH Model. Working Paper. 2022. Available online: https://mpra.ub.uni-muenchen.de/112475 (accessed on 17 January 2023).
Daniels, H.E. Saddlepoint approximations in statistics. Ann. Math. Stat. 1954, 25, 631–650. [Google Scholar] [CrossRef]
Field, C.; Ronchetti, E. Small sample asymptotics. In Institute of Mathematical Statistics Lecture Notes—Monograph Series; Institute of Mathematical Statistics: Hayward, CA, USA, 1990. [Google Scholar]
Jensen, J.L. Saddlepoint Approximations; Oxford University Press: Oxford, UK, 1995. [Google Scholar]
Butler, R.W. Saddlepoint Approximations with Applications; Cambridge University Press: Cambridge, UK, 2007. [Google Scholar]
Pedeli, X.; Davison, A.C.; Fokianos, K. Likelihood estimation for the INAR(p) model by saddlepoint approximation. J. Am. Stat. Assoc. 2015, 110, 1229–1238. [Google Scholar] [CrossRef]
Francq, C.; Zakoïan, J.M. Maximum likelihood estimation of pure GARCH and ARMA-GARCH processes. Bernoulli 2004, 10, 605–637. [Google Scholar] [CrossRef]
Aknouche, A.; Francq, C. Two-stage weighted least squares estimator of the conditional mean of observation-driven time series models. J. Econom. 2023. forthcoming. [Google Scholar] [CrossRef]
Xu, Y.; Zhu, F. A new GJR-GARCH model for Z-valued time series. J. Time Ser. Anal. 2022, 43, 490–500. [Google Scholar] [CrossRef]
Straumann, D. Estimation in Conditionally Heteroscedastic Time Series Models; Springer: Berlin, Germany, 2005. [Google Scholar]
Hu, X.; Andrews, B. Integer-valued asymmetric GARCH modeling. J. Time Ser. Anal. 2021, 42, 737–751. [Google Scholar] [CrossRef]
Billingsley, P. Convergence of Probability Measures, 2nd ed.; Wiley: New York, NY, USA, 1999. [Google Scholar]
Davis, R.A.; Knight, K.; Liu, J. M-estimation for autoregressions with infinite variance. Stoch. Process. Their Appl. 1992, 40, 145–180. [Google Scholar] [CrossRef] [Green Version]

Figure 1. (a) The plot of integer-valued series of ExRate. (b) The plot of ACF of observations. (c) The plot of PACF of observations.

Table 1. Mean and MADE of estimates for PMthINARCH

(2)

model with SPMLE.

Table 1. Mean and MADE of estimates for PMthINARCH

(2)

model with SPMLE.

Model				$ω$	$α_{1}$	$α_{2}$
A1	m = 3	n = 100	Mean	0.6069	0.5356	0.3569
		n = 100	MADE	0.3681	0.2866	0.2510
		n = 200	Mean	0.5722	0.5026	0.3952
		n = 200	MADE	0.3557	0.2434	0.2243
		n = 500	Mean	0.6436	0.4888	0.4140
		n = 500	MADE	0.2724	0.1287	0.1005
A2	m = 8	n = 100	Mean	0.7782	0.5076	0.4750
		n = 100	MADE	0.2533	0.2752	0.3007
		n = 200	Mean	0.7935	0.5161	0.4701
		n = 200	MADE	0.2318	0.2527	0.2778
		n = 500	Mean	0.8703	0.5170	0.4677
		n = 500	MADE	0.1752	0.2155	0.2390

Table 2. Mean and MADE of estimates for GMthINARCH

(2)

model with SPMLE.

Table 2. Mean and MADE of estimates for GMthINARCH

(2)

model with SPMLE.

Model				$ω$	$α_{1}$	$α_{2}$
B1	m = 4	n = 100	Mean	0.7821	0.2930	0.2870
		n = 100	MADE	0.1195	0.1499	0.1766
		n = 200	Mean	0.8190	0.3611	0.3185
		n = 200	MADE	0.1121	0.1425	0.1640
		n = 500	Mean	0.8456	0.3610	0.3298
		n = 500	MADE	0.0601	0.1331	0.1414
B2	m = 6	n = 100	Mean	0.4718	0.2086	0.3811
		n = 100	MADE	0.1965	0.1466	0.1463
		n = 200	Mean	0.5186	0.2632	0.5080
		n = 200	MADE	0.1607	0.1198	0.1412
		n = 500	Mean	0.5468	0.2874	0.4896
		n = 500	MADE	0.1415	0.1050	0.0770

Table 3. Estimation results: AIC and BIC values for PMthINARCH

(3)

, GMthINARCH

(3)

, PINAR

(3)

and INARCH

(3)

models.

Table 3. Estimation results: AIC and BIC values for PMthINARCH

(3)

, GMthINARCH

(3)

, PINAR

(3)

and INARCH

(3)

models.

PMthINARCH(3)	$ω$	$α_{1}$	$α_{2}$	$α_{3}$	AIC	BIC
PMthINARCH(3)	0.3242	0.5214	0.1945	0.0842	1395.296	1413.613
GMthINARCH(3)	$ω$	$α_{1}$	$α_{2}$	$α_{3}$	AIC	BIC
GMthINARCH(3)	0.4904	0.2532	0.2155	0.2392	1402.472	1420.789
PINAR(3)	$α_{1}$	$α_{2}$	$α_{3}$		AIC	BIC
PINAR(3)	0.1335	0.4116	0.3901		1572.806	1586.544
INARCH(3)	$ω$	$α_{1}$	$α_{2}$	$α_{3}$	AIC	BIC
INARCH(3)	8.5670	0.1140	0.1379	0.1009	1524.638	1542.955

Table 4. MADEs of in-sample forecasts and out-of-sample forecasts for PMthINARCH

(3)

, GMthINARCH

(3)

, and PINAR

(3)

models with SPMLE.

Table 4. MADEs of in-sample forecasts and out-of-sample forecasts for PMthINARCH

(3)

, GMthINARCH

(3)

, and PINAR

(3)

models with SPMLE.

Methods of Forecast		PMthINARCH	GMthINARCH	PINAR
In-sample	C1	15.30	16.80	17.40
	C2	15.87	17.67	18.40
	C3	16.65	20.70	21.90
Out-of-sample	D1	17.50	17.70	22.50
	D2	19.47	19.80	23.80
	D3	20.50	25.25	27.50

Table 5. MADEs of in-sample forecasts and out-of-sample forecasts for PMthINARCH

(3)

model with SPMLE and 2SWLSE.

Table 5. MADEs of in-sample forecasts and out-of-sample forecasts for PMthINARCH

(3)

model with SPMLE and 2SWLSE.

Methods of Forecast		SPMLE	2SWLSE
In-sample	C1	15.30	16.20
	C2	15.87	17.20
	C3	16.65	18.55
Out-of-sample	D1	17.50	18.60
	D2	19.47	21.67
	D3	20.50	22.70

Table 6. MADEs of in-sample forecasts and out-of-sample forecasts for GMthINARCH

(3)

model with SPMLE and 2SWLSE.

Table 6. MADEs of in-sample forecasts and out-of-sample forecasts for GMthINARCH

(3)

model with SPMLE and 2SWLSE.

Methods of Forecast		SPMLE	2SWLSE
In-sample	C1	16.80	17.20
	C2	17.67	18.07
	C3	20.70	21.05
Out-of-sample	D1	17.70	19.90
	D2	19.80	22.87
	D3	25.25	26.50

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Xu, Y.; Li, Q.; Zhu, F. A Modified Multiplicative Thinning-Based INARCH Model: Properties, Saddlepoint Maximum Likelihood Estimation, and Application. Entropy 2023, 25, 207. https://doi.org/10.3390/e25020207

AMA Style

Xu Y, Li Q, Zhu F. A Modified Multiplicative Thinning-Based INARCH Model: Properties, Saddlepoint Maximum Likelihood Estimation, and Application. Entropy. 2023; 25(2):207. https://doi.org/10.3390/e25020207

Chicago/Turabian Style

Xu, Yue, Qi Li, and Fukang Zhu. 2023. "A Modified Multiplicative Thinning-Based INARCH Model: Properties, Saddlepoint Maximum Likelihood Estimation, and Application" Entropy 25, no. 2: 207. https://doi.org/10.3390/e25020207

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Modified Multiplicative Thinning-Based INARCH Model: Properties, Saddlepoint Maximum Likelihood Estimation, and Application

Abstract

1. Introduction

2. A Multiplicative Thinning-Based INARCH Model

3. Parameter Estimation

3.1. Saddlepoint Maximum Likelihood Estimation

3.2. Asymptotic Properties of the SPMLE

3.3. Simulation Study

4. A Real Example

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

Appendix A.1. Details of SPMLE

Appendix A.2. Derivatives of the Quasi-Likelihood Function

Appendix A.3. Proof of Theorem 1

Appendix A.4. Proof of the Positive Definiteness of Σ

Appendix A.5. Lemmas for the Proof of Theorem 2

Appendix A.6. Proof of Theorem 2

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI