1. Introduction
By allowing the equilibrium error to follow a fractionally integrated process, fractional cointegration constitutes a useful extension of classical cointegration. It has received considerable attention in the statistics, finance and econometric literature. There are several notions of (fractional) cointegration for a
p-dimensional time series
(see Engle and Granger (1987) [
1], Johansen (1996) [
2], Flôres and Szafarz (1996) [
3] and Robinson and Yajima (2002) [
4] among others). In the definition studied in Robinson and Yajima (2002) [
4], a
p-vector
is partitioned into several sub-vectors such that elements in each sub-vector have the same integration order. Furthermore,
is said to be (fractionally) cointegrated if a cointegration exists in any of the sub-vectors. Under this setting, partitioning
requires testing for the homogeneity of integration orders of multiple time series, which has attracted much interest. Current procedures usually assume stationarity and invertibility. For example, Heyde and Gay (1993) [
5] and Hosoya (1997) [
6] investigate this problem based on a parametric setting, and Robinson (1995) [
7] and Lobato (1996 and 1999) [
8,
9] study the problem using the semiparametric framework. When cointegration exists or the time series becomes nonstationary, some of these tests become invalid.
Robinson and Yajima (2002) [
4] construct a single-test statistic that is valid in the presence of cointegration for testing the homogeneity of the fractional integration orders of multiple (asymptotically) stationary and invertible time series. They propose estimating the fractional integration order using the local Whittle likelihood method and introduce a user-chosen number to deal with the inversion of an asymptotically singular matrix. Nielsen and Shimotsu (2007) [
10] extend this test statistic to accommodate both (asymptotically) stationary and nonstationary time series by applying the exact local Whittle likelihood method of Shimotsu and Phillips (2005) [
11]. The simulation results in Nielsen and Shimotsu (2007) [
10] show that the test statistic is sensitive to the choice of the user chosen number, which is assumed to satisfy certain conditions. Hualde (2013) [
12] proposes a residual-based test, which covers the nonstationary and noninvertible series, and is valid irrespective of whether cointegration exists. Although this test is developed for a bivariate series, extending it to the multivariate case is non-trivial because multiple comparisons are needed when high-dimensional series are involved. There are two ways to extend the Hualde (2013) [
12] result. The first involves testing the equality of each pair of integration orders, which requires
simple tests for a
p-dimensional series. When
p is large, this test procedure becomes computationally intensive. The second extension is to explore the possibility of a one-step single test, which is pursued here.
In this paper, a residual-based testing procedure for the equality of integration orders of a multiple fractionally integrated process is proposed. The test encompasses both the stationary/nonstationary and invertible/noninvertible situations, and is valid even when the time series is cointegrated. The procedure is computationally feasible because it is a one-step test without inverting ill-conditioned matrices under cointegration. The test can be computed very fast even when dealing with a large p. The test statistic converges to a standard normal distribution under the null hypothesis that all integration orders are equal, and diverges when there are different integration orders.
This paper is organized as follows. In
Section 2, the testing procedure and asymptotic theory are presented. Empirical sizes and powers of the proposed test are given via a Monte Carlo study in
Section 3.
Section 4 concludes the paper.
2. Integration Orders
Consider the following
p-dimensional time series
, with prime denoting transposition and
,
where
is the indicator function,
,
L is the lag operator, and
is a vector of zero mean covariance stationary processes. Note that the series
is nonstationary for
and “asymptotically stationary” for
. By Taylor’s expansion,
, where
. If
α is not a negative integer, then
. When
α is a negative integer, then
for
and
becomes the usual formula of differencing with integer orders. The symbol
is used to represent the Euclidean norm and
means that
converge to a constant or converge in distribution to a random variable as
n goes to ∞.
Assumption 1. Consider the process with . Assume that- 1.1.
;
- 1.2.
are vectors with mean zero, positive definite covariance matrix Ω and for some , where .
- 1.3.
, where is the spectral density matrix of and is the element of .
Assumption 1 is mild because it is satisfied by the usual stationary and invertible autoregressive moving average (ARMA) processes. This is a common assumption for applying the functional limit theorem of Marinucci and Robinson (2000) [
13], and it has appeared in a similar form as Assumptions A–C of Marmol and Velasco (2004) [
14], Assumption A of Hualde (2013) [
12] and Assumption 1 of Wang, Wang and Chan (2015) [
15]. In Particular, the moment condition in Assumption 1.2 is discussed by Johansen and Nielsen (2012) [
16]. As pointed out in Wang, Wang and Chan (2015) [
15], Assumption 1.1 ensures that the limiting process of the partial sum of
has nondegenerated finite-dimensional distributions. Assumption 1.1 implies that
is
,
.
Under Assumption 1, model (
1) means that all
are type-II fractionally integrated processes. Furthermore, based on the fractional cointegration definition given in Robinson and Yajima (2002) [
4], if the integration orders of
are the same and there exists a non-zero linear combination
that is
, then the
p-dimensional time series
is said to be cointegrated. Furthermore, any multiple time series containing
as a sub-vector is also said to be cointegrated.
To test whether all of the are the same, we need to estimate precisely. Thus, the following assumptions are introduced.
Assumption 2. Under both the null and alternative hypotheses,- 2.1.
there exists a positive constant and estimates of , respectively, such thatand there exists , - 2.2.
Letting be an estimate of , then , where stands for the convergence in probability.
Assumption 2 is very mild, as condition (
2) is satisfied if
are optimizers of the corresponding functions over compact sets.
can be estimated by semiparametric methods (see, for example, the log periodogram estimate of Geweke and Porter-Hudak (1983) [
17] studied by Hurvich et al. (1998) [
18] or the narrow-band Gaussian or Whittle estimate introduced by Künsch (1987) [
19] and studied in Robinson (1995) [
7] and Lobato (1999) [
9]. Equation (3) is satisfied by many estimation methods, such as that used in Beran (1995) [
20] and Tanaka (1999) [
21]. As pointed out by Hualde and Velasco (2008) [
22], Equation (3) is satisfied if
is estimated from
using the usual parametric or semiparametric methods. For example, the Whittle pseudo-maximum likelihood estimation proposed by Velasco and Robinson (2000) [
23] satisfies (3). In particular, if a parametric structure is imposed on
, then a
-consistent estimator results by means of a multivariate extension of Robinson (2005) [
24]. Assumption 2.2 is quite common and is satisfied by many classic semiparametric or nonparametric estimates. Actually, a stricter condition on the convergence rate of
(
, with
χ being a positive constant) is used in many articles, such as Hualde and Robinson (2006) [
25], Hualde and Robinson (2010) [
26], Hualde and Velasco (2008) [
22] and Wang (2008) [
27], among others. In particular, Hualde and Robinson (2006) [
25] discuss the convergence rate of some estimates of
f, including a weighted periodogram estimate that satisfies Assumption 2.2. Hualde and Velasco (2008) [
22] point out that the nonparametric estimate of
introduced in their paper satisfies Assumption 2.2. Once
is estimated, the nonparametric estimator of
can be based on the weighted averages of the periodogram of the proxy
, where
.
Let
be a sequence such that
Let
,
and
where
,
and
. Furthermore, for
, let
. Clearly,
is the entire sample space with
.
Defining
and
, we denote:
as the test for
against the alternative
: there exists at least a pair of
such that
.
Theorem 1. Letting Assumptions 1 and 2 hold, is defined in (1), and then under and under , where stands for convergence in distribution as .
Remark 1. Denote the set of indices of the maxima of as , and let be the smallest index of the maxima, that is, . Furthermore, let where is the unit vector that equals one at the -th coordinate and zero otherwise. Then, it is shown in the proof of Theorem 1 that .
Remark 2. The vector can also be set as a vector of constants: , which satisfies . As in probability, with probability 1. However, with unknown, it is not guaranteed that diverges under at a rate as fast as that specified in Theorem 1. Wang (2008) [27] shows that different pre-determined may lead to different divergence rates. Remark 3. As pointed out in Remark 2, the choice of has an influence on the diverging speed of . From the proof of Theorem 1, to get the theoretical diverging speed of as in Theorem 1, define by Equations (4) and (5). Then, when , with being the smallest index of the maxima of . Consequently, the denominator of converges to . Similar to the analysis in Hualde (2013) [12] and Wang, Wang and Chan (2015) [15], it is natural to replace condition (4) by setting , in which case converges to a random limit under . Furthermore, the limits of the numerator and denominator of are dependent, which complicates analysis of the asymptotic distribution of . From the definition of , it is obvious that the power of the proposed test with is superior to that of tests with other choices of . However, when the sample size , the powers of different cases will become the same. In practice, or are two possible choices. In particular, if the parametric method in Hualde and Robinson (2011) [28] is used, , then we can set .
Remark 4. If is cointegrated with , , then would be singular. In this situation, most of the tests in the literature involve the inverse of and become invalid under . However, the proposed test still works in the presence of cointegration. As by Assumption 1, and as mentioned in Remark 1, we have . Furthermore, as shown in Theorem 1, converges to in probability. Then, is positive with probability 1, and remains valid under cointegration.
3. Simulation
To assess the performance of our testing procedure, we conduct two Monte Carlo experiments. For both experiments, we generate
as in (1) with
being a three-dimensional white noise with
,
for
,
=0.5. We compute
parametrically, which means
are estimated as in Hualde and Robinson (2011) [
28] and
is estimated by
.
For the first experiment, using 10,000 replications and 3 different sample sizes
, we compute the proportion of rejecting
for nominal size
, 0.05, and 0.1 with different combinations of
. Letting
, we consider
. To investigate the sensitivity of the choice of
, we present the result for
with
in
Table 1.
First, consider the sizes, that is,
. We observe that for
,
is oversized and the empirical sizes of case
and
are very close to the nominal sizes. As
n increases, the empirical sizes under all scenarios approach the nominal sizes as expected. We also examine the power for
and
. It can be seen that the empirical power increases as
n and
ϕ increase, and that
performs very well for all choices of
. As expected, a smaller
leads to better power, so
has the best power and
has better power than
. As
ϕ increases, the difference decreases substantially, and it is clear that for
, the powers of all
are almost the same. One explanation is that when
ϕ is large enough,
become relatively small compared with
ϕ, leading to the same
. As ARMA models are common in modeling stationary time series, autoregressive fractionally integrated moving averaging (ARFIMA) models constitute a reasonable approximation to
when the parametric method in Hualde and Robinson (2011) [
28] is considered. In practice, if there is insufficient information about the true model, a general ARFIMA
model is entertained first and a model selection procedure based on some information criteria is conducted to choose
and
.
For the second experiment, we conduct a simulation to compare the proposed test
with the test in Nielsen and Shimotsu (2007) [
10]:
where
m is the bandwidth parameter;
is the vector of integration orders of
;
o is the Hadamard product;
is the
-dimensional identity matrix;
, with
ι being the
-vector of ones;
is a positive sequence satisfying certain assumptions;
G is the spectral density matrix of the
δ’th differenced process around the origin; and
D is the diagonal matrix of
G. Using 5,000 replications and 3 different sample sizes
n = 128, 256 and 512, we report the rejection frequencies of
with
, as well as
with bandwidth parameter
and two choices of
, that is
and
in
Table 2. Here,
denotes the largest integer smaller than or equal to
z. The fractional integration order
δ is estimated by the exact local Whittle likelihood for
.
We find that all of the three tests are oversized, and that their empirical powers increase when ϕ increases. However, the empirical powers and empirical sizes of do not change much when the sample size changes from 128 to 512, while those of improve significantly when n increases.
We first compare the simulation results of
with
and
. It is obvious that
is sensitive to the choice of
:
works reasonably well for
and
over-rejects substantially for
. The test
is oversized for both
and
, and
has a better empirical size and
better empirical power. This phenomenon is also reported in Nielsen and Shimotsu (2007) [
10].
We then compare with and find that, for all sample sizes n, has much better empirical sizes than for both and . The empirical power of is not as good as that of when the sample size is relatively small (128 and 256). However, as the sample size increases to 512, the empirical power of becomes superior to that of .