Next Article in Journal
An Empirical Study on the Impact of Basel III Standards on Banks’ Default Risk: The Case of Luxembourg
Next Article in Special Issue
The Solvency II Standard Formula, Linear Geometry, and Diversification
Previous Article in Journal
Modeling NYSE Composite US 100 Index with a Hybrid SOM and MLP-BP Neural Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On the Power and Size Properties of Cointegration Tests in the Light of High-Frequency Stylized Facts

University of Erlangen-Nürnberg, Lange Gasse 20, 90403 Nürnberg, Germany
*
Author to whom correspondence should be addressed.
J. Risk Financial Manag. 2017, 10(1), 7; https://doi.org/10.3390/jrfm10010007
Submission received: 10 October 2016 / Revised: 12 December 2016 / Accepted: 31 January 2017 / Published: 7 February 2017

Abstract

:
This paper establishes a selection of stylized facts for high-frequency cointegrated processes, based on one-minute-binned transaction data. A methodology is introduced to simulate cointegrated stock pairs, following none, some or all of these stylized facts. AR(1)-GARCH(1,1) and MR(3)-STAR(1)-GARCH(1,1) processes contaminated with reversible and non-reversible jumps are used to model the cointegration relationship. In a Monte Carlo simulation, the power and size properties of ten cointegration tests are assessed. We find that in high-frequency settings typical for stock price data, power is still acceptable, with the exception of strong or very frequent non-reversible jumps. Phillips–Perron and PGFF tests perform best.

1. Introduction

The concept of cointegration has been empirically applied to a wide range of financial and macroeconomic data. In recent years, interest has surged in identifying cointegration relationships in high-frequency financial market data (see, among others, [1,2,3,4,5]). However, it remains unclear whether standard cointegration tests are truly robust against the specifics of high-frequency settings. In our dataset, we find several stylized facts with potential impact on the power and size properties of contemporary cointegration tests. Most notably, the one-minute return data are highly non-normal and exhibit ARCH effects with intraday seasonalities and jumps. Additionally, we find evidence for nonlinear dependencies, even after applying AR(1)-GARCH(1,1) filtration; see Section 2.
Monte Carlo studies of the power and size properties of cointegration tests are no novelty to the literature. Existing approaches may be clustered as follows: The first group uses vector autoregressive (VAR) models with Gaussian innovations as data generating processes (DGPs). These are in line with the assumptions of the commonly-applied Johansen procedure; see [6,7,8]. The objective is to compare power and size properties across a wide range of different cointegration tests. Common references are [9,10,11]. The latter study constitutes the most comprehensive contribution and provides an excellent literature review. The second group analyzes the effect of non-Gaussian innovations on power and size properties. The work in [12] considers Student’s t-distributions with three degrees of freedom, a truncated Cauchy distribution, Gaussian mixtures and others. Furthermore, bivariate ARCH and GARCH processes are also used to model the innovations. The work in [13] analyzes the impact of ARCH innovations from a multivariate Baba-Engle-Kraft-Kroner-ARCH DGP on the trace test. The works in [14,15] follow a similar approach. The third group covers the concept of threshold cointegration. The seminal reference is by [16], who were the first to introduce threshold nonlinearities in cointegration relationships and to analyze the power of conventional and enhanced tests in this setting. Further studies and improved tests follow [17,18].
However, none of these studies has considered the joint impact of high-frequency stylized facts on the power and size properties of contemporary cointegration tests. In this respect, the contribution of this paper is three-fold:
First, a procedure is developed to simulate high-frequency stock prices from actual market data, while retaining most of their idiosyncrasies.
Second, suitable DGPs are considered to simulate cointegration relationships, reflecting different stages of high-frequency stylized facts, i.e., non-normality, GARCH effects, threshold nonlinearities, reversible and non-reversible jumps. Our choice of DGPs is not only driven by the known stylized facts, but also by the economic interpretation of the cointegration relation. Similar approaches are known to the literature (see, e.g., [19,20]), but new in the context of testing. Here, we follow the notion that the cointegration relation reflects the law of one price (LOP); see [21] for the subsequent discussion. The work in [22] states the LOP as the “proposition (...) that two investments with the same payoff in every state of nature must have the same current value.” The work in [23] expands on this concept and argues that “closely integrated markets should assign to similar payoffs prices that are close.” In other words, the cointegration residual ties the price series of two securities together, ensuring that they are close. Whenever they diverge too far, the stationary models describe a relative value arbitrage mechanism as in [21], ensuring subsequent convergence. The latter is referred to as a “weak-form market integration” by [23], meaning a “near-efficient market”. Consequently, we investigate the following models for the cointegration residual in our simulations:
  • In a basic approach, an AR(1) and an AR(1)-GARCH(1,1) model are chosen, representing one stationary (or nonstationary) regime with arbitrage permanently occurring.
  • A sophistication is given by a MR(3)-STAR(1)1-GARCH(1,1) with its different regimes to model the impact of transaction cost on arbitrage: In its middle regime, the cointegration residual truly behaves like a random walk; in this domain, arbitrage is not yet profitable. However, once the cointegration residual ventures into the outer regimes, arbitrage starts to occur.
  • An addition of reversible jumps represents uninformed buying or selling, as described in [24], or in the sense of [25], who associate “transitory” or “observable jumps” in continuous time trading with temporary liquidity shortages.
  • An alternative addition of non-reversible jumps represents idiosyncratic information, affecting only one of the two companies. Such a jump translates into a regime shift, causing further arbitrage to occur, but at a different level. A similar motivation is given in [25], who define “permanent” or “innovation jumps” as regime shifts in the fundamental value of a firm.
Third, ten contemporary cointegration tests are examined in a Monte Carlo simulation with respect to their power and size properties for sample sizes of n = 510 minutes, i.e., one trading day at Xetra. We find that: (1) non-normal innovations and GARCH effects have only a limited impact on the tests; (2) threshold nonlinearities lead to significant distortions with increasing threshold levels and abruptness of regime shifts; and (3) reversible jumps do not harm or even increase the power, and non-reversible jumps strongly deteriorate the power.
The remainder of this paper is organized as follows: Section 2 covers the high-frequency sample provided by Deutsche Börse AG. Section 3 reviews the three-step approach for simulating a cointegrated stock pair and assessing the power and size properties of the different cointegration tests. Section 4 presents the results and discusses the key findings in light of the existing literature. Finally, Section 5 concludes and summarizes directions for further research.

2. Data Sample and Its Stylized Facts

The data applied in this study consist of all transactions of the DAX 30 constituents traded on Xetra in continuous trading throughout the year 2014. Transaction data are aggregated to one-minute bins, so we have more than 3.8 million data points at hand. The total value of all transactions amounts to EUR 910 billion, representing a major share of all trades in the respective stocks. Corporate actions data are available, as well. In order to smooth the data for the deterministic price changes implied by corporate actions (time and magnitude are largely known up front), we sanitize the returns by the approximate amount of their corporate actions, i.e., by setting the largest return in absolute value within a ten-minute window around the exact time of the corporate action to zero. In a first step, we analyze the data in light of well-known stylized facts for financial data. For each of the 249 trading days, we perform a set of tests for each of the 30 constituents. (The Jarque–Bera test, the Broock-Dechert-Scheinkman (BDS) test and the Teräsvirta test are implemented in the R package tseries by [26]. The Box test is part of the R package stats by the [27]. Engle’s ARCH test is implemented in the R package FinTS by [28]. The Barndorff-Nielsen and Shephard (BNS) test of [29] is implemented in the R package highfrequency by [30]. The Tsay test is implemented in the R package TSA by [31]. The Luukkonen test of [32] is implemented in the R package twinkle by [33].) These tests are first run on the raw returns and then on the residuals of adequately-fitted AR(1) and AR(1)-GARCH(1,1) processes - the latter are fitted with the R package rugarch of [34]. Table 1 summarizes the results and depicts the share of tests with p-values less than 0.05 . The raw and the filtered returns are highly non-normal, as indicated by the Jarque–Bera tests. ARCH effects are present in the raw returns, but largely filtered out by the AR(1)-GARCH(1,1) processes. The raw returns are highly nonlinear, as indicated by the BDS test, the Tsay test, the Luukkonen test and the Teräsvirta test. Even after the GARCH filter, there remains evidence of nonlinearity in the data. Volatility does not only show strong GARCH(1,1) effects, but exhibits significant intraday seasonality, as discussed, e.g., in [35]. Specifically, we observe the typical U-shaped pattern across all constituents. The clock time dependency of this seasonality pattern for each individual stock strongly resembles the pattern identified for the DAX future in [36]; see Figure 1.
Furthermore, the BNS test indicates jumps. These findings are in line with the existing literature; see, for example [37] for an account of relevant stylized facts in financial markets.
In a second step, we check if the stylized facts of the cointegration residuals of potentially cointegrated stock pairs differ from the results above. We identify the cointegration relationship with the Johansen trace test at the five percent significance level. This approach is delicate, since one of the objectives of this paper is to examine the power of cointegration tests. However, we do not claim that the price series are really cointegrated, but much rather identify processes that are typically identified as cointegrated by one of the most commonly-used cointegration tests. We use the implementation of the Johansen procedure by [38] to examine all 30 × 29 / 2 = 435 pairs of stocks for each of the 249 full trading days (this amounts to a total of 108,315 tests). First, we use the augmented Dickey–Fuller test of [39] to check if the individual series are I(1) at the one percent significance level. We follow the routine outlined in [40], where first the augmented Dickey–Fuller (ADF) test is deployed on the two nonstationary processes and second the Johansen test. A more conservative approach could be to use the Kwiatkowski–Phillips–Schmidt–Shin (KPSS) test in the first step, with a null hypothesis of stationarity. Rejecting the I(0) null hypothesis of a KPSS test instead of accepting the I(1) null hypothesis of a unit root test may further restrict the number of pairs. If the series are considered I(1), the lag order of the VAR models is selected with the Bayesian information criterion (BIC). Next, the test statistic of the Johansen trace test is calculated, and the p-values are approximated with the routine described in [41] and implemented in R by [42]. A total of 435 × 249 = 108,315 processes are analyzed, and 41,069 are identified as cointegrated at the five percent significance level. Finally, we run the same set of tests to see if the stylized facts of the cointegration residuals differ from the results of Table 1. The results are summarized in Table 2. We see that the cointegration residuals exhibit similar stylized facts as the return data, i.e., non-normality, ARCH effects, jumps and further evidence of nonlinearities.
Since the cointegration residuals are one of the focal points of this paper, we provide further evidence on GARCH effects. We deploy the R package rugarch of [34] to fit AR(1)-GARCH(1,1) models to all identified cointegration relationships, following a standard specification further detailed in Equation (2) with t-distributed innovations. We log all 41,069 × 5 coefficients in a matrix of these dimensions, notably, the AR(1) coefficient ϕ 1 , the intercept α 0 of the GARCH model, the ARCH parameter α 1 , the GARCH parameter β 1 and the degrees of freedom of the t-distribution ν. We add an additional column for the unconditional standard deviation σ G of the process, which can be calculated. Next, we sanitize the coefficient matrix by eliminating the 99th percentile of values of σ G , as well as a handful of estimates with σ G < 0 . The latter cases usually correspond to processes, where the solver does not find a proper optimum. This sanitization eliminates a total of 424 cases, reducing the number of observations to 40,644. In Figure 2, we show a histogram for the above mentioned process parameters and in Table 3 their summary statistics. We see that α 1 is close to zero (mean of 0.064); β 1 is close to one (mean of 0.875); and α 1 + β 1 even closer to one (mean of 0.940), i.e., close to variance nonstationarity. The t-distributed innovations exhibit low degrees of freedom; most of the probability mass is concentrated to the left of ν = 10 . The latter observations are typical for financial data. The unconditional standard deviation is very low; most of the probability mass is left of 0.1, which is in line with the short time period of one-minute returns.
Next, we submit the cointegration residuals to a detailed jump analysis, loosely following the methodology [25] applied to stocks. In particular, we difference each of the cointegration residuals, consisting of n = 510 observations. Then, we estimate the left and right tail index over the differenced series, using Hill’s estimator (compare [43]) with a threshold of five percent, relying on the R package fExtremes. For simplicity’s sake, we assume that the innovations of the differenced cointegration residuals are drawn from a t-distribution whose degrees of freedom are equal to the average of our left and right tail index estimates. We characterize a jump as an innovation outside of the one percent or outside of the 99 percent quantile. It is reversible, if the jump persists for only one period before the process bounces back into its 98 percent confidence interval. Conversely, it is defined as non-reversible if the new level is sustained for at least two periods. Further details can be found in [25], Appendix 6.1. Table 4 reports the descriptive statistics for reversible and non-reversible jumps, with q k % denoting the k-% quantile. We see that, on average, 3.66 reversible and 6.88 non-reversible jumps occur per day in the cointegration residuals. Non-reversible jumps are larger, with a mean of 0.17 EUR as opposed to 0.12 EUR for reversible jumps. Note that we observe more jumps than the theoretical quantiles of the t-distribution would suggest, indicating even fatter tails than estimated. This result is in analogy to [25].

3. Methodology

The suggested methodology to assess the power and size properties of ten cointegration tests is based on three steps. In the first step in Subsection 3.1, we present our approach to simulate high-frequency stock prices while retaining their stylized facts. In the second step in Subsection 3.2, we show how to simulate cointegration residuals, exhibiting different stylized facts. In the third step in Subsection 3.3, we finally present the methodology for simulating the cointegration relationship and, hence, the price of the second stock. Steps 1 to 3 are then repeated in different Monte Carlo settings in Subsection 3.4 to obtain the power and size properties of the tests under different conditions. This univariate design for assessing cointegration tests is loosely based on the setups of [44,45]. The parameter choices common to all Monte Carlo settings are discussed along Subsection 3.1 through Subsection 3.3, and specific choices are detailed in Subsection 3.4.

3.1. Simulation of Stock Prices

For an effective Monte Carlo simulation, we need a methodology to generate artificial high-frequency price series, while retaining their stylized facts as accurately as possible. We suggest a variant of the stationary bootstrap developed by Politis and Romano [46] that we apply across all Monte Carlo settings:
  • Set the return index i equal to one for initiation, i.e., to the first return of the day. Initialize a vector v with length 509 with zeros.
  • Draw one stock s out of the 30 DAX 30 constituents.
  • Draw one day d out of 249 full trading days.
  • Draw a random block length l from a geometric distribution with expected value of four 2.
  • Choose a block of length l, consisting of returns from stock s from day d for indices i , i + 1 , . . . , i + l . Copy these returns in vector v at positions i , i + 1 , . . . , i + l .
  • Update i with i + l + 1 . Go back to Step 1, until vector v consists of 509 returns.
  • Draw a random starting price between 5 and 40 from a uniform distribution. Accumulate the return vector v to a price series.
This procedure has several advantages. First, it ensures that the return series remains stationary; see [46]. Second, since blocks of a random length with an expected value of four are drawn, volatility clusters are partially preserved. Third, considering that the time ordering within a trading day is retained, intraday volatility seasonalities, i.e., the U-shaped pattern, are also reflected in the simulated prices. Fourth, also all further stylized facts can be observed in simulated price paths 3. For example, jumps in the raw input data are drawn with some positive probability corresponding to the actual occurrence of jumps in high-frequency financial market data. Subsuming all of these advantages justifies the ad hoc nature of this nonparametric approach.

3.2. Simulation of Cointegration Residuals

In order to reflect the identified stylized facts, five DGPs are considered for the simulation of the cointegration residuals: a simple AR(1)-process, an AR(1)-GARCH(1,1) process, an MR(3)-STAR(1)-GARCH(1,1) process, an MR(3)-STAR(1)-GARCH(1,1) process with reversible jumps and an MR(3)-STAR(1)-GARCH(1,1) process with non-reversible jumps. In the following, we briefly discuss the relevant statistical properties of each DGP.

3.2.1. Autoregressive Model

Let u t denote the cointegration residual at time t. An AR(1) process of this residual with coefficient ϕ 1 and scale parameter σ G may be defined as:
u t = ϕ 1 u t 1 + σ G ϵ t , with ϵ t N 0 , 1 or ϵ t t 0 , ω = ν 2 ν , ν ,
such that the innovations are either normally distributed with mean zero and standard deviation of one or t-distributed with mean zero, ν degrees of freedom and a shape parameter ω. Following [47], the shape parameter ω is chosen in such a way that the standard deviation of the t-distribution equals one. For this purpose, ν must be strictly greater than two, so for all numerical implementations involved, a lower bound of 2.1 for ν is applied. Clearly, this process is stationary if | ϕ 1 | is less than one. In case the normal distribution is chosen, this process does not reflect any of the high-frequency stylized facts. The t-distribution introduces non-normality. Note that in the AR(1)-case, the cointegrated processes share the same stylized facts, e.g., even jumps would happen simultaneously. This behavior is typical for dual-listed companies; see, for example, [48].

3.2.2. Generalized Autoregressive Conditional Heteroscedasticity Model

The GARCH model of [49] generalizes Equation (1) to an AR(1)-GARCH(1,1) model, accounting for time-dependency in the scale parameter as:
u t = ϕ 1 u t 1 + x t , with x t = σ t ϵ t , σ t 2 = α 0 + α 1 x t 1 2 + β 1 σ t 1 2 , α 0 > 0 , α 1 , β 1 0
where the time-dependency is determined by parameters α 0 , α 1 and β 1 . The innovations ϵ t may again be either normally or t-distributed in the same manner as for the autoregressive model. The mean follows the same stationarity condition as Equation (1). A sufficient condition for strong stationarity for GARCH(1,1) is given if E ( log ( β 1 + α 1 ϵ t 2 ) ) < 0 (compare [50]), whereas the necessary and sufficient condition for weak stationarity is α 1 + β 1 < 1 ; see [49]. The given process is used to model the stylized facts of non-normality and of ARCH effects. Several popular extensions of GARCH, such as Glosten-Jagannathan-Runkle-GARCH ([51]) or exponential-GARCH ([52]), allowing for leverage/asymmetry in conditional volatility, have been neglected for the sake of simplicity and interpretability. The same applies to highly flexible multivariate volatility models, such as the diagonal Baba-Engle-Kraft-Kroner model for multivariate conditional volatilities and co-volatilities; see [53].

3.2.3. Multiple Regime Smooth Transition Autoregressive Model

Based on TAR models developed in [54,55], which first introduced STAR models, which have again been extended by [56] to three regimes, for our purpose, we use a simplified version of the latter model, i.e., a MR(3)-STAR(1) process defined as:
u t = ϕ 1 u t 1 + ϕ 2 u t 1 F 12 γ 1 , c 1 + ϕ 3 u t 1 F 23 γ 2 , c 2 + σ t ϵ t .
Thereby, ϕ i , i [ 1 , 2 , 3 ] are the AR(1) coefficients of the three regimes, and F denotes the logistic transition function as suggested in van Dijk and Franses [56] with smoothness parameter γ and threshold c. σ t follows again Equation (2) with innovations being either normally or t-distributed. With three regimes, threshold nonlinearities and GARCH effects, the model may be denoted as MR(3)-STAR(1)-GARCH(1,1). Its variance follows the same stationarity condition as above. Regarding the mean equation, to the best of our knowledge, no comprehensive stationarity conditions have been developed yet; compare the discussions in [57] or [58]. However, for our purposes, we just discuss simplified parameter constellations, following the ideas of [59] for a multiple-threshold AR(1) model:
Case A: Let ϕ 1 be equal to one, and c 1 < c 2 . If ϕ 2 and ϕ 3 are equal to zero, we wind up having only one regime: a random walk, which clearly is nonstationary.
Case B: If ϕ 1 is an element of [ 0.95 , 0.90 , 0.85 ] , ϕ 2 is an element of [ 0.05 , 0.10 , 0.15 ] , ϕ 3 is an element of [ 0.05 , 0.10 , 0.15 ] and c 1 < c 2 , the middle regime corresponding to ϕ 2 is nonstationary, but the outer regimes become stationary, making the entire process stationary. The latter is due to a mix effect. For values sufficiently 4 less than c 1 , the logistic transition functions F 12 and F 23 converge towards zero, so the lower regime is stationary with a mixed coefficient ϕ L element of [ 0.95 , 0.90 , 0.85 ] . For values sufficiently greater than c 1 , but sufficiently smaller than c 2 , the logistic transition function F 12 converges towards one, but F 23 still converges towards zero. Hence, the mixed coefficient for the middle regime ϕ M is an element of [ 1.00 , 1.00 , 1.00 ] , indicating nonstationarity. For values sufficiently greater than c 2 , F 23 also converges towards one, leading to a mixed coefficient ϕ U equal to [ 0.95 , 0.90 , 0.85 ] . Let us assume that we choose symmetric threshold levels, i.e., c 1 = c 2 , with c 2 > 0 . The farther the thresholds are apart, the larger the nonstationary share of the process becomes. Furthermore, the smaller the parameter γ, the smoother the logistic function and the larger the influence of the lower and upper regimes.
The MR(3)-STAR(1)-GARCH(1,1) can thus model the stylized facts not only of non-normality and ARCH effects, but also nonlinear dependencies. The latter is related to the fact that the MR-STAR model “nests several other nonlinear time-series models (...), for example, an artificial neural network”, as ([56], p. 317) point out.

3.2.4. Multiple Regime Smooth Transition Autoregressive Model with Reversible Jumps

We define reversible jumps as extreme events that are reversed over time after their initial occurrence, meaning they only have a temporary effect on the cointegration relationship. We model them by successively drawing waiting times w i from an exponential distribution with parameter λ w , so that the cumulative sum of the waiting times is less than or equal to n = 510 min (we round to the next full integer). The cumulative sum over the waiting times w i provides the time index t of the current jump. At these time indexes, the innovations σ G ϵ t of Equation (1) and σ t ϵ t of Equations (2) or (3) are replaced by a draw from a Student’s t-distribution defined as:
D w t 0 , ω w = s w m w ν w 2 ν w , ν w ,
where the shape parameter ω w fixes the median of the absolute value of the t-distribution to the desired level m w , i.e., the median jump size ( s w first standardizes it to one). In stationary models, these jumps only have a temporary impact. This model fulfills all of the stylized facts outlined in Section 2, except non-reversible jumps.

3.2.5. Multiple Regime Smooth Transition Autoregressive Model with Non-reversible Jumps

We define non-reversible jumps as extreme events that have a permanent effect on the time series, i.e., a mean shift. We model the jump series j t as a compound Poisson process defined in ([60], p. 87) ff., as:
j t = i = 1 N ( t ) D i , with D i t 0 , ω p = s p m p ν p 2 ν p , ν p ,
where { N ( t ) , t 0 } is a Poisson process with parameter λ P , the expected number of jumps per trading day, and D i is a sequence iid of t-distributed random variables independent from N ( t ) . The shape parameter ω p fixes the median of the absolute value of the t-distribution to the desired level m p , i.e., the median jump size ( s p first standardizes it to one). The cointegration residual u t * is then a superposition of the MR(3)-STAR(1)-GARCH(1,1) model of Equation (3) and the compound Poisson process in Equation (5), i.e.,
u t * = u t + j t .
As a consequence, this model fulfills all of the stylized facts outlined in Section 2, except reversible jumps. Note that the compound Poisson process has an expected value of zero, as well, but its variance increases over time.

3.2.6. Parameter Choices Common to All Monte Carlo Variants

The above models require various parameters that we have introduced in Equations (1) through (5). Generic parameter settings for all Monte Carlo variants with respect to the cointegration residuals are discussed below. In particular, the five GARCH parameters are sampled from actual high-frequency data, i.e., the ARCH parameter α 1 , the GARCH parameter β 1 , the degrees of freedom of the t-distribution ν and the unconditional standard deviation σ G . Specifically, we sample with replacement from the 40,644 sets of sanitized cointegration residuals elaborated in Section 2. Each resampling provides us with five raw GARCH parameters specific to a particular set of cointegration residuals. Following [61], we fuzzify these parameters by adding Gaussian error terms with mean zero and standard deviation equal to the bandwidth determined with a method suggested by [62]. We ensure that the error term does not lead to the violation of the general assumptions about the process parameters, e.g., among others, α 1 + β 1 < 1 or σ G > 0 . This routine allows for effective resampling from realistic GARCH parameters, while introducing additional flexibility for the Monte Carlo simulations through the Gaussian error terms.

3.3. The Cointegration Relationship

Once the prices of the first stock are simulated according to the approach in Subsection 3.1 and the cointegration residual according to the approach in Subsection 3.2, we can finally determine the cointegration relationship. The latter is defined as:
p 2 t = α + β p 1 t + u t ,
where p 2 t is the price of the second stock, p 1 t is the price of the first stock as simulated with the approach outlined in Subsection 3.1 and u t is the cointegration residual as simulated with one of the processes suggested in Subsection 3.2. The coefficient α is the intercept, which is always drawn from a uniform distribution between 2 and 2. The parameter β is the coefficient of cointegration, which is always drawn from a uniform distribution between 1 and 5. The latter parameters are chosen ad hoc, as they have no impact on the power and size properties of cointegration tests.

3.4. Study Design

3.4.1. Cointegration Tests

In order to assess the power and size properties of ten different cointegration tests, we have adapted the R implementation of the package egcm of [45] to our setting. This package provides p-values for ten different unit root tests that are adjusted for cointegration testing. In other words, for all cointegration tests outlined in Table 5, we use the corresponding critical values suitable for a cointegration setting from egcm (see [45]), in order to not over-reject the null hypothesis. All tests are listed in Table 5 together with their implementation. Details about their functionality can be obtained from the references.

3.4.2. Setup of Monte Carlo Simulations

We run six different types of Monte Carlo simulations of price pairs, differing in the setup of the cointegration relationship. The types are chosen to gradually incorporate more of the stylized facts discussed in Section 2. Each simulation is run with 10,000 replications per fixed parameter constellation and a path length of n = 510 , i.e., one trading day. The parameters for these simulation may be distinguished into two groups. The first group is the parameters applied in a common way for all types. The choices for these parameters are outlined in Subsection 3.1, Subsection 3.2 and Subsection 3.3. The second group consists of parameters specific to each simulation type, discussed below, where Table 6 provides an overview.
Type I: An AR(1) process with normally-distributed innovations is used to model the cointegration relationship. We test different values for ϕ 1 . This setting reflects none of the stylized facts and should return largely undisturbed size and power properties for all tests. As mentioned before, ϕ 1 = 1 returns the size of the tests, and ϕ 1 = 0.95 , 0.90 , 0.85 returns the power for different AR(1) coefficients. Hence, in total, four simulations are performed, each with 10,000 replications.
Type II: An AR(1) process with t-distributed innovations is applied in the same configuration as above, thereby reflecting the stylized fact of non-normality and its influence on size and power.
Type III: An AR(1)-GARCH(1,1) process with t-distributed innovations introduces non-normality and ARCH effects to the cointegration relationship. As in the scenarios above, we iterate through the four different values for ϕ 1 , accounting for four simulations.
Type IV: A MR(3)-STAR(1)-GARCH(1,1) model with t-distributed innovations is the workhorse for a cointegration residual exhibiting non-normality, ARCH effects and nonlinear dependencies. Type IV is split into 36 simulations. Our simulations are based on three different, symmetric threshold levels c i * , defined as:
c i * = c i · σ M , where | c i | = 1.00 , 5.00 , 10.00 , σ M = σ G 1 ϕ 2 and i = 1 , 2 .
In other words, the threshold c i * is a multiple c i of the unconditional standard deviation σ M of a hypothetical mixed process, assuming ϕ = 0.95 = const . across all applications 5 , 6. For each of these threshold multiples c i , three different values of γ i are tested, i.e., γ i = 1.00 , 5.00 , 10.00 . For each set of threshold multiple and gamma, we jointly vary ϕ 1 , ϕ 2 and ϕ 3 through a total of four combinations, i.e., ϕ 1 = 1.00 , 0.95 , 0.90 , 0.85 , ϕ 2 = 0.00 , 0.05 , 0.10 , 0.15 and ϕ 3 = ϕ 2 . The latter leads to ϕ L = 1.00 , 0.95 . , 0.90 , 0.85 = ϕ U and ϕ M = 1 . As before, each core simulation has 10,000 replications.
This choice of parameters is also in line with their economic interpretation: The (symmetric) thresholds c 1 * and c 2 * define at which level arbitrage kicks in, and the parameter γ determines how abruptly arbitrage kicks in. Regarding the abruptness of arbitrage kicking in, we can argue for a certain smoothness. Arbitrageurs most likely have different operating costs and different relative-value arbitrage models; for an overview of such strategies, see [73]. This diversity creates fuzziness around the threshold levels, which is reflected in the parameter γ. A value of ten already hinges towards a Heaviside function; a value of one is very smooth. As an ad hoc choice, we opt for a value of five. Regarding the thresholds, we have reason to believe that they would amount to the mean value of the cointegration residual plus or minus the transaction costs of the arbitrageur. The median bid-ask spread for all DAX 30 stocks in 2014 is in the range of four basis points (bps) or approximately 0.02 EUR, taking into account the price levels. Using Equation (8), the unconditional standard deviation of a hypothetical mixed process with σ G = 0.006 (mean value observed in our data in Table 2) and ϕ = 0.95 (median value corresponding to our data as in Table 7) amounts to 0.019 EUR; hence, clearly, the profit from a spread trade, entered at the threshold level of | c i * | = 1 · σ M and held until the mean reversion amounts to 0.019 EUR and approximately covers transaction costs, given that the bid-ask spread has to be crossed only for one of the stocks on average. Considering this thought experiment, it is reasonable to believe that equity markets exhibit arbitrage threshold multiples approximately at | c i | = 1 .
Type V: A MR(3)-STAR(1)-GARCH(1,1) model with t-distributed innovations is extended by reversible jumps. Type V consists of 36 simulations. We set the threshold multiple | c i | = 1 and γ i = 5 . We iterate through three different values for λ w , notably λ w = 2.00 , 3.00 , 8.00 , which correspond to an expected value of two, three or eight jumps per trading day. For each parameter setting of λ w , we test three different values for the scale parameter m w of the t-distributed innovations, notably m w = 0.05 , 0.10 , 0.30 . Both parameters, jump frequency λ w and median jump size m w , are motivated from the descriptive statistics in Table 4, effectively representing a best case (low jump frequency or jump size close to the 25% quantile), a base case (medium jump frequency or size close to the 50% quantile) and a worst case (high jump frequency or size close to the 95% quantile). The degrees of freedom ν w are set to five in order to create a leptokurtic distribution with fat tails. For each combination of λ w and m w , we iterate through three combinations of ϕ j , as in Type IV. This scenario reflects the stylized facts of non-normality, the ARCH effects, nonlinear dependencies and reversible jumps.
Type VI: An MR(3)-STAR(1)-GARCH(1,1) model with t-distributed innovations is extended by non-reversible jumps. Type VI consists of 36 simulations. As before, we set the threshold multiple | c i | = 1 and γ i = 5 . We iterate through three different values for λ p , notably λ p = 4.00 , 6.00 , 12.00 , which correspond to an expected value of four, six or twelve jumps per trading day. For each parameter setting of λ p , we test three different values for the scale parameter m p of the t-distributed innovations for the compound Poisson process, notably m p = 0.05 , 0.10 , 0.45 . Both parameters, jump frequency λ p and median jump size m p , are motivated from the descriptive statistics in Table 4, effectively representing a best case (low jump frequency or jump size close to the 25% quantile), a base case (medium jump frequency or size close to the 50% quantile) and a worst case (high jump frequency or size close to the 95% quantile). These values are also in line with the findings in [25]. Applying a variant of their methodology suitable for cointegration relationships, we find a median number of six non-reversible jumps per day with a median jump size of approximately 0.10 EUR in absolute value. The degrees of freedom ν p are again set to five. For each combination of λ p and m p , we iterate through three combinations of ϕ j , as in Type IV. This scenario reflects the stylized facts of non-normality, the ARCH effects, nonlinear dependencies and non-reversible jumps.
Please note that, like the majority of the other parameters, the mean-reversion coefficients ϕ 1 (Types I and II) and ϕ L = ϕ U (Types III to VI) selected for the power and size analysis are motivated based on our data; compare Table 3. Here, the descriptive statistics for the AR(1)-coefficients estimated from the empirical cointegration residuals are significantly closer to one than the values we apply, i.e., ϕ 1 = ϕ L = ϕ U = 1.00 , 0.95 . , 0.90 , 0.85 . This alleged discrepancy reflects the fact that estimates for AR(1)-coefficients are biased towards the nonstationary case, if a true underlying AR(1)-process is contaminated with non-reversible jumps 7. The fit of our choice of AR(1)-coefficients with the estimates in Table 3 is therefore illustrated by a simple simulation: We run 10,000 replications in order to see how the true coefficients translate to their empirical estimates in our Type VI case for stock market data, i.e., six non-reversible jumps of median size of 0.10 EUR in absolute value. Table 7 reports the results. We see that a mean-reversion coefficient ϕ L = ϕ U of the true DGP translates to an estimate ϕ ^ of 0.956. This value approximately corresponds to the 25% quantile of the empirically-observed data (see Table 3), i.e, the least conservative case. A mean-reversion coefficient of 0.95 corresponds to estimates of 0.971, which is in line with the 50% quantile of the empirically observed data and, thus, defines the base case for stock market data.

4. Results

Henceforth, we use the following acronyms for cointegration tests in the subsequent tables: pp is the Phillips-Perron test, adf the Augmented Dickey-Fuller test, jo-e the Johansen Eigenvalue test, jo-t the Johansen Trace test, ers-p the Elliott-Rothenberg-Stock point optimal test, ers-d the Elliott-Rothenberg-Stock Dickey-Fuller Generalized Least Squares test, sp-r the Schmidt and Phillips rho statistic, hurst the Hurst exponent, bvr Breitung’s variance ratio test, and pgff the Pantula-Gonzalez-Farias and Fuller test. The same acronyms are used in capital letters to refer to these tests in the main text of the results section.

4.1. Results Type I through Type III

Size: The Type I simulation unveils the undisturbed size properties of the cointegration tests. All simulations are run at a significance level of five percent. Since Type I does not violate any assumptions of any of the tests, the size should amount to five percent, as well, as long as the tests are properly calibrated. We see in Table 8 that only very limited size distortions occur for Type I, i.e., the normally-distributed innovations. When running the original routines of [45] with 10,000 replications, we also obtain a very slight size distortion at the level of one percentage point. Whereas [45] uses a standard deviation of one, we simulate the standard deviation as described in Subsection 3.2.6, with a mean value of 0.006 and a first quartile of 0.0003. The latter should not make any difference for the considered cointegration tests, so we attribute the fluctuations around the five percent level to chance alone.
For t-distributed innovations, the size remains unchanged to the second decimal across all tests. Hence, t-distributed innovations have no effect whatsoever on the size of these tests. The latter is expected, given the strong asymptotics for a sample size of n = 510 . For Type III, the size increases on average by 0.01, meaning that GARCH models on the brink of variance nonstationarity, as incurred by high-frequency data, only lead to very limited size distortions. We note at this stage that for types IV through VI, no further significant size distortions beyond those of the AR(1)-GARCH(1,1) process occur; compare Table 10, Table 12 and Table 14. The size properties are an important finding. They suggest that the stylized facts of high-frequency financial data have virtually no impact on the Type I error of these tests.
Power: The Type I simulation shows undisturbed power properties; see Table 9. We see that the power is above 0.80 for all tests for ϕ 1 equal to 0.85. With the AR(1) coefficient moving towards a random walk, the power deteriorates. For ϕ 1 equal to 0.95, the Phillips–Perron (PP) and the Pantula, Gonzalez-Farias and Fuller (PGFF) test still have excellent power of 0.84 and 0.87, respectively. This result confirms the widespread opinion of practitioners that the PP test performs better on financial data; see Alexander [74]. Their reasoning is based on the fact that the PP test allows for dependent errors with heteroscedastic variance by introducing a correction term to the Dickey–Fuller statistic. This correction evidently translates to improved power properties vis-a-vis the augmented Dickey–Fuller (ADF) test. Furthermore, we confirm the findings of [66] in a cointegration setting, who affirm the higher power of their alternative estimators compared to OLS. Along those lines, it makes sense that the ADF test does not perform as well as the PGFF test with a power of 0.64. However, it is surprising to see that the well-established Johansen procedure with the Johansen eigenvalue (JO-E) and the Johansen trace test (JO-T) shows unfavorable power properties at ϕ 1 equal to 0.95. Even though the Johansen test is still comparable to PP and PGFF for ϕ 1 = 0.90 , 0.85 , it significantly deteriorates when moving closer towards nonstationarity, exhibiting powers of 0.56 and 0.52 respectively. As [69] point out, the Elliot–Rothenberg–Stock (ERS-P, ERS-D) tests perform well in small samples and in case the series has an unknown mean or linear trend compared to the standard ADF test. Here, we face large sample sizes of n = 510 minutes, and linear trends are not considered. Hence, the power of these tests vacillates compared to the ADF test. At ϕ 1 equal to 0.95, the ERS tests show slightly better power, but for more stationary values, the ADF test dominates. The Schmidt–Phillips test (SP-R) performs almost equivalently to the ADF, with slight disadvantages at the most critical case of ϕ 1 equal to 0.95. The Hurst exponent and Breitung’s nonparametric variance ratio test (BVR) disappoint.
For Type II, this picture remains unchanged. The t-distributed innovations lead to fluctuations of the power of approximately 0.01 to 0.02, which seem random and go in both directions. The same applies to GARCH effects in Type III. So far, we conclude that non-normality and ARCH effects, as present in high-frequency financial data, still allow for cointegration testing with very respectable size and power properties. We can particularly recommend the PGFF and the PP tests. The very commonly-applied ADF test also exhibits adequate power.

4.2. Results Type IV

Type IV introduces threshold nonlinearities as additional stylized fact. Results on size are given in Table 10 and power in Table 11. Size: As size is stable at around 0.05 to 0.07 for all tests, the analysis may be focused on power.
Power: Overall, we see that the top of the ranking order of the tests remains as in Subsection 4.1, with PGFF and PP performing best, albeit with reduced power. We observe two dominant effects that negatively affect the power. First, increasing threshold levels significantly reduces the power. Take the PP test as an example. A threshold multiple of | c i | = 1.00 leads to a power of 0.78 for ϕ L = ϕ U = 0.95 and γ i = 1.00 . A threshold multiple of | c i | = 5.00 reduces that power to 0.54 and a threshold multiple of | c i | = 10.00 to 0.38. This behavior is easy to understand: With increasing threshold levels, a larger share of the total number of observations falls in the nonstationary middle regime. Hence, it is natural for the cointegration tests to exhibit lower power with the nonstationary share of the process gaining in weight. Speaking in relative terms, the decline in power driven by higher thresholds is approximately similar across all tests, with some exceptions. For example, for γ i = 1.00 , increasing the threshold multiples from ± 1 to ± 10 leads to a deterioration in the power of on average 50 percent for ϕ L = ϕ U = 0.95 , 36 percent for ϕ L = ϕ U = 0.90 and of approximately 25 percent for ϕ L = ϕ U = 0.85 . We see that threshold effects are getting stronger with AR coefficients approaching nonstationarity.
The second effect stems from the parameter γ of the logistic function in the STAR model. Increasing values of γ lead to more abrupt regime shifts and have an adverse effect on the power. The latter can be explained as follows: If γ is small, the logistic function is very smooth and “drags” the stationary behavior far into the otherwise nonstationary middle regime. In other words, the stationary outer regimes gain in weight at the expense of the nonstationary inner regime. The opposite is true for large values of γ. Then, the logistic function approaches a Heaviside function and induces very abrupt regime changes. In that case, the outer regimes lose weight relative to the inner regime. Logically, more realizations fall in the nonstationary inner regime, with a negative effect on the power.
The nonparametric BVR test, that should perform well in light of nonlinear dynamics (see [67]), does not gain the upper hand in this setting, but slightly improves its relative positioning for ϕ L = ϕ U = 0.95 and stronger threshold nonlinearities. PGFF and PP remain at the top of the ranking.

4.3. Results Type V

Type V introduces reversible jumps as additional stylized fact. Results on size are given in Table 12 and power in Table 13. Size: As size varies only between 0.04 and 0.07 for all tests, the analysis may be focused on power.
Power: We see that the power increases with λ w , i.e., the expected number of jumps per 510 min trading day, and m w , i.e., the median jump size in absolute value. For example, the PP test exhibits a power of 0.65 for ϕ L = ϕ U = 0.95 and λ w = 0 , which increases to 0.74 for λ w = 8 and m w = 0.30 , i.e., the most extreme case. This result is not surprising. A reversible jump is defined as an innovation with much higher variance. However, the mechanics of the MR(3)-STAR(1)-GARCH(1,1) process still apply. As such, depending on the sign, there is a given chance that a jump boosts the process from the nonstationary middle regime to one of the stationary outer regimes. Then, mean-reversion occurs, driving the cointegration relationship back towards the middle regime. Of course, the contrary may occur, as well, meaning that the process jumps from one of the stationary outer regimes to the nonstationary inner regime. However, with increasing jump sizes, chances are becoming much higher for jumps occurring from the nonstationary to the stationary regimes than vice versa. The reason is the high median jump size in absolute value compared to the magnitude of the threshold levels. Clearly, we see that reversible jumps enlarge the proportion of observations in the stationary regimes, so it is quite intuitive that the cointegration tests exhibit higher power. We further note that this behavior is similar across all tests, i.e., reversible jumps leave the ranking of Table 11 largely unaffected. With respect to the AR-coefficients, we observe that gain in power is more expressed when moving closer to nonstationarity. However, most importantly, we note that significant gains in power can only be observed for AR(1)-coefficients of 0.95 and the more extreme jump scenarios with median jump sizes of 0.30 and an occurrence of eight jumps per trading day in expectation. The middle scenario with jump sizes of 0.10 and frequencies of 3.0 only shows very marginal gains in power, if any. However, this scenario corresponds to a typical cointegration residual exhibiting reversible jumps; see Table 4. We conclude that reversible jumps do not have an adverse effect on cointegration testing. On the contrary, in the unlikely case that they manifest at high rates and in high magnitude, they actually enhance the power.

4.4. Results Type VI

Results for Type VI are given for size in Table 14 and for power in Table 15. Size: As size varies only between 0.04 and 0.08 for all tests, the analysis may be focused on power.
Power: Type VI introduces non-reversible jumps through a compound Poisson process with expected number of jumps λ p per trading day and median jumps size m p in absolute value. The superposition of the compound Poisson process with the previously discussed STAR process of Equation (3) results in a regime shift of the mean equation with each jump. In other words, each jump technically leads to a rupture of the previous cointegration relationship and the establishment of a new one at another mean level. The shift in mean depends on the size of the jump, which is driven by its variance. From Table 15, we see that the power decreases with increasing expected number of jumps and increasing jump size in absolute value. In the following, we differentiate the results by jump size: Small jumps: Already smaller jumps with median size of 0.05 have an expressed effect on the power, considering that this jump size is already more than 15-times higher than the median size of the average GARCH-innovation of an average cointegration residual 8. For example, the power of the PP test declines from 0.65 to 0.41 for λ p = 6 and to 0.32 for λ p = 12 . Medium jumps: Medium jumps with a median size of 0.10 are even more detrimental. An expected value of six jumps leads to a deterioration in power to 0.28 for the PP test, i.e., a relative slump of 57 percent. This scenario corresponds to a cointegration residual exhibiting typically-sized non-reversible jumps at a typical frequency; see Table 4 for the quantile estimates. Higher jump frequencies further deteriorate the power, which is fair, considering that cointegration relationships only exist for a shorter time until the next mean shift occurs. Large jumps: Large jumps with standard deviation of 0.45 already do significant damage if they occur just four times a day in expectation. We see that power deteriorates by 80 percent across almost all tests compared to the case with no jumps. Increasing the frequency to six or even 12 jumps deteriorates the power to levels close to the Type I errors, rendering the tests ineffective.
These results are paramount, meaning that we are able to detect a (time-varying) cointegration relationship in financial market data with classical tests, even if its mean varies over time either at low frequencies or with only limited shifts in mean. Furthermore, in these settings, relative to the other tests, PGFF and PP are still recommended.

5. Conclusions

We conduct an in-depth analysis of the size and power properties of ten contemporary cointegration tests, given the stylized facts of high-frequency trade data. We make several contributions to the existing literature.
First, we test for different stylized facts on a large database of one-minute return data. In line with the existing literature, we find non-normality, ARCH effects, jumps and evidence of further nonlinearities. The same stylized facts are established for the cointegration residuals processes linking some of the DAX 30 constituents. To our knowledge, the latter analysis has not yet been performed in the literature. It is interesting to see that the identified cointegration residuals that are classified as stationary by the Johansen trace test still exhibit non-normalities, ARCH effects, further nonlinearities, as well as jumps. In particular, we find a median number of three reversible and six non-reversible jumps with median sizes of 0.10 and 0.11 EUR in absolute value. One could have assumed that these stylized facts cancel each other out, just as the nonstationary components. The data prove this to be wrong.
Second, we propose an innovative approach for simulating stock prices following the stationary bootstrap of [46]. This procedure allows for simulating stock prices in a high-frequency setting, while retaining the majority of their stylized facts.
Third, we suggest six different processes for cointegration residuals based on the current literature. These processes accommodate the above-mentioned stylized facts in a staggered approach. To our knowledge, the application of an MR(3)-STAR(1)-GARCH(1,1) model is a novelty in this context. It provides significantly higher flexibility for modeling relative-value arbitrage strategies and may also be an interesting direction for further research in that respect.
Fourth, we have performed Monte Carlo simulations to assess the power and size properties of ten different cointegration tests. We find that both non-normality and GARCH effects have none or only a marginal impact on size and power properties. We conclude that non-normality and ARCH effects, as present in high-frequency financial data, still allow for cointegration testing with very respectable size and power properties.
STAR nonlinearities have an adverse effect on the power, which decreases with rising threshold levels and increasing values for gamma. The latter is driven by the fact that increasing parameter values push a higher share of observations in the nonstationary middle regime. However, with the plausible assumption of relative-value arbitrage kicking in at threshold levels estimated via bid-ask spreads, the power of cointegration tests still remains acceptable.
The effect of jumps differs by their nature: Reversible jumps, which may be driven by uninformed buying or liquidity shortages, do not have an adverse effect on the power or actually increase it. This phenomenon is easily explained by the fact that a reversible jump is just a temporary disruption, which usually pushes the observations in the stationary outer regimes. On the contrary, non-reversible jumps disrupt the power with an increasing rate of occurrence and increasing jump size. Assuming typical jump rates and sizes for our cointegration residuals, power deteriorates to levels of approximately 0.29 for PGFF and 0.28 for PP for AR(1)-coefficients of 0.95.
We conclude that contemporary cointegration tests may be applied with caution in the high-frequency setting of one-minute stock return data. In particular, we recommend pre-evaluating the cointegration relationship with the procedure described in this paper and in [25] to assess the size and magnitude of jumps. Then, our tables provide a sensible orientation, whether cointegration testing has acceptable power. Across all of the above-mentioned stylized facts, the PGFF and the PP tests exhibit the most favorable power properties. These tests generally dominate all other tests in all settings, where PGFF is slightly better than PP.

Acknowledgments

We are grateful to Matthew Clegg, Thomas Fischer, Ingo Klein, Helmut Lütkepohl, Benedikt Mangold, Michael McAleer, Johannes Stübinger, the participants of the CEQURA Conference on Advances in Financial and Insurance Risk Management 2015 and two anonymous referees for many helpful discussions and suggestions on this topic.

Author Contributions

The research method was conceived of by C.K. The data were provided by Deutsche Börse, transferred to the database KDB by K.H. and preprocessed in R by C.K. The analyses were performed by C.K. and reviewed by C.K. and K.H. The paper was initially drafted by C.K. and revised, refined and finalized by C.K. and KH.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. E. Elyasiani, and A.E. Kocagil. “Interdependence and dynamics in currency futures markets: A multivariate analysis of intraday data.” J. Bank. Financ. 25 (2001): 1161–1186. [Google Scholar] [CrossRef]
  2. J. Hasbrouck. “Intraday price formation in U.S. equity index markets.” J. Finance 58 (2003): 2375–2399. [Google Scholar] [CrossRef]
  3. C.L. Dunis, G. Giorgioni, J. Laws, and J. Rudy. Statistical Arbitrage and High-Frequency Data with an Application to Eurostoxx 50 Equities. Working Paper; Liverpool, UK: Liverpool Business School, 2010. [Google Scholar]
  4. P.C. Pati, and P. Rajib. “Intraday return dynamics and volatility spillovers between NSE S&P CNX Nifty stock index and stock index futures.” Appl. Econ. Lett. 18 (2011): 567–574. [Google Scholar]
  5. J. Yang, Z. Yang, and Y. Zhou. “Intraday price discovery and volatility transmission in stock index and stock index futures markets: Evidence from China.” J. Futur. Mark. 32 (2012): 99–121. [Google Scholar] [CrossRef]
  6. S. Johansen. “Statistical analysis of cointegration vectors.” J. Econ. Dyn. Control 12 (1988): 231–254. [Google Scholar] [CrossRef]
  7. S. Johansen, and K. Juselius. “Maximum likelihood estimation and inference on cointegration–With applications to the demand for money.” Oxf. Bull. Econ. Stat. 52 (1990): 169–210. [Google Scholar] [CrossRef]
  8. S. Johansen. “Estimation and hypothesis testing of cointegration vectors in Gaussian vector autoregressive models.” Econometrica 59 (1991): 1551–1580. [Google Scholar] [CrossRef]
  9. J.J. Kremers, N.R. Ericsson, and J.J. Dolado. “The power of cointegration tests.” Oxf. Bull. Econ. Stat. 54 (1992): 325–348. [Google Scholar] [CrossRef]
  10. A.A. Haug. “Tests for cointegration a Monte Carlo comparison.” J. Econom. 71 (1996): 89–115. [Google Scholar] [CrossRef]
  11. K. Hubrich, H. Lütkepohl, and P. Saikkonen. “A review of systems cointegration tests.” Econom. Rev. 20 (2001): 247–318. [Google Scholar] [CrossRef]
  12. H.P. Boswijk, A. Lucas, and N. Taylor. A Comparison of Parametric, Semi-Nonparametric, Adaptive, and Nonparametric Cointegration Tests. Amsterdam, The Netherlands: Tinbergen Institute Discussion Papers, 1999. [Google Scholar]
  13. A. Rahbek, E. Hansen, and J.G. Dennis. ARCH Innovations and Their Impact on Cointegration Rank Testing. Working Paper; Copenhagen, Denmark: Department of Theoretical Statistics, University of Copenhagen, 2002. [Google Scholar]
  14. G. Cavaliere, A. Rahbek, and A.M.R. Taylor. “Testing for co-integration in vector autoregressions with non-stationary volatility.” J. Econom. 158 (2010): 7–24. [Google Scholar] [CrossRef]
  15. G. Cavaliere, A. Rahbek, and A.M.R. Taylor. “Cointegration rank testing under conditional heteroskedasticity.” Econometr. Theor. 26 (2010): 1719–1760. [Google Scholar] [CrossRef]
  16. N.S. Balke, and T.B. Fomby. “Threshold cointegration.” Int. Econ. Rev. 38 (1997): 627–645. [Google Scholar] [CrossRef]
  17. B.E. Hansen, and B. Seo. “Testing for two-regime threshold cointegration in vector error-correction models.” J. Econom. 110 (2002): 293–318. [Google Scholar] [CrossRef]
  18. M. Seo. “Bootstrap testing for the null of no cointegration in a threshold vector error correction model.” J. Econom. 134 (2006): 129–150. [Google Scholar] [CrossRef]
  19. C. Alexander. “Optimal hedging using cointegration.” Philos. Trans. R. Soc. Lond. A Math. Phys. Eng. Sci. 357 (1999): 2039–2058. [Google Scholar] [CrossRef]
  20. C. Alexander, and A. Dimitriu. “Indexing and statistical arbitrage.” J. Portf. Manag. 31 (2005): 50–63. [Google Scholar] [CrossRef]
  21. E. Gatev, W.N. Goetzmann, and K.G. Rouwenhorst. “Pairs trading: Performance of a relative-value arbitrage rule.” Rev. Financial Stud. 19 (2006): 797–827. [Google Scholar] [CrossRef]
  22. J.E. Ingersoll. Theory of Financial Decision Making. Totowa, NJ, USA: Rowman & Littlefield, 1987. [Google Scholar]
  23. Z. Chen, and P.J. Knez. “Measurement of market integration and arbitrage.” Rev. Financial Stud. 8 (1995): 287–325. [Google Scholar] [CrossRef]
  24. S. Andrade, V. Di Pietro, and M. Seasholes. Understanding the Profitability of Pairs Trading. Working Paper; Evanston, IL, USA: UC Berkeley, Northwestern University, 2005. [Google Scholar]
  25. E. Jondeau, J. Lahaye, and M. Rockinger. “Asymmetry in the price impact of trades in an high-frequency microstructure model with jumps.” J. Bank. Finance. 61 (2015): 205–224. [Google Scholar] [CrossRef]
  26. A. Trapletti, K. Hornik, and B. LeBaron. “tseries: Time series analysis and computational finance.” Available online: https://cran.r-project.org/web/packages/tseries/index.html (accessed on 6 February 2017).
  27. R Development Core Team. “R: A language and environment for statistical computing.” Available online: https://www.r-project.org/ (accessed on 6 February 2017).
  28. S. Graves. “FinTS: Companion to Tsay (2005) Analysis of financial time series.” Available online: https://rdrr.io/cran/FinTS/ (accessed on 6 February 2017).
  29. O.E. Barndorff-Nielsen, and N. Shephard. “Econometrics of testing for jumps in financial economics using bipower variation.” J. Financial Econom. 4 (2006): 1–30. [Google Scholar] [CrossRef]
  30. K. Boudt, J. Cornelissen, S. Payseur, G. Nguyen, and M. Schermer. “High frequency: Tools for high frequency data analysis.” Available online: https://rdrr.io/cran/highfrequency/ (accessed on 6 February 2017).
  31. K.S. Chan, and B. Ripley. “TSA: Time series analysis.” Available online: https://rdrr.io/cran/TSA/ (accessed on 6 February 2017).
  32. R. Luukkonen, P. Saikkonen, and T. Teräsvirta. “Testing linearity against smooth transition autoregressive models.” Biometrika 75 (1988): 491–499. [Google Scholar] [CrossRef]
  33. A. Ghalanos. “Twinkle: Dynamic smooth transition ARMAX models.” Available online: http://past.rinfinance.com/agenda/2014/talk/AlexiosGhalanos.pdf (accessed on 6 February 2017).
  34. A. Ghalanos. “Rugarch: Univariate GARCH models.” Available online: https://rdrr.io/cran/rugarch/ (accessed on 6 February 2017).
  35. R.F. Engle, and M.E. Sokalska. “Forecasting intraday volatility in the US equity market: Multiplicative component GARCH.” J. Financial Econom. 10 (2012): 54–83. [Google Scholar] [CrossRef]
  36. K. Herrmann, S. Teis, and W. Yu. Components of Intraday Volatility and Their Prediction at Different Sampling Frequencies with Application to DAX and BUND Futures. IWQW Discussion Paper Series; Erlangen, Germany: University of Erlangen-Nürnberg, 2014. [Google Scholar]
  37. R. Cont. “Empirical properties of asset returns: Stylized facts and statistical issues.” Quantit. Finance 1 (2001): 223–236. [Google Scholar] [CrossRef]
  38. B. Pfaff. Analysis of Integrated and Cointegrated Time Series with R. New York, NY, USA: Springer, 2008. [Google Scholar]
  39. D.A. Dickey, and W.A. Fuller. “Distribution of the estimators for autoregressive time series with a unit root.” J. Am. Stat. Assoc. 74 (1979): 427–431. [Google Scholar] [CrossRef]
  40. R.S. Tsay. Multivariate Time Series Analysis: With R and Financial Applications. Wiley Series in Probability and Statistics; Hoboken, NJ, USA: John Wiley & Sons, 2014. [Google Scholar]
  41. J.A. Doornik. “Approximations to the asymptotic distributions of cointegration tests.” J. Econ. Surv. 12 (1998): 573–593. [Google Scholar] [CrossRef]
  42. F. Yang. “CommonTrend: Extract and plot common trends from a cointegration system. Calculate p-value for Johansen statistics. ” Available online: https://rdrr.io/cran/CommonTrend/ (accessed on 6 February 2017).
  43. B.M. Hill. “A simple general approach to inference about the tail of a distribution.” Ann. Statist. 3 (1975): 1163–1174. [Google Scholar] [CrossRef]
  44. J. Breitung. “Rank tests for nonlinear cointegration.” J. Bus. Econ. Stat. 19 (2001): 331–340. [Google Scholar] [CrossRef]
  45. M. Clegg. “On the persistence of cointegration in pairs trading. ” Available online: https://ssrn.com/abstract=2491201 (accessed on 6 February 2017).
  46. D.N. Politis, and J.P. Romano. “The stationary bootstrap.” J. Am. Stat. Assoc. 89 (1994): 1303–1313. [Google Scholar] [CrossRef]
  47. A. Azzalini, and A. Capitanio. The Skew-Normal and Related Families. Cambridge, UK: Cambridge University Press, 2014. [Google Scholar]
  48. G. Hong, and R. Susmel. Pairs-Trading in the Asian ADR Market. Working Paper; Houston, TX, USA: University of Houston, 2003. [Google Scholar]
  49. T. Bollerslev. “Generalized autoregressive conditional heteroskedasticity.” J. Econom. 31 (1986): 307–327. [Google Scholar] [CrossRef]
  50. D.B. Nelson. “Stationarity and persistence in the GARCH(1,1) model.” Econometr. Theor. 6 (1990): 318–334. [Google Scholar] [CrossRef]
  51. L.R. Glosten, R. Jagannathan, and D.E. Runkle. “On the relation between the expected value and the volatility of the nominal excess return on stocks.” J. Finance 48 (1993): 1779–1801. [Google Scholar] [CrossRef]
  52. D.B. Nelson. “Conditional heteroskedasticity in asset returns: A new approach.” Econometrica 59 (1991): 347–370. [Google Scholar] [CrossRef]
  53. M. McAleer, F. Chan, S. Hoti, and O. Lieberman. “Generalized autoregressive conditional correlation.” Econometr. Theor. 24 (2008): 1554–1583. [Google Scholar] [CrossRef]
  54. H. Tong. “On a threshold model.” In Pattern Recognition and Signal Processing. Nato Advanced Study Institutes: Series E, Applied Sciences; Edited by C.H. Chen. Alphen aan den Rijn, The Netherlands: Sijthoff & Noordhoff, 1978, Volume 29. [Google Scholar]
  55. T. Teräsvirta. “Specification, estimation, and evaluation of smooth transition autoregressive models.” J. Am. Stat. Assoc. 89 (1994): 208–218. [Google Scholar] [CrossRef]
  56. D. van Dijk, and P.H. Franses. “Modeling multiple regimes in the business cycle.” Macroecon. Dyn. 3 (1999): 311–340. [Google Scholar] [CrossRef]
  57. F. Chan, and M. McAleer. “Maximum likelihood estimation of STAR and STAR-GARCH models: Theory and Monte Carlo evidence.” J. Appl. Econom. 17 (2002): 509–534. [Google Scholar] [CrossRef]
  58. F. Chan, and M. McAleer. “Estimating smooth transition autoregressive models with GARCH Errors in the presence of extreme observations and outliers.” Appl. Financial Econ. 13 (2003): 581–592. [Google Scholar] [CrossRef]
  59. K.S. Chan, J.D. Petruccelli, H. Tong, and S.W. Woolford. “A multiple-threshold AR(1) model.” J. Appl. Probab. 22 (1985): 267. [Google Scholar] [CrossRef]
  60. S.M. Ross. Stochastic Processes, 2nd ed. Wiley Series in Probability and Statistics; Probability and Statistics; New York, NY, USA: Wiley, 1996. [Google Scholar]
  61. B.W. Silverman. Density Estimation for Statistics and Data Analysis. London, UK: New York, NY, USA: Chapman and Hall, 1986. [Google Scholar]
  62. S.J. Sheather, and M.C. Jones. “A reliable data-based bandwidth selection method for kernel density estimation.” J. R. Stat. Soc. Ser. B, 1991, 683–690. [Google Scholar]
  63. D.A. Dickey, and W.A. Fuller. “Likelihood ratio statistics for autoregressive time series with a unit root.” Econometrica 49 (1981): 1057–1072. [Google Scholar] [CrossRef]
  64. P. Perron. “Trends and random walks in macroeconomic time series.” J. Econ. Dyn. Control 12 (1988): 297–332. [Google Scholar] [CrossRef]
  65. P.C.B. Phillips, and S. Ouliaris. “Asymptotic properties of residual based tests for cointegration.” Econometrica 58 (1990): 165–193. [Google Scholar] [CrossRef]
  66. S.G. Pantula, G. Gonzalez-Farias, and W.A. Fuller. “A comparison of unit-root test criteria.” J. Bus. Econ. Stat. 12 (1994): 449–459. [Google Scholar] [CrossRef]
  67. J. Breitung. “Nonparametric tests for unit roots and cointegration.” J. Econom. 108 (2002): 343–363. [Google Scholar] [CrossRef]
  68. J. Breitung, and A.M.R. Taylor. “Corrigendum to “Nonparametric tests for unit roots and cointegration” [J. Econom. 108 (2002) 343–363].” J. Econom. 117 (2003): 401–404. [Google Scholar] [CrossRef]
  69. G. Elliott, T.J. Rothenberg, and J.H. Stock. “Efficient tests for an autoregressive unit root.” Econometrica 64 (1996): 813–836. [Google Scholar] [CrossRef]
  70. P. Schmidt, and P.C.B. Phillips. “LM tests for a unit root in the presence of deterministic trends.” Oxf. Bull. Econ. Stat. 54 (1992): 257–287. [Google Scholar] [CrossRef]
  71. G. Cavaliere. “Testing the unit root hypothesis using generalized range statistics.” Econom. J. 4 (2001): 70–88. [Google Scholar] [CrossRef]
  72. D. Wuertz, and M.S. Taqqu. “fArma: ARMA time series modelling.” Available online: https://rdrr.io/cran/fArma/ (accessed on 6 February 2017).
  73. C. Krauss. “Statistical arbitrage pairs trading strategies: Review and outlook.” J. Econ. Surv. Forthcom., 2016. [Google Scholar] [CrossRef]
  74. C. Alexander. Practical Financial Econometrics, Reprinted March 2009 ed. Volume 2—Market Risk Analysis; Chichester, UK: Wiley, 2009. [Google Scholar]
  • 1.Multiple regime smooth transition autoregressive process.
  • 2.A value of four is chosen ad hoc, as a compromise between partially preserving serial dependence in returns or squared returns and sufficient randomization. The latter refers to the fact that setting large block lengths leads to the risk of creating the simulated time series just on the basis of a few selected stocks. A value of four puts more emphasis on introducing a higher level of diversity.
  • 3.We replicate the analyses of Table 1 on bootstrap simulated data in order to see if the bootstrap preserves the stylized facts. It turns out that almost all results are very similar. We mainly observe a decrease in detected ARCH effects, as volatility patterns cannot be perfectly replicated with small block lengths.
  • 4.The point at which F 12 converges to zero depends on the choice of γ.
  • 5.Clearly, the mixed AR coefficient varies with the parameters ϕ 1 , ϕ 2 , ϕ 3 , but for simplicity reasons and better comparability, it was fixed ad hoc at 0.95.
  • 6.This auxiliary metric is used to define fixed thresholds, even in light of potential non-stationarities.
  • 7.To see this effect, estimate the AR(1)-coefficient of two processes: (1) a simple stationary AR(1)-process with coefficient ϕ 1 ; and (2) the same process contaminated by a single large non-reversible jump. The estimate of the latter is biased towards the nonstationary case.
  • 8.For such an estimation, the median of the absolute value of t-distributed innovations with five degrees of freedom and standard deviation of 0.0059 can be calculated as 0.0033. See Table 3 for the relevant parameter values used in this estimation.
Figure 1. Average realized volatility per one minute of the trading day (in CET) for selected stocks in 2014.
Figure 1. Average realized volatility per one minute of the trading day (in CET) for selected stocks in 2014.
Jrfm 10 00007 g001
Figure 2. Distribution of parameter estimates, when fitting AR(1)-GARCH(1,1) models to the 41,069 sets of cointegration residuals between the DAX 30 constituents identified on 249 trading days in 2014.
Figure 2. Distribution of parameter estimates, when fitting AR(1)-GARCH(1,1) models to the 41,069 sets of cointegration residuals between the DAX 30 constituents identified on 249 trading days in 2014.
Jrfm 10 00007 g002
Table 1. Evaluation of minute-binned stock return data of the DAX 30 constituents over the 249 trading days in 2014. Subject to testing are raw returns, the residuals after AR(1)-filtration and the residuals after AR(1)-GARCH(1,1)-filtration. Depicted is the share of tests with a p-value less than 0.05.
Table 1. Evaluation of minute-binned stock return data of the DAX 30 constituents over the 249 trading days in 2014. Subject to testing are raw returns, the residuals after AR(1)-filtration and the residuals after AR(1)-GARCH(1,1)-filtration. Depicted is the share of tests with a p-value less than 0.05.
TypeTestRaw ReturnsAR(1)AR(1)-GARCH(1,1)
Non-normalityJarque–Bera test1.001.001.00
Box test0.570.550.03
ARCH effectsEngle’s ARCH test0.710.690.04
BDS test0.840.760.24
Tsay test0.500.450.23
Luukkonen test0.410.400.08
NonlinearityTeräsvirta test0.370.360.07
JumpsBNS test1.000.991.00
Table 2. Evaluation of 41,069 sets of cointegration residuals between the DAX 30 constituents identified on 249 trading days in 2014. Subject to testing are the raw data, the residuals after AR(1)-filtration and the residuals after AR(1)-GARCH(1,1)-filtration. Depicted is the share of tests with a p-value less than 0.05.
Table 2. Evaluation of 41,069 sets of cointegration residuals between the DAX 30 constituents identified on 249 trading days in 2014. Subject to testing are the raw data, the residuals after AR(1)-filtration and the residuals after AR(1)-GARCH(1,1)-filtration. Depicted is the share of tests with a p-value less than 0.05.
TypeTestRaw DataAR(1)AR(1)-GARCH(1,1)
Non-normalityJarque–Bera test0.890.990.99
Box test1.000.700.13
ARCH effectsEngle’s ARCH test1.000.750.07
BDS test1.000.640.49
Tsay test0.450.540.27
Luukkonen test0.550.380.15
NonlinearityTeräsvirta test0.550.350.16
JumpsBNS test1.000.981.00
Table 3. Summary statistics of parameter estimates, when fitting AR(1)-GARCH(1,1) models to the 41,069 sets of cointegration residuals between the DAX 30 constituents identified on 249 trading days in 2014.
Table 3. Summary statistics of parameter estimates, when fitting AR(1)-GARCH(1,1) models to the 41,069 sets of cointegration residuals between the DAX 30 constituents identified on 249 trading days in 2014.
ϕ 1 α 0 α 1 β 1 α 1 + β 1 ν σ G
Minimum0.751870.000000.000000.000000.000002.100000.00000
1st Quartile0.958200.000010.022440.849150.934383.747720.00029
Median0.973700.000040.044910.921240.968794.628100.00119
Mean0.969040.000160.064140.875420.939565.056120.00585
3rd Quartile0.985160.000120.081260.952680.982775.696190.00312
Maximum1.000000.039820.929030.999000.9990099.993610.27104
Table 4. Summary statistics for reversible and non-reversible jumps detected in the 41,069 sets of cointegration residuals between the DAX 30 constituents identified on 249 trading days in 2014.
Table 4. Summary statistics for reversible and non-reversible jumps detected in the 41,069 sets of cointegration residuals between the DAX 30 constituents identified on 249 trading days in 2014.
Panel A: Number of Jumps per Day q 5 % q 25 % q 50 % Mean q 75 % q 95 %
Reversible jumps0.002.003.003.665.008.00
Non-reversible jumps2.004.006.006.888.0013.00
Panel B: Size of Jumps in EUR q 5 % q 25 % q 50 % Mean q 75 % q 95 %
Reversible jumps0.020.050.100.120.160.31
Non-reversible jumps0.020.060.110.170.190.45
Table 5. Set of cointegration tests applied in this study. Note that critical values for cointegration testing and corresponding p-values for all deployed tests are taken from egcm; see [45].
Table 5. Set of cointegration tests applied in this study. Note that critical values for cointegration testing and corresponding p-values for all deployed tests are taken from egcm; see [45].
Cointegration Test R Implementation
Augmented Dickey–Fuller test[39,63]tseries[26]
Phillips–Perron test[64,65]tseries[26]
Pantula, Gonzalez-Farias and Fuller test[66]egcm[45]
Breitung’s variance ratio test[67,68]egcm[45]
Johansen’s eigenvalue test[6,7,8]urca, vars[38]
Johansen’s trace test[6,7,8]urca, vars[38]
Elliott-Rothenberg-Stock point optimal test[69]urca[38]
Elliott-Rothenberg-Stock Dickey-Fuller Generalized Least Squares test[69]urca[38]
Schmidt and Phillips rho statistic[70]urca[38]
Based on Hurst exponent[71]fArma[72]
Table 6. Parameter settings for the six types of Monte Carlo simulations conducted in this study and described in Subsection 3.4.2.
Table 6. Parameter settings for the six types of Monte Carlo simulations conducted in this study and described in Subsection 3.4.2.
MC TypeType IType IIType III
ProcessAR(1)AR(1)AR(1)-GARCH(1,1)
DistributionNormaltt
ϕ 1 1.000.950.900.851.000.950.900.851.000.950.900.85
MC TypeType IVType VType VI
ProcessSTAR(1)-GARCH(1,1)STAR(1)-GARCH(1,1)STAR(1)-GARCH(1,1)
Regimes333
Jumps-reversiblenon-reversible
Distributionttt
ϕ L 1.000.950.900.851.000.950.900.851.000.950.900.85
ϕ M 1.001.001.001.001.001.001.001.001.001.001.001.00
ϕ U 1.000.950.900.851.000.950.900.851.000.950.900.85
c 1 –1.00–5.00–10.00 –1.00–1.00
c 2 1.005.0010.00 1.001.00
γ 1 1.005.0010.00 5.005.00
γ 2 1.005.0010.00 5.005.00
λ w 2.003.008.00
m w 0.050.100.30
λ p 4.006.0012.00
m p 0.050.100.45
Table 7. Ten thousand Type VI cointegration residuals with path length 510 and given mean-reversion coefficient ϕ are generated and contaminated with six non-reversible jumps ( λ p = 6 ) with median size m p = 0.10 . Depicted are the average estimated mean-reversion coefficients ϕ ^ compared to their true values ϕ.
Table 7. Ten thousand Type VI cointegration residuals with path length 510 and given mean-reversion coefficient ϕ are generated and contaminated with six non-reversible jumps ( λ p = 6 ) with median size m p = 0.10 . Depicted are the average estimated mean-reversion coefficients ϕ ^ compared to their true values ϕ.
ϕ L = ϕ U 0.8500.9000.950
ϕ ^ 0.9560.9630.971
Table 8. Size of cointegration tests for Monte Carlo settings of Type I, Type II and Type III, as described in Subsection 3.4.2.
Table 8. Size of cointegration tests for Monte Carlo settings of Type I, Type II and Type III, as described in Subsection 3.4.2.
Test ϕ 1 Type IType IIType III
pp1.000.060.060.07
adf1.000.050.050.06
jo-e1.000.060.060.06
jo-t1.000.060.060.07
ers-p1.000.050.050.06
ers-d1.000.050.050.05
sp-r1.000.050.050.05
hurst1.000.050.050.06
bvr1.000.050.050.05
pgff1.000.060.060.06
Table 9. Power of cointegration tests for Monte Carlo settings of Type I, Type II and Type III, as described in Subsection 3.4.2.
Table 9. Power of cointegration tests for Monte Carlo settings of Type I, Type II and Type III, as described in Subsection 3.4.2.
Test ϕ 1 Type IType IIType III
pp0.950.840.850.83
pp0.901.001.001.00
pp0.851.001.001.00
adf0.950.640.630.64
adf0.900.980.980.98
adf0.851.001.001.00
jo-e0.950.560.560.58
jo-e0.901.001.000.99
jo-e0.851.001.001.00
jo-t0.950.520.530.54
jo-t0.900.990.990.98
jo-t0.851.001.001.00
ers-p0.950.690.700.70
ers-p0.900.910.910.90
ers-p0.850.970.960.96
ers-d0.950.650.670.68
ers-d0.900.860.860.87
ers-d0.850.920.920.92
sp-r0.950.630.620.62
sp-r0.900.970.970.96
sp-r0.851.001.000.99
hurst0.950.430.440.44
hurst0.900.730.720.73
hurst0.850.840.850.85
bvr0.950.530.530.54
bvr0.900.810.810.81
bvr0.850.910.910.91
pgff0.950.870.890.87
pgff0.901.001.001.00
pgff0.851.001.001.00
Table 10. Size of cointegration tests for Monte Carlo settings of Type IV.
Table 10. Size of cointegration tests for Monte Carlo settings of Type IV.
1.001.001.005.005.005.0010.0010.0010.00 c i
Test ϕ L = ϕ U 1.005.0010.001.005.0010.001.005.0010.00 γ i
pp1.000.070.060.060.060.060.060.070.060.07
adf1.000.060.060.060.060.060.060.060.060.06
jo-e1.000.070.070.070.070.070.070.070.070.07
jo-t1.000.070.060.060.070.060.060.060.070.06
ers-p1.000.050.050.050.050.060.050.050.050.06
ers-d1.000.050.050.050.050.050.050.050.060.05
sp-r1.000.060.050.050.050.060.050.050.050.05
hurst1.000.050.050.050.050.050.050.060.050.05
bvr1.000.050.050.050.050.050.050.050.050.05
pgff1.000.060.060.070.070.070.070.070.060.07
Table 11. Power of cointegration tests for Monte Carlo settings of Type IV, as described in Subsection 3.4.2.
Table 11. Power of cointegration tests for Monte Carlo settings of Type IV, as described in Subsection 3.4.2.
1.001.001.005.005.005.0010.0010.0010.00 c i
Test ϕ L = ϕ U 1.005.0010.001.005.0010.001.005.0010.00 γ i
pp0.950.780.650.590.540.230.160.380.130.10
pp0.900.990.970.950.900.410.270.700.220.13
pp0.851.000.980.970.950.510.340.810.280.16
adf0.950.580.460.420.390.170.130.280.110.09
adf0.900.960.870.810.800.330.200.580.180.11
adf0.851.000.950.930.920.430.270.740.230.13
jo-e0.950.500.380.340.330.130.110.220.090.08
jo-e0.900.970.890.840.800.300.180.550.160.10
jo-e0.851.000.970.950.920.400.250.740.220.11
jo-t0.950.480.360.320.310.130.100.210.090.08
jo-t0.900.960.850.790.760.280.180.520.150.09
jo-t0.850.990.960.940.900.380.230.710.200.11
ers-p0.950.670.580.550.520.240.170.360.130.10
ers-p0.900.880.820.790.780.390.260.610.210.13
ers-p0.850.950.910.880.870.470.310.730.250.16
ers-d0.950.650.570.530.500.230.160.350.130.09
ers-d0.900.860.790.750.740.360.240.600.200.13
ers-d0.850.920.860.830.840.460.300.700.250.14
sp-r0.950.570.470.430.410.180.130.280.100.08
sp-r0.900.940.850.780.780.320.200.570.180.11
sp-r0.850.990.950.910.900.410.260.720.210.13
hurst0.950.410.340.320.310.140.110.220.090.08
hurst0.900.690.610.550.560.250.170.420.150.09
hurst0.850.830.740.690.710.320.220.550.190.11
bvr0.950.510.430.410.390.190.140.290.120.08
bvr0.900.770.700.630.650.310.200.490.170.11
bvr0.850.900.810.760.780.380.250.630.220.12
pgff0.950.810.680.640.580.230.170.390.140.10
pgff0.901.000.970.960.910.420.280.710.230.14
pgff0.851.000.980.970.950.520.350.820.280.17
Table 12. Size of cointegration tests for Monte Carlo settings of Type V.
Table 12. Size of cointegration tests for Monte Carlo settings of Type V.
2.002.002.003.003.003.008.008.008.00 λ w
Test ϕ L = ϕ U 0.050.100.300.050.100.300.050.100.30 m w
pp1.000.060.060.060.060.060.060.070.060.05
adf1.000.060.060.060.060.060.060.060.060.06
jo-e1.000.070.070.070.070.070.070.070.070.07
jo-t1.000.070.070.070.070.060.070.060.070.07
ers-p1.000.050.050.050.050.050.050.050.050.05
ers-d1.000.050.050.050.050.050.050.050.050.05
sp-r1.000.050.050.040.050.050.050.050.050.05
hurst1.000.050.050.060.050.050.060.050.050.06
bvr1.000.040.050.050.050.050.050.050.050.05
pgff1.000.060.060.060.060.060.060.070.060.05
Table 13. Power of cointegration tests for Monte Carlo settings of Type V, as described in Subsection 3.4.2.
Table 13. Power of cointegration tests for Monte Carlo settings of Type V, as described in Subsection 3.4.2.
0.002.002.002.003.003.003.008.008.008.00 λ w
Test ϕ L = ϕ U 0.000.050.100.300.050.100.300.050.100.30 m w
pp0.950.650.650.670.700.650.660.710.660.670.74
pp0.900.970.960.970.970.970.970.970.960.970.98
pp0.850.980.980.980.980.980.980.980.980.980.98
adf0.950.460.470.470.490.470.470.490.470.470.51
adf0.900.870.870.870.890.880.880.900.870.890.91
adf0.850.950.960.960.960.950.950.960.960.960.96
jo-e0.950.380.380.380.390.380.390.390.380.380.43
jo-e0.900.890.890.900.920.890.900.920.900.910.95
jo-e0.850.970.970.970.970.970.970.970.970.970.98
jo-t0.950.360.360.360.370.370.360.370.360.360.39
jo-t0.900.850.850.860.890.860.860.890.860.870.93
jo-t0.850.960.960.960.970.960.960.970.960.970.98
ers-p0.950.580.590.610.650.600.610.660.590.610.68
ers-p0.900.820.830.840.850.830.840.860.830.840.87
ers-p0.850.910.900.900.910.900.910.910.900.910.92
ers-d0.950.570.560.580.620.580.580.640.570.600.65
ers-d0.900.790.800.800.830.790.810.830.800.800.85
ers-d0.850.860.870.870.890.870.870.890.870.880.90
sp-r0.950.470.470.470.500.470.470.500.470.480.51
sp-r0.900.850.850.850.860.840.850.870.840.860.88
sp-r0.850.950.950.940.940.950.940.940.950.950.94
hurst0.950.340.340.340.320.340.330.320.340.340.35
hurst0.900.610.610.600.600.600.600.600.590.600.62
hurst0.850.740.740.740.740.750.740.750.740.740.76
bvr0.950.430.430.430.430.430.430.430.420.440.45
bvr0.900.700.700.690.710.690.690.700.690.700.72
bvr0.850.810.820.820.820.820.820.830.820.830.84
pgff0.950.680.690.680.730.680.690.740.680.700.78
pgff0.900.970.970.970.970.970.970.970.970.970.98
pgff0.850.980.980.980.980.980.980.980.980.980.99
Table 14. Size of cointegration tests for Monte Carlo settings of Type VI (see Subsection 3.4.2).
Table 14. Size of cointegration tests for Monte Carlo settings of Type VI (see Subsection 3.4.2).
4.004.004.006.006.006.0012.0012.0012.00 λ p
Test ϕ L = ϕ U 0.050.100.450.050.100.450.050.100.45 m p
pp1.000.060.060.060.060.060.060.060.050.06
adf1.000.060.060.060.060.060.060.060.060.06
jo-e1.000.070.070.080.070.070.070.070.070.07
jo-t1.000.060.060.070.060.060.070.060.070.07
ers-p1.000.050.050.050.060.050.050.050.050.05
ers-d1.000.050.050.050.050.050.050.050.050.05
sp-r1.000.050.050.050.050.050.040.050.050.04
hurst1.000.050.050.060.050.050.060.050.050.05
bvr1.000.050.050.040.050.050.050.040.050.04
pgff1.000.070.060.050.060.060.060.060.060.05
Table 15. Power of cointegration tests for Monte Carlo settings of Type VI, as described in Subsection 3.4.2.
Table 15. Power of cointegration tests for Monte Carlo settings of Type VI, as described in Subsection 3.4.2.
0.004.004.004.006.006.006.0012.0012.0012.00 λ p
Test ϕ L = ϕ U 0.000.050.100.450.050.100.450.050.100.45 m p
pp0.950.650.450.340.140.410.280.110.320.200.08
pp0.900.970.740.560.190.680.460.130.560.330.08
pp0.850.980.790.620.220.740.520.150.620.370.08
adf0.950.460.320.250.120.280.200.090.230.150.07
adf0.900.870.580.410.160.510.330.110.400.220.07
adf0.850.950.660.470.170.580.380.110.450.250.07
jo-e0.950.380.240.180.110.210.150.090.170.120.07
jo-e0.900.890.550.380.150.480.300.110.360.200.08
jo-e0.850.970.680.480.170.600.380.120.450.240.08
jo-t0.950.360.220.170.110.210.140.080.160.110.07
jo-t0.900.850.530.360.140.450.280.100.330.180.08
jo-t0.850.960.650.460.170.580.360.110.430.220.08
ers-p0.950.580.390.280.120.350.240.080.270.170.06
ers-p0.900.820.560.400.130.500.330.090.390.230.07
ers-p0.850.910.620.450.150.550.370.090.440.260.07
ers-d0.950.570.370.260.110.330.220.080.260.170.06
ers-d0.900.790.520.370.130.470.310.090.360.200.06
ers-d0.850.860.570.400.130.500.340.090.400.220.06
sp-r0.950.470.320.230.090.270.190.070.220.130.05
sp-r0.900.850.530.360.120.460.300.080.340.190.05
sp-r0.850.950.620.400.130.520.310.080.390.200.05
hurst0.950.340.220.160.100.200.130.080.150.100.06
hurst0.900.610.300.210.110.260.170.090.190.110.07
hurst0.850.740.320.220.130.270.170.090.180.110.06
bvr0.950.430.250.190.080.230.150.060.180.110.05
bvr0.900.700.340.220.090.280.180.070.200.110.05
bvr0.850.810.350.230.090.290.170.060.210.110.05
pgff0.950.680.470.350.130.420.290.100.340.200.07
pgff0.900.970.770.580.200.710.510.130.600.350.08
pgff0.850.980.820.650.220.760.560.150.670.420.09

Share and Cite

MDPI and ACS Style

Krauss, C.; Herrmann, K. On the Power and Size Properties of Cointegration Tests in the Light of High-Frequency Stylized Facts. J. Risk Financial Manag. 2017, 10, 7. https://doi.org/10.3390/jrfm10010007

AMA Style

Krauss C, Herrmann K. On the Power and Size Properties of Cointegration Tests in the Light of High-Frequency Stylized Facts. Journal of Risk and Financial Management. 2017; 10(1):7. https://doi.org/10.3390/jrfm10010007

Chicago/Turabian Style

Krauss, Christopher, and Klaus Herrmann. 2017. "On the Power and Size Properties of Cointegration Tests in the Light of High-Frequency Stylized Facts" Journal of Risk and Financial Management 10, no. 1: 7. https://doi.org/10.3390/jrfm10010007

Article Metrics

Back to TopTop