Next Article in Journal
Analysis of Measured Parameters in Relation to the Amount of Fibre in Lightweight Red Ceramic Waste Aggregate Concrete
Next Article in Special Issue
Networks with Complex Weights: Green Function and Power Series
Previous Article in Journal
Global Stabilization of Nonlinear Finite Dimensional System with Dynamic Controller Governed by 1 − d Heat Equation with Neumann Interconnection
Previous Article in Special Issue
Current Trends in Random Walks on Random Lattices
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Simple Out-of-Sample Test of Predictability against the Random Walk Benchmark

1
School of Business, Universidad Adolfo Ibáñez, Peñalolén 8380629, Chile
2
Facultad de Economía y Empresa, Universidad Diego Portales, Huechuraba 8370179, Chile
3
Facultad de Economía y Negocios, Universidad de Talca, Talca 3460000, Chile
*
Author to whom correspondence should be addressed.
Mathematics 2022, 10(2), 228; https://doi.org/10.3390/math10020228
Submission received: 2 December 2021 / Revised: 1 January 2022 / Accepted: 5 January 2022 / Published: 12 January 2022
(This article belongs to the Special Issue Latest Advances in Random Walks Dating Back to One Hundred Years)

Abstract

:
We show that a straightforward modification of a trading-based test for predictability displays interesting advantages over the Excess Profitability (EP) test proposed by Anatolyev and Gerco when testing the Driftless Random Walk Hypothesis. Our statistic is called the Straightforward Excess Profitability (SEP) test, and it avoids the calculation of a term that under the null of no predictability should be zero but in practice may be sizable. In addition, our test does not require the strong assumption of independence used to derive the EP test. We claim that dependence is the rule and not the exception. We show via Monte Carlo simulations that the SEP test outperforms the EP test in terms of size and power. Finally, we illustrate the use of our test in an empirical application within the context of the commodity-currencies literature.

1. Introduction

The objective of this paper is to provide a new out-of-sample test of the Random Walk Hypothesis (RWH) for the particular case in which the predictand is a zero mean variable. Leading examples of a setup such as ours include the prediction of high frequency asset returns, for which the economic literature has normally used either a random walk or a driftless random walk as a benchmark to test for predictability. In the context of asset prices, for instance, the driftless random walk posits that today’s price is the best forecast of tomorrow’s price. In other words, this model assumes that future expected returns, given information available today, are zero (strictly speaking, the implication of a zero forecast for future expected returns emerges when a driftless random walk model is assumed for the natural logarithm of the corresponding asset price).
Although extremely simple, this driftless random walk model that generates the trivial zero forecast is of tremendous importance in the financial and economic literature. For instance, in the context of exchange rates, the review by Rossi (2013) [1] identifies the driftless random walk as the most difficult benchmark to beat in forecast competitions. Similarly, Clark and West (2006) [2] derived an out-of-sample test of predictive ability for cases in which a given model is compared to the driftless random walk. Finally, Pincheira and Hardy (2021) [3] explain that part of the problem of beating the driftless random walk relies on its minimum magnitude: The zero forecast is so small that it can outperform a non-zero forecast in terms of Mean Squared Prediction Error despite having zero covariance with the predictand. In addition, driftless random walk is used as a common benchmark in the literature to compare more sophisticated models (see for instance, Meese and Rogoff (1983) [4], Chen, Rogoff and Rossi (2010) [5], Pincheira et al. (2021) [6], Pincheira and Hardy (2018, 2019, 2021) [7,8,9], ref. [2] and many others).
We propose the construction of the Straightforward Excess Profitability (SEP) test. This test is built as a simple modification of the Excess Profitability (EP) test originally developed by Anatolyev and Gerco (2005) [10]. SEP and EP test statistics differ because the former avoids the calculation of a term that under the null of no predictability should be zero but in practice is sizable. In addition, our test does not require the strong assumption of independence used to derive the EP test. We claim that dependence is the rule and not the exception. We also show via Monte Carlo simulations that both size and power of the SEP test are in general better than those of the EP test. Despite these distinctions, the SEP and EP tests share the same rich economic interpretation: The finding of predictability indicates that a given trading strategy provides positive returns on average. Therefore, these two tests depart from the vast literature focused on statistical measures of predictability as Mean Squared Prediction Errors (MSPE), Least Absolute Deviations and Linex loss functions and Direction of Change, among others (see McCracken and West (2002) [11], Chinn and Meese (1995) [12], Cheung, Chinn and Garcia (2005) [13], Patton and Timmermann (2007) [14] and Giacomini and White (2006) [15]).
While the simple RWH is generally rejected when the econometrician engages in conventional in-sample analysis, it is indeed a difficult benchmark to beat when an out-of-sample approach is followed. The seminal paper of [4] is a classic example of this problem in the context of the exchange rate literature. This is sometimes interpreted as indication that in-sample analyses are affected by overfitting or data mining problems and, therefore, should be disregarded. While the conflicting results from in-sample and out-of-sample approaches are not entirely clear, Inoue and Kilian (2005) [16] emphasize that in-sample strategies display higher power relative to out-of-sample analyses. According to this argument, out-of-sample tests of the RWH would fail to reject the null of no predictability mainly due to the low power of these tests. Therefore, it is essential to derive out-of-sample tests of the RWH displaying power improvements with respect to their competitors.
A key point to consider when testing RWH is that traditional statistical methods of comparing predictive accuracy, as those presented by West (1996) [17] and Diebold and Mariano (1995) [18], are not adequate. The reason for this is that the models under analysis are nested (see [17]). McCracken (2007) [19] and Clark and McCracken (2005) [20] show that when comparing nested models, direct application of traditional methods may result in tests of inadequate size. In particular, [2], Clark and West (2007) [21] and Pincheira and West (2016) [22] show that traditional comparisons of MSPE render tests with low power and questionable size when comparing forecasts from nested models.
Two interesting alternatives to construct proper statistical methods to compare predictive accuracy when models are nested are presented by [2,19,21]. In the first paper, an asymptotic distribution of a t-type statistic, comparable to that suggested by [17,18], is derived. While the distribution is nonstandard, tables of asymptotically correct critical values are provided. In the other two papers, refs. [2,21] make an important observation about the null hypothesis of equal MSPE when models are nested. They show that under the null, the sample distribution of the difference in MSPE is not zero. Instead, the sample MSPE from the null model is expected to be smaller than that of the alternative model. They propose an adjusted procedure that considers this finding and demonstrate via simulations that this procedure is well-sized and relatively powerful in small and moderate samples. A third alternative to properly test for model adequacy when models are nested is provided by Chao, Corradi and Swanson (2000) [23]. They provide a test of Granger causality that displays good size and power properties in small samples when applied to the RWH.
Despite these interesting alternatives to properly test RWH, this field of research is still not closed. From a theoretical point of view, we have already mentioned that it is desirable to derive more powerful tests in small and moderate samples and, more generally, to derive tests with better small sample properties. We would also like to have a test with a direct economic interpretation that allows us, for example, to obtain positive returns when using a given trading rule. This is exactly what we perform in this paper. We introduce the SEP test, which has a direct economic interpretation and displays better size and power than the EP test in our simulations.
The rest of the paper is organized as follows: Section 2 introduces the econometric environment; Section 3 shows that the original EP test may be biased under the null hypothesis of a driftless random walk model; Section 4 introduces the new SEP test. Section 5 describes the experimental design and delivers simulation results; Section 6 presents an empirical application within the commodity-currencies literature, and Section 7 provides the conclusion.

2. Econometric Environment

Let us consider the setup introduced by [2] in which two simple models for a scalar stationary and ergodic time series y t + 1 are defined as follows:
Model   1   ( null ) :   y t + 1 = e t + 1
Model   2   ( alternative ) :   y t + 1 = X t + 1 β + e t + 1  
where X t + 1 is a vector of ergodic and stationary exogenous random variables, and e t + 1 is a zero mean martingale difference meaning that E e t + 1 | t = 0 , where F t represents a filtration such that F t is the sigma-field generated by current and past X s and e s .
t = σ X t + 1 ,   e t   ,   X t ,   e t 1 ,   X t 1   ,   e t 2 ,  
If y t + 1 represents log returns of an asset, then Model 1 contains the special case of a driftless random walk model for the natural logarithm of asset prices.
Notice that we are using the index t+1 to denote exogenous variables known at time t. Thus, X t + 1 is a vector containing known variables at time t. Notice also that we will use notations E e t + 1 | F t or E t e t + 1 . The alternative model posits that the conditional expectation of y t + 1 with respect to the filtration F t only depends linearly on the vector X t + 1 and an unknown vector of parameters β .
                                                                              E y t + 1 | F t = X t + 1 β
According to this condition, the null hypothesis may be written in terms of a restriction over the vector of parameters β = 0 against the alternative β 0 .
Ref. [10] provide a very attractive out-of-sample test in a slightly different econometric context but that in principle might be used to test β = 0 against β 0 . They introduce the Excess Profitability (EP) test, which is based on the idea that y t + 1 corresponds to an asset return. Given a forecast y t , t + 1 f built on information available up until time t for the return y t + 1 , they consider the following trading strategy:
  • To buy shares of the asset if the forecast y t , t + 1 f predicts nonnegative returns in the following period; that is to say that y t , t + 1 f 0 ;
  • To sell shares of the asset if the forecast y t , t + 1 f predicts negative returns in the following period; that is to say that y t , t + 1 f < 0 ;
  • The one-period-return of this trading strategy is given by r t + 1 = s i g n y t , t + 1 f y t + 1 , where the following is the case.
    s i g n z = 1           i f           z 0 1       i f         z < 0
Based upon this one-period-return, [10] construct an asymptotically normal statistic under the null hypothesis of conditional mean independence:
A G   null   hypothesis :   E y t + 1 | t = E y t + 1 = μ
where t represents the information set available to the forecaster at time t. This asymptotically normal statistic is given by the following:
EP A T B T V ^ A T B T
where the following is the case:
A T = 1 T t r t + 1
B T = 1 T t s i g n y t , t + 1 f 1 T t y t + 1
and V ^ A T B T represents a consistent estimator of the Variance of A T B T that is computed according to the following expressions.
V ^ E P V ^ A T B T = 4 T 2 p y f ^ 1 p y f ^ t y t + 1 y ¯ 2
p y f ^ = 1 2 1 + 1 T t s i g n y t , t + 1 f
The rationale for using these two expressions above comes from the fact that under the null hypothesis (4) and some strong assumptions of independence (see [10] for details), we have the following.
V A T B T = 4 T 1 T 2 p y f 1 p y f v a r y t + 1
p y f = Pr s i g n y t , t + 1 f = 1
The intuition behind the definition of the EP test relies on the fact that under the null of no predictability, both A T and B T converge in probability to the same value; thus, they are asymptotically equal.
It is tempting to export the EP test to our econometric framework. In fact, our null hypothesis coincides with (4) when we consider the same information sets F t = t and μ = 0 . Nevertheless, when μ = 0 and no additional assumptions of independence are imposed we will show that the core EP statistic may be biased under the null hypothesis when some usual out-of-sample methods are implemented. We observe this next.

3. Bias in the Core EP Statistic

Traditional out-of-sample analyses must split the available sample into two windows: an estimation window and a predictive window. For the sake of simplicity, we will focus on one-step-ahead forecasts, and we will use the notation in [2,17]. If we have a total of T + 1 observations, then the last P will be used for predictions and R = T + 1 P will be used for the initial estimation of the parameters. β t ^ denotes a generic estimate of β with information available until time t. Usually, the updating scheme may be either fixed, rolling or recursive. The fixed scheme is one in which β t ^ is estimated only once using the first R observations. Thus, no updating occurs here. The rolling scheme updates the estimate of β t ^ using always the last R available observations.
The recursive scheme also updates the estimate of β t ^ , but this time using all available information until time t. That is to say that, in the recursive scheme, the estimation sample increases with t. We will work with both the rolling and recursive scheme because they are the most used in empirical applications. It is important to mention that the asymptotic theory in [2] is only compatible with the rolling scheme, yet via simulations the same authors show that their test works well with the recursive scheme, which is also commonly used in applied research.
Let us analyze the behavior of the core EP statistic under the null hypothesis. Under our notation, we will now have a total of P forecasts for evaluation. The EP test should be written according to this notation as follows:
EP A P B P V ^ A P B P
where the following is the case.
A P = 1 P t = R + 1 T + 1 r t
B P = 1 P t = R + 1 T + 1 s i g n ( y t 1 , t f ) 1 P t = R + 1 T + 1 y t
The next proposition shows the behavior of both terms, A P and B P , of the core EP statistic under the null hypothesis.
Proposition 1.
In the econometric framework characterized by models (1) and (2), we have the following under the null hypothesis.
E A P = 0 E B P = 1 P 2 t > j T + 1 E ( s i g n y t 1 , t f y j )
This last term may or may not be zero.
Proof. 
See Proposition 1 in Appendix A. □
Proposition 1 tells us that, under the null hypothesis, the core statistic from the EP test might not be zero when there is dependence between future values of the forecasts and past values of the target variable. A simple example in which E B P 0 is given next.
Let us consider the simple case in which there is only one predictor under the alternative: a constant different from zero. Let us also assume that e t + 1   is not only a zero mean martingale difference sequence but also an independent sequence. Accordingly, here, we assume that the alternative model looks as follows.
Model   2   ( alternative ) :   y t + 1 = c + e t + 1
Under this assumption, the best forecast is given by y t 1 , t f = c 0 . If this constant c is unknown; it needs to be estimated. Let us assume that we use a rolling scheme to estimate c by OLS. Therefore, we have it that our feasible forecast corresponds to the following.
y ^ t 1 , t f = c t ^ = 1 R i = t R t 1 y i ; t = R + 1 , ,   T + 1
In the very particular case when R = 1, we have the following.
y ^ t 1 , t f = c t ^ = y t 1   ;   t = 2 ,   ,   T + 1
Therefore, the following is the case.
E B P = 1 P 2 t > j T + 1 E ( s i g n y ^ t 1 , t f y j = 1 P 2 t > j T + 1 E ( s i g n y t 1 y j = H 0 1 P 2 t > j T + 1 E ( s i g n e t 1 e j = 1 P 2 j = 2 T + 1 E e j s i g n e j = E e j P > 0
In this case, the term under consideration is strictly positive, but it proceeds to zero as the number of forecasts increases.
This is just an example to show that this E B P term might not be zero when the null hypothesis is true. We have used an extreme simplification in the context of a rolling estimation scheme to prove our point. Nevertheless, we should also mention that the term E B P will be zero in some circumstances. For instance, this happens when using the fixed scheme and our alternative model is given by (13). Under these conditions, the forecast y t 1 , t f   is constructed using information contained in F R , and it is never updated again. In this case, we have the following expression of forecasts:
y ^ t 1 , t f = c t ^ = c R ^   for   all   t = R + 1 ,   ,   T + 1
where c R ^ is a function of the data y 1 ,   y 2 ,   ,   y R . A natural candidate for c R ^ is the simple average.
c R ¯ = 1 R j = 1 R y j
Therefore, the following is the case:
E B P = 1 P 2 t > j j = R + 1 T + 1 E ( s i g n y ^ t 1 , t f y j = 1 P 2 t > j j = R + 1 T + 1 E s i g n c R ^ y j = 1 P 2 t > j j = R + 1 T + 1 E E ( s i g n c R ^ y j | F R = 1 P 2 t > j j = R + 1 T + 1 E s i g n c R ^ E y j | F R
but under the null hypothesis, we have the following:
E B P = H 0 1 P 2 t > j j = R + 1 T + 1 E ( s i g n c R ^ E y j | F R ) = 1 P 2 t > j j = R + 1 T + 1 E s i g n c R ^ E e j | F R = 1 P 2 t > j j = R + 1 T + 1 E s i g n c R ^ 0 = 0
and the core test EP statistic is unbiased under the null.
In the next sections, we will observe that the bias of the test statistic may be important in more general examples than the extremely simple one we have described so far. Before that, we show next that for the null hypothesis in (1), we could construct an adjusted EP test, which is unbiased under the null. The idea is simply to remove the B P term from the construction of the test. The new test is called the Straightforward Excess Profitability (SEP) test.

4. The Straightforward Excess Profitability Test (SEP)

Here, we propose a simple modification of the EP test by removing the B P term from the EP test. In other words, we propose the construction of a test for the random walk hypothesis based on the following statistic.
A P = 1 P t = R + 1 T + 1 r t
r t + 1 = s i g n y t , t + 1 f y t + 1
Assuming stationarity and ergodicity for r t we have the following under the null.
H 0 : E r t + 1 = E E r t + 1 | F t = E [ E s i g n y t , t + 1 f e t + 1 | F t ] = E [ s i g n y t , t + 1 f E e t + 1 | F t ] = 0
This is because the following is the case.
s i g n y t , t + 1 f     F t
Under the alternative, we have the following.
H A : E r t + 1 = E E r t + 1 | F t = E [ E s i g n y t , t + 1 f y t + 1 | F t ] E [ E s i g n y t , t + 1 f X t + 1 β + e t + 1 | F t ] = E [ E s i g n y t , t + 1 f X t + 1 β | F t ] + E [ s i g n y t , t + 1 f E e t + 1 | F t ] E [ s i g n y t , t + 1 f X t + 1 β ] = E [ s i g n X t + 1 β t ^ X t + 1 β ]
As long as β t ^ is an adequate estimate of β , then we obtain the following.
H A :   E s i g n X t + 1 β t ^ X t + 1 β E X t + 1 β > 0
Under the null, r t   is a martingale difference sequence; thus, under mild assumptions we could use the central limit theorem for martingale difference sequences (see Hamilton, 1994 [24]) to obtain the following:
P A P = P P t = R + 1 T + 1 r t N 0 , V
where the following is the case.
V = lim P 1 P t = R + 1 T + 1 E [ r t ] 2
We could then construct a consistent estimate of V, which we denote V ^ , to build the following statistic.
S E P P P   t = R + 1 T + 1 r t V ^ N 0 , 1
Such a consistent estimator V   ^ could be given by the following.
V = 1 P t = R + 1 T + 1 r t 2
Considering (16), we will construct a one side test, as under the alternative we expect a positive value for our statistic.
In the next section, we compare size and power properties of EP and SEP tests via simulations. We also include in the comparison the widely cited test developed by [2] in order to have an external and simple benchmark.

5. Monte Carlo Simulations

We use Monte Carlo simulations for three multivariate data-generating processes (DGPs) to evaluate the finite-sample properties of the SEP test. The idea is to compare properties of this test with those of some other tests already available in the literature. We consider two benchmark tests: the traditional EP test and the MSPE-Adjusted test introduced by [2]. This last test is derived from typical comparisons of MSPE between a linear model and the null of a martingale difference sequence. Clark and West claim that the MSPE-Adjusted test has a better size than traditional tests of MSPE comparisons when the models under evaluation are nested. Intuitively, this test shows good size because it does not consider a term that introduces noise into its forecasts by estimating a parameter vector that under the null should be zero. To observe this, we notice that the sample analog of the difference in MSPE between the two models considered in (1) and (2) is given by the following.
Δ M S P E ^ = M S P E ^ 1 M S P E ^ 2 = 1 P t = R T y t + 1 2 1 P t = R T y t + 1 X t + 1 β t ^ 2 = 2 P t = R T y t + 1 X t + 1 β t ^ 1 P t = R T X t + 1 β t ^ 2
Clark and West notice that the second term in the right-hand side introduces a bias that does not vanish as P goes to . They propose to build a test based on the first term in the right-hand side. Their test is given by the following.
M S P E A d j u s t e d :   P 1 / 2 2 P t = R T y t + 1 X t + 1 β t ^ 4 V ^ y t + 1 X t + 1 β t ^
They also show that, under the null hypothesis and mild assumptions, their test is asymptotically normal; thus, the three tests we consider here are asymptotically normal.
We use DGPs based on [2] and [10]. These DGPs are calibrated to match features of exchange rate and asset returns for which the driftless random walk is a plausible null model. Specifically, the first DGP (DGP1) is given by the following process:
DGP1:
y t + 1 = β x t + e t + 1
x t = 0.95 x t 1 + u t ;
e t + 1 = N 0 , σ e 2 ;   u t + 1 = N 0 , σ u 2
with E e t + 1 | F t = 0 ,   E u t + 1 | F t = 0 and σ e 2 = 1 .
DGP1 is calibrated to match exchange rate features based on interest parity; thus, we will have v a r u t = σ u 2 (with σ u = 0.025 ) and c o r r e t , u t = 0 . We set β = 2   in experiments evaluating power and β = 0 in experiments evaluating size. We assume data to be generated from homoscedastic draws of their respective distributions. We are aware that β = 1 is the theoretical implication. Empirical estimations for a number of industrial countries, however, provide estimates around −2 (see [2] for further details).
The next two DGPs are simple processes calibrated to match S&P 500 returns and are identical to those used by [10].
DGP2:
This is a simple first order autoregressive model for S&P returns.
y t = ρ y t 1 + ε t
ε t N 0 , 0.000249
We set ρ = 0.1256 for power results and ρ = 0 for size results.
DGP3:
This DGP is similar to DGP2, but now this process includes a threshold parameter to simulate possible breaks in the data.
y t = c 1 + ρ 1 y t 1 + ε t y t 1 0.01848 c 2 + ρ 2 y t 1 + ε t y t 1 > 0.01848
ε t N 0 , 0 , 000245
We set c 1 = ρ 1 = c 2 = ρ 2 = 0 when analyzing the size of the tests. For power and following [10], we set the following.
c 1 = 0.000844 ;   ρ 1 = 0.2453 ;     c 2 = 0.002679 ;     ρ 2 = 0.0664
For all simulations, we focus only on one-step-ahead forecasts. Estimation always includes a constant term in each regression. We explore the performance of rolling and recursive schemes for a number of sample sizes (T + 1) and decompositions of the sample in the estimation window (size R) and the prediction window (size P, (T + 1 = R + P)). We run simulations for the following sample sizes: R = 25, 50 and 100; P = 25, 50, 100, 300 and 750. The total number of simulations is 5000. We use OLS to obtain estimates of the parameters. The main features arising from the simulations with DGP1, DGP2 and DGP3 are analyzed in the following subsection. Notice that results for DGP 3 are mostly used as a robustness check. We perform this because, for this DGP, we estimate an AR(1) model with drift; thus, the threshold parameter is omitted in the estimation and a misspecification error is present.

5.1. Simulation Results

Table 1 and Table 2 display results of empirical size for the three tests under consideration for DGP1 and DGP2, respectively. Table 3, Table 4 and Table 5 exhibit results of raw power for DGP1 through DGP3, respectively. Notice that we do not report size results for DGP3 since, under the null hypothesis, this DGP is almost identical to DGP2 and size results are, consequently, almost identical as well. DGP3 is estimated as a simple first-order autoregressive process, and under the null hypothesis, this DGP is white noise, but under the alternative, it is a SETAR model.
Each table exhibits our results for both the rolling and the recursive scheme. While Table 1, Table 2, Table 3, Table 4 and Table 5 consider a nominal size of 10%, Table A1, Table A2, Table A3, Table A4 and Table A5 in the Appendix A section consider a nominal size of 5% with similar conclusions. Each panel in Table 1, Table 2, Table 3, Table 4 and Table 5 considers a different length for the initial estimation window R (either R = 25, 50 or 100).
Table 1 and Table 2 show similar results. In general, the original EP seems to be remarkably undersized, while the SEP test has better size properties than both the EP and MSPE-Adjusted tests. For instance, in Table 1, EP exhibits an average empirical size of 3.9% for the recursive scheme, ranging from 2.9% to 4.8% (i.e., the maximum empirical size is less than half the nominal size). In contrast, the SEP test has an average empirical size of 10.1%, ranging from 9.3 to 11.0%. Across the two DGPs, considering the rolling (recursive) scheme: the average empirical size for the EP test is 5.7% (4.3%), while for the SEP and CW tests they are 10.2% (10.3%) and 7.9% (7.0%), respectively.
In terms of power, Table 3, Table 4 and Table 5 suggest that for small P, the SEP test has much more power than the EP test and a little more power than the MSPE-Adjusted test. As the number of forecasts becomes larger, we still observe a better performance of the SEP test over the EP test. Nevertheless, the MSPE-Adjusted test dominates the rest of the tests. The size of the estimation window does not seem to have a significant impact on these conclusions. Nevertheless, it does have an impact on absolute magnitudes: we can observe that with a larger estimation window, all three tests display higher power.
All in all, Table 1, Table 2, Table 3, Table 4 and Table 5 display a similar message: First, the SEP test dominates the EP test both in terms of size and power. Second, the MSPE-Adjusted test has no better empirical size compared to the SEP test. Third, the MSPE-Adjusted test is the best in terms of power when the number of forecasts is moderate or large, and fourth, with a larger estimation window, all three tests display higher power.

5.2. Robustness

Thus far, we have focused on the null hypothesis of a driftless random walk. Yet, in practice, the returns of many assets may have expected values that are close to zero but are not exactly zero. Table A6 in the Appendix A shows descriptive statistics of several monthly returns of commodities and exchange rates. The null hypothesis of zero returns cannot be rejected in any asset at usual significance levels, yet numerically, mean and median returns are slightly different from zero. With this in mind, we explore next the behavior of our proposed SEP test in situations in which we face small deviations from zero mean returns. To that end we consider the following DGP 4 inspired in [9]:
Y t + 1 = c + β X t + ε t + 1 X t + 1 = α + ρ X t + v t + 1
where ε t + 1 and v t + 1 are white noise processes:
Y t + 1 = Δ l o g A l u m i n u m t + 1 X t + 1 = Δ l o g A u s t r a l i a t + 1
and A u s t r a l i a t + 1 denotes the Australian dollar at time t + 1.
By using monthly data from 31 September 1999 to August 2020, we calibrate our parameters as follows.
c = 0.0008 ,   σ ε = 0.058 ,   σ v = 0.036 ,   ρ = 0.0375 ,   α = 0.0004 ,   C o r r ε t ; v t = 0.49
Under the null, we set β = 0 . Under the alternative, we set it to β = 0.27 . Notice also that under the null, the expected value of Y t + 1   is set to c = 0.0008 0 .
Table 6 and Table 7 next show results of size and power for SEP, EP and CW tests in 5000 replications based on the previous DGP. As usual, we report results for different values of R and P and for recursive and rolling windows.
Interestingly, the general picture looks the same as before, and our proposed SEP test is better sized and more powerful than the original EP test. The main difference now occurs for large R and P in which our proposed SEP test is slightly oversized. This is expected, as with a large number of observations our SEP test is subtle enough to detect deviations from zero mean returns.
We also carried out simulations for returns displaying means far from zero. In these conditions, our proposed test is importantly oversized, and from that point of view it would be better to use the original EP test. Other than that, for zero mean returns or close to zero mean returns, our proposed SEP test is preferable.

5.3. Bias Results

In previous sections we proved that the core EP statistic might have an expected value different from zero under the null hypothesis. Table 8 and Table 9 show the median for SEP, EP and MSPE-Adjusted statistics under the null. The idea is to explore whether this bias is present in our simulations. From Table 8 and Table 9, we observe that it is indeed present. In fact, we detect a negative bias for the EP test in all our simulations.
Interestingly, at least under the rolling scheme, this bias decreases (in absolute terms) as the number of forecasts grows larger. The MSPE-Adjusted statistic follows a similar pattern. As mentioned by [2] and Pincheira (2013) [25], this adjusted statistic also has a negative bias under the null hypothesis that decreases as the number of forecasts grows larger, at least under the rolling scheme. Finally, we observe that our proposed SEP test presents a negligible bias with no clear signs, which suggests that the modification we propose here is unbiased under the null hypothesis.
Figure 1 considers the EP and SEP statistics kernel densities using DGP1 with R = 200, P = 750 and a rolling scheme. Notice that this figure clearly illustrates the negative bias of the EP statistic compared to the SEP statistic.
In summary, our simulations show that both the EP test and MSPE-Adjusted test are undersized, while the SEP test presents an empirical size closer to nominal size. In addition, the SEP test presents lower bias under the null and more power than the EP test, although sometimes it is less powerful than the MSPE-Adjusted test.

6. Empirical Illustration

In this section, based on the commodity-currencies literature, we provide an empirical illustration in which we compare the SEP and the EP tests. Following [5], Chen, Rossi and Rogoff (2011) [26]; [7,8,9]; Pincheira, Hardy and Muñoz (2021) [27]; and [6], we evaluate the performance of five exchange rates from major commodity exporters (Australia, Canada, Chile, New Zealand and Norway) when predicting commodity prices. We consider five commodity series: West Texas Intermediate (WTI), heating oil, two-month WTI futures, copper and the London metal exchange index (LMEX). Similarly to [6], we have different data sources.
For the case of fuels, we download our series from the Energy Information Administration (EIA) database. The exchange rates, copper and LMEX are obtained from Thomson Reuters Datastream. In each case, we downloaded daily prices, which we converted to monthly frequencies by sampling from the last observation of each month. The sample period is from October 1999 to December 2019. Notice that the starting point of our database coincides with the time when monetary authorities in Chile decided to pursue a pure flotation regime for the exchange rate.
Our econometric model for one-step-ahead forecasts closely follows specification (1) in Table 1 of [6], page 5:
Δ ln C P t = c + β 1 Δ ln E R t 1 + Δ ln E R t 2 + Δ ln E R t 3 + β 2 Δ ln E R t 4 + Δ ln E R t 5 + Δ ln E R t 6 + ρ Δ ln C P t 1 + ε t
where C P t stands for a generic commodity price, E R t stands for a generic exchange rate, Δ ln ( ) is the logarithmic return and ε t is an error term. Notice that this specification considers up to six lags of exchange rates to predict commodity returns. These lags are aggregated into two quarters, where each one is constructed by adding three subsequent monthly returns. In other words, we consider two different predictors, where each one of them is constructed by adding three monthly returns. Additionally, we consider a constant and an AR(1) component.
We evaluate out-of-sample the null hypothesis of a driftless random walk for the natural logarithm of commodity prices. In terms of our model, the null is H 0 : c = β 1 = β 2 = ρ = 0 . The idea is to evaluate if the information added by a nonzero constant, the exchange rates and the AR(1) component is useful for improving the forecast of a model that simply predicts zero returns (the driftless random walk).
Table 10 next reports the SEP and EP statistics using recursive regressions estimated by OLS, with an initial estimation window of R = 45 observations. Three features of Table 10 are worth mentioning. First, consistent with the negative bias reported in our simulations, the SEP statistic is greater than the EP in 98% of our exercises. Second, SEP rejects the null hypothesis in three more entries than the EP. In particular, the EP rejects the null hypothesis in 48% of our exercises, while the SEP rejects the null in 56% of the entries at usual significance levels. Finally, SEP not only rejects the null more frequently but also at tighter significance levels. Considering a 5% significance level, EP rejects the null in 24% of our exercises, while SEP rejects the null in 36% of our exercises, providing more evidence in favor of the predictability of commodity returns with commodity-currencies.

7. Conclusions

In this paper, we propose a new out-of-sample test of predictability that is constructed as a simple modification of the Excess Profitability (EP) test developed by [10]. Our proposed test is called Straightforward Excess Profitability (SEP) test and avoids the calculation of a term in the EP test that under the null of no predictability should be zero but in practice may be sizable. In addition, our test does not require the strong assumption of independence used to derive the EP test. We claim that dependence is the rule and not the exception.
We use Monte Carlo simulations to explore the small and moderate sample behavior of the SEP test, comparing it with the EP test and the MSPE-Adjusted test derived by [2]. Our simulations make use of three different data generating processes designed to match features of exchange rates and S&P 500 monthly returns, as shown by [2] and [10]. Simulation results show that the SEP test displays better size and more power than the EP test. We link this better behavior to an important bias that is present in the EP test under the null of no predictability. This bias generates an undersized test, which may also explain the lower power of the test. By removing the term that generates the bias, the SEP test displays a superior performance.
Our SEP test also fares well when compared to the MSPE-Adjusted test in terms of empirical size. Nevertheless, the MSPE-Adjusted test sometimes show more power than the SEP test, especially when the number of forecasts is large.
We illustrate the use of the SEP test in an empirical application within the commodity-currencies literature. Using monthly exchange rates of commodity-exporting economies, we find that the SEP statistic tends to be greater than the EP statistic, rejecting the null hypothesis more frequently and at tighter significance levels. These results are consistent with our simulations: compared to the EP, the SEP test has better size properties, more power and less bias.
As a final remark let us notice that the SEP test could also be considered as a “shrinkage-based-test” according to the terminology used in [25]. In this regard, the estimation of an optimal shrinkage factor to improve forecast accuracy is an open question that might be addressed in future research.

Author Contributions

Conceptualization, P.P. and A.B.; data curation, N.H.; formal analysis, P.P., N.H. and A.B.; investigation, P.P., N.H. and A.B.; methodology, P.P. and N.H.; project administration, P.P.; software, P.P. and N.H.; supervision, P.P.; writing—original draft, P.P. and A.B.; writing—review and editing, P.P., N.H. and A.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Proof. 
Let us take expectations on both A P and B P .
E A P = E 1 P t = R + 1 T + 1 r t = 1 P t = R + 1 T + 1 E r t
E r t = E E ( r t | F t 1 ) = E E s i g n y t 1 f y t | F t 1
E r t = E s i g n ( y t 1 , t f E y t | F t 1 )
E r t = E s i g n ( y t 1 , t f E e t | F t 1 )
E r t = E s i g n ( y t ^ 0 ) = 0
E B P = E 1 P t = R + 1 T + 1 s i g n ( y t 1 , t f ) 1 P t = R + 1 T + 1 y t
E B P = 1 P 2 t , j = R + 1 T + 1 E ( s i g n ( y t 1 , t f ) y j )
E B P = 1 P 2 t = j T + 1 E ( s i g n ( y t 1 , t f ) y j ) + t j T + 1 E ( s i g n ( y t 1 , t f ) y j )
Therefore, the following is the case.
E B P = 1 P 2 t = R + 1 T + 1 E ( r t ) + t < j T + 1 E ( s i g n ( y t 1 , t f ) y j ) + t > j T + 1 E ( s i g n ( y t 1 , t f ) y j )
Under the null hypothesis, we have the following:
E B P = H 0 1 P 2 t = R + 1 T + 1 0 + t < j T + 1 E ( s i g n ( y t 1 , t f ) E ( y j | F j 1 ) + t > j T + 1 E ( s i g n ( y t 1 , t f ) y j )
E B P = H 0 1 P 2 t = R + 1 T + 1 0 + t < j T + 1 0 + t > j T + 1 E ( s i g n ( y t 1 , t f ) y j )
E B P = H 0 1 P 2 t > j T + 1 E ( s i g n ( y t 1 , t f ) y j )
which is the desired result. □
Table A1. Empirical Size DGP1 (Nominal Size of 5%).
Table A1. Empirical Size DGP1 (Nominal Size of 5%).
Rolling Recursive
P = 25P = 50P = 100P = 300P = 750 P = 25P = 50P = 100P = 300P = 750
R = 25 R = 25
CW0.0390.0430.0400.0390.043 CW0.0380.0370.0360.0260.029
SEP0.0560.0510.0500.0480.051 SEP0.0530.0500.0560.0480.045
EP0.0220.0230.0280.0330.041 EP0.0160.0190.0210.0190.015
R = 50 R = 50
CW0.0370.0330.0340.0370.050 CW0.0360.0340.0300.0290.030
SEP0.0530.0500.0500.0510.056 SEP0.0520.0510.0530.0480.051
EP0.0180.0200.0240.0300.039 EP0.0140.0180.0240.0200.017
R = 100 R = 100
CW0.0440.0360.0330.0320.038 CW0.0430.0360.0320.0250.027
SEP0.0590.0500.0510.0460.047 SEP0.0570.0510.0520.0460.049
EP0.0160.0190.0230.0230.030 EP0.0160.0170.0220.0200.020
Notes: CW stands for the [2] test. EP stands for the Excess Profitability test by [10]. SEP is our Straightforward Excess Profitability test. R stands for the length of the initial estimation window, and P is the number of forecasts. Each entry reports the rate of rejections with a nominal size of 5% and 5000 Monte Carlo simulations.
Table A2. Empirical Size DGP2 (Nominal Size of 5%).
Table A2. Empirical Size DGP2 (Nominal Size of 5%).
Rolling Recursive
P = 25P = 50P = 100P = 300P = 750 P = 25P = 50P = 100P = 300P = 750
R = 25 R = 25
0.0400.0360.0400.0420.044 CW0.0410.0350.0350.0320.033
0.0540.0530.0520.0520.049 SEP0.0520.0510.0520.0500.052
0.0230.0230.0250.0330.038 EP0.0210.0250.0200.0180.020
R = 50 R = 50
0.0440.0330.0420.0360.041 CW0.0410.0360.0410.0290.031
0.0590.0500.0550.0460.051 SEP0.0550.0500.0570.0520.048
0.0280.0220.0240.0250.037 EP0.0240.0210.0210.0220.016
R = 100 R = 100
0.0450.0420.0350.0340.038 CW0.0440.0450.0380.0320.033
0.0510.0510.0540.0460.051 SEP0.0560.0530.0550.0500.052
0.0250.0250.0260.0200.028 EP0.0270.0230.0250.0230.020
Notes: See notes of Table A1.
Table A3. Raw Power DGP1 (Nominal Size of 5%).
Table A3. Raw Power DGP1 (Nominal Size of 5%).
Rolling Recursive
P = 25P = 50P = 100P = 300P = 750 P = 25P = 50P = 100P = 300P = 750
R = 25 R = 25
CW0.0860.1140.1600.2990.532 CW0.0950.1370.2210.5530.923
SEP0.1090.1380.1700.2780.466 SEP0.1160.1530.2350.5160.891
EP0.0330.0550.0920.2110.409 EP0.0340.0580.1230.4100.842
R = 50 R = 50
CW0.1050.1470.2060.4060.716 CW0.1120.1700.2580.6030.944
SEP0.1290.1720.2160.3660.611 SEP0.1280.1730.2570.5420.905
EP0.0350.0630.1100.2750.546 EP0.0330.0640.1460.4290.862
R = 100 R = 100
CW0.1300.1740.2750.5340.837 CW0.1310.1860.3050.6570.956
SEP0.1420.1870.2660.4710.729 SEP0.1470.1930.2880.5810.910
EP0.0420.0760.1510.3660.665 EP0.0410.0860.1670.4860.875
Notes: See notes of Table A1.
Table A4. Raw Power DGP2 (Nominal Size of 5%).
Table A4. Raw Power DGP2 (Nominal Size of 5%).
Rolling Recursive
P = 25P = 50P = 100P = 300P = 750 P = 25P = 50P = 100P = 300P = 750
R = 25 R = 25
CW0.0720.0900.1290.2140.376 CW0.0800.1000.1570.3890.776
SEP0.0900.0930.1220.1700.277 SEP0.0940.1110.1500.3520.719
EP0.0340.0380.0650.1280.242 EP0.0330.0540.0870.2770.660
R = 50 R = 50
CW0.0940.1060.1510.2860.523 CW0.0940.1190.1820.4230.810
SEP0.1010.1100.1560.2290.364 SEP0.1000.1150.1730.3640.733
EP0.0420.0530.0870.1720.317 EP0.0420.0580.1100.2990.683
R = 100 R = 100
CW0.1010.1290.1820.3780.663 CW0.1030.1450.2100.4830.845
SEP0.1080.1360.1680.3030.517 SEP0.1070.1390.1960.4010.763
EP0.0450.0750.1080.2300.447 EP0.0460.0810.1410.3520.722
Notes: See notes of Table A1.
Table A5. Raw Power DGP3 (Nominal Size of 5%).
Table A5. Raw Power DGP3 (Nominal Size of 5%).
Rolling Recursive
P = 25P = 50P = 100P = 300P = 750 P = 25P = 50P = 100P = 300P = 750
R = 25 R = 25
CW0.0950.1190.1720.3230.571 CW0.1060.1430.2220.5560.920
SEP0.1200.1420.1830.3030.502 SEP0.1190.1640.2370.5280.887
EP0.0330.0480.0740.1390.257 EP0.0290.0560.0960.2550.508
R = 50 R = 50
CW0.1190.1510.2210.4300.740 CW0.1270.1710.2640.5960.941
SEP0.1300.1760.2370.3890.642 SEP0.1380.1810.2680.5520.891
EP0.0370.0590.0850.1750.331 EP0.0400.0660.1130.2690.527
R = 100 R = 100
CW0.1440.1790.2740.5460.851 CW0.1460.1970.3080.6600.953
SEP0.1530.1920.2810.4910.775 SEP0.1530.1990.2930.5860.899
EP0.0440.0690.1200.2180.430 EP0.0470.0800.1340.3070.544
Notes: See notes of Table A1.
Table A6. Summary statistics of several monthly returns.
Table A6. Summary statistics of several monthly returns.
ALUMINUMAUSTRALIACANADACHILECOPPERLEADLMEXNEWZEALANDNICKELSOUTHAFRICATIN
Mean0.0008−0.0004−0.00050.00150.00550.00530.0036−0.00090.00320.00400.0049
Median−0.0016−0.0022−0.00060.00020.00720.00840.0051−0.00250.00230.00060.0037
Std. Dev.0.05760.03600.02570.03400.07370.08530.06050.03730.10110.04770.0686
Variance0.00330.00130.00070.00120.00540.00730.00370.00140.01020.00230.0047
t-stat 0.1890−0.1540−0.28600.71100.96900.98300.7510−0.36500.45601.34400.9230
Obs251251251251251251251251251251251

References

  1. Rossi, B. Exchange Rate Predictability. J. Econ. Lit. 2013, 51, 1063–1119. [Google Scholar] [CrossRef] [Green Version]
  2. Clark, T.; West, K. Using Out-of-Sample Mean Squared Prediction Errors to Test the Martingale Difference Hypothesis. J. Econom. 2006, 135, 155–186. [Google Scholar] [CrossRef] [Green Version]
  3. Pincheira, P.; Hardy, N. The Mean Squared Prediction Error Paradox; MPRA Paper 107403; University Library of Munich: Munich, Germany, 2021. [Google Scholar]
  4. Meese, R.; Rogoff, K. Empirical Exchange Rate Models of the Seventies. Do They Fit Out-of-Sample? J. Int. Econ. 1983, 14, 3–24. [Google Scholar] [CrossRef]
  5. Chen, Y.; Rogoff, K.; Rossi, B. Can exchange rates forecast commodity prices? Q. J. Econ. 2010, 125, 1145–1194. [Google Scholar] [CrossRef] [Green Version]
  6. Pincheira, P.; Bentancor, A.; Hardy, N.; Jarsun, N. Forecasting fuel prices with the Chilean exchange rate: Going beyond the commodity currency hypothesis. Energy Econ. 2021, 105802, in press. [Google Scholar] [CrossRef]
  7. Pincheira, P.; Hardy, N. The Predictive Relationship between Exchange Rate Expectations and Base Metal Prices. 2018. Available online: https://ssrn.com/abstract=3263709 (accessed on 1 December 2021).
  8. Pincheira, P.; Hardy, N. Forecasting base metal prices with the Chilean exchange rate. Resour. Policy 2019, 62, 256–281. [Google Scholar] [CrossRef]
  9. Pincheira, P.; Hardy, N. Forecasting aluminum prices with commodity currencies. Resour. Policy 2021, 73, 102066. [Google Scholar] [CrossRef]
  10. Anatolyev, S.; Gerco, A. A Trading Approach to Testing for Predictability. J. Bus. Econ. Stat. 2005, 4, 455–461. [Google Scholar] [CrossRef]
  11. McCracken, M.; West, K. Inference About Predictive Ability. In Companion to Economic Forecasting; Clements, M., Hendry, D., Eds.; Blackwell: Oxford, UK, 2002; pp. 299–321. [Google Scholar]
  12. Chinn, M.; Meese, R. Banking on currency forecasts: How predictable is change in money? J. Int. Econ. 1995, 38, 161–178. [Google Scholar] [CrossRef]
  13. Chinn, M.D.; Cheung, Y.W.; Garcia Pascual, A.I. Empirical Exchange Rate Models of the 1990′s: Are Any Fit to Survive? J. Int. Money Financ. 2005, 24, 1150–1175. [Google Scholar]
  14. Patton, A.; Timmermann, A. Properties of optimal forecasts under asymmetric loss and nonlinearity. J. Econom. 2007, 140, 884–918. [Google Scholar] [CrossRef] [Green Version]
  15. Giacomini, R.; White, H. Tests of Conditional Predictive Ability. Econometrica 2006, 74, 1545–1578. [Google Scholar] [CrossRef] [Green Version]
  16. Inoue, A.; Kilian, L. In-Sample or Out-of-Sample Tests of Predictability: Which One Should We Use? Econom. Rev. 2005, 23, 371–402. [Google Scholar] [CrossRef] [Green Version]
  17. West, K. Asymptotic Inference About Predictive Ability. Econometrica 1996, 64, 1067–1084. [Google Scholar] [CrossRef]
  18. Diebold, F.; Mariano, R. Comparing Predictive Accuracy. J. Bus. Econ. Stat. 1995, 13, 253–263. [Google Scholar]
  19. McCracken, M. Asymptotics for Out of Sample Tests of Granger Causality. J. Econom. 2007, 140, 719–752. [Google Scholar] [CrossRef]
  20. Clark, T.; McCracken, M. Evaluating Direct Multistep Forecasts. Econom. Rev. 2005, 24, 369–404. [Google Scholar] [CrossRef]
  21. Clark, T.; West, K. Approximately Normal Tests for Equal Predictive Accuracy in Nested Models. J. Econom. 2007, 138, 291–311. [Google Scholar] [CrossRef] [Green Version]
  22. Pincheira, P.; West, K.D. A comparison of some out-of-sample tests of predictability in iterated multi-step-ahead forecasts. Res. Econ. 2016, 70, 304–319. [Google Scholar] [CrossRef]
  23. Chao, J.; Corradi, V.; Swanson, N. An Out of Sample Test for Granger Causality. Macroecon. Dyn. 2000, 5, 598–620. [Google Scholar] [CrossRef]
  24. Hamilton, J. Time Series Analysis; University Press: Princeton, NJ, USA, 1994. [Google Scholar]
  25. Pincheira, P. Shrinkage Based Tests of Predictability. J. Forecast. 2013, 32, 289–384. [Google Scholar]
  26. Chen, Y.; Rogoff, K.; Rossi, B. Predicting Agri-Commodity Prices: An Asset Pricing Approach. 2011. Available online: https://ssrn.com/abstract=1616853 (accessed on 1 December 2021).
  27. Pincheira, P.; Hardy, N.; Muñoz, F. “Go Wild for a While!”: A New Test for Forecast Evaluation in Nested Models. Mathematics 2021, 9, 2254. [Google Scholar] [CrossRef]
Figure 1. Kernel Densities of the EP and SEP Statistics. Notes: In Figure 1, we consider DGP1 (under the null hypothesis), with R = 200, P = 750, a rolling scheme and 5000 Monte Carlo simulations.
Figure 1. Kernel Densities of the EP and SEP Statistics. Notes: In Figure 1, we consider DGP1 (under the null hypothesis), with R = 200, P = 750, a rolling scheme and 5000 Monte Carlo simulations.
Mathematics 10 00228 g001
Table 1. Empirical Size DGP1; Nominal size is 10%.
Table 1. Empirical Size DGP1; Nominal size is 10%.
Rolling Recursive
P = 25P = 50P = 100P = 300P = 750 P = 25P = 50P = 100P = 300P = 750
R = 25 R = 25
CW0.0740.0820.0840.0780.090 CW0.0750.0680.0700.0560.063
SEP0.1010.1030.1010.1020.101 SEP0.1020.1000.1050.1040.096
EP0.0450.0520.0590.0750.086 EP0.0350.0370.0460.0430.035
R = 50 R = 50
CW0.0710.0690.0690.0760.089 CW0.0750.0670.0610.0600.065
SEP0.1010.0940.0970.0980.108 SEP0.1000.0950.1060.0930.100
EP0.0360.0420.0490.0640.083 EP0.0290.0390.0480.0360.036
R = 100 R = 100
CW0.0840.0690.0670.0680.076 CW0.0800.0710.0670.0600.060
SEP0.1150.1050.1000.0960.096 SEP0.1100.1060.1020.0980.101
EP0.0350.0400.0510.0540.063 EP0.0310.0370.0440.0460.044
Notes: CW stands for the [2] test. EP stands for Excess Profitability test by [10]. SEP is our Straightforward Excess Profitability test. R stands for the length of the initial estimation window, and P is the number of forecasts. Each entry reports the rate of rejections with a nominal size of 10% and 5000 Monte Carlo simulations.
Table 2. Empirical Size DGP2, Nominal size is 10%.
Table 2. Empirical Size DGP2, Nominal size is 10%.
Rolling Recursive
P = 25P = 50P = 100P = 300P = 750 P = 25P = 50P = 100P = 300P = 750
R = 25 R = 25
CW0.0790.0750.0890.0830.093 CW0.0810.0700.0650.0610.071
SEP0.1070.1020.1080.0990.100 SEP0.1070.1050.1060.1040.102
EP0.0510.0520.0560.0700.082 EP0.0490.0490.0430.0410.040
R = 50 R = 50
CW0.0890.0720.0830.0790.079 CW0.0870.0740.0790.0630.062
SEP0.1080.0990.1060.1050.101 SEP0.1040.1010.1040.1060.099
EP0.0560.0470.0480.0640.073 EP0.0570.0480.0440.0460.038
R = 100 R = 100
CW0.0870.0830.0720.0690.084 CW0.0890.0850.0710.0600.065
SEP0.1010.1030.1060.0940.101 SEP0.1020.1090.1070.1030.102
EP0.0570.0540.0540.0480.062 EP0.0550.0520.0490.0450.045
Notes: See notes of Table 1.
Table 3. Raw Power DGP1 (Nominal Size of 10%).
Table 3. Raw Power DGP1 (Nominal Size of 10%).
Rolling Recursive
P = 25P = 50P = 100P = 300P = 750 P = 25P = 50P = 100P = 300P = 750
R = 25 R = 25
CW0.1410.1880.2440.4130.658 CW0.1520.2070.3180.6700.954
SEP0.1830.2190.2640.4060.598 SEP0.1830.2460.3370.6460.941
EP0.0670.1040.1540.3270.541 EP0.0660.1090.1940.5320.901
R = 50 R = 50
CW0.1750.2280.3030.5250.813 CW0.1850.2570.3610.7030.967
SEP0.2110.2620.3170.4830.726 SEP0.2090.2740.3710.6640.949
EP0.0680.1130.1780.3810.668 EP0.0630.1180.2220.5490.915
R = 100 R = 100
CW0.2060.2720.3840.6410.894 CW0.2180.2870.4180.7540.975
SEP0.2240.2850.3810.5940.821 SEP0.2320.2920.4010.6940.952
EP0.0750.1290.2310.4740.761 EP0.0750.1380.2600.6010.923
Notes: See notes of Table 1.
Table 4. Raw Power DGP2 (Nominal Size of 10%).
Table 4. Raw Power DGP2 (Nominal Size of 10%).
Rolling Recursive
P = 25P = 50P = 100P = 300P = 750 P = 25P = 50P = 100P = 300P = 750
R = 25 R = 25
CW0.1260.1540.2060.3120.506 CW0.1380.1740.2420.5070.845
SEP0.1550.1650.1990.2710.405 SEP0.1610.1860.2450.4790.817
EP0.0650.0820.1200.2140.360 EP0.0630.0950.1530.3940.764
R = 50 R = 50
CW0.1590.1810.2370.4040.647 CW0.1600.1900.2830.5450.876
SEP0.1690.1880.2370.3380.497 SEP0.1660.1980.2700.5050.828
EP0.0830.1010.1440.2610.435 EP0.0800.1140.1810.4230.784
R = 100 R = 100
CW0.1740.2200.2850.4990.769 CW0.1810.2350.3230.6110.907
SEP0.1760.2170.2790.4280.638 SEP0.1810.2260.2980.5370.852
EP0.0940.1370.1830.3320.571 EP0.0970.1430.2170.4660.810
Notes: See notes of Table 1.
Table 5. Raw Power DGP3 (Nominal Size of 10%).
Table 5. Raw Power DGP3 (Nominal Size of 10%).
Rolling Recursive
P = 25P = 50P = 100P = 300P = 750 P = 25P = 50P = 100P = 300P = 750
R = 25 R = 25
CW0.1580.1950.2630.4400.689 CW0.1670.2230.3250.6600.949
SEP0.1860.2300.2740.4170.625 SEP0.1910.2530.3460.6480.936
EP0.0700.0950.1290.2230.373 EP0.0660.1090.1580.3470.598
R = 50 R = 50
CW0.1940.2360.3130.5420.825 CW0.2030.2590.3790.6990.966
SEP0.2160.2620.3340.5030.750 SEP0.2220.2790.3860.6720.941
EP0.0790.1070.1500.2700.456 EP0.0780.1240.1840.3670.614
R = 100 R = 100
CW0.2290.2810.3780.6650.907 CW0.2360.2990.4260.7660.974
SEP0.2380.2890.3950.6150.850 SEP0.2420.3000.4080.7130.944
EP0.0910.1290.1870.3170.553 EP0.0940.1390.2130.4060.625
Notes: See notes of Table 1.
Table 6. Empirical Size DGP4, Nominal size is 10%.
Table 6. Empirical Size DGP4, Nominal size is 10%.
Rolling Recursive
P = 25P = 50P = 100P = 300P = 750 P = 25P = 50P = 100P = 300P = 750
R = 25 R = 25
CW0.0850.0750.0740.0900.094 CW0.0830.0750.0670.0730.073
SEP0.1060.0990.0980.1090.101 SEP0.1000.1050.1060.1160.112
EP0.0540.0450.0500.0770.079 EP0.0480.0440.0460.0370.033
R = 50 R = 50
CW0.0810.0740.0720.0810.094 CW0.0810.0730.0680.0700.069
SEP0.1030.1020.1020.1080.110 SEP0.1030.1020.1000.1050.114
EP0.0530.0530.0440.0630.076 EP0.0500.0520.0490.0400.040
R = 100 R = 100
CW0.1020.0810.0730.0750.086 CW0.0980.0810.0740.0730.073
SEP0.1140.1110.1070.1090.108 SEP0.1110.1100.1060.1110.124
EP0.0640.0570.0480.0500.064 EP0.0610.0530.0500.0450.042
Notes: See notes of Table 1.
Table 7. Raw Power DGP4 (Nominal Size of 10%).
Table 7. Raw Power DGP4 (Nominal Size of 10%).
Rolling Recursive
P = 25P = 50P = 100P = 300P = 750 P = 25P = 50P = 100P = 300P = 750
R = 25 R = 25
CW0.1790.2110.2890.5120.780 CW0.1830.2540.3760.7550.985
SEP0.1790.2050.2640.4170.628 SEP0.1880.2380.3550.7000.968
EP0.1110.1270.1830.3550.582 EP0.1230.1710.2800.6310.950
R = 50 R = 50
CW0.2000.2540.3700.6370.896 CW0.2110.2930.4450.7970.987
SEP0.2010.2450.3390.5150.768 SEP0.2000.2690.3920.7360.976
EP0.1400.1680.2440.4420.722 EP0.1400.2050.3260.6760.966
R = 100 R = 100
CW0.2510.3230.4450.7430.957 CW0.2540.3370.4900.8440.993
SEP0.2290.2920.4000.6450.891 SEP0.2310.3000.4220.7620.977
EP0.1660.2250.3260.5660.860 EP0.1780.2430.3750.7210.969
Notes: See notes of Table 1.
Table 8. Median of the Tests Statistics under the Null hypothesis: DGP1.
Table 8. Median of the Tests Statistics under the Null hypothesis: DGP1.
Rolling Recursive
P = 25P = 50P = 100P = 300P = 750 P = 25P = 50P = 100P = 300P = 750
R = 25 R = 25
Median (CW)−0.259−0.209−0.134−0.087−0.074 Median (CW)−0.306−0.283−0.242−0.267−0.264
Median (SEP)−0.029−0.0060.0190.0000.006 Median (SEP)−0.018−0.0330.0200.009−0.035
Median (EP)−0.475−0.377−0.293−0.148−0.101 Median (EP)−0.551−0.532−0.523−0.521−0.591
R = 50 R = 50
Median (CW)−0.259−0.247−0.187−0.171−0.087 Median (CW)−0.317−0.299−0.283−0.307−0.278
Median (SEP)−0.0230.005−0.002−0.0510.010 Median (SEP)−0.027−0.0010.012−0.0100.008
Median (EP)−0.507−0.464−0.394−0.290−0.131 Median (EP)−0.518−0.508−0.485−0.556−0.543
R = 100 R = 100
Median (CW)−0.200−0.276−0.254−0.223−0.144 Median (CW)−0.224−0.290−0.292−0.306−0.289
Median (SEP)0.029−0.028−0.009−0.035−0.020 Median (SEP)0.015−0.019−0.014−0.002−0.015
Median (EP)−0.464−0.420−0.462−0.331−0.241 Median (EP)−0.514−0.470−0.489−0.508−0.544
Table 9. Median of the Tests Statistics under the Null hypothesis: DGP2.
Table 9. Median of the Tests Statistics under the Null hypothesis: DGP2.
Rolling Recursive
P = 25P = 50P = 100P = 300P = 750 P = 25P = 50P = 100P = 300P = 750
R = 25 R = 25
Median (CW)−0.232−0.200−0.142−0.100−0.060 Median (CW)−0.289−0.297−0.303−0.232−0.236
Median (SEP)−0.027−0.0300.003−0.0080.005 Median (SEP)−0.044−0.010−0.0340.0040.016
Median (EP)−0.501−0.409−0.307−0.179−0.119 Median (EP)−0.465−0.485−0.542−0.548−0.594
R = 50 R = 50
Median (CW)−0.211−0.219−0.194−0.099−0.108 Median (CW)−0.250−0.252−0.264−0.296−0.289
Median (SEP)0.0050.0050.0230.0070.002 Median (SEP)−0.0050.0010.047−0.041−0.003
Median (EP)−0.327−0.412−0.402−0.236−0.161 Median (EP)−0.364−0.421−0.467−0.546−0.585
R = 100 R = 100
Median (CW)−0.198−0.209−0.224−0.196−0.129 Median (CW)−0.193−0.243−0.252−0.313−0.275
Median (SEP)−0.0070.001−0.012−0.0400.022 Median (SEP)0.0080.0050.007−0.0150.019
Median (EP)−0.271−0.356−0.436−0.372−0.209 Median (EP)−0.286−0.342−0.405−0.508−0.556
Table 10. Empirical Illustration Forecasting Commodity Prices with Commodity Currencies.
Table 10. Empirical Illustration Forecasting Commodity Prices with Commodity Currencies.
SEP
WTI Heating OilWTIF2LMEXCopper
CLP3.499 ***3.139 ***3.340 ***2.438 ***1.901 **
AUD0.8261.1241.0240.1251.369 *
CAD0.7001.646 **−0.0740.0411.416 *
NZD1.1791.684 **1.515 *−0.0501.392 *
NOK1.690 **2.056 **1.634 *1.580 *1.253
EP
WTI Heating OilWTIF2LMEXCopper
CLP3.520 ***3.083 ***3.334 ***2.355 ***1.722 **
AUD0.7050.9940.896−0.0471.143
CAD0.5571.531 *−0.325−0.1581.167
NZD1.0871.565 *1.453 *−0.3111.161
NOK1.625 *1.972 **1.551 *1.446 *0.996
Notes: CLP, AUD, CAD, NZD and NOK stand for Chilean peso, Australian Dollar, Canadian dollar, New Zealand dollar and Norwegian krone, respectively. * p < 10%, ** p < 5%, *** p < 1%.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Pincheira, P.; Hardy, N.; Bentancor, A. A Simple Out-of-Sample Test of Predictability against the Random Walk Benchmark. Mathematics 2022, 10, 228. https://doi.org/10.3390/math10020228

AMA Style

Pincheira P, Hardy N, Bentancor A. A Simple Out-of-Sample Test of Predictability against the Random Walk Benchmark. Mathematics. 2022; 10(2):228. https://doi.org/10.3390/math10020228

Chicago/Turabian Style

Pincheira, Pablo, Nicolas Hardy, and Andrea Bentancor. 2022. "A Simple Out-of-Sample Test of Predictability against the Random Walk Benchmark" Mathematics 10, no. 2: 228. https://doi.org/10.3390/math10020228

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop