Modeling and Forecasting Realized Portfolio Diversification Benefits

Golosnoy, Vasyl; Hildebrandt, Benno; Köhler, Steffen

doi:10.3390/jrfm12030116

Open AccessArticle

Modeling and Forecasting Realized Portfolio Diversification Benefits

by

Vasyl Golosnoy

^*

,

Benno Hildebrandt

and

Steffen Köhler

Faculty of Management and Economics, Ruhr-Universität Bochum, 44780 Bochum, Germany

^*

Author to whom correspondence should be addressed.

J. Risk Financial Manag. 2019, 12(3), 116; https://doi.org/10.3390/jrfm12030116

Submission received: 17 May 2019 / Revised: 5 July 2019 / Accepted: 9 July 2019 / Published: 11 July 2019

(This article belongs to the Special Issue Panel Data and Factor Models in Empirical Finance)

Download

Browse Figures

Versions Notes

Abstract

:

For a financial portfolio, we suggest a realized measure of diversification benefits, which is based on intraday high-frequency returns. Our measure quantifies volatility reduction, which could be achieved by including an additional asset in the portfolio. In order to make our approach feasible for investors, we also provide time series modeling of both the realized diversification measure and realized portfolio weight. The performance of our approach is evaluated in-sample and out-of-sample. We find out that our approach is helpful for the purpose of portfolio variance minimization.

Keywords:

diversification benefits; HAR models; minimum variance portfolio; realized measures

JEL Classification:

C01; C22; C51; C58; G11

Graphical Abstract

1. Introduction

The mean-variance portfolio selection procedure of Markowitz (1952) remains a theoretical cornerstone of the modern portfolio theory. Empirically, one should not apply the mean-variance portfolio optimization directly, primarily due to the effect of estimation risk in the mean returns; see, e.g., Best and Grauer (1991); Chopra and Ziemba (1993). For this reason, variance minimization approaches leading to the choice of the global minimum variance portfolio (GMVP) (cf. Ledoit and Wolf 2003) or even naive equally-weighted portfolios (cf. DeMiguel et al 2009a, 2009b) often appear to be preferable in practical portfolio selection. Further improvement of the GMVP performance could be gained by imposing constraints on portfolio weights (Jagannathan and Ma 2003), using shrinkage procedures (Golosnoy and Okhrin 2009; Frahm and Memmel 2010), or applying LASSO or other regularization techniques (Callot et al. 2017).

The essential concept of portfolio diversification postulates that non-systematic risks could be substantially reduced by including enough not perfectly-correlated risky assets into the portfolio. Hence, when selecting a portfolio composition, one of the crucial questions is whether the portfolio is already diversified enough or if including additional assets would lead to a further noticeable risk reduction (cf. Evans and Archer 1968). There are various measures of portfolio diversification proposed in the literature; see, e.g., Rudin and Morgan (2006), Bera and Park (2008), Choueifaty and Coignard (2008), Goetzmann and Kumar (2008), or Meucci (2009).

Recently, the work in Frahm and Memmel (2010) suggested to measure diversification as a ratio of the current portfolio variance and the GMVP variance. The work in Frahm and Wiechers (2013) analyzed the properties of this measure and showed that it possesses a convenient economic interpretation. The availability of intraday returns allows computing daily realized variances and covariances of risky asset returns, which are consistent estimators of the daily covariance matrix (Barndorff-Nielsen and Shephard 2004). By analogy, one could also calculate daily realized GMVP weights, which are a function of the realized covariance matrix (cf. Golosnoy et al. 2019). In this paper, we propose a daily realized portfolio diversification measure, which quantifies a portfolio volatility reduction due to inclusion of an additional asset. Our statistic could be seen as a realized extension of the diversification measure of Frahm and Wiechers (2013). We provide its stochastic properties for a given realized covariance matrix estimator.

As investors intend to make portfolio decisions for the next period, it is of interest to predict the next period (day) diversification gains. Hence, we model realized diversification benefits ex-ante and directly in order to obtain the corresponding forecasts. For this purpose, we consider several time series models for our realized diversification measure. Forecasts provided by these models help an investor to decide whether it is reasonable to include or not to include an additional risky asset into the portfolio composition. We found out that the cascade HAR model of Corsi (2009) appears to be mostly suitable in our empirical application both in-sample and out-of-sample, so that it is shown to be reasonable to consider realized diversification statistics.

The rest of the paper is organized as follows: In Section 2, we introduce our realized diversification measure and establish its asymptotic stochastic properties. In Section 3, we propose time series models for diversification in order to make forecasting decisions whether to include additional assets into the portfolio. Then, in Section 4, we provide the empirical study where we estimate the time series models for diversification measures and evaluate the economic relevance of the corresponding diversification forecasts. In Section 5, we conclude the paper, whereas some theoretical results are placed in Appendix A.

2. Realized Measure of Diversification Benefits

2.1. Quantifying Diversification Benefits

Consider a portfolio of risky assets with log-price

p_{p} (τ)

and an additional asset (not included in this portfolio yet) with log-price

p_{a} (τ)

where

τ \in I R_{+}

represents continuous time. Denote the bivariate vector of their log-prices

p (τ) = {(p_{p} (τ), p_{a} (τ))}^{'}

, and assume that

p (τ)

is a Brownian stochastic semimartingale with a spot covariance matrix

Θ (τ)

. The integrated covariance matrix at day t is denoted by

Σ_{t}

with:

\begin{matrix} Σ_{t} = \int_{t - 1}^{t} Θ (τ) d τ = (\begin{matrix} σ_{p, t}^{2} & σ_{p a, t} \\ σ_{p a, t} & σ_{a, t}^{2} \end{matrix}), \end{matrix}

(1)

where

σ_{p, t}^{2}

and

σ_{a, t}^{2}

are the daily variances of the initial portfolio and the additional asset returns, respectively, and

σ_{p a, t}

denotes the corresponding covariance, so that

vech (Σ_{t}) = {(σ_{p, t}^{2}, σ_{p a, t}, σ_{a, t}^{2})}^{'}

. Further, we assume that the matrix

Σ_{t}

is positive definite for all t.

For day t, the log returns on the initial portfolio are

x_{p, t} = p_{p} (t) - p_{p} (t - 1)

and on the asset

x_{a, t} = p_{a} (t) - p_{a} (t - 1)

. Construct a new portfolio with log return

x_{t}

, which is a linear combination of returns on the initial portfolio

x_{p, t}

and on the asset

x_{a, t}

:

\begin{matrix} x_{t} = v_{t} x_{p, t} + (1 - v_{t}) x_{a, t} . \end{matrix}

(2)

The daily variance of the new portfolio return is denoted by

σ_{t}^{2}

.

This problem can be reformulated as a task of constructing a two-asset (the original portfolio and the additional asset) GMVP, where the corresponding GMVP weight is obtained as a solution of the variance minimization task:

\begin{matrix} w_{t} = \underset{v_{t}}{\arg min} [V a r (x_{t}) = V a r (v_{t} x_{p, t} + (1 - v_{t}) x_{a, t})] . \end{matrix}

(3)

The solution of the task (3) is the weight of the original portfolio, which is given by:

\begin{matrix} w_{t} = \frac{σ_{a, t}^{2} - σ_{p a, t}}{σ_{a, t}^{2} + σ_{p, t}^{2} - 2 σ_{p a, t}} . \end{matrix}

(4)

The variance of the new GMVP obtained by combining the original portfolio with the additional asset is given as:

\begin{matrix} σ_{t}^{2} = \frac{σ_{a, t}^{2} σ_{p, t}^{2} - σ_{p a, t}^{2}}{σ_{a, t}^{2} + σ_{p, t}^{2} - 2 σ_{p a, t}} . \end{matrix}

(5)

In line with Frahm and Wiechers (2013), we argue that a statistic that measures the distance between

σ_{t}^{2}

and

σ_{p, t}^{2}

is of great interest: when

σ_{p, t}^{2}

is substantially larger than

σ_{t}^{2}

, then there are diversification benefits to achieve by including this additional asset into the portfolio. The diversification benefits could be quantified by the variance ratio

D_{t}

, which is defined as:

\begin{matrix} D_{t} = \frac{σ_{p, t}^{2}}{σ_{t}^{2}} - 1 = \frac{σ_{a, t}^{2} + σ_{p, t}^{2} - 2 σ_{p a, t}}{σ_{a, t}^{2} - σ_{a, t}^{2} ρ_{p a, t}^{2}} - 1 = \frac{σ_{a, t}^{2} ρ_{p a, t}^{2} + σ_{p, t}^{2} - 2 σ_{p a, t}}{σ_{a, t}^{2} (1 - ρ_{p a, t}^{2})} = \frac{{(σ_{p, t}^{2} - σ_{p a, t})}^{2}}{σ_{a, t}^{2} σ_{p, t}^{2} - σ_{p a, t}^{2}}, \end{matrix}

(6)

with

ρ_{p a, t} = σ_{p a, t} / (σ_{p, t} σ_{a, t})

. As it holds that

σ_{a, t}^{2} σ_{p, t}^{2} - σ_{p a, t}^{2} > 0

due to the positive definiteness of the matrix

Σ_{t}

, the case

σ_{p, t}^{2} = σ_{p a, t}

is excluded as well. Hence,

D_{t} \in (0, + \infty)

, and the value of

D_{t}

close to zero indicates no reasonable diversification effect from including this asset in the existing portfolio.

However, for our purposes, it is more convenient to consider the log measure:

\begin{matrix} L_{t} = \ln D_{t} = \ln (\frac{σ_{p, t}^{2}}{σ_{t}^{2}} - 1) = \ln \frac{{(σ_{p, t}^{2} - σ_{p a, t})}^{2}}{σ_{a, t}^{2} σ_{p, t}^{2} - σ_{p a, t}^{2}} . \end{matrix}

(7)

As the log measure

L_{t} \in (- \infty, \infty)

, it appears to be advantageous from the perspective of time series modeling.

2.2. Realized Measures for Diversification

The availability of intraday high-frequency returns provides the possibility to construct precise realized volatility measures. Using them, we introduce the realized diversification measure

{\hat{D}}_{t}

and analyze its asymptotic stochastic properties.

Assume that we observe m intraday log-prices for day t at uniformly-spaced time intervals. Then, the jth intraday return vector is given by:

\begin{matrix} x_{t, j} = p ((t - 1) + j / m) - p ((t - 1) + (j - 1) / m), j = 1, \dots, m . \end{matrix}

(8)

These high-frequency intraday returns appear to be useful for the construction of the realized covariance measures, which are precise nonparametric ex-post estimates of

Σ_{t}

. The most simple realized covariance matrix estimator is given as:

\begin{matrix} {\hat{Σ}}_{t} = \sum_{j = 1}^{m} x_{t, j} x_{t, j}^{'} . \end{matrix}

(9)

Accordingly, for the entries of matrix

Σ_{t}

, we get the realized estimators

{\hat{σ}}_{a, t}^{2} = \sum_{j = 1}^{m} x_{a, t, j}^{2}

,

{\hat{σ}}_{p, t}^{2} = \sum_{j = 1}^{m} x_{p, t, j}^{2}

, and

{\hat{σ}}_{p a, t} = \sum_{j = 1}^{m} x_{p, t, j} x_{a, t, j}

. More advanced estimators, such as the realized kernel or composite realized kernel (cf. Lunde et al. 2016), are proposed in order to provide more elaborated realized volatility estimators.

Given the realized covariance matrix

{\hat{Σ}}_{t}

, the realized diversification measure

{\hat{D}}_{t}

is defined as:

\begin{matrix} {\hat{D}}_{t} = \frac{{({\hat{σ}}_{p, t}^{2} - {\hat{σ}}_{p a, t})}^{2}}{{\hat{σ}}_{a, t}^{2} {\hat{σ}}_{p, t}^{2} - {\hat{σ}}_{p a, t}^{2}} . \end{matrix}

For time series modeling purposes, it appears to be more convenient to consider the measure

{\hat{L}}_{t} = \ln {\hat{D}}_{t}

, as its distribution is not as skewed as those of

{\hat{D}}_{t}

. We formulate the stochastic properties of both

{\hat{D}}_{t}

and

{\hat{L}}_{t}

in the following proposition, which is proven in Appendix A.

Proposition 1.

Consider a realized covariance matrix measure

{\hat{Σ}}_{t}

in (9), which is a consistent estimator of the positive definite covariance matrix

Σ_{t}

for the number of intraperiod returns

m \to \infty

. Then, the realized diversification measure

{\hat{D}}_{t} = {({\hat{σ}}_{p, t} - {\hat{ρ}}_{p a, t} {\hat{σ}}_{a, t})}^{2} / ({\hat{σ}}_{a, t}^{2} {\hat{σ}}_{p, t}^{2} - {\hat{σ}}_{p a, t}^{2})

is a consistent estimator of the diversification benefit

D_{t}

. Moreover, given that

m^{1 / 2} (v e c h ({\hat{Σ}}_{t}) - v e c h (Σ_{t})) \overset{L}{⟶} N (0, Π_{t})

for

m \to \infty

with

\overset{L}{⟶}

indicating convergence in the distribution (law), the estimator

{\hat{D}}_{t}

is asymptotically normally distributed with:

\begin{matrix} m^{1 / 2} ({\hat{D}}_{t} - D_{t}) \overset{L}{⟶} N (0, {(\nabla D_{t})}^{'} Π_{t} \nabla D_{t}) . \end{matrix}

The expression for

Π_{t}

was provided by Barndorff-Nielsen and Shephard (2004); its realized estimator

{\hat{Π}}_{t}

is given in (A1) in Appendix A, whereas the gradient is given by:

\begin{matrix} \nabla D_{t} = \frac{σ_{p, t}^{2} - σ_{p a, t}}{{(σ_{a, t}^{2} σ_{p, t}^{2} - σ_{p a, t}^{2})}^{2}} \cdot {(σ_{a, t}^{2} (σ_{p, t}^{2} + σ_{p a, t}) - 2 σ_{p a, t}^{2}, 2 σ_{p, t}^{2} (σ_{p a, t} - σ_{a, t}^{2}), - σ_{p, t}^{2} (σ_{p, t}^{2} - σ_{p a, t}))}^{'} . \end{matrix}

Next, the estimator

{\hat{L}}_{t}

of

L_{t}

is also consistent and asymptotically normally distributed for

m \to \infty

with:

\begin{matrix} m^{1 / 2} ({\hat{L}}_{t} - L_{t}) \overset{L}{⟶} N (0, {(\nabla L_{t})}^{'} Π_{t} \nabla L_{t}), \end{matrix}

(10)

with the gradient given as:

\nabla L_{t} = {(\frac{2}{σ_{p, t}^{2} - σ_{p a, t}} - \frac{σ_{a, t}^{2}}{σ_{a, t}^{2} σ_{p, t}^{2} - σ_{p a, t}^{2}}, \frac{2 σ_{p a, t}}{σ_{a, t}^{2} σ_{p, t}^{2} - σ_{p a, t}^{2}} - \frac{2}{σ_{p, t}^{2} - σ_{p a, t}}, \frac{- σ_{p, t}^{2}}{σ_{a, t}^{2} σ_{p, t}^{2} - σ_{p a, t}^{2}})}^{'} .

A similar asymptotic distribution for the optimal realized weight

{\hat{w}}_{t}

could be obtained as a special case of the Theorem 1 result in Golosnoy et al. (2019). We provide the asymptotic variance of

{\hat{w}}_{t}

in Appendix A, together with the asymptotic covariances

A C o v ({\hat{w}}_{t}, {\hat{D}}_{t})

and

A C o v ({\hat{w}}_{t}, {\hat{L}}_{t})

for

m \to \infty

.

The distributional result in Equation (10) in Proposition 1 would be of particular importance for making tests for the usefulness of the additional asset in the portfolio composition. For example, the null hypotheses

H_{0} : D_{t} = D_{0}

can be tested by means of the statistic:

\begin{matrix} T_{D, t} & = & \sqrt{m} \frac{{\hat{D}}_{t} - D_{0}}{\sqrt{{(\hat{\nabla} D_{t})}^{'} {\hat{Π}}_{t} \hat{\nabla} D_{t}}} \overset{H_{0}}{\sim} N (0, 1) . \end{matrix}

(11)

If this statistic

T_{D, t}

is smaller than the

α

-quantile of the

N (0, 1)

distribution, no significant diversification benefits can be achieved by including the additional asset into the portfolio. The test for

H_{0} : L_{t} = L_{0}

could be conducted by analogy with the test statistic:

\begin{matrix} T_{L, t} & = & \sqrt{m} \frac{{\hat{L}}_{t} - L_{0}}{\sqrt{{(\hat{\nabla} L_{t})}^{'} {\hat{Π}}_{t} \hat{\nabla} L_{t}}} \overset{H_{0}}{\sim} N (0, 1) . \end{matrix}

(12)

To illustrate the results in Proposition 1, we conducted a Monte Carlo simulation study, which was designed as follows. For

T = 10^{4}

days, we drew m intraday returns

x_{t} = {(x_{t, p}, x_{t, a})}^{'}

from a bivariate normal distribution with

x_{t} \sim N (0, m^{- 1} Σ)

and

v e c h (Σ) = {(1, σ_{p a, t}, 1)}^{'}

. We performed this simulation study for

m = 78

corresponding to 5-min intraday sampling and for

m = 390

corresponding to 1-min intraday sampling. We selected the correlation

σ_{p a, t} \in [0, 0.3, 0.5]

. Relying on the results from Proposition 1, we computed (11) and (12) for each

t = 1, \dots, T

and then calculated the sample moments—mean, variance, skewness, kurtosis—as well as applied the Kolmogorov–Smirnov (KS) and Jarque–Bera (JB) test to check whether (11) and (12) followed a standard normal distribution. The values

D_{0}

and

L_{0}

were set to the true value for the given matrix

Σ

.

From the results reported in Table 1, we could observe that the sample moments were much better matched by

T_{L}

compared to

T_{D}

. The p-values of the KS and JB tests indicated that at intraday sampling frequency

m = 78

, the assumption of normality could be rejected in some cases. However, for

m = 390

, both

D_{t}

- and

L_{t}

-based test statistics appeared to be quite close to the null hypothesis of the standard normal distribution.

3. Time Series Model for Diversification Measures

The realized diversification measures

{\hat{D}}_{t}

and

{\hat{L}}_{t}

are observable at the end of period (day) t. However, the investor should decide whether to include or not the additional asset into the portfolio already at

t - 1

. Hence, it is necessary to forecast the diversification measures based on the information set

F_{t - 1}

in order to facilitate the investment decisions. Since the measure

D_{t}

is a function of

σ_{p, t}^{2}

,

σ_{a, t}^{2}

and

σ_{p a, t}

, we expect that both time series of

{\hat{D}}_{t}

and

{\hat{L}}_{t}

would exhibit similar properties, such as slowly decaying autocorrelation functions, which is also supported by the empirical evidence in Section 4. Next, we concentrate on time series modeling of the observable realized measures

{\hat{L}}_{t}

.

To capture the persistency of

D_{t}

, we utilized the Heterogeneous Autoregressive (HAR) model proposed by Corsi (2009), which is a natural choice for modeling log realized volatility series. The HAR models have several advantages for our purposes. First, they can be estimated by a simple application of the Ordinary Least Squares (OLS) methodology. Second, the model is sparsely parameterized so that only a few parameters need to be estimated. Third, the model is able to accommodate easily various shocks due to its cascade nature, which makes the HAR approach rather robust. The HAR model for

{\hat{D}}_{t}

is defined as:

\begin{matrix} {\hat{D}}_{t} = c + β_{d} {\hat{D}}_{t - 1} + β_{w} {\hat{D}}_{t}^{(w)} + β_{m} {\hat{D}}_{t}^{(m)} + ε_{t}, ε_{t} \sim i . i . d . (0, σ_{ε}^{2}), \end{matrix}

(13)

with

{\hat{D}}_{t}^{(w)} = (1 / 5) \cdot \sum_{i = 1}^{5} {\hat{D}}_{t - i}

and

{\hat{D}}_{t}^{(m)} = (1 / 22) \cdot \sum_{i = 1}^{22} {\hat{D}}_{t - i}

. Hence, the HAR model postulates that the current

D_{t}

depends on the previous day

D_{t - 1}

, as well as on averages over the last week and over last month. By analogy, we specify the model for

{\hat{L}}_{t}

:

\begin{matrix} {\hat{L}}_{t} & = & c + β_{d} {\hat{L}}_{t - 1} + β_{w} {\hat{L}}_{t}^{(w)} + β_{m} {\hat{L}}_{t}^{(m)} + ε_{t}, \end{matrix}

(14)

with

{\hat{L}}_{t}^{(w)} = \ln [(1 / 5) \cdot \sum_{i = 1}^{5} {\hat{D}}_{t - i}]

, etc.

For making investment decisions at

t - 1

, it is also of importance to forecast the GMVP weight

w_{t}

based on the information set

F_{t - 1}

. As the time series properties of realized

{\hat{w}}_{t}

are rather similar to those of the realized volatilities, we applied for

{\hat{w}}_{t}

the HAR model as well with:

\begin{matrix} {\hat{w}}_{t} = c + β_{d} {\hat{w}}_{t - 1} + β_{w} {\hat{w}}_{t}^{(w)} + β_{m} {\hat{w}}_{t}^{(m)} + ε_{t}, \end{matrix}

(15)

with

{\hat{w}}_{t}^{(w)} = (1 / 5) \cdot \sum_{i = 1}^{5} {\hat{w}}_{t - i}

, etc. The corresponding one-step-ahead forecasts are denoted as

{\tilde{D}}_{t}

,

{\tilde{L}}_{t}

, and

{\tilde{w}}_{t}

. The HAR-type models in (13)–(15) are estimated by the OLS methodology in Section 4.

4. Empirical Application

We structure our empirical study in the following way. First, we describe the data and construction of the portfolios in Section 4.1. Second, we estimate the time series models and provide the corresponding diagnostics in Section 4.2, where we compare the HAR approaches with simple AR(1) and AR(5) alternatives. Finally, in Section 4.3, we evaluate the performance of our approach by investigating whether our measures are suitable for reducing the portfolio variance from the investors’ perspective.

4.1. Data and Construction of Portfolios

Our sample consisted of 10 stocks listed in Table 1 ranging from 1 February 2001–31 December 2009 with

T = 2242

observations in total. This sample was investigated by Noureldin et al. (2012) and is available through Heber et al. (2009). We used the period until 28 February 2005 with 1022 observations as the in-sample to estimate our models. Table 2 displays the average in-sample and out-of-sample daily realized variances of all assets.

For our purposes, we constructed three portfolios, namely the equally-weighted portfolio with 10 assets denoted as

P_{e w}

, as well as two equally weighted portfolios from five stocks each with the highest and lowest average in-sample daily realized variances, denoted as

P_{high}

and

P_{low}

, respectively. In the out-of-sample period, which was manifested by the subprime mortgage crisis 2007–2009, the average portfolio variance increased compared to the in-sample period by 150.29% for

P_{high}

, by 75.13% for

P_{e w}

, and only by 5.01% for

P_{low}

.

We consider now the following setting: the investor holds the portfolio

P_{low}

and considers the possibility to reduce the portfolio risk by including the portfolio

P_{high}

as an additional asset. For this approach, we computed the realized diversification measures

{\hat{L}}_{t}

and the realized GMVP weights

{\hat{w}}_{t}

, which correspond to the proportion of

P_{low}

in the new portfolio.

In Figure 1 and Figure 2, we provide the autocorrelation function for

{\hat{L}}_{t}

and

{\hat{w}}_{t}

for both the in-sample and out-of-sample. Both measures appeared to be rather persistent, which is also taken into account by time series modeling in Section 4.2.

4.2. Time Series Modeling

For time series modeling of

{\hat{L}}_{t}

and

{\hat{w}}_{t}

, we applied the HAR models as in (14) and (15). Moreover, we considered both the AR(1) and AR(5) models as simple benchmarks for both processes with, e.g., AR(5) for the realized

{\hat{L}}_{t}

, parameterized as:

\begin{matrix} {\hat{L}}_{t} = c + ϕ_{1} {\hat{L}}_{t - 1} + ϕ_{2} {\hat{L}}_{t - 2} + ϕ_{3} {\hat{L}}_{t - 3} + ϕ_{4} {\hat{L}}_{t - 4} + ϕ_{5} {\hat{L}}_{t - 5} + ϵ_{t} . \end{matrix}

All three models were estimated by OLS with the results reported in Table 3.

Almost all model coefficients proved to be significantly different from zero. Considering the

{\bar{R}}^{2}

, the HAR model gave the best fit, followed by the AR(5). Moreover, the AR(5) and HAR both had lower values for AIC and BIC than the AR(1) model. Next, we analyzed the in-sample regression residuals to further check the models’ adequacy. In Figure 3, Figure 4 and Figure 5, we show the Autocorrelation Functions (ACF) of the models’ residuals and their squares.

Based on the ACF plots, we conclude that our HAR and AR(5) modeling removed residual autocorrelation, whereas some autocorrelation remained for the AR(1) approach. Furthermore, there appeared to be no autocorrelation in the squared residuals for all models. Additionally, in Table 4, we provide the results of residual tests, namely the Ljung–Box (LB) test for autocorrelation, the ARCH-LM test for heteroskedasticity, and Shapiro–Wilk (SW) test for the normality assumption.

Supporting the evidence from the ACF plots, the tests failed to reject the null hypotheses of no serial correlation and no ARCH effects for the HAR and AR(5) models. On the other hand, the Ljung–Box test rejected the null “no autocorrelation” for AR(1), indicating that this model does not reflect the underlying dynamics well enough. The normality assumption was clearly rejected for all models.

Next, we estimated the HAR, AR(5), and AR(1) models for the process of realized weights

{\hat{w}}_{t}

, with, e.g., the AR(5) model given as:

\begin{matrix} {\hat{w}}_{t} = c + ϕ_{1} {\hat{w}}_{t - 1} + ϕ_{2} {\hat{w}}_{t - 2} + ϕ_{3} {\hat{w}}_{t - 3} + ϕ_{4} {\hat{w}}_{t - 4} + ϕ_{5} {\hat{w}}_{t - 5} + ϵ_{t} . \end{matrix}

Similar to Table 3, in Table 5, we show the estimation results, whereas the model diagnostics are presented in Table 6. As for the case of

{\hat{L}}_{t}

, the model coefficient for

{\hat{w}}_{t}

were mostly highly significant. At first glance, AR(5) appeared to be preferred by AIC compared to AR(1) and HAR; however, judging by the adjusted

{\bar{R}}^{2}

, the HAR still seemed to be the best specification among the considered models.

In Figure 6, Figure 7 and Figure 8, we show the in-sample residual ACFs. As for

{\hat{L}}_{t}

, in the case of

{\hat{w}}_{t}

, the ACFs for the HAR and AR(5) residuals showed no remaining autocorrelation, whereas for AR(1), there was still some autocorrelation left. The in-sample diagnostic test results are shown in Table 6. The HAR and AR(5) models for

{\hat{w}}_{t}

seemed to pass all the tests, whereas AR(1) residuals showed some residual autocorrelation.

Summarizing our time series modeling, we could conclude that both HAR and AR(5) models seemed to be appropriate for modeling realized diversification benefits

{\hat{L}}_{t}

and realized portfolio weights

{\hat{w}}_{t}

. Next, we conduct out-of-sample analysis in Section 4.3 in order to investigate whether this modeling would be helpful to achieve lower portfolio variances.

4.3. Economic Evaluation

Now we provide the out-of-sample analysis within the following framework. Consider the investor holding the portfolio

P_{low}

and willing to know whether he/she should diversify it further by including the portfolio

P_{high}

as a potential additional asset. Based on the in-sample data, we estimated the time series models both for

{\hat{w}}_{t}

and

{\hat{L}}_{t}

and denote the corresponding one-step-ahead out-of-sample forecasts by

{\tilde{w}}_{t}

and

{\tilde{L}}_{t}

, respectively.

Next, consider that the investor is eager to diversify only if volatility could be reduced at least by a certain amount, for example because of the transaction costs argument. In practice, investors often make decisions by relying not on statistical significance, but on some (naive) empirical criteria; see, e.g., Brandt et al. (2009). In order to resemble this setting, we assumed that the investor seeks to diversify away at least

5 %

of portfolio risk, so that the ratio

σ_{low, t}^{2} / σ_{t}^{2}

must not exceed 0.95. This can be translated into a threshold ℓ for the log diversification measure

L_{t}

with the value

ℓ = \ln (1 / 0.95 - 1) = - 1.28

. Thus, the corresponding decision rule would be to diversify if the forecast

{\tilde{L}}_{t} \geq ℓ

and to stay by the initial portfolio if

{\tilde{L}}_{t} < ℓ

. Then, given the realized measures

{\hat{L}}_{t}

, one could learn in the next period whether this decision was correct or not. The resulting frequencies are visualized using

2 \times 2

decision matrices in Table 7.

Judging only from the percentage of correct predictions,

({\tilde{L}}_{t} < ℓ \cap {\hat{L}}_{t} < ℓ)

and

({\tilde{L}}_{t} \geq ℓ \cap {\hat{L}}_{t} \geq ℓ)

, the HAR model appeared to perform better than both AR(5) and AR(1). Note that the HAR approach is a rather conservative one, as it leads to frequent recommendations not to diversify compared, e.g., with AR(5). To sum up, the HAR produced the most correct predictions and, moreover, resulted in the fewest wrong and costly diversification signals.

As a next step, we incorporated into the decision procedure the forecasted portfolio weight

{\tilde{w}}_{t}

in order to quantify the amount of a possible portfolio variance reduction. The strategy would be as follows: select the diversified portfolio with the forecasted weight

{\tilde{w}}_{t}

in the case of

{\tilde{L}}_{t} > ℓ

, which would lead to the variance

σ_{t}^{2} ({\tilde{w}}_{t})

, or remain by the initial portfolio

P_{low}

in the case of

{\tilde{L}}_{t} < ℓ

with the variance

σ_{low, t}^{2}

. We denote the resulting portfolio variance from this diversification rule as

σ_{t}^{2} ({\tilde{w}}_{t}, {\tilde{L}}_{t})

, as we considered its ratio to the variances from three benchmark approaches:

σ_{t}^{2} ({\hat{w}}_{t})

corresponding to the ex-post GMVP,

σ_{low, t}^{2}

, and

σ_{e w, t}^{2}

for the portfolio with 50% in

P_{high}

and 50% in

P_{low}

. The comparison of different models is provided in Table 8.

The realized GMVP benchmark

σ_{t}^{2} ({\hat{w}}_{t})

provided the lower boundary, so it was reported primarily for comparison purposes. Concerning the portfolio

P_{low}

, the HAR model provided the possibility to reduce its variance by diversifying in more than 43% of days, leading to wrong decisions only in 12.3% of days. Similar evidence was found for the equally-weighted portfolio

P_{e w}

. The results became worse the for AR(5) models and appeared to be really unsatisfactory for the AR(1) approaches, where holding

P_{low}

led to a lower portfolio variance in more than 50% of days.

For a further illustration of our our results, we visualize the time series of portfolio variance ratios with respect to the benchmarks of

P_{low}

and

P_{e w}

. In particular, for the HAR, AR(5), and AR(1) approaches, we report the time series of

σ_{t}^{2} ({\tilde{w}}_{t}, {\tilde{L}}_{t}) / σ_{low, t}^{2}

and

σ_{t}^{2} ({\tilde{w}}_{t}, {\tilde{L}}_{t}) / σ_{e w, t}^{2}

in Figure 9 and Figure 10, respectively.

In Figure 9 for the benchmark

P_{low}

, we observe that the HAR-based approach suggested to diversify only at a comparatively small number of days, whereas most of the time, the ratio was equal to one, i.e., no diversification was recommended. It provided the major correct recommendation before the start of the crisis. The AR(5) suggested very often diversification decisions; however, they appeared to be mostly disadvantageous from the start of the subprime mortgage crisis in the middle of 2007. The AR(1) model provided mostly wrong diversification decisions, especially during the crisis year 2008. Note that the reasons for these false recommendations could be attributed to either

L_{t}

or

w_{t}

forecasting models. Hence, it is apparent that AR(1) is not really suitable for our purposes here.

Different from the case above, in Figure 10, for the benchmark

P_{e w}

, we observe that the HAR model provided reasonable diversification recommendations especially since the crisis began in 2007; however, it was not really useful before the crisis start. Surprisingly, the other two approaches—AR(5) and AR(1)—also performed similarly to the HAR for this equally-weighted portfolio benchmark. We interpreted these findings as evidence that not only the choice of the time series model, but also the choice of the benchmark could determine the success of a portfolio diversification strategy.

5. Conclusions

The availability of intraday returns allows constructing precise realized volatility measures, which should be used for the improvement of risk management procedures. In this paper, we introduced the novel realized measure for portfolio diversification benefits. Our procedure would help to decide whether to include or not to include an additional security into the risky asset portfolio in order to reduce its variance.

After providing the asymptotic properties of our realized diversification measures, we considered several time series models for them such that we formulated a diversification decision rule. The performance of these models was evaluated in the empirical study based on a dataset of 10 risky assets. We found that the HAR time series approach was mostly suitable for out-of-sample prediction of realized diversification measures, as well as in order to forecast the optimal proportion of wealth to invest into the additional asset.

Author Contributions

Conceptualization, methodology, formal analysis, resources, writing, and visualization: V.G., B.H. and S.K. Supervision, project administration, funding acquisition: V.G.

Funding

This research was in part financially supported by the Collaborative Research Center “Statistical modeling of nonlinear dynamic processes” (SFB 823, Teilprojekt A1) of the German Research Foundation (DFG).

Acknowledgments

The authors would like to thank the anonymous referees for providing helpful comments and suggestions which lead to the improvement of the paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A.1. Proof of Proposition 1

To derive the limit distribution of

{\hat{D}}_{t}

, we utilized the Delta method and the results of Barndorff-Nielsen and Shephard (2004), namely:

m^{1 / 2} (v e c h ({\hat{Σ}}_{t}) - v e c h (Σ_{t})) \overset{L}{⟶} N (0, Π_{t}) .

Note that this type of result would also hold for many other realized covariance matrix estimators; see, e.g., Lunde et al. (2016).

To write down the estimator of

Π_{t}

, define

y_{t, j} = v e c h (x_{t, j} x_{t, j}^{'})

, with

v e c h (A)

being the column stacking operator for the lower triangular matrix of the symmetric matrix A; see Lütkepohl (2005). The estimator

{\hat{Π}}_{t}

is then given by:

{\hat{Π}}_{t} = m \cdot [\sum_{j = 1}^{m} y_{t, j} y_{t, j}^{'} - 0.5 \sum_{j = 1}^{m - 1} (y_{t, j} y_{t, j + 1}^{'} + y_{t, j + 1} y_{t, j}^{'})] .

(A1)

Consequently,

{\hat{D}}_{t}

is a function of normally-distributed random variables and, thus, asymptotically normally distributed:

m^{1 / 2} ({\hat{D}}_{t} - D_{t}) \overset{L}{⟶} N (0, {(\nabla D_{t})}^{'} Π_{t} \nabla D_{t}) .

The gradient

\nabla D_{t}

contains the partial derivatives of

D_{t}

with respect to

Σ

:

\nabla D_{t} = {(\frac{\partial D_{t}}{\partial σ_{p, t}^{2}}, \frac{\partial D_{t}}{\partial σ_{p a, t}}, \frac{\partial D_{t}}{\partial σ_{a, t}^{2}})}^{'} .

More precisely, it is given by

\nabla D_{t} = \frac{σ_{p, t}^{2} - σ_{p a, t}}{{(σ_{a, t}^{2} σ_{p, t}^{2} - σ_{p a, t}^{2})}^{2}} \cdot {(σ_{a, t}^{2} (σ_{p, t}^{2} + σ_{p a, t}) - 2 σ_{p a, t}^{2}, 2 σ_{p, t}^{2} (σ_{p a, t} - σ_{a, t}^{2}), - σ_{p, t}^{2} (σ_{p, t}^{2} - σ_{p a, t}))}^{'} .

The Delta method is also applied to derive the asymptotic distribution of

{\hat{L}}_{t} = \ln {\hat{D}}_{t}

:

{\hat{L}}_{t} = \ln {\hat{D}}_{t} = \ln (\frac{{\hat{σ}}_{p, t}}{{\hat{σ}}_{t}} - 1) = 2 \ln ({\hat{σ}}_{p, t}^{2} - {\hat{σ}}_{p a, t}) - \ln ({\hat{σ}}_{a, t}^{2} {\hat{σ}}_{p, t}^{2} - {\hat{σ}}_{p a, t}^{2}) .

The realized estimator

{\hat{L}}_{t}

is also asymptotically normally distributed:

m^{1 / 2} ({\hat{L}}_{t} - L_{t}) \overset{L}{⟶} N (0, {(\nabla L_{t})}^{'} Π_{t} \nabla L_{t}), with

\nabla L_{t} = {(\frac{2}{σ_{p, t}^{2} - σ_{p a, t}} - \frac{σ_{a, t}^{2}}{σ_{a, t}^{2} σ_{p, t}^{2} - σ_{p a, t}^{2}}, \frac{2 σ_{p a, t}}{σ_{a, t}^{2} σ_{p, t}^{2} - σ_{p a, t}^{2}} - \frac{2}{σ_{p, t}^{2} - σ_{p a, t}}, \frac{- σ_{p, t}^{2}}{σ_{a, t}^{2} σ_{p, t}^{2} - σ_{p a, t}^{2}})}^{'} .

Appendix A.2. Further Asymptotic Results

A further application of the Delta method yields the asymptotic distribution of

{\hat{w}}_{t}

, which is a special case of the results in Golosnoy et al. (2019), as well as the asymptotic covariances between

{\hat{w}}_{t}

and

{\hat{D}}_{t}

,

{\hat{w}}_{t}

and

{\hat{L}}_{t}

.

The realized weight of the original portfolio in the GMVP is given as:

\begin{matrix} {\hat{w}}_{t} = \frac{{\hat{σ}}_{a, t}^{2} - {\hat{σ}}_{p a, t}}{{\hat{σ}}_{a, t}^{2} + {\hat{σ}}_{p, t}^{2} - 2 {\hat{σ}}_{p a, t}} . \end{matrix}

It is also a consistent estimator of the true unknown optimal weight

w_{t}

.

As

{\hat{w}}_{t}

is a function of

{\hat{Σ}}_{t}

, its asymptotic distribution is also normal. The gradient

\nabla w_{t}

is given by:

\nabla w_{t} = {(σ_{a, t}^{2} + σ_{p, t}^{2} - 2 σ_{p a, t})}^{- 2} \cdot {(σ_{p a, t} - σ_{a, t}^{2}, σ_{a, t}^{2} - σ_{p, t}^{2}, σ_{p, t}^{2} - σ_{p a, t})}^{'} .

Therefore, the asymptotic distribution of

{\hat{w}}_{t}

for

m \to \infty

is:

m^{1 / 2} ({\hat{w}}_{t} - w_{t}) \overset{L}{⟶} N (0, {(\nabla w_{t})}^{'} Π_{t} \nabla w_{t}),

whereas the asymptotic covariances are given by

A C o v ({\hat{D}}_{t}, {\hat{w}}_{t}) = {(\nabla D_{t})}^{'} Π_{t} \nabla w_{t}

and

A C o v ({\hat{L}}_{t}, {\hat{w}}_{t}) = {(\nabla L_{t})}^{'} Π_{t} \nabla w_{t}

, respectively.

References

Barndorff-Nielsen, Ole E., and Neil Shephard. 2004. Econometric analysis of realized covariation: High frequency based covariance, regression, and correlation in financial economics. Econometrica 72: 885–925. [Google Scholar] [CrossRef]
Bera, Anil K., and Sung Y. Park. 2008. Optimal portfolio diversification using the maximum entropy principle. Econometric Reviews 27: 484–512. [Google Scholar] [CrossRef]
Best, Michael J., and Robert. R. Grauer. 1991. On the sensitivity of mean-variance-efficient portfolios to changes in asset means: Some analytical and computational results. Review of Financial Studies 4: 315–42. [Google Scholar] [CrossRef]
Brandt, Michael W., Pedro Santa-Clara, and Rossen Valkanov. 2009. Parametric portfolio policies: Exploiting characteristics in the cross-section of equity returns. Review of Financial Studies 22: 3411–47. [Google Scholar] [CrossRef]
Callot, Laurent A., Anders B. Kock, and Marcelo C. Medeiros. 2017. Modeling and forecasting large realized covariance matrices and portfolio choice. Journal of Applied Econometrics 32: 140–58. [Google Scholar] [CrossRef]
Chopra, Vijay K., and William T. Ziemba. 1993. The effects of errors in means, variances, and covariances on optimal portfolio choice. Journal of Portfolio Management 19: 6–11. [Google Scholar] [CrossRef]
Choueifaty, Yves, and Yves Coignard. 2008. Toward maximum diversification. Journal of Portfolio Management 35: 40–51. [Google Scholar] [CrossRef]
Corsi, Fulvio. 2009. A simple approximate long-memory model of realized volatility. Journal of Financial Econometrics 7: 174–96. [Google Scholar] [CrossRef]
DeMiguel, Victor, Lorenzo Garlappi, Francisco J. Nogales, and Raman Uppal. 2009a. A generalized approach to portfolio optimization: Improving performance by constraining portfolio norms. Management Science 55: 798–812. [Google Scholar] [CrossRef]
DeMiguel, Victor, Lorenzo Garlappi, and Raman Uppal. 2009b. Optimal versus naive diversification: How inefficient is the 1/N portfolio strategy? Review of Financial Studies 22: 1915–53. [Google Scholar] [CrossRef]
Evans, John L., and Stephen H. Archer. 1968. Diversification and the reduction of dispersion: An empirical analysis. The Journal of Finance 23: 761–67. [Google Scholar]
Frahm, Gabriel, and Christoph Memmel. 2010. Dominating estimators for minimum-variance portfolios. Journal of Econometrics 159: 289–302. [Google Scholar] [CrossRef] [Green Version]
Frahm, Gabriel, and Christof Wiechers. 2013. A diversification measure for portfolios of risky assets. In Advances in Financial Risk Management. Edited by Jonathan A. Batten, Peter MacKay and Niklas Wagner. London: Palgrave Macmillan. [Google Scholar]
Goetzmann, William N., and Alok Kumar. 2008. Equity portfolio diversification. Review of Finance 12: 433–63. [Google Scholar] [CrossRef]
Golosnoy, Vasyl, and Yarema Okhrin. 2009. Flexible shrinkage in portfolio selection. Journal of Economic Dynamics and Control 33: 317–28. [Google Scholar] [CrossRef]
Golosnoy, Vasyl, Wolfgang Schmid, Miriam I. Seifert, and Taras Lazariv. 2019. Statistical inferences for realized portfolio weights. Econometrics and Statistics 11. in press. [Google Scholar]
Heber, Gerd, Asger Lunde, Neil Shephard, and Kevin K. Sheppard. 2009. Oxford-Man Institute’s Realized Library. version 0.2. Oxford: Oxford-Man Institute, University of Oxford. [Google Scholar]
Jagannathan, Ravi, and Tongshu Ma. 2003. Risk reduction in large portfolios: Why imposing the wrong constraints helps. Journal of Finance 58: 1651–83. [Google Scholar] [CrossRef]
Ledoit, Olivier, and Michael Wolf. 2003. Improved estimation of the covariance matrix of stock returns with an application to portfolio selection. Journal of Empirical Finance 10: 603–21. [Google Scholar] [CrossRef]
Lütkepohl, Helmut. 2005. New Introduction to Multiple Time Series Analysis. Berlin/Heidelberg: Springer Science & Business Media. [Google Scholar]
Lunde, Asger, Neil Shephard, and Kevin K. Sheppard. 2016. Econometric analysis of vast covariance matrices using composite realized kernels and their application to portfolio choice. Journal of Business & Economic Statistics 34: 504–18. [Google Scholar]
Markowitz, Harry M. 1952. Portfolio selection. Journal of Finance 7: 77–91. [Google Scholar]
Meucci, Attilio. 2009. Managing diversification. Risk 22: 74–79. [Google Scholar]
Noureldin, Diaa, Neil Shephard, and Kevin K. Sheppard. 2012. Multivariate high-frequency-based volatility (heavy) models. Journal of Applied Econometrics 27: 907–33. [Google Scholar] [CrossRef]
Rudin, Alexander M., and Jonathan S. Morgan. 2006. A portfolio diversification index. Journal of Portfolio Management 32: 81–89. [Google Scholar] [CrossRef]

Figure 1. In-sample: autocorrelation functions of

{\hat{L}}_{t}

(left) and

{\hat{w}}_{t}

(right).

Figure 1. In-sample: autocorrelation functions of

{\hat{L}}_{t}

(left) and

{\hat{w}}_{t}

(right).

Figure 2. Out-of-sample: autocorrelation functions of

{\hat{L}}_{t}

(left) and

{\hat{w}}_{t}

(right).

Figure 2. Out-of-sample: autocorrelation functions of

{\hat{L}}_{t}

(left) and

{\hat{w}}_{t}

(right).

Figure 3.

{\hat{L}}_{t}

, in-sample autocorrelations of HAR residuals (left) and their squares (right).

Figure 3.

{\hat{L}}_{t}

, in-sample autocorrelations of HAR residuals (left) and their squares (right).

Figure 4.

{\hat{L}}_{t}

, in-sample autocorrelations of AR(5) residuals (left) and their squares (right).

Figure 4.

{\hat{L}}_{t}

, in-sample autocorrelations of AR(5) residuals (left) and their squares (right).

Figure 5.

{\hat{L}}_{t}

, in-sample autocorrelations of AR(1) residuals (left) and their squares (right).

Figure 5.

{\hat{L}}_{t}

, in-sample autocorrelations of AR(1) residuals (left) and their squares (right).

Figure 6.

{\hat{w}}_{t}

, in-sample autocorrelations of HAR residuals (left) and their squares (right).

Figure 6.

{\hat{w}}_{t}

, in-sample autocorrelations of HAR residuals (left) and their squares (right).

Figure 7.

{\hat{w}}_{t}

, in-sample autocorrelations of AR(5) residuals (left) and their squares (right).

Figure 7.

{\hat{w}}_{t}

, in-sample autocorrelations of AR(5) residuals (left) and their squares (right).

Figure 8.

{\hat{w}}_{t}

, in-sample autocorrelations of AR(1) residuals (left) and their squares (right).

Figure 8.

{\hat{w}}_{t}

, in-sample autocorrelations of AR(1) residuals (left) and their squares (right).

Figure 9. Ratios

σ_{t}^{2} ({\tilde{w}}_{t}, {\tilde{L}}_{t}) / σ_{low, t}^{2}

for the HAR, AR(5), and AR(1) models, from above to below. Note: the red lines correspond to a ratio of one, indicating equal variances.

Figure 9. Ratios

σ_{t}^{2} ({\tilde{w}}_{t}, {\tilde{L}}_{t}) / σ_{low, t}^{2}

for the HAR, AR(5), and AR(1) models, from above to below. Note: the red lines correspond to a ratio of one, indicating equal variances.

Figure 10. Ratios

σ_{t}^{2} ({\tilde{w}}_{t}, {\tilde{L}}_{t}) / σ_{e w, t}^{2}

for the HAR, AR(5), and AR(1) models, from above to below. Note: the red lines correspond to a ratio of one, indicating equal variances.

Figure 10. Ratios

σ_{t}^{2} ({\tilde{w}}_{t}, {\tilde{L}}_{t}) / σ_{e w, t}^{2}

for the HAR, AR(5), and AR(1) models, from above to below. Note: the red lines correspond to a ratio of one, indicating equal variances.

Table 1. Sample moments and p-values of Kolmogorov–Smirnov and Jarque–Bera tests for normality.

Block A: Monte Carlo Simulation Results for $T_{D}$
m	$σ_{pa, t}$	Mean	Variance	Skewness	Kurtosis	$p_{KS}$	$p_{JB}$
78	0	−0.064	$1.188$	−0.979	$5.374$	$0.051$	0
	0.3	−0.072	$1.133$	−0.855	$4.421$	$0.183$	0
	0.5	−0.098	$1.203$	−0.999	$5.067$	$0.098$	0
390	0	−0.056	1.054	−0.198	$3.113$	$0.576$	$0.029$
	0.3	−0.033	$1.001$	−0.383	$3.290$	$0.458$	0
	0.5	−0.039	$1.004$	−0.419	$3.359$	$0.341$	0
Block B: Monte Carlo Simulation Results for $T_{L}$
$m$	$σ_{pa, t}$	Mean	Variance	Skewness	Kurtosis	$p_{KS}$	$p_{JB}$
78	0	$0.105$	$1.049$	$0.210$	$3.234$	$0.048$	$0.008$
	0.3	$0.122$	$0.998$	$0.403$	$3.288$	$0.070$	0
	0.5	$0.138$	$0.997$	$0.557$	$3.462$	$0.019$	0
390	0	$0.018$	$1.046$	$0.239$	$3.088$	$0.732$	$0.007$
	0.3	$0.051$	$0.971$	$0.141$	$3.066$	$0.357$	$0.175$
	0.5	$0.060$	$0.963$	$0.195$	$3.018$	$0.378$	$0.042$

Note: computed by generating

T = 10^{4}

days with

m = 78

or

m = 390

intraday returns.

Table 2. Average realized daily variances of assets and portfolios.

Company	In-Sample	Out-of-Sample	% Change from In- to Out-of-Sample	Portfolio
Bank of America	1.68	8.63	414.64	$\} P_{high}$
Alcoa	3.32	6.30	89.76
American Express	3.16	5.47	73.23
J.P. Morgan	3.93	6.00	52.69
Exxon	1.73	2.36	36.15
General Electric	2.69	3.62	34.20	$\} P_{low}$
DuPont	2.25	2.76	22.93
IBM	2.05	1.84	−10.49
Microsoft	2.91	2.07	−28.82
Coca Cola	1.68	1.19	−29.18
$P_{e w}$	1.10	1.92	75.13
$P_{low}$	1.21	1.27	5.01
$P_{high}$	1.31	3.28	150.29

Note: in-sample: 1 February 2001–28 February 2005, 1022 obs.; out-of-sample: 1 March 2005–31 December 2009, 1220 obs.

Table 3. Time series model estimates,

{\hat{L}}_{t}

.

Table 3. Time series model estimates,

{\hat{L}}_{t}

.

Parameter	In-Sample			Full Sample
	HAR	AR(5)	AR(1)	HAR	AR(5)	AR(1)
c	$- \underset{(0.1443)}{0.4029}$ ***	$- (0.1067) 0.8895$ ***	$- \underset{(0.0797)}{1.5080}$ ***	$- \underset{(0.1215)}{0.4756}$ ***	$- \underset{(0.0893)}{1.0520}$ ***	$- \underset{(0.0674)}{1.8687}$ ***
$ϕ_{1}$ or $β_{d}$	$\underset{(0.0347)}{0.1039}$ ***	$\underset{(0.0314)}{0.1877}$ ***	$\underset{(0.0299)}{0.2941}$ ***	$\underset{(0.0234)}{0.1174}$ ***	$\underset{(0.0211)}{0.1958}$ ***	$\underset{(0.0201)}{0.3112}$ ***
$ϕ_{2}$		$\underset{(0.0316)}{0.1356}$ ***			$\underset{(0.0214)}{0.1376}$ ***
$ϕ_{3}$		$\underset{(0.0318)}{0.0708}$ **			$\underset{(0.0215)}{0.0845}$ ***
$ϕ_{4}$		$\underset{(0.0317)}{0.1461}$ ***			$\underset{(0.0214)}{0.1005}$ ***
$ϕ_{5}$		${\underset{(0.0314)}{0.0466}}^{}$			$\underset{(0.0211)}{0.0957}$ ***
$β_{w}$	$\underset{(0.0981)}{0.3743}$ ***			$\underset{(0.0675)}{0.4651}$ ***
$β_{m}$	$\underset{(0.1234)}{0.5586}$ ***			$\underset{(0.0852)}{0.4962}$ ***
AIC	$3598.5$	$3677.5$	$3762.0$	$8896.7$	$9002.7$	$9189.6$
BIC	$3623.0$	$3711.9$	$3776.7$	$8925.2$	$9042.7$	$9206.8$
adjusted ${\bar{R}}^{2}$	$0.160$	$0.147$	$0.086$	$0.176$	$0.163$	$0.096$

Standard errors are reported in parentheses; p-values: * <10%, ** <5%, *** <1%.

Table 4.

{\hat{L}}_{t}

, in-sample residual diagnostic test statistics. LB, Ljung–Box; SW, Shapiro–Wilk.

Table 4.

{\hat{L}}_{t}

, in-sample residual diagnostic test statistics. LB, Ljung–Box; SW, Shapiro–Wilk.

	HAR	AR(5)	AR(1)
LB(5)	7.299 (0.199)	3.255 (0.661)	70.890 (6.7 $\times 10^{- 14}$ )
ARCH-LM	1.686 (0.891)	3.866 (0.569)	1.408 (0.923)
SW	0.843 (<2.2 $\times 10^{- 16}$ )	0.844 (<2.2 $\times 10^{- 16}$ )	0.850 (<2.2 $\times 10^{- 16}$ )

The corresponding p-values are reported in parentheses.

Table 5. Time series model estimates of

{\hat{w}}_{t}

.

Table 5. Time series model estimates of

{\hat{w}}_{t}

.

Parameter	In-Sample			Full Sample
Parameter	HAR	AR(5)	AR(1)	HAR	AR(5)	AR(1)
c	$\underset{(0.0304)}{0.1283}$ ***	$\underset{(0.0227)}{0.2012}$ ***	$\underset{(0.0162)}{0.3278}$ ***	$\underset{(0.0062)}{0.0104}$ *	$\underset{(0.0061)}{0.0256}$ ***	$\underset{(0.0066)}{0.0759}$ ***
$ϕ_{1}$ or $β_{d}$	$\underset{(0.0374)}{0.1377}$ ***	$\underset{(0.0315)}{0.2202}$ ***	$\underset{(0.0297)}{0.3179}$ ***	$\underset{(0.0253)}{0.1468}$ ***	$\underset{(0.0210)}{0.2886}$ ***	$\underset{(0.0151)}{0.7014}$ ***
$ϕ_{2}$		$\underset{(0.0320)}{0.1492}$ ***			$\underset{(0.0216)}{0.1961}$ ***
$ϕ_{3}$		$\underset{(0.0323)}{0.0598}$ *			$\underset{(0.0219)}{0.1306}$ ***
$ϕ_{4}$		$\underset{(0.0320)}{0.1297}$ ***			$\underset{(0.0216)}{0.1467}$ ***
$ϕ_{5}$		${\underset{(0.0315)}{0.0212}}^{}$			$\underset{(0.0210)}{0.1353}$ ***
$β_{w}$	$\underset{(0.0732)}{0.2794}$ ***			$\underset{(0.0481)}{0.3692}$ ***
$β_{m}$	$\underset{(0.0854)}{0.3131}$ ***			$\underset{(0.0432)}{0.4376}$ ***
AIC	$- 23.614$	$- 24.731$	$33.051$	$- 441.542$	$- 352.927$	$207.223$
BIC	$0.924$	$9.741$	$47.837$	$- 413.018$	$- 312.940$	$224.366$
adjusted ${\bar{R}}^{2}$	$0.157$	$0.155$	$0.100$	$0.620$	$0.605$	$0.492$

Standard errors are reported in parentheses; p-values: * <10%, ** <5%, *** <1%.

Table 6.

{\hat{w}}_{t}

, in-sample residual diagnostic test statistics.

Table 6.

{\hat{w}}_{t}

, in-sample residual diagnostic test statistics.

	HAR	AR(5)	AR(1)
LB(5)	11.929 (0.036)	2.886 (0.718)	64.117 (1.7 $\times 10^{- 12}$ )
ARCH-LM	8.353 (0.138)	7.347 (0.196)	10.760 (0.056)
SW	0.997 (0.119)	0.997 (0.088)	0.998 (0.163)

The corresponding p-values are reported in parentheses.

Table 7. Number of decisions for

ℓ = - 1.28

, 1220 out-of-sample observations.

Table 7. Number of decisions for

ℓ = - 1.28

, 1220 out-of-sample observations.

HAR	${\hat{L}}_{t} < ℓ$	${\hat{L}}_{t} \geq ℓ$
${\tilde{L}}_{t} < ℓ$
${\tilde{L}}_{t} \geq ℓ$
66.31% correct predictions
AR(5)	${\hat{L}}_{t} < ℓ$	${\hat{L}}_{t} \geq ℓ$
${\tilde{L}}_{t} < ℓ$
${\tilde{L}}_{t} \geq ℓ$
63.69% correct predictions
AR(1)	${\hat{L}}_{t} < ℓ$	${\hat{L}}_{t} \geq ℓ$
${\tilde{L}}_{t} < ℓ$
${\tilde{L}}_{t} \geq ℓ$
59.43% correct predictions

Table 8. Ratios of

σ_{t}^{2} ({\tilde{w}}_{t}, {\tilde{L}}_{t})

to different benchmark variances.

Table 8. Ratios of

σ_{t}^{2} ({\tilde{w}}_{t}, {\tilde{L}}_{t})

to different benchmark variances.

Benchmark	Mean Ratio	Std. Deviation of Ratio	% with >1	% with <1	% with =1
HAR to get ${\tilde{L}}_{t}$ and ${\tilde{w}}_{t}$
$σ_{t}^{2} ({\hat{w}}_{t})$	1.068	0.1029	1	0	-
$σ_{low, t}^{2}$	0.9576	0.0958	0.1230	0.4377	0.4393
$σ_{e w, t}^{2}$	0.817	0.2102	0.1779	0.8221	-
AR(5) to get ${\tilde{L}}_{t}$ and ${\tilde{w}}_{t}$
$σ_{t}^{2} ({\hat{w}}_{t})$	1.0884	0.1303	1	0	-
$σ_{low, t}^{2}$	0.9751	0.1083	0.2992	0.3385	0.3623
$σ_{e w, t}^{2}$	0.8258	0.1963	0.1705	0.8295	-
AR(1) to get ${\tilde{L}}_{t}$ and ${\tilde{w}}_{t}$
$σ_{t}^{2} ({\hat{w}}_{t})$	1.1833	0.264	1	0	-
$σ_{low, t}^{2}$	1.0588	0.2094	0.5025	0.3270	0.1705
$σ_{e w, t}^{2}$	0.8714	0.1454	0.1434	0.8566	-

Note: “% with >1” is % of days where the diversification rule delivers larger portfolio variance than the benchmark.

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Golosnoy, V.; Hildebrandt, B.; Köhler, S. Modeling and Forecasting Realized Portfolio Diversification Benefits. J. Risk Financial Manag. 2019, 12, 116. https://doi.org/10.3390/jrfm12030116

AMA Style

Golosnoy V, Hildebrandt B, Köhler S. Modeling and Forecasting Realized Portfolio Diversification Benefits. Journal of Risk and Financial Management. 2019; 12(3):116. https://doi.org/10.3390/jrfm12030116

Chicago/Turabian Style

Golosnoy, Vasyl, Benno Hildebrandt, and Steffen Köhler. 2019. "Modeling and Forecasting Realized Portfolio Diversification Benefits" Journal of Risk and Financial Management 12, no. 3: 116. https://doi.org/10.3390/jrfm12030116

Article Menu

Modeling and Forecasting Realized Portfolio Diversification Benefits

Abstract

1. Introduction

2. Realized Measure of Diversification Benefits

2.1. Quantifying Diversification Benefits

2.2. Realized Measures for Diversification

3. Time Series Model for Diversification Measures

4. Empirical Application

4.1. Data and Construction of Portfolios

4.2. Time Series Modeling

4.3. Economic Evaluation

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Appendix A

Appendix A.1. Proof of Proposition 1

Appendix A.2. Further Asymptotic Results

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI