Skip Content
You are currently on the new version of our website. Access the old version .
IJFSInternational Journal of Financial Studies
  • Article
  • Open Access

14 May 2018

Risk-Based Portfolios with Large Dynamic Covariance Matrices

,
and
1
Nomura Asset Management Co., Ltd. 1-12-1 Nihonbashi, Chuo-ku, 103-0027 Tokyo, Japan
2
Graduate School of Business Sciences, University of Tsukuba, 3-29-1 Otsuka, Bunkyo-ku, 112-0012 Tokyo, Japan
3
Department of Risk Engineering, University of Tsukuba, 1-1-1 Tennodai, 305-8577 Tsukuba, Japan
*
Author to whom correspondence should be addressed.
This article belongs to the Special Issue Asset Pricing and Portfolio Choice

Abstract

In the field of portfolio management, practitioners are focusing increasingly on risk-based portfolios rather than on mean-variance portfolios. Risk-based portfolios are constructed based solely on covariance matrices, and include methods such as minimum variance (MV), risk parity (RP), and maximum diversification (MD). It is well known that the performance of a mean-variance portfolio depends on the accuracy of the estimations of the inputs. However, no studies have examined the relationship between the performance of risk-based portfolios and the estimated accuracy of covariance matrices. In this research, we compare the performance of risk-based portfolios for several estimation methods of covariance matrices in the Japanese stock market. In addition, we propose a highly accurate estimation method called cDCC-NLS, which incorporates nonlinear shrinkage into the cDCC-GARCH model. The results confirm that (1) the cDCC-NLS method shows the best estimation accuracy, (2) the RP and MD do not depend on the estimation accuracy of the covariance matrix, and (3) the MV does depend on the estimation accuracy of the covariance matrix.
JEL Classification:
C13; C32; G11

1. Introduction

In the field of portfolio management, practitioners are focusing increasingly on risk-based portfolios, which use methods such as the minimum variance (MV), risk parity (RP), and maximum diversification (MD) to construct portfolios based solely on covariance matrices. Risk-based portfolios outperform traditional mean-variance portfolios (Markowitz 1952) and market capitalization-weighted portfolio, as shown in various empirical studies on stock selection (De Carvalho et al. 2012) and asset allocation (Chaves et al. 2011). As a result, the number of investment trusts and pension funds designed based on this concept are increasing.
It is well known that the performance of a mean-variance portfolio depends on the accuracy of the estimations of the inputs (Michaud 1989). In contrast, risk-based portfolios do not need to estimate expected returns, which are notoriously unstable and difficult to predict (Merton 1980). This is one of the main motivations for using a risk-based portfolio rather than a mean-variance portfolio. Recently, Ardia et al. (2017) estimated the accuracy of covariance matrices in terms of the weights of several risk-based portfolios. However, to the best of our knowledge, no studies have examined the relation between the performance of risk-based portfolios and the estimated accuracy of covariance matrices.
In this research, we compare the performances of risk-based portfolios for several estimation methods of covariance matrices. In addition, we propose a highly accurate estimation method that combines nonlinear shrinkage and the cDCC-GARCH model, called the cDCC-NLS method.
To estimate the covariance matrix in financial time series, it is necessary consider two important aspects: the cross section and the time series. With regard to the cross section, we have the difficulty of correcting the biases of the sample covariance matrix eigenvalues in a large number of time series. A popular method used to achieve this is the linear shrinkage method (LS) and its successor, the nonlinear shrinkage method (NLS), proposed by Ledoit and Wolf (2012). With regard to the time series aspect, we have to account for volatility clustering and time-varying correlations. To introduce a time-varying volatility and correlation structure, Engle (2002) proposed using the dynamic conditional correlation GARCH (DCC) model. Aielli (2013) modified the DCC model, proposing the corrected DCC-GARCH (cDCC) model. We combine the cDCC model and the NLS method to deliver an improved estimation of the covariance matrix. We compare various (c)DCC models (DCC, cDCC, DCC-LS, cDCC-LS, DCC-NLS, and cDCC-NLS) to estimate the accuracy of each method. To test whether the estimation accuracy of the covariance matrices affects the performance of the risk-based portfolio, we compare the accuracy of each method, including the proposed method. Then, we check whether the improvement affects the performance of risk-based portfolios in the Japanese stock market.
The proposed method (cDCC-NLS) offers the best estimation accuracy of the six methods. Furthermore, we confirm that, unlike MV, RP and MD do not depend on the estimation accuracy of the covariance matrix.
The remainder of the paper is organized as follows. Section 2 gives a brief description of risk-based portfolios and the estimation method using covariance matrices. Section 3 describes the proposed method, which consists of a cDCC-GARCH model and the nonlinear shrinkage method. Section 4 discusses a simulation of a large number of stocks in Japanese markets. Lastly, Section 5 concludes the paper.

3. Combining Nonlinear Shrinkage and the cDCC-GARCH Model

As described above, the estimation of the covariance matrix is problematic in terms of the cross section and the time series. To combine these two aspects, Hafner and Reznikova (2012) proposed an estimation method combining linear shrinkage and the DCC-GARCH model (DCC-LS), and Engle et al. (2017) proposed an estimation method combining nonlinear shrinkage and the DCC-GARCH model (DCC-NLS). We propose a highly accurate estimation method that combines nonlinear shrinkage and the cDCC-GARCH model, called the cDCC-NLS method. To summarize, the cDCC-NLS estimation follows a two-step process:
Step 1
For each asset, fit a univariate GARCH(1,1) model.
Step 2
Estimate the unconditional covariance matrix of the standardized errors using the nonlinear shrinkage method, and then maximize the composite likelihood of the cDCC model.
In the estimation of Step 2, Hafner and Reznikova (2012) used the sample covariance matrix with the linear shrinkage estimator (DCC-LS). Engle et al. (2017) later replaced this with the nonlinear shrinkage estimator (DCC-NLS). We combine nonlinear shrinkage and the cDCC-GARCH model and confirm its advantages in the next section.

4. Simulation Study

In order to check whether the performance of risk-based portfolios depends on the estimation accuracy of the covariance matrices, it is necessary to confirm (1) the estimation accuracy of each method, and (2) the effect of the estimation accuracy on performance.
In this section, we first compare the estimation accuracy of the covariance matrix using various (c)DCC models. We evaluate the covariance matrix estimation accuracy following Engle et al. (2017). Next, we examine the effect of the difference in the estimation accuracy on the performance of the risk-based portfolios.

4.1. Monte Carlo Study

In this section, we perform a Monte Carlo simulation to compare the estimation accuracy of the parameters of various (c)DCC models: the DCC, cDCC, DCC-LS, cDCC-LS, DCC-NLS, and cDCC-NLS models. Engle et al. (2017) defines an error function between the true covariance matrix generated by a (c)DCC model and the estimation. On the other hand, because of the sample estimator, the LS and NLS methods do not handle true covariance matrices at each time point, which means the errors cannot be calculated in the same way as in the (c)DCC model. Therefore, we limit our discussion to the (c)DCC model.
In order to compare our results with those of previous studies (Hafner and Reznikova 2012; Engle et al. 2017), we run a simulation study using the same setup. Let the covariance matrix at each time point of the DCC model, given a certain unconditional covariance and parameters, be a true covariance matrix. We generate random numbers according to the true DCC model, which we then use to estimate the covariance matrix. An error between the estimated covariance matrix and the true covariance matrix is defined as the estimation accuracy.
As in the previous studies, we estimate the unconditional population covariance matrix from the N { 100 , 500 , 1000 } most liquid stocks in the TOPIX index, using 60 months of daily data from January 2005 through December 2009. We then simulate a (c)DCC model using the unconditional population covariance matrix and the parameters a = 0.05 and b = 0.93 . The univariate volatility dynamics are GARCH(1,1) models with identical parameters ( ω i , α i , β i ) = ( 0.01 , 0.05 , 0.9 ) across all stocks i = 1 , , N . For each simulation, we generate an N × T matrix of simulated returns, where the sample size is T = 1250 . We run the simulation study for the same setup, with the exception of the unconditional population covariance matrix. Because we study the Japanese stock market in the next section, we use data on Japanese stocks to define the true unconditional covariance matrix.
First, we check the estimation accuracy of the (c)DCC model parameters a = 0.05 and b = 0.93 . The estimation results for each model are shown in Table 1 and Table 2. Table 1 and Table 2 show the average values and standard deviations of 10 5 / N estimates of each parameter ( a , b ) . There is no downward bias in the parameters described by Engle et al. (2008) or Aielli (2013) in either the estimation method or in the number of stocks. Therefore, it is considered that the difference in each method is the estimation accuracy of the standardized residual sample covariance matrix, which we use for the correlation targeting.
Table 1. The average and standard deviation of the (c)DCC parameter a = 0.05 using six correlation methods targeting dimensions N { 100 , 500 , 1000 } , with sample size T = 1250 .
Table 2. The average and standard deviation of the (c)DCC parameter b = 0.93 using six correlation methods targeting dimensions N { 100 , 500 , 1000 } , with sample size T = 1250 .
Next, we evaluate the accuracy of the covariance matrix estimation using the loss function of Engle et al. (2017). Let Σ t denote the N-dimensional “true” covariance matrix at time t, and Σ ^ t be an estimator of Σ t . The loss function L t ( Σ t ^ , Σ t ) and the average loss L ^ are defined below.
L t ( Σ t ^ , Σ t ) = T r ( Σ t ^ 1 Σ t Σ t ^ 1 ) / N [ T r ( Σ t ^ 1 / N ) ] 2 1 T r ( Σ t 1 / N )
L ^ = 1 T t = 1 T L t
Table 3 shows the results for L ^ , averaged across 10 5 / N Monte Carlo simulations.
Table 3. Average loss for (c)DCC estimators using six correlation methods targeting dimensions N { 100 , 500 , 1000 } , with sample size T = 1250 . The unit is 10 3 .
We find that the average loss of the cDCC-NLS method is the lowest of the various methods for any number of assets. Therefore, the cDCC-NLS method shows the best estimation accuracy.

4.2. Performance of the Risk-Based Portfolios

In the previous section, we compared the accuracy of the covariance matrix estimation of each method using a Monte Carlo study. Thus, we were able to confirm that the cDCC-NLS method shows the best estimation accuracy. Next, we examine the effect of the difference in the estimation accuracy on the performance of risk-based portfolios. Here, we use the minimum variance (MV), minimum variance without short constraint (MVS), risk parity (RP), and maximum diversification (MD). We estimate the covariance matrices using the various (c)DCC models for the N { 100 , 500 , 1000 } most liquid stocks in the TOPIX index, as before. Each portfolio is re-balanced monthly, and the parameter estimation period of each method is 60 months. The out-of-sample period is from January 2000 to December 2015. In other words, we estimate the weights on the previous 60 months of data from January 2000, and then compute the value of the out-of-sample portfolio for the following month. Next, we roll the window forward by one month and repeat the process until December 2015.
We use an annualized Sharpe ratio to evaluate the performance of each portfolio. We also consider whether the cDCC-NLS (the most accurate method) delivers a higher out-of-sample Sharpe ratio than that of the DCC and the cDCC at a level that is statistically significant. For a given sample size N, a two-sided p-value for the null hypothesis of equal Sharpe ratios is obtained using the pre-whitened HACPW method, as described in (Ledoit and Wolf 2008). Table 4, Table 5, Table 6 and Table 7 show the annualized return, annualized volatility, and annualized Sharpe ratio for each risk-based portfolio.
Table 4. Annualized return, volatility, and Sharpe ratio for an MV portfolio using six methods for dimensions N { 100 , 500 , 1000 } , with sample size T = 1250 . The out-of-sample period is from January 2000 to December 2015. In the rows labeled Sharpe Ratio, the largest number appears in bold. In the columns labeled DCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: *** denotes significance at the 0.01 level; ** denotes significance at the 0.05 level; * denotes significance at the 0.1 level. In the columns labeled cDCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: +++ denotes significance at the 0.01 level; ++ denotes significance at the 0.05 level; + denotes significance at the 0.1 level.
Table 5. Annualized return, volatility, and Sharpe ratio for an MV portfolio without short-sales constraints (MVS) using six methods for dimensions N { 100 , 500 , 1000 } , with sample size T = 1250 . The out-of-sample period is from January 2000 to December 2015. In the rows labeled Sharpe Ratio, the largest number appears in bold. In the columns labeled DCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: *** denotes significance at the 0.01 level; ** denotes significance at the 0.05 level; * denotes significance at the 0.1 level. In the columns labeled cDCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: +++ denotes significance at the 0.01 level; ++ denotes significance at the 0.05 level; + denotes significance at the 0.1 level.
Table 6. Annualized return, volatility, and Sharpe ratio for an RP portfolio using six methods for dimensions N { 100 , 500 , 1000 } , with sample size T = 1250 . The out-of-sample period is from January 2000 to December 2015. In the rows labeled Sharpe Ratio, the largest number appears in bold. In the columns labeled DCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: *** denotes significance at the 0.01 level; ** denotes significance at the 0.05 level; * denotes significance at the 0.1 level. In the columns labeled cDCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: +++ denotes significance at the 0.01 level; ++ denotes significance at the 0.05 level; + denotes significance at the 0.1 level.
Table 7. Annualized return, volatility, and Sharpe ratio for an MD portfolio using six methods for dimensions N { 100 , 500 , 1000 } , with sample size T = 1250 . The out-of-sample period is from January 2000 to December 2015. In the rows labeled Sharpe Ratio, the largest number appears in bold. In the columns labeled DCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: *** denotes significance at the 0.01 level; ** denotes significance at the 0.05 level; * denotes significance at the 0.1 level. In the columns labeled cDCC and cDCC-NLS, significant out-performance of one of the two portfolios over the other (in terms of the Sharpe ratio) is denoted by asterisks: +++ denotes significance at the 0.01 level; ++ denotes significance at the 0.05 level; + denotes significance at the 0.1 level.
All risk-based portfolios show improved performance as the number of stocks N increases. For the RP in Table 6 and the MD in Table 7, we confirm that performance does not depend on the estimation accuracy. There are no statistically significant differences between the DCC and cDCC-NLS or between the cDCC and cDCC-NLS. We further confirm that the weights of the RP and MD are similar across the models until December 2015. We consider that this explains why the performance of the RP and MD are similar.
On the other hand, for the MV in Table 4, the Sharpe ratio of the cDCC-NLS is an improvement over those of the DCC and the cDCC, especially for large numbers of stocks. In addition, the MVS in Table 5, which allows short selling, shows a significant improvement. For the MV and MVS, the cDCC-NLS records the highest Sharpe ratios for any numbers of stocks. According to Scherer (2011), the good performance of the MV portfolio is based on a low-risk anomaly. By being short in high volatility and long in low volatility, low-risk anomalies can be realized. Thus, we think the presence of short sales makes the low-risk anomaly possible. From these results, we conclude the RP and MD do not depend on the estimation accuracy of the covariance matrix, whereas the MV and MVS do depend on the estimation accuracy of the covariance matrix.

5. Conclusions

The contribution of this study is twofold. First, we propose the cDCC-NLS method, a highly accurate estimation method that combines nonlinear shrinkage and the cDCC-GARCH model. This method overcomes the difficulties that arise from the cross-section and time series aspects of estimating covariance matrices. We compared the performance of risk-based portfolios under several estimation methods for covariance matrices: the DCC, cDCC, DCC-LS, DCC-NLS, cDCC-LS, and cDCC-NLS. We performed an empirical analysis using a large number of stocks in Japanese markets and confirm the following results:
  • The cDCC-NLS method shows the best estimation accuracy.
  • The RP and MD do not depend on the estimation accuracy of the covariance matrix.
  • The MV does depend on the estimation accuracy of the covariance matrix.
Our recommendations for investors are as follows. When constructing a minimum variance portfolio, especially one that allows short selling, the covariance matrix should be estimated with good estimation accuracy using the cDCC-NLS method. However, for risk parity and maximum diversification portfolios, the estimation accuracy of the covariance matrix is not so important.
Although not considered in this research, many studies have examined the volatility asymmetry in which volatility tends to show a greater increase after negative returns than it does after positive returns. With regard to the conditional correlation, asymmetry is evident in the correlation between stocks tending to increase after negative returns, as in the case of volatility. In future research, we will investigate the impact on the performance of risk-based portfolios by introducing the asymmetry of volatility and the correlation.
In this study, we limited our scope to the (c)DCC model in order to estimate the error of covariance matrix. However, it is also important to examine the relationship between the performance of a risk-based portfolio and other estimation methods.

Author Contributions

K.N. provided the idea, constructed the hypotheses, analyzed the data and wrote the paper. M.I. performed statistical computations and helped analyzing the data. K.Y. supervised and edited the paper.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Aielli, Gian Piero. 2013. Dynamic conditional correlation: on properties and estimation. Journal of Business & Economic Statistics 31: 282–99. [Google Scholar]
  2. Ardia, David, Guido Bolliger, Kris Boudt, and Jean-Philippe Gagnon-Fleury. 2017. The impact of covariance misspecification in risk-based portfolios. Annals of Operations Research 254: 1–16. [Google Scholar] [CrossRef]
  3. Bollerslev, Tim. 1986. Generalized autoregressive conditional heteroskedasticity. Journal of Econometrics 31: 307–27. [Google Scholar] [CrossRef]
  4. Chaves, Denis, Jason Hsu, Feifei Li, and Omid Shakernia. 2011. Risk parity portfolio vs. other asset allocation heuristic portfolios. The Journal of Investing 20: 108–18. [Google Scholar] [CrossRef]
  5. Choueifaty, Yves, and Yves Coignard. 2008. Toward maximum diversification. The Journal of Portfolio Management 35: 40–51. [Google Scholar] [CrossRef]
  6. Clarke, Roger G., Harindra De Silva, and Steven Thorley. 2006. Minimum-variance portfolios in the us equity market. The Journal of Portfolio Management 33: 10–24. [Google Scholar] [CrossRef]
  7. De Carvalho, Raul Leote, Xiao Lu, and Pierre Moulin. 2012. Demystifying equity risk-based strategies: A simple alpha plus beta description. The Journal of Portfolio Management 38: 56–70. [Google Scholar] [CrossRef]
  8. Engle, Robert. 2009. Anticipating Correlations: A New Paradigm for Risk Management. Princeton: Princeton University Press. [Google Scholar]
  9. Engle, Robert F. 1982. Autoregressive conditional heteroscedasticity with estimates of the variance of united kingdom inflation. Econometrica: Journal of the Econometric Society 50: 987–1007. [Google Scholar] [CrossRef]
  10. Engle, Robert F. 2002. Dynamic conditional correlation: A simple class of multivariate generalized autoregressive conditional heteroskedasticity models. Journal of Business & Economic Statistics 20: 339–50. [Google Scholar]
  11. Engle, Robert F, Olivier Ledoit, and Michael Wolf. 2017. Large dynamic covariance matrices. Journal of Business & Economic Statistics, 1–13. [Google Scholar] [CrossRef]
  12. Engle, Robert F., Neil Shephard, and Kevin Sheppard. 2008. Fitting Vast Dimensional Time-Varying Covariance Models. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=1293629 (accessed on 11 May 2018).
  13. Hafner, Christian M., and Olga Reznikova. 2012. On the estimation of dynamic conditional correlation models. Computational Statistics & Data Analysis 56: 3533–45. [Google Scholar]
  14. Haugen, Robert A., and Nardin L. Baker. 1991. The efficient market inefficiency of capitalization—Weighted stock portfolios. The Journal of Portfolio Management 17: 35–40. [Google Scholar] [CrossRef]
  15. Ledoit, Olivier, and Sandrine Péché. 2011. Eigenvectors of some large sample covariance matrix ensembles. Probability Theory and Related Fields 151: 233–64. [Google Scholar] [CrossRef]
  16. Ledoit, Olivier, and Michael Wolf. 2004. A well-conditioned estimator for large-dimensional covariance matrices. Journal of Multivariate Analysis 88: 365–411. [Google Scholar] [CrossRef]
  17. Ledoit, Oliver, and Michael Wolf. 2008. Robust performance hypothesis testing with the sharpe ratio. Journal of Empirical Finance 15: 850–59. [Google Scholar] [CrossRef]
  18. Ledoit, Olivier, and Michael Wolf. 2012. Nonlinear shrinkage estimation of large-dimensional covariance matrices. The Annals of Statistics 40: 1024–60. [Google Scholar] [CrossRef]
  19. Lindsay, Bruce G. 1988. Composite likelihood methods. Contemporary Mathematics 80: 221–39. [Google Scholar]
  20. Maillard, Sébastien, Thierry Roncalli, and Jérôme Teïletche. 2010. The properties of equally weighted risk contribution portfolios. The Journal of Portfolio Management 36: 60–70. [Google Scholar] [CrossRef]
  21. Markowitz, Harry. 1952. Portfolio selection. The Journal of Finance 7: 77–91. [Google Scholar]
  22. Merton, Robert C. 1980. On estimating the expected return on the market: An exploratory investigation. Journal of Financial Economics 8: 323–61. [Google Scholar] [CrossRef]
  23. Michaud, Richard O. 1989. The markowitz optimization enigma: Is ‘optimized’ optimal? Financial Analysts Journal 45: 31–42. [Google Scholar] [CrossRef]
  24. Qian, Edward. 2005. Risk Parity Portfolios: Efficient Portfolios through True Diversification. Boston: Panagora Asset Management. [Google Scholar]
  25. Scherer, Bernd. 2011. A note on the returns from minimum variance investing. Journal of Empirical Finance 18: 652–60. [Google Scholar] [CrossRef]
  26. Silverstein, Jack W. 1995. Strong convergence of the empirical distribution of eigenvalues of large dimensional random matrices. Journal of Multivariate Analysis 55: 331–39. [Google Scholar] [CrossRef]
1.
The operator ∘ represents the Hadamard product, which is a binary operation that takes two matrices of the same dimensions. For two matrices, A and B of the same dimension, the Hadamard product A B , is a matrix of the same dimension as the operands, with elements given by A B i , j = ( A ) i , j ( B ) i , j .
2.
Note that the parameter vector ϕ = ( a , b ) is the same among all H jt .

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.