Next Article in Journal
Teaching Down to Earth—Service-Learning Methodology for Science Education and Sustainability at the University Level: A Practical Approach
Next Article in Special Issue
A Novel Extended Higher-Order Moment Multi-Factor Framework for Forecasting the Carbon Price: Testing on the Multilayer Long Short-Term Memory Network
Previous Article in Journal
Performance and Resilience of Socially Responsible Investing (SRI) and Conventional Funds during Different Shocks in 2016: Evidence from Japan
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Prediction of Stock Returns: Sum-of-the-Parts Method and Economic Constraint Method

College of Mathematics and Statistics, Changsha University of Science and Technology, Changsha 410114, Hunan, China
*
Author to whom correspondence should be addressed.
Sustainability 2020, 12(2), 541; https://doi.org/10.3390/su12020541
Submission received: 29 November 2019 / Revised: 8 January 2020 / Accepted: 8 January 2020 / Published: 10 January 2020
(This article belongs to the Special Issue Forecasting Financial Markets and Financial Crisis)

Abstract

:
Forecasting stock market returns has great significance to asset allocation, risk management, and asset pricing, but stock return prediction is notoriously difficult. In this paper, we combine the sum-of-the-parts (SOP) method and three kinds of economic constraint methods: non-negative economic constraint strategy, momentum of return prediction strategy, and three-sigma strategy to improve prediction performance of stock returns, in which the price-earnings ratio growth rate (gm) is predicted by economic constraint methods. Empirical results suggest that the stock return forecasts by proposed models are both statistically and economically significant. The predictions of proposed models are robust to various robustness tests.

1. Introduction

The prediction of stock returns has always been a concern of scholars and investors, as it affects some fundamentals of capital budgeting and investment processes [1]. Nevertheless, the prediction of stock returns is difficult. Goyal and Welch [2] argue that the out-of-sample prediction performance of predictive models based on a long series of economic variables does not outperform the historical average, although the interest models have a superior in-sample forecasting performance compared with the benchmark. Hence, researchers tend to adopt different predictors such as dividend yield [3], inflation [4], oil price increase [5], and technical indicators [6,7] for predicting stock returns. On the other hand, some researchers aim to gain more accurate stock return forecasts through various methods, for example: non-negative economic constraint (CT) [8], sum-of-parts (SOP) method [9], combination forecasts [10] and [11], the conditional Sharpe ratio [12], momentum of return prediction (MOP) [13], and three sigma rule (three-sigma) [14].
Although Ferreira and Santa-Clara [9] show that sum-of-the-parts (SOP) can improve the accuracy of prediction as a whole and have better out-of-sample results, things do not seem so optimistic when it comes to the predictive power of the price-earnings ratio growth rate (gm). To remedy this situation in the SOP method, we will combine SOP [9] and three kinds of economic constraint methods in Campbell and Thompson [8], Wang et al. [13], and Zhang et al. [14] to obtain stock return forecasting. In detail, SOP model divides the stock returns into three parts. First, each part is predicted separately, and then the predictive value of each part is added up to get the forecast of stock returns. Moreover, in the process of coefficient estimation, a certain constraint can be imposed on its coefficients of the predictive model. Relative to the original SOP method, the influence of 14 economic variables [9] on the SOP component is also taken into account. In other words, we first decompose the stock returns into three different parts: earnings growth rate (ge), dividend-price ratio (dp), and price-earnings ratio growth rate (gm). Concerning dp, we use the true value because of its high persistence. For ge, we construct its 20-year moving average [9] as its predictive value. The forecasting methods of the first two components are the same as those of the previous literature. However, with regard to the third component of gm, we use 14 economic variables to forecast it, instead of assuming that gm is 0, as in simple versions of SOP. It is noteworthy that we limit the parameters when regressing gm and macroeconomic variables in order to improve the performance of prediction.
After obtaining the predictive values of gm with parameter constraints, we adopt three strategies to get the final forecast of gm. The first one is CT strategy [8], that is to say, we only consider the part of gm larger than 0 simply because we prefer to get the positive stock return prediction value. To reduce the impact of extreme value, we use a new three-sigma strategy slightly different from Zhang et al. [14]. When the gm at time t + 1 is beyond the range of three-sigma, we use the value of g ^ m t , otherwise we use the g ^ m t + 1 C T value based on the CT strategy. The third one is the MOP strategy [13]. MOP is the momentum effect, which believes that some attribute of the past period can be extended to the present. The reason why we argue gm may have momentum effect is because it is a part of stock return, and the predictability of stock return considering momentum effect obviously improves according to Wang et al. [13]. Thus, we obtain three gm predictions based on different strategies. We call the method, considering three different strategies constrained, SOP because it can be interpreted as imposing certain constraint on the predictive values of gm.
In the empirical analysis, our sample data spanned from December 1927 to December 2018. Recursive window method was used to generate out-of-sample stock return forecasts in this paper. Following Binsbergen and Koijen [15], Paye [16] and Rapach et al. [17], we employed the monthly out-of-sample R 2 ( R O S 2 ) calculated based on relative percentage decrease of mean squared predictive error (MSPE) to evaluate the performance of prediction and the p-value of Clark and West [18] statistic to check the significance from a statistical perspective. From the out-of-sample stock return forecasts, we can easily find that the SOP model with CT, three-sigma, and MOP constraints increase the average R O S 2 by 0.132%, 0.158%, and 0.401%, respectively, compared with SOP model, which indicates that imposing certain restrictions on the SOP model will indeed improve the predictability of stock returns. What is more, our new three-sigma SOP model yields the largest increase in average out-of-sample R O S 2 .
Regarding economic performance, we further calculated the certainty equivalent return (CER) gain, the difference between the CER for a mean-variance investor with risk coefficient of five, who uses a SOP or constraint-SOP predictive regression forecast of the equity risk premium and the CER gained by utilizing benchmark forecast according to Rapach and Zhou [19]. In univariate analysis, the average CER gain increased from 0.688% of SOP model to 0.799%, 0.891%, and 0.843% when imposing CT, MOP, and three-sigma constraints, respectively. This again supports the evidence that generally combining SOP with three constraints moderately improves out-of-sample economic values. In addition, after using principal component predictive regression to predict price-earnings multiple growth rates, we can obtain significant stock return predictability both in statistical and economic terms, which suggests that incorporating multivariate information is conducive to improve the out-of-sample performance.
The predictability of stock return using the proposed models is robust, even considering reasonable transaction costs or various risk aversion coefficients. This is still the case that the robustness test again supports the advantages of our constraint-SOP models.
The rest of this paper is organized as follows. Section 2 introduces the methodology. We elaborate on two aspects: the form of SOP model and three kinds of constraint-SOP models, and out-of-sample evaluation is also presented. Section 3 reports descriptive statistical analysis on the monthly data. Section 4 discusses the out-of-sample empirical results including statistical and economic performance. Section 5 gives the robustness test. Section 6 concludes this paper.

2. Methodology

2.1. Return Decomposition

Following Ferreira and Santa-Clara [9], we decomposed the stock return into three parts. In addition, we obtained the stock return forecast through adding up the predictive values of each decomposed part.
The stock returns at month t + 1 , R t + 1 can be divided into capital gains, C G t + 1 , and dividend yield, D Y t + 1 :
1 + R t + 1 = 1 + C G t + 1 + D Y t + 1 = P t + 1 P t + D t + 1 P t
where P t + 1 is the stock return at time t + 1 , D t + 1 is dividend per share paid from time t to t + 1 .
Capital gains can be written as:
D Y t + 1 = D t + 1 P t = D t + 1 / E t + 1 P t / E t E t + 1 E t = D P t + 1 ( 1 + G M t + 1 ) ( 1 + G E t + 1 )
where M t + 1 is the price-earnings multiple, G M t + 1 is the price-earnings multiple growth rate, and G E t + 1 denotes the earnings growth rate.
Similarly, the dividend yield can be divided into:
D Y t + 1 = D t + 1 P t = D t + 1 / E t + 1 P t / E t E t + 1 E t = D P t + 1 ( 1 + G M t + 1 ) ( 1 + G E t + 1 )
where D P t + 1 is the dividend/price ratio.
Therefore, Equation (1) can be rewritten substituting capital gain, C G t + 1 , and the dividend yield, D Y t + 1 :
1 + R t + 1 = ( 1 + G M t + 1 ) ( 1 + G E t + 1 ) + D P t + 1 ( 1 + G M t + 1 ) ( 1 + G E t + 1 ) = ( 1 + D P t + 1 ) ( 1 + G E t + 1 ) ( 1 + G M t + 1 )
Finally, we take logarithms of Equation (4) to get the following:
r t + 1 = log ( 1 + R t + 1 ) = d p t + 1 + g e t + 1 + g m t + 1
where lowercase ones are rates in logarithmic form. For instance, g m t + 1 is the log of one plus price-earnings multiple growth rate.

2.2. The Sum-of-the-Parts Method

As shown in Section 2.1, we forecasted separately the components of the stock return from Equation (5):
E t r t + 1 = E t d p t + 1 + E t g e t + 1 + E t g m t + 1
where E t g e t + 1 is the expected earnings growth estimated using a 20-year moving average and regarded as the low-frequency prediction part in line with Binsbergen and Koijen [15]. According to the analysis results of Ferreira and Santa-Clara [9], E t d p t + 1 , the expected dividend-price ratio, is assumed to be a random walk, so it is equivalent to the present true value.
Following the extended version of the SOP method in Ferreira and Santa-Clara [9], we used 14 economic variables to predict the E t g m t + 1 , rather than making it equal to zero as in the simple version of the SOP (see Ferreira and Santa-Clara [9]). The following univariate predictive regression model can be used to predict the price-earnings multiple growth rate:
g ^ m t + 1 = a ^ i + b ^ i x i , t + ε t + 1
where x i , t is the i-th macroeconomic variable at month t . Following Ferreira and Santa-Clara [9] and Connor [20], in order to lessen the influence of model uncertainty on prediction performance, we restricted the intercept and the slope coefficient, instead of directly using a ^ i and b ^ i , estimates of ordinary least squares (OLS) regression. Along with Ferreira and Santa-Clara [9] and Connor [20], we converted the slope coefficient into b i * = t t + 1200 b ^ i and a i * = g ¯ m t b i * x ¯ t . Hence, g ^ m t + 1 refers to the predictive value of log of one plus the price-earnings multiple growth rate with shrinkage in subsequent studies.
In addition, we will incorporate information from multiple variables to predict the price-earnings multiple growth rate based on principal components. Let X t = ( x 1 , t , x 2 , t x 14 , t ) T denote the 14 vectors of the entire set of variables, and let E t = ( e 1 , t ,   e 2 , t   e K , t ) T denote the vector containing the first K principal components extracted from E t (where K < 14). The principal component predictive regression (PC model) is given as follows:
g ^ m t + 1 = α + k = 1 K β k   e k , t + ε t + 1
Principal components parsimoniously incorporate information from a large number of potential predictors in a predictive regression, where the first few principal components identify the key co-movements among the entire set of predictors. Following Neely et al. [5], we also selected the number of principal components as K = 3.
Overall, the SOP method adopted in this paper is:
r ^ t + 1 = α ^ + β ^ d p t + g ¯ e t + g ^ m t
where g ¯ e t denotes the twenty-year moving average of g e up to month t , d p t means the log of one plus the dividend/price ratio at month t , and g ^ m t is obtained by Equations (7) or (8).

2.3. Forecasting with Constrained-SOP Model

In order to improve the prediction accuracy of the log price-earnings multiple growth rate (gm), we will impose reasonable constraints on Models (7) and (8). In this paper, three economic constraints were applied in the process of forecasting.
Campbell and Thompson [8] argue that reasonable investors do not consider a negative stock return forecast. In the same light, we expect a positive price-earnings multiple growth rate. Consequently, the CT forecasts are defined as:
g ^ m i , t + 1 C T = max ( 0 , g ^ m i , t + 1 )
where g ^ m i , t + 1 C T denotes the CT predictive value of gm at month t + 1 .
The second is based on momentum effect. Wang et al. [13] found if past predictability of the benchmark model outperforms, it is more likely to still beat the current model of interest. The past predictability is defined as follows:
p p t ( k ) = I ( ( j = t k j = t 1 ( g m j g ^ m j ) 2 j = t k j = t 1 ( g m j g ¯ m j ) 2 ) < 0 )
where p p t ( k ) equals one when the model of interest performs better than the benchmark. k is the look-back period. It is evident that p p t ( k ) is affected by k . We finally chose k = 3 as the out-of-sample performance, as it is relatively better than the remaining look-back periods. The momentum of predictability (MOP) strategy is as follows:
{ g ^ m t + 1 M O P ( k ) = g ^ m t + 1 ,   i f p p t ( k ) = 1 , g ^ m t + 1 M O P ( k ) = g ¯ m t + 1 ,   i f p p t ( k ) = 0
where g ^ m t + 1 M O P ( k ) denotes the predictive value of gm based on the MOP strategy, and g ¯ m t + 1 is the historical average up to time t . g ^ m t + 1 represents the predictive value of log of one plus the price-earnings multiple growth rate with shrinkage.
The third one is based on the three-sigma strategy. Zhang et al. [14] argued that a rational investment behavior is unlikely to be implemented when the return forecast is too large or small. Moreover, rational investors tend to get a positive forecast. In view of these two points, we put forward the following new three-sigma constraint strategy slightly different from method proposed in Zhang et al. [14].
g ^ m i , t + 1 n e w = { g ^ m i , t ,   i f g ^ m i , t + 1 > g m i , t + 3 σ t g ^ m i , t ,   i f g ^ m i , t + 1 < g m i , t 3 σ t g ^ m i , t + 1 C T , o t h e r w i s e
where g ^ m i , t is the OLS predictive value of log of one plus the price-earnings multiple growth rate with shrinkage, and σ t denotes the standard deviation of log of one plus price-earnings multiple growth rate for month t generated by using an expanding window. g ^ m i , t + 1 n e w is the forecast on account of new three-sigma constraints. g ^ m i , t + 1 C T is the forecast in Model (12), and it usually helps to exclude the impact of outliers.
Overall, we combined the SOP method in Equation (9) and three kinds of constraints to receive more precise forecast results on the stock return.

2.4. Forecast Evaluation

We used the recursive window method to generate out-of-sample forecasts. To be more specific, the full sample of T observations was divided into two parts: M observations for the in-sample part and N observations for the out-of-sample segment.
The first forecast is obtained by regressing { r t } t = 2 M on a constant and { independent variables t } t = 1 M 1 . The second is received by regressing { r t } t = 3 M + 1 on a constant and { independent   variables t } t = 2 M . The progress will not be interrupted until N out-of-sample predictions are obtained.
Following [21,22,23,24,25,26,27,28,29], the forecast quality is evaluated by R o s 2 , which is the percent reduction of MSPE of the given model compared to the benchmark model, given by:
R o s 2 = 1 M S P E m o d e l M S P E b e n c h
where M S P E m o d e l = 1 T M t = M + 1 T ( r t r ^ t ) 2 and M S P E b e n c h = 1 T M t = M + 1 T ( r t r ¯ t ) 2 . r ^ t and r ¯ are the return forecast from the model of interest and historical average forecast, respectively. The historical average benchmark is r ¯ t + 1 = 1 t i = 1 t r t . As we know, a positive R o s 2 value indicates that the predictive regression forecast outperforms the benchmark in terms of MSPE, while a negative value means the opposite.
Moreover, we used Clark and West [19] statistics (i.e., MSFE-adjusted statistics) to further explore the significance of R o s 2 . The null hypothesis is M S P E b e n c h M S P E m o d e l against the upper-tail alternative hypothesis M S P E b e n c h > M S P E m o d e l . The CW statistic is computed by first defining:
f t = ( r t r ¯ t ) 2 ( r t r ^ t ) 2 + ( r ¯ r ^ ) 2
The t-statistic through a regression of f t on a constant is the CW statistic, and its p-value can be used to explore the significance.

3. Data and Summary Statistics

3.1. Stock Return

Considering the availability of the data, we used the return of S&P 500 index (hereinafter referred to as S&P 500 or SP500) as market return. The data can be obtained from https://finance.yahoo.com/, and the stock return is computed as:
r t =   100   × ( l o g ( P t ) l o g ( P t 1 ) )
where P t is the closing price at month t . For the convenience of showing the forecast effect, we multiplied the logarithmic return by 100.

3.2. Macroeconomic Variables

In order to explore the predictability of stock return, we selected 14 monthly macroeconomic variables from Amit Goyal’s home page at http://www.hec.unil.ch/agoyal/. In detail, we included the following variables:
  • stock variance (SVAR), the sum of the squared daily returns on the S&P 500 index;
  • default return spread (DFR), long-term corporate bond return minus the long-term government bond return;
  • long-term yield (LTY), long-term government bond yield;
  • long-term return (LTR), return on long-term government bonds;
  • inflation (INFL), calculated from the CPI (all urban consumers). We used one-month-lagged values because of the delay in CPI releases;
  • term spread (TMS), long-term yield minus the treasury bill rate;
  • treasury bill rate (TBL), interest rate on a three-month treasury bill (secondary market);
  • default yield spread (DFY), difference between Moody’s BAA- and AAA-rated corporate bond yields;
  • net equity expansion (NTIS), ratio of a twelve-month moving sum of net equity issues by New York Stock Exchange (NYSE)-listed stocks to the total end-of-year market capitalization of NYSE stocks
  • dividend/payout ratio (log; DE), log of a twelve-month moving sum of dividends minus the log of a twelve-month moving sum of earnings;
  • earnings/price ratio (log; EP), log of a twelve-month moving sum of earnings on the S&P 500 index minus the log of stock prices;
  • dividend/price ratio (log; DP), log of a twelve-month moving sum of dividends paid on the S&P 500 index minus the log of stock prices (S&P 500 index);
  • dividend yield (log; DY), log of a twelve-month moving sum of dividends minus the log of the lagged stock prices; and
  • book-to-market ratio (BM), Book-to-market value ratio for the Dow Jones Industrial Average.
Table 1 displays summary statistics for stock returns and 14 macroeconomic variables from December 1927 to December 2018 for the full-sample period at a monthly frequency. The average monthly stock return was 0.769%, and the monthly standard deviation was 5.375%. Although the autocorrelation of SP500 was very low, 0.078%, 10 out of 14 economic variables were strongly autocorrelated, as their autocorrelation coefficients were more than 0.9. In terms of the mean, we can see that the mean value of SP500 approximately equaled the sum of means of dp, ge and gm. This also verified the correctness of the formula deduced in the SOP decomposition process.

4. Empirical Performance

4.1. Out-of-Sample Forecasting Performance

Following Ferreira and Santa-Clara [9], we forecasted the stock return combining the SOP method and the CT strategy, the MOP strategy, or the three-sigma strategy, rather than only using different economic variables as for the predictors in the third part.
Table 2 reports prediction statistical results from an out-of-sample standpoint. The R O S 2 and the p-value of Clark and West [18] MSFE-adjusted statistics were used to evaluate whether our models could beat the benchmark. The forecast period ranged from December 1947 through December 2018. Twelve of 14 macroeconomic variables had positive R O S 2 values, which was nearly consistent with the literature (e.g., Ferreira and Santa-Clara [9]). That is to say, the SOP method with multiple growth gm and shrinkage outperformed the historical average method.
Interestingly, after adding CT constraint to the SOP method, we found that 12 predictors generated larger R O S 2 statistics compared to the original SOP method. When we implemented the three-sigma strategy in the SOP predictive model, the vast majority of R O S 2 values significantly matched or even exceeded the counterparts. It is remarkable that all R O S 2 values for three constraint methods were positive, as shown in the third to fifth column, meaning the three constraint methods outperformed the benchmark model. Nevertheless, though the SOP method based on the MOP switching regime failed to outperform the SOP method for all the predictors when the look-back period k was set to 3, the MOP-based SOP method applied to EP and DP regressions improved from −0.44% to 0.129% and from −0.157% to 0.532%, respectively, which indicates that it is still helpful for predictability improvement. The SOP based on three-sigma strategy outperformed the above three SOP methods and yielded the largest R O S 2 , 1.535%, at a 1% significance level. When considering out-of-sample results for the principal component predictive regression forecasts, it was observed that the out-of-sample performance of the principal component predictive regression forecasts manifested a stronger out-of-sample predictive ability than that of the univariate predictive regression model. Meanwhile, for the principal component predictive regression forecasts, the economic constraint SOP models also yielded larger R O S 2 values than the original SOP model.
Overall, the predictive ability of SOP was stronger with the addition of the CT constraint, MOP, and three-sigma strategy. Among them, the three-sigma strategy had the best performance.

4.2. Asset Allocation

In the above subsection, we investigated the out-of-sample performance from a statistical significance perspective. We then quantified the economic value of stock return forecasts based on four kinds of SOP models. Namely, we subsequently computed the certainty equivalent return (CER) and its gain. As in [30,31,32,33,34,35,36,37,38,39], we imagined investor as a risk-averse and a mean-variance utility investor who optimally allocated her wealth across stock index and risk-free bills in light of various stock return forecasts. The weight of the portfolio to equities during month t + 1 is given by
ω t = ( 1 γ ) ( r ^ t + 1 σ ^ t + 1 2 )
where r ^ t + 1 denotes forecasts of the stock return based on four kinds of SOP approaches. We set the risk aversion parameter γ to 5. σ ^ t + 1 2 is the forecast of stock return variance computed from a five-year rolling window of the past returns. We also restricted ω t to be between −0.5 and 1.5 to have over 50% financial leverage.
The portfolio return at month t + 1 is as follows:
R p , t + 1 = ω i , t r t + 1 + R f , t + 1
where R f , t + 1 is the risk-free rate. The CER for portfolio is
C E R p = μ ^ p 1 2 γ σ ^ p 2
where μ ^ p and σ ^ p 2 denote the mean and variance forecasts, respectively. We then calculated CER gain, the difference between the CER achieved through employing the four kinds of SOP models, to forecast stock returns and the CER using the benchmark forecasts. Following Rapach et al. [10], we multiplied the CER gain by 1200. We can thus treat the CER gain as the management fee that an investor would be willing to accept to have access to a trading strategy based on the four kinds of SOP methods, instead of one based on the benchmark forecast.
Table 3 shows the portfolio results of the single economic variable and the principal component predictive regression forecasts. Comparing with the economic performance of the SOP model, we found 11 of 15 variables with CT constraints improved their CER gains. Using the univariate three-sigma SOP model, we obtained the highest CER gains of 224.8 basis points in the out-of-sample, demonstrating the predictability of stock return was economically significant. Moreover, 9 of 14 predictors in the three-sigma SOP models achieved CER gains of over 1%, and 11 of 14 predictors including variables like LTY, DE, and DY improved their CER gains after implementing the CT and three-sigma strategy. In detail, the CER values increased from 1.318% to 1.538% for LTY, from 1.26% to 2.248% for DE, and from 1.671% to 2.024% for DY after implementing the three-sigma strategy. Twelve of 14 predictors in the MOP-SOP model had positive CER gains, and even 8 predictors had more than 1%, although they failed to perform better than the SOP method. When considering economic performance for the principal component predictive regression forecasts, we can see that the CER gain of the principal component predictive regression forecasts was larger than that of the univariate predictive regression model, which is consistent with the out-of-sample prediction performance in Section 4.1. Meanwhile, the CER gain of economic constraint SOP models was larger than that of the original SOP model for the principal component predictive regression forecasts. Generally, the CER gain of the SOP model improved to some extent after adding different restrictions, and most CER gains were greater than 1%, which indicates that the economic performance of the SOP model with restrictions was superior. In a word, the performance of the SOP model with constraints performed better than the historical average model.
When we compute the certainty equivalent return (CER) for a mean-variance utility investor who allocates his or her assets between a stock index and the risk-free asset by using various stock return forecasts, the SOP model and the CT constraint SOP model can promote systematic gains (except for the predictors DP, EP, and NTIS), and the MOP-SOP model and three-sigma SOP model can promote systematic gains (except for the predictors EP and NTIS).

5. Robustness Check

5.1. Investor Risk Aversion Choices

In the earlier part of the article, we assumed that an investor’s relative risk aversion equals 5. However, in fact, the optimal weight may be affected by the investor’s relative risk aversion. Therefore, in order to further examine whether the economic performance of stock return makes a difference according to the investor’s relative risk aversion, it is worthwhile to consider other risk aversion values. Here, we employed alternative risk aversion coefficients of 3 and 4 and re-ran the asset allocation to check the robustness.
Table 4 reports CER gains calculated with alternative risk aversion coefficients. In univariate regression, whether the risk aversion coefficient was 3 or 4, we can see that 11 out of 15 predictors in the CT-SOP model increased CER gains clearly compared with the original SOP model. When γ was 3, it is noteworthy that 11 out of 14 predictors of CER gains of SOP model with three-sigma restriction were higher relative to the original SOP model, while only 7 predictors in SOP models with MOP constraints were better. Besides, when switching to the risk aversion coefficient of 4, it was the same case, which is consistent with analysis results in Section 4.2. At the same time, we can note that CER gain decreased with the increase of risk aversion coefficients. The CER gain of economic constraint SOP models was larger than that of the original SOP model for the principal component predictive regression forecasts if different risk aversion coefficients were selected. The above findings suggest that the superior economic performance of SOP models with constraints is robust.

5.2. Transaction Cost

Moreover, we may have to pay a certain transaction cost in practice, instead of ignoring the transaction cost as in Section 4.2, that is, to make the transaction cost zero. Following prior literature (see e.g., Neely et al. [5]; Zhang et al. [14]), we calculated the certainty equivalent return (CER) gains taking the transaction cost of 50 basis points into account (Balduzzi and Lynch [23]).
Table 5 checks the robustness for considering transaction costs. As pointed out in Zhang et al. [14], the stock return forecasted by the interest model usually fluctuated more than that of the historical average, which results in relatively lower economic values. Compared with Table 4, it can be shown that 10 of the 14 predictors in the CT-SOP model, 10 of the 14 predictors in the MOP-SOP model, and 12 of the 14 predictors in three-sigma SOP model had bigger CER gains than that of the original SOP model. The CER gain of economic constraint SOP models was larger than that of the original SOP model for the principal component predictive regression forecasts, even if transaction fees were considered. In sum, the economic performance of SOP models with economic constraints is robust regardless of risk aversion factors or transaction costs.

6. Conclusions and Implication

This paper combines the SOP method and economic constraint method to predict stock return. Specifically, unlike the ordinary economic restriction model, we do not directly choose between the predictive value of stock return based on the model of interest and the historical average according to some strategy, but we proceed to its component selected following the strategy. For instance, it is enough to use the 20-year moving average to capture its direction for some component like g e because it has a low-frequency predictable part [34], while d p is suitable for being described with random walk. The purpose of combining CT restriction with the SOP method is to fully consider the impact of positive stock returns. To mitigate the impact of the extreme value of its components, we combine a new three-sigma rule with SOP. Using MOP to investigate the component g m , we can obtain that the properties of g m in the past will continue to the present.
Our out-of-sample analysis results show that, in general, the SOP model with restrictions can significantly improve the predictive ability, especially when our new three-sigma restriction is applied to the SOP model. Overall, the economic constraint SOP models can obtain better forecasting results than those of the original SOP model. Moreover, the economic performance of SOP models with economic constraints is robust regardless of risk aversion factors or transaction costs.
Finally, possible implications of our findings are the following: (i) the predictive power can be improved by using economic constraint SOP model; (ii) the principal component predictive regression can further enhance the out-of-sample prediction ability; and (iii) in the investments world, a mean-variance investor can choose an appropriate SOP model to obtain high gains from the market.

Author Contributions

Conceptualization, Z.D.; methodology, Z.D. and H.Z.; software, Z.D. and H.Z.; validation, Z.D.; formal analysis, Z.D.; investigation, Z.D.; resources, Z.D. and H.Z.; data curation, Z.D.; writing-original draft preparation, Z.D. and H.Z.; writing-review and editing Z.D. and H.Z.; visualization, Z.D. and H.Z.; supervision, Z.D. and H.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China, grants 71771030, 71671018, and 11301041; and the Scientific Research Fund of Hunan Provincial Education Department, grant number 19A007.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Cochrane, J.H. Presidential Address: Discount Rates. J. Financ. 2011, 66, 1047–1108. [Google Scholar] [CrossRef] [Green Version]
  2. Welch, I.; Goyal, A. A Comprehensive Look at The Empirical Performance of Equity Premium Prediction. Rev. Financ. Stud. 2007, 21, 1455–1508. [Google Scholar] [CrossRef]
  3. Campbell, J.Y.; Yogo, M. Efficient tests of stock return predictability. J. Financ. Econ. 2006, 81, 27–60. [Google Scholar] [CrossRef] [Green Version]
  4. Vuolteenaho, T.; Campbell, J. Inflation Illusion and Stock Prices. Am. Econ. Rev. 2004, 94, 19–23. [Google Scholar]
  5. Neely, C.J.; Rapach, D.E.; Tu, J.; Zhou, G. Forecasting the Equity Risk Premium: The Role of Technical Indicators. Manag. Sci. 2014, 60, 1772–1791. [Google Scholar] [CrossRef] [Green Version]
  6. Lin, Q. Technical analysis and stock return predictability: An aligned approach. J. Financ. Mark. 2018, 38, 103–123. [Google Scholar] [CrossRef]
  7. Wang, Y.; Pan, Z.; Liu, L.; Wu, C. Oil price increases and the predictability of equity premium. J. Bank. Financ. 2019, 102, 43–58. [Google Scholar] [CrossRef]
  8. Campbell, J.; Thompson, S.P. Predicting Excess Stock Returns Out of Sample: Can Anything Beat the Historical Average? Rev. Financ. Stud. 2008, 21, 1509–1531. [Google Scholar] [CrossRef] [Green Version]
  9. Ferreira, M.A.; Santa-Clara, P. Forecasting stock market returns: The sum of the parts is more than the whole. J. Financ. Econ. 2011, 100, 514–537. [Google Scholar] [CrossRef] [Green Version]
  10. Rapach, D.E.; Strauss, J.K.; Zhou, G. Out-of-sample equity premium prediction: Combination forecasts and links to the real economy. Rev. Financ. Stud. 2010, 23, 821–862. [Google Scholar] [CrossRef]
  11. Zhu, X.; Zhu, J. Predicting stock returns: Regime-switching combination approach and economic links. J. Bank. Financ. 2013, 37, 4120–4133. [Google Scholar] [CrossRef]
  12. Pettenuzzo, D.; Timmermann, A.; Valkanov, R. Forecasting stock returns under economic constraints. J. Financ. Econ. 2014, 114, 517–553. [Google Scholar] [CrossRef] [Green Version]
  13. Wang, Y.; Liu, L.; Ma, F.; Diao, X. Momentum of return predictability. J. Empir. Financ. 2018, 45, 141–156. [Google Scholar] [CrossRef]
  14. Zhang, Y.; Wei, Y.; Ma, F.; Yi, Y. Economic constraints and stock return predictability: A new approach. Int. Rev. Financ. Anal. 2019, 63, 1–9. [Google Scholar] [CrossRef]
  15. Binsbergen, J.; Koijen, R. Predictive regressions: A present-value approach. J. Financ. 2010, 65, 1439–1471. [Google Scholar] [CrossRef] [Green Version]
  16. Paye, B.S. ‘Déjà vol’: Predictive regressions for aggregate stock market volatility using macroeconomic variables. J. Financ. Econ. 2012, 106, 527–546. [Google Scholar] [CrossRef] [Green Version]
  17. Rapach, D.E.; Ringgenberg, M.C.; Zhou, G. Short interest and aggregate stock returns. J. Financ. Econ. 2016, 121, 46–65. [Google Scholar] [CrossRef]
  18. Clark, T.E.; West, K.D. Approximately normal tests for equal predictive accuracy in nested models. J. Econ. 2007, 138, 291–311. [Google Scholar] [CrossRef] [Green Version]
  19. Rapach, D.; Zhou, G. Forecasting Stock Returns. In Handbook of Economic Forecasting; Elsevier: Amsterdam, The Netherlands, 2013; pp. 328–383. [Google Scholar]
  20. Connor, G. Sensible Return Forecasting for Portfolio Management. Financ. Anal. J. 1997, 53, 44–51. [Google Scholar] [CrossRef] [Green Version]
  21. Dai, Z.; Zhu, H. Forecasting stock market returns by combining sum-of-the-parts and ensemble empirical mode decomposition. Appl. Econ. 2019. [Google Scholar] [CrossRef]
  22. Dai, Z.; Zhu, H. Stock return predictability from a mixed model perspective. Pac. Basin Financ. J. 2020. forthcoming. [Google Scholar]
  23. Balduzzi, P. Transaction costs and predictability: Some utility cost calculations. J. Financ. Econ. 1999, 52, 47–78. [Google Scholar] [CrossRef] [Green Version]
  24. Claeskens, G.; Magnus, J.; Vasnev, A.; Wang, W. The forecast combination puzzle: A simple theoretical explanation. Int. J. Forecast. 2016, 32, 754–762. [Google Scholar] [CrossRef] [Green Version]
  25. Henkel, S.J.; Martin, J.S.; Nardari, F.; Martin, S. Time-varying short-horizon predictability. J. Financ. Econ. 2011, 99, 560–580. [Google Scholar] [CrossRef]
  26. Kelly, B.T.; Pruitt, S. Market Expectations in the Cross Section of Present Values. J. Financ. 2012, 68, 1721–1756. [Google Scholar] [CrossRef]
  27. Fan, Y.; Zhang, Z.; Zhao, X.; Yin, H. Interaction between Industrial Policy and Stock Price Volatility: Evidence from China’s Power Market Reform. Sustainability 2018, 10, 1719. [Google Scholar] [CrossRef] [Green Version]
  28. Narayan, P.K.; Ahmed, H.A.; Narayan, S. Can investors gain from investing in certain sectors? J. Int. Financ. Mark. Inst. Money 2017, 48, 160–177. [Google Scholar] [CrossRef]
  29. Dangl, T.; Halling, M. Predictive regressions with time-varying coefficients. J. Financ. Econ. 2012, 106, 157–181. [Google Scholar] [CrossRef]
  30. Huang, D.; Jiang, F.; Tu, J.; Zhou, G. Investor sentiment aligned: A powerful predictor of stock returns. Rev. Financ. Stud. 2015, 28, 791–837. [Google Scholar] [CrossRef]
  31. Lin, H.; Wu, C.; Zhou, G. Forecasting Corporate Bond Returns with a Large Set of Predictors: An Iterated Combination Approach. Manag. Sci. 2018, 64, 3971–4470. [Google Scholar] [CrossRef] [Green Version]
  32. Jiang, F.; Lee, J.; Martin, X.; Zhou, G. Manager sentiment and stock returns. J. Financ. Econ. 2019, 132, 126–149. [Google Scholar] [CrossRef]
  33. Dai, Z.; Zhu, H. A modified Hestenes-Stiefel-type derivative-free method for large-scale nonlinear monotone equations. Mathematics 2020, 8, 115. [Google Scholar]
  34. Dai, Z.; Zhu, H.; Wen, F. Two nonparametric approaches to mean absolute deviation portfolio selection model. J. Ind. Manag. Optim. 2019. [Google Scholar] [CrossRef]
  35. DeMiguel, V.; Garlappi, L.; Nogales, F.J.; Uppal, R. A Generalized Approach to Portfolio Optimization: Improving Performance by Constraining Portfolio Norms. Manag. Sci. 2009, 55, 798–812. [Google Scholar] [CrossRef] [Green Version]
  36. Faria, G.; Verona, F. Forecasting stock market returns by summing the frequency-decomposed parts. J. Empir. Financ. 2018, 45, 228–242. [Google Scholar] [CrossRef] [Green Version]
  37. Dai, Z.; Zhou, H.; Huang, S. Stock market volatility predictability: The role of implied volatility. Mathematics 2020, 8, 124. [Google Scholar]
  38. Nonejad, N. Déjàvol oil? Predicting S&P 500 equity premium using crude oil price volatility: Evidence from old and recent time-series data. Int. Rev. Financ. Anal. 2018, 58, 260–270. [Google Scholar]
  39. Yi, Y.; Ma, F.; Huang, D.; Zhang, Y. Internet rate level and stock return predictability. Rev. Financ. Econ. 2019, 37, 506–522. [Google Scholar] [CrossRef]
Table 1. Summary statistics.
Table 1. Summary statistics.
MeanStdMinMedianMaxSkewnessKurtosisAR (1)
Ret0.7695.375−34.5371.19735.560−0.45410.9050.078
gm0.0196.687−72.3820.13153.607−0.60124.2300.273
ge0.4374.320−52.8280.66070.3861.961104.0560.761
dp0.3130.1450.0900.2901.2721.2306.6180.982
BM0.5660.2670.1210.5422.0280.7684.3840.986
DY−3.3790.462−4.531−3.349−1.913−0.2052.5790.993
DP−3.3830.464−4.524−3.358−1.873−0.1782.6000.993
EP−2.7440.418−4.836−2.797−1.775−0.5655.5230.987
DE−0.6400.330−1.244−0.6321.3801.5409.0270.991
INFL0.2460.526−2.0550.2425.8821.17017.4250.489
TMS1.7271.297−3.6501.7704.550−0.2953.2170.960
TBL3.3923.1010.0102.91016.3001.0854.2730.993
DFY1.1250.6930.3200.9005.6402.47811.8250.975
NTIS0.0160.025−0.0580.0170.1771.66911.6140.980
LTR0.4692.457−11.2400.30015.2300.5777.4840.044
LTY5.1192.7931.7504.22014.8201.0843.5920.996
DFR0.0341.373−9.7500.0507.370−0.37510.499−0.117
SVAR0.2880.5750.0070.1267.0955.77946.4080.632
This table reports the summary statistics for the stock return on the SP500 index and the related predictors used in the study. Ret is the market return. gm is the growth in the price-earnings ratio. g e is the growth in earnings. d p is the dividend-price ratio. The remaining are defined as follows: stock return variance (SVAR), default return spread (DFR), long-term bond yield (LTY), long-term bond return (LTR), inflation rate (INFL), term spread (TMS), treasury bill rate (TBL), default yield spread (DFY), net equity expansion (NTIS), log dividend payout ratio (DE), earnings-price ratio (EP), the log dividend-price ratio (DP), log dividend yield (DY), and log book-to-market ratio (BM). AR (1) refers to the first-order autocorrelation. The sample period ranges from December 1927 to December 2018.
Table 2. Out-of-sample forecasting performance.
Table 2. Out-of-sample forecasting performance.
SOPCT-SOPMOP-SOPThree-sigma SOP
BM0.2490.643 **0.571 ***0.661 ***
DY0.39 *0.971 ***0.2340.989 ***
DP−0.1570.2700.532 **0.287
EP−0.4400.41 *0.1290.586 **
DE0.706 **1.358 ***0.3801.535 ***
INFL0.2510.491 **0.0940.615 **
TMS0.797 ***0.91 ***0.804 ***0.93 ***
TBL0.515 **0.84 ***0.501 **0.861 ***
DFY0.57 **0.886 **0.493 **0.962 ***
NTIS0.397 *0.559 **0.38 *0.514 **
LTR1.137 **0.878 **1.015 **0.581 *
LTY0.417 *0.769 ***0.459 **0.790
DFR0.7870.715 **0.6120.877 ***
SVAR1.423 **0.816 ***1.456 **0.808 ***
PC1.406 *1.772 ***1.569 **1.807 ***
This table shows the out-of-sample R-square ( R O S 2 , in percentage) based on four different kinds of SOP models. SOP refers to the original SOP model with multiple growths and shrinkages, while CT-SOP, MOP-SOP, and three-sigma SOP denote the SOP models with Campbell and Thompson [8] economic constraint, momentum of predictability (MOP) strategy, and our new three-sigma constraint approach, respectively. PC means principal component predictive regression forecasts. The return forecasts are generated from a recursive window. In the MOP-SOP model, the look-back period is three. We measured statistical significance using the Clark and West [18] test statistic. ***, **, and * indicate significance at the 1%, 5%, and 10% levels, respectively. The out-of-sample period is December 1947 to December 2018.
Table 3. Portfolio Performance of CER.
Table 3. Portfolio Performance of CER.
SOPCT-SOPMOP-SOPThree-Sigma SOP
BM0.1830.4531.0100.485
DY1.6711.9951.3862.024
DP−0.874−0.4230.8440.393
EP−1.746−1.029−0.407−0.585
DE1.2601.8240.7652.248
INFL0.8130.6890.3601.006
TMS1.4941.5331.5901.576
TBL1.4721.6201.3091.660
DFY0.9061.3640.9181.523
NTIS−1.842−1.629−1.078−1.119
LTR0.6590.0830.4320.886
LTY1.3181.5021.2431.538
DFR2.1771.2321.9281.624
SVAR2.1441.6932.1781.694
PC2.1822.2992.3652.358
This table reports the out-of-sample economic performance of four kinds of SOP models evaluated by certainty equivalent return (CER) gains. The CER gains are calculated as the difference between the CER from the model of interest and that from benchmark forecasts. We multiply CER gains by 1200 to get the annualized values. PC denotes the CER gains based on principal component predictive regression forecasts. SOP refers to the original SOP model with multiple growths and shrinkages, while CT-SOP, MOP-SOP, and three-sigma SOP denote the SOP models with Campbell and Thompson [8] economic constraint, momentum of predictability (MOP) strategy, and our new three-sigma constraint approach, respectively. Risk aversion coefficient is set to 5, and the weight of stock is restricted to values between −0.5 and 1.5. The evaluation period spans from December 1947 to December 2018.
Table 4. Investor risk aversion choices.
Table 4. Investor risk aversion choices.
γ = 3 γ = 4
SOPCT-SOPMOP-SOPThree-Sigma SOPSOPCT-SOPMOP-SOPThree-Sigma SOP
BM0.3000.7521.6830.8050.2270.5661.2620.605
DY2.7903.3232.3053.3702.0912.4931.7312.529
DP−1.460−0.7101.4080.660−1.094−0.5301.0550.493
EP−2.920−1.727−0.675−0.990−2.186−1.291−0.508−0.737
DE2.0933.0281.2733.7331.5722.2750.9552.805
INFL1.3551.1490.6031.6741.0160.8620.4511.256
TMS2.4932.5532.6532.6241.8691.9151.9891.969
TBL2.4462.6952.1802.7621.8372.0231.6362.073
DFY1.5142.2711.5312.5341.1341.7041.1481.902
NTIS−3.068−2.717−1.792−2.866−2.302−2.037−1.346−2.149
LTR1.1140.1490.7271.4600.8300.1080.5431.101
LTY2.1912.4992.0692.5571.6451.8761.5531.920
DFR3.6292.0523.2132.7032.7221.5402.4102.029
SVAR3.5662.8163.6232.8192.6772.1142.7202.116
PC2.3673.1593.5743.2582.0272.62112.7062.6954
This table reports the out-of-sample certainty equivalent return (CER) gains for alternative risk aversion coefficients including 3 and 4. The CER gains are calculated as the difference between the CER from the model of interest and that from benchmark forecasts. We multiply CER gains by 1200 to get the annualized values. PC denotes the CER gains based on principal component predictive regression forecasts. SOP refers to the original SOP model with multiple growths and shrinkages, while CT-SOP, MOP-SOP, and three-sigma SOP denote the SOP models with Campbell and Thompson [8] economic constraint, momentum of predictability (MOP) strategy, and our new three-sigma constraint approach, respectively. The weight of stock is restricted to values between −0.5 and 1.5. The out-of-sample period spans from December 1947 to December 2018.
Table 5. Portfolio results with transaction cost.
Table 5. Portfolio results with transaction cost.
SOPCT-SOPMOP-SOPThree-Sigma SOP
BM0.1630.4201.0030.451
DY1.7411.9971.4152.025
DP−0.914−0.4730.8310.443
EP−1.783−1.090−0.420−0.644
DE1.2811.8150.7742.242
INFL0.8170.6710.3610.989
TMS1.4921.5191.5901.562
TBL1.4841.6141.3161.655
DFY0.9111.3560.9181.516
NTIS−1.883−1.678−1.102−1.768
LTR0.6570.0470.4240.929
LTY1.3241.4931.2481.528
DFR2.1851.2151.9371.610
SVAR2.1431.6792.1831.681
PC2.1232.2872.3682.346
This table reports the certainty equivalent return (CER) gains when supposing a proportional transaction cost of 50 basis points per transaction. The CER gains are calculated as the difference between the CER from the model of interest and that from benchmark forecasts. We multiply CER gains by 1200 to get the annualized values. PC denotes the CER gains based on principal component predictive regression forecasts. SOP refers to the original SOP model with multiple growths and shrinkages, while CT-SOP, MOP-SOP, and three-sigma SOP denote the SOP models with Campbell and Thompson [8] economic constraint, momentum of predictability (MOP) strategy, and our new three-sigma constraint SOP approach, respectively. Risk aversion coefficient is set to 5, and the weight of stock is restricted to values between −0.5 and 1.5. The out-of-sample period spans from December 1947 to December 2018.

Share and Cite

MDPI and ACS Style

Dai, Z.; Zhou, H. Prediction of Stock Returns: Sum-of-the-Parts Method and Economic Constraint Method. Sustainability 2020, 12, 541. https://doi.org/10.3390/su12020541

AMA Style

Dai Z, Zhou H. Prediction of Stock Returns: Sum-of-the-Parts Method and Economic Constraint Method. Sustainability. 2020; 12(2):541. https://doi.org/10.3390/su12020541

Chicago/Turabian Style

Dai, Zhifeng, and Huiting Zhou. 2020. "Prediction of Stock Returns: Sum-of-the-Parts Method and Economic Constraint Method" Sustainability 12, no. 2: 541. https://doi.org/10.3390/su12020541

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop