Next Article in Journal
Is VIX a Contrarian Indicator? On the Positivity of the Conditional Sharpe Ratio
Previous Article in Journal
A Meta-Analysis of Determinants of Success and Failure of Economic Sanctions
Previous Article in Special Issue
Investigating Some Issues Relating to Regime Matching
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Forecasting Asset Returns Using Nelson–Siegel Factors Estimated from the US Yield Curve

Department of Finance, Bocconi University, 20136 Milan, Italy
*
Author to whom correspondence should be addressed.
Econometrics 2025, 13(2), 17; https://doi.org/10.3390/econometrics13020017
Submission received: 1 February 2025 / Revised: 30 March 2025 / Accepted: 8 April 2025 / Published: 11 April 2025
(This article belongs to the Special Issue Advancements in Macroeconometric Modeling and Time Series Analysis)

Abstract

:
This paper explores the hypothesis that the returns of asset classes can be predicted using common, systematic risk factors represented by the level, slope, and curvature of the US interest rate term structure. These are extracted using the Nelson–Siegel model, which effectively captures the three dimensions of the yield curve. To forecast the factors, we applied autoregressive (AR) and vector autoregressive (VAR) models. Using their forecasts, we predict the returns of government and corporate bonds, equities, REITs, and commodity futures. Our predictions were compared against two benchmarks: the historical mean, and an AR(1) model based on past returns. We employed the Diebold–Mariano test and the Model Confidence Set procedure to assess the comparative forecast accuracy. We found that Nelson–Siegel factors had significant predictive power for one-month-ahead returns of bonds, equities, and REITs, but not for commodity futures. However, for 6-month and 12-month-ahead forecasts, neither the AR(1) nor VAR(1) models based on Nelson–Siegel factors outperformed the benchmarks. These results suggest that the Nelson–Siegel factors affect the aggregate stochastic discount factor for pricing all assets traded in the US economy.

1. Introduction

Modeling and forecasting the term structure of interest rates is crucial for policymakers and market participants, as the yield curve reflects market expectations about future monetary policy, output growth, and inflation across a range of time horizons. Additionally, used as a risk-free benchmark, the AA(A)-rated government bond yield curve often represents the floor for borrowing costs in the economy. This paper aims to extend the literature by examining the informational content of the US yield curve beyond the US government bond market. Specifically, we test the out-of-sample explanatory power of the yield curve’s shape—captured through the classical Nelson and Siegel (1987) level, slope, and curvature factors—in forecasting the returns of a range of financial assets besides bonds that, and riskless and not, encompass the equity, real estate, and commodity domains. We accomplish this task by deploying the insights derived from both simple, pairwise Diebold–Mariano (one-sided) tests of equal predictive accuracy and from more systematic model confidence set tests.
There are two primary approaches to fitting the default risk-free yield curve: nonparametric spline-based methods and parametric models. Spline-based approaches excel at providing an accurate in-sample fit, due to their flexibility; however, parsimonious parametric models typically offer superior forecasting performance, achieving an effective balance between model simplicity and predictive accuracy (Nymand-Andersen, 2018). Given that the primary focus of this paper is to test the out-of-sample predictive power of the yield curve’s information content, rather than merely fitting the curve in-sample, we adopt a parsimonious parametric approach. Specifically, we employ the Nelson–Siegel (NS) model, which offers simplicity and interpretability by requiring the estimation of only three (or four, see below for a discussion) parameters to characterize the entire yield curve. According to the specification in Diebold and Li (2006), these parameters can be interpreted as capturing the level, slope, and curvature of the yield curve.
Using data on the US yield curve from January 1990 to December 2020, we assess the forecasting performance of several models during the pseudo out-of-sample period January 2011–December 2020. Our analysis focuses on AR(1) and VAR(1) models applied to forecast the NS model factors at a certain horizon h. Such predicted factors are then used to predict asset returns at the same horizon h. To forecast asset returns, we implement a two-step approach. The predictive accuracy of these models is evaluated against benchmarks, including a historical average and an AR(1) model applied to asset returns. Additionally, we investigate whether the VAR(1) model applied to the NS outputs, and which accounts for interactions among the NS factors, can provide enhanced forecasting performance.
Contrary to Diebold and Li (2006) and the literature on forecasting yields using NS factors that has followed (see, e.g., Christensen et al. (2011); Favero et al. (2012); Fernandes and Vieira (2019); Guidolin and Pedio (2019); Toczydlowska and Peters (2018); Xiang and Zhu (2013); Yu and Zivot (2011) and the references in Diebold and Rudebusch (2013); Duffee (2013)) on more recent data, we find that NS factors do not always consistently outperform a set of benchmarks in the prediction of interest rates, even though the (dynamic) NS approach remains a competitive one. We then investigate the ability of these factors to explain returns across various asset classes, such as equities, corporate bonds, commodities, and REITs. This exploration is motivated by the potential economic significance of the Nelson–Siegel factors, which reflect bond investors’ expectations regarding the economic outlook and may reflect information about macroeconomic conditions. Given their ability to capture such crucial economic insights, these factors may also serve as systematic drivers of the stochastic discount factor (henceforth, SDF) that prices all financial assets.1 Because of these properties, the NS factors may themselves represent predictors of the SDF, particularly in the context of time-varying risk premia influenced by macroeconomic shifts and investor behavior throughout business cycles (Drobetz et al., 2002). In other words, while a naive null hypothesis would imply that the dynamic factors extracted from the yields (i.e., ex ante returns over a holding period) in one market should not help forecast yields or ex post, realized returns in other asset markets, the integration of NS factors into asset pricing models may help reconcile observed return patterns with macroeconomic fundamentals, refining our understanding of the stochastic discount factor and creating cross-asset common predictability linkages that we explore and document in this paper.
Our key empirical result suggests that the Nelson–Siegel factors exhibit a predictive power extending beyond the government bond market, though their performance varies significantly across forecast horizons and asset classes. Notably, these factors outperformed standard benchmarks in forecasting equity and REIT returns, while their predictive ability for commodity futures was more limited. As expected, direct regression on the Nelson–Siegel factors proved effective for forecasting Treasury securities returns. However, the comparison between VAR(1) and AR(1) models for the Nelson–Siegel factors yielded mixed results, with their relative performance depending on the specific forecast horizon and asset class considered.
Although we are not aware of any similar paper in the literature, in some respects our empirical goal is related to Hillebrand et al. (2018), who interpreted the yield curve’s NS factors as special, economically motivated substitutes of standard principal components that efficiently combine the information contained in the raw data, the yield curve. However, the standard NS factors were not “supervised” for a specific forecast target, in that they were constructed using only the predictors and not using a particular forecast target. Therefore, they proposed computing NS factors not of the original data, but of the many alternative forecasts that a range of models may offer, with each of the many forecasts being computed using one predictor at a time. These combined forecasts are shown to outperform classical benchmarks in the out-of-sample (henceforth, OOS) forecasting of monthly US output growth and inflation, especially at longer forecast horizons.
The rest of this paper is organized as follows: Section 2 describes the methodology, Section 3 outlines the data sources, Section 4 presents the key empirical results, and Section 5 concludes.

2. Methodology

This section presents the methodological framework informing the analysis. Section 2.1 introduces the parsimonious parametric model used to fit the US yield curve. Section 2.2 describes the models employed to forecast the yield curve. In Section 2.3, the focus shifts to the models used to predict the financial returns of non-government bond assets. Lastly, Section 2.4 briefly overviews the two statistical tests applied to evaluate the differential forecasting accuracy in this paper.

2.1. Fitting the Nelson–Siegel Model to the US Yield Curve

Our analysis begins by fitting a standard NS model—a parsimonious parametric framework—to the U.S. yield curve. The goal is to extract the level, slope, and curvature factors that capture the key dynamics of the term structure. Using data spanning a January 1990–December 2020 sample, these factors were computed to test whether they contain useful information for forecasting the returns of other assets, hence establishing a potential connection to the SDF(s). The functional form of the NS model for the yield y at time t, with maturity m is
y t ( m ) = β 0 , t + β 1 , t 1 e λ m λ m + β 2 , t 1 e λ m λ m e λ m
To mitigate multicollinearity and facilitate an intuitive interpretation of the factors, we adopt the following specification (Diebold & Li, 2006):
y t ( m ) = β 0 , t + β 1 , t 1 e λ m λ m + β 2 , t e λ m
The two specifications are identical for λ = 0 , when β 0 , t = β 1 , t and β 2 , t = 0 at a given time t. The three potentially time-varying coefficients β 0 , t , β 1 , t , and β 2 , t determine the behavior of the yield curve over the long, short, and medium terms, respectively. In the presence of time variation, if we interpret β 0 , t , β 1 , t , and β 2 , t as factors, it is easy to see that the loading on β 0 , t is constant and equal to one, not decaying to zero with increasing maturity, which makes β 0 , t the factor governing the curve level in the long term. The loading on β 1 , t equals one at zero maturity and decays exponentially as the maturity increases, making β 1 , t the factor associated with short-term dynamics. In contrast, the loading on β 2 , t is zero at a zero maturity, increases initially, and then decays to zero as the maturity grows, making β 2 , t the factor capturing the medium-term behavior of the yield curve.
Interestingly, the three factors β 0 , t , β 1 , t , and β 2 , t can also be interpreted as linear transformations of the classical, latent level, slope, and curvature features of the riskless bond yield curve. Specifically, if we define the level of the yield curve as y ( 120 ) , the slope as y ( 120 ) y ( 3 ) , and the curvature as 2 y ( 24 ) y ( 120 ) + y ( 3 ) , we observe a close correspondence with β 0 , t , β 1 , t , and β 2 , t and β 2 , as illustrated in Figure 1, Figure 2 and Figure 3.
The estimated correlations between the level, slope, and curvature extracted from the observed yield curves and the fitted factors are nearly perfect, with values of 0.973, 0.995, and 0.998 for β 0 , β 1 , and β 2 , respectively. The sample time-series cross-factor correlations are estimated as 0.229 , 0.266 , and 0.620 for β 0 , β 1 , and β 2 , respectively.
The coefficient λ governs the speed of the exponential decay of the loadings of the factors. While this may be estimated along with the factors β 0 , t , β 1 , t , and β 2 , t at any time, we fix it at λ = 0.0609 , as suggested in Diebold and Li (2006). This value maximizes the loading on the medium-term, or curvature, factor β 2 , t at the 30-month maturity.2
At each point in time t, using monthly data, we recursively estimate the NS model in (2) and extract the factors best fitting the cross-section of U.S. Treasury constant maturity rates. These rates, which are available for maturities of 3 months; 6 months; and 1-, 2-, 3-, 5-, 7-, and 10 years, provide a reliable representation of the overall yield curve. By employing a recursive estimation procedure, we thus obtain a 372 × 3 estimated matrix of factors.

2.2. Yield Curve Forecasting

The out-of-sample forecast period used in this paper spans January 2011–December 2020. An expanding window approach is employed, whereby the model estimation at each time t benefits from an additional data point relative to the previous estimation. The forecasting performance of the models is assessed at horizons of 1-, 6-, and 12-month steps. This is done with reference to maturities of 3 months, 1 year, 5 years, and 10 years. Forecasts for the 6 and 12 steps ahead are computed by iterating on the dynamic conditional mean models for the NS factors specified below. Specifically, for each period from t to t + h 1 , the model uses the predicted values as inputs for computing the time t + h forecast, rather than the actual observed values.
The factor prediction models are listed below.

2.2.1. AR(1) Model

We start with an AR(1) process applied to each of the time series of the three NS factors β ^ 1 , t , β ^ 2 , t , and β ^ 3 , t recursively estimated previously for a given maturity and horizon:
y ^ ( t + h | t ) ( m ) = β ^ 1 , t + h | t + β ^ 2 , t + h | t 1 e λ m λ m + β ^ 3 , t + h | t 1 e λ m λ m e λ m
where the model parameters β ^ i , t + h / t are predicted from
β ^ i , t + h | t = α ˜ i , t , h + γ ˜ i , t , h β ^ i , t for i = 1 , 2 , 3
and α ˜ i , t , h , γ ˜ i , t , h are estimated by OLS using data up to time t on the factors obtained in the first step.3
Equation (3) is derived directly from Equation (2), maintaining the refinement proposed by Diebold and Li (2006) to improve estimation stability and interpretability. The key distinction lies in the factor representation, whereby the medium-term component β 2 , t is defined with an alternative decay structure that ensures a better empirical fit without affecting the fundamental properties of the model. The factor β 2 , t , often referred to as the “curvature factor”, exhibits an initial increase before decaying to zero as the maturity grows. While its decay property differs from a strict second-derivative curvature measure, it retains its role in capturing medium-term fluctuations in the yield curve.

2.2.2. Joint VAR(1) Model

We consider a VAR(1) process applied to each of the three NS model factors β 1 , t , β 2 , t , and β 3 , t as estimated previously:
β ^ t + h | t = α ˜ t , h + Γ ˜ t , h β ^ t for i = 1 , 2 , 3
where α ˜ t , h and Γ ˜ t , h are estimated by OLS using data up to time t on the vector of factors estimated in the earlier stage.

2.2.3. Random Walk Model for Yields

We also use as a benchmark a simple random walk model for yields:
y ^ ( t + h | t ) ( m ) = y t ( m )

2.2.4. AR(1) Model for Yields

As a parametric generalization of (6), we study an AR(1) process for yields:
y ^ ( t + h | t ) ( m ) = μ ˜ t , h , m + κ ˜ t , h , m y t ( m )
where μ ˜ t , h , m and κ ˜ t , h , m are estimated by OLS using data up to time t.

2.2.5. Slope Regression Model

Following, for instance, Steeley (2014) and Chen and Niu (2014), we estimate a slope regression model specified as follows:
y ^ ( t + h / t ) ( m ) y t ( m ) = α ˜ ( m ) + γ ˜ ( m ) y t ( m ) y t ( 3 )
where the forecast is generated using a regression of yield changes over the prediction horizon h on a constant and the difference between the yield at maturity m and the 3-month yield. This approach captures the slope of the yield curve across different maturities. However, it is important to note that this model does not allow the prediction of the 3-month yield.

2.2.6. VAR(1) Model for Yields

We extend the AR(1) model by incorporating a VAR(1) process applied to all available yield maturities:
y ^ ( t + h | t ) = μ ˜ t , h + Γ ˜ t , h y t
where the estimation of the matrices of coefficients is straightforward but horizon h-specific.4

2.2.7. VAR(1) Model for Yield Changes

Because the unit root nature of US riskless yields has been debated in the traditional finance literature (see, e.g., Pesando (1979)), we also analyze a VAR(1) model for yield changes:
Δ y ^ t + h | t = a ˜ t , h + A ˜ t , h Δ y ^ t ,
which removes by differentiation any stochastic trends but does not test or exploit the existence of cointegration among interest rates.5

2.2.8. AR(1) Model for Principal Components

In addition, following Diebold and Li (2006), we decompose the time t sample covariance matrix C of the constant maturity rates into C = Q W Q using principal component analysis. As is well known, the eigenvalues and eigenvectors of C are represented by the diagonal matrix W and columns of Q, respectively. We consider the three largest eigenvalues w 1 , w 2 , w 3 and their corresponding eigenvectors q 1 , q 2 , q 3 . Each of the eigenvectors contains a number of elements equal to the number of the available maturities, indexed by m. We then employ univariate AR(1) models to separately forecast each of the principal components h steps ahead:
x ^ i , t + h | t = η ˜ i , t + ζ ˜ i x i , t for i = 1 , 2 , 3 and h = 1 , 6 , 12
where the principal components x i , t are defined as
x i , t = q i y t
and η ˜ i , t and ζ ˜ i are recursively estimated by OLS. The predicted yields h-steps ahead are then obtained from
y ^ ( t + h t ) ( m ) = q 1 ( m ) x ˜ 1 , t + h | t + q 2 ( m ) x ˜ 2 , t + h | t + q 3 ( m ) x ˜ 3 , t + h | t
where q i ( m ) is the element of the eigenvector q i corresponding to maturity m.

2.3. Returns Forecasting

In this section, we introduce simple methods to evaluate the predictive power of the NS factors β 1 , t , β 2 , t , and β 3 , t for returns across various asset classes. In contrast to the riskless bond yield forecasting framework outlined in Section 2.2, this analysis requires us to use a distinct set of models. For corporate and Treasury bonds, equity, REIT, and commodity returns, we evaluate the forecasting performance of the three predictive models. First, we forecast returns using direct, h-step-ahead regression on the NS model factors estimated earlier:6
r ^ i , t + h | t = α ˜ i , t + γ ˜ 1 , i , t β ˜ 1 , t + h | t + γ ˜ 2 , i , t β ˜ 2 , t + h | t + γ ˜ 3 , i , t β ˜ 3 , t + h | t
In this context, the coefficients α ˜ i , t , γ ˜ 1 , i , t , γ ˜ 2 , i , t , and γ ˜ 3 , i , t are estimated through recursive OLS. The forecasts for the factors β ˜ 1 , t + h | t , β ˜ 2 , t + h | t , and β ˜ 3 , t + h | t , which are used as inputs for return predictions, are generated using either an AR(1) or VAR(1) process, as outlined in Section 2.2. The regression model in Equation (14) represents our workhorse model and provides direct information on whether, how, and for which assets/portfolios the NS factors may be structural features of the SDF(s), to the point of allowing us to obtain accurate predictions of the returns of other assets.
As additional benchmarks that fail to depend on the NS fixed income factors, we use a simple AR(1) model applied directly to returns:
r ^ i , t + h | t = α ˜ i , t + γ ˜ i , t r i , t
where the coefficients are estimated according to the usual logic. A third and final benchmark is obtained by imposing the restriction γ i , t = 0 for all portfolios and times, which features a classical simple historical average (HA) model, as featured in recent work on the predictability of asset returns (see, e.g., Rapach et al. (2010), Guidolin et al. (2013)), and the review in Rapach and Zhou (2013)).7

2.4. Measuring and Testing (Equal) Predictive Accuracy

To evaluate the predictive accuracy of alternative models, we apply two statistical tests: Diebold–Mariano (henceforth, DM) tests of pairwise equal predictive accuracy (Diebold & Mariano, 2002) and the Model Confidence Set (henceforth, MCS) procedure to perform tests and a multiple-model assessment (Hansen et al., 2011). Detailed formulas of these tests/algorithms are reported in Appendix A for completeness.
The DM test requires the computation of forecast errors for each model and an assumption on the functional form of the loss measuring the loss of utility to investors derived from the forecast errors. For yields with maturity m or asset returns indexed by i, the forecast error at time t and horizon h is defined as the difference between the forecast and the observed value. The loss functions are then assumed to be of the squared type. Although rather different loss functions might be assumed, the classical squared loss makes sense within the empirical finance literature because of the well-known connections between squared loss and measures of risk-adjusted performance, such as the Sharpe ratio (see Campbell and Thompson (2008)). At this point, the loss differentials between competing models considered in pairs are tested for statistical significance, see Appendix A for details.
The MCS procedure enables the comparison of all competing models within a multivariate framework. Starting with an initial set of models M, the procedure identifies a subset M M that, at a predetermined confidence level, contains the best forecasting models, in the sense that these are not dominated by any other model in the set outside M ) . Dominance is conceptually identified with rejection of the null of equal predictive accuracy (EPA) in favor of a model, in a pairwise comparison. Depending on the relative performance of the models, M may consist of a single model outperforming all others or include a rich set of models, in principle even all of M, when no dominance relationships can be established. Depending on the relative performance of the different models, the procedure sequentially eliminates models from the superior set until the EPA hypothesis is no longer rejected. Appendix A provides details.

2.5. Combination Forecasts With and Without NS Factors

To further evaluate the predictive power of the NS factors, we conduct a robustness analysis by implementing combination forecasts based on individual predictive regression models. Following Rapach et al. (2010), we construct forecasts using a broad set of macro-financial predictors originally introduced by Welch and Goyal (2008). The predictive regression model follows a standard formulation. Our forecasts are generated using an expanding window estimation approach with a six-month horizon forecast.
The predictive variables included in our analysis consist of valuation ratios, market risk indicators, interest rate measures, and credit risk variables. Specifically, we use the following GW predictors:8
  • Dividend–Price Ratio (log), D/P: The logarithmic difference between total dividends paid on the S&P 500 index and the index’s price level, where dividends are aggregated over a rolling one-year period.
  • Dividend Yield (log), D/Y: The log of dividends minus the log of lagged S&P 500 prices.
  • Earnings–Price Ratio (log), E/P: The difference between the logarithm of S&P 500 earnings (measured over a trailing 12-month period) and the log of stock prices.
  • Dividend–Earnings Ratio (log), D/E: Logarithmic gap between dividend payments and earnings on the S&P 500 index.
  • Stock Variance, SVAR: The cumulative sum of squared daily returns on the S&P 500 index over a given month.
  • Book-to-Market Ratio, B/M: The ratio between book value and market capitalization of the Dow Jones Industrial Average firms.
  • Net Equity Issuance, NTIS: The ratio of 12-month cumulative net stock issuance by NYSE-listed firms to their total year-end market value.
  • Treasury Bill Rate, TBL: Yield on three-month U.S. Treasury bills traded in the secondary market.
  • Long-Term Yield, LTY: Yield on long-term government bonds.
  • Long-Term Return, LTR: Realized return from investing in long-term government bonds.
  • Term Spread, TMS: The difference between long-term government bond yields and short-term Treasury bill rates.
  • Default Yield Spread, DFY: The yield spread between BAA-rated and AAA-rated corporate bonds.
  • Default Return Spread, DFR: The difference in returns between long-term corporate and long-term government bonds.
  • Inflation, INFL: Computed using the Consumer Price Index (CPI) for urban consumers. Following Welch and Goyal (2008), inflation is lagged by one month to account for release timing and used as x i , t 1 in the regressions.
To assess the impact of the NS factors, we compare two models: one that includes the NS factors in the combination forecasts and one that excludes them. This approach allows us to isolate the contribution of the NS factors to the predictive accuracy and determine whether they improve the forecasting performance relative to a benchmark set of macro-financial predictors. The out-of-sample period for the analysis spans from January 2011 to December 2020, ensuring consistency with the evaluation of Diebold–Mariano test statistics and Model Confidence Set results.

3. Data Description

For all the assets and portfolios listed below, our sample spans the period January 1990–December 2020, comprising daily US yield curve rates sourced from the US Treasury Department; monthly returns on Treasury bills, notes, and bonds across various maturities; US equity returns spanning 113 distinct portfolios; sector-specific returns for REITs; corporate bond returns from investment-grade issuers in the US; and realized, chained return series for a diverse range of commodity futures.

3.1. US Treasury Yields

We used end-of-month data on the yield curve published by the US Treasury, commonly referred to as Constant Maturity Treasury (CMT) rates. These rates were derived through the interpolation of closing market bids from actively traded on-the-run US Treasury securities in the over-the-counter market. The interpolation employs a cubic spline model with fixed maturities at 1, 2, 3, and 6 months, as well as 1, 2, 3, 5, 7, 10, 20, and 30 years.9 The sample period January 1990–December 2020 was used to construct the monthly yield curve. The analysis was limited to maturities for which the full sample was available, specifically the 3-month, 6-month, and 1-, 2-, 3-, 5-, 7-, and 10-year maturities.10 The maturities used to estimate the NS factors covered a broader range than those utilized for yield curve forecasting. Forecasts were generated for the 3-month, 1-year, 5-year, and 10-year maturities, while the NS factors were estimated using yields from eight distinct maturities.

3.2. US Treasury Returns

We computed monthly returns based on the constant maturity rates published by the US Treasury discussed in the previous section. These returns were derived from holding 3-month, 6-month, and 1-, 2-, 3-, 5-, 7-, and 10-year T-bills and T-notes. The calculations were performed under two different sets of assumptions, depending on the maturity of the securities analyzed.
For coupon Treasury notes with maturities of one year or longer, we assumed the coupon bond was purchased at par at the end of the previous month, with a flat yield curve corresponding to the specified maturity. Monthly returns were then calculated based on changes in the bond’s valuation. Coupons are paid semiannually, on the last day of the month, beginning six months after the purchase date. The total return from holding the security for one month, comprising both the coupon income return and capital appreciation, was calculated as
P ( t 2 , y 2 ( m 1 ) , m 1 ) P ( t 1 , y 1 ( m ) , m ) 1
where P ( t , y ( m ) , m ) is the price of the bond with maturity m (expressed in years of fractions of years) and yield y ( m ) at time t, computed as
P ( t , y ( m ) , m ) = 100 ( 1 + y t ( m ) / 2 ) 2 N 1 + D / S ( 1 + y t ( m ) / 2 ) 1 D / S 1 ( 1 + y t ( m ) / 2 ) 2 N
where D / S indicates the fraction of time remaining until the next coupon payment in terms of coupon frequency.
For maturities shorter than a year, the approximate total return consists only of capital appreciation. In this case, the T-bill does not pay coupons, and at the beginning of each month, a T-bill is purchased at the prior month’s end price. Monthly, realized returns reflect the change in pricing of the bill under the assumption that the yield curve is flat for the maturity considered, which leads to a simple discounting formula for the residual life of the bill.

3.3. US Equity Returns

We studied the predictability of returns on 113 different US-based equity portfolios. These portfolios, made available through Kenneth French’s website,11 included the market portfolio, 17 industry portfolios, and various other portfolios obtained by sorting the CRSP stock universe (which currently includes all NYSE, NASDAQ, and AMEX-listed shares of stock) by a range of measurable stock characteristics: market capitalization (size) and book-to-market (value); bivariate combinations of size with other characteristics, including investments, operating profitability, cashflows, the dividend yield, the earnings-price ratio, and price momentum.

3.4. US REIT Returns

We also analyzed indices for closed-end fund-style returns derived from investments in real estate equity and debt instruments. REITs are companies that own, operate, or finance income-generating real estate across various sectors and regions, allowing investors to gain indirect exposure to the real estate market. These data were sourced from NAREIT (National Association of Real Estate Investment Trusts) and consist of the FTSE NAREIT U.S. Real Estate Index Series. Our analysis focused on monthly total returns, including those for all REITs combined (equity and mortgage), all equity REITs, all mortgage REITs, and six sector-specific REIT indices (office, industrial, retail, lodging and resorts, and self-storage). The total return series represent the sum of income and capital appreciation returns, calculated before taxes and commissions.

3.5. US Corporate Bond Returns

We assessed the predictability of four ICE Bank of America (BofA) US corporate bond series. The series correspond to portfolios of securities carrying investment-grade ratings of AAA, AA, A, and BBB as of the end of each month in the sample.12 The data, sourced from the St. Louis FED, track total returns for US dollar-denominated investment-grade corporate bonds publicly issued in the US domestic market.

3.6. Commodity Returns

Lastly, we examined the predictability of commodity returns across 17 series, encompassing precious metals, industrial metals, agricultural products, and livestock. The commodities were gold, silver, platinum, copper, Brent crude, gasoil, light crude oil, natural gas, cotton, cocoa, coffee, corn, lumber, soybean oil, soybeans, wheat, and live cattle. The monthly data, all sourced from Refinitiv, represent continuous futures contracts, with monthly returns calculated based on changes in the series levels.

3.7. Summary Statistics

The six panels of Table A1 (collected in Appendix B, to save space, labeled A through F) provide information on the US yield curve rates and the five sets of return series outlined in Section 3.1, Section 3.2, Section 3.3, Section 3.4, Section 3.5 and Section 3.6.
Most return series exhibited left skewness, with the exception of returns on commodity futures and on the US Treasury securities. This negative skewness can be attributed to the various market crashes that occurred in our sample. These sample skewness values emphasized the occurrence of extreme negative price events in our sample and need to be accounted for in what follows.
Table 1 presents summary statistics for the first-stage, estimated NS factors. The 1-, 12-, and 30-month sample autocorrelations ( ρ ^ k for k = 1, 12, 30) reported in the table indicate that the level factor, β ^ 0 , is the most persistent, as it seems to befit the level of a stationary time series. Its sample mean indeed agrees with a positive annualized risk-free rate of approximately 4.9 percent. In addition, unsurprisingly, the slope factor was negative on average (−2.21 percent), which captured a term structure that tends to be upward sloping in our sample. Finally, the curvature factor tended to be negative on average, which points to a concave yield curve shape. According to the augmented Dickey–Fuller test, neither the slope nor the level factors exhibited unit roots at any standard confidence level on average. However, the curvature factor, β 2 , was found to possibly display a unit root at both the 5% and 10% confidence levels.13
All three factors only showed significant partial autocorrelations at the first lag, justifying their modeling with an AR(1) and VAR(1) processes as described earlier, in Section 2.2.

4. Empirical Results

4.1. Yield Curve Forecasting

The NS factors were forecast using either the AR(1) or the VAR(1) model, with the latter model able to account for cross-factor serial correlations. These factor forecasts were then applied to compute yield forecasts for the 3-month, 1-year, 5-year, and 10-year maturities, as implied by Equation (2). While eight maturities were employed to estimate the NS factors, only the four listed maturities were predicted. Table 2 presents the results of the Diebold–Mariano test applied to forecast the bond yields.
We compared the predictive accuracy of the AR(1) for the NS factors against seven benchmark models, including the VAR(1) for NS factors; the random walk for the NS factors; predictive slope regressions, AR(1), and VAR(1) for yields; AR(1) for yield changes; and a direct regression on the first three AR(1) principal components. The comparison spanned three forecast horizons (one-, six-, and twelve-step-ahead) and four yield maturities. The results from the DM one-sided test, implemented at a 5% size, show that the null hypothesis of equal predictive accuracy was only rejected for a limited set of combinations, even though the negative statistics always indicated that using AR(1) for the factors predicted more accurately.
In particular, we found that using the AR(1) model for the NS factors outperformed the direct regression on principal components across most maturities and horizons, except for 3-month yields at the six- and twelve-step horizons. It also performed better than the VAR(1) for NS factors for the one-step-ahead forecasts of 3-month yields. In the case of the VAR(1) for yields, the AR(1) for NS factors performed better at a six-month horizon in the case of the 3-month, 5-year, and 10-year yields, and at a twelve-step horizon for 3-month and 10-year yields. The DM test statistics were positive and large when comparing the AR(1) for NS factors to benchmarks like the AR(1) and VAR(1) for yields, slope regression, and random walk, for one- and six-month horizons. However, despite the poor overall performance of the AR(1) model compared to the benchmarks, the AR(1) turned out not to imply a significant difference in predictive power.14
While the DM tests failed to establish the clear outperformance of the AR(1) for NS factors relative to the benchmarks, an interesting pattern emerges in Table 2: the relative forecasting performance improved as the horizon lengthens. This improvement was more evident for shorter maturities (3-month and 1-year yields) than for longer ones. Overall, the AR(1) for NS factors performed relatively well for long horizons and short maturities, as previously noted by Diebold and Li (2006). However, in contrast to their findings, our results do not indicate significant outperformance against the random walk model at conventional confidence levels. This aligns with Duffee (2013), who also found that the AR(1) process for NS factors failed to outperform the random walk in terms of RMSE, using a more recent sample, as we did in our work.
The MCS procedure yielded results similar to those of the Diebold–Mariano test. Table 3 shows that the outcomes were consistent across both the TR and TMax test statistics (see Appendix A for the meaning of these statistics). The finding that the AR(1) applied to the NS factors performed better for short-term maturities and longer forecasting horizons is confirmed by these tests. The AR and VAR models were instead excluded from the superior set (M*) when forecasting one-step-ahead yields, except for the 10-year yield. However, the models for NS factors were included in the superior set for the 6- and 12-step-ahead forecast horizons, ranking among the top three models for various maturities.
The superior set increased in size as the forecast horizon lengthened, indicating a convergence in forecasting accuracy among the models for longer-term forecasts. In particular, the AR(1) model for yields appeared to be one of the best models across all maturities and forecasting horizons. Despite being the most accurate model for one-step-ahead forecasts, the VAR(1) for yields was excluded from the superior set for six-step and 12-step-ahead forecasts.
Additionally, although the TMax and TR test statistics yielded similar results, they differed in the number of models excluded from the superior set. For the six-month-ahead forecasts, the TMax statistic identified a narrower superior set, excluding more models than the TR statistic. This difference arose from the distinct comparison methods employed by the two test statistics: TMax evaluated each model’s predictive accuracy against the average of all other models, while TR compared models pairwise. Consequently, we expected TMax to be more selective in excluding models.
All in all, the evidence presented in Table 2 and Table 3 is consistent with the idea that autoregressive modeling of the NS factors is as accurate as a number of benchmarks. However, at least for horizons of 6 and 12 months, the equal predictive accuracy tests indicate that such models failed to be uniformly dominated by any of the benchmarks, at least with reference to the prediction of the very data from which the NS factors were extracted. This justifies our continued interest in their forecasting performance for asset returns in the reminder of this paper.
Finally, as a robustness check, we compared the combination forecasts following Rapach et al. (2010) for the factors of Welch and Goyal (2008) both with and without the NS factor. Table A4 in Appendix D presents this comparison. This additional analysis allowed us to evaluate the general usefulness of NS predictors in excess asset return forecasting, assessing whether their inclusion enhanced the predictive power of combination models beyond traditional macro-financial factors. The gains were particularly noticeable for short-term maturities, with an increase from 3.24% to 4.77% for 3-month yields and from 3.32% to 4.88% for 1-year yields. This suggests that the NS factor captured valuable information for forecasting short-term interest rates. While the improvement was smaller for longer maturities, such as 5-year and 10-year yields, the inclusion of the NS factor still enhanced the predictive accuracy, albeit to a lesser extent. This pattern aligns with the idea that the NS factor is particularly effective in explaining short-term yield variations, while providing moderate benefits for longer-term forecasts.

4.2. Forecasting Treasury Returns

Table A2 (in Appendix C) shows the results of the Diebold–Mariano test for return series; specifically, Panel A presents the results of the pairwise DM test for the government bond return series.15 The comparison of the predictive performance of the NS factors—themselves predicted from either an AR(1) or a VAR(1) model—with other specifications was performed across three forecasting horizons (one-, six-, and 12-step-ahead) and eight return series from monthly holdings of 3-month, 6-month, and 1-, 2-, 3-, 5-, 7-, and 10-year US Treasuries. The results indicate that we can reject the null hypothesis of equal predictive accuracy for only a limited set of combinations. Specifically, the AR(1) model for NS factors outperformed the AR(1) model for past returns for two maturities (3 and 6 months) and two forecasting horizons (six- and twelve-step-ahead). Additionally, it outperformed the HA model for these two maturities across all three forecasting horizons. However, the AR(1) for NS factors never outperformed the VAR(1) applied to predict NS factors, which turned out to be better suited for forecasting these returns. The VAR(1) performed better for shorter maturities and forecasting horizons, but its performance decreased for longer maturities and forecast horizons. Similarly to bond yields, the forecasting accuracy of the AR(1) NS improved as the forecasting horizon lengthened, particularly when compared to the VAR(1) NS, when their ranking flipped around and once more favored the simpler AR(1) model. However, Table A2 reports a large majority of negative DM test statistics, illustrating that the (predictions of) the NS factors provided serious competition to the AR models and the very random walk for prices, when it comes to forecasting bond returns.
In Table 4, the MCS results, obtained applying the 90% confidence testing level, further confirm the superior performance of the VAR(1) model applied to predict the NS factors compared to the other models. The VAR(1) NS model was always included in the superior set M* across all maturities and forecast horizons, consistently ranking the highest among the four models. The AR(1) NS also performed well but was often excluded from M* due to the VAR(1) model’s superior performance. Nonetheless, the results varied depending on the specific MCS test statistic used, whether TMax or TR. Under TMax, the AR(1) for the NS factors was excluded from the superior set only five times out of 24 tests (across three forecast horizons and eight return series). When we used the TR statistic, the AR(1) was excluded more frequently, six times for one-step-ahead predictions, five times for six-step-ahead, and four times for the 12-month-ahead forecasts. Finally, regarding the comparison of forecast combining models applied to Treasury returns, we observed that the inclusion of the NS factor significantly enhanced the predictive accuracy compared to using only the GW predictors. Specifically, the out-of-sample R O S 2 increased substantially for shorter maturities, such as the 3 M and 6 M Treasury securities, showing a marked improvement that suggests the additional informational value of the NS factor in short-term bond return forecasting. For longer maturities, the impact of the NS factor was more moderate but remained positive for several horizons, indicating its potential to enhance the performance of combined forecasting models.
More importantly for our purposes, both the AR(1) for past returns and the HA benchmarks consistently underperformed compared to the models using NS factor forecasts. These benchmarks were often excluded from the superior set and ranked low when included, especially in comparison to the VAR(1) for NS factors. Therefore, maybe unsurprisingly, predictions of NS factors extracted from Treasury yield data ranked rather highly within a rather classical set of forecast models for Treasury returns, especially for very-short- and very-long-term maturities and for six- and 12-month horizons.

4.3. Forecasting Equity Portfolio Returns

Following the same design used for Treasury returns, the DM test results for equity returns are shown in Panel B of Table A2, in Appendix C. The comparison was made across the three forecast horizons pursued in this paper and with reference to 113 portfolio return series, grouped into eight sub-categories: Fama and French factors portfolios; industry portfolios; various other portfolios sorted on the basis of stock and firm characteristics, as listed in Table A1, Panel C). In the prediction of stock returns one month ahead, the AR(1) model for the NS factors often outperformed the AR(1) applied to past returns and the HA model. Out of 113 equity return series, we rejected the null hypothesis of equal predictive accuracy between the AR(1) NS and the AR(1) for equity returns 52 times, and 49 times vs. the HA model. Such statistics indicate a superior predictive accuracy that largely exceeded the 10 percent we would expect from pure chance. However, this superior performance failed to emerge in the case of six- and 12-step-ahead forecasts, where the AR(1) NS model often underperformed compared to the benchmarks. The VAR(1) NS generally performed better than the AR(1) NS, forecasting more accurately than the latter more often than not. The difference in predictive accuracy between the two models was consistent across all three forecast horizons.
The results of the MCS approach provide a more comprehensive view of a model’s relative performance. A pattern involving both NS factor-based models emerged, consistent with what was reported with reference to the DM test. According to both the TR and TMax statistics, the AR(1) and VAR(1) NS models were the most accurate for one-step-ahead forecasting, rarely being excluded from the superior set. For the one-step-ahead forecast, the AR(1) was included in the superior set with 92% and 81% probabilities for TMax and TR, respectively. In fact, the VAR(1) NS was included with slightly higher probabilities: 93% and 82% for TMax and TR. As seen for US Treasury returns, the TR statistic was more selective, excluding more models from the superior set, and so these findings were to be expected. Moreover, AR(1) and VAR(1) NS turned out to be the best performing models for 53% and 38% of the target series, respectively, being placed as the best or the second best for 83% and 88% of the target series, which is rather an impressive performance. On the contrary, the AR(1) and HA models for returns performed rather poorly and were excluded from the superior set for more than 50% of the return series. Therefore, the short-term predictive power of NS factors for a large majority of the equity portfolio returns appeared to be remarkable and, possibly surprisingly, turned out to exceed what was found in the case of Treasuries.
Finally, the inclusion of NS factors did not significantly enhance the predictive power for equity returns, as it did for yields and government returns. The only notable exception was for Fabricated Products returns, where the inclusion of NS factors led to a visible improvement in predictive performance.

4.4. Forecasting REIT Returns

In contrast, for six- and 12-month-ahead forecasts, both the AR(1) applied directly to returns and the HA outperformed the AR(1) and VAR(1) models based on NS factor predictions. Specifically, the AR(1) for stock returns was included in the superior set with probabilities of 99% and 98% for six-step-ahead forecasts (using TMax and TR statistics, respectively), and 98% in the case of the 12-step-ahead horizon case. The even simpler HA model was included with slightly lower probabilities than the AR(1) but still tended to outperform the NS-based models. Overall, the HA model demonstrated superior accuracy to the AR(1), while the VAR(1) outperformed the AR(1) NS, regardless of the forecast horizon.
The results of DM tests applied to REIT returns are presented in Panel C of Table A3, as usual, across three forecasting horizons and nine return series. Similarly to the findings for equity returns, the AR(1) based on the NS factors outperformed, at the usual one-sided 5% size, the AR(1) estimated on returns and the historical mean models for one-step-ahead forecasts. The AR(1) NS, however, failed to outperform the VAR(1) NS across all REIT return series and forecasting horizons. In the case of the one-step-ahead forecast, the AR(1) NS outperformed the standard AR(1) six times and the HA model three times, out of nine series analyzed. When we compare the performance of the two NS factor-based models, AR(1) outperformed in the case of the month horizon, while VAR(1) came out on top for the 12-step-ahead horizon; we found mixed results for the six-month horizon. As in the case of equity returns, the predictive accuracy of the AR(1) NS deteriorated relative to the benchmarks as the forecast horizon lengthened. Contrary to the equity returns case, where the performance shifted in favor of the AR(1) model on returns and HA for the 12-step-ahead forecasts, the AR(1) NS consistently outperformed the two benchmarks for most REIT indices across all horizons, including the six-step and 12-step horizons.
The findings from the DM tests were reinforced by the results from the MCS procedure, which are presented in Table 4. The AR(1) and VAR(1) driven by NS predictions were rarely excluded from the superior set across all three horizons, irrespective of the test statistic used. Only for one REIT return series and for the one-step-ahead horizon was VAR(1) NS excluded from the superior set, based on the TMax statistic. The classical AR(1) and the HA models were instead frequently excluded from the superior set for the one-step-ahead horizon. Specifically, they were excluded six and seven times, respectively, out of the nine return series considered, regardless of the MCS test statistic used. However, both models were always included in the superior set for 6- and 12-month predictions. Even when they were included in the superior set, these models consistently ranked below the AR(1) and VAR(1) NS across the majority of the nine return series considered. Specifically, for the 6-step and 12-step-ahead horizons, the standard AR(1) never ranked first. The HA model ranked first three times for the 6-step-ahead horizon and once for the 12-step-ahead horizon, according to both MCS test statistics.
The REIT returns were predictable not only at the one-step-ahead horizon but also at the 6-step and 12-month horizons. Both the DM test and the MCS procedure showed a convergence in forecasting performance between the two NS factor-based models and the two benchmarks for the 6- and 12-step-ahead prediction horizons. The forecasts of the dynamics of the yield curve proved to contain more precise information on future REIT returns than for equity returns, indicating a surprisingly high integration between the REIT and U.S. government bond markets. This aligns with the fact that interest rate changes tend to have longer-lasting effects on real estate investments.

4.5. Forecasting Corporate Bond Returns

We analyzed the predictability of four corporate bond return series corresponding to the highest investment grade ratings. In our sample, based on the DM test statistic implemented at a 5% test size level, we failed to reject the null hypothesis of equal predictive accuracy across the three benchmarks assumed against the AR(1) NS factors model, for all forecast horizons (Panel D of Table A2). As observed for both equity and REIT returns (but not for Treasury returns), the forecasting accuracy of the AR(1) NS model decreased as the horizon lengthened, compared to the HA and the AR(1) model applied to corporate bond returns. In contrast, the AR(1) NS model performed better than the VAR(1) NS when forecasting corporate bond returns, although this turned out not to be the case for equity and REIT returns. Overall, while the AR(1) NS tended to lag behind the VAR(1), it outperformed the other two benchmark models.
The results from the MCS procedure, shown in Table 4, Panels A through F, align with those reported for REITs. In most experiments, both the AR(1) and VAR(1) NS models remained inside the superior set across all horizons, for the specific portfolio return series, and regardless of the test statistic used. The only exception concerned the AA-rated corporate bond series, for which the AR(1) NS model happened to be excluded from the superior set in the case of the one-month-ahead predictions, according to the TR statistic. On the contrary, the classical AR(1) and the HA model were typically excluded from the superior set for the one-step-ahead forecasts, but were included for the six- and twelve-month-ahead forecasts. Both models consistently ranked lower than AR(1) and VAR(1) NS, never ranking first across any of the horizons. For the six-step-ahead forecast, the HA model ranked second once and fourth three times, while the standard AR(1) for returns placed third three times and fourth once. For the 12-step-ahead forecasts, the HA model ranked third consistently, with the AR(1) for returns always ranking fourth.
Both the DM test and the MCS procedure confirmed that NS factors can predict U.S. corporate bond returns, though the predictive strength was weaker when compared to REIT returns and less pronounced than for equity returns at the one-step-ahead forecast horizon. Across the four bond return series, we could not reject the null hypothesis of equal predictive accuracy using the DM test. However, for both the 6- and 12-step-ahead predictions, the AR(1) and VAR(1) NS models outperformed the HA and the classical AR(1) and ranked higher according to the MCS procedure. The MCS results were consistent across the different test statistics, with rankings and superior set compositions remaining unchanged under both TMax and TR tests.

4.6. Forecasting Commodity Returns

With reference to the 17 return series for commodity futures, the test statistics presented in Table A2, Panel E, for various comparisons across commodities and three horizons, show that the NS factors performed poorly in predicting commodity futures returns. Similarly to the findings for corporate bond, the AR(1) NS failed to outperform the benchmarks, i.e., the AR(1) for commodity returns and the HA model. On the contrary, AR(1) NS performed similarly to the VAR(1) NS model. As in the case of other asset classes, the forecasting accuracy of the NS-driven models declined as the forecast horizon lengthened, in comparison to the benchmarks. Additionally, no clear trend was observed in the predictive performance across the four commodity subgroups (metals, energy, agricultural, and livestock).
Although this may appear unsurprising in light of the DM tests, according to the MCS procedure, the results of which are presented in Table 4, Panels A through F, forecasting commodity futures returns using NS factors led to partial success at best. The AR(1) and VAR(1) NS models were frequently excluded from the superior set M*; this occurred for approximately 50% of forecasting experiments, for both one- and six-month-ahead forecasts. However, the performance of the two benchmarks was comparable to the HA model, which was indeed often excluded from the superior set as well. The classical AR(1) for returns had the highest predictive accuracy among all models considered, ranking first 14 times for the one-step-ahead horizon and 11 times for the six-step-ahead horizon, out of the 17 series considered. As expected, in the case of the 12-step-ahead horizon, the MCS test excluded the AR(1) and VAR(1) NS from the superior set more frequently than for the shorter prediction horizons. At 12 months, both models were only included in M* once, according to the TMax statistic, and twice according to the TR statistic. When compared to the benchmarks, AR(1) NS outperformed HA for the one- and six-step-ahead forecasts but underperformed for the 12-step-ahead horizon. Therefore, and contrary expectations based on the influence of interest rates on commodity futures prices (Frankel, 2008), both th DM and MCS tests failed to produce significant results in forecasting commodity returns across all horizons.16

5. Discussion and Conclusions

The main goal of this paper was to assess the hypothesis that the three NS factors (level, slope, and curvature) derived from the US yield curve can be used to forecast returns across a variety of financial assets. To test this over-arching hypothesis, we analyzed the predictability of monthly return series covering a range of asset classes, i.e., equity portfolios, REITs, corporate bonds, commodity futures, and US Treasury notes. Given a baseline null hypothesis that, being extracted from yield curve data, the NS factors should contain no (or not much) information for forecasting returns on asset classes that remain essentially different from default risk-free bonds, our results suggest that the three Nelson–Siegel factors do carry some unsuspected predictive power well beyond the government bond market. Although the performance of AR(1) and VAR(1) models incorporating predictions of the NS factors relative to common benchmarks was not always overwhelming and it varied across forecast horizons and asset classes, the results were generally statistically significant. Specifically, we found that, in the case of one-step-ahead forecasts, the AR(1) and VAR(1) NS models frequently outperformed the benchmarks, which included a classical AR(1) model estimated on past returns and the historical mean model (derived from the random walk hypothesis for log-asset prices). However, the predictive accuracy of the AR(1) and VAR(1) NS models tended to decline as the forecast horizon lengthened, as the predictive performance at the 6- and 12-month-ahead horizons turned out to be inferior vs. the short-term horizons. These results indicate that models based on predictions of NS factors may have been more appropriate for short-term forecasting, while the benchmarks became increasingly competitive when applied within medium- to long-term applications.
As one would expect, in our recursive, pseudo OOS experiments, the strongest predictive accuracy was obtained with reference to the prediction of US Treasury returns, when both the AR(1) and VAR(1) NS models performed well across most forecast horizons. In light of the baseline null hypothesis, the results also revealed notable pockets of non-negligible predictive power in the case of corporate bonds, REITs (which are generally linked to the dynamics of interest rates through the mortgage market), and even equities. However, the informational content of the NS factors was weaker in the case of commodity futures returns, where they failed to outperform even simple benchmarks.17
In conclusion, our results underscore the importance of considering the historical dynamics of the yield curve’s level, slope, and curvature when forecasting financial returns, and opens up the possibility that the NS factors expressed by US interest rate data may appear in a structural way in the SDF pricing and hence display predictive power for the returns on all assets. Indeed, the NS factors may be conjectured to describe the yield curve’s dynamics and, in turn, capture shifts in macroeconomic conditions that affect investment opportunities and hence, risk premia. These factors, if sufficiently systematic and pervasive, could be drivers of the SDF, which governs asset pricing through the fundamental condition
P t = E t m t + 1 r t + 1 ,
where m t + 1 is the SDF and r t + 1 is the return on any traded asset. If the NS factors serve as state variables, because they may affect consumption growth, investor sentiment, or financial frictions, they may be embedded into the SDF representation as
m t + 1 = a + b 0 β 0 , t + b 1 β 1 , t + b 2 β 2 , t + ε t + 1
Setting r t + 1 = 1 in Equation (18) implies that P t represents the short-term yield, linking the expectation of m t + 1 directly to the short-term rate. Since this rate is largely determined by the sum of the level and slope factors, it follows that the stochastic discount factor is primarily influenced by β 0 and β 1 , rather than the curvature factor β 2 . This observation suggests that the stronger predictive power observed for financial asset returns likely stems from the combined effect of level and slope factors. Empirically, our results support this interpretation, as models incorporating all three NS factors outperformed those based on individual factors. This indicates that return predictability is not solely driven by curvature effects but rather emerges from broader interactions within the term structure.
This formulation implies that any movement in the yield curve factors propagates into expected returns across asset classes. As argued since Diebold and Li (2006), the level factor ( β 0 ) captures the general risk-free rate and long-term discount rates, the slope factor ( β 1 ) reflects changes in short-term interest rate expectations and monetary policy, and the curvature factor ( β 2 ) accounts for medium-term risk adjustments and liquidity considerations. Consequently, fluctuations in these factors alter discount rates and expected excess returns, influencing not only bonds but also equities, real estate, currencies, and, in principle, alternative assets.
Our findings establish a potential link between NS factors and traditional asset pricing models, such as the Fama–French factor model and the consumption-based asset pricing model (C-CAPM). While the Fama–French framework attributes equity return variations to systematic, aggregate market risk exposure, and the C-CAPM links asset prices to consumption risk, our results suggest that NS factors may similarly encapsulate macroeconomic conditions shaping expected returns across different asset classes. This implies that the NS factors could either complement existing pricing factors and serve as alternative proxies for systematic risk or simply provide instruments that can predict business cycles and hence aggregate asset market valuations. For example, the role of the level factor ( β 0 ) in capturing variations in long-term discount rates aligns with the role played by the market risk premium in the Fama–French model. The slope factor ( β 1 ) is a well-known predictor of business cycle fluctuations, and therefore may help explain the cross-sectional variation in stock returns, particularly in relation to small-cap (SMB) and value (HML) stocks. Meanwhile, the curvature factor ( β 2 ) may be linked to liquidity conditions and short-term risk dynamics, concepts widely explored in liquidity-based asset pricing models (see, e.g., Holmström and Tirole (2001)). From a C-CAPM perspective, the NS factors could act as proxies for time-varying discount rates embedded in the SDF. If yield curve capture conveys shifts in consumption growth expectations and intertemporal risk aversion, then variations in ( β 0 ), ( β 1 ), and ( β 2 ) should help predict excess returns across asset classes, which is consistent with the bulk of our empirical findings.
This predictive ability of NS factors aligns with theoretical constraints on return forecastability. For instance, Potì (2018) established a tight bound on the predictability of portfolio returns using features of the SDF, and showed that out-of-sample predictability is constrained by the correlation between the SDF and the returns of any portfolio. In this framework, the upper bound on return predictability is given by
R O O S 2 ρ 2 ( r t + 1 , m t ) σ 2 ( m t + 1 )
where ρ ( r t + 1 , m t + 1 ) denotes the correlation between the SDF m t + 1 and the returns on some portfolio r t + 1 , while σ 2 ( m t + 1 ) represents the unconditional variance in the SDF. If the NS factors significantly contribute to the predicted variation in m t + 1 , then their ability to forecast asset returns is dictated by their empirical correlation with the pricing kernel.
Our empirical evidence shows that NS factors often exhibit high predictive power for short-term returns, particularly in fixed-income and equity markets, indicating that they effectively capture fluctuations in risk premia and discount rates. The observed return predictability, if within the derived theoretical bounds, would support the idea that NS factors serve as valid proxies for economic conditions embedded in the SDF. By influencing discount rates and expected excess returns across multiple markets, the NS factors may serve as state variables that should be explicitly incorporated into asset pricing models.

Author Contributions

M.G.: conceptualization, formal analysis, writing—review and editing, supervision, funding acquisition, investigation; S.I.: software, writing—original draft preparation, formal analysis, investigation. All authors have read and agreed to the published version of the manuscript.

Funding

This study was funded by the European Union—NextGenerationEU, in the framework of the GRINS—Growing Resilient, Inclusive and Sustainable project (GRINS PE000018—CUP B43C22000760006). The views and opinions expressed are solely those of the authors and do not necessarily reflect those of the European Union, nor can the European Union be held responsible for them.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Diebold–Mariano and Model Confidence Set Tests

For yields with maturity m, the forecast error for model i at time t and forecast horizon h is defined as
e i , t ( m ) = y ^ i , t + h / t ( m ) y t + h ( m )
For returns of asset j, the forecast error for model i is given by
e j , i , t = r ^ j , i , t + h / t r j , i , t + h
Using the squared forecast error as the loss function, the loss variables for yields and returns are respectively defined as
l i , t ( m ) = e i , t ( m ) 2 , l j , i , t = e j , i , t 2
The loss differential between competing models i and k is defined as
d i k , t ( m ) = l i , t ( m ) l k , t ( m ) , d j , i k , t = l j , i , t l j , k , t
The null hypothesis of equal predictive accuracy between models i and k is tested as follows
H 0 : E [ d i k , t ] = 0 t
against the alternative:
H 1 : E [ d i k , t ] 0 t
Under the assumption that the loss differential series d t is covariance stationary and short in memory, the DM test statistic follows a standard normal distribution:
D M = d ¯ 2 π f ^ 0 ( 0 ) T N ( 0 , 1 )
where d ¯ is the average loss differential. The spectral density f ^ 0 ( 0 ) is estimated as follows:
For h = 1 ,
f ^ 0 ( 0 ) = γ ^ d ( 0 ) 2 π
For h = 6 ,
f ^ 0 ( 0 ) = 1 2 π γ ^ d ( 0 ) + 2 k = 1 5 γ ^ d ( k )
For h = 12 ,
f ^ 0 ( 0 ) = 1 2 π γ ^ d ( 0 ) + 2 k = 1 11 γ ^ d ( k )
where γ ^ d ( k ) is the k-th autocorrelation of the loss differential series d t . The null hypothesis is tested for the models defined in Section 2.2 at a 5% significance level using a one-tailed test. A DM test statistic below 1.645 rejects the null hypothesis.
As regards the MSC procedure, starting from an initial set of models M, the procedure identifies a subset M M that, with a predetermined confidence level, includes the best forecasting model. Let d i j , t represent the loss differential between two models i and j, as defined above. Additionally, define the average loss differential between model i and all other models as
d i j , t = j M d i j , t m 1 for i , j = 1 , , m
The null hypothesis of equal predictive accuracy (EPA) can be tested in two ways. First, we can test that all pairwise loss differentials d i j , t are zero across all models:
H 0 , M : E ( d i j , t ) = 0 for all i , j = 1 , 2 , , m
against the alternative hypothesis that there is at least one non-zero pairwise loss differential:
H 1 , M : E ( d i j , t ) 0 for some i , j = 1 , 2 , , m
Alternatively, we can test that the average loss differentials are zero for all models:
H 0 , M : E ( d i · , t ) = 0 for all i = 1 , 2 , , m
against the alternative hypothesis that at least one average loss differential is non-zero:
H 1 , M : E ( d i · , t ) 0 for some i = 1 , 2 , , m
The following statistics are used to test these hypotheses:
t i j = d ¯ i j σ ^ 2 ( d i j ) and t i · = d ¯ i · σ ^ 2 ( d i · )
where d ¯ i j = t = 1 T d i j , t T and d ¯ i · = j i d ¯ i j m 1 , while σ ^ 2 ( d i j ) and σ ^ 2 ( d i · ) are bootstrapped estimates of the variances σ 2 ( d i j ) and σ 2 ( d i · ) , respectively.
The two test statistics for these hypotheses are
T R , M = max i , j M | t i j | and T max , M = max i M t i ·
These test statistics are non-normally distributed, and their asymptotic distributions are estimated using the bootstrap. The procedure was carried out sequentially, eliminating models one at a time from the superior set until the EPA hypothesis was not rejected.
Models were eliminated according to the following rules:
e R , M = arg max i sup j M t i j and e max , M = arg max i M t i ·
The number of bootstrapped samples was set to 1000, the quadratic loss function was used, and the confidence level was 1 α = 0.9 . The MCS procedure was applied for models forecasting yields (models I to VIII in Section 2.2), for each forecasting horizon (1-step-ahead, 6-step-ahead, and 12-step-ahead), and for each yield analyzed (3 months, 1 year, 5 years, and 10 years). The procedure was also applied for models forecasting returns (models I to IV in Section 2.3), for each of the three forecasting horizons (1-step-ahead, 6-step-ahead, and 12-step-ahead), and for each portfolio analyzed across various asset classes (equity, US Treasury securities, corporate bonds, REITs, and commodity futures contracts).

Appendix B. Summary Statistics

Table A1. Summary statistics for target return series.
Table A1. Summary statistics for target return series.
Panel A: Treasury RatesObservationsMeanMedianMinimumMaximumSt. DevSkewness
3 Months3722.6922.2250.0008.0702.2860.341
6 Months3722.8182.3600.0308.4402.3200.324
1 Year3722.9432.4750.0908.5802.3260.303
2 Years3723.2302.8650.1108.9602.3540.289
3 Years3723.4413.1300.1109.0502.3120.276
5 Years3723.8313.7100.2109.0402.2030.248
7 Years3724.1354.0150.3909.0602.1150.243
10 Years3724.3704.3350.5509.0402.0120.238
Panel B: Treasury Returns         ObservationsMeanMedianMinimumMaximumSt. DevSkewness
3 Months3710.2260.193−0.0170.6980.1950.377
6 Months3710.2420.197−0.0600.8760.2170.556
1 Year3710.4450.205−5.02412.2141.9851.326
2 Years3710.4830.336−6.9579.1102.3270.355
3 Years3710.5050.398−8.3628.5502.4800.065
5 Years3710.5340.485−8.6379.3172.542−0.025
7 Years3710.5510.480−8.6119.8922.481−0.064
10 Years3710.5620.413−8.34611.0542.3800.085
Panel CObservationsMeanMedianMinimumMaximumSt. DevSkewness
Fama and French Factor Portfolios
SMB3700.0920.115−16.72021.1303.1430.633
HML3700.191−0.050−9.05014.1103.0740.761
Mkt3700.8911.385−17.15013.6504.352−0.641
Industry Portfolios
Food3710.9131.020−14.56015.1403.962−0.286
Mines3710.8460.740−32.74022.4507.919−0.284
Oil3710.7010.670−34.81032.8206.2800.003
Textiles3711.0401.280−22.14023.2706.150−0.216
Consumer Durables3710.8081.070−25.77029.2405.777−0.217
Chemicals3710.8971.240−21.93022.3005.886−0.193
Consumer3711.0011.380−10.13015.6304.156−0.179
Construction3711.1371.510−20.20017.9205.871−0.248
Steel3710.6690.710−32.91030.6708.333−0.230
Fabricated Products3711.0331.360−23.03018.8205.473−0.496
Machinery3711.1781.670−28.32019.3206.938−0.514
Cars3711.0970.980−28.43038.7107.1940.536
Transportation3711.0231.510−22.68019.7005.196−0.621
Utilities3710.7991.230−13.02011.7203.965−0.645
Retail Stores3711.0801.010−14.59018.2404.776−0.070
Financials3710.9731.590−22.10017.1005.647−0.623
Other3710.9181.600−17.79015.3104.941−0.489
Size and Value
Ptf 13700.5000.937−27.84538.3217.9820.209
Ptf 23701.0801.454−21.52142.4217.0130.442
Ptf 33700.9821.370−21.92921.0575.740−0.344
Ptf 43701.1591.470−26.78626.3695.615−0.344
Ptf 53701.1661.330−27.82016.3815.801−0.721
Ptf 63700.8751.527−25.77528.1157.024−0.308
Ptf 73701.0451.581−23.11618.7855.775−0.518
Ptf 83701.0421.529−21.25716.0625.191−0.661
Ptf 93700.9741.788−23.77216.1935.295−0.822
Ptf 103701.0251.771−32.14426.0196.333−0.776
Ptf 113700.9291.721−23.09723.1306.485−0.436
Ptf 123701.0911.454−21.50918.9695.335−0.418
Ptf 133700.9781.465−17.84716.7574.879−0.555
Ptf 143701.0771.522−26.85416.9365.168−0.752
Ptf 153701.0551.382−31.17519.0585.885−0.815
Ptf 163701.1161.380−20.84026.0445.816−0.175
Ptf 173701.0541.364−20.16515.8994.873−0.713
Ptf 183700.9311.462−25.32715.5155.010−0.867
Ptf 193701.0021.604−31.42716.2335.025−1.128
Ptf 203700.8991.832−32.80218.3365.830−0.961
Ptf 213701.0131.078−14.77615.3564.437−0.298
Ptf 223700.9111.313−17.04913.6914.168−0.540
Ptf 233700.9311.469−17.30812.5704.246−0.623
Ptf 243700.6051.151−27.16415.8785.025−1.209
Ptf 253700.8461.330−28.40121.4946.218−0.572
Operating Profits and Investments
Ptf 263710.9291.305−25.25827.0486.967−0.085
Ptf 273710.6291.009−25.57215.9985.420−0.599
Ptf 283710.9701.266−31.15218.7135.712−0.764
Ptf 293710.9241.884−26.36618.6206.106−0.711
Operating Profits and Investments
Ptf 303710.4851.201−27.41418.7636.832−0.598
Ptf 313710.9451.098−19.47421.3525.180−0.238
Ptf 323711.0101.486−20.91217.8864.983−0.383
Ptf 333710.8241.244−24.36818.7835.243−0.920
Ptf 343711.0041.255−20.01423.1415.258−0.163
Ptf 353710.8221.189−20.74927.2565.612−0.180
Ptf 363710.9321.393−23.23915.7675.105−0.572
Ptf 373711.0321.269−22.02116.5204.746−0.522
Ptf 383710.9901.433−20.93913.8584.505−0.781
Ptf 393710.9751.296−21.70416.7414.745−0.643
Ptf 403710.8631.700−24.74616.4465.825−0.714
Ptf 413711.2411.599−21.23515.4464.563−0.449
Ptf 423711.0341.227−13.24715.3503.826−0.171
Ptf 433710.9471.201−14.48512.9524.242−0.413
Ptf 443711.0771.569−17.85518.1254.613−0.447
Ptf 453711.1031.342−23.78418.8965.565−0.412
Ptf 463711.1841.534−17.22515.0814.588−0.258
Ptf 473710.9731.292−19.06817.0084.364−0.356
Ptf 483710.9141.142−17.24917.4314.253−0.354
Ptf 493711.1031.233−16.58812.8514.537−0.352
Size and Cash flow
Ptf 503711.1701.349−20.30820.0175.884−0.332
Ptf 513710.9921.690−21.65019.5205.876−0.375
Ptf 523711.1251.720−22.92016.6704.961−0.709
Ptf 533711.1221.690−31.13021.0005.663−0.816
Ptf 543711.0241.390−15.67014.3604.522−0.355
Size and Dividend yield
Ptf 553710.8981.350−15.38012.5704.139−0.573
Ptf 563710.9351.350−23.61015.1904.594−0.816
Ptf 573711.1031.800−20.89015.9305.180−0.646
Ptf 583711.0101.560−20.44016.2104.821−0.633
Ptf 593710.9921.320−23.46021.8404.644−0.608
Size and Price to Earnings ratio
Ptf 603710.8941.210−20.31016.1004.861−0.492
Ptf 613710.9681.230−15.55012.9004.063−0.555
Ptf 623710.8681.030−16.75014.9604.072−0.661
Ptf 633711.0001.700−22.51018.8605.928−0.415
Ptf 643711.0971.660−22.89017.0304.873−0.679
Ptf 653711.1381.640−30.89020.0505.536−0.844
Ptf 663710.9551.230−15.94013.8004.451−0.375
Ptf 673710.9301.250−15.47013.1004.104−0.511
Ptf 683711.0461.580−23.59015.9504.788−0.829
Ptf 693710.5730.610−29.21047.5208.8460.858
Ptf 703710.8971.280−23.55027.5505.758−0.308
Ptf 713711.1371.670−23.43022.4805.190−0.538
Ptf 723711.3331.690−24.34020.7305.157−0.548
Ptf 733711.6062.190−22.83031.8506.619−0.132
Ptf 743710.7370.760−27.73055.8508.7280.607
Ptf 753711.0371.480−25.21033.7406.095−0.156
Ptf 763711.1221.620−22.28018.6705.156−0.607
Ptf 773711.2181.630−23.07017.6705.183−0.565
Ptf 783711.4041.780−25.60030.1606.799−0.132
Ptf 793710.7900.890−27.53044.2808.4380.417
Ptf 803710.9701.440−25.02027.0605.820−0.350
Ptf 813711.0471.350−22.20023.5205.012−0.482
Ptf 823711.0351.490−22.11016.6204.808−0.637
Size and Momentum
Ptf 833711.2971.640−21.46026.3906.215−0.267
Ptf 843710.6490.620−38.49046.4208.5540.133
Ptf 853711.0161.420−23.55028.8805.738−0.161
Ptf 863711.0921.610−22.24017.4804.802−0.580
Ptf 873711.0751.410−19.39012.9804.378−0.719
Ptf 883711.2451.430−22.17025.0805.631−0.338
Ptf 893710.6000.620−31.45030.9507.7200.163
Ptf 903710.8710.930−20.64024.3705.144−0.003
Ptf 913710.8991.130−16.89017.5404.211−0.329
Ptf 923710.9641.210−14.19015.8004.019−0.263
Ptf 933711.1111.630−19.61018.5204.920−0.381
Panel DObservationsMeanMedianMinimumMaximumSt. DevSkewness
Diversified3210.6991.262−31.96039.6876.181−0.575
Industrial3211.2321.528−56.18870.4838.2540.264
Lodging/Resorts3210.5700.590−36.55567.5258.7100.638
Office3210.8951.480−31.79632.4586.077−0.487
Residential3211.0131.362−26.65622.2425.501−0.833
Retail3210.8461.086−42.67843.5166.701−0.985
Self Storage3211.3261.616−22.24421.9285.584−0.364
All REITs3210.8881.194−30.22627.9755.296−0.959
Mortgage REITs3210.6421.595−53.75319.4116.551−2.455
Panel EObservationsMeanMedianMinimumMaximumSt. DevSkewness
AAA3720.5360.586−6.5446.3911.473−0.185
AA3720.5860.565−5.7745.3571.339−0.258
BBB3720.5310.671−10.9966.2861.668−1.575
A3720.5650.649−9.6726.7501.524−1.063
Panel FObservationsMeanMedianMinimumMaximumSt. DevSkewness
Gold3720.5330.115−18.00516.4584.4300.174
Silver3720.8660.235−27.95830.4628.3710.199
Platinum3720.4170.309−42.00424.1126.286−0.854
Copper3720.5820.421−36.15034.0647.1810.044
Brent Crude3720.6600.829−54.98840.0749.540−0.361
Gasoil3720.5570.566−33.61731.1729.286−0.092
Light crude oil3720.7030.924−54.24588.37610.4851.198
Natural Gas3721.1320.512−41.61662.61315.1970.568
Cotton3720.3850.511−36.11624.7498.492−0.222
Cocoa3720.5850.201−28.08334.5658.9210.457
Coffee3720.638−0.727−30.39150.59510.6221.007
Corn3720.5650.081−31.38022.1907.976−0.185
Lumber3721.0910.485−34.03058.41611.2050.674
Soybean Oil3720.4440.242−24.46126.8626.9330.039
Soybeans3720.4850.434−32.86719.5737.010−0.491
Wheat3720.6370.204−25.24842.3358.6080.516
Live Cattle3720.2640.017−25.56540.3425.6520.437
The table contains summary statistics for the sample of monthly data used in the analysis. The sample was from January 1990 to December 2020. Each panel represents a different asset class. Panel A shows descriptive statistics of US Constant Maturity Treasury rates, taken to be representative of the US yield curve, at different maturities. Panel B shows statistics for the returns on US Treasury securities. Panels C, D, E, and F show statistics for equities, REIT, corporate bonds, and commodity returns, respectively. In particular, Panel C is split into eight subsections, showing statistics for the returns on the original Fama–French factor portfolios, the 17 industry portfolios, and various bivariate types on size, book-to-market, investments, operating profitability, cash flows, dividend yield, earnings price ratio, and momentum. Values are reported in percentage points.

Appendix C. Diebold–Mariano Test on Asset Return Series

Table A2. Diebold–Mariano test on return series.
Table A2. Diebold–Mariano test on return series.
Panel A—Diebold–Mariano test comparing AR(1) on NS to listed models
VAR(1) on NSAR(1) for ReturnsHistorical Average
h = 1h = 6h = 12h = 1h = 6h = 12h = 1h = 6h = 12
3 Months                   7.2831.7361.5171.8262.1401.87517.7725.7623.419
6 Months6.7602.7712.077−0.7864.4132.87816.3215.6883.373
1 Year4.8872.4891.0460.940−0.026−0.082−0.331−0.054−0.089
2 Years4.0562.2561.1300.9030.405−0.0260.3840.402−0.021
3 Years3.0591.6280.7860.5290.328−0.1950.3230.325−0.193
5 Years1.3910.374−0.185−0.294−0.041−0.677−0.323−0.045−0.664
7 Years0.559−0.208−0.630−0.595−0.177−0.924−0.810−0.186−0.896
10 Years−0.335−0.749−0.967−0.909−0.143−0.975−1.353−0.162−0.945
Panel B—Diebold–Mariano test comparing AR(1) on NS vs. listed models
VAR(1) on NSAR(1) for ReturnsHistorical Average
h = 1h = 6h = 12h = 1h = 6h = 12h = 1h = 6h = 12
SMB0.126−0.542−1.039−0.2450.006−0.113−1.1370.025−0.059
HML1.3141.3871.107−1.489−1.0392.1762.2471.869−1.048
Mkt0.4721.2181.2201.9222.3331.606−1.3950.7452.549
Food2.6982.5041.7420.9301.6131.6060.9451.6791.659
Mines2.193−1.363−1.3991.0682.3692.0221.1842.3642.030
Oil−1.5731.955−0.6840.1920.4600.935−1.081−0.5191.048
Textiles2.5302.3411.697−0.3681.7771.7350.4421.7591.835
Consumer Durables2.381−1.3430.5450.5921.0171.302−0.0381.0771.366
Chemicals1.4071.4581.039−0.4900.6441.001−0.351−0.6751.085
Steel1.0681.9021.8392.5040.5541.156−0.902−1.129−0.678
Consumer0.826−0.061−0.0940.5921.0171.302−0.0381.0771.366
Construction0.826−0.061−0.0940.5921.0171.302−0.0381.0771.366
Steel1.4741.5250.807−0.4351.6962.370−0.848−1.6622.197
Fabricated Products−1.408−1.357−1.3812.3641.3991.588−0.9571.496−1.670
Machinery1.2631.0270.209−0.5671.3861.364−0.7231.931−1.384
Cars−1.498−1.579−1.417−1.0681.2271.525−1.5661.2491.544
Transportation1.3971.2821.070−1.0661.2091.756−0.6861.2351.904
Utilities−0.712−1.5172.243−2.7751.2771.2542.4441.877−1.087
Retail Stores1.2691.4301.8332.5121.2341.6341.7421.0921.643
Financials2.0852.3022.0810.2101.3041.7240.2931.3221.770
Other−0.728−0.0420.6312.162−0.6071.3002.668−1.4431.034
Ptf 1−1.042−0.875−0.369−1.5160.6552.2063.2510.5372.125
Ptf 2−0.800−0.970−1.4212.9100.7021.6221.624−0.7321.718
Ptf 3−0.656−0.750−0.7692.6380.1260.9811.9430.1701.060
Ptf 4−1.106−1.035−0.7112.329−0.2430.221−0.944−0.2080.267
Ptf 5−1.107−1.126−0.8971.974−0.3050.2341.870−0.2590.292
Ptf 6−1.461−0.863−0.7152.3111.1532.6462.6781.0912.213
Ptf 70.4100.578−0.046−1.5341.4700.9022.1691.3880.773
Ptf 8−0.857−0.637−0.1161.8070.0670.5411.8680.1450.608
Ptf 9−0.988−0.958−0.6912.272−0.4240.112−1.144−0.3750.175
Ptf 10−0.353−0.442−0.5152.196−0.1250.724−1.203−0.047−0.885
Ptf 11−1.282−1.189−1.529−1.4931.1982.7751.8861.2283.369
Ptf 12−1.243−0.5330.0271.7970.6541.309−0.895−0.7151.448
Ptf 13−0.3370.038−0.2772.253−0.0580.7091.7990.0740.939
Ptf 14−0.7420.1960.490−1.4190.3750.770−0.525−0.4380.845
Ptf 150.0460.156−0.1112.848−1.122−0.5571.719−1.055−0.499
Ptf 16−1.201−0.746−0.9971.9780.8543.0212.1920.7903.791
Ptf 17−0.422−0.352−0.0671.7491.4011.5412.7931.2463.655
Ptf 18−0.4540.5680.2061.7640.6781.827−1.3370.8242.230
Ptf 19−0.992−1.519−0.8672.611−0.730−0.0211.821−0.6590.083
Ptf 200.6261.1531.022−1.3590.4530.976−0.5960.4791.059
Ptf 211.4732.7401.9911.6690.3071.7861.6690.1511.706
Ptf 221.7142.3101.4830.1701.4201.812−0.0921.4251.888
Ptf 230.4210.0960.2512.1480.1341.715−1.478−0.2052.009
Ptf 241.2510.6640.471−0.7430.8951.388−0.3680.9301.471
Ptf 251.0970.7680.838−0.5210.8241.4551.8510.8331.509
Ptf 26−1.036−0.979−0.027−1.2001.2021.5942.0651.1481.483
Ptf 27−0.4320.3982.164−0.9170.7011.209−0.6670.7121.280
Ptf 282.1111.7291.234−0.7180.7821.019−0.1040.7981.033
Ptf 29−0.572−1.026−1.5572.5880.3522.8032.962−0.2853.126
Ptf 30−1.391−0.508−0.4701.806−0.693−0.1892.648−0.8420.287
Ptf 310.1100.357−0.0052.0304.420−0.7511.6764.702−0.713
Ptf 32−1.0843.046−1.119−1.280−0.5600.6321.904−0.5190.783
Ptf 33−0.158−0.226−0.373−1.210−0.0880.4002.192−0.1570.329
Ptf 340.2950.8652.266−1.1541.1153.1481.6951.1122.939
Ptf 35−1.4992.7482.188−1.4550.1471.578−1.325−0.0091.453
Ptf 36−0.6790.3791.114−1.3941.4065.6781.9151.4635.460
Ptf 37               0.7140.6150.486−0.812−0.2460.248−0.835−0.1750.323
Ptf 38−0.7580.4550.382−1.8691.2102.1442.5261.0171.870
Ptf 393.1572.7322.0571.3012.0642.2370.9512.0832.283
Ptf 40−0.319−0.968−1.1681.693−0.805−0.0522.180−0.830−0.082
Ptf 410.3920.9420.603−0.6031.0981.149−0.9081.0901.157
Ptf 42−0.727−0.543−0.174−0.9740.5952.5612.0440.6472.729
Ptf 430.6550.1740.421−0.6370.6001.712−1.2660.6621.985
Ptf 44−0.799−0.2730.2392.3000.8763.5702.6450.8843.764
Ptf 45−1.2343.0872.192−0.6440.6664.1352.2370.4974.996
Ptf 463.4012.9682.6021.5902.1072.4231.2522.1672.472
Ptf 473.9014.5424.2201.6322.6094.1201.6462.5994.209
Ptf 480.423−0.0030.1201.8380.8761.3641.8620.9511.551
Ptf 49−0.1910.0740.414−1.1040.8311.5792.7200.8241.669
Ptf 500.8641.1741.042−1.4411.7722.3192.4311.6722.646
Ptf 51−0.816−0.897−1.0522.4460.6522.0131.861−0.7202.253
Ptf 52−0.717−0.617−0.6112.865−0.2310.326−1.236−0.1470.430
Ptf 53−0.1670.094−0.0642.077−0.4040.1791.763−0.3100.343
Ptf 541.0911.9441.8181.8920.2841.6201.8350.1801.602
Ptf 551.8301.9681.374−0.0311.3952.055−0.0171.4022.163
Ptf 560.528−0.106−0.019−1.3140.2171.081−1.004−0.2821.233
Ptf 57−0.742−0.627−0.5212.5140.0730.750−1.2600.1690.909
Ptf 58−1.143−1.136−0.8932.5180.1960.7101.8420.2470.803
Ptf 591.942−1.436−0.6501.6760.5440.743−0.2400.5880.804
Ptf 601.4761.7821.649−0.7161.0991.806−0.5191.1071.867
Ptf 611.2241.0420.967−0.0761.0012.157−0.7600.9932.301
Ptf 621.0420.5450.2041.8450.2270.948−0.6770.2991.075
Ptf 63−0.498−0.570−0.9232.5050.4961.9561.8490.5812.327
Ptf 64−0.762−0.562−0.4912.937−0.0790.564−1.298−0.0080.686
Ptf 65−0.581−0.445−0.4512.253−0.3970.1741.930−0.3350.297
Ptf 661.3242.0131.910−1.6030.8241.848−1.2500.7801.887
Ptf 671.3071.4091.063−0.2530.9902.175−0.6100.9812.323
Ptf 681.1010.8170.638−0.7770.5211.096−0.6550.5681.188
Ptf 69−0.111−0.283−0.3981.8071.0301.4941.9320.9721.371
Ptf 70−1.116−1.208−0.7352.3820.4430.752−0.7740.4430.793
Ptf 71−1.4981.899−1.1782.3230.6211.462−0.5840.6411.559
Ptf 72−0.960−1.210−1.0012.172−0.2680.414−1.021−0.1990.490
Ptf 732.198−1.564−1.096−0.9930.3880.758−0.202−0.4260.801
Ptf 741.7441.6380.936−1.1511.5801.4350.1651.5181.379
Ptf 751.0751.2370.7191.7271.2240.937−0.8691.1690.873
Ptf 76−0.516−0.138−0.2542.4550.2331.672−1.5510.3121.951
Ptf 77−0.949−0.871−0.8242.308−0.1420.4091.901−0.0500.495
Ptf 78−1.170−0.932−1.037−1.0310.1660.830−0.8310.2520.914
Ptf 790.7370.5630.128−1.5761.3341.634−1.3421.2941.475
Ptf 80−0.2050.183−0.0112.0240.0231.8202.325−0.0081.401
Ptf 811.0661.0401.0621.8710.4220.783−0.637−0.4830.881
Ptf 82−1.3420.0490.451−0.2900.4930.730−0.1980.5630.796
Ptf 83−1.5181.0090.683−0.0750.6741.261−0.1950.7451.317
Ptf 84−1.058−0.955−1.1351.7170.6551.4111.8200.7081.592
Ptf 85−0.5150.1470.2192.1040.3071.354−1.5060.3071.562
Ptf 860.6020.3870.536−1.5220.6121.581−1.2610.6741.862
Ptf 870.3640.1600.322−1.5000.5921.9441.749−0.6872.414
Ptf 88−1.199−0.564−0.862−0.5620.6361.475−0.7490.7071.565
Ptf 891.4671.7681.516−1.5801.2982.332−0.6151.2912.392
Ptf 90−0.536−0.3230.1251.9010.0221.0812.532−0.0221.036
Ptf 910.8951.2841.0552.4890.1512.3072.666−0.1432.191
Ptf 920.1700.2180.617−0.5080.2461.8821.812−0.2322.093
Ptf 931.5142.1271.8070.0481.2301.880−0.4361.2571.981
Panel C—Diebold–Mariano test comparing AR(1) on NS vs. listed models
VAR(1) on NSAR(1) for ReturnsHistorical Average
h = 1h = 6h = 12h = 1h = 6h = 12h = 1h = 6h = 12
Diversified−0.669−0.5271.1172.594−0.534−1.2741.973−0.515−1.269
Industrial−0.5760.3701.847−0.3710.3490.390−1.499−0.3530.375
Lodging/Resorts−0.9940.2040.7101.736−0.142−0.3371.824−0.139−0.323
Office−0.885−0.057−0.0642.286−1.595−0.8031.701−1.598−0.774
Residential−0.204−0.1160.4491.676−0.698−1.0582.513−0.666−1.054
Retail1.580−0.668−0.3621.893−0.8831.768−1.178−0.877−1.586
Self Storage−0.3630.9630.036−0.1310.058−0.4572.379−0.063−0.425
All REITs−0.5720.1581.7282.7900.506−0.9741.8240.539−0.916
Mortgage REITs0.2430.8501.4050.9920.9730.2670.0911.1070.290
Panel D—Diebold–Mariano test comparing AR(1) vs. NS to listed models
VAR(1) on NSAR(1) for ReturnsHistorical Average
h = 1h = 6h = 12h = 1h = 6h = 12h = 1h = 6h = 12
AAA0.7200.3570.386−1.169−0.076−0.314−1.117−0.072−0.290
AA1.8320.472−0.236−1.113−0.364−0.850−0.430−0.354−0.829
BBB1.4820.178−0.169−0.9540.012−0.350−0.2170.020−0.302
A1.5050.219−0.3401.884−0.291−0.710−0.484−0.289−0.683
Panel E—Diebold–Mariano test comparing AR(1) on NS vs. listed models
VAR(1) on NSAR(1) for ReturnsHistorical Average
h = 1h = 6h = 12h = 1h = 6h = 12h = 1h = 6h = 12
Gold−1.159−1.5941.8390.7871.8581.5720.6071.6831.570
Silver−0.100−1.614−1.0320.1525.8752.313−0.1363.1182.166
Platinum2.1832.1472.7481.4602.5635.1881.8932.5494.752
Copper−0.346−0.214−0.2830.5152.7122.3931.7472.6052.412
Brent Crude0.7850.171−0.0041.3392.5132.5380.7292.4982.458
Gasoil0.6600.081−0.1041.7451.8922.5670.9071.7252.438
Light crude oil0.7040.036-0.0630.8632.2923.4140.2382.3862.919
Natural Gas0.662−1.2160.1310.205−1.1021.434−0.059−1.5391.604
Cotton−1.218−1.386−0.8571.4021.2833.1141.1581.2013.421
Cocoa0.451−0.007−0.2061.821−0.7630.437−0.362−0.8710.535
Coffee2.8301.829−0.8980.0680.4111.632−0.2950.1041.685
Corn−0.0610.4703.1810.325−0.6471.662−1.077−1.1441.776
Lumber0.1230.5401.536−0.377−0.2132.186−0.795−0.5442.015
Soybean Oil−0.660−0.299−0.3541.1441.3062.0330.2260.5011.983
Soybeans0.0850.3031.3930.191−0.4391.861−0.521−0.6832.062
Wheat0.460−0.752−0.3510.926−0.8482.093−0.802−0.9182.059
Live Cattle0.9230.986−0.0201.3390.7461.7910.4100.5251.882
Panel F—Summary of Diebold–Mariano test results
VAR(1) on NSAR(1) for ReturnsHistorical Average
h = 1h = 6h = 12h = 1h = 6h = 12h = 1h = 6h = 12
Equity45452114940
Treasuries000122222
REITs100601600
Corporate Bonds000100000
Commodities111000000
The table contains results of the Diebold–Mariano test. In particular, the table shows test statistics for the three models (VAR(1) on NS factors, AR(1) on returns, and Historical Average) considered for forecasting returns of US Treasury securities, equities, REIT, corporate bonds, and commodity against an AR(1) on Nelson–Siegel factors, in the pseudo out-of-sample period going from January 2011 to September 2020. Test statistics are shown according to three dimensions: (i) the model used to compute the forecasts, (ii) the returns series considered (one of 151 among the five asset classes analyzed), and (iii) the forecast horizon (one-step-ahead, six-step-ahead, and twelve-step-ahead forecasts). Boldfaced values represent series for which we can reject the null hypothesis of equal predictive accuracy in a 5%-sized, one-tailed test. Panels A to E show DM test statistics for each of the five asset classes considered, while Panel F shows a summary of DM tests by asset class and forecast horizon. Panel F summarizes how many times we can reject the null hypothesis of equal predictive accuracy in a 5%-sized, one-tailed test. Numbers in brackets in Panel F represent the number of series analyzed for each asset class.
Table A3. Root mean squared forecast errors (RMSE) for government bond yields.
Table A3. Root mean squared forecast errors (RMSE) for government bond yields.
ModelRMSE (h = 1)RMSE (h = 6)RMSE (h = 12)
VAR(1) on NS1.880.220.03
AR(1) on Yields9.300.930.44
VAR(1) on Yield Changes14.441.560.77
Random Walk8.071.030.54
Slope Regression12.750.600.84
VAR(1) on Yields10.8821.900.77
Regression on Principal Components177.8924.790.77
Note: The table presents root mean squared forecast errors (RMSEs) for government bond yields at different maturities (3 months, 1 year, 5 years, and 10 years) across three forecast horizons: one-step-ahead (h = 1), six-step-ahead (h = 6), and twelve-step-ahead (h = 12). The RMSEs were computed for different forecasting models, including VAR(1) on Nelson–Siegel (NS) factors, AR(1) on yields, and benchmark methods such as the Random Walk and Principal Component Regression. Lower RMSE values indicate better forecasting accuracy.

Appendix D. Combining Forecast with Goyal and Welch Predictors

Table A4. Combining out-of-sample R O S 2 (%) for 6-month horizon.
Table A4. Combining out-of-sample R O S 2 (%) for 6-month horizon.
Yield CurveTreasury ReturnsEquity Returns
MaturityGW PredictorsGW + NS FactorMaturityGW PredictorsGW + NS FactorSectorGW PredictorsGW + NS Factor
3 M3.24 ***4.77 ***3 M0.71 ***5.77 ***Mkt0.25 **0.13 *
1 Y3.32 ***4.88 ***6 M3.38 ***5.66 ***Food0.250.09
5 Y2.54 ***3.28 ***1 Y0.87 ***0.95 ***Mines0.060.13 *
10 Y2.25 ***3.03 ***2 Y0.140.66 **Oil−0.09−0.11
3 Y0.080.32 *Textiles0.250.06
5 Y−0.011.52 **Consumer Durables0.53 ***0.31 *
7 Y0.040.08Chemicals0.140.09
10 Y0.140.24Consumer0.45 **0.40 **
Construction0.21 *0.14
Steel0.030.01
Fabricated Products0.130.26 *
Machinery0.030.01
Cars0.31 **−0.01
Transportation0.16 *0.01
Utilities−0.09−0.17
Retail Stores0.080.13
Financials0.38 **0.26 *
Other0.52 ***0.53 **
REITsCorporate BondsCommodities
SectorGW PredictorsGW + NS FactorRatingGW PredictorsGW + NS FactorCommodityGW PredictorsGW + NS Factor
Diversified−0.18−0.27AAAA0.120.16Gold0.130.19
Industrial−0.09−0.79AA0.45 *0.55 *Silver0.100.04
Lodging/Resorts0.330.32BBB0.46 *0.33Platinum−0.34−0.39
Office−0.21−0.23A0.85 **0.74 *Copper−0.30−0.33
Residential−0.07−0.23 Brent Crude−0.12−0.21
Retail−0.17−0.07 Gasoil−0.26−0.35
Self Storage−0.19−0.07 Light crude oil−0.170.16
All REITs−0.23−0.04 Natural Gas0.010.20
Mortgage REITs0.15 *0.03 Cotton0.35 **0.09
Cocoa−0.03−0.02
Coffee−0.15−0.18
Corn−0.09−0.17
Lumber0.38 ***0.24 **
Soybean Oil−0.06−0.20
Soybeans−0.09−0.15
Wheat−0.04−0.03
Live Cattle−0.04−0.11
The table reports the out-of-sample R O S 2 (%) for a 6-month horizon, computed as the median of the combination forecasts. The predictive variables included in the combination models consist of various macro-financial indicators. These include valuation ratios such as the dividend–price ratio (D/P), the dividend yield (D/Y), the earnings–price ratio (E/P), and the dividend–payout ratio (D/E). Additionally, measures of market risk and performance such as stock variance (SVAR), the book-to-market ratio (B/M), and net equity expansion (NTIS) are considered. The analysis also incorporates interest rate variables, including the Treasury bill rate (TBL), the long-term yield (LTY), the long-term return (LTR), and the term spread (TMS), along with credit risk indicators such as the default yield spread (DFY) and the default return spread (DFR). Finally, inflation (INFL) is included, following Welch and Goyal (2008), with the predictive regression using the previous month’s inflation rate due to the lag in data availability. The table compares combination forecasts with and without NS factors to assess their impact on predictive accuracy. The out-of-sample period spans from January 2011 to December 20200. Statistical significance for the R O S 2 statistic is based on the p-value for the Clark and West (2007) out-of-sample MSPE-adjusted statistic. The test corresponds to a one-sided hypothesis where the null states that the competing forecasting model has an expected squared prediction error equal to that of the historical average benchmark forecasting model. The alternative hypothesis posits that the competing forecasting model exhibits a lower expected squared prediction error relative to the benchmark. Statistical significance is denoted as follows: for the 10% level, for the 5% level, and for 1%.

Notes

1
The stochastic discount factor, also called the pricing kernel, is a random variable used in asset pricing to discount future payoffs. It ensures that the expected discounted payoffs equal current asset prices. The SDF captures time value, risk preferences, and market conditions.
2
Fixing λ allows us to avoid estimating the baseline NS specification using nonlinear least squares, enhancing the simplicity and robustness of the forecasts obtained from the model. In fact, Diebold et al. (2006) emphasized that NLS estimation can encounter issues such as local minima or convergence failures.
3
In Equation (4) and in all subsequent forecasting models, we use a caret to denote the final predictions (of factors and yields) that we assign an economic meaning to and that we use to assess the forecast accuracy of the different models. A tilde is used to indicate intermediate-step parameter estimates that are instrumental in computing the final predictions.
4
In unreported tests, we tried to perform formal model specification searches concerning the number of VAR lags, and in particular investigated the performance for the yield series of a VAR(2) model, finding a similar or uniformly worse predictive performance.
5
The use of cointegrated VAR models for interest rate forecasting does not consistently outperform other methods. The inclusion of many yields increases model complexity, which can lead to overfitting and poor out-of-sample predictions. Despite improvements like cubic splines and ECM approaches, these models often overlook the fact that bond prices (yields) reflect all available information about future interest rates, limiting their forecasting effectiveness (Duffee, 2013).
6
In the case of Treasury data, even though an obvious (yet non linear) relationship exists between (the changes in ex ante) yields and (ex post, realized) returns, even when the NS factors are used, the forecast results obtained for returns are not presumed to be the same as the ones obtainable for yields.
7
Some literature labels this third predictive benchmark as a ”random walk”, which obviously is at odds with the restriction γ ˜ i , t = 0 applied to Equation (15). The label derives from the fact that restricting θ i , t = 1 in l n P i , t + h | t = θ 0 , i , t + θ 1 , i , t l n P i , t + ϵ i , t + h , one obtains the model in (15) with γ ˜ i , t = 0 .
8
These predictors are available at Goyal’s personal site: https://sites.google.com/view/agoyal145/home?authuser=0, accessed on 20 March 2025.
9
Negative yields, which may arise due to the low interest rate environment or specific technical market factors, such as cash and repurchase agreements in US Treasury markets, were excluded from the CMT calculation. This ensures that the dataset included only positive and economically meaningful yields.
10
Maturities such as the 1-month, 2-month, 20-year, and 30-year constant maturity rates, which were discontinued during the study period, were excluded.
11
12
To qualify for inclusion, bonds had to have an investment-grade rating (based on an average of Moody’s, S&P, and Fitch) and meet specific criteria regarding maturity (a remaining maturity of at least one year at the time of inclusion in the index), a fixed coupon schedule (i.e., zero coupon bonds are excluded), and minimum amount outstanding (USD 250 million). The index was capitalization-weighted based on the outstanding amount of each security at the end of each month.
13
Nonetheless, this is considered partially unrealistic, also in light of the recent results in Kim and Choi (2017).
14
The sample autocorrelation in the forecast errors further supports these findings, with errors from the AR(1) and VAR(1) models for yields being serially correlated for most maturities and horizons, except for longer maturities (3-, 5-, 7-, and 10-year yields) at one-step-ahead horizons.
15
Because the results for MCS test are starker and allow us to draw sharper conclusions, in the main text, we present the tables documenting the findings from the MCS methodology and report in an Appendix those from the simpler, but intuitive DM tests.
16
While the two test statistics used in the MCS procedure, TMax and TR, produced similar results, they differed in the total number of models excluded from the superior set. The TMax statistic tended to exclude fewer models from M* across all forecasting horizons compared to the TR statistic. Using the TMax statistic, we excluded 187, 62, and 181 models from M* across all return series for the one-step-ahead, six-step-ahead, and 12-step-ahead forecasts, respectively. Using the TR statistic, we excluded 229, 76, and 188 models from M* for the same horizons. In the case of the return series, we observed the opposite of what was found for the bond yield series, with the TMax test appearing less selective than the TR statistic. This difference was likely due to the fact that only four models were compared when forecasting returns, whereas up to eight models were compared when forecasting bond yields, making pairwise comparisons based on the TR statistic more powerful than the multivariate comparisons used by the TMax statistic.
17
As a sub-question, we also explored whether an AR(1) or a VAR(1) model based on (predicted) NS factors could be preferable in forecasting returns. Although no definitive conclusion emerged, our analysis suggested that the two types of models did not exhibit equivalent performance across different asset classes and forecast horizons. In overall terms, the VAR(1) NS tended to outperform the AR(1) NS, particularly in the case of Treasury securities, equities, and corporate bonds. However, the AR(1) NS returned a slightly superior performance in the prediction of REIT returns. As far as the forecast horizon is concerned, the AR(1) NS performance relative to the VAR(1) improved as the horizon lengthened, especially for US Treasuries and corporate bonds. Conversely, the VAR(1) model proved more effective in forecasting REIT returns.

References

  1. Campbell, J. Y., & Thompson, S. B. (2008). Predicting excess stock returns out of sample: Can anything beat the historical average? The Review of Financial Studies, 21(4), 1509–1531. [Google Scholar] [CrossRef]
  2. Chen, Y., & Niu, L. (2014). Adaptive dynamic Nelson–Siegel term structure model with applications. Journal of Econometrics, 180(1), 98–115. [Google Scholar] [CrossRef]
  3. Christensen, J. H. E., Diebold, F. X., & Rudebusch, G. D. (2011). The affine arbitrage-free class of Nelson–Siegel term structure models. Journal of Econometrics, 164(1), 4–20. [Google Scholar] [CrossRef]
  4. Clark, T. E., & West, K. D. (2007). Approximately normal tests for equal predictive accuracy in nested models. Journal of Econometrics, 138(1), 291–311. [Google Scholar] [CrossRef]
  5. Diebold, F. X., & Li, C. (2006). Forecasting the term structure of government bond yields. Journal of Econometrics, 130(2), 337–364. [Google Scholar] [CrossRef]
  6. Diebold, F. X., & Mariano, R. S. (2002). Comparing predictive accuracy. Journal of Business & Economic Statistics, 20(1), 134–144. [Google Scholar]
  7. Diebold, F. X., & Rudebusch, G. D. (2013). Yield curve modeling and forecasting: The dynamic Nelson-Siegel approach. Princeton University Press. [Google Scholar]
  8. Diebold, F. X., Rudebusch, G. D., & Aruoba, S. B. (2006). The macroeconomy and the yield curve: A dynamic latent factor approach. Journal of Econometrics, 131(1–2), 309–338. [Google Scholar]
  9. Drobetz, W., Stürmer, S., & Zimmermann, H. (2002). Conditional asset pricing in emerging stock markets. Swiss Journal of Economics and Statistics (SJES), 138(IV), 507–526. [Google Scholar]
  10. Duffee, G. (2013). Forecasting interest rates. In G. Elliott, & A. Timmermann (Eds.), Handbook of economic forecasting (Vol. 2, pp. 385–426). North Holland. [Google Scholar]
  11. Favero, C. A., Niu, L., & Sala, L. (2012). Term structure forecasting: No-arbitrage restrictions versus large information set. Journal of Forecasting, 31(2), 124–156. [Google Scholar] [CrossRef]
  12. Fernandes, M., & Vieira, F. (2019). A dynamic Nelson–Siegel model with forward-looking macroeconomic factors for the yield curve in the US. Journal of Economic Dynamics and Control, 106, 103720. [Google Scholar] [CrossRef]
  13. Frankel, J. A. (2008). The effect of monetary policy on real commodity prices. In Asset prices and monetary policy (pp. 291–333). University of Chicago Press. [Google Scholar]
  14. Guidolin, M., McMillan, D. G., & Wohar, M. E. (2013). Time varying stock return predictability: Evidence from US sectors. Finance Research Letters, 10(1), 34–40. [Google Scholar] [CrossRef]
  15. Guidolin, M., & Pedio, M. (2019). Forecasting and trading monetary policy effects on the riskless yield curve with regime switching Nelson–Siegel models. Journal of Economic Dynamics and Control, 107, 103723. [Google Scholar]
  16. Hansen, P. R., Lunde, A., & Nason, J. M. (2011). The model confidence set. Econometrica, 79(2), 453–497. [Google Scholar] [CrossRef]
  17. Hillebrand, E., Huang, H., Lee, T.-H., & Li, C. (2018). Using the entire yield curve in forecasting output and inflation. Econometrics, 6(3), 40. [Google Scholar] [CrossRef]
  18. Holmström, B., & Tirole, J. (2001). LAPM: A liquidity-based asset pricing model. The Journal of Finance, 56(5), 1837–1867. [Google Scholar] [CrossRef]
  19. Kim, J. H., & Choi, I. (2017). Unit roots in economic and financial time series: A re-evaluation at the decision-based significance levels. Econometrics, 5(3), 41. [Google Scholar] [CrossRef]
  20. Nelson, C. R., & Siegel, A. F. (1987). Parsimonious modeling of yield curves. Journal of Business, 473–489. [Google Scholar] [CrossRef]
  21. Nymand-Andersen, P. (2018). Yield curve modelling and a conceptual framework for estimating yield curves: Evidence from the European Central Bank’s yield curves. In ECB statistics paper. European Central Bank. [Google Scholar]
  22. Pesando, J. E. (1979). On the random walk characteristics of short-and long-term interest rates in an efficient market. Journal of Money, Credit and Banking, 11(4), 457–466. [Google Scholar] [CrossRef]
  23. Potì, V. (2018). A new tight and general bound on return predictability. Economics Letters, 162, 140–145. [Google Scholar]
  24. Rapach, D., Strauss, J., & Zhou, G. (2010). Out-of-sample equity premium prediction: Consistently beating the historical average. Review of Financial Studies, 23(2), 821–862. [Google Scholar] [CrossRef]
  25. Rapach, D., & Zhou, G. (2013). Forecasting stock returns. In Handbook of economic forecasting (Vol. 2, pp. 328–383). Elsevier Science & Technology. [Google Scholar]
  26. Steeley, J. M. (2014). Forecasting the term structure when short-term rates are near zero. Journal of Forecasting, 33(5), 350–363. [Google Scholar] [CrossRef]
  27. Toczydlowska, D., & Peters, G. W. (2018). Financial big data solutions for state space panel regression in interest rate dynamics. Econometrics, 6(3), 34. [Google Scholar] [CrossRef]
  28. Welch, I., & Goyal, A. (2008). A comprehensive look at the empirical performance of equity premium prediction. The Review of Financial Studies, 21(4), 1455–1508. [Google Scholar] [CrossRef]
  29. Xiang, J., & Zhu, X. (2013). A regime-switching Nelson–Siegel term structure model and interest rate forecasts. Journal of Financial Econometrics, 11(3), 522–555. [Google Scholar] [CrossRef]
  30. Yu, W.-C., & Zivot, E. (2011). Forecasting the term structures of Treasury and corporate yields using dynamic Nelson-Siegel models. International Journal of Forecasting, 27(2), 579–591. [Google Scholar] [CrossRef]
Figure 1. US yield curve level and Nelson–Siegel long-term factor. This figure illustrates the long-term level factor ( β 0 , t ) derived from the Nelson–Siegel model. The data span the sample January 1990–December 2020.
Figure 1. US yield curve level and Nelson–Siegel long-term factor. This figure illustrates the long-term level factor ( β 0 , t ) derived from the Nelson–Siegel model. The data span the sample January 1990–December 2020.
Econometrics 13 00017 g001
Figure 2. US yield curve slope and Nelson–Siegel short-term factor. This figure illustrates the short-term slope factor ( β 1 , t ) derived from the Nelson–Siegel model. The data span the sample January 1990–December 2020.
Figure 2. US yield curve slope and Nelson–Siegel short-term factor. This figure illustrates the short-term slope factor ( β 1 , t ) derived from the Nelson–Siegel model. The data span the sample January 1990–December 2020.
Econometrics 13 00017 g002
Figure 3. US yield curve curvature and Nelson–Siegel medium-term factor. This figure illustrates the medium-term curvature factor ( β 2 , t ) from the Nelson–Siegel model. The data span the sample January 1990–December 2020.
Figure 3. US yield curve curvature and Nelson–Siegel medium-term factor. This figure illustrates the medium-term curvature factor ( β 2 , t ) from the Nelson–Siegel model. The data span the sample January 1990–December 2020.
Econometrics 13 00017 g003
Table 1. Summary statistics for the fitted Nelson–Siegel factors.
Table 1. Summary statistics for the fitted Nelson–Siegel factors.
FactorMeanStd. Dev.MinimumMaximum ρ 1 ^ ρ 12 ^ ρ 30 ^ ADF
β 1 ^ 4.8931.9550.7479.2670.9800.7760.616−1.398
β 2 ^ −2.2051.582−5.2140.9800.9730.523−0.163−2.235
β 3 ^ −1.8012.226−6.7964.7040.9540.5820.200−2.984
The table presents summary statistics for the estimated Nelson–Siegel factors, using data for a January 1990–December 2020 sample. The augmented Dickey–Fuller (ADF) test was computed using the Schwarz information criterion (SIC) with up to 16 lags. The critical values to reject the null hypothesis of no unit roots were −3.452, −2.870, and −2.571 at the 1%, 5%, and 10% size levels, respectively.
Table 2. Diebold–Mariano tests concerning alternative forecasts of US treasury rates.
Table 2. Diebold–Mariano tests concerning alternative forecasts of US treasury rates.
Yield Maturityh = 1h = 6h = 12
VAR(1) on NS
3 Months2.052−0.5520.059
1 Y0.8350.1560.210
5 Y−1.5380.6310.249
10 Y0.5010.3890.137
AR(1) on Yields
3 Months2.5040.7330.273
1 Y4.6140.3060.080
5 Y2.8011.4350.926
10 Y1.3351.4241.176
VAR(1) on Yield Changes
3 Months2.6230.430−0.081
1 Y3.4310.111−0.188
5 Y2.1351.3250.839
10 Y2.0621.4241.176
Random Walk
3 Months2.2210.440−0.050
1 Y4.3140.125−0.156
5 Y2.5271.3510.850
10 Y1.5351.4441.188
Slope Regression
3 Monthsn.a.n.a.n.a.
1 Y5.408−0.411−1.364
5 Y2.2680.9310.440
10 Y1.9670.8780.695
VAR(1) on Yields
3 Months3.6924.4333.406
1 Y4.358−1.976−1.057
5 Y2.9633.564−1.825
10 Y1.4587.1653.938
Predictive Regression on Principal Components
3 Months6.097−1.564−0.975
1 Y6.3792.3792.780
5 Y14.8485.8253.533
10 Y20.3287.5572.278
The table shows the results of the Diebold–Mariano tests. In particular, the table shows the DM test statistic for the null of identical predictive accuracy applied to seven models compared to the AR(1) model applied to the Nelson–Siegel factors. The pseudo out-of-sample period spans January 2011–December 2020. Boldfaced values indicate combinations of maturities, horizon, and tested models for which we could reject the null hypothesis of equal predictive accuracy vs. the AR(1) model for NS factors in a 5%-sized, one-tailed test.
Table 3. Model confidence set result for US Treasury rates.
Table 3. Model confidence set result for US Treasury rates.
Panel A—MCS test results for 1-step-ahead forecasts using the TMax test statistic
Rankings Summary
Model3 Mo1 Y5 Y10 YModelExcluded1st to 3rd4th to 6th7th to 8th
AR(1) on NSexcludedexcludedexcluded7AR(1) on NS3--1
VAR(1) on NSexcludedexcludedexcluded6VAR(1) on NS3-11
Slope Regressionexcluded321Slope Regression13--
AR on Yields344excludedAR on Yields-13-
VAR(1) on Yields2114VAR(1) on Yields-31-
VAR(1) on Yield Changes1232VAR(1) on Yield Changes-4--
Principal ComponentsexcludedexcludedexcludedexcludedPrincipal Components4---
Random Walk4553Random Walk-13-
Panel B—MCS test results for 6-step-ahead forecasts using the TMax test statistic
Model3 Mo1 Y5 Y10 YModelExcluded1st to 3rd4th to 6th7th to 8th
AR(1) on NS4556AR(1) on NS--4-
VAR(1) on NS5254VAR(1) on NS-13-
Slope Regressionexcluded643Slope Regression112-
AR on Yields1115AR on Yields-31-
VAR(1) on Yields2114VAR(1) on Yields-31-
VAR(1) on Yield Changes3432VAR(1) on Yield Changes-31-
Principal ComponentsexcludedexcludedexcludedexcludedPrincipal Components4---
Random Walk2321Random Walk-4--
Panel C—MCS test results for 12-step-ahead forecasts using the TMax test statistic
Model3 Mo1 Y5 Y10 YModelExcluded1st to 3rd4th to 6th7th to 8th
AR(1) on NS5266AR(1) on NS-13-
VAR(1) on NS4355VAR(1) on NS-13-
Slope Regressionexcluded643Slope Regression112-
AR on Yields1114AR on Yields-31-
VAR(1) on Yieldsexcluded7excludedexcludedVAR(1) on Yields3--1
VAR(1) on Yield Changes3432VAR(1) on Yield Changes-41-
Principal Components6excludedexcludedexcludedPrincipal Components3-1-
Random Walk2521Random Walk-21-
Panel D—MCS test results for 1-step-ahead forecasts using the TR test statistic
Model3 Mo1 Y5 Y10 YModelExcluded1st to 3rd4th to 6th7th to 8th
AR(1) on NSexcludedexcludedexcluded7AR(1) on NS3--1
VAR(1) on NSexcludedexcludedexcluded6VAR(1) on NS3-1-
Slope Regressionexcluded321Slope Regression13--
AR on Yields3445AR on Yields-22-
VAR(1) on Yields2114VAR(1) on Yields-31-
VAR(1) on Yield Changes1232VAR(1) on Yield Changes-4--
Principal ComponentsexcludedexcludedexcludedexcludedPrincipal Components4---
Random Walk4553Random Walk--4-
Panel E—MCS test results for 6-step-ahead forecasts using the TR test statistic
Model3 Mo1 Y5 Y10 YModelExcluded1st to 3rd4th to 6th7th to 8th
AR(1) on NS4466AR(1) on NS-22-
VAR(1) on NS5554VAR(1) on NS-13-
Slope Regressionexcluded643Slope Regression121-
AR on Yields1115AR on Yields-22-
VAR(1) on Yieldsexcluded7excludedexcludedVAR(1) on Yields3-1-
VAR(1) on Yield Changes3232VAR(1) on Yield Changes-2
Principal Components68excludedexcludedPrincipal Components2-11
Random Walk2321Random Walk-31-
Panel F—MCS test results for 12-step-ahead forecasts using the TR test statistic
Model3 Mo1 Y5 Y10 YModelExcluded1st to 3rd4th to 6th7th to 8th
AR(1) on NS5266AR(1) on NS-22-
VAR(1) on NS4455VAR(1) on NS-22-
Slope Regressionexcluded643Slope Regression112-
AR on Yields1114AR on Yields-31-
VAR(1) on Yieldsexcluded7excludedexcludedVAR(1) on Yields3-1-
VAR(1) on Yield Changes3332VAR(1) on Yield Changes-211
Principal Components6excludedexcludedexcludedPrincipal Components-21-
Random Walk2521Random Walk-21-
The table shows results for the Model Confidence Set tests concerning the eight models considered for forecasting the US Treasury rates on a monthly basis in the pseudo out-of-sample period from January 2011 to December 2020. Panels A through C refer to the results obtained under the TMax test statistic. Panels D through F refer to the results obtained under the TR test statistic. Each panel shows, on the left-hand side, the rankings of the MCS test, where excluded means that the model was not part of the superior set M*; on the right-hand side, the table displays a summary of the MCS tests for the panel, counting model rankings across the four different yield maturities forecast (3 month, 1 year, 5 year, and 10 year). Panels A and D show the MCS test results for one-month-ahead forecasts, Panels B and E show results for six-month-ahead forecasts, and Panels C and F show results for 12-month-ahead forecasts.
Table 4. Model confidence set results for return series.
Table 4. Model confidence set results for return series.
Panel A—MCS test results for 1-step-ahead forecasts using the TMax test statistic (h = 1)
Excl.1st2nd3rd4thExcl.1st2nd3rd4th
Equities (113)
AR(1) on NS96034828%53%30%7%2%
VAR(1) on NS84356427%38%50%4%2%
AR(1) for returns6974171661%6%4%15%14%
Historical Average6137291354%3%6%26%12%
Treasuries (8)
AR(1) on NS1041213%0%50%13%25%
VAR(1) on NS053000%63%38%0%0%
AR(1) for returns1303113%38%0%38%13%
Historical Average3003238%0%0%38%25%
REITs (9)
AR(1) on NS080100%89%0%11%0%
VAR(1) on NS1070111%0%78%0%11%
AR(1) for returns6102067%11%0%22%0%
Historical Average7011078%0%11%11%0%
Corporate Bonds (4)
AR(1) on NS004000%0%100%0%0%
VAR(1) on NS040000%100%0%0%0%
AR(1) for returns3000175%0%0%0%25%
Historical Average1003025%0%0%75%0%
Commodities (17)
AR(1) on NS5136229%6%18%35%12%
VAR(1) on NS5045329%0%24%29%18%
AR(1) for returns21401012%82%0%6%0%
Historical Average5250529%12%29%0%29%
Equities (113)
AR(1) on NS21184353319%16%4%31%31%
VAR(1) on NS14109384212%9%8%34%37%
AR(1) for returns1286311101%25%56%10%9%
Historical Average157371531%50%33%13%3%
Panel B—MCS test results for 6-step-ahead forecasts using the TMax test statistic (h = 6)
Excl.1st2nd3rd4thExcl.1st2nd3rd4th
Treasuries (8)
AR(1) on NS2220225%25%25%0%25%
VAR(1) on NS060020%75%0%0%25%
AR(1) for returns2042025%0%50%25%0%
Historical Average2004225%0%0%50%25%
REITs (9)
AR(1) on NS041040%44%11%0%44%
VAR(1) on NS024300%22%44%33%0%
AR(1) for returns1031411%0%33%11%44%
Historical Average031500%33%11%56%0%
Corporate Bonds (4)
AR(1) on NS003010%0%0%75%0%
VAR(1) on NS040000%100%0%0%0%
AR(1) for returns000220%0%0%50%50%
Historical Average001210%0%25%50%25%
Commodities (17)
AR(1) on NS5245129%12%24%29%6%
VAR(1) on NS8421247%24%12%6%12%
AR(1) for returns0110600%65%0%35%0%
Historical Average5070529%0%41%0%29%
Panel C—MCS test results for 12-step-ahead forecasts using the TMax test statistic (h = 12)
Excl.1st2nd3rd4thExcl.1st2nd3rd4th
Equities (113)
AR(1) on NS6852221660%4%2%19%14%
VAR(1) on NS6233242155%3%3%21%19%
AR(1) for returns25154422%45%48%4%2%
Historical Average35449523%48%43%4%2%
Treasuries (8)
AR(1) on NS2240025%25%50%0%0%
VAR(1) on NS061010%75%13%0%13%
AR(1) for returns2002425%0%0%25%50%
Historical Average2014125%0%13%50%13%
REITs (9)
AR(1) on NS025020%22%56%0%22%
VAR(1) on NS062100%67%22%11%0%
AR(1) for returns2020522%0%22%0%56%
Historical Average2106022%11%0%67%0%
Corporate Bonds (4)
AR(1) on NS004000%0%100%0%0%
VAR(1) on NS040000%100%0%0%0%
AR(1) for returns000040%0%0%0%100%
Historical Average000400%0%0%100%0%
Commodities (17)
AR(1) on NS16001094%0%0%6%0%
VAR(1) on NS16000194%0%0%0%6%
AR(1) for returns3770018%41%41%0%0%
Historical Average1106006%59%35%0%0%
Equities (113)
AR(1) on NS2259225519%52%19%4%4%
VAR(1) on NS2044387418%39%34%6%4%
AR(1) for returns7361318365%5%12%16%3%
Historical Average6425173357%4%22%15%3%
Treasuries (8)
AR(1) on NS6110075%13%13%0%0%
VAR(1) on NS044000%50%50%0%0%
AR(1) for returns2303025%38%0%38%0%
Historical Average2013225%0%13%38%25%
Panel D—MCS test results for 1-step-ahead forecasts using the TR test statistic (h = 1)
Excl.1st2nd3rd4thExcl.1st2nd3rd4th
REITs (9)
AR(1) on NS080100%89%0%11%0%
VAR(1) on NS1052111%0%56%22%11%
AR(1) for returns6120067%11%22%0%0%
Historical Average7011078%0%11%11%0%
Corporate Bonds (4)
AR(1) on NS2010150%0%25%0%25%
VAR(1) on NS040000%100%0%0%0%
AR(1) for returns3001075%0%0%25%0%
Historical Average1000025%0%75%0%0%
Commodities (17)
AR(1) on NS5047129%0%24%41%6%
VAR(1) on NS6153235%6%29%18%12%
AR(1) for returns21401012%82%0%6%0%
Historical Average7230541%12%18%0%29%
Panel E—MCS test results for 6-step-ahead forecasts using the TR test statistic (h = 6)
Excl.1st2nd3rd4thExcl.1st2nd3rd4th
Equities (113)
AR(1) on NS211420283019%12%18%25%27%
VAR(1) on NS111213482910%11%12%42%26%
AR(1) for returns2334116212%29%36%14%19%
Historical Average254381452%48%34%12%4%
Treasuries (8)
AR(1) on NS5210063%25%13%0%0%
VAR(1) on NS060110%75%0%13%13%
AR(1) for returns5021063%0%25%13%0%
Historical Average5001263%0%0%13%25%
REITs (9)
AR(1) on NS040140%44%0%11%44%
VAR(1) on NS026100%22%67%11%0%
AR(1) for returns001350%0%11%33%56%
Historical Average032400%33%22%44%0%
Corporate Bonds (4)
AR(1) on NS003100%0%75%25%0%
VAR(1) on NS040000%100%0%0%0%
AR(1) for returns000310%0%0%75%25%
Historical Average001030%0%25%0%75%
Commodities (17)
AR(1) on NS6353035%18%29%18%0%
VAR(1) on NS8341147%18%24%6%6%
AR(1) for returns0111500%65%6%29%0%
Historical Average11021365%0%12%6%18%
Equities (113)
AR(1) on NS69513131361%4%12%12%12%
VAR(1) on NS663827958%3%7%24%8%
AR(1) for returns245494132%40%43%4%12%
Historical Average36040913%53%35%8%1%
Treasuries (8)
AR(1) on NS4310050%38%13%0%0%
VAR(1) on NS052010%63%25%0%13%
AR(1) for returns5001263%0%0%13%25%
Historical Average5012063%0%13%25%0%
Panel F—MCS test results for 12-step-ahead forecasts using the TR test statistic (h = 12)
Excl.1st2nd3rd4thExcl.1st2nd3rd4th
REITs (9)
AR(1) on NS023040%22%33%0%44%
VAR(1) on NS063000%67%33%0%0%
AR(1) for returns001530%0%11%56%33%
Historical Average012420%11%22%44%22%
Corporate Bonds (4)
AR(1) on NS031000%75%25%0%0%
VAR(1) on NS013000%25%75%0%0%
AR(1) for returns000040%0%0%0%100%
Historical Average000400%0%0%100%0%
Commodities (17)
AR(1) on NS15010188%0%6%0%6%
VAR(1) on NS15001188%0%0%6%6%
AR(1) for returns3761018%41%35%6%0%
Historical Average1106006%59%35%0%0%
This table contains results from the Model Confidence Set tests. In particular, the table shows test results for four models applied to forecast the returns on various asset classes over a pseudo out-of-sample period January 2011–September 2020. Panels A through C refer to results based on the TMax test statistic. Panels D through F refer to results based on the TR test statistic. Each panel is broken down into five sub-panels, each corresponding to a different asset class. Each panel shows summary results in absolute terms (how many times a model ranked 1st, 2nd, 3rd, and 4th or was excluded from the superior set) and in relative terms (out of the total number of return series considered for each asset class, which is shown in brackets). Panels A through D refer to one-step-ahead forecasts, Panels B and E to six-step-ahead predictions, and Panels C and F to 12-step-ahead forecasts.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Guidolin, M.; Ionta, S. Forecasting Asset Returns Using Nelson–Siegel Factors Estimated from the US Yield Curve. Econometrics 2025, 13, 17. https://doi.org/10.3390/econometrics13020017

AMA Style

Guidolin M, Ionta S. Forecasting Asset Returns Using Nelson–Siegel Factors Estimated from the US Yield Curve. Econometrics. 2025; 13(2):17. https://doi.org/10.3390/econometrics13020017

Chicago/Turabian Style

Guidolin, Massimo, and Serena Ionta. 2025. "Forecasting Asset Returns Using Nelson–Siegel Factors Estimated from the US Yield Curve" Econometrics 13, no. 2: 17. https://doi.org/10.3390/econometrics13020017

APA Style

Guidolin, M., & Ionta, S. (2025). Forecasting Asset Returns Using Nelson–Siegel Factors Estimated from the US Yield Curve. Econometrics, 13(2), 17. https://doi.org/10.3390/econometrics13020017

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop