3.2. Methodology
In this paper, we extend the FF5 model of Fama and French [
57] by modeling factor betas as a flexible threshold time-varying process. Specifically, the excess return,
, on industry portfolio
in period
t is defined as:
where
is the excess return on the market portfolio;
,
,
and
are the returns on a zero-cost portfolio of small minus big stocks, high minus low book-to-market value stocks, high minus low operating profitability stocks and low minus high investment stocks, respectively; and
is independently and normally distributed pricing error at time
with potentially heteroskedastic (time-varying) variance
, that is
. In this setting, the intercept
represents the alpha of the portfolio, while the other parameters,
,
,
and
, represent factor exposures associated with the market, size, value, profitability and investment factors, respectively.
For ease of exposition, we introduce a compact notation for the time-varying FF5 model. Defining a
vector at time
as
and
portfolio excess returns as
, we represent the system of equations in Equation (1) with the compact notation as:
where
, with
as an
identity matrix. Time varying alphas (
) and betas for all portfolios are collected into the
vector
and the
vector of white noise pricing errors (shocks)
is distributed as
In Equation (3), denotes a () zero vector and is a time-varying variance-covariance matrix. Here, is a lower triangular matrix with unit diagonal and is a diagonal matrix with time-varying variances on the diagonal; thus, is the logarithm of conditional variance of the th portfolio.
We introduce time-variation into the five-factor model by assuming that the relationship between the factors
and returns
is not constant through time and the time-variation is governed by a law of motion that defines how
evolves over time. As mentioned above, the most typical of such specifications is the random walk TVP model, which assumes that the
th element of
,
,
,
is given by
where
is a white noise innovation with variance
. Thus, Equations (1)–(4) together define a latent state TVP model where the latent states are defined by Equation (4) with the latent state innovation variance given by
. As can be seen, the model subsumes the constant parameter case when
, which would imply
for all
.
Although the FF5-TVP model defined in Equations (1)–(4) is conceptually flexible and looks appealing for modeling factor betas in the presence of structural breaks, i.e., time-varying or conditional betas, it suffers a serious shortcoming in that the model may generate spurious beta changes that can significantly reduce the empirical performance of the model, thus leading to large pricing errors (see, e.g., [
69]). In TVP models wherein a random walk process governs parameter changes, the parameters are programmed to evolve gradually over time, which may rule out adaptation to abrupt changes. Indeed, a random walk parameter process has unit root memory, and it will not forget the past in the absence of any new shock. Accordingly, the FF5-TVP model that imposes a pre-specified random walk parameter structure on the time variation process may be restrictive for modeling asset returns, since financial markets often experience sudden changes driven by market shocks. Loosely speaking, the coefficients in TVP models are assumed to change every time period, whereas the innovation variance
in the state Equation (4) is generally estimated to be small, thus
will be close to
. Thus, one can think of TVP models as models of “many small breaks” which is usually not consistent with return dynamic in financial markets since regime changes occur less frequently and coefficient changes are usually large. In contrast with TVP models, several studies consider structural change models with fewer breaks in parameters, bu.t when parameters do change, the size of the change is unrestricted and is allowed to be large (see, e.g., [
70,
71,
72,
73,
74,
75,
76,
77]).
Against this backdrop, following studies by, e.g., [
70,
71,
74,
75], who built on the previous literature that considers fewer structural breaks with unrestricted coefficient changes, we specify a spike-and-slab mixture distribution for the innovations
of the state Equation (1). Specifically, we specify the state equation innovation as follows:
where
is the time-varying innovation variance given by
In Equation (6),
and
are two distinct state innovation variances. Specifically,
is the spike variance which is set close to zero and
is the slab variance. The state innovation variances are endogenously determined from the data using indicator variables
, which are independent sequence of Bernoulli distributed variables, defined as:
Equations (1)–(7) thus form a mixture innovation model, which is relatively standard in the literature [
70,
71,
72,
75]. This specification also relates to multiple structural change models Pesaran et al. [
77] and Koop and Potter [
75], which build on the former work of Chib [
70]. The mixture innovation model with Bernoulli distributed indicator variables
states that when
equals one, the change in the parameters
is normally distributed with zero mean and variance
. In contrast, when
equals zero, the innovation variance is equal to
, which implies that the parameters have almost no variability, effectively making
constant, that is
. This last case occurs because the innovation variance
is close to zero when
. Therefore, the coefficients in the mixture innovation model alters between constant coefficient state and changing coefficient state with unrestricted magnitude of change in the latter case.
The mixture innovation TVP model has quite appealing features for modeling time varying betas. The difficulty arises from the stochastic nature of indicators
and their joint simulation along with latent states, which in turn can make the model computationally cumbersome. Huber et al. [
78] and Cuaresma et al. [
79] circumvented the computational issue by introducing a deterministic threshold switching mechanism to identify the indicator sequences
, leading to a TTVP model. Analogous to mixture innovation models, the TTVP approach also conditions on the states in order to simulate the indicator sequences
during the Monte Carlo Markov Chain simulation (MCMC), but the
are not directly simulated from their full conditional distribution, and, instead, they are identified from a threshold parameter
defined for each of the
process. Specifically, let
be the draws of
in the
th iteration of the MCMC simulation, conditional on draws of
in the (
)th, that is
iteration and the remaining parameters. Then,
in the
th iteration is defined as
where
denotes the (
)th draw of the threshold parameter
specific to the coefficient
and
. The threshold identification specification for
in Equation (8) makes it conditionally deterministic, freeing the MCMC algorithm from simulating
from its full conditional distribution. According to Equation (8),
is set equal to 1 if the absolute period-on-period change of the
th draw of
exceeds the (
)th draw of the corresponding threshold
. In this case, the innovation variance is set to a large value for
and the motion of betas is in the change state. In contrast, if the absolute period-on-period change of the
th draw of
is smaller than the (
)th draw of the threshold
, then
and the innovation variance is set close to zero, i.e.,
, effectively making
constant with
The model with the threshold specification in Equation (8) can thus be called the threshold mixture innovation model wherein regime shifts are governed by a deterministic law of motion. Compared to the standard threshold models, this model allows each parameter to switch across regimes independently of the state of other parameters. That is, the model does not require all parameters to simultaneously move into or out of a certain regime which allows some parameters to be constant during a given period while others might be changing.
Given that most financial return series display conditional heteroskedasticity, our model can easily incorporate this statistical feature into the modeling framework. Indeed, the specification for
in Equation (3) is quite general with a time-varying variance feature that allows for conditional heteroskedasticity commonly observed for financial returns. Additionally, the model also allows time varying covariances via the matrix
. Conditional heteroskedasticity is commonly modeled using GARCH models. In this paper, we model the conditional volatility using a stochastic volatility (SV) model due to its several attractive features and modeling flexibility in our framework (see, e.g., [
80]). Hence, we specify the logarithm of the variances to follow the first-order stationary SV process:
where white noise error
is normally distributed with zero mean and variance
, i.e.,
,
is the mean parameter and
is the persistence (autoregressive) parameter satisfying the stationarity condition
. For the covariances
, we specify a random walk process with spike-and-slab error variances [
70,
71,
72,
75], as explained above. With the introduction of the conditional heteroskedasticity through the SV process in Equation (9), we label the extended model as TTVP with stochastic volatility or TTVP-SV.