This section aims to provide the specification for the proposed model and the fitting methodology. It will also briefly touch on the materials and methods for conducting the empirical data analysis.
2.1. Model Specification
Time-series models with non-Gaussian error were previously considered, in some detail, by
Li and McLeod (
1988), and earlier in
Lawrance and Lewis (
1980) and
Ledolter (
1979). In this article, specifically, the AR and TAR models are further explored. The AR(p) model is defined as follows:
where
is the random error, assumed to follow a Gamma distribution; thus,
, where the density function is defined as
It should be noted that it is assumed that there is no drift term in the AR model; yet, the drift term could be easily incorporated into the model. The TAR(p) model, similar to that introduced in
Tong (
1978) but with a modification in the random error term, is defined as follows:
where
d ≥ 1 is the lag of the model and
T is the threshold, such that the model is divided into two regimes, according to the observations at
d time periods earlier. The pivot element
determines which regime
falls into, with
falling into the first regime if
is less than or equal to the threshold and into the second regime, otherwise. Each regime follows an AR(p) model, as defined above, with different AR and Gamma parameters.
2.2. Model Estimation
The fitting of the AR(p) model is introduced in this part, followed by the extension of the procedure to the fitting of the TAR(p) model. Both procedures are fitted with the maximum likelihood procedure, as it has been shown that the maximum likelihood estimators (MLE) are consistent for Gamma random error (
Li and McLeod 1988).
The MLE for AR(p) model are derived by
, where
l denotes the log-likelihood function, in the form of
where
is the random error.
To further reduce the dimension of estimation, a profile likelihood method is used. The Gamma parameters
and
are replaced by the MLE of
and
, using the result of
Wilk et al. (
1962) and the approximation
. Thus, the final estimates of
and
are as follows:
where
A stands for the arithmetic mean of the random error and
G is the geometric mean. Thus, the estimation of the model is achieved by estimating
, where
where
and
are estimated by Equation (
7) and, by Equation (
6), thus depend on
.
The Nelder–Mead method, which is a non-gradient optimization method, is proposed to optimize the negative log-likelihood function. Although such a procedure is heuristic and may converge to non-stationary points, its performance is much more stable than traditional gradient methods, such as the Hessian matrix method, which may not be easily calculated (even numerically) given the dependency of the log-likelihood function and as is quite complicated.
Additionally, before simply applying the method and carrying out the optimization, it should be noticed that, as the random error
is assumed to be Gamma, it is required to be greater than zero, which is also evident from the term
in the expression of the log-likelihood function. To reflect this non-negativity constraint, a penalty method is applied and the log-likelihood function becomes:
where
M is some large-enough number.
As the Nelder–Mead method is a heuristic search method, the choice of initial point may greatly affect the result and, thus, the estimation process takes various initial points and returns the result that yields a best fit, using the AIC or BIC. Furthermore, a candidate set of AR order p is given and the procedure searches for the best AR order within the set, again by AIC and BIC. Specifically, in the scope of the simulation study in this report, the initial points for are set uniformly within [0,1] and the initial points for T are set within [, ], where the sample mean of the RV, is the sample variance, and n is a pre-determined number to control the range, here set as 0.5. The step size of is set to be 0.25 and that of T to be 0.05. For empirical data analysis, values of in the ranges [0,0.5] and [0.5,1] are tested, with step size 0.125, and the results showed that the outcome from [0,0.5] almost always dominated that from [0.5,1] and, thus, the range [0,0.5] and step size 0.125 were used for .
The fitting of the TAR(p) model is essentially the same, except that the random errors are classified into two different regimes. Thus, the log-likelihood function is expressed as:
where
are the random errors corresponding to the observations in the first regime,
is the number of observations in the first regime, and
and
the corresponding counterparts in the second regime, respectively.
A final concern regarding the model estimation would be that, for the first few observations, the AR model may not be properly initiated, as there are no earlier observations. Therefore, the sample estimates are essentially estimated by a sample, with the first few observations serving only as the independent variable, but not the dependent variable; that is,
with
n being the truncated size. Additionally, as the AIC and BIC are typically applied on the same sample with the same sample size, to allow for the comparison between models of different AR order and lag, a common truncation of size 10 is applied in the scope of this study, as the AR order and lag investigated did not exceed this reasonably.
As with the process of fitting the AR(p) model, the fitting for TAR(p) searches for the best model of AR order p and lag d, where p and d are given in the pre-determined candidate set and the threshold T.
2.3. Empirical Data Analysis Preparation
The data used in this paper were the consolidated realized volatility data from
Shen et al. (
2018), which are the realized volatilities for 30 stocks traded on the New York Stock Exchange (NYSE).
Graphs of PACF and the corresponding naive 95% confidence bound, proposed by
Quenouille (
1949), were first examined for the stock data, which showed that the PACF of the stocks were mostly significant within a lag of 5 and demonstrated a somewhat cut-off property; thus suggesting the fitting the AR model was potentially a good starting point. Non-linear threshold type AR models were also considered as a supplement to the AR model.
After considering the practical reasonableness of the model and the computational power available, an AR order up to 5 and lag order up to 3 were considered.
The final model for each stock was determined by both considering the AIC and BIC and the associated Ljung–Box test for each criterion. If the model selected by the two criteria differed with a similar goodness of fit, a simpler model was preferred. Otherwise, the model that gave a better goodness of fit result was preferred.
The data set and R code used for the study are available upon request, from either author.