2.1.1. Time Series Models
Time series models regard the systems for prediction as black boxes, without considering the factors which influence the results. Instead, the models are established by fitting several historical data. Trend extrapolation models, exponential smoothing models, grey models, autoregressive models and moving average models are typical time series models, which have been widely used in nearly every prediction scene. Considering the impact of specific events, complex methods may perform better. For example, Djakaria et al. [
23] predicted the passenger demand of Djalaluddin Gorontalo Airport using a multiplicative of Holt-Winters exponential smoothing. Elena et al. [
24] compared the performance of various models including linear trend, quadratic trend, exponential trend, linear exponential smoothing (Holt’s Model), and autoregressive integrated moving average models on APT prediction, and the results showed that linear exponential smoothing model performed best facing the impact of COVID-19, with a level of reliability of 95%. Concerning the large number of research samples, linear trend, quadratic trend, cubic trend, power trend, exponential trend, logarithmic trend, quadratic exponential smoothing, cubic exponential smoothing and grey GM (1,1) models are considered in this paper.
To define the models for evaluation, an APT observation sequence is proposed, where k denotes a time metric.
The parameters of the linear trend model, the quadratic trend model, the cubic trend model, the power trend model, the exponential trend model and the logarithmic trend model by using the least squares method and the models can be described as:
The quadratic exponential smoothing model is based on a single exponential smoothing model, and the single exponential smoothing model can be obtained by:
where
α1 is the smoothing coefficient of the model,
is the single smoothing result at time
t and
is the single smoothing result at time
t + 1. When
t = 1,
is set to be the value of
x0. Then, the quadratic exponential smoothing model can be deduced as:
where
α2 is the smoothing coefficient of the model,
is the single smoothing result at time
t,
is the quadratic smoothing result at time
t and
is quadratic the smoothing result at time
t + 1. When
t = 1,
is set to be the value of
. The predicted result of APT
at time
can be calculated by:
where
and
are the parameters which can be calculated by Equations (5) and (6).
Similarly, the cubic exponential smoothing model is based on the quadratic exponential smoothing model, which is defined as:
where
α3 is the smoothing coefficient of the model,
is the quadratic smoothing result at time
t and
is quadratic the smoothing result at time
t + 1. The predicted result of APT
at time
t +
T can be calculated by:
where
,
and
are the parameters which can be calculated by Equations (9)–(11).
For the grey model GM (1,1), the raw sequence
can be accumulated by
to obtain the accumulated sequence. Then, the accumulated sequence can be used to fit
by:
where
The predicted value of the accumulated sequence can be primarily obtained by:
and the predicted result of APT
at time
t + 1 can be iteratively calculated by:
2.1.2. Causal Models
Causal models establish causal relationships between independent variables. Typical causal models include regression models and elastic coefficient models [
25]. For regression models, a unary linear regression model, a multiple linear regression, a stepwise regression model, a hybrid regression model, an elastic coefficient model and a proposed elastic-like scale model are considered.
To define the models for evaluation, consider there are M correlated variables, the variables can be collected as , where is the value of mth variable at time point t. For APT prediction, the variables are usually the economic indicators of the cities where the airports are located in.
The unary linear regression model can be by fitting:
using the least squares method. Furthermore, the multiple linear regression can be similarly described as:
By combing the regression models with the time series models, can be calculated first and then, and can be obtained.
The stepwise regression model introduces correlated variables step-by-step into the regression model until the model reflects the relationship significantly. The common methods to determine the variables include the forward method and backward method. Taking the forward method as an example, the modelling process is shown as follows.
Step 1: Establishing the unary linear regression models between APT and each variable , which can be obtained by Equation (17). Calculating the F inspection values of each regression coefficient of the models. The F inspection values can be denoted as , and the maximum value can be obtained by . For a given significance level α, the critical value is denoted as . If it satisfies that , the variable is selected as the regression variable and collected into set I1.
Step 2: Establishing the binary sets of with the other variables, which can be denoted as . Building the binary linear regression models between APT and the established binary sets. Calculating the F inspection values of each regression coefficient of the models. The F inspection values can be denoted as , and the maximum value can be obtained by . For a given significance level α, the critical value is denoted as . If it satisfies that , the variable is selected as the regression variable and collected into set I1. Otherwise, the process is stopped.
Step 3: Similarly, considering establishing multiple linear regression models. The variables are selected into I1, until the process is stopped.
A hybrid regression causal model is established by using the correlation analysis method and the unary linear regression model. The correlation between the APT and the variables is first analyzed. The most correlated variable is selected to build the unary linear regression model with APT. The correlation coefficient can be obtained by:
where
is the correlation coefficient of the APT
x and the variable
,
x(
t) is the APT of the studied airport at time
t,
is the mean value of APT,
is the value of
at time
t and
is the mean value of
.
The elastic coefficient models are indirect methods to forecast the results by fitting the correlated factors. An elastic coefficient model can be defined as:
where
Es is the elastic coefficient,
T is the target time point,
q′ is the growth rate of a correlated variable before time
t,
p′ is the growth rate of the APT of an airport before time
t,
p is the growth rate of the APT of an airport before time
T,
q is the growth rate of a correlated variable before time
T,
is the predicting result.
Zhang et al. [
26] analyzed the causal relationship between air transport and economic growth, and the results showed the relationship was bi-directional, especially for the underdeveloped area. For the developed area, air transport only showed a positive effect on economic growth. The relationship can be reflected by the relative value of the APT and that of economic indicators, which can be represented by:
Regarding the relative value RAI as the elastic coefficient, an elastic-like scale model can be established. By fitting the indicators with trend extrapolation models, the predicted values of the indicators can be obtained and then, APT can be predicated.
2.1.4. Analogy-Based Method
The analogy-based method was first used for economic business forecasting in the 1920s [
27], a forecasting process was proposed so that the experts can use the process to conduct analogy. Solvoll et al. [
1] carried out verification on an airport in Norway to compare the performances of elastic models and analogy-based methods, and found that under particular circumstances, analogy-based methods performed better. The employed process is shown below.
Supposing that xit is the APT of the studied airport at time t, xjt is the APT of the analogical airport at time t. The predicted result can be obtained by the following steps.
Step 1: Determining the target predicting time T.
Step 2: Determining the conditions for analogy. For the APT prediction, the analogical condition can be determined by referring to the airport with a similar APT and setting the allowed error θ. The analogical condition can be described as .
Step 3: Filtering the airports which satisfy the analogical condition from a database and all the J satisfied airports are collected as a set D.
Step 4: Obtaining the airports’ APT data from D as a collection .
Step 5: The predicted result can be calculated by: