Forecasting Energy-Related Carbon Dioxide Emissions in Thailand’s Construction Sector by Enriching the LS-ARIMAXi-ECM Model

Sutthichaimethee, Jindamas; Kubaha, Kuskana

doi:10.3390/su10103593

Open AccessArticle

Forecasting Energy-Related Carbon Dioxide Emissions in Thailand’s Construction Sector by Enriching the LS-ARIMAXi-ECM Model

by

Jindamas Sutthichaimethee

^*

and

Kuskana Kubaha

Division of Energy Management Technology, School of Energy, Environment and Materials, King Mongkut’s University of Technology Thonburi, 126 Pracha Uthit Road, Bang Mod, Thung Khru, Bangkok 10140, Thailand

^*

Author to whom correspondence should be addressed.

Sustainability 2018, 10(10), 3593; https://doi.org/10.3390/su10103593

Submission received: 28 September 2018 / Revised: 6 October 2018 / Accepted: 8 October 2018 / Published: 9 October 2018

(This article belongs to the Special Issue Modelling and Analysis of Sustainability Related Issues in New Era)

Download

Browse Figures

Versions Notes

Abstract

:

The Thailand Development Policy focuses on the simultaneous growth of the economy, society, and environment. Long-term goals have been set to improve economic and social well-being. At the same time, these aim to reduce the emission of CO₂ in the future, especially in the construction sector, which is deemed important in terms of national development and is a high generator of greenhouse gas. In order to achieve national sustainable development, policy formulation and planning is becoming necessary and requires a tool to undertake such a formulation. The tool is none other than the forecasting of CO₂ emissions in long-term energy consumption to produce a complete and accurate formulation. This research aims to study and forecast energy-related carbon dioxide emissions in Thailand’s construction sector by applying a model incorporating the long- and short-term auto-regressive (AR), integrated (I), moving average (MA) with exogenous variables (Xi) and the error correction mechanism (LS-ARIMAXi-ECM) model. This model is established and attempts to fill the gaps left by the old models. In fact, the model is constructed based on factors that are causal and influential for changes in CO₂ emissions. Both independent variables and dependent variables must be stationary at the same level. In addition, the LS-ARIMAXi-ECM model deploys a co-integration analysis and error correction mechanism (ECM) in its modeling. The study’s findings reveal that the LS-ARIMAXi

(2, 1, 1, X_{t - 1})

-ECM model is a forecasting model with an appropriate time period (t − i), as justified by the Q-test statistic and is not a spurious model. Therefore, it is used to forecast CO₂ emissions for the next 20 years (2019 to 2038). From the study, the results show that CO₂ emissions in the construction sector will increase by 37.88% or 61.09 Mt CO₂ Eq. in 2038. Also, the LS-ARIMAXi

(2, 1, 1, X_{t - 1})

-ECM model has been evaluated regarding its performance, and it produces a mean absolute percentage error (MAPE) of 1.01% and root mean square error (RMSE) of 0.93% as compared to the old models. Overall, the results indicate that determining future national sustainable development policies requires an appropriate forecasting model, which is built upon causal and contextual factors according to relevant sectors, to serve as an important tool for future sustainable planning.

Keywords:

long- and short-term; greenhouse gas; LS-ARIMAX_i-ECM model; sustainability; economic growth; exogenous variables; CO₂ emissions

1. Introduction

Over the past few years and up to the present, Thailand has continuously made a firm effort to enhance its economic development. As a result, the national economy has continued to grow. The gross domestic product (GDP) has also grown at the same time [1]. In fact, Thailand has been seriously engaging with exports to gain for them a bigger global market share, particularly penetrating the Chinese market. Also, the country is improving its tourism industry in order to generate more national revenue. Among other major actions taken, it has set a clear goal and objective to promote foreign investments in local industries by offering low tax rates and subsidies in some sectors in order to attract more investment and create better revenues for its people. According to the Office of the National Economic and Social Development Board (NESDB), Thailand’s economy has increased its growth rate, resulting in the social growth rate increasing in a positive relationship with this. However, energy consumption has also been found to be climbing steadily [2,3]. The increments in energy consumption have influenced a continuing rise in greenhouse gas emissions. In particular, greenhouse gas emissions in the industrial sector are projected to grow at a very high rate of 27% with a growth rate of 4.3% (comparing 2017 to 2016). Besides this, the construction sector has been found to be continuously emitting CO₂ at a higher emission rate.

The CO₂ emission in the construction sector has a 1.6% growth rate (2017/2016) [4]. The construction sector consumes a large amount of energy, causing massive greenhouse gas emissions. In general, it releases 90% of the carbon dioxide and 75% of other greenhouse emissions out of all the greenhouse gases in Thailand [3,4].

Thailand pursues its policy objectives by setting a sustainable development plan as the key to achieve sustainability. This plan aims to develop three main areas simultaneously, ensuring economic, social and environmental growth. In such development, the environmental aspect is very important and challenging as it requires highly effective action plans and positive long-term effects.

Therefore, it is necessary to create an effective tool for its implementation and possible application in the future. Engaging in future long-term planning is always challenging and complex as it requires extreme care in all planning phases. If the planning fails, damage later is hard to resolve. Hence, the most important tool for such long-term policy planning is a long-term forecasting model. This paper has addressed research gaps by reviewing other relevant literature, and it is determined to fill them. Some special causal factors are carefully selected in the modelling process so as to make use of both the endogenous variables and exogenous variables, which are characterized as stationary at the same level. The paper conducts an analysis of the co-integration test at the same level, examines the appropriateness of the period of time (t − i) with respect to both types of variables, and investigates the model falseness or model spuriousness. Additionally, it attempts to extend its forecasting capacity to a long-term prediction of 20 years (2019–2038), and it can be extended to apply in other sectors and various contexts. The structure of this paper is as follows:

We analyze stationary causal variables and those which are influential over the change of CO₂ emissions based on the augmented Dickey and Fuller theory [5]. We select stationary variables at the same level under the Sustainable Development Framework along with the use of data from 1990 to 2017.
We bring those stationary causal variables to the same level to analyze a long-term relationship through a concept from Johansen and Juselius [6].
We apply co-integrated variables at the same level to construct the the long- and short-term auto-regressive (AR), integrated (I), moving average (MA) with exogenous variables (Xi) and the error correction mechanism (LS-ARIMAXi–ECM) model comprising endogenous variables and exogeneous variables.
We examine the period of time (t − i) for the appropriateness of the LS-ARIMAXi ( $p, d, q, X_{t - i}$ )-ECM model with Q-testing, as well as checking on spurious issues, consisting of heteroscedasticity, multicollinearity and autocorrelation.
We compare the efficiency of the LS-ARIMAXi ( $p, d, q, X_{t - i}$ )-ECM model with other existing models, including multiple regression, the grey model (GM (1,1)), grey model-autoregressive integrated moving average (GM-ARIMA) model, artificial neural network (ANN) model, autoregressive moving average (ARMA) model, and autoregressive integrated moving average (ARIMA) model, through the performance measurement of MAPE and RMSE.
We forecast future CO₂ emissions from the LS-ARIMAXi ( $p, d, q, X_{t - i}$ )-ECM model during the years 2019 to 2038, totaling 20 years of forecasting. The flowchart of the LS-ARIMAXi ( $p, d, q, X_{t - i}$ )-ECM model is shown in Figure 1.

The remainder of this paper is as follows: Section 2 is a literature review. Section 3 discusses the materials and methods. Section 4 shows the results. Section 5 summarizes the discussion. Section 6 is the conclusion.

2. Literature Review

Developing an energy-forecasting model is a key step to promoting a supportive national policy of an individual country. Having an efficient and effective model would allow all policy makers to make better decisions. Many studies have highlighted the significance of forecasting the energy consumption or other related areas. Ardakani and Ardehali [7] developed an optimized regression and ANN models for a long-term forecasting for the years 2010 to 2030 on the electrical energy consumption (EEC) of both developing and developed economies based on different optimized models and historical data types. By using such an approach, they obtained the result of which usage of historical data of socio-economic indicators produce more accurate EEC forecasting. Azadeh, Ghaderi, Sheikhalishahi and Nokhandan [8] applied two different seasonal ANNs in order to predict a short load in Iran’s electricity market. As regards their prediction result, it reflected a significant correlation between actual data and ANN outcomes. Hence, the ANN models outperform the regression models in terms of MAPE in most cases. Zhao, Zhao and Guo [9] carried out a study to estimate the electricity consumption in Inner Mongolia by using an integrated Grey model enriched by a Moth-flame optimization (MFO) algorithm along with rolling mechanism (Rolling-MFO-GM (1,1)). From their study, it can be seen that such a hybrid model can greatly enhance a forecasting performance for annual electricity consumption. In China, monthly electric energy was also estimated with the implementation of a feature extraction, and this study was investigated by Meng, Niu and Sun [10]. They found that the above method performed better than traditional approaches in terms of expected risk and forecasting precision. Hasanov, Hunt and Mikayilov [11] attempted to establish a model to forecast Azerbaijan’s electricity demand in 2025 by applying co-integration and error correction approaches. In their study, Azerbaijan’s electricity demand in 2025 was forecast between 19.50 and 21 TWh. Khairalla, Ning, AL-Jallad and El-Faroug [12] investigated the stacking multi-learning ensemble (SMLE) model to forecast energy consumption in the short term. The study’s result demonstrated that the mentioned model functioned better and more accurately compared to other methods discussed in this paper.

In other studies, various methods are utilized differently, and their applications vary in context. Chang, Sun and Gu [13] presented a novel quantum harmony search (QHS) algorithm-based discounted mean square forecast error (DMSFE) combination model to forecast energy CO₂ emissions. This study’s finding was able to certify the validity of the presented approach, while it also revealed that the forecasting precision can be enhanced to a certain degree. Zeng, Xu, Wang, Chen and Li, [14] examined and forecasted the allocative efficiency of China’s carbon emission allowance financial assets at a provincial level for 2020. In their study, they deployed a zero sum gains data envelopment analysis (ZSG-DEA) model. As of their finding, an efficient allocation scheme for all the provinces, based on the mentioned model, was achieved. With that, they therefore provided a suggestion of which particular provinces have to cut off their CO₂ emission. Also, Liang, Niu, Wang and Chen [15] did an evaluation on the security early warning of energy consumption carbon emissions (ECCE) in Hebei Province of China. They constructed an assessment index system according to the pressure-state-response (P-S-R) model, as well as deploying the variance method and linearity weighted method in order to compute such an early warning index of ECCE. Their finding has shown the potential trend of growing improvement from the security index during 2015 to 2020, while the security degree and the corresponding alarm are found to be negative. Prakash, Xu, Rajagopal and Noh [16] presented a forecasting technique according to Gaussian process regression (GPR) to estimate an energy load, and the result reflected that the above method outperformed precisely as compared to other forecasting models. While Mehedintu, Sterpu and Soava [17] embarked on a study to estimate and predict the share of renewable energy consumption in final energy use within the European Union by 2020. This study’s analysis utilized three macroeconomic indicators and five regression models (polynomial, ARIMA). Later, the finding showed a growing trend of the share. Liang, Niu, Cao and Hong [18] conducted an analysis and constructed a model to forecast China’s electricity demand, in terms of carbon emissions. They began the study with an integration of the Grey relation degree (GRD) and induced ordered weighted harmonic averaging operator (IOWHA) in order to construct the optimal hybrid forecasting model, based on multiple regression and an extreme learning machine. Throughout the study, they drew the conclusion that the proposed model performs better than other forecasting models, especially in boosting overall instability. Furthermore, the study revealed that a low-carbon economy development will increase the demand for electricity, while it impacts the adjustment of the electricity demand structure. In other studies, Zhai and Wang [19] tried to predict the carbon emissions demands in India, under the balanced economic growth path, from 2009 to 2050, using the economy-carbon dynamic model. In this study, they projected that the cumulative energy demand and carbon emissions demand are 44.65 Gtoe and 36.16 Gt C, respectively. Additionally, those two demands will peak in 2045 at 1290.74 Mtoe and 1045.98 Mt C, respectively, while their demands disclose maximum values of 0.81 toe and 0.65 t C, respectively. On the other hand, in the case of China, Zeng and Chen [20] developed a low-carbon economy index evaluation system, based on the entropy weight method, in order to forecast the allocation ratio of carbon emissions in China for 2020 and 2030. They projected reasonable allocation ratios for carbon emission allowances during the predicted period. Attaining such an allocation ratio can help China in many respects, including economic development, energy conservation and emissions reduction. Zhou, Yu, Guang and Li [21] analyzed and predicted CO₂ emissions in China during the period of 2000 to 2014, by implementing the logarithmic mean division index (LMDI) and genetic algorithm-support vector machine (GA-SVM) model. Their finding reveals that the proposed model performs better than a back propagation neural network (BPNN) model and a single SVM model in terms of forecasting CO₂ emissions. Later, Xu, Gua, Liu and Dai [22] forecasted the final energy consumption of the Guangdong Province of China from 2013 to 2016 using a newly established GM–ARMA model, based on a HP filter. The study shows that this particular model has excellent precision and a higher level of reliability. Additionally, it indicates that the study region will face a serious issue concerning energy conservation and emission reductions in the next few years.

With different developed forecasting approaches, Zhao, Zhao, Liu, Su and An [23] conducted a study to forecast wind speed using the self-adaptive auto-regressive integrated moving average chaotic particle swarm optimization (SA-ARIMA-CPSO) approach. This approach was developed by a SA auto-regressive integrated moving average, with an exogenous variables (ARIMAX) model, through the optimization of the CPSO algorithm. Once the experimental result was revealed, the developed model was shown to outperform the other models. Souza, Christo and Almeida [24] proposed a method using the ARIMA model to locate the faults in power transmission lines. In the study, they analyzed the voltage oscillographic signals. Their study results were found to be satisfactory in a comparison with other used techniques in the literature. In addition, Farias, Puig, Rangel and Flores [25] attempted to forecast the demand of water distribution networks by deploying a multi-model predictor, qualitative multi-model predictor plus (QMMP+). In their study, it was found that such a predictor enhances the forecasting precision. On the other hand, Chen, Xu and Zhou [26] proposed a hybrid approach, combining the variational mode decomposition (VMD) denoising technique and the autoregressive integrated moving average (ARIMA) and GM (1,1) models to predict the lifetime of a battery (RUL). Once the experiment was carried out, a result was produced that indicates the accuracy of the proposed methods for lithium-ion battery on-line RUL prediction. Other than the above studies, Yang, Park, Choi, Kim, Munkhdalai, Musa and Ryu [27] conducted a comparative study on state-of-the-art techniques. They compared four different temporal outbreak detection algorithms, namely, the cumulative SUM (CUSUM), early aberration reporting system (EARS), ARIMA and the Holt–Winters algorithm. Here, the comparison results indicate that the EARS C3 method performs better than any other studied algorithms. However, it can be observed that the Holt–Winters outperforms the others when the baseline frequency and dispersion parameter values are less than 1.5 and 2, respectively. Additionally, Kahsai, Nondo, Schaeffer and Gebremedhin [28] investigated the relationship between energy consumption and economic growth in Sub-Saharan Africa by deploying a panel co-integration approach. Their examination explains the interdependence of energy consumption and economic growth in the study region. The results draw a vital conclusion for formulating sustainable development policies in order to achieve the efficient allocation of resources.

However, Xin, Zhou, Yang, Li and Wang [29] proposed a new method, which integrates the Kalman filter, ARIMA, and generalized autoregressive conditional heteroskedasticity (GARCH) to predict a bridge structure deformation. The study reported the discovery of a new way of predicting structural behavior, based on data processing, laying a basis for a bridge health monitoring system based on sensor data using sensing technology. Li, Yang and Li [30] developed four time-series forecasting techniques, including a metabolism grey model (GM), ARIMA, grey model (GM)-ARIAMA and non-linear metabolism grey model (NMGM), to forecast China’s coal power installed capacity for the next 10 years (2017–2026). The prediction results present an average annual growth rate of 5.26% for the predicted period. In addition to this, the average annual new added installed capacity for 2017–2026 is found to be 74 gigawatts. Kurecic and Kokotovic [31] examined the relevance of political stability on foreign direct investment (FDI) in three different panels—small, developed, and instability threatened economies—by implementing a Granger causality test, a vector autoregressive (VAR) framework and an ARDL model. As a result, the study presents a conclusion that there is a long-term relationship between political stability and FDI in the panel of small economies, while such a relationship is not found in other panels of larger and more developed economies. Meanwhile, Li and Su [32] adopted the VAR model to study the dynamic effect of renewable energy consumption on carbon dioxide emissions in the US, from 1990 to 2015. They found that the use of renewable energy would greatly help to reduce carbon emissions, yet natural gas consumption would have a negative impact on CO₂ emissions in the early stages. This could guide policy makers to develop energy-saving and emission-reduction policies. Consequently, Dai, Niu and Han [33] proposed to adapt the MSFLA-LSSVM model for CO₂ emissions prediction in China from 2018 to 2025. They concluded that China’s CO₂ emissions would exhibit slow growth trend for the next few years. With this in mind, China’s CO₂ emissions could be effectively controlled in the future, which could start to reduce the greenhouse effect. In another approach.

Last but not least, Jiang, Yang and Li [34] carried out a comparative study of forecasting an energy demand in India by deploying various methods, namely MGM, ARIMA, MGM-ARIMA, and back propagation neural network (BP). Based on their predicted result, India’s energy demand will potentially increase by 4.75% from 2017 to 2030.

Based on a review of previous research, many works have presented metrologies, research methodologies, and various analytical results differently. Thus, this research is grounded in unique features which other existing research has not undertaken before in terms of its modeling, validation, spurious check testing, and the efficiency and effectiveness of its modeling regarding decision-making. In addition, a key feature of this study is the possible application of the model to other sectors according to their particular contexts.

3. Materials and Methods

3.1. Co-Integration Testing and Error Correction Mechanism Model Based on Johansen and Juselius

A co-integration test based on the concept of Johansen and Juselius [35] is developed to serve as the relationship model of at least two variables. If the model comes with large sample properties, the result generated may not be accurate as a reference. In practice, we will find that a regression indicates that a modelling variable is co-integrated. If we perform a regression in the form of an order or reverse order, this shows the variable as non-co-integrated. Hence, the second condition is taken out of interest, as the co-integration test should not vary over the change of the variables [5,6,35]:

λ_{t r a c e} (r) = - T \sum_{i = r + 1}^{n} \ln (1 - {\hat{λ}}_{i})

(1)

Equation (1) sets a hypothesis as follows:

H_{o} : r \leq k

H_{a} : r > k, k = 0, \dots, n

λ_{\max} (r, r + 1) = - T (1 - {\hat{λ}}_{r + 1})

(2)

Equation (2) presents a hypothesis as below:

H_{o} : r = k

H_{a} : r = k + 1, k = 0, \dots, n

where

{\hat{λ}}_{i}

is an estimated value of the characteristic roots or eigenvalues derived from estimated matrix π, and T is a number of observations for an estimation of the characteristic roots, retrieved from the equation below:

| \begin{matrix} λ S_{p p} & - S_{p o} & S_{o o}^{- 1} \begin{matrix} S_{o p} \end{matrix} \end{matrix} | = 0

(3)

where

S_{i j} = T^{- 1} \sum_{t = 1}^{T} R_{i t} R_{j t}^{'}

i, j = o, p

As for the residuals

R_{o t}

and

R_{p t}

, they can be derived from a regression of

Δ u_{t}

and

Δ u_{t - p}

with

Δ u_{t - 1}, \dots, Δ u_{t - p + 1}

, where

x_{t}

and

y_{t}

are the time series which are stationary at first differences I (1); A is a constant where

u_{i}

is I (0).

A likelihood ratio test statistic of the null hypothesis is shown below:

H_{o}

: A rank of

π

that is less or equal to k, or written as

H_{o} : r \leq k

Hence,

- 2 \ln (Q) = - T \sum_{i = r + 1}^{n} 1 - {\hat{λ}}_{i}

(4)

Equation (4) tells us that

α = (n \times r)

matrix,

β = (n \times r)

matrix,

r = a

rank of matrix

π

where a characteristic of matrix α and β is as follows:

π = α β'

(5)

where matrix

β

is a parameter matrix of co-integrating vectors, and matrix

α

is a parameter matrix of speed of adjustment parameters.

In estimating a parameter of the ECM for a co-integrated series, the multi-learning (ML) process that we consider is determined by a sequence, wherein a dimension n can be written as NID (0.Λ).

The process of cointegration testing according to Johansen can be seen as follows:

Step 1: the procedure to evaluate the order of integration by testing and evaluating the order of integration of all the variables is done by plotting the data to see whether the data-generating process is a linear time trend or otherwise; the variables must be at the same level.

The lag length can be found through a test in VAR with undifferenced data, and later we can estimate a vector autoregression. The process starts from the longest lag length which is deemed reasonable, and we can check whether we can shorten the lag length or not. For instance, if we want to test a significance of lag 2 to lag 5, we have to estimate the VARs as follows [36]:

y_{1} = A_{0} + A_{1} y_{t - 1} + A_{2} y_{t - 2} + A_{3} y_{t - 3} + A_{4} y_{t - 4} + A_{5} y_{1 - 5} + u_{1 t}

(6)

y_{1} = A_{0} + A_{1} y_{t - 1} + u_{2 t}

(7)

where

y_{t} = n \times 1

vector of variables;

A_{0} = n \times 1

matrix of intercept terms;

A_{i} = n \times n

matrix of coefficient;

u_{1 i}

and

u_{2 i} = n \times n

vector of error terms.

In practice, we take an estimation of Equation (6) with a lag equal to 5 for each variable in each equation, and let

\sum_{5}^{}

be a variance–covariance matrix of the residuals of Equation (6). Later, we estimate Equation (7) with only one lag for all variables in each equation, and let

\sum_{t}

be a variance–covariance matrix of residuals of Equation (7).

As for testing, we use a likelihood ratio test statistic as proposed by Sims [37], although the studied variables taken into account are non-stationary variables. The likelihood ratio test can be demonstrated as below:

(T - c) (\ln | \sum_{1} | - \ln | \sum_{5} |)

(8)

where:

T

= a number of observations;

c

= a number of parameters in unrestricted system;

\ln | \sum_{1} |

= natural logarithm of determinant of

\sum_{1}

;

\ln | \sum_{5} |

= natural logarithm of determinant of

\sum_{5}

.

A statistical test has a distribution as

X^{2}

with a degree of freedom equivalent to the number of limited coefficients. However, we have found that

A_{i}

has

n^{2}

coefficient. In Equation (7), we have a limitation of

A_{2} = A_{3} = A_{4} = A_{5} = 0

, and that means that the limitation is equal to

4 n^{2}

. Nonetheless, Enders (2010) suggested that we can choose a lag length p by using AIC or SBC.

Step 2 estimates the modeling and value of rank of

π

. In this case, the use of ordinary least square (OLS) is not appropriate for the estimation, because the restrictions must be inserted across the equation in matrix π. Here, we may choose to estimate in three different forms: (a) a form that gives a set of

A_{0}

equivalent to zero, (b) a form with a drift or (c) a constant term in a co-integrating vector as shown below:

Δ y_{t} = A_{0} + π_{1} Δ y_{t - 1} + π y_{t - 2} + ε_{t}

(9)

where the drift term

A_{0}

is given with restrictions to monitor an intercept appearing in the co-integrating vector in the case of the intercept existing in the co-integrating vector. However, we have to analyze the residuals of the model. If the errors are found not to be white noise, this means that the lag lengths are too short. In terms of the residuals’ criteria, the first condition lies upon the residuals of a long-run equilibrium, which must be stationary, and the second condition is that the estimation of short-term deviation (that is

ε_{t}

in Equation (9)) must be white noise.

Thereafter, the characteristic roots of matrix π have to be estimated, and we compute the value of

λ_{\max}

and

λ_{t r a c e}

.

However, to justify the hypothesis in which the variables are not co-integrated (rank π = 0), we have two possible statistical tests based on an alternative hypothesis. This is to say that if we want to test the hypothesis saying that the variables are not co-integrated (r = 0) where the alternative hypothesis is a co-integrating vector equivalent to or greater than 1 (r > 0), we need to do a statistical test of

λ_{t r a c e}

(0) as explained below. In the case of Equation (7), the value of the characteristic roots of matrix π 3 (assume n = 3) is

λ_{1}, λ_{2}, λ_{3}

as shown in the following:

λ_{t r a c e} (0) = - T [\ln (1 - λ_{1}) + \ln (1 - λ_{2}) + \ln (1 - λ_{3})],

λ_{t r a c e} (1) = - T [\ln (1 - λ_{2}) + \ln (1 - λ_{3})]

(10)

where

λ_{i}

= an estimated value of the characteristic roots (or known as eigenvalues) derived from matrix π estimated by

λ_{1} > λ_{2} > λ_{3} > \dots > λ_{n}

, and

T

= a number of observations we can use and compare with a critical value of

λ_{t r a c e}

.

Step 3 is a process of the coefficient analysis of co-integrating vectors, which have been normalized, as well as the coefficients of speed of adjustment, as demonstrated below:

When we consider whether $β_{0} = 0$ or otherwise, we must impose one restriction into the co-integrating vector with the use of the likelihood ratio test. This distributes $X^{2}$ with a degree of freedom equivalent to 1, and we assume that we cannot reject $H_{0}$ where $β_{0} = 0$ . Here, we may need to reapply the model where the constants are absent in the co-integrating vector;
In limiting a normalized co-integrating vector at $β_{2} = - 1$ and $β_{3} = 1$ , we are imposing two restrictions into the co-integrating vector. When the likelihood ratio test is used here, in this case it is distributed as $X^{2}$ with degrees of freedom equivalent to 2 due to two restrictions;
In testing whether $β_{} = (0, - 1, - 1, 1)$ , we impose three restrictions including $β_{0} = 0$ , $β_{2} = - 1$ , $β_{3} = 1$ ( $β_{1}$ is equal to −1). In this case, the statistical test is the likelihood ratio test, which is distributed as $X^{2}$ with a degree of freedom of 3. This type of testing is known as a joint restriction.
For a test that is $β_{} = (0, - 1, - 1, 1)$ , then the constraint 3 is for −1. In this case, the test statistic is the likelihood ratio test, which is a line of degrees of freedom equal to 3 tests (the joint restriction test).

Step 4 is a stage called “innovation accounting” (which falls under an analysis of impulse response and variance decompositions) designed as a useful tool to evaluate a relationship. If the relationship among other innovations is very low, it indicates that an identification problem will no longer occur. If the order is set differently, the impulse responses and variance decomposition would become similar. In testing the innovation accounting and casual factors toward an error–correction model, this helps to identify a structural model and answer the question of whether an estimating model is reasonable or otherwise.

3.2. Long- and Short-Term Auto-Regressive Integrated Moving Average with Exogenous Variables and the Error Correction Mechanism (LS-ARIMAXi–ECM) Model

The LS-ARIMAXi-ECM model is a newly developed model built upon a concept of the ARIMA model with the following conditions: (1) factors used in modelling are both endogenous variables and exogenous variables, and they must be stationary only at the same level; (2) when the first condition is fulfilled, the above factors have to undergo a co-integration test to investigate the long-term relationship of all factors at the same level only; (3) the next step is to build a forecasting model of LS-ARIMAXi-ECM whose construction is structured based on autoregressive (AR), integrated (I), moving average (MA), ECM (t − i) and exogenous variables (X_i), as explained in the next paragraph.

3.2.1. Autoregressive Moving Average ( $A R M A (p, q)$ ) Model

The ARMA

(p, q)

model is written as [38,39]:

X_{t} = α_{0} + α_{1} X_{t - 1} + α_{2} X_{t - 2} + \dots + α_{p} X_{t - p} + ε_{t} - β_{1} ε_{t - 1} - β_{2} ε_{t - 2} - \dots - β_{q} ε_{t - q}

(11)

where

t = 1, 2, \dots, T

. If we consider at time

T

, the

A R M A (p, q)

model becomes:

X_{T} = α_{0} + α_{1} X_{T - 1} + α_{2} X_{T - 2} + \dots + α_{p} X_{T - p} + ε_{T} - β_{1} ε_{T - 1} - β_{2} ε_{T - 2} - \dots - β_{q} ε_{T - q}

(12)

or it can be written in another form as:

α (L) X_{T} = α_{0} + β (L) ε_{T}

(13)

where

α (L) = 1 - α_{1} L - α_{2} L - \dots - α_{p} L^{p}

and

β (L) = 1 - β_{1} L - β_{2} L - \dots - β_{q} L^{q}

while the information at time

T

can be replaced by

I_{T} = {X_{1} \dots, X_{T}, ε_{1} \dots, ε_{T}}

. Equation (12) produces

X_{T - 1}

and

X_{T - 2}

as the equation below:

\begin{array}{l} X_{T - 1} = α_{0} + α_{1} X_{T} + α_{2} X_{T - 1} + \dots + α_{p} X_{T + (1 - p)} \\ + ε_{T + 1} - β_{1} ε_{T} - β_{2} ε_{T - 1} - \dots - β_{q} ε_{T + (1 - q)} \end{array}

(14)

\begin{array}{l} X_{T - 2} = α_{0} + α_{1} X_{T + 1} + α_{2} X_{T} + \dots + α_{p} X_{T + (2 - p)} \\ + ε_{T + 2} - β_{1} ε_{T + 1} - β_{2} ε_{T} - \dots - β_{q} ε_{T + (2 - q)} \end{array}

(15)

A forecasting of time series 1 and 2 from

A R M A (p, q)

can be made below:

{\hat{X}}_{T} (1) = E (X_{T + 1} | I_{T}) = α_{0} + α_{1} X_{T} + α_{2} X_{T - 1} + \dots + α_{p} X_{T + (1 - p)} - β_{1} ε_{T} - \dots - β_{q} ε_{T + (1 - q)}

(16)

{\hat{X}}_{T} (2) = E (X_{T + 2} | I_{T}) = α_{0} + α_{1} {\hat{X}}_{T} (1) + α_{2} X_{T} + \dots + α_{p} X_{T - (p - 2)} - β_{2} ε_{T} - \dots - β_{q} ε_{T + (2 - q)}

(17)

While we can formulate

X_{T + j}

in a general form as:

{\hat{X}}_{T} (j) = E (X_{T + j} | I_{T})

(18)

{\hat{X}}_{T} (j) = α_{0} + \sum_{i = 1}^{p} α_{i} {\hat{X}}_{T} (j - i) - \sum_{i = 1}^{q} β_{i} ε_{T} (j - i)

(19)

where:

{\hat{X}}_{T} (j - 1) = X_{T + (j - i)}

when

j - i \leq 0

ε_{T} (j - i) = {\begin{matrix} ε_{T + (j - i)} \\ 0 \end{matrix} \begin{matrix} , i f & j - i \leq 0 \\ , i f & j - i > 0 \end{matrix}

Besides, we can also check

A R M A (1, 1)

as the above explanation when

j \to \infty

, and the forecasting can be executed from:

{\hat{X}}_{T} (j) = \frac{α_{0}}{1 - α_{1} - \dots - α_{p}}

(20)

Equation (20) tells us when to forecast further where the forecasting result will approach

\frac{α_{0}}{1 - α_{1} - \dots - α_{p}} = E (X_{t})

, and this is the average of time series

X_{t}

in the

A R M A (p, q)

model. In addition, the

j

-step ahead forecast error and its variance can be easily executed when altering the

A R M A (p, q)

model into

M A (\infty)

as explained below.

Since the time series

X_{t}

is stationary, it can be rewritten as:

X_{T} = \frac{α_{0}}{α (L)} + \frac{β_{0}}{α (L)} ε_{T}

(21)

when considering

\frac{α_{0}}{α (L)} = \frac{α_{0}}{1 - α_{1} - \dots - α_{p}} = E (X_{t})

, which is the average. When

\frac{β_{0}}{α (L)} ε_{T}

is considered, it shows a relativeness to

ε_{T}

, and that

\frac{β_{0}}{α (L)} ε_{T} = \frac{1 - β_{1} L - \dots - β_{q} L}{1 - α_{1} L - \dots - α_{p} L} ε_{T}

with inconstancy in value.

\frac{α_{0}}{α (L)} = μ

(22)

\frac{β (L)}{α (L)} = ϕ (L) = 1 + ϕ_{1} L + ϕ_{1}^{2} L^{2} + \dots

(23)

Thus, Equation (21) with

A R M A (p, q)

can be formulated into the

M A (\infty)

form as:

X_{T} = μ + ϕ (L) ε_{T}

(24)

We call this

ϕ_{i} (i = 1, 2, \dots)

the impulse response function of the

A R M A

model. When the time series

X_{T}

is stationary,

ϕ_{1}, ϕ_{2}, ϕ_{3}, \dots

will rapidly decrease exponentially. However, Equation (24) can be used to compute the

j

-step ahead forecast error and its variance through the following description.

From Equation (24), the time series

X_{T + 1}, X_{T + 2}, and \begin{matrix} X_{T + 3} \end{matrix}

can be written as follows:

X_{T + 1} = μ + ε_{T + 1} + ϕ_{1} ε_{T} + ϕ_{2} ε_{T - 1} + \dots

(25)

X_{T + 2} = μ + ε_{T + 2} + ϕ_{1} ε_{T + 1} + ϕ_{2} ε_{T} + ϕ_{3} ε_{T + 1} + \dots

(26)

X_{T + 3} = μ + ε_{T + 3} + ϕ_{1} ε_{T + 2} + ϕ_{2} ε_{T + 1} + ϕ_{3} ε_{T} + ϕ_{4} ε_{T - 1} + \dots

(27)

Then, the forecasting value of 1, 2, and 3 ahead is derived from the following:

{\hat{X}}_{T} (1) = E (X_{T + 1} | I_{T}) = μ + ϕ_{1} ε_{T} + ϕ_{2} ε_{T - 1} + \dots

(28)

{\hat{X}}_{T} (2) = E (X_{T + 2} | I_{T}) = μ + ϕ_{2} ε_{T} + ϕ_{3} ε_{T - 1} + \dots

(29)

{\hat{X}}_{T} (3) = E (X_{T + 3} | I_{T}) = μ + ϕ_{3} ε_{T} + ϕ_{4} ε_{T - 1} + \dots

(30)

While its error at 1, 2, and 3 ahead is as follows:

e_{T} (1) = X_{T + 1} - {\hat{X}}_{T} (1) = ε_{T + 1}

(31)

e_{T} (2) = X_{T + 2} - {\hat{X}}_{T} (2) = ε_{T + 2} + ϕ_{1} ε_{T + 1}

(32)

e_{T} (3) = X_{T + 3} - {\hat{X}}_{T} (3) = ε_{T + 3} + ϕ_{1} ε_{T + 2} + ϕ_{2} ε_{T + 1}

(33)

Moreover, its variance at 1, 2, and 3 ahead is as below:

V a r (e_{T} (1)) = σ^{2}

(34)

V a r (e_{T} (2)) = (1 + ϕ_{1}^{2}) σ^{2}

(35)

V a r (e_{T} (3)) = (1 + ϕ_{1}^{2} + ϕ_{2}^{2}) σ^{2}

(36)

However, the

j

-step ahead forecast error and its variance can be drawn in an equation as follows:

e_{T} (j) = ε_{T + j} + ϕ_{1} ε_{T + (j - 1)} + ϕ_{2} ε_{T + (j - 2)} + \dots + ϕ_{j - 1} ε_{T + 1}

(37)

V a r (e_{T} (j)) = (1 + ϕ_{1}^{2} + ϕ_{2}^{2} + \dots ϕ_{j - 1}^{2}) σ^{2}

(38)

3.2.2. Autoregressive Integrated Moving Average ( $A R I M A (p, d, q)$ ) Model

The non-stationary variables used in the modeling must be converted into a stationary variable before being deployed into the modeling by differentiating. This is called

A R I M A (p, d, q)

and can be explained through the equation below [38,39]:

Δ X_{t} = α_{0} + α_{1}^{} Δ X_{t - 1} + ε_{t}

(39)

where

t = 1, 2, \dots, T

.

When the value

X_{1}, X_{2}, \dots X_{T}

(otherwise denoted as

I_{T}

) is known, Equation (39) can be illustrated as follows:

\begin{matrix} \begin{matrix} {\hat{X}}_{T + 1} & = \end{matrix} & α_{0} & + & α_{1} Δ {\hat{X}}_{T} \\ \begin{matrix} {\hat{X}}_{T + 2} & = \end{matrix} & α_{0} & + & α_{1} Δ {\overset{\land}{X}}_{T + 1} \\ \begin{matrix} {\hat{X}}_{T + 3} & = \end{matrix} & α_{0} & + & α_{1} Δ {\hat{X}}_{T + 2} \\ \begin{matrix} \begin{matrix}  \end{matrix} \\ \begin{matrix} {\hat{X}}_{T + j} & = \end{matrix} \end{matrix} & \begin{matrix} ⋮ \\ α_{0} \end{matrix} & \begin{matrix} + \end{matrix} & \begin{matrix} α_{1} Δ {\hat{X}}_{T + (j - 1)} \end{matrix} \end{matrix}}

(40)

Another explanation of

A R I M A (1, 1, 0)

can be seen as below:

X_{t} - X_{t - 1} = α_{0} + α_{1} (X_{t - 1} - X_{t - 2}) + ε_{t}

(41)

X_{t} = α_{0} + (α_{1} + 1) X_{t - 1} - α_{1} X_{t - 2} + ε_{t}

(42)

where

t = 1, 2, \dots, T

As for forecasting with

A R I M A (p, 1, q)

, this can be applied as demonstrated below.

Assuming the

A R I M A (p, 1, q)

model is written as below:

Δ X_{t} = α_{0} + α_{1}^{} Δ X_{t - 1} + α_{2}^{} Δ X_{t - 2} + \dots + α_{p}^{} Δ X_{t - p} + ε_{t} - β_{1} ε_{t - 1} - β_{2} ε_{t - 2} - \dots - β_{1} ε_{t - q}

(43)

X_{t} - X_{t - 1} = α_{0} + α_{1}^{} (X_{t - 1} - X_{t - 2}) + α_{2}^{} (X_{t - 2} - X_{t - 3}) + \dots + α_{p}^{} (X_{t - p} - X_{t - p - 1}) + ε_{t} - β_{1} ε_{t - 1} - β_{2} ε_{t - 2} - \dots - β_{1} ε_{t - q}

(44)

when the ARIMA model is retrieved, we will apply it to establish a model called the LS-ARIMAXi-ECM Model. This can be seen in the following.

3.2.3. LS-ARIMAXi-ECM Model

The LS-ARIMAXi-ECM model can be written as below:

Δ X_{t} = α_{0} + α_{1}^{} Δ X_{t - 1} + α_{2}^{} Δ X_{t - 2} + \dots + α_{p}^{} Δ X_{t - p} + ε_{t} - β_{1} ε_{t - 1} - β_{2} ε_{t - 2} - \dots - β_{1} ε_{t - q} + \sum_{i = 1}^{p} Y_{t - i} + \sum_{i = 1}^{p} E C M_{t - i}

(45)

where

\sum_{i = 1}^{p} Y_{t - i}

= exogeneous variables, which are stationary at a level and

\sum_{i = 1}^{p} E C M_{t - i}

= the error correction mechanism test.

The LS-ARIMAXi-ECM model is a model that requires the testing of the appropriateness of the time-period through Q-test statistics. Also, it needs to undergo an assessment of its heteroskedasticity, multicollinearity, and autocorrelation. This is to ensure that the model will not be a spurious model. Once we derive the best model, we must test the model performance for both MAPE and RMSE values. Consequently, we can compare the above values of the model with other studied models to monitor the effectiveness of the model for future use.

3.3. Measurement of the Forecasting Performance

There are many methods we can choose; we decided to utilize the MAPE and RMSE to compare the forecasting accuracy of each model. The calculation equations are shown as follows [38,39]:

MAPE = \frac{1}{n} \sum_{i = 1}^{n} | \frac{{\hat{y}}_{i} - y_{i}}{y_{i}} |

(46)

RMSE = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {({\hat{y}}_{i} - y_{i})}^{2}}

(47)

4. Results

4.1. Screening of Influencing Factors for Model Input

In this paper, we bring the causal factors to bear on the stationary status under the Sustainable Development policy of Thailand. The time series data used ranges from 1990 to 2017 along with 8 factors, including carbon dioxide emissions

{(CO}_{2})

, per capita GDP

(GDP)

, population growth

(Population)

, urbanization rate

(URT)

, industrial structure

(IST)

, total coal consumption

(CCT)

, oil price

(OP)

, and total exports and imports

(X - E)

.

The test was conducted based on the augmented Dickey and Fuller theory at Level I (0) and the First Difference I (1), as illustrated in Table 1.

Table 1 clarifies which of all the factors are analyzed in the unit root test and found to be non-stationary at Level I (0) or insignificant at 1%, 5% and 10%. Therefore, it requires a first difference analysis. This results in the fact that all the factors are stationary at Level I (1) or significant at 1%, 5% and 10%. Next, we bring the factors for co-integration testing using a concept of Johansen and Juselius in Table 2.

The test results of co-integration are shown in Table 2. The test presents a trace test score of 275.41 and 82.45. At the same time, the results of the maximum eigenvalue test are 141.25 and 96.05, which are higher than the MacKinnon critical values at the same significance levels. This signifies a long-term relationship of all variables as well as a feasible use of variables in structuring the LS-ARIMAXi-ECM model.

4.2. Formation of Analysis Modeling with the LS-ARIMAX_i $(p, d, q, X_{t - 1})$ -ECM Model

As for the LS-ARIMAX_i (

p, d, q, X_{t - i}

)-ECM model, it is built with the aim of being applicable in different contexts in various sectors. Hence, we seek to test an appropriate time period by using Q-testing. This produces a conclusion in which the right time is a period (t − i) of

p, d, q, X_{t - i}

, and the best fit is the period (t − i). They are embedded in the LS-ARIMAX_i

(2, 1, 1, X_{t - 1})

-ECM model as shown in Figure 2.

Figure 2 reflects the fact that the LS-ARIMAX_i (

2, 1, 1, X_{t - i}

)-ECM model becomes the best forecasting model because all values of the Q test statistic at time (t − i) are in the criteria and meet all conditions, or the insignificance falls as follows;

α

= 0.01,

α

= 0.05 and

α

= 0.1. Therefore, this model can be used to forecast CO₂ emissions. However, the authors have discovered the best model currently to be the LS-ARIMAX_i (

2, 1, 1, X_{t - i}

)-ECM model, and this allows us to know about the influence of the changes or elasticity of all independent variables causing changes in the CO₂ emission at time (t − i), as illustrated in Table 3.

Table 3 illustrates the parameters of the LS-ARIMAX_i

(2, 1, 1, X_{t - 1})

-ECM model at a statistically significant level of 1% and 5%. Regarding the examination of the goodness of fit of the LS-ARIMAX_i

(2, 1, 1, X_{t - 1})

-ECM model, it has been found that R-squared is 0.93, and this indicates that the independent variables can explain or predict the dependent variables by up to 93%. By investigating the autocorrelation at the Durbin–Watson statistic of 2.02 and the LM test, of 1.45, the model was found to be free from the autocorrelation issue. The F-statistic is 245.05 (probability is 0.00), and this shows that the LS-ARIMAX_i

(2, 1, 1, X_{t - 1})

-ECM model maintains a confidence interval of 99% and 95% and eliminates the issue of multicollinearity. The value of the ARCH test is 31.05, and this guarantees that the model is free from the issue of heteroscedasticity.

The findings have illustrated that when the per capita GDP (

Δ {\ln (GDP)}_{t - 2}

at time (t-2) changes about 1%, it affects CO₂ emissions (

Δ \ln (C O_{2})_{t}

) changing in the same direction equivalent to 6.78% at a confidence interval of 99%. While the population growth (

Δ {\ln (Population)}_{t - 1}

changes by about 1%, it influences CO₂ emissions (

Δ \ln (C O_{2})_{t}

), changing in the same direction, equivalent to 2.33% at a confidence interval of 95%. When the urbanization rate (

Δ {\ln (URT)}_{t - 1}

) changes by about 1%, it changes CO₂ emissions (

Δ \ln (C O_{2})_{t}

) in the same direction, equivalent to 5.45% at a confidence interval of 99%. When the industrial structure

Δ \ln {(I S T)}_{t - 1}

changes by about 1%, it affects CO₂ emissions (

Δ \ln (C O_{2})_{t}

) changing in the same direction, equivalent to 4.62% at a confidence interval of 99%. When the total coal consumption (

Δ \ln {(C C T)}_{t - 2}

) changes by about 1%, it changes CO₂ emissions (

Δ \ln (C O_{2})_{t}

) in the same direction, equivalent to 3.15% at a confidence interval of 99%. Also, when the total exports and imports

Δ \ln {(X - E)}_{t - 2}

change by about 1%, they influence CO₂ emissions (

Δ \ln (C O_{2})_{t}

) changing in the same direction, equivalent to 6.40% at a confidence interval of 99%. With the same effect, when the oil price (

Δ \ln {(O P)}_{t - 3}

) changes by about 1%, it affects CO₂ emissions (

Δ \ln (C O_{2})_{t}

) changing in the same direction, equivalent to 6.55% at a confidence interval of 99%. In the case of oil prices, although the oil price has climbed, energy consumption is also increasing, which results in increased CO₂ emissions. This is because oil price is not a product based on the law of demand.

In case of

{ECM}_{t - 1}

at a coefficient value of −3.87, the adjustment of the LS-ARIMAX_i (

2, 1, 1, X_{t - i}

)-ECM model toward the equilibrium is at a rate of 3.87%.

As far as the LS-ARIMAX_i (

2, 1, 1, X_{t - i}

)-ECM model is concerned, we have compared it in terms of its model efficiency with other old models by deploying MAPE and RMSE. The comparison between the new model with the old ones—multiple regression, GM (1,1), ANN, ARMA, ARIMA, and GM-ARIMA—is undertaken as follows.

Table 4 shows that the LS-ARIMAX_i (

2, 1, 1, X_{t - i}

)-ECM model comprises the lowest value of MAPE and RMSE, equivalent to 1.01% and 0.93%, respectively. The GM-ARIMA model shows an MAPE and RMSE at 4.27% and 3.45%, respectively. The ARIMA model shows an MAPE and RMSE of 5.38% and 5.85%, respectively, while the ARMA model has an MAPE and RMSE equivalent to 10.18% and 11.36%, respectively. The ANN model produces MAPE and RMSE of 12.55% and 13.65%, respectively, whereas the GM (1,1) has MAPE and RMSE of 12.94% and 17.39%, respectively. Lastly, the multiple regression model generates an MAPE and RMSE equivalent to 20.05% and 19.49%, respectively. When comparing the studied model’s values with other models, it is found that the LS-ARIMAX_i (

2, 1, 1, X_{t - i}

)-ECM model is an efficient model and is suitable for future long-term forecasting.

4.3. CO₂ Emission Forecasting Based on the LS-ARIMAXi $(2, 1, 1, X_{t - 1})$ -ECM Model

When the most suitable forecasting model of LS-ARIMAX_i

(2, 1, 1, X_{t - 1})

-ECM is retrieved, we can then use it to predict and estimate the carbon dioxide emissions in Thailand’s construction sector for a duration of 20 years (2019–2038), as shown in Figure 3.

Figure 3 shows that the CO₂ emissions for the next 20 years from 2019 to 2038 in Thailand’s construction sector will increase along with a growth rate of 37.88%. In 2019, the CO₂ emissions are projected to be 43.31 (Mt CO₂ Eq) with a continuous increase. By 2038, the CO₂ emissions are forecast to be 59.72 (Mt CO₂ Eq). The above results reflect that the construction sector is a sector with continuous emissions of CO₂, resulting in a continuous rise in greenhouse gas emissions.

5. Discussion

The result of this study is the establishment of the LS-ARIMAX_i (

2, 1, 1, X_{t - i}

)-ECM model. This model is built and used to forecast CO₂ emissions in the construction sector in Thailand for 20 years in total (2019–2038). As for this model, only causal factors which are stationary at the same level are selected, and the model is free from being a spurious model. In this study, the model efficiency is evaluated by comparing the model performance with other old models, consisting of the multiple regression, grey model (GM (1,1)), ANN, ARMA model, ARIMA model, and GM-ARIMA model. The evaluation outcome reaffirms that the LS-ARIMAX_i (

2, 1, 1, X_{t - i}

)-ECM model has better efficiency and is more appropriate for long-term prediction than the other existing models. In its prediction, the established model projects that there will be a continuous increase of CO₂ emissions at a growth rate of 43.31 (Mt CO₂ Eq) (2019–2038). This suggests that Thailand has to take serious action in policy planning as well as in following up evaluations in the construction sector. In the meantime, Thailand has to develop other sustainability policies in line with existing policies. This study differs from other previous studies as it builds a new LS-ARIMAXi-ECM model based on the concept of the ARIMA model coupled with co-integration testing. In modelling, the LS-ARIMAXi-ECM model is deployed with advanced statistics. Only the causal yet exogeneous factors are integrated, while an error correction mechanism has been incorporated to clearly determine the magnitude of equilibrium adjustment in both the short and long term. The unique feature of the LS-ARIMAX_i (

2, 1, 1, X_{t - i}

)-ECM model is that it can be applied to other sectors and areas. The model is not a spurious model as it is free from heteroskedasticity, multicollinearity, and autocorrelation. As such, this allows the model to accurately determine a magnitude of change of CO₂ emissions better than other existing models. Hence, it becomes supportive in the decision-making and long-term planning of Thailand in the future.

From the review of the literature, this kind of study has been shown to be relevant to other past research in terms of model applications in CO₂ emission forecasting. Zhao, Zhao, and Guo [9] used GM (1,1) optimized by MFO with a rolling mechanism to forecast the electricity consumption of Inner Mongolia; Chang, Sun, and Gu [13] forecast energy CO₂ emissions using a quantum harmony search algorithm-based DMSFE combination model; Zeng, Xu, Wang, Chen, and Li [14] forecasted the allocative efficiency of carbon emission allowance financial assets in china at the provincial level in 2020; Liang, Niu, Wang, and Chen [15] did an assessment analysis and forecast for the secure early warning of energy consumption carbon emissions in Hebei Province, China; Li, Yang, and Li [30] forecast China’s coal power installed capacity using a comparison of MGM, ARIMA, GM-ARIMA, and NMGM Models.

However, the entirety of the literature is distinguished this paper in terms of its modeling process, application capability, appropriateness assessment of the time period, prediction quality and usage. In fact, this research aims to forecast energy-related carbon dioxide emissions in Thailand’s construction sector for 20 years (2019–2038), which is constructed based on advanced research methodologies, high-quality statistics and a detailed research process. In the past, many studies have focused on research findings, not the research process. Therefore, some errors and potential risks occurred. Nonetheless, our particular study is seen as better and more efficient than any other previous studies in the field. Also, this study responds to a long-term need to have a model whose capacity is improved for future application in different contexts.

In the selection of software for use in this research, we decided to use the EVIEWS 9.2 software as a research tool to optimize the advanced statistics effectively. As for those who are interested in the software, EVIEWS can be downloaded in a student version at no cost or license fee, or you may choose other software as you see fit.

Regarding the limitations of this study, some factors of the sustainable development policy are not taken into account, including oil prices. This is because the Thai government has a policy to ensure diesel prices, and that is a major factor affecting energy consumption in Thailand. With government interference, the price of diesel fuel does not fluctuate in line with market mechanisms. Due to this phenomenon, this study is not able to include that factor, as it does not determine the real magnitude of the change in diesel prices on CO₂ emissions.

6. Conclusions

This paper has developed and established the LS-ARIMAX_i (

2, 1, 1, X_{t - i}

) model for a useful application in forecasting the future trends of CO₂ emissions in the construction sector of Thailand for the next 20 years (2019–2038). This model is able to effectively and efficiently support sustainable development policy planning in Thailand. Most importantly, it can reduce errors in the planning so as to avoid mistakes of the past. In addition, the model is undertaken through careful research methods, with a highly statistical use of data. Additionally, we have chosen 8 variables from the casual factors. The variables are carbon dioxide emissions

{(CO}_{2})

, per capita GDP

(GDP)

, population growth

(Population)

, urbanization rate

(URT)

, industrial structure

(IST)

, total coal consumption

(CCT)

, oil price

(OP)

, and total exports and imports

(X - E)

. All of the variables used are assessed by the unit root test, at the first level, and analyzed using the co-integration test, resulting in the LS-ARIMAX_i (

2, 1, 1, X_{t - i}

) model. Additionally, they are tested for a proper time period (t − i). In fact, the model is found to be free from the issue of heteroskedasticity, multicollinearity, and autocorrelation and therefore, it becomes a most suitable model for forecasting CO₂ emissions in Thailand’s construction sector, while it is available for future applications in other sectors and contexts both in Thailand and other countries.

One remaining aspect to reflect upon is that the LS-ARIMAX_i (

2, 1, 1, X_{t - i}

) model has considered not only the stationary causal factors on the same level, but also proportion and relationship analysis. In addition to this, each factor of the model is analyzed based on rationality and the structure equation model (SEM) so as to increase its utilization and optimization for future research and policy planning.

As a recommendation for applying this research, the model should be adapted for the context of each sector and area. In particular, factors have to be stationary and influential over dependent variables. At the same time, they must have a co-integration at the same level to avoid the model being spurious and to decrease errors. Also, they must undergo an assessment of the appropriateness of their time period in order to produce the most accurate prediction result.

Author Contributions

J.S. and K.K. were involved in the data collection and preprocessing phase, model constructing, empirical research, results analysis and discussion, and manuscript preparation. All authors have approved the submitted manuscript.

Funding

This research received no external funding.

Acknowledgments

This work was performed with the approval of King Mongkut’s University of Technology Thonburi.

Conflicts of Interest

The authors declare no conflict of interest.

References

Office of the National Economic and Social Development Board (NESDB). Available online: http://www.nesdb.go.th/nesdb_en/more_news.php?cid=154&filename=index (accessed on 1 August 2018).
National Statistic Office Ministry of Information and Communication Technology. Available online: http://web.nso.go.th/index.htm (accessed on 2 August 2018).
Department of Alternative Energy Development and Efficiency. Available online: http://www.dede.go.th/ewtadmin/ewt/dede_web/ewt_news.php?nid=47140 (accessed on 3 August 2018).
Thailand Greenhouse Gas Management Organization (Public Organization). Available online: http://www.tgo.or.th/2015/thai/content.php?s1=7&s2=16&sub3=sub3 (accessed on 3 August 2018).
Dickey, D.A.; Fuller, W.A. Likelihood ratio statistics for autoregressive time series with a unit root. Econometrica 1981, 49, 1057–1072. [Google Scholar] [CrossRef]
Johansen, S.; Juselius, K. Maximum likelihood estimation and inference on cointegration with applications to the demand for money. Oxford Bull. Econ. Stat. 1990, 52, 169–210. [Google Scholar] [CrossRef]
Ardakani, F.J.; Ardehali, M.M. Long-term electrical energy consumption forecasting for developing and developed economies based on different optimized models and historical data types. Energy 2014, 65, 452–461. [Google Scholar] [CrossRef]
Azadeh, A.; Ghaderi, S.F.; Sheikhalishahi, M.; Nokhandan, B.P. Optimization of short load forecasting in electricity market of Iran using artificial neural networks. Opt. Eng. 2014, 15, 485–508. [Google Scholar] [CrossRef]
Zhao, H.; Zhao, H.; Guo, S. Using GM (1,1) Optimized by MFO with rolling mechanism to forecast the electricity consumption of Inner Mongolia. Appl. Sci. 2016, 6, 20. [Google Scholar] [CrossRef]
Meng, M.; Niu, D.; Sun, W. Forecasting monthly electric energy consumption using feature extraction. Energies 2011, 4, 1495–1507. [Google Scholar] [CrossRef]
Hasanov, F.J.; Hunt, L.C.; Mikayilov, C.I. Modeling and forecasting electricity demand in Azerbaijan using cointegration techniques. Energies 2016, 9, 1045. [Google Scholar] [CrossRef]
Khairalla, M.A.; Ning, X.; AL-Jallad, N.T.; El-Faroug, M.O. short-term forecasting for energy consumption through stacking heterogeneous ensemble learning model. Energies 2018, 11, 1605. [Google Scholar] [CrossRef]
Chang, H.; Sun, W.; Gu, X. Forecasting energy CO₂ emissions using a quantum harmony search algorithm-based DMSFE combination model. Energies 2013, 6, 1456–1477. [Google Scholar] [CrossRef]
Zeng, S.; Xu, Y.; Wang, L.; Chen, J.; Li, Q. Forecasting the allocative efficiency of carbon emission allowance financial assets in China at the provincial level in 2020. Energies 2016, 9, 329. [Google Scholar] [CrossRef]
Liang, Y.; Niu, D.; Wang, H.; Chen, H. Assessment analysis and forecasting for security early warning of energy consumption carbon emissions in Hebei Province, China. Energies 2017, 10, 391. [Google Scholar] [CrossRef]
Prakash, A.K.; Xu, S.; Rajagopal, R.; Noh, H.Y. Robust Building energy load forecasting using physically-based kernel models. Energies 2018, 11, 862. [Google Scholar] [CrossRef]
Mehedintu, A.; Sterpu, M.; Soava, G. Estimation and forecasts for the share of renewable energy consumption in final energy consumption by 2020 in the European Union. Sustainability 2018, 10, 1515. [Google Scholar] [CrossRef]
Liang, Y.; Niu, D.; Cao, Y.; Hong, W.C. Analysis and modeling for China’s electricity demand forecasting using a hybrid method based on multiple regression and extreme learning machine: A view from carbon emission. Energies 2016, 9, 941. [Google Scholar] [CrossRef]
Zhai, S.; Wang, Z. The prediction of carbon emissions demands in India under the balance economic growth path. Smart Grid Renew. Energy 2012, 3, 186–193. [Google Scholar] [CrossRef]
Zeng, S.; Chen, J. Forecasting the allocation ratio of carbon emission allowance currency for 2020 and 2030 in China. Sustainability 2016, 8, 650. [Google Scholar] [CrossRef]
Zhou, J.; Yu, X.; Guang, F.; Li, W. Analyzing and predicting CO₂ emissions in China based on the LMDI and GA-SVM model. Pol. J. Environ. Stud. 2018, 27, 927–938. [Google Scholar] [CrossRef]
Xu, W.; Gu, R.; Liu, Y.; Dai, Y. Forecasting energy consumption using a new GM–ARMA model based on HP filter: The case of Guangdong Province of China. Econ. Modell. 2015, 45, 127–135. [Google Scholar] [CrossRef]
Zhao, E.; Zhao, J.; Liu, L.; Su, Z.; An, N. Hybrid wind speed prediction sased on a self-adaptive ARIMAX model with an exogenous WRF simulation. Energies 2016, 9, 7. [Google Scholar] [CrossRef]
Souza, D.; Christo, E.; Almeida, A. Location of faults in power transmission lines using the ARIMA method. Energies 2017, 10, 1596. [Google Scholar] [CrossRef]
Farias, R.L.; Puig, V.; Rangel, H.R.; Flores, J.J. Multi-model prediction for demand forecast in water distribution networks. Energies 2018, 11, 660. [Google Scholar] [CrossRef]
Chen, L.; Xu, L.; Zhou, Y. Novel approach for lithium-ion battery on-line remaining useful life prediction based on permutation entropy. Energies 2018, 11, 820. [Google Scholar] [CrossRef]
Yang, E.; Park, H.W.; Choi, Y.H.; Kim, J.; Munkhdalai, L.; Musa, I.; Ryu, K.H. A simulation-based study on the comparison of statistical and time series forecasting methods for early detection of infectious disease outbreaks. Int. J. Environ. Res. Public Health 2018, 15, 2178. [Google Scholar] [CrossRef] [PubMed]
Kahsai, M.S.; Nondo, C.; Schaeffer, P.V.; Gebremedhin, T.G. Does level of income matter in the energy consumption and GDP Nexus: Evidence from Sub-Saharan African countries. Energy Econ. 2012, 34, 739–746. [Google Scholar] [CrossRef]
Xin, J.; Zhou, J.; Yang, S.X.; Li, X.; Wang, Y. Bridge structure deformation prediction based on GNSS data using Kalman-ARIMA-GARCH model. Sensors 2018, 18, 298. [Google Scholar] [CrossRef] [PubMed]
Li, S.; Yang, X.; Li, R. Forecasting China’s coal power installed capacity: A comparison of MGM, ARIMA, GM-ARIMA, and NMGM models. Sustainability 2018, 10, 506. [Google Scholar] [CrossRef]
Kurecic, P.; Kokotovic, F. The relevance of political stability on FDI: A VAR analysis and ARDL models for selected small, developed, and instability threatened economies. Economies 2017, 5, 22. [Google Scholar] [CrossRef]
Li, R.; Su, M. The role of natural gas and renewable energy in curbing carbon emission: Case study of the United States. Sustainability 2017, 9, 600. [Google Scholar] [CrossRef]
Dai, S.; Niu, D.; Han, Y. Forecasting of Energy-Related CO₂ emission in China based on GM (1,1) and Least Squared Support Leaping Vector Machine Optimized by Modified Shuffled Frog Leaping Algorithm for Sustainability. Sustainability 2018, 10, 958. [Google Scholar] [CrossRef]
Jiang, F.; Yang, X.; Li, S. Comparison of Forecasting India’s energy demand using an MGM, ARIMA model, MGM-ARIMA model, and BP Neural Network model. Sustainability 2018, 10, 2225. [Google Scholar] [CrossRef]
Johansen, S. Likelihood-Based Inference in Cointegrated Vector Autoregressive Models; Oxford University Press: New York, NY, USA, 1995. [Google Scholar]
MacKinnon, J. Critical Values for Cointegration Test in Long-Run Economic Relationships; Engle, R., Granger, C., Eds.; Oxford University Press: Oxford, UK, 1991. [Google Scholar]
Sims, C.A. Macroeconomics and Reality. Econ. J. Econ. Soc. 1980, 48, 1–48. [Google Scholar] [CrossRef]
Enders, W. Applied Econometrics Time Series; Wiley Series in Probability and Statistics; University of Alabama: Tuscaloosa, AL, USA, 2010. [Google Scholar]
Harvey, A.C. Forecasting, Structural Time Series Models and the Kalman Filter; Cambridge University Press: Cambridge, UK, 1989. [Google Scholar]

Figure 1. The flowchart of the long- and short-term auto-regressive (AR), integrated (I), moving average (MA) with exogenous variables (Xi) and error correction mechanism (LS-ARIMAXi (

p, d, q, X_{t - i}

)-ECM) model.

Figure 1. The flowchart of the long- and short-term auto-regressive (AR), integrated (I), moving average (MA) with exogenous variables (Xi) and error correction mechanism (LS-ARIMAXi (

p, d, q, X_{t - i}

)-ECM) model.

Figure 2. The correlogram of the residual error of the LS-ARIMAX_i (

2, 1, 1, X_{t - i}

)-ECM model.

Figure 2. The correlogram of the residual error of the LS-ARIMAX_i (

2, 1, 1, X_{t - i}

)-ECM model.

Note: AC means the value of the autocorrelation coefficient. PAC means the value of the partial correlation coefficient.

Figure 3. The forecasting results of CO₂ emission from 2019 to 2038 in Thailand’s construction sector.

Table 1. Unit root test at one level and first difference I (1).

Tau Test at Level I (0)		Tau Test at First Difference I (1)		MacKinnon Critical Value
Variables	Value	Variables	Value	1%	5%	10%
$\ln (C O_{2})$	−2.21	$Δ \ln (C O_{2})$	−4.69 ***	−4.37	−3.05	−2.95
$\ln (GDP)$	−2.74	$Δ \ln (GDP)$	−5.95 ***	−4.37	−3.05	−2.95
$\ln (Population)$	−2.13	$Δ \ln (Population)$	−4.44 ***	−4.37	−3.05	−2.95
$\ln (U R T)$	−2.59	$Δ \ln (U R T)$	−5.61 ***	−4.37	−3.05	−2.95
$\ln (IST)$	−2.77	$Δ \ln (IST)$	−5.74 ***	−4.37	−3.05	−2.95
$\ln (CCT)$	−2.40	$Δ \ln (CCT)$	−5.45 ***	−4.37	−3.05	−2.95
$\ln (X - E)$	−2.90	$Δ \ln (X - E)$	−5.59 ***	−4.37	−3.05	−2.95
$\ln (OP)$	−2.83	$Δ \ln (OP)$	−5.61 ***	−4.37	−3.05	−2.95

Note: *** denotes a significance,

α = 0.01

, compared to the Tau test with the MacKinnon Critical Value,

Δ

is the first difference, and

\ln

is the natural logarithm.

Table 2. Co-integration testing using a concept of Johansen and Juselius.

				1%	5%
$Δ \ln (C O_{2})$ , $Δ \ln (GDP)$ , $Δ \ln (Population)$ , $Δ \ln (U R T)$ , $Δ \ln (IST)$ , $Δ \ln (CCT)$ , $Δ \ln (X - E)$ , $Δ \ln (OP)$	None ***	275.41	141.25	20.15	15.05	I (1)
	At Most 1 ***	82.45	96.05	5.25	3.40	I (1)

*** denotes significance α = 0.01.

Table 3. The result of the LS-ARIMAX_i (

2, 1, 1, X_{t - i}

Table 3. The result of the LS-ARIMAX_i (

2, 1, 1, X_{t - i}

Independent Variables	Dependent Variable
Independent Variables	$Δ {\ln (CO}_{2})_{t}$
$Δ {\ln (CO}_{2})_{t - 1}$	2.01 **
$Δ {\ln (CO}_{2})_{t - 2}$	3.75 ***
$M A (1)$	2.72 ***
$Δ {\ln (GDP)}_{t - 2}$	6.78 ***
$Δ {\ln (Population)}_{t - 1}$	2.33 **
$Δ {\ln (URT)}_{t - 1}$	5.45 ***
$Δ {\ln (IST)}_{t - 1}$	4.62 ***
$Δ {\ln (CCT)}_{t - 2}$	3.15 ***
$Δ \ln (X - {E)}_{t - 2}$	6.40 ***
$Δ {\ln (OP)}_{t - 3}$	6.55 ***
${ECM}_{t - 1}$	−3.87 ***

Note: In the above,

Δ {\ln (CO}_{2})_{t - 1}

and

Δ {\ln (CO}_{2})_{t - 2}

are the autoregressive model,

M A

is the moving average model, *** denotes significance

α

= 0.01, ** denotes significance α = 0.05, R-squared is 0.94, adjusted R-squared is 0.93, the Durbin–Watson statistic is 2.02, the F-statistic is 245.05 (probability is 0.00), the ARCH test is 31.05 (probability is 0.1), the LM test is 1.45 (probability is 0.11), and the chi-square test represents the significance.

Table 4. The performance monitoring of the forecasting model. MAPE: mean absolute percentage error.

Forecasting Model	MAPE (%)	RMSE (%)
Multiple Regression model	20.05	19.49
Grey model (GM (1,1))	12.94	17.39
Artificial Neural Natural (ANN) model	12.55	13.65
Autoregressive Moving Average (ARMA) model	10.18	11.36
Autoregressive Integrated Moving Average (ARIMA) model	5.38	5.85
GM-ARIMA Model	4.27	3.45
LS-ARIMAX_i ( $2, 1, 1, X_{t - i}$ )-ECM	1.01	0.93

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Sutthichaimethee, J.; Kubaha, K. Forecasting Energy-Related Carbon Dioxide Emissions in Thailand’s Construction Sector by Enriching the LS-ARIMAXi-ECM Model. Sustainability 2018, 10, 3593. https://doi.org/10.3390/su10103593

AMA Style

Sutthichaimethee J, Kubaha K. Forecasting Energy-Related Carbon Dioxide Emissions in Thailand’s Construction Sector by Enriching the LS-ARIMAXi-ECM Model. Sustainability. 2018; 10(10):3593. https://doi.org/10.3390/su10103593

Chicago/Turabian Style

Sutthichaimethee, Jindamas, and Kuskana Kubaha. 2018. "Forecasting Energy-Related Carbon Dioxide Emissions in Thailand’s Construction Sector by Enriching the LS-ARIMAXi-ECM Model" Sustainability 10, no. 10: 3593. https://doi.org/10.3390/su10103593

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Forecasting Energy-Related Carbon Dioxide Emissions in Thailand’s Construction Sector by Enriching the LS-ARIMAXi-ECM Model

Abstract

1. Introduction

2. Literature Review

3. Materials and Methods

3.1. Co-Integration Testing and Error Correction Mechanism Model Based on Johansen and Juselius

3.2. Long- and Short-Term Auto-Regressive Integrated Moving Average with Exogenous Variables and the Error Correction Mechanism (LS-ARIMAXi–ECM) Model

3.2.1. Autoregressive Moving Average ( $A R M A (p, q)$ ) Model

3.2.2. Autoregressive Integrated Moving Average ( $A R I M A (p, d, q)$ ) Model

3.2.3. LS-ARIMAXi-ECM Model

3.3. Measurement of the Forecasting Performance

4. Results

4.1. Screening of Influencing Factors for Model Input

4.2. Formation of Analysis Modeling with the LS-ARIMAX_i $(p, d, q, X_{t - 1})$ -ECM Model

4.3. CO₂ Emission Forecasting Based on the LS-ARIMAXi $(2, 1, 1, X_{t - 1})$ -ECM Model

5. Discussion

6. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Forecasting Energy-Related Carbon Dioxide Emissions in Thailand’s Construction Sector by Enriching the LS-ARIMAXi-ECM Model

Abstract

1. Introduction

2. Literature Review

3. Materials and Methods

3.1. Co-Integration Testing and Error Correction Mechanism Model Based on Johansen and Juselius

3.2. Long- and Short-Term Auto-Regressive Integrated Moving Average with Exogenous Variables and the Error Correction Mechanism (LS-ARIMAXi–ECM) Model

3.2.1. Autoregressive Moving Average ( A R M A ( p , q ) ) Model

3.2.2. Autoregressive Integrated Moving Average ( A R I M A ( p , d , q ) ) Model

3.2.3. LS-ARIMAXi-ECM Model

3.3. Measurement of the Forecasting Performance

4. Results

4.1. Screening of Influencing Factors for Model Input

4.2. Formation of Analysis Modeling with the LS-ARIMAXi ( p , d , q , X t − 1 ) -ECM Model

4.3. CO2 Emission Forecasting Based on the LS-ARIMAXi ( 2 , 1 , 1 , X t − 1 ) -ECM Model

5. Discussion

6. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.2.1. Autoregressive Moving Average ( $A R M A (p, q)$ ) Model

3.2.2. Autoregressive Integrated Moving Average ( $A R I M A (p, d, q)$ ) Model

4.2. Formation of Analysis Modeling with the LS-ARIMAX_i $(p, d, q, X_{t - 1})$ -ECM Model

4.3. CO₂ Emission Forecasting Based on the LS-ARIMAXi $(2, 1, 1, X_{t - 1})$ -ECM Model