Next Article in Journal
The Economic Valuation of Change in the Quality of Rural Tourism Resources: Choice Experiment Approaches
Previous Article in Journal
Building Customer Loyalty in Rural Destinations as a Pre-Condition of Sustainable Competitiveness
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Forecasting of Energy-Related CO2 Emissions in China Based on GM(1,1) and Least Squares Support Vector Machine Optimized by Modified Shuffled Frog Leaping Algorithm for Sustainability

1
School of Economics and Management, North China Electric Power University, Beijing 102206, China
2
Beijing Key Laboratory of New Energy and Low-Carbon Development, North China Electric Power University, Beijing 102206, China
*
Author to whom correspondence should be addressed.
Sustainability 2018, 10(4), 958; https://doi.org/10.3390/su10040958
Submission received: 26 February 2018 / Revised: 22 March 2018 / Accepted: 25 March 2018 / Published: 26 March 2018

Abstract

:
Presently, China is the largest CO2 emitting country in the world, which accounts for 28% of the CO2 emissions globally. China’s CO2 emission reduction has a direct impact on global trends. Therefore, accurate forecasting of CO2 emissions is crucial to China’s emission reduction policy formulating and global action on climate change. In order to forecast the CO2 emissions in China accurately, considering population, the CO2 emission forecasting model using GM(1,1) (Grey Model) and least squares support vector machine (LSSVM) optimized by the modified shuffled frog leaping algorithm (MSFLA) (MSFLA-LSSVM) is put forward in this paper. First of all, considering population, per capita GDP, urbanization rate, industrial structure, energy consumption structure, energy intensity, total coal consumption, carbon emission intensity, total imports and exports and other influencing factors of CO2 emissions, the main driving factors are screened according to the sorting of grey correlation degrees to realize feature dimension reduction. Then, the GM(1,1) model is used to forecast the main influencing factors of CO2 emissions. Finally, taking the forecasting value of the CO2 emissions influencing factors as the model input, the MSFLA-LSSVM model is adopted to forecast the CO2 emissions in China from 2018 to 2025.

1. Introduction

With the continuous progress of human society and global industrialization, the consumption of fossil fuels is getting faster and faster, resulting in serious environmental problems, such as the greenhouse effect. CO2 is one of the main gases that cause the greenhouse effect. Carbon capture and storage is the process of capturing and storing the CO2 in industrial production by various means, such as cryogenic carbon capture [1,2], chemical absorption [3], oxyfuel combustion [4] and so on. Carbon capture technologies that result in significant reduction in energy-related CO2 emissions, specifically from coal-fired power plants, can relieve the greenhouse effect. The economic efficiency and feasibility of these carbon capture technologies are affected by CO2 emission forecasting. Presently, China is the largest CO2 emitting country in the world, which accounts for 28% of the CO2 emissions globally. China’s CO2 emission reduction has a direct impact on global trends. Therefore, accurate forecasting of CO2 emissions is crucial to China’s emission reduction policy formulating and global action on climate change.
Currently, scholars have put forward various models for CO2 emission forecasting, such as multiple linear regression [5,6,7], system dynamics [8,9,10] and grey model [11,12]. Matjafri and Lim [13] proposed the forecasting model to predict the CO2 emissions in Malaysia. They removed irrelevant data by the best subset method and built the prediction model of multi linear regression. Zhong [14] forecasted CO2 emissions and energy demand with the system dynamics approach through mode analyzing of energy consumption and population and economic development and proposed suggestions for energy development. The grey forecasting model was used by Lin et al. [15] to estimate CO2 emissions from 2010 to 2012 in Taiwan. The forecasting results showed that the mean residual error of the GM(1,1) was less than 10%, and CO2 emissions in Taiwan over the next 3 years would increase.
Recently, with the continuous development of artificial intelligence technology, more and more intelligent forecasting models are applied to the field of prediction, which is also suitable for the forecasting of CO2 emissions. The neural network model [16,17,18] and support vector machine [19,20] are two widely used artificial intelligence forecasting models. Gallo et al. [21] put forward the artificial neural network model which had a more flexible system to predict short-term CO2 emissions instead of classical methods. Combined with the rough set and the grey system model and support vector machine, Zhou and Zhang [22] proposed the model to forecast the CO2 emissions in China using the data of the population, gross domestic product (GDP) and total energy consumption.
Compared with the neural network algorithm, SVM has superior generalization, which can better refrain from the local optimum by parameter optimization. The least squares support vector machine (LSSVM) [23,24,25] is an improved algorithm based on equality constraint and least squares value function for standard SVM. LSSVM uses the quadratic loss function to transform the secondary optimization of SVM algorithm into the solution of linear equation, which has better convergence accuracy and a faster training speed. Therefore, LSSVM is used to forecast CO2 emissions in this article. At present, for the parameter optimization of LSSVM, researchers have proposed various optimization algorithms, such as PSO (particle swarm optimization) [26,27], GA (genetic algorithm) [28,29], and ABC (artificial bee colony) [30,31,32]. Wang et al. [33] proposed a short-term load prediction model based on LSSVM optimized by improved parallel PSO to improve the accuracy of load forecasting. Wen et al. [34] proposed the LSSVM model with genetic algorithm to forecast landslide displacement and used a case study to prove that the GA-LSSVM model was effective for the prediction of landslide displacement. Mustaffa and Yusof [35] put forward the LSSVM model to forecast gold price, and the artificial bee colony intelligent technology was used to obtain the ideal value of parameters of LSSVM. Through empirical analysis, it was proved that the ABC-LSSVM model was a promising approach for financial forecasting. In this paper, shuffled frog leaping algorithm (SFLA) is used to realize the optimization of LSSVM. SFLA simulates the behavior of information exchange in each frog group for food searching. It is a new heuristic colony evolution algorithm, which has efficient calculated performance and excellent global search ability [36,37,38,39].
In this article, for forecasting the CO2 emissions in China more accurately, a novel forecasting model is proposed based on the main driving factors of CO2 emissions. The major innovations of this paper are as follows:
(1)
In this paper, the CO2 emissions forecasting model based on GM(1,1) and LSSVM optimized by MSFLA (MSFLA-LSSVM) are put forward. First of all, the GM(1,1) model is used to forecast the main influencing factors of CO2 emissions. Then, the MSFLA-LSSVM model is adopted to forecast the CO2 emissions taking the forecasting value of the CO2 emissions influencing factors as the model input. Finally, through empirical analysis, it is verified that the MSFLA-LSSVM model has strong generalization ability and robustness for CO2 emission forecasting and the forecasting accuracy of MSFLA-LSSVM is better than that of SFLA-LSSVM, LSSVM and BP (back propagation) neural network models, which can achieve good forecasting results.
(2)
The forecasting accuracy of CO2 emissions is affected by many factors. Considering population, per capita GDP, urbanization rate, industrial structure, energy consumption structure, energy intensity, total coal consumption, carbon emissions intensity, total imports and exports and other influencing factors of CO2 emissions, the main driving factors of CO2 emissions are screened as the model input according to the sorting of grey relational degrees to realize feature dimension reduction.
The main structure and contents of this article are as follows: the second section introduces the forecasting model of GM(1,1) and LSSVM optimized by MSFLA (MSFLA-LSSVM). The third section carries out empirical analysis to prove the practicality and validity of the proposed model for CO2 emissions forecasting, and forecasts the CO2 emissions in China from 2018 to 2025. The fourth section summarizes the full text.

2. The Forecasting Model

2.1. GM(1,1)

The model of GM(1,1) is the most commonly applied grey model, which consists of a first order differential equation containing only one variable. The model is simple to calculate and has obvious advantages for the forecasting of small sample data with irregular distribution [11,12]. The specific mathematical model of GM(1,1) is as follows:
Set x ( 0 ) as the original sequence:
x ( 0 ) = [ x ( 0 ) ( 1 ) , x ( 0 ) ( 2 ) , , x ( 0 ) ( n ) ] .
The accumulated generating operation of x ( 0 ) is made, and the sequence x ( 1 ) is obtained:
x ( 1 ) = [ x ( 1 ) ( 1 ) , x ( 1 ) ( 2 ) , , x ( 1 ) ( n ) ] .
where x ( 1 ) ( k ) = i = 1 k x ( 0 ) ( i ) , k = 1 , 2 , , n . x ( 1 ) ( k ) satisfies the following first order linear differential equation model:
d x ( 1 ) d t + a x ( 1 ) = u .
In the equation, a and u are parameters to be estimated. According to the definition of derivative:
d x ( 1 ) d t = lim Δ t o x ( 1 ) ( t + Δ t ) x ( 1 ) ( t ) Δ t .
If expressed in the discrete form, the differential term can be written as:
Δ x ( 1 ) Δ t = x ( 1 ) ( k + 1 ) x ( 1 ) ( k ) k + 1 k = x ( 1 ) ( k + 1 ) x ( 1 ) ( k ) = α ( 1 ) [ x ( 1 ) ( k + 1 ) ] = x ( 0 ) ( k + 1 ) .
where x ( 1 ) takes the mean of time k and k + 1 , that is 1 2 [ x ( 1 ) ( k + 1 ) + x ( 1 ) ( k ) ] . Therefore, Equation (5) can be rewritten as:
α ( 1 ) [ x ( 1 ) ( k + 1 ) ] + 1 2 a [ x ( 1 ) ( k + 1 ) + x ( 1 ) ( k ) ] = u .
The following equation can be derived:
k = 1 , x ( 0 ) ( 2 ) + 1 2 a [ x ( 1 ) ( 1 ) + x ( 1 ) ( 2 ) ] = u , k = 2 , x ( 0 ) ( 3 ) + 1 2 a [ x ( 1 ) ( 2 ) + x ( 1 ) ( 3 ) ] = u , k = n 1 , x ( 0 ) ( n ) + 1 2 a [ x ( 1 ) ( n ) + x ( 1 ) ( n 1 ) ] = u .
The above results are expressed in matrix form:
( x ( 0 ) ( 2 ) x ( 0 ) ( 3 ) x ( 0 ) ( n ) ) = ( 1 2 [ x ( 1 ) ( 1 ) + x ( 1 ) ( 2 ) ] 1 1 2 [ x ( 1 ) ( 2 ) + x ( 1 ) ( 3 ) ] 1 1 2 [ x ( 1 ) ( n 1 ) + x ( 1 ) ( n ) ] 1 ) ( a u ) .
Equation (8) can be written as:
Y n = B A , Y n = ( x ( 0 ) ( 2 ) x ( 0 ) ( 3 ) x ( 0 ) ( n ) ) , A = ( a u ) , B = ( 1 2 [ x ( 1 ) ( 1 ) + x ( 1 ) ( 2 ) ] 1 1 2 [ x ( 1 ) ( 2 ) + x ( 1 ) ( 3 ) ] 1 1 2 [ x ( 1 ) ( n 1 ) + x ( 1 ) ( n ) ] 1 ) .
Y n and B are known quantities. A is the parameter to be estimated. According to the least squares criterion, Equation (9) can be rewritten as:
Y n = B A ^ + E .
where E is an error term. Let min Y n B A ^ 2 = min ( Y n B A ^ ) T ( Y n B A ^ ) , the following equation can be obtained according to the matrix derivation equation:
A ^ = ( B T B ) 1 B T Y n = ( a ^ u ^ ) .
The parameters in the original differential equation are replaced by a ^ and u ^ :
d x ( 1 ) d t + a ^ x ( 1 ) = u ^ .
x ( 1 ) ( t + 1 ) can be solved by the above equation:
x ( 1 ) ( t + 1 ) = [ x ( 1 ) ( 1 ) u ^ a ^ ] e a ^ t + u ^ a ^ .
x ( 1 ) ( 1 ) = x ( 0 ) ( 1 ) is known. Express Equation (13) as the discrete form and Equation (14) is obtained:
x ( 1 ) ( k + 1 ) = [ x ( 0 ) ( 1 ) u ^ a ^ ] e a ^ k + u ^ a ^ ( k = 0 , 1 , 2 ) .
Equations (13) and (14) are the time response functions of GM(1,1). The cumulative reduction of Equation (14) is made, and the gray forecasting model of the original sequence x ( 0 ) is obtained:
x ^ ( 0 ) ( k + 1 ) = x ^ ( 1 ) ( k + 1 ) x ^ ( 1 ) ( k ) = ( 1 e a ^ ) ( x ( 0 ) ( 1 ) u ^ a ^ ) e a ^ k ( k = 0 , 1 , 2 ) .

2.2. LSSVM

Let N be the sample number and m be the dimension of sample space. For training samples ( x i , y i ) , i = 1 , 2 , , N , there is a nonlinear mapping φ ( x ) , which can map the samples to high dimensional linear space:
f ( x ) = ω φ ( x ) + b .
According to the principle of structural risk minimization, the above problem can be equivalent to quadratic programming problem as follows:
min ω , b , ε J ( ω , ε ) = 1 2 ω T ω + 1 2 ξ i = 1 N ε i 2 . s . t .   y i = ω φ ( x ) + b + ε i , i = 1 , 2 , , N .
In the equation, J ( ω , ε ) is structural risk. ξ is penalty coefficient. ε i is allowable error and ω T ω controls the model generalization ability. The Lagrange method is adopted to solve the optimization problem of Equation (17). According to Karush–Kuhn–Tucker optimization conditions, the following can be obtained:
[ 0 1 N T 1 N θ + 1 ξ I N ] [ b a ] = [ 0 y ] .
In the equation, y = [ y 1 , y 2 , , y N ] T , 1 N = [ 1 , 1 , , 1 ] T , 1 N is unit matrix. The kernel function is defined according to the Mercer condition:
K ( x i , x j ) = φ ( x i ) T φ ( x j ) .
The output of the least squares support vector machine can be obtained:
f ( x ) = i = 1 N α i K ( x , x i ) + b .

2.3. Modified Shuffled Frog Leaping Algorithm

2.3.1. SFLA

The mathematical modeling process of SFLA is as follows:
Generate frog populations S randomly. The frog number in the population is N , S = ( X 1 , X 2 , , X N ) . If the dimension of the candidate solution X i is t , the j candidate solution can be expressed as X j = ( x j 1 , x j 2 , , x j t ) , 0 j N . The fitness value f ( X i ) of each frog is calculated, and frog individuals (candidate solutions) are ranked in descending order according to fitness values. For population, the sub-populations number is m , the number of frogs in each sub-population is K , and N = m K is satisfied. The first candidate solution is assigned to the first sub-population, the second candidate solution is assigned to the second sub-population, and so on. For each sub-population, the optimal and worst candidate solutions of sub-populations are recorded as X b and X ω respectively, and the optimal candidate solution in the population is recorded as X m . The sub-population search is to update X ω of each sub-population. The update equation is as follows:
D = r a n d ( X b X ω ) .
X n e w = X ω + D .
D min D D max .
In the Equation (21), r a n d is the random number in [0, 1]; D min and D max are the minimum and maximum distances that allow the frog to move respectively. If X n e w is better than X ω , replace X ω with X n e w , otherwise replace X b in Equation (21) with X m . If the new solution is still not better than X ω , a random X n e w is needed to replace X ω .
Repeat the above steps until the maximum number of search times within the sub-population is reached. When solutions within all sub-populations are updated, sub-population division and sub-population search are carried out again. Repeat the whole process above until reaching the maximum number of iterations.

2.3.2. MSFLA

For SFLA, using the balance strategies of global information exchange and local depth search can avoid falling into local extremes in the early stage of evolution. However, it is easy to fall into local optimum in the later stage of evolution. In order to solve the above problem, this paper determines whether the population has a precocious convergence based on the idea of population fitness variance. The global optimum of the precocious convergent population is slightly perturbed so that it can jump out of the local optimum.
Define population fitness variance as follows:
u 2 = i = 1 n ( f i f a v g f ) 2 .
In the equation, u 2 is population fitness variance of sub-population; n is the number of sub-populations; f i is the fitness value of frog i ; f a v g is the average fitness value of the current population; f is normalization factor and is calculated as follows:
f = { max | f i f a v g | ,   max | f i f a v g | > 1 , 1 ,   max | f i f a v g | 1 .
Let L = | u i 2 u i 1 2 | ( i > 1 ) , when L > a , the population falls into the local optimal. Among them, u i 2 is the fitness variance of i generation sub-population, and u i 1 2 is the fitness variance of i 1 generation sub-population.
When the population falls into local optimal, the global optimal solution of the population is slightly perturbed for jumping out of the local optimum. The perturbation equation is as follows:
X m * = X m + X m P ( r a n d ( ) 0.5 ) .
In the equation, X m is the global optimal solution of the population, P is perturbation coefficient, r a n d ( ) is the random number in [0, 1].
In this case, the range of X m * is ( 1 0.5 P ) X m < X m * < ( 1 + 0.5 P ) X m . The range not only ensures that the algorithm is not distorted due to excessive perturbation, but also makes the current population jump out of the local optimum to some extent.

2.4. MSFLA-LSSVM

In this article, we choose RBF (radial basis function) as the kernel function of the LSSVM model. Parameters to be optimized in this model are regularization parameters and the kernel function width. MSFLA is used to optimize the above two parameters. The specific optimization process is as follows:
Step 1:
Collect and preprocess data;
Step 2:
Set parameters of the algorithm and initialize the population;
Step 3:
Calculate and sort fitness values of individuals, perform the sub-population division operation;
Step 4:
Determine the optimal solution, the worst solution and the global optimal solution of each sub-population. According to the update strategy, the worst frog individuals in sub-populations are updated repeatedly until the maximum of iterations of the sub-population is reached;
Step 5:
Calculate the population fitness variance and determine whether the algorithm is lost in the local optimum. If the population falls into the local optimum, perturb and update the global optimal solution;
Step 6:
Judge the termination condition of the algorithm. When the global maximum of iterations is reached, the calculation is terminated and the optimal solution is output; otherwise, mix all frog individuals and turn to Step 3;
Step 7:
The optimal solution is substituted into the LSSVM model for forecasting.

2.5. The Forecasting Model Based on GM(1,1) and Least Squares Support Vector Machine Optimized by Modified Shuffled Frog Leaping Algorithm

The forecasting accuracy of CO2 emissions is affected by many factors. In order to forecast CO2 emissions accurately, this paper takes population, per capita GDP, urbanization rate, industrial structure (second industry added value ratio), energy consumption structure, energy intensity, total coal consumption, carbon emissions intensity, total imports and exports as alternative inputs of the model. On this basis, the forecasting model using GM(1,1) and MSFLA-LSSVM is used to forecast CO2 emissions. The steps are as follows:
(1) Data collection and pretreatment
Collect sample data containing CO2 emissions, population, per capita GDP, urbanization rate, industrial structure, energy consumption structure, energy intensity, total coal consumption, carbon emissions intensity and total imports and exports. Then, nondimensional operation of the data is performed, and grey relational degrees between various influencing factors and CO2 emissions are calculated. According to the sorting of grey relational degrees, input indexes of model are screened to realize feature dimension reduction.
(2) Influencing factors forecasting based on the GM(1,1) model
Based on the historical data collected, the GM(1,1) model is used to forecast influencing factors that have been screened out for CO2 emissions.
(3) CO2 emissions forecasting based on MSFLA-LSSVM
The forecasting values of influencing factors of CO2 emissions are used as model inputs, and then the MSFLA-LSSVM model is used to forecast CO2 emissions.
The flow of the forecasting model of GM(1,1) and MSFLA-LSSVM is shown in Figure 1:

3. Empirical Analysis

3.1. Screening of Influencing Factors for Model Input

In this paper, we use the proposed model to forecast CO2 emissions in China. The data of the population, per capita GDP, urbanization rate, industrial structure, energy consumption structure, energy intensity, total coal consumption, carbon emissions intensity, total imports and exports and other influencing factors of CO2 emissions from 1990 to 2016 in China are collected as the candidates for the model input (the data source is the World Bank Database and China Statistical Yearbook).
Because of the excessive influencing factors, we use the grey relational degree to screen the influencing factors for model input to realize feature dimension reduction. The calculation results of the grey relational degrees between the various influencing factors and the CO2 emissions are shown in Table 1.
According to Table 1, we choose the four factors whose grey relational degree is greater than 0.8 as the CO2 emissions forecasting model input. They are per capita GDP, urbanization rate, total coal consumption and total imports and exports. The model output is the CO2 emissions.
The data of the CO2 emissions, per capita GDP, urbanization rate, total coal consumption and total imports and exports from 1990 to 2016 in China are shown in Figure 2.

3.2. Forecasting Effect Test for MSFLA-LSSVM Model

We take the data from 1990 to 2009 as the training set, and the test set is the data from 2010 to 2016. The MSFLA-LSSVM model is used to forecast the CO2 emissions. The model parameters settings are as follows: the frog population is 300; the sub-population is 30; the sub-population search number is 10; the regularization parameter search range of LSSVM is [0.1, 200] and the RBF kernel parameter is [0.01, 20]; the maximum number of iterations is 200. The forecasting results and the residuals are shown in Figure 3.
The relative error of each prediction point is shown in Table 2.
From Figure 3 and Table 2, it can be concluded that using the MSFLA-LSSVM model to forecast CO2 emissions in China can obtain good forecasting results, and the forecasting curve has a good fitting degree with the actual curve.
In order to prove the validity and superiority of the MSFLA-LSSVM model further, three models (SFLA-LSSVM, LSSVM and BP neural network) are selected to make a comparison with the MSFLA-LSSVM model and study the prediction results of the same sample. The comparison results, the relative errors and the boxplot of relative errors are shown in Figure 4, Figure 5 and Figure 6, respectively.
Figure 4 shows the fitting degree between the CO2 emissions curve forecasted by different models and the actual curve of CO2 emissions. Figure 5 shows the relative errors of each model for CO2 emission forecasting. The boxplot in Figure 6 shows the minimum, first quartile, the median, third quartile and the maximum of the relative errors in each model. From Figure 5 and Figure 6, it can be seen that the MSFLA-LSSVM model has the minimum relative error, followed by the SFLA-LSSVM model and the relative error of BP is maximum.
In order to evaluate the forecasting effect of each model more objectively, M A P E (mean absolute percentage error), R M S E (root mean square error) and M A E (mean absolute error) are applied to compare the forecasting accuracy of each model. The calculation equations are shown as follows:
M A P E = 1 n i = 1 n | y ^ i y i y i | .
R M S E = 1 n i = 1 n ( y ^ i y i ) 2 .
M A E = 1 n i = 1 n | y ^ i y i | .
where y ^ i is the predicted value; y i is the actual value; n is the sample size.
The calculation results of M A P E , R M S E and M A E for different models are shown in Table 3.
From Table 2, it can be concluded that M A P E , R M S E and M A E of the MSFLA-LSSVM model is the smallest in all models, reaching 1.1165%, 0.7013, and 0.5425, respectively. Next is the SFLA-LSSVM model, the M A P E , R M S E and M A E are 3.7209%, 2.1616, and 1.8467, respectively. The M A P E , R M S E and M A E of BP model is the largest, reaching 7.5416%, 4.9479, and 3.9981 respectively. Besides this, the LSSVM model has a better forecasting effect than the BP model. In a word, the evaluation results of three indexes for four models are basically the same. And the prediction accuracy is ranked as follows: MSFLA-LSSVM > SFLA-LSSVM > LSSVM>BP. It can be concluded that the forecasting accuracy of MSFLA-LSSVM model is obviously higher than that of other models, and it is effective and practical for CO2 emissions forecasting.

3.3. CO2 Emissions Forecasting Based on GM(1,1) and MSFLA-LSSVM Model

Based on the collected historical data, we use GM(1,1) model to forecast CO2 emissions influencing factors of per capita GDP, urbanization rate, total coal consumption and total imports and exports from 2018 to 2025 in China. Since the GM(1,1) model has a higher forecasting accuracy for small samples, we choose the data from 2010 to 2016 as the forecasting sample instead of the entirety of the data.
Before using the GM(1,1) model, we first do the dimensionless processing of the original data according to the Equation (30).
y i = x i min ( x i ) max ( x i ) min ( x i ) .
Then, the GM(1,1) model is applied to forecast the per capita GDP, urbanization rate, total coal consumption and total imports and exports from 2018 to 2025 in China. The forecasting results are shown in Figure 7.
The predicted data above are used as the MSFLA-LSSVM model input to forecast the CO2 emissions from 2018 to 2025 in China. The specific forecasting results are shown in Figure 8 and Figure 9.
As you can see in Figure 8 and Figure 9, China’s CO2 emissions from 2018 to 2025 will take on a slow growth trend. China’s CO2 emissions increased rapidly from 2002 to 2012. However, after 2012, the CO2 emissions grew slowly and got into the platform period. With the strengthening of the policies on CO2 emissions reduction in China, it is foreseeable that CO2 emissions will be effectively controlled and the greenhouse effect will be relieved in the future.

4. Conclusions

In order to forecast CO2 emissions in China accurately, considering population, the CO2 emission forecasting model using GM(1,1) and LSSVM optimized by MSFLA (MSFLA-LSSVM) is put forward in this paper. First of all, considering population, per capita GDP, urbanization rate, industrial structure, energy consumption structure, energy intensity, total coal consumption, carbon emissions intensity, total imports and exports and other influencing factors of CO2 emissions, the main driving factors are screened according to the sorting of grey correlation degrees to realize feature dimension reduction. Then, the GM(1,1) model is used to forecast the main influencing factors of CO2 emissions. Finally, taking the forecasting value of the CO2 emissions influencing factors as the model input, the MSFLA-LSSVM model is adopted to forecast the CO2 emissions in China from 2018 to 2025.
According to the forecasting results of the CO2 emissions from 2018 to 2025 in China, it can be seen that China’s CO2 emissions will take on a slow growth trend in the next few years. With the strengthening of the policy on CO2 emissions reduction in China, China’s CO2 emissions will be effectively controlled in the future, and then the greenhouse effect will be relieved.

Acknowledgments

1. The paper is supported by Natural Science Foundation of China (Project No. 71471059). 2. The paper is supported by “the Fundamental Research Funds for the Central Universities (2018ZD14)”. 3. The paper is supported by “the 111 Project (B18021)”.

Author Contributions

In this research activity, all the authors were involved in the data collection and preprocessing phase, model constructing, empirical research, results analysis and discussion, and manuscript preparation. All authors have approved the submitted manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Safdarnejad, S.M.; Hedengren, J.D.; Baxter, L.L. Plant-level dynamic optimization of Cryogenic Carbon Capture with conventional and renewable power sources. Appl. Energy 2015, 149, 354–366. [Google Scholar] [CrossRef]
  2. Safdarnejad, S.M.; Hedengren, J.D.; Baxter, L.L. Dynamic optimization of a hybrid system of energy-storing cryogenic carbon capture and a baseline power generation unit. Appl. Energy 2016, 172, 66–79. [Google Scholar] [CrossRef]
  3. Gopan, A.; Kumfer, B.M.; Phillips, J.; Thimsen, D.; Smith, R.; Axelbaum, R.L. Process design and performance analysis of a Staged, Pressurized Oxy-Combustion (SPOC) power plant for carbon capture. Appl. Energy 2014, 125, 179–188. [Google Scholar] [CrossRef]
  4. Cohen, S.M.; Rochelle, G.T.; Webber, M.E. Optimizing post-combustion CO2, capture in response to volatile electricity prices. Int. J. Greenh. Gas Control 2012, 8, 180–195. [Google Scholar] [CrossRef]
  5. Ismail, Z.; Yahaya, A.; Shabri, A. Forecasting Gold Prices Using Multiple Linear Regression Method. Am. J. Appl. Sci. 2009, 6, 1509–1514. [Google Scholar] [CrossRef]
  6. Liu, D.; Bai, X.; Meng, J. Multiple linear regression forecasting model of total food yield in China based on forward selection variables method. J. Northeast Agric. Univ. 2010, 41, 24–128. [Google Scholar]
  7. Sehgal, V.; Tiwari, M.K.; Chatterjee, C. Wavelet Bootstrap Multiple Linear Regression Based Hybrid Modeling for Daily River Discharge Forecasting. Water Resour. Manag. 2014, 28, 2793–2811. [Google Scholar] [CrossRef]
  8. Ming, Z.; Liu, D.; Kaiyan, D.; Song, X.; Yulong, L.; Haiyan, Z. Pre-integrated Forecasting Method Research of Urban Electricity Consumption Based on System Dynamics and Econometric Model. J. Appl. Sci. 2013, 13, 4732–4737. [Google Scholar] [CrossRef]
  9. Dyson, B.; Chang, N.B. Forecasting municipal solid waste generation in a fast-growing urban region with system dynamics modeling. Waste Manag. 2005, 25, 669–679. [Google Scholar] [CrossRef]
  10. Venkatesan, A.K.; Ahmad, S.; Johnson, W.; Batista, J.R. Systems dynamic model to forecast salinity load to the Colorado River due to urbanization within the Las Vegas Valley. Sci. Total Environ. 2011, 409, 2616–2625. [Google Scholar] [CrossRef]
  11. Hsu, C.C.; Chen, C.Y. Applications of improved grey prediction model for power demand forecasting. Energy Convers. Manag. 2003, 44, 2241–2249. [Google Scholar] [CrossRef]
  12. Lee, Y.S.; Tong, L.I. Forecasting energy consumption using a grey model improved by incorporating genetic programming. Energy Convers. Manag. 2011, 52, 147–152. [Google Scholar] [CrossRef]
  13. Matjafri, M.Z.; Lim, H.S. Prediction models for CO2 emission in Malaysia using best subsets regression and multi-linear regression. SPIE Proc. 2015, 9638, 12. [Google Scholar] [CrossRef]
  14. Zhong, Q. Prediction of energy consumption and CO2 emission by system dynamics approach. Chin. J. Eco-Agric. 2008, 16, 1043–1047. [Google Scholar] [CrossRef]
  15. Lin, C.S.; Liou, F.M.; Huang, C.P. Grey forecasting model for CO2 emissions: A Taiwan study. Adv. Mater. Res. 2011, 88, 3816–3820. [Google Scholar] [CrossRef]
  16. Huang, D.Z.; Gong, R.X.; Gong, S. Prediction of Wind Power by Chaos and BP Artificial Neural Networks Approach Based on Genetic Algorithm. J. Electr. Eng. Technol. 2015, 10, 41–46. [Google Scholar] [CrossRef]
  17. Bin, H.; Zu, Y.X.; Zhang, C. A Forecasting Method of Short-Term Electric Power Load Based on BP Neural Network. Appl. Mech. Mater. 2014, 538, 247–250. [Google Scholar] [CrossRef]
  18. Narayanakumar, S.; Raja, K. A BP Artificial Neural Network Model for Earthquake Magnitude Prediction in Himalayas, India. Circuits Syst. 2016, 7, 3456–3468. [Google Scholar] [CrossRef]
  19. Yang, J.F.; Cheng, H.Z. Application of SVM to power system short-term load forecast. Electr. Power Autom. Equip. 2004, 24, 30–32. [Google Scholar]
  20. Hong, W.C. Electric load forecasting by support vector model. Appl. Math. Model. 2009, 33, 2444–2454. [Google Scholar] [CrossRef]
  21. Gallo, C.; Contò, F.; Fiore, M. A Neural Network Model for Forecasting CO2 Emission. AGRIS Econ. Inform. 2014, 6, 31. [Google Scholar]
  22. Zhou, J.G.; Zhang, X.G. Projections about Chinese CO2 emissions based on rough sets and gray support vector machine. China Environ. Sci. 2013, 33, 2157–2163. [Google Scholar]
  23. De Giorgi, M.G.; Malvoni, M.; Congedo, P.M. Comparison of strategies for multi-step ahead photovoltaic power forecasting models based on hybrid group method of data handling networks and least square support vector machine. Energy 2016, 107, 360–373. [Google Scholar] [CrossRef]
  24. Li, X.; Wang, X.; Zheng, Y.H.; Li, L.X.; Zhou, L.D.; Sheng, X.K. Short-Term Wind Power Forecasting Based on Least-Square Support Vector Machine (LSSVM). Appl. Mech. Mater. 2013, 448, 1825–1828. [Google Scholar] [CrossRef]
  25. Zhao, H.; Guo, S.; Zhao, H. Energy-Related CO2 Emissions Forecasting Using an Improved LSSVM Model Optimized by Whale Optimization Algorithm. Energies 2017, 10, 874. [Google Scholar] [CrossRef]
  26. Yang, W.; Li, Q. Survey on Particle Swarm Optimization Algorithm. Eng. Sci. 2004, 6, 87–94. [Google Scholar]
  27. Trelea, I.C. The particle swarm optimization algorithm: Convergence analysis and parameter selection. Inform. Process. Lett. 2016, 85, 317–325. [Google Scholar] [CrossRef]
  28. Zwickl, D.J. Genetic algorithm approaches for the phylogenetic analysis of large biological sequence datasets under the maximum likelihood criterion. Diss. Theses Gradworks 2006, 3, 257–260. [Google Scholar]
  29. Deb, K. An efficient constraint handling method for genetic algorithms. Comput. Meth. Appl. Mech. Eng. 2000, 186, 311–338. [Google Scholar] [CrossRef]
  30. Karaboga, D.; Basturk, B. A powerful and efficient algorithm for numerical function optimization: Artificial bee colony (ABC) algorithm. J. Glob. Optim. 2007, 39, 459–471. [Google Scholar] [CrossRef]
  31. Karaboga, D.; Basturk, B. On the performance of artificial bee colony (ABC) algorithm. Appl. Soft Comput. 2008, 8, 687–697. [Google Scholar] [CrossRef]
  32. Akay, B.; Karaboga, D. A modified Artificial Bee Colony algorithm for real-parameter optimization. Inform. Sciences 2012, 192, 120–142. [Google Scholar] [CrossRef]
  33. Wang, B.Y.; Wang, D.Y.; Zhang, S.M. A Short-Term Distributed Load Forecasting Algorithm Based on Spark and IPPSO_LSSVM. Appl. Mech. Mater. 2015, 713–715, 1385–1388. [Google Scholar]
  34. Wen, T.; Tang, H.; Wang, Y.; Lin, C.Y.; Xiong, C.R. Landslide displacement prediction using the GA-LSSVM model and time series analysis: A case study of Three Gorges Reservoir, China. Nat. Hazard Earth Syst. Sci. 2017, 17, 2181–2198. [Google Scholar] [CrossRef]
  35. Mustaffa, Z.; Yusof, Y. Optimizing LSSVM using ABC for non-volatile financial prediction. Aust. J. Basic Appl. Sci. 2011, 7, 549. [Google Scholar]
  36. Eusuff, M.M.; Lansey, K.E. Optimization of Water Distribution Network Design Using the Shuffled Frog Leaping Algorithm. J. Water Resour. Plan. Manag. 2003, 129, 210–225. [Google Scholar] [CrossRef]
  37. Eusuff, M.; Lansey, K.; Pasha, F. Shuffled frog-leaping algorithm: A memetic meta-heuristic for discrete optimization. Eng. Optim. 2006, 38, 129–154. [Google Scholar] [CrossRef]
  38. Zhao, Z.; Xu, Q.; Jia, M. Improved shuffled frog leaping algorithm-based BP neural network and its application in bearing early fault diagnosis. Neural Comput. Appl. 2016, 27, 375–385. [Google Scholar] [CrossRef]
  39. Pan, Q.K.; Wang, L.; Gao, L.; Li, J. An effective shuffled frog-leaping algorithm for lot-streaming flow shop scheduling problem. Int. J. Adv. Manuf. Technol. 2011, 52, 699–713. [Google Scholar] [CrossRef]
Figure 1. The flow of the forecasting model.
Figure 1. The flow of the forecasting model.
Sustainability 10 00958 g001
Figure 2. The sample data of the CO2 emissions and the selected influencing factors.
Figure 2. The sample data of the CO2 emissions and the selected influencing factors.
Sustainability 10 00958 g002
Figure 3. The forecasting result and residual error.
Figure 3. The forecasting result and residual error.
Sustainability 10 00958 g003
Figure 4. Comparison of forecasting results.
Figure 4. Comparison of forecasting results.
Sustainability 10 00958 g004
Figure 5. The relative errors of four models: (a) modified shuffled frog leaping algorithm (MSFLA)–least squares support vector machine (LSSVM); (b) shuffled frog leaping algorithm (SFLA)-LSSVM; (c) LSSVM; (d) BP (back propagation).
Figure 5. The relative errors of four models: (a) modified shuffled frog leaping algorithm (MSFLA)–least squares support vector machine (LSSVM); (b) shuffled frog leaping algorithm (SFLA)-LSSVM; (c) LSSVM; (d) BP (back propagation).
Sustainability 10 00958 g005aSustainability 10 00958 g005b
Figure 6. The boxplot for four models.
Figure 6. The boxplot for four models.
Sustainability 10 00958 g006
Figure 7. The forecasting results of per capita GDP, urbanization rate, total coal consumption and total imports and exports.
Figure 7. The forecasting results of per capita GDP, urbanization rate, total coal consumption and total imports and exports.
Sustainability 10 00958 g007
Figure 8. The forecasting results of CO2 emissions.
Figure 8. The forecasting results of CO2 emissions.
Sustainability 10 00958 g008
Figure 9. The forecasting results of CO2 emissions from 2018 to 2025 in China.
Figure 9. The forecasting results of CO2 emissions from 2018 to 2025 in China.
Sustainability 10 00958 g009
Table 1. The calculation results of the grey relational degrees.
Table 1. The calculation results of the grey relational degrees.
Influencing FactorGrey Relational Degree
Population0.7752
Per capita GDP0.8218
Urbanization rate0.8516
Industrial structure0.7584
Energy consumption structure0.7513
Energy intensity0.6546
Total coal consumption0.9631
Carbon emissions intensity0.6517
Total imports and exports0.8116
Table 2. The relative error.
Table 2. The relative error.
YearActual Value
(100 Million Tons)
Forecasting Value
(100 Million Tons)
RE (%)
199024.4225.76525.4901
199125.6626.18832.0751
199226.9026.78780.4341
199328.7927.76063.5654
199430.5830.43190.4920
199533.2033.56661.0954
199634.6334.64150.0306
199734.7034.07941.7745
199833.2433.31700.2214
199933.1833.66071.4469
200034.0534.14770.2816
200134.8835.17700.8640
200238.5038.02181.2489
200345.4045.16820.5197
200452.3452.16430.3270
200558.9760.30212.2597
200665.2964.44341.3010
200770.3170.33980.0452
200875.5375.22290.4075
200980.0179.65720.4411
201087.7688.68341.0517
201197.3497.33470.0007
2012100.2999.88690.3977
2013102.58101.36351.1859
2014102.92101.90120.9892
2015101.38102.14960.7591
2016100.87102.32121.4387
Table 3. The calculation results.
Table 3. The calculation results.
ModelMAPE (%)RMSE (100 Million Tons)MAE (100 Million Tons)
MSFLA-LSSVM1.11650.70130.5425
SFLA-LSSVM3.72092.16161.8467
LSSVM5.97403.15152.8041
BP7.54164.94793.9981

Share and Cite

MDPI and ACS Style

Dai, S.; Niu, D.; Han, Y. Forecasting of Energy-Related CO2 Emissions in China Based on GM(1,1) and Least Squares Support Vector Machine Optimized by Modified Shuffled Frog Leaping Algorithm for Sustainability. Sustainability 2018, 10, 958. https://doi.org/10.3390/su10040958

AMA Style

Dai S, Niu D, Han Y. Forecasting of Energy-Related CO2 Emissions in China Based on GM(1,1) and Least Squares Support Vector Machine Optimized by Modified Shuffled Frog Leaping Algorithm for Sustainability. Sustainability. 2018; 10(4):958. https://doi.org/10.3390/su10040958

Chicago/Turabian Style

Dai, Shuyu, Dongxiao Niu, and Yaru Han. 2018. "Forecasting of Energy-Related CO2 Emissions in China Based on GM(1,1) and Least Squares Support Vector Machine Optimized by Modified Shuffled Frog Leaping Algorithm for Sustainability" Sustainability 10, no. 4: 958. https://doi.org/10.3390/su10040958

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop