Next Article in Journal
Climate and Biodiversity Credentials for Australian Grass-Fed Beef: A Review of Standards, Certification and Assurance Schemes
Previous Article in Journal
Optimal Allocation of Photovoltaic Distributed Generations in Radial Distribution Networks
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Coupling LSTM and CNN Neural Networks for Accurate Carbon Emission Prediction in 30 Chinese Provinces

1
College of Science, North China University of Technology, Beijing 100144, China
2
School of Economics and Management, North China University of Technology, Beijing 100144, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Sustainability 2023, 15(18), 13934; https://doi.org/10.3390/su151813934
Submission received: 18 August 2023 / Revised: 9 September 2023 / Accepted: 12 September 2023 / Published: 20 September 2023

Abstract

:
Global warming is a major environmental issue facing humanity, and the resulting climate change has severely affected the environment and daily lives of people. China attaches great importance to and actively responds to climate change issues. In order to achieve the “dual carbon” goal, it is necessary to clearly define the emission reduction path and scientifically predict future carbon emissions, which is the basis for setting emission reduction targets. To ensure the accuracy of data, this study applies the emission coefficient method to calculate the carbon emissions from the energy consumption in 30 provinces, regions, and cities in China from 1997 to 2021. Considering the spatial correlation between different regions in China, we propose a new machine learning prediction model that incorporates spatial weighting, namely, an LSTM-CNN combination model with spatial weighting. The spatial weighting explains the spatial correlation and the combined model is used to analyze the carbon emissions in the 30 provinces, regions, and cities of China from 2022 to 2035 under different scenarios. The results show that the LSTM-CNN combination model with four convolutional layers performs the best. Compared with other models, this model has the best predictive performance, with an MAE of 8.0169, an RMSE of 11.1505, and an R 2 of 0.9661 on the test set. Based on different scenario predictions, it is found that most cities can achieve carbon peaking before 2030. Some cities need to adjust their development rates based on their specific circumstances in order to achieve carbon peaking as early as possible. This study provides a research direction for deep learning time series forecasting and proposes a new predictive method for carbon emission forecasting.

1. Introduction

Global climate change is a significant environmental issue facing humanity, and the resulting climate change has had a severe impact on the environment and daily lives of people. Research has found that the substantial increase in emissions of greenhouse gases, such as carbon dioxide, caused by human activities is a major factor in global warming [1]. In order to mitigate the trend of global warming, the United Nations has successively formulated the United Nations Framework Convention on Climate Change (1992) and the Kyoto Protocol (1997), which provide legally binding quantitative carbon reduction and emission limitation targets for the contracting parties. The Paris Agreement (2015) made arrangements for global actions to address climate change after 2020.
China is a developing country with rapid urbanization and industrialization, and it is also the world’s largest manufacturing nation, known as the “world’s factory”. Currently, China is still in a phase of rapid economic development, which relies heavily on energy consumption. In the coming years, energy consumption and carbon emissions are expected to continue to increase. Resolving the contradiction between economic development and rising carbon emissions is an urgent issue that the Chinese government needs to address. In 2020, China proposed the “dual carbon” target, which aims to peak carbon dioxide emissions before 2030 and strive to achieve carbon neutrality by 2060. Achieving carbon peaking before 2030 is a major challenge that China is currently facing. With its vast territory, China has significant regional disparities in resource endowment, population size, economic development level, and industrial structure, leading to variations in energy structure across different regions. The levels of development and the structure of carbon emissions also differ across regions, requiring tailored and reasonable carbon reduction policies. According to the “2006 IPCC Guidelines for National Greenhouse Gas Inventories” and the “Guidelines for Provincial-Level Greenhouse Gas Inventories (Trial)” compiled by China in 2011, energy activities are usually the most important sector in greenhouse gas emission inventories. In developed countries, energy activities generally contribute over 90% of carbon dioxide emissions and 75% of total greenhouse gas emissions. Energy activities are also significant sources of greenhouse gas emissions in China.
To achieve China’s “dual carbon” goal, it is essential to clarify the emission reduction pathway and scientifically predict future carbon emissions. This article first applies the emission coefficient method to calculate the carbon emissions from energy consumption in 30 provinces and municipalities in China from 1997 to 2021. Considering the spatial correlation among different regions in China, we propose a new machine learning prediction model that incorporates spatial weights, namely, an LSTM-CNN combination model with spatial weights. We conduct a data ablation experiment using historical data (from 1997 to 2021) to test the predictive capability of the model. Then, we use the aforementioned combination model to analyze the scenario of carbon emissions in 30 provinces and municipalities in China from 2022 to 2035 and provide differentiated carbon emission reduction policy recommendations for different regions.
In this paper, the combined prediction model of LSTM-CNN is constructed to predict China’s carbon emission, with innovation in the following three main areas:
(1)
The existing literature generally predicts a certain city or a certain region, but this paper studies the prediction in 30 provinces and cities in China.
(2)
In the existing literature, the spatial autocorrelation between regions is not considered in the prediction of regional carbon emission. In this paper, the spatial autocorrelation weights are introduced to the input variables.
(3)
In a large number of papers, machine learning models are used to predict the carbon emissions inside samples. On this basis, this paper uses the trained models to predict the carbon peaks under different scenarios.

2. Related Works

The existing research focuses on predicting carbon emissions in individual provinces, cities, and regions. Zhang et al. [2] proposed a machine learning method using BPNN to predict carbon emissions in urban blocks. Yan et al. [3] used a convolutional neural network model to predict carbon emissions in the preliminary design stage of residential buildings in Beijing, which could effectively reflect changes in carbon emissions. The authors believed that predictions for the midterm and late stages should also be included to compare the generalization of the model. Su et al. [4] predicted carbon emissions in large-scale green buildings in Dalian using data-driven models to explore energy consumption patterns in commercial buildings. Combination models are widely favored to improve prediction accuracy. Zhang et al. [5] used ANN-CA and IWOA-LSTM models to explore the correlation between carbon emissions and surface temperature in Wuhan. Chen et al. [6] studied greenhouse gas emissions on the Chengdu Metro Line 18 by establishing a WOA-DELM model. The authors suggested reporting the greenhouse gas situation on other lines to enhance the model’s generalizability. Wang et al. [7] proposed a method that combined machine learning with DDF to estimate the carbon emission efficiency and reduction potential in Chinese cities. Sun et al. [8] fitted carbon emission data for 281 cities in China using the sparrow optimization neural network algorithm. Yan et al. [3] proposed a real-time actionable carbon emission prediction method based on a CNN and predicted a typical northern case, Beijing. The authors suggested extending carbon emission predictions to all provinces and cities nationwide for internal comparisons.
In addition to predicting individual provinces and regions, carbon emission research in a specific spatial area has been effective in recent years. Zhao et al. [9] proposed a combined model of SSA-LSTM and applied it to carbon emission forecasting in the Yellow River basin. The results showed an increasing trend in carbon emissions in the Yellow River basin with significant interprovincial differences. The authors suggested that in addition to predicting this particular region, factors such as spatial autocorrelation among variables within the region should also be considered. Qin et al. [10] used machine learning methods such as panel data and random forest to determine the factors influencing CO2 emissions and found that areas with higher levels of economic development in the eastern region had the highest CO2 emissions. The authors suggested that causal inference methods could be introduced to validate the causal relationship between influencing factors and carbon emissions in order to identify more significant influencing factors. Aryai et al. [11] proposed a PSO-ERT regression model for predicting the emission intensity in Australia’s regional electricity market. Sarwar et al. [12] selected a suitable model for predicting electricity prices and carbon emissions in the eastern region of Saudi Arabia. Xu et al. [13] conducted a study on carbon sequestration in mangroves based on the coastal areas south of the Yangtze River in China, using the RF and gradient boost models. The authors suggested that in addition to predicting a specific region, other regions should also be considered in order to compare the forecast results and examine the generalizability of the models.
Machine learning methods have shown good performance in energy and carbon emission prediction and analysis, both for in-sample fitting and out-of-sample prediction. Han et al. [14] proposed an ISM-ELM model for energy and carbon emission prediction and analysis. Anthony et al. [15] introduced a tool called Carbontracker for tracking and predicting the energy and carbon footprints of training deep learning models. This tool reports the energy and carbon footprints of model development and training together with performance indicators. In addition to optimizing the models, we can also preprocess the raw data to achieve better fitting and prediction results. Han et al. [14] proposed an improved integrated structure model based extreme learning machine (ISM-ELM) for energy and carbon emission analysis and prediction. This model not only denoised and reduced the dimensionality of the raw data, reducing training time and errors of ELM prediction models, but also exhibited a robustness and effectiveness in terms of model accuracy and training time. Furthermore, a research trend was sparked by comparing the results of various models and identifying the best model. To compare the effects of multiple machine learning models, Zhao et al. [16] conducted a comparative analysis of the main carbon emission prediction models, including backpropagation neural network, support vector machine, long short-term memory neural network, random forest, and extreme learning machine. Based on this analysis, they proposed research ideas for future machine learning-based carbon emission prediction models. Considering the characteristics of carbon emission data, Yang et al. [17] proposed a combined carbon emission ensemble prediction model that integrated a singular spectrum analysis and a variational mode decomposition. Nadirgil et al. [18] developed 48 hybrid machine learning models using a complete ensemble empirical mode decomposition and compared their performance. The authors suggested that carbon peak prediction scenarios under different conditions should be introduced in the out-of-sample prediction to compare the peak time between different regions in the future. Zhang et al. [19] proposed a comprehensive carbon emission prediction model called SSA-FAGM-SVR for predicting the carbon emissions of the G20 countries in the next 10 years.
The reason why this study adopted the construction of an LSTM-CNN model is that the CNN-LSTM model has shown significant performance in fitting and predicting time series data, especially in areas such as gold prices [20], stock prices [21], and residential energy consumption [22]. Therefore, compared to some classical time series models, the LSTM-CNN model not only has better in-sample fitting performance but also a higher prediction accuracy for out-of-sample data. The choice of this combined model allows us to take advantage of the LSTM’s ability to fit and predict time series data and the CNN’s ability to extract more comprehensive features from the data.

3. Methodology

3.1. STIRPAT

The STIRPAT model [23] is an extension of the IPAT model by Dietz and Rosa, designed as a scalable and stochastic environmental impact assessment model. Its standard form is shown in Equation (1):
I = a P b A c T d e ,
In the equation, I, P, A, and T represent the environmental impact, population, affluence, and technological level, respectively; a is a model coefficient; b, c, and d are estimated coefficients; and e is the error term.
Taking the logarithm of both sides of the equation:
ln I = ln a + b ln P + c ln A + d ln T + ln e ,

3.2. Global Moran’s Index

The global Moran index primarily reflects the overall spatial autocorrelation of a certain indicator in different regions. The specific calculation formula is shown in Equation (3):
I m = n i = 1 n j = 1 n w i j ( x i X ¯ ) ( x j X ¯ ) i = 1 n j = 1 n w i j i = 1 n ( x i X ¯ ) ,
In the equation, I m is the global Moran index, n represents the total number of provinces or regions, x i , x j represent the carbon emissions of individual provinces i or regions j, X ¯ is the average carbon emissions of all provinces or regions, w i j represents the element (i,j) of the spatial weight matrix W, and the spatial weight matrix W is based on the Rook distance matrix derived from adjacency relationships. The significance of the value I in the equation is assessed using a value Z test:
Z = I m E ( I m ) V a r ( I m ) ,
where E ( I m ) represents the expected value of Moran’s I, and V a r ( I m ) represents the variance of Moran’s I.

3.3. Spatial Durbin Model

The spatial Durbin model is an extended form of the spatial lag model and the spatial lag of the independent variable model, simultaneously considering the autocorrelation of both the dependent and independent variables.
y = λ W 1 y + X β + W 2 X δ + ϵ ,
In the equation, W 1 , W 2 represents the spatial weight matrix, W 1 y is the spatial lag of the dependent variable y, W 2 X is the spatial lag of the independent variable X, λ is the spatial autocorrelation coefficient, and ϵ is the error term. The equation can be transformed into:
y = ( E λ W 1 ) 1 ( X β + W 2 X δ + ϵ ) ,
In the equation, E represents the identity matrix.

3.4. CNN

A CNN [20] has a strong capability for processing spatial grid data, primarily composed of convolutional layers, pooling layers, and fully connected layers. The convolutional layers are mainly responsible for feature extraction; the pooling layers perform dimensionality reduction and sampling without compromising the recognition results; and the fully connected layers connect all neurons and output results through hidden layers. The specific flowchart of a simple CNN is shown in Figure 1.

3.5. LSTM

LSTM [24], which stands for long short-term memory, is a specialized form of recurrent neural networks (RNNs). It is primarily used to address the issues of vanishing or exploding gradients that occur during training long time sequences. Compared to traditional RNNs, an LSTM network provides better analytical results for long time sequence dependencies. The internal structure of the LSTM network is illustrated in Figure 2.

3.6. Gaussian Noise

In order to prevent overfitting during the model training process, this study adopted a noise injection approach on the original data for adversarial training. This was achieved by using the random.gauss ( μ , σ ) function from the random library, where the parameter mu represents the mean of the Gaussian distribution and the parameter sigma represents the standard deviation of the Gaussian distribution.

3.7. Model Construction

Taking into account the spatial correlation of carbon emissions, this study incorporated spatial weights into the model. LSTM performs well in handling time series data, while a CNN is effective in feature extraction from spatial grid data. A combined LSTM-CNN model was constructed based on LSTM and CNN models to predict carbon emission intensity at the provincial level in China. The model structure is illustrated in Figure 3.
The model consists of an input layer, a feature extraction layer, and an output layer. In the input layer, data are processed with spatial weights and transformed into tensors suitable for neural network training. The feature extraction layer includes LSTM and convolutional layers, with the convolutional layer comprising a one-dimensional convolutional neural network layer, an activation layer, and a pooling layer. The output layer consists of a fully connected layer that outputs the prediction results.

4. Data Source and Characteristics Analysis

4.1. Data Source

The data utilized in this study primarily included energy data, carbon emission factors, and socio-economic data. Specifically, energy data consisted of energy consumption; carbon emission factors encompassed net calorific value, carbon content per unit of heat value, and carbon oxidation rate; socio-economic data mainly comprised regional GDP, total population, energy intensity, industrial structure, and urbanization rate for various provinces in China.
Energy data were sourced from the “China Energy Statistical Yearbook” (1997–2021), and missing data were filled using interpolation methods.
Carbon emission factors were drawn from the “Guidelines for Compilation of Provincial Greenhouse Gas Inventories (Trial)” (2011), “China Energy Statistical Yearbook”, “2006 IPCC National Greenhouse Gas Inventories Guidelines”, and “Data Descriptor: China CO2 emission accounts 1997–2015”. The calculation methods and data sources for the socio-economic data are outlined in Table 1.

4.2. Carbon Emission Accounting

This study employed the emission coefficient method proposed by the IPCC to estimate carbon emissions. Due to the lack of publicly available data or significant missing data on energy consumption for the Tibet Autonomous Region, Hong Kong Special Administrative Region, Macau Special Administrative Region, and Taiwan Province, this study calculated carbon emissions for the 30 provinces and municipalities in China excluding the aforementioned regions from 1997 to 2021. The reference method outlined in the “2006 IPCC National Greenhouse Gas Inventories Guidelines” is based on the apparent consumption of various fossil fuels, net calorific value, carbon content per unit of heat value for different fuel types, and carbon oxidation rate in major equipment for burning various fuels. It comprehensively calculates these parameters and deducts non-energy-related carbon sequestration of fossil fuels to derive the estimation. The specific calculation formula is shown in Equation (7):
C E = i ( A D i · N C V i · C C i · C O F i · 44 12 ) ,
In the equation, CE represents the calculated carbon emissions; A D i stands for the final consumption of fossil fuels i; N C V i denotes the net calorific value of fossil fuels i; C C i represents the carbon content per unit of heat value for fossil fuels i; C O F i signifies the carbon oxidation rate of fossil fuels i; and 44 12 represents the ratio of molecular weights between carbon dioxide and carbon.

4.3. Spatial Self-Correlation Analysis

Table 2 shows the global Moran index of carbon emissions of 30 provinces, autonomous regions, and municipalities in China from 1997 to 2020.
Except for the year 1997 and the years 2016–2020, the global Moran I index for the other years passed the significance test, indicating a spatial autocorrelation in carbon emissions among various provinces in China. Moreover, Moran’s I index values were all greater than zero, suggesting a positive spatial correlation in carbon emissions across the country. Provinces with higher (lower) carbon emissions are relatively adjacent to those with higher (lower) carbon emissions.
Given the spatial autocorrelation in carbon emissions among different provinces in China and in combination with the STIRPAT model, the following model was established:
y t = n = t 3 t 1 ( W y n + λ W y n + ln Z n , 1 β 1 + ln Z n , 2 β 2 + ln Z n , 3 β 3 + ln Z n , 4 β 4 + ln Z n , 5 β 5 + W ln Z n , 1 δ 1 + W ln Z n , 2 δ 2 + W ln Z n , 3 δ 3 + W ln Z n , 4 δ 4 + W ln Z n , 5 δ 5 + ϵ ) ,
In the equation, λ represents the spatial autocorrelation coefficient, W stands for the spatial weight matrix, y t represents carbon emissions in the ith year, Z n , 1 is the population at the end of year n after feature extraction by a neural network for 30 provinces in China, Z n , 2 is the per capita GDP for 30 provinces in China in year n after feature extraction by a neural network, Z n , 3 is the energy intensity for 30 provinces in China in year n after feature extraction by a neural network, Z n , 4 is the industrial structure for 30 provinces in China in year n after feature extraction by a neural network, Z n , 5 is the urbanization rate for 30 provinces in China in year n after feature extraction by a neural network, and β 1 , β 2 , β 3 , β 4 , β 5 , δ 1 , δ 2 , δ 3 , δ 4 , and δ 5 represent the regression coefficients of respective variables.

4.4. Carbon Emission Trend Analysis of Provinces and Cities in China

The 30 provinces and municipalities in China are divided into three regions: North, Central, and South, each containing 10 provinces and municipalities. The carbon emissions’ variations in the northern provinces and municipalities of China from 1997 to 2021 are shown in Figure 4; the carbon emissions’ variations in the central region are shown in Figure 5; and the carbon emissions’ variations in the southern region are shown in Figure 6. In Figure 4, “Shan1Xi” represents Shanxi Province, and “Shan3Xi” represents Shaanxi Province. From Figure 4, it can be observed that Hebei Province has the highest carbon emissions. From Figure 5, it can be seen that Shandong Province has the highest carbon emissions. From Figure 6, it can be observed that Guangdong Province has the highest carbon emissions.

4.5. Variable Correlation Analysis

In order to explore the linear relationship between carbon emissions y and the selected five original independent variables X 1 X 5 , the calculation of the correlation coefficients between variables was adopted to reflect the degree of linear correlation between variables, as shown in Figure 7. From the perspective of linear relationships with carbon emissions y, a strong linear relationship was observed between carbon emissions and the year-end population X 1 , reaching a correlation of 0.646. However, the linear relationships with per capita GDP X 2 , industrial structure X 4 and urbanization rate X 5 were relatively weak, with correlations of 0.339, 0.297, and 0.235, respectively. Additionally, there existed a weak negative correlation between carbon emissions y and energy intensity X 3 , with a correlation coefficient of −0.311.

5. Empirical Analysis

5.1. Datasets and Preprocessing

The experimental data consisted of previously calculated carbon emissions and their relevant influencing factors, including carbon emissions, year-end population, per capita GDP, energy intensity, industrial structure, and urbanization rate. The dataset underwent preprocessing by adding Gaussian noise with a mean of zero and a standard deviation of 0.05. The data were then divided into windows of a time step of three, resulting in 630 sets of data. The first 70% of the data were used as the training set, while the remaining 30% were designated as the testing set, thus creating the initial dataset.

5.2. Parameter Setting

The experimental parameter settings were as follows: In the model parameters, a unidirectional LSTM network was used with a hidden layer output dimension of 32 and a stack of three layers. In the CNN convolutional layers, the linear activation function used was ReLU, and the convolutional kernel size was three. In the fully connected layers, the linear activation function used was also ReLU. The input and output dimensions of each convolutional layer and fully connected layer are shown in the following table, Table 3.
For the setting of hyperparameters in the model, we refer to the relevant conclusions and pre-experimental rules from Bengio Y et al. [25], and through continuous optimization adjustments, the final settings were as follows: the loss function was set to MSE (mean squared error); the optimization algorithm was AdamW, with a learning rate of 0.0001 and weight decay of 0.001; the number of iterations was set to 50,000.

5.3. Evaluation Criteria

In this paper, three types of metrics—mean absolute error (MAE), root mean square error (RMSE), and coefficient of determination (R-squared)—were chosen as evaluation criteria to measure errors. The calculation methods are shown in Equations (9)–(11):
M A E = 1 n i = 1 n y i ^ y i ,
R M S E = 1 n i = 1 n ( y i y i ^ ) 2 ,
R 2 = i = 1 n ( y i ^ y ¯ ) 2 i = 1 n ( y i y ¯ ) 2 ,
In the equation, y i ^ represents the predicted value, y i represents the true value, and y ¯ represents the mean of y i . The ranges of MAE and RMSE are [ 0 , + ) , where smaller values indicate smaller errors and a better accuracy of the model. The range of R-squared is [ 0 , 1 ] , where larger values indicate smaller errors and a better accuracy of the model.

5.4. Results of the Analysis

5.4.1. The Relationship between the Number of Convolution Layers and the Training Results

In order to explore the relationship between the training performance of LSTM-CNN and different numbers of convolutional layers, we conducted 50,000 iterations of training on convolutional networks ranging from 1 to 10 layers. The results are shown in Table 4. To exclude the interference of other influencing factors on the model training results, we ensured that the hyperparameters of these 10 models were set according to Section 5.2. In addition, our models were experimentally conducted in the environment of Google Colab using a Tesla T4.
From the table, it can be seen that in the training set, when the number of convolutional layers reaches two, the MAE reaches its minimum value of 9.3253, and the RMSE reaches its minimum value of 14.3121, with a maximum value of 0.9846. In the test set, when the number of convolutional layers reaches four, the MAE reaches its minimum value of 8.0169, and the RMSE reaches its minimum value of 11.1505, with a maximum value of 0.9661. Therefore, it can be considered that the LSTM-CNN model performs optimally when the number of convolutional layers reaches four.

5.4.2. Ablation Experiment

To verify the relative effectiveness of each part of the LSTM-CNN model, we conducted ablation experiments and the results are shown in Table 5.
From the table, it can be seen that for the model without spatial weights, in the training set, the LSTM model achieved a minimum MAE of 10.4812, and the CNN-LSTM model achieved a minimum RMSE of 15.1564, with a maximum value of 0.9827. In the testing set, the CNN-LSTM model achieved a minimum MAE of 8.3908, and the RMSE reached a minimum value of 11.6647, with a maximum value of 0.9629. Figure 8 shows that the CNN-LSTM model had the best fitting performance for the real data without considering spatial weights in both the training and testing sets. This indicates that in the model without weights, the CNN-LSTM model had the best fitting effect on the data.
For the model with spatial weights, in the training set, the CNN-LSTM model achieved a minimum MAE of 8.3815 and the RMSE reached a minimum value of 12.3646, with a maximum value of 0.9885. In the testing set, the LSTM-CNN model achieved the minimum MAE of 7.8500 and the RMSE reached the minimum value of 10.6967, with a maximum value of 0.9688. Figure 9 shows that the LSTM-CNN model had the best fitting performance for the real data considering spatial weights in both the training and testing sets. By comparing the results, it can be concluded that after adding spatial weights, the model’s performance was improved in both the testing and training sets. Therefore, this ablation experiment verified that each part of the model had its own contribution to the model fitting.

6. Scenario Analysis

6.1. Scenario Setting

Based on the “14th Five-Year Plan and Long-Term Goals for 2035 of the People’s Republic of China’s National Economic and Social Development” and the relevant policy documents of “14th Five-Year Plans” for various provinces and regions, this paper sets the growth rates of year-end population, per capita GDP, energy intensity, industrial structure, and urbanization rate, with three levels of rates: low, medium, and high. Among them, the medium rate was set to meet the minimum requirements of government documents for each indicator. The growth rate settings for each indicator are provided in Table A1. Four carbon emission scenarios were established: the baseline scenario, low-economic-development scenario, and high-economic-development scenario, as detailed in Table 6.
Baseline scenario: The year-end population was set based on the annual average growth rate in different periods for each province and city, while the rest of the influencing factors were determined according to the relevant policy documents of each province’s “14th Five-Year Plan”, meeting the minimum development requirements. This scenario can to a certain extent represent the development situation of China’s 30 provinces and regions from 2021 to 2035.
Industrial structure optimizing scenario: The industrial structure was set to develop at a high rate, while the change rates of the other influencing factors remained the same as in the baseline scenario. This scenario aimed to explore the impact of optimizing the industrial structure further under the existing economic development model on carbon emissions for different regions.
Extensive scenario: The year-end population was the same as in the baseline scenario, while per capita GDP and urbanization rate were set to develop at a high rate, and energy intensity and industrial structure were set to develop at a low rate. This scenario aimed to study the impact of a strong economic development with less focus on carbon emissions in various regions on carbon emissions.
Emission reduction scenario: Energy intensity developed at a high rate, while the change rates of other influencing factors were the same as in the baseline scenario. This model aimed to study the impact of intensifying energy policy implementation and actively implementing energy conservation and emission reduction measures under the current economic development model in various regions on carbon emissions.

6.2. Carbon Emission Prediction

Using the LSTM-CNN model constructed earlier, a scenario analysis of carbon emissions at the provincial level in China was conducted. The predicted results are shown in Figure 10, and specific peak time details are provided in Table 7.
Different provinces have varying development speeds, technological levels, and development strategies, leading to differences in the timing of carbon peaking under different scenarios. It can be observed that, before 2035, Shanxi Province does not reach its peak in both the baseline and extensive development scenarios. Zhejiang Province does not reach its peak in the extensive development scenario. Fujian Province and Guizhou Province do not reach their peaks in the baseline scenario. Qinghai Province does not reach its peak in both the baseline and optimized industrial structure scenarios. Other provinces reach their peaks under different development scenarios.
Comparing the carbon peaking times of various cities under different scenarios, Beijing reaches its peak earliest in both the extensive development and energy-saving and emission reduction scenarios, achieving carbon peaking in 2025. Tianjin, Fujian, Guizhou, and Qinghai provinces reach their peaks the earliest in the extensive development scenario, achieving carbon peaking in 2025, 2028, 2026, and 2029, respectively. Hebei and Liaoning provinces reach their peaks in the same years under all four scenarios, with Hebei achieving carbon peaking in 2023 and Liaoning in 2024. Shanxi, Anhui, Hubei, and Ningxia provinces reach their peaks the earliest in the energy-saving and emission reduction scenario, achieving carbon peaking in 2024. Inner Mongolia Autonomous Region and Chongqing reach their peaks the earliest in the baseline, extensive development, and energy-saving and emission reduction scenarios, with Inner Mongolia achieving carbon peaking in 2024 and Chongqing in 2029. Jilin and Hainan provinces reach their peaks the earliest in the optimized industrial structure, extensive development, and energy-saving and emission reduction scenarios, with Jilin achieving carbon peaking in 2025 and Hainan in 2022.
Heilongjiang, Jiangsu, Zhejiang, and Xinjiang reach their peaks the earliest in the baseline and optimized industrial structure scenarios, achieving carbon peaking in 2026, 2023, 2024, and 2024, respectively. Shanghai, Jiangxi, Henan, Guangdong, and Gansu provinces reach their peaks the earliest in the optimized industrial structure scenario, achieving carbon peaking in 2024, 2029, 2024, 2024, and 2028, respectively. Shandong, Guangxi Zhuang Autonomous Region, Sichuan, and Shaanxi provinces reach their peaks the earliest in the baseline scenario, achieving carbon peaking in 2023, 2026, 2023, and 2027, respectively. Hunan province reaches its peaks the earliest in the baseline and extensive development scenarios, achieving carbon peaking in 2024. Yunnan province reaches its peaks the earliest in the optimized industrial structure and extensive development scenarios, achieving carbon peaking in 2032.
Based on the four development scenarios, an analysis was conducted on the 30 provinces and regions in China. If there were scenarios where the same city reached its peak at the same time, priority was given to economic development speed. In the baseline scenario, Shandong Province, Guangxi Zhuang Autonomous Region, Sichuan Province, and Shaanxi Province reach their peaks the earliest. This indicates that the effects of optimizing industrial structure, extensive development, and energy-saving and emission reduction scenarios on carbon reduction are not significant in these cities. Among them, Shandong Province and Sichuan Province can achieve carbon peaking before 2025 while maintaining their current development speed. Guangxi Zhuang Autonomous Region and Shaanxi Province can achieve carbon peaking before 2030 while maintaining their current development speed.
In the scenario of optimizing industrial structure, Heilongjiang Province, Shanghai, Jiangsu Province, Zhejiang Province, Jiangxi Province, Henan Province, Guangdong Province, Gansu Province, and Xinjiang Uygur Autonomous Region reach their peaks the earliest. This indicates that these cities are more suitable for the different development rates set in the industrial structure optimization scenario. Among them, Shanghai, Jiangsu Province, Zhejiang Province, Henan Province, Guangdong Province, and Xinjiang Uygur Autonomous Region have a rapid development of low-carbon technologies. By maintaining the development speed of the industrial structure optimization scenario, they can achieve carbon peaking before 2025. On the other hand, Heilongjiang Province, Jiangxi Province, and Gansu Province have a slower development of low-carbon technologies. By maintaining the development speed of the industrial structure optimization scenario, they can achieve carbon peaking before 2030.
In the extensive scenario, Beijing, Tianjin, Hebei Province, Inner Mongolia Autonomous Region, Liaoning Province, Jilin Province, Fujian Province, Hunan Province, Hainan Province, Chongqing, Guizhou Province, Yunnan Province, and Qinghai Province achieve carbon peaking the earliest. Among them, Beijing, Tianjin, Jilin Province, and Hunan Province have a rapid development of low-carbon technologies. By following the different development rates set in the extensive scenario, they can not only achieve carbon peaking before 2025 but also ensure maximum economic development. Hainan Province has a distinct industrial structure with a lower energy consumption, resulting in lower carbon emissions. Following the different development rates set in the extensive scenario, Hainan Province can achieve carbon peaking in 2022 while maximizing economic development. Hebei Province, Inner Mongolia Autonomous Region, and Liaoning Province have similar or identical carbon peaking times across the four scenarios. Due to differing development strategies and economic structures from other cities, they are major coal or industrial-heavy provinces, with closely intertwined economic development and carbon emissions. Their slow transformation of industrial structure is notable.
Fujian Province, Guizhou Province, and Qinghai Province have a slower development of low-carbon technologies and relatively lower economic development levels. They primarily rely on high-carbon energy products such as coal. To achieve the different development rates set in the extensive scenario, these cities must increase the use of clean energy, reduce the proportion of coal usage, and promote the development of low-carbon technologies to ensure carbon peaking before 2030. The carbon emissions of Chongqing are significantly influenced by surrounding cities. Dominated by the manufacturing industry and characterized by a large urban–rural development gap, Chongqing’s carbon emissions show an increasing trend. Following the different development rates set in the extensive scenario, Chongqing can achieve carbon peaking in 2029. Although Yunnan Province reaches carbon peaking the earliest in the extensive scenario, it cannot achieve carbon peaking before 2030. It is evident that Yunnan Province also reaches carbon peaking earliest in the scenario where industrial structure is optimized. To advance the carbon peaking time, Yunnan Province could intensify efforts in industrial structural transformation and accelerate the development of low-carbon technologies.
In the energy-saving and emission reduction scenario, Shanxi Province, Anhui Province, Hubei Province, and Ningxia Province achieve carbon peaking the earliest. This indicates that the aforementioned cities are more suitable for promoting low-carbon lifestyles, increasing the proportion of clean-energy usage, and accelerating the development of low-carbon technologies. By following the different development rates set in the energy-saving and emission reduction scenario, maintaining the pace of development in this scenario, these cities can achieve carbon peaking before 2025.

7. Conclusions and Future Research

7.1. Conclusions

This article calculated the carbon emissions from the energy consumption in 30 provinces, regions, and municipalities in China from 1997 to 2021 and analyzed the trends in carbon emissions in each province. From 1997 to 2021, China’s carbon emissions from energy consumption showed an overall upward trend, increasing from 2051.507 million tons to 5738.955 million tons. Among different provinces, Hebei, Shandong, Jiangsu, Liaoning, Guangdong, and Inner Mongolia Autonomous Region had higher carbon emissions, while Hainan, Qinghai, Ningxia, Beijing, and Gansu had relatively lower carbon emissions. The trends in carbon emissions varied among different provinces.
Moran’s index was used to analyze the spatial autocorrelation of carbon emissions among provinces. There was spatial autocorrelation in carbon emissions among provinces in China, and Moran’s index values were all greater than zero, indicating a positive spatial correlation in carbon emissions. Provinces with higher (lower) carbon emissions were relatively adjacent to provinces with higher (lower) carbon emissions.
A new machine learning prediction model incorporating spatial weights, called the LSTM-CNN model with spatial weights, was proposed in this article and applied to carbon emission forecasting in 30 provinces, regions, and municipalities in China. By adjusting parameters, it was found that the model performed optimally with four convolutional layers. Through ablation experiments, it was found that each part of the model contributed relatively independently. Compared to other models, this model achieved an MAE of 8.0169, RMSE of 11.1505, and R2 score of 0.9661 on the test set, which were higher than those of a single LSTM model, a single CNN model, and an LSTM-CNN model without spatial weights. This indicates that incorporating spatial weights in the model and using a combination of LSTM and CNN models can improve the prediction accuracy.
Four different development scenarios were set to analyze the carbon emissions in 30 provinces, regions, and municipalities in China from 2022 to 2035. Local governments can formulate corresponding carbon emission reduction policies based on the peaking situations under different development scenarios. The results showed that except for Yunnan province, all other 29 provinces, regions, and municipalities could achieve carbon peaking before 2030 under suitable development scenarios. Specifically, in the baseline scenario, Shandong, Guangxi Zhuang Autonomous Region, Sichuan, and Shaanxi would reach their peaks the earliest. In the scenario of industrial structure optimization, Heilongjiang, Shanghai, Jiangsu, Zhejiang, Jiangxi, Henan, Guangdong, Gansu, and Xinjiang Uygur Autonomous Region would reached their peaks earliest. In the extensive scenario, Beijing, Tianjin, Hebei, Inner Mongolia Autonomous Region, Liaoning, Jilin, Fujian, Hunan, Hainan, Chongqing, Guizhou, Yunnan, and Qinghai would reach their peaks the earliest. In the energy conservation and emission reduction scenario, Shanxi, Anhui, Hubei, and Ningxia would reach their peaks the earliest.

7.2. Policy Recommendations

For different cities, it is recommended that Shandong Province, Guangxi Zhuang Autonomous Region, Sichuan Province, and Shaanxi Province develop at different rates as set in the baseline scenario. The decarbonization effects are not satisfactory in the optimized industrial structure, extensive development, and energy-saving and emission reduction scenarios in the above-mentioned cities. It is recommended that Heilongjiang Province, Shanghai, Jiangsu Province, Zhejiang Province, Jiangxi Province, Henan Province, Guangdong Province, Gansu Province, and Xinjiang Uyghur Autonomous Region develop at different rates as set in the scenario where industrial structure is optimized. Compared with the other three scenarios, accelerating the transformation of industrial structure in the above-mentioned cities can advance the carbon peaking time. It is recommended that Shanxi Province, Anhui Province, Hubei Province, and Ningxia Province develop at different rates as set in the energy-saving and emission reduction scenario. The above-mentioned cities are suitable for promoting low-carbon lifestyles, increasing the proportion of clean-energy use, and accelerating the development of low-carbon technologies.
It is recommended that Beijing, Tianjin, Jilin Province, Hunan Province, Hainan Province, and Chongqing develop at different rates as set in the extensive development scenario. Except for Chongqing, the development of low-carbon technologies is rapid in these cities, and maintaining a high economic development speed has little impact on the timing of carbon peaking. The carbon emissions of Chongqing are greatly influenced by surrounding cities. Due to different development strategies and economic structures from other cities, Hebei Province, Inner Mongolia Autonomous Region, Liaoning Province, Fujian Province, Guizhou Province, and Qinghai Province need to increase the use of clean energy, reduce the proportion of coal consumption, and strengthen the development of low-carbon technologies in order to develop at different rates as set in the extensive development scenario.
If Yunnan Province wants to advance its timing of carbon peaking, it can consider intensifying the transformation of industrial structure and accelerating the development of low-carbon technologies.

7.3. Insufficiency and Future Prospects

For the limitations and improvements of the model encountered in the research process, this paper provides the following two aspects:
(1) This paper applied deep learning algorithms to predict carbon emissions. The internal mechanism of deep learning algorithms is very complex, and the judgment and inference results of the model are often difficult to explain. This means the model can only be used for predicting carbon emissions but not for analyzing the influencing factors of carbon emissions.
(2) The independent variables in the model are all socio-economic data. In the scenario analysis, it was necessary to first set the rate of change of each independent variable, predict the independent variables, and then predict the carbon emissions. The subjective setting of the rate of change of independent variables was relatively strong, and the longer the prediction time, the greater the deviation between the predicted data and the real data. Based on the development plan released by the country, this paper cautiously set the rate of change of each independent variable to minimize data deviation.
For future work, we could use feature extraction networks with stronger exploratory capabilities, such as Transformer [26] and GNN models [27], for the input variable part of the model. We can also introduce causal inference [28] and other methods to improve the prediction effect of carbon emissions based on influencing factors, and explore the causal relationship between influencing factors. Finally, the selection of original data can also contribute to the improvement and enhancement of the final prediction effect.

Author Contributions

Data curation, B.C.; formal analysis, Z.H.; funding acquisition, L.X.; investigation, B.C.; project administration, L.X.; software, Z.H. and B.C.; supervision, L.X. and Z.G.; validation, Z.H. and B.C.; visualization, Z.H.; writing—original draft, Z.H. and B.C.; writing—review and editing, J.W. and L.X. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported in part by the National Social Science Foundation of China under Grant 20BTJ046.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The dataset is available on request.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

LSTMLong short-term memory
CNNConvolutional neural network
MAEMean absolute error
RMSERoot-mean-square error

Appendix A

Table A1. Setting of provincial influencing factor change rates in different scenarios.
Table A1. Setting of provincial influencing factor change rates in different scenarios.
Province/CityYearDevelopment RateAnnual Total Population Change RatePer Capita GDP Change RateEnergy Intensity Change RateIndustrial Structure Change RateUrbanization Rate Change Rate
Beijing2021–2025Low−0.04764.6−3.4−0.70.1
Beijing2021–2025Medium−0.04765.1−3−0.60.2
Beijing2021–2025High−0.04765.6−2.6−0.50.3
Beijing2026–2030Low−0.04764−3.1−0.40.05
Beijing2026–2030Medium−0.04764.5−2.7−0.30.1
Beijing2026–2030High−0.04765−2.3−0.20.2
Beijing2031–2035Low−0.04763.6−2.9−0.20
Beijing2031–2035Medium−0.04764.1−2.5−0.10.05
Beijing2031–2035High−0.04764.6−2.1−0.050.1
Tianjing2021–2025Low0.95244.5−3.4−40.15
Tianjing2021–2025Medium0.95245−3−3.50.25
Tianjing2021–2025High0.95245.5−2.6−30.35
Tianjing2026–2030Low0.95244−3.1−4.50.1
Tianjing2026–2030Medium0.95244.5−2.7−40.2
Tianjing2026–2030High0.95245−2.3−3.50.3
Tianjing2031–2035Low0.95243.8−2.9−4.80.05
Tianjing2031–2035Medium0.95244.2−2.5−4.30.15
Tianjing2031–2035High0.95244.7−2.1−3.80.25
Hebei2021–2025Low0.215.3−3.4−3.31.1
Hebei2021–2025Medium0.215.8−3−2.81.6
Hebei2021–2025High0.216.3−2.6−2.32.1
Hebei2026–2030Low0.214.7−3.1−3.51
Hebei2026–2030Medium0.215.2−2.7−31.5
Hebei2026–2030High0.215.7−2.3−2.52
Hebei2031–2035Low0.214.3−2.9−3.70.9
Hebei2031–2035Medium0.214.8−2.5−3.21.4
Hebei2031–2035High0.215.3−2.1−2.71.9
Shan1xi2021–2025Low−0.0937.6−3.5−1.51.2
Shan1xi2021–2025Medium−0.0938.1−3.1−11.7
Shan1xi2021–2025High−0.0938.6−2.7−0.52.2
Shan1xi2026–2030Low−0.0937−3.4−21.1
Shan1xi2026–2030Medium−0.0937.5−2.8−1.51.6
Shan1xi2026–2030High−0.0938−2.4−12.1
Shan1xi2031–2035Low−0.0936.6−2.9−2.51
Shan1xi2031–2035Medium−0.0937.1−2.5−21.5
Shan1xi2031–2035High−0.0937.6−2.1−1.52
Neimenggu2021–2025Low−0.47395−3.6−0.40.5
Neimenggu2021–2025Medium−0.47395.5−3.2−0.31
Neimenggu2021–2025High−0.47396−2.8−0.21.5
Neimenggu2026–2030Low−0.47394.5−3.2−0.30.4
Neimenggu2026–2030Medium−0.47395−2.8−0.20.9
Neimenggu2026–2030High−0.47395.5−2.4−0.11.4
Neimenggu2031–2035Low−0.47394−2.9−0.20.3
Neimenggu2031–2035Medium−0.47394.5−2.5−0.10.8
Neimenggu2031–2035High−0.47395−2.101.3
Liaoning2021–2025Low0.85634.6−3.5−20.6
Liaoning2021–2025Medium0.85635.1−3.1−1.51.1
Liaoning2021–2025High0.85635.6−2.7−11.6
Liaoning2026–2030Low0.85634−3.4−1.50.5
Liaoning2026–2030Medium0.85634.5−2.8−11
Liaoning2026–2030High0.85635−2.4−0.51.5
Liaoning2031–2035Low0.85633.6−2.910.4
Liaoning2031–2035Medium0.85634.1−2.5−0.50.9
Liaoning2031–2035High0.85634.6−2.101.4
Jiling2021–2025Low−2.29368.5−3.5−2.50.7
Jiling2021–2025Medium−2.29369−3.1−20.8
Jiling2021–2025High−2.29369.5−2.7−1.50.9
Jiling2026–2030Low−2.29369.5−3.4−30.6
Jiling2026–2030Medium−2.293610−2.8−2.50.7
Jiling2026–2030High−2.293610.5−2.4−20.8
Jiling2031–2035Low−2.293610.5−2.9−3.50.5
Jiling2031–2035Medium−2.293611−2.5−30.6
Jiling2031–2035High−2.293611.5−2.1−2.50.7
Heilongjiang2021–2025Low−1.40196.5−3.5−3.50.15
Heilongjiang2021–2025Medium−1.40197−3.1−30.16
Heilongjiang2021–2025High−1.40197.5−2.7−2.50.17
Heilongjiang2026–2030Low−1.40196−3.4−2.50.14
Heilongjiang2026–2030Medium−1.40196.5−2.8−20.15
Heilongjiang2026–2030High−1.40197−2.4−1.50.16
Heilongjiang2031–2035Low−1.40195.5−2.9−1.50.13
Heilongjiang2031–2035Medium−1.40196−2.5−10.14
Heilongjiang2031–2035High−1.40196.5−2.1−0.50.15
Shanghai2021–2025Low0.09534.4−3.4−2.60.1
Shanghai2021–2025Medium0.09534.9−3−2.10.2
Shanghai2021–2025High0.09535.4−2.6−1.60.3
Shanghai2026–2030Low0.09533.9−3.1−20.05
Shanghai2026–2030Medium0.09534.4−2.7−1.50.1
Shanghai2026–2030High0.09534.9−2.3−10.2
Shanghai2031–2035Low0.09533.4−2.9−1.40
Shanghai2031–2035Medium0.09533.9−2.5−0.90.05
Shanghai2031–2035High0.09534.4−2.1−0.40.1
Jiangsu2021–2025Low1.05363.9−3.4−1.90.35
Jiangsu2021–2025Medium1.05364.4−3−1.40.45
Jiangsu2021–2025High1.05364.9−2.6−0.90.55
Jiangsu2026–2030Low1.05363.4−3.1−1.80.3
Jiangsu2026–2030Medium1.05363.9−2.7−1.30.4
Jiangsu2026–2030High1.05364.4−2.3−0.80.5
Jiangsu2031–2035Low1.05362.9−2.9−1.70.25
Jiangsu2031–2035Medium1.05363.4−2.5−1.20.35
Jiangsu2031–2035High1.05363.9−2.1−0.70.45
Zhejiang2021–2025Low0.09494.9−3.5−40.7
Zhejiang2021–2025Medium0.09495.4−3.1−3.50.8
Zhejiang2021–2025High0.09495.9−2.7−30.9
Zhejiang2026–2030Low0.09494.4−3.4−4.50.6
Zhejiang2026–2030Medium0.09494.9−2.8−40.7
Zhejiang2026–2030High0.09495.4−2.4−3.50.8
Zhejiang2031–2035Low0.09493.9−2.9−50.5
Zhejiang2031–2035Medium0.09494.4−2.5−4.50.6
Zhejiang2031–2035High0.09494.9−2.1−40.7
Anhui2021–2025Low−1.11427.2−3.4−31.2
Anhui2021–2025Medium−1.11427.7−3−2.51.3
Anhui2021–2025High−1.11428.2−2.6−21.4
Anhui2026–2030Low−1.11426.7−3.1−2.51.1
Anhui2026–2030Medium−1.11427.2−2.7−21.2
Anhui2026–2030High−1.11427.7−2.3−1.51.3
Anhui2031–2035Low−1.11426.2−2.9−21
Anhui2031–2035Medium−1.11426.7−2.5−1.51.1
Anhui2031–2035High−1.11427.2−2.1−11.2
Fujian2021–2025Low0.85394.9−3.5−30.7
Fujian2021–2025Medium0.85395.4−3.1−2.50.8
Fujian2021–2025High0.85395.9−2.7−20.9
Fujian2026–2030Low0.85394.4−3.4−2.50.6
Fujian2026–2030Medium0.85394.9−2.8−20.7
Fujian2026–2030High0.85395.4−2.4−1.50.8
Fujian2031–2035Low0.85393.9−2.9−20.5
Fujian2031–2035Medium0.85394.4−2.5−1.50.6
Fujian2031–2035High0.85394.9−2.1−10.7
Jiangxi2021–2025Low1.90484.5−3.5−31.1
Jiangxi2021–2025Medium1.90485−3.1−2.51.2
Jiangxi2021–2025High1.90485.5−2.7−21.3
Jiangxi2026–2030Low1.90485.9−3.4−2.51
Jiangxi2026–2030Medium1.90486.4−2.8−21.1
Jiangxi2026–2030High1.90486.9−2.4−1.51.2
Jiangxi2031–2035Low1.90487.3−2.9−20.9
Jiangxi2031–2035Medium1.90487.8−2.5−1.51
Jiangxi2031–2035High1.90488.3−2.1−11.1
Shandong2021–2025Low0.5724.4−3.8−3.20.6
Shandong2021–2025Medium0.5724.9−3.4−2.70.7
Shandong2021–2025High0.5725.4−3−2.20.8
Shandong2026–2030Low0.5723.9−3.4−2.70.5
Shandong2026–2030Medium0.5724.4−3−2.20.6
Shandong2026–2030High0.5724.9−2.6−1.70.7
Shandong2031–2035Low0.5723.4−3.1−2.20.4
Shandong2031–2035Medium0.5723.9−2.7−1.70.5
Shandong2031–2035High0.5724.4−2.3−1.20.6
Henan2021–2025Low0.37885.1−3.5−4.51.5
Henan2021–2025Medium0.37885.6−3.1−41.6
Henan2021–2025High0.37886.1−2.7−3.51.7
Henan2026–2030Low0.37886.1−3.4−41.4
Henan2026–2030Medium0.37886.6−2.8−3.51.5
Henan2026–2030High0.37887.1−2.4−31.6
Henan2031–2035Low0.37887.1−2.9−3.51.3
Henan2031–2035Medium0.37887.6−2.5−31.4
Henan2031–2035High0.37888.1−2.1−2.51.5
Hubei2021–2025Low0.47175.5−3.5−50.6
Hubei2021–2025Medium0.47176−3.1−4.50.7
Hubei2021–2025High0.47176.5−2.7−40.8
Hubei2026–2030Low0.47175−3.4−4.50.5
Hubei2026–2030Medium0.47175.5−2.8−40.6
Hubei2026–2030High0.47176−2.4−3.50.7
Hubei2031–2035Low0.47174.5−2.9−40.4
Hubei2031–2035Medium0.47175−2.5−3.50.5
Hubei2031–2035High0.47175.5−2.1−30.6
Hunan2021–2025Low0.09445.4−3.4−2.91.4
Hunan2021–2025Medium0.09445.9−3−2.41.5
Hunan2021–2025High0.09446.4−2.6−1.91.6
Hunan2026–2030Low0.09444.9−3.1−2.41.3
Hunan2026–2030Medium0.09445.4−2.7−1.91.4
Hunan2026–2030High0.09445.9−2.3−1.41.5
Hunan2031–2035Low0.09444.4−2.9−1.91.2
Hunan2031–2035Medium0.09444.9−2.5−1.41.3
Hunan2031–2035High0.09445.4−2.1−0.91.4
Guangdong2021–2025Low0.96153.5−3.5−2.90.9
Guangdong2021–2025Medium0.96154−3.1−2.41
Guangdong2021–2025High0.96154.5−2.7−1.91.1
Guangdong2026–2030Low0.96153−3.4−2.40.8
Guangdong2026–2030Medium0.96153.5−2.8−1.90.9
Guangdong2026–2030High0.96154−2.4−1.41
Guangdong2031–2035Low0.96152.5−2.9−1.90.7
Guangdong2031–2035Medium0.96153−2.5−1.40.8
Guangdong2031–2035High0.96153.5−2.1−0.90.9
Guangxi2021–2025Low0.94795−3.4−3.51
Guangxi2021–2025Medium0.94795.5−2.8−31.1
Guangxi2021–2025High0.94796−2.4−2.51.2
Guangxi2026–2030Low0.94794.5−2.9−30.9
Guangxi2026–2030Medium0.94795−2.5−2.51
Guangxi2026–2030High0.94795.5−2.1−21.1
Guangxi2031–2035Low0.94794−2.7−2.50.8
Guangxi2031–2035Medium0.94794.5−2.3−20.9
Guangxi2031–2035High0.94795−1.9−1.51
Hainan2021–2025Low1.1968.2−3.4−41.5
Hainan2021–2025Medium1.1968.7−2.8−3.51.6
Hainan2021–2025High1.1969.2−2.4−31.7
Hainan2026–2030Low1.1967.7−2.9−1.51.4
Hainan2026–2030Medium1.1968.2−2.5−11.5
Hainan2026–2030High1.1968.7−2.1−0.51.6
Hainan2031–2035Low1.1967.2−2.7−11.3
Hainan2031–2035Medium1.1967.7−2.3−0.51.4
Hainan2031–2035High1.1968.2−1.901.5
Chongqing2021–2025Low0.37885.1−3.4−2.90.9
Chongqing2021–2025Medium0.37885.6−3−2.41
Chongqing2021–2025High0.37886.1−2.6−1.91.1
Chongqing2026–2030Low0.37884.6−3.1−2.40.8
Chongqing2026–2030Medium0.37885.1−2.7−1.90.9
Chongqing2026–2030High0.37885.6−2.3−1.41
Chongqing2031–2035Low0.37884.1−2.9−1.90.7
Chongqing2031–2035Medium0.37884.6−2.5−1.40.8
Chongqing2031–2035High0.37885.1−2.1−0.90.9
Sichuan2021–2025Low0.37885.1−3.4−3.41.1
Sichuan2021–2025Medium0.37885.6−3−2.91.2
Sichuan2021–2025High0.37886.1−2.6−2.41.3
Sichuan2026–2030Low0.37884.6−3.1−2.91
Sichuan2026–2030Medium0.37885.1−2.7−2.41.1
Sichuan2026–2030High0.37885.6−2.3−1.91.2
Sichuan2031–2035Low0.37884.1−2.9−2.40.9
Sichuan2031–2035Medium0.37884.6−2.5−1.91
Sichuan2031–2035High0.37885.1−2.1−1.41.1
Guizhou2021–2025Low−1.3828−3.4−1.81.7
Guizhou2021–2025Medium−1.3828.5−2.8−1.31.8
Guizhou2021–2025High−1.3829−2.4−0.81.9
Guizhou2026–2030Low−1.3827.5−2.9−1.41.6
Guizhou2026–2030Medium−1.3828−2.5−0.91.7
Guizhou2026–2030High−1.3828.5−2.1−0.41.8
Guizhou2031–2035Low−1.3827−2.7−11.5
Guizhou2031–2035Medium−1.3827.5−2.3−0.51.6
Guizhou2031–2035High−1.3828−1.901.7
Yunnan2021–2025Low0.46736.5−3.4−1.83.2
Yunnan2021–2025Medium0.46737−2.8−1.33.7
Yunnan2021–2025High0.46737.5−2.4−0.84.2
Yunnan2026–2030Low0.46736−2.9−1.42.7
Yunnan2026–2030Medium0.46736.5−2.5−0.93.2
Yunnan2026–2030High0.46737−2.1−0.43.7
Yunnan2031–2035Low0.46735.5−2.7−12.2
Yunnan2031–2035Medium0.46736−2.3−0.52.7
Yunnan2031–2035High0.46736.5−1.903.2
Shan3xi2021–2025Low−0.46956−3.5−30.7
Shan3xi2021–2025Medium−0.46956.5−2.9−2.50.8
Shan3xi2021–2025High−0.46957−2.5−20.9
Shan3xi2026–2030Low−0.46955.5−3−2.50.6
Shan3xi2026–2030Medium−0.46956−2.6−20.7
Shan3xi2026–2030High−0.46956.5−2.2−1.50.8
Shan3xi2031–2035Low−0.46955−2.8−20.5
Shan3xi2031–2035Medium−0.46955.5−2.4−1.50.6
Shan3xi2031–2035High−0.46956−2−10.7
Gansu2021–2025Low−0.18746.2−3.3−3.52.1
Gansu2021–2025Medium−0.18746.7−2.7−32.2
Gansu2021–2025High−0.18747.2−2.3−2.52.3
Gansu2026–2030Low−0.18745.7−2.8−2.52
Gansu2026–2030Medium−0.18746.2−2.4−22.1
Gansu2026–2030High−0.18746.7−2−1.52.2
Gansu2031–2035Low−0.18745.2−2.6−1.51.9
Gansu2031–2035Medium−0.18745.7−2.2−12
Gansu2031–2035High−0.18746.2−1.8−0.52.1
Qinghai2021–2025Low0.47624.5−3.3−0.60.6
Qinghai2021–2025Medium0.47625−2.7−0.10.7
Qinghai2021–2025High0.47625.5−2.30.40.8
Qinghai2026–2030Low0.47624−2.8−1.10.5
Qinghai2026–2030Medium0.47624.5−2.4−0.60.6
Qinghai2026–2030High0.47625−2−0.10.7
Qinghai2031–2035Low0.47623.5−2.6−1.60.4
Qinghai2031–2035Medium0.47624−2.2−1.10.5
Qinghai2031–2035High0.47624.5−1.8−0.60.6
Ningxia2021–2025Low0.37885.1−3.6−0.80.1
Ningxia2021–2025Medium0.37885.6−3.2−0.30.2
Ningxia2021–2025High0.37886.1−2.80.20.3
Ningxia2026–2030Low0.37884.6−3.2−1.30.2
Ningxia2026–2030Medium0.37885.1−2.8−0.80.3
Ningxia2026–2030High0.37885.6−2.4−0.30.4
Ningxia2031–2035Low0.37884.1−2.9−1.80.3
Ningxia2031–2035Medium0.37884.6−2.5−1.30.4
Ningxia2031–2035High0.37885.1−2.1−0.80.5
Xingjiang2021–2025Low1.13425.3−3.7−21.1
Xingjiang2021–2025Medium1.13425.8−3.3−1.51.2
Xingjiang2021–2025High1.13426.3−2.9−11.3
Xingjiang2026–2030Low1.13424.8−3.3−1.51
Xingjiang2026–2030Medium1.13425.3−2.9−11.1
Xingjiang2026–2030High1.13425.8−2.5−0.51.2
Xingjiang2031–2035Low1.13424.3−3−10.9
Xingjiang2031–2035Medium1.13424.8−2.6−0.51
Xingjiang2031–2035High1.13425.3−2.201.1

References

  1. Qin, D.H.; Thomas, S. Highlights of the IPCC Fifth Assessment Report Working Group I report. ACCR 2014, 10, 1–6. [Google Scholar]
  2. Zhang, X.; Yan, F.; Liu, H.; Qiao, Z. Towards low carbon cities: A machine learning method for predicting urban blocks carbon emissions (UBCE) based on built environment factors (BEF) in Changxing City, China. Sustain. Cities Soc. 2021, 69, 102875. [Google Scholar] [CrossRef]
  3. Yan, S.R.; Zhang, Y.X.; Sun, H.D.; Wang, A.P. A real-time operational carbon emission prediction method for the early design stage of residential units based on a convolutional neural network: A case study in Beijing, China. J. Build. Eng. 2023, 75, 106994. [Google Scholar] [CrossRef]
  4. Su, Y.; Cheng, H.; Wang, Z.; Yan, J.; Miao, Z.; Gong, A. Analysis and prediction of carbon emission in the large green commercial building: A case study in Dalian, China. J. Build. Eng. 2023, 68, 106147. [Google Scholar] [CrossRef]
  5. Zhang, M.; Kafy, A.A.; Xiao, P.; Han, S.; Zou, S.; Saha, M.; Zhang, C.; Tan, S. Impact of urban expansion on land surface temperature and carbon emissions using machine learning algorithms in Wuhan, China. Urban Clim. 2023, 47, 101347. [Google Scholar] [CrossRef]
  6. Chen, Z.; Guo, Y.; Guo, C. Prediction of GHG emissions from Chengdu Metro in the construction stage based on WOA-DELM. Tunn. Undergr. Space Technol. 2023, 139, 105235. [Google Scholar] [CrossRef]
  7. Wang, A.; Hu, S.; Li, J. Using machine learning to model technological heterogeneity in carbon emission efficiency evaluation: The case of China’s cities. Energ. Econ. 2022, 114, 106238. [Google Scholar] [CrossRef]
  8. Sun, Q.; Chen, H.; Long, R.; Zhang, J.; Yang, M.; Huang, H.; Ma, W.; Wang, Y. Can Chinese cities reach their carbon peaks on time? Scenario analysis based on machine learning and LMDI decomposition. Appl. Energy 2023, 347, 121427. [Google Scholar] [CrossRef]
  9. Zhao, J.; Kou, L.; Wang, H.; He, X.; Xiong, Z.; Liu, C.; Cui, H. Carbon emission prediction model and analysis in the Yellow River basin based on a machine learning method. Sustainability 2022, 14, 6153. [Google Scholar] [CrossRef]
  10. Qin, J.; Gong, N. The estimation of the carbon dioxide emission and driving factors in China based on machine learning methods. Sustain. Prod. Consump. 2022, 33, 218–229. [Google Scholar] [CrossRef]
  11. Aryai, V.; Goldsworthy, M. Day ahead carbon emission forecasting of the regional National Electricity Market using machine learning methods. Eng. Appl. Artif. Intel. 2023, 123, 106314. [Google Scholar] [CrossRef]
  12. Sarwar, S.; Aziz, G.; Tiwari, A.K. Implication of machine learning techniques to forecast the electricity price and carbon emission: Evidence from a hot region. Geosci. Front. 2023, 2023, 101647. [Google Scholar] [CrossRef]
  13. Xu, M.; Sun, C.; Zhan, Y.; Liu, Y. Impact and prediction of pollutant on mangrove and carbon stocks: A machine learning study based on urban remote sensing data. Geosci. Front. 2023, 2023, 101665. [Google Scholar] [CrossRef]
  14. Han, Y.; Zhu, Q.; Geng, Z.; Xu, Y. Energy and carbon emissions analysis and prediction of complex petrochemical systems based on an improved extreme learning machine integrated interpretative structural model. Appl. Therm. Eng. 2017, 115, 280–291. [Google Scholar] [CrossRef]
  15. Anthony, L.F.W.; Kanding, B.; Selvan, R. Carbontracker: Tracking and predicting the carbon footprint of training deep learning models. arXiv 2020, arXiv:2007.03051. [Google Scholar]
  16. Zhao, Y.; Liu, R.; Liu, Z.; Liu, L.; Wang, J.; Liu, W. A Review of Macroscopic Carbon Emission Prediction Model Based on Machine Learning. Sustainability 2023, 15, 6876. [Google Scholar] [CrossRef]
  17. Yang, H.; Wang, M.; Li, G. A combined prediction model based on secondary decomposition and intelligence optimization for carbon emission. Appl. Math. Model. 2023, 121, 484–505. [Google Scholar] [CrossRef]
  18. Nadirgil, O. Carbon price prediction using multiple hybrid machine learning models optimized by genetic algorithm. J. Environ. Manag. 2023, 342, 118061. [Google Scholar] [CrossRef]
  19. Zhang, Y.; Li, X.; Zhang, Y. A novel integrated optimization model for carbon emission prediction: A case study on the group of 20. J. Environ. Manag. 2023, 344, 118422. [Google Scholar] [CrossRef]
  20. Bai, S.; Kolter, J.Z.; Koltun, V. An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. arXiv 2018, arXiv:1803.01271. [Google Scholar]
  21. Livieris, I.E.; Pintelas, E.; Pintelas, P. A CNN–LSTM model for gold price time-series forecasting. Neural Comput. Appl. 2020, 32, 17351–17360. [Google Scholar] [CrossRef]
  22. Lu, W.; Li, J.; Li, Y.; Sun, A.; Wang, J. A CNN-LSTM-based model to forecast stock prices. Complexity 2020, 2020, 1–10. [Google Scholar] [CrossRef]
  23. Shahbaz, M.; Loganathan, N.; Muzaffar, A.T.; Ahmed, K.; Jabran, M.A. How urbanization affects CO2 emissions in Malaysia? The application of STIRPAT model. Renew. Sustain. Energy Rev. 2016, 57, 83–93. [Google Scholar] [CrossRef]
  24. Bulut, M. Hydroelectric Generation Forecasting with Long Short Term Memory (LSTM) Based Deep Learning Model for Turkey. arXiv 2021, arXiv:2109.09013. [Google Scholar]
  25. Bengio, Y. Practical recommendations for gradient-based training of deep architectures. arXiv 2012, arXiv:1206.5533. [Google Scholar]
  26. Wu, N.; Green, B.; Ben, X.; O’Banion, S. Deep transformer models for time series forecasting: The influenza prevalence case. arXiv 2020, arXiv:2001.08317. [Google Scholar]
  27. Kim, T.; Kim, J.; Tae, Y.; Park, C.; Cho, J.-H.; Choo, J. Reversible instance normalization for accurate time-series forecasting against distribution shift. In Proceedings of the Tenth International Conference on Learning Representations, ICLR 2022, Virtual, 25–29 April 2022. [Google Scholar]
  28. Hu, Y.; Jia, X.; Tomizuka, M.; Zhan, W. Causal-based time series domain generalization for vehicle intention prediction. In Proceedings of the 2022 International Conference on Robotics and Automation (ICRA), Philadelphia, PA, USA, 23–27 May 2022; pp. 7806–7813. [Google Scholar]
Figure 1. Flowchart of CNN.
Figure 1. Flowchart of CNN.
Sustainability 15 13934 g001
Figure 2. LSTM network’s internal structure diagram.
Figure 2. LSTM network’s internal structure diagram.
Sustainability 15 13934 g002
Figure 3. LSTM-CNN model structure.
Figure 3. LSTM-CNN model structure.
Sustainability 15 13934 g003
Figure 4. Carbon emissions by provinces in China (Northern) from 1997 to 2021.
Figure 4. Carbon emissions by provinces in China (Northern) from 1997 to 2021.
Sustainability 15 13934 g004
Figure 5. Carbon emissions by provinces in China (Central China) from 1997 to 2021.
Figure 5. Carbon emissions by provinces in China (Central China) from 1997 to 2021.
Sustainability 15 13934 g005
Figure 6. Carbon emissions by provinces in China (Southern) from 1997 to 2021.
Figure 6. Carbon emissions by provinces in China (Southern) from 1997 to 2021.
Sustainability 15 13934 g006
Figure 7. Heat map of correlation between carbon emission y and independent variables.
Figure 7. Heat map of correlation between carbon emission y and independent variables.
Sustainability 15 13934 g007
Figure 8. The ablation experiment results in LSTM-CNN Model (no weighting).
Figure 8. The ablation experiment results in LSTM-CNN Model (no weighting).
Sustainability 15 13934 g008
Figure 9. The ablation experiment results in LSTM-CNN model (weighting).
Figure 9. The ablation experiment results in LSTM-CNN model (weighting).
Sustainability 15 13934 g009
Figure 10. Forecast of carbon emissions by provinces in China during 2022–2035.
Figure 10. Forecast of carbon emissions by provinces in China during 2022–2035.
Sustainability 15 13934 g010
Table 1. Socio-economic data.
Table 1. Socio-economic data.
The IndexCalculation MethodConnotationUnitTimeData Source
Total population at year endYear-end population by regionPopulationThousands of peopleFrom 1997 to 2020China Population and Employment Statistical Yearbook (1997–2021)
Per capita GDPGross regional product/year-end population of subregioneconomyYuanFrom 1997 to 2020National Bureau of Statistics
Energy intensityTotal energy consumption/GDPTechnologyTen thousand tons of standard coal/CNY 100 millionFrom 1997 to 2020China Energy Statistical Yearbook (1997–2019), Statistical Yearbook of Provinces and Municipalities (2020–2021), Statistical Yearbook of Ningxia (2001)
Industrial structureValue added of secondary industry/gross regional productTechnology%From 1997 to 2020China Statistical Yearbook (1997–2021)
Urbanization rateUrban population/year-end population by regionUrbanization level%From 1997 to 2020China Statistical Yearbook (2005–2021), China Population and Employment Statistical Yearbook (1997–2004)
Table 2. Global Moran’s index.
Table 2. Global Moran’s index.
YearMoran’s IZ-Valuep-Value
19970.1724821.5787830.070
19980.2039771.8554810.040
19990.2255062.1328960.023
20000.2283012.2068870.021
20010.2604592.3925980.018
20020.2540422.3010150.023
20030.1849191.8027180.046
20040.2443782.3874700.013
20050.2729782.6762800.007
20060.2838782.7118530.009
20070.2844802.7721280.008
20080.2466672.3844510.012
20090.2306762.1007690.027
20100.2263142.3195270.018
20110.2221902.0763780.024
20120.1840551.8766830.041
20130.1887101.8798290.042
20140.1959531.9798890.035
20150.1711371.7476870.054
20160.1538561.5902290.078
20170.1188521.2370580.113
20180.1245161.3719060.092
20190.1332531.4114850.082
20200.1511971.5702060.080
Table 3. Parameter Settings.
Table 3. Parameter Settings.
LayerInput ChannelOutput Channel
Convolution layer 1364
Convolution layer 26464
Convolution layer 364128
Convolution layer 4128128
Convolution layer 5–10128128
Fully connected layer 1307232
Fully connected layer 23216
Fully connected layer 3161
Table 4. Training results of LSTM-CNN model under different convolution layers.
Table 4. Training results of LSTM-CNN model under different convolution layers.
Number of CNN Convolution LayersTraining Set (MAE)Training Set (RMSE)Training Set (R-Squared)Testing Set (MAE)Testing Set (RMSE)Testing Set (R-Squared)
112.651817.39160.97729.529112.88540.9548
29.325314.31210.98468.448811.72710.9625
39.462614.43170.98439.003712.05010.9605
410.138215.29880.98248.016911.15050.9661
519.164625.02270.952911.445014.81900.9402
610.922216.76700.97888.797912.24850.9591
713.172119.21770.97228.612712.13950.9599
89.874515.34620.98239.059312.69990.9561
916.880325.68810.950310.084713.70420.9489
109.755314.95350.98328.307111.44210.9643
Table 5. Ablation results of LSTM-CNN model.
Table 5. Ablation results of LSTM-CNN model.
ModelTraining Set (MAE)Training Set (RMSE)Training Set (R-Squared)Testing Set (MAE)Testing Set (RMSE)Testing Set (R-Squared)
LSTM (no weighting)10.481215.16350.98279.151211.51550.9639
CNN (no weighting)23.864832.99290.918114.682421.28490.8766
CNN-LSTM (no weighting)10.695915.15640.98278.390811.66470.9629
LSTM-CNN (no weighting)15.468622.35430.96249.149712.69010.9561
LSTM (weighting)8.694413.40540.98658.460711.56440.9636
CNN (weighting)7.888712.00490.98929.761913.99770.9466
CNN-LSTM (weighting)8.381512.36460.988510.242812.96170.9543
LSTM-CNN (weighting)10.230115.72710.98147.850010.69670.9688
Table 6. Scenario setting of carbon emission.
Table 6. Scenario setting of carbon emission.
ScenarioYear-End PopulationPer Capita GDPEnergy IntensityIndustrial StructureUrbanization Rate
BaselineIntermediateIntermediateIntermediateIntermediateIntermediate
Industrial structure optimizingIntermediateIntermediateIntermediateHighIntermediate
ExtensiveIntermediateHighLowLowHigh
Emission reductionIntermediateIntermediateHighIntermediateIntermediate
Table 7. Prediction of carbon peak time.
Table 7. Prediction of carbon peak time.
Province/CityBaselineIndustrial Structure OptimizingExtensiveEmission Reduction
BeiJing2031202720252025
TianJing2027202720252028
HeBei2023202320232023
Shan1XiUnderpeak2025Underpeak2024
NeiMengGu2024202520242024
LiaoNing2024202420242024
JiLing2031202520252025
HeiLongJiang2026202620322027
ShangHai2027202420262025
JiangSu2023202320242024
ZheJiang20242024Underpeak2027
AnHui2026202520252024
FuJianUnderpeak202920282033
JiangXi2030202920342031
ShanDong2023202420282024
HeNan2033202420262027
HuBei2026203020262025
HuNan2024203320242027
GuangDong2025202420272025
GuangXi2026202720282033
HaiNan2023202220222022
ChongQing2029203420292029
SiChuan2023202420312024
GuiZhouUnderpeak202720262028
YunNan2033203220322034
Shan3Xi2027203120322028
GanSu2032202820292032
QingHaiUnderpeakUnderpeak20292031
NingXia2029202620302025
XingJiang2024202420272026
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Han, Z.; Cui, B.; Xu, L.; Wang, J.; Guo, Z. Coupling LSTM and CNN Neural Networks for Accurate Carbon Emission Prediction in 30 Chinese Provinces. Sustainability 2023, 15, 13934. https://doi.org/10.3390/su151813934

AMA Style

Han Z, Cui B, Xu L, Wang J, Guo Z. Coupling LSTM and CNN Neural Networks for Accurate Carbon Emission Prediction in 30 Chinese Provinces. Sustainability. 2023; 15(18):13934. https://doi.org/10.3390/su151813934

Chicago/Turabian Style

Han, Zhonghua, Bingwei Cui, Liwen Xu, Jianwen Wang, and Zhengquan Guo. 2023. "Coupling LSTM and CNN Neural Networks for Accurate Carbon Emission Prediction in 30 Chinese Provinces" Sustainability 15, no. 18: 13934. https://doi.org/10.3390/su151813934

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop