Carbon emissions are a major issue of global concern, and in order to address the problem of global warming caused by greenhouse gas emissions, China has put forward a clear “dual-carbon” target. Accurately grasping the trend of carbon emissions may provide a rational basis for the country to formulate a reasonable emission reduction plan, which will be vital if the country is to achieve its dual-carbon target. Relying on an integrated energy system and aiming to minimize the total amount of carbon emissions to achieve carbon peaking and carbon neutrality is a major strategic decision made by integrating the international and domestic situations, and it is of great significance to the realization of the “dual carbon” goal.
Carbon emission forecasting has been studied at national and international levels using different methods, and these experts have yielded spectacular results. Luo Bixiong [
1] and others used a hybrid measurement model for predicting carbon emissions in the energy sector to optimize the installed power structure and energy consumption structure to achieve the goals of minimizing these structures, making the structures cleaner, and minimizing carbon emissions. Zhang Shiqiang [
2] et al. used the path analysis method to show that by optimizing the energy structure and expanding the enterprise scale, inhibiting effects on carbon emissions are achieved. Lv Yan [
3] and others dynamically simulated a LSTM Model for carbon emission prediction using scenario analysis to anticipate the future trend of carbon emissions from the construction industry in Xinjiang and to derive the recommendations for emission reductions.
Liu et al. [
4] studied the factors affecting carbon emissions from transport based on relevant data from 30 provinces from 2005 to 2019 and aggregated multiple machine learning algorithms into different prediction models. Chen Chuanmin [
5] and others constructed a framework for analyzing the carbon emissions of grid enterprises based on the LEAP model in order to quantify the emission reduction contribution rate of each factor during the operation of the grid enterprises; for the correlation of each factor, a two-level scenario analysis model consisting of integrated scenarios and sub-scenarios was designed and constructed by applying the scenario analysis method and the comparative analysis method at the same time. Zhou Cheng [
6] et al. used the decomposition–integration strategy, which by decreasing the computation of the original EC prediction problem’s complications and non-linearities can effectively improve the prediction performance. By combining the advantages of trend decomposition, empirical modal decomposition, and wavelet decomposition, a new three-layer differential evolutionary prediction method is proposed. Yang [
7] proposed a machine learning-based urban carbon emission prediction method in the context of big data in his article, and determined that the optimal algorithm was Random Forest—which was chosen to predict urban carbon emissions—by comparing the advantages and disadvantages of each algorithm. Wei [
8] and others combined the Tapio de-coupling model with the STIRPAT model and estimated its efficacy using the 2000–2020 panel data of Henan Province. The relationship between the economic development of agriculture and animal husbandry and carbon emissions, as well as related factors were examined. Yue [
9] and others constructed a GCA-GRNN-DOA carbon emission prediction model in order to find an effective carbon emission prediction method so as to establish corresponding emission reduction measures. Yan [
10] et al. improved the results of the forecast in the case of the joint use of multiple algorithmic models and proposed a carbon emission influencing decomposition model based on LMDI for the carbon emissions, and a prediction model of carbon emissions on the basis of the EEMD-BSO-GPR model. Yu [
11] et al. constructed a community carbon emission sample database based on the statistical emission factors of the power system of the North China Power Grid. A community carbon emission early-warning system was designed by training an SVR model to predict electricity carbon emissions and optimize the SVR model using GA. Wei [
12] et al. used a Tapio decoupling model to explore the connection between carbon emissions and the economy in Henan Province; a STIRPAT extension model and the ridge regression model to find out the factor influencing carbon emission in Henan Province; and they also obtained a carbon emission forecasting model. Wang [
13] et al. programmed a Lagrange interpolation algorithm through MATLAB to forecast the carbon emission of the economic growth of Beijing city and provided decision support for the government. An “inverted U” relationship was derived between economic growth and energy consumption and carbon emissions. Chai Tew Ang [
14] et al. used an integrated modeling tool consisting of a time series ARIMA model to predict the total CO
2 emissions from 2009 to 2020 in a study of Malaysia’s energy consumption and transportation CO
2 emissions. Ref. [
14] et al. used a comprehensive modeling tool consisting of a time series ARIMA model to predict the total carbon dioxide emissions from 2009 to 2020 for energy consumption, and transportation carbon dioxide emissions forecasting in Malaysia. Wang [
15] et al. proposed a two-stage prediction method based on Support Vector Regression, Random Forest, Ridge Regression, and Artificial Neural Networks for carbon dioxide forecasting, and compared it with a single prediction method for carbon dioxide emissions forecasting. Peng [
16] proposed a model-based method for predicting short-term carbon emissions from green buildings. The IPCC method was used to find the interacting elements of carbon emissions of green buildings, and the interacting elements were classified according to the importance of the interacting elements. The model was then applied to calculate the short-term CO
2 emissions during the construction phase and the whole process of implementing a green building, and the IPAT model was developed to disintegrate the CO
2 emissions as products of dissimilar elements.
As the problem of carbon emission is becoming more and more serious, the demand for the accurate prediction of carbon emission is becoming more and more urgent. Carbon emissions are closely related to the development of society, but the current research generally focuses on several aspects, such as economy and energy, to study the correlation between typical influencing factors and carbon emissions, and there is a lack of research on the mining of carbon emission influencing factors at a deeper level. At the same time, the mining of carbon emission influencing factors needs to deal with large-scale data. This paper applies the path analysis method to analyze the carbon emission influencing factors and derives the influencing relationship between the factors and the corresponding path. The establishment of a carbon emission prediction model in the context of large-scale data, using the parameters of a multiple regression carbon emission prediction model, is achieved by the small-batch gradient descent algorithm. The following are the new innovations in this paper: