Next Article in Journal
Meaningful Gamification in Ecotourism: A Study on Fostering Awareness for Positive Ecotourism Behavior
Previous Article in Journal
Understanding Resilience and Sustainability in Organizations: A Cutting-Edge Framework and the Research Agenda
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Study on Transportation Carbon Emissions in Tibet: Measurement, Prediction Model Development, and Analysis

1
School of Engineering, Tibet University, Lhasa 850000, China
2
Plateau Major Infrastructure Smart Construction and Resilience Safety Technology Innovation Center, Lhasa 850000, China
3
Intelligent Transportation System Research Center, Southeast University, Nanjing 211189, China
4
Meteorological Information and Network Center of Xizang Autonomous Region, Lhasa 850000, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Sustainability 2024, 16(19), 8419; https://doi.org/10.3390/su16198419
Submission received: 26 August 2024 / Revised: 23 September 2024 / Accepted: 24 September 2024 / Published: 27 September 2024

Abstract

:
In recent years, the socio-economic development in the Tibet region of China has experienced substantial growth. However, transportation increasingly strains the region’s fragile ecological environment. Most studies overlook the accurate measurement and analysis of factors influencing traffic carbon emissions in Tibet due to data scarcity. To address this, this paper applies an improved traffic carbon emissions model, using transportation turnover data to estimate emissions in Tibet from 2008 to 2020. Simultaneously, the estimated traffic carbon emissions in Tibet served as the predicted variable, and various machine learning algorithms, including Radial Basis Function Support Vector Machine (RBF-SVM), eXtreme Gradient Boosting (XGBoost), Random Forest, and Gradient Boosting Decision Tree (GBDT) are employed to conduct an initial comparison of the constructed prediction models using three-fold cross-validation and multiple evaluation metrics. The best-performing model undergoes further optimization using Grid Search (GS) and Real-coded Genetic Algorithm (RGA). Finally, the central difference method and Local Interpretable Model-Agnostic Explanation (LIME) algorithm are used for local sensitivity and interpretability analyses on twelve core variables. The results assess each variable’s contribution to the model’s output, enabling a comprehensive analysis of their impact on Tibet’s traffic carbon emissions. The findings demonstrate a significant upward trend in Tibet’s traffic carbon emissions, with road transportation and civil aviation being the main contributors. The RBF-SVM algorithm is most suitable for predicting traffic carbon emissions in this region. After GS optimization, the model’s R2 value exceeded 0.99, indicating high predictive accuracy and stability. Key factors influencing traffic carbon emissions in Tibet include civilian vehicle numbers, transportation land-use area, transportation output value, urban green coverage areas, per capita GDP, and built-up area. This paper provides a systematic framework and empirical support for measuring, predicting, and analyzing factors influencing traffic carbon emissions in Tibet. It employs innovative measurement methods, optimized machine learning models, and detailed sensitivity and interpretability analyses. The results can guide regional carbon reduction targets and promote green sustainable development.

1. Introduction

A report from the World Resources Institute reveals that transportation is responsible for 24% of global CO2 emissions, making it a significant contributor to the world’s carbon dioxide levels [1]. Transportation plays a vital role in daily life and work, significantly advancing socio-economic development. However, this rapid growth has imposed immense environmental pressure. As a primary source of CO2 emissions, its impact on global climate change and environmental pollution is undeniable. Therefore, ensuring green and sustainable growth while advancing transportation development has become essential, drawing significant attention from policymakers and researchers [2]. Governments are striving to achieve “carbon neutrality” and “carbon peaking” goals to mitigate the complex effects of climate change. Addressing the relationship between transportation growth and carbon emissions is not only of practical significance but also key to achieving global sustainable development goals.
On 28 August 2020, at the Seventh Central Symposium on Tibet Work, Chinese President Xi Jinping emphasized, “Protecting the ecological environment of the Qinghai-Tibet Plateau is the greatest contribution to the survival and development of the Chinese nation”. The Tibet Autonomous Region, known as the “Third Pole of the Earth”, plays a crucial role in China’s ecological civilization efforts, with particularly vital ecological functions. However, economic and social development has led to rising CO2 emissions, posing a growing threat to the plateau’s ecosystem [3]. Although industry and construction in Tibet are developing slowly, transportation is expected to continue growing rapidly in the foreseeable future. Therefore, effectively controlling and reducing transportation-related CO2 emissions has become an urgent priority.
This study aims to address Tibet’s transportation carbon emission prediction and analysis issues, not only tackling the region’s pressing environmental concerns but also contributing to the global framework of climate change mitigation by developing localized low-carbon transportation strategies. The urgency of this research lies in its potential to provide critical decision-making support and a theoretical basis for local governments in setting carbon reduction targets and formulating energy-saving and emission reduction policies. We employ carbon emission estimation methods suitable for Tibet’s unique environmental conditions, combining quantitative analysis with region-specific prediction research. This approach will provide more precise scientific evidence for future transportation policies, enhancing the effectiveness of carbon reduction efforts in the transportation sector.
The remainder is given as follows: Section 2 comprehensively reviews the literature and identifies the research direction and contributions of this paper, aiming to improve the prediction and analysis of Tibet’s transportation carbon emissions through model optimization and variable selection. Section 3 refines the transportation carbon emissions measurement model proposed by the UNWTO and combines it with Tibet’s transportation turnover data to estimate emissions from 2008 to 2020. Section 4 selects twelve core variables across five dimensions, adds two control variables based on a literature review and Tibet’s characteristics, constructs and evaluates multiple machine learning models to identify the most suitable algorithm for predicting Tibet’s transportation carbon emissions, and further optimizes the selected algorithm to achieve the highest prediction accuracy and stability. Section 5 uses the central difference method for local sensitivity analysis of the twelve core variables based on the model selected in Section 4 and then conducts interpretability analysis using the LIME algorithm. The contribution of each variable to the model output is analyzed based on the results of both methods. Section 6 comprehensively analyzes the impact of various variables on Tibet’s transportation carbon emissions based on the results of the variable contribution analysis in Section 5. Section 7 concludes this paper.

2. Literature Review

Transportation production occurs during the circulation process, aimed at the spatial movement of people or goods. As a result, its carbon emissions are characterized by dispersed sources and wide coverage. The United Nations World Tourism Organization (UNWTO) [4] proposed a model for calculating transportation carbon emissions, where the passenger turnover for each mode is multiplied by the corresponding emission factor to determine the carbon emissions per mode. Following this, Cai et al. [5] utilized the Intergovernmental Panel on Climate Change (IPCC)-recommended mobile source accounting method and energy emission factors to estimate the CO2 emissions in China’s transportation sector at both national and regional levels in 2007. Because directly measuring these emissions is difficult, indirect estimation based on other statistics is often necessary.
In transportation carbon emissions prediction, most studies rely on machine learning models. For example, Zhu et al. [6] were among the early scholars to construct a carbon emissions prediction model for China’s transportation industry based on the Support Vector Regression (SVR) model, introducing a scenario analysis method to estimate future emissions. Subsequently, Ağbulut [7] applied various machine learning algorithms, including Random Forest, Support Vector Machine (SVM), and eXtreme Gradient Boosting (XGBoost), to predict Turkey’s transportation-related energy demand and CO2 emissions, expanding the application of machine learning in transportation carbon emission forecasting. More recently, Ma et al. [8] further advanced the field by integrating Gradient Boosting Decision Tree (GBDT) and Multiscale Geographically Weighted Regression (MGWR) models to explore the spatial heterogeneity of accessibility’s impact on multimodal transportation carbon emissions in urban agglomerations. This approach not only provides a deeper analysis of spatial factors but also highlights the growing importance of incorporating spatial variations into carbon emission predictions. In some studies, traditional prediction models are still utilized. Gao et al. [9] utilized traditional prediction models like time series and regression analysis to construct a multi-model comparison framework, evaluating various prediction models for urban transportation carbon emissions. Similarly, Li et al. [10] applied a time series prediction model to forecast the carbon peak time of China’s transportation sector. These studies illustrate the continued use of traditional models while showing the theoretical progression toward more complex methodologies. Although these studies contribute valuable insights, unresolved issues remain regarding the applicability of traditional models under diverse conditions, paving the way for more advanced approaches like machine learning. Recently, deep learning techniques have been increasingly applied to transportation carbon emissions prediction. For instance, Jiao et al. [11] predicted transportation carbon emissions using a prediction model based on Convolutional Neural Networks (CNNs). Similarly, Zhao et al. [12] employed the Least Absolute Shrinkage and Selection Operator (Lasso) and Backpropagation Neural Network (BPNN) algorithms to predict the carbon peak pathway in Henan Province. These studies reflect a shift towards more complex predictive models, improving prediction accuracy through the integration of deep learning. However, challenges remain regarding the accurate prediction of carbon emissions with small-scale datasets.
Recent studies have increasingly focused on enhancing and optimizing machine learning models to improve prediction accuracy in transportation carbon emissions. Wei et al. [13] employed Random Forest and Extreme Learning Machine, utilizing a moth–flame optimization model to forecast CO2 emissions in Hebei Province. They demonstrated that optimization techniques could significantly enhance prediction accuracy, though their work raised questions about the scalability of these models to different regions. Subsequently, Wang et al. [14] combined Variational Mode Decomposition (VMD) and Salp Swarm Algorithm (SSA) with the Least Squares Support Vector Machine (LSSVM) model to forecast transportation carbon emissions. However, the computational complexity involved in these hybrid models presents challenges for real-time applications. Huo et al. [15] developed an SVM prediction model, optimizing its parameters using a Genetic Algorithm (GA). While the model showed high accuracy in predicting carbon emissions using data from Jiangsu Province, concerns remain about its generalizability to other regions with varying economic and environmental factors. Liu et al. [16] optimized machine learning models using Grid Search and evaluated them with four indicators. Ultimately, they used the selected XGBoost model to analyze the factors influencing transportation carbon emissions in 30 Chinese provinces from 2005 to 2019.
Various methods and approaches exist for studying transportation carbon emissions. Zhao et al. [17] employed Decision Tree algorithms and Monte Carlo simulations to analyze transportation carbon emissions, contributing a new methodology in this field. Jiang et al. [18] employed panel quantile analysis and quantile regression models using provincial data to assess the impact of public transportation on carbon emissions. Wang et al. [19] employed scenario and cost–benefit analysis algorithms to construct a model evaluating the emission reduction potential of China’s transportation industry. Hou et al. [20] constructed an integrated regression model and incorporated Environmental Kuznets Curve (EKC) and Tapio Decoupling Effect (Tapio) models to analyze the relationship between vehicle ownership, economic growth, and environmental pressure in Chongqing’s transportation industry. Hussain et al. [21] utilized regression analysis to evaluate the impact of economic development, income inequality, transportation, and environmental expenditure on transportation carbon emissions. Tian et al. [22] applied a Logarithmic Mean Divisia Index (LMDI) and regression analysis to assess carbon emission levels and intensities in China’s transportation industry and various modes. These studies highlight regional disparities in carbon emissions and the significant role of economic growth.
Research on regional transportation carbon emissions has grown increasingly popular. At the regional level, Wang et al. [23] applied LMDI and decoupling analysis to discuss the driving factors and decoupling effects of transportation carbon emissions in the western region of China. Subsequently, Wang et al. [24] employed a multiple regression analysis and LMDI to discuss the driving factors for carbon emission reduction in the Silk Road Economic Belt’s transportation sector, reflecting an evolution in analytical approaches towards incorporating more complex methodologies. Liu et al. [25] applied the LMDI decomposition method to analyze the measurement and driving factors of tourism transportation carbon emissions in East China. Building on this, Li et al. [26] employed a Geographic Information System (GIS) and regression analysis to construct a spatiotemporal evolution analysis model, examining the spatiotemporal evolution and influencing factors of tourism transportation carbon emissions in the provinces along the Grand Canal Cultural Belt, representing an innovation in integrating spatial analysis with carbon emission studies. At the provincial level, He et al. [27] applied LMDI to decompose the driving factors of carbon emissions from transportation energy consumption in Jiangsu Province. Bian et al. [28] introduced a quantitative approach by employing multiple regression analysis to examine the factors influencing transportation carbon emissions in Qinghai. Liu et al. [29] applied the LMDI decomposition method to investigate the driving factors of transportation carbon emissions in Inner Mongolia. Finally, Huang et al. [30] employed spatial autocorrelation analysis and regression analysis to construct a spatiotemporal analysis model, discussing the spatiotemporal evolution characteristics and driving factors of urban transportation carbon emissions in Jiangsu Province, representing a key development in linking spatial and emissions data. These studies underscore the significant role of regional economic development in driving emissions. However, concerns exist regarding the transferability of these models to regions with differing socio-economic structures, and it remains challenging to account for the impact of emerging technologies.
Research on transportation carbon emissions in Tibet is relatively limited and primarily relies on traditional models, often lacking comprehensive and in-depth analysis. Liu et al. [31] conducted a case study using LMDI and Driving forces–Pressure–State–Impact–Response (DPSIR) models, combined with transportation data from Tibet, to explore the driving factors and decoupling effects of transportation carbon emissions in Tibet, laying the foundation for further research in this underexplored area. Gong et al. [32] constructed a Partial Least Squares Regression (PLSR) model to analyze the factors influencing transportation carbon emissions in Lhasa. While these studies offer important insights, they reveal gaps in modeling and raise concerns about the generalizability of traditional models to such a unique region. Controversies remain regarding the transferability of models developed for other regions to Tibet, given its distinct geographic and socio-economic factors. Summary of the literature review can be found in Table 1.

2.1. The Focus of This Study

Transportation carbon emissions in the Tibet region differ from other areas, exhibiting unique characteristics. First, as a popular tourist destination, Tibet’s rapidly growing tourism industry has become a key driver of the local economy [3]. However, this growth has also resulted in significant mobile source carbon emissions, particularly from tourists renting cars for self-driving tours, further exacerbating transportation carbon emissions. Additionally, the rapid development of Tibet’s tertiary industry in recent years has significantly impacted transportation carbon emissions, with its output value accounting for 50.1% of the total by 2020. This growing environmental pressure highlights the urgent need to develop more accurate prediction models that account for the unique characteristics of the region, particularly given the rapid expansion of its tourism and service sectors. Addressing this issue is critical not only for achieving carbon reduction goals but also for maintaining Tibet’s ecological balance.
Previous studies on Tibet’s transportation carbon emissions have often relied on traditional prediction models without fully considering the compatibility between models and data. For example, Gong et al. [32] utilized only PLSR to construct a prediction model for Lhasa’s transportation carbon emissions, without fully considering data–model compatibility and employing appropriate methods to analyze input variables. Other studies have not fully considered influencing factors. For example, Liu et al. [31] analyzed only the basic variables required for the Kaya equation—GDP, energy consumption, total population, and equivalent turnover—without comprehensively analyzing the factors influencing transportation carbon emissions in Tibet. These gaps in previous research underscore the need for more tailored models that can accurately reflect the distinct socio-economic and geographical characteristics of the region. Adoption of machine learning models is driven by their ability to handle complex datasets and capture intricate relationships among variables. This approach provides a more effective way to account for the diverse factors influencing transportation carbon emissions in Tibet, such as tourism dynamics and industry growth.
Based on these considerations, this paper selects input variables tailored to the characteristics of Tibet, filters and optimizes various machine learning models to enhance feature extraction, and ultimately conducts local sensitivity and interpretability analyses on twelve core variables. The contributions of these variables to the model output are analyzed to comprehensively assess their impact on Tibet’s transportation carbon emissions. This methodological choice enables a more accurate assessment of the region’s transportation carbon emissions, addressing the limitations of traditional models.

2.2. Contributions

This paper addresses the unique challenges of predicting and analyzing transportation carbon emissions in Tibet by enhancing model accuracy and interpretability. The key contributions are as follows.
First, the transportation carbon emissions estimation model is refined for Tibet, where energy consumption data are missing. By adapting the UNWTO’s calculation model, a more accurate estimation for Tibet is achieved, addressing data gaps and providing a robust method under such conditions.
Second, prediction models are screened and optimized, emphasizing data–model compatibility. Machine learning algorithms—including RBF-SVM, XGBoost, Random Forest, and GBDT—are evaluated. The most suitable model is identified through three-fold cross-validation and performance metrics. Feature extraction limitations are addressed by optimizing the models and applying Partial Least Squares (PLS) to refine input variables, improving accuracy and stability. This method surpasses earlier research that overlooked data compatibility and variable selection, offering a more reliable prediction method and providing insights for policy development and sustainable transportation planning in Tibet.
Finally, local sensitivity and interpretability analyses are conducted on factors influencing transportation carbon emissions in Tibet. This addresses the insufficient analysis of key influencing factors in previous studies. By selecting variables related to the tertiary industry and tourism—along with commonly identified factors—models that better reflect Tibet’s unique characteristics are developed. The central difference method and the Local Interpretable Model-Agnostic Explanation (LIME) algorithm analyze the contribution of variables to the model output, offering a comprehensive assessment of their impact on emissions. This framework provides empirical support for analyzing factors influencing Tibet’s transportation carbon emissions, aiding regional carbon reduction efforts and sustainable development.

3. Transportation Carbon Emissions Measurement

3.1. Measurement Method

According to the 2006 IPCC Guidelines for National Greenhouse Gas Inventories [33], carbon emissions from mobile sources like transportation can be measured using two methods: the “top-down” approach, based on transportation energy consumption, and the “bottom-up” approach, based on the vehicle type, ownership, mileage, and fuel consumption per kilometer [5]. In practice, obtaining energy consumption statistics for Tibet’s transportation sector is challenging. Therefore, this paper adopts the transportation carbon emissions measurement model proposed by the United Nations World Tourism Organization (UNWTO) [4]. The specific model is presented in Equation (1).
C = i C i = i K i × Q i
where, i represents the i th type of passenger transportation mode; C represents the total carbon emissions from passenger transportation (gCO2); C i represents the carbon emissions from the i th type of passenger transportation mode (gCO2); K i represents the carbon emission factor for the i th type of passenger transportation mode (gCO2/passenger-kilometer); and Q i represents the passenger turnover for the i th type of passenger transportation mode (passenger-kilometers).
Building on the aforementioned formula, the carbon emissions measurement model for transportation in Tibet is derived by incorporating the calculation of freight transportation emissions alongside passenger transportation emissions. The resulting model is presented in Equation (2).
C = C p + C f = i = 1 3 C p i + j = 1 4 C f j = i = 1 3 K p i × Q p i + j = 1 4 K f j × Q f j
where, i represents the i th type of passenger transportation mode (with i = 1, 2, 3 corresponding to rail, road, and civil aviation passenger transport, respectively); j represents the j th type of freight transportation mode (with j = 1, 2, 3, 4 corresponding to rail, road, civil aviation, and pipeline freight transport, respectively); C represents the total carbon emissions from transportation in Tibet (gCO2); C p represents the total carbon emissions from passenger transportation in Tibet (gCO2); C f represents the total carbon emissions from freight transportation in Tibet (gCO2); C p i represents the carbon emissions from the i th type of passenger transportation mode in Tibet (gCO2); C f j represents the carbon emissions from the j th mode of freight transportation in Tibet (gCO2); K p i represents the emission factor for the i th mode of passenger transportation (gCO2 per passenger-kilometer); K f j represents the emission factor for the j th mode of freight transportation (gCO2 per passenger-kilometer); Q p i represents the passenger turnover for the i th mode of transportation in Tibet (person-kilometers); and Q f j represents the freight turnover for the j th mode of transportation in Tibet (ton-kilometers).

3.2. Measurement Data

This section estimates Tibet’s transportation carbon emissions across different modes, using turnover data from railway, highway, civil aviation, and pipeline transportation. The turnover data for Tibet’s transportation modes from 2008 to 2020 were sourced from the 2021 Tibet Statistical Yearbook. Due to incomplete statistics on civil aviation turnover in the Yearbook, data from 2008 to 2010 are missing. The detailed data can be found in Table 2.
Based on the research by Tian et al. [10,19,22], the annual carbon emission factors for road, rail, and civil aviation passenger transport are determined. The civil aviation freight transport carbon emission factor is derived from the civil aviation passenger transport factor and the conversion coefficient between passenger and freight turnover, using a conversion ratio of 0.072 for Chinese flights. The standard coal carbon dioxide emission factor, published by the Intergovernmental Panel on Climate Change (IPCC), is 1.9003 tCO2/t. Combined with Tibet’s pipeline transport energy efficiency data [31], the carbon emission factors for pipeline transport can be calculated annually. The emission factors for various transport modes can be found in Table 3.
As Li et al. [10] did not calculate carbon emission factors for every year between 2005 and 2020, some years lack data. To address this, we employ cubic spline interpolation with a 95% confidence level to estimate the carbon emission factors for various transportation modes from 2005 to 2020. Cubic spline interpolation is a widely used numerical method. It fits cubic polynomials within each interval, ensuring that the function values and their first and second derivatives are continuous at each data point. This continuity forms a “spline curve” for interpolation. Additionally, a 95% confidence interval was calculated for the interpolation results. This interval provides a range within which the true values are likely to fall, offering greater reliability and robustness to the estimates. The results can be found in Table 4.
The median of the final selected interval is used as the interpolation result. The complete interpolation results can be found in Table 5.

3.3. Measurement Results and Analysis

Using the measurement method outlined in Section 2.1 and the data presented in Section 2.2, the carbon emissions from transportation in Tibet for the years 2008 to 2020 have been estimated. The results can be found in Table 6.
From 2008 to 2020, except for fluctuations in pipeline transportation, carbon emissions from rail, road, and civil aviation transportation mirrored the overall trend of increasing transportation emissions in Tibet. In passenger transportation, civil aviation generated the highest CO2 emissions and also exhibited the fastest growth, increasing from 15.54 thousand tons in 2011 to 884.45 thousand tons in 2020, a staggering 56.91-fold increase. In freight transportation, road transport was the dominant contributor to carbon emissions, growing from 7401.43 thousand tons in 2006 to 12,892.83 thousand tons in 2020, a 1.74-fold increase, accounting for an average of 93.35% of freight emissions. Road freight was responsible for an average of 62.29% of total transportation emissions, driving the significant increase in Tibet’s transportation carbon emissions over the past decade. Although total emissions briefly declined in 2011 and 2012 due to the rise of civil aviation, which alleviated some pressure on road transport, the limited capacity of civil aviation led to a continued rise in emissions after 2012. This trend suggests that Tibet’s logistics industry heavily relies on road transport, while the tourism industry depends on civil aviation, both of which are major contributors to the region’s carbon emissions. Therefore, it is crucial to enhance carbon reduction accountability in Tibet’s logistics industry and develop more effective and targeted carbon reduction strategies.

4. Prediction Model Construction and Improvement

4.1. Selection of Input Variables

Tibet’s transportation carbon emissions are influenced by the development of various sectors. Following the principle of data availability, this paper identifies input variables from both external and internal factors.
The transportation industry itself is the direct factor contributing to transportation carbon emissions. Its development not only enhances efficiency but also drives economic activities, leading to increased emissions. The rise in vehicle numbers further exacerbates transportation carbon emissions [20]. Public transportation systems can mitigate carbon emissions by enhancing vehicle efficiency, reducing fuel carbon content, and decreasing vehicle mileage. At a sufficient scale, these systems significantly reduce emissions, although smaller scales may initially increase emissions. However, with further expansion and optimization, public transportation can substantially reduce carbon emissions [18]. This section, therefore, identifies internal factors affecting transportation carbon emissions from the perspectives of transportation and public transportation development in Tibet. Based on a literature review [7,14,23,24,29] and Tibet’s specific conditions, the selected variables representing transportation development are the number of civilian vehicles, transportation land-use area, and transportation output value. Public transportation development is represented by the total public transport passenger volume and the number of buses and electric vehicles per 10,000 people.
Socio-economic development is closely tied to the transportation industry and indirectly influences transportation carbon emissions. As socio-economic development progresses, energy consumption—particularly from fossil-fuel-dependent transportation modes—increases, leading to higher carbon emissions [21]. Based on a literature review [7,25,26,30,31] and considering Tibet’s characteristics, this paper identifies three external factors influencing Tibet’s transportation carbon emissions: socio-economic development, tourism development, and tertiary industry development. Tourism revenue and the number of tourists represent tourism development; total retail sales of consumer goods and the added value of the tertiary industry represent tertiary industry development; urban green coverage areas, per capita GDP, and built-up area represent socio-economic development. To reduce model endogeneity, passenger turnover and freight turnover are introduced as control variables. Details of the selected input variables can be found in Table 7.
Annual data for the number of civilian vehicles, tourism revenue, number of tourists, total retail sales of consumer goods, the added value of the tertiary industry, urban green coverage areas, population, per capita GDP, built-up area, passenger turnover, and freight turnover are sourced from the 2021 Tibet Statistical Yearbook. Data on transportation land-use area are sourced from the 2009–2021 China Land and Resources Statistical Yearbook. Annual data on the added value of transportation are sourced from the 2009–2021 China Tertiary Industry Statistical Yearbook. Annual data on the built-up area of urban regions in Tibet, total public transport passenger volume, and the number of buses and electric vehicles per 10,000 people are sourced from the 2009–2021 China Statistical Yearbook. Annual data on urban green coverage areas are sourced from the 2009–2021 China Environmental Statistical Yearbook. The annual values for each variable can be found in Table 8.

4.2. Data Preprocessing

Before model training, the raw data must be preprocessed. As the dataset contains no missing values, missing value processing is unnecessary. Common data preprocessing methods include Min–Max normalization, Z-score standardization, and Sigmoid normalization. This paper employs the Z-score standardization. Z-score standardization, also known as zero-mean standardization, is primarily used to convert data with different units to the same scale, ensuring comparability in statistical analysis and machine learning algorithms. The function expression is presented in Equation (3).
z i = x i μ i σ i
where, z i represents the data of the i th variable after standardization; x i represents the original data of the i th variable; μ i represents the mean value of the i th variable; and σ i represents the standard deviation of the i th variable.

4.3. Model Construction

Based on a literature review [6,7,8,9,10,11,12,13,14,15,16], commonly used machine learning algorithms for predicting transportation carbon emissions include Random Forest, Gradient Boosting Decision Tree (GBDT), Decision Tree, eXtreme Gradient Boosting (XGBoost), Support Vector Machine (SVM), Linear Regression, and Backpropagation Neural Network (BPNN). Different datasets are suited to different algorithms. Given numerous input variables in this paper, and the possibility that not all have a linear relationship with Tibet’s transportation carbon emissions, Linear Regression is deemed unsuitable for this dataset. Additionally, due to the small dataset size, using neural network algorithms could result in overfitting. Therefore, this paper conducts a preliminary selection among the remaining five algorithms. Using Python 3.11.9, a three-fold cross-validation is performed, with the R2 result used as the evaluation criterion to select the algorithm that best suits this dataset. The suitability of the five machine learning algorithms for this dataset is presented in Figure 1.
The length of the boxplot indicates the model’s stability; a shorter length suggests a more concentrated distribution of cross-validation results, reflecting better stability. The median line (green line) in the boxplot represents the mean of the R2 distribution; a higher median indicates a greater mean R2, suggesting better model accuracy. The Decision Tree model has the poorest stability and accuracy, while the Linear-SVM and Polynomial-SVM models exhibit relatively poor stability. Overall, the Radial Basis Function Support Vector Machine (RBF-SVM), XGBoost, Random Forest, and GBDT algorithms are better suited for this dataset, with RBF-SVM and Random Forest performing the best. Therefore, this paper constructs transportation carbon emissions prediction models for Tibet using the RBF-SVM, XGBoost, Random Forest, and GBDT algorithms, naming them Model_rs, Model_xg, Model_rf, and Model_gb, respectively. To prevent overfitting and reduce random errors, ten repetitions of three-fold cross-validation are conducted. Grid Search (GS) was employed to find the optimal hyperparameters, ensuring that the constructed prediction models do not overfit and maintain good generalization ability.
RBF-SVM, a supervised learning algorithm grounded in statistical learning theory, is widely used in regression prediction tasks. The core concept of SVM is to map all variables into a high-dimensional feature space, identify a hyperplane that effectively separates sample points, and maximize the distance between them, thereby establishing a robust predictive model [34]. RBF-SVM combines the Radial Basis Function (RBF) Kernel with SVM.
XGBoost is an efficient and flexible GBDT algorithm that falls under the Boosting ensemble learning category, implemented based on the Gradient Boosting framework. Its core principle is to iteratively add new decision trees to correct errors from previous trees, with each tree learning the residuals of the current model, thus continuously improving predictive accuracy [35].
Random Forest is an ensemble learning algorithm that constructs numerous decision trees, each generating a prediction during training. The final output is determined by majority vote or the average of these results [36].
GBDT is another ensemble learning algorithm. Its core concept involves iteratively training multiple decision trees to model complex nonlinear relationships, with the outcome being the sum of weak learners [8].
The libraries used for model construction, the Grid Search settings, and their optimized parameters are listed in Table 9.

4.4. Model Evaluation

4.4.1. Model Comparison

After the models were established, their fitting performance on the training set and predictive performance on the test set were compared. The model comparisons are presented in Figure 2 and Figure 3.
The figures above show that on the test set, Model_rs and Model_gb exhibit relatively similar and good fitting performance, followed by Model_rf, while Model_xg performs poorly. On the training set, the prediction results of Model_rs, Model_rf, and Model_gb are significantly better than those of Model_xg. Overall, Model_rs and Model_gb achieved superior prediction results.

4.4.2. Evaluation Metrics

This paper utilizes four of the most commonly used metrics from the literature [15,16] to evaluate the performance of Model_rs, Model_xg, Model_rf, and Model_gb. These metrics include Mean Squared Error (MSE), Mean Absolute Error (MAE), Coefficient of Determination (R2), and Mean Absolute Percentage Error (MAPE). The MSE value ranges from 0 to infinity, where values closer to 0 indicate more stable predictions. The MAE value also ranges from 0 to infinity, where values closer to 0 indicate better model prediction performance. The R2 value ranges from 0 to 1. On the training set, values closer to 1 indicate a better goodness of fit; on the test set, values closer to 1 indicate superior prediction performance. MAPEs value closer to 0 indicate better prediction performance. Typically, MAPE ≤ 10% indicates “high accuracy”, 10% < MAPE ≤ 20% indicates “good accuracy”, 20% < MAPE ≤ 50% indicates “reasonable accuracy”, and MAPE > 50% indicates “inaccuracy” [7]. The formulas for these metrics are presented in Equations (4)–(7).
M S E = 1 n i = 1 n y i y i 2
M A E = 1 n i = 1 n y i y i
R 2 = 1 i = 1 n y i y i 2 i = 1 n y ¯ i y i 2
M A P E = 100 % n i = 1 n y ^ i y i y i
where, n represents the number of samples, i.e., the total number of samples in the dataset; y i represents the actual value of the i th sample, which is the true value in the dataset; y i represents the predicted value of the i th sample, obtained from the model prediction; and y ¯ i represents the mean value of the i th sample.

4.4.3. Model Evaluation Results

After establishing the models, the MSE, MAE, R2, and MAPE were calculated for each model on the test set. The average values of these metrics are used as evaluation indicators for the predictive models. The evaluation results for each model can be found in Table 10.
Based on the results, it can be seen that Model_rs has the smallest MSE, indicating the best prediction stability. The MSE values of Model_rf and Model_gb are quite close, both yielding good results, and all three models significantly outperform Model_xg. Model_gb achieved the lowest MAE value, indicating the best prediction performance. The MAE values of Model_gb, Model_rs, and Model_rf are similar, while Model_xg has the highest MAE value, indicating a relatively poor prediction performance compared to the other three models. The R2 values of Model_rs, Model_rf, and Model_gb all exceed 0.91, indicating that these models demonstrate superior predictive performance for Tibet’s transportation carbon emissions, with Model_rs demonstrating the best results. In contrast, Model_xg’s R2 value of 0.8020 indicates it is less suitable for predicting Tibet’s transportation carbon emissions compared to the other three models. All four models have MAPE values below 10%, indicating high accuracy. Although Model_xg has a higher MAPE value, it remains below 10%. Considering all four metrics, Model_rs demonstrates superior prediction performance.
In summary, based on the presented analysis, the RBF-SVM algorithm is the most suitable for constructing a prediction model for Tibet’s transportation carbon emissions for the dataset in this paper. This finding aligns with the research results of Song et al. [37]. Therefore, the RBF-SVM algorithm is selected for subsequent research and analysis in this paper. The Random Forest and GBDT algorithms also performed well on this dataset and are suitable for constructing a prediction model for Tibet’s transportation carbon emissions, whereas the performance of the XGBoost algorithm requires improvement.

4.5. Model Improvement

4.5.1. Initial Selection of Improved Model

The dataset in this paper includes numerous input variables, potentially causing multicollinearity. Removing one or more variables could destabilize the explanatory model, leading to inaccurate results. Therefore, Partial Least Squares (PLS) is applied to decompose and reconstruct the input variables. PLS decomposes all 14 variables, extracts key components, and forms new independent principal components, thus mitigating the adverse effects of multicollinearity during modeling.
Accurate prediction with the RBF-SVM algorithm depends on adjusting two parameters: the regularization parameter and the kernel function coefficient. Common optimization algorithms for identifying optimal prediction model parameters include GS [16], Genetic Algorithm (GA), and Ant Colony Optimization (ACO) [15]. Therefore, this paper uses the deviation between predicted and actual values on the test set as the objective function, optimizing the regularization parameter and kernel function coefficient in the RBF-SVM algorithm. Using Python, the parameters of the model are optimized based on GS, Real-coded Genetic Algorithm (RGA), and ACO. To avoid errors due to randomness, ten repetitions of three-fold cross-validation are conducted, with the average R2 value on the test set used as the evaluation metric. If the R2 value does not increase significantly with the number of components, the corresponding number of components is considered optimal. The R2 values of the optimized models across different principal components are presented in Figure 4.
As shown in Figure 4, the R2 value of the ACO-optimized model is unstable, indicating a lack of model stability. The GS-optimized and RGA-optimized models both show a clear upward trend, with R2 values stabilizing after the fifth principal component, indicating that the original model is effectively explained at this point. Therefore, this paper selects five principal components (denoted as U 1 , U 2 , U 3 , U 4 , U 5 ) for subsequent modeling. The formulas for the principal components are presented in Equations (8)–(12).
U 1 = 0.0821 x 1 + 0.663 x 2 + 0.0804 x 3 + 0.0563 x 4 + 0.0247 x 5 + 0.0778 x 6 + 0.0811 x 7 + 0.0803 x 8 + 0.0637 x 9 + 0.0810 x 10 + 0.0747 x 11 + 0.0809 x 12 + 0.0771 c 1 + 0.0793 c 2
U 2 = 0.1282 x 1 + 0.1772 x 2 + 0.1097 x 3 0.0281 x 4 + 0.1086 x 5 + 0.0802 x 6 + 0.1012 x 7 + 0.0859 x 8 0.0266 x 9 + 0.0996 x 10 + 0.0366 x 11 + 0.0934 x 12 + 0.0960 c 1 + 0.1007 c 2
U 3 = 0.1281 x 1 + 0.2136 x 2 + 0.0503 x 3 + 0.0918 x 4 + 0.0776 x 5 + 0.0863 x 6 + 0.0673 x 7 + 0.0373 x 8 0.3285 x 9 + 0.0753 x 10 0.1262 x 11 + 0.0868 x 12 + 0.2667 c 1 + 0.4100 c 2
U 4 = 0.1341 x 1 + 0.2944 x 2 0.0053 x 3 + 0.0769 x 4 + 0.0131 x 5 + 0.0515 x 6 + 0.0448 x 7 + 0.0083 x 8 0.3690 x 9 + 0.0771 x 10 0.1664 x 11 + 0.0897 x 12 + 0.2653 c 1 + 0.5457 c 2
U 5 = 0.1098 x 1 + 0.3189 x 2 0.1449 x 3 + 0.0470 x 4 + 0.0797 x 5 + 0.0289 x 6 + 0.0106 x 7 0.0482 x 8 0.3046 x 9 + 0.0653 x 10 0.1922 x 11 + 0.0929 x 12 + 0.2814 c 1 + 0.8244 c 2
The coefficients of these five principal components are distinct and uncorrelated with each other. Thus, we transformed the fourteen correlated variables into five independent variables, reducing the dimensionality of the data. These five new variables were then used for model selection in Section 4.5.2.

4.5.2. Final Selection of the Improved Model

As discussed in Section 4.5.1, both GS and RGA showed good results in the initial optimization of model parameters. This paper further utilizes these two algorithms to optimize the RBF-SVM model, establishing Model_gs_rs and Model_rga_rs.
Grid Search systematically traverses predefined combinations of hyperparameters to identify the best model parameters [38]. The basic process is as follows:
  • Step 1. Parameter Grid: Set the parameters to be searched and their value ranges.
  • Step 2. Model Training and Validation: Train the model and perform cross-validation for each parameter combination in the grid.
  • Step 3. Select Best Parameters: Select the parameter combination with the best performance based on cross-validation metrics.
Real-coded Genetic Algorithm is an optimization algorithm that simulates natural selection and genetic mechanisms. Unlike traditional binary-coded genetic algorithms, RGA employs real number encoding, making it more effective for continuous variable optimization [39]. The basic process is as follows:
  • Step 1. Initialization: Randomly generate a set of real-number candidate solutions.
  • Step 2. Fitness Evaluation: Evaluate the quality of each candidate solution using a predefined fitness function.
  • Step 3. Selection: Apply a fitness-based selection strategy to choose superior candidate solutions as parents for the next generation.
  • Step 4. Crossover: Generate new candidate solutions by exchanging parts of the variables between parent solutions.
  • Step 5. Mutation: Randomly modify the genes of some candidate solutions to enhance diversity.
  • Step 6. Replacement: Replace the old generation with the new generation and iterate until the termination condition is met.
The data from 2008 to 2020, after extracting five principal components, were used for model construction with three-fold cross-validation. The parameters selected for the optimization algorithms of the two models and the optimized RBF-SVM parameters can be found in Table 11.
The four metrics from Section 4.4.2.—MSE, MAE, R2, and MAPE—are employed to evaluate Model_gs_rs and Model_rga_rs. The prediction performance of each model is presented in Figure 5 and Figure 6, and the accuracy comparison of the models can be found in Table 12.
The figures above show that on the test set, Model_rga_rs has a better fitting effect in the first half, while in the second half, Model_gs_rs achieves better results, with Model_rga_rs showing larger deviations. Overall, Model_gs_rs demonstrates a better fitting effect on the test set. On the training set, the prediction results of Model_gs_rs are significantly better than those of Model_rga_rs. In summary, the GS optimization of the RBF-SVM algorithm yields better results compared to RGA.
The table shows that Model_gs_rs has a smaller MSE value than Model_rga_rs, indicating better prediction stability. The MAE value of Model_gs_rs is also lower, reflecting superior prediction performance. The R2 value of Model_gs_rs exceeds 0.99, achieving an ideal prediction result for Tibet’s transportation carbon emissions, while Model_rga_rs also performs well with an R2 value above 0.98. Both models have MAPE values below 10%, indicating high prediction accuracy. Overall, Model_gs_rs demonstrates the best prediction performance. Therefore, after applying PLS, the RBF-SVM algorithm optimized by GS is identified as the most suitable for constructing a prediction model for Tibet’s transportation carbon emissions with this dataset, making Model_gs_rs the preferred choice for further research and analysis.

5. Analyzing the Contribution of Variables to Model Output

5.1. Local Sensitivity Analysis

Local sensitivity analysis evaluates the impact of individual variables on model output by applying small perturbations to the input variables and observing the resulting output changes [40]. The finite difference method approximates the derivative using a finite difference expression (Equation (14)) based on the derivative definition (Equation (13)). Perturbing input variables slightly and observing output changes, the model’s sensitivity to each variable can be estimated [41]. There are three types of finite difference methods: forward difference, backward difference, and central difference. Central difference, which accounts for changes around the point x, is typically more accurate than forward or backward difference. Thus, the central difference method is used in this section, with its formula presented in Equation (15).
f x lim Δ x 0 f x + Δ x f x Δ x
In practical applications, a very small number h is used to approximate Δ x , leading to a finite difference approximation:
f x f x + h f x h
S i = f x 1 , x 2 , x 3 , x i + h , x n 2 h
To evaluate the impact of the 12 core variables on the model output, a local sensitivity analysis is performed on Model_gs_rs using the central difference method. Each core variable is perturbed with a step size of 0.00001, and the resulting changes are input into the model to observe their effect on the final output. The impact of each variable on the model output is presented in Figure 7.
In the figure, each bar corresponds to a variable, where darker colors signify a greater influence on the model output. The variables are ranked in descending order of their impact on the model output as follows: x 2 , x 9 , x 11   x 3 , x 1 , x 12 , x 10 , x 4 , x 5 , x 8 , x 6 , and x 7 .

5.2. Interpretability Analysis

There are many classifications of interpretability methods, including those based on applicable models, based on timing, and based on individual and global perspectives. This paper focuses on using post-hoc interpretability methods to explain the model. LIME (Local Interpretable Model-Agnostic Explanation) is a model-agnostic local surrogate model used to explain the predictions of black-box models [42]. The concept behind LIME is straightforward: train a local surrogate model to explain each prediction. The process works as follows:
  • Step 1. Generate multiple perturbed samples around the data point to be explained.
  • Step 2. Use the original model to predict these perturbed samples.
  • Step 3. Train an interpretable model using these perturbed samples and their prediction results.
  • Step 4. The coefficients of the surrogate model indicate the importance of each feature to the original model’s prediction.
To obtain interpretable model parameters, an objective function for optimization must be constructed for the algorithm. The objective function of the LIME algorithm is presented in Equation (16).
ξ x = arg min g G L f , g , x + Ω x
where f represents the complex model that needs to be explained; G represents the potential interpretable model, such as a linear model, Decision Tree, etc.; g represents the interpretable simple model; x represents the example data; x represents the proximity of the example data to the perturbed data; L f , g , x represents the degree of local unfaithfulness of the model; and Ω x represents the complexity of the model g .
In this section, Model_gs_rs is the complex model being explained, and a linear regression model is selected as the interpretable simple model. Twelve core variables were analyzed using the LIME algorithm to assess their impact on Model_gs_rs. The analysis was conducted using Python programming, and all other parameters and configurations were left at their default settings in Python. The results of the interpretability analysis based on the LIME algorithm are presented in Figure 8.
In the figure, each bar represents a variable, with blue indicating a positive correlation (+) with carbon emissions and green indicating a negative correlation (−). Darker colors denote stronger correlations. The variables are ranked by importance from highest to lowest as follows: x 10 (+), x 12 (+), x 2 (−), x 1 (−), x 9 (−), x 3 (+), x 11 (−), x 5 (−), x 4 (−), x 8 (+), x 6 (−), and x 7 (−).
Based on the feature importance analysis and the sensitivity analysis, variables x 4 through x 8 have a smaller contribution to the model output. This indicates that total public transport passenger volume, the number of buses and electric vehicles per 10,000 people, tourism revenue, and the number of tourists have less impact on Tibet’s transportation carbon emissions. Conversely, variables x 1 through x 3 and x 9 through x 12 make a more significant contribution to the model output, indicating that number of civilian vehicles, transportation land-use area, transportation output value, urban green coverage areas, per capita GDP, and built-up area have a greater impact on carbon emissions. Additionally, transportation output value, total retail sales of consumer goods, urban green coverage areas, and built-up area positively impact carbon emissions, whereas the remaining variables have a negative impact. From a dimensional perspective, the level of transportation and socio-economic development have a greater influence on carbon emissions.

6. Discussion on Variables Affecting Transportation Carbon Emissions

6.1. Transportation Development Level

The transportation development significantly impacts carbon emissions in Tibet, making it a crucial dimension directly influencing these emissions. The corresponding factors, including transportation land-use area, transportation output value, and the number of civilian vehicles, all strongly influence carbon emissions in Tibet’s transportation sector. The transportation land-use area has a significant impact on reducing carbon emissions. A larger transportation land area can provide more roads and transportation facilities, optimize the transportation network, reduce traffic congestion [43], and consequently lower carbon emissions. The number of civilian vehicles also has a significant impact on reducing carbon emissions. There is an inverted “U” relationship between the number of civilian vehicles and carbon emissions. When the number of civilian vehicles reaches a high level, low-carbon vehicles, such as hybrid and electric vehicles gradually dominate the market. As low- or zero-emission vehicles become more prevalent, overall carbon emissions decrease, despite an increase in vehicle numbers, because these vehicles emit significantly less carbon than traditional internal combustion engine vehicles [20]. In recent years, the rate of electric vehicle purchases in Tibet has grown significantly, outpacing that of gasoline vehicles. Low-carbon vehicles now dominate the market [44], contributing to the descending segment of the inverted “U” curve in Tibet’s transportation carbon emissions. Furthermore, the rapid increase in civilian vehicles has prompted the government and relevant departments to improve transportation infrastructure, including more efficient road systems and optimized traffic signals. These measures have reduced congestion, improved vehicle efficiency [45], and thus lowered fuel consumption and carbon emissions. The transportation output value has a certain impact on promoting carbon emissions in Tibet’s transportation sector. The transportation industry’s output value directly reflects the development level of the sector. An increase in output value is often accompanied by higher energy consumption, as frequent and intensive transportation activities lead to more fuel use [46], thereby increasing carbon emissions.

6.2. Public Transportation Development Level

Although public transportation development in Tibet contributes to emission reductions, its impact is relatively small. The primary reason is Tibet’s slow economic development, leading to limited investment in public transportation, which remains insufficient to meet people’s travel needs [47]. Additionally, Tibet’s complex geography and scattered population make it difficult for public transit systems to cover extensive areas [48]. Consequently, public transportation in Tibet has not fulfilled its potential to reduce emissions.

6.3. Tourism Development Level

The development of tourism in Tibet has a relatively small impact on reducing transportation carbon emissions. Tibet’s tourism industry is highly seasonal, with peak periods in summer and autumn, and fewer visitors during other seasons. This seasonality causes uneven transportation carbon emissions throughout the year [49], with minimal impact on the annual total emissions. Additionally, the development of Tibet’s tourism has optimized transportation resources and improved infrastructure, though these improvements are often limited to popular tourist areas [3]. Moreover, the number of tourists indirectly reflects those driving their own or rented vehicles [50]; thus, the carbon emissions from these activities contribute relatively little to the overall carbon emissions in Tibet. Consequently, tourism development has a relatively small impact on reducing transportation carbon emissions in Tibet.

6.4. Tertiary Industry Development Level

The development of the tertiary industry significantly impacts Tibet’s transportation carbon emissions, with its output value making a particularly notable contribution to emission reductions. The output value of the tertiary industry directly reflects the development level of Tibet’s tertiary sector. As information technology spreads in Tibet, many tertiary industry activities can be conducted remotely or online, reducing travel needs and thus lowering transportation carbon emissions [51]. The development of the retail industry generates substantial logistics and supply chain activity, leading to increased transportation carbon emissions.

6.5. Socio-Economic Development Level

Socio-economic development also has a significant impact on Tibet’s transportation carbon emissions, serving as a major dimension indirectly influencing these emissions. The urban green coverage and built-up areas within this dimension substantially reduce Tibet’s transportation carbon emissions. Both factors measure the level of urban development in Tibet and indirectly reflect social development. Larger green coverage and built-up areas in Tibet indicate higher urban development, which in turn improves people’s living standards and makes travel more convenient [52]. Additionally, increased green spaces enhance urban attractiveness, prompting more people to drive to these areas for recreation [53]. These factors contribute to higher transportation energy consumption and, consequently, increased carbon emissions. Per capita GDP has a certain impact on reducing carbon emissions, as it directly reflects the level of economic development. As Tibet’s economy grows, the proportion of the tertiary industry increases. Compared to industry and agriculture, the service sector has lower carbon intensity and relies more on information technology and remote work, reducing the need for travel and associated emissions. Furthermore, Tibet’s economic development has heightened public environmental awareness, increasing emphasis on conservation and encouraging more people to choose low-carbon travel methods [54].

7. Conclusions

This paper begins by refining the UNWTO’s transportation carbon emissions model, applying it to estimate Tibet’s transportation carbon emissions from 2008 to 2020 using transportation turnover data. Next, based on the literature review and considering the characteristics of Tibet, the dimensions and factors influencing Tibet’s carbon emissions were identified to create input variables and construct a prediction model for Tibet’s transportation carbon emissions. The model was then refined to identify the most suitable prediction algorithm. Following feature extraction using the PLS method, various parameter optimization techniques were applied to develop new models, with the most accurate and stable model being selected. Finally, sensitivity and interpretability analyses were conducted using the central difference method and LIME algorithm to assess the influence of each variable on the model output, allowing for a comprehensive analysis of their impact on Tibet’s transportation carbon emissions. The key findings of this paper are as follows:
(1)
The total carbon emissions from Tibet’s transportation sector show a significant upward trend. Notably, emissions from civil aviation passenger transport see the most dramatic rise, surging from 15.54 thousand tons in 2011 to 884.45 thousand tons in 2020, a 56.91-fold increase. Road freight remains the largest contributor, responsible for 62.29% of the transportation carbon emissions in Tibet.
(2)
The model construction reveals that the RBF-SVM algorithm is the most suitable for predicting Tibet’s transportation carbon emissions. After optimizing RBF-SVM using GS and RGA, the enhanced model (Model_gs_rs) achieves superior performance, with an R2 value exceeding 0.99, significantly enhancing the prediction accuracy.
(3)
Regarding the impact on Tibet’s transportation carbon emissions, factors like total public transport passenger volume, the number of buses and electric vehicles per 10,000 people, tourism revenue, and the number of tourists have minimal influence. In contrast, the number of civilian vehicles, transportation land-use area, transportation output value, urban green coverage areas, per capita GDP, and built-up area exert a more significant impact. Dimensionally, the levels of transportation development and socio-economic development are the most critical determinants of carbon emissions.
The data used in this paper span a relatively short period, limiting the ability to fully capture long-term trends and shifts in Tibet’s transportation carbon emissions. To address this limitation, an extended data collection period should be employed in future research, allowing for a more comprehensive analysis of long-term changes. Additionally, although Chinese transportation carbon emission factors were used to estimate emissions in Tibet, factors such as altitude and environmental conditions could introduce potential estimation errors. Moreover, interpolation issues may arise due to large gaps in carbon emission factor data. This is particularly true when using cubic spline interpolation for certain years, which could result in inaccuracies when trends are complex or nonlinear. To minimize these interpolation uncertainties, more continuous and detailed carbon emission factor data should be collected in future studies. Finally, although traditional machine learning algorithms, which involve complex parameter optimization and high computational demand, were primarily used in this paper, future research could benefit from exploring more advanced and efficient prediction models, such as deep learning algorithms. These models offer better prediction accuracy and computational efficiency, thus enhancing both model performance and resource optimization.

Author Contributions

Conceptualization, W.B., G.C. and K.Z.; methodology, W.B., G.C., K.Z. and Y.W.; software, K.Z. and J.Z.; validation, G.C., J.Z., C.Y. and M.C.; formal analysis, K.Z., G.C. and W.D.; investigation, M.C. and W.D.; resources, W.B., G.C., C.Y. and M.C.; data curation, W.B., K.Z., Y.W. and M.C.; writing—original draft preparation, W.B., K.Z., G.C., Y.W., J.Z. and W.D.; writing—review and editing, W.B., K.Z., G.C., Y.W., C.Y. and M.C.; visualization, J.Z. and W.D.; supervision, W.B., G.C. and C.Y.; project administration, W.B., G.C. and C.Y.; funding acquisition, W.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the 2023 Central Government Support Fund for Reform and Development of Local Universities (Second Batch)—Research on Major Engineering Problems and Energy Planning & Construction of Sichuan-Tibet Railway Corridor under the Background of New Engineering, grant number 00061146, the Key Research and Development Program of Tibet Autonomous Region—Research on Key Technologies for Traffic Organization in Tibet, grant number XZ202303ZY0005G, and the Regional College Students Innovation and Entrepreneurship Training Program, Tibet University, grant number S202410694012.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article; further inquiries can be directed to the corresponding author.

Acknowledgments

The authors cannot thank enough the Plateau Major Infrastructure Smart Construction and Resilience Safety Technology Innovation Center for the computational support provided.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Wang, S.; Ge, M. Everything You Need to Know about the Fastest-Growing Source of Global Emissions: Transport. World Resources Institute 16 October 2019. Available online: https://www.wri.org/insights/everything-you-need-know-about-fastest-growing-source-global-emissions-transport#:~:text=1.%20How%20big%20a%20problem%20are%20emissions%20from (accessed on 18 October 2023).
  2. Jelti, F.; Allouhi, A.; Tabet Aoul, K.A. Transition Paths towards a Sustainable Transportation System: A Literature Review. Sustainability 2023, 15, 15457. [Google Scholar] [CrossRef]
  3. Wang, J.; Yi, L.; Chen, L.; Hou, Y.; Zhang, Q.; Yang, X. Coupling and Coordination between Tourism, the Environment and Carbon Emissions in the Tibetan Plateau. Sustainability 2024, 16, 3657. [Google Scholar] [CrossRef]
  4. World Tourism Organization; United Nations Environment Programme. Climate Change and Tourism: Responding to Global Challenges; World Tourism Organization: Madrid, Spain, 2008; pp. 139–143. [Google Scholar]
  5. Cai, B.; Yang, W.; Cao, D.; Liu, L.; Zhou, Y.; Zhang, Z. Estimates of China’s National and Regional Transport Sector CO2, Emissions in 2007. Energy Policy 2012, 41, 474–483. [Google Scholar] [CrossRef]
  6. Zhu, C.; Wang, M.; Du, W. Prediction on peak values of carbon dioxide emissions from the Chinese transportation industry based on the SVR model and scenario analysis. J. Adv. Transp. 2020, 2020, 8848149. [Google Scholar] [CrossRef]
  7. Ağbulut, Ü. Forecasting of transportation-related energy demand and CO2 emissions in Turkey with different machine learning algorithms. Sustain. Prod. Consum. 2022, 1, 141–157. [Google Scholar] [CrossRef]
  8. Ma, S.; Chen, X.; Yang, L.; Zhao, Y.; Zeng, Y. Impacts of Accessibility on Sspatial Heterogeneity of Multimodal Transportation Carbon Emissions in an Urban Agglomeration. J. Transp. Syst. Eng. Inf. Technol. 2024, 24, 64–74. [Google Scholar]
  9. Gao, J.; Huang, W.; Jiang, H. Comparison of Multiple Forecast Models of Urban Traffic Carbon Emissions. J. Chongqing Jiaotong Univ. (Nat. Sci.) 2020, 39, 33–39. [Google Scholar]
  10. Li, N.; Chen, S.; Liang, X.; Tian, P. Prediction of Transportation Industry Carbon Peak in China. J. Transp. Syst. Eng. Inf. Technol. 2024, 24, 2–13+54. [Google Scholar]
  11. Jiao, L.; Liu, Y.; Wu, Y.; Huo, X. Transportation Carbon Emission Prediction Based on Convolutional Neural Network. Railw. Transp. Econ. 2024, 1–9. [Google Scholar]
  12. Zhao, J.; Li, J.; Wang, P.; Hou, G. A Study on Carbon Peaking Paths in Henan, China Based on Lasso Regression-BP Neural Network Model. Environ. Eng. 2022, 40, 151–156+164. [Google Scholar]
  13. Wei, S.; Wang, Y.; Zhang, C. Forecasting CO2 emissions in Hebei, China, through moth-flame optimization based on the random forest and extreme learning machine. Environ. Sci. Pollut. Res. 2018, 25, 28985–28997. [Google Scholar] [CrossRef] [PubMed]
  14. Wang, Q.; Wang, J.; Zhu, C.; Hao, F. Carbon Emission Prediction of Transportation Industry Based on VMD and SSA-LSSVM. Environ. Eng. 2023, 41, 124–132. [Google Scholar]
  15. Huo, Z.; Zha, X.; Lu, M.; Ma, T.; Lu, Z. Prediction of carbon emission of the transportation sector in Jiangsu province-regression prediction model based on GA-SVM. Sustainability 2023, 15, 3631. [Google Scholar] [CrossRef]
  16. Liu, H.; Hu, D. Construction and Analysis of Machine Learning Based Transportation Carbon Emission Prediction Model. Environ. Eng. 2024, 45, 3421–3432. [Google Scholar]
  17. Zhao, G.; Gao, Y.; Wang, Z. A Transportation Carbon Emission Analysis Method Based on Decision Tree and Monte Carlo Simulation. Highway 2015, 60, 191–195. (In Chinese) [Google Scholar]
  18. Jiang, Y.; Zhou, Z.; Liu, C. The impact of public transportation on carbon emissions: A panel quantile analysis based on Chinese provincial data. Environ. Sci. Pollut. Res. 2019, 26, 4000–4012. [Google Scholar] [CrossRef] [PubMed]
  19. Wang, J.; Yan, Y.; Huang, Q.; Song, Y. Analysis of Carbon Emission Reduction Potential of China’s Transportation. Sci. Technol. Manag. Res. 2021, 41, 200–210. [Google Scholar]
  20. Hou, L.; Wang, Y.; Zheng, Y.; Zhang, A. The Impact of Vehicle Ownership on Carbon Emissions in the Transportation Sector. Sustainability 2022, 14, 12657. [Google Scholar] [CrossRef]
  21. Hussain, Z.; Khan, M.K.; Shaheen, W.A. Effect of economic development, income inequality, transportation, and environmental expenditures on transport emissions: Evidence from OECD countries. Environ. Sci. Pollut. Res. 2022, 29, 56642–56657. [Google Scholar] [CrossRef]
  22. Tian, P.; Mao, B.; Tong, R.; Zhang, H.; Zhou, Q. Analysis of carbon emission level and intensity of China’s transportation industry and different transportation modes. Clim. Change Res. 2023, 19, 347–356. [Google Scholar]
  23. Wang, Z.; Li, J.; Peng, B.; Xiang, W. Driving Factors and Decoupling Effect Analysis of Transportation Carbon Emissions in Western China. Environ. Eng. 2023, 41, 213–222. [Google Scholar]
  24. Wang, C.; Wu, L. Factors Driving the Carbon Emission Reduction in Transport Along the Silk Road Economic Belt: An Analysis from the Perspective of “Double Carbon”. J. Arid Land Resour. Environ. 2024, 38, 9–19. [Google Scholar]
  25. Liu, J.; Liu, Y.; Huang, L. Measurement and Driving Factor Analysis of Carbon Emissions from Tourism Transportation in East China. Res. Environ. Sci. 2024, 37, 626–636. [Google Scholar]
  26. Li, J.; Li, M.; Yuan, M.; Li, M. Study on the Spatial and Temporal Evolution and Influencing Factors of Tourism Traffic Carbon Emissions in Provinces along the Grand Canal Cultural Belt. Ecol. Econ. 2024, 40, 145–152+173. [Google Scholar]
  27. He, M.; Xie, J.; Wu, X.; Luo, J. Factors Decomposition of Carbon Emissions from Transportation Energy Consumption in Jiangsu Province Based on LMDI. Math. Pract. Theory 2015, 45, 16–22. [Google Scholar]
  28. Bian, L.; Ji, M. Research on Influencing Factors and Prediction of Transportation Carbon Emissions in Qinghai. Ecol. Econ. 2019, 35, 35–39+100. [Google Scholar]
  29. Liu, X.; Wu, P.; Wang, H. Analysis of Driving Factors of Transport Carbon Emissions in Inner Mongolia Based on LMDI Model. Transp. Energy Conserv. Environ. Prot. 2024, 20, 89–93+100. [Google Scholar]
  30. Huang, L.; Liu, J.; Liu, Y.; Ju, X.; Wang, P. Spatial and temporal evolution characteristics and driving factors of urban traffic carbon emissions in Jiangsu Province. Environ. Ecol. 2024, 6, 1–8. [Google Scholar]
  31. Liu, Y.; Wu, M.; Mu, R. Carbon Emission Measurement, Factor Decomposition and Low Carbonization Strategies of Transportation Industry in Tibet. J. Tibet Univ. 2021, 36, 126–133. [Google Scholar]
  32. Gong, J.; Hu, J.; Wang, X. Analysis of the Influencing Factors of Carbon Emissions from Urban Transportation. Automob. Technol. 2024, 49, 170–174. [Google Scholar]
  33. Eggleston, S.; Buendia, L.; Miwa, K.; Ngara, T.; Tanabe, K. 2006 IPCC Guidelines for National Greenhouse Gas Inventories; Institute for Global Environmental Strategies: Hayama, Japan, 2006. [Google Scholar]
  34. Su, X.; An, J.; Zhang, Y. Support vector machine regression forecasting of O3 concentrations based on wavelet transformation. China Environ. Sci. 2019, 39, 3719–3726. [Google Scholar]
  35. Zhao, N.; Lu, Y. Remote-sensing estimation of near-surface ozone concentration based on XGBoost. Acta Sci. Circumstantiae 2022, 42, 95–108. [Google Scholar]
  36. Montes, C.; Kapelan, Z.; Saldarriaga, J. Predicting non-deposition sediment transport in sewer pipes using Random forest. Water Res. 2021, 189, 116639. [Google Scholar] [CrossRef] [PubMed]
  37. Song, Y.; Shu, Q.; Jin, C.; Zheng, S.; Luo, L. Regional Transport Carbon Emission Forecasting and Peak Carbon Pathway Planning in China. Environ. Sci. 2024, 1–18. [Google Scholar] [CrossRef]
  38. Shekar, B.H.; Dagnew, G. Grid Search-Based Hyperparameter Tuning and Classification of Microarray Cancer Data. In Proceedings of the 2019 Second International Conference on Advanced Computational and Communication Paradigms (ICACCP), Majitar, East Sikkim, India, 25–28 February 2019; pp. 1–8. [Google Scholar]
  39. Alibrahim, H.; Ludwig, S.A. Hyperparameter optimization: Comparing genetic algorithm against grid search and bayesian optimization. In Proceedings of the 2021 IEEE Congress on Evolutionary Computation (CEC), Krakow, Poland, 28 June–1 July 2021; pp. 1551–1559. [Google Scholar]
  40. Gustafson, P.; Srinivasan, C.; Wasserman, L. Local sensitivity analysis. Bayesian Stat. 1996, 5, 197–210. [Google Scholar]
  41. Tortorelli, D.A.; Michaleris, P. Design sensitivity analysis: Overview and review. Inverse Probl. Eng. 1994, 1, 71–105. [Google Scholar] [CrossRef]
  42. Ribeiro, M.T.; Singh, S.; Guestrin, C. “Why should i trust you?” Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 1135–1144. [Google Scholar]
  43. Bao, Z.; Ou, Y.; Chen, S.; Wang, T. Land use impacts on traffic congestion patterns: A tale of a Northwestern Chinese City. Land 2022, 11, 2295. [Google Scholar] [CrossRef]
  44. Palden, N.; Da, Q. Electric Vehicle Sales Increase in Tibet as Infrastructure Expanded. The China Daily Global, 18 October 2023. Available online: https://usa.chinadaily.com.cn/a/202310/18/WS652f1cefa31090682a5e91a0.html (accessed on 18 October 2023).
  45. Li, H.; Luo, N. Will improvements in transportation infrastructure help reduce urban carbon emissions?—Motor vehicles as transmission channels. Environ. Sci. Pollut. Res. 2022, 29, 38175–38185. [Google Scholar] [CrossRef]
  46. Zhang, Y.; Kong, Y.; Quan, J.; Wang, Q.; Zhang, Y.; Zhang, Y. Scenario analysis of energy consumption and related emissions in the transportation industry—A case study of Shaanxi Province. Environ. Sci. Pollut. Res. 2024, 31, 26052–26075. [Google Scholar] [CrossRef] [PubMed]
  47. Yang, X.; Qiu, X.; Fang, Y.; Xu, Y.; Zhu, F. Spatial variation of the relationship between transport accessibility and the level of economic development in Qinghai-Tibet Plateau, China. J. Mt. Sci. 2019, 16, 1883–1900. [Google Scholar] [CrossRef]
  48. Xu, J.; Liu, J.; Xu, Y.; Pei, T. Visualization and analysis of local and distant population flows on the Qinghai-Tibet Plateau using crowd-sourced data. J. Geogr. Sci. 2021, 31, 231–244. [Google Scholar] [CrossRef]
  49. Sang, S.; He, W.; Ni, M. How off-season tourism promotion affects seasonal destinations? A multi-stakeholder perspective in Tibet. Tour. Rev. 2021, 76, 229–240. [Google Scholar]
  50. Wang, B.; Yang, Z.; Han, F.; Shi, H. Car Tourism in Xinjiang: The Mediation Effect of Perceived Value and Tourist Satisfaction on the Relationship between Destination Image and Loyalty. Sustainability 2017, 9, 22. [Google Scholar] [CrossRef]
  51. Zheng, Y.; Wang, S.; Liu, L.; Aloisi, J.; Zhao, J. Impacts of remote work on vehicle miles traveled and transit ridership in the USA. Nat. Cities 2024, 1, 346–358. [Google Scholar] [CrossRef]
  52. Jabbar, M.; Yusoff, M.M.; Shafie, A. Assessing the role of urban green spaces for human well-being: A systematic review. GeoJournal 2022, 87, 4405–4423. [Google Scholar] [CrossRef] [PubMed]
  53. Lombardía, A.; Gómez-Villarino, M.T. Green infrastructure in cities for the achievement of the un sustainable development goals: A systematic review. Urban Ecosyst. 2023, 26, 1693–1707. [Google Scholar] [CrossRef]
  54. Ma, H.; Chen, W.; Ma, H.; Yang, H. Influence of publicity and education and environmental values on the green consumption behavior of urban residents in Tibet. Int. J. Environ. Res. Public Health 2021, 18, 10808. [Google Scholar] [CrossRef]
Figure 1. R2 scores of each model from three-fold cross-validation.
Figure 1. R2 scores of each model from three-fold cross-validation.
Sustainability 16 08419 g001
Figure 2. Comparison of fitted values and actual values on the training set for the four models.
Figure 2. Comparison of fitted values and actual values on the training set for the four models.
Sustainability 16 08419 g002
Figure 3. Comparison of predicted values and actual values on the test set for the four models.
Figure 3. Comparison of predicted values and actual values on the test set for the four models.
Sustainability 16 08419 g003
Figure 4. R2 scores of the three optimized models across different principal components.
Figure 4. R2 scores of the three optimized models across different principal components.
Sustainability 16 08419 g004
Figure 5. Comparison of fitted values and actual values on the training set for Model_gs_rs and Model_rga_rs.
Figure 5. Comparison of fitted values and actual values on the training set for Model_gs_rs and Model_rga_rs.
Sustainability 16 08419 g005
Figure 6. Comparison of predicted values and actual values on the test set for Model_gs_rs and Model_rga_rs.
Figure 6. Comparison of predicted values and actual values on the test set for Model_gs_rs and Model_rga_rs.
Sustainability 16 08419 g006
Figure 7. Impact of core variables on model output using the central difference method.
Figure 7. Impact of core variables on model output using the central difference method.
Sustainability 16 08419 g007
Figure 8. Feature importance of variables using LIME mean values.
Figure 8. Feature importance of variables using LIME mean values.
Sustainability 16 08419 g008
Table 1. Summary of the literature review on transportation carbon emissions.
Table 1. Summary of the literature review on transportation carbon emissions.
LiteratureResearch ContentModel and AlgorithmResearch Factors
Ağbulut [7]Prediction of Turkey’s transportation energy demand and CO2 emissionsRandom Forest, SVM, XGBoost, etc.GDP, per capita income, total population, urbanization rate, energy consumption, energy prices, number of vehicles, road length, vehicle efficiency
Gao et al. [9]Driving factors for transportation carbon emission reduction in the Silk Road Economic BeltLMDIConverted transportation turnover, total energy consumption, gross transportation output, regional GDP, population size
Wang et al. [14]Transportation carbon emissions predictionIntegrated VMD and SSA-LSSVMGDP, energy consumption, number of vehicles, population size
Wang et al. [23]Driving factors and decoupling effects of transportation carbon emissions in the western regionLMDIEnergy consumption in the transportation sector, equivalent turnover, added value of the transportation sector, GDP, resident population
Liu et al. [25]Measurement and driving factors of tourism transportation carbon emissions in East ChinaLMDIAdded value of the tertiary industry, regional GDP, tourism revenue, number of tourists, number of scenic spots, passenger turnover, passenger volume, energy consumption in transportation, scale of tourism
Li et al. [26]Spatiotemporal evolution and influencing factors of tourism transportation carbon emissions in the provinces along the Grand Canal Cultural BeltGIS and regression analysisRegional GDP, number of inbound tourists, added value of the tertiary industry, energy consumption per passenger turnover, number of libraries, number of museums, number of community cultural centers
Liu et al. [29]Driving factors of transportation carbon emissions in Inner MongoliaLMDIEnergy consumption in transportation, total energy consumption in transportation, total transportation output, population size
Huang et al. [30]Spatiotemporal evolution characteristics and driving factors of urban transportation carbon emissions in Jiangsu ProvinceSpatial autocorrelation analysis and regression analysisAdded value of the transportation sector, regional GDP, resident population, total urban bus passenger volume, built-up area, urban green coverage areas
Liu et al. [31]Driving factors and decoupling effects of transportation carbon emissions in TibetLMDI and DPSIRGDP, energy consumption, total population, equivalent turnover
Gong et al. [32]Influencing factors of transportation carbon emissions in LhasaPLSRTotal population, GDP, vehicle ownership, passenger turnover, freight turnover, public transport passenger volume, number of tourists
This paperMeasurement, prediction model development, and analysis of transportation carbon emissions in TibetGS-RBF-SVM, central difference method, and LIME algorithmNumber of civilian vehicles, transportation land-use area, transportation output, total public transport passenger volume, number of buses and electric vehicles per 10,000 people, tourism revenue, number of tourists, total retail sales of consumer goods, added value of the tertiary industry, urban green coverage areas, per capita GDP, built-up area
Table 2. Turnover by various transportation modes in Tibet from 2008 to 2020.
Table 2. Turnover by various transportation modes in Tibet from 2008 to 2020.
YearPassenger Turnover (100 Million Passenger–Kilometers)Freight Turnover (100 Million Ton–Kilometers)
RailRoadCivil AviationRailRoadCivil AviationPipeline
20086.2424.14/6.725.4/1.15
20098.2521.73/9.9326.56/1.54
20109.4422.77/11.9727.1/1.32
201110.3822.51.9912.9227.10.011.52
201210.2523.27.8818.327.870.041.62
201311.431.0316.121.45860.081.5
201412.3432.7820.5322.94860.091.6
201514.0924.229.4122.0196.10.150.9
201616.0423.7441.3828.3694.50.250.8
201718.126.6661.1729.19105.820.41.3
201818.8627.9778.2833.22116.840.551.18
201918.0827.2385.339.91114.470.651.12
202012.3414.6158.3339.8116.730.431.22
Table 3. Carbon emission factors for various transportation modes from 2005 to 2020.
Table 3. Carbon emission factors for various transportation modes from 2005 to 2020.
YearPassenger Transport (gCO2/Passenger–Kilometer)Freight Transport (gCO2/Ton–Kilometer)
RailRoadCivil AviationRailRoadCivil AviationPipeline
200510.78106.83141.2210.78235.311835.8628.95
201010.61120.15120.2410.61264.641563.1228.95
201412.4246.39112.0612.42102.171456.7828.95
201513.6847.23106.6313.68104.021386.1928.95
201615.1143.13109.715.11951426.128.95
201714.8943.89106.6614.8996.681386.5828.95
201814.0341.67105.9314.0391.781377.0928.95
201913.5351.4995.4713.53113.421241.1128.95
202017.1750.15158.2217.17110.452056.8628.95
Table 4. Interpolated carbon emission factors with 95% confidence intervals for various transportation modes (2006–2009 and 2011–2013).
Table 4. Interpolated carbon emission factors with 95% confidence intervals for various transportation modes (2006–2009 and 2011–2013).
YearPassenger Transport (gCO2/Passenger–Kilometer)Freight Transport (gCO2/Ton–Kilometer)
RailRoadCivil AviationRailRoadCivil AviationPipeline
2006[7.87, 13.49][66.79, 169.67][122.09, 149.17][7.87, 13.49][147.12, 373.72][1587.22, 1939.26]28.95
2007[7.79, 13.41][76.01, 178.89][116.86, 143.94][7.79, 13.41][167.42, 394.02][1519.12, 1871.16]28.95
2008[7.74, 13.36][76.01, 178.89][116.86, 143.94][7.74, 13.36][167.42, 394.02][1519.12, 1871.16]28.95
2009[7.74, 13.36][79.15, 182.03][108.81, 135.89][7.74, 13.36][174.34, 400.94][1519.12, 1871.16]28.95
2011[7.95, 13.57][48.90, 151.78][106.00, 133.08][7.95, 13.57][107.70, 334.30][1378.06, 1730.10]28.95
2012[8.26, 13.88][25.26, 128.14][105.46, 132.54][8.26, 13.88][55.64, 282.24][1371.04, 1723.08]28.95
2013[8.78, 14.40][25.26, 128.14][103.50, 130.58][8.78, 14.40][10.82, 237.42][1345.46, 1697.50]28.95
Table 5. Carbon emission factors for various transportation modes from 2005 to 2020 after interpolation.
Table 5. Carbon emission factors for various transportation modes from 2005 to 2020 after interpolation.
YearPassenger Transport (gCO2/Passenger–Kilometer)Freight Transport (gCO2/Ton–Kilometer)
RailRoadCivil AviationRailRoadCivil AviationPipeline
200510.78106.83141.2210.78235.311835.8628.95
200610.68118.23135.6310.68260.421763.2428.95
200710.60127.45130.4010.60280.721695.1428.95
200810.55132.30125.8510.55291.391636.0828.95
200910.55130.59122.3510.55287.641590.5728.95
201010.61120.15120.2410.61264.641563.1228.95
201110.76100.34119.5410.76221.001554.0828.95
201211.0776.70119.0011.07168.941547.0628.95
201311.5956.35117.0411.59124.121521.4828.95
201412.4246.39112.0612.42102.171456.7828.95
201513.6847.23106.6313.68104.021386.1928.95
201615.1143.13109.7015.1195.001426.1028.95
201714.8943.89106.6614.8996.681386.5828.95
201814.0341.67105.9314.0391.781377.0928.95
201913.5351.4995.4713.53113.421241.1128.95
202017.1750.15158.2217.17110.452056.8628.95
Table 6. Carbon emissions from various transportation modes in Tibet from 2008 to 2020.
Table 6. Carbon emissions from various transportation modes in Tibet from 2008 to 2020.
YearPassenger Transport (Thousand Tons CO2)Freight Transport (Thousand Tons CO2)Total Emissions (Thousand Tons CO2)
RailRoadCivil AviationRailRoadCivil AviationPipeline
200865.833193.62/70.687401.43/33.291076.49
200987.022837.75/104.747639.70/44.581071.38
2010100.162735.82/127.007171.74/38.211017.29
2011111.712257.55237.89139.045988.9815.5444.00879.47
2012113.421779.50937.75202.504708.3661.8846.90785.03
2013132.181748.641884.29248.7010,674.06121.7243.431485.30
2014153.261520.662300.59284.918786.62131.1146.321322.35
2015192.751142.973135.99301.109996.32207.9326.061500.31
2016242.361023.914539.39428.528977.50356.5323.161559.14
2017269.511170.116524.39434.6410,230.68554.6337.641922.16
2018264.611165.518292.20466.0810,723.58757.4034.162170.35
2019244.621402.078143.59539.9812,983.19806.7232.422415.26
2020211.88732.699228.97683.3712,892.83884.4535.322466.95
Table 7. Input variables and their corresponding indicators for each dimension of Tibet’s transportation carbon emissions.
Table 7. Input variables and their corresponding indicators for each dimension of Tibet’s transportation carbon emissions.
Internal/External FactorsDimensionInput VariableUnitCode
Internal factorsTransportation development levelNumber of civilian vehicles10,000 vehicles x 1
Transportation land-use areakm2 x 2
Transportation output value100 million CNY x 3
Public transportation development levelTotal public transport passenger volume10,000 persons x 4
Number of buses and electric vehicles per 10,000 peoplevehicles x 5
External factorsTourism development levelTourism revenue100 million CNY x 6
Number of tourists10,000 persons x 7
Tertiary industry development levelTotal retail sales of consumer goods100 million CNY x 8
Added value of tertiary industry100 million CNY x 9
Socio-economic development levelUrban green coverage areaskm2 x 10
Per capita GDP10,000 CNY x 11
Built-up areakm2 x 12
Control variables Passenger turnover100 million passenger-kilometers c 1
Freight turnover100 million ton-kilometers c 2
Table 8. Input variable values from 2008 to 2020.
Table 8. Input variable values from 2008 to 2020.
Variablesx1x2x3x4x5x6x7x8x9x10x11x12c1c2
Year
200819.30240.0019.906861.0012.7622.59224.64145.47229.8422.001.3779.0030.3833.25
200919.86333.4721.195618.0012.6055.99561.06178.89255.0926.671.5281.3029.9838.03
201022.03336.2022.125295.9020.9171.44685.14217.94293.0527.781.7284.9032.2140.39
201125.78347.1323.957251.009.0297.06869.76270.42346.1431.572.0189.7034.8741.55
201227.38348.6026.237139.008.59126.481058.39318.39408.7843.982.28119.7041.3347.83
201331.30355.4729.218001.007.70165.181291.06371.48476.2142.632.62120.3058.53109.03
201433.47366.6030.808380.008.43204.001553.14422.75541.2956.302.93126.3065.65110.63
201537.45375.2731.769128.009.05281.922017.53477.07607.5863.343.18144.5067.70119.16
201640.46418.6731.268688.006.20330.752315.94539.05672.0772.703.50145.2081.16123.91
201744.68421.6034.089120.0010.43379.372561.43618.84761.2454.123.92147.60105.93136.71
201858.24786.3547.709615.0012.97490.143368.73711.76837.3363.864.41163.70125.10151.79
201963.611666.0047.809257.007.62559.284012.15773.40924.0164.154.75164.40130.61156.15
202070.212754.4045.707588.008.30366.423505.01745.78953.8466.565.23168.4085.28158.18
Table 9. Libraries and optimized parameters used in constructing the four models.
Table 9. Libraries and optimized parameters used in constructing the four models.
ModelPython LibraryGrid Search ParametersParameters
Model_rsMachine learning library: Scikit-learnRegularization parameter: [0.01, 0.1, 1, 10, 100, 1000]
Kernel coefficient: [0.0001, 0.001, 0.01, 0.1, 1]
Regularization parameter: 1000
Kernel coefficient: 0.001
Model_xgMachine learning library: Scikit-learn and XGBoostNumber of trees: [100, 200, 300]
Maximum tree depth: [3, 5, 7]
Learning rate: [0.01, 0.05, 0.1]
Number of trees: 300
Maximum tree depth: 3
Learning rate: 0.05
Model_rfMachine learning library: Scikit-learnNumber of trees: [100, 200, 500]
Maximum tree depth: [10, 20]
Minimum samples per leaf node: [1, 2, 4]
Number of trees: 200
Maximum tree depth: 10
Minimum samples per leaf node: 2
Model_gbMachine learning library: Scikit-learnNumber of trees: [100, 150, 200]
Maximum tree depth: [1, 2, 3]
Minimum samples per leaf node: [1, 2, 5]
Learning rate: [0.01, 0.05, 0.1]
Number of trees: 200
Maximum tree depth: 3
Minimum samples per leaf node: 1
Learning rate: 0.05
Table 10. Evaluation results of the four models across four metrics.
Table 10. Evaluation results of the four models across four metrics.
MetricModel
Model_rsModel_xgModel_rfModel_gb
MSE0.06290.13010.08210.0754
MAE0.21170.25770.21820.2002
R20.93500.80200.91340.9118
MAPE1.04271.89691.09421.2280
Table 11. Parameters for the optimization algorithms in Model_gs_rs and Model_rga_rs, along with the optimized RBF-SVM parameters.
Table 11. Parameters for the optimization algorithms in Model_gs_rs and Model_rga_rs, along with the optimized RBF-SVM parameters.
ModelPython LibraryOptimization Algorithm ParametersRBF-SVM Parameters
Model_gs_rsMachine learning library: Scikit-learnRegularization parameter: [0.01, 0.1, 1, 10, 100, 1000]
Kernel coefficient: [0.0001, 0.001, 0.01, 0.1, 1]
Regularization Parameter: 1000
Kernel coefficient: 0.001
Model_rga_rsMachine learning library: Scikit-learn and DEAPSelection: Grid Search
Crossover: Uniform Crossover
Fitness function: Evaluate
Population size: 50
Generations: 40+
Crossover rate: 0.5
Mutation rate: 0.2
Regularization Parameter: 42.76
Kernel coefficient: 0.02
Table 12. Evaluation results of each optimized model across four metrics.
Table 12. Evaluation results of each optimized model across four metrics.
MetricModel
Model_gs_rsModel_rga_rs
MSE0.00890.0220
MAE0.09070.1200
R20.99240.9835
MAPE0.29900.1198
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Bo, W.; Zhao, K.; Cheng, G.; Wang, Y.; Zhang, J.; Cheng, M.; Yang, C.; Da, W. Study on Transportation Carbon Emissions in Tibet: Measurement, Prediction Model Development, and Analysis. Sustainability 2024, 16, 8419. https://doi.org/10.3390/su16198419

AMA Style

Bo W, Zhao K, Cheng G, Wang Y, Zhang J, Cheng M, Yang C, Da W. Study on Transportation Carbon Emissions in Tibet: Measurement, Prediction Model Development, and Analysis. Sustainability. 2024; 16(19):8419. https://doi.org/10.3390/su16198419

Chicago/Turabian Style

Bo, Wu, Kunming Zhao, Gang Cheng, Yaping Wang, Jiazhe Zhang, Mingkai Cheng, Can Yang, and Wa Da. 2024. "Study on Transportation Carbon Emissions in Tibet: Measurement, Prediction Model Development, and Analysis" Sustainability 16, no. 19: 8419. https://doi.org/10.3390/su16198419

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop