Next Article in Journal
The Guidance of Public Value in China’s Historic Environment: Research on Regeneration Strategies Using Taiyuan’s Bell Tower Street as an Example
Previous Article in Journal
How to Use Evidence Rules Reasonably to Resolve Land Disputes: Analysis of Typical Land Dispute Cases from China
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Approach for Mapping Ecotourism Suitability Using Machine Learning: A Case Study of Zhangjiajie, China

School of Geography and Ocean Science, Nanjing University, Nanjing 210023, China
*
Authors to whom correspondence should be addressed.
Land 2024, 13(8), 1188; https://doi.org/10.3390/land13081188 (registering DOI)
Submission received: 25 June 2024 / Revised: 26 July 2024 / Accepted: 30 July 2024 / Published: 1 August 2024

Abstract

:
The assessment of ecotourism suitability is crucial for sustainable regional development and is seen as an effective strategy to achieve both environmental protection and economic growth. One of the key challenges in land research is effectively identifying potential ecotourism resources while balancing regional protection and development. This study mapped the suitability of ecotourism in Zhangjiajie, China, using a combination of various geospatial data sources and four machine-learning techniques. Additionally, an indicator system was developed, covering the ecological environment, geological geomorphology, socioeconomics, and resource availability. The prediction results for suitability classified the area into four categories: highly suitable, moderately suitable, marginally suitable, and unsuitable; based on the ensemble results generated by the four algorithms, these categories accounted for 19.34%, 28.78%, 23.87%, and 28.01% of the total area, respectively. This study’s findings illustrate the spatial distribution of ecotourism suitability in Zhangjiajie, providing valuable insights for identifying potential ecotourism resources as well as informing regional planning and policy-making.

1. Introduction

Tourism is one of the fastest-growing sectors worldwide [1,2] and plays a vital role in national economies. However, its growth has led to serious environmental problems and increased socioeconomic costs. The most significant issue is the environmental damage caused by tourism development, especially in ecologically sensitive and fragile natural areas [2]. Therefore, it is crucial to adapt the tourism industry to the demands of the new era and promote sustainable tourism development. Globally, there is an increasing emphasis on ecotourism as a new tourism model that balances economic development and environmental protection, in accordance with the United Nations Sustainable Development Goals (SDGs) [3].
The relationship between ecotourism and the UN SDGs can facilitate sustainable regional development [4,5]. Ecotourism, defined by the International Ecotourism Society (TIES) as ‘responsible travel to natural areas that conserves the environment, sustains the wellbeing of the local people, and provides a means to promote sustainable development’ [4], emphasizing the moderation of natural and humanistic tourism activities under the premise of environmental protection [6].
Ecotourism helps minimize damage to and the pollution of the natural environment by offering green, low-carbon, and recycled tourism products and services. This promotes harmonious coexistence between humans and nature [7]. The development of ecotourism not only protects biodiversity and ecological balance but also provides residents with more employment opportunities and economic benefits and promotes the sustainable development of the economy [5]. Ecotourism is the most promising form of tourism for the future because of its focus on conservation, communities, and interpretation [8], which can increase local economic income while safeguarding the environment [9]. According to UN Tourism, global tourism revenue will reach USD 3.3 trillion in 2023, accounting for 3% of global GDP.
Assessing an area’s suitability for ecotourism is an important first step in developing ecotourism resources. It is essential to use scientific rigor and accuracy to achieve sustainable development. Suitability analysis involves evaluating how well a specific land use type is suited to a particular region [10]. Ecotourism suitability evaluation takes into account various aspects, such as the natural environment [11] as well as socioeconomic [12], resource availability [6], and cultural connotations [13]. Multicriteria approaches that integrate multiple sources of spatial data and corresponding GIS spatial analysis techniques have been widely used to assess the ecotourism potential of various locations [9]. In terms of research methods, most existing ecotourism suitability evaluations have relied on GIS and remote sensing (RS) [1,3,10,14,15,16], analytic hierarchy process (AHP) [12,15], multiple criteria decision analysis (MCDA) [1,3,14,16], multiple-attribute decision making (MADM) [9,12], strengths, weaknesses, opportunities, and threats (SWOT) [6], principal component analysis (PCA) [9], and Delphi [17,18], among others. These methods typically involve determining factor weights for comprehensive consideration and integrated evaluation of various influencing factors.
When assessing regional ecotourism potential, previous studies have combined various methods with GIS, the most common combination being GIS with AHP [1,11]. For instance, the scholarly literature [15] used fuzzy AHP (F-AHP) alongside GIS to determine feature weights, while the scholarly literature [12] used AHP and ordered weighted averaging (OWA) for MADM modeling and a generic grading scale in a GIS environment to assess suitability across the study area. Research frameworks that combine GIS and MCDA for ecotourism suitability assessments have also been widely used [3]. For example, the scholarly literature [16] combined GIS and MCDA to assess the suitability of ecotourism in mountainous landscapes [16]. Some researchers have combined these methods with other analytical methods. For example, the scholarly literature [6] integrated SWOT with MCDM to assess the potential of ecotourism areas with natural and cultural features.
The scholarly literature [9] used PCA combined with clustering algorithms to identify potential sites for ecotourism development. They revealed a significant contribution of waterbody variability to ecotourism development. The scholarly literature [17] used the fuzzy Delphi method and expert consultation to establish a cross-dimensional indicator system for the evaluation of sustainable ecotourism, offering valuable insights into the complex dynamics of ecotourism in the Belize region. Integrating GIS has become a key method for evaluating the suitability of regional ecotourism. For example, the scholarly literature [14] employed a GIS overlay to create ecotourism suitability maps, and their classification method provided important insights. Some researchers have also integrated spatial information into interactive platforms. For example, the scholarly literature [5] developed a geospatial decision-support system, which serves as an interactive platform for planners, government agencies, and tourists. However, such platforms have limitations in providing relevant geospatial information and are not conducive to mapping ecotourism suitability.
These methods have significantly contributed to the field of ecotourism suitability assessment; however, the potential lack of objectivity in factor weighting of the current evaluation system may result in various contradictions and challenges in ecotourism planning, implementation, and management. During the planning phase, the subjective or unilateral determination of certain key factor weights may overemphasize economic benefits at the expense of ecological conservation [19], which could adversely impact the long-term health and sustainability of ecotourism, irreparably harming local ecosystems. In the implementation phase, non-objective factor identification may result in inefficient resource allocation. Biases in factor inputs can result in resource wastage and misallocation during the construction of ecotourism facilities, thereby reducing the overall benefits of ecotourism [20]. During the management phase, conflicts of interest and social contradictions may arise because of the involvement of multiple stakeholders, including the government, local residents, businesses, and tourists [12,20]. For example, some residents may lose traditional livelihoods or living environments due to the development of ecotourism, whereas some enterprises may cause social dissatisfaction due to the excessive benefits they gain.
Artificial intelligence (AI) technology has facilitated the development of novel approaches and methodologies for ecotourism suitability evaluations, thereby offering substantial benefits in the processing and analysis of ecotourism data. The scholarly literature [21] leveraged machine learning and social media big data to identify patterns of cultural ecosystem services and tourist emotions in Hong Kong’s coastal areas. The scholarly literature [22] leverages social big data to effectively elucidate spatial access patterns and hotspots pertaining to nature tourism, thereby furnishing novel insights into the management of protected areas and augmenting participatory methodologies in practice. However, despite their effectiveness in analyzing macro-level causes, predicting ecotourism market trends, and exploring tourists’ emotional patterns, these methods fail to provide a conclusive map of ecotourism suitability that meets the demand for regional planning, development, and conservation. In this study, to address the challenges and issues faced, we used a machine-learning-based framework to assess the suitability of ecotourism. This framework considered the ecological environment, geology and landforms, humanities and society, and resource availability as the main factors influencing ecotourism suitability mapping. This allowed the evaluation process to gather information, recognize patterns, establish regularities, and make precise predictions based on a wide range of geographical features. By combining AI technology with real-world research, our aim was to achieve a comprehensive and accurate assessment of ecotourism suitability.

2. Study Area and Data

2.1. Study Area

Zhangjiajie, situated in the northwestern part of Hunan Province, China, boasts a unique geographical location with a range of 109°40′–111°20′ E longitude and 28°52′–29°48′ N (Figure 1). The region predominantly features mountainous terrain and is globally acclaimed for its distinctive Zhangjiajie landforms, including the peak sandstone forest landforms. The area experiences a humid subtropical monsoon climate characterized by ample precipitation and simultaneous rain and heat, fostering lush vegetation growth and a diverse range of flora and fauna. Zhangjiajie is home to several rare wildlife species and, as an internationally recognized ecotourism destination, is graced with the distinctions of being a World Geopark, a World Natural Heritage site, and a National Forest Park. The exceptional natural scenery and cultural resources attract a substantial annual influx of domestic and international tourists. In 2023 alone, the city welcomed 36.274 million visitors, generating a total tourism revenue of RMB 51.46 billion, accounting for 83.8% of the region’s gross product.

2.2. Data Source

This study used multiple sources of geospatial data to evaluate ecotourism suitability, as outlined in Table 1. The dataset includes a wide range of geospatial information. Electronic distance measurement (EDM) data, primarily used for analyzing digital terrain, accurately represent the area’s topographic variations. RS data can be used to calculate ecological indices such as the NDVI, providing insights into the area’s ecological characteristics. The nighttime light data show a strong correlation with regional economic development, offering valuable information on the economic aspects of the study area. Meteorological data, including temperature and precipitation, help characterize the region’s climate conditions and are particularly important for understanding the local climate. Geospatial data sources have varying resolutions and applications. To address this, we combined these data sources into a 30 × 30 m raster format for further modeling and analysis.
It is important to emphasize that ensuring the reliability of our model’s training required precise geographic location data for ecotourism attractions. These data were sourced from the esteemed Third Survey and Mapping Institute of Hunan Province. By meticulously extracting feature values for each data point, we ensured that this information effectively contributed to the machine learning model’s training process, thereby enhancing the credibility of our predictive analytics.

3. Methods

3.1. Framework

This study introduces a research framework for assessing the suitability of ecotourism using various geospatial data sources and four machine learning algorithms (Figure 2). The framework consisted of three main components: data acquisition and preprocessing, model training, and model evaluation and prediction. The data acquisition and preprocessing phases, outlined in Section 2.2 and Section 3.3, involved consolidating geospatial data from different sources and creating feature matrices for subsequent modeling. Model training, outlined in Section 4.2, included processing the sample points and training the machine learning model. The data were used to train the machine learning model by extracting the values of each raster layer from the sample points. Model evaluation and prediction are detailed in Section 4.2 and Section 4.3. The results include a comprehensive assessment of the trained machine learning model to ensure its robustness and generalizability. Finally, the prediction results of the four machine learning models were obtained and visualized by inputting all the data into the model. To mitigate the risk of relying on a single model, the predictions from the four machine learning models were combined to produce a final prediction.
Central to ecotourism is the principle of sustainable development. Therefore, constructing an assessment framework for the ecotourism suitability requires a comprehensive consideration of the multifaceted impacts of tourism activities on the local ecological environment. Key factors include soil and water conservation potential and the NDVI, which reflect the ecological environment’s carrying capacity, as well as limiting factors such as slope and relief. These elements are crucial in shaping the evolution of ecotourism. This approach highlights the importance of not only evaluating the economic benefits of ecotourism but also ensuring environmental protection and fostering positive impacts on the local community. Ecotourism aims to mitigate environmental degradation by promoting a harmonious relationship between tourism and the natural world. Our framework utilizes suitability probabilities applied to individual raster cells, offering a high-resolution approach to local ecotourism planning and avoiding overly broad planning strategies. This method clearly distinguishes between conservation and development areas, helping to further minimize the environmental impacts and damage caused by tourism activities.

3.2. Machine Learning

3.2.1. Random Forest (RF)

The RF algorithm, an ensemble learning technique that leverages decision trees to produce accurate predictions [25], integrates the results of multiple decision trees and assigns the category with the highest number of votes as the final output.
f x = a r g m a x Y i = 1 k   I ( h i ( x , θ i ) = Y )
where f x is the final classification result, h i ( x ,   θ i ) is the decision trees, θ i is a random variable following independent homogeneous distribution, I .   is the indicator function, and Y is the objective variable.
The RF algorithm benefits from the bagging technique, which improves prediction accuracy and increases model diversity using random sampling. RF randomly selects a subset of features for splitting, which introduces an element of randomness that reduces the correlation between features and mitigates the risk of overfitting.

3.2.2. Xtreme Gradient Boosting (XGBoost)

XGBoost is a machine learning algorithm that employs gradient boosting trees. This model was developed by Tianqi Chen [26]. It iteratively adds new decision trees to optimize the objective function. Each added tree focuses on reducing the prediction residuals of the previous tree, thereby improving the overall prediction accuracy. The objective function of XGBoost consists of a loss function and regularization term.
L ϕ = i = 1 I l y ^ i , y i + k = 1 K Ω f k
where L ϕ is the objective function, I is the number of training samples, i is the i sample in the sample set, K is the number of boosted trees, k is the k tree in the boosted tree, l y ^ i , y i is the loss function that measures the difference between the true label y i and predicted label y i ^ , and Ω f k is the regularity term used to control the model complexity to prevent overfitting and improve the model generalization ability.
Ω f k = γ T + 1 2 λ | | w | | 2
where Ω f k is the regularization strength, T is the total number of leaf nodes, γ and λ are the regularization parameters, and w is the leaf node weight vector.
XGBoost uses a second-order Taylor expansion of the loss function to accurately approximate the true loss function.
L ~ t = i = 1 I g i f t x i + 1 2 h i f t 2 x i + Ω f k
where L ~ t is the optimized objective function, and g i = y ^ i t 1   l y i , y ^ t 1 and h i = y ^ i t 1 2   l y i , y ^ t 1 are the first- and second-order gradient statistics of the loss function, respectively.
The XGBoost algorithm demonstrates outstanding predictive performance, scalability, interpretability, and support for parallel computing. It also automatically handles missing values. However, it is essential to fine-tune the algorithm parameters to prevent overfitting and underfitting.

3.2.3. Support Vector Machine (SVM)

The SVM algorithm is a popular supervised machine learning technique that classifies data by identifying an optimal hyperplane [27]. When the data are linearly separable, the problem can be transformed into an optimal solution by solving the Lagrange duality; however, when the data are not linearly separable, a kernel function is introduced to map them to a higher-dimensional space, making them linearly separable.
f x = w T Φ x + b
where f x is the distance from the sample x to the hyperplane, w is the feature weight vector, w T is the transpose of the weight vector w , Φ x is the feature space transformation function, and b is the bias term.
SVM is well equipped to handle high-dimensional data and can effectively handle outliers and noise. Additionally, it has strong generalization capabilities because of its ability to convert nonlinear problems into linear ones using kernel functions. However, its performance can be sensitive to parameters such as the kernel and penalty coefficient C. Thus, it is essential to use techniques like cross-validation and grid search to identify the best parameters during experimentation.

3.2.4. Adaptive Boosting (AdaBoost)

The AdaBoost algorithm, originally proposed by Freund and Schapire in 1997 [28], is an iterative approach that combines multiple weak classifiers into a single robust classifier. Its underlying principle is to adjust the weights assigned to both samples and weak classifiers, enabling the classifiers to progressively focus on instances that are difficult to classify, thereby enhancing the overall accuracy of the classification task [29].
f ( x ) = s i g n n = 1 N   α n G n ( x )
where f ( x ) is the trained strong learner, N is the number of weak learners, α n is the weight corresponding to the n weak learner, and G n ( x ) is the n weak learner.
α n = 1 2 l o g 1 e n e n
The AdaBoost algorithm exhibits robustness, adaptability, and generalizability while maintaining its capacity for effective data fitting even in scenarios with numerous features and intricate data distributions.

3.3. Feature Processing

In our study, we utilized scientific rigor, reliability, and accessible data to classify the factors affecting the assessment of regional ecotourism suitability into four components: ecological environment, geology and landforms, humanities and society, and resource availability. These components were derived from previous studies, which provided valuable insights for our research. Each component was then classified into five distinct characteristics for analysis, as outlined in Table 2.
Table 2. Feature building and information on relevant details.
Table 2. Feature building and information on relevant details.
FeatureProcessingDescriptionReference
Ecological environment
(Figure 3)
NDVI N D V I = R e d G r e e n R e d + G r e e n R e d   and   G r e e n denote the red and green bands in remote sensing data, respectively.[9]
TemperatureKriging interpolation and maskingAffects the variability of natural landscapes.[2]
PrecipitationKriging interpolation and maskingAffects vegetation ecology and scenic safety.[30]
HumidityKirchhoff transformAffects vegetation growth and distribution, which directly affect landscape ecology.[10]
Land useResample and maskingAffects the scale and quality of ecotourism.[10]
Geology and landforms
(Figure 4)
GeologyResample and maskingReflects the region’s geological environment.[31]
ElevationResampleReflects surface altitude conditions.[13]
SlopeSurface analysisReflects surface microstructure.[13]
AspectSurface analysisReflects surface mountain ecology.[32]
Relief R = H m a x H m i n H m a x   and   H m i n denote the maximum and minimum elevation per unit area, respectively.[15]
Humanities and society
(Figure 5)
PopulationResample and maskingInfluences the layout of ecotourism development and resource allocation.[33]
Nighttime light intensity (NILI)Resample and maskingIt is highly correlated with economic development.[31]
Road network density (ROND) D r = L r S r L r   and   S r denote the road lengths and unit sizes, respectively.[34]
Distance from roads (DISO)Euclidean distanceUsed to measure regional transport accessibility.[30]
Distance from settlements (DISS)Euclidean distanceIt serves as a crucial support node and stakeholder in the advancement of ecotourism.[2]
Resource availability
(Figure 6)
Aesthetic landscape value (ALSV)Kriging interpolation and maskingComprehensively reflects the region’s natural and human landscapes.[35]
BiodiversityKriging interpolation and maskingA prerequisite for the development of ecotourism.[35]
Soil conservation capacity (SOCC)Kriging interpolation and maskingIts strength directly affects the sustainable development of ecotourism.[32,36]
Forest ageResample and maskingReflects vegetated landscape resources.[24]
Distance from river (DISI)Euclidean distanceProvides water and ecological support.[9]

4. Results

4.1. Feature Correlation Analysis

We created a correlation heatmap (Figure 7) to visualize the connection between different geographic features. The correlation between precipitation and elevation was remarkably strong, with a value of 0.87. This indicated that the topography of the Zhangjiajie Mountain Area significantly influences rainfall patterns. In the summer, the southeastern monsoon brings moisture from the Pacific Ocean, leading to topographic rain, which results in abundant water for vegetation growth, promotes biodiversity, and enhances the scenic beauty of the region. The correlation between slope and relief was 0.91, which aligns with the fundamental geographic principles. Biodiversity and ALSV had the highest correlation (0.94), suggesting that areas with rich biodiversity tend to have higher ALSV values.
To apply the geographic features to ecotourism suitability assessment modeling, we integrated the variance threshold and mutual information methods for feature selection. The variance threshold test revealed no redundant features with a variance of zero. Subsequently, we utilized the mutual information method to analyze the relationship between the features and labels and showed that all the features exhibited a positive mutual information estimate, suggesting their relevance to the labels. Based on both analyses, we decided to retain all the features for subsequent modeling, as they contribute to preserving the original information of the data while also ensuring efficient modeling.

4.2. Model Evaluation

A thorough evaluation is necessary during the modeling process to ensure the model’s generalizability and robustness. In this study, we selected five evaluation metrics, namely, accuracy, precision, recall, F1, and area under the curve (AUC), to assess the model’s classification performance under ten-fold cross-validation (Table 3). Furthermore, we visualized the confusion matrix and ROC curves (Figure 8 and Figure 9) to provide a comprehensive understanding of the model’s performance.
As shown in Table 3, XGBoost exhibited superior performance compared to the remaining three types of models under 10-fold cross-validation. AdaBoost and RF followed in second and third place, respectively. In addition, SVM displayed a notable performance gap compared with the other three model types.
To visualize the model’s classification performance, we created a confusion matrix, as shown in Figure 8. XGBoost exhibited the best performance, followed by RF. Although the overall error rate of AdaBoost was slightly higher than that of SVM, the difference was not substantial.
Based on the results of the ROC curve and AUC calculations presented in Figure 9, XGBoost had the best performance, followed by AdaBoost, RF, and SVM. However, the differences between the models were not significant, especially for RF and AdaBoost, whose AUC values differed by only 0.0012. All four models performed well in predicting spatial classifications for the test data features.

4.3. Mapping Results of the Models

The input data were individually fed into four different pre-trained models, which produced the predictions shown in Figure 10. While these models showed a high level of agreement, certain areas had some differences. This suggested that while machine learning techniques can effectively model real-world training data, they may produce varied predictive outcomes in different regions due to differences in the underlying algorithms.
The evaluation results of the models (Table 3 and Figure 9) and the spatial distribution characteristics (Figure 10) indicated that the spatial variability in the model fit was consistent with the actual growth patterns of ecotourism in the region. These four model categories provided insights into the untapped potential of regional ecotourism, which can be visualized using geospatial visualization techniques. Notably, XGBoost exhibited the highest predictive power, followed closely by RF, whereas SVM fell short of the other three types of models, which can possibly be attributed to the fact that tree-based models possess greater learning capacities and are, therefore, better equipped to capture intricate spatial nuances in the data.

4.4. Comprehensive Results

To visualize the predictive performance of the models, two representative regions, namely, the urban area of Zhangjiajie and the Wulingyuan District (an ecotourism hotspot), were selected for comparative analysis (Figure 11).
A comparison of the predictions of the four models in the two selected regions revealed that the urban area of Zhangjiajie (Figure 11a-1–a-4) was not conducive to ecotourism development owing to the prevalence of human activity, with urban construction land and industrial and agricultural land holding leading positions (as indicated by the red color in Figure 11). However, the southern part of the urban area exhibited high suitability for ecotourism development (indicated by the green color in Figure 11), consistent with the current state of ecotourism development in the area. This region encompasses the Tianmen Mountain National Forest Park, which boasts a range of popular tourist destinations, such as Tianmen Cave, Ghost Valley Trail, and Glass Viewing Platform. The area is characterized by mountainous terrain, lush vegetation, rivers, and water systems, with diverse natural scenery, a significant elevation variation, and steep cliffs. The region’s rich ecotourism resources contributed to the formation of the current spatial distribution pattern of ecotourism, which is supported by our predictive findings.
Within the Wulingyuan District, the predictions of the four models exhibited a high degree of consistency, as depicted in Figure 11b-1–b-4. The area is generally well suited for ecotourism, which can possibly be attributed to the unique natural and cultural tourism resources in the region, as well as the protection provided by Zhangjiajie National Forest Park. The region is home to the renowned ecotourism destination, the Wulingyuan Scenic Spot, which features natural wonders such as quartz sandstone peak forest landforms, as well as cultural attractions such as air corridors and the setting of the classic Chinese novel Journey to the West. The spatial distribution of ecotourism attractions in the area aligns closely with the model predictions, further supporting the reliability of the predictive results obtained using the machine learning approach.

4.5. Result Verification

To mitigate the risk of relying on a single model, we computed the average prediction across the four models (Figure 12).
The zones with high suitability were mostly located in the central region of Wulingyuan, southern and southeastern regions of Yongding, northwestern, western, and southwestern regions of Cili, and northern, northwestern, and eastern regions of Sangzhi. These areas were selected to coordinate regional ecotourism resources, including Zhangjiajie National Forest Park, Tianzishan Nature Reserve, Suoxiyu Nature Reserve, and Yangjiajie Nature Reserve. In the southern part of Yongding (south of the urban area of Zhangjiajie), several patches of highly suitable areas were separated from the core urban area. This is related to two factors, first of which is the presence of Tianmen Mountain National Forest Park (Figure 13a), which features a combination of a natural landscape, biological resources, and humanistic history and is rich in ecotourism resources. Second, this area is a transitional zone between the basin topography of the urban area of Zhangjiajie and the southern Tianmen Mountain, resulting in different levels of suitability with clear boundaries. The highly suitable zones of Cili are concentrated primarily in the northwestern (Figure 13h), western, and southwestern regions, which feature mountainous terrain, high vegetation cover, and strong soil and water conservation capacity, coupled with a unique topography and outstanding aesthetic landscape value. In Sangzhi, the distribution patterns of highly suitable zones were similar.
The unsuitable zones were mainly found in the western and northwestern parts of Wulingyuan, northern section of Yongding, central, eastern, and southwestern regions of Cili, and southwestern part of Sangzhi. These areas are relatively small in Wulingyuan and are mostly located in urban areas and settlements on the outskirts of the Wulingyuan scenic area (Figure 14a). They are dominated by residential buildings, water conservation land, and permanent basic farmland, which do not support ecotourism development. The northern part of Yongding is part of the urban area of Zhangjiajie and is characterized by urban land use with a significant amount of farmland on the periphery, including permanent basic farmland essential for food security in China (Figure 14c). Similarly, in Cili, the unsuitable zones are larger due to the area’s topography, resulting in a flat topography that has developed into residential areas and farmland with limited ecotourism resources (Figure 14f,g). In Sangzhi, the unsuitable zones exhibited a northeast–southwest-oriented strip-like pattern influenced by the unique topography created by the local geological environment (Figure 14b).
Zones with moderate suitability exhibited a distribution pattern similar to that of the highly suitable zones, being primarily concentrated around their periphery. For example, the small mountain range in the southern part of the Wulingyuan urban area, Siduping Township and Tianle Mountain Village in Yongding, Wujiadai and Rhinoceros Villages in the southwestern part of Cili, and Linxiyuan and Niudongkou Villages in the northern part of Sangzhi fell into this category. These areas are located at the interface between human-intensive areas and mountainous regions. They offer both protective functions and development potential, with high vegetation cover and excellent soil and water conservation capacity. They also boast unique flora and fauna, making them well suited for future ecotourism activities. The marginally suitable zones, on the other hand, are primarily located in the transition zone between moderately suitable zones and unsuitable zones. These areas are characterized by high human activity and fragile ecosystems and should, therefore, be protected.

4.6. Contributions of Features

Machine learning algorithms rely on the features constructed to learn during the training phase. In tree models, the importance of a feature is determined by the frequency with which it is chosen as a split point or the degree of information gain resulting from the split. In this study, we used the feature importance attributes of the model to calculate and average the feature importance rankings for the RF, XGBoost, and AdaBoost tree models (Figure 15).
According to the model’s average ranking results for feature importance, temperature, precipitation, and soil conservation capacity were the top three features, accounting for 15.44%, 13.69%, and 13.83% of all the features, respectively. These three features are crucial in determining the ecological environment, which in turn affects ecotourism development. Temperature and precipitation are the key factors influencing the attractiveness of ecotourism destinations. Zhangjiajie’s moderate altitude and mountainous terrain create a comfortable environment for tourists, especially during the summer months when the region experiences a large influx of visitors seeking relief from the heat.
Furthermore, precipitation has a significant impact on the regional ecology and feasibility of ecotourism initiatives. Adequate precipitation is essential for vegetation growth and the development of unique ecological landscapes. Soil conservation directly affects a region’s environmental carrying capacity, which is crucial for sustainable ecotourism. Improved conservation capacity minimizes the negative effects of tourism on the environment. Additionally, factors such as relief height and forest age play significant roles, accounting for 10.82% and 9.11%, respectively. Increased relief height contributes to the formation of unique geological features, supporting the development of ecotourism resources with exceptional scenic value. Meanwhile, a higher forest age indicates greater potential for ecotourism resources focused on tall vegetation, offering visitors the opportunity to immerse themselves in nature and explore plant diversity.
The aforementioned findings are substantiated by evidence drawn from existing research [10,13,15,30,31,32,33,34,35,36] and collectively underpin the theoretical framework of the present study. Compared to previous findings, the rankings of the feature importance derived from the machine learning techniques are fundamentally distinct because they are based on the inherent attributes of the analyzed sample data, without relying on human intervention to predetermine factor weights. In contrast, the majority of existing studies employ methodologies such as the expert scoring approach or AHP [15] to assign factor weights prospectively, which culminates in a composite analysis to produce the final ecotourism suitability map. Consequently, the assignment of feature weights in traditional analytical frameworks is susceptible to researcher subjectivity. In contrast, machine learning methodologies, in which are based on data-driven insights, provide a more objective and impartial assessment.

5. Discussion

5.1. Model’s Fractional Anisotropy

Although data-driven machine learning can provide accurate predictions, it requires a substantial number of high-quality samples to perform effectively. Different training samples can lead to variations in models, emphasizing the importance of reliable training data to minimize such differences and improve the model’s strength and applicability. In the present study, we used the training samples from the Professional Surveying and Mapping Department of the Third Surveying and Mapping Institute of Hunan Province to ensure data reliability and reduce model divergence caused by inaccurate training samples. It is crucial to note that a model’s performance is significantly influenced by its parameters, which can also contribute to model divergence. To address these concerns and improve the model’s reliability, we controlled the parameters while maintaining the quality of the training samples. In our pursuit of higher accuracy, we utilized learning curves, grid search, cross-validation, and regularization techniques to prevent overfitting and improve the model’s applicability.

5.2. Impact of Land Policy

The development of ecotourism is closely related to local land policies that regulate it. These policies form a connected system that influences ecotourism. While restrictive land policies may hinder ecotourism to some extent, this is not inherently contradictory because ecotourism is based on environmental conservation. Different countries have different perspectives on land development, leading to variations in land policy formulations. Therefore, assessing the suitability of ecotourism must be based on an understanding of national and local land policy frameworks.
The evaluation’s findings will have a significant impact on the development of ecotourism policies, directly influencing land use planning. This can be observed through several factors. First, the results of the evaluation will guide the formulation of policies by identifying areas suitable for ecotourism and areas that require special protection, leading to the implementation of more precise and effective land policies such as land use restrictions, protection measures, and ecological compensation mechanisms. For example, at the turn of the century, the local jurisdiction implemented a land management strategy involving the conversion of farmlands back into forested areas. This initiative, known as the ‘returning farmland to forest’ policy, achieved two key objectives. It facilitated the rehabilitation of a degraded ecological environment and instituted an ecological compensation mechanism designed to provide financial support to local farmers, thus fostering a more sustainable relationship between human activities and the environment. Second, the evaluation results can be utilized to improve land use planning, especially in ecologically sensitive areas. By considering the needs of ecotourism activities, land use can be organized in a way that allows tourism to coexist harmoniously with the natural environment. Additionally, targeted land and soil remediation plans should be developed to address illegal land use, thereby restoring and enhancing land quality and productivity. Furthermore, the evaluation contributes to sustainable regional development by promoting scientific planning, the rational use of land resources, and the healthy development of ecotourism, leading to the coordinated development of regional environmental protection and economic growth. For instance, the locale’s extant land use policy, characterized by a dual emphasis on conservation and sustainable development, aims to balance the stringent protection of the ecological environment with the thoughtful expansion of eco-tourism activities. This approach not only preserves the natural habitat but also improves the economic wellbeing of the local population through the promotion of environmentally friendly tourism initiatives. Lastly, the evaluation results can enhance the efficiency of land policy implementation by clarifying policy objectives and standards, thus reducing subjectivity and arbitrariness in the policy implementation process [4]. This provides a basis for monitoring and evaluating policy implementation to ensure that policies are effectively implemented to achieve the desired outcomes. For example, the China Spatial Planning Observation Network (CSPON) provides real-time monitoring and early warning systems for the implementation of both national and regional spatial plans. This advancement significantly enhances the scientific rigor and effectiveness of spatial planning and management across the country.

5.3. Applied Range and Limitations

This study used machine learning and data mining to objectively evaluate the suitability of regional ecotourism. We used accurate and scientific sample data, which resulted in excellent prediction outcomes. Our approach effectively overcomes the limitations of traditional subjective methods and allows for independent research on ecotourism suitability within the study area. However, the effectiveness of our method relies on having a sufficient number of labeled samples, as machine learning models require a certain number of training samples to function optimally. Therefore, the practical application of our method may be limited by the availability of labeled data.

5.4. Future Work

Our experiment confirmed that using machine learning to evaluate ecotourism suitability can effectively identify and map potential ecotourism areas. This method provides more objective evaluation results because it does not rely on the manual determination of factor weights. However, it is crucial to obtain effective training samples with labels for this approach to be effective. In cases where training samples in the study area are insufficient or lacking, the method may have limitations at the algorithmic level. In such situations, future research should explore incorporating transfer learning to reuse models or samples from different domains for a more comprehensive assessment of ecotourism suitability.
Furthermore, a global-scale evaluation of ecotourism suitability remains to be conducted due to variations in the sample data and regional characteristics. In the future, it will be necessary to break down national and regional barriers and integrate land policies, regional cultures, and unique tourism resources for a more comprehensive evaluation.

6. Conclusions

This empirical study conducted in Zhangjiajie, China, integrated multisource geospatial data and machine learning techniques to evaluate the suitability of ecotourism in the region. Our findings demonstrated the effectiveness of using machine learning methods for regional ecotourism suitability evaluation, as they can achieve objective evaluation without human interference in determining factor weights. Furthermore, this study demonstrated the inherent limitations of the traditional method of human interference, which can be addressed by the proposed data-driven approach.
The AUC of the four models (XGBoost, AdaBoost, RF, and SVM) for the training data were 0.9884, 0.9660, 0.9648, and 0.9567, respectively. XGBoost performed the best, followed by AdaBoost and RF, whereas SVM performed the worst. To avoid relying on a single model, the prediction results were averaged. In the combined prediction results, highly suitable, moderately suitable, marginally suitable, and unsuitable areas for ecotourism accounted for 19.34%, 28.78%, 23.87%, and 28.01% of the total area, respectively.
The cartographic representation of the predicted outcomes delineates the spatial configuration of preexisting ecotourism destinations and accurately identifies prospective ecotourism resources. Though currently unexploited as ecotourism attractions, these locations showcase exceptional scenic and landscape appeal and hold significant developmental potential in the future. They frequently face constraints imposed by transportation infrastructure, topography, tourism-related amenities, and policies yet to be formulated; however, this presents an opportunity rather than a hindrance. Such conditions allow for enhanced ecological preservation—a fundamental requisite for the sustainability of ecotourism. Moreover, certain areas previously considered suitable for development have since been deemed inappropriate due to their ecological sensitivity and vulnerability. Consequently, the government has instituted a comprehensive suite of land use policies—including, but not limited to, ecological protection red lines, safeguards for drinking water sources, the delineation of permanent basic farmland, and restrictions on urban expansion—to conserve these regions. In the long term, these measures stand to confer immense benefits.
Future research should explore training samples and model selection. In regions with insufficient training samples or zero samples, transfer learning is a viable option as it enables model transfer and reuse even in the absence of training samples or zero samples in the target domain, thereby reducing the reliance on training samples in traditional machine learning methods.

Author Contributions

Conceptualization, Q.H. and C.Z.; methodology, Q.H., Y.M., and C.Z.; software, Q.H. and S.H.; validation, M.L. and C.Z.; formal analysis, M.L. and. Q.H.; investigation, Y.M. and. Q.H.; resources, M.L. and C.Z.; data curation, Q.H. and S.H.; writing—original draft preparation, Q.H.; writing—review and editing, Q.H., M.L., and C.Z.; visualization, Q.H., S.H., and Y.M.; supervision, M.L. and C.Z.; project administration, C.Z.; funding acquisition, C.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key Research & Development Program of China (grant number 2022YFC3800802), National Natural Science Foundation of China (grant number 42271414), and Natural Science Foundation of Jiangsu Province (grant number BK20231410).

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding authors.

Acknowledgments

We would like to thank the editors and reviewers of Land for their valuable feedback and suggestions that helped to improve the quality of our manuscript. We are also grateful to the members of our research team who contributed to the collection and analysis of the data used in this study. Finally, we thank the corresponding author for providing us with the necessary resources and support throughout the research process.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Gigović, L.; Pamučar, D.; Lukić, D.; Marković, S. GIS-Fuzzy DEMATEL MCDA model for the evaluation of the sites for ecotourism development: A case study of “Dunavski ključ” region, Serbia. Land Use Policy 2016, 58, 348–365. [Google Scholar] [CrossRef]
  2. Kaymaz, Ç.K.; Çakır, Ç.; Birinci, S.; Kızılkan, Y. GIS-Fuzzy DEMATEL MCDA model in the evaluation of the areas for ecotourism development: A case study of “Uzundere”, Erzurum-Turkey. Appl. Geogr. 2021, 136, 102577. [Google Scholar] [CrossRef]
  3. Ronizi, S.R.A.; Mokarram, M.; Negahban, S. Utilizing multi-criteria decision to determine the best location for the ecotourism in the east and central of Fars province, Iran. Land Use Policy 2020, 99, 105095. [Google Scholar] [CrossRef]
  4. Yee, J.Y.; Loc, H.H.; Le Poh, Y.; Vo-Thanh, T.; Park, E. Socio-geographical evaluation of ecosystem services in an ecotourism destination: PGIS application in Tram Chim National Park, Vietnam. J. Environ. Manag. 2021, 291, 112656. [Google Scholar] [CrossRef]
  5. Mileti, F.A.; Miranda, P.; Langella, G.; Pacciarelli, M.; De Michele, C.; Manna, P.; Bancheri, M.; Terribile, F. A geospatial decision support system for ecotourism: A case study in the Campania region of Italy. Land Use Policy 2022, 118, 106131. [Google Scholar] [CrossRef]
  6. Tajer, E.; Demir, S. Ecotourism strategy of UNESCO city in Iran: Applying a new quantitative method integrated with BWM. J. Clean. Prod. 2022, 376, 134284. [Google Scholar] [CrossRef]
  7. Hafezi, F.; Bijani, M.; Gholamrezai, S.; Savari, M.; Panzer-Krause, S. Towards sustainable community-based ecotourism: A qualitative content analysis. Sci. Total Environ. 2023, 891, 164411. [Google Scholar] [CrossRef]
  8. Brightsmith, D.J.; Stronza, A.; Holle, K. Ecotourism, conservation biology, and volunteer tourism: A mutually beneficial triumvirate. Biol. Conserv. 2008, 141, 2832–2842. [Google Scholar] [CrossRef]
  9. Santarem, F.; Campos, J.C.; Pereira, P.; Hamidou, D.; Saarinen, J.; Brito, J.C. Using multivariate statistics to assess ecotourism potential of water-bodies: A case-study in Mauritania. Tour. Manag. 2018, 67, 34–46. [Google Scholar] [CrossRef]
  10. Ayhan, Ç.K.; Taşlı, T.C.Z.; Özkök, F.; Tatlı, H. Land use suitability analysis of rural tourism activities: Yenice, Turkey. Tour. Manag. 2020, 76, 103949. [Google Scholar] [CrossRef]
  11. Dhami, I.; Deng, J.; Burns, R.C.; Pierskalla, C. Identifying and mapping forest-based ecotourism areas in West Virginia–Incorporating visitors’ preferences. Tour. Manag. 2014, 42, 165–176. [Google Scholar] [CrossRef]
  12. Jeong, J.S.; García-Moruno, L.; Hernández-Blanco, J.; Jaraíz-Cabanillas, F.J. An operational method to supporting siting decisions for sustainable rural second home planning in ecotourism sites. Land Use Policy 2014, 41, 550–560. [Google Scholar] [CrossRef]
  13. Ghasemi, M.; Charrahy, Z.; González-García, A. Mapping cultural ecosystem services provision: An integrated model of recreation and ecotourism opportunities. Land Use Policy 2023, 131, 106732. [Google Scholar] [CrossRef]
  14. Ambecha, A.B.; Melka, G.A.; Gemeda, D.O. Ecotourism site suitability evaluation using geospatial technologies: A case of Andiracha district, Ethiopia. Spat. Inf. Res. 2020, 28, 559–568. [Google Scholar] [CrossRef]
  15. Zabihi, H.; Alizadeh, M.; Wolf, I.D.; Karami, M.; Ahmad, A.; Salamian, H. A GIS-based fuzzy-analytic hierarchy process (F-AHP) for ecotourism suitability decision making: A case study of Babol in Iran. Tour. Manag. Perspect. 2020, 36, 100726. [Google Scholar] [CrossRef]
  16. Fadafan, F.K.; Soffianian, A.; Pourmanafi, S.; Morgan, M. Assessing ecotourism in a mountainous landscape using GIS–MCDA approaches. Appl. Geogr. 2022, 147, 102743. [Google Scholar] [CrossRef]
  17. Ruano, M.; Huang, C.-Y.; Nguyen, P.-H.; Nguyen, L.-A.T.; Le, H.-Q.; Tran, L.-C. Enhancing sustainability in Belize’s ecotourism sector: A fuzzy Delphi and fuzzy DEMATEL Investigation of Key Indicators. Mathematics 2023, 11, 2816. [Google Scholar] [CrossRef]
  18. Li, Y.; Zhang, S.; Han, J.; Zhao, Y.; Han, Q.; Wu, L.; Wang, X.; Qiu, Z.; Zou, T.; Fan, C. A study of the temporal and spatial variations in the suitability of the environment in Chinese cities for tourism and in strategies for optimizing the environment. Int. J. Digit. Earth 2022, 15, 527–552. [Google Scholar] [CrossRef]
  19. Thompson, B.S. Ecotourism anywhere? The lure of ecotourism and the need to scrutinize the potential competitiveness of ecotourism developments. Tour. Manag. 2022, 92, 104568. [Google Scholar] [CrossRef]
  20. Lee, J.-H. Conflict mapping toward ecotourism facility foundation using spatial Q methodology. Tour. Manag. 2019, 72, 69–77. [Google Scholar] [CrossRef]
  21. Cao, H.J.; Wang, M.; Su, S.L.; Kang, M.J. Explicit quantification of coastal cultural ecosystem services: A novel approach based on the content and sentimental analysis of social media. Ecol. Indic. 2022, 137, 108756. [Google Scholar] [CrossRef]
  22. Kim, Y.; Kim, C.-K.; Lee, D.K.; Lee, H.-W.; Andrada, R.I.T. Quantifying nature-based tourism in protected areas in developing countries by using social big data. Tour. Manag. 2019, 72, 249–256. [Google Scholar] [CrossRef]
  23. Zhang, X.; Zhao, T.; Xu, H.; Liu, W.; Wang, J.; Chen, X.; Liu, L. GLC_FCS30D: The first global 30-m land-cover dynamic monitoring product with a fine classification system from 1985 to 2022 using dense time-series Landsat imagery and continuous change-detection method. Earth Syst. Sci. Data Discuss. 2023, 16, 1353–1381. [Google Scholar] [CrossRef]
  24. Cheng, K.; Chen, Y.; Xiang, T.; Yang, H.; Liu, W.; Ren, Y.; Guan, H.; Hu, T.; Ma, Q.; Guo, Q. 2020 forest age map for China with 30 m resolution. Earth Syst. Sci. Data Discuss. 2023, 16, 803–819. [Google Scholar] [CrossRef]
  25. Bai, J.; Li, Y.; Li, J.; Yang, X.; Jiang, Y.; Xia, S.-T. Multinomial random forest. Pattern Recognit. 2022, 122, 108331. [Google Scholar] [CrossRef]
  26. Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd Acm Sigkdd International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
  27. Tavakkoli Piralilou, S.; Einali, G.; Ghorbanzadeh, O.; Nachappa, T.G.; Gholamnia, K.; Blaschke, T.; Ghamisi, P. A Google Earth Engine approach for wildfire susceptibility prediction fusion with remote sensing data of different spatial resolutions. Remote Sens. 2022, 14, 672. [Google Scholar] [CrossRef]
  28. Freund, Y.; Schapire, R.E. A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 1997, 55, 119–139. [Google Scholar] [CrossRef]
  29. Xing, H.-J.; Liu, W.-T.; Wang, X.-Z. Bounded exponential loss function based AdaBoost ensemble of OCSVMs. Pattern Recognit. 2024, 148, 110191. [Google Scholar] [CrossRef]
  30. Guo, J.; Rong, Y.; Zhu, J.; Yan, Y.; Du, J.; Zheng, L.; Zhao, Y. Industrial development zoning with dual objectives of spatial development suitability and ecosystem service value a case study in Xiaonanhai Hydropower Station basin. Ecol. Indic. 2024, 158, 111522. [Google Scholar] [CrossRef]
  31. Chung, L.-W.; Chao, C.-L.; Liao, M.-C.; Kao, C.-W.; Lu, D.-J. Effectiveness and appropriateness of core areas in an integrated protected area: A case study of Ecological Protected Areas in Yangmingshan National Park, Taiwan. Environ. Sci. Policy 2023, 145, 175–187. [Google Scholar] [CrossRef]
  32. Qu, Y.; Wang, S.; Tian, Y.; Jiang, G.; Zhou, T.; Meng, L. Territorial spatial planning for regional high-quality development–An analytical framework for the identification, mediation and transmission of potential land utilization conflicts in the Yellow River Delta. Land Use Policy 2023, 125, 106462. [Google Scholar]
  33. Sobhani, P.; Esmaeilzadeh, H.; Wolf, I.D.; Deljouei, A.; Marcu, M.V.; Sadeghi, S.M.M. Evaluating the ecological security of ecotourism in protected area based on the DPSIR model. Ecol. Indic. 2023, 155, 110957. [Google Scholar] [CrossRef]
  34. Wang, L.; Huang, L.; Cao, W.; Zhai, J.; Fan, J. Assessing grassland cultural ecosystem services supply and demand for promoting the sustainable realization of grassland cultural values. Sci. Total Environ. 2024, 912, 169255. [Google Scholar] [CrossRef] [PubMed]
  35. Qiu, Y.; Zhou, A.; Li, M.; Guo, Y.; Ma, C. Territorial spatial usage regulation based on resources endowment and sustainable development: A case of Wuhan, China. J. Clean. Prod. 2023, 385, 135771. [Google Scholar] [CrossRef]
  36. Li, J.; He, H.; Chen, L.; Sun, R. Long-time Series Dataset of Soil Conservation Capacity Preventing Water Erosion in China (1992–2019). Earth Syst. Sci. Data Discuss. 2022, 2022, 1–24. [Google Scholar]
Figure 1. Geographic distribution map of the study area.
Figure 1. Geographic distribution map of the study area.
Land 13 01188 g001
Figure 2. Framework for ecotourism suitability assessment using machine learning.
Figure 2. Framework for ecotourism suitability assessment using machine learning.
Land 13 01188 g002
Figure 3. Ecological environment features: (a) NDVI, (b) Temperature, (c) Precipitation, (d) Humidity, (e) Land use.
Figure 3. Ecological environment features: (a) NDVI, (b) Temperature, (c) Precipitation, (d) Humidity, (e) Land use.
Land 13 01188 g003
Figure 4. Geology and landform features: (a) Geology, (b) Elevation, (c) Slope, (d) Aspect, (e) Relief.
Figure 4. Geology and landform features: (a) Geology, (b) Elevation, (c) Slope, (d) Aspect, (e) Relief.
Land 13 01188 g004
Figure 5. Humanities and society features: (a) Population, (b) NILI, (c) ROND, (d) DISO, (e) DISS.
Figure 5. Humanities and society features: (a) Population, (b) NILI, (c) ROND, (d) DISO, (e) DISS.
Land 13 01188 g005
Figure 6. Resource availability features: (a) ALSV, (b) Biodiversity, (c) SOCC, (d) Forest age, (e) DISI.
Figure 6. Resource availability features: (a) ALSV, (b) Biodiversity, (c) SOCC, (d) Forest age, (e) DISI.
Land 13 01188 g006
Figure 7. Heat map of feature correlations.
Figure 7. Heat map of feature correlations.
Land 13 01188 g007
Figure 8. Confusion matrix: (a) RF, (b) SVM, (c) XGBoost, (d) AdaBoost.
Figure 8. Confusion matrix: (a) RF, (b) SVM, (c) XGBoost, (d) AdaBoost.
Land 13 01188 g008
Figure 9. ROC curves of different models.
Figure 9. ROC curves of different models.
Land 13 01188 g009
Figure 10. Model prediction results: (a) RF, (b) SVM, (c) XGBoost, (d) AdaBoost.
Figure 10. Model prediction results: (a) RF, (b) SVM, (c) XGBoost, (d) AdaBoost.
Land 13 01188 g010
Figure 11. Prediction results of different machine learning models: (a) remote sensing image of Zhangjiajie downtown: (a-1) RF, (a-2) SVM, (a-3) XGBoost, (a-4) AdaBoost; (b) remote sensing image of Wulingyuan District: (b-1) RF, (b-2) SVM, (b-3) XGBoost, (b-4) AdaBoost.
Figure 11. Prediction results of different machine learning models: (a) remote sensing image of Zhangjiajie downtown: (a-1) RF, (a-2) SVM, (a-3) XGBoost, (a-4) AdaBoost; (b) remote sensing image of Wulingyuan District: (b-1) RF, (b-2) SVM, (b-3) XGBoost, (b-4) AdaBoost.
Land 13 01188 g011
Figure 12. Average of four machine learning model prediction results.
Figure 12. Average of four machine learning model prediction results.
Land 13 01188 g012
Figure 13. Typical regional validation of integrated forecast results. Existing ecotourism scenic spot prediction results with photographs: (a) Tianmen Mountain, (b) Wulingyuan, (c) Baofeng Lake, (d) Zhangjiajie Grand Canyon; potential ecotourism resource prediction results with photographs: (e) Smaller Tianmen Mountain, (f) Fairy Streams, (g) Guanyin Mountain, and (h) South Beach Prairie Park.
Figure 13. Typical regional validation of integrated forecast results. Existing ecotourism scenic spot prediction results with photographs: (a) Tianmen Mountain, (b) Wulingyuan, (c) Baofeng Lake, (d) Zhangjiajie Grand Canyon; potential ecotourism resource prediction results with photographs: (e) Smaller Tianmen Mountain, (f) Fairy Streams, (g) Guanyin Mountain, and (h) South Beach Prairie Park.
Land 13 01188 g013
Figure 14. Unsuitable and marginally suitable zones prediction results: (a) Wulingyuan urban area, (b) strips of settlements and mountains in Sangzhi County, (c) farmland and villages on the outskirts of Yongding District, (d) water conservation area in Goat Creek, (e) Cili County urban area, (f) Yanbodu township urban area, (g) sporadic villages and towns along provincial road 306 and permanent basic farmland, and (h) cultivated land and settlements along the Lishui riverbanks.
Figure 14. Unsuitable and marginally suitable zones prediction results: (a) Wulingyuan urban area, (b) strips of settlements and mountains in Sangzhi County, (c) farmland and villages on the outskirts of Yongding District, (d) water conservation area in Goat Creek, (e) Cili County urban area, (f) Yanbodu township urban area, (g) sporadic villages and towns along provincial road 306 and permanent basic farmland, and (h) cultivated land and settlements along the Lishui riverbanks.
Land 13 01188 g014
Figure 15. Feature importance ranking results.
Figure 15. Feature importance ranking results.
Land 13 01188 g015
Table 1. Data source and details.
Table 1. Data source and details.
DataYearResolutionSourceDescription
DEM202330 × 30 mhttps://www.usgs.gov/ (accessed on 15 April 2023)Basic data for terrain analysis.
Landsat 8202330 × 30 mhttps://www.usgs.gov/ (accessed on 8 March 2024)Used to calculate remote sensing indices such as the NDVI.
Geology1957–19951:200,000https://www.cgs.gov.cn/ (accessed on 12 March 2024)Used to characterize the regional geological environment.
Nighttime light2019130 × 130 mhttp://59.175.109.173:8888/index.html (accessed on 13 March 2024)The resolution of other data sources was too low; however, the Luojia1-01 data were only updated to 2019.
Meteorology1980–20221 × 1 kmhttp://loess.geodata.cn/data/ (accessed on 14 March 2024)Used to calculate the multi-year average temperature and precipitation.
Land use [23]202230 × 30 mhttps://doi.org/10.5281/zenodo.8239305 (accessed on 16 March 2024)The status of land use is a prerequisite and basis for ecotourism planning.
OSM2023-http://www.openstreetmap.org/ (accessed on 1 May 2024)Road data were extracted to calculate features such as road network density.
WorldPop2020100 × 100 mhttps://hub.worldpop.org/ (accessed on 17 March 2024)Global high-resolution population distribution dataset.
POI2023-https://lbs.amap.com/ (accessed on 31 December 2023)Crawled using Python web crawling techniques.
Ecological value2000–20201 × 1 kmhttps://www.resdc.cn/ (accessed on 18 March 2024)Five years of data (2000, 2005, 2010, 2015, and 2020) were averaged.
Forest age [24]202030 × 30 mhttps://doi.org/10.5281/zenodo.8354262 (accessed on 20 March 2024)To obtain the 2023 data, a value of 3 was added to the existing raster data.
Table 3. Evaluation and comparison of model performance metrics through ten-fold cross-validation.
Table 3. Evaluation and comparison of model performance metrics through ten-fold cross-validation.
MetricsAccuracyPrecisionRecallF1AUC
RF0.89370.89590.87570.88280.9610
SVM0.87170.86780.86140.86260.9355
XGBoost0.92070.91760.91730.91540.9721
AdaBoost0.89380.87520.91300.89090.9603
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Huang, Q.; Zhou, C.; Li, M.; Ma, Y.; Hua, S. An Approach for Mapping Ecotourism Suitability Using Machine Learning: A Case Study of Zhangjiajie, China. Land 2024, 13, 1188. https://doi.org/10.3390/land13081188

AMA Style

Huang Q, Zhou C, Li M, Ma Y, Hua S. An Approach for Mapping Ecotourism Suitability Using Machine Learning: A Case Study of Zhangjiajie, China. Land. 2024; 13(8):1188. https://doi.org/10.3390/land13081188

Chicago/Turabian Style

Huang, Qin, Chen Zhou, Manchun Li, Yu Ma, and Song Hua. 2024. "An Approach for Mapping Ecotourism Suitability Using Machine Learning: A Case Study of Zhangjiajie, China" Land 13, no. 8: 1188. https://doi.org/10.3390/land13081188

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop