Next Article in Journal
Genetic Parameters and Genotype–Environment Interactions in Paulownia Clonal Tests in Temperate and Subtropical Regions of China
Previous Article in Journal
Partitioning of Ambrosia Beetle Diversity on Teak Plantations in Java, Sumbawa, and Sulawesi Islands
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Different Modelling Approaches to Determine Suitable Areas for Conserving Egg-Cone Pine (Pinus oocarpa Schiede) Plus Trees in the Central Part of Mexico

by
Martin Enrique Romero-Sanchez
*,
Mario Valerio Velasco-Garcia
,
Ramiro Perez-Miranda
,
Efrain Velasco-Bautista
and
Antonio Gonzalez-Hernandez
Forest Ecosystems Enhancement and Conservation, National Institute of Forestry, Agriculture and Livestock Research, Coyocan, Mexico City 04010, Mexico
*
Author to whom correspondence should be addressed.
Forests 2022, 13(12), 2112; https://doi.org/10.3390/f13122112
Submission received: 4 December 2022 / Revised: 4 December 2022 / Accepted: 6 December 2022 / Published: 9 December 2022
(This article belongs to the Section Forest Inventory, Modeling and Remote Sensing)

Abstract

:
Various spatial modelling methods and tools have been used in ecology and biogeography. The application of these options serves a dual function: first, they offer information about the potential distribution of species to understand the richness and diversity of unassessed areas. Second, spatial modelling methods employ these predictions to select relevant sites to determine natural conservation areas. In this study, we compared three methods for modelling the spatial distribution of Egg-cone Pine (Pinus oocarpa Schiede), an important non-timber pine in Mexico. The final goal is to estimate suitable areas for the conservation and reproduction of superior individuals (plus trees) of P. oocarpa as a conservation strategy outside the known distribution since this species possesses a high ecological and economic value. The model used were a generalised linear model (GLM) as a parametric regression method, random forest (RF) as a machine-learning method, and the MaxEnt model, a standard procedure, implemented using the Kuenm R package. The results suggest that the models used performed well since the AUROC was between 0.95 and 0.98 in all cases. MaxEnt and random forest approaches provided more conservative predictions for the distribution of suitable areas of plus trees of P. oocarpa than the generalised linear model, but the random forest algorithm achieved the best performance. The results of the study allowed the determination of ex situ conservation areas for P. oocarpa plus trees outside of their known distribution.

1. Introduction

Forest species have a specific role within the terrestrial ecosystem. They provide goods and services for human communities, sequester carbon from the atmosphere, provide shelter and food to other species, and support biodiversity at different scales [1,2]. However, forests have faced challenges due to anthropogenic factors, and lately, climate change has been considered another factor that pressures the conservation of forest species [3]. The importance of forest conservation strategies takes relevance since they face the task of protecting ecological processes required for the protection of biodiversity over time [4] and, simultaneously, ensuring the provision of ecosystem services to people [5].
Mexico possesses 49 out of the 120 known species of pines [6]; however, most are restricted to specific habitats or suffer some pressure caused by anthropogenic or environmental conditions [7]. Pines in the Mexican territory are found as pure pine forests, pine–oak, and oak–pine forest communities, occurring at various elevations, climates, and exposures [8], representing the ecological biodiversity of the Mexican landscape. In addition to the anthropogenic pressure to which forest species are subject due to their value (economic and social, among others), extreme climatic factors (global warming) represent risk factors for the natural distribution of species [9]. Climate change will likely change species’ range, composition, and abundance in different plant communities [10]. Therefore, special attention should be given to those pine species that require consideration due to their economic, ecological, or social value and their conditions and requirements when defining conservation strategies.
The National Forestry Commission of Mexico states that Pinus oocarpa Schiede ex Schltdl. is arguably the most adaptable to extreme conditions [11], and it is considered a highly adaptative species [12] because of its altitudinal pattern of genetic differentiation of growth [13]. This species is the most resin-producing pine in Mexico [14], and yet, there are no research programmes to propose strategies to increase and make production more efficient. Forest genetic improvement strategies are crucial for enhancing and conserving resin forest species [12]. Genetic improvement should be understood as a long-term process that starts with the base population (e.g., natural stands) to subsequently generate the selection of individuals with superior morphological characteristics, propagation, and improvement of the selected population, where selection and genetic testing are essential [15,16].
In the last five years, a forest genetic improvement program with P. oocarpa began in Mexico to increase resin production based on the selection of “plus trees”. To identify suitable areas for conservation ex-situ of these “plus trees” as a function of the ecological and climatic requirements of the pine and the site’s characteristics, it became essential to examine suitable areas for establishing plantations of P. oocarpa plus trees as a prominent research area in Mexico, especially using different modelling approaches. This study proposes spatial modelling for determining Pinus oocarpa plus tree natural distribution and areas as a tool for defining conservation strategies [17]. We based our analysis on the advantages that ecological niche models combined with spatial modelling and analytical tools (i.e., GIS) offer for ecological and biogeography purposes.
Since conservation strategies must rely on a solid basis, it is essential to determine areas where P. oocarpa trees with superior characteristics (plus trees) are distributed based on spatial modelling and niche ecological models. To ensure the best outcome, we compared three popular niche ecological methods for determining the possible spatial distribution of superior individuals of P. oocarpa. The models used were the generalised linear model (GLM) in its logistic regression form, as a parametric regression method, random forest (RF) as machine-learning method, and the MaxEnt model, as standard procedure. The final goal is to determine suitable areas for the conservation and reproduction of individuals of P. oocarpa that are morphologically superior (selected population) compared to the rest of the population. Our study hypothesised that the models would perform well, but each would define a different area to conserve and reproduce P. oocarpa plus trees.

2. Materials and Methods

2.1. Study Area

The study area was located within temperate forested regions in the central part of Mexico, composed mainly of the genus Pinus [8,18] but including fir, cedar, junipers, and other types of vegetation associated with coniferous trees. Mexican coniferous temperate forests are located primarily on high altitudes over mountainous zones originated by volcanos [8,19] (Figure 1).
The natural distribution of the sampling sites of trees with superior characteristics (resin production) of Pinus oocarpa Schiede is extended to most of the country (15 out of 32 states) in a variety of agroclimatic conditions (Table 1). Since the distribution of the species is too extensive, we focused our study on the central part of Mexico (Figure 1), specifically in the states of Mexico, Oaxaca, and Michoacan.

2.2. Data

2.2.1. Field Data

We conducted field surveys to collect data on the presence of P. oocarpa in the states of Oaxaca, Mexico, and Michoacán through visits to natural populations (Figure 1). We focused on resin-producing plus trees, which were labelled as optimal candidates for conservation/genetic purposes. A plus tree is a tree that is phenotypically outstanding in one or more traits of economic interest, in this case, for resin production. The objective of selecting plus trees is to use them as parents in breeding and production populations.
We recorded geographic coordinates (Datum WGS84), dasometric information (e.g., diameter at breast height, age, height), and other environmental variables at every location. Later, data on the presence of Pinus oocarpa trees were employed to model suitable areas for the distribution of the plus trees.
Since we only record presence data, we generated pseudo-absence observations assuming the same probability of occurrence of P. oocarpa [20]. Therefore, we use the “sdm” package [20] in the R software environment [21] to generate and use the “pseudo-absence” observations to compensate for the lack of “actual absence” data [22,23]. For analysis and modelling purposes, all data were divided for training (70%) and validation (30%) of the models.

2.2.2. Geographic Data

We used land cover and vegetation type maps at the national level at a scale of 1:250,000 (25-hectare minimum mapping unit geographic data) produced and published by the National Institute of Statistics and Geography of Mexico (INEGI, 2017). Using this layer (the sixth version of the land cover map), we isolated temperate forests where P. oocarpa is distributed and provided a spatial and temporal context for the analysis. Later, we create slope and aspect layers from a national digital elevation model (DEM) raster dataset with approximately 90 m pixel size for further analysis.

2.2.3. Climatic Layers

The climatic data layers used for modelling in this study were updated high-resolution (90 m pixel size) climatic surfaces for Mexico for the average monthly climate period 1910–2009 (https://rmets.onlinelibrary.wiley.com/doi/full/10.1002/joc.3848, accessed on 22 September of 2021). The layers correspond to monthly precipitation values, daily maximum and minimum temperature, and other bioclimatic variables derived from the monthly precipitation and temperature values [24] (Table 2).

2.3. Description of the Models Used

2.3.1. Generalised Linear Model (GLM): Logistic Regression

GLMs are mathematical extensions of linear models that do not force data into unnatural scales, allowing for non-linearity and non-constant variance structures in the data [25,26]. GLM consists of three components. First is the random component, the response variable, and probability distribution. Second is the systematic component, which represents the model’s predictors (X variables). The third is the link function, which links the random and systematic components. GLMs are considered parametric models because a probability distribution is specified for the response variable and, therefore, for the error terms from the model [27]. One significant application of GLMs in biology is to model binary response variables (e.g., presence/absence, alive/dead) in logistic regression, where the predictors can be either continuous and/or categorical. Logistic regression was selected for modelling the possible spatial distribution of the P. oocarpa plus trees.
In logistic regression, when the response variable is binary (i.e., categorical with two levels, zero or one), we model (x), the probability that Y equals one for a given value of X. The usual model we fit such data with is the logistic regression model, a nonlinear model with a sigmoidal shape. The change in the probability that Y equals one for a given change in X is most significant for values of X near the middle of its range rather than for values at the extremes. The error terms from the logistic model are not normally distributed; because the response variable is binary, the error terms have a binomial distribution. Then, the logistic regression model is Y = E Y + ε , and E Y = π is expressed as [28,29]:
E Y = π = exp X β 1 + exp X β
where E Y is the response variable taking only values of 1 and 0 with probabilities π and 1 − π , respectively.

2.3.2. Random Forest

The random forest algorithm is a machine learning method that can be defined as an ensemble of multiple decision trees at training time. This modelling method uses bagging (bootstrap aggregation) to combine many trees. Bagging involves taking many bootstrap samples from the training data, fitting a tree to each sample, and making an average prediction over all fitted trees [30]. In this study, we used the random forest method’s occurrence probabilities for P. oocarpa [31].
We implemented the random forest method by following the typical steps [32]: (i) the samples are bootstrapped from the original dataset to generate multiple sets (ntree) of training data; (ii) unpruned regression trees are created with the bootstrapped samples, and in each node of the tree, a subset of variables is selected randomly to define the split (mtry), and the best split is chosen; and (iii) predictions are made by averaging the predictions of the ntree regression trees. The random forest method was implemented inside the R environment using the random forest package [33].

2.3.3. MaxEnt

Maximum entropy ecological niche modelling (MaxEnt) [34] predicts species occurrences by applying a machine learning technique called maximum entropy modelling. From a set of environmental datasets and georeferenced points, the model estimates a probability distribution where each grid cell has the expected suitability of conditions for the species of interest. This study implemented the MaxEnt model employing the Kuenm package [35] in the R statistical software environment [21]. Kuenm is an R package for the detailed development of ecological niche models using MaxEnt. In essence, Kuenm automates essential calibration and evaluation steps using the Maxent modelling algorithm, systematises model calibration, creates final models and their transfers and evaluations, and assesses extrapolation risks.

2.3.4. Model’s Validation

We calculated all models’ sensitivity, specificity, and concordance indices as measures of goodness of fit. These metrics were calculated on the validation sample separated for validation purposes and independent from the training set. Later, we estimated the overall accuracy metric from the confusion matrix of each model. Additionally, we used all models’ calculated receiver operating characteristic (ROC) curves and the true skill statistic (TSS) for comparison purposes. Next, these two metrics were used to evaluate model performance according to previous studies [36,37,38]. It is worth noting that TSS has become popular due to its simplicity (sensitivity + specificity − 1) but robust and intuitive measure of the performance of species distribution models [38].

2.4. Identifying Areas for Conservation Purposes (Ex Situ) of the Species

After evaluation, we applied the models to prepare maps of suitable areas for P. oocarpa. First, we classified the different probability categories from each model to obtain a binary map (suitable, not suitable) based on the premise that all pixels with values above 0.56 were reclassified as suitable and pixels below 0.56 as non-suitable for the establishment of the species (Table 3).
Here we assumed that the three models used in this study estimate probability of occurrence of P. oocarpa plus trees. Once we produced binary maps (0′s and 1′s), we used the vegetation and land cover map provided by INEGI to isolate the categories “grasslands”, “secondary vegetation of temperate forests”, “no visible vegetation”, and “Agriculture” and used them as potential zones for the establishment of conservation areas of the species outside of natural pine forests. All the analysis and spatial processes were made inside the software Arc View® version 10.8 environment.

3. Results

After the field campaign, we identified and selected 285 individuals of P. oocarpa that demonstrated superior characteristics compared to the rest of the population. It is worth noting that the biggest tree was in the state of Michoacan (36.7 m height and 130 cm of diameter at breast height); according to Table 4, Michoacan possesses the trees with the best dimensions, followed by Oaxaca and Mexico State.
The spatial distribution of the P. oocarpa plus trees was characterised by their preference for high altitudes in the Trans-Mexican Volcanic Belt (above 1400 masl) for Mexico State and Michoacan and moderate slopes (10 to 40%). However, for Oaxaca’s case, the distribution range varies from 860 to 1390 masl and steep slopes (25 to 90%) (Figure 2).

3.1. Generalised Linear Model (GLM) and Logistic Regression Modelling

The generalised linear model family selected logistic regression to predict the potential distribution of P. oocarpa. Logistic regression presumes the probability distribution as the response variable; therefore, the error terms from the fitted model are effectively explained by the chosen random component [39]. Here the best model fitted used climatic variables related to precipitation (bio12, bio13, bio17, bio19, bio14), temperature (bio02 and bio04), and topography (slope) (Table 4).
The model predicted a probability of occurrence where values close to 1 represent areas where the species has better chances of being present or established. The area with a probability higher than 56%, as was stated before, was classified as suitable for the development of plus trees of P. oocarpa. Those areas were located at high elevations and well distributed across the study area (Figure 3).

3.2. Random Forests

We used a bagging procedure for averaging the outputs of different CARTs (classification and regression trees) for the random forest modelling process. Bagging stands for “bootstrap aggregation”. An essential feature of random forests is the out-of-bag samples, which means that the prediction/fit for a specific data point is only derived from averaging trees that did not include this data point during tree growing [31]. Thus, the output of random forests is essentially cross-validated. Random forests estimate variable importance by a permutation procedure, which measures for each variable the drop in mean accuracy when this variable is permutated. For our case, the variable importance can be seen in Figure 3; Bio04, Bio05, Bio13, and Bio16 were the most important for modelling (Figure 4).
The procedure used for the random forest model was a regression, using a total of 250 trees, and was able to explain 92.5% of the variability with a mean of squared residuals of 0.018. The final regression tree (Figure 5).
Finally, we applied the regression model to predict the spatial distribution of P. oocarpa plus trees and created a raster file using the function “writeRaster” within “R” software. Later the raster was exported to ArcGIS, and we classified all pixels above 0.56 (56% of probability) as suitable areas for the development of the trees (Figure 6).

3.3. MaxEnt

A total of 434 candidate models were analysed, with parameters reflecting all combinations of 14 regularisation multiplier settings (variables), 31 feature class combinations, and one distinct set of environmental variables. Model performance was evaluated based on statistical significance (partial ROC), omission rates (OR), and the Akaike information criterion corrected for small sample sizes (AICc) (Table 5).
The distribution model generated by MaxEnt for P. oocarpa was different concerning the scope of the prediction area; however, the model generated AUC values close to 1, which is supported by a fastness statistic. AUC values of 0.5 and close to 1 indicate that the generated model is better than the prediction. Table 6 summarises the performance statistics of applying the MaxEnt model using the Kuenm package to estimate the potential spatial distribution of plus trees of P. oocarpa. Table 7 shows the performance of the best model selected.
Figure 7 shows the position of the selected models in the distribution of all candidate models in terms of statistical significance, omission rates, and AICc values.
The MaxEnt model assigns the increase in gain to the environmental variables that are occupied, converts values to percentages, and creates a test to estimate the most critical variables in the model (jackknife), by using the Kuenm package. According to the results, the temperature was the most crucial factor in determining suitable areas for the species. Temperature seasonality, minimum temperature of the coldest month, mean temperature of the driest quarter, and precipitation of the wettest quarter accounted for more than 80% of the contribution of the predictive model (Table 8). It is worth noting that four variables related to temperature comprised about 70% of the contributions.
The potential spatial distribution generated for plus trees of P. oocarpa is shown in Figure 8, where the higher probabilities of the presence of the pine are in well-conservated forest areas and elevations over 2000 m above sea level (Figure 8).

3.4. Validation of the Models

In this work, the AUC value was taken as a primary measure of model performance since this metric provides a reliable and fast way to assess the performance of the distribution models [40]. The total potential distribution generated with the different approaches used in this work for plus trees of P. oocarpa indicates reliable modelling since the AUC was very high for all cases: 0.93, 0.99, and 0.97, for logistic regression, random forests, and MaxEnt, respectively (Figure 9).
As can be seen in Figure 9, the best performance was achieved using the random forest modelling approach. According to previous studies, random forests have been evidenced to be a machine learning algorithm that outperforms other methods [36,41,42]. Table 9 shows the performance of the models; it is worth noting that although accuracy was adequate in all cases, the random forest modelling demonstrated higher values for AUC, TSS, and accuracy.

3.5. Identifying Areas for Conservation Purposes (Ex Situ) of the Species

The land cover map distributed by INEGI has around 72 categories representing the types of vegetation present in the country. We performed discrimination of all the categories listed to focus on those potentially eligible for the establishment of conservation areas ex situ (plantations) of P. oocarpa plus trees. The categories “grasslands”, “secondary vegetation from temperate forests”, and “Agriculture” from the land cover map were selected as a target for this purpose (Table 10).
After evaluation, we applied the models to prepare maps of suitable areas for P. oocarpa, considering the above land categories. In all cases, according to the validation metrics used, the models demonstrated good agreement regarding accuracy; however, it is worth noting that the models estimated different amounts of a suitable surface for each one of the land use categories used. The spatial distribution of the suitable areas for the development of plus trees of P. oocarpa is shown in Figure 10.

4. Discussion

Numerous spatial modelling methods and tools have been developed recently, although they have mainly been used in ecology and biogeography for conservation purposes, predicting future impacts and determining suitable areas for species [42,43]. The use of these alternatives serves a dual function: firstly, they provide insights into the distribution potential of species to understand the richness and diversity of areas with little or no information. Secondly, they use these predictions to select sites of interest for determining biological conservation areas or establishing species where they do not naturally occur [17,44]. Spatial modelling to determine the natural and suitable ranges of the forest species of interest has been used to define strategies for conservation, planning, and management of the species in the face of climate change. For this purpose, spatial modelling and mainly ecological niche models, together with other analytical tools (e.g., geographic information systems, GIS), have been used to assess species’ climatic requirements and to model their distribution under current and future conditions [40,41,45].
In this study, we used the advantages of the ecological niche models combined with geographic tools to define areas suitable for establishing P. oocarpa plus trees according to their natural and potential distribution. The evaluation of suitable areas for plantation establishment remains a prominent area of research in Mexico, especially using different spatial modelling approaches, such as logistic regression, decision trees, multi-criteria analysis, and artificial intelligence algorithms. Using these tools is critical in identifying impacts on resin pine trees, considered species with a high adaptive capacity [8]; however, these species’ growth tolerance and survival intervals have not been described in the necessary detail. Therefore, it is considered essential to estimate suitable areas for ex-situ conservation based on the species ecological requirements and the site’s characteristics. These factors, mainly climatic, condition the growth and resin production [46], so it is important to determine them and use them in the planning processes to establish plantations.
Conservation strategies for species of interest include collecting, storing, and conservating germplasm in botanical gardens, germplasm banks, and plant tissue culture laboratories in Mexico [46]. P. oocarpa may have a high adaptive capacity. It is crucial to determine the tolerance intervals of the populations for good growth and survival rate. Therefore, it is essential to generate the potential zoning, a function of the species ecological requirements and the site’s characteristics, factors that condition the trees, and their production levels [47,48]. The delimitation of areas by productive potential allows for the efficient use of resources [49,50].

4.1. Implications and Conservation Strategies

Although it is considered that widely distributed species such as P. oocarpa may have a high adaptive capacity, it is essential to determine the environmental tolerance intervals of the populations for good growth and survival rate. For example, P. oocarpa seedlings from populations originating at lower altitudes tend to grow larger than seedlings originating from populations at higher altitudes [51]. Therefore, it is essential to generate the potential zoning, a function of the species ecological requirements and the site’s characteristics, factors that condition the trees, and their production levels [47,48]. The delimitation of areas by productive potential allows for the efficient use of resources [49,50].
Our results suggest that the best conditions for the optimal development of P. oocarpa are related to temperature seasonality (Bio04) and precipitation of the wettest season (Bio13), the first variable in all models and the former in two. Similarly, the growth of P. oocarpa seedlings in Michoacán, Mexico was related to the annual humidity index, annual temperature, precipitation, summer humidity index, and degree days of the source populations [51]. This highlights the importance of defining areas for the conservation and reproduction of plus trees of P. oocarpa through the modelling of potential distribution in this study. Likewise, shared garden and field trials are necessary for a second phase to define the geographical limits to establish conservation and reproduction plantations without sacrificing adaptation [13,51,52,53].
It is also important to highlight that the potential distribution modelled agrees with the ecological and climatic requirements of the species since P. oocarpa grows at altitudes between 700 and 2000 m above sea level, with regular dry seasons and severe drought (total rainfall of 500 mm) [11,12,47]. Since we are focused on plus trees, the ecological and climatic requirements were narrowed, as the results demonstrated, and the potential distribution was set in regions where this species grows naturally, where there are six months with an average rainfall of less than 50 mm. Additionally, all suitable areas are where the species grow naturally and are in eroded soils derived from ancient volcanic material and containing a large amount of quartz. It has been stated that the optimum temperature for the growth of this species is between 13 and 23 °C [47]. However, it can withstand certain minimum temperatures of 0 °C and maximum temperatures of 45 °C, and P. oocarpa can withstand sporadic frosts. Precipitation in the pine habitat is around 1300 mm per year, with a minimum of 500 and 2600 mm per year [14,54].
In Mexico, conservation strategies for species of interest are varied, including Protected Natural Areas (ANPs), ecological planning, Management Units for the Conservation of Wildlife (UMAs), payment for environmental services, and community forest management [N], as well as the collection, storage, and conservation of germplasm in botanical gardens, germplasm banks, and plant tissue culture laboratories in Mexico [55]. The P. oocarpa plus trees sampled in this study are not located in ANPs. The plus trees in Oaxaca are more than 60 km from the Lagunas de Chacahua National Park, with a difference of more than 500 m in elevation. The plus trees in the State of Mexico are 5.7 km from an ANP (Forest Protection Zone, the constitutive lands of the basins of the Valle de Bravo, Malacatepec, Tilostoc, and Temascaltepec rivers), but with a difference of 300 m in elevation. Meanwhile, the trees in Michoacán are more than 10 km from the Zicuirán-Infiernillo Biosphere Reserve, with more than 800 m of altitude difference. Due to the above, these trees’ in situ conservation strategy in PNAs is not an alternative.
Although in situ conservation is essential to renew genetic diversity and face future environmental changes, ex situ conservation is operationally convenient for short-term results [56]. Regarding the ex situ conservation of plus trees of P. oocarpa in PNA, of the 159,672.06 ha estimated with the RandomForest model as surface suitable for conservation, only 15.08 ha is located within the “Sierra de Huautla Biosphere Reserve”, 73.8 ha in the “Forest Protection Zone the constitutive lands of the basins of the Valle de Bravo, Malacatepec, Tilostoc and Temascaltepec rivers”, and 1251 ha in the “Zicuirán-Infiernillo Biosphere Reserve”. The preceding shows that only 0.84% of the area located within a PNA is suitable for conservation for plus trees of P. oocarpa; compare to sites that can be established plantations exclusively dedicated to ex situ conservation. This shows that the PNA network in Mexico does not adequately protect Pinus [57] species, so in situ and ex situ conservation activities must be proposed for each species [9].
Among the viable alternatives for the in situ conservation of P. oocarpa plus trees is the establishment of Management Units for the Conservation of Wildlife (UMAs), in which owners can make sustainable use of the habitat by harvesting seeds and, at the same time, work on the conservation of the species [17,55]. It is suggested that the main ex situ conservation strategy be the establishment of conservation plantations in the areas determined in this study. However, to mitigate the future effect of global climate change, conservation and production plantations, with plus tree germplasm of P. oocarpa, should be established at altitudes higher than their natural origin, up to 150 m [51].
Another ex situ conservation strategy is establishing plantations for resin production with plus tree germplasm of P. oocarpa in the suitable areas determined in this study. These production plantations must be linked to a genetic improvement program [15,16]. A forest genetic improvement program begins with the selection of the base population (natural stands) to obtain the selected population (plus trees). It continues with the propagation and breeding populations to generate the first improved generation, where genetic tests play a fundamental role [15,16]. The selected population comprises plus trees located in natural stands in the study area of this work; the care and maintenance of these trees are a viable in situ conservation strategy through UMAs. The population of propagation and crossing is constituted by the seed orchards, the product of the recurrent additional selection; in this case, three seed thefts were established with plus tree seeds in suitable areas determined in this study. These seed orchards constitute an essential in situ conservation strategy for P. oocarpa plus resin trees. Genetic tests such as progeny tests, which should be established in sites where it is required to test the performance of P. oocarpa plus trees, will constitute an ex situ conservation strategy [16].
For P. oocarpa, there is an altitudinal pattern of genetic growth differentiation [51], a significant linear relationship between latitude and longitude of populations with survival and frost tolerance [52], genetic groups correlated with geography, two centres of diversity [58], and interfamily variation in growth [13]. The preceding demonstrates the importance of conducting detailed studies to conserve P. oocarpa plus trees. Among the actions that will contribute to the conservation of P. oocarpa plus trees is the study of the reproductive potential (seed production and germination), the quality of plants, the chemical quality of resin, and the study of genetic diversity. Several of the above studies are underway to improve and conserve plus resin trees in Mexico. Although some natural populations of P. oocarpa are high [13,59], it is necessary to evaluate the genetic diversity of selected trees to generate guidelines for conservation and genetic improvement [16]. Superior phenotypes should be evaluated using progeny testing in the areas determined in this study to estimate genetic variation, genetic gain, heritability, and genetic correlations between various traits, which will allow the identification of genotypically superior families [15,16,53]. To evaluate these genetic parameters related to resin production, trees need to grow to a diameter sufficient to allow resin production.
To establish seed orchards, one must start with selecting individuals in natural stands, which may imply low genetic diversity; however, this depends on population size and other factors such as the number of parents in the orchard, seed orchard design, fecundity, and pollen contamination. Although seeds from seed orchards are used in large regions, very few studies have compared the genetic diversity in the orchard concerning the genetic diversity of the regions where they are planted [60]. Genetic variability between altitudinal gradients and between different natural populations of P. oocarpa has been assessed utilising isoenzymes [59], DNA markers [61], and quantitative morphological data [51]. However, there are no known studies on the genetic variability in high-quality seed orchards.

4.2. Model Selection and Implications

Ecological niche modelling as a set of analytical tools has been used in many different applications, including conservation strategies [17], climate change impacts [62], and forest insect outbreaks [63], among other applications in ecology and biogeography [64]. In the last couple of years, ecological niche modelling based on machine learning tools has gained popularity in determining forest species’ land suitability or potential spatial distribution [44,65,66]. Machine learning algorithms are relatively more flexible and accurate and require less time than traditional methods. In the last couple of years, machine learning methods have been proven to be more adequate for noisy or sparse data than many well-known methods [32].
Although computational improvements and programming efficiency facilitate the use of machine learning algorithms in ecological niche modelling, methods such as MaxEnt are still in use due to their simplicity of application and the flexibility of adaptation to several situations [22,23,24]. One good example is the Kuenm R package [38], which uses R and MaxEnt to enable detailed model calibration and selection, final model creation, and extrapolation risk analysis. In this study, the Kuenm R package results demonstrated promising results (AUC = 0.97, TSS = 0.76 and accuracy = 0.81) compared to those obtained using random forests (AUC = 0.99, TSS = 0.93 and accuracy = 0.96). In both cases, model selection, calibration, and validation are easy to follow, allowing the user to choose one method.
One significant advantage of using ML, e.g., random forests, is that there is no limitation on the number of predictor variables used compared to the traditional approaches. More traditional approaches, such as general linear and occupancy, depend on technical expertise to meet statistical assumptions and produce unbiased output [31,67], which makes machine learning methods even more attractive to use. Another advantage of the random forest model is its strength in efficiently recognising data patterns, lack of assumptions related to the properties of the data, user-friendly parameters, and ability to address the interactions between predictive variables [30] flexibly. Additionally, random forests modelling has been recognised to be solid for training with small sample sizes [32]. We confirmed the advantages and better performance of machine learning methods, such as random forest, over traditional methods as demonstrated by our results according to the comparison metrics used in this study.

4.3. Final Considerations

In this study, we determined the primary potential distribution and determined areas of possible interest to establish plus trees of P. oocarpa. It is essential to highlight that the spatial distribution and number of observations collected and retrieved from the diverse databases were essential factors in the performance of all models and the ability to predict the spatial distribution of the pine accurately. Although it was not part of this study, it would be worth performing an analysis by region, including other crucial variables such as distance to roads or population density, as they are considered factors in the vulnerability of Mexican forests [59]. It is important to highlight that land cover change affects suitable species distribution areas.
In this paper, we reported three methods for modelling the spatial distribution of modelling P. oocarpa, an important non-timber pine in Mexico. According to the results, we could estimate suitable areas for the conservation and reproduction of superior individuals (plus trees) of P. oocarpa as a conservation strategy outside the known distribution since this species possesses a high ecological and economic value. We had a limited number of observations from the field due to the specificity of the study and the morphological characteristics of the trees we were looking for; however, we considered the dataset to be enough and that it fulfilled the study’s objectives.
Although species distribution models (e.g., MaxEnt algorithm) can fit with limited presence-only data [64], random forests was considered a better option as it presented the best performance according to the metrics used compared to MaxEnt and logistic regression [16,65]. Still, some issues need to be considered before selecting the appropriate modelling method: the clarification of the niche concept, sampled data, parametrisation strategies, model selection, predictor contribution, and evaluation strategies [66]. That being said, our results demonstrated good agreement with the performance of all models according to the validation process.
Finally, we would like to highlight that the novelty of the work we presented in this paper is significant to forest planners and decision-makers because it provides evidence of the use of spatial modelling to define conservation areas for specific target species. The authors’ consideration for future research endeavours concerns the addition of the impacts of climate change on species distribution modelling to deliver more complete and accurate results for the operation of conservation plans and the genetic improvement programmes for Pinus oocarpa.

5. Conclusions

The models presented good performance; each estimated varied surfaces for the conservation and reproduction of P. oocarpa plus trees.
The random forest model performed better, providing a more conservative prediction. This study’s definition of suitable areas for conservation and reproduction contributes to defining ex situ conservation strategies for P. oocarpa plus trees.
Our findings agreed with previous reports that machine learning algorithms have superior performance compared to traditional ecological niche and spatial distribution modelling methods.

Author Contributions

Conceptualization, M.E.R.-S., M.V.V.-G., R.P.-M. and A.G.-H.; methodology, M.E.R.-S., R.P.-M. and A.G.-H.; formal analysis, M.E.R.-S. and E.V.-B.; data curation, M.V.V.-G.; writing—original draft preparation, M.E.R.-S., M.V.V.-G., R.P.-M., A.G.-H. and E.V.-B.; writing—review and editing, M.V.V.-G. and E.V.-B. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Institute of Forestry, Agriculture, and Livestock Research, Mexico, through research grant No. 1218634780.

Data Availability Statement

The data presented in this study are available upon request from the corresponding author.

Acknowledgments

This study is part of the project “Selección fenotípica y establecimiento de Huertos Semilleros Sexuales de Pinus oocarpa para producción de resina” funded by the National Institute of Forestry, Agriculture, and Livestock Research of Mexico. The authors are deeply grateful for the comments and suggestions during the review process, which were helpful in improving the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Hein, J.; Guarin, A.; Frommé, E.; Pauw, P. Deforestation and the Paris climate agreement: An assessment of REDD + in the national climate action plans. For. Policy Econ. 2018, 90, 7–11. [Google Scholar] [CrossRef]
  2. Türkeş, M.; Musaoğlu, N.; Özcan, O. Assessing the vulnerability of a forest ecosystem to climate change and variability in the western Mediterranean sub-region of Turkey: Future evaluation. J. For. Res. 2018, 29, 1177–1186. [Google Scholar] [CrossRef]
  3. Choe, H.; Thorne, J.H. Integrating climate change and land use impacts to explore forest conservation policy. Forests 2017, 8, 321. [Google Scholar] [CrossRef] [Green Version]
  4. Cazorla, B.P.; Cabello, J.; Peñas, J.; Garcillán, P.P.; Reyes, A.; Alcaraz-Segura, D. Incorporating Ecosystem Functional Diversity into Geographic Conservation Priorities Using Remotely Sensed Ecosystem Functional Types. Ecosystems 2021, 24, 548–564. [Google Scholar] [CrossRef]
  5. Mendoza-Ponce, A.; Corona-Núñez, R.; Kraxner, F.; Leduc, S.; Patrizio, P. Identifying effects of land use cover changes and climate change on terrestrial ecosystems and carbon stocks in Mexico. Glob. Environ. Change 2018, 53, 12–23. [Google Scholar] [CrossRef] [Green Version]
  6. Gernandt, D.S.; Pérez-De La Rosa, J.A. Biodiversidad de Pinophyta (coníferas) en México. Rev. Mex. Biodivers. 2014, 85, 126–133. [Google Scholar] [CrossRef] [Green Version]
  7. Escobar-Flores, J.G.; Lopez-Sanchez, C.A.; Sandoval, S.; Marquez-Linares, M.A.; Wehenkel, C. Predicting Pinus monophylla forest cover in the Baja California Desert by remote sensing. PeerJ 2018, 6, e4603. [Google Scholar] [CrossRef] [Green Version]
  8. Rzedowski, J. Vegetacion de Mexico, 1st ed.; Comision Nacional para el Uso y Conocimiento de la Biodiversidad: Mexico City, Mexico, 2006. [Google Scholar]
  9. Sáenz-Romero, C.; Rehfeldt, G.E.; Duval, P.; Lindig-Cisneros, R.A. Abies religiosa habitat prediction in climatic change scenarios and implications for monarch butterfly conservation in Mexico. For. Ecol. Manag. 2012, 275, 98–106. [Google Scholar] [CrossRef]
  10. Matsui, T.; Nakao, K.; Higa, M.; Tsuyama, I.; Kominami, Y.; Yagihashi, T.; Koide, D.; Tanaka, N. Potential impact of climate change on canopy tree species composition of cool-temperate forests in Japan using a multivariate classification tree model. Ecol. Res. 2018, 33, 289–302. [Google Scholar] [CrossRef]
  11. CONAFOR. La Produccion de Resina de Pino en Mexico; Guadalajara, Mexico, 2013; Volume 14. [Google Scholar]
  12. Flores, A.; Climent, J.; Pando, V.; López-Upton, J.; Alía, R. Intraspecific Variation in Pines from the Trans-Mexican Volcanic Belt Grown under Two Watering Regimes: Implications for Management of Genetic Resources. Forests 2018, 9, 71. [Google Scholar] [CrossRef]
  13. Fabián-Plesníková, I.; Sáenz-Romero, C.; Cruz de León, J.; Martínez-Trujillo, M.; Sánchez Vargas, N.M. Parámetros genéticos de caracteres de crecimiento en un ensayo de progenies de Pinus oocarpa. Madera y Bosques 2020, 26, e2632014. [Google Scholar] [CrossRef]
  14. Reyes-Ramos, A.; de León, J.C.; Martínez-Palacios, A.; Lobit, P.C.M.; Ambríz-Parra, J.E.; Sánchez-Vargas, N.M. Ecological and dendrometric characters in which influence resin production of pinus oocarpa of michoacán, mexico. Madera y Bosques 2019, 25, e2511414. [Google Scholar] [CrossRef]
  15. Zobel, B.; Talbert, J. Applied Forest Tree Improvement; John Wiley & Sons, Ltd.: New York, NY, USA, 1988. [Google Scholar]
  16. White, T.; Adams, W.; Neale, D. Forest Genetics; CABI Publishiung Series: New York, NY, USA, 2007; ISBN 0851993486. [Google Scholar]
  17. Romero-Sanchez, M.; Perez-Miranda, R.; Gonzalez-Hernandez, A.; Velasco-Garcia, M.; Velasco-Bautista, E.; Flores, A. Current and Potential Spatial Distribution of Six Endangered Pine Species of Mexico: Towards a Conservation Strategy. Forests 2018, 9, 767. [Google Scholar] [CrossRef] [Green Version]
  18. Prieto-Amparán, J.A.; Villarreal-Guerrero, F.; Martínez-Salvador, M.; Manjarrez-Domínguez, C.; Vázquez-Quintero, G.; Pinedo-Alvarez, A. Spatial near future modeling of land use and land cover changes in the temperate forests of Mexico. PeerJ 2019, 2019, 6617. [Google Scholar] [CrossRef] [Green Version]
  19. Gebhardt, S.; Wehrmann, T.; Ruiz, M.; Maeda, P.; Bishop, J.; Schramm, M.; Kopeinig, R.; Cartus, O.; Kellndorfer, J.; Ressl, R.; et al. MAD-MEX: Automatic Wall-to-Wall Land Cover Monitoring for the Mexican REDD-MRV Program Using All Landsat Data. Remote Sens. 2014, 6, 3923–3943. [Google Scholar] [CrossRef] [Green Version]
  20. Naimi, B.; Araújo, M.B. sdm: A reproducible and extensible R platform for species distribution modelling. Ecography 2016, 39, 368–375. [Google Scholar] [CrossRef] [Green Version]
  21. R Core Team. R: A Languaje and Environment for Statistical Computing; R Foundation: Vienna, Austria, 2020. [Google Scholar]
  22. Barbet-Massin, M.; Jiguet, F.; Albert, C.H.; Thuiller, W. Selecting pseudo-absences for species distribution models: How, where and how many? Methods Ecol. Evol. 2012, 3, 327–338. [Google Scholar] [CrossRef]
  23. Renteria, J.L.; Darin, G.M.S.; Grosholz, E.D. Assessing the risk of plant species invasion under different climate change scenarios in California. Invasive Plant Sci. Manag. 2021, 14, 172–182. [Google Scholar] [CrossRef]
  24. Cuervo-Robayo, A.P.; Téllez-Valdés, O.; Gómez-Albores, M.A.; Venegas-Barrera, C.S.; Manjarrez, J.; Martínez-Meyer, E. An update of high-resolution monthly climate surfaces for Mexico. Int. J. Climatol. 2014, 34, 2427–2437. [Google Scholar] [CrossRef]
  25. Hallgren, W.; Santana, F.; Low-Choy, S.; Zhao, Y.; Mackey, B. Species distribution models can be highly sensitive to algorithm configuration. Ecol. Modell. 2019, 408, 108719. [Google Scholar] [CrossRef]
  26. Sillero, N.; dos Santos, R.; Teodoro, A.C.; Carretero, M.A. Ecological niche models improve home range estimations. J. Zool. 2021, 313, 145–157. [Google Scholar] [CrossRef]
  27. Saha, S.; Saha, M.; Mukherjee, K.; Arabameri, A.; Ngo, P.T.T.; Paul, G.C. Predicting the deforestation probability using the binary logistic regression, random forest, ensemble rotational forest, REPTree: A case study at the Gumani River Basin, India. Sci. Total Environ. 2020, 730, 139197. [Google Scholar] [CrossRef] [PubMed]
  28. Agresti, A. An Introduction to Categorical Data Analysis; JohnWiley & Sons, Inc.: Hoboken, NJ, USA, 2007. [Google Scholar]
  29. Quinn, G.P.; Keough, M.J. Experimental Design and Data Analysis for Biologists; Cambridge University Press: Cambridge, UK, 2002; ISBN 9780521811286. [Google Scholar]
  30. Valavi, R.; Guillera-Arroita, G.; Lahoz-Monfort, J.J.; Elith, J. Predictive performance of presence-only species distribution models: A benchmark study with reproducible code. Ecol. Monogr. 2022, 92, 1486. [Google Scholar] [CrossRef]
  31. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
  32. Luan, J.; Zhang, C.; Xu, B.; Xue, Y.; Ren, Y. The predictive performances of random forest models with limited sample size and different species traits. Fish. Res. 2020, 227, 105534. [Google Scholar] [CrossRef]
  33. Liam, A.; Wiener, M. Classification and Regression by randomForest. R News 2002, 2, 18–22. [Google Scholar]
  34. Phillips, S.J.; Dudík, M. Modeling of species distribution with Maxent: New extensions and a comprehensive evalutation. Ecograpy 2008, 31, 161–175. [Google Scholar] [CrossRef]
  35. Cobos, M.E.; Townsend Peterson, A.; Barve, N.; Osorio-Olvera, L. Kuenm: An R package for detailed development of ecological niche models using Maxent. PeerJ 2019, 2019, 6281. [Google Scholar] [CrossRef] [Green Version]
  36. Mi, C.; Huettmann, F.; Guo, Y.; Han, X.; Wen, L. Why choose Random Forest to predict rare species distribution with few samples in large undersampled areas? Three Asian crane species models provide supporting evidence. PeerJ 2017, 5, e2849. [Google Scholar] [CrossRef] [Green Version]
  37. Hijmans, R.J.; Graham, C.H. The ability of climate envelope models to predict the effect of climate change on species distributions. Glob. Chang. Biol. 2006, 12, 2272–2281. [Google Scholar] [CrossRef]
  38. Allouche, O.; Tsoar, A.; Kadmon, R. Assessing the accuracy of species distribution models: Prevalence, kappa and the true skill statistic (TSS). J. Appl. Ecol. 2006, 43, 1223–1232. [Google Scholar] [CrossRef]
  39. Santini, L.; Benítez-López, A.; Maiorano, L.; Čengić, M.; Huijbregts, M.A.J. Assessing the reliability of species distribution projections in climate change research. Divers. Distrib. 2021, 27, 1035–1050. [Google Scholar] [CrossRef]
  40. Phillips, S.J.; Anderson, R.P.; Schapire, R.E. Maximum entropy modeling of species geographic distributions. Ecol. Modell. 2006, 190, 231–259. [Google Scholar] [CrossRef] [Green Version]
  41. Paquit, J.C.; Pampolina, N.M.; Tiburan, C.L.; Manalo, M.M.Q. Maxent modeling of the habitat distribution of the critically endangered Pterocarpus indicus Willd. forma indicus In Mindanao, Philippines. J. Biodivers. Environ. Sci. 2017, 10, 2222–3045. [Google Scholar]
  42. Ray, D.; Behera, M.D.; Jacob, J. Evaluating Ecological Niche Models: A Comparison Between Maxent and GARP for Predicting Distribution of Hevea brasiliensis in India. Proc. Natl. Acad. Sci. India Sect. B Biol. Sci. 2017, 88, 1337–1343. [Google Scholar] [CrossRef]
  43. Guisan, A.; Zimmermann, N.E. Predictive habitat distribution models in ecology. Ecol. Modell. 2000, 135, 147–186. [Google Scholar] [CrossRef]
  44. Cheng, Y.; Tjaden, N.B.; Jaeschke, A.; Thomas, S.M.; Beierkuhnlein, C. Using centroids of spatial units in ecological niche modelling: Effects on model performance in the context of environmental data grain size. Glob. Ecol. Biogeogr. 2021, 30, 611–621. [Google Scholar] [CrossRef]
  45. Martínez-Sifuentes, A.R.; Villanueva-Díaz, J.; Manzanilla-Quiñones, U.; Becerra-López, J.L.; Hernández-Herrera, J.A.; Estrada-ávalos, J.; Velázquez-Pérez, A.H. Spatial modeling of the ecological niche of pinus greggii engelm. (pinaceae): A species conservation proposal in mexico under climatic change scenarios. iForest 2020, 13, 426–434. [Google Scholar] [CrossRef]
  46. Lombardero, M.J.; Ayres, M.P.; Lorio, P.L., Jr.; Ruel, J.J. Environmental effects on constitutive and inducible resin defences of Pinus taeda. Ecol. Lett. 2000, 3, 329–339. [Google Scholar] [CrossRef] [Green Version]
  47. López-Hernández, M.I.; Cerano-Paredes, J.; Valencia-Manzo, S.; Cornejo-Oviedo, E.H.; Villanueva-Díaz, J.; Cervantes-Martínez, R.; Esquivel-Arriaga, G. Respuesta del crecimiento de Pinus oocarpa Schiede a variables climáticas en el noroeste de Lagunas de Montebello, Chiapas, México. Rev. Biol. Trop. 2018, 66, 1580–1596. [Google Scholar] [CrossRef] [Green Version]
  48. Egloff, P. Tapping Pinus Oocarpa. Assessing Drivers of Resin Yield in Natural Stands of Pinus Oocarpa. Master’s Thesis, Wageningen University & Research, Wageningen, The Netherlands, 2020; pp. 1–29. [Google Scholar]
  49. Ahmad, F.; Uddin, M.M.; Goparaju, L.; Dhyani, S.K.; Oli, B.N.; Rizvi, J. Tree suitability modeling and mapping in Nepal: A geospatial approach to scaling agroforestry. Model. Earth Syst. Environ. 2021, 7, 169–179. [Google Scholar] [CrossRef]
  50. Rattis, L.; Dobrovolski, R.; Talebi, M.; Loyola, R. Geographic range-scale assessment of species conservation status: A framework linking species and landscape features. Perspect. Ecol. Conserv. 2018, 16, 97–104. [Google Scholar] [CrossRef]
  51. Sáenz-Romero, C.; Guzmán-Reyna, R.R.; Rehfeldt, G.E. Altitudinal genetic variation among Pinus oocarpa populations in Michoacán, Mexico. Implications for seed zoning, conservation, tree breeding and global warming. For. Ecol. Manage. 2006, 229, 340–350. [Google Scholar] [CrossRef]
  52. de Waal, L.; Mitchell, R.G.; Hodge, G.R.; Chirwa, P.W. The use of field and artificial freezing studies to assess frost tolerance in natural populations of Pinus oocarpa. South. For. J. For. Sci. 2018, 80, 195–208. [Google Scholar] [CrossRef]
  53. Fabián-Plesníková, I.; Sáenz-Romero, C.; Cruz-De-León, J.; Martínez-Trujillo, M.; Sánchez-Vargas, N.M.; Terrazas, T. Heritability and characteristics of resin ducts in Pinus oocarpa stems in Michoacán, Mexico. IAWA J. 2021, 42, 258–278. [Google Scholar] [CrossRef]
  54. Gomez-Mendoza, L.; Arriaga, L. Modeling the Effect of Climate Change on the Distribution of Oak and Pine Species of Mexico. Conserv. Biol. 2007, 21, 1545–1555. [Google Scholar] [CrossRef]
  55. Pisanty, I.; Urquiza-Haas, E.; Vargas-Mena y Amezcua, A.; Ruiz-González, S.P.; Urquiza-Haas, T.; García-Méndez, G. Instrumentos de conservación in situ en México: Logros y retos. In Capital Natural de México; Sarukhán, J., Pisanty, I., Eds.; CONABIO: Mexico City, Mexico, 2016; pp. 245–302. ISBN 978-607-8328-69-7. [Google Scholar]
  56. Yang, R.C.; Yeh, F.C. 94Z/00002 Genetic consequences of in situ and ex situ conservation of forest trees. Biol. Conserv. 1995, 71, 211. [Google Scholar] [CrossRef]
  57. Gutiérrez, J.A.; Duivenvoorden, J.F. Can we expect to protect threatened species in protected areas? A case study of the genus Pinus in Mexico. Rev. Mex. Biodivers. 2010, 81, 875–882. [Google Scholar]
  58. Dvorak, W.S.; Potter, K.M.; Hipkins, V.D.; Hodge, G.R. Genetic Diversity and Gene Exchange in Pinus oocarpa, a Mesoamerican Pine with Resistance to the Pitch Canker Fungus (Fusarium circinatum). Int. J. Plant Sci. 2009, 170, 609–626. [Google Scholar] [CrossRef] [Green Version]
  59. Sáenz-Romero, C.; Tapia-Olivares, B.L. Pinus oocarpa isoenzymatic variation along an altitudinal gradient in Michoacán, México. Silvae Genet. 2003, 52, 237–240. [Google Scholar]
  60. Sønstebø, J.H.; Tollefsrud, M.M.; Myking, T.; Steffenrem, A.; Nilsen, A.E.; Edvardsen, Ø.M.; Johnskås, O.R.; El-Kassaby, Y.A. Genetic diversity of Norway spruce (Picea abies (L.) Karst.) seed orchard crops: Effects of number of parents, seed year, and pollen contamination. For. Ecol. Manag. 2018, 411, 132–141. [Google Scholar] [CrossRef]
  61. Díaz, V.; Muñiz, L.M.; Ferrer, E. Random amplified polymorphic DNA and amplified fragment length polymorphism assessment of genetic variation in Nicaraguan populations of Pinus oocarpa. Mol. Ecol. 2001, 10, 2593–2603. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  62. Joyce, D.G.; Rehfeldt, G.E. Climatic niche, ecological genetics, and impact of climate change on eastern white pine (Pinus strobus L.): Guidelines for land managers. For. Ecol. Manag. 2013, 295, 173–192. [Google Scholar] [CrossRef] [Green Version]
  63. González-Hernández, A.; Morales-Villafaña, R.; Romero-Sánchez, M.E.; Islas-Trejo, B.; Pérez-Miranda, R. Modelling potential distribution of a pine bark beetle in Mexican temperate forests using forecast data and spatial analysis tools. J. For. Res. 2020, 31, 649–659. [Google Scholar] [CrossRef]
  64. Melo-Merino, S.M.; Reyes-Bonilla, H.; Lira-Noriega, A. Ecological niche models and species distribution models in marine environments: A literature review and spatial analysis of evidence. Ecol. Modell. 2020, 415, 108837. [Google Scholar] [CrossRef]
  65. Zhang, L.; Huettmann, F.; Zhang, X.; Liu, S.; Sun, P.; Yu, Z.; Mi, C. The use of classification and regression algorithms using the random forests method with presence-only data to model species’ distribution. MethodsX 2019, 6, 2281–2292. [Google Scholar] [CrossRef]
  66. Talukdar, S.; Singha, P.; Mahato, S.; Pal, S.; Liou, Y.A.; Rahman, A. Land-use land-cover classification by machine learning classifiers for satellite observations-A review. Remote Sens. 2020, 12, 1135. [Google Scholar] [CrossRef] [Green Version]
  67. Evans, J.S.; Cushman, S.A. Gradient modeling of conifer species using random forests. Landsc. Ecol. 2009, 24, 673–683. [Google Scholar] [CrossRef]
Figure 1. Study area.
Figure 1. Study area.
Forests 13 02112 g001
Figure 2. Spatial distribution of the Pinus oocarpa plus trees selected.
Figure 2. Spatial distribution of the Pinus oocarpa plus trees selected.
Forests 13 02112 g002
Figure 3. Potential spatial distribution of Pinus oocarpa using logistic regression.
Figure 3. Potential spatial distribution of Pinus oocarpa using logistic regression.
Forests 13 02112 g003
Figure 4. Variable importance in random forest model.
Figure 4. Variable importance in random forest model.
Forests 13 02112 g004
Figure 5. Regression tree for Pinus oocarpa using random forests.
Figure 5. Regression tree for Pinus oocarpa using random forests.
Forests 13 02112 g005
Figure 6. Potential spatial distribution of Pinus oocarpa using random forests.
Figure 6. Potential spatial distribution of Pinus oocarpa using random forests.
Forests 13 02112 g006
Figure 7. Distribution of all models, non-statistically significant models, and selected models.
Figure 7. Distribution of all models, non-statistically significant models, and selected models.
Forests 13 02112 g007
Figure 8. Potential spatial distribution of P. oocarpa using MaxEnt.
Figure 8. Potential spatial distribution of P. oocarpa using MaxEnt.
Forests 13 02112 g008
Figure 9. AUC curves of Pinus oocarpa using logistic regression, random forest, and MaxEnt.
Figure 9. AUC curves of Pinus oocarpa using logistic regression, random forest, and MaxEnt.
Forests 13 02112 g009
Figure 10. Spatial distribution of suitable areas for developing plus trees of Pinus oocarpa.
Figure 10. Spatial distribution of suitable areas for developing plus trees of Pinus oocarpa.
Forests 13 02112 g010
Table 1. Characteristics of the Pinus oocarpa reported in the literature *.
Table 1. Characteristics of the Pinus oocarpa reported in the literature *.
Height (m)Diameter at Breast Height (cm)Altitude (masl)Temp (°C)Precipitation (mm)
12–1840–75Min 1000
Max. 2400 Mean 1800
Min. 3
Max. 35 Mean 19
Min. 650
Max. 2600 Mean 1300
*: National Forestry Commission.
Table 2. Climatic and environmental variables used in this study.
Table 2. Climatic and environmental variables used in this study.
Environmental VariablesUnitsVariable Code
Annual mean temperature°CBio1
Mean diurnal range (mean of monthly (max temp–min temp))°CBio2
Isothermality (BIO2/BIO7) (multiply by 100)°CBio3
Temperature seasonality (standard deviation multiplied by 100)%Bio4
Max temperature of the warmer month°CBio5
Min temperature of the coldest month°CBio6
Temperature annual range (Bio5-Bio6)°CBio7
Mean temperature of wettest quarter°CBio8
Mean temperature of driest quarter°CBio9
Mean temperature of warmest quarter°CBio10
Mean temperature of coldest quarter°CBio11
Total precipitationMmBio12
Precipitation of the wettest seasonmmBio13
Precipitation of the driest seasonmmBio14
Precipitation seasonality (coefficient of variation)%Bio15
Precipitation of the wettest quartermmBio16
Precipitation of the driest quartermmBio17
Precipitation of warmer quartermmBio18
Precipitation of the coldest quartermmBio19
AltitudemaslAltitude
AspectDegrees (°)Aspect
SlopeDegrees (°)Slope
Table 3. Parameters to classify land suitability of Pinus oocarpa trees.
Table 3. Parameters to classify land suitability of Pinus oocarpa trees.
Probability of OccurrenceCategoryNew Pixel Value
0.0 to 0.1Non-suitable0
0.12 to 0.25Non-suitable0
0.26 to 0.40Non-suitable0
0.40 to 0.55Non-suitable0
0.56 to 1 Suitable1
Table 4. Characteristics of the Pinus oocarpa trees used in the study.
Table 4. Characteristics of the Pinus oocarpa trees used in the study.
LocationMean Height (SD)Mean Diameter at Breast Height (SD)Age (SD)Observations of Presence
Mexico State19.05 (4.98)44.68 (12.11)71 (20)100
Michoacan21.19 (5.47)59.76 (15.75)62 (20)99
Oaxaca19.86 (4.96)47.04 (10.33)63 (23)86
SD: standard deviation.
Table 5. Parameters for modelling potential distribution using the GML approach.
Table 5. Parameters for modelling potential distribution using the GML approach.
ModelEstimatesStandard Errorp Value
π ^ = l o g e π ^ 1 π ^ = β 0 + β 1 b i o 12 + β 2 b i o 13 + β 3 b i o 17 + β 4 s l o p e + β 5 B i o 02 + β 6 b i o 04 + β 7 b i o 19 + β 8 b i o 14 2.6049
0.0435
−0.1623
−0.3899
−0.2800
−0.6139
0.0138
0.0980
3.5611
0.0050
0.0195
0.0926
0.0640
0.2743
0.0110
0.02747
0.4644
<0.0001
<0.0001
<0.0001
<0.0001
0.025
0.2077
<0.0001
Table 6. General statistics of models that met distinct criteria.
Table 6. General statistics of models that met distinct criteria.
CriteriaNumber of Models
All candidate models434
Statistically significant models434
Models meeting omission rate criteria108
Models meeting AICc criteria8
Statistically significant models meeting omission rate criteria108
Statistically significant models meeting AICc criteria8
Statistically significant models meeting omission rate and AICc criteria1
Table 7. Performance statistics for the best models selected.
Table 7. Performance statistics for the best models selected.
ModelMean AUC RatioPartial ROCOmission Rate at 5%AICcW_AICcParameters
Pinus oocarpa1.9400.0497394.2120.94724
Table 8. Top predictors and their percentage contribution reported from the Maxent model for Pinus oocarpa.
Table 8. Top predictors and their percentage contribution reported from the Maxent model for Pinus oocarpa.
Predictor VariableContribution (%)
Temperature seasonality (Bio 4)20.50
Min temperature of the coldest month (Bio 6)18.16
Mean temperature of the driest quarter (Bio09)15.44
Precipitation of wettest quarter (Bio16)13.21
Precipitation seasonality (Bio 15)8.38
Precipitation of the wettest season (Bio 13)5.60
Table 9. Validation metrics used for comparison purposes.
Table 9. Validation metrics used for comparison purposes.
ModelAUCTSSAccuracy
Logistic0.930.670.83
Random forest0.990.930.96
MaxEnt0.970.760.81
Table 10. Land suitability for the establishment of plus trees of P. oocarpa.
Table 10. Land suitability for the establishment of plus trees of P. oocarpa.
Suitable Area (ha)
Land UseLogisticRandom ForestMax Ent
Agriculture85,497.9325,446.158689.68
Grasslands139,452.829,205.3612,729.15
No vegetation220.32150.660
Secondary vegetation576,968.7104,869.8939,681.9
Total802,139.8159,672.0661,100.73
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Romero-Sanchez, M.E.; Velasco-Garcia, M.V.; Perez-Miranda, R.; Velasco-Bautista, E.; Gonzalez-Hernandez, A. Different Modelling Approaches to Determine Suitable Areas for Conserving Egg-Cone Pine (Pinus oocarpa Schiede) Plus Trees in the Central Part of Mexico. Forests 2022, 13, 2112. https://doi.org/10.3390/f13122112

AMA Style

Romero-Sanchez ME, Velasco-Garcia MV, Perez-Miranda R, Velasco-Bautista E, Gonzalez-Hernandez A. Different Modelling Approaches to Determine Suitable Areas for Conserving Egg-Cone Pine (Pinus oocarpa Schiede) Plus Trees in the Central Part of Mexico. Forests. 2022; 13(12):2112. https://doi.org/10.3390/f13122112

Chicago/Turabian Style

Romero-Sanchez, Martin Enrique, Mario Valerio Velasco-Garcia, Ramiro Perez-Miranda, Efrain Velasco-Bautista, and Antonio Gonzalez-Hernandez. 2022. "Different Modelling Approaches to Determine Suitable Areas for Conserving Egg-Cone Pine (Pinus oocarpa Schiede) Plus Trees in the Central Part of Mexico" Forests 13, no. 12: 2112. https://doi.org/10.3390/f13122112

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop