1. Introduction
Chinese fir (
Cunninghamia lanceolata (Lamb.) Hook.), a diploid species (2n = 2x = 22), is an important fast-growing timber tree in southern China [
1]. The species occupies approximately 25% of plantations in subtropical areas of southern China [
2], and the wood of the Chinese fir is widely used in building, decoration, and furniture production [
3]. Chinese fir forests provide considerable ecological benefits, including carbon sequestration, soil conservation, and increasing groundwater storage [
4,
5].
Karst landscapes represent an important facet of the Earth’s geodiversity, and the karst geo-ecosystems are fragile environments. Guangxi province, with an area of 2.38 × 10
5 km
2, is home to one of the largest karst areas in China, and the distribution area of carbonate rock accounts for 41.57% of the total area [
6]. The areas are rocky desertification, which is characterized by soil erosion and progressive degradation and is associated with very low land productivity [
7]. Chinese fir cannot grow in karst areas which would fragment the distribution habitat of the species.
The effects of habitat fragmentation can cause severe consequences on trees, such as reduced effective population size, increased inbreeding, loss of genetic diversity, and a discontinuous pattern of genetic variation, mainly due to the limited gene flow [
8,
9]. The central–marginal hypothesis states that marginal populations show higher differentiation and less genetic variation than central populations [
10]. Marginal populations are often small, spatially isolated, and occur in different or extreme habitats. Marginal populations have long caught the attention of conservation biologists because they provide insights into marked genetic differentiation from populations and the roles of adaptation [
11]. Current marginal populations may become future evolutionary units, especially faced with the problem of ongoing global climatic changes [
12]. Guangxi is the southernmost province in China where Chinese fir grows. Marginal populations of Chinese fir identified in the area may be of great significance for the evolutionary process and special conservation interest for the species.
Successful conservation, management, and utilization strategies for a species require an accurate assessment of the genetic variation of the population. Here, we present the first survey of genetic variation of Chinese fir in sites around the karst landform in Guangxi, China, with novel and highly polymorphic SSR markers. We hypothesize that there is an increasing genetic differentiation of Chinese fir from the north to the south of the karst landform. A second hypothesis is that a marginal population of Chinese fir exists in the south of the karst landform in Guangxi. The main goals are to (a) develop neutral EST-SSR markers with low scoring error rates for Chinese fir, (b) assess the levels of genetic diversity and differentiation among sampling localities around the karst landform, and (c) identify marginal populations of Chinese fir around the landform in the southernmost province of China.
2. Methods
2.1. Plant Sample Collection
Chinese fir is a coniferous and indigenous tree species in subtropical southern China (
Figure 1). The species is monoecious and predominantly outcrossing. From October to December 2019, 330 individuals of Chinese fir were collected from 11 counties (
Table 1) around the karst area in Guangxi, with 30 trees being collected from each county. The age of each individual was greater than 30 years, and the distances between trees were at least 50 m. Tree age was estimated by counting growth rings of increment core sampled from breast height of Chinese firs. For each county, every group of 10 plants was considered a repeat, and the interval between three repeats was greater than 10 km. The geographical distribution of the 11 sampling localities is shown in
Figure 2. Fresh leaves without diseases and insect pests were picked and immediately transported back to the laboratory in liquid nitrogen and stored at −80 °C until use.
2.2. DNA Extraction and SSR Genotyping
Total DNA of Chinese fir leaves was extracted using a Rapid Plant Genomic DNA Isolation Kit (Sangon Biotech, Shanghai, China) according to the manufacturer’s protocol. The purity and integrity of the DNA were detected using 1% agarose gel. A Nanodrop-2000 ultramicro spectrophotometer (Thermo Fisher Scientific, Wilmington, NC, USA) was used to detect the concentration of DNA. The DNA concentration was adjusted to 20 ng μL−1 with deionized water and stored at −20 °C.
The forward primers of 22 microsatellite markers were synthesized and labeled, respectively, with FAM, HEX, TAMRA, and ROX (RuiBiotech, Beijing, China). Single PCR reactions were performed over all the genotypes with 22 microsatellites. Polymerase chain reaction was performed on an Analytik Jena Biometra Tone 96G (Jena, Germany). The amplification system consisted of 5 μL 2× taqplus mix (5 U/μL, Takara, Dalian, China), 0.3 μL each of the forward (0.5 μM) and reverse primers (0.5 μM), 2 μL DNA template, and 2.4 μL deionized water. The annealing temperature was 55 °C for 40 cycles, and the amplified products were stored at 4 °C.
The amplified products were divided into five sets according to microsatellite markers. Set A includes SSR1(FAM), SSR2(HEX), SSR3(TAMRA), and SSR4(ROX). Set B includes SSR5(FAM), SSR6(HEX), SSR7(TAMRA), and SSR8(ROX). Set C includes SSR9(FAM), SSR10(HEX), SSR11(TAMRA), and SSR12(ROX). Set D includes SSR13(FAM), SSR14(HEX), SSR15(TAMRA), and SSR16(ROX). Set E includes SSR17(FAM), SSR18(HEX), and SSR19(TAMRA). Set F includes SSR20(FAM), SSR21(HEX), and SSR22(TAMRA). The amplified products of each set from the same genotype were mixed for capillary electrophoresis. Capillary electrophoresis was performed in ABI 3730xl (Applied Biosystems, Foster City, CA, USA), and the fragment size of each sampling locality was read using genemarker 2.2.0 software (SoftGenetics, State College, PA, USA).
2.3. SSR Loci Development
RNA was extracted from the roots, stems, and leaves of the Chinese fir and sequenced using Illumina Hiseq TM2000. The transcriptome sequence splicing software Trinity was used to splice transcriptome sequences [
13]. SSR loci were identified using MISA software [
14], and SSR primers were designed using Primer3 [
15]. Primer3 options were default. Twenty-two novel SSR loci with the highest polymorphism and low null allele frequencies were selected from 1000 SSR loci. Identification method of the loci was described [
16]. Twenty-two pairs of primers are shown in
Table 2.
MICRO-CHECKER 2.2.3 was used to check for allelic dropout and scoring errors [1+]. The Ewens–Watterson test for neutrality and Hardy–Weinberg equilibrium (HWE) of SSR markers were conducted by POPGENE 1.32 [
17] to test whether the loci were under selection pressures and the level for deviations from HWE. Linkage disequilibrium (LD) tests between pairs of microsatellite loci were performed in each sampling locality using the Arlequin 3.1 [
18]. Then, the Bonferroni correction for multiple tests [
19] was applied.
2.4. Genetic Diversity
Based on SSRs profiles, the polymorphism information content (PIC) was calculated using Cervus 3.0.7 [
20]. The number of alleles (
Na), number of effective alleles (
Ne), observed heterozygosity (
Ho), expected heterozygosity (
He), number of private alleles (
Np), Shannon’s diversity index (
I), frequency of null alleles (
FNA), and inbreeding coefficient (
FIS) were calculated using GenAlEx6.5 [
21,
22]. The 95% confidence interval of the
FIS was calculated from 5000 bootstrap re-samplings using the package bootES of the software R to test whether
FIS was significantly different from zero [
23].
2.5. Isolation-by-Distance
To analyze which sampling localities have a greater impact on IBD, the correlation between the matrix of genetic distances [FST/(1 − FST)] and the matrix of geographic distances (Km) in Chinese fir among the different sampling localities were analyzed with Mantel tests (999 permutations) run in GenAIEx6.5.
2.6. Genetic Structure
The population genetic structure of Chinese fir in the study area was investigated with STRUCTURE version 2.3.1 software [
24]. The parameters were set as follows: group number
K, 1–10; length of burn-in period and MCMC value, 100,000 and 200,000 times; admixture and correlated allele frequencies models, respectively; and each
K value was repeated 20 times. The results were uploaded to STRUCTURE Harvester [
25], and the optimum
K value was obtained. STRUCTURE graphical bar plot was generated with DISTRUCT program to show membership coefficients [
26].
Isolation by distance pattern can cause the algorithm of STRUCTURE to overestimate the number of genetic clusters for increased genetic differentiation among individuals with geographical distance [
27]. Pairwise population differentiation coefficient
FST and
RST between pairs of sampling localities were computed using SPAGeDi 1.3 to explore the relationship of genetic differentiation among sampling localities [
28]. If
RST is >
FST, then there is phylogeographic signal. Jost’s
D values (
Dest), Nei’s genetic distance (
D), and gene flow (
Nm) were calculated using GenAlEx6.5. To estimate the genetic differences between and within sampling localities, an analysis of molecular variance (AMOVA) was performed using Arlequin 3.1 software. In addition, NTSYS pc 2.10e [
29] software was used to draw the dendrogram with Nei’s genetic distance based on the unweighted group average method (UPGMA). To further confirm genetic discontinuities, structure, and cluster analysis, principal coordinate analysis (PCoA) was performed using GenAlEx6.5.
4. Discussion
Highly variable microsatellite loci are a useful tool for studying genetic diversity and genetic structure [
30,
31]. However, stuttering, large-allele dropout, and null alleles are three typical sources of scoring errors capable of biasing biological conclusions with microsatellite data [
32]. In addition, a small percentage of EST-SSR loci could be under selective pressure [
33]. Because population genetics applications are based on neutral assumptions, SSR loci under selective pressure should be excluded from analyses [
34]. Neutral markers could reflect population processes, including patterns of recent gene flow and historical connectivity, as well as changes in population size [
35]. A locus with a polymorphism information content (PIC) value over 0.5 could be regarded as highly polymorphic [
36]. The frequency of null alleles below 8% was found to have little effect on population differentiation [
37]. In this study, we developed 22 novel SSR markers, and all the loci were suitable for population analyses.
Karst areas occupy approximately 15% of continental terrains [
38]. Southwestern China, where karst areas cover nearly 4.26 × 10
5 km
2, contains the largest and most well-developed karst region in the world [
6,
39]. Guangxi province is home to one of the largest karst areas in China. The effects of habitat fragmentation may lead to a discontinuous pattern of genetic variation and reduce the genetic diversity of the populations, mainly due to the limited gene flow [
9]. The genetic diversity of Chinese firs in RS was higher than that of the other sampling localities. RS population is one of the superior provenances of the species with a fast-growing phenotype in China [
40]. ND had the highest rates of gene flow with other sampling localities. No significant
FST differentiation was detected in pairwise comparisons among RS, ND, and TE sampling localities, which could be the consequence of high levels of gene flow homogenizing RS, ND, and TE sampling localities. The three sampling localities are speculated to be the central source of gene flow and genetic diversity for Chinese firs around the karst landform.
Marginal populations occur near the outer boundary of the geographic range and in atypical habitats of the species [
41]. By reason of genetic drift and natural selection, marginal populations will diverge from central populations [
41]. Compared to central populations, marginal populations show reduced neutral genetic diversity and increased genetic differentiation [
10]. CW and PB counties are located near the southern tip of China. Pairwise differentiation based on
FST and
Dest values confirmed the results analyzed by PCoA and NJ tree that CW had the most genetic differentiation in all sampling localities, and PB came second. With higher differentiation and less genetic variation than in the central population, populations of CW and PB were identified to be the marginal populations in the south of the karst landform.
Tree species dynamics at range edges are frequently driven by climatic factors [
42]. Many low-latitude populations of cold-adapted plant species will be negatively affected by global warming [
43]. Cold-adapted populations at low-latitude range edges are important because they may harbor the combination of alleles that foster persistence in a warmer climate [
44]. PB county is located in the south of the Tropic of Cancer, near the southern tip of China, with the maximum temperature for Chinese fir growth. The climate of CW and PB is similar to that of the nearby area, where the annual highest temperature is about 40 °C [
45]. Chinese firs are not resistant to overly high summer temperatures [
46]. Under a warmer climate, genetic diversity would be subjected to selective pressure [
47]. Evolutionary adaptation is an important way for natural populations to cope with threats, including extreme climate [
48]. Because neutral molecular variation rarely predicts adaptive and quantitative genetic variation, it is common to fail distinguishing between differentiation due to drift and differentiation due to adaptation to different environmental conditions [
49,
50]. The association between environmental gradients and genotypes could be analyzed by landscape genomic approach in the future to identify potential loci from Chinese fir underlying extreme climate adaptation.
The ability of a species to adapt to peripheral habitats in marginal populations plays a major role in the evolution of species ranges and adaptation in the face of environmental change [
10,
41]. Marginal populations are worthy of special conservation interest [
12,
51,
52]. Large-scale afforestation and reforestation of other tree species have occurred in Guangxi [
53]. Habitat loss has been most extensive for CW and PB populations. Translocations are among the most powerful tools for biodiversity conservation [
54]. Augmentation (movement of individuals into a population of conspecifics) is recognized as one of the three different types of translocations by the International Union for Conservation of Nature [
55]. Augmentation can be beneficial to increasing population size [
56]. By promoting the adaptive potential for evolutionary change, individuals mixing from several source populations will enhance the establishment and persistence of the translocated populations [
57,
58]. To preserve the genetic diversity of CW and PB populations of Chinese fir, we propose the following measures. First, core germplasm individuals from CW and PB could be collected, as previous research demonstrated [
59]. In the second place, considering that locally adapted populations are most likely to establish and persist under similar ecological conditions, it is proposed that the core germplasm individuals could be freely released into nature reserves in nearby counties such as JX. In the end, a mixture of individuals from several gene pools in the north of the karst landform could be freely released into nature reserves of CW and PB.