Next Article in Journal
Nutritional Strategies for Childhood Obesity Prevention
Next Article in Special Issue
First Record of the Alien Species Procambarus virginalis Lyko, 2017 in Fresh Waters of Sardinia and Insight into Its Genetic Variability
Previous Article in Journal
Effect of the Chinese Herbal Medicine SS-1 on a Sjögren’s Syndrome-Like Disease in Mice
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Microsatellite Loci Reveal Genetic Diversity of Asian Callery Pear (Pyrus calleryana) in the Species Native Range and in the North American Cultivars

1
Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, TN 37996, USA
2
Department of Plant Sciences, University of Tennessee, Knoxville, TN 37996, USA
3
Department of Forestry and Environmental Conservation, Clemson University, Clemson, SC 29634, USA
*
Author to whom correspondence should be addressed.
Life 2021, 11(6), 531; https://doi.org/10.3390/life11060531
Submission received: 5 May 2021 / Revised: 30 May 2021 / Accepted: 4 June 2021 / Published: 7 June 2021
(This article belongs to the Special Issue Biological Invasions and Biodiversity)

Abstract

:
Pyrus calleryana Decne. (Callery pear) includes cultivars that in the United States are popular ornamentals in commercial and residential landscapes. Last few decades, this species has increasingly naturalized across portions of the eastern and southern US. However, the mechanisms behind this plant’s spread are not well understood. The genetic relationship of present-day P. calleryana trees with their Asian P. calleryana forebears (native trees from China, Japan, and Korea) and the original specimens of US cultivars are unknown. We developed and used 18 microsatellite markers to analyze 147 Pyrus source samples and to articulate the status of genetic diversity within Asian P. calleryana and US cultivars. We hypothesized that Asian P. calleryana specimens and US cultivars would be genetically diverse and would show genetic relatedness. Our data revealed high genetic diversity, high gene flow, and presence of population structure in P. calleryana, potentially relating to the highly invasive capability of this species. Strong evidence for genetic relatedness between Asian P. calleryana specimens and US cultivars was also demonstrated. Our data suggest the source for P. calleryana that have become naturalized in US was China. These results will help understand the genetic complexity of invasive P. calleryana when developing management for escaped populations: In follow-up studies, we use the gSSRs developed here to analyze P. calleryana escape populations from across US.

Graphical Abstract

1. Introduction

Genetic diversity and population structure of any species are important factors for determining the long-term survival of the species and their adaptation to environmental changes [1]. The population genetic profile of a species can inform about the origins of sub-populations [2,3], which provides greater understanding about the introduction history of the tree and its cultivars. Such knowledge can be used to formulate an effective management plan. Assessment of the existing genetic variation of any species requires widespread and intensive sampling from both native and introduced ranges [4].
Pyrus calleryana Decne., Callery pear, is a species of pear tree native to eastern and southern China, Taiwan, Korea, and Japan [5]. This deciduous tree often has a conical to rounded crown and flowers as early as three years of age [5,6]. Pyrus calleryana is a popular ornamental tree well-known for its early spring flowers, robust growth, and fall color display [7]. ‘Bradford’ is the most widely planted and the most commonly known Callery pear cultivar in the United States (USA) [8,9]. Pyrus calleryana is diploid with a haploid chromosome number of 17, 2n = 34 [6] and flow cytometry estimated genome size of 588 Mbp/1C [10]. Flowers from the cultivars of P. calleryana are self-incompatible and cultivars are routinely vegetatively propagated by grafting. The trees can produce viable, long-lived seeds [11], when flowers are cross-pollinated by other compatible Pyrus species, compatible rootstock-based pollen, or other P. calleryana cultivar [12]. When this occurs, the seeds can germinate and establish when dispersed in favorable environments [9]. In addition, P. calleryana adapts well to soils of variable pH (acidic to alkaline soil) and trees are tolerant of drought [7]. Several traits, including gametophytic self-incompatibility, pathogen and herbivore resistance, and tolerance of various abiotic stresses have contributed to the spread and persistence of P. calleryana in a variety of environments [12]. Although the underlying mechanisms and processes that have enabled the broad spread and invasiveness of the species are not well understood, intraspecific hybridization among the genetically distinct cultivars and interspecific hybridization with the other escaped Pyrus species could be among the possible reasons behind expansion of P. calleryana populations [7,13]. Additionally, several insects promote cross-pollination [8,14]. The fruits on fertile naturalized trees become secondary foods for birds that then disperse seeds into distant areas, and contribute to the spread of the species [15].
Major imports of P. calleryana seeds to the USA were made from East Asia in 1917 through 1920 and that germplasm was used to breed with European pear, P. communis L. Selections were performed based on resistance against the fire blight pathogen, Erwinia amylovora Burrill [12]. The hardiness of P. calleryana lent itself to the development and release of several intraspecific hybrid cultivars making it popular among local gardeners and landscapers. Soon, P. calleryana appeared in and naturalized almost all habitats throughout the USA [8,16,17,18,19]. Naturalized P. calleryana trees can be found in 33 states [16,17] and due to expanding populations of naturalized trees, P. calleryana has been listed, or watch-listed, as an invasive species in many USA states [9,12,20]. Pyrus calleryana can potentially become one of the most problematic invasive species in the USA [21]. Hence, there is need for understanding genetic contributions to invasiveness as a means to develop effective management plans and strategies that can mitigate or limit the spread of P. calleryana. To address this knowledge gap, we need to know more about the biology and genetics of this invasive species.
The assessment of the molecular population genetics preferably using co-dominant markers is necessary for understanding the dynamics of adaptation and the spread of any species [22]. The use of simple sequence repeats (microsatellites; SSRs) in population genetics and phylogenetics provides a more reliable interpretation of genetic diversity than other arbitrary markers such as random amplified polymorphic DNAs (RAPDs) [23,24]. SSRs, highly informative and multi-allele genetic markers, are experimentally reproducible and transferable among closely related species [25]. The presence of multiple alleles at an SSR locus makes SSRs more informative than other molecular markers, including SNPs [26].
Most of our knowledge on P. calleryana originates within its native range in Asia, where P. calleryana naturally occurs as scattered individuals in the wild to the point of being considered a threatened species [14,27]. Most of these studies have focused on identification and characterization of cultivars or Pyrus species using different types of DNA markers such as SSRs [28,29,30,31], amplified fragment length polymorphisms (AFLPs) [32], RAPDs [33], restriction fragment length polymorphisms (RFLPs) [34], and chloroplast DNA (cpDNA) [35]. Few studies, however, have investigated the genetic diversity and population structure of P. calleryana. Nuclear SSRs (nSSRs) and cpDNA were used in a study of genetic diversity of P. calleryana in Zhejiang province of China [36], which revealed the geographic distance as the major factor in shaping the population structure of P. calleryana. Additionally, an examination of the genetic diversity of P. calleryana var. dimorphophylla in the Tokai district of central Japan [7,27,37] found complex genetic structure of the species, probably originating from artificial propagation and introgression with other Pyrus species. There are limited studies investigating the genetic diversity and population structure of the invasive USA P. calleryana escapes, and this issue urgently needs further investigation [7,37].
We assessed the genetic diversity and population structure of P. calleryana from both native (i.e., Asian) and introduced ranges (the USA-introduced cultivars). We hypothesized that there would be high genetic diversity present within Asian P. calleryana populations, and that they would be genetically related to early USA-naturalized cultivars of P. calleryana (owing to the origin of the species) in terms of genetic distance and their genetic clustering. Specifically, we aimed to (a) develop novel microsatellite markers for P. calleryana, (b) test the cross-species amplification of the developed genomic short sequence repeats (gSSRs) to other Pyrus and Malus species (a related outgroup, of economic importance), and (c) use the most informative of these markers to evaluate the genetic diversity and genetic relatedness among Asian P. calleryana specimens and early USA-naturalized cultivars of P. calleryana.

2. Materials and Methods

2.1. Sample Collection

Leaf and flower bud samples of pears (Pyrus) and the original USA cultivar selections of P. calleryana were obtained from 10 herbaria and arboreta in the USA. In total, 147 samples (including 80 samples of P. calleryana and 67 samples of other Pyr and Malus species) were received. Respective permissions or licenses for limited destructive sampling were granted by all these institutions, in accordance with their internal regulations. The geographical coordinates, the country of origin, and the year of collection were recorded for each sample and whenever possible for historical specimens [38]. The collection was reduced to 90 specimens due to the unreliable/inconsistent amplification of samples. Out of those 90 samples, 57 were P. calleryana specimens (36 specimens of Asian P. calleryana and 21 samples of 7 USA commercial cultivars of P. calleryana) and 33 samples represented 14 different species of Pyrus and 2 samples of Malus rockii (Supplementary Table S1). These 33 Pyrus and Malus species samples were used to evaluate the potential for cross-amplification among gSSRs.

2.2. gDNA Extraction

Approximately 100 mg of dried leaf samples or fresh flower buds were taken from each specimen and homogenized using a Bead mill 24 (Fisher Scientific, Pittsburgh, PA, USA). Samples were homogenized four times and kept frozen in liquid nitrogen for at least 5 min between each homogenization step in order to improve the tissue homogenization. The gDNA was extracted using EZNA DNA DS Mini Kit (Omega Bio-Tek, Norcross, GA, USA) protocol. Nanodrop (Thermo Fisher Scientific, Wilmington, DE, USA) was used to evaluate the purity and measure the concentration of the isolated gDNA. Re-extraction was attempted following the CTAB protocol [39] for those samples that did not produce good quality and quantity of gDNA using EZNA DNA DS Mini Kit. gDNAs was re-extracted wherever enough of plant sample material was available.

2.3. Microsatellite Primers and Genotyping Conditions

Genomic SSRs (gSSRs) were developed by using closely related pear (Pyrus × bretschneideri Rehder) whole genome sequence data (GenBank number: JH994112) [40]. The development of gSSRs involved several steps including: (i) genome assembly using MaSuRCA [41]; and (ii) SSR mining using GMATA [42]. The gSSRs of interest with a motif length of 2 to 6 bp and a minimum of 5 repeats were retained. Finally, 18 to 22 bp gSSR primers were designed using Primer3 [43] with 45 to 55% GC content, 58 to 62 °C melting temperature, and 100 to 500 bp of expected product sizes. The developed gSSRs were used for this P. calleryana study (IDT, Coralville, IA, USA; Supplementary Table S2). Genomic locations of the gSSRs used in this study were analyzed in six available genomes of related Prunus spp. using BLAST [44] with default settings, owing to their relatively higher quality (contiguity, N score, and annotation) than the genome used for their development. Genome with most gSSRs mapping to it was used to visualize their locations. Additionally, for cross-species amplification analysis, gDNA samples of the other Pyrus species that failed to amplify with the gSSRs were further tested with Internal transcribed spacer (ITS) primers [45] and ribosomal protein S16 (rps16/cpDNA03) primers [46]. This was done to confirm that the failure of gSSR-PCR was not due to inferior quality of gDNA used. Both ITS (nuclear) and rps16 (plastidic) primers were used in our study for more effective DNA quality controls.
Polymerase chain reaction (PCR) amplifications were performed in a 10 μL reaction mixture consisting of 4 ng gDNA, 1 μM final concentration of each primer, 5 μL of 2 × GoTaq® DNA Polymerase (Promega, Madison, WI, USA), and 0.5 μL of dimethyl sulfoxide (DMSO). In order to validate the data, the gDNA extracted from the P. calleryana herbarium specimen (Arnold Arboretum, catalog number: 156,119) was used as a positive control, and sterile distilled water as a negative control, for each of 18 gSSRs tested. PCR amplification was performed using the touch-down protocol to ensure the specificity of the amplified fragments [47] using the following thermal profile: initial denaturation at 94 °C for 3 min., followed by 10 cycles of denaturation at 94 °C for 30 s, annealing at 65 °C for 30 s with a touch-down of 0.7 °C/cycle and an extension at 72 °C for 30 s then, followed by 30 cycles of denaturation at 94 °C for 30 s, annealing at 58 °C for 30 s and an extension at 72 °C for 30 s, with a final extension of 72 °C for 4 min.
QIAxcel Capillary Electrophoresis System (QIAGEN, Germantown, Maryland, USA) was used to visualize the P. calleryana PCR products and determine allelic sizes using a 15/600 bp alignment marker and 25 to 500 bp DNA size marker. Allele sizing was performed for each of the 147 gDNA samples against each of the 18 gSSR markers. The PCR reaction was repeated twice for the samples that did not amplify, which were then declared missing data if still failed to amplify. Samples with missing data in more than 9 of 18 loci were discarded from the analyses resulting in the final dataset of 57 P. calleryana samples (Supplementary Table S3 and data not shown).
For cross-species amplification test, PCR amplification in the gDNA samples of the other Pyrus and Malus species was also performed using this PCR reaction mixture composition and following the touch-down protocols detailed above. In addition, PCR amplification for the samples that failed to amplify for each gSSR was further attempted using ITS [45] and rps16 primers [46] using a 10 μL reaction mixture: 10 ng gDNA, 5 μM final concentration of each primer (ITS or rps16, respectively), and 5 μL of AccuStart II PCR SuperMix (Quantabio, Beverly, MA, USA). Pyrus calleryana (Carnegie Herbarium, catalog number: 396078) DNA was used as a positive control and sterile distilled water was used as a negative control.
The optimized thermal profile for PCR with ITS1 and ITS4 primers included an initial denaturation at 94 °C for 2 min, 40 cycles of 95 °C for 30 s, 60 °C for 1 min, 72 °C for 90 s, and the final extension at 72 °C for 7 min. The thermal cycle for touchdown PCR involving rps16 primers included the following settings: 95 °C for 4 min, 95 °C for 20 s, 58 °C for 30 s (−0.3 °C/cycle), (72 °C for 90 s) for 10 cycles; (95 °C for 20 s, 55 °C for 30 s, 72 °C for 90 s) for 35 times, and 72 °C for 5 min. The products of the PCR reactions were electrophoresed using 2% w/v agarose gels with ethidium bromide stain at 100 V/cm2 for 1 h, visualized under UV light using UVP GelStudio PLUS (Analytikjena, Upland, CA, USA) and documented using VisionWorks 8.22.18309.10577 (Analytikjena, Upland, CA, USA).

2.4. Data Analysis

The raw allele sizes were binned into statistically identical allelic classes using FlexiBin excel macro [48]. The binned allelic data were used for further analyses. For each gSSRs, the P. calleryana dataset was transformed using the motif lengths to represent the number of repeats rather than the PCR allele sizes using PGDSpider [49] version 2.1.1.5 (University of Bern, Bern, Switzerland). Clone correction was completed using poppr [24] version 2.8.5 (Oregon State University, Corvallis, OR, USA) in RStudio version 1.2.5033 (RStudio, Boston, MA, USA) using R [50] 3.6.2 (R Foundation, Vienna, Austria). There were no clones in the dataset (total n = 57). Based on their country of origin, the samples were divided into four population groups, denoted “China” (n = 20), “Japan” (n = 11), “Korea” (n = 5), and “USA” (n = 21).

2.5. Population Genetics of P. calleryana

2.5.1. Genetic Diversity

The genetic diversity indices across the 18 gSSRs and 4 population groups were calculated using R with packages poppr version 2.8.5 and hierfstat [51] version 0.04-22, and SPAGeDi [52] version 1.5 (Free University of Brussels, Brussels, Belgium). For each gSSR marker, the number of alleles amplified (N), observed heterozygosity (Ho), expected heterozygosity (He), Jost’s differentiation estimate (Dest), Stoddard and Taylor index (G), and allelic richness (Ar) were calculated using hierfstat. SPAGeDi was used to estimate the presence of private alleles (Pa), as well as hierarchical fixation indices including inbreeding coefficients (FIS, RIS), allele fixation index (FST), and RST (FST analogue based on allele size) [53,54]. Gene flow (Nm = ¼ × [(1/FST) − 1]) among populations was estimated using GenAlEx [55] version 6.5 (Rutgers University, New Brunswick, NJ, USA).
In order to assess the contribution of mutation rate to the population structure, SPAGeDi performed the permutation tests to determine the phylogenetic distance between individuals using 10,000 permutations among alleles within each locus and to determine the significance of inbreeding coefficients using 10,000 permutations among gene copies [56]. For many samples, including many of the historical specimens, Global Positioning System (GPS) coordinates were not available, shrinking the sample subset further, and consequently creating a possible bias in our study. The entire P. calleryana dataset (n = 57) was used to determine the phylogeographical signal within populations as RST does not depend on GPS coordinates, whereas, only the individuals with known GPS coordinates of origin (total n = 28; “China”, “Japan”, “Korea” groups) were used to determine the phylogeographical signal among populations.
Analysis of Molecular Variance (AMOVA) was performed using the package poppr with 1000 permutations in order to evaluate the molecular variance partitioning within and among the population groups. Linkage Disequilibrium (LD) of the used gSSRs was determined in poppr, with significance assessed using 1000 permutations. The pairwise index of association ( r ¯ d) was calculated using permutation approach to assess whether the loci were linked and to ensure that the observed pattern of LD is not due to a single pair of loci [57].

2.5.2. Population Structure

Mantel Test for Isolation by Distance

Samples with known GPS coordinates (n = 28; Supplementary Figure S1) were analyzed using Mantel and partial Mantel tests implemented in the package MASS [58] version 7.3-50 (CSIRO, Cleveland, Australia) using 1000 permutations for the assessment of tests significance. These tests estimate isolation by distance (IBD) to determine the correlation between the genetic and the geographical distance matrices of the individuals. The Mantel test was also standardized using the year of sampling (partial Mantel test), to additionally assess the effect of mutation across time. Further, Mantel correlogram tests were performed to further examine the underlying correlative relationship using the packages ade4 [59] version 1.7-13 (Université Claude Bernard Lyon 1, Lyon, France) and vegan [60] version 2.5-3 (University of Helsinki, Helsinki, Finland) using α = 0.05.

Bayesian Clustering Using Structure and DAPC

Structure [61] version 2.3.4 (Stanford University, Stanford, CA, USA) was used to analyze the population structure of the P. calleryana dataset utilizing a Bayesian clustering algorithm. Genetic clusters among the P. calleryana individuals were inferred using 30 independent Monte Carlo Markov Chains (MCMC) with 250,000 generations of burn-in period and 750,000 MCMC steps in the actual runs for each used number of clusters (K = 1 to 10). Structure results were then visualized with PopHelper [62] version 1.0.10 (Uppsala University, Uppsala, Sweden) using the Evanno’s method [63]. ObStruct [64] version 1.0 (University of Auckland, Auckland, New Zealand) was used to determine the correlation of population structure of inferred ancestral profiles to that of predefined/sampled populations. The program uses the ad hoc R2 statistics whose values range from 0 (recent divergence of predefined populations or a lot of migration/admixture between populations) to 1 (strong diversification and/or population structure) [64]. One-tailed t-tests were used to accrue the significance of the differences when consecutively removing each one of the pre-defined populations or the inferred clusters, to assess this group’s impact on R2.
The population structure of the P. calleryana dataset was also analyzed using model-free multivariate clustering approach, Discriminant Analysis of Principal Components (DAPC) using the package adegenet [65] version 2.1.1 (Université de Lyon, Lyon, France). The DAPC analysis was optimized and cross-checked utilizing 1000 permutations over Principal Component Analysis (PCA) range from 2 to 109 (total number of alleles – 1). The analysis was further confirmed using a dendrogram of unrooted neighbor-joining tree of pairwise genetic distances [66] among P. calleryana originating populations. A separate DAPC analysis was also performed for USA cultivars only.

3. Results

3.1. gSSR Development and Selection

After gSSR mining with GMATA and execution of additional in-house scripts, a total of 115,838 SSRs were discovered from three Pyrus × bretschneideri draft genomes. Marker polymorphism was determined based on allelic variation within each genome or across the three genome assemblies. SSR mining and primer design resulted in 105,557 SSRs with acceptable primers and of these, 90,987 were dinucleotide, 10,913 were trinucleotide, and 2891 were tetranucleotide repeats. Primers between 59 to 60 °C with <1 °C difference between forward and reverse primers melting temperatures resulted in the discovery of 15,269 single motifs monomorphic SSRs along with 306 single motifs polymorphic SSRs. Only single motifs polymorphic SSRs were considered for the further analysis. In the single motifs polymorphic SSRs collection, AG was the most frequent dinucleotide motif and AAG was the most frequent trinucleotide motif (Supplementary Figure S2a). For the study, 50 gSSRs were selected and tested using gDNA samples from three locally escaped, naturalizing Callery pear trees (Third Creek Greenway, Knoxville, TN, USA). Based on preliminary data and the resulting amplification robustness, polymorphic character, and agreement with the expected product sizes, 40 gSSRs were selected for further evaluations. Of these, based on the initial assessment, 18 gSSRs with high discriminating power for multi-locus genotypes of P. calleryana gDNA were included in the subsequent analyses (Supplementary Table S2). Genomic locations of those 18 gSSRs were analyzed using the six available high-quality genomes of related Prunus spp. Among those, 17 gSSR mapped to Prunus dulcis (Mill.) D.A.Webb genome GCF_902201215.1 [67] (Supplementary Figure S2b).

3.2. Cross-Amplification

Cross-amplification was performed using 18 gSSRs (Supplementary Table S4). Within the genus Pyrus, all the gSSRs amplified in three Pyrus species: P. communis, P. longipes/cossonii, and P. pyrifolia. Likewise, the gSSRs cross-amplified at high rates in P. pashia (94%) and P. amygdaliformis (89%). The developed gSSRs performed well in Malus rockii (67%). Additionally, the gDNA samples that failed to amplify in P. gharbiana, P. korshinskyi, P. regelii, and P. × hybrid (‘Bartlett’ × P. salicifolia) samples using our gSSRs were successfully amplified using both ITS and rps16 primers (Supplementary Figure S3).

3.3. Population Genetics of P. calleryana

3.3.1. Genetic Diversity

All 57 individual P. calleryana samples represented unique MLGs and their genotypic data were used for analyses. Throughout the dataset, there was about 10% missing data (Table 1). The locus and population with the highest missing data were PyC012 with 26%, and “China” with 15% missing data, respectively. The dataset deviated from Hardy–Weinberg equilibrium (HWE; Supplementary Figure S4), which is a possible result from the relatively low sample number, from specimens that were sampled across a broad time range (approximately 1912 to 2019 AD), as well as from wide geographical origins.
Among the four populations, the average Shannon-Wiener Index of MLG diversity (H) was 4.04, indicating high genetic diversity in the genotypic dataset (Table 1). The average effective number of alleles in the four populations was 5, ranging from 2 in “Korea” to 6 in “China”. This indicated high genetic diversity and variations of P. calleryana populations. Similarly, the overall Ar of 3.79 varied from 2.48 in “Korea” to 4.61 in “China”. A total of 88 private alleles were found in the P. calleryana dataset, with “China” population having the most of private alleles (n = 37).
The gSSRs had high power in discriminating the MLGs requiring only 8 gSSRs to capture all of the MLGs present (Supplementary Figure S5). The calculation of a small range of pairwise values of LD ( r ¯ d = 0 to 0.4, p-value = 0.113) indicates no linked loci and suggests distribution of gSSRs across the genome (Supplementary Figure S6).
Across the dataset, an average of about 12 alleles per locus (ranging from 5 to 20) were detected (Table 2). The mean allelic richness (Ar) calculated in the locus-wise manner was 4.70, ranging from 2.74 (PyC050) to 6.22 (PyC031), suggesting a high long-term adaptability and persistence potential of the P. calleryana populations. There was a moderate observed heterozygosity across 18 gSSRs overall (Ho = 0.34) ranging from 0.03 (PyC050) to 0.93 (PyC038). In addition, there was a high expected overall heterozygosity (He = 0.81) across all loci ranging from 0.41 (PyC050) to 0.92 (PyC031 and PyC017). The overall Ho < He implied the presence of population structure within our P. calleryana collection. Furthermore, the dataset indicated high gene flow across all gSSRs with an overall value of 1.79.
For the assessment of phylogeographic signals within the P. calleryana dataset, SPAGeDi was implemented, using 10,000 permutations among alleles within each locus. The mean permuted RST over all loci was not statistically different from FST in accordance with the expectations of this test, and the observed RST was bigger than the observed FST (Pobs > exp = 0) indicating the presence of phylogeographic signal within populations (data not shown). Furthermore, the slope test of pairwise RST was evaluated in both linear and logarithmic forms to assess the phylogeographic signal among populations. From this slope test, we found no evidence of phylogeographic signal among populations (Pobs > exp = 0.95). Only the samples with available GPS coordinates (n = 28) were used for the slope test, thus creating a possible bias in this analysis.
AMOVA was used to investigate the partitioning of the molecular variance. It suggested a low proportion of total molecular variance among populations (6.2%), with more than half of the total molecular variance partitioned within individuals (63.5%) (Supplementary Table S5). Significance of the test result (p < 0.001) indicated the existence of population structure within the genotyped collection of P. calleryana.

3.3.2. Population Structure

Mantel Test for Isolation by Distance

Several independent analyses were performed to determine the population structure in the P. calleryana collection. Isolation by distance analysis using the Mantel test yielded no evidence of correlation between genetic and geographic distances among the analyzed P. calleryana individuals of Asian origins (total n = 28; Supplementary Figure S1; Mantel statistic (r) = 0.04, p-value = 0.30) (Figure 1). The partial Mantel test (standardized geographic distance matrix by the year of sample collection) did not change that result (Mantel statistic using year of sample collection (r’) = 0.04, p-value = 0.33). Additionally, there was a non-linear relationship of the genetic and geographic distances across the space (Figure 1b). The amplitude of the Mantel’s r scores in the correlogram was low (−0.1 to 0.05), indicating a low impact of spatial distancing on the population structure of P. calleryana dataset.

Bayesian Clustering Using Structure and DAPC

Bayesian clustering using Structure indicated two genetically distinct clusters (ΔK = 2) in the genotyped P. calleryana dataset (Figure 2a,b). The admixture of both the inferred clusters was observed for all four population groups. Additionally, four genetically distinct clusters (ΔK = 4) of the genotyped P. calleryana dataset (Figure 2c) showed the admixture of all four inferred clusters in all population groups except “Korea”. Negligible variability among 30 independent Markov chains of Structure was detected. The overall R2 between predefined and inferred clusters of the P. calleryana collection (when K = 2) was 0.42 ± 0.07 suggesting moderate divergence among the predefined populations and Structure-derived genetic clusters within the dataset. The information on the contribution of sampled and inferred populations to the observed structure of P. calleryana was derived by changes to R2 using iterative successive removal of the pre-defined populations and the inferred clusters using ObStruct (Supplementary Table S6). Only minor changes in R2 index value were evident when the predefined populations or the inferred clusters were removed sequentially. Removal of the “China” population caused decrease (p-value = 0.32) in R2 indicating that this population contributed the most to the population structure from among those analyzed. Additionally, the removal of the “Japan” population caused increase (p-value = 0.20) in R2 which suggests that this population was of mixed ancestries and contributed the least to the population structure. Furthermore, removal of the inferred clusters resulted in no major changes in R2 indicating no major contribution of the inferred clusters to the structure within the data. In addition to this, the overall R2 between predefined and inferred clusters of the P. calleryana dataset (when K = 4) was 0.41 ± 0.02, again with no major changes when sequentially removing the pre-attributed or the inferred groups (Supplementary Table S6).
Using DAPC, a multivariate analysis, the P. calleryana dataset showed the clustering different from Structure. DAPC indicated 4 clusters for the P. calleryana dataset. The DAPC result indicated the possibility of the “China” population being ancestral to the bulk of the species with diverged “Japan” and “Korea” populations (Figure 3). This was also supported by the genetic distance dendrogram, and in congruence with the results of the Bayesian clustering. In addition to this, DAPC analysis of USA cultivars alone showed that the cultivars named same from the same or different source institutions were not identical (Supplementary Figure S7).

4. Discussion

We investigated the genetic diversity of P. calleryana in the collection of original non-cultivated Asian specimens and early developed USA cultivars. Available historical records and our results support the hypothesis for high genetic diversity within Asian pear specimens and their genetic relatedness with the early developed USA cultivars. Sample collection from herbaria and arboreta demonstrate their great value for research studies such as ours, or when sampling in native environment is hindered. We found high levels of genetic diversity within P. calleryana populations supporting the fact that wild populations of forest tree species maintain high levels of genetic diversity [73]. Our findings on P. calleryana diversity are also supported by other studies of woody forest trees: North America-native Cercis canadensis [74] and Asia-native Cornus kousa [75].
We developed and used 18 gSSRs discriminating individual MLGs of P. calleryana. Despite availability of the SSRs published by others [36], we chose to develop new markers, as is standard in our laboratories [25,75]. This approach increases the pool of available markers and provides a higher level of confirmation for the results accrued by others. Our novel gSSRs also cross-amplified to conserved sequences of DNA extracted from several Pyrus species. However, these gSSRs did not perform well, with fewer gSSRs amplifying informative sequences from the more distantly related Malus species. Wide genetic variation among Pyrus species exists, and the failure of gSSRs to cross-amplify some Pyrus species could be due to speciation and changing genomic landscape [36]. Those non-amplified Pyrus species are related to P. bretschneideri/P. calleryana but they are highly admixed likely due to inter-species hybridization and genetic admixture common in Pyrus [76]. Nevertheless, the overall cross-amplification potential of our gSSRs makes these novel markers useful for future studies of at least several species of Pyrus. Owing to limited resources we did not confirm the gSSR transfer to other species used here; PCR product size agreement provides clues that our cross-amplification was indeed successful. The gSSRs can be sequenced as needed for each species undergoing analyses.
The genetic diversity, allelic richness, and gene flow patterns of any plant species are related to their ecology and evolution [77,78,79]. In an outcrossing and widespread species, the genetic diversity mainly exists within populations [80]. Our study revealed a high level of genetic diversity (He = 0.81) and allelic richness, and the presence of population structure in P. calleryana. A similarly high level of genetic diversity was found in P. calleryana (He = 0.639) using 14 nSSRs to study 77 individuals [36] and in Ambrosia artemisiifolia (He = 0.776) using 13 gSSRs and 13 expressed sequence tag SSRs (EST-SSRs) to study 321 individuals [81]. We found high genetic diversity for P. calleryana populations compared to other invasive species such as southwestern Puerto Rico’s invasive tree [82], Albizia lebbeck with He = 0.27, or southeastern US invasive species [83], Pueraria lobata with He = 0.25.
We recorded high gene flow among P. calleryana populations across the species’ native range. Irrespective of the geographical barriers, high gene flow could be a possible reason for this species’ invasiveness because gene flow promotes evolution through the spread of new genes or mixture of genes throughout the range of species [84]. The high gene flow rate observed here is consistent with other invasive species [82,85] such as A. lebbeck and Fallopia species. Such a high gene flow rate promotes exchange of genes among populations especially in self-incompatible species by providing seeds for more population growth and colonization of new habitats [82]. As an outcrossing species, P. calleryana has been able to maintain population structure with high genetic diversity. The individuals of a particular cultivar have the same self-incompatibility genotype and cannot produce fruits. However, if the rootstock is allowed to sprout then that rootstock can cross with genetically different scion resulting in fruit set with mixed cultivar types [12]. Additionally, our study found a low impact of spatial distance in population structure of P. calleryana dataset which could support the seeds or pollen as the far-reaching mechanism of dispersal.
In P. calleryana, flowers are indiscriminately visited by various generalist pollinator species and the pollen is often carried among neighboring cultivars, resulting in intraspecific hybridization [8]. Seeds can be dispersed over long distances as a result of frequent frugivorous animal activities [34]. In a previous study, most of the P. calleryana seedlings were found with almost no genetic similarity to nearby mature trees, implying that those seedlings might have originated from foraging birds’ defecation [12]. In addition, there could be an intraspecific hybridization among the cultivars and an interspecific hybridization with the other escaped Pyrus species [13], as implied by our cross-amplification data. These characteristics may help P. calleryana maintain high genetic diversity and high gene flow rate, aiding in the continued spread of the species by providing environment-specific genotypes needed to adapt to a varied environmental condition [82].
High genetic differentiation found in our P. calleryana collection suggested the existence of population structure. Pears are expected to undergo random mating, but an unexpected positive result for the inbreeding coefficient (FIS) values was found in our study indicating the probability of alleles coming from a common ancestor. A similar positive FIS result was found in a study in P. calleryana in China [36], where the species is considered threatened. One of the possible explanations for such positive FIS result could be that the sampled individuals might have experienced some human interferences, such as selection, propagation, and intentional transportation resulting in the escape of cultivation in the USA. Other explanations could be the insect pollination with limited pollen flow creating local population structure, or the selection of loci under negative selection by chance, where any mutation would be lethal. Studies testing this phenomenon in the escaped P. calleryana populations are underway.
Our study did not indicate geographic distance as the major factor contributing to the genetic structure of the populations. A non-significant correlation between genetic and geographic distance could be a result of the small size of our P. calleryana dataset collection. Our result is unlike the Mantel test result obtained for wild P. calleryana from China [36]. In addition to this, our study partitioned 57 individuals from 4 populations into 2 major genetic clusters. However, considering the biology of P. calleryana, DAPC result, unrooted neighbor-joining tree, and bias of Structure towards k = 2 [86,87], we assumed k = 4 as the best clustering for our P. calleryana dataset. Almost all P. calleryana individuals throughout all populations showed extensive genetic admixture, in agreement with the detected high gene flow among populations. The individuals from “China” and “USA” population groups had relatively higher assignment probability to the major genetic clusters. Furthermore, genetic distances of “China” and “USA” populations placed them close to each other. Thus, our results support the historical import records of P. calleryana from China to the USA [12] with the “China” population being ancestral to the “USA” population.
The historical events suggest the development of P. calleryana cultivars using P. calleryana as a common rootstock. We expected at least some clones within the “USA” P. calleryana individuals as such cultivars were multiplicated clonally. However, no clones within the “USA” P. calleryana individuals were found, as each “USA” P. calleryana individual was unique with no shared MLGs within this population. Our study also found no identical cultivars although named the same from the source institutions. In such case, ‘Trinity’ cultivar of one source is genetically different from ‘Trinity’ cultivar of another source. Such a great genetic composition and diversity with no clones in our “USA” P. calleryana individuals signifies the great invasive potential of the species, and alarms about the level of the previously noted cultivar mislabeling and mishandling [7,88]. Our result is in stark contrast to those of others, who used 2 SSRs from P. pyrifolia and 7 from Malus × domestica on 14 P. calleryana cultivars [7]. They detected clonal MLGs in agreement with the cultivar description, confirming genetic identity of ‘Chanticleer’, ‘Cleveland Select’, and ‘Stonehill’- in accord with their history, and origin from the same source [7].
The high level of genetic diversity within the investigated P. calleryana collection suggests the high potential of the species for evolving resilience and ability to thrive in a variety of environmental conditions. In addition, the high gene flow among P. calleryana populations creates the complex genetic admixture, adding more challenge to their control. For the development of measures for improved control, it is necessary to study in detail the genetic composition and diversity, and to infer the evolutionary potential of the species across the introduced ranges. Considering the ultimate goal of well-informed control measures, we are already taking the next steps in analyses of the naturalized P. calleryana: the fine-scale study of P. calleryana in localized area of the USA using the gSSRs developed here, and the broad-scale study of P. calleryana across the USA using reduced-representation genotyping-by-sequencing. Thus, P. calleryana populations across the USA provide a great prospect for further research and study, to better understand the spread mechanism of this invasive species.

5. Conclusions and Outlook

Our study has highlighted factors in P. calleryana native populations that are likely to be contributing to its invasive capability within introduced range in the USA. This tree species has been able to maintain high genetic diversity, which has helped it survive and adapt under diverse environmental, physical, and geographical conditions. The genetic complexity of the species adds complications to efforts at managing naturalized populations and mitigating further spread. The evidenced mishandling of the cultivars likely contributed to enabling the species spread in the naturalized range. In order to limit the cultivar mishandling, we suggest nursery professionals to verify their cultivars with the original ones or to maintain only one/few properly identified cultivars per station, and to consider the high genetic diversity of P. calleryana in large scale production of P. calleryana saplings. Further molecular and remote imagery research is underway to better understand the mechanisms behind the observed spread of P. calleryana across the USA.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/life11060531/s1, Supplementary Supplementary Figure S1. Map of geographic origins for the Asian samples of P. calleryana with known details (n = 28). Locations for several specimens are obscured due to the marker size and the overlap. Map generated using GoogleEarth version 7.3.3.7786 (Google LLC, Mountain View, CA, USA), with surface distance of 300 km marked. Supplementary Figure S2. Frequency of di-, tri-, tetra-, and hexa-nucleotide motif repeats disovered in 306 single motifs polymorphic gSSRs collection. The indicated motifs and their redundant iterations are grouped together. Insert: Frequency of di-, tri-, and tetra-nucleotide motif repeat gSSRs in the discovered 105,557 gSSRs collection; bp = base pair. Supplementary Figure S3. Gel image of cross-species amplification using ITS and rps16, ran on the same gel. (a) The uppermost row in gel image represents the amplification done using ITS primers [45]; (b) The lowermost row in gel image represents the amplification done using rps16 primers [46]. Order of samples in (a) and (b) is identical, and fro\m left to right includes: DNA ladder (DNA molecular marker of 100 bp; BIONEER, Catalog No.:D-1030; Oakland, California, USA); positive control (P. calleryana; PC_A_019); negative control (water); P. gharbiana (20_PC_AO_29 and 20_PC_AO_33); P. korshinskyi (20_PC_AO_14 and 20_PC_AO_15); P. regelii (20_PC_AO_16 and 20_PC_AO_21); P. hybrid (‘Bartlett’ × P. salicifolia; 20_PC_AO_08). The expected size of PCR products for ITS was 678 bp and for rps16 was 911 bp. Supplementary Figure S4. Hardy–Weinberg Equilibrium observed for samples included within the P. calleryana dataset. The rows represent loci and columns represent populations. The legend represents the probability of loci to follow HWE. The loci in pink are suspected of not being in HWE with P ≤ 0.05. Supplementary Figure S5. Genotype Accumulation Curve (GAC) for samples included within the P. calleryana dataset. It represents the number of MLG detected (Y-axis) in relation to the number of loci (X-axis) used for genotyping. Supplementary Figure S6. Pairwise linkage disequilibrium (LD) among the 18 gSSRs for samples included within the P. calleryana dataset. It is expressed as standardized index of association ( r ¯ d). Legend values coincide with hues that explain the strength of the linkage calculated between each pair of the markers. Supplementary Figure S7. DAPC of P. calleryana USA cultivars from different source institutions. DAPC for determining the molecular variance partitioning projected using 15 Principal Components cross-checked and optimized with 1000 permutations. The two respective axes are indicated by the alleles explaining the most of variance within the sampled population. Each different color represents the different source institutions. Each colored dot represents P. calleryana USA cultivar individuals. The names in the figure indicate the following P. calleryana cultivars: Ari01: ‘Aristocrat’; Br01-Br08: ‘Bradford’; Ca01: ‘Cambridge’; Ch01 and Ch02: ‘Chanticleer’; Hm01 and Hm02: ‘Holmford’; Re01: ‘Redspire’; Tr01 and Tr02: ‘Trinity’; and US034 and US035: Unknown cultivars. Table S1. Callery pear samples used for the study, Supplementary Table S2. gSSRs used for the study of P. calleryana, Supplementary Table S3. Genotyping dataset used for the study (binned and converted to repeat number, Supplementary Table S4. Cross-species amplification of the studied gSSRs, Supplementary Table S5. Analysis of Molecular Variance of P. calleryana dataset, Supplementary Table S6. R2 value using Obstruct).

Author Contributions

M.N., D.H., R.N.T., W.E.K., S.L.B., and D.R.C. generated the idea of this experiment and contacted the herbaria and arboreta for the samples. B.A.O. and R.D.K. worked on the pipelines for developing gSSRs. S.S. performed the experiments. S.S. and M.N. contributed to data analysis. All authors contributed to the result interpretation, manuscript writing, and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the United States Department of Agriculture—Agricultural Research Service grant (NACA 58-6062-6) to Trigiano to investigate woody ornamental genomics and Student/Faculty Research Award (SFRA), University of Tennessee Knoxville: Sapkota/Nowicki 2019. The funding agencies had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Funding for open access to this research was provided by University of Tennessee’s Open Publishing Support Fund.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data generated during this study are available in the Supplementary Information.

Acknowledgments

We gratefully acknowledge the herbaria and arboreta (Illinois Natural History Survey Herbarium and Arboretum, Harvard University Herbaria, Morton Arboretum, Carnegie Museum of Natural History Herbarium, Morris Arboretum, US National Arboretum, Arnold Arboretum of Harvard University, Bartlett Tree Research Laboratories, Idaho Museum of Natural History, National Clonal Germplasm Repository-Corvallis) for providing us with the samples of specimens analyzed in this study. Any trade names used in this study are for identification purposes only and do not imply their endorsement by the authors or the research funding entities.

Conflicts of Interest

Our experimental research and field studies on P. calleryana, including the collection of plant material, complied fully with relevant institutional, national, and international guidelines and legislation. No conflicts of interest exist for each and every Author.

References

  1. Taberlet, P.; Fumagalli, L.; Wust-Saucy, A.G.; Cosson, J.F. Comparative phylogeography and postglacial colonization routes in Europe. Mol. Ecol. 1998, 7, 453–464. [Google Scholar] [CrossRef] [PubMed]
  2. Bossdorf, O.; Auge, H.; Lafuma, L.; Rogers, W.E.; Siemann, E.; Prati, D. Phenotypic and genetic differentiation between native and introduced plant populations. Oecologia 2005, 144, 1–11. [Google Scholar] [CrossRef] [PubMed]
  3. Goolsby, J.A.; de Barro, P.J.; Makinson, J.R.; Pemberton, R.W.; Hartley, D.M.; Frohlich, D.R. Matching the origin of an invasive weed for selection of a herbivore haplotype for a biological control programme. Mol. Ecol. 2006, 15, 287–297. [Google Scholar] [CrossRef] [PubMed]
  4. Schierenbeck, K.A.; Ainouche, M.L. Conceptual Ecology and Invasion Biology: Reciprocal Approaches to Nature; Springer: Amsterdam, The Netherlands, 2006; pp. 193–221. [Google Scholar]
  5. Bell, R.; Zimmerman, R. Combining ability analysis of juvenile period in pear. Hort. Sci. 1990, 25, 1425–1427. [Google Scholar] [CrossRef] [Green Version]
  6. Cuizhi, G.; Spongberg, S. Pyrus. Flora China 2003, 9, 173–179. [Google Scholar]
  7. Culley, T.M.; Hardiman, N.A. The role of intraspecific hybridization in the evolution of invasiveness: A case study of the ornamental pear tree Pyrus calleryana. Biol. Invasions 2009, 11, 1107–1119. [Google Scholar] [CrossRef]
  8. Culley, T.M. The rise and fall of the ornamental Callery pear tree. Arnoldia 2017, 74, 2–11. [Google Scholar]
  9. Swearingen, J.; Reshetiloff, K.; Slattery, B.; Zwicker, S. Plant Invaders of Mid-Atlantic Natural Areas. 2002021-06-05. Available online: http://www.maipc.org/PlantInvadersMidAtlanticNaturalAreas5thEdition.pdf (accessed on 4 June 2021).
  10. Dickson, E.; Arumuganathan, K.; Kresovich, S.; Doyle, J. Nuclear DNA content variation within the Rosaceae. Am. J. Bot. 1992, 79, 1081–1086. [Google Scholar] [CrossRef] [Green Version]
  11. Serota, T.; Culley, T. Seed germination and seedling survival of invasive Callery Pear (Pyrus calleryana Decene.) 11 years after fruit collection. Castanea 2019, 84, 47–52. [Google Scholar] [CrossRef]
  12. Culley, T.M.; Hardiman, N.A. The beginning of a new invasive plant: A history of the ornamental Callery pear in the United States. BioScience 2007, 57, 956–964. [Google Scholar] [CrossRef] [Green Version]
  13. Vincent, M.A. On the spread and current distribution of Pyrus calleryana in the United States. Castanea 2005, 70, 20–32. [Google Scholar] [CrossRef] [Green Version]
  14. Makimura, F.; Tsuruta, M.; Yamasaki, K.; Mukai, Y. Visitors to flowers of the endangered Callery pear, Pyrus calleryana Decne. var. dimorphopheylla (Makino) Koidz.; in the Tado population, Mie Prefecture. Jpn. J. Conserv. Ecol. 2015, 20, 197–202. [Google Scholar]
  15. Reichard, S.H.; Hamilton, C.W. Predicting invasions of woody plants introduced into North America: Predicción de invasiones de plantas leñosas introducidas a Norteamérica. Conserv. Biol. 1997, 11, 193–203. [Google Scholar] [CrossRef]
  16. EDDMapS. Early Detection; Distribution Mapping System; The University of Georgia–Center for Invasive Species and Ecosystem Health: Tifton, GA, USA, 2019. [Google Scholar]
  17. GBIF. GBIF Occurrence of Pyrus Calleryana. Available online: https://www.gbif.org/species/5363119 (accessed on 29 May 2021).
  18. Nesom, G.L. Callery pear (Pyrus calleryana—Rosaceae) naturalized in North Carolina. Rhodora 2000, 102, 361–364. [Google Scholar]
  19. White, J.; McClain, W.E.; Ebinger, J.E. Naturalized callery pear (Pyrus calleryana Decne.) in Illinois. Trans. Ill. State Acad. Sci. 2005, 98, 123–130. [Google Scholar]
  20. Randall, R.P. A Global Compendium of Weeds; R.G. and F.J. Richardson: Victoria, Australia, 2002. [Google Scholar]
  21. Warrix, A.R.; Marshall, J.M. Callery pear (Pyrus calleryana) response to fire in a managed prairie ecosystem. Invasive Plant Sci. Manag. 2018, 11, 27–32. [Google Scholar] [CrossRef]
  22. Purugganan, M.D. The molecular population genetics of regulatory genes. Mol. Ecol. 2000, 9, 1451–1461. [Google Scholar] [CrossRef] [Green Version]
  23. Hadziabdic, D.; Wadl, P.A.; Vito, L.M.; Boggess, S.L.; Scheffler, B.E.; Windham, M.T.; Trigiano, R.N. Development and characterization of sixteen microsatellite loci for Geosmithia morbida, the causal agent of thousand canker disease in black walnut (Juglans nigra). Conserv. Genet. Resour. 2012, 4, 287–289. [Google Scholar] [CrossRef]
  24. Kamvar, Z.N.; Tabima, J.F.; Grünwald, N.J. Poppr: An R package for genetic analysis of populations with clonal, partially clonal, and/or sexual reproduction. PeerJ 2014, 2, e281. [Google Scholar] [CrossRef] [Green Version]
  25. Nowicki, M.; Zhao, Y.; Boggess, S.L.; Fluess, H.; Paya-Milans, M.; Staton, M.E.; Houston, L.C.; Hadziabdic, D.; Trigiano, R.N. Taraxacum kok-saghyz (rubber dandelion) genomic microsatellite loci reveal modest genetic diversity and cross-amplify broadly to related species. Sci. Rep. 2019, 9, 1–17. [Google Scholar] [CrossRef]
  26. Vieira, M.L.C.; Santini, L.; Diniz, A.L.; Munhoz, C.d.F. Microsatellite markers: What they mean and why they are so useful. Genet. Mol. Biol. 2016, 39, 312–328. [Google Scholar] [CrossRef] [PubMed]
  27. Kato, S.; Imai, A.; Rie, N.; Mukai, Y. Population genetic structure in a threatened tree, Pyrus calleryana var. dimorphophylla revealed by chloroplast DNA and nuclear SSR locus polymorphisms. Conserv. Genet. Resour. 2013, 14, 983–996. [Google Scholar] [CrossRef]
  28. Bao, L.; Chen, K.; Zhang, D.; Cao, Y.T.; Yamamoto, T.; Teng, Y. Genetic diversity and similarity of pear (Pyrus, L.) cultivars native to East Asia revealed by SSR (simple sequence repeat) markers. Genet. Resour. Crop Evol. 2007, 54, 959–971. [Google Scholar] [CrossRef]
  29. Yamamoto, T.; Kimura, T.; Sawamura, Y.; Kotobuki, K.; Ban, Y.; Hayashi, T.; Matsuta, N. SSRs isolated from apple can identify polymorphism and genetic diversity in pear. Theor. Appl. Genet. 2001, 102, 865–870. [Google Scholar] [CrossRef]
  30. Yamamoto, T.; Kimura, T.; Sawamura, Y.; Manabe, T.; Kotobuki, K.; Hayashi, T.; Ban, Y.; Matsuta, N. Simple sequence repeats for genetic analysis in pear. Euphytica 2002, 124, 129. [Google Scholar] [CrossRef]
  31. Yamamoto, T.; Kimura, T.; Shoda, M.; Ban, Y.; Hayashi, T.; Matsuta, N. Development of microsatellite markers in the Japanese pear (Pyrus pyrifolia Nakai). Mol. Ecol. Resour. 2002, 2, 14–16. [Google Scholar] [CrossRef]
  32. Bao, L.; Chen, K.; Zhang, D.; Li, X.; Teng, Y. An assessment of genetic variability and relationships within Asian pears based on AFLP (amplified fragment length polymorphism) markers. Sci. Hortic. 2008, 116, 374–380. [Google Scholar] [CrossRef]
  33. Yuanwen Teng, Y.; Tanabe, K.; Tamura, F.; Itai, A. Genetic relationships of pear cultivars in Xinjiang, China, as measured by RAPD markers. J. Hortic. Sci. Biotechnol. 2001, 76, 771–779. [Google Scholar] [CrossRef]
  34. Iketani, H.; Manabe, T.; Matsuta, N.; Akihama, T.; Hayashi, T. Incongruence between RFLPs of chloroplast DNA and morphological classification in east Asian pear (Pyrus spp.). Genet. Resour. Crop Evol. 1998, 45, 533–539. [Google Scholar] [CrossRef]
  35. Kimura, T.; Iketani, H.; Kotobuki, K.; Matsuta, N.; Ban, Y.; Hayashi, T.; Yamamoto, T. Genetic characterization of pear varieties revealed by chloroplast DNA sequences. J. Hortic. Sci. Biotechnol. 2003, 78, 241–247. [Google Scholar] [CrossRef]
  36. Liu, J.; Zheng, X.; Potter, D.; Hu, C.; Teng, Y. Genetic diversity and population structure of Pyrus calleryana (Rosaceae) in Zhejiang province, China. Biochem. Syst. Ecol. 2012, 45, 69–78. [Google Scholar] [CrossRef]
  37. Lalk, S.; Hartshorn, J.; Coyle, D.R. Invasive woody plants and their effects on arthropods in the United States: Challenges and opportunities. Ann. Entomol. Soc. Am. 2021, 114, 192–205. [Google Scholar] [CrossRef]
  38. Kim, H.; Chang, K.S.; Chang, C.-S. EH Wilson’s expedition to Korea from 1917 to 1919: Resolving place names of his collections. J. Jpn. Bot. 2010, 85, 99–117. [Google Scholar]
  39. Doyle, J.J.; Doyle, J.L. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem. Bull. 1987, 19, 11–15. [Google Scholar]
  40. Wu, J.; Wang, Z.; Shi, Z.; Zhang, S.; Ming, R.; Zhu, S.; Khan, M.A.; Tao, S.; Korban, S.S.; Wang, H.; et al. The genome of the pear (Pyrus bretschneideri Rehd.). Genome Res. 2013, 23, 396–408. [Google Scholar] [CrossRef] [Green Version]
  41. Zimin, A.V.; Marcais, G.; Puiu, D.; Roberts, M.; Salzberg, S.L.; Yorke, J.A. The MaSuRCA genome assembler. Bioinformatics 2013, 29, 2669–2677. [Google Scholar] [CrossRef] [Green Version]
  42. Wang, X.; Wang, L. GMATA: An integrated software package for genome-scale SSR mining, marker development and viewing. Front. Plant Sci. 2016, 7, 1350. [Google Scholar] [CrossRef] [Green Version]
  43. Untergasser, A.; Cutcutache, I.; Koressaar, T.; Ye, J.; Faircloth, B.C.; Remm, M.; Rozen, S.G. Primer3—New capabilities and interfaces. Nucleic Acids Res. 2012, 40, e115. [Google Scholar] [CrossRef] [Green Version]
  44. Altschul, S.F.; Gish, W.; Miller, W.; Myers, E.W.; Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 1990, 215, 403–410. [Google Scholar] [CrossRef]
  45. White, T.J.; Bruns, T.; Lee, S.; Taylor, J. Amplification and direct sequencing of fungal ribosomal RNA genes for phylogenetics. PCR Protoc. Guide Methods Appl. 1990, 18, 315–322. [Google Scholar]
  46. Nowicki, M.; Boggess, S.L.; Saxton, A.M.; Hadziabdic, D.; Xiang, Q.-Y.J.; Molnar, T.; Huff, M.L.; Staton, M.E.; Zhao, Y.; Trigiano, R.N. Haplotyping of Cornus florida and C. kousa chloroplasts: Insights into species-level differences and patterns of plastic DNA variation in cultivars. PLoS ONE 2018, 13, e0205407. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  47. Korbie, D.J.; Mattick, J.S. Touchdown PCR for increased specificity and sensitivity in PCR amplification. Nat. Protoc. 2008, 3, 1452. [Google Scholar] [CrossRef] [PubMed]
  48. Amos, W.; Hoffman, I.; Frodsham, A.; Zhang, L.; Best, S.; Hill, A.V.S. Automated binning of microsatellite alleles: Problems and solutions. Mol. Ecol. Notes 2007, 7, 10–14. [Google Scholar] [CrossRef]
  49. Lischer, H.E.; Excoffier, L. PGDSpider: An automated data conversion tool for connecting population genetics and genomics programs. Bioinformatics 2012, 28, 298–299. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  50. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2013. [Google Scholar]
  51. Goudet, J. Hierfstat, a package for R to compute and test hierarchical F-statistics. Mol. Ecol. Notes 2005, 5, 184–186. [Google Scholar] [CrossRef] [Green Version]
  52. Hardy, O.; Vekemans, X. SPAGeDi 1.5. A Program for Spatial Pattern Analysis of Genetic Diversity; Program Manual; Universite Libre de Bruxelles: Brussels, Belgium, 2015. [Google Scholar]
  53. Hardy, O.J.; Vekemans, X. SPAGeDi: A versatile computer program to analyse spatial genetic structure at the individual or population levels. Mol. Ecol. Notes 2002, 2, 618–620. [Google Scholar] [CrossRef] [Green Version]
  54. Slatkin, M. A measure of population subdivision based on microsatellite allele frequencies. Genetics 1995, 139, 457–462. [Google Scholar] [CrossRef]
  55. Peakall, R.; Smouse, P.E. GENALEX 6: Genetic analysis in Excel. Population genetic software for teaching and research. Mol. Ecol. Notes 2006, 6, 288–295. [Google Scholar] [CrossRef]
  56. Pons, O.; Petit, R. Measwring and testing genetic differentiation with ordered versus unordered alleles. Genetics 1996, 144, 1237–1245. [Google Scholar] [CrossRef]
  57. Agapow, P.M.; Burt, A. Indices of multilocus linkage disequilibrium. Mol. Ecol. Notes 2001, 1, 101–102. [Google Scholar] [CrossRef]
  58. Ripley, B.; Venables, B.; Bates, D.M.; Hornik, K.; Gebhardt, A. Package ‘mass’. Cran R 2013, 538, 113–120. [Google Scholar]
  59. Dray, S.; Dufour, A. The ade4 package: Implementing the duality diagram for ecologists. J. Stat. Softw. 2007, 22, 1–20. [Google Scholar] [CrossRef] [Green Version]
  60. Oksanen, J.; Blanchet, F.G.; Kindt, R.; Legendre, P. Package ‘vegan’: Community Ecology Package, version 2.0-10. Cran R 2013, 2, 1–295. [Google Scholar]
  61. Pritchard, J.K.; Stephens, M.; Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 2000, 155, 945–959. [Google Scholar] [CrossRef]
  62. Francis, R.M. Pophelper: An R package and web app to analyse and visualize population structure. Mol. Ecol. Resour. 2017, 17, 27–32. [Google Scholar] [CrossRef] [Green Version]
  63. Evanno, G.; Regnaut, S.; Goudet, J. Detecting the number of clusters of individuals using the software STRUCTURE: A simulation study. Mol. Ecol. 2005, 14, 2611–2620. [Google Scholar] [CrossRef] [Green Version]
  64. Gayevskiy, V.; Klaere, S.; Knight, S.; Goddard, M.R. ObStruct: A method to objectively analyse factors driving population structure using Bayesian ancestry profiles. PLoS ONE 2014, 9, e85196. [Google Scholar]
  65. Jombart, T. adegenet: A R package for the multivariate analysis of genetic markers. Bioinformatics 2008, 24, 1403–1405. [Google Scholar] [CrossRef] [Green Version]
  66. Nei, M. Estimation of average heterozygosity and genetic distance from a small number of individuals. Genetics 1978, 89, 583–590. [Google Scholar] [CrossRef]
  67. Alioto, T. Transposons played a major role in the diversification between the closely related almond and peach genomes: Results from the almond genome sequence. Plant J. 2020, 101, 455–472. [Google Scholar] [CrossRef] [Green Version]
  68. Nielsen, R.; Tarpy, D.R.; Reeve, H.K. Estimating effective paternity number in social insects and the effective number of alleles in a population. Mol. Ecol. 2003, 12, 3157–3164. [Google Scholar] [CrossRef]
  69. Shannon, C.E. A mathematical theory of communication. ACM SIGMOBILE Mob. Comput. Commun. Rev. 2001, 5, 3–55. [Google Scholar] [CrossRef]
  70. Stoddart, J.A.; Taylor, J.F. Genotypic diversity: Estimation and prediction in samples. Genetics 1988, 118, 705–711. [Google Scholar] [CrossRef]
  71. Simpson, E.H. Measurement of diversity. Nature 1949, 163, 688. [Google Scholar] [CrossRef]
  72. Jost, L. GST and its relatives do not measure differentiation. Mol. Ecol. 2008, 17, 4015–4026. [Google Scholar] [CrossRef]
  73. Hamrick, J.L.; Godt, M.W. Effects of life history traits on genetic diversity in plant species. Philos. Trans. R. Soc. Lond. Ser. B Biol. Sci. 1996, 351, 1291–1298. [Google Scholar]
  74. Ony, M.A.; Nowicki, M.; Boggess, S.l.; Klingeman, W.E.; Zobel, J.M.; Trigiano, R.N.; Hadziabdic, D. Habitat fragmentation influences genetic diversity and differentiation: Fine-scale population structure of Cercis canadensis (eastern redbud). Ecol. Evol. 2020, 10, 3655–3670. [Google Scholar] [CrossRef] [Green Version]
  75. Nowicki, M.; Houston, L.C.; Boggess, S.L.; Aiello, A.S.; Paya-MIlans, M.; Staton, M.E.; Hayashida, M.; Yamanaka, M.; Eda, S.; Trigiano, R.N. Species diversity and phylogeography of Cornus kousa (Asian dogwood) captured by genomic and genic microsatellites. Ecol. Evol. 2020, 10, 8299–8312. [Google Scholar] [CrossRef]
  76. Wu, J.; Wang, Y.; Xu, J.; Korban, S.S.; Fei, Z.; Tao, S.; Ming, R.; Tai, S.; Khan, A.M.; Postman, J.D.; et al. Diversification and independent domestication of Asian and European pears. Genome Biol. 2018, 19, 1–16. [Google Scholar] [CrossRef]
  77. Caballero, A.; Rodríguez-Ramilo, S.T.; Avila, V.; Fernández, J. Management of genetic diversity of subdivided populations in conservation programmes. Conserv. Genet. 2010, 11, 409–419. [Google Scholar] [CrossRef]
  78. Leimu, R.; Mutikainen, P.; Koricheva, J.; Fischer, M. How general are positive relationships between plant population size, fitness and genetic variation? J. Ecol. 2006, 94, 942–952. [Google Scholar] [CrossRef]
  79. Sakai, A.K.; Allendorf, F.W.; Holt, J.S.; Lodge, D.M.; Molofsky, J.; With, K.A.; Baughman, S.; Cabin, R.J.; Cohen, J.E.; Ellstrand, N.C.; et al. The population biology of invasive species. Annu. Rev. Ecol. Syst. 2001, 32, 305–332. [Google Scholar] [CrossRef] [Green Version]
  80. Hamrick, J.L.; Godt, M.J.W.; Sherman-Broyles, S.L. Factors Influencing Levels of Genetic Diversity in Woody Plant Species. In Population Genetics of Forest Trees; Springer: Berlin/Heidelberg, Germany, 1992; pp. 95–124. [Google Scholar]
  81. Meyer, L.; Causse, R.; Pernin, F.; Scalone, R.; Bailly, G.; Chauvel, B.; Delye, C.; Le Corre, V. New gSSR and EST-SSR markers reveal high genetic diversity in the invasive plant Ambrosia artemisiifolia L. and can be transferred to other invasive Ambrosia species. PLoS ONE 2017, 12, e0176197. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  82. Dunphy, B.; Hamrick, J. Gene flow among established Puerto Rican populations of the exotic tree species, Albizia lebbeck. Heredity 2005, 94, 418–425. [Google Scholar] [CrossRef]
  83. Pappert, R.A.; Hamrick, J.; Donovan, L.A. Genetic variation in Pueraria lobata (Fabaceae), an introduced, clonal, invasive plant of the southeastern United States. Am. J. Bot. 2000, 87, 1240–1245. [Google Scholar] [CrossRef]
  84. Slatkin, M. Gene flow and the geographic structure of natural populations. Science 1987, 236, 787–792. [Google Scholar] [CrossRef]
  85. Gaskin, J.F.; Schwarzlander, M.; Grevstad, F.S.; Haverhals, M.A.; Bourchier, R.S.; Miller, T.W. Extreme differences in population structure and genetic diversity for three invasive congeners: Knotweeds in western North America. Biol. Invasions 2014, 16, 2127–2136. [Google Scholar] [CrossRef]
  86. Cullingham, C.I.; Miller, J.M.; Peery, R.M.; Dupuis, J.R.; Malenfant, R.M.; Gorrell, J.C.; Janes, J.K. Confidently identifying the correct K value using the ΔK method: When does K = 2? Mol. Ecol. 2020, 29, 862–869. [Google Scholar] [CrossRef]
  87. Lombaert, E.; Guillemaud, T.; Deleury, E. Biases of STRUCTURE software when exploring introduction routes of invasive species. Heredity 2018, 120, 485–499. [Google Scholar] [CrossRef] [Green Version]
  88. Santamour, F.S., Jr.; Demuth, P. Identification of Callery pear cultivars by peroxidase isozyme patterns. J. Hered. 1980, 71, 447–449. [Google Scholar] [CrossRef]
Figure 1. Mantel test of Pyrus calleryana dataset. Mantel test (a) with Mantel correlogram (b) for isolation-by-distance analyses of samples within the P. calleryana dataset. The correlation between geographic and genetic distance for the dataset was determined using 1000 permutations at α = 0.05. Distance class indices (in 100 s of km) indicate that the maximum linear distance between samples was 3600 km. Significance (α = 0.05) is reported for the Mantel index (r) and the Mantel index standardized by the year of sampling (r’).
Figure 1. Mantel test of Pyrus calleryana dataset. Mantel test (a) with Mantel correlogram (b) for isolation-by-distance analyses of samples within the P. calleryana dataset. The correlation between geographic and genetic distance for the dataset was determined using 1000 permutations at α = 0.05. Distance class indices (in 100 s of km) indicate that the maximum linear distance between samples was 3600 km. Significance (α = 0.05) is reported for the Mantel index (r) and the Mantel index standardized by the year of sampling (r’).
Life 11 00531 g001
Figure 2. Structure Bayesian clustering of Pyrus calleryana dataset. Structure Bayesian clustering analyzed with (a) the Evanno method and visualized using (b) 2 genetic clusters or (c) 4 genetic clusters. Each vertical bar represents an individual sample and the bar color indicates the probability of an individual to get assigned to one of the identified clusters.
Figure 2. Structure Bayesian clustering of Pyrus calleryana dataset. Structure Bayesian clustering analyzed with (a) the Evanno method and visualized using (b) 2 genetic clusters or (c) 4 genetic clusters. Each vertical bar represents an individual sample and the bar color indicates the probability of an individual to get assigned to one of the identified clusters.
Life 11 00531 g002
Figure 3. DAPC of Pyrus calleryana dataset. DAPC for determining the molecular variance partitioning projected using 15 Principal Components cross-checked and optimized with 1000 permutations. Eigenvalues (Insert bar graph, bottom right) represent the factor by which eigenvectors are scaled, which expresses the spatial relationship among populations at different spatial scales. The two respective axes are indicated by the alleles explaining the most of variance within the sampled population. Insert genetic distance tree, top right: Unrooted neighbor-joining tree of pairwise genetic distances (FST) [66] among the sampled P. calleryana specimens.
Figure 3. DAPC of Pyrus calleryana dataset. DAPC for determining the molecular variance partitioning projected using 15 Principal Components cross-checked and optimized with 1000 permutations. Eigenvalues (Insert bar graph, bottom right) represent the factor by which eigenvectors are scaled, which expresses the spatial relationship among populations at different spatial scales. The two respective axes are indicated by the alleles explaining the most of variance within the sampled population. Insert genetic distance tree, top right: Unrooted neighbor-joining tree of pairwise genetic distances (FST) [66] among the sampled P. calleryana specimens.
Life 11 00531 g003
Table 1. Genetic diversity indices of P. calleryana dataset on the basis of population groups using 18 gSSR loci.
Table 1. Genetic diversity indices of P. calleryana dataset on the basis of population groups using 18 gSSR loci.
PopulationN% Missing# AllelesNAeHGFISArλHeHo r ¯ dPa
China2015.288.066.403.00200.636 ***4.610.950.800.300.10 ns37
Japan115.565.724.742.40110.511 ***4.020.910.740.370.16 ***19
Korea57.782.562.411.6150.569 ***2.480.800.510.240.03 ns10
USA217.947.224.623.04210.397 ***4.060.950.720.440.11 ns22
Total5710.0411.784.544.04570.552 ***3.790.980.800.360.12 ns88
N: Number of individual samples used; % Missing: percent of genotypes missing; # Alleles: Number of alleles present in the given population group (calculated using SPAGeDi); NAe: Effective number of alleles present in the given population group [68]; H: Shannon-Wiener Index of MLG diversity [69]; G: Stoddart and Taylor’s Index of MLG diversity [70]; FIS: Individual inbreeding coefficient (*** = p < 0.0001); Ar: Allelic richness giving the expected number of alleles among 8 gene copies; λ: Simpson’s Index [71]; He: Nei’s unbiased gene diversity [66]; Ho: Observed heterozygosity; r ¯ d: Standardized index of association taking into account for the number of loci sampled [24]; Pa: Number of private alleles in each population group. Significance was assessed by using 10,000 randomizations of gene copies among all individuals, *** = p < 0.0001 and ns = p > 0.05 at 10,000 permutations.
Table 2. Population genetics indices of P. calleryana dataset on the basis of microsatellite loci.
Table 2. Population genetics indices of P. calleryana dataset on the basis of microsatellite loci.
SSR LocusNo.% MissingArHoHeDestGFSTFISRSTRISNm
PyC0061012.283.890.130.720.203.530.08 ns0.74 ****0.18 *−0.08 ns5.58
PyC008163.514.860.360.870.827.360.16 ****0.38 ****0.47 ****0.69 ***0.85
PyC0091512.285.390.250.85−0.146.13−0.02 ns0.72 ****0.18 *0.70 ****NA
PyC0121326.324.730.080.890.788.340.14 ***0.85 ****0.17 ns0.85 ****1.07
PyC013140.004.340.430.840.505.860.11 ***0.46 ****0.11 *0.36 *1.23
PyC014147.025.450.210.860.726.840.16 ****0.70 ****0.50 ****0.67 ****0.94
PyC015185.265.400.120.870.857.030.20 ****0.85 ****0.74 ****0.87 ****0.67
PyC0172021.056.480.600.920.4610.800.06 **0.19 **0.18 *0.37 *3.34
PyC018168.775.890.650.910.5110.180.05 **0.17 **−0.04 ns0.09 ns3.20
PyC020130.004.890.390.800.204.820.03 ns0.49 ****0.07 ns0.34 *5.50
PyC031208.776.220.840.920.5711.760.04 *0.13 *0.03 ns−0.38 **6.86
PyC032619.304.240.090.740.583.770.22 ****0.86 ****0.37 ***0.73 ****0.93
PyC035131.754.770.410.850.316.540.03 ns0.49 ****0.14 *0.17 ns7.91
PyC03860.004.060.930.76−0.044.12−0.01 ns0.23 ***0 ns−0.75 ****NA
PyC041814.044.080.290.810.045.040.01 ns0.65 ****0.08 ns0.22 ****7.04
PyC042715.793.570.060.810.705.080.23 ****0.89 ****0.62 ****0.82 ****0.61
PyC047612.283.640.180.760.363.970.10 *0.61 ****0.19 *0.75 ****2.36
PyC050512.282.740.030.410.071.680.10 ns0.90 ****0.09 ns0.92 ****2.74
Average12.210.044.700.340.810.426.270.09 ****0.52 ****0.33 ****0.31 ****1.79
No.: Number of alleles detected; % Missing: % of samples that failed to amplify in the given locus; Ar: Allelic richness; Ho: Observed heterozygosity (Frequency of heterozygous individuals per locus averaged over the number of sampled loci); He: Expected heterozygosity (Nei’s unbiased gene diversity [66]); Dest: Jost’s differentiation estimate [72]; G: Stoddard and Taylor index [70]; FST: a measure of sub-population genetic structure; FIS: a measure of deviations from Hardy–Weinberg equilibrium in terms of heterozygote deficiency if <0 or homozygote excess if >0; RST and RIS: Analogues of FST and FIS based on allele sizes [54]; Nm: Gene flow estimated as Nm = ¼ × [(1/FST) − 1]. Significance of the dataset was assessed by 10,000 randomization of gene copies among all individuals using **** = p < 0.0001; *** = p < 0.001; ** = p < 0.01; * = p < 0.05; ns = p > 0.05. NA: Not applicable as a negative value does not represent any gene flow data.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Sapkota, S.; Boggess, S.L.; Trigiano, R.N.; Klingeman, W.E.; Hadziabdic, D.; Coyle, D.R.; Olukolu, B.A.; Kuster, R.D.; Nowicki, M. Microsatellite Loci Reveal Genetic Diversity of Asian Callery Pear (Pyrus calleryana) in the Species Native Range and in the North American Cultivars. Life 2021, 11, 531. https://doi.org/10.3390/life11060531

AMA Style

Sapkota S, Boggess SL, Trigiano RN, Klingeman WE, Hadziabdic D, Coyle DR, Olukolu BA, Kuster RD, Nowicki M. Microsatellite Loci Reveal Genetic Diversity of Asian Callery Pear (Pyrus calleryana) in the Species Native Range and in the North American Cultivars. Life. 2021; 11(6):531. https://doi.org/10.3390/life11060531

Chicago/Turabian Style

Sapkota, Shiwani, Sarah L. Boggess, Robert N. Trigiano, William E. Klingeman, Denita Hadziabdic, David R. Coyle, Bode A. Olukolu, Ryan D. Kuster, and Marcin Nowicki. 2021. "Microsatellite Loci Reveal Genetic Diversity of Asian Callery Pear (Pyrus calleryana) in the Species Native Range and in the North American Cultivars" Life 11, no. 6: 531. https://doi.org/10.3390/life11060531

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop