Next Article in Journal
Drosophila Interspecific Hybridization Causes a Deregulation of the piRNA Pathway Genes
Next Article in Special Issue
Serendipitous In Situ Conservation of Faba Bean Landraces in Tunisia: A Case Study
Previous Article in Journal
Evolutionary Dynamics of the POTE Gene Family in Human and Nonhuman Primates
Previous Article in Special Issue
Genetic Analysis of QTL for Resistance to Maize Lethal Necrosis in Multiple Mapping Populations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Characteristics of Microsatellites Mined from Transcriptome Data and the Development of Novel Markers in Paeonia lactiflora

1
College of Landscape Architecture, Beijing Forestry University, Beijing 100083, China
2
Management Office of Caozhou Peony Garden, Heze 274000, Shandong, China
3
Beijing Key Laboratory of Ornamental Plants Germplasm Innovation & Molecular Breeding, Beijing Forestry University, Beijing 100083, China
4
National Engineering Research Center for Floriculture, Beijing Forestry University, Beijing 100083, China
5
Beijing Laboratory of Urban and Rural Ecological Environment, Beijing Forestry University, Beijing 100083, China
*
Author to whom correspondence should be addressed.
Genes 2020, 11(2), 214; https://doi.org/10.3390/genes11020214
Submission received: 28 January 2020 / Revised: 17 February 2020 / Accepted: 17 February 2020 / Published: 19 February 2020
(This article belongs to the Special Issue Genetic Diversity Assessment and Marker-Assisted Selection in Crops)

Abstract

:
The insufficient number of available simple sequence repeats (SSRs) inhibits genetic research on and molecular breeding of Paeonia lactiflora, a flowering crop with great economic value. The objective of this study was to develop SSRs for P. lactiflora with Illumina RNA sequencing and assess the role of SSRs in gene regulation. The results showed that dinucleotides with AG/CT repeats were the most abundant type of repeat motif in P. lactiflora and were preferentially distributed in untranslated regions. Significant differences in SSR size were observed among motif types and locations. A large number of unigenes containing SSRs participated in catalytic activity, metabolic processes and cellular processes, and 28.16% of all transcription factors and 21.74% of hub genes for inflorescence stem straightness were found to contain SSRs. Successful amplification was achieved with 89.05% of 960 pairs of SSR primers, 55.83% of which were polymorphic, and most of the 46 tested primers had a high level of transferability to the genus Paeonia. Principal component and cluster dendrogram analyses produced results consistent with known genealogical relationships. This study provides a set of SSRs with abundant information for future accession identification, marker-trait association and molecular assisted breeding in P. lactiflora.

1. Introduction

Herbaceous peony, which has many varieties with distinct flower types and colors, provides great commercial benefits in the form of cut flowers and potted plants. It has a long juvenile period before flowering, which slows the development of new cultivars with specific and stable characteristics by traditional hybridization breeding [1]. Based on appropriate DNA markers, molecular-assisted breeding can be employed to select a target genotype and detect whether hybrids have the expected trait at an early stage; thus, it improves breeding efficiency and accuracy and saves time, labor and material resources [2]. The molecular breeding of herbaceous peony is not currently well developed due to a lack of foundational research; hence a large number of highly polymorphic and stable molecular markers of herbaceous peony should be developed to further identify associations with target traits.
Microsatellites, also known as simple sequence repeats (SSRs), are widely used for plant fingerprinting, genetic diversity assessment and association analysis between target traits and quantitative trait loci (QTLs) [3,4,5] due to their abundance in the genome, high polymorphism, codominant inheritance and good reproducibility [6,7]. SSRs can be developed from DNA or complementary DNA (cDNA) reverse transcribed from RNA [8]. In four previous studies on herbaceous peony, there were fewer than 16 SSR primers in each SSR-enriched genomic library or magnetic bead enrichment dataset [9,10,11]. Previous researchers synthesized 384 pairs of SSR primers from barcoded Illumina sequencing libraries of several species from the genus Paeonia; the researchers utilized 12 pairs of these SSRs and nine other SSR pairs to successfully identify 93 genotypes of Paeonia [12]. The numbers and repeat motif types of expressed sequence tag SSRs (EST-SSRs) from two sets of transcriptome data were previously reported by bioinformatics analysis, but additional PCR experiments or further validation were not performed [13,14]. Moreover, based on the transferability of SSRs among congeners of dicotyledonous plants [15], several primers from Paeonia were selected and used to successfully identify cultivars of Paeonia lactiflora [16]. Therefore, neither the number nor the application range of SSRs in herbaceous peony was not sufficient in these previous studies.
SSRs within genes or ESTs are more likely than genic SSRs obtained from SSR-enriched libraries or random DNA sequences to be effectively linked to target traits [17]. In Populus tomentosa, the genic SSRs selected from candidate genes related to wood formation were successfully used in family-based linkage mapping [18]. Twenty-four SSR primers of Pisum sativum were successfully mapped to several existing linkage groups [19]. Similar methods were also used in raspberry, in which SSRs were associated with several developmental traits [20]. For Paeonia rockii, SSR markers were used to perform association mapping, and 2.68–23.97% of flowering trait variance was explained [21]. Moreover, a genetic linkage map of tree peony covering five linkage groups was constructed by 124 EST-SSR primers [22]. Thus, development SSRs from herbaceous peony transcriptome may be effective for future use. Further studies showed the genomic distribution of SSRs is nonrandom. SSRs in genes may influence gene transcription or translation and gene activity [6,23], and recent studies showed a higher abundance of SSRs in response to environmental stress [24]. The polymorphism levels and potential functions of SSRs differ among the 5′ untranslated region (5′ UTR), the 3′ UTR and coding sequences (CDs) are different; SSRs in 5′ UTR may affect transcription or translation, SSRs in CDS may inactivate or activate genes, or truncate proteins, and SSRs in 3′ UTR may cause silencing or slippage [25]. However, no such information has been reported in herbaceous peony, which is not conducive to developing desirable SSR markers.
In this study, we mined SSRs from herbaceous peony transcriptome data and analyzed the distribution and location of the SSRs and the function of unigenes containing them. Initially, a total of 960 pairs of SSR primers were developed and amplified in eight cultivars from a core collection to initially validate the polymorphism level. Then, 46 pairs of primers were used to analyze transferability among nine species in Paeonia. Finally, we constructed a phylogenetic tree containing seven species and 24 varieties. This study provides a number of efficient and informative SSR primers for future molecular-assisted breeding of herbaceous peony.

2. Materials and Methods

2.1. SSR Identification, Annotation from Transcriptome Data and SSR Primer Design

All the SSR sequences used in this study were obtained from transcriptome data from inflorescence stems of P. lactiflora ‘Da Fugui’ and ‘Chui Touhong’ at five developmental stages (i.e., stages representing every seven days from stem elongation to flowering). The transcriptome data have been deposited in the Sequence Read Archive (SRA) database as described previously (accession number: PRJNA528693) [26,27] and were assembled by Trinity 3.0. MISA-web (http://pgrc.ipk-gatersleben.de/misa/) was used to search for SSRs in the unigenes [28]. The parameters were set as follows: dinucleotide (Di-) repeats had to be repeated at least 6 times, trinucleotide (Tri-) repeats had to be repeated at least five times, tetranucleotide (Tetra-) repeats had to be repeated at least four times, pentanucleotide (Penta-) repeats had to be repeated at least four times, and hexanucleotide (Hexa-) repeats had to be repeated at least four times. Interruptions were set to 100 to merge two SSR sequences into one SSR when the distance was shorter than 100 bp. Notably, mononucleotide repeats were not analyzed in this study. To identify possible SSR functions for future use, unigenes that contained SSRs were mapped to terms in the Gene Ontology (GO) database, gene numbers were calculated for each term, and the Kyoto Encyclopedia of Genes and Genomes (KEGG) database was used for pathway analysis. TransDecoder v3.0.1 (http://transdecoder.github.io/) was used to identify candidate coding regions, dividing transcript sequences into 5′ UTR, CDS and 3′ UTR sections. Primer 3 (http://bioinfo.ut.ee/primer3) [29] was used to design primers on both sides of the microsatellite sequences, following previous product size, primer length, GC content and annealing temperature principles [8].

2.2. Plant Materials

To evaluate the specificity and polymorphism of primers, fresh young leaves of P. lactiflora ‘Qihua Lushuang’, ‘Jinxing Shanshuo’, ‘Lian Tai’, ‘Fu Shi’, ‘Da Fugui’, ‘Dongfang Shaonu’, ‘Yangfei Chuyu’ and ‘Hong Fushi’ were obtained from Caozhou Peony Garden, Heze, Shandong Province, China, in April 2019. To evaluate transferability, fresh leaves of seven species of Paeonia were collected from different habitats in China: i.e., P. lactiflora was collected from Xilin Gol, Inner Mongolia (115°13′–117°06′ E, 43°02′–44°52′ N), Paeonia emodi and Paeonia sterniana were collected from Tibet (84°35′–86°20′ E, 28°3′–29°3′ N), Paeonia obovata was collected from Pingquan, Hebei Province (118°21′–119°15′ E, 40°24′–40°40′ N), Paeonia anomala was collected from Altay city (85°31′–91°04′ E, 45°00′–49°10′ N), Paeonia intermedia was collected from Yumin, Xinjiang Province (82°12′–83°30′ E, 45°24′–46°3′ N), and Paeonia veitchii was collected from Lanzhou, Gansu Province (103°40′ E, 36°03′ N). Twenty-four cultivars of P. lactiflora used for phylogenetic analysis were also collected from Caozhou Peony Garden. These leaves were bagged with silica gel and transported to Beijing, ground to powder with liquid nitrogen and stored at −80 °C in the laboratory of the National Engineering Research Center for Floriculture, Beijing, China. A DNAsecure plant kit (TIANGEN Biotech, Co., Ltd., Beijing, China) was used for DNA extraction. The quality and quantity of total DNA were estimated by a NanoDrop 2000 spectrophotometer (Thermo Scientific, Waltham, MA, USA). DNA was diluted to 30 ng/μL in preparation for polymerase chain reaction (PCR).

2.3. SSR Primer Evaluation in Eight Cultivars

A total of 960 pairs of primers were selected for synthesis (Ruibiotech, Co., Ltd., Beijing, China). To improve the efficiency of primer fluorescence labeling, the thermocycler amplification protocol was conducted in two rounds. First, the primers synthesized for the DNA of the eight cultivars were used for amplification. The 10 μL PCR mixture consisted of 0.1 μL of 10 μmol/μL forward primer containing the M13(-21) tail at its 5′ end and reverse primer, 1 μL of 30 ng/μL DNA, 5 μL of 0.1 U/μL 2×Taq PCR MasterMix (containing 0.05 units/μL Taq DNA polymerase (recombinant), 4 mM MgCl2 and 0.4 mM dNTPs, Aidlab Biotechnologies Co., Ltd., Beijing, China) and 3.8 μL of ddH2O. After an initial denaturation step of 95 °C for 5 min, 20 cycles of 95 °C for 30 s, 60 °C for 30 s and 72 °C for 30 s, as well as extension at 72 °C for 10 min, were performed. Second, to efficiently and economically analyze the length of PCR products, fluorescently labeled (i.e., FAM, HEX, TAMRA or ROX) M13(-21) universal primers were added to the PCR mix [30]. The 10 μL PCR mixture contained 0.15 μL of 10 μmol/μL M13(-21) universal primer and reverse primer, 2 μL of the PCR product from the first round, 5 μL of 0.1 U/μL 2×Taq PCR MasterMix and 2.7 μL of ddH2O. In the thermocycler, amplification was performed at 95 °C for 5 min and followed by 35 cycles of 95 °C for 30 s, 52 °C for 30 s and 72 °C for 30 s, as well as extension at 72 °C for 10 min. After 3% agarose gel electrophoresis, the amplified loci of the final PCR product were detected by a 3730xl DNA Analyzer with 96 capillaries (Applied Biosystems, Foster City, CA, USA) and sized with GS500 LIZ. The amplified loci were analyzed by GeneMarker V2.2.0.

2.4. Phylogenetic Analysis of Seven Species of Paeonia and 24 Cultivars of P. lactiflora

As shown in Table 1, the 46 forward primers showing the most abundant polymorphic loci were resynthesized by adding a fluorescent label to the 5′ tail. The 10 μL PCR mixture consisted of 0.2 μL of 10 μmol/μL forward primer and reverse primer, 1 μL of 30 ng/μL DNA template, 5 μL of 0.1 U/μL 2×Taq PCR MasterMix and 3.6 μL of ddH2O. PCR was performed at 95 °C for 5 min, followed by 35 cycles of 94 °C for 30 s, annealing at an appropriate temperature (as shown in Table S3) for 30 s, and 72 °C for 30 s and a final extension at 72 °C for 7 min.
Polymorphism information content (PIC) was calculated by the Microsatellite Toolkit according to the methods of a previous study [31]. GenAlEx 6.51b2 was used for genetic analysis and principal coordinates analysis (PCoA), and parameters including the number of alleles (Na), tests for Hardy-Weinberg equilibrium, the number of effective alleles (Ne), Shannon’s information index (I), heterozygosity (Ho), heterozygosity (He) and PIC were calculated [32,33]. The frequency of null alleles at loci were estimated by maximum likelihood method [34]. A dendrogram of the accessions of Paeonia was generated according to Bruvo’s distance with 1000 bootstrap replicates by the R package poppr [35].

3. Results

3.1. Numbers and Distribution of SSRs in Transcriptome Data

A total of 122,670 unigenes with a total length of 1.06E+08 bp were searched by MISA-web, and 10,468 SSRs (including 825 compound formations) were found. These SSRs were distributed among 8837 unigenes (7.20%), 1321 of which contained more than one SSR. As shown in Figure 1A, Di- repeats were the most abundant (63.52%) type of repeat motif, followed by Tri- repeats (22.70%), Tetra- repeats (7.73%) and all the other types of repeat motifs (6.05%). Di- repeats were of four types, namely, AG/CT (36.55%), AT/AT (14.78%), AC/GT (11.97%) and CG/CG (0.23%). The number of each type of Di- repeat (except CG/CG) exceeded the number of SSRs. TransDecoder analysis showed that these SSRs involved 4482 CDSs and 3958 unigenes. As shown in Figure 1B, the positions of many motifs were not putative, and in known positions of unigenes, different motifs exhibited distinct preferences. Di- repeats were mostly located in the 5′ UTR, followed by the 3′ UTR. In CDSs, Tri- repeats were the most abundant motif, and Tetra- repeats were mostly located in the 3′ UTR and 5′ UTR.
SSR size was analyzed as shown in Figures S1 and S2 and Table S1. For each type of repeat motif, the smallest SSRs were the most abundant, with sizes of 12 for Di- repeats, 15 for Tri- repeats, 16 for Tetra- repeats, 20 for Penta- repeats and 24 for Hexa- repeats, all of which had distinct discrete values with the increase in repeat motifs. Different types of repeat motifs exhibited significantly distinct sizes according to pairwise comparisons (Wilcoxon rank sum test, with Bonferroni adjustment). For all the SSRs, the most common size was 12 for Di- repeats (frequency of 1894), 15 for Tri- repeats (frequency of 1309) and 14 for Di- repeats (frequency of 1200). For each location (including the unknown positions) of SSRs, a large number of discrete values were observed to have high repetitions. Significant differences in the sizes of SSRs among the 5′ UTR, 3′ UTR and CDSs appeared according to pairwise comparisons (Wilcoxon rank sum test, with Bonferroni adjustment). Only the size of SSRs in the 5′ UTR differed from that in multiple regions. The size of SSRs in unknown regions was not obviously different from that in the 3′ UTR and multiple regions.

3.2. Annotation of the Unigenes with SSRs

To understand the potential functions of the unigenes containing SSRs, we classified these unigenes using GO annotation. A total of 8837 unigenes were blasted against a protein database and annotated with GO terms. The results showed that 8522 unigenes (96.44%) were involved in three functional groups and 42 putative processes or functions, as shown in Figure 2. These unigenes participated in 17 types of biological processes, 11 types of molecular functions and 14 types of cellular components. Catalytic activity (972, 11.40% of the total blasted genes), metabolic process (959, 11.25%) and cellular process (930, 10.91%) were the three most abundant terms for putative gene functions. Fewer than ten unigenes were involved in each of the two types of biological processes, six types of molecular functions and four types of cellular components, and rhythmic process (1), transcription factor (TF) activity and protein binding (1) and extracellular region part (1) were the terms assigned the fewest unigenes containing SSRs.
To further categorize the unigenes, KEGG annotation was used. As shown in Figure 3, unigenes with SSRs were involved in 126 pathways, which were divided into five classes and 18 subclasses. In total, unigenes participating in metabolism were most abundant. At the subclass level, the global and overview pathway (38.22%) had the largest number of unigenes, followed by the transcription pathway (11.44%), and the carbohydrate metabolism pathway (9.79%). The top five unigenes belonged to global and overview subgroups, and they were involved in metabolic pathways (15.38%), biosynthesis of secondary metabolites (8.63%), biosynthesis of antibiotics (4.11%), microbial metabolism in diverse environments (3.98%) and carbon metabolism (2.73%).
As shown in Table S2, the TFs containing SSRs accounted for 28.16% of the total TFs (1218) in the transcriptome of herbaceous peony, including 51 kinds of TF families (e.g., ERF, MYB, MYB related and ARF). Regarding the hub genes (46) for inflorescence stem straightness, as described in a previous study [27], 21.74% of the genes contained SSRs and were involved in lignin monolignol biosynthesis (4CL1, CCoAOMT2, HST and CAD2), xylan synthesis and metabolic process (IRX-15 L), auxin signaling transduction (IAA26, IAA31 and SAUR20) and lateral organ boundary domain TF (LBD15 and LBD36) pathways.

3.3. Initial Amplification of SSR Primers

Primer3 was used to design primers for the 9643 SSRs. Appropriate primers could not be designed for 2369 of these SSRs due to short or missing flanking sequences. A total of 7274 pairs of primers were designed, 3721 of which were able to identify CDSs. We further selected 960 pairs of primers considering SSR types and locations for synthesis and used them for amplification in eight distinct cultivars of P. lactiflora. As shown in Figure S3, 89.05% of the primers resulted in successful amplification in these cultivars, and 55.83% (i.e., 62.72% of the total with successful amplification) of the SSR marker primers had polymorphic amplification products. The total number of polymorphic loci decreased with an increasing number of alleles per locus. Appropriately 30% of the primers amplified only two or three types of products, and almost 16% of the primers amplified more than five alleles in the eight DNA templates.

3.4. Polymorphism in P. lactiflora

The 46 primers (listed in Table S3) with the most abundant amplified loci were used to reveal the information and transferability of the primers, as shown in Table 1.
Forty-four pairs of primers were amplified in the accessions; however, TA564 (172–201 bp) and TA566 (167–185 bp) were successful in only 26 and 25 accessions, respectively. The product size range varied among accessions. The products of T852 (209 bp) and TA144 (100 bp) had the maximum size difference among accessions, and T163 (14 bp), T237 (14 bp) and TA464 (14 bp) presented the smallest size differences among accessions. A total of 472 different alleles (Na) were amplified. The Na ranged from 6 to 16, with an average value of 10.26 ± 0.36; the Ne ranged from 2.47 to 9.96, with an average value of 5.26 ± 0.21. I varied from 1.39 to 2.41, with an average value of 1.88. The ranges of Ho and He were 0.13–0.97 (the average was 0.67) and 0.59–0.90 (the average was 0.80), respectively. SSR typing data of seven locus followed Hardy-Weinberg equilibrium, and the frequency of null alleles in other locus were 0.000–0.391 (the average was 0.11). The PIC ranged from 0.57–0.89 with an average value of 0.77.

3.5. Transferability of SSR Markers among Paeonia Species

PCoA of 31 accessions was conducted according to the amplified alleles, as shown in Figure 4A. Eigen values by axis and sample eigen vectors are shown in Table S4. A total of 30 dimensions were extracted, and dimension 1 (10.7%) and dimension 2 (7.1%) represented 17.8% of the total information. The points representing the 24 cultivars of P. lactiflora were close to each other and separated from points of the other seven species. Of these species, P. lactiflora was nearest to the 24 cultivars in the PCoA plot, and P. intermedia was significantly separated from all the accessions. A cluster dendrogram was drawn according to Bruvo’s distances calculated by the amplified alleles, as shown in Figure 4B. All the accessions were divided into two groups, and the species P. lactiflora and its cultivars were tightly clustered and separated from the other Paeonia species, which was consistent with the results of PCoA. Furthermore, at a height of 0.85, P. obovata and P. emodi were further clustered and separate from the other four species. Among the cultivars of P. lactiflora, ‘Yinxian Xiuhongpao’ had the maximum distance from the other cultivars and was separated at a height of 0.65.

4. Discussion

Next-generation sequencing makes it possible to develop microsatellites efficiently and inexpensively [8]. In this study, we identified a total of 10,468 SSRs, covering 7.20% of the transcripts assembled using our transcriptome data. This method was significantly more convenient and effective than the use of SSR-enriched genomic libraries or magnetic bead enrichment, which were the primary methods used in previous herbaceous peony studies [9,10,11,36,37]. The coverage of SSRs in ESTs reported in the present study was higher than that reported in five cereals (average of 3.2%) [38] and similar (6.6%) to that generated by Trinity for P. lactiflora ‘Hang Baishao’ [13]. The distribution of motif types varies among plants. The most frequent motif type for Parrotia subaequalis was Di- [39], while the most frequent motif type for Lychnis kiusiana and Dendrocalamus hamiltonii was Tri- [40,41]. In this study, Di- repeats were the most abundant motif (mononucleotides were not considered) in P. lactiflora, and AG/CT accounted for 36.55% of all SSRs, followed by AT/AT (14.78%) and AC/GT (11.97%). The observation that AG/CT was the most frequent repeat was consistent with the finding of a previous study in the genus Paeonia regardless of assemble methods [13], while the proportions of TC/GA and AC/GT repeats significantly differed [14,42]. Differences in the transcriptomic SSR motifs can explain the relatively low transferability of SSR primers (approximately 26%) from the genus Paeonia to P. lactiflora [16].
SSR size varied significantly among unigene locations and motif types in this study. Furthermore, our results suggested distinct preferences in the distribution of motifs among different gene parts; in annotated positions, a large number of Di- repeat motifs were distributed in the 5′ UTR and 3′ UTR, and most of the Tri- repeat motifs were distributed in CDSs. Polymorphism level is affected by location; notably, a large proportion of SSRs in the 3′ UTR were polymorphic in Hordeum vulgare [43]. Further studies in grape showed that the most polymorphic SSR position differed at three levels, that is, among cultivars, among cultivars and species, and among species and genera [25]. As shown in Figure S3, in the initial screening of 960 SSR primers in eight cultivars, we found that at the cultivar level, SSRs from the 5′ UTR (64.05%) were the most polymorphic, followed by those from the 3′ UTR (60.61%). This result suggested that the polymorphism level of SSR locations was related to species.
SSRs from the transcribed sequence may be directly related to phenotypic variation and thus related to functional trait. SSR alleles associated with biotic or abiotic stress, such as heat, cold, salt and resistant to multiple diseases have been reported [44,45,46,47]. SSRs from specific organ are likely to associate with corresponding morphological traits; SSRs obtained from flower bud transcriptome in P. rockii have been demonstrated significantly associating with flower colors and shapes [21]. In this study, SSRs were investigated from transcriptome that was obtained from two cultivars with distinct straightness of inflorescence stem, and a large quantity of unigenes with SSRs were annotated with the catalytic activity, metabolic process and cellular process terms. Furthermore, 28.16% of all TFs and 21.74% of the hub genes for inflorescence stem straightness contained SSRs. We speculate that these SSRs likely associated with straightness characteristics of the herbaceous peony inflorescence stem, while association analysis and QTL mapping are needed in further study.
In our experiment, 89.05% of 960 pairs of primers were validated by PCR, and 55.83% of the primers were polymorphic, which was approximately the percentage (59.90%) previously reported in peony [10], higher than the percentage (36.67%) in Amentotaxus argotaenia [48], and lower than the percentage for SSR markers (77.2%) generated from the soybean genome [49]. These amplification differences may be due to the number of individuals used for amplification or locus mutations (e.g., insertions, deletions and translocations) among species or cultivars [50]. To identify the reason, more individuals should be subjected to PCR amplification, and cloning experiments and sequencing should be carried out.
The mean Na in this study was 10.26, and Ho and He were 0.67 and 0.80, respectively, which were higher than the Ho and He reported in previous studies on tree peony and herbaceous peony [9,10,51]. The mean PIC value was 0.77, showing a high level of high informativeness [52]; compared to mean PIC value (0.4149–0.678) revealed by previous SSR development [9,10,36,37], our results were significantly higher, suggesting our contribution for future effective genetic analysis or QTL mapping of Paeonia with fewer SSRs. Previous studies suggested that the presence of null alleles is common, and it has influence on evaluation of genetic diversity of population, even causes misunderstanding in parentage analysis [53,54]. Literature showed that the frequencies of null alleles were almost fewer than 0.40 and most of them were fewer than 0.20 [55]. Our results showed the frequencies of null alleles of 22 SSRs were less than 0.08 (or no presence), and only that of four SSRs were between 0.20 and 0.40. The future use of these SSRs should carefully consider the influence of null alleles according to research objective and choose the appropriate SSRs.
EST-SSRs developed for one species can be transferred to related species, with transferability varying depending on the plant and SSR source. In Magnolia wufengensis and Elymus sibiricus, the transferability of EST-SSRs to related species was 50–68.1% and 49.1%, respectively. In SSRs from candidate genes of Oryza sativa, transferability ranged from 70.37% to 77.78% according to different complexes [56]. In this study, 52.17% of the 46 pairs of SSR primers selected from the initial screening could be completely transferred to seven species of the genus Paeonia, 39.13% of the pairs had high transferability (six or five of seven accessions were successfully amplified), and 4.35% of the pairs had moderate transferability and could be amplified in four of seven species.
The diversity of dimensions extracted from PCoA and the low explanatory power of one dimension suggested that the genetic background of the Paeonia accessions involved in this study varied greatly. Combining the PCoA plot and the dendrogram of 31 accessions, the genetic relationships between these accessions were almost consistent with their recognized morphological classification [57,58]. These SSR markers can be used in genetic variance analysis and to initially evaluate the value of breeding parents selected according to genetic distance in the genus Paeonia.

5. Conclusions

In this study, a large quantity of informative SSRs were conveniently identified from transcriptome data of P. lactiflora, and the distribution and location of motifs were defined. SSR containing genes associated with TFs and inflorescence stem straightness were identified, providing a foundation for future marker-trait association research. To the best of our knowledge, this is the first study to comprehensively reveal the characteristics and functional annotations of EST-SSRs in P. lactiflora. In future studies, more herbaceous peony accessions should be tested to further evaluate the polymorphism of markers, and more functional markers potentially associated with traits should be developed to advance the molecular breeding of P. lactiflora.

Supplementary Materials

The following are available online at https://www.mdpi.com/2073-4425/11/2/214/s1, Figure S1: Size distribution of SSRs with different motifs and different unigene positions. (A) SSR size of different motifs. Considering the data coverage and to improve the readability of the graph, the figure shows 99% of the data. The distribution of all data is shown in Figure S2. Size differences between motifs were analyzed with pairwise comparisons using the Wilcoxon rank sum test (after Bonferroni correction), and all the outputs were less than 2.2 × 10−16. (B) Sizes of SSRs in different unigene positions. The figure shows 98% of the data. The distribution of all data is shown in Table S1. Size differences between positions were compared using the same method as in (A), and the figure shows combinations with p-values less than 0.05, Figure S2: Size distribution of each type of repeat motif in all data, Figure S3: Polymorphism rate of primers designed for different positions of SSRs in unigenes, Table S1: Numbers of SSRs with distinct sizes in different regions, Table S2: Count of SSRs in TFs, Table S3: Sequence, position, repeat motif, annotation, annealing temperature and amplified product size of 46 SSR primers, Table S4: Eigen values by axis and sample eigen vectors.

Author Contributions

Conceptualization, Y.W. and Y.L.; methodology, Y.W. and M.Z.; software, Y.W.; validation, Y.W., M.Z. and Y.Z.; formal analysis, Y.W.; investigation, A.H. and Y.Z.; resources, A.H. and Y.Z.; data curation, M.Z.; writing—original draft preparation, Y.W.; writing—review and editing, M.Z., A.H., Y.Z., and Y.L.; visualization, Y.W.; supervision, Y.L.; project administration, Y.L.; funding acquisition, Y.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Beijing Municipal Science & Technology Commission, grant number D161100001916004.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Kamenetsky, R.; Dole, J. Herbaceous peony (Paeonia): Genetics, physiology and cut flower production. Floric. Ornam. Biotechnol. 2012, 6, 62–77. [Google Scholar]
  2. Collard, B.C.; Mackill, D.J. Marker-assisted selection: An approach for precision plant breeding in the twenty-first century. Philos. Trans. R. Soc. B Biol. Sci. 2008, 363, 557–572. [Google Scholar] [CrossRef] [Green Version]
  3. Agarwal, M.; Shrivastava, N.; Padh, H. Advances in molecular marker techniques and their applications in plant sciences. Plant Cell Rep. 2008, 27, 617–631. [Google Scholar] [CrossRef]
  4. Bhattarai, G.; Mehlenbacher, S.A. In silico development and characterization of tri-nucleotide simple sequence repeat markers in hazelnut (Corylus avellana L.). PLoS ONE 2017, 12, e0178061. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Nie, G.; Tang, L.; Zhang, Y.; Huang, L.; Ma, X.; Cao, X.; Pan, L.; Zhang, X.; Zhang, X. Development of SSR markers based on transcriptome sequencing and association analysis with drought tolerance in perennial grass Miscanthus from China. Front. Plant Sci. 2017, 8, 801. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. Varshney, R.K.; Graner, A.; Sorrells, M.E. Genic microsatellite markers in plants: Features and applications. Trends Biotechnol. 2005, 23, 48–55. [Google Scholar] [CrossRef] [PubMed]
  7. Liu, S.; An, Y.; Li, F.; Li, S.; Liu, L.; Zhou, Q.; Zhao, S.; Wei, C. Genome-wide identification of simple sequence repeats and development of polymorphic SSR markers for genetic studies in tea plant (Camellia sinensis). Mol. Breed. 2018, 38, 59. [Google Scholar] [CrossRef]
  8. Taheri, S.; Abdullah, T.L.; Yusop, M.R.; Hanafi, M.M.; Sahebi, M.; Azizi, P.; Shamshiri, R.R. Mining and development of novel SSR markers using next generation sequencing (NGS) data in plants. Molecules 2018, 23, 399. [Google Scholar] [CrossRef] [Green Version]
  9. Cheng, Y.; Kim, C.-H.; Shin, D.-I.; Kim, S.-M.; Koo, H.-M.; Park, Y.-J. Development of simple sequence repeat (SSR) markers to study diversity in the herbaceous peony (Paeonia lactiflora). J. Med. Plants Res. 2011, 5, 6744–6751. [Google Scholar] [CrossRef]
  10. Li, L.; Cheng, F.Y.; Zhang, Q.X. Microsatellite markers for the Chinese herbaceous peony Paeonia lactiflora (Paeoniaceae). Am. J. Bot. 2011, 98, e16–e18. [Google Scholar] [CrossRef]
  11. Sun, J.; Yuan, J.; Wang, B.; Pan, J.; Zhang, D. Development and characterization of 10 microsatellite loci in Paeonia lactiflora (Paeoniaceae). Am. J. Bot. 2011, 98, e242–e243. [Google Scholar] [CrossRef] [PubMed]
  12. Gilmore, B.; Bassil, N.; Nyberg, A.; Knaus, B.; Smith, D.; Barney, D.L.; Hummer, K. Microsatellite marker development in peony using next generation sequencing. J. Am. Soc. Hortic. Sci. 2013, 138, 64–74. [Google Scholar] [CrossRef] [Green Version]
  13. Zhang, J.; Wu, Y.; Li, D.; Wang, G.; Li, X.; Xia, Y. Transcriptomic analysis of the underground renewal buds during dormancy transition and release in ‘Hangbaishao’Peony (Paeonia lactiflora). PLoS ONE 2015, 10, e0119118. [Google Scholar]
  14. Ma, Y.; Cui, J.; Lu, X.; Zhang, L.; Chen, Z.; Fei, R.; Sun, X. Transcriptome analysis of two different developmental stages of Paeonia lactiflora seeds. Int. J. Genom. 2017. [Google Scholar] [CrossRef] [Green Version]
  15. Barbara, T.; PALMA-SILVA, C.; Paggi, G.M.; Bered, F.; Fay, M.F.; Lexer, C. Cross-species transfer of nuclear microsatellite markers: Potential and limitations. Mol. Ecol. 2007, 16, 3759–3767. [Google Scholar] [CrossRef]
  16. Zhang, J.; Liu, A.; Zhang, S.; Xie, Y.; Yan, L. Using the SSR with fluorescent labeling to establish SSR molecular ID code for cultivars of the Chinese herbaceous peony. J. Beijing For. Univ. 2016, 38, 101–109. [Google Scholar]
  17. Dutta, S.; Kumawat, G.; Singh, B.P.; Gupta, D.K.; Singh, S.; Dogra, V.; Gaikwad, K.; Sharma, T.R.; Raje, R.S.; Bandhopadhya, T.K. Development of genic-SSR markers by deep transcriptome sequencing in pigeonpea [Cajanus cajan (L.) Millspaugh]. BMC Plant Biol. 2011, 11, 17. [Google Scholar] [CrossRef] [Green Version]
  18. Du, Q.; Gong, C.; Pan, W.; Zhang, D. Development and application of microsatellites in candidate genes related to wood properties in the Chinese white poplar (Populus tomentosa Carr.). DNA Res. 2012, 20, 31–44. [Google Scholar] [CrossRef] [Green Version]
  19. Mishra, R.K.; Gangadhar, B.H.; Nookaraju, A.; Kumar, S.; Park, S.W. Development of EST-derived SSR markers in pea (Pisum sativum) and their potential utility for genetic mapping and transferability. Plant Breed. 2012, 131, 118–124. [Google Scholar] [CrossRef]
  20. Woodhead, M.; McCallum, S.; Smith, K.; Cardle, L.; Mazzitelli, L.; Graham, J. Identification, characterisation and mapping of simple sequence repeat (SSR) markers from raspberry root and bud ESTs. Mol. Breed. 2008, 22, 555–563. [Google Scholar] [CrossRef]
  21. Wu, J.; Cheng, F.; Cai, C.; Zhong, Y.; Jie, X. Association mapping for floral traits in cultivated Paeonia rockii based on SSR markers. Mol. Genet. Genom. 2017, 292, 187–200. [Google Scholar] [CrossRef] [PubMed]
  22. Guo, Q.; Guo, L.-L.; Zhang, L.; Zhang, L.-X.; Ma, H.-L.; Guo, D.-L.; Hou, X.-G. Construction of a genetic linkage map in tree peony (Paeonia Sect. Moutan) using simple sequence repeat (SSR) markers. Sci. Hortic. 2017, 219, 294–301. [Google Scholar]
  23. Li, Y.C.; Korol, A.B.; Fahima, T.; Beiles, A.; Nevo, E. Microsatellites: Genomic distribution, putative functions and mutational mechanisms: A review. Mol. Ecol. 2002, 11, 2453–2465. [Google Scholar] [CrossRef] [PubMed]
  24. Srivastava, S.; Avvaru, A.K.; Sowpati, D.T.; Mishra, R.K. Patterns of microsatellite distribution across eukaryotic genomes. BMC Genom. 2019, 20, 153. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Scott, K.D.; Eggler, P.; Seaton, G.; Rossetto, M.; Ablett, E.M.; Lee, L.S.; Henry, R. Analysis of SSRs derived from grape ESTs. Theor. Appl. Genet. 2000, 100, 723–726. [Google Scholar] [CrossRef]
  26. Wan, Y.; Hong, A.; Zhang, Y.; Liu, Y. Selection and validation of reference genes of Paeonia lactiflora in growth development and light stress. Physiol. Mol. Biol. Plants 2019, 25, 1097. [Google Scholar] [CrossRef]
  27. Wan, Y.; Zhang, M.; Hong, A.; Lan, X.; Yang, H.; Liu, Y. Transcriptome and weighted correlation network analyses provide insights into inflorescence stem straightness in Paeonia lactiflora. Plant Mol. Biol. 2020, 102, 239–252. [Google Scholar] [CrossRef]
  28. Beier, S.; Thiel, T.; Münch, T.; Scholz, U.; Mascher, M. MISA-web: A web server for microsatellite prediction. Bioinformatics 2017, 33, 2583–2585. [Google Scholar] [CrossRef] [Green Version]
  29. Untergasser, A.; Cutcutache, I.; Koressaar, T.; Ye, J.; Faircloth, B.C.; Remm, M.; Rozen, S.G. Primer3—New capabilities and interfaces. Nucleic Acids Res. 2012, 40, e115. [Google Scholar] [CrossRef] [Green Version]
  30. Schuelke, M. An economic method for the fluorescent labeling of PCR fragments. Nat. Biotechnol. 2000, 18, 233. [Google Scholar] [CrossRef]
  31. Lovin, D.D.; Washington, K.O.; deBruyn, B.; Hemme, R.R.; Mori, A.; Epstein, S.R.; Harker, B.W.; Streit, T.G.; Severson, D.W. Genome-based polymorphic microsatellite development and validation in the mosquito Aedes aegypti and application to population genetics in Haiti. BMC Genom. 2009, 10, 590. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  32. Peakall, R.; Smouse, P.E. Genalex 6: Genetic analysis in Excel. Population genetic software for teaching and research. Mol. Ecol. Notes 2006, 6, 288–295. [Google Scholar] [CrossRef]
  33. Peakall, R.; Smouse, P.E. GenAlEx 6.5: Genetic analysis in Excel. Population genetic software for teaching and research—An update. Bioinformatics 2012, 28, 2537–2539. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Kalinowski, S.T.; Taper, M.L. Maximum likelihood estimation of the frequency of null alleles at microsatellite loci. Conserv. Genet. 2006, 7, 991–995. [Google Scholar] [CrossRef]
  35. Kamvar, Z.N.; Tabima, J.F.; Grünwald, N.J. Poppr: An R package for genetic analysis of populations with clonal, partially clonal, and/or sexual reproduction. PeerJ 2014, 2, e281. [Google Scholar] [CrossRef] [Green Version]
  36. Yu, H.-P.; Cheng, F.-Y.; Zhong, Y.; Cai, C.-F.; Wu, J.; Cui, H.-L. Development of simple sequence repeat (SSR) markers from Paeonia ostii to study the genetic relationships among tree peonies (Paeoniaceae). Sci. Hortic. 2013, 164, 58–64. [Google Scholar] [CrossRef]
  37. Ji, L.; da Silva, J.A.T.; Zhang, J.; Tang, Z.; Yu, X. Development and application of 15 novel polymorphic microsatellite markers for sect. Paeonia (Paeonia L.). Biochem. Syst. Ecol. 2014, 54, 257–266. [Google Scholar] [CrossRef]
  38. Kantety, R.V.; La Rota, M.; Matthews, D.E.; Sorrells, M.E. Data mining for simple sequence repeats in expressed sequence tags from barley, maize, rice, sorghum and wheat. Plant Mol. Biol. 2002, 48, 501–510. [Google Scholar] [CrossRef]
  39. Zhang, Y.; Zhang, M.; Hu, Y.; Zhuang, X.; Xu, W.; Li, P.; Wang, Z. Mining and characterization of novel EST-SSR markers of Parrotia subaequalis (Hamamelidaceae) from the first Illumina-based transcriptome datasets. PLoS ONE 2019, 14, e0215874. [Google Scholar] [CrossRef]
  40. Park, S.; Son, S.; Shin, M.; Fujii, N.; Hoshino, T.; Park, S. Transcriptome-wide mining, characterization, and development of microsatellite markers in Lychnis kiusiana (Caryophyllaceae). BMC Plant Biol. 2019, 19, 12. [Google Scholar] [CrossRef] [Green Version]
  41. Bhandawat, A.; Sharma, V.; Singh, P.; Seth, R.; Nag, A.; Kaur, J.; Sharma, R.K. Discovery and utilization of EST-SSR marker resource for genetic diversity and population structure analyses of a subtropical bamboo, Dendrocalamus hamiltonii. Biochem. Genet. 2019, 57, 652–672. [Google Scholar] [CrossRef] [PubMed]
  42. Wu, J.; Cai, C.; Cheng, F.; Cui, H.; Zhou, H. Characterisation and development of EST-SSR markers in tree peony using transcriptome sequences. Mol. Breed. 2014, 34, 1853–1866. [Google Scholar] [CrossRef]
  43. Thiel, T.; Michalek, W.; Varshney, R.; Graner, A. Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.). Theor. Appl. Genet. 2003, 106, 411–422. [Google Scholar] [CrossRef] [PubMed]
  44. Sun, X.; Du, Z.; Ren, J.; Amombo, E.; Hu, T.; Fu, J. Association of SSR markers with functional traits from heat stress in diverse tall fescue accessions. BMC Plant Biol. 2015, 15, 116. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  45. Xiao, Y.; Zhou, L.; Xia, W.; Mason, A.S.; Yang, Y.; Ma, Z.; Peng, M. Exploiting transcriptome data for the development and characterization of gene-based SSR markers related to cold tolerance in oil palm (Elaeis guineensis). BMC Plant Biol. 2014, 14, 384. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Bosamia, T.C.; Mishra, G.P.; Thankappan, R.; Dobaria, J.R. Novel and stress relevant EST derived SSR markers developed and validated in peanut. PLoS ONE 2015, 10, e0129127. [Google Scholar] [CrossRef] [Green Version]
  47. Molla, K.A.; Debnath, A.B.; Ganie, S.A.; Mondal, T.K. Identification and analysis of novel salt responsive candidate gene based SSRs (cgSSRs) from rice (Oryza sativa L.). BMC Plant Biol. 2015, 15, 122. [Google Scholar] [CrossRef] [Green Version]
  48. Ruan, X.; Wang, Z.; Su, Y.; Wang, T. Characterization and application of EST-SSR markers developed from the transcriptome of Amentotaxus argotaenia (Taxaceae), a relict vulnerable conifer. Front. Genet. 2019, 10, 1014. [Google Scholar] [CrossRef]
  49. Song, Q.; Jia, G.; Zhu, Y.; Grant, D.; Nelson, R.T.; Hwang, E.-Y.; Hyten, D.L.; Cregan, P.B. Abundance of SSR motifs and development of candidate polymorphic SSR markers (BARCSOYSSR_1. 0) in soybean. Crop Sci. 2010, 50, 1950–1960. [Google Scholar] [CrossRef] [Green Version]
  50. Zhao, H.; Yang, L.; Peng, Z.; Sun, H.; Yue, X.; Lou, Y.; Dong, L.; Wang, L.; Gao, Z. Developing genome-wide microsatellite markers of bamboo and their applications on molecular marker assisted taxonomy for accessions in the genus Phyllostachys. Sci. Rep. 2015, 5, 8018. [Google Scholar] [CrossRef]
  51. Gao, Z.; Wu, J.; Liu, Z.; Wang, L.; Ren, H.; Shu, Q. Rapid microsatellite development for tree peony and its implications. BMC Genom. 2013, 14, 886. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  52. Botstein, D.; White, R.L.; Skolnick, M.; Davis, R.W. Construction of a genetic linkage map in man using restriction fragment length polymorphisms. Am. J. Hum. Genet. 1980, 32, 314. [Google Scholar] [PubMed]
  53. Chapuis, M.-P.; Estoup, A. Microsatellite null alleles and estimation of population differentiation. Mol. Biol. Evol. 2007, 24, 621–631. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  54. Oddou-Muratorio, S.; Vendramin, G.G.; Buiteveld, J.; Fady, B. Population estimators or progeny tests: What is the best method to assess null allele frequencies at SSR loci? Conserv. Genet. 2009, 10, 1343. [Google Scholar] [CrossRef]
  55. Dakin, E.; Avise, J. Microsatellite null alleles in parentage analysis. Heredity 2004, 93, 504–509. [Google Scholar] [CrossRef]
  56. Molla, K.A.; Azharudheen, T.P.M.; Ray, S.; Sarkar, S.; Swain, A.; Chakraborti, M.; Vijayan, J.; Singh, O.N.; Baig, M.J.; Mukherjee, A.K. Novel biotic stress responsive candidate gene based SSR (cgSSR) markers from rice. Euphytica 2019, 215, 17. [Google Scholar] [CrossRef]
  57. Hong, D.; Pan, K. A taxonomic revision of the Paeonia anomala complex (Paeoniaceae). Ann. Mo. Bot. Gard. 2004, 91, 87–98. [Google Scholar]
  58. Hong, D. Peonies of the World: Taxonomy and Phytogeography; Royal Botanic Gardens: Edinburgh, UK, 2010; pp. 33–44. [Google Scholar]
Figure 1. Distribution of different repeat motifs and positions of simple sequence repeats (SSRs) in unigenes. (A) Proportion and distribution of each type of motif in dinucleotide (Di-), trinucleotide (Tri-), tetranucleotide (Tetra-) and other (i.e., Penta-, Hexa- and compound) repeats. In the legend, ‘other Tri-’ consists of ACG/CGT (0.26%), ACT/AGT (0.26%) and CCG/CGG (0.42%), and ‘Other Tetra-’ consists of 26 types of Tetra- repeats, the most abundant of which are AATC/ATTG (0.22%) and AGGG/CCCT (0.22%). (B) Abundances of six motifs in different unigene positions. Two types of SSRs were located in multiple regions. One of these SSRs was located across two 5′ UTRs, a coding sequences (CDS) and a 3′ UTR; another was located in multiple CDSs in one unigene. The SSR locations differed from each other. Unknown refers to the SSRs without matching locations.
Figure 1. Distribution of different repeat motifs and positions of simple sequence repeats (SSRs) in unigenes. (A) Proportion and distribution of each type of motif in dinucleotide (Di-), trinucleotide (Tri-), tetranucleotide (Tetra-) and other (i.e., Penta-, Hexa- and compound) repeats. In the legend, ‘other Tri-’ consists of ACG/CGT (0.26%), ACT/AGT (0.26%) and CCG/CGG (0.42%), and ‘Other Tetra-’ consists of 26 types of Tetra- repeats, the most abundant of which are AATC/ATTG (0.22%) and AGGG/CCCT (0.22%). (B) Abundances of six motifs in different unigene positions. Two types of SSRs were located in multiple regions. One of these SSRs was located across two 5′ UTRs, a coding sequences (CDS) and a 3′ UTR; another was located in multiple CDSs in one unigene. The SSR locations differed from each other. Unknown refers to the SSRs without matching locations.
Genes 11 00214 g001
Figure 2. Gene Ontology (GO) analysis of unigenes containing SSRs. The lighter color of each bar represents the number of unigenes without matching coding sequences.
Figure 2. Gene Ontology (GO) analysis of unigenes containing SSRs. The lighter color of each bar represents the number of unigenes without matching coding sequences.
Genes 11 00214 g002
Figure 3. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway annotation of unigenes with embedded SSRs. The numbers outside the circle represent the cumulative number of unigenes beginning at zero and moving in a clockwise manner.
Figure 3. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway annotation of unigenes with embedded SSRs. The numbers outside the circle represent the cumulative number of unigenes beginning at zero and moving in a clockwise manner.
Genes 11 00214 g003
Figure 4. Principal coordinates analysis (PCoA) of amplified loci (A) and dendrogram generated by Bruvo’s distances (B) of 31 accessions, including seven species and 24 cultivars of Paeonia lactiflora. The cultivar names are abbreviated with capitalized letters in (A), and their full names are shown in (B). The UPGMA tree was produced with 1000 bootstrap replicates, and the node values greater than 50 are shown in the tree.
Figure 4. Principal coordinates analysis (PCoA) of amplified loci (A) and dendrogram generated by Bruvo’s distances (B) of 31 accessions, including seven species and 24 cultivars of Paeonia lactiflora. The cultivar names are abbreviated with capitalized letters in (A), and their full names are shown in (B). The UPGMA tree was produced with 1000 bootstrap replicates, and the node values greater than 50 are shown in the tree.
Genes 11 00214 g004
Table 1. Size range of amplification products, sample size (N), the frequency of null allele at locus (Null allele), number of different alleles (Na), number of effective alleles (Ne), Shannon’s information index (I), observed heterozygosity (Ho), expected heterozygosity (He), fixation index (F) and polymorphism information content (PIC)./means we did not calculated the frequency of null allele at the locus because there was no significant difference in the observed and expected value in this locus and it followed Hardy-Weinberg equilibrium (Chi-Square tests, p < 0.05).
Table 1. Size range of amplification products, sample size (N), the frequency of null allele at locus (Null allele), number of different alleles (Na), number of effective alleles (Ne), Shannon’s information index (I), observed heterozygosity (Ho), expected heterozygosity (He), fixation index (F) and polymorphism information content (PIC)./means we did not calculated the frequency of null allele at the locus because there was no significant difference in the observed and expected value in this locus and it followed Hardy-Weinberg equilibrium (Chi-Square tests, p < 0.05).
LocusRepeat MotifSize Range (bp)NNaNeIHoHeNull alleleFPIC
T125(TC)9128–17031157.842.290.770.870.0730.110.86
T163(TA)8146–1603183.961.630.650.750.0850.140.71
T237(CT)1095–1093184.591.760.680.780.0950.130.75
T241(GA)15127–15731126.772.110.810.85/0.050.84
C160(AT)9ctcctt(CTC)5211–2272984.581.790.660.780.1170.160.76
TA564(TA)8172–2012695.431.900.770.820.0580.060.79
T317(CT)10157–1833095.941.920.830.83/0.000.81
S024(AAT)16161–20131105.601.940.770.820.0420.060.80
T040(GA)7228–2463175.021.730.680.800.1060.150.77
T179(AT)7235–26931104.931.830.710.800.0700.110.77
T192(CT)8212–2303184.501.750.680.780.0930.130.75
T300(AT)10218–2383194.451.800.480.780.1710.380.75
TA673(CA)18190–28831156.742.200.810.850.0580.050.84
T210(GA)10253–28531124.441.860.810.770.044−0.040.75
TA038(CT)9259–29931136.742.180.130.850.3910.850.84
S033(TAT)7137–18131102.471.390.420.590.1230.290.57
T304(GA)10147–1633094.751.770.630.790.1440.200.76
T863(AG)10140–16031106.262.030.740.840.0700.120.82
TA028(AC)6101–1233183.881.690.480.740.1770.350.72
T239(CT)9185–22131112.981.590.580.660.0690.130.64
TA144(CT)6186–2863194.001.640.680.75/0.100.71
T852(AT)8158–36731135.952.060.350.830.2680.570.81
F106(TATG)5200–29031135.022.030.550.800.1450.320.78
TA082(AG)6242–28030136.002.030.730.830.0840.120.81
TA079(AT)8232–2642982.931.450.520.660.1610.210.62
T356(AG)9247–2693094.761.810.330.790.2960.580.76
TA133(AT)8258–2862894.281.720.750.770.0950.020.73
SA010(AAT)5264–2943194.981.850.710.800.0790.110.77
W75(CTCAC)5257–2933184.651.700.650.790.0920.180.75
T205(TC)9275–30131136.942.180.710.86/0.170.84
S853(TTG)5158–1862984.671.710.690.79/0.120.75
T859(AG)19166–20029117.612.150.900.870.000-0.030.85
TA566(GA)6167–1852574.831.710.800.79/−0.010.76
TA695(AC)15164–1803083.001.490.570.670.1190.150.64
S237(CTG)8193–21931139.962.410.740.900.0810.180.89
TA464(TC)8200–2143085.561.850.730.820.0620.110.80
T160(CT)10205–22730105.471.920.700.82/0.140.80
TA134(GA)16142–22429135.432.090.930.820.019−0.140.80
SA061(GAA)7257–2772963.431.460.380.710.2810.460.67
TA074(CT)8260–30630135.792.030.970.830.000−0.170.81
TA704(TC)17237–2773075.191.730.700.810.1110.130.78
TA610(TC)8257–29128116.252.050.680.840.1170.190.82
S025(ATT)7268–35029135.142.020.550.810.2000.320.79
TA022(TC)6276–31830166.872.230.900.850.000−0.050.84
TA086(AG)10393–41931105.621.940.740.820.0770.100.80
TA700(AG)9278–31229135.762.050.620.830.1550.250.81
Mean 30.0710.265.261.880.670.800.110.160.77
SE 0.200.360.210.030.020.010.010.030.01

Share and Cite

MDPI and ACS Style

Wan, Y.; Zhang, M.; Hong, A.; Zhang, Y.; Liu, Y. Characteristics of Microsatellites Mined from Transcriptome Data and the Development of Novel Markers in Paeonia lactiflora. Genes 2020, 11, 214. https://doi.org/10.3390/genes11020214

AMA Style

Wan Y, Zhang M, Hong A, Zhang Y, Liu Y. Characteristics of Microsatellites Mined from Transcriptome Data and the Development of Novel Markers in Paeonia lactiflora. Genes. 2020; 11(2):214. https://doi.org/10.3390/genes11020214

Chicago/Turabian Style

Wan, Yingling, Min Zhang, Aiying Hong, Yixuan Zhang, and Yan Liu. 2020. "Characteristics of Microsatellites Mined from Transcriptome Data and the Development of Novel Markers in Paeonia lactiflora" Genes 11, no. 2: 214. https://doi.org/10.3390/genes11020214

APA Style

Wan, Y., Zhang, M., Hong, A., Zhang, Y., & Liu, Y. (2020). Characteristics of Microsatellites Mined from Transcriptome Data and the Development of Novel Markers in Paeonia lactiflora. Genes, 11(2), 214. https://doi.org/10.3390/genes11020214

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop