Next Article in Journal
Confinement Effects on Chemical Equilibria: Pentacyano(Pyrazine)Ferrate(II) Stability Changes within Nanosized Droplets of Water
Previous Article in Journal
Quality Assessment of Kumu Injection, a Traditional Chinese Medicine Preparation, Using HPLC Combined with Chemometric Methods and Qualitative and Quantitative Analysis of Multiple Alkaloids by Single Marker
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Chloroplast Genome of the Folk Medicine and Vegetable Plant Talinum paniculatum (Jacq.) Gaertn.: Gene Organization, Comparative and Phylogenetic Analysis

State Key Laboratory of Food Nutrition and Safety, Key Laboratory of Food Nutrition and Safety, Ministry of Education of China, College of Food Engineering and Biotechnology, Tianjin University of Science & Technology, No. 29, 13th Street, TEDA, Tianjin 300457, China
*
Author to whom correspondence should be addressed.
Molecules 2018, 23(4), 857; https://doi.org/10.3390/molecules23040857
Submission received: 8 March 2018 / Revised: 6 April 2018 / Accepted: 6 April 2018 / Published: 9 April 2018

Abstract

:
The complete chloroplast (cp) genome of Talinum paniculatum (Caryophyllale), a source of pharmaceutical efficacy similar to ginseng, and a widely distributed and planted edible vegetable, were sequenced and analyzed. The cp genome size of T. paniculatum is 156,929 bp, with a pair of inverted repeats (IRs) of 25,751 bp separated by a large single copy (LSC) region of 86,898 bp and a small single copy (SSC) region of 18,529 bp. The genome contains 83 protein-coding genes, 37 transfer RNA (tRNA) genes, eight ribosomal RNA (rRNA) genes and four pseudogenes. Fifty one (51) repeat units and ninety two (92) simple sequence repeats (SSRs) were found in the genome. The pseudogene rpl23 (Ribosomal protein L23) was insert AATT than other Caryophyllale species by sequence alignment, which located in IRs region. The gene of trnK-UUU (tRNA-Lys) and rpl16 (Ribosomal protein L16) have larger introns in T. paniculatum, and the existence of matK (maturase K) genes, which usually located in the introns of trnK-UUU, rich sequence divergence in Caryophyllale. Complete cp genome comparison with other eight Caryophyllales species indicated that the differences between T. paniculatum and P. oleracea were very slight, and the most highly divergent regions occurred in intergenic spacers. Comparisons of IR boundaries among nine Caryophyllales species showed that T. paniculatum have larger IRs region and the contraction is relatively slight. The phylogenetic analysis among 35 Caryophyllales species and two outgroup species revealed that T. paniculatum and P. oleracea do not belong to the same family. All these results give good opportunities for future identification, barcoding of Talinum species, understanding the evolutionary mode of Caryophyllale cp genome and molecular breeding of T. paniculatum with high pharmaceutical efficacy.

1. Introduction

Chloroplasts are the metabolic centers of our world due to the crucial role of converting sunlight into energy and releasing oxygen. Chloroplast genome encodes many key proteins that are involved in photosynthesis and other important metabolic processes for plant interactions with their environment, such as drought, salt and light [1,2,3,4]. The availability of over 2400 sequenced chloroplast genomes (http://www.ncbi.nlm.nih.gov/genomes/) gave us insights to understand the plant biology diversity, evolution, climatic adaptation, DNA barcoding and expression high-value agricultural or biomedical products by genetic engineering [1,2,5,6,7,8,9].
Java ginseng (Talinum paniculatum (Jacq.) Gaertn.), one of the most popular medicinal plants, belongs to the genus Talinum Adans. in the Portulacaceae family of the Caryophyllales. It is widely used for many ailments, including cancer, diabetes, hepatic disorders, leishmaniasis and reproductive disorders due to its high concentrations of alkaloids, flavonoids, tannins, steroids, and triterpenes [10,11,12,13,14]. The root of T. paniculatum has lots of valuable components, which are similar to those in Panax [15]. Meanwhile, the leaves of T. paniculatum are widely used as a green leafy vegetable for human consumption in many parts of South America, Africa and Asia.
Like Portulaca oleracea, T. paniculatum has high adaptability to drought, salinity, and low nutrient conditions [16]. Few reports have been published on the genetic diversity of chloroplast DNA from the Portulacaceae. The relationship of Portulaca to Anacampserotaceae, Cactaceae Juss, and Talinaceae is still uncertain [17,18,19,20]. The Portulacaeae are composed only a single genus of Portulace [21]. Molecular barcodes based on the chloroplast genome have shown great potential for species discrimination, especially between closely related taxa, but to date, chloroplast genome sequences of only two genus of the Portulacaceae have been analyzed, which are Carnegiea gigantean and Portulaca oleracea [22,23,24]. At both the species and population levels, the complete chloroplast genome sequence might enhance our ability to explore reliable barcoding for accurate plant identification [6,7,25].
Here, we report for the first time the complete cp genome sequence of T. paniculatum. Meanwhile, the gene structure and organization of the cp genome were analyzed. Besides, the cp genome sequences was compared with other genus of Caryophyllales. We also present the results of phylogenetic analyses of DNA sequences for 48 protein-coding genes from T. paniculatum, 35 Caryophyllales cp genomes and two outgroups.

2. Results

2.1. Features of the T. paniculatum cp Genome

The chloroplast genome of T. paniculatum, a typical quadripartite structure, is 156,929 bp in size, including a LSC region of 86,898 bp ranging from trnH-GUG to rps19 (Ribosomal protein S19), a SSC region of 18,529 bp from ndhF (NAD(P)H dehydrogenase) to ycf1 (hypothetical protein 1 gene), a pair of IR regions of 25,751 bp from rps19 to pseudogene ycf1 and ranging from ycf1 to rps19, respectively (Figure 1 and Table 1). The GC content of cp genome in the cp total, LSC (large single copy), SSC (small single copy), IRA (inverted repeat A region) and IRB (inverted repeat B region) is 36.79%, 34.6%, 30.5%, 42.7% and 42.7%, respectively (Table 1), which is similar to the cp genome of other Caryophyllales [23,24] and the higher GC content in IR regions. A higher AT representation at the third codon position and second codon position was significant, which is one of the common characteristics of the chloroplast genome [26,27,28,29]. The overall cp genome of T. paniculatum is made up of 49.98% protein-coding regions, 5.76% rRNAs, 1.78% tRNAs and 50.02% non-coding regions.
In total, there are 113 unique functional genes and four pseudogenes in the chloroplast gene of T. paniculatum (Table 1 and Table 2). Among 113 functional genes, 79 protein-coding genes, 30 distinct tRNAs and four distinct rRNAs were contained (Table 2). Remarkably, because all rRNA, seven tRNA, four protein-coding genes and one pseudogene are replicated in the IR regions, the total genes in Figure 1 and Table 2 are 132. Fifty six protein-coding and 22 tRNA genes are located in LSC region, but only 12 protein-coding genes and one tRNA are located in the SSC region. Meanwhile, all eight rRNA are sited in IR regions, which has been found to be common in most cp genomes [23,30]. The ycf1 gene is located at the IR and SSC boundary region, the rps19 gene was located at the IR and LSC boundary region. Pseudogenes of rpl23, rps19, ycf1 are all located in IR regions.
Among the 113 unique genes in T. paniculatum cp genome, 17 genes contain introns, including six tRNA genes and 11 protein-coding genes. Most of them only contain one intron, while like to other angiosperms, ycf3, clpP and rps12 have two introns (Table 3). Twelve genes with introns are located in the LSC, four genes with introns are located in IR regions, and only one gene with intron is located in the SSC region. Interestingly, the 3′ exon of rps12 is duplicated in IRs, but its 5′ exon shared by the 3′ exon was located in the LSC region. Consistent with many research results, the matK gene was inserted into the intron of trnK-UUU, so the intron of trnK-UUU became so maximal [23,30]. Comparing these 17 introns with P. oleracea, most of them is shorter in T. paniculatum, whereas the introns of petB, petD are larger, and rpoC1 intron is the same size [23].

2.2. Codon Usage of the T. paniculatum cp Genome

As shown in Table 4, total 26,146 codons involved in the protein-coding in T. paniculatum. Among 26,146 codons, the amino acids used most frequently were leucine and isoleucine, which encode in 2780 (10.63%) and 2258 (8.63%) codons, respectively. While only 303 (1.50%) codons encode cysteine as the least frequent universal amino acid (Table 4).
The relative synonymous codon usage (RSCU) has been divided into four models, including lack of bias (RSCU < 1.0), low bias (1.0 < RSCU< 1.2), moderately biased (1.2 < RSCU< 1.3) and highly biased (RSCU > 1.3) [30,31]. As shown in Table 4, there are 32 lack of bias codons with values < 1.0, except tryptophan and methionine, four low bias codons, four moderately biased codons and 21 highly biased codons, respectively. The TAA stop codon was found to be preferred. Meanwhile, due to 29 preferred synonymous codons (RSCU > 1.0) end with A or T, the third position of CDS contains 70.5% A or T bases, a significantly higher percentage than in the 2nd position (62.1%) or the 1st position (54.4%) (Table 1 and Table 4). The results showed that the bias was significant in usage of synonymous codons except tryptophan and methionine in T. paniculatum as well as A/T ending rich in cp genome. These results revealed that the RSCU and codon usage exsited biased codon usage, which was consistent with previous reports in cp genomes of higher plants [32,33].

2.3. Repeats Structure and SSR in T. paniculatum cp Genome

Analysis of the repeat structure analysis using REPuter detected a total of 51 repeat sequences, including 20 forward repeats, 21 palindromic repeats, one complement repeats and nine reverse repeats in the cp genome of T. paniculatum (Table 5). The repeats range from 30 to 61 bp in length and are mostly located in the intergenic spacer (IGS) and intron sequences. Twenty five repeats are located in exons of matK, trnS-GGA, trnG-UCC, psaB, psaA and ycf2 genes, respectively. A comparative analysis revealed that 30–39 bp were the most repetitive in cp genome of seven species in Caryophyllales (Figure 2). Complement repeats were rare, as only T. paniculatum and C. gigantea contain them [24]. The reverse repeats of T. paniculatum are the most in the seven cp genomes.
There are 92 simple sequence repeats (SSRs) in the T. paniculatum chloroplast genome, the majority of which were mononuclear repeats (68). Twelve dinucleotide repeats, five trinucleotide repeats and seven pentanucleotide repeats were also found in T. paniculatum cp genome (Table 6, Figure 3). Whereas, only seven SSRs located in the CDSs (Table 7), including four mononucleotide, one dinucleotide and two pentanucleotide SSRs. The results indicate that all of the SSRs in CDs are located in the LSC region and most of them are AT repeats, which is consistent with the view that SSRs in cp genomes contribute to AT richness [32].

2.4. matK and rpl23 Diversity Analysis

The chloroplast genes matK and rpl23 of T. paniculatum were compared with those of nine other Caryophyllaceae species. The matK sequence is often used in DNA barcoding, phylogenetic and evolutionary studies [34]. The sequences of matK genes of chloroplast DNA in T. paniculatum, P. oleracea, S. conica, S. chalcedonica, S. europaea, S. bigelovii, A. githago, H. ammodendro and S. oleracea show significant diversity by alignment analysis (Figure 4), while most of the matk (T. paniculatum, P. oleracea, S. conica, S. chalcedonica, S. europaea, S. bigelovii, A. githaggo, H. ammodendro and S. oleracea) were inserted into the introns of trnK-UUU [35]. Oddly, the matK gene is a pseudogene in S. conica, which has the shortest length [36]. The multiple alignment result revealed that the matK gene of T. paniculatum was relatively similar to that of P. oleracea, except for an AATT insert and eight single nucleotide polymorphisms (SNPs) (Figure 4). The matK genes from S. chalcedonica and A. githago displayed relatively high sequence similarity, due to the fact they both belong to the Sileneae in the Caryophyllaceae family [37]. The matK in S. bigelovii and S. europaea had high sequence similarity too, because they belong to the same family of Chenopodiaceae (Figure 4). These results revealed that the closer the relationship between species, the higher the sequence similarity of matK. Hence, consistent with many research results, the matK gene was one of the best choices for DNA barcoding, phylogenetic and evolutionary analysis [38,39,40].
Ribosomal protein L23 (rpl23) is a protein component of the 60S large ribosomal subunit and is also a negative regulator of cellular apoptosis in animals [41]. The alignment of rpl23 was carried out in T. paniculatum, P. oleracea, C. longiscapa, S. conica, H. ammodendron, S. bigelovii, S. europaea (Figure 5). The results revealed that rpl23 gene of T. paniculatum was a pseudogene as in H. ammodendron, S. bigelovii and S. europaea. The rpl23 gene lacked the CTTGACACCAAAGA sequence in H. ammodendron, S. bigelovii and S. europaea. However, the rpl23 of T. paniculatum had AATT inserted. Interestingly, the rpl23 was not presented in A. githago, S. chalcedonica and A. hypochondriacus [37]. The rpl23 were normal genec in P. oleracea and C. longiscapa, which had some SNPs between them [42].

2.5. Comparative Analysis of the Chloroplast Genomes of the Caryophyllales

The genome sequence of T. paniculatum was compared with eight species of Caryophyllales using the mVISTA software (Figure 6). T. paniculatum had the biggest cp genome with the biggest IR region (25,751 bp), while C. gigantea had the smallest cp genome with one IR region loss [24]. The length differences among them were mainly caused by deletions in the non-coding regions. Meanwhile, the results showed that the non-coding region variation was significantly higher than that of the coding regions, and the IR region was more conserved than the LSC and SSC regions [43,44]. The matK, accD, ndhF, infA, trnS-GGA, trnT-UGU, trnL-UAA, trnG-UCC, ycf2, ycf1, rpl23, ccsA, ndhG and rps19 genes were the most divergent coding regions. The introns of ycf3, ndhA, rpl16, clpP, petB, trnK-UUU, rpoC, pet D were relatively highly divergent, too.

2.6. IR Expansion and Contraction

IR contraction and expansion of T. paniculatum was analyzed by comparing the LSC/IRb/SSC/IRa boundary regions with seven families in the Caryophyllales (Figure 7). Among them, the chloroplast genome of T. paniculatum was the longest. The ndhF gene of T. paniculatum did not cross the IRb region and the SSC region, compared to P. oleracea, A. githago, A. hypochndriacus and S. oleracea, however it was the largest IR region among them (Figure 7). The rps19 gene in the T. paniculatum cp genome was shifted by 117 bp from LSC to IRb at the LSC/IRb border, which was the smallest shift length compared to P. oleracea, A. githago, H. ammodendron, S. oleracea and S. bigelovii (Figure 7). Differently from other LSC/IRb border, the rps19 gene of S. chalcedonica was entirely located in the LSC region. Unlike most LSC/IRa borders, the trnH gene of S. bigelovii was located in the LSC region, and the trnH of T. paniculatum was located in the LSC region as most of them. Oddly, a significant bigger IR size but the smallest rps19 and ycf1 length in the IR region were found in T. paniculatum. This phenomenon may be for two reasons: on one hand, pseudogenes exist, and on the other hand, there is a low occurrence of contraction by the fragment deletions in the intergenic regions in T. paniculatum [32].

2.7. Phylogenetic Analysis

The cp genome sequence is a useful resource for studying the taxonomy in the Angiosperm clade, and for analyzing evolutionary relationships within families. Here, to obtain a reasonable phylogenetic status of T. paniculatum, we performed multiple sequence alignments of cp genome protein coding genes. A total of 35 complete cp genomes of Caryophyllales and two outgroup species were subjected to phylogenetic analysis based on a 48-gene data matrix, which used the MP and ML methods. MP analysis resulted in a single tree with a length of 24,669, a consistency index (CI) of 0.6111, and a retention index (RI) of 0.7786 (Figure 8). Bootstrap analysis showed that 28 out of the 33 nodes had bootstrap values >95%.
As we all know, the specific relationships within the Poltulaca and Talinum remain obscure due to their complex past evolutionary histories. Recently, according to molecular and morphological evidence, the Anacampserotaceae, Basellaceae, Cactaceae, Didiereaceae, Portulacaceae, and Talinaceae were classified into four single independent families [17,18,19,20]. The Portulacaeae are now comprised only a single Portulace genus [21]. However, few reports have been published on the genetic diversity of chloroplast DNA from the Anacampserotaceae, Basellaceae, Cactaceae, Didiereaceae and Portulacaceae. From Figure 8, the cp genome protein coding genes of the phylogenetic trees show that P. oleracea and T. paniculatum do not belong to the same family, although the above data shows that there are many similarities between P. oleracea and T. paniculatum, which is consistent with the recent classification studies [20,45,46]. Hence, whole cp genome or the key segments of evolutionary variation should be used for phylogenetic study.

3. Discussion

We report a genome sequence of T. paniculatum, which provides an important resource for studying the evolution of the Caryophyllales and the molecular breeding of T. paniculatum with high pharmaceutical efficacy. Despite the fact that the chloroplast genomes of Angiosperms are well-conserved in the genomic structure in terms of gene order and number, length variations of the whole chloroplast genome sequences and LSC, SSC and IR regions, the IR expansion and contraction occur frequently. The results reported here are congruent with the recent studies which showed that the trnH-GUG gene was situated in the LSC region in some species of Caryophyllales, while the SSC/IRA border extends into the ycf1 with subsequent formation of a ycf1 pseudogene [47,48]. Boundary expansion and contraction between the single copy and IR boundary regions lead to sequence variation, which might be a base of plant lineages [49].
In this study, we analyzed codon usage frequency and RSCU in the T. paniculatum. As previously reported, leucine and isoleucine are the more commonly seen amino acids in the cp genomes of Angiosperms [33,50,51,52,53]. Likewise, like in earlier studies about repeats and SSRs, mononucleotide repeats are more abundant with A/T repeats, which is consistent with AT richness in Angiosperm chloroplast genomes [54,55,56]. Complement repeats were rarely found in T. paniculatum and C. gigantea [24]. Meanwhile, all of the SSRs in CDs locate in the LSC region [32]. These cp SSR markers could be a resource for molecular-marker-assisted selection breeding for T. paniculatum for production of high levels of biologically active compounds.
Here, we compared the matK and rpl23 sequences of several species of Caryophyllales. The results revealed that the pseudogene rpl23 of T. paniculatum had inserted AATT, which is different from other Caryophyllales. The gene of matK is often used to identity the relationship between species as a barcoding marker [57]. Most reports have revealed that the closer the relationship between species, the higher the sequence similarity of matK. matK represents one of the best choices for DNA barcoding, phylogenetic and evolutionary analysis [38,39,40]. However, though T. paniculatum has a high matK gene similarity to P. oleracea, they belong to different families, hence, multiple marks should be used for barcoding.

4. Conclusions

In this study, the complete cp genome of T. paniculatum was reported and analyzed for the first time. T. paniculatum is one of the key traditional Chinese medicines used against cancer, diabetes, hepatic disorders, leishmaniasis and reproductive disorders and is also an edible vegetable. Comparing the cp genomes of T. paniculatum with other Caryophyllale species, the cp genome of T. paniculatum is the largest IRs, but has the smallest rps19 and ycf1 length in the IR border, most likely due to low occurrence of contraction by the fragment deletions in the intergenic regions. The pseudogene of rpl23 was inserted by AATT, and trnK-UUU and rpl16 have larger introns than other Caryophyllale species. The matK genes show rich divergence. All these results provide good opportunities for future barcoding molecular marker development. Our phylogenetic analysis showed that T. paniculatum and P. oleracea don’t belong to the same family. This information will be useful for the phylogenetic study of T. paniculatum, and might also contribute to the genetics and breeding of T. paniculatum.

5. Materials and Methods

5.1. DNA Sequencing and Genome Assembly

Total DNA of T. paniculatum was obtained from approximately 100 g of fresh leaves using the CTAB method [58]. Quality of the DNA was evaluated by measuring A260 using a Nanodrop2000 spectrometer (Thermo Fisher Scientific, Waltham, MA, USA). Then, the DNA was sheared to fragments of 300~500 bp. Paired-end libraries were prepared with the TruSeqTM DNA sample Prep Kit and the TruSeq PE Cluster Kit. The genome was then sequenced using the HiSeq4000 platform (Illumina Inc., San Diego, CA, USA). The assembly of the cp genome of T. paniculatum was first carried out through the error correction and production of initial contigs using the GS FLX De Novo Assembler Software (Newbler V2.6). PCR amplification and Sanger sequencing were performed to verify the four junction regions between the IRs and the LSC/SSC. The final cp genome of T. paniculatum was submitted to GenBank with the accession number MG710385.

5.2. Gene Annotation and Codon Usage Analysis

The cp genome was annotated by manual corrections using BLAST and DOGMA [59]. The tRNAscan-SE [60] was used to identify the tRNA genes. OGDRAW [61] was used to draw the circular genome map. MEGA5 were used for revealing the characteristics of the variations in synonymous codon usage [62]. The relative synonymous codon usage values (RSCU), codon usage and GC content were also determined by MEGA5.

5.3. Repeat Structure and Single Sequence Repeats (SSRs) Analysis

Analysis of tandem repeats with more than 30 bp and a minimum of 90% sequence (forward, palindromic, reverse and complement) and single sequence repeats (SSRs) was identified by REPuter [63] and MISA respectively, with the same parameters as described in Ni et al. [43].

5.4. Comparative Genome Analysis of the T. paniculatum with Eight cp Genomes of Caryophyllales

Comparison of the overall cp genome of T. paniculatum with eight cp genomes of Caryophyllales were performed by mVISTA [64,65], using the annotation of T. paniculatum as a reference.

5.5. Phylogenetic Analysis

A total of 37 complete cp genome sequences were downloaded from the NCBI Organelle Genome Resources database (http://www.ncbi.nlm.nih.gov/genomes/). For the phylogenetic analysis, a set of 48 protein-coding genes that were common in the 37 analyzed genomes, was used. Maximum parsimony (MP) analysis was performed with PAUP*4.0b10 [66], using a heuristic search combined with the random addition of 1000 replicates and tree bisection-reconnection (TBR) branch swapping, in the Multrees option. Bootstrap analysis was also performed with 1000 replicates and TBR branch swapping. Rehmannia chingii and Lindenbergia philippensis were set as outgroups.

Acknowledgments

This work was supported by grants from the Natural Science Foundation of Tianjin (17JCZDJC34300) and Key Projects in the National Science and Technology Pillar Program during the Twelve Five-year Plan Period of China (2015BAD16B02).

Author Contributions

X.L. conceived and designed the experiments; Y.L. and H.Y. performed the experiments; X.L., Y.L. and B.Z. analysed the data; X.L. and Y.L. wrote the paper. All authors read and approved the final manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Cazzonelli, C.I. Carotenoids in nature: Insights from plants and beyond. Funct. Plant Biol. 2011, 38, 833–847. [Google Scholar] [CrossRef]
  2. Daniell, H.; Lin, C.; Yu, M.; Chang, W. Chloroplast genomes: Diversity, evolution, and applications in genetic engineering. Genome Biol. 2016. [Google Scholar] [CrossRef] [PubMed]
  3. Bobik, K.; Burch-Smith, T.M. Chloroplast signaling within, between and beyond cells. Front. Plant Sci. 2015. [Google Scholar] [CrossRef] [PubMed]
  4. Bohne, A.; Schwarz, C.; Schottkowski, M.; Lidschreiber, M.; Piotrowski, M.; Zerges, W.; Nickelsen, J. Reciprocal Regulation of Protein Synthesis and Carbon Metabolism for Thylakoid Membrane Biogenesis. PLoS Biol. 2013. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Song, Y.; Chen, Y.; Lv, J.; Xu, J.; Zhu, S.; Li, M.; Chen, N. Development of Chloroplast Genomic Resources for Oryza Species Discrimination. Front. Plant Sci. 2017, 8, 1854. [Google Scholar] [CrossRef] [PubMed]
  6. Kang, Y.; Deng, Z.; Zang, R.; Long, W. DNA barcoding analysis and phylogenetic relationships of tree species in tropical cloud forests. Sci. Rep. UK 2017. [Google Scholar] [CrossRef] [PubMed]
  7. Baczkiewicz, A.; Szczecinska, M.; Sawicki, J.; Stebel, A.; Buczkowska, K. DNA barcoding, ecology and geography of the cryptic species of Aneura pinguis and their relationships with Aneura maxima and Aneura mirabilis (Metzgeriales, Marchantiophyta). PLoS ONE 2017, 12, e0188837. [Google Scholar] [CrossRef] [PubMed]
  8. Jansen, R.K.; Cai, Z.; Raubeson, L.A.; Daniell, H.; DePamphilis, C.W.; Leebens-Mack, J.; Mueller, K.F.; Guisinger-Bellian, M.; Haberle, R.C.; Hansen, A.K.; et al. Analysis of 81 genes from 64 plastid genomes resolves relationships in angiosperms and identifies genome-scale evolutionary patterns. Proc. Natl. Acad. Sci. USA 2007, 104, 19369–19374. [Google Scholar] [CrossRef] [PubMed]
  9. Su, J.; Zhu, L.; Sherman, A.; Wang, X.; Lin, S.; Kamesh, A.; Norikane, J.H.; Streatfield, S.J.; Herzog, R.W.; Daniell, H. Low cost industrial production of coagulation factor IX bioencapsulated in lettuce cells for oral tolerance induction in hemophilia B. Biomaterials 2015, 70, 84–93. [Google Scholar] [CrossRef] [PubMed]
  10. Sukwan, C.; Wray, S.; Kupittayanant, S. The effects of Ginseng Java root extract on uterine contractility in nonpregnant rats. Physiol. Rep. 2014, 2, e12230. [Google Scholar] [CrossRef] [PubMed]
  11. Jung, M.; Park, M.; Lee, H.C.; Kang, Y.H.; Kang, E.S.; Kim, S.K. Antidiabetic agents from medicinal plants. Curr. Med. Chem. 2006, 13, 1203–1218. [Google Scholar] [CrossRef] [PubMed]
  12. Odonne, G.; Bourdy, G.; Castillo, D.; Estevez, Y.; Lancha-Tangoa, A.; Alban-Castillo, J.; Deharo, E.; Rojas, R.; Stien, D.; Sauvain, M. Ta’ta’, Huayani: Perception of leishmaniasis and evaluation of medicinal plants used by the Chayahuita in Peru. Part II. J. Ethnopharmacol. 2009, 126, 149–158. [Google Scholar] [CrossRef] [PubMed]
  13. Dos Reis, L.F.C.; Cerdeira, C.D.; De Paula, B.F.; Da Silva, J.J.; Coelho, L.F.L.; Silva, M.A.; Marques, V.B.B.; Chavasco, J.K.; Alves-Da-Silva, G. Chemical Characterization and Evaluation of Antibacterial, Antifungal, Antimycobacterial, and Cytotoxic Activities of Talinum paniculatum. Rev. Inst. Med. Trop. SP 2015, 57, 397–405. [Google Scholar] [CrossRef] [PubMed]
  14. Thanamool, C.; Papirom, P.; Chanlun, S.; Kupittayanant, S. Talinum paniculatum (Jacq.) Gertn: A medicinal plant with potential estrogenic activity in ovariectomized rats. Int. J. Pharm. Pharm. Sci. 2013, 5, 478–485. [Google Scholar]
  15. Shaw, P.C.; But, P.P. Authentication of Panax species and their adulterants by random-primed polymerase chain reaction. Planta Med. 1995, 61, 466–469. [Google Scholar] [CrossRef] [PubMed]
  16. Assaha, D.V.M.; Mekawy, A.M.M.; Liu, L.; Noori, M.S.; Kokulan, K.S.; Ueda, A.; Nagaoka, T.; Saneoka, H. Na+ Retention in the Root is a Key Adaptive Mechanism to Low and High Salinity in the Glycophyte, Talinum paniculatum (Jacq.) Gaertn. (Portulacaceae). J. Agron. Crop Sci. 2017, 203, 56–67. [Google Scholar] [CrossRef]
  17. Nyffeler, R. The closest relatives of cacti: Insights from phylogenetic analyses of chloroplast and mitochondrial sequences with special emphasis on relationships in the tribe Anacampseroteae. Am. J. Bot. 2007, 94, 89–101. [Google Scholar] [CrossRef] [PubMed]
  18. Nyffeler, R.; Eggli, U. Disintegrating Portulacaceae: A New Familial Classification of the Suborder Portulacineae (Caryophyllales) Based on Molecular and Morphological Data. Taxon 2010, 59, 227–240. [Google Scholar]
  19. Nyananyo, B.L. Taxonomic studies in the genus Portulaca L. (Portulacaceae). Feddes Repert. 1987, 98, 399–402. [Google Scholar] [CrossRef]
  20. Ocampo, G.; Columbus, J.T. Molecular phylogenetics, historical biogeography, and chromosome number evolution of Portulaca (Portulacaceae). Mol. Phylogenet. Evol. 2012, 63, 97–112. [Google Scholar] [CrossRef] [PubMed]
  21. Christenhusz, M.J.M.; Byng, J.W. The number of known plant species in the world and its annual increase. Phytotaxa 2016, 261, 201–217. [Google Scholar] [CrossRef]
  22. Atichokudomchai, N.; Varavinit, S. Characterization and utilization of acid-modified cross-linked Tapioca starch in pharmaceutical tablets. Carbohyd. Polym. 2003, 53, 263–270. [Google Scholar] [CrossRef]
  23. Liu, X.; Yang, H.; Zhao, J.; Zhou, B.; Li, T.; Xiang, B. The complete chloroplast genome sequence of the folk medicinal and vegetable plant purslane (Portulaca oleracea L.). J. Hortic. Sci. Biotechnol. 2017, 92, 1–10. [Google Scholar] [CrossRef]
  24. Sanderson, M.J.; Copetti, D.; Burquez, A.; Bustamante, E.; Charboneau, J.L.M.; Eguiarte, L.E.; Kumar, S.; Lee, H.O.; Lee, J.; McMahon, M.; et al. Exceptional reduction of the plastid genome of saguaro cactus (Carnegiea gigantea): Loss of the ndh gene suite and inverted repeat. Am. J. Bot. 2015, 102, 1115–1127. [Google Scholar] [CrossRef] [PubMed]
  25. Li, X.; Yang, Y.; Henry, R.J.; Rossetto, M.; Wang, Y.; Chen, S. Plant DNA barcoding: From gene to genome. Biol. Rev. Camb. Philos. Soc. 2015, 90, 157–166. [Google Scholar] [CrossRef] [PubMed]
  26. Morton, B.R. Codon use and the rate of divergence of land plant chloroplast genes. Mol. Biol. Evol. 1994, 11, 231–238. [Google Scholar] [PubMed]
  27. Muse, S.V.; Gaut, B.S. Comparing patterns of nucleotide substitution rates among chloroplast loci using the relative ratio test. Genetics 1997, 146, 393–399. [Google Scholar] [PubMed]
  28. Muse, S.V. Examining rates and patterns of nucleotide substitution in plants. Plant Mol. Biol. 2000, 42, 25–43. [Google Scholar] [CrossRef] [PubMed]
  29. Wakeley, J. Substitution-rate variation among sites and the estimation of transition bias. Mol. Biol. Evol. 1994, 11, 436–442. [Google Scholar] [PubMed]
  30. Zhou, J.; Cui, Y.; Chen, X.; Li, Y.; Xu, Z.; Duan, B.; Li, Y.; Song, J.; Yao, H. Complete Chloroplast Genomes of Papaver rhoeas and Papaver orientale: Molecular Structures, Comparative Analysis, and Phylogenetic Analysis. Molecules 2018, 23, 437. [Google Scholar] [CrossRef] [PubMed]
  31. Zuo, L.; Shang, A.; Zhang, S.; Yu, X.; Ren, Y.; Yang, M.; Wang, J. The first complete chloroplast genome sequences of Ulmus species by de novo sequencing: Genome comparative and taxonomic position analysis. PLoS ONE 2017. [Google Scholar] [CrossRef] [PubMed]
  32. Jian, H.; Zhang, Y.; Yan, H.; Qiu, X.; Wang, Q.; Li, S.; Zhang, S. The Complete Chloroplast Genome of a Key Ancestor of Modern Roses, Rosa chinensis var. spontanea, and a Comparison with Congeneric Species. Molecules 2018, 23, 389. [Google Scholar] [CrossRef] [PubMed]
  33. Li, Z.; Saina, J.K.; Gichira, A.W.; Kyalo, C.M.; Wang, Q.; Chen, J. Comparative Genomics of the Balsaminaceae Sister Genera Hydrocera triflora and Impatiens pinfanensis. Int. J. Mol. Sci. 2018, 19, 319. [Google Scholar] [CrossRef] [PubMed]
  34. Carbonell-Caballero, J.; Alonso, R.; Ibanez, V.; Terol, J.; Talon, M.; Dopazo, J. A Phylogenetic Analysis of 34 Chloroplast Genomes Elucidates the Relationships between Wild and Domestic Species within the Genus Citrus. Mol. Biol. Evol. 2015, 32, 2015–2035. [Google Scholar] [CrossRef] [PubMed]
  35. Zoschke, R.; Nakamura, M.; Liere, K.; Sugiura, M.; Boerner, T.; Schmitz-Linneweber, C. An organellar maturase associates with multiple group II introns. Proc. Natl. Acad. Sci. USA 2010, 107, 3245–3250. [Google Scholar] [CrossRef] [PubMed]
  36. Sloan, D.B.; Alverson, A.J.; Wu, M.; Palmer, J.D.; Taylor, D.R. Recent Acceleration of Plastid Sequence and Structural Evolution Coincides with Extreme Mitochondrial Divergence in the Angiosperm Genus Silene. Genome Biol. Evol. 2012, 4, 294–306. [Google Scholar] [CrossRef] [PubMed]
  37. Sloan, D.B.; Triant, D.A.; Forrester, N.J.; Bergner, L.M.; Wu, M.; Taylor, D.R. A recurring syndrome of accelerated plastid genome evolution in the angiosperm tribe Sileneae (Caryophyllaceae). Mol. Phylogenet. Evol. 2014, 72, 82–89. [Google Scholar] [CrossRef] [PubMed]
  38. Bafeel, S.O.; Arif, I.A.; Bakir, M.A.; Khan, H.A.; Al Farhan, A.H.; Al Homaidan, A.A.; Ahamed, A.; Thomas, J. Comparative evaluation of PCR success with universal primers of maturase K (matK) and ribulose-1,5-bisphosphate carboxylase oxygenase large subunit (rbcL) for barcoding of some arid plants. Plant Omics 2011, 4, 195–198. [Google Scholar]
  39. Yu, J.; Xue, J.; Zhou, S. New universal matK primers for DNA barcoding angiosperms. J. Syst. Evol. 2011, 49, 176–181. [Google Scholar] [CrossRef]
  40. Heckenhauer, J.; Barfuss, M.H.J.; Samuel, R. Universal Multiplexable Matk Primers for Dna Barcoding of Angiosperms. Appl. Plant Sci. 2016, 4. [Google Scholar] [CrossRef] [PubMed]
  41. Qi, Y.; Li, X.; Chang, C.; Xu, F.; He, Q.; Zhao, Y.; Wu, L. Ribosomal protein L23 negatively regulates cellular apoptosis via the RPL23/Miz-1/c-Myc circuit in higher-risk myelodysplastic syndrome. Sci. Rep. 2017, 7, 2323. [Google Scholar] [CrossRef] [PubMed]
  42. Stoll, A.; Harpke, D.; Schutte, C.; Stefanczyk, N.; Brandt, R.; Blattner, F.R.; Quandt, D. Development of microsatellite markers and assembly of the plastid genome in Cistanthe longiscapa (Montiaceae) based on low-coverage whole genome sequencing. PLoS ONE 2017, 12, e178402. [Google Scholar] [CrossRef] [PubMed]
  43. Ni, L.; Zhao, Z.; Xu, H.; Chen, S.; Dorje, G. The complete chloroplast genome of Gentiana straminea (Gentianaceae), an endemic species to the Sino-Himalayan subregion. Gene 2016, 577, 281–288. [Google Scholar] [CrossRef] [PubMed]
  44. Khakhlova, O.; Bock, R. Elimination of deleterious mutations in plastid genomes by gene conversion. Plant J. 2006, 46, 85–94. [Google Scholar] [CrossRef] [PubMed]
  45. Veselova, T.D.; Dzhalilova, K.K.; Remizowa, M.V.; Timonin, A.C. Embryology of Talinum paniculatum (Jacq.) Gaertn. and T-triangulare (Jacq.) Willd. (Portulacaceae s.l., Caryophyllales). Wulfenia 2012, 19, 107–129. [Google Scholar]
  46. Steyn, E.; Smith, G.F. Portulacaceae-Talinum paniculatum, a naturalized weed in South Africa. Bothalia 2001, 31, 195–197. [Google Scholar]
  47. Lee, M.; Park, J.; Lee, H.; Sohn, S.; Lee, J. Complete chloroplast genomic sequence of Citrus platymamma determined by combined analysis of Sanger and NGS data. Hortic. Environ. Biotechnol. 2015, 56, 704–711. [Google Scholar] [CrossRef]
  48. Su, H.; Hogenhout, S.A.; Al-Sadi, A.M.; Kuo, C. Complete Chloroplast Genome Sequence of Omani Lime (Citrus aurantiifolia) and Comparative Analysis within the Rosids. PLoS ONE 2014. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  49. Wang, W.; Messing, J. High-Throughput Sequencing of Three Lemnoideae (Duckweeds) Chloroplast Genomes from Total DNA. PLoS ONE 2011. [Google Scholar] [CrossRef] [PubMed]
  50. Yang, Y.; Zhu, J.; Feng, L.; Zhou, T.; Bei, G.; Yang, J.; Zhao, G. Plastid Genome Comparative and Phylogenetic Analyses of the Key Genera in Fagaceae: Highlighting the Effect of Codon Composition Bias in Phylogenetic Inference. Front. Plant Sci. 2018, 9. [Google Scholar] [CrossRef] [PubMed]
  51. Guo, S.; Guo, L.; Zhao, W.; Xu, J.; Li, Y.; Zhang, X.; Shen, X.; Wu, M.; Hou, X. Complete Chloroplast Genome Sequence and Phylogenetic Analysis of Paeonia ostii. Molecules 2018, 23, 246. [Google Scholar] [CrossRef] [PubMed]
  52. Wang, W.; Yu, H.; Wang, J.; Lei, W.; Gao, J.; Qiu, X.; Wang, J. The Complete Chloroplast Genome Sequences of the Medicinal Plant Forsythia suspensa (Oleaceae). Int. J. Mol. Sci. 2017, 18, 2288. [Google Scholar] [CrossRef] [PubMed]
  53. Shen, X.; Wu, M.; Liao, B.; Liu, Z.; Bai, R.; Xiao, S.; Li, X.; Zhang, B.; Xu, J.; Chen, S. Complete Chloroplast Genome Sequence and Phylogenetic Analysis of the Medicinal Plant Artemisia annua. Molecules 2017, 22, 1330. [Google Scholar] [CrossRef] [PubMed]
  54. Saina, J.K.; Gichira, A.W.; Li, Z.; Hu, G.; Wang, Q.; Liao, K. The complete chloroplast genome sequence of Dodonaea viscosa: Comparative and phylogenetic analyses. Genetica 2018, 146, 101–113. [Google Scholar] [CrossRef] [PubMed]
  55. Zhou, J.; Chen, X.; Cui, Y.; Sun, W.; Li, Y.; Wang, Y.; Song, J.; Yao, H. Molecular Structure and Phylogenetic Analyses of Complete Chloroplast Genomes of Two Aristolochia Medicinal Species. Int. J. Mol. Sci. 2017, 18, 1839. [Google Scholar] [CrossRef] [PubMed]
  56. Zhou, T.; Chen, C.; Wei, Y.; Chang, Y.; Bai, G.; Li, Z.; Kanwal, N.; Zhao, G. Comparative Transcriptome and Chloroplast Genome Analyses of Two Related Dipteronia Species. Front. Plant. Sci. 2016, 7. [Google Scholar] [CrossRef] [PubMed]
  57. Abugalieva, S.; Volkova, L.; Genievskaya, Y.; Ivaschenko, A.; Kotukhov, Y.; Sakauova, G.; Turuspekov, Y. Taxonomic assessment of Allium species from Kazakhstan based on ITS and matK markers. BMC Plant Biol. 2017, 172. [Google Scholar] [CrossRef] [PubMed]
  58. Allen, G.C.; Flores-Vergara, M.A.; Krasynanski, S.; Kumar, S.; Thompson, W.F. A modified protocol for rapid DNA isolation from plant tissues using cetyltrimethylammonium bromide. Nat. Protoc. 2006, 1, 2320–2325. [Google Scholar] [CrossRef] [PubMed]
  59. Wyman, S.K.; Jansen, R.K.; Boore, J.L. Automatic annotation of organellar genomes with DOGMA. Bioinformatics 2004, 20, 3252–3255. [Google Scholar] [CrossRef] [PubMed]
  60. Schattner, P.; Brooks, A.N.; Lowe, T.M. The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs. Nucl. Acids Res. 2005, 33, W686–W689. [Google Scholar] [CrossRef] [PubMed]
  61. Lohse, M.; Drechsel, O.; Bock, R. OrganellarGenomeDRAW (OGDRAW): A tool for the easy generation of high-quality custom graphical maps of plastid and mitochondrial genomes. Curr. Genet. 2007, 52, 267–274. [Google Scholar] [CrossRef] [PubMed]
  62. Tamura, K.; Peterson, D.; Peterson, N.; Stecher, G.; Nei, M.; Kumar, S. MEGA5: Molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol. Biol. Evol. 2011, 28, 2731–2739. [Google Scholar] [CrossRef] [PubMed]
  63. Kurtz, S.; Choudhuri, J.V.; Ohlebusch, E.; Schleiermacher, C.; Stoye, J.; Giegerich, R. REPuter: The manifold applications of repeat analysis on a genomic scale. Nucl. Acids Res. 2001, 29, 4633–4642. [Google Scholar] [CrossRef] [PubMed]
  64. Kurtz, S.; Phillippy, A.; Delcher, A.L.; Smoot, M.; Shumway, M.; Antonescu, C.; Salzberg, S.L. Versatile and open software for comparing large genomes. Genome Biol. 2004, 5, R12. [Google Scholar] [CrossRef] [PubMed]
  65. Frazer, K.A.; Pachter, L.; Poliakov, A.; Rubin, E.M.; Dubchak, I. VISTA: Computational tools for comparative genomics. Nucl. Acids Res. 2004, 32, W273–W279. [Google Scholar] [CrossRef] [PubMed]
  66. Swofford, D.L. PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods); Version 4.0b10; Sinauer Associates: Sunderland, MA, USA, 2003. [Google Scholar]
Sample Availability: Samples of the T. paniculatum are available from the authors.
Figure 1. The complete chloroplast genome map of Talinum paniculatum (Jacp) Gaertn. Genes are color-coded based on functional group. Genes drawn inside the circle are transcribed clockwise, and those outside are transcribed counterclockwise. The genome orientation are the orange arrow.
Figure 1. The complete chloroplast genome map of Talinum paniculatum (Jacp) Gaertn. Genes are color-coded based on functional group. Genes drawn inside the circle are transcribed clockwise, and those outside are transcribed counterclockwise. The genome orientation are the orange arrow.
Molecules 23 00857 g001
Figure 2. Repeat sequences in seven chloroplast genomes of Caryophyllales. REPuter was used to identify repeat sequences with length ≥30 bp and sequence identify ≥90% in the chloroplast genomes. F, P, R and C indicate the repeat types forward, palindrome, reverse and complement, respectively.
Figure 2. Repeat sequences in seven chloroplast genomes of Caryophyllales. REPuter was used to identify repeat sequences with length ≥30 bp and sequence identify ≥90% in the chloroplast genomes. F, P, R and C indicate the repeat types forward, palindrome, reverse and complement, respectively.
Molecules 23 00857 g002
Figure 3. Distribution of SSRs present in seven chloroplast genomes of Caryophyllales.
Figure 3. Distribution of SSRs present in seven chloroplast genomes of Caryophyllales.
Molecules 23 00857 g003
Figure 4. Sequence alignment of matK among nine species cp genome in Caryophyllales. As the matK gene are too long, only the sequences with greater variation were shown here.
Figure 4. Sequence alignment of matK among nine species cp genome in Caryophyllales. As the matK gene are too long, only the sequences with greater variation were shown here.
Molecules 23 00857 g004
Figure 5. The gene of rpl23 sequence alignment among six Caryophyllales species.
Figure 5. The gene of rpl23 sequence alignment among six Caryophyllales species.
Molecules 23 00857 g005
Figure 6. Complete chloroplast genome sequence comparison of eight species using mVISTA, with T. paniculatum as a reference. The horizontal axis corresponds to the coordinates within the chloroplast genome. The vertical scale represents the identity percentage. The grey lines and the arrows show the genes with their orientation and position. CNS: conserved noncoding sequences.
Figure 6. Complete chloroplast genome sequence comparison of eight species using mVISTA, with T. paniculatum as a reference. The horizontal axis corresponds to the coordinates within the chloroplast genome. The vertical scale represents the identity percentage. The grey lines and the arrows show the genes with their orientation and position. CNS: conserved noncoding sequences.
Molecules 23 00857 g006
Figure 7. Comparison of the borders of the LSC, SSC and IR regions in nine Caryophyllales species. #indicates that the gene is a pseudogene.
Figure 7. Comparison of the borders of the LSC, SSC and IR regions in nine Caryophyllales species. #indicates that the gene is a pseudogene.
Molecules 23 00857 g007
Figure 8. Phylogenetic tree of the 35 species in Caryophyllales using maximum parsimony (MP) and tree bisection-reconnection (TBR) analysis based on 48 protein-coding genes using a non-partitioning scheme. The phylogenetic tree was drawn using Cistanche deserticola and Rehmannia chingiis as outgroup.
Figure 8. Phylogenetic tree of the 35 species in Caryophyllales using maximum parsimony (MP) and tree bisection-reconnection (TBR) analysis based on 48 protein-coding genes using a non-partitioning scheme. The phylogenetic tree was drawn using Cistanche deserticola and Rehmannia chingiis as outgroup.
Molecules 23 00857 g008
Table 1. Chloroplast genome composition of Talinum paniculatum (Jacp) Gaertn.
Table 1. Chloroplast genome composition of Talinum paniculatum (Jacp) Gaertn.
RegionSize (bp)T(U) (%)C (%)A (%)G (%)GenesProtein-Coding GenestRNA GenesrRNA Genes
LSC86,89833.217.832.216.88356220
SSC18,52934.615.934.914.6121210
IRA25,75129.022.328.220.419774
IRB25,75128.220.429.022.319874
Total156,92931.819.030.718.512883 (4)37 (7)8 (4)
CDS78,43831.517.630.820.1
1st position26,14623.718.830.726.7
2nd position26,14632.720.229.417.8
3rd position26,14638.113.732.415.7
CDS: protein-coding regions. The numbers in brackets represent the number of repeated genes.
Table 2. Genes of the Talinum paniculatum (Jacp) Gaertn.
Table 2. Genes of the Talinum paniculatum (Jacp) Gaertn.
Group of GenesName of GeneNo.
Photosytem IpsaA, psaB, psaC, psaI, psaJ5
Photosytem IIpsbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ15
Cytochrome b/f complexpetA, petB, petD, petG, petL, petN6
ATP systematpA, atpB, atpF, atpF, atpH, atpI6
NADH dehydrogenasendhA, ndhB *, ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK12
RuBisCO large subnitrbcL1
RNA polymeraserpoA, rpoB, rpoC1, rpoC24
Ribosomal proteins (SSU)rps2, rps3, rps4, rps7 *, rps8, rps11, rps12 *, rps14, rps15, rps16, rps18, rps1914
Ribosomal proteins (LSU)rpl2, rpl14, rpl16, rpl19, rpl20, rpl22, rpl32, rpl33, rpl369
Miscellaneous proteinsaccD, cemA, ccsA, clpP, infA, matK6
Hypothetical chloroplast reading frames (ycf)ycf1, ycf2 *, ycf3, ycf45
Transfer RNAstrnA-UGC *, trnC-GCA, trnD-GUC, trnE-UUC, trnF-GAA, trnG-UCC, trnG-GCC, trnH-GUG, trnI-CAU *, trnIGAU *, trnK-UUU, trnL-UAA, trnL-CAA *, trnL-UAG, trnM-CAU, trnN-GUU *, trnP-UGG, trnQ-UUG, trnR-UCU, trnRACG *, trnS-GCU, trnS-UGA, trnS-GGA, trnT-GGU, trnT-UGU, trnV-UAC, trnV-GAC *, trnW-CCA, trnY-GUA, trnfM-CAU37
Ribosomal RNAsrrn4.5 *, rrn5 *, rrn16 *, rrn23 *8
Pseudogenerpl23 *, rps19, ycf14
Total132
* indicates a duplicated gene.
Table 3. The intron-containing genes in the T. paniculatum cp genome and the lengths of the exons and introns.
Table 3. The intron-containing genes in the T. paniculatum cp genome and the lengths of the exons and introns.
No.GeneLocationExonI (bp)IntronI (bp)ExonII (bp)IntronII (bp)ExonIII (bp)
1trnK-UUULSC35250237
2rps16LSC20286741
3trnG-UCCLSC2370748
4atpFLSC410745145
5rpoC1LSC1611794432
6ycf3LSC153773229769125
7trnL-UAALSC3759950
8trnV-UACLSC3558638
9clpPLSC22959029189471
10petBLSC6768642
11petDLSC8792475
12rpl16LSC39911029
13ndhB *IR756668777
14rps12 *IR114-23253326
15trnI-GAU *IR3794735
16trnA-UGC *IR3881835
17ndhASSC5391087553
* indicates duplicated gene.
Table 4. Codon usage in the T. paniculatum cp genome.
Table 4. Codon usage in the T. paniculatum cp genome.
Amino AcidCodonCountRSCUAmino AcidCodonCountRSCUAmino AcidCodonCountRSCUAmino AcidCodonCountRSCU
PheUUU(F)9751.3SerUCU(S)5561.67TyrUAU(Y)7791.61StopUGA(*)200.72
PheUUC(F)5270.7SerUCC(S)3310.99TyrUAC(Y)1900.39TrpUGG(W)4601
LeuUUA(L)8541.84SerUCA(S)4081.22StopUAA(*)461.66AlaGCU(A)6151.75
LeuUUG(L)5541.2SerUCG(S)1870.56StopUAG(*)170.61AlaGCC(A)2340.66
LeuCUU(L)6011.3SerAGU(S)3931.18HisCAU(H)4481.49AlaGCA(A)4161.18
LeuCUC(L)1790.39SerAGC(S)1270.38HisCAC(H)1550.51AlaGCG(A)1440.41
LeuCUA(L)4080.88ProCCU(P)4261.6GlnCAA(Q)7291.54ArgCGU(R)3601.36
LeuCUG(L)1840.4ProCCC(P)2000.75GlnCAG(Q)2160.46ArgCGC(R)920.35
IleAUU(I)11301.5ProCCA(P)2991.12AsnAAU(N)9601.53ArgCGA(R)3661.38
IleAUC(I)4100.54ProCCG(P)1430.54AsnAAC(N)2970.47ArgCGG(R)1200.45
IleAUA(I)7180.95ThrACU(T)5311.61LysAAA(K)10611.5ArgAGA(R)4751.79
MetAUG(M)6071ThrACC(T)2500.76LysAAG(K)3570.5ArgAGG(R)1750.66
ValGUU(V)5141.48ThrACA(T)4021.22AspGAU(D)8831.66GlyGGU(G)5521.26
ValGUC(V)1600.46ThrACG(T)1340.41AspGAC(D)1840.34GlyGGC(G)1900.43
ValGUA(V)5221.5CysUGU(C)2351.55GluGAA(E)10411.53GlyGGA(G)7121.62
ValGUG(V)1940.56CysUGC(C)680.45GluGAG(E)3230.47GlyGGG(G)3020.69
RSCU: Relative synonymous codon usage. RSCU > 1 are highlighted in bold. * indicates stop codon.
Table 5. Repeat sequences distribution in the T. paniculatum (Jacp) Gaertn chloroplast genome.
Table 5. Repeat sequences distribution in the T. paniculatum (Jacp) Gaertn chloroplast genome.
No.Size (bp)TypeRepeat 1 StartRepeat 1 LocationRepeat 2 StartRepeat 2 LocationLocation
135F3144matK6441IGS (rps16, trnQ-UUG)LSC
230P4153IGS (trnK-UUU (exon), rps16)4211IGS (trnK-UUU (exon), rps16)LSC
330R4578IGS (trnK-UUU (exon), rps16)4581IGS (trnK-UUU (exon), rps16)LSC
430C4581IGS (trnK-UUU (exon), rps16)4582IGS (trnK-UUU (exon), rps16)LSC
530R6862IGS (trnQ-UUG, psbK)8268IGS (trnS-GCU, trnG-UCC)LSC
630F7743IGS (psbI, trnS-GCU)7770IGS (psbI, trnS-GCU)LSC
732F7896IGS (psbI, trnS-GCU)36,017IGS (psbC, trnS-UGA)LSC
830P7898IGS (psbI, trnS-GCU)46,268trnS-GGALSC
937R8258IGS (trnS-GCU, trnG-UCC)8261IGS (trnS-GCU, trnG-UCC)LSC
1037R8258IGS (trnS-GCU, trnG-UCC)8264IGS (trnS-GCU, trnG-UCC)LSC
1135F8258IGS (trnS-GCU, trnG-UCC)8277IGS (trnS-GCU, trnG-UCC)LSC
1235R8266IGS (trnS-GCU, trnG-UCC)8277IGS (trnS-GCU, trnG-UCC)LSC
1334R8258IGS (trnS-GCU, trnG-UCC)8261IGS (trnS-GCU, trnG-UCC)LSC
1433F8261IGS (trnS-GCU, trnG-UCC)8283IGS (trnS-GCU, trnG-UCC)LSC
1532R8263IGS (trnS-GCU, trnG-UCC)8280IGS (trnS-GCU, trnG-UCC)LSC
1631F8261IGS (trnS-GCU, trnG-UCC)8264IGS (trnS-GCU, trnG-UCC)LSC
1731R8261IGS (trnS-GCU, trnG-UCC)8280IGS (trnS-GCU, trnG-UCC)LSC
1831R8267IGS (trnS-GCU, trnG-UCC)29,873IGS (psbM, trnD-GUC)LSC
1930P8267IGS (trnS-GCU, trnG-UCC)62,668IGS (ycf4, cemA)LSC
2030P8280IGS (trnS-GCU, trnG-UCC)31,428IGS (trnE-UUC, trnT-GGU)LSC
2131F9566trnG-UCC37,057trnG-GCCLSC
2230P36,019IGS (psbC, trnS-UGA)46,268trnS-GGALSC
2330F39,314psaB41,538psaALSC
2442F44,540ycf3 (intronII)123,558ndhA (intron)LSC, SSC
2539F44,543ycf3 (intronII)100,738IGS (rps12, trnV-GAC)LSC, IRb
2639P44,543ycf3 (intronII)143,050IGS (trnV-GAC, rps12)LSC, IRa
2730F44,555ycf3 (intronII)100,750IGS (rps12, trnV-GAC)LSC, IRb
2830P44,555ycf3 (intronII)143,047IGS (trnV-GAC, rps12)LSC, IRa
2940P76,849IGS (psbT, psbN)76,849IGS (psbT, psbN)LSC
3030P84,344IGS (trnS-GCU, trnG-UCC)84,346IGS (trnS-GCU, trnG-UCC)LSC
3161F93,517ycf293,535ycf2IRb
3261P93,517ycf2150,231ycf2IRb, IRa
3361P93,535ycf2150,249ycf2IRb, IRa
3461F150,231ycf2150,249ycf2IRa
3552F93,526ycf293,544ycf2IRb
3652P93,526ycf2150,231ycf2IRb, IRa
3752P93,544ycf2150,249ycf2IRb, IRa
3834F93,526ycf293,562ycf2IRb
3934P93,526ycf2150,231ycf2IRb, IRa
4034P93,562ycf2150,267ycf2IRb, IRa
4143F93,517ycf293,533ycf2IRb
4243P93,517ycf2150,231ycf2IRb, IRa
4343P93,553ycf2150,267ycf2IRb, IRa
4443F150,231ycf2150,267ycf2IRa
4540F100,738IGS (rps12, trnV-GAC)123,561ndhA (intron)IRb, SSC
4634F109,506IGS (rrn4.5, rrn5)109,538IGS (rrn4.5, rrn5)IRb
4734F109,506IGS (rrn4.5, rrn5)134,255IGS (rrn5, rrn4.5)IRb, IRa
4834P109,538IGS (rrn4.5, rrn5)134,287IGS (rrn5, rrn4.5)IRb, IRa
4934P134,255IGS (rrn5, rrn4.5)134,287IGS (rrn5, rrn4.5)IRa
5038P118,646IGS (ccsA, ndhD)118,646IGS (ccsA, ndhD)SSC
5140P123,561ndhA (intron)143,049IGS(trnV-GAC, rps12)SSC, IRa
F: forward repeat; P: palindrome (inverted) repeat; R: reverse repeat; C: complement repeat. IGS: intergenic spacer.
Table 6. Frequency of simple sequence repeats in the T. paniculatum chloroplast genome.
Table 6. Frequency of simple sequence repeats in the T. paniculatum chloroplast genome.
Length Unit101011121314151617181920Total
A1515381131133
T1414473211133
C111
G111
AG111
AT4437
TA22114
AAT111
ATA11
TTA22
TAT11
AGGT11
ATGG11
AATT11
CTAC11
TTTC11
TAAT11
GGAA11
Table 7. Simple sequence repeats in the CDSs of the T. paniculatum chloroplast genome.
Table 7. Simple sequence repeats in the CDSs of the T. paniculatum chloroplast genome.
No.TypeMotifSizeStartEndLocationRegion
1P1(A)10104756trnH-UGGLSC
2P1(A)1010637646psbALSC
3P1(A)111121042114matKLSC
4P1(A)121239423953trnK-UUU (intron)LSC
5P2(AT)510755764psbALSC
6P4(AATT)31239743985trnK-UUU (intron)LSC
7P4(CCAT)3125465trnH-UGGLSC

Share and Cite

MDPI and ACS Style

Liu, X.; Li, Y.; Yang, H.; Zhou, B. Chloroplast Genome of the Folk Medicine and Vegetable Plant Talinum paniculatum (Jacq.) Gaertn.: Gene Organization, Comparative and Phylogenetic Analysis. Molecules 2018, 23, 857. https://doi.org/10.3390/molecules23040857

AMA Style

Liu X, Li Y, Yang H, Zhou B. Chloroplast Genome of the Folk Medicine and Vegetable Plant Talinum paniculatum (Jacq.) Gaertn.: Gene Organization, Comparative and Phylogenetic Analysis. Molecules. 2018; 23(4):857. https://doi.org/10.3390/molecules23040857

Chicago/Turabian Style

Liu, Xia, Yuan Li, Hongyuan Yang, and Boyang Zhou. 2018. "Chloroplast Genome of the Folk Medicine and Vegetable Plant Talinum paniculatum (Jacq.) Gaertn.: Gene Organization, Comparative and Phylogenetic Analysis" Molecules 23, no. 4: 857. https://doi.org/10.3390/molecules23040857

Article Metrics

Back to TopTop