Next Article in Journal
STAT3 Increases CVB3 Replication and Acute Pancreatitis and Myocarditis Pathology via Impeding Nuclear Translocation of STAT1 and Interferon-Stimulated Gene Expression
Previous Article in Journal
ShSPI Inhibits Thrombosis Formation and Ischemic Stroke In Vivo
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Deciphering the Plastome and Molecular Identities of Six Medicinal “Doukou” Species

1
Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China
2
Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China
3
State Key Laboratory for Conservation and Utilization of Bio-Resources, Research Center of Perennial Rice Engineering and Technology, School of Agriculture, Yunnan University, Kunming 650201, China
4
School of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
5
CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650091, China
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2024, 25(16), 9005; https://doi.org/10.3390/ijms25169005
Submission received: 4 June 2024 / Revised: 1 August 2024 / Accepted: 16 August 2024 / Published: 19 August 2024
(This article belongs to the Section Molecular Biology)

Abstract

:
The genus Amomum includes over 111 species, 6 of which are widely utilized as medicinal plants and have already undergone taxonomic revision. Due to their morphological similarities, the presence of counterfeit and substandard products remains a challenge. Accurate plant identification is, therefore, essential to address these issues. This study utilized 11 newly sequenced samples and extensive NCBI data to perform molecular identification of the six medicinal “Doukou” species. The plastomes of these species exhibited a typical quadripartite structure with a conserved gene content. However, independent variation shifts of the SC/IR boundaries existed between and within species. The comprehensive set of genetic sequences, including ITS, ITS1, ITS2, complete plastomes, matK, rbcL, psbA-trnH, and ycf1, showed varying discrimination of the six “Doukou” species based on both distance and phylogenetic tree methods. Among these, the ITS, ITS1, and complete plastome sequences demonstrated the highest identification success rate (3/6), followed by ycf1 (2/6), and then ITS2, matK, and psbA-trnH (1/6). In contrast, rbcL failed to identify any species. This research established a basis for a reliable molecular identification method for medicinal “Doukou” plants to protect wild plant resources, promote the sustainable use of medicinal plants, and restrict the exploitation of these resources.

1. Introduction

Species identification is crucial in the fields of biology and ecology [1] and serves as the basis for ecological research, enabling the understanding of species richness and biodiversity [2]. It informs conservation efforts through the identification of endangered invasive and keystone species [3]. Additionally, it plays a crucial role in predicting and preventing infectious disease outbreaks by identifying potential disease hosts and transmitters among wild animal species [4]. In food production industries, species identification ensures authenticity, quality, and safety, preventing fraud and the circulation of substandard products [5]. In criminal and forensic cases, it aids in identifying the origin of wildlife products [6]. Traditional methods of species identification, relying on morphological characteristics, have limitations in discriminating taxa with limited morphological differences or complex phylogenetic history. DNA barcoding technology has emerged as an effective advancement to overcome these challenges [7].
DNA barcoding is a molecular technique that identifies biological species by examining distinct DNA segments, utilizing variations in short DNA sequences to provide rapid and reliable species identification [5,8,9,10,11,12]. The concept of DNA barcoding was first proposed by Paul Hebert, who suggested using a small, highly conserved genetic sequence called the “ribosomal RNA gene region” to identify species [5]. Initially, DNA barcoding was widely used in animals, where the gene-encoding cytochrome c oxidase I (COI) in mitochondria has a high species differentiation potential, especially in insects, birds, and fish [13,14]. Therefore, the COI gene has become the preferred choice for universal DNA barcoding in animals due to its high level of accuracy in species identification [15]. However, in plant mitochondrial genomes, the COI gene shows a high degree of conservation and is not suitable as a DNA barcode [16]. In addition, complex evolutionary events such as hybridization, polyploidization, and incomplete lineage sorting are more common in plants than in animals, further increasing the difficulty of screening fragments suitable for DNA barcoding [17]. Currently, the internationally recognized universal plant DNA barcodes include four gene regions, including ITS (internal transcribed spacer: internal transcribed spacer 1-5.8S-internal transcribed spacer (2), matK, rbcL, and psbA-trnH [18]. The selection of these gene regions considers the genetic diversity and evolutionary history of the plant kingdom to enhance the effectiveness and usefulness of plant DNA barcodes. However, these fragments have limitations. As an alternative, ultra-barcoding using the complete plastomes for plant species identification has been proposed [19]. Although discerning closely related species using DNA barcoding can pose challenges, this technique is promising in distinguishing morphologically indistinguishable species but genetically distinct [20].
Amomum Roxb. is the second-largest genus in the Zingiberaceae Martinov family after Alpinia and includes approximately 111 [21] to 150 [22,23] species distributed in tropical Asia and Australia, particularly in Southeast Asia, such as India, Malaysia, and Indonesia [23]. In China, Amomum comprises 39 species (29 endemic and 1 introduced) [23], mainly distributed across the Fujian, Guangdong, Guangxi, Guizhou, Yunnan, and Tibet provinces [22]. Among these, six species are listed in the Chinese Pharmacopoeia [24]. These species, originally classified within the genus Amomum, have undergone taxonomic revision [25]. These encompass (1) Lanxangia tsaoko (Crevost & Lemarié) M. F. Newman & Škorničk (synonym: A. tsaoko Crevost et Lemarie), (2) Wurfbainia compacta (Sol. ex Maton) Škorničk. & A. D. Poulsen (synonyms: A. compactum Sol. ex Maton and Zingiber compactum (Sol. ex Maton) Stokes), (3) W. longiligularis (T. L. Wu) Škorničk. & A. D. Poulsen (synonym: A. longiligulare T. L. Wu), (4) W. vera (Blackw.) Škorničk. & A. D. Poulsen (synonyms: A. krervanh Pierre ex Gagnep. and A. verum Blackw.), (5) W. villosa (Lour.) Škorničk. & A. D. Poulsen (synonyms: A. villosum Lour., Cardamomum villosum (Lour.) Kuntze, and Z. villosum (Lour.) Stokes), and (6) W. villosa var. xanthioides (Wall. ex Baker) Škorničk. & A. D. Poulsen (synonyms: A. xanthioides Wall. ex Baker, A. villosum var. xanthioides (Wall.ex Bak.) T. L. Wu & S. J. Chen, and C. xanthioides Wall. ex Kuntze) [25]. They exhibit a diverse range of traits and applications. For instance, W. compacta is a widely used culinary spice and its fruits, leaves, and seeds have a wide range of pharmacological activities in traditional medicine, such as antifungal, antibacterial, antioxidant, gastroprotective, anti-inflammatory, immunomodulatory, anticancer, antiasthmatic, and treatment of acute renal failure [26]. The fruits of W. vera have shown antibacterial activity [27]. The active ingredients in W. longiligularis and W. villosa var. xanthioides have antibacterial activity [28,29]. In addition, the powerful antioxidant properties of W. villosa var. xanthioides have been used in the treatment of non-alcoholic fatty liver disease (NAFLD) and non-alcoholic steatohepatitis (NASH) [30]. L. tsaoko has been found to contain antifungal active substances [31] and antioxidant ingredients [32], indicating its potential medicinal properties; recent studies suggested that it can relieve constipation and could be a promising candidate for developing laxatives [33]. The total flavonoids extracted from W. villosa have shown potential for developing new drugs to treat gastric cancer [34]. Chemical components found in the seeds of W. villosa can enhance cellular antioxidant activity [35]. Additionally, Chen et al. (2018) have confirmed the potential beneficial effects of W. villosa in treating inflammatory bowel disease [36]. Li et al. (2016) demonstrated that the fresh stems and leaves of W. villosa can be used as high-quality feed for cattle, sheep, and other grass-eating livestock [37]. Despite these several benefits, their morphological similarities make it challenging and confusing between species. Therefore, molecular identification through DNA barcoding is crucial for accurately identifying Amomum species.
Recent studies have utilized various universal barcodes, with the ITS sequence being approximately 500–700 bp long While the length of the ITS sequence is relatively conserved, its sequence exhibits significant variability, making it useful for species differentiation [38]. The sequencing and analysis of ITS are rapid and cost-effective, compared to traditional morphological classification methods. Additionally, an extensive repository of ITS is available in public databases, providing researchers with a wealth of reference resources that facilitate convenient species identification and classification. The GenBank database at the National Center for Biotechnology Information (NCBI) hosts an extensive collection of ITS sequences for Amomum and its taxonomic synonyms. As of April 11, 2024, it includes 572 sequences representing 159 species. This vast dataset is a valuable resource for this study, providing comprehensive and diverse information. Selvaraj et al. (2012) revealed that ITS and ITS1 are successful DNA barcodes for differentiating Boerhavia diffusa Linnaeus from counterfeit medicinal plants [39]. The ITS2 (internal transcribed spacer 2) region has been utilized for the identification of medicinal plants and their closely related species [40] in the Polygonaceae A. L. Jussieu family [41] and the Dendrobium Sw. genus [42]. ITS2 is demonstrated to be the most promising universal DNA barcode for the Zingiberaceae family [43]. The complete plastome, matK, and rbcL sequences have shown to effectively distinguish W. compacta, W. longiligularis, and W. villosa [44]. Notably, the matK and the psbA-trnH intergenic spacer exhibited high identification efficiency for L. tsaoko and other Amomum species [45]. The most efficient barcodes for the molecular identification of Amomum are ITS [46,47,48], ITS1 [49,50], and ITS2 [51,52,53]. These findings have demonstrated the promising potential application of DNA barcoding in species identification and classification within Amomum. DNA barcoding can facilitate accurate identification and classification of different Amomum species, which helps to understand their diversity and evolutionary relationships, and it is an effective tool that guides methods for Amomum’s protection, sustainable utilization, and medicinal value research.
In this study, we employed a combination of newly sequenced data and additional data from the NCBI database, including (1) ITS, (2) ITS1, (3) ITS2, (4) complete plastomes, (5) matK, (6) rbcL, (7) psbA-trnH, and (8) ycf1 to facilitate the evaluation and precise identification of six medicinal plants within the Amomum genus. Using DNA barcoding, we evaluated the value of barcodes in identifying different Amomum medicinal species, thereby reducing the potential errors associated with traditional morphological methods. Our findings will enhance the sustainable management and conservation of Amomum resources, thereby facilitating industrial growth and quality control. Ultimately, this will lead to substantial scientific and societal advantages.

2. Results

2.1. Plastome Structural Variation, Sequence Divergences, and Hypervariable Regions

All 41 individuals from the 6 examined “Doukou” species exhibited a quadripartite structure (Figure 1) and showed limited intraspecific variation in plastome size (Table S1). The complete plastomes ranged in size from 162,678 to 164,332 bp. The lengths of the large single-copy (LSC), small single-copy (SSC), and inverted repeat (IR) regions ranged from 87,632 to 89,067 bp, 14,895 to 15,754 bp, and 29,642 to 29,971 bp, respectively (Table S1). There was a slight variation in the total GC content, which ranged from 36.0% to 36.4% (Table S1). However, the GC content was higher in the IR regions (41.0–41.2%) compared to the LSC (33.7–34.1%) and SSC (29.6–30.3%) regions (Table S1). The “Doukou” plastomes were highly conserved and encode between 121 and 133 genes, including 82 to 87 protein-coding genes, 8 rRNA genes, and 30 to 38 tRNA genes (Table S1).
We compared the contraction and expansion of IRs regions at four junctions between the two IRs (IRa and IRb) and the two single-copy regions (LSC and SSC) among six “Doukou” species (Figure 2). The LSC/IRb boundary was embedded in the rpl22-rps19 region (except for W. compacta YWB91902-1 and W. vera YWB91901-1, which were directly at the rpl22 gene); the IRb/SSC and SSC/IRa boundaries were within the ycf1 gene; the IRa/LSC boundary was in the rps19-psbA region. The boundary shifts exhibited independent variations both between and within species.
The nucleotide diversity (Pi) values were calculated with DnaSP v.5 [54] to test divergence level across different regions within the complete plastomes of the six “Doukou” species and their taxonomic synonyms. The average value of nucleotide diversity (Pi) was 0.00469. The nucleotide diversity (π) value ranged from 0 to 0.02354 across the plastomes, and the most hypervariable region was ycf1 (Figure 3).

2.2. Sequence Characteristics

The matrix characteristics of ITS, ITS1, ITS2, complete plastomes, matK, rbcL, psbA-trnH, and ycf1 of six medicinal “Doukou” species and their taxonomic synonyms are listed in Table 1. ITS2 had the highest percentage of variable sites, but complete plastomes had the most variable sites. The same was true for singleton sites (Table 1). ITS1 had the highest percentage of parsimony-informative sites (Table 1).

2.3. Distance Based Species Discrimination

Analyses of intra- and interspecific Kimura 2-parameter (K2P) distances identified varying barcoding gaps among six medicinal “Doukou” species and their taxonomic synonyms across different datasets. In a barcoding gap analysis, the ITS1 and complete plastome barcodes exhibited the highest discriminatory power, successfully identifying 50% of the species (3/6; Table S2; Figure 4. The ITS and ycf1 barcode were the next most effective, identifying 33% of the species (2/6; Table S2; Figure 4). The ITS2 and psbA-trnH barcodes could only identify one species each, accounting for 17% of the species (1/6; Table S2; Figure 4). The matK and rbcL barcodes could not identify any species (Table S2; Figure 4. In the ABGD analysis, ITS and ITS1 performed best (3/6; 50%; Table S3), followed by ycf1 (2/6; 33%; Table S3); while the other five performed the worst (1/6; 17%; Table S3). The number of generated OTUs varied across the ABGD analysis with the different prior intraspecific divergence in the initial and recursive partitions (Table S3).

2.4. Tree Based Species Discrimination

In the ITS dataset, due to the abundance of sequences for W. villosa (synonyms: A. villosum, C. villosum, and Z. villosum), maximum likelihood (ML), and Bayesian inference (BI) trees were initially constructed for all individuals (Figures S1 and S2). Subsequently, three individuals from the W. villosa (A. villosum, C. villosum, and Z. villosum) branch of the ML tree were chosen to contribute to the construction of subsequent ITS, ITS1, and ITS2 trees. Similarly, in the matK (Figures S3 and S4) and rbcL (Figures S5 and S6) datasets, three individuals were selected from the same branch in the ML tree for the construction of subsequent matK and rbcL trees. In both cases, these individuals were chosen from the top, middle, and bottom of the branch to represent the full range of genetic diversity.
The ML and BI topologies derived from seven of the eight datasets for the six species were congruent in showing which species were monophyletic (Figure 5, Figure 6 and Figures S7–S18), except the ITS1 dataset, which differed from the others (Figure 7 and Figure S19). Across all datasets, including ITS, ITS1, ITS2, complete plastome, matK, rbcL, psbA-trnH, and ycf1, L. tsaoko, and all individuals of its taxonomic synonyms, formed a monophyletic group, demonstrating the successful identification of L. tsaoko (Figure 5, Figure 6, Figure 7 and Figures S7–S19). Similarly, W. compacta, along with all individuals of its taxonomic synonyms, formed a monophyletic group in the ITS, ITS1, complete plastome, and ycf1 datasets (Figure 5, Figure 6, Figure 7 and Figures S7, S10, S17 and S18). In the ITS, ITS1, and complete plastome datasets, individuals of W. vera and its taxonomic synonyms exhibited a monophyletic group (Figure 5, Figure 6, Figure 7 and Figures S7, S10 and S19). Overall, the ITS, ITS1 and complete plastome datasets can successfully identify L. tsaoko, W. compacta, and W. vera (3/6; Figure 5, Figure 6, Figure 7 and Figures S7 and S10); ycf1 can successfully identify L. tsaoko and W. compacta (2/6; Figures S17 and S18); the ITS2, matK, and psbA-trnH datasets can successfully identify L. tsaoko (1/6; Figures S8, S9 and S11–S14); the rbcL dataset cannot identify any species (Figure 8 and Figures S15 and S16). However, W. longiligularis, W. villosa, W. villosa var. xanthioides, and their individuals of taxonomic synonyms did not form monophyly in the four datasets (Figure 5, Figure 6, Figure 7 and Figures S7–S19).

3. Discussion

3.1. Plastome Characteristics and DNA Barcode Performance

The plastomes of “Doukou” were highly conserved and exhibited a typical quadripartite structure, a characteristic shared with nine species within the subfamily Zingiberoideae. [55], Zingiber Boehm. [56], various species of Curcuma L. [57], and other photosynthetic angiosperms [58,59,60]. In the six medicinal “Doukou” species, the maximum possible species discrimination was 3/6 because W. longiligularis, W. villosa, and W. villosa var. xanthioides were non-monophyletic for ITS, ITS1, plastome, and plastome-standard barcodes (Figure 5, Figure 6, Figure 7 and Figures S7–S19).
Taxon-specific markers present a feasible alternative that balances the costs of comprehensive super-barcodes, such as whole plastomes, against the limited genetic variability often found in standard barcodes. For the six medicinal “Doukou” species, we identified the most significant mutational hotspots as the ycf1 gene with a π value of 0.02354 (Figure 3), similar to other members of the Zingiberaceae family [46,57]. This was consistent with the ycf1 (Figure 8 and Figures S17 and S18), having a higher identification rate than the matK, rbcL, and psbA-trnH barcodes. The four conventional barcodes (ITS2, matK, rbcL, and psbA-trnH) were each only able to reliably identify a single species at most, so the ycf1 gene region could serve as a viable alternative for species identification for the revised Amomum species. Given the financial demands of complete plastome sequencing, this gene region can offer a cost-effective and efficient method for population genetic research on Amomum. Additionally, this approach aids in the development of a growing database for taxon-specific barcodes.

3.2. Performance Comparison of Species Delimitation Methods

Consistent with previous research [46,47,48], species delimitation outcomes vary with the data and methodologies applied. Among the methods evaluated—ABGD, BG, BI, and ML—ML stood out as the most effective species identification, closely followed by BI, as illustrated in Figure 8. Additionally, the topological structures produced by ML and BI are largely similar, suggesting that these methods consistently achieve the highest identification rates (Figure 8). While the identification rates for ABGD and BG differ, ABGD generally outperforms BG (Figure 8), leading to a method ranking of ML > BI > ABGD > BG. Given the demonstrated robustness and efficiency of ML and BI in this study, these methods are recommended as the preferred approaches for species delimitation in DNA barcode-based identification, particularly when employing super-barcodes.

3.3. DNA Barcoding in Six Medicinal “Doukou” Plants

Previous studies have indicated that the standard barcodes are not sufficient for the identification of several medicinal plants within the genus Amomum [44,48,49,50,51,53,61]. The complete plastomes have demonstrated a strong capability to differentiate species of Amomum [44]. The results of this study have further validated these findings. The ability to distinguish species of Amomum is enhanced by the length of the complete plastome sequence, which is approximately 160,000 bp long, and its inclusion of many informative sites. However, the sequencing and analysis of the complete plastomes are considerably more expensive and resource-intensive than short fragments such as ITS sequences. ITS sequencing is more cost-effective and demands fewer computational resources for analysis. Despite its relatively short length of approximately 600 bp, the informative sites within the ITS region can accurately distinguish among L. tsaoko, W. compacta, and W. vera, similar to the capabilities of ITS1. ITS2 can only successfully identify L. tsaoko. Although ITS2 contains the highest proportion of variable sites, the complete plastomes hold numerous variable sites (Table 1).
Previous studies have shown that the identification rate of ITS/ITS1/ITS2 is higher compared to the plastome fragments [51,62,63]. This may be because the plastome only contains maternal genetic information [63], while ITS/ITS1/ITS2 contain richer biparental genetic information [64]. ITS sequences typically have multiple copies, which may increase ITS variability and improve the accuracy of species identification. Conversely, plastome fragments may only have a single copy, limiting their identification capabilities in some taxa. Notably, some taxa may contain hybrid or show hybridization leading to difficulties in species identification using plastome fragments. In this case, ITS sequences may better reflect the genetic differences between such species, thereby improving the identification rate.
In the eight datasets, some individuals were placed within monophyletic groups (Figure 5, Figure 6, Figure 7 and Figures S1–S17), possibly due to misidentification. The inclusion of some non-target species individuals in the monophyletic branches might be due to errors in species identification, given that the NCBI database has an extensive range of sources. Previously, several studies solely relied on distance for species identification. However, subsequent research has indicated that the barcode gap may be a result of errors in under-sampled taxonomic groups [65]. Therefore, when carrying out species identification, it is important to incorporate extensive analyses. The relationship between the minimum interspecific distance and the maximum intraspecific distance among the six species, along with the consistency between the ABGD grouping results and the tree results, provides strong evidence to support the species identification and classification of these species.

3.4. NCBI Database as a Resource and ITS vs. ITS1 Identification Rate

The NCBI database has provided comprehensive biological and biomedical information [66], offering a vast collection of genetic sequences, gene expression data, protein structures, and scientific literature. Its user-friendly interface and open-access policy promote global scientific collaboration. However, challenges include navigating through the extensive data and ensuring the quality and accuracy of the information due to varying submissions from researchers and institutions. This research was conducted based on a large amount of NCBI data, and reliable results were obtained. The NCBI database provides great convenience for research.
The ITS region has been proposed as a standard DNA barcode marker in fungi [67] and plants [68]. In our study, the identification rate of ITS1 was higher than that of ITS2, which aligned with the view that ITS1 is a better barcode than ITS2 in eukaryotes [69]. Despite the evaluation of ITS1 and ITS2 as meta-barcode markers for fungi [70], their identification efficacy as DNA barcode markers varies across different taxa. In this study, the individuals used across the ITS, ITS1, and ITS2 datasets are consistent. Therefore, this research serves as a reference, suggesting that the ITS1 dataset might be considered first in practical applications when the experimental individuals are identical.
Although ITS is significantly longer than ITS1, there is no noticeable difference in the difficulty of amplification between the two. Even though the ITS dataset had more variable sites than ITS1, it did not necessarily mean that it surpasses ITS1 in identification rate. Notably, the percentage of variable sites within the ITS1 sequence is higher than that of variable sites within the entire ITS sequence. Many factors can affect the identification rate, including the presence of key variable sites that can significantly distinguish species, the amount of recognizable feature information within the dataset, and the size of the differences in the species being identified. In these aspects, the ITS1 dataset may be superior to ITS and, thus, have a higher identification rate.
The ITS1 sequence is approximately 100–200 bp in length. In most cases, it can be easily acquired through sequencing of polymerase chain reaction (PCR) amplicons. This process is cost-effective and can easily be obtained. Furthermore, abundant ITS sequences of the genus Amomum and its taxonomic synonyms can be directly extracted from the NCBI database. Through multiple analyses in this study, it has been mutually verified that ITS1 has the highest identification rate. This suggests that in future identification of these six medicinal “Doukou” plants, ITS1 should be considered first.

4. Materials and Methods

4.1. Taxon Sampling

Based on the phylogenetic relationships of the genus Amomum established by Boer et al. [25], we selected close relatives of six target species for our study. We consulted this study to identify the accepted names and synonyms for these six target species and their closely related species. Our data collection and analysis focused on these target species, their close relatives, and the taxonomic synonyms associated with both groups. We sampled 11 individuals from Wurfbainia and Lanxangia genera (Table 2), as well as numerous individuals represented by ITS, complete plastome, matK, rbcL, psbA-trnH, and ycf1 sequences from both Wurfbainia and Lanxangia and their taxonomic synonyms available on NCBI (Table S4). To download the second-generation sequencing data within these groups, we utilized the prefetch tool in SRA Toolkit v.3.1.0, accessible at https://github.com/ncbi/sra-tools, accessed on 4 March 2024, from the NCBI database. The cut-off date for downloading data from NCBI was 11 April 2024. The detailed species information that was sequenced is listed in Table 2, and all the information was uploaded to the NCBI GenBank database. Alpinia nigra (Gaertn.) Burtt (MF076960) and Alpinia galanga (L.) Willd. (AF478715) were chosen as outgroups for constructing the matrices of ITS, ITS1, and ITS2 sequences. For the complete plastome, matK, rbcL, psbA-trnH, and ycf1 matrices, A. nigra (MK940826) and A. galanga (MK940825) were selected as outgroups. The selection of outgroups was based on Gong et al. [53]. We downloaded 232, 31, 138, 224, 53, and 31 sequences of ITS/ITS1/ITS2, complete plastome, matK, rbcL, psbA-trnH, and ycf1, respectively, from NCBI (Table S4). Numerous sequences of W. villosa (synonyms: A. villosum, C. villosum, and Z. villosum) were recovered for the ITS, matK, and rbcL datasets. Initially, we constructed phylogenetic trees using all available data and subsequently selected three individuals from the W. villosa clade within the tree based on genetic distance for further analysis.

4.2. DNA Extraction, Sequencing, Assembly, and Annotation

We extracted total DNA from 0.2 g of the gel-dried leaves and herbarium samples using the modified 4 × CTAB method [71]. The quality of DNA was assessed using 1% agarose gel electrophoresis and a NanoDrop® ND-1000 spectrophotometer. We constructed a DNA library (300–500 bp) using the NEBNext UItra II DNA library prep kit for Illumina and performed two-end sequencing (2 × 150 bp) on the DNBSEQ-T7 high-throughput platform, generating a total amount of data of no less than 3 Gb. The length of single-ended sequencing reads was 150 bp (sequencing strategy PE150). To convert SRA files downloaded from NCBI into FASTQ format, we used fasterq-dump from SRA Toolkit v.3.1.0 (https://github.com/ncbi/sra-tools, accessed on 4 March 2024). Then, we compressed the ‘fastq’ files into ‘fastq.gz’ format suitable for GetOrganelle assembly using the open-source tool pigz v. 2.2.5 (https://zlib.net/pigz/, accessed on 4 March 2024).
The ITS sequence, spanning approximately 600–700 bp, was first assembled utilizing GetOrganelle v.1.7.5.3 [72]. Following assembly, the resultant FASTG file and the reference from A. sericeum Roxb. (KY438097.1) were aligned using the Map function in Geneious v.9.0.2 [73] to prepare the sequence for annotation. Subsequently, annotation was performed through Geneious v.9.0.2 [73] with the reference to acquire the ITS sequence. ITS1/ITS2 sequences were then extracted based on annotation information using Geneious v.9.0.2 [73].
The plastome assembly and annotation methods of sequences were conducted following the protocol described by Li et al. [74]. The clean data obtained from high-throughput sequencing were directly assembled using GetOrganelle v.1.7.5.3 [72], and the complete circular plastid genome was automatically generated. In cases where the circular structure could not be obtained, results were visually inspected using Bandage v.0.8.1 [75]. Subsequently, reliable plastid genome contigs or scaffolds were identified by manually removing non-target contigs from the ‘fastg’ file. The selected sequences were manually edited and spliced to obtain a complete plastid genome. Annotation of the plastid genome was performed using Geneious v.9.0.2 [73], with the published genome of A. krervanh (NC_036935.1) as the reference, and then combined with ORF (open reading frame) for correction. The matK, rbcL, psbA-trnH, and ycf1 were extracted using Geneious v.9.0.2 [73] based on annotation information.
The ITS, ITS1, ITS2, complete plastome, matK, rbcL, psbA-trnH, and ycf1 matrices were constructed by aligning the sequences using the Mafft Multiple Alignment plugins in Geneious v.9.0.2 [73]. All annotated sequences were uploaded to GenBank, and accession numbers were assigned (Table 2).

4.3. Data Analysis

4.3.1. Plastome Structural Variation, Divergence, and Mutational Hotspot Analyses

In this study, we conducted a detailed examination of 41 plastomes from six medicinal “Doukou” species and their taxonomic synonyms, focusing on aspects such as genome size, gene content, which includes protein-coding genes, tRNAs, rRNAs, and GC content. We utilized Geneious v.9.0.2 [73] for comparative analyses to investigate the expansion and contraction dynamics of the inverted repeats (IRs) at the four junctions of these plastomes, with visualization facilitated by IRscope [76]. Furthermore, by employing a sliding window analysis using DnaSP v.5 [54], with settings adjusted to a step size of 200 bp and a window length of 600 bp, we successfully pinpointed the top three sequences as the most variable regions. To complement our research findings, we constructed a detailed physical circular map of the plastome using OGDRAW v.1.3.1 [77].

4.3.2. Sequence-Based Analyses

We conducted a distance-based analysis using matrices generated from a subset of target and closely related species individuals selected from all individuals of the Wurfbainia and Lanxangia genera and their taxonomic synonyms for tree construction according to Boer et al. [25]. Two primary species delimitation approaches were employed: barcoding gaps (BG) [78] and automatic barcode gap discovery (ABGD) [79]. To investigate the existence of barcoding gaps within each dataset (ITS, ITS1, ITS2, complete plastome, matK, rbcL, psbA-trnH, and ycf1), we conducted pairwise distance calculations implemented in MEGA-11 [80] using the K2P model. A scatter plot was employed to identify barcoding gaps by visualizing the relationship between the minimum interspecific distance and maximum intraspecific distance for the six species and their taxonomic synonyms. A species is considered accurately identified when the minimum interspecific distance is larger than its maximum intraspecific distance [81]. The ABGD analysis was conducted using an online platform (https://bioinfo.mnhn.fr/abi/public/abgd/, accessed on 4 March 2024), employing three distinct distance models: Jukes–Cantor [JC69], Kimura [K80] TS/TV 2.0, and simple distance. The analysis was configured with the following parameters: Pmin = 0.001, Pmax = 0.1, Steps = 10, X = 1.5, and Nb bins = 20. The best partition was identified as the one most closely aligning with the delimitation of nominal species among the partitions obtained.

4.3.3. Phylogenetic Tree-Based Analyses

We constructed phylogenetic trees based on ML and BI methods from eight datasets: (1) ITS, (2) ITS1, (3) ITS2, (4) complete plastome, (5) matK, (6) rbcL, (7) psbA-trnH, and (8) ycf1 sequences. The sequence matrices of each dataset were aligned using MAFFT implemented in Geneious v.9.0.2 [73]. The ML tree was constructed using RAxML v.8.2.11 [82] by the GTRGAMMAI model with 1000 rapid bootstrap replicates. MrBayes v.3.2.7 [83] was utilized for BI analyses runs with 1,000,000 generations, employing the best-fit model specified according to the optimal scheme selected by jModeltest v.2.1.7 [84] using the Akaike information criterion (AIC) criteria. Phylogenetic trees were then visualized by tvBOT v.3.0 [85]. Successful identification was considered when all individuals of the same species and their synonyms cluster into a single clade.

5. Conclusions

In this study, we examined the structural variations in plastomes and assessed the effectiveness of both standard and super DNA barcodes in species identification, focusing on intraspecific and interspecific variability within six medicinal “Doukou” species. Molecular identification of these Amomum species was achieved through the analysis of wide genetic markers, including ITS, ITS1, ITS2, complete plastome, matK, rbcL, psbA-trnH, and ycf1 sequences. Among the markers employed, ITS, ITS1, and complete plastome were highly effective in identifying L. tsaoko, W. compacta, and W. vera. The ycf1 barcode proved useful for identifying L. tsaoko and W. compacta. In contrast, ITS2, matK, and psbA-trnH were specifically effective only for identifying L. tsaoko. Conversely, rbcL was ineffective in distinguishing any of the species. In conclusion, the ITS, ITS1, and complete plastomes performed best followed by ycf1, and then ITS2, matK, and psbA-trnH, while rbcL performed worst with insufficient sites to discriminate any species. Consequently, considering factors such as cost-efficiency, ITS1 emerges as the most recommended marker for molecular identification within the Amomum genus. The methodologies utilized herein for the molecular identification of the six medicinal “Doukou” species form a basis for the conservation of wild plant resources, the rational utilization of medicinal plants, and the prevention of resource misappropriation. This study provides essential molecular tools for the precise identification of species, hence enhancing our understanding of the botanical and pharmacological aspects of “Doukou” medicinal plants.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms25169005/s1.

Author Contributions

J.-B.Y. conceived the project and designed the research; Y.Z. carried out data analysis and wrote the manuscript with input from all co-authors; A.K. corrected draft syntax; Z.-P.L. and L.X. handled all the figures and reviewed the article. All authors have read and agreed to the published version of the manuscript.

Funding

The study was supported by the Obtaining Super Barcodes of Important Wild Plants in Gaoligong Mountain (Grant No. 2021FY100204) to JBY.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets presented in this study can be accessed at NCBI GenBank; the list of accessions can be found in Table 2 and Table S4.

Acknowledgments

The authors are grateful to the iFlora High Performance Computing Center of Germplasm Bank of Wild Species for providing a stable and fast computing environment and the Germplasm Bank of Wild Species for facilitating the laboratory work. Thanks to the NCBI database for providing us with a large amount of data for analysis. We thank Wen-Bin Yu (Xishuangbanna Tropical Botanical Garden, CAS) for kindly providing the samples. We also thank Jing Yang, Zheng-Shan He, Chun-Yan Lin, Ji-Xiong Yang, Wen-Bin Yuan, and other supporting staff from the Molecular Biology Experiment Center of GBOWS. We would like to express our gratitude to Associate Xiang-Qin Yu and Nan Shu from the CAS Key Laboratory for Plant Diversity and Biogeography of East Asia for their invaluable assistance in addressing the reviewer’s comments.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Bickford, D.; Lohman, D.J.; Sodhi, N.S.; Ng, P.K.L.; Meier, R.; Winker, K.; Ingram, K.K.; Das, I. Cryptic species as a window on diversity and conservation. Trends Ecol. Evol. 2006, 22, 148–155. [Google Scholar] [CrossRef]
  2. Gotelli, N.J.; Colwell, R.K. Quantifying biodiversity: Procedures and pitfalls in the measurement and comparison of species richness. Ecol. Lett. 2001, 4, 379–391. [Google Scholar] [CrossRef]
  3. Soulé, M.E.; Wilcox, B.A. Conservation Biology. An Evolutionary-Ecological Perspective; Addison-Wesle: London, UK, 1980; p. 395. [Google Scholar]
  4. Smith, K.F.; Behrens, M.; Schloegel, L.M.; Marano, N.; Burgiel, S.; Daszak, P. Reducing the risks of the wildlife trade. Science 2009, 324, 594–595. [Google Scholar] [CrossRef]
  5. Hebert, P.D.; Cywinska, A.; Ball, S.L.; DeWaard, J.R. Biological identifications through DNA barcodes. Proc. R. Soc. Lond. B Biol. Sci. 2003, 270, 313–321. [Google Scholar] [CrossRef]
  6. Linacre, A.; Gusmão, L.; Hecht, W.; Hellmann, A.P.; Mayr, W.R.; Parson, W.; Prinz, M.; Schneider, P.M.; Morling, N. ISFG: Recommendations regarding the use of non-human (animal) DNA in forensic genetic investigations. Forensic Sci. Int. Genet. 2011, 5, 501–505. [Google Scholar] [CrossRef] [PubMed]
  7. Kress, W.J.; Erickson, D.L. DNA barcodes: Genes, genomics, and bioinformatics. Proc. Natl. Acad. Sci. USA 2008, 105, 2761–2762. [Google Scholar] [CrossRef]
  8. Hebert, P.D.; Gregory, T.R. The promise of DNA barcoding for taxonomy. Syst. Biol. 2005, 54, 852–859. [Google Scholar] [CrossRef] [PubMed]
  9. Kress, W.J.; Erickson, D.L. A two-locus global DNA barcode for land plants: The coding rbcL gene complements the non-coding trnH-psbA spacer region. PLoS ONE 2007, 2, e508. [Google Scholar] [CrossRef]
  10. Ford, C.S.; Ayres, K.L.; Toomey, N.; Haider, N.; Van Alphen Stahl, J.; Kelly, L.J.; Wikström, N.; Hollingsworth, P.M.; Duff, R.J.; Hoot, S.B.; et al. Selection of candidate coding DNA barcoding regions for use on land plants. Bot. J. Linn. Soc. 2009, 159, 1–11. [Google Scholar] [CrossRef]
  11. CBOL Plant Working Group; Hollingsworth, P.M.; Forrest, L.L.; Spouge, J.L.; Hajibabaei, M.; Ratnasingham, S.; van der Bank, M.; Chase, M.W.; Cowan, R.S.; Erickson, D.L.; et al. A DNA barcode for land plants. Proc. Natl. Acad. Sci. USA 2009, 106, 12794–12797. [Google Scholar]
  12. Hollingsworth, P.M.; Graham, S.W.; Little, D.P. Choosing and using a plant DNA barcode. PLoS ONE 2011, 6, e19254. [Google Scholar] [CrossRef] [PubMed]
  13. Hebert, P.D.; Stoeckle, M.Y.; Zemlak, T.S.; Francis, C.M. Identification of birds through DNA barcodes. PLoS Biol. 2004, 2, e312. [Google Scholar] [CrossRef] [PubMed]
  14. Ward, R.D.; Holmes, B.H.; O’Hara, T.D. DNA barcoding discriminates echinoderm species. Mol. Ecol. Resour. 2008, 8, 1202–1211. [Google Scholar] [CrossRef] [PubMed]
  15. Yoo, H.S.; Eah, J.; Kim, J.S.; Kim, Y.; Min, M.; Paek, W.K.; Lee, H.; Kim, C. DNA barcoding Korean birds. Mol. Cells 2006, 22, 323–327. [Google Scholar] [CrossRef] [PubMed]
  16. Kress, W.J.; Wurdack, K.J.; Zimmer, E.A.; Weigt, L.A.; Janzen, D.H. Use of DNA barcodes to identify flowering plants. Proc. Natl. Acad. Sci. USA 2005, 102, 8369–8374. [Google Scholar] [CrossRef] [PubMed]
  17. Kress, W.J.; Erickson, D.L.; Jones, F.A.; Swenson, N.G.; Perez, R.; Sanjur, O.; Bermingham, E. Plant DNA barcodes and a community phylogeny of a tropical forest dynamics plot in Panama. Proc. Natl. Acad. Sci. USA 2009, 106, 18621–18626. [Google Scholar] [CrossRef] [PubMed]
  18. Hollingsworth, P.M.; Li, D.Z.; van der Bank, M.; Twyford, A. Telling plant species apart with DNA: From barcodes to genomes. Philos. Trans. R. Soc. Lond. B Biol. Sci. 2016, 371, 20150338. [Google Scholar] [CrossRef] [PubMed]
  19. Kane, N.C.; Cronk, Q. Botany without borders: Barcoding in focus. Mol. Ecol. 2008, 17, 5175–5176. [Google Scholar] [CrossRef] [PubMed]
  20. Mishra, P.; Kumar, A.; Nagireddy, A.; Mani, D.N.; Shukla, A.K.; Tiwari, R.; Sundaresan, V. DNA barcoding: An efficient tool to overcome authentication challenges in the herbal market. Plant Biotechnol. J. 2016, 14, 8–21. [Google Scholar] [CrossRef] [PubMed]
  21. Plants of the World Online Kew Science. Available online: https://powo.science.kew.org/taxon/urn:lsid:ipni.org:names:327296-2 (accessed on 22 February 2024).
  22. Xu, H.Z. Amomum Roxb. In Flora of China; Xu, H.Z., Ed.; Science Press: Beijing, China; Missouri Botanical Garden Press: St. Louis, MO, USA, 1981; Volume 16, pp. 110–135. [Google Scholar]
  23. Yao, J.Y. Amomum Roxb. In Flora of China; Yao, J.Y., Ed.; Science Press: Beijing, China; Missouri Botanical Garden Press: St. Louis, MO, USA, 2000; Volume 24, pp. 347–356. [Google Scholar]
  24. China Pharmacopoeia Commission. Pharmacopoeia of the People’s Republic of China: Part One; China Medical Science and Technology Press: Beijing, China, 2020; pp. 175–264. [Google Scholar]
  25. Boer, H.D.; Newman, M.; Poulsen, A.D.; Droop, A.J.; Fér, T.; Thu Hiền, L.T.; Hlavatá, K.; Lamxay, V.; Richardson, J.E.; Steffen, K.; et al. Convergent morphology in Alpinieae (Zingiberaceae): Recircumscribing Amomum as a monophyletic genus. Taxon 2018, 67, 6–36. [Google Scholar] [CrossRef]
  26. Alkandahri, M.Y.; Shafirany, M.Z.; Rusdin, A.; Agustina, L.S.; Pangaribuan, F.; Fitrianti, F.; Farhamzah; Kusumawati, A.H.; Sugiharta, S.; Arfania, M.; et al. Amomum compactum: A review of pharmacological studies. Plant Cell Biotechnol. Mol. Biol. 2021, 22, 61–69. [Google Scholar]
  27. Diao, W.R.; Zhang, L.L.; Feng, S.S.; Xu, J.G. Chemical composition, antibacterial activity, and mechanism of action of the essential oil from Amomum kravanh. J. Food Prot. 2014, 77, 1740–1746. [Google Scholar] [CrossRef] [PubMed]
  28. Thinh, B.B.; Chac, L.D.; Hanh, D.H.; Korneeva, A.A.; Hung, N.; Igoli, J.O. Effect of extraction method on yield, chemical composition and antimicrobial activity of essential oil from the fruits of Amomum villosum var. xanthioides. J. Essent. Oil-Bear. Plants 2022, 25, 28–37. [Google Scholar] [CrossRef]
  29. Chau, L.T.M.; Thang, T.D.; Huong, L.T.; Ogunwande, I.A. Constituents of Essential Oils from Amomum longiligulare from Vietnam. Chem. Nat. Compd. 2015, 51, 1181–1183. [Google Scholar] [CrossRef]
  30. Cho, J.H.; Lee, J.S.; Kim, H.G.; Lee, H.W.; Fang, Z.; Kwon, H.H.; Kim, D.W.; Lee, C.M.; Jeong, J.W. Ethyl acetate fraction of Amomum villosum var. xanthioides attenuates hepatic endoplasmic reticulum stress-induced non-alcoholic steatohepatitis via improvement of antioxidant capacities. Antioxidants 2021, 10, 998. [Google Scholar] [CrossRef] [PubMed]
  31. Moon, S.S.; Lee, J.Y.; Cho, S.C. Isotsaokoin, an antifungal agent from Amomum tsao-ko. J. Nat. Prod. 2004, 67, 889–891. [Google Scholar] [CrossRef] [PubMed]
  32. Martin, T.S.; Kikuzaki, H.; Hisamoto, M.; Nakatani, N. Constituents of Amomum tsao-ko and their radical scavenging and antioxidant activities. J. Am. Oil Chem. Soc. 2000, 77, 667–673. [Google Scholar] [CrossRef]
  33. Hu, Y.; Gao, X.; Zhao, Y.; Liu, S.; Luo, K.; Fu, X.; Li, J.; Sheng, J.; Tian, Y.; Fan, Y. Flavonoids in Amomum tsaoko crevost et lemarie ameliorate loperamide-induced constipation in mice by regulating gut microbiota and related metabolites. Int. J. Mol. Sci. 2023, 24, 7191. [Google Scholar] [CrossRef] [PubMed]
  34. Yue, J.; Zhang, S.; Zheng, B.; Raza, F.; Luo, Z.; Li, X.; Zhang, Y.; Nie, Q.; Qiu, M. Efficacy and mechanism of active fractions in fruit of Amomum villosum Lour. for gastric cancer. J. Cancer 2021, 12, 5991–5998. [Google Scholar] [CrossRef] [PubMed]
  35. Zhang, D.; Li, S.; Xiong, Q.; Jiang, C.; Lai, X. Extraction, characterization and biological activities of polysaccharides from Amomum villosum. Carbohydr. Polym. 2013, 95, 114–122. [Google Scholar] [CrossRef] [PubMed]
  36. Chen, Z.; Ni, W.; Yang, C.; Zhang, T.; Lu, S.; Zhao, R.; Mao, X.; Yu, J. Therapeutic effect of Amomum villosum on inflammatory bowel disease in rats. Front. Pharmacol. 2018, 9, 639. [Google Scholar] [CrossRef] [PubMed]
  37. Li, Q.X.; Gao, G.; Ye, K.X.; Liu, J.Y.; Huang, B.Z.; Wang, A.K.; Wang, X.; Yang, G.R. Evaluation on feeding value of stems and leaves in fructus amomi. Anim. Husb. Vet. Med. 2016, 48, 61–63. [Google Scholar]
  38. Baldwin, B.G.; Sanderson, M.J.; Porter, J.M.; Wojciechowski, M.F.; Campbell, C.S.; Donoghue, M. The ITS region of nuclear ribosomal DNA: A valuable source of evidence on angiosperm phylogeny. Ann. Mo. Bot. Gard. 1995, 82, 247–277. [Google Scholar] [CrossRef]
  39. Selvaraj, D.; Shanmughanandhan, D.; Sarma, R.K.; Joseph, J.C.; Srinivasan, R.V.; Ramalingam, S. DNA barcode its effectively distinguishes the medicinal plant Boerhavia diffusa from its adulterants. Genom. Proteom. Bioinform. 2012, 10, 364–367. [Google Scholar] [CrossRef]
  40. Chen, S.; Yao, H.; Han, J.; Liu, C.; Song, J.; Shi, L.; Zhu, Y.; Ma, X.; Gao, T.; Pang, X. Validation of the ITS2 region as a novel DNA barcode for identifying medicinal plant species. PLoS ONE 2010, 5, e8613. [Google Scholar] [CrossRef] [PubMed]
  41. Youngbae, S.; Kim, S.; Park, C.W. A phylogenetic study of Polygonum sect. Tovara (Polygonaceae) based on ITS sequences of nuclear ribosomal DNA. J. Plant Biol. 1997, 40, 47–52. [Google Scholar] [CrossRef]
  42. Yao, H.; Song, J.Y.; Ma, X.Y.; Liu, C.; Li, Y.; Xu, H.X.; Han, J.P.; Duan, L.S.; Chen, S.L. Identification of Dendrobium species by a candidate DNA barcode sequence: The chloroplast psbA-trnH intergenic region. Planta Med. 2009, 75, 667–669. [Google Scholar] [CrossRef] [PubMed]
  43. Nagaraj, S.; Girenahalli, R.; Tavareakere Venkataravanappa, J.; Kumar, P.; Subbarayappa, K.; Nanjappa, L. DNA based identification of species of Zingiberaceae family plants using Bar-Hrm analysis. Int. J. Res. Anal. Rev. 2019, 6, 289–294. [Google Scholar]
  44. Cui, Y.; Chen, X.; Nie, L.; Sun, W.; Hu, H.; Lin, Y.; Li, H.; Zheng, X.; Song, J.; Yao, H. Comparison and phylogenetic analysis of chloroplast genomes of three medicinal and edible Amomum species. Int. J. Mol. Sci. 2019, 20, 4040. [Google Scholar] [CrossRef]
  45. Hu, Y.F.; Zhang, X.M.; Shi, N.X.; Yang, Z.Q. DNA barcoding sequence analysis of Amomum tsao-ko germplasm resources in Yunnan province. Chin. Tradit. Herb. Drugs 2019, 50, 6091–6097. [Google Scholar]
  46. Amenu, S.G.; Wei, N.; Wu, L.; Oyebanji, O.; Hu, G.W.; Zhou, Y.D.; Wang, Q. Phylogenomic and comparative analyses of Coffeeae alliance (Rubiaceae): Deep insights into phylogenetic relationships and plastome evolution. BMC Plant Biol. 2022, 22, 88. [Google Scholar] [CrossRef] [PubMed]
  47. Zhang, L.; Huang, Y.W.; Huang, J.L.; Ya, J.D.; Zhe, M.Q.; Zeng, C.X.; Zhang, Z.R.; Zhang, S.B.; Li, D.Z.; Li, H.T. DNA barcoding of Cymbidium by genome skimming: Call for next-generation nuclear barcodes. Mol. Ecol. Resour. 2023, 23, 424–439. [Google Scholar] [CrossRef]
  48. Segersäll, M. DNA Barcoding of Commercialized Plants; An Examination of Amomum (Zingiberaceae) in South-East Asia. Master’s Thesis, Uppsala University, Uppsala, Sweden, 2011. [Google Scholar]
  49. Leung, F.C.C.; Huang, Q.; Duan, Z.; Yang, J.; Ma, X.; Zhan, R.; Xu, H.; Chen, W. SNP typing for germplasm identification of Amomum villosum Lour. based on DNA barcoding markers. PLoS ONE 2014, 9, 0114940. [Google Scholar]
  50. Gong, L.; Zhang, D.; Ding, X.; Huang, J.; Guan, W.; Qiu, X.; Huang, Z. DNA barcode reference library construction and genetic diversity and structure analysis of Amomum villosum Lour. (Zingiberaceae) populations in Guangdong province. PeerJ 2021, 9, 12325. [Google Scholar] [CrossRef] [PubMed]
  51. Shi, L.; Song, J.; Chen, S.; Yao, H.; Han, J. Identification of Amomum (Zingiberaceae) through DNA Barcodes. World Sci. Technol. Mod. Tradit. Chin. Med. 2010, 12, 473–479. [Google Scholar]
  52. Han, J.P.; Li, M.N.; Shi, L.C.; Yao, H.; Song, J.Y. ITS2 sequence identification of cardamom and its adulterants. Glob. Tradit. Chin. Med. 2011, 4, 99–102. [Google Scholar]
  53. Gong, L.; Ding, X.; Guan, W.; Zhang, D.; Zhang, J.; Bai, J.; Xu, W.; Huang, J.; Qiu, X.; Zheng, X.; et al. Comparative chloroplast genome analyses of Amomum: Insights into evolutionary history and species identification. BMC Plant Biol. 2022, 22, 520. [Google Scholar] [CrossRef]
  54. Librado, P.; Rozas, J. DnaSP v5: A software for comprehensive analysis of DNA polymorphism data. Bioinformatics 2009, 25, 1451–1452. [Google Scholar] [CrossRef] [PubMed]
  55. Li, D.M.; Li, J.; Wang, D.R.; Xu, Y.C.; Zhu, G.F. Molecular evolution of chloroplast genomes in subfamily Zingiberoideae (Zingiberaceae). BMC Plant Biol. 2021, 21, 558. [Google Scholar] [CrossRef] [PubMed]
  56. Jiang, D.Z.; Cai, X.D.; Gong, M.; Xia, M.Q.; Xing, H.T.; Dong, S.S.; Tian, S.M.; Li, J.L.; Lin, J.Y.; Liu, Y.Q. Complete chloroplast genomes provide insights into evolution and phylogeny of Zingiber (Zingiberaceae). BMC Genom. 2023, 24, 30. [Google Scholar]
  57. Liang, H.; Zhang, Y.; Deng, J.B.; Gao, G.; Ding, C.B.; Zhang, L.; Yang, R.W. The complete chloroplast genome sequences of 14 Curcuma species: Insights into genome evolution and phylogenetic relationships within Zingiberales. Front. Genet. 2020, 11, 802. [Google Scholar] [CrossRef] [PubMed]
  58. Yang, Q.; Fu, G.F.; Wu, Z.Q.; Li, L.; Zhao, J.L.; Li, Q.J. Chloroplast genome evolution in four montane Zingiberaceae taxa in China. Front. Plant Sci. 2022, 12, 774482. [Google Scholar] [CrossRef]
  59. Li, D.M.; Zhu, G.F.; Xu, Y.C.; Ye, Y.J.; Liu, J.M. Complete chloroplast genomes of three medicinal Alpinia species: Genome organization, comparative analyses and phylogenetic relationships in family Zingiberaceae. Plants 2020, 9, 286. [Google Scholar] [CrossRef] [PubMed]
  60. Liu, J.; Milne, R.I.; Möller, M.; Zhu, G.F.; Ye, L.J.; Luo, Y.H.; Yang, J.B.; Wambulwa, M.C.; Wang, C.N.; Li, D.Z. Integrating a comprehensive DNA barcode reference library with a global map of yews (Taxus L.) for forensic identification. Mol. Ecol. Resour. 2018, 18, 1115–1131. [Google Scholar] [CrossRef]
  61. Zhai, E.A.; Mi, W.J.; Cui, Y.; Hong, W.F.; Wang, Y.S.; Guo, X.Y.; Zou, H.Q.; Yan, Y.H. Comparative study of morphological identification and DNA barcoding for the authentication of medicinal Fructus Amomi. China J. Chin. Mater. Med. 2022, 47, 4600–4608. [Google Scholar]
  62. Sone, M.; Zhu, S.; Cheng, X.; Ketphanh, S.; Swe, S.; Tun, T.L.; Kawano, N.; Kawahara, N.; Komatsu, K. Genetic diversity of Amomum xanthioides and its related species from Southeast Asia and China. J. Nat. Med. 2021, 75, 798–812. [Google Scholar] [CrossRef]
  63. Kuroiwa, T.; Kawano, S.; Nishibayashi, S.; Sato, C. Epifluorescent microscopic evidence for maternal inheritance of chloroplast DNA. Nature 1982, 298, 481–483. [Google Scholar] [CrossRef]
  64. Smith, S.E. Plant Breeding Reviews; Timber Press: Portland, OR, USA, 1989; Volume 6. [Google Scholar]
  65. Wiemers, M.; Fiedler, K. Does the DNA barcoding gap exist?—A case study in blue butterflies (Lepidoptera: Lycaenidae). Front. Zool. 2007, 4, 8. [Google Scholar] [CrossRef]
  66. Wheeler, D.L.; Barrett, T.; Benson, D.A.; Bryant, S.H.; Canese, K.; Chetvernin, V.; Church, D.M.; DiCuccio, M.; Edgar, R.; Federhen, S. Database resources of the national center for biotechnology information. Nucleic Acids Res. 2007, 35, D5–D12. [Google Scholar] [CrossRef]
  67. Schoch, C.L.; Seifert, K.A.; Huhndorf, S.; Robert, V.; Spouge, J.L.; Levesque, C.A.; Chen, W.; Fungal Barcoding Consortium. Nuclear ribosomal internal transcribed spacer (ITS) region as a universal DNA barcode marker for Fungi. Proc. Natl. Acad. Sci. USA 2012, 109, 6241–6246. [Google Scholar] [CrossRef]
  68. China Plant BOL Group 1; Li, D.Z.; Gao, L.M.; Li, H.T.; Wang, H.; Ge, X.J.; Liu, J.Q.; Chen, Z.D.; Zhou, S.L.; Chen, S.L. Comparative analysis of a large dataset indicates that internal transcribed spacer (ITS) should be incorporated into the core barcode for seed plants. Proc. Natl. Acad. Sci. USA 2011, 108, 19641–19646. [Google Scholar]
  69. Wang, X.C.; Liu, C.; Huang, L.; Bengtsson-Palme, J.; Chen, H.; Zhang, J.H.; Cai, D.; Li, J.Q. ITS1: A DNA barcode better than ITS2 in eukaryotes? Mol. Ecol. Resour. 2015, 15, 573–586. [Google Scholar] [CrossRef] [PubMed]
  70. Blaalid, R.; Kumar, S.; Nilsson, R.H.; Abarenkov, K.; Kirk, P.; Kauserud, H. ITS 1 versus ITS 2 as DNA metabarcodes for fungi. Mol. Ecol. Resour. 2013, 13, 218–224. [Google Scholar] [CrossRef] [PubMed]
  71. Doyle, J.J.; Doyle, J.L. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem. Bull. 1987, 19, 11–15. [Google Scholar]
  72. Jin, J.J.; Yu, W.B.; Yang, J.B.; Song, Y.; DePamphilis, C.W.; Yi, T.S.; Li, D.Z. GetOrganelle: A fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol. 2020, 21, 241. [Google Scholar] [CrossRef] [PubMed]
  73. Kearse, M.; Moir, R.; Wilson, A.; Stones-Havas, S.; Cheung, M.; Sturrock, S.; Buxton, S.; Cooper, A.; Markowitz, S.; Duran, C.; et al. Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 2012, 28, 1647–1649. [Google Scholar] [CrossRef]
  74. Li, R.Z.; Cai, J.; Yang, J.B.; Zhang, Z.R.; Li, D.Z.; Yu, W.B. Plastid phylogenomics resolving phylogenetic placement and genera phylogeny of Sterculioideae (Malvaceae s. l.). Guihaia 2022, 42, 25–38. [Google Scholar]
  75. Wick, R.R.; Schultz, M.B.; Zobel, J.; Holt, K.E. Bandage: Interactive visualization of de novo genome assemblies. Bioinformatics 2015, 31, 3350–3352. [Google Scholar] [CrossRef] [PubMed]
  76. Amiryousefi, A.; Hyvönen, J.; Poczai, P. IRscope: An online program to visualize the junction sites of chloroplast genomes. Bioinformatics 2018, 34, 3030–3031. [Google Scholar] [CrossRef] [PubMed]
  77. Lohse, M.; Drechsel, O.; OrganellarGenomeDRAW, R.B. A tool for the easy generation of high-quality custom graphical maps of plastid and mitochondrial genomes. Curr. Genet. 2007, 52, 267–274. [Google Scholar] [CrossRef] [PubMed]
  78. Čandek, K.; Kuntner, M. DNA barcoding gap: Reliable species identification over morphological and geographical scales. Mol. Ecol. Resour. 2015, 15, 268–277. [Google Scholar] [CrossRef] [PubMed]
  79. Puillandre, N.; Lambert, A.; Brouillet, S.; Achaz, G. ABGD, Automatic Barcode Gap Discovery for primary species delimitation. Mol. Ecol. 2012, 21, 1864–1877. [Google Scholar] [CrossRef] [PubMed]
  80. Kumar, S.; Stecher, G.; Li, M.; Knyaz, C.; Tamura, K. MEGA X: Molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 2018, 35, 1547. [Google Scholar] [CrossRef] [PubMed]
  81. Collins, R.; Cruickshank, R. The seven deadly sins of DNA barcoding. Mol. Ecol. Resour. 2013, 13, 969–975. [Google Scholar] [CrossRef] [PubMed]
  82. Stamatakis, A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 2014, 30, 1312–1313. [Google Scholar] [CrossRef] [PubMed]
  83. Ronquist, F.; Teslenko, M.; van der Mark, P.; Ayres, D.L.; Darling, A.; Hohna, S.; Larget, B.; Liu, L.; Suchard, M.A.; Huelsenbeck, J.P. MrBayes 3.2: Efficient Bayesian phylogenetic inference and model choice across a large model space. Syst. Biol. 2012, 61, 539–542. [Google Scholar] [CrossRef] [PubMed]
  84. Posada, D. jModelTest: Phylogenetic model averaging. Mol. Biol. Evol. 2008, 25, 1253–1256. [Google Scholar] [CrossRef] [PubMed]
  85. Xie, J.; Chen, Y.; Cai, G.; Cai, R.; Hu, Z.; Wang, H. Tree Visualization By One Table (tvBOT): A web application for visualizing, modifying and annotating phylogenetic trees. Nucleic Acids Res. 2023, 51, W587–W592. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Plastome gene map of Wurfbainia compacta YWB91902-2 showing the typical structure organization in “Doukou” plastomes. Genes inside the circle are transcribed clockwise, and those outside are transcribed counterclockwise. Genes in different functional groups are color-coded. The small and large single-copy regions (SSC and LSC) and inverted repeat (IRa and IRb) regions are noted in the inner circle.
Figure 1. Plastome gene map of Wurfbainia compacta YWB91902-2 showing the typical structure organization in “Doukou” plastomes. Genes inside the circle are transcribed clockwise, and those outside are transcribed counterclockwise. Genes in different functional groups are color-coded. The small and large single-copy regions (SSC and LSC) and inverted repeat (IRa and IRb) regions are noted in the inner circle.
Ijms 25 09005 g001
Figure 2. Comparison of the borders of the LSC, SSC, and IR regions among six plastomes of Amomum. Abbreviations: JLB—Junction of large single-copy and small single-copy regions; JSB—Junction of small single copy and inverted repeat B; JSA—Junction of small single copy and inverted repeat A; JLA—Junction of large single copy and inverted repeat A.
Figure 2. Comparison of the borders of the LSC, SSC, and IR regions among six plastomes of Amomum. Abbreviations: JLB—Junction of large single-copy and small single-copy regions; JSB—Junction of small single copy and inverted repeat B; JSA—Junction of small single copy and inverted repeat A; JLA—Junction of large single copy and inverted repeat A.
Ijms 25 09005 g002
Figure 3. The variable sites in the homologous regions of 41 plastomes of “Doukou” species and their taxonomic synonyms. The y-axis represents the nucleotide diversity (Pi), and the x-axis indicates the nucleotide midpoints.
Figure 3. The variable sites in the homologous regions of 41 plastomes of “Doukou” species and their taxonomic synonyms. The y-axis represents the nucleotide diversity (Pi), and the x-axis indicates the nucleotide midpoints.
Ijms 25 09005 g003
Figure 4. Scatter plot of barcoding gap analysis of the eight datasets across the six medicinal “Doukou” species and their taxonomic synonyms. The y-axis represents the genetic divergence, with the plots above the blue line of best fit representing successfully delimited species and those along and below the line representing the overlap. “CP” represents complete plastome.
Figure 4. Scatter plot of barcoding gap analysis of the eight datasets across the six medicinal “Doukou” species and their taxonomic synonyms. The y-axis represents the genetic divergence, with the plots above the blue line of best fit representing successfully delimited species and those along and below the line representing the overlap. “CP” represents complete plastome.
Ijms 25 09005 g004
Figure 5. The phylogenetic tree was reconstructed using the maximum likelihood (ML) method with the ITS dataset of six medicinal “Doukou” species and their taxonomic synonyms. The numbers at nodes indicate bootstrap values.
Figure 5. The phylogenetic tree was reconstructed using the maximum likelihood (ML) method with the ITS dataset of six medicinal “Doukou” species and their taxonomic synonyms. The numbers at nodes indicate bootstrap values.
Ijms 25 09005 g005
Figure 6. The phylogenetic tree was reconstructed using the maximum likelihood (ML) method with the complete plastome dataset of six medicinal “Doukou” species and their taxonomic synonyms. The numbers at nodes indicate bootstrap values.
Figure 6. The phylogenetic tree was reconstructed using the maximum likelihood (ML) method with the complete plastome dataset of six medicinal “Doukou” species and their taxonomic synonyms. The numbers at nodes indicate bootstrap values.
Ijms 25 09005 g006
Figure 7. The phylogenetic tree was reconstructed using the maximum likelihood (ML) method with the ITS1 dataset of six medicinal “Doukou” species and their taxonomic synonyms. The numbers at nodes indicate bootstrap values.
Figure 7. The phylogenetic tree was reconstructed using the maximum likelihood (ML) method with the ITS1 dataset of six medicinal “Doukou” species and their taxonomic synonyms. The numbers at nodes indicate bootstrap values.
Ijms 25 09005 g007
Figure 8. The species discrimination success for candidate barcodes of six medicinal “Doukou” plants across different delimitation methods. The success rate is the number of species successfully delimited to species in the different DNA markers. “CP” represents complete plastome.
Figure 8. The species discrimination success for candidate barcodes of six medicinal “Doukou” plants across different delimitation methods. The success rate is the number of species successfully delimited to species in the different DNA markers. “CP” represents complete plastome.
Ijms 25 09005 g008
Table 1. Comparison of characteristics of seven datasets in six medicinal “Doukou” plants.
Table 1. Comparison of characteristics of seven datasets in six medicinal “Doukou” plants.
DatasetNo. of SamplesAligned Length (bp)No. of Variable Sites (% Divergence)No. of Parsimony Informative Sites (% Divergence)GC Content (%)No. of Conserved Sites (% Divergence)No. of Singleton Sites (% Divergence)
ITS65609164 (26.9)120 (19.7)56.2422 (69.3)42 (6.9)
ITS16519470 (36.1)59 (30.4)56.5111 (57.2)11 (5.7)
ITS26522283 (37.4)58 (26.1)60.1130 (58.6)23 (10.4)
Complete plastomes44168,5195299 (3.1)3280 (1.9)36.1161,202 (95.7)1980 (1.2)
matK8271644 (6.1)28 (3.9)28.7672 (93.9)16 (2.2)
rbcL6149012 (2.4)9 (1.8)43.2478 (97.6)3 (0.6)
psbA-trnH6680465 (8.1)35 (4.4)29.2690 (85.8)30 (3.7)
ycf1447090162 (2.3)91 (1.3)30.96832 (96.4)67 (0.9)
Table 2. Detailed collection information of the newly sequenced Wurfbainia and Lanxangia species.
Table 2. Detailed collection information of the newly sequenced Wurfbainia and Lanxangia species.
Sample NumberSpeciesCountryProvinceRegionLocalityGenBank Accession Numbers for Each DNA Region
ITS/ITS1/ITS2CP/matK/rbcL/psbA-trnH
YWB91902-1Wurfbainia compactaChinaYunnanXishuangbanna Dai Autonomous PrefectureMengla CountyOR801269PP826179
YWB91902-2Wurfbainia compactaChinaYunnanXishuangbanna Dai Autonomous PrefectureMengla CountyOR801270PP826180
YWB91901-1Wurfbainia veraChinaYunnanXishuangbanna Dai Autonomous PrefectureMengla CountyOR801267PP826177
YWB91901-2Wurfbainia veraChinaYunnanXishuangbanna Dai Autonomous PrefectureMengla CountyOR801268PP826178
S07964Lanxangia paratsaokoChinaYunnanHonghe Hani and Yi Autonomous PrefectureYuanyang CountyOR801266PP826176
S00918Wurfbainia villosaChinaGuangxiFangchengang CityShangsi CountyOR801265PP826175
B190333Wurfbainia villosaChinaYunnanKunming CityXishan DistrictOR801256PP826171
B190623Wurfbainia villosaChinaYunnanKunming CityXishan DistrictOR801257PP826172
B190641Wurfbainia villosaChinaYunnanKunming CityXishan DistrictOR801258PP826173
YWS1-25-1Wurfbainia villosaChinaYunnanXishuangbanna Dai Autonomous PrefectureMengla CountyOR801271PP826181
YWS1-25-5Wurfbainia villosaChinaYunnanXishuangbanna Dai Autonomous PrefectureJinghong CityOR801272PP853448
Note: “CP” represents complete plastome.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhao, Y.; Kipkoech, A.; Li, Z.-P.; Xu, L.; Yang, J.-B. Deciphering the Plastome and Molecular Identities of Six Medicinal “Doukou” Species. Int. J. Mol. Sci. 2024, 25, 9005. https://doi.org/10.3390/ijms25169005

AMA Style

Zhao Y, Kipkoech A, Li Z-P, Xu L, Yang J-B. Deciphering the Plastome and Molecular Identities of Six Medicinal “Doukou” Species. International Journal of Molecular Sciences. 2024; 25(16):9005. https://doi.org/10.3390/ijms25169005

Chicago/Turabian Style

Zhao, Ying, Amos Kipkoech, Zhi-Peng Li, Ling Xu, and Jun-Bo Yang. 2024. "Deciphering the Plastome and Molecular Identities of Six Medicinal “Doukou” Species" International Journal of Molecular Sciences 25, no. 16: 9005. https://doi.org/10.3390/ijms25169005

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop