Next Article in Journal
Ecosystem Size and Functional Group Relative Abundance Drive Stream Community Body Size Structure
Previous Article in Journal
Spatial Heterogeneity-Based Explainable Machine Learning Methods—Modeling the Relationship Between Yellowfin Tuna Fishery Resources and the Environment in the Pacific Ocean
Previous Article in Special Issue
Environmental DNA Metabarcoding as a Tool for Fast Fish Assessment in Post-Cleanup Activities: Example from Two Urban Lakes in Zagreb, Croatia
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

DNA Barcode Reference Library and Undetected Diversity of Fish Species in the Yuanjiang River, China

1
Pearl River Fisheries Research Institute, Chinese Academy of Fishery Sciences, Guangzhou 510380, China
2
Ocean College, Fujian Agriculture and Forestry University, Fuzhou 350002, China
3
Fisheries and Life Science College, Shanghai Ocean University, Shanghai 201306, China
4
Fisheries Administration of Qiubei, Wenshan 663220, China
5
Yangtze River Fisheries Research Institute, Chinese Academy of Fishery Science, Wuhan 430223, China
6
Key Laboratory of Aquatic Animal Immune Technology of Guangdong Province, Guangzhou 510380, China
7
Guangzhou Scientific Observing and Experimental Station of National Fisheries Resources and Environment, Guangzhou 510380, China
*
Authors to whom correspondence should be addressed.
Fishes 2025, 10(8), 418; https://doi.org/10.3390/fishes10080418
Submission received: 15 July 2025 / Revised: 14 August 2025 / Accepted: 14 August 2025 / Published: 20 August 2025

Abstract

The Yuanjiang River, situated in the upper reaches of the Red River, is a crucial component of a biodiversity hotspot in the mountains of southwestern China, supporting a high diversity of fish species. Nevertheless, systematic research on fish diversity in the Yuanjiang River is scarce, scattered, and outdated. In our study, we produced 764 DNA barcodes belonging to 64 fish morphospecies to evaluate fish diversity in the Yuanjiang River. Barcoding gap analysis and DNA-based delimitation approaches achieved a high identification success rate (>93%), indicating that DNA barcoding is a practical approach for delimiting fish in the Yuanjiang River. However, four species were characterized by high levels of intraspecific divergence, generating multiple clades and/or molecular operational taxonomic units (MOTUs), suggesting that these species might comprise undetected species. Meanwhile, two closely related species within the genus Schistura, i.e., S. callichroma and S. caudofurca, cannot be delimited by the DNA barcoding technique, which is indicative of recent speciation. In summary, this study established a reliable DNA barcode reference library for fish species in the Yuanjiang River and revealed previously unknown fish diversity.
Key Contribution: This study establishes the first comprehensive DNA barcode reference library for Yuanjiang River fish, comprising 764 barcodes from 64 species, enabling highly accurate (>93%) species identification. In addition, hidden diversity observed within four species suggests potential cryptic species.

1. Introduction

The Red River is an internationally important river that flows through China and Vietnam. The upper reach of the Red River—the Yuanjiang River—is an essential component of the biodiversity hotspot in the mountains of southwestern China, which harbors rich fish diversity and endemic fish species due to its complex geological history and dramatic topographical variations [1,2,3,4]. A total of 82 known fish species have been documented in the Yuanjiang River throughout history, and numerous species are endemic and even endangered, such as Bagarius rutilus and Poropuntius krempfi [1,2]. Furthermore, more and more invasive species such as Oreochromis niloticus, Cirrhinus mrigala, and Pterygoplichthys pardalis were documented in this river during our field surveys between 2023 and 2024. Meanwhile, several new species distributed in this river have been described over the last several years [5,6], indicating that unexplored fish diversity remains to be discovered. To better manage and protect fish resources in the Yuanjiang River, summarizing the current fish diversity is a necessary step.
Nevertheless, systematic research on fish diversity in the Yuanjiang River is rare, scattered, and outdated. For example, Zhou et al. (1999) compared β diversity among three branches within the Yuanjiang River based on field surveys conducted in the 1990s [7]. Liu et al. (2025) evaluated the length–weight and length–length relationships of twenty-three species of freshwater fish in the Yuanjiang River [8]. Other relevant reports were only found in ichthyography books and published research that focused on species descriptions [1,2]. Considering the timeliness and variability of the data on fish diversity in the Yuanjiang River, it is highly necessary to carry out investigations and research on fish diversity in this river.
Understanding detailed fish diversity in a river is crucial for utilization and conservation [9,10,11]. To date, most taxonomic descriptions of species have mainly been via morphological characterizations. Nevertheless, misidentifications often occurred due to phenotypic plasticity, cryptic speciation, and/or differing life stages [12,13,14,15,16,17,18]. DNA barcoding is a popular molecular approach for rapidly and accurately discriminating animal species, which has been shown to be highly successful in identifying fish, regardless of life stage [19,20,21,22,23,24]. Additionally, this technique is widely used for discovering cryptic species and novel species [25,26,27,28,29,30]. As a consequence, DNA barcoding enables accurate species identification and can detect morphologically cryptic species, as well as aid in the search for new species.
In this study, we employed DNA barcoding to investigate the fish diversity in the Yuanjiang River, using samples collected during four field surveys between 2023 and 2024. Our main goals were (1) to establish a DNA barcoding library for the ichthyofauna in the Yuanjiang River, (2) to shed new light on fish diversity via sequence-based delimitation methods, and (3) to reveal discrepancies in species observed between DNA barcodes and morphological features.

2. Materials and Methods

2.1. Sample Collection

During field surveys conducted in the Yuanjiang River between 2023 and 2024, fish specimens were randomly sampled using gill nets, throw nets, and fish cages. Species identification followed morphological criteria outlined in two monographs: “The Fishes of Yunnan, China” [1] and the “Checklist of Fishes of Yunnan” [2]. A total of 64 morphologically delimitated species were collected across 18 sampling sites spanning the mainstem and tributaries (Figure 1; Table S1). To construct a DNA barcoding library for Yuanjiang River fishes, 764 specimens representing all 64 species were selected for genetic analysis (Figure 1; Table S1). We created the sampling map using ArcGIS 10.2 and then modified it in Microsoft Office 2017. Fin clips or muscle tissues were clipped and conserved in absolute ethyl alcohol for DNA extraction in the laboratory. All the tissue samples used for genomic DNA extraction were deposited in the Freshwater Fish Museum at the Pearl River Fisheries Research Institute, Chinese Academy of Fishery Sciences.

2.2. Genomic Extraction and Sequencing

Total genomic DNA was extracted using the Axygen DNA Extraction Kit, following the manufacturer’s instructions. We then checked DNA quality using 1% agarose gel electrophoresis. The standard barcoding fragment (~648 bp from the 5′ end of the mitochondrial COI gene) was selected as a DNA barcoding marker. We utilized the fish universal primers FishF1/FishF2 and FishR1/FishR2 to amplify and sequence the target locus [31].
The PCR reactions were performed in a total volume of 30 µL, containing the following components: 10 μL of 2 × PCR buffer, 2 μL of dNTPs (2.5 mM each), 0.2 µL of Taq DNA polymerase (2.0 U), 1 μL of template DNA, 1 μL of each primer (10 pmol), and 15 μL of ddH2O. The PCR conditions for the two fish universal primers contained initial denaturation at 95 °C for 5 min, 30 cycles at 94 °C for 30 s, 54 °C for 30 s, and 72 °C for 1 min, and a final extension at 72 °C for 10 min. The PCR products with high quality were sequenced bi-directionally on an ABI 3730XL DNA system (Perkin-Elmer Applied Biosystems, Foster City, CA, USA) to decrease the occurrence of sequencing mistakes. We checked and assembled the contigs using the SEQMEN in the Lasergene package (DNASTAR, Inc., Madison, WI, USA). The sequences were aligned and trimmed using the MEGA version X [32] and were submitted to the GenBank database under the accession numbers PV878687–PV879450 (Table S1).

2.3. Genetic Distance Calculations

Firstly, we calculated Kimura 2-parameter (K2P) [33] pairwise genetic distances within and between species. Secondly, we conducted a barcoding gap analysis by making species-level comparisons between maximum intraspecific genetic distances and minimum distances to the nearest neighbor [34]. Thirdly, we constructed a neighbor-joining (NJ) tree using K2P genetic distances with the “pairwise deletion” option and 1000 bootstrap replicates. All abovementioned analyses were performed using MEGA version X [32].

2.4. Sequence-Based Species Delimitation

Three species delimitation approaches were utilized to group specimens into molecular operational taxonomic units (MOTUs). Firstly, we performed automatic barcode gap discovery analysis (ABGD) using the default value for the relative gap width (X = 1.5) and K2P genetic distance [35] in ABGD program. Secondly, the version of the Poisson tree process (PTP) model in multiple rates was implemented on the web server (https://mcmc-mptp.h-its.org/; accessed on 5 May 2025 [36,37]) using a maximum likelihood (ML) tree constructed with the software RAxML-VI-HPC [38]. The GTR + I + R model selected in MrModelTEST [39] was employed for ML analyses. Thirdly, Generalized Mixed Yule Coalescent (GMYC) analyses [40] using single- and multiple-threshold values were also performed on a web server (http://species.h-its.org/gmyc/; accessed on 10 May 2025). A fully resolved ultrametric gene tree was inferred as input for the GMYC analysis using BEAST version 1.8.2 [41] as a Bayesian tree. A birth–death tree prior model and a GTR + I + R substitution model were employed in Bayesian reconstructions with haplotype sequences. We opted for an uncorrelated, relaxed lognormal clock model, estimating the rate from the data and ucld-mean parameters. A uniform prior with lower and upper boundaries of 0 and 10 was set. The remaining settings were set as the default parameters. Fifty million generations with 1000 sampling frequencies were used in the Monte Carlo Markov Chain (MCMC) run. Effective sample sizes (ESSs) for all parameters were assessed in the software Tracer v1.5 [42]. The final trees were summarized in a maximum credibility tree using TreeAnnotator version 1.8.2, with 50% of the total trees discarded [41]. A majority-rule consensus was adopted from the aforementioned three algorithms for the delimitation scheme.

2.5. Cryptic Diversity Analyses

Regarding morphological species that comprise multiple clades and/or MOTUs with higher genetic distances, the construction of an NJ tree for visual inspection was carried out using K2P distances, with 1000 bootstrap replicates. In addition, the mean K2P genetic divergence between observed clades was also calculated. The abovementioned analyses were performed using MEGA version X [32].

3. Results

3.1. Sequence Information

We successfully obtained 764 DNA barcodes that represent 64 morphospecies, 52 genera, 15 families, and 5 orders (Table S1). On average, 11.94 sequences were obtained per morphospecies, with a range of 1 to 42 (11 morphospecies were represented by singletons). The length of the barcoding sequences after aligning and trimming reached 632 base pairs (bp). The absence of any deletions, insertions, or stop codons in the amplified sequences suggests that all sequences constitute functional mitochondrial COI sequences.

3.2. Genetic Distance Calculation

A hierarchical increase in mean genetic distance was obtained, ranging from within species (0.47 ± 0.03%) to within congeners (4.48 ± 0.29%), as measured by the K2P distance (Table 1). Of the 53 species with more than two specimens, 49 had an intraspecific distance of less than 2%. Nevertheless, four species, i.e., Hemiculter leucisculus, Misgurnus anguillicaudatus, Paramisgurnus dabryanus, and Rhinogobius honghensis, exhibited relatively high intraspecific divergence (>2%). Among 53 species with more than two specimens, barcode gap analysis showed that 52 species had barcoding gaps with the minimum interspecific distance to the nearest neighbor larger than the maximum intraspecific distance (Figure 2). Spearman correlation analysis revealed that the intraspecific sequence divergence is not significantly correlated with sample size (R = 0.177, p = 0.206). The NJ trees (Figure 3) yielded 68 clades from 64 morphospecies (including the singleton species), suggesting that several morphospecies might form multiple clades. Four species, i.e., Hemiculter leucisculus, Opsariichthys bidens, Misgurnus anguillicaudatus, and Paramisgurnus dabryanus, yielded two clusters from the NJ trees. In addition, a visual inspection of the NJ tree revealed that 60 morphospecies formed concordant clusters of DNA barcodes.

3.3. Results of Sequence-Based Species Delimitation

Three delimitation approaches yielded discordant results (Figure 4). The ABGD analyses conducted with the K2P models resolved 65 MOTUs. Among the sixty-five MOTUs, we detected that Misgurnus anguillicaudatus and Opsariichthys bidens comprised two MOTUs. In addition, the ABGD analyses supported two Schistura species, namely S. callichroma and S. caudofurca, that shared a MOTU (Figure 4). The remaining MOTUs were found to be concordant with morphological identification (Figure 4).
The Poisson tree process (mPTP) obtained 52 MOTUs, which was fewer than the number of MOTUs yielded by ABGD analyses (Figure 4). Only 45 MOTUs were consistent with morphological results. mPTP analyses yielded conservative outcomes than other approaches. For example, mPTP clustered five morphologically distinct Schistura species into one MOTU. In addition, six species belonging to five genera, i.e., Pseudohemiculter dispar, Anabarilius sp., Chanodichthys dabryi, Culter alburnus, Toxabramis houdemeri, and Hemiculter leucisculus, were combined into one MOTU (Figure 4). A similar pattern was also seen in three species belonging to three genera, i.e., Discogobio yunnanensis, Ageneiogarra imberba, and Placocheilus caudofasciatus (Figure 4).
Regarding GMYC analyses, a total of 80 MOTUs were identified. GMYC analyses revealed that 11 morphological species had 2–4 MOTUs (Figure 4), resulting in more MOTUs than the other two approaches.
A majority-rule consensus established the final consensus, which obtained 65 MOTUs. In sum, sixty species were unambiguously identified by the delimitation analyses, two species (Opsariichthys bidens and Misgurnus anguillicaudatus) comprised two MOTUs, and two closely related species (Schistura callichroma and S. caudofurca) were found to display mixed genealogies with one MOTU (Figure 4).

3.4. Cryptic Diversity

Two clades were detected in four species, i.e., Hemiculter leucisculus, Opsariichthys bidens, Paramisgurnus dabryanus, and Misgurnus anguillicaudatus. Furthermore, higher genetic divergence between the two clades of each species was examined, with values ranging from 2.70% (Paramisgurnus dabryanus) to 10.08% (Misgurnus anguillicaudatus) (Figure 5).

4. Discussion

This is the first study to present a complete DNA barcode reference library for fish species in the Yuanjiang River in China. The library comprises 64 morphospecies and covers most of the river section including the main stem and tributaries (Figure 1). Of the 64 species, 54 are historically documented, accounting for approximately 65.85% of the reported species in this river [2]. In addition, our study collected six exotic species and five unnamed species. Barcoding analyses revealed that 60 morphospecies (93.75% of those analyzed) could be unambiguously identified, providing a simple identification system for the fish species in this river. However, the high intraspecific divergences observed for two species, as well as the failure to delimit two closely related species, demonstrate the need for further taxonomic research on these species. Our findings provide robust evidence that DNA barcoding is a reliable tool for refining extant taxonomic classifications and uncovering cryptic diversity within the ichthyofauna of the Yuanjiang River.

4.1. DNA Barcode Reference Library of Fish Species in the Yuanjiang River

The corrected mean K2P intraspecific distance of 0.47% (SE = 0.0003) calculated for the fish species in the Yuanjiang River is slightly greater than the values that reported in the several previous fish barcoding studies involving Chinese rivers, such as the Nujiang River of China section (0.41%), the middle of the Yangtze River (0.36%) and the middle and lower Pearl River (0.32%) and the Irtysh River (0.30%) [18,43,44,45]. The relatively higher level of intraspecific divergence observed in the Yuanjiang River can be explained by several species with high intraspecific genetic distance. We found that the mean intraspecific genetic distances of fifteen morphospecies (28.30% of the species analyzed, for which two specimens were available) exceeded 1.00%.
The barcoding gap analysis, which included 53 species with more than two samples, revealed that the mean nearest-neighbor distances (12.38%) were, on average, ~26-fold higher than intraspecific distances (0.47%), indicating that the present library is useful for identifying Yuanjiang River fish. Furthermore, we demonstrated the presence of a barcode gap in 52 species, with the exception of Schistura callichroma (Figure 2). This study also revealed that the interspecific divergence (3.77%) between S. callichroma and S. caudofurca was slightly lower than the maximum intraspecific divergence (3.94%). Additionally, while NJ tree analysis produced monophyletic clades for S. callichroma and S. caudofurca, Bayesian trees revealed that the two species were combined into a single phylogenetic clade (Figure 3 and Figure 4). These findings suggest that the two species diverged recently and share a close phylogenetic relationship. To clarify the relationships between the two species, future studies should conduct population-level analyses using larger sample sizes and nuclear loci.

4.2. Cryptic Diversity

The most salient finding was that four species (Hemiculter leucisculus, Opsariichthys bidens, Paramisgurnus dabryanus, and Misgurnus anguillicaudatus) generated two clades from the NJ topology, which suggested that these species have undetected genetic diversity. Hemiculter leucisculus is a widespread species in East Asia, with undetected diversity demonstrated in different drainages. For instance, Chen et al. (2017, 2021, 2022) found that H. leucisculus populations in the Pearl River generated two deep mitochondrial clades with higher genetic distance [18,46,47]. The upper Yuanjiang River is situated near two major rivers: the Pearl River and the Lancang River. Furthermore, the lower section of the Yuanjiang River in Vietnam is near the Hainan Island drainages. These can be connected due to the drop in sea level triggered by glaciation during the Late Pleistocene [48,49,50]. Collectively, the two H. leucisculus clades that occurred in the Yuanjiang River were considered a normal phenomenon. The same situation can be used to explain the pattern observed in Opsariichthys bidens populations. In addition, Opsariichthys bidens is a small, benthic taxon that was found to have an unexpected genetic diversity within the same drainage area, which could be because of limited dispersal ability, geographic separation, and geological events [51,52,53].
For Paramisgurnus dabryanus, two clades observed could be interpreted by their introductions from different sources. This species is an economically important taxon cultivated in many regions, including southwestern China. Introduction from various genetic resources can trigger multiple clades. Moreover, hybridization with other species may cause P. dabryanus to produce multiple clades, due to the introduction of mitochondrial genomes from other species [54]. Similar cases can be seen in other rivers in China, such as the Nujiang River and the lower Pearl River [18,43]. It was not unexpected that two deep clades were observed in the pond loach (Misgurnus anguillicaudatus) in an independent river. For example, Zhong et al. (2019) argued that Misgurnus anguillicaudatus populations both in the Pearl River and the Yangtze River had four mitochondrial clades with high genetic divergence among clades [55].

4.3. Implications for the Yuanjiang River Ichthyofauna

The present study provides the most comprehensive geographical, taxonomic, and molecular sampling of the Yuanjiang River to date, with several novel findings that contradict current taxonomic knowledge. The most challenging cases pertained to shallow interspecific divergence between two Schistura species, as well as the detection of multiple, highly divergent clades within four species. Given that close relationships between the two Schistura species may be the result of a recent divergence, using additional molecular loci with higher substitution rates—such as the control region or nuclear genome dataset—would certainly help to distinguish such cases more effectively. Furthermore, the presence of multiple clades/MOTUs in four species may be attributable to the presence of subtle, overlooked morphological differences, cryptic diversity, and unrecognized speciation events [46]. However, the taxonomic status of these cryptic clades, as delineated by the mitochondrial barcoding marker, is not robust. Further assessments of morphological divergence and phylogenetic placement should be performed using additional nuclear loci.
Five morphologically distinct species belonging to four genera—Anabarilius, Schistura, Rhinogobius, and Mastacembelus—cannot be assigned to exact species based on currently available publications. This observation suggests that many species in the Yuanjiang River remain undescribed because of a lack of systematic research on this river. Additionally, the ichthyofauna of the Yuanjiang River combines species from Vietnam and southwestern China, suggesting a complex composition of fish species. Collectively, to enhance our understanding of fish diversity in the Yuanjiang River, more field surveys and taxonomic research involving the ichthyofauna of the Yuanjiang River should be conducted in the future.

5. Conclusions

In the current study, a DNA barcode reference library was constructed, comprising 64 fish species from the Yuanjiang River in China, based on high-density sampling. The relatively high identification success rate (93.75%) demonstrates the reliability of DNA barcoding in automatically delimiting fish in the Yuanjiang River. The newly detected mitochondrial clades in four species necessitate a substantial effort to better determine the diversity of these groups. Nevertheless, we also confirmed the limitation of DNA barcoding when it comes to the automated identification of closely related species. Furthermore, five taxa cannot be delimited to the species level using morphological characteristics only, indicating that more taxonomic studies should be conducted for the fish in this river. Given that approximately 20% of species were not included in the current study, a more comprehensive barcode library containing unsampled species requires updating.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/fishes10080418/s1, Table S1: Detailed information including collection data and GenBank accession numbers for sequences used in this study.

Author Contributions

W.C., J.L., Y.L., X.L. and M.L. conceived this study; X.S., C.K., C.H., H.D. and H.Y. conducted the experiments; X.S. and W.C. analyzed the data and drafted the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by the Finance Special Fund of the Chinese Ministry of Agriculture and Rural Affairs of the People’s Republic of China (fishery resources and environment survey in the key water areas of southwestern China).

Institutional Review Board Statement

The study was approved by the Laboratory Animal Ethics Committee of Pearl River Fisheries Research Institute (Nos. LAEC-PRFRI-2023-04-03).

Data Availability Statement

The sequences are available from the NCBI (https://dataview.ncbi.nlm.nih.gov; accessed on 1 July 2025) under the accession numbers of PV878687–PV879450.

Conflicts of Interest

The authors declare no competing interests.

References

  1. Chu, X.L.; Chen, Y.R. The Fishes of Yunnan, China; Sciences Press: Beijing, China, 1989. [Google Scholar]
  2. Chen, X.Y. Checklist of Fishes of Yunnan. Zoo. Res. 2013, 34, 281–343. [Google Scholar]
  3. Schoenbohm, L.M.; Whipple, K.; Burchfiel, B.; Chen, L. Geomorphic constraints on surface uplift, exhumation, and plateau growth in the Red River region, Yunnan Province, China. Geol. Soc. Am. Bull. 2004, 116, 895–909. [Google Scholar] [CrossRef]
  4. Allen, C.; Gillespie, A.; Yuan, H.; Sieh, K.E.; Buchun, Z.; Chengnan, Z. Red River and associated faults, Yunnan Province, China: Quaternary geology, slip rates, and seismic hazard. Geol. Soc. Am. Bull. 1984, 95, 686–700. [Google Scholar]
  5. Li, X.; Yang, B.; Guo, Y.; Zhou, W. Three new species of Homatula (Teleostei: Nemacheilidae) from Yunnan, China, with comments on habitat conservation. PLoS ONE 2022, 17, e0276846. [Google Scholar] [CrossRef]
  6. Cao, J.; Song, S.; Yi, W.; Xiang, H.; Yang, H.; Li, J.; Jiang, W. Two new cavefish species of Triplophysa (Cypriniformes: Nemacheilidae) from the karst subterranean rivers of Yunnan, China. Zool. J. Linn. Soc. 2025, 204, zlaf033. [Google Scholar] [CrossRef]
  7. Zhou, W.; Liu, J.H.; Ye, X.M. A comparison of fish β diversity among three branches of Yuanjiang River System, Yunnan. Zool. Res. 1999, 20, 111–117. [Google Scholar]
  8. Liu, Y.; He, C.; Liu, M.; Chen, W.; Xia, Y.; Li, Y.; Zhu, S.; Li, J.; Li, X. Length-weight and length-length relations of twenty-three freshwater fish species (Actinopterygii) from the Yuanjiang River, Yunnan Province, China. Acta Ichthyol. Piscat. 2025, 55, 83–87. [Google Scholar] [CrossRef]
  9. Aguirre, W.E.; Alvarez-Mieles, G.; Anaguano-Yancha, F.; Burgos Morán, R.; Cucalón, R.V.; Escobar-Camacho, D.; Jácome-Negrete, I.; Jiménez Prado, P.; Laaz, E.; Miranda-Troya, K.; et al. Conservation threats and future prospects for the freshwater fishes of Ecuador: A hotspot of Neotropical fish diversity. J. Fish Biol. 2021, 99, 1158–1189. [Google Scholar] [CrossRef]
  10. Hare, J.A.; Borggaard, D.L.; Alexander, M.A.; Bailey, M.M.; Bowden, A.A.; Damon-Randall, K.; Didden, J.T.; Hasselman, D.J.; Kerns, T.; McCrary, R.; et al. A review of river herring science in support of species conservation and ecosystem restoration. Mar. Coast. Fish. 2021, 13, 627–664. [Google Scholar] [CrossRef]
  11. Ayoola, V.B.; Idoko, I.P.; Eromonsei, S.O.; Afolabi, O.; Apampa, A.R.; Oyebanji, O.S. The role of big data and AI in enhancing biodiversity conservation and resource management in the USA. World J. Adv. Res. Rev. 2024, 23, 1851–1873. [Google Scholar] [CrossRef]
  12. Tang, Q.; Deng, L.; Luo, Q.; Duan, Q.; Wang, X.; Zhang, R. DNA barcoding of fish species diversity in Guizhou, China. Diversity 2023, 15, 203. [Google Scholar] [CrossRef]
  13. Marnis, H.; Syahputra, K.; Darmawan, J.; Febrianti, D.; Tahapari, E.; Larashati, S.; Iswanto, B.; Primanita, E.P.H.; Syaifudin, M.; Subangkit, A.T. DNA barcoding of fish diversity from Batanghari River, Jambi, Indonesia. Fish. Aquat. Sci. 2024, 27, 87–99. [Google Scholar] [CrossRef]
  14. Panprommin, D.; Soontornprasit, K.; Tuncharoen, S.; Pithakpol, S.; Kannika, K.; Wongta, K. DNA barcoding for fish species identification and diversity assessment in the Mae Tam reservoir, Thailand. Fish. Aquat. Sci. 2023, 26, 548–557. [Google Scholar] [CrossRef]
  15. Shan, B.; Liu, Y.; Yang, C.; Zhao, Y.; Zhang, G.; Wu, Q.; Sun, D. DNA barcoding of fish in mischief reef—Fish diversity of a reef fish community from Nansha Islands. Front. Mar. Sci. 2021, 7, 618954. [Google Scholar] [CrossRef]
  16. Chan, W.W.R.; Chang, J.J.M.; Tan, C.Z.; Ng, J.X.; Ng, M.H.C.; Jaafar, Z.; Huang, D. Eyeing DNA barcoding for species identification of fish larvae. J. Fish Biol. 2024, 105, 1784–1799. [Google Scholar] [CrossRef]
  17. Marrone, F.; Fontaneto, D.; Naselli-Flores, L. Cryptic diversity, nichedisplacement and our poor understanding of taxonomy and ecology of aquatic microorganisms. Hydrobiologia 2023, 850, 1221–1236. [Google Scholar] [CrossRef]
  18. Chen, W.; Li, C.; Yang, J.; Zhu, S.; Li, J.; Li, Y.; Li, X. Temporal species-level composition of larvae resources in the lower Pearl River drainage and implications for species’ reproductive cycles. Gene 2021, 776, 145351. [Google Scholar] [CrossRef]
  19. Tsoupas, A.; Papavasileiou, S.; Minoudi, S.; Gkagkavouzis, K.; Petriki, O.; Bobori, D.; Sapounidis, A.; Koutrakis, E.; Leonardos, I.; Karaiskou, N.; et al. DNA barcoding identification of Greek freshwater fishes. PLoS ONE 2022, 17, e0263118. [Google Scholar] [CrossRef]
  20. Modeel, S.; Negi, R.K.; Sharma, M.; Dolkar, P.; Yadav, S.; Siwach, S.; Yadav, P.; Negi, T. A comprehensive DNA barcoding of Indian freshwater fishes of the Indus River system. Beas. Sci. Rep. 2024, 14, 2763. [Google Scholar] [CrossRef]
  21. Munguia-Vega, A.; Terrazas-Tapia, R.; Dominguez-Contreras, J.F.; Reyna-Fabian, M.; Zapata-Morales, P. DNA barcoding reveals global and local influences on patterns of mislabeling and substitution in the trade of fish in Mexico. PLoS ONE 2022, 17, e0265960. [Google Scholar] [CrossRef]
  22. Afrand, M.; Sourinejad, I.; Shahdadi, A.; Vera, M. DNA barcoding for identification and discovery of fish species in the protected mangroves of Hormozgan, Iran. Estuar. Coast. 2024, 47, 865–879. [Google Scholar] [CrossRef]
  23. Habib, K.A.; Neogi, A.K.; Rahman, M.; Oh, J.; Lee, Y.H.; Kim, C.G. DNA barcoding of brackish and marine water fishes and shellfishes of Sundarbans, the world’s largest mangrove ecosystem. PLoS ONE 2021, 16, e0255110. [Google Scholar] [CrossRef]
  24. Muhala, V.; Guimarães-Costa, A.; Macate, I.E.; Rabelo, L.P.; Bessa-Silva, A.R.; Watanabe, L.; Dos Santos, G.D.; Sambora, L.; Vallinoto, M.; Sampaio, I. DNA barcoding for the assessment of marine and coastal fish diversity from the Coast of Mozambique. PLoS ONE 2024, 19, e0293345. [Google Scholar] [CrossRef]
  25. Yin, Y.; Yao, L.F.; Hu, Y.; Shao, Z.K.; Hong, X.Y.; Hebert, P.D.; Xue, X.F. DNA barcoding uncovers cryptic diversity in minute herbivorous mites (Acari, Eriophyoidea). Mol. Ecol. Resour. 2022, 22, 1986–1998. [Google Scholar] [CrossRef]
  26. Costa, A.C.P.; Garcia, T.M.; Tavares, T.C.L.; dos Santos Rocha, R.; Maggioni, R.; Soares, M.O. DNA barcoding reveals cryptic diversity of fish eggs in Equatorial Southwestern Atlantic. Reg. Stud. Mar. Sci. 2023, 65, 103069. [Google Scholar] [CrossRef]
  27. Teramura, A.; Koeda, K.; Matsuo, A.; Sato, M.P.; Senou, H.; Ho, H.C.; Suyama, Y.; Kikuchi, K.; Hirase, S. Assessing the effectiveness of DNA barcoding for exploring hidden genetic diversity in deep-sea fishes. Mar. Ecol. Prog. Ser. 2022, 701, 83–98. [Google Scholar] [CrossRef]
  28. Hasan, M.; Kambayashi, C.; Anik, Z.H.; Islam, M.S. Cryptic biodiversity of freshwater fish species in Bangladesh. PLoS ONE 2025, 20, e0318982. [Google Scholar] [CrossRef] [PubMed]
  29. Nascimento, M.H.S.; Aragão, D.G.; Silva, J.L.N.; Lima, R.C.; Birindelli, J.L.O.; Fraga, E.C.; Barros, M.C. The DNA barcode reveals cryptic diversity and a new record for the genus Leporinus (Characiformes, Anostomidae) in the hydrographic basins of central northern Brazil. PeerJ 2023, 11, e15184. [Google Scholar] [CrossRef]
  30. Chac, L.; Thinh, B. Species identification through DNA barcoding and its applications: A review. Biol. Bull. 2023, 50, 1143–1156. [Google Scholar] [CrossRef]
  31. Ward, R.D.; Zemlak, T.S.; Innes, B.H.; Last, P.R.; Hebert, P.D. DNA barcoding Australia’s fish species. Philos. T. R. Soc. B 2005, 360, 1847–1857. [Google Scholar] [CrossRef] [PubMed]
  32. Kumar, S.; Stecher, G.; Li, M.; Knyaz, C.; Tamura, K.J. MEGA X: Molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 2018, 35, 1547–1549. [Google Scholar] [CrossRef]
  33. Kimura, M. A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J. Mol. Evol. 1980, 16, 111–120. [Google Scholar] [CrossRef] [PubMed]
  34. Meyer, C.P.; Paulay, G. DNA barcoding: Error rates based on comprehensive sampling. PLoS Biol. 2005, 3, e422. [Google Scholar] [CrossRef]
  35. Puillandre, N.; Lambert, A.; Brouillet, S.; Achaz, G. ABGD, Automatic Barcode Gap Discovery for primary species delimitation. Mol. Ecol. 2012, 21, 1864–1877. [Google Scholar] [CrossRef]
  36. Kapli, P.; Lutteropp, S.; Zhang, J.; Kobert, K.; Pavlidis, P.; Stamatakis, A.; Flouri, T. Multi-rate Poisson tree processes for single-locus species delimitation under maximum likelihood and Markov chain Monte Carlo. Bioinformatics 2017, 33, 1630–1638. [Google Scholar] [CrossRef]
  37. Zhang, J.; Kapli, P.; Pavlidis, P.; Stamatakis, A. A general species delimitation method with applications to phylogenetic placements. Bioinformatics 2013, 29, 2869–2876. [Google Scholar] [CrossRef] [PubMed]
  38. Stamatakis, A. RAxML-VI-HPC: Maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics 2006, 22, 2688–2690. [Google Scholar] [CrossRef]
  39. Nylander, J.A.A. (Ed.) MrModeltest v2. Program Distributed by the Author; Evolutionary Biology Centre, Uppsala University: Uppsala, Sweden, 2004. [Google Scholar]
  40. Fujisawa, T.; Barraclough, T.G. Delimiting species using single-locus data and the Generalized Mixed Yule Coalescent approach: A revised method and evaluation on simulated data sets. Syst. Biol. 2013, 62, 707–724. [Google Scholar] [CrossRef] [PubMed]
  41. Drummond, A.J.; Rambaut, A. BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol. Biol. 2007, 7, 214. [Google Scholar] [CrossRef]
  42. Tracer, version 1.4; University of Edinburgh: Edinburgh, UK, 2009. Available online: http://beast.bio.ed.ac.uk/Tracer (accessed on 2 February 2025).
  43. Chen, W.; Ma, X.; Shen, Y.; Mao, Y.; He, S. The fish diversity in the upper reaches of the Salween River, Nujiang River, revealed by DNA barcoding. Sci. Rep. 2015, 5, 17437. [Google Scholar] [CrossRef]
  44. Shen, Y.; Guan, L.; Wang, D.; Gan, X. DNA barcoding and evaluation of genetic diversity in Cyprinidae fish in the midstream of the Yangtze River. Ecol. Evol. 2016, 6, 2702–2713. [Google Scholar] [CrossRef] [PubMed]
  45. Yang, T.; Meng, W.; Zhang, R.; Gao, T.; Cai, L.; Hai, S.; Zhou, Q. DNA barcoding of fishes in Irtysh River China. Russ. J. Genet. 2016, 52, 969–976. [Google Scholar] [CrossRef]
  46. Chen, W.; Hubert, N.; Li, Y.; Xiang, D.; Cai, X.; Zhu, S.; Yang, J.; Zhou, C.; Li, X.; Li, J. Large-scale DNA barcoding of the subfamily Culterinae (Cypriniformes: Xenocyprididae) in East Asia unveils a geographical scale effect, taxonomic warnings and cryptic diversity. Mol. Ecol. 2022, 31, 3871–3887. [Google Scholar] [CrossRef] [PubMed]
  47. Chen, W.; Zhong, Z.; Dai, W.; Fan, Q.; He, S. Phylogeographic structure, cryptic speciation and demographic history of the sharpbelly (Hemiculter leucisculus), a freshwater habitat generalist from southern China. BMC Evol. Biol. 2017, 17, 216. [Google Scholar] [CrossRef]
  48. Liu, Z.; Trentesaux, A.; Clemens, S.C.; Colin, C.; Wang, P.; Huang, B.; Boulay, S. Clay mineral assemblages in the northern South China Sea: Implications for East Asian monsoon evolution over the past 2 million years. Mar. Geol. 2003, 201, 133–146. [Google Scholar] [CrossRef]
  49. Zong, Y.; Yim, W.-S.; Yu, F.; Huang, G. Late Quaternary environmental changes in the Pearl River mouth region, China. Quatern. Int. 2009, 206, 35–45. [Google Scholar] [CrossRef]
  50. Wang, P.; Li, Q. The South China Sea: Paleoceanography and Sedimentology; Springer: Dordrecht, The Netherlands, 2009. [Google Scholar]
  51. Perdices, A.; Sayanda, D.; Coelho, M.M. Mitochondrial diversity of Opsariichthys bidens (Teleostei, Cyprinidae) in three Chinese drainages. Mol. Phylogenet. Evol. 2005, 37, 920–927. [Google Scholar] [CrossRef] [PubMed]
  52. Li, G.Y.; Wang, X.Z.; Zhao, Y.H.; Zhang, J.; Zhang, C.G.; He, S.P. Speciation and Phylogeography of Opsariichthys bidens (Pisces: Cypriniformes: Cyprinidae) in China: Analysis of the Cytochrome b Gene of mtDNA from Diverse Populations. Zool. Stud. 2009, 48, 569–583. [Google Scholar]
  53. Berrebi, P.; Retif, X.; Fang, F.; Zhang, C.G. Population structure and systematics of Opsariichthys bidens (Osteichthyes: Cyprinidae) in south-east China using a new nuclear marker: The introns (EPIC-PCR). Biol. J. Linn. Soc. 2006, 87, 155–166. [Google Scholar] [CrossRef]
  54. You, C.H.; Yu, X.M.; Tong, J.O. Detection of hybridization between two loach species (Paramisgurnus dabryanus and Misgurnus anguillicaudatus) in wild populations. Environ. Biol. Fishes 2009, 86, 65–71. [Google Scholar] [CrossRef]
  55. Zhong, J.; Yi, S.; Ma, L.; Wang, W. Evolution and phylogeography analysis of diploid and polyploid Misgurnus anguillicaudatus populations across China. P. Roy. Soc. B-Biol. Sci. 2019, 286, 20190076. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Map of sampling locations in the present study. Green dots indicate sampling sites and the abbreviations of the sampling sites refer to Table S1.
Figure 1. Map of sampling locations in the present study. Green dots indicate sampling sites and the abbreviations of the sampling sites refer to Table S1.
Fishes 10 00418 g001
Figure 2. The maximum intraspecific distance is compared with the nearest-neighbor distance for fish in the Yuanjiang River. Species with more than two sequences are presented in Table S1. Green dots indicate maximum intraspecific distance and the nearest-neighbor distance for each species. The species fall above the 1:1 line, indicating the presence of a barcode gap.
Figure 2. The maximum intraspecific distance is compared with the nearest-neighbor distance for fish in the Yuanjiang River. Species with more than two sequences are presented in Table S1. Green dots indicate maximum intraspecific distance and the nearest-neighbor distance for each species. The species fall above the 1:1 line, indicating the presence of a barcode gap.
Fishes 10 00418 g002
Figure 3. Neighbor-joining tree of overall barcoding sequences based on the K2P model.
Figure 3. Neighbor-joining tree of overall barcoding sequences based on the K2P model.
Fishes 10 00418 g003
Figure 4. DNA-based species delimitation of 64 morphological species (green) in this study. MOTU delimitation schemes collected from the automatic barcode gap discovery analysis (ABGD), Poisson tree process (PTP) model in multiple rates (mPTP), and Generalized Mixed Yule Coalescent (GMYC) algorithms (orange), including the consensus delimitation scheme (light blue). A Bayesian tree conducted in BEAST was used to summarize delimitation results.
Figure 4. DNA-based species delimitation of 64 morphological species (green) in this study. MOTU delimitation schemes collected from the automatic barcode gap discovery analysis (ABGD), Poisson tree process (PTP) model in multiple rates (mPTP), and Generalized Mixed Yule Coalescent (GMYC) algorithms (orange), including the consensus delimitation scheme (light blue). A Bayesian tree conducted in BEAST was used to summarize delimitation results.
Fishes 10 00418 g004
Figure 5. Neighbor-joining trees with mean inter-clade distances of the four species based on the K2P model. (a) Hemiculter leucisculus, (b) Opsariichthys bidens, (c) Misgurnus anguillicaudatus, and (d) Paramisgurnus dabryanus.
Figure 5. Neighbor-joining trees with mean inter-clade distances of the four species based on the K2P model. (a) Hemiculter leucisculus, (b) Opsariichthys bidens, (c) Misgurnus anguillicaudatus, and (d) Paramisgurnus dabryanus.
Fishes 10 00418 g005
Table 1. Mean values, ranges, and standard deviations of genetic distances within and between species.
Table 1. Mean values, ranges, and standard deviations of genetic distances within and between species.
Mean%SEMin%Max%
Within species0.470.0003010.22
Within genus4.480.00323.3823.64
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Shi, X.; Kou, C.; He, C.; Deng, H.; Yang, H.; Li, X.; Liu, M.; Liu, Y.; Li, J.; Chen, W. DNA Barcode Reference Library and Undetected Diversity of Fish Species in the Yuanjiang River, China. Fishes 2025, 10, 418. https://doi.org/10.3390/fishes10080418

AMA Style

Shi X, Kou C, He C, Deng H, Yang H, Li X, Liu M, Liu Y, Li J, Chen W. DNA Barcode Reference Library and Undetected Diversity of Fish Species in the Yuanjiang River, China. Fishes. 2025; 10(8):418. https://doi.org/10.3390/fishes10080418

Chicago/Turabian Style

Shi, Xian, Chunni Kou, Chengdong He, Hong Deng, Hongfu Yang, Xinhui Li, Mingdian Liu, Yaqiu Liu, Jie Li, and Weitao Chen. 2025. "DNA Barcode Reference Library and Undetected Diversity of Fish Species in the Yuanjiang River, China" Fishes 10, no. 8: 418. https://doi.org/10.3390/fishes10080418

APA Style

Shi, X., Kou, C., He, C., Deng, H., Yang, H., Li, X., Liu, M., Liu, Y., Li, J., & Chen, W. (2025). DNA Barcode Reference Library and Undetected Diversity of Fish Species in the Yuanjiang River, China. Fishes, 10(8), 418. https://doi.org/10.3390/fishes10080418

Article Metrics

Back to TopTop