Next Article in Journal
Bioinformatics Approach to mTOR Signaling Pathway-Associated Genes and Cancer Etiopathogenesis
Previous Article in Journal
Rare Duplication in the RYR1 Gene Causing Malignant Hyperthermia and Clinical Variability
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genetic Diversity and Population Structure of Farmed Longfin Batfish (Platax teira) in the South China Sea

1
College of Fisheries and Life Science, Shanghai Ocean University, Shanghai 201306, China
2
Sanya Tropical Fisheries Research Institute, Sanya 572018, China
3
Key Laboratory of South China Sea Fishery Resources Exploitation and Utilization, Ministry of Agriculture and Rural Affairs, South China Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Guangzhou 510300, China
4
Guangdong Provincial Engineer Technology Research Center of Marine Biological Seed Industry, Guangzhou 510300, China
*
Author to whom correspondence should be addressed.
Genes 2025, 16(11), 1254; https://doi.org/10.3390/genes16111254 (registering DOI)
Submission received: 4 September 2025 / Revised: 13 October 2025 / Accepted: 16 October 2025 / Published: 24 October 2025
(This article belongs to the Section Animal Genetics and Genomics)

Abstract

Background: Longfin batfish (Platax teira) is an important economic species in southern China. In recent years, its wild population has significantly declined due to overfishing. Around 2015, breakthroughs in the artificial large-scale seedling technology for P. teira have promoted the growth of its aquaculture scale in regions such as Hainan and Guangdong. Methods: To study the genetic diversity, inbreeding status, and population structure of the current P. teira farming populations in China, we performed whole-genome resequencing technology and high-density SNP markers to analyze the genetics of four main farming populations. A total of 109 individuals from four populations (NA, ZP, XL, and XC) were sequenced, identifying 5,384,029 high-quality SNPs. Results: The results showed that the nucleotide diversity (π) of each population ranged from 0.00155 to 0.00165 and observed heterozygosity (Ho) ranged from 0.253 to 0.282, which indicated low levels of genetic diversity. The results of the ROH analysis show significant inbreeding in the NA population. Genetic differentiation analysis revealed that the genetic differentiation among NA, XC, and ZP populations was relatively low (FST = 0.021–0.029). Conclusions: NA, XC, and ZP populations likely share a common origin of their fry stocks. Based on a phylogenetic tree, principal component analysis (PCA), and population structure analysis, the four populations were divided into four genetic groups. This study is the first analysis of the genetic diversity and population structure of P. teira farming populations in China, laying the foundation for the establishment of a base breeding population and the implementation of genetic improvement programs.

1. Introduction

P. teira (Forsskål, 1775), commonly known as the longfin batfish, belongs to the class Osteichthyes, order Perciformes, family Ephippidae, and genus P. teira. It is widely distributed across the Indo-Pacific region, including the Arabian Sea, Indonesia, and the Sea of Japan, and it is commonly found in the South China Sea and coastal waters of Taiwan [1]. In recent years, P. teira has gained increasing attention in China’s culture industry, particularly in Guangdong and Hainan Provinces, owing to its rapid growth rate, high ornamental and nutritional value, and excellent environmental adaptability. The successful establishment of P. teira as a new aquaculture candidate aligns with the strategic need of the Chinese aquaculture industry to diversify farmed species, enhance economic resilience, and promote sustainable mariculture.
Despite its growing importance in aquaculture, genetic studies on P. teira remain limited. To date, research has focused mainly on resource distribution [2], embryonic and larval development [3,4,5], muscle nutritional composition [6], and aquaculture techniques [7], while no studies have yet investigated its nuclear genome-level genetic diversity or population structure [8].
Genetic variation is fundamental to biological evolution. However, the expansion of aquaculture often involves repeated use of limited broodstock and frequent seed translocation, which may lead to inbreeding and a loss of genetic variation [9,10]. Following the onset of inbreeding, a population’s genetic variation can diminish considerably across a few generations, and restoring this lost diversity often presents a significant challenge [11]; this can negatively affect disease resistance and growth performance.
Single-nucleotide polymorphisms (SNPs) have emerged as the molecular marker of choice in population genetics studies due to their high genetic stability and lower genotyping error rates [12]. The application of single-nucleotide polymorphisms enables highly accurate and resource-effective investigations, which are fundamentally important for conducting advanced studies in genome-scale population evolution [13]. SNPs enable efficient and high-resolution analyses of genetic variation, population structure, and inbreeding across the genome. Numerous studies have successfully applied these markers to marine fish. For instance, in European sea bass (Dicentrarchus labrax), researchers analyzed the genetic structure of wild and cultured populations using SNP markers, revealing the genetic impact of aquaculture practices and providing valuable data for broodstock management [14]. Similarly, studies in groupers (Epinephelus coioides) identified genetic bottlenecks and declines in diversity in hatchery stocks, emphasizing the need for genetic monitoring [15]. Whole-genome resequencing (WGS) has emerged as a powerful tool to address these challenges, enabling high-resolution assessment of genetic diversity, inbreeding, and population structure in aquaculture species. In Nile tilapia (Oreochromis niloticus), WGS has revealed signatures of domestication selection and genetic erosion in farmed populations compared to their wild counterparts [16]. Similarly, in rainbow trout (Oncorhynchus mykiss), the use of WGS provided novel insights into the genetic diversity and structure of Sweden’s three main farmed rainbow trout populations [17]. However, a critical knowledge gap remains regarding the genome-wide genetic diversity and population structure of farmed P. teira populations. To date, no study has applied whole-genome resequencing to assess the genetic status of its cultured stocks.
In this research, we conducted whole-genome resequencing on 109 individual P. teira collected from four separate cultured populations across southern China. Our objectives were to (i) assess the genetic diversity within each population, (ii) evaluate genomic inbreeding through runs of homozygosity (ROH) analysis, and (iii) evaluate genetic differentiation and population structure among populations. Furthermore, this study serves as a valuable reference for genome-assisted breeding strategies in P. teira and related species.

2. Materials and Methods

2.1. Sampling and Data Collection

In this study, a total of 109 longfin batfish were collected from four distinct cultured groups on Hainan Island and Guangdong Province, China (Table 1, Figure 1). Each population was named according to its collection site, with all four originating from principal breeding farms located in the South China Sea, including NA (Nanao, Shenzhen, China), ZP (Zhapo, Yangjiang, China), XL (Xilian, Zhanjiang, China), and XC (Xincun, Lingshui, China). Fins were collected from each population in a random sampling procedure. These fins were stored in 75% alcohol at −20 °C for DNA extraction, with each group’s fins being stored separately. DNA was extracted using TIANamp Marine Animals DNA Kit (Tiangen, Beijing, China) following the manufacturer’s instructions [18]. DNA quality was determined via agarose gel electrophoresis and an Agilent 4200 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA). Sequencing of PE libraries (2 × 150 bp) was performed on the MGI-2000/MGI-T7 platform (The Beijing Genomics Institute, Qingdao, China). All experiments in this study were approved by the Animal Care and Use Committee of the South China Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences (no. SCSFRI96-253), and performed according to the regulations and guidelines established by this committee.

2.2. Whole-Genome Resequencing and Variant Calling

Adaptor sequences and low-quality bases were filtered out from the raw reads using Fastp (v0.20.0, -n 10 -q 20 -u 40). The clean reads were mapped to the P. teira reference genome [19] using BWA v0.7.15 [20]. Mapping results were then converted into BAM format and sorted with SAMtools v1.3.1 [21]. The SNPs were filtered using the following criteria: (i) SNPs with QD < 2.0, FS > 60.0, MQ < 40.0, MQRankSum < −12.5, and ReadPosRankSum < −8.0; (ii) variant missing rate less than 0.1 and a per-genotype depth filter of >4 to exclude low-confidence genotype calls; (iii) minor allele frequency (MAF) > 0.01; and (iv) linkage disequilibrium pruning, performed using PLINK v1.9 [22]. Using a sliding window of 100 kb with a 10 kb step, we annotated and predicted the functional effects of the filtered SNPs based on the genome annotation file with snpEff v5.0 [23].

2.3. Genetic Diversity and Linkage Disequilibrium

Prior to analysis, each genotype with markers was assembled head to tail and missing sites were replaced by “-”. The parameters of genetic diversity were analyzed by VCFTOOLS v0.1.16 [24], including the expected heterozygosity (He), polymorphism information content (PIC), observed heterozygosity (Ho), nucleotide diversity (π), and Hardy–Weinberg equilibrium p-value (HW-P) (p < 0.05 indicates Hardy–Weinberg disequilibrium). We compared linkage disequilibrium (LD) patterns among different groups and LD decay was measured by calculating correlation coefficients (r2) for all pairs of SNPs within 500 kb using PopLDdecay v3.41 [25].

2.4. Genome-Wide Detection of ROH and Estimation of Inbreeding Coefficients

The PLINK v1.9 software command –homozyg was used for the detection of ROH. The parameters for ROH identification were set as follows, based on recommended practices in the literature [26]: –homozyg-kb 100 –homozyg-window-missing 5 –homozyg-window-threshold 0.05 –homozyg-window-het 3 –homozyg-window-snp 50 –homozyg-snp 50 –homozyg-density 50 –homozyg-gap 1000. The package R CMplot [27] was employed to generate visual representations of the results. The identified ROHs were classified into five distinct size categories as follows: 0.1 to 0.3 Mb, 0.3 to 0.6 Mb, 0.6 to 1 Mb, 1 to 2 Mb, and >2 Mb [28]. The total number of ROH for each animal, the length of ROH per population per chromosome, and the number of ROH per length category were calculated.
We employed the two following approaches to estimate inbreeding coefficients: FROH, derived from runs of homozygosity, and FHOM, calculated from the comparison of observed versus expected homozygosity. FROH estimates were obtained using the R package detect RUNS v0.9.6 [29]. FROH was defined based on the proportion of the total length of the genome that is within ROH, using the total length of the P. teira reference genome (697.98 Mb) [30]. The individual inbreeding coefficient FHOM was estimated using PLINK v1.9 with the –het option (plink –bfile dataset –het –out output_prefix).

2.5. Population Structure Analysis

The genetic differentiation coefficient (FST) between geographical groups was calculated using GENEPOP v.4.5 [31]. The Reynolds’ genetic distance (DR) between geographical groups was estimated by FST value according to DR = −ln (1 − FST). The neighbor-joining (NJ) tree was constructed and visualized using Mega11 software and an identity-by-state (IBS) kinship matrix, and the phylogenetic tree was further refined using the online tool iTOL (https://itol.embl.de/). Principal component analysis (PCA) based on genome-wide SNPs was conducted with PLINK, while population structure was further examined using ADMIXTURE v1.3.0 [32]. The likelihood of ancestral kinship (K) from 2 to 6 was tested using all SNPs. The IBS matrix was calculated by VCF2PCACluster V1.40 [33]. Ten different seeds were selected for 10 repeated analyses, and pong [34] was used to cluster the results 10 times according to the cross-validation error to determine the optimal K value.

3. Results

3.1. Overview of Whole-Genome Resequencing and Variant Detection

Whole-genome resequencing was conducted on 109 fish, generating a total of 10.68 billion reads. Of these, 99.2% successfully aligned to the reference genome using BWA, achieving an average sequencing depth of 19.5× (ranging from 13.0× to 29.3×), providing sufficient coverage for reliable downstream analyses. Following stringent quality filtering, we identified 5,384,029 SNPs across the 109 samples, with the majority residing in intronic (40.66%) and intergenic (30.74%) regions (Table 2). An examination of SNP distribution across the P. teira genome revealed that individual chromosomes contained between 144,120 SNPs (chromosome 5) and 275,445 SNPs (chromosome 11) (Figure 2).

3.2. Genetic Diversity and Linkage Disequilibrium Analysis

We calculated the genetic diversity parameters for four P. teira populations based on the selected SNP loci. As shown in Table 3, Hardy–Weinberg equilibrium values (HW-P), observed heterozygosity (Ho), expected heterozygosity (He), polymorphism information content (PIC), and nucleotide diversity (π) were determined. Hardy–Weinberg equilibrium testing indicated that all farmed populations had HW-P values above 0.05, suggesting that genetic equilibrium was largely preserved across the P. teira populations. The expected heterozygosity (He) values varied from 0.242 in ZP to 0.257 in NA, and the observed heterozygosity (Ho) ranged from 0.253 (ZP) to 0.282 (XL). (PIC) measures the level of polymorphism at each SNP site, ranging from 0.160 (XL) to 0.172 (ZP). (π) was consistently low across populations (0.00155–0.00165), with the highest value observed in XC (π = 0.00165) and the lowest in XL (π = 0.00155). As shown in (Figure 3), all populations exhibited some level of linkage disequilibrium, with relatively similar decay rates. The average pairwise correlation coefficient (r2) was higher in XL than in other reference populations.

3.3. Genomic Distribution of ROH and Inbreeding Coefficients

As shown in Table 4, NA exhibited the highest total number of ROH (3522), significantly more than ZP (1900), XC (2000), and XL (1264). All populations showed the highest proportion of short ROH (0.1–0.3 Mb; 68.3–84.3%), with XL displaying the greatest predominance of short ROH (84.3%). Notably, NA had a significantly higher proportion of long ROH segments (>0.6 Mb; 14.1%) compared to XC (6.1%) and XL (5.6%). Particularly striking was the >2 Mb ROH fraction in population NA (1.3%), while XC showed only 0.2%. These results suggest that population NA may have experienced more recent inbreeding events. The sum number of the ROH of segments and the total length of ROH in Mb per fish are shown in Figure 4. Most individuals in NA had the highest number and longest length of ROH. In contrast, the lowest number of ROH and shortest length of ROH were observed in individuals of XL.
FROH values varied from 0.0172 to 0.0594 (Table 3), with the NA population showing the highest inbreeding levels and the XL population exhibiting the lowest. Consistent with these results, FHOM values followed a similar pattern across the analyzed populations (Figure 5).

3.4. Population Structure

To evaluate the degree of genetic difference between populations, fixation index (FST) values and genetic distances were estimated (Table 5). The highest divergence was observed between XL and NA (FST = 0.065) and the lowest between ZP and XC (FST = 0.021). PCA results (Figure 6a) show that ZP, XC, and NA were divided into three clusters with overlap. The XL distribution appears independent. In the phylogenetic tree (Figure 6b), individuals from the XC, ZP, and NA populations were extensively intermingled, forming a large cluster with poor bootstrap support at internal nodes. This pattern, indicative of high genetic similarity and recent shared ancestry, aligns with our field surveys confirming that ZP and NA initially sourced their broodstock from the XC population. Conversely, the XL population formed a distinct, well-supported monophyletic clade, confirming its independent genetic lineage. This genetic divergence was corroborated by the PCA, where XL formed a separate cluster along PC1. The Delta K results indicated that the optimal number of genetic clusters representing the most similar ancestral populations was at K = 4. The cross-validation error (CV error) reached its minimum at this value (Figure 6c), aligning with the phylogenetic tree analysis results. Moreover, the genetic structure inferred (Figure 6d) closely corresponded with the phylogenetic relationships and showed strong concordance with the clustering patterns observed in the PCA plot.

4. Discussion

The South China Sea represents the main aquaculture region for P. teira. To investigate the genetic diversity, population structure, and inbreeding status of the major cultured populations in Guangdong and Hainan Provinces, this study performed whole-genome resequencing on four populations (NA, ZP, XL, XC) from southern China. By utilizing high-density SNP markers, we provide a comprehensive assessment of the species’ genetic status.
All four populations exhibited low levels of genetic diversity, as indicated by the observed heterozygosity (Ho: 0.253–0.282) and nucleotide diversity (π: 0.00155–0.00165); π values below 0.005 reflect low nucleotide diversity. Similarly, the polymorphism information content (PIC) ranged from 0.160 to 0.172 across the populations. As values below 0.25 are indicative of low polymorphism, these results collectively confirm reduced genetic variation in all four cultured populations. These results indicate that the farmed populations in this study may have been degraded to varying degrees, and effective supplementation of farmed populations are urgently needed. Low genetic diversity has also been observed in other cultured species, such as the Coilia nasus [35]. In all populations, Ho was higher than He (Ho > He). This phenomenon has also been observed in other farmed marine fish species, such as sea bream (Sparus aurata) and European seabass (D. labrax) [36]. In a finite population, it is expected that there will be random differences between the allele frequencies between both sexes, and this generates an excess of Ho with respect to those expected with He [37]. The relatively slow decay of LD in these farmed populations was most likely caused by inbreeding within each strain, although population structure may also have contributed to this. However, the potential cause of the high linkage disequilibrium observed in XL and NA might be attributed to the lower coverage depth.
The number and length of runs of homozygosity (ROH) were markedly higher in NA compared to the other populations, and the FROH values were also highest in NA (up to 0.0594), indicating strong inbreeding. ROH are reliable indicators of both historical and recent inbreeding [38]. The presence of long ROH (>2 Mb) in NA suggests recent inbreeding events, likely resulting from repeated use of close relatives or a small breeding nucleus. High levels of inbreeding and ROH accumulation have been reported in other aquaculture species, such as westslope cutthroat trout (Oncorhynchus lewisi) [39]; the XL population exhibited the lowest FROH and FHOM values, with shorter and fewer ROH segments, suggesting lower inbreeding levels and better genetic status. This population may serve as a valuable genetic resource for future selective breeding.
An analysis of the genetic differentiation index revealed a certain degree of genetic differentiation among the populations. The genetic differentiation coefficient (FST = 0.065) and genetic distance (DR = 0.067) between the NA and XL populations were the highest, indicating significant genetic differences between these two populations. FST values between 0.05 and 0.15 represent moderate differentiation, except for NA and XL and XL and ZP populations (FST = 0.051); other pairwise comparisons showed moderate genetic differentiation, with FST values ranging from 0.021 to 0.039. Among them, genetic differentiation among XC, ZP, and NA were particularly low (FST values: 0.021, 0.029, and 0.026, respectively). The observed admixture patterns may reflect historical exchanges of fry among farming sites in Hainan Province; however, the exact sources remain uncertain and require further validation. According to our previous investigation, the initial breeding stock of XC originated from the wild population in the South China Sea near the Hainan Island of China. Similar trends have been reported in cultured populations of other fish (O. niloticus) [40]. In this study, genetic differentiation among three populations was particularly low, probably due to their origin from one lake. The assignment of XC, NA, and ZP in the same cluster according to both PCA and population structure analysis provides further support for the aforementioned hypothesis. In contrast, XL was relatively independent.
This study revealed the urgent need for improved genetic management in P. teira aquaculture. The low genetic diversity, moderate differentiation, and uneven inbreeding levels raise concerns about long-term sustainability, particularly in populations such as NA. To enhance genetic health and support future breeding programs, we recommend the following: (i) increasing broodstock size and ensure periodic introduction of wild individuals or genetically diverse stocks; (ii) establishing regional broodstock management protocols to avoid inbreeding and genetic drift; and (iii) although XL and ZP showed relatively higher genetic diversity, which makes them promising candidates for breeding programs, further validation with larger sample sizes and inclusion of wild populations is necessary before making definitive recommendations. Genomic-assisted breeding strategies, as applied in Oreochromis, Salmo, and Sparus species [41], can be adapted for P. teira to accelerate genetic improvement and maintain population resilience under intensive farming. Despite the insights provided, this study has some limitations. First, the sample size per population was limited, which may affect the accuracy of diversity and structure estimates. Second, only cultured populations were analyzed; the inclusion of wild populations would provide a more comprehensive view of domestication effects. Future research should incorporate wild populations, long-term temporal samples, and integrate phenotypic data to link genomic patterns with economically important traits.

5. Conclusions

This study provides a genome-wide evaluation of the genetic diversity, population structure, and inbreeding levels of four P. teira farmed populations from southern China. Our findings revealed generally low genetic diversity and varying degrees of inbreeding across the groups. Notably, the XL population showed relative genetic independence, while the other three populations (XC, ZP, NA) exhibited significant genetic admixture, which may be attributed to their origin—partial fry stocks were likely derived from Xincun Village in Lingshui City. The NA population showed signs of recent inbreeding, evidenced by a higher number and longer stretches of homozygous regions, while the XL population maintained relatively greater genetic variability and lower inbreeding risk. It should be noted that our sampling did not include wild populations and therefore the extent to which farming practices alone contributed to the reduced diversity cannot be conclusively determined. These results highlighted the need for improved genetic management in P. teira aquaculture. Strategies such as expanding broodstock sources, incorporating wild genetic material, and implementing regular genomic monitoring should be considered to maintain genetic health and breeding efficiency. By integrating genomic tools into selective breeding programs, the long-term sustainability and productivity of P. teira aquaculture can be better secured, contributing to both economic development and conservation of genetic resources in the South China Sea region.

Author Contributions

All authors contributed to the study conception and design. Material preparation, data collection, and analysis were performed by D.Z., Y.G., B.L., H.G., K.Z., N.Z., T.Z. and L.X. The first draft of the manuscript was written by Y.G. and all authors commented on previous versions of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by the Hainan Province Science and Technology Special Fund (ZDYF2024XDNY231), the Central Public-Interest Scientific Institution Basal Research Fund, CAFS (no. 2023TD33), and the Seed Industry Revitalization Project of Special Fund for Rural Revitalization Strategy in Guangdong Province (2022SBH00002).

Institutional Review Board Statement

The Committee of the South China Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences (no. SCSFRI96-263) approved the animal protocols, and all experiments were performed under the applicable standards. The approval for this study was granted on 14 May 2023.

Informed Consent Statement

This study did not involve humans.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors have no relevant financial or non-financial interests to disclose.

References

  1. Guo, H.; Liu, M.; Gao, J.; Zhu, K.; Liu, B.; Guo, L.; Zhang, N.; Sun, J.; Zeng, C.; Yang, J.; et al. Development of vertebral column and appendicular skeleton in larvae and juveniles of (Platax teira). South China Fish. Sci. 2022, 18, 93–99. [Google Scholar] [CrossRef]
  2. Rousset, F. Genetic differentiation and estimation of gene flow from F-statistics under isolation by distance. Genetics 1997, 145, 1219–1228. [Google Scholar] [CrossRef] [PubMed]
  3. Leu, M.-Y.; Tai, K.-Y.; Meng, P.-J.; Tang, C.-H.; Wang, P.-H.; Tew, K.S. Embryonic, larval and juvenile development of the longfin batfish, Platax teira (Forsskål, 1775) under controlled conditions with special regard to mitigate cannibalism for larviculture. Aquaculture 2018, 493, 204–213. [Google Scholar] [CrossRef]
  4. Liu, M.; Guo, H.; Gao, J.; Zhu, K.; Liu, B.; Guo, L.; Zhang, N.; Yang, J.; Liu, B.; Zhang, D. Embryonic development and morphological characteristics of larvae and juvenile of (Platax teira). South China Fish. Sci. 2022, 18, 103–111. [Google Scholar] [CrossRef]
  5. Liu, M.-J.; Gao, J.; Guo, H.-Y.; Zhu, K.-C.; Liu, B.-S.; Zhang, N.; Sun, J.-H.; Zhang, D.-C. Transcriptomics Reveal the Effects of Breeding Temperature on Growth and Metabolism in the Early Developmental Stage of Platax teira. Biology 2023, 12, 1161. [Google Scholar] [CrossRef]
  6. Liu, B.; Guo, H.-Y.; Zhu, K.-C.; Liu, B.-S.; Guo, L.; Zhang, N.; Jiang, S.-G.; Zhang, D.-C. Nutritional compositions in different parts of muscle in the longfin batfish, Platax teira (Forsskål, 1775). J. Appl. Anim. Res. 2019, 47, 403–407. [Google Scholar] [CrossRef]
  7. Chiu, P.-S.; Chu, Y.-T.; Huang, C.-H.; Ho, S.-W.; Huang, J.-W.; Yeh, S.-L. Effects of stocking density on growth performance, survival and size heterogeneity of juvenile longfin batfish Platax teira. Aquac. Res. 2020, 51, 5269–5272. [Google Scholar] [CrossRef]
  8. Li, S.; Xie, Z.; Chen, P.; Tang, J.; Tang, L.; Chen, H.; Wang, D.; Zhang, Y.; Lin, H. The complete mitochondrial genome of the Platax teira (Osteichthyes: Ephippidae). Mitochondrial DNA Part A DNA Mapp. Seq. Anal. 2016, 27, 796–797. [Google Scholar] [CrossRef]
  9. An, H.S.; Byun, S.G.; Kim, Y.C.; Lee, J.W.; Myeong, J.I. Wild and hatchery populations of Korean starry flounder (Platichthys stellatus) compared using microsatellite DNA markers. Int. J. Mol. Sci. 2011, 12, 9189–9202. [Google Scholar] [CrossRef]
  10. Hughes, A.R.; Inouye, B.D.; Johnson, M.T.; Underwood, N.; Vellend, M. Ecological consequences of genetic diversity. Ecol. Lett. 2008, 11, 609–623. [Google Scholar] [CrossRef]
  11. Lind, C.E.; Evans, B.S.; Knauer, J.; Taylor, J.J.U.; Jerry, D.R. Decreased genetic diversity and a reduced effective population size in cultured silver-lipped pearl oysters (Pinctada maxima). Aquaculture 2009, 286, 12–19. [Google Scholar] [CrossRef]
  12. Dong, Y.; Li, Q.; Zhong, X.; Kong, L. Development of Gene-derived SNP Markers and Their Application for the Assessment of Genetic Diversity in Wild and Cultured Populations in Sea Cucumber, Apostichopus japonicus. J. World Aquac. Soc. 2016, 47, 873–888. [Google Scholar] [CrossRef]
  13. Yu, D.; Wang, H.; Gu, W.; Qin, T.; Zheng, H. Genetic diversity and population structure of popcorn germplasm resources using genome-wide SNPs through genotyping-by-sequencing. Genet. Resour. Crop Evol. 2021, 68, 2379–2389. [Google Scholar] [CrossRef]
  14. Vandeputte, M.; Gagnaire, P.A.; Allal, F. The European sea bass: A key marine fish model in the wild and in aquaculture. Anim. Genet. 2019, 50, 195–206. [Google Scholar] [CrossRef] [PubMed]
  15. Wang, L.; Meng, Z.; Liu, X.; Zhang, Y.; Lin, H. Genetic Diversity an A program for annotating ad Differentiation of the Orange-Spotted Grouper (Epinephelus coioides) Between and Within Cultured Stocks and Wild Populations Inferred from Microsatellite DNA Analysis. Int. J. Mol. Sci. 2011, 12, 4378–4394. [Google Scholar] [CrossRef]
  16. Cádiz, M.I.; López, M.E.; Díaz-Domínguez, D.; Cáceres, G.; Yoshida, G.M.; Gomez-Uchida, D.; Yáñez, J.M. Whole genome re-sequencing reveals recent signatures of selection in three strains of farmed Nile tilapia (Oreochromis niloticus). Sci. Rep. 2020, 10, 11514. [Google Scholar] [CrossRef]
  17. Longo, A.; Kurta, K.; Vanhala, T.; Jeuthe, H.; de Koning, D.-J.; Palaiokostas, C. Genetic diversity patterns in farmed rainbow trout (Oncorhynchus mykiss) populations using genome-wide SNP and haplotype data. Anim. Genet. 2024, 55, 87–98. [Google Scholar] [CrossRef]
  18. Tiangen Biotech. TIANamp Marine Animals DNA Kit Handbook; Tiangen Biotech (Beijing) Co., Ltd.: Beijing, China; Available online: https://www.tiangen.com/product/ (accessed on 15 October 2025).
  19. Ouyang, Y.; Pan, J.; Xian, L.; Liu, B.; Guo, H.; Zhu, T.; Zhang, N.; Zhu, K.; Zhang, D. Chromosome-level genome and characteristic analysis of (Platax teira). South China Fish. Sci. 2024, 20, 31–42. [Google Scholar] [CrossRef]
  20. Li, H.; Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 2009, 25, 1754–1760. [Google Scholar] [CrossRef] [PubMed]
  21. Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R. The Sequence Alignment/Map format and SAMtools. Bioinformatics 2009, 25, 2078–2079. [Google Scholar] [CrossRef]
  22. Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M.A.; Bender, D.; Maller, J.; Sklar, P.; de Bakker, P.I.; Daly, M.J.; et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007, 81, 559–575. [Google Scholar] [CrossRef]
  23. Cingolani, P.; Platts, A.; Wang, L.L.; Coon, M.; Nguyen, T.; Wang, L.; Land, S.J.; Lu, X.; Ruden, D.M. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 2012, 6, 80–92. [Google Scholar] [CrossRef]
  24. Danecek, P.; Auton, A.; Abecasis, G.; Albers, C.A.; Banks, E.; DePristo, M.A.; Handsaker, R.E.; Lunter, G.; Marth, G.T.; Sherry, S.T.; et al. The variant call format and VCFtools. Bioinformatics 2011, 27, 2156–2158. [Google Scholar] [CrossRef] [PubMed]
  25. Zhang, C.; Dong, S.S.; Xu, J.Y.; He, W.M.; Yang, T.L. PopLDdecay: A fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics 2019, 35, 1786–1788. [Google Scholar] [CrossRef] [PubMed]
  26. Ceballos, F.C.; Hazelhurst, S.; Ramsay, M. Assessing runs of Homozygosity: A comparison of SNP Array and whole genome sequence low coverage data. BMC Genom. 2018, 19, 106. [Google Scholar] [CrossRef] [PubMed]
  27. Yin, L.; Zhang, H.; Tang, Z.; Xu, J.; Yin, D.; Zhang, Z.; Yuan, X.; Zhu, M.; Zhao, S.; Li, X.; et al. rMVP: A Memory-Efficient, Visualization-Enhanced, and Parallel-Accelerated Tool for Genome-Wide Association Study. Genom. Proteom. Bioinform. 2021, 19, 619–628. [Google Scholar] [CrossRef]
  28. Sun, X.; Guo, J.; Li, R.; Zhang, H.; Zhang, Y.; Liu, G.E.; Emu, Q.; Zhang, H. Whole-Genome Resequencing Reveals Genetic Diversity and Wool Trait-Related Genes in Liangshan Semi-Fine-Wool Sheep. Animals 2024, 14, 444. [Google Scholar] [CrossRef]
  29. Biscarini, F.; Cozzi, P.; Gaspa, G.; Marras, G. detectRUNS: An R Package to Detect Runs of Homozygosity and Heterozygosity in Diploid Genomes; R foundation: Vienna, Austria, 2019. [Google Scholar]
  30. McQuillan, R.; Leutenegger, A.L.; Abdel-Rahman, R.; Franklin, C.S.; Pericic, M.; Barac-Lauc, L.; Smolej-Narancic, N.; Janicijevic, B.; Polasek, O.; Tenesa, A.; et al. Runs of homozygosity in European populations. Am. J. Hum. Genet. 2008, 83, 359–372. [Google Scholar] [CrossRef]
  31. Rousset, F. genepop’007: A complete re-implementation of the genepop software for Windows and Linux. Mol. Ecol. Resour. 2008, 8, 103–106. [Google Scholar] [CrossRef]
  32. Alexander, D.H.; Novembre, J.; Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009, 19, 1655–1664. [Google Scholar] [CrossRef] [PubMed]
  33. He, W.; Xu, L.; Wang, J.X.; Yue, Z.; Jing, Y.; Tai, S.; Yang, J.; Fang, X. VCF2PCACluster: A simple, fast and memory-efficient tool forprincipal component analysis of tens of millions of SNPs. BMC Bioinform. 2024, 25, 173. [Google Scholar] [CrossRef]
  34. Behr, A.A.; Liu, K.Z.; Liu-Fang, G.; Nakka, P.; Ramachandran, S. pong: Fast analysis and visualization of latent clusters in population genetic data. Bioinformatics 2016, 32, 2817–2823. [Google Scholar] [CrossRef] [PubMed]
  35. Zhang, S.; Xu, Z.; Luo, L.; Gu, S.; Hu, Z.; Wan, S.; Gao, Z. Genetic Diversity and Population Structure of Coilia nasus Revealed by 2b-RAD Sequencing. Biology 2023, 12, 600. [Google Scholar] [CrossRef]
  36. Villanueva, B.; Fernández, A.; Peiró-Pastor, R.; Peñaloza, C.; Houston, R.D.; Sonesson, A.K.; Tsigenopoulos, C.S.; Bargelloni, L.; Gamsız, K.; Karahan, B.; et al. Population structure and genetic variability in wild and farmed Mediterranean populations of gilthead seabream and European seabass inferred from a 60K combined species SNP array. Aquac. Rep. 2022, 24, 101145. [Google Scholar] [CrossRef]
  37. Caballero, A. Quantitative Genetics; Cambridge University Press: Cambridge, UK, 2020. [Google Scholar]
  38. Purfield, D.C.; Berry, D.P.; McParland, S.; Bradley, D.G. Runs of homozygosity and population history in cattle. BMC Genet. 2012, 13, 70. [Google Scholar] [CrossRef] [PubMed]
  39. Bell, D.A.; Carim, K.J.; Kovach, R.; Eby, L.A.; Barfoot, C.; Painter, S.; Lodmell, A.; Amish, S.J.; Smith, S.; Rosenthal, L.; et al. Genomic Insights Into Inbreeding and Adaptive Divergence of Trout Populations to Inform Genetic Rescue. Evol. Appl. 2025, 18, e70090. [Google Scholar] [CrossRef]
  40. Kajungiro, R.A.; Palaiokostas, C.; Pinto, F.A.L.; Mmochi, A.J.; Mtolera, M.; Houston, R.D.; de Koning, D.J. Population Structure and Genetic Diversity of Nile Tilapia (Oreochromis niloticus) Strains Cultured in Tanzania. Front. Genet. 2019, 10, 1269. [Google Scholar] [CrossRef]
  41. Houston, R.D.; Bean, T.P.; Macqueen, D.J.; Gundappa, M.K.; Jin, Y.H.; Jenkins, T.L.; Selly, S.L.C.; Martin, S.A.M.; Stevens, J.R.; Santos, E.M.; et al. Harnessing genomics to fast-track genetic improvement in aquaculture. Nat. Rev. Genet. 2020, 21, 389–409. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Map showing the sampling locations of the four cultured P. teira populations in this study. NA: Nanao, Guangdong; ZP: Zhapo, Guangdong; XL: Xilian, Guangdong; XC: Xincun, Hainan.
Figure 1. Map showing the sampling locations of the four cultured P. teira populations in this study. NA: Nanao, Guangdong; ZP: Zhapo, Guangdong; XL: Xilian, Guangdong; XC: Xincun, Hainan.
Genes 16 01254 g001
Figure 2. The genetic information of P. teira. The identification of high-quality SNPs and their distribution across 24 chromosomes of P. teira. The gradient colors from green to red indicate an increase in SNP density.
Figure 2. The genetic information of P. teira. The identification of high-quality SNPs and their distribution across 24 chromosomes of P. teira. The gradient colors from green to red indicate an increase in SNP density.
Genes 16 01254 g002
Figure 3. Average LD decay rates across all 24 P. teira chromosomes. The y-axis represents the average pairwise correlation coefficient (r2), while the x-axis represents the physical distance between SNPs in kilobytes (Kb). Each colored line represents the LD decay pattern of a distinct population.
Figure 3. Average LD decay rates across all 24 P. teira chromosomes. The y-axis represents the average pairwise correlation coefficient (r2), while the x-axis represents the physical distance between SNPs in kilobytes (Kb). Each colored line represents the LD decay pattern of a distinct population.
Genes 16 01254 g003
Figure 4. Sum number of homozygosity (ROH) of segment and total length of ROH in Mb per fish of four populations.
Figure 4. Sum number of homozygosity (ROH) of segment and total length of ROH in Mb per fish of four populations.
Genes 16 01254 g004
Figure 5. Inbreeding coefficient in four populations. (a) Inbreeding coefficient based on ROH. (b) Inbreeding coefficient based on observed and expected homozygosity.
Figure 5. Inbreeding coefficient in four populations. (a) Inbreeding coefficient based on ROH. (b) Inbreeding coefficient based on observed and expected homozygosity.
Genes 16 01254 g005
Figure 6. Analysis of population genetic structure. (a) The principal component analysis of 109 P. teira samples from 4 populations; (b) A tree was generated using the neighbor-joining method with 1000 bootstrap replicates on 109 P. teira samples from 4 populations. Different colors represent different populations. (c) Cross-validation errors in admixture analysis. X-axis represents K-values and y-axis represents corresponding cross-validation errors. Error was lowest at K = 4. (d) Population structure. Colors in each fragment represent the proportion of K = 2–6 ancestral populations assigned to individual genomes. Each accession is represented by a bar, and the length of each colored segment in the bar represents the proportion contributed by that ancestral population.
Figure 6. Analysis of population genetic structure. (a) The principal component analysis of 109 P. teira samples from 4 populations; (b) A tree was generated using the neighbor-joining method with 1000 bootstrap replicates on 109 P. teira samples from 4 populations. Different colors represent different populations. (c) Cross-validation errors in admixture analysis. X-axis represents K-values and y-axis represents corresponding cross-validation errors. Error was lowest at K = 4. (d) Population structure. Colors in each fragment represent the proportion of K = 2–6 ancestral populations assigned to individual genomes. Each accession is represented by a bar, and the length of each colored segment in the bar represents the proportion contributed by that ancestral population.
Genes 16 01254 g006
Table 1. Information on P. teira samples. Population names correspond to their collection sites, comprising NA, ZP, XL, and XC.
Table 1. Information on P. teira samples. Population names correspond to their collection sites, comprising NA, ZP, XL, and XC.
LocationAbbreviationCoordinateSample Size
Nanao, ShenzhenNA114.31° E, 22.33° N30
Zhapo, YangjiangZP111.80° E, 21.57° N30
Xilian, ZhanjiangXL109.90° E, 20.42° N25
Xincun, LingshuiXC109.97° E, 18.40° N24
Table 2. SNP annotation by genomic region.
Table 2. SNP annotation by genomic region.
Genomic RegionCountPercentage (%)
Intergenic1,655,03630.74
Upstream762,89114.17
Intro2,189,17840.66
Exon207,6373.86
Splice19,9680.37
Downstream549,31910.20
Total5,384,029100
Table 3. Genetic diversity analysis of the four populations.
Table 3. Genetic diversity analysis of the four populations.
PopulationSNP (Num)HoHePICHardy–Weinberg (HWP)πFROHFHOM
NA4,225,6990.2690.2570.1640.77440.001580.0590.003
ZP4,667,1550.2530.2420.1720.78760.001640.028−0.030
XL4,224,0800.2820.2520.1600.77090.001550.017−0.034
XC4,625,2290.2540.2450.1710.80250.001650.029−0.052
Table 4. Summary statistics of ROH in four P. teira populations.
Table 4. Summary statistics of ROH in four P. teira populations.
CountProportion (%)
Insert Size (MB)NAXCXLZPNAXCXLZP
0.1–0.3 MB240616151065147368.380.884.377.5
0.3–0.6 MB62026312822617.613.210.111.9
0.6–1 MB2548340887.24.23.24.6
1–2 MB1953524885.51.81.94.6
>2 MB4747251.30.20.61.3
Total3522200012641900
Table 5. Genetic differentiation FST value (below diagonal) and genetic distance (above diagonal) among four P. teira populations.
Table 5. Genetic differentiation FST value (below diagonal) and genetic distance (above diagonal) among four P. teira populations.
PopulationNAZPXLXC
NA-0.0260.0670.029
ZP0.026-0.0520.021
XL0.0650.051-0.040
XC0.0290.0210.039-
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Gao, Y.; Liu, B.; Guo, H.; Zhu, K.; Xian, L.; Zhang, N.; Zhu, T.; Zhang, D. Genetic Diversity and Population Structure of Farmed Longfin Batfish (Platax teira) in the South China Sea. Genes 2025, 16, 1254. https://doi.org/10.3390/genes16111254

AMA Style

Gao Y, Liu B, Guo H, Zhu K, Xian L, Zhang N, Zhu T, Zhang D. Genetic Diversity and Population Structure of Farmed Longfin Batfish (Platax teira) in the South China Sea. Genes. 2025; 16(11):1254. https://doi.org/10.3390/genes16111254

Chicago/Turabian Style

Gao, Yayang, Baosuo Liu, Huayang Guo, Kecheng Zhu, Lin Xian, Nan Zhang, Tengfei Zhu, and Dianchang Zhang. 2025. "Genetic Diversity and Population Structure of Farmed Longfin Batfish (Platax teira) in the South China Sea" Genes 16, no. 11: 1254. https://doi.org/10.3390/genes16111254

APA Style

Gao, Y., Liu, B., Guo, H., Zhu, K., Xian, L., Zhang, N., Zhu, T., & Zhang, D. (2025). Genetic Diversity and Population Structure of Farmed Longfin Batfish (Platax teira) in the South China Sea. Genes, 16(11), 1254. https://doi.org/10.3390/genes16111254

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop