Next Article in Journal
Protective Effects of 3′-Epilutein and 3′-Oxolutein against Glutamate-Induced Neuronal Damage
Previous Article in Journal
Comparative Biochemical, Structural, and Functional Analysis of Recombinant Phospholipases D from Three Loxosceles Spider Venoms
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Assembly of Genome and Resequencing Provide Insights into Genetic Differentiation between Parents of Hulong Hybrid Grouper (Epinephelus fuscoguttatus× E. lanceolatus ♂)

1
State Key Laboratory of Biocontrol, Institute of Aquatic Economic Animals and Guangdong Provincial Key Laboratory for Aquatic Economic Animals, Life Sciences School, Sun Yat-sen University, Guangzhou 510275, China
2
Key Laboratory of Tropical Marine Fish Germplasm Innovation and Utilization, Ministry of Agriculture, Sanya 572025, China
3
Key Laboratory of Tropical Aquatic Germplasm of Hainan Province, Sanya 572025, China
4
Southern Laboratory of Ocean Science and Engineering, Zhuhai 519000, China
*
Authors to whom correspondence should be addressed.
Int. J. Mol. Sci. 2023, 24(15), 12007; https://doi.org/10.3390/ijms241512007
Submission received: 7 June 2023 / Revised: 22 July 2023 / Accepted: 25 July 2023 / Published: 26 July 2023
(This article belongs to the Section Molecular Genetics and Genomics)

Abstract

:
The Hulong hybrid grouper was bred from the brown-marbled grouper (Epinephelus fuscoguttatus) and the giant grouper (E. lanceolatus) ♂, combining the advantageous traits of both parents. Possessing an excellent performance, this hybrid’s cultivation promotes the development of the grouper industry. Its male parent, the giant grouper, possesses the fastest growth and the largest body size among all coral-reef-dwelling fish. This species is not only an economically important species in marine aquaculture, but it is also an ideal male parent in the interspecific crossing of grouper species. In the present study, a high-quality chromosome-level genome of the giant grouper was constructed with a total length of 1.06 Gb, consisting of 24 chromosomes and 69 scaffolds. To analyze the genetic differences between the parents of the Hulong hybrid grouper, the structural variations (SVs) between both parental genomes were detected, and a total of 46,643 SVs were obtained. High-quality SNPs were identified from resequencing data. There were significant differences between the two genomes, and the average FST reached 0.685. A total of 234 highly differentiated regions were detected with an FST > 0.9. The protein-coding genes involved in SVs and highly differentiated regions were significantly enriched in metabolic pathways, including fatty metabolism, carbohydrate metabolism, amino acid metabolism and the TCA cycle. These genes may be related to the differences in feeding preferences and the ability to digest carbohydrates between the two grouper species under natural conditions. In addition, protein-coding genes related to the cell cycle and p53-signaling pathway were also detected. These genes may play important roles in the regulation of body size and growth performance. This research provides genomic resources for further breeding works and evolutionary analyses.

1. Introduction

Groupers (Epinephelidae, perciformes) are coral-reef-dwelling fish species that are mostly distributed in the tropical and temperate marine areas. Groupers include more than 160 species in 16 genera [1], and, among them, approximately 47 grouper species have been cultured [2]. Groupers are popular in the markets of South and Southeast Asia due to their high nutrient value, tender meat and delicious taste. Due to the incomplete reproductive isolation and various grouper species, crossbreeding is the most common tool in the breeding of groupers, and many new hybrids have been obtained by means of artificial insemination. Among the hybrids, brown-marbled grouper (Epinephelus fuscoguttatus) ♀ × giant grouper (E. lanceolatus) ♂ is the most successful hybrid grouper. It possesses an excellent growth performance and strong stress resistance, and it greatly promotes the development of the grouper industry [3]. At present, the annual output of this hybrid accounts for more than 70% of the grouper market in China [4].
The giant grouper is the male parent of the Hulong hybrid grouper, and it is the largest grouper species in the world. Its length can reach 2.7 m and its weight can reach 400 kg [5]. The giant grouper has been widely applied as the male parent in other interspecific crossings of grouper species due to its excellent growth traits. To date, at least five hybrids, including kelp grouper (E. moara) ♀ × giant grouper ♂ [6], orange-spotted grouper (E. coioides) ♀ × giant grouper ♂ [7], brown-marbled grouper ♀ × giant grouper ♂ [8], humpback grouper (Cromileptes altivelis) ♀ × giant grouper ♂ [9] and red-spotted grouper (E. akaara) ♀ × giant grouper ♂ [10], have been reported. The growth traits of these hybrid groupers are all significantly higher than those of their female parents. Obviously, the genome of the giant grouper plays an important role in the growth advantage of hybrid groupers. The giant grouper and brown-marbled grouper are the parents of the Hulong hybrid grouper; however, there are significant differences in their sizes: the brown-marbled grouper weighs 4~12 kg, while the giant grouper can reach 400 kg. Analyses of the growth difference between the two grouper species contribute to understanding the genetic basis for growth advantages in giant groupers. Recent research reported some genetic analyses of the brown-marbled grouper and the giant grouper; high levels of genome synteny were revealed, and their divergence time was 10.49–22.50 Myr, which are considered pivotal factors for the success of hybrid groupers [11]. An F2 segregation population of the Hulong hybrid grouper was applied to analyze the genetic basis of the growth advantage, and several genes involved in energy metabolism, development, stress tolerance, the immune response and visual perception were found to be associated with growth traits in the juvenile stage of the giant grouper [12].
In order to provide useful genetic information for the conservation and breeding of the giant grouper, a high-quality chromosome-level genome is essential. Although two genomes of the giant grouper have been assembled recently [4,13], there were some flaws in them. For Zhou et al.’s assembled genome, they only applied Illumina sequencing data, the contig N50 value was only 119.9 kb and the number of assembled scaffolds was 4207. The numerous gaps and fragments in the giant grouper genome destroy the integrity and lower the correctness of the genome, and hamper efforts at locating trait-related genes and markers. In Wang et al.’s assembled genome, the pseudochromosomes of the giant grouper were constructed based on the genetic linkage map of another grouper (orange-spotted grouper), where only 39.41% and 88.62% of the scaffolds and bases, respectively, could be anchored to pseudochromosomes. Thus, the genome of the giant grouper needs to be further improved.
In the present study, first, we assembled a high-quality chromosome-level giant grouper genome using the PacBio platform and Hi-C technology, and we predicted the gene structures; second, genomic differences between the giant grouper and the brown-marbled grouper were detected. These genetic differences were related to feeding preferences, the ability to digest carbohydrates and the cell cycle. This research provides a better genome and insights into the genetic differentiation between the brown-marbled grouper and giant grouper.

2. Results

2.1. Genome Assembly

The genome characteristics of the giant grouper were predicted using k-mer methods with Illumina data. A total of 169.44 Gb of high-quality data with Q20 > 96.99% and Q30 > 92.27% were obtained (Table S1). The heterozygosity and genome size were approximately 0.07% and 1.08 Gb, respectively. The repeat-sequence ratio and GC ratio were 26.08% and 41.20%, respectively (Figure S2).
A total of 134.42 Gb of PacBio-sequencing clean data were obtained, and the average length and N50 of the reads were 18.55 Kb and 29.14 Kb, respectively (Table S1). The assembled genome of the giant grouper was 1.06 Gb in length with a contig N50 of 24.33 Mb and 170 contigs. The integrality of the assembled genome was verified using BWA, CEGMA and BUSCO. A total of 99.31% of the Illumina clean reads were mapped onto the assembled genome, and, among them, 98.21% were properly mapped (Table S2). The CEGMA results show that 438 (95.63%) CEGs were present in the assembled genome, as well as 188 (75.81%) highly conserved CEGs (Table S3). The results from the BUSCO alignment showed that the assembled genome contained 4463 (97.36%) complete BUSCOs, among which 4357 (95.05%) were single-copy BUSCOs (Table S4).

2.2. Pseudochromosome Construction

A total of 1.06 Gb (99.96%) and 145 (81.46%) contigs were anchored into 24 chromosomes using Hi-C technology (Table S5 and Figure 1). Among the anchored contigs, 133 (91.72%) were anchored in the proper order. Ultimately, a total of 69 scaffolds were obtained, and N50 and N90 were 45,206,398 bp and 40,025,893 bp in length, respectively.

2.3. Genome Annotation

Non-coding RNAs, including micro-RNAs (miRNAs), ribosomal RNAs (rRNAs) and transfer RNAs (tRNAs), were predicted based on the Rfam database. In total, 835 miRNAs, 190 rRNAs and 1480 tRNAs were obtained. In addition, 1186 pseudogenes were predicted using GeneWise software [14].
In the giant grouper genome, a total of 1,000,756 repeat sequences were predicted with 322,857,005 bp, which accounted for 30.34% of the whole genome. Terminal inverted repeats (TIRs) were the largest group in the repeat sequences, and they included 480,765 sequences and accounted for 17.42% of the genome. We also detected 3074 simple-sequence repeats (SSRs), which accounted for 0.09% of the genome (Table S6).
Gene structures were predicted using ab initio, homologous sequencing and RNA-seq. Ultimately, 28,313 genes (Table S7 and Figure S3) were predicted, with an average gene length of 16,339.58 bp and an average exon length of 2650.08 bp (Table S8). A total of 27,107 (95.74%) genes were annotated based on annotation databases: 27,057 (95.56%) of them were mapped into the NR database, while 26,692 (94.27%) were mapped into the TrEMBL database (Table S9).

2.4. Analysis of Differences in Genomic Structure between the Brown-Marbled Grouper and Giant Grouper

A total of 46,643 structural variations were detected between the brown-marbled grouper and giant grouper genomes, including 4539 insertions with a total length of 1,661,757 bp, 3577 deletions with a total length of 1,035,628 bp, 171 tandem expansions with a total length of 460,709 bp, 5 tandem contractions with a total length of 6247 bp, 22,188 repeat expansions with a total length of 31,273,681 bp and 16,163 repeat contractions with a total length of 17,907,672 bp (Table S10). The variant-size histogram of the structural variations in the genomes shows that the sizes of the sequences are inversely proportional to their mutation ratios (Figure S4).
Based on annotations using Annovar software, deletions and insertions were mainly located in introns and intergenic regions (3648 and 3619, respectively). A total of 37 deletions and insertions were located in exons. Repeat mutations were mainly located in intergenic regions and exons (15,382 and 14,913, respectively). Tandem expansions and contractions were mainly located in intergenic regions (Figure 2).
The protein-coding genes that possessed repeat mutations in exons were enriched based on the KEGG database. These protein-coding genes were significantly enriched into metabolic pathways (Figure 3A), including lipid metabolism (glycerophospholipid metabolism, fatty acid metabolism and biosynthesis), amino acid metabolism (histidine, glutamate, alanine, aspartate and glutamate metabolism), glycometabolism (insulin secretion, fructose and mannose metabolism, N-glycan metabolism) and the TCA cycle (Figure 3B). In addition, some genes involved in the lysosome, HIF-1 signaling pathway, glycan metabolism and fructose and mannose metabolism were also enriched.

2.5. Analysis of Genetic Differentiation between the Brown-Marbled Grouper and Giant Grouper

A total of 3,611,851 SNPs were detected between the giant grouper and brown-marbled grouper genomes. The density of the SNPs was 293.59 bp/SNP. The SNP density of Chromosome 24 was the highest, reaching 233.94 bp/SNP, whereas the SNP density of Chromosome 18 was the lowest, at 332.41 bp/SNP. A genetic differentiation analysis between the two groupers was carried out. There were significant differences between the two genomes, and the average FST reached 0.685. The average FST of Chromosome 5 was the lowest, with a value of 0.674, whereas the average FST of Chromosome 2 was the highest, reaching 0.701. FST values were lower at both ends of each chromosome compared to the middle parts. A total of 234 regions with high genetic differentiation were detected (FST > 0.9, Figure 4). The genes located in these regions were annotated using Annovar, and 981 protein-coding genes were detected. Based on the KEGG enrichment analysis, numerous genes were enriched in metabolism pathways, such as starch and sucrose metabolism, fructose and mannose metabolism, galactose metabolism, the TCA cycle, pyruvate metabolism, the insulin-signaling pathway, glycolysis/gluconeogenesis and the AMPK-signaling pathway. In addition, some genes were involved in the cell cycle, the regulation of the actin cytoskeleton, the p53-signaling pathway and apoptosis (Figure 5).

3. Discussion

The giant grouper is the largest coral-reef-dwelling fish, and it has been widely used for crossbreeding as the male parent. The Hulong hybrid grouper is one of the most successful hybrids, bred from the brown-marbled grouper and giant grouper. It shows the advantageous traits of both parents, and it greatly promotes the development of the grouper industry. Here, a high-quality chromosome-level genome of the giant grouper was assembled, and the genomic differences between the parents of the Hulong hybrid grouper were analyzed.

3.1. Genome Assembly and Evolution Analyses

The chromosomal genome of the giant grouper was constructed using Illumina and PacBio sequencing and Hi-C technology. The assembled genome is 1.06 Gb in length, which is slightly smaller than Wang’s result (1.128 Gb) [4] and Zhou’s result (1.08 Gb) [13]. Compared to the other two published genomes, the new genome showed a higher quality [4,13]. For example, the contig length N50 of the new genome (24.33 Mb) is far higher than those of the other two genomes (119.9 kb and 1469.4 kb, respectively); the contig number of the new genome (170) is far less than those of the other two genomes (27,630 and 3207, respectively); the scaffold number of the new genome (69) is also far less than those of the other two published genomes (4207 and 3187, respectively); and the chromosomal anchor of the new genome is better than those of the other two published genomes. A high mapping ratio (98.21%) of Illumina clean reads against the assembled genome was observed, 438 (95.63%) CEGs and 188 (75.81%) highly conserved CEGs were present in the assembled genome, 4463 (97.36%) complete BUSCOs were identified and 4357 (95.05%) of them were single-copy BUSCOs. These results indicate the high quality of the genome. Recently, the brown-marbled grouper genome was published, which provided essential resources for the analysis of the genetic differences between the parents of the Hulong hybrid grouper.

3.2. Analysis of Differences in the Genomic Structure between the Brown-Marbled Grouper and Giant Grouper

Through interspecific collinearity analysis, a significant genome synteny between the brown-marbled grouper and giant grouper was observed, while the divergence time was relatively late [11], and lots of SVs between the brown-marbled grouper and giant grouper genomes were detected. Variations were mainly distributed in introns and intergenic regions, but numerous repeat mutations were located in exons. Recent reports have shown that repeats are closely related to gene expression. For example, thousands of repeats showing regulatory effects for protein-coding genes were identified and demonstrated in Caenorhabditis elegans [15]. A repeat mutation located in the cold-induced autoinflammatory syndrome 1 (CIAS1) gene was associated with hypertension [16]. From our results, the protein-coding genes involved in repeat mutations were found to be involved in multiple metabolic pathways, including lipid metabolism (glycerophospholipid metabolism, fatty acid metabolism and biosynthesis), amino acid metabolism (histidine, glutamate, alanine, aspartate and glutamate metabolism), glycometabolism (insulin secretion, fructose and mannose metabolism, N-glycan metabolism) and the TCA cycle. These pathways are crucial to cellular and organismal homeostasis and play important roles in energy utilization and storage and signal transduction. Fatty acids, amino acids and carbohydrates are necessary energy substances for animals, and differences in the metabolisms between the two species may result in different levels of efficiency in the assimilation of energy. The tricarboxylic acid (TCA) cycle is a critical route for oxidative phosphorylation, which is the main source of cellular energy and participates in many metabolic pathways in organisms [17]. The mutations may influence the efficiency of ATP production. Glycerophospholipids are the most abundant phospholipids in the body, and they not only constitute biological membranes, but are also components of bile and membrane surfactants. The participation of glycerophospholipids in membrane protein recognition and signal transduction is associated with many traits and diseases, such as diabetes, obesity, neurological diseases, fatty liver and stress resistance [18]. These variations in genome sequences may give rise to differences in multiple phenotypes.

3.3. Analysis of Genetic Differentiation between the Brown-Marbled Grouper and Giant Grouper

Recent research has shown that the divergence time of E. lanceolatus and E. fuscoguttatus is 10.49–22.50 Myr, and their genomes possessed a significant synteny [11]. However, there was a significant differentiation between the two genomes based on SNPs: the average FST reached 0.685. The regions with an FST > 0.9 were considered as demonstrating high genetic differentiation, and a total of 234 regions and 981 protein-coding genes were detected of this type. Based on the KEGG analysis, these genes were enriched in starch and sucrose metabolism, fructose and mannose metabolism, galactose metabolism, the TCA cycle, pyruvate metabolism, the insulin-signaling pathway, glycolysis/gluconeogenesis and the AMPK-signaling pathway. Numerous SNPs were located in genes related to carbohydrate metabolism. These SNPs may relate to the differences in feeding preferences and the ability to digest carbohydrates between the two grouper species under natural conditions. In a recent study, researchers explored the genetic basis of species differentiation between wolves and dogs, and they mentioned that key genes in the starch–sucrose-metabolism pathway and fat-metabolism pathway of dogs were selected to adapt to starch-based grain food in order to adapt to the farming environment of human beings in the process of evolution or domestication [19]. These genetic differences in glucose metabolism will be an important resource in future breeding works, which will be used to improve multiple production traits, such as growth, the artificial-feed-acclimation efficiency, the feed-conversion efficiency and the stocking density. Some protein-coding genes were related to the cell cycle and p53-signaling pathways, and these pathways are closely related to the regulation of the cell cycle and proliferation [20,21]. In teleost species, genes involved in pathways of the cell cycle and cell proliferation play important roles in the regulation of growth and development [22]. In rainbow trout, quantitative trait loci (QTLs) associated with bodyweight gain were detected using genome-wide association (GWA) analysis, and some genes involved in bodyweight gain were enriched in the cell cycle [23]. In broilers (Gallus gallus), researchers discovered that genes involved in the p53-signaling pathway are associated with the growth performance [24]. These genetic differences may be associated with the differences in body size between the brown-marbled grouper and giant grouper.

4. Materials and Methods

4.1. Sample Collection, Library Construction and Sequencing for Giant Grouper Genome Assembly

A giant grouper, E. lanceolatus (Figure S1), with a bodyweight of 227.0 g and a total length of 23.8 cm, was collected from Hainan Chenhai Aquatic Co., Ltd. (Hainan, China). The fish was immediately dissected after anesthesia with MS-222. White dorsal muscle tissue was sampled and immediately stored in liquid nitrogen, which was used for genomic DNA sequencing and Hi-C library construction. Moreover, ten tissues, including skin, muscle, liver, kidney, brain, intestine, fat, spleen, heart and gill tissues, were collected, immersed in RNAlater™ Stabilization Solution (Invitrogen, Carlsbad, CA, USA) and then stored at −80 °C for RNA isolation.
Total DNA was extracted from white muscle tissue with a TIANamp Marine Animals DNA Kit (Tiangen Biotech Co., Ltd., Beijing, China). The quality and quantity of the total DNA were determined via NanoDrop 2000 (Thermo Fisher Scientific Inc., Waltham, MA, USA). A paired-end sequencing library with an insert length of 350 bp was constructed using a TruSeq Nano DNA LT Library Preparation Kit (Illumina, San Diego, CA, USA). The obtained library was then sequenced using the Illumina HiSeq X Ten platform.
Genome DNA was broken into fragments using Covaris and recycled using AMpure PB beads (Pacific Biosciences, Menlo Park, CA, USA). A SMRTbell library was constructed using the SMRTbell Template Prep Kit (Pacific Biosciences, Menlo Park, CA, USA), according to the manufacturer’s instructions, and was sequenced using the PacBio Bioscience Sequel platform (Pacific Biosciences, Menlo Park, CA, USA).
Muscle samples were fixed with fresh paraformaldehyde, and then DNA–protein bonds were created. The Mbo I restriction enzyme was used to digest the DNA, and the overhanging 5′ ends of the DNA fragments were repaired with a biotinylated residue. The fragments close to each other in the nucleus during fixation were ligated, and the denatured proteins were removed. The Hi-C fragments were further sheared via sonication, and then pulled down with streptavidin beads. The library was sequenced on the Illumina HiSeq X Ten platform with PE150.
The total RNA of the 10 tissues (approximately 80 mg each) was extracted using RNAiso reagents (Takara, Dalian, China), following the manufacturer’s instructions. The quantity and quality of the RNA samples were determined using a microplate spectrophotometer (BioTek Company, Winooski, VT, USA), and electrophoresis was conducted using 1% agarose gel. The total RNA of the 10 tissues was mixed at equal amounts to generate a mixed RNA pool. RNA-seq libraries were prepared using the NEBNext UltraTM RNA Library Prep Kit (NEB, Ipswich, MA, USA), following the manufacturer’s protocols. The libraries’ qualities and quantities were measured using an Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA). Finally, the libraries were sequenced using the Illumina-Hiseq 2000 platform with PE-150.

4.2. Genome Assembly

Raw sequencing reads were filtered using Trimmomatic version 0.38 [25] to remove low-quality data. After filtering, the clean data were used to estimate the genome size and heterozygosity using k-mer methods [26] with 21-mer.
For the PacBio-sequencing data, after removing short polymerase reads (less than 500 bp) and reads with a barcode adapter, clean data were corrected and primarily assembled using Canu version 1.8 [27]. Wtdbg was used to assemble the genome by constructing a fuzzy Bruijn graph https://github.com/ruanjue/wtdbg (accessed on 31 July 2021). Quickmerge [28] was used to merge the assemblies produced using Canu and Wtdbg, and to produce a more contiguous assembly. In simple terms, contigs from Canu were used as input queries, contigs from Wtdbg were used as references and the two types of contigs were aligned through MUMmer version 4.0 [29]. The assembled genome was polished using Pilion version 1.22 [30]. For the purposes of estimating the genome completeness, Illumina data were mapped back to the assembled genome to calculate the mapping rate using BWA v0.7.17 [31]. The genome’s integrality was verified based on the Core Eukaryotic Gene Mapping Approach (CEGMA) [32] and the Benchmarking Universal Single-Copy Ortholog (BUSCO) [33] database.

4.3. Pseudochromosome Construction

The sequencing data of Hi-C were filtered to remove low-quality reads using Fastp version 0.12.6 [34]. The clean reads were aligned to the draft genome of the giant grouper using bowtie2 version 2.3.2 [35] with an end-to-end model and very sensitive parameters. The draft genome was broken into 50 Kb fragments and reassembled with the correct cluster, order and orientation of the contigs using Lachesis [36].

4.4. Genome Annotation

Repetitive regions of the giant grouper genome were identified via de novo and homology prediction. Transposable elements (TEs) were identified using LTR Finder version 1.05 (http://tlife.fudan.edu.cn/tlife/ltr_finder/) [37], RepeatScout v1.0.5 (http://www.repeatmasker.org) and PILER v1.0 [38]. The TEs were classified and annotated using PASTEClassifier version 1.0 using the TEdenovo pipeline [39]. Coding sequences were removed from the predicted repeat sequences through alignment to the SwissProt database using blastx with an e-value < 1 × 10−4, identity > 30, coverage > 30% and length > 90 bp. RepeatMasker v4.0.5 [40] was used to identify repeats based on the RepBase library version 19.06 [41] of known transposable elements (TEs).
Non-coding RNAs, including micro RNAs (miRNAs), ribosomal RNAs (rRNAs) and transfer RNAs (tRNAs), were predicted. tRNAs were predicted using tRNAscan-SE v1.3.1 [42]. microRNAs and rRNAs were predicted using Blast+ version 2.2.31 with an e-value < 1 × 10−5 based on the Rfam database [43]. miRNAs were predicted using Infenal version 1.1.1 [44] based on miRBase version 21 [45].
Protein-coding genes were predicted using three methods: de novo prediction, homologous-sequence prediction and RNA-seq-assisted methods. For de novo prediction, the genome without repeat regions was applied to generate gene structures using Genscan [46], Augustus version 2.4 [47], GlimmerHMM version 3.0.4 [48], GeneID version 1.4 [49] and SNAP [50]. Four fish species’ genomes and their annotation files, including tilapia (Oreochromis niloticus, GCF_001858045.2), zebra fish (Danio rerio, GCF_000002035.6), large yellow croaker (Larimichthys crocea, GCF_000972845.2) and Atlantic salmon (Salmo salar, GCF_000233375.1), were downloaded. The genome of the giant grouper was aligned to these genomes using GeMoMa [51] to obtain the exact exon, intron and splice sites [51]. RNA-seq data from ten tissues were aligned to the genome using HISAT2 version 2.0.4 [52], and assembled using Stringtie version 1.2.3 [53]. Open reading frames (ORFs) were predicted using PASA version 2.0.2 [54], TransDecoder version 2.0 https://github.com/TransDecoder/TransDecoder (accessed on 31 July 2021) and GeneMarkS-T version 5.1 [55]. The genes predicted using the three methods were merged using EVM http://evidencemodeler.sourceforge.net/ (accessed on 31 July 2021) [56].
The putative functions of predicted genes were annotated using aligned predicted genes against the NR [57], KOG [58], GO [59], KEGG [60] and TrEMBL [61] databases using Blast+ version 2.2.31.

4.5. Analysis of Genomic Structure between Brown-Marbled Grouper and Giant Grouper

Genomic structural variations, including deletion, insertion, tandem expansion and contraction and repeat expansion and contraction, were detected using nucmer in MUMmer 4.0.0 [62] and Assemblytics [63]. The giant grouper genome was aligned against the brown-marbled grouper genome [11], which served as a reference. The genes involved in variations were annotated using Annovar [64], with a parameter of neargene 2000.

4.6. Analysis of Genetic Differentiation

A total of 24 tail fins of 12 brown-marbled groupers (E. fuscoguttatus) and 12 giant groupers (E. lanceolatus) were used for genomic DNA isolation. Total DNA was extracted using a TIANamp Marine Animals DNA Kit (Tiangen Biotech Co., Ltd., Beijing, China). The quality and quantity of the total DNA were determined using a NanoDrop 2000 (Thermo Fisher Scientific Inc., Waltham, MA, USA). Paired-end sequencing libraries with an insert length of 350 bp were constructed using the Illumina Truseq DNA PCR free Library Preparation Kit (Illumina, San Diego, CA, USA). The obtained libraries were then sequenced using the Illumina HiSeq X Ten platform.
The raw data were filtered and trimmed using Trimmomatic [25]. Clean reads from each sample were aligned to the assembled genome using BWA. SNPs were called using BCFtools (version 1.10.2) [65] with mpileup, call and filter programs (https://github.com/samtools/bcftools). SNPs with a depth of <8, minor allele frequency (MAF) of <0.05, >20% missing positions, a minimum quality of <30 and a number of alleles of >2 were removed using VCFtools (version 0.1.16) [66]. The FST values were calculated using VCFtools with parameters including a window size of 100 Kb and an FST window step of 10 Kb.

Supplementary Materials

The supporting information can be downloaded at https://www.mdpi.com/article/10.3390/ijms241512007/s1.

Author Contributions

Y.Y., Z.M. and X.L. designed the study. Y.Y., L.Z., T.W. and L.W. collected the samples. Y.Y., J.X. and T.W. performed the laboratory work. Y.Y., X.W. and Z.M. performed the analyses. J.X. and X.W. contributed technical assistance. Y.Y., Z.M. and X.L. drafted the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (U22A20531, 32273132), China Agriculture Research System of MOF and MARA (CARS-47), National Key Research and Development Program (2018YFD0900203) and Project of Hainan Academician Team Innovation Center (YSPTZX202122).

Institutional Review Board Statement

The animal study protocol was approved by the Institutional Animal Care and Use Committee (IACUC), Sun Yat-Sen University, Approval No. is SYSU-IACUC-2022-B0131.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw data of the genome assembly were deposited in the Genome Sequence Archive (GSA): the project access number is PRJCA003546. The access numbers of the Illumina data for the survey, gene structure prediction and Hi-C are CRX172207, CRX166582 and CRX166581, respectively, and the access number of the PacBio data is CRX166583. The Illumina data of the resequencing of the brown-marbled grouper and giant grouper were deposited in the GSA with project access number PRJCA016709 and data access number CRA010914.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Zhuang, X.; Qu, M.; Zhang, X.; Ding, S. A Comprehensive Description and Evolutionary Analysis of 22 Grouper (Perciformes, Epinephelidae) Mitochondrial Genomes with Emphasis on Two Novel Genome Organizations. PLoS ONE 2013, 8, e73561. [Google Scholar] [CrossRef] [Green Version]
  2. Rimmer, M.A.; Glamuzina, B. A review of grouper (Family Serranidae: Subfamily Epinephelinae) aquaculture from a sustainability science perspective. Rev. Aquac. 2019, 11, 58–87. [Google Scholar] [CrossRef] [Green Version]
  3. Yang, Y.; Wu, L.; Wu, X.; Li, B.; Huang, W.; Weng, Z.; Lin, Z.; Song, L.; Guo, Y.; Meng, Z.; et al. Identification of Candidate Growth-Related SNPs and Genes Using GWAS in Brown-Marbled Grouper (Epinephelus fuscoguttatus). Mar. Biotechnol. 2020, 22, 153–166. [Google Scholar] [CrossRef]
  4. Wang, D.; Chen, X.; Zhang, X.; Li, J.; Yi, Y.; Bian, C.; Shi, Q.; Lin, H.; Li, S.; Zhang, Y.; et al. Whole Genome Sequencing of the Giant Grouper (Epinephelus lanceolatus) and High-Throughput Screening of Putative Antimicrobial Peptide Genes. Mar. Drugs 2019, 17, 503. [Google Scholar] [CrossRef] [Green Version]
  5. Williams, K.C. A review of feeding practices and nutritional requirements of postlarval groupers. Aquaculture 2009, 292, 141–152. [Google Scholar] [CrossRef]
  6. Chen, Z.F.; Tian, Y.S.; Wang, P.F.; Tang, J.; Liu, J.C.; Ma, W.H.; Li, W.S.; Wang, X.M.; Zhai, J.M. Embryonic and larval development of a hybrid between kelp grouper Epinephelus moara female x giant grouper E.lanceolatus male using cryopreserved sperm. Aquac. Res. 2018, 49, 1407–1413. [Google Scholar] [CrossRef]
  7. Tang, Z.J.; Chen, J.X.; Tang, L.; Chen, X.; Li, S.S.; Liu, Y.; Zhang, Y.; Lin, H.R.; Zhao, M. The complete mitochondrial genome of the hybrid grouper Epinephelus coioides female x Epinephelus akaara male with phylogenetic consideration. Mitochondrial DNA Part B Resour. 2017, 2, 31–32. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Sun, Y.; Guo, C.Y.; Wang, D.D.; Li, X.F.; Xiao, L.; Zhang, X.; You, X.; Shi, Q.; Hu, G.J.; Fang, C.; et al. Transcriptome analysis reveals the molecular mechanisms underlying growth superiority in a novel grouper hybrid (Epinephelus fuscogutatus female x E. lanceolatus male). BMC Genet. 2016, 17, 10. [Google Scholar] [CrossRef] [Green Version]
  9. Chen, J.X.; Ye, Z.F.; Yu, Z.S.; Wang, J.; Li, P.S.; Chen, X.; Liu, Y.; Zhang, Y. The complete mitochondrial genome of the hybrid grouper (Cromileptes altivelis female x Epinephelus lanceolatus male) with phylogenetic consideration. Mitochondrial DNA Part B Resour. 2017, 2, 171–172. [Google Scholar] [CrossRef] [Green Version]
  10. Guo, Y.; Li, W.P.; Liu, Y.L.; Li, P.S.; Wei, R.P.; Liu, Y.; Zhang, Y.; Xiao, L. The complete mitochondrial genome of the hybrid grouper Epinephelus akaara female x Epinephelus lanceolatus male. Mitochondrial DNA Part B Resour. 2018, 3, 599–600. [Google Scholar] [CrossRef] [Green Version]
  11. Yang, Y.; Wang, T.; Chen, J.; Wu, L.; Wu, X.; Zhang, W.; Luo, J.; Xia, J.; Meng, Z.; Liu, X. Whole-genome sequencing of brown-marbled grouper (Epinephelus fuscoguttatus) provides insights into adaptive evolution and growth differences. Mol. Ecol. Resour. 2022, 22, 711–723. [Google Scholar] [CrossRef] [PubMed]
  12. Yang, Y.; Wang, T.; Chen, J.; Wu, X.; Wu, L.; Zhang, W.; Luo, J.; Xia, J.; Meng, Z.; Liu, X. First construction of interspecific backcross grouper and genome-wide identification of their genetic variants associated with early growth. Aquaculture 2021, 545, 12. [Google Scholar] [CrossRef]
  13. Zhou, Q.; Gao, H.; Zhang, Y.; Fan, G.; Xu, H.; Zhai, J.; Xu, W.; Chen, Z.; Zhang, H.; Liu, S.; et al. A chromosome-level genome assembly of the giant grouper (Epinephelus lanceolatus) provides insights into its innate immunity and rapid growth. Mol. Ecol. Resour. 2019, 19, 1322–1332. [Google Scholar] [CrossRef]
  14. Birney, E.; Clamp, M.; Durbin, R. GeneWise and Genomewise. Genome Res. 2004, 14, 988–995. [Google Scholar] [CrossRef] [Green Version]
  15. Zhang, G.; Andersen, E.C. Interplay Between Polymorphic Short Tandem Repeats and Gene Expression Variation in Caenorhabditis elegans. Mol. Biol. Evol. 2023, 40, msad067. [Google Scholar] [CrossRef]
  16. Omi, T.; Kumada, M.; Kamesaki, T.; Okuda, H.; Munkhtulga, L.; Yanagisawa, Y.; Utsumi, N.; Gotoh, T.; Hata, A.; Soma, M.; et al. An intronic variable number of tandem repeat polymorphisms of the cold-induced autoinflammatory syndrome 1 (CIAS1) gene modifies gene expression and is associated with essential hypertension. Eur. J. Hum. Genet. 2006, 14, 1295–1305. [Google Scholar] [CrossRef] [Green Version]
  17. Kang, W.; Suzuki, M.; Saito, T.; Miyado, K. Emerging Role of TCA Cycle-Related Enzymes in Human Diseases. Int. J. Mol. Sci. 2021, 22, 13057. [Google Scholar] [CrossRef] [PubMed]
  18. Jiang, C.; Li, D.; Chen, L.; Liu, Y.; Zhao, Y.; Mei, G.; Tang, Y.; Yang, Y.; Yao, P.; Gao, C. Quercetin ameliorated cardiac injury via reducing inflammatory actions and the glycerophospholipid metabolism dysregulation in a diabetic cardiomyopathy mouse model. Food Funct. 2022, 13, 7847–7856. [Google Scholar] [CrossRef] [PubMed]
  19. Axelsson, E.; Ratnakumar, A.; Arendt, M.L.; Maqbool, K.; Webster, M.T.; Perloski, M.; Liberg, O.; Arnemo, J.M.; Hedhammar, A.; Lindblad-Toh, K. The genomic signature of dog domestication reveals adaptation to a starch-rich diet. Nature 2013, 495, 360–364. [Google Scholar] [CrossRef] [PubMed]
  20. Levine, A.J. p53, the cellular gatekeeper for growth and division. Cell 1997, 88, 323–331. [Google Scholar] [CrossRef] [Green Version]
  21. Guillouf, C.; Rosselli, F.; Krishnaraju, K.; Moustacchi, E.; Hoffman, B.; Liebermann, D.A. p53 involvement in control of G2 exit of the cell cycle: Role in DNA damage-induced apoptosis. Oncogene 1995, 10, 2263–2270. [Google Scholar] [PubMed]
  22. Johnston, I.A.; Bower, N.I.; Macqueen, D.J. Growth and the regulation of myotomal muscle mass in teleost fish. J. Exp. Biol. 2011, 214, 1617–1628. [Google Scholar] [CrossRef] [Green Version]
  23. Ali, A.; Al-Tobasei, R.; Lourenco, D.; Leeds, T.; Kenney, B.; Salem, M. Genome-wide identification of loci associated with growth in rainbow trout. BMC Genom. 2020, 21, 209. [Google Scholar] [CrossRef] [Green Version]
  24. Guo, Y.; Huang, H.; Zhang, Z.; Ma, Y.; Li, J.; Tang, H.; Ma, H.; Li, Z.; Li, W.; Liu, X.; et al. Genome-wide association study identifies SNPs for growth performance and serum indicators in Valgus-varus deformity broilers (Gallus gallus) using ddGBS sequencing. BMC Genom. 2022, 23, 26. [Google Scholar] [CrossRef]
  25. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Liu, B.; Shi, Y.; Yuan, J.; Galaxy, Y.; Zhang, H.; Li, N.; Li, Z.; Chen, Y.; Mu, D.; Parkin, I. Estimation of genomic characteristics by analyzing k-mer frequency in de novo genome projects. Quant. Biol. 2013, 35, 62–67. [Google Scholar]
  27. Koren, S.; Walenz, B.P.; Berlin, K.; Miller, J.R.; Bergman, N.H.; Phillippy, A.M. Canu: Scalable and accurate long-read assembly via adaptive κ-mer weighting and repeat separation. Genome Res. 2017, 27, 722–736. [Google Scholar] [CrossRef] [Green Version]
  28. Chakraborty, M.; Baldwin-Brown, J.G.; Long, A.D.; Emerson, J.J. Contiguous and accurate de novo assembly of metazoan genomes with modest long read coverage. Nucleic Acids Res. 2016, 44, e147. [Google Scholar] [CrossRef] [Green Version]
  29. Kurtz, S.; Phillippy, A.; Delcher, A.L.; Smoot, M.; Shumway, M.; Antonescu, C.; Salzberg, S.L. Versatile and open software for comparing large genomes. Genome Biol. 2004, 5, R12. [Google Scholar]
  30. Walker, B.J.; Abeel, T.; Shea, T.; Priest, M.; Abouelliel, A.; Sakthikumar, S.; Cuomo, C.A.; Zeng, Q.; Wortman, J.; Young, S.K.; et al. Pilon: An integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE 2014, 9, e0112963. [Google Scholar] [CrossRef]
  31. Li, H.; Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 2010, 26, 589–595. [Google Scholar] [CrossRef] [Green Version]
  32. Parra, G.; Bradnam, K.; Korf, I. CEGMA: A pipeline to accurately annotate core genes in eukaryotic genomes. Bioinformatics 2007, 23, 1061–1067. [Google Scholar] [CrossRef] [Green Version]
  33. Simão, F.A.; Waterhouse, R.M.; Ioannidis, P.; Kriventseva, E.V.; Zdobnov, E.M. BUSCO: Assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 2015, 31, 3210–3212. [Google Scholar] [CrossRef] [Green Version]
  34. Chen, S.; Zhou, Y.; Chen, Y.; Gu, J. Fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 2018, 34, i884–i890. [Google Scholar] [CrossRef]
  35. Langmead, B.; Salzberg, S.L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 2012, 9, 357–359. [Google Scholar] [CrossRef] [Green Version]
  36. Burton, J.N.; Adey, A.; Patwardhan, R.P.; Qiu, R.; Kitzman, J.O.; Shendure, J. Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions. Nat. Biotechnol. 2013, 31, 1119–1125. [Google Scholar] [CrossRef] [PubMed]
  37. Xu, L.; Zhang, Y.; Su, Y.; Liu, L.; Yang, J.; Zhu, Y.; Li, C. Structure and evolution of full-length LTR retrotransposons in rice genome. Plant Syst. Evol. 2010, 287, 19–28. [Google Scholar] [CrossRef]
  38. Edgar, R.C.; Myers, E.W. PILER: Identification and classification of genomic repeats. Bioinformatics 2005, 21, i152–i158. [Google Scholar] [CrossRef] [PubMed]
  39. Hoede, C.; Arnoux, S.; Moisset, M.; Chaumier, T.; Inizan, O.; Jamilloux, V.; Quesneville, H. PASTEC: An automatic transposable element classification tool. PLoS ONE 2014, 9, e91929. [Google Scholar] [CrossRef] [Green Version]
  40. Tarailo-Graovac, M.; Chen, N. Using Repeat Masker to identify repetitive elements in genomic sequences. Curr. Protoc. Bioinform. 2009, 25, 1–14. [Google Scholar] [CrossRef]
  41. Bao, W.; Kojima, K.K.; Kohany, O. Repbase Update, a database of repetitive elements in eukaryotic genomes. Mob. DNA 2015, 6, 11. [Google Scholar] [CrossRef] [Green Version]
  42. Lowe, T.M.; Eddy, S.R. TRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1996, 25, 955–964. [Google Scholar] [CrossRef] [PubMed]
  43. Lagesen, K.; Hallin, P.; Rødland, E.A.; Staerfeldt, H.H.; Rognes, T.; Ussery, D.W. RNAmmer: Consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 2007, 35, 3100–3108. [Google Scholar] [CrossRef] [PubMed]
  44. Nawrocki, E.P.; Eddy, S.R. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics 2013, 29, 2933–2935. [Google Scholar] [CrossRef] [Green Version]
  45. Kozomara, A.; Birgaoanu, M.; Griffiths-Jones, S. MiRBase: From microRNA sequences to function. Nucleic Acids Res. 2019, 47, D155–D162. [Google Scholar] [CrossRef]
  46. Burge, C.; Karlin, S. Prediction of complete gene structures in human genomic DNA. J. Mol. Biol. 1997, 268, 78–94. [Google Scholar] [CrossRef] [Green Version]
  47. Stanke, M.; Keller, O.; Gunduz, I.; Hayes, A.; Waack, S.; Morgenstern, B. AUGUSTUS: A b initio prediction of alternative transcripts. Nucleic Acids Res. 2006, 34, W435–W439. [Google Scholar] [CrossRef] [Green Version]
  48. Majoros, W.H.; Pertea, M.; Salzberg, S.L. TigrScan and GlimmerHMM: Two open source ab initio eukaryotic gene-finders. Bioinformatics 2004, 20, 2878–2879. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  49. Alioto, T.; Blanco, E.; Parra, G.; Guigó, R. Using geneid to Identify Genes. Curr. Protoc. Bioinform. 2018, 64, e56. [Google Scholar] [CrossRef]
  50. Korf, I. Gene finding in novel genomes. BMC Bioinform. 2004, 5, 59. [Google Scholar] [CrossRef] [Green Version]
  51. Slater, G.S.C.; Birney, E. Automated generation of heuristics for biological sequence comparison. BMC Bioinform. 2005, 6, 31. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  52. Kim, D.; Langmead, B.; Salzberg, S.L. HISAT: A fast spliced aligner with low memory requirements. Nat. Methods 2015, 12, 357–360. [Google Scholar] [CrossRef] [Green Version]
  53. Trapnell, C.; Roberts, A.; Goff, L.; Pertea, G.; Kim, D.; Kelley, D.R.; Pimentel, H.; Salzberg, S.L.; Rinn, J.L.; Pachter, L. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 2012, 7, 562–578. [Google Scholar] [CrossRef] [Green Version]
  54. Campbell, M.A.; Haas, B.J.; Hamilton, J.P.; Mount, S.M.; Robin, C.R. Comprehensive analysis of alternative splicing in rice and comparative analyses with Arabidopsis. BMC Genom. 2006, 7, 327. [Google Scholar] [CrossRef] [Green Version]
  55. Tang, S.; Lomsadze, A.; Borodovsky, M. Identification of protein coding regions in RNA transcripts. Nucleic Acids Res. 2015, 43, 18. [Google Scholar] [CrossRef] [Green Version]
  56. Haas, B.J.; Salzberg, S.L.; Zhu, W.; Pertea, M.; Allen, J.E.; Orvis, J.; White, O.; Robin, C.R.; Wortman, J.R. Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biol. 2008, 9, R7. [Google Scholar] [CrossRef] [Green Version]
  57. Marchler-Bauer, A.; Lu, S.; Anderson, J.B.; Chitsaz, F.; Derbyshire, M.K.; DeWeese-Scott, C.; Fong, J.H.; Geer, L.Y.; Geer, R.C.; Gonzales, N.R.; et al. CDD: A Conserved Domain Database for the functional annotation of proteins. Nucleic Acids Res. 2011, 39, D225–D229. [Google Scholar] [CrossRef] [Green Version]
  58. Koonin, E.V.; Fedorova, N.D.; Jackson, J.D.; Jacobs, A.R.; Krylov, D.M.; Makarova, K.S.; Mazumder, R.; Mekhedov, S.L.; Nikolskaya, A.N.; Rao, B.S.; et al. A comprehensive evolutionary classification of proteins encoded in complete eukaryotic genomes. Genome Biol. 2004, 5, R7. [Google Scholar] [CrossRef] [Green Version]
  59. Dimmer, E.C.; Huntley, R.P.; Alam-Faruque, Y.; Sawford, T.; O’Donovan, C.; Martin, M.J.; Bely, B.; Browne, P.; Chan, W.M.; Eberhardt, R.; et al. The UniProt-GO Annotation database in 2011. Nucleic Acids Res. 2012, 40, D565–D570. [Google Scholar] [CrossRef]
  60. Kanehisa, M.; Sato, Y.; Kawashima, M.; Furumichi, M.; Tanabe, M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. 2016, 44, D457–D462. [Google Scholar] [CrossRef] [Green Version]
  61. Boeckmann, B.; Bairoch, A.; Apweiler, R.; Blatter, M.C.; Estreicher, A.; Gasteiger, E.; Martin, M.J.; Michoud, K.; O’Donovan, C.; Phan, I.; et al. The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 2003, 31, 365–370. [Google Scholar] [CrossRef]
  62. Marçais, G.; Delcher, A.L.; Phillippy, A.M.; Coston, R.; Salzberg, S.L.; Zimin, A. MUMmer4: A fast and versatile genome alignment system. PLoS Comput. Biol. 2018, 14, e1005944. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  63. Nattestad, M.; Schatz, M.C. Assemblytics: A web analytics tool for the detection of variants from an assembly. Bioinformatics 2016, 32, 3021–3023. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  64. Wang, K.; Li, M.; Hakonarson, H. ANNOVAR: Functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010, 38, e164. [Google Scholar] [CrossRef]
  65. Narasimhan, V.; Danecek, P.; Scally, A.; Xue, Y.; Tyler-Smith, C.; Durbin, R. BCFtools/RoH: A hidden Markov model approach for detecting autozygosity from next-generation sequencing data. Bioinformatics 2016, 32, 1749–1751. [Google Scholar] [CrossRef] [Green Version]
  66. Danecek, P.; Auton, A.; Abecasis, G.; Albers, C.A.; Banks, E.; DePristo, M.A.; Handsaker, R.E.; Lunter, G.; Marth, G.T.; Sherry, S.T.; et al. The variant call format and VCFtools. Bioinformatics 2011, 27, 2156–2158. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Heatmap of interaction of contigs for giant grouper genome assemblies. A deeper color indicates a higher interaction level of contigs. Red indicates a high interaction level and white indicates a low interaction level.
Figure 1. Heatmap of interaction of contigs for giant grouper genome assemblies. A deeper color indicates a higher interaction level of contigs. Red indicates a high interaction level and white indicates a low interaction level.
Ijms 24 12007 g001
Figure 2. Distribution of structural variation in genomes between brown-marbled grouper and giant grouper: (A) annotation of insertion and deletion; (B) annotation of repeat expansion and contraction; (C) annotation of tandem expansion and contraction; (D) annotation of genomic inversion.
Figure 2. Distribution of structural variation in genomes between brown-marbled grouper and giant grouper: (A) annotation of insertion and deletion; (B) annotation of repeat expansion and contraction; (C) annotation of tandem expansion and contraction; (D) annotation of genomic inversion.
Ijms 24 12007 g002
Figure 3. KEGG enrichment of protein-coding genes containing repeat mutations between brown-marbled grouper and giant grouper genomes. (A) Bubble pattern of KEGG enrichment analysis. The sizes of the bubbles indicate the numbers of genes, purple indicates a high enrichment level and orange indicates a low enrichment level. (B) Metabolism pathway network of protein-coding genes containing repeat mutations. The size of the bubble indicates the gene’s number of pathways, and the color of the bubble indicates the enrichment degree of the pathways.
Figure 3. KEGG enrichment of protein-coding genes containing repeat mutations between brown-marbled grouper and giant grouper genomes. (A) Bubble pattern of KEGG enrichment analysis. The sizes of the bubbles indicate the numbers of genes, purple indicates a high enrichment level and orange indicates a low enrichment level. (B) Metabolism pathway network of protein-coding genes containing repeat mutations. The size of the bubble indicates the gene’s number of pathways, and the color of the bubble indicates the enrichment degree of the pathways.
Ijms 24 12007 g003
Figure 4. Manhattan plot of FST values of SNPs between giant grouper and brown-marbled grouper genomes.
Figure 4. Manhattan plot of FST values of SNPs between giant grouper and brown-marbled grouper genomes.
Ijms 24 12007 g004
Figure 5. KEGG enrichment of protein-coding genes involved in highly differentiated regions between brown-marbled grouper and giant grouper genomes. (A) Gradient of KEGG enrichment. The sizes of bubbles indicate the numbers of genes, purple indicates a high enrichment level and orange indicates a low enrichment level. (B) Metabolism pathway network. The size of the spot indicates the gene’s number of pathways.
Figure 5. KEGG enrichment of protein-coding genes involved in highly differentiated regions between brown-marbled grouper and giant grouper genomes. (A) Gradient of KEGG enrichment. The sizes of bubbles indicate the numbers of genes, purple indicates a high enrichment level and orange indicates a low enrichment level. (B) Metabolism pathway network. The size of the spot indicates the gene’s number of pathways.
Ijms 24 12007 g005
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yang, Y.; Zeng, L.; Wang, T.; Wu, L.; Wu, X.; Xia, J.; Meng, Z.; Liu, X. Assembly of Genome and Resequencing Provide Insights into Genetic Differentiation between Parents of Hulong Hybrid Grouper (Epinephelus fuscoguttatus× E. lanceolatus ♂). Int. J. Mol. Sci. 2023, 24, 12007. https://doi.org/10.3390/ijms241512007

AMA Style

Yang Y, Zeng L, Wang T, Wu L, Wu X, Xia J, Meng Z, Liu X. Assembly of Genome and Resequencing Provide Insights into Genetic Differentiation between Parents of Hulong Hybrid Grouper (Epinephelus fuscoguttatus× E. lanceolatus ♂). International Journal of Molecular Sciences. 2023; 24(15):12007. https://doi.org/10.3390/ijms241512007

Chicago/Turabian Style

Yang, Yang, Leilei Zeng, Tong Wang, Lina Wu, Xi Wu, Junhong Xia, Zining Meng, and Xiaochun Liu. 2023. "Assembly of Genome and Resequencing Provide Insights into Genetic Differentiation between Parents of Hulong Hybrid Grouper (Epinephelus fuscoguttatus× E. lanceolatus ♂)" International Journal of Molecular Sciences 24, no. 15: 12007. https://doi.org/10.3390/ijms241512007

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop