Next Article in Journal
Molecular Phylogenetics and Historical Biogeography of Subtribe Ecliptinae (Asteraceae, Heliantheae)
Previous Article in Journal
An Assessment of Vegetation Changes in the Three-River Headwaters Region, China: Integrating NDVI and Its Spatial Heterogeneity
Previous Article in Special Issue
Cotton Pectate Lyase GhPEL48_Dt Promotes Fiber Initiation Mediated by Histone Acetylation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Genome-Wide Alternative Splicing Analysis of Gossypium arboreum and Gossypium raimondii During Fiber Development

1
College of Life Sciences, Wuhan University, Wuhan 430072, China
2
Institute for Advanced Studies, Wuhan University, Wuhan 430072, China
3
Hubei Hongshan Laboratory, Wuhan 430072, China
4
TaiKang Center for Life and Medical Sciences, Wuhan University, Wuhan 430072, China
*
Author to whom correspondence should be addressed.
Plants 2024, 13(19), 2816; https://doi.org/10.3390/plants13192816
Submission received: 23 August 2024 / Revised: 2 October 2024 / Accepted: 6 October 2024 / Published: 8 October 2024
(This article belongs to the Special Issue Molecular Insights into Cotton Fiber Gene Regulation)

Abstract

:
Alternative splicing (AS) is a crucial post-transcriptional regulatory mechanism that contributes to proteome complexity and versatility in different plant species. However, detailed AS exploration in diploid cotton during fiber development has not been reported. In this study, we comparatively analyzed G. arboreum and G. raimondii AS events during fiber development using transcriptome data and identified 9690 and 7617 AS events that were distributed in 6483 and 4859 genes, respectively. G. arboreum had more AS genes and AS events than G. raimondii, and most AS genes were distributed at both ends of all 13 chromosomes in both diploid cotton species. Four major AS types, including IR, SE, A3SS, and A5SS, were all experimentally validated through RT-PCR assays. G. arboreum and G. raimondii had only 1888 AS genes in common, accounting for one-third and one-half of the total number of AS genes, respectively. Furthermore, we found a lysine-specific demethylase coding gene with a different AS mechanism in G. arboreum and G. raimondii, in which AS isoforms lacked part of a key conserved domain. Our findings may provide new directions for the discovery of functional genes involved in cotton species differentiation.

1. Introduction

Alternative splicing (AS) is a crucial post-transcriptional regulatory mechanism in eukaryotes that allows a single gene to produce multiple mRNA isoforms, which significantly contributes to transcriptome diversity and proteome complexity [1,2]. Since alternative splicing was first demonstrated in 1977 [3,4], along with technological advances and the popularity of high-throughput sequencing, AS has been comprehensively analyzed in many species on a genome-wide scale [5,6]. Over 90 and 61% of multi-exonic genes are alternatively spliced in humans [7] and Arabidopsis thaliana [8,9,10], and many isoforms produced by alternative splicing exhibit tissue or condition specificity [11], which has been linked to many human diseases [12,13]. However, not all AS events are functional; some isoforms produced by alternative splicing may contain a termination codon leading to degradation via the nonsense-mediated decay (NMD) pathway [14,15].
In plants, AS plays a crucial role in metabolism [16], temperature response, disease defense [17], immunity response [18], and osmotic stress response [19] and is regulated by histone modifications [20,21]. Most studies have identified and analyzed AS at the genomic level and found some functional AS genes, while others explained how alternative splicing contributes to plant development. For example, a wheat heat shock transcription factor, TaHSFA6e, produced a 14-amino acid peptide by AS on its C-terminal under heat stress, which enhanced the transcriptional activity of three downstream heat shock protein 70 (TaHSP70) genes [22]. In Camellia sinensis, when jasmonic acid (JA) was present, AS transcripts and CsJAZ1 full-length transcripts interacted and formed heterodimers that stabilized the CsJAZ1-CsMYC2 complexes, thereby repressing the transcription of four genes that act late in the flavan-3-ol biosynthetic pathway [23]. The AS transcript of MaMYB16 was up-regulated during banana fruit ripening, competitively combined and formed non-functional heterodimers with full-length transcripts, decreased binding capacity with MaDREB2, and facilitated the activation of ripening-related genes, thereby promoting fruit ripening [24]. In Oryza sativa, RLI1 alternative splicing produced a protein isoform without coiled-coil (CC) domains, enabling it to activate broader target genes that regulated brassinolide (BL) biosynthesis and signaling [25].
Cotton is an important economic crop worldwide and is used to produce both natural textile fiber and cottonseed oil [26,27]. Cotton is also an excellent model for studying genome polyploidization [28], cell elongation, and cell wall biosynthesis [29,30]. The G. arboretum (A2) and G. raimondii (D5) genomes, which are potential diploid ancestors of cultivated allotetraploid cotton species (Gossypium hirsutum and Gossypium barbadense), have been successfully sequenced and assembled [31,32]. Gossypium arboreum is an important cultivated diploid cotton species [33], whereas G. raimondii does not produce spinnable fiber. Although the G. arboretum genome is almost twice the size of G. raimondii [34], the number of annotated genes is similar, and >80% are orthologous [32]. Genome-wide association studies have identified many candidate genes associated with fiber traits [35,36,37,38,39]. Genome-wide analyses of the AS genes and AS events have been performed in G. raimondii [40], G. arboretum [30], G. hirsutum [41], and G. barbadense [42]. However, few systematic studies have revealed AS during cotton fiber development, and whether there are functional isoforms that regulate fiber development is unknown.
To visually compare fiber-associated AS, we collected transcriptomic data on the fiber developmental stages of cultivated diploid G. arboreum and G. raimondii (without spinnable fiber). Then, we conducted a comparative analysis of AS events and genes and chromosomal distribution and homologues in G. arboreum and G. raimondii. Histone modification-related AS genes were further analyzed, and a potential AS gene associated with fiber development was found. Our findings increase our knowledge of AS in different diploid cotton species and provide a new platform for studying the cotton fiber development mechanism.

2. Materials and Methods

2.1. Plant Materials

G. arboretum (Shixiya1) and G. raimondii (D5-3) were grown in a greenhouse at Wuhan University, Hubei, China. Then, 0, 5, 10, and 15 DPA (day post-anthesis) fibers and ovules were collected from G. arboreum and G. raimondii, immediately frozen in liquid nitrogen, and stored at −80 °C until RNA extraction.

2.2. RNA Extraction, cDNA Library Preparation, and RNA-seq

Fibers and ovules of the same weight from 0, 5, 10, and 15 DPA were mixed together and ground to powder with a grinder. Total RNA was extracted using a plant total RNA extraction kit (TIANGEN, DP441); after the removal of genomic DNA and ribosomal RNA, the mRNA was broken through the NEBNext® RNA Fragmentation Buffer into 200–300 nt. The first strand of cDNA was then generated using random hexamer primers. Libraries were sequenced via Illumina platforms NovaSeq PE150. The low-quality reads and adapters were removed using Trimmomatic (version 0.36). The clean sequencing data of G. arboreum and G. raimondii have been uploaded to the NCBI Small Read Archive (SRA) and are available under the following accession numbers: SRR29754175 and SRR29754174.

2.3. Identification of AS Events

Genome sequences and annotations of G. arboreum (CRI) and G. raimondii (HAU) were downloaded from https://www.cottongen.org (accessed on 5 August 2023). Tophat (version 2.1.1) and Cufflinks (version 2.2.1) software [43] were used to align and assemble all of the paired-end clean reads. Transcripts with fragments per kilobase of exon per million fragments mapped (FPKM) <0.1 or ratios <10% of each gene were discarded [40,44]. Final transcript annotations were compared and merged with the reference genome using the cuffcompare tool (in Cufflink). We then used ASTALAVISTA v4.0 software [45] to identify the AS events. Four major AS types (IR, A3SS, A5SS and ES) were recognized, and the remainder were collectively grouped as Complex.

2.4. The Distribution of AS Genes and AS Events

To visualize the distribution of AS events and AS genes identified in G. arboreum and G. raimondii across their chromosomes, we calculated the density of genes and events based on their locations and visualized them using the Circos tool [46].

2.5. RT-PCR Verification of AS Events

Total RNA extracted from G. arboreum and G. raimondii were used to synthesize cDNA using a 1st Strand cDNA synthesis kit (Vazyme, R212-01) after the removal of genomic DNA. For each selected gene, primers were designed on the upstream and downstream of the splice site (Table S3). RT-PCR reactions were performed under the following conditions: 95 °C for 5 min, followed by 30 cycles of 95 °C for 30 s, 57 °C for 30 s, and 72 °C for 60 s.

2.6. Homologous Gene Analysis and Conserved Domains Search

To investigate the homologous genes between G. arboreum and G. raimondii, we used a bidirectional blast based on protein sequences. To compare the different AS genes between G. arboreum and G. raimondii, we translated different transcripts into protein sequences for further analysis. We used the InterPro database (https://www.ebi.ac.uk/interpro/ (accessed on 5 May 2024)) to find conserved domains in protein sequences. The protein structure was predicted using alphafold (https://www.alphafold.ebi.ac.uk/ (accessed on 7 May 2024)).

3. Results

3.1. Statistical Analysis of AS Genes and AS Events

Cufflink and ASTALAVISTA were used to assemble the transcripts, estimate transcriptional expression, and identify AS events. In total, 27,598 and 28,123 annotated genes were expressed in G. arboreum and G. raimondii, which produced 44,711 and 41,563 gene isoforms (Table 1). Of all the genes expressed in G. arboreum and G. raimondii, 22,732 and 23,729 genes were multiexon. In our analysis, 9690 and 7617 AS events were identified in G. arboreum and G. raimondii and distributed in 6483 and 4859 genes (Tables S1 and S2).
Of the AS events identified, there were four major types, including retained introns (IRs), skipped exons (SEs), alternative 3′ splicing sites (A3SSs), and alternative 5′ splicing sites (A5SSs) accounting for 88.48 and 88.09% in G. arboreum and G. raimondii, respectively (Figure 1B). The remaining AS events were collectively grouped as “Complex”. In G. raimondii, 2453 IR events (32.2% of total events) from 1955 genes were identified, implying that most AS events were of this type (Figure 1A,C). In G. arboreum, 3039 A3SS events (31.36% of total events) from 2640 genes were the most abundant type. G. arboreum had more AS genes and AS events than G. raimondii.
All of the AS genes had at least one AS event. G. arboreum had more genes with one to five different AS events than G. raimondii, G. raimondii had more genes with over five AS events than G. arboreum (Figure 1D). An uncharacterized AS gene (Grai_02G007450) in G. raimondii had the most AS events, producing 14 from six different transcripts (Table S2). G. arboreum and G. raimondii AS genes produced 1.49 and 1.57 splicing events on average.

3.2. Experimental Validation of Different AS Events Identified in G. arboreum

To validate AS events, four different AS event types in G. arboreum were randomly selected and verified by RT-PCR. The gene-structure models and amplified products represented different AS event types (IR, Figure 2A; SE, Figure 2B; A3SS, Figure 2C; A5SS, Figure 2D). As expected, two amplified products corresponding to the primary transcript and AS transcript were observed.

3.3. Distribution of AS Genes and AS Events on Chromosomes

To visualize the distribution of AS genes and AS events across G. arboreum and G. raimondii chromosomes, AS genes and AS event density were calculated. The G. arboreum A05 chromosome and G. raimondii D05 chromosome had the most AS genes and AS events, with 705 (1049 AS events) and 546 (852 AS events) AS genes, respectively (Figure 3 and Figure S2). Meanwhile, the G. arboreum A02 chromosome and G. raimondii D02 chromosome had the fewest AS genes, with 273 (395 AS events) and 262 (441 AS events) AS genes, respectively, and the G. raimondii D01 chromosome had the least AS events at 427. Generally, the distribution of AS genes was similar to expressed genes. Interestingly, almost all of the AS genes were mainly distributed at both ends of chromosomes, which was more obvious in G. arboreum (Figure 3).

3.4. G. arboreum and G. raimondii AS Gene Differences

G. arboreum and G. raimondii had almost the same number of genes, and most were homologous. To compare their genes, we used protein sequences to identify ortholog genes. We detected 27,598 and 28,123 expressed genes in G. arboreum and G. raimondii, respectively, from transcriptome data containing 24,983 pairs of G. arboreum and G. raimondii homologous genes (Figure 4A). Only 1888 AS genes were present in both G. arboreum and G. raimondii, 327 AS genes in G. arboreum and 397 AS genes in G. raimondii without expression in homologous genes. The remaining 4268 and 2574 AS genes were identified only in G. arboreum or G. raimondii (Figure 4B).
The AS gene exon numbers and lengths of the four majority AS events were statistically analyzed. IR genes had more exons than other expressed genes and were more pronounced in G. arboretum (Figure S1A). The SE skipping lengths were similar in the two cotton species (Figure S1D), while the retained intron lengths in G. raimondii were longer than those in G. arboretum (Figure S1B). A3SS lengths were mainly below 40 bp, and A5SS lengths were more widely distributed (Figure S1C).

3.5. A specific Transcription Factor Coding Gene JMJ25 Possesses a Distinct AS Mechanism in Both Gossypium Varieties

The gene’s protein sequence determines gene function, with those genes producing different transcripts through alternative splicing also creating different protein isoforms, which may have different functions. By comparing AS events between G. arboreum and G. raimondii homologous genes, we found a transcription factor jumonji (Jmjc) domain-containing protein coding gene named JMJ25 (lysine-specific demethylase) with a different AS mechanism. In G. arboreum and G. raimondii, the gene produced three and two transcripts through alternative splicing (Figure 5A), which can translate into different proteins. All of the AS events were validated using RT-PCR, and different amplification product sizes were observed (Figure 5B). The results of the conserved domains showed that the JMJ25 gene had three conserved domains, and all the AS sites in G. arboreum and G. raimondii were in the Jmjc-domain (Figure 5C), where Jmjc functions in a histone demethylation mechanism. The protein structure showed that the AS site was located in an alpha helix consisting of 10 amino acids at both ends (Figure 5D).

4. Discussion

High-throughput sequencing and high-quality genomes enable AS identification on a genome-wide scale. Although the total gene number in the genome and total number of genes expressed during fiber development were similar in the two diploid ancestors, G. arboreum and G. raimondii, AS genes and AS events identified in G. arboreum were greater than in G. raimondii. When comparing AS in homologous gene pairs between G. arboreum and G. raimondii, we found that over 88% of the expressed genes were the same, while > 61% of the AS genes were different. The same trends were observed in cultivated allotetraploid cotton (Gossypium barbadense) [42], so it appears that these differences were inherited from diploid ancestors and retained after polyploidization. G. raimondii and G. arboreum have similar genes but their fiber quality is very different. G. arboreum may produce more transcript isoforms by AS from a limited number of genes, increasing the diversity of gene transcription and the complexity of the proteome. In this study, the different AS genes between G. arboreum and G. raimondii may have played an important role in the development of fiber cells, especially in the stages of fiber initiation and elongation.
Among all of the AS events we identified in G. arboreum and G. raimondii, A3SS and IR were the most abundant types, accounting for 31.3 and 32.2%, respectively. The overall content of different AS types varied substantially between species; for example, IR is the most prevalent in plants and fungi, whereas ES is the most common in vertebrates [7] (38% in zebrafish, 42% in humans). These differences may be caused by transposable element (TE) insertions inducing changes in the branch point site distribution, which are important for IR [40]. During the development of cotton fiber cells, AS may produce alternative stop codons in mature mRNA sequences, which can be further processed or degraded via the NMD pathway [47,48] as required, ensuring efficient gene regulation in the process of fiber.
Gene expression involves transcription and translation, so the extent to which mRNA levels influence protein abundance and the effects in cases where this dependency breaks down remain topics of intense debate [49]. Normally, we determine whether a gene is up-regulated or down-regulated based on the level of transcription. There are so many dynamically changing genes during fiber development, especially during primary and secondary cell wall development [35,50]. However, few studies have reported that AS can change the encoded protein structure and function in cotton species. Comparing the differences in alternative splicing between two diploid cotton plants may provide new insights into the mechanisms of cotton fiber development, especially those upstream transcription factors and histone regulatory factors.
Histone modification is an important mechanism that mediates gene expression, growth, and development in plants and animals [8,9,10]. There are some modifications, such as Lys 4 trimethylation (H3K4me3) and histone H3 Lys K9 acetylation (H3K9ac), which are euchromatic marks that are often associated with active transcription, while other modifications, such as H3K9me2 and H3K27me3, are known as heterochromatic marks and are related to gene repression [49]. Moreover, histone modifications affect splicing outcomes by influencing the recruitment of splicing regulators via a chromatin-binding protein [21]. Additionally, H3K36me3 plays an important role in regulating AS and plant responses to high temperatures [20]. In allotetraploid cotton (Gossypium hirsutum), H3K4me3 levels are unequal in homologous gene pairs between A and D subgenomes [51]. We further compared the differences between G. arboreum and G. raimondii AS genes, especially transcription factors and including those related to histone modification. We found a JMJ25 (lysine-specific demethylase) gene with a different AS mechanism in G. arboreum and G. raimondii whereby all transcripts can translate into proteins, and AS transcripts may have different protein activities that affect histone methylation levels. In Arabidopsis (Arabidopsis thaliana), the homologue JMJ24 is a nuclear-localized Jmjc domain-containing protein that appears to regulate basal levels of transcription of silenced loci in part by controlling methylation in heterochromatic regions. Other homologues, JMJ30 and JMJ32, mediate histone demethylation at the FLC site, constituting a balanced mechanism that controls flowering at elevated temperatures to prevent premature flowering [52]. In this study, we found that AS variants of JMJ25 can change the activity of demethylases and may regulate the expression of downstream-associated genes, which suggested that JMJ25 may be a potential gene associated with fiber development.

5. Conclusions

We used fiber development transcriptomic data to identify AS events in G. arboreum and G. raimondii at the genome level. The number of AS events in G. arboreum was significantly higher than G. raimondii. Furthermore, <39% of AS genes were identified in both G. arboreum and G. raimondii. There was a greater difference in the G. arboreum and G. raimondii transcriptome due to the presence of AS. Alternative splicing of lysine-specific demethylase JMJ25 produced at least three- and two-protein isoforms with histone-modified demethylase activity that varied in G. arboreum and G. raimondii during fiber development.
G. arboreum and G. raimondii have similar genes but distinct differences in fiber traits. These ASs reflect an important transcriptional regulatory mechanism and potentially expand proteome diversity. The AS variants, especially those that differed between G. arboreum and G. raimondii, provide a new direction for studying the cotton fiber development mechanism.

Supplementary Materials

The following supporting information can be downloaded at this website: https://www.mdpi.com/article/10.3390/plants13192816/s1, Figure S1. Exon number and length of four basic alternative splicing events, including (A) the exon number of AS genes and expressed genes, the AS length of (B) IR, of (C) A3SS and A5SS, and of (D) SE (*** p < 0.001, by Student’s t-test); Figure S2. Statistics of the AS genes and AS events in the chromosomes identified in G. arboreum and G. raimondii, including (A) the AS genes and (B) the AS events; Table S1. AS genes and AS events identified in G. arboretum; Table S2. AS genes and AS events identified in G. raimondii; Table S3. Primers used for AS validation.

Author Contributions

Y.Z. and J.H. participated in the design of the study. J.H. performed the experiments, the data analysis, and wrote the manuscript. Y.Z., J.H. and X.W. revised the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (31830057 and 32200286) and partly supported by the China Postdoctoral Science Foundation (2022TQ0240 and 2022M722470).

Data Availability Statement

The transcriptome data of G. arboreum and G. raimondii has been uploaded to https://www.ncbi.nlm.nih.gov/sra/ (accessed on 4 July 2024) under the following accession numbers: SRR29754175 and SRR29754174.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Baralle, F.E.; Giudice, J. Alternative splicing as a regulator of development and tissue identity. Nat. Rev. Mol. Cell Biol. 2017, 18, 437–451. [Google Scholar] [CrossRef] [PubMed]
  2. Marasco, L.E.; Kornblihtt, A.R. The physiology of alternative splicing. Nat. Rev. Mol. Cell Biol. 2023, 24, 242–254. [Google Scholar] [CrossRef] [PubMed]
  3. Chow, L.T.; Gelinas, R.E.; Broker, T.R.; Roberts, R.J. An amazing sequence arrangement at the 5′ ends of adenovirus 2 messenger RNA. Cell 1977, 12, 1–8. [Google Scholar] [CrossRef] [PubMed]
  4. Berget, S.M.; Moore, C.; Sharp, P.A. Spliced segments at the 5′ terminus of adenovirus 2 late mRNA. Proc. Natl. Acad. Sci. USA 1977, 74, 3171–3175. [Google Scholar] [CrossRef]
  5. Dong, C.; He, F.; Berkowitz, O.; Liu, J.; Cao, P.; Tang, M.; Shi, H.; Wang, W.; Li, Q.; Shen, Z.; et al. Alternative splicing plays a critical role in maintaining mineral nutrient homeostasis in rice (Oryza sativa). Plant Cell 2018, 30, 2267–2285. [Google Scholar] [CrossRef]
  6. Shen, Y.; Zhou, Z.; Wang, Z.; Li, W.; Fang, C.; Wu, M.; Ma, Y.; Liu, T.; Kong, L.A.; Peng, D.L.; et al. Global dissection of alternative splicing in paleopolyploid soybean. Plant Cell 2014, 26, 996–1008. [Google Scholar] [CrossRef]
  7. Stamm, S.; Zhu, J.; Nakai, K.; Stoilov, P.; Stoss, O.; Zhang, M.Q. An alternative-exon database and its statistical analysis. DNA Cell Biol. 2000, 19, 739–756. [Google Scholar] [CrossRef]
  8. Berger, S.L. The complex language of chromatin regulation during transcription. Nature 2007, 447, 407–412. [Google Scholar] [CrossRef]
  9. Liu, C.; Lu, F.; Cui, X.; Cao, X. Histone methylation in higher plants. Annu. Rev. Plant Biol. 2010, 61, 395–420. [Google Scholar] [CrossRef]
  10. Li, B.; Carey, M.; Workman, J.L. The role of chromatin during transcription. Cell 2007, 128, 707–719. [Google Scholar] [CrossRef]
  11. Kalsotra, A.; Cooper, T.A. Functional consequences of developmentally regulated alternative splicing. Nat. Rev. Genet. 2011, 12, 715–729. [Google Scholar] [CrossRef] [PubMed]
  12. Arnold, E.S.; Ling, S.C.; Huelga, S.C.; Lagier-Tourenne, C.; Polymenidou, M.; Ditsworth, D.; Kordasiewicz, H.B.; McAlonis-Downes, M.; Platoshyn, O.; Parone, P.A.; et al. ALS-linked TDP-43 mutations produce aberrant RNA splicing and adult-onset motor neuron disease without aggregation or loss of nuclear TDP-43. Proc. Natl. Acad. Sci. USA 2013, 110, E736–E745. [Google Scholar] [CrossRef] [PubMed]
  13. David, C.J.; Chen, M.; Assanah, M.; Canoll, P.; Manley, J.L. HnRNP proteins controlled by c-Myc deregulate pyruvate kinase mRNA splicing in cancer. Nature 2010, 463, 364–368. [Google Scholar] [CrossRef] [PubMed]
  14. Kurosaki, T.; Maquat, L.E. Nonsense-mediated mRNA decay in humans at a glance. J. Cell Sci. 2016, 129, 461–467. [Google Scholar] [CrossRef]
  15. Filichkin, S.A.; Mockler, T.C. Unproductive alternative splicing and nonsense mRNAs: A widespread phenomenon among plant circadian clock genes. Biol. Direct 2012, 7, 20. [Google Scholar] [CrossRef]
  16. Lam, P.Y.; Wang, L.; Lo, C.; Zhu, F.Y. Alternative splicing and its roles in plant metabolism. Int. J. Mol. Sci. 2022, 23, 7355. [Google Scholar] [CrossRef]
  17. Zhang, H.; Mao, R.; Wang, Y.; Zhang, L.; Wang, C.; Lv, S.; Liu, X.; Wang, Y.; Ji, W. Transcriptome-wide alternative splicing modulation during plant-pathogen interactions in wheat. Plant Sci. 2019, 288, 110160. [Google Scholar] [CrossRef]
  18. Kufel, J.; Diachenko, N.; Golisz, A. Alternative splicing as a key player in the fine-tuning of the immunity response in Arabidopsis. Mol. Plant Pathol. 2022, 23, 1226–1238. [Google Scholar] [CrossRef]
  19. Thatcher, S.R.; Danilevskaya, O.N.; Meng, X.; Beatty, M.; Zastrow-Hayes, G.; Harris, C.; Van Allen, B.; Habben, J.; Li, B. Genome-Wide analysis of alternative splicing during development and drought stress in Maize. Plant Physiol. 2016, 170, 586–599. [Google Scholar] [CrossRef]
  20. Pajoro, A.; Severing, E.; Angenent, G.C.; Immink, R. Histone H3 lysine 36 methylation affects temperature-induced alternative splicing and flowering in plants. Genome Biol. 2017, 18, 102. [Google Scholar] [CrossRef]
  21. Luco, R.F.; Pan, Q.; Tominaga, K.; Blencowe, B.J.; Pereira-Smith, O.M.; Misteli, T. Regulation of alternative splicing by histone modifications. Science 2010, 327, 996–1000. [Google Scholar] [CrossRef]
  22. Wen, J.; Qin, Z.; Sun, L.; Zhang, Y.; Wang, D.; Peng, H.; Yao, Y.; Hu, Z.; Ni, Z.; Sun, Q.; et al. Alternative splicing of TaHSFA6e modulates heat shock protein-mediated translational regulation in response to heat stress in wheat. New Phytol. 2023, 239, 2235–2247. [Google Scholar] [CrossRef] [PubMed]
  23. Zhu, J.; Yan, X.; Liu, S.; Xia, X.; An, Y.; Xu, Q.; Zhao, S.; Liu, L.; Guo, R.; Zhang, Z.; et al. Alternative splicing of CsJAZ1 negatively regulates flavan-3-ol biosynthesis in tea plants. Plant J. 2022, 110, 243–261. [Google Scholar] [CrossRef] [PubMed]
  24. Jiang, G.; Zhang, D.; Li, Z.; Liang, H.; Deng, R.; Su, X.; Jiang, Y.; Duan, X. Alternative splicing of MaMYB16L regulates starch degradation in banana fruit during ripening. J. Integr. Plant Biol. 2021, 63, 1341–1352. [Google Scholar] [CrossRef] [PubMed]
  25. Guo, M.; Zhang, Y.; Jia, X.; Wang, X.; Zhang, Y.; Liu, J.; Yang, Q.; Ruan, W.; Yi, K. Alternative splicing of REGULATOR OF LEAF INCLINATION 1 modulates phosphate starvation signaling and growth in plants. Plant Cell 2022, 34, 3319–3338. [Google Scholar] [CrossRef]
  26. Hu, Y.; Chen, J.; Fang, L.; Zhang, Z.; Ma, W.; Niu, Y.; Ju, L.; Deng, J.; Zhao, T.; Lian, J.; et al. Gossypium barbadense and Gossypium hirsutum genomes provide insights into the origin and evolution of allotetraploid cotton. Nat. Genet. 2019, 51, 739–748. [Google Scholar] [CrossRef]
  27. Alam, B.; Liu, R.; Gong, J.; Li, J.; Yan, H.; Ge, Q.; Xiao, X.; Pan, J.; Shang, H.; Shi, Y.; et al. Hub Genes in stable QTLs Orchestrate the accumulation of cottonseed oil in upland cotton via catalyzing key steps of lipid-related pathways. Int. J. Mol. Sci. 2023, 24, 16595. [Google Scholar] [CrossRef]
  28. Zhu, Y. The post-genomics era of cotton. Sci. China Life Sci. 2016, 59, 109–111. [Google Scholar] [CrossRef]
  29. Qin, Y.M.; Zhu, Y.X. How cotton fibers elongate: A tale of linear cell-growth mode. Curr Opin Plant Biol 2011, 14, 106–111. [Google Scholar] [CrossRef]
  30. Wang, K.; Wang, D.; Zheng, X.; Qin, A.; Zhou, J.; Guo, B.; Chen, Y.; Wen, X.; Ye, W.; Zhou, Y.; et al. Multi-strategic RNA-seq analysis reveals a high-resolution transcriptional landscape in cotton. Nat. Commun. 2019, 10, 4714. [Google Scholar] [CrossRef]
  31. Wang, K.; Wang, Z.; Li, F.; Ye, W.; Wang, J.; Song, G.; Yue, Z.; Cong, L.; Shang, H.; Zhu, S.; et al. The draft genome of a diploid cotton Gossypium raimondii. Nat. Genet. 2012, 44, 1098–1103. [Google Scholar] [CrossRef] [PubMed]
  32. Li, F.; Fan, G.; Wang, K.; Sun, F.; Yuan, Y.; Song, G.; Li, Q.; Ma, Z.; Lu, C.; Zou, C.; et al. Genome sequence of the cultivated cotton Gossypium arboreum. Nat. Genet. 2014, 46, 567–572. [Google Scholar] [CrossRef] [PubMed]
  33. Liu, X.; Moncuquet, P.; Zhu, Q.H.; Stiller, W.; Zhang, Z.; Wilson, I. Genetic identification and transcriptome analysis of lintless and fuzzless traits in Gossypium arboreum L. Int. J. Mol. Sci. 2020, 21, 1675. [Google Scholar] [CrossRef] [PubMed]
  34. Hendrix, B.; Stewart, J.M. Estimation of the nuclear DNA content of gossypium species. Ann. Bot. 2005, 95, 789–797. [Google Scholar] [CrossRef]
  35. Li, F.; Fan, G.; Lu, C.; Xiao, G.; Zou, C.; Kohel, R.J.; Ma, Z.; Shang, H.; Ma, X.; Wu, J.; et al. Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1) provides insights into genome evolution. Nat. Biotechnol. 2015, 33, 524–530. [Google Scholar] [CrossRef]
  36. Wang, M.; Tu, L.; Lin, M.; Lin, Z.; Wang, P.; Yang, Q.; Ye, Z.; Shen, C.; Li, J.; Zhang, L.; et al. Asymmetric subgenome selection and cis-regulatory divergence during cotton domestication. Nat. Genet. 2017, 49, 579–587. [Google Scholar] [CrossRef]
  37. Fang, L.; Wang, Q.; Hu, Y.; Jia, Y.; Chen, J.; Liu, B.; Zhang, Z.; Guan, X.; Chen, S.; Zhou, B.; et al. Genomic analyses in cotton identify signatures of selection and loci associated with fiber quality and yield traits. Nat. Genet. 2017, 49, 1089–1098. [Google Scholar] [CrossRef]
  38. Sun, Z.; Wang, X.; Liu, Z.; Gu, Q.; Zhang, Y.; Li, Z.; Ke, H.; Yang, J.; Wu, J.; Wu, L.; et al. Genome-wide association study discovered genetic variation and candidate genes of fibre quality traits in Gossypium hirsutum L. Plant Biotechnol. J. 2017, 15, 982–996. [Google Scholar] [CrossRef]
  39. Li, X.; Huang, G.; Zhou, Y.; Wang, K.; Zhu, Y. GhATL68b Regulates cotton fiber cell development by ubiquitinating the enzyme required for beta-oxidation of polyunsaturated fatty acids. Plant Commun. 2024, 101003. [Google Scholar] [CrossRef]
  40. Li, Q.; Xiao, G.; Zhu, Y.X. Single-nucleotide resolution mapping of the Gossypium raimondii transcriptome reveals a new mechanism for alternative splicing of introns. Mol. Plant 2014, 7, 829–840. [Google Scholar] [CrossRef]
  41. Zheng, J.; Wen, S.; Yu, Z.; Luo, K.; Rong, J.; Ding, M. Alternative splicing during fiber development in G. hirsutum. Int. J. Mol. Sci. 2023, 24, 11812. [Google Scholar] [CrossRef] [PubMed]
  42. Wang, M.; Wang, P.; Liang, F.; Ye, Z.; Li, J.; Shen, C.; Pei, L.; Wang, F.; Hu, J.; Tu, L.; et al. A global survey of alternative splicing in allopolyploid cotton: Landscape, complexity and regulation. New Phytol. 2018, 217, 163–178. [Google Scholar] [CrossRef] [PubMed]
  43. Trapnell, C.; Williams, B.A.; Pertea, G.; Mortazavi, A.; Kwan, G.; van Baren, M.J.; Salzberg, S.L.; Wold, B.J.; Pachter, L. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat. Biotechnol. 2010, 28, 511–515. [Google Scholar] [CrossRef] [PubMed]
  44. He, F.; Wang, W.; Rutter, W.B.; Jordan, K.W.; Ren, J.; Taagen, E.; DeWitt, N.; Sehgal, D.; Sukumaran, S.; Dreisigacker, S.; et al. Genomic variants affecting homoeologous gene expression dosage contribute to agronomic trait variation in allopolyploid wheat. Nat. Commun. 2022, 13, 826. [Google Scholar] [CrossRef] [PubMed]
  45. Foissac, S.; Sammeth, M. Analysis of alternative splicing events in custom gene datasets by AStalavista. Methods Mol. Biol. 2015, 1269, 379–392. [Google Scholar] [CrossRef]
  46. Krzywinski, M.; Schein, J.; Birol, I.; Connors, J.; Gascoyne, R.; Horsman, D.; Jones, S.J.; Marra, M.A. Circos: An information aesthetic for comparative genomics. Genome Res. 2009, 19, 1639–1645. [Google Scholar] [CrossRef]
  47. Gohring, J.; Jacak, J.; Barta, A. Imaging of endogenous messenger RNA splice variants in living cells reveals nuclear retention of transcripts inaccessible to nonsense-mediated decay in Arabidopsis. Plant Cell 2014, 26, 754–764. [Google Scholar] [CrossRef]
  48. Hartmann, L.; Wiessner, T.; Wachter, A. Subcellular Compartmentation of alternatively spliced transcripts defines SERINE/ARGININE-RICH PROTEIN30 expression. Plant Physiol. 2018, 176, 2886–2903. [Google Scholar] [CrossRef]
  49. Buccitelli, C.; Selbach, M. mRNAs, proteins and the emerging principles of gene expression control. Nat. Rev. Genet. 2020, 21, 630–644. [Google Scholar] [CrossRef]
  50. Wen, X.; Zhai, Y.; Zhang, L.; Chen, Y.; Zhu, Z.; Chen, G.; Wang, K.; Zhu, Y. Molecular studies of cellulose synthase supercomplex from cotton fiber reveal its unique biochemical properties. Sci. China Life Sci. 2022, 65, 1776–1793. [Google Scholar] [CrossRef]
  51. Zheng, D.; Ye, W.; Song, Q.; Han, F.; Zhang, T.; Chen, Z.J. Histone modifications define expression bias of homoeologous genomes in allotetraploid cotton. Plant Physiol. 2016, 172, 1760–1771. [Google Scholar] [CrossRef] [PubMed]
  52. Gan, E.S.; Xu, Y.; Wong, J.Y.; Goh, J.G.; Sun, B.; Wee, W.Y.; Huang, J.; Ito, T. Jumonji demethylases moderate precocious flowering at elevated temperature via regulation of FLC in Arabidopsis. Nat. Commun. 2014, 5, 5098. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Statistics for different AS types and AS genes identified in G. arboreum and G. raimondii. Overview of different types of AS events: (A) number, (B) frequency, (C) gene number identified in G. arboreum and G. raimondii, (D) gene number distribution with different AS events.
Figure 1. Statistics for different AS types and AS genes identified in G. arboreum and G. raimondii. Overview of different types of AS events: (A) number, (B) frequency, (C) gene number identified in G. arboreum and G. raimondii, (D) gene number distribution with different AS events.
Plants 13 02816 g001
Figure 2. Experimental validation of the 4 main AS types in G. arboreum: (A) IR, (B) SE, (C) A3SS, and (D) A5SS. Black rectangles in the gene-structure models denote constitutive exons and red rectangles denote alternatively spliced exons; short lines under the gene-structure models identify the mapped reads, and dotted lines are genomic sequences that are not present in the RNA-seq data set. The gels show different-sized transcripts (PCR products) from the same primer pairs with cDNA templates; – represents the reverse transcriptase-negative PCR control.
Figure 2. Experimental validation of the 4 main AS types in G. arboreum: (A) IR, (B) SE, (C) A3SS, and (D) A5SS. Black rectangles in the gene-structure models denote constitutive exons and red rectangles denote alternatively spliced exons; short lines under the gene-structure models identify the mapped reads, and dotted lines are genomic sequences that are not present in the RNA-seq data set. The gels show different-sized transcripts (PCR products) from the same primer pairs with cDNA templates; – represents the reverse transcriptase-negative PCR control.
Plants 13 02816 g002
Figure 3. Distribution of AS genes and AS events across the G. arboreum (A genome) and G. raimondii (D genome) chromosomes. The densities of (A) all expressed genes, (B) AS genes, and (C) AS events.
Figure 3. Distribution of AS genes and AS events across the G. arboreum (A genome) and G. raimondii (D genome) chromosomes. The densities of (A) all expressed genes, (B) AS genes, and (C) AS events.
Plants 13 02816 g003
Figure 4. Venn diagram depicting the number of the homologous genes in G. arboreum and G. raimondii: (A) expressed genes in RNA-seq and (B) all expressed genes and AS genes from G. arboreum and G. raimondii.
Figure 4. Venn diagram depicting the number of the homologous genes in G. arboreum and G. raimondii: (A) expressed genes in RNA-seq and (B) all expressed genes and AS genes from G. arboreum and G. raimondii.
Plants 13 02816 g004
Figure 5. Different AS mechanisms of homologous genes between G. arboreum and G. raimondii. (A) represents the gene structure of the primary transcript and the AS transcript in G. arboreum (Ga13G1604) and G. raimondii (Grai_13G017400). The black boxes represent the exons; the black lines represent the introns; the dotted lines indicate the splicing results compared with the primary transcript; the red boxes represent the AS sites; the red arrows represent the forward primers and the reverse primers, and the red numbers represent the lengths of the forward primer to the reverse primer. (B) represents the RT-PCR validation of AS events; the gel bands show the DNA markers and the PCR results in G. arboreum and G. raimondii, which are amplified by the same primer, with its size (bp) indicated at the right. The conserved domains (C) and protein structure (D) of JMJ25 are also shown.
Figure 5. Different AS mechanisms of homologous genes between G. arboreum and G. raimondii. (A) represents the gene structure of the primary transcript and the AS transcript in G. arboreum (Ga13G1604) and G. raimondii (Grai_13G017400). The black boxes represent the exons; the black lines represent the introns; the dotted lines indicate the splicing results compared with the primary transcript; the red boxes represent the AS sites; the red arrows represent the forward primers and the reverse primers, and the red numbers represent the lengths of the forward primer to the reverse primer. (B) represents the RT-PCR validation of AS events; the gel bands show the DNA markers and the PCR results in G. arboreum and G. raimondii, which are amplified by the same primer, with its size (bp) indicated at the right. The conserved domains (C) and protein structure (D) of JMJ25 are also shown.
Plants 13 02816 g005
Table 1. AS events and genes identified in G. arboreum and G.raimondii.
Table 1. AS events and genes identified in G. arboreum and G.raimondii.
Expressed GenesGene IsoformsExpressed
Multi-Exonic Genes
AS GenesAS Events
G. arboreum27,59844,71122,73264839690
G. raimondii28,12341,56323,72948597617
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Hao, J.; Wen, X.; Zhu, Y. A Genome-Wide Alternative Splicing Analysis of Gossypium arboreum and Gossypium raimondii During Fiber Development. Plants 2024, 13, 2816. https://doi.org/10.3390/plants13192816

AMA Style

Hao J, Wen X, Zhu Y. A Genome-Wide Alternative Splicing Analysis of Gossypium arboreum and Gossypium raimondii During Fiber Development. Plants. 2024; 13(19):2816. https://doi.org/10.3390/plants13192816

Chicago/Turabian Style

Hao, Jianfeng, Xingpeng Wen, and Yuxian Zhu. 2024. "A Genome-Wide Alternative Splicing Analysis of Gossypium arboreum and Gossypium raimondii During Fiber Development" Plants 13, no. 19: 2816. https://doi.org/10.3390/plants13192816

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop