Next Article in Journal
Possible Relevance of PNPLA3 and TLL1 Gene Polymorphisms to the Efficacy of PEG-IFN Therapy for HBV-Infected Patients
Previous Article in Journal
Particulate Air Pollution, Clock Gene Methylation, and Stroke: Effects on Stroke Severity and Disability
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genome-Wide Identification and Characterization of UTR-Introns of Citrus sinensis

1
College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou 350002, China
2
Institute of Fruit Tree Research, Guangdong Academy of Agricultural Sciences, Guangzhou 510640, China
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2020, 21(9), 3088; https://doi.org/10.3390/ijms21093088
Submission received: 10 March 2020 / Revised: 18 April 2020 / Accepted: 23 April 2020 / Published: 27 April 2020
(This article belongs to the Section Molecular Biology)

Abstract

:
Introns exist not only in coding sequences (CDSs) but also in untranslated regions (UTRs) of a gene. Recent studies in animals and model plants such as Arabidopsis have revealed that the UTR-introns (UIs) are widely presented in most genomes and involved in regulation of gene expression or RNA stability. In the present study, we identified introns at both 5′UTRs (5UIs) and 3′UTRs (3UIs) of sweet orange genes, investigated their size and nucleotide distribution characteristics, and explored the distribution of cis-elements in the UI sequences. Functional category of genes with predicted UIs were further analyzed using GO, KEGG, and PageMan enrichment. In addition, the organ-dependent splicing and abundance of selected UI-containing genes in root, leaf, and stem were experimentally determined. Totally, we identified 825 UI- and 570 3UI-containing transcripts, corresponding to 617 and 469 genes, respectively. Among them, 74 genes contain both 5UI and 3UI. Nucleotide distribution analysis showed that 5UI distribution is biased at both ends of 5′UTR whiles 3UI distribution is biased close to the start site of 3′UTR. Cis- elements analysis revealed that 5UI and 3UI sequences were rich of promoter-enhancing related elements, indicating that they might function in regulating the expression through them. Function enrichment analysis revealed that genes containing 5UI are significantly enriched in the RNA transport pathway. While, genes containing 3UI are significantly enriched in splicesome. Notably, many pentatricopeptide repeat-containing protein genes and the disease resistance genes were identified to be 3UI-containing. RT-PCR result confirmed the existence of UIs in the eight selected gene transcripts whereas alternative splicing events were found in some of them. Meanwhile, qRT-PCR result showed that UIs were differentially expressed among organs, and significant correlation was found between some genes and their UIs, for example: The expression of VPS28 and its 3UI was significantly negative correlated. This is the first report about the UIs in sweet orange from genome-wide level, which could provide evidence for further understanding of the role of UIs in gene expression regulation.

1. Introduction

The existence of intron in genes was initially discovered in late 1970 [1]. Gradually, it has come to be identified as a common event in all eukaryotic genomes. In the past, it was recognized as an important gene structure that was eliminated from transcripts by a complex molecular mechanism called a spliceosome [2,3,4]. Recently, however, the intron retention (or alternative splicing, AS) in transcripts has also been widely identified, suggesting that they are not only components in gDNA but may also play roles in gene expression regulation or even functioning.
Introns are important for the gene expression process and are thought to go through five phases before the formation of mature mRNA: Genomic intron, transcribed intron, spliced intron, excised intron and exon-junction complex (EJC)-harboring transcript [4]. The presence and removal of introns in gene transcripts affect almost every step of gene expression, including transcription and polyadenylation, mRNA export, localization, translation and attenuation [5,6]. Leon et al. [7] reported that the maize Adhl intron 1 enhanced gene expression in the legume species for about 10-fold. The Sh1 intron 1 enhances chimeric gene expression in rice and maize protoplasts approximately 100-fold [8]. These have revealed that the introns in the transcripts showed positive effect on gene expression, i.e., intron-mediated enhancement (IME) [9,10,11].
In eukaryotes, mature mRNAs have a tripartite structure consisting of a 5′-untranslated region (5′UTR), a coding region (CDS) and a 3′-untranslated region (3′UTR). It has been widely proven that UTRs play crucial roles in the control of mRNA translation efficiency, stability and subcellular localization [12]. There are also introns in the 5′ and 3′ UTRs of many protein-coding genes. The 5′UTR intron (5UI) is usually longer than the intron within the CDS and may affect gene expression, mRNA stability or mRNA export [13]. Reports have shown that, like CDS introns close to the 5′ end of a gene, introns located in the 5′UTR also often exert positive effects on gene expression by affecting the basal promoter activity [14,15,16,17,18]. In the study of Norris et al. [19], the 5UIs of Arabidopsis thaliana polyubiquitin genes (UBQ3, UBQ10, and UBQ11) were identified to be quantitative determinants of chimeric gene expression. Grant et al. [16] reported that the 5UI might contribute to the high activity of the soybean polyubiquitin promoter. Sivamani and Qu [20] reported that the rice polyubiquitin gene (rubi3) promoter containing the 5UI conferred approximately 20-fold higher GUS gene expression than the intron-less promotor in bombarded rice suspension cells. Chaubet-Gigot et al. [21] demonstrated that the 5UI of Arabidopsis replacement histone H3 genes together with their endogenous promoters could produce higher expression of the H3 genes. Kamo et al. [15] also revealed that transgenic Gladiolus (monocot) and Arabidopsis (dicotyledon) plants overexpressing the Gladiolus polyubiquitin promoter (GUBQ1) containing 5UI showed higher GUS expression level compared with the transgenic plants overexpressing GUBQ1 promoter without it.
Unlike 5UI, 3UI was thought to usually downregulate gene expression levels [22,23] through a nonsense-mediated decay (NMD) pathway [24,25]. The NMD inhibition affects a much higher proportion of 3UI containing gene transcripts than non-3UI containing gene transcripts, and the most significantly enriched NMD-affected gene transcripts are those that encode RNA-binding proteins [23,26,27]. The transcripts containing 3UIs were previously thought to be nonfunctional because of their rapid degradation through NMD. Recently, however, some 3UI containing transcripts were also proved to be functional [23].
The genome-wide identification of UIs can provide information for the understanding of gene expression regulation and alternative splicing network in the species from the whole genome level. Cenik et al. [28] studied the UI distribution in human genome and found that the highly expressive genes often had a short 5UI, indicating the 5UI in the human genome enhanced the expression of certain genes in a length-dependent manner. Hong et al. [29] analyzed the intron size, abundance, and distribution in A. thaliana, Drosophila melanogaster, human, and mouse UTRs, found that 5UIs were approximately twice as large as introns in CDSs and 3UIs, and there was a sharp drop in intron size at the 5′UTR-CDS boundary. Chung et al. [30] analyzed the UI distribution in A. thaliana genes, found that 5UIs from Arabidopsis were more proximal to the UTR end or closer to the transcriptional start site. Moreover, they also found that the 5UI in EF1a-A3 gene could enhance its expression in a size dependent manner.
Sweet orange (Citrus sinensis) is an important fruit with high nutrition and economical values. It accounts for approximately 60% of citrus production for both fresh fruit and processed juice consumption (FAO, http://faostat.fao.org/default.aspx). Its genome has been published and the UTR information was well addressed [31], which could be used for the genome-wide identification of UIs. Based on the C. sinensis genome, we identified and characterized the distribution of introns, including 5UI, 3UI, and introns in CDSs. The UI existence conditions in several genes were determined using RT-PCR and their expression pattern in different organs were investigated using qRT-PCR. To our knowledge, this is the first report of horticultural plant UIs from genome-wide level.

2. Results and Discussion

2.1. UTR Introns in the Citrus sinensis Genome

Totally, 29,655 protein-coding genes were identified from the sweet orange genome, of which 15,823 (53.36%) contained both 5′UTR and 3′UTR, 1093 (3.69%) only contained 5′UTR, 1585 (5.34%) only contained 3′UTR, and 11,154 (37.61%) contained neither. In Arabidopsis, more genes were found to lack 5′ UTR annotation rather than 3′ annotation [30], which is similar to the result found in our present study. To show the intron density in UTRs and CDSs, we normalized the intron density to the average number of introns per nucleotide of each gene transcript sequence. The intron density followed the order: CDS > 5′UTR > 3′UTR, which was 1.42 × 10−4, 2.98 × 10−3 and 6.36 × 10−5 respectively (Table 1). The intron density in 5′UTRs is ~2.2 times the intron density in 3′UTRs and is only ~4.8% of the intron density in CDSs. The intron density in the UTRs and CDSs of Arabidopsis gene transcripts also follow the same order, but with much higher density [30]. Notably, the intron density in the Arabidopsis 5′UTRs (~1.6 × 10−3, about 60% of CDSs) was > 10 times higher than that of sweet orange, indicating that the 5UI regulation of gene expression in Arabidopsis is more frequent. The high 5′UTR intron density might be due to the fact that the Arabidopsis genome is better sequenced and annotated with less unknown sequences (Ns). In both sweet orange and Arabidopsis, the 3′UTR intron density is the lowest (the 3UI density is only), suggesting that plants may utilize a similar NMD pathway like mammals [30,32,33].
A total of 965 5UIs and 745 3UIs respectively were identified from 825 5′UTR- and 570 3′UTR-containing gene transcripts, corresponding to 617 and 469 genes (Supplemental Materials Table S1), respectively. Among these UI-containing gene transcripts (UI-Ts), 77 (74 genes) were found to contain both 5UI and 3UI. Around 86.43% of the 5UI-Ts and 78.60% of the 3UI-Ts only contain one 5UI or 3UI (Table 2). The proportion of UI-Ts with two or more UIs dropped greatly, which is consistent with previous reports in other organisms [28,29,34]. Only 10.91% of the 5UI-Ts contain two 5UIs, and 15.79% of the 3UI-Ts contain two 3UIs. The percentages of UI-Ts containing more than three 5UIs or 3UIs were both less than 4%. Notably, we also identified some UI-Ts were of several UIs, for example, a A20/AN1-like zinc finger family protein gene (Cs7g06380.1) and an unknown gene (orange1.1t06039.1) had five 5UIs; a zinc finger protein gene (Cs2g17870.1) and another unknown gene (Cs5g25765.1) had four 5UIs; two NB-ARC domain-containing disease resistance protein genes (Cs1g15550.1, Cs1g17000.1) had six 3UIs, four pentatricopeptide repeat (PPR)-containing protein genes (Cs2g11780.1, orange1.1t01460.1, orange1.1t03486.1,) and a NB-ARC domain-containing disease resistance protein gene (Cs1g16990.1) had five 3UIs, eight unknown genes (Cs2g08495.1, Cs3g12715.1, orange1.1t05956.1, orange1.1t03536.1), one S-phase kinase-associated protein 1 (Cs3g10260.1), one pentatricopeptide repeat (PPR)-containing protein gene (Cs5g26550.1), one NB-ARC domain-containing disease resistance protein gene (Cs5g28645.2) and one tetratricopeptide repeat (TPR)-like superfamily protein gene (Cs7g15390.1) had four 3UIs. Interestingly, by searching the expression data of sweet orange genes in the genome database, we found that the UI-Ts containing multiple 3UIs usually tend to record low-expression levels (average FPKM < 4).
We further mapped the UI-Ts to the sweet orange chromosomes (Figure 1). Results showed that the UI-Ts were unevenly distributed in all the sweet orange chromosomes. Chromosome 2 showed the highest 5UI-T distribution (16.79%) and the highest density (5.26 × 10−6), while chromosome 1 showed the highest 3UI-T distribution (13.83%) and the highest density (3.58 × 10−6) (Table 3). Chromosome 9 contained the least 5UI-Ts and 3UI-Ts, respectively accounting for 4.96% and 3.62% (Table 3). The 5UI-Ts densities in chromosome 2, 3, 4, 5, 6, 7, and 9 were all higher than the 3UI-Ts densities. Notably, the 5UI-T density is about 2.75 times higher than the 3UI-T density in chromosome 2, and 2.57 times higher in chromosome Moreover, the UI-Ts distribution in the same chromosome is also uneven. For example, significant more 5UI-Ts were found in the 3′ ends of chromosome 3, 5, and 6 and in the 5′ end of chromosome 4, and significantly more 3UI-Ts in the 3′ ends of chromosome 1, 2, 5, and 8.
We further calculated the UI distribution in the sweet orange chromosomes. Like the UI-T distribution result, the highest 5UI density was found in Chromosome 2 and the highest 3UI density was found in Chromosome 1, and the lowest 3UI density (2.38 × 10−6) was found in chromosome 9 (Table 3). The 5UI density in chromosome 2 and chromosome 6 were both ~2 times of 3UI density. However, unlike the 5UI-T distribution condition, the lowest 5UI density was found in chromosome 8, which might be due to the chromosome length differences.
Intron number and size distribution analysis result showed that the number and length of introns within 5′UTRs, CDSs and 3′UTRs varied (Figure 2A). The 5UI and 3UI have very similar average length distributions (5′UTR: n = 965, mean = 587.5 nucleotides, median = 450 nucleotides, LQ = 168 nucleotides, UQ = 836 nucleotides, SD = 548.3 nucleotides; 3′UTR: n = 745, mean = 563.5 nucleotides, median = 335 nucleotides, LQ = 139 nucleotides, UQ = 730 nucleotides, SD = 979.6 nucleotides). Their average length were both higher than the CDS introns (n = 148,831, mean = 343.2 nucleotides, median = 171 nucleotides, LQ = 102 nucleotides, UQ = 441 nucleotides, SD = 454.9 nucleotides).
By comparing the intron size distributions within 5′UTRs, 3′UTRs, and CDSs, it is obvious that short introns accounted for a lot, which is similar to the result in Arabidopsis [30]. Most abundant short introns <50 nucleotides were found in the 3′UTRs, followed by in the 5′UTRs, and in CDSs short introns were very rare. The relative frequency of introns with length ranging from 100 to 300 nucleotides in CDSs was significantly higher than that in 5UIs and 3UIs. Longer introns above 300 nucleotides in UIs were more than that in CDSs, indicating that CDS intron length was more conservative.
Figure 2B,C respectively represents the distribution of intron position within the 5′ UTRs and 3′UTRs, relative to the beginning and end of the corresponding UTRs. It appears that 5UIs are preferentially located close to the stop ends of 5′UTRs (nearby the translation initiation site of genes), although the location preference close to the beginning of 5′UTR is also relatively high. The splicing-dependent complex mRNA-protein (mRNP) component, which showed positive influence on rapid export and translation of newly synthesized mRNA, is also deposited as close as possible to the 5′ end of the mRNA [35]. Studies have also shown that 5UI would lead to large accumulation of EJCs, which interact with the translation initiation site and result in IME [36,37]. The proximity of 5UI to the end of 5′UTR (which is also close to the translation start site and 5′ end of mRNA) is well consistent with the 5UI regulatory role in gene translation [30].
It was reported that an EJC downstream of the stop codon should persist and stimulate NMD [38]. In our present study, we found that 3UIs are more frequently located at the beginning of 3′UTRs, i.e., close to the stop codon. The proximity of 3UI to the stop codon might cause the stop codon from being recognized as premature and triggering NMD [39].
Intron removal is influenced by many splicing signals and factors [40]. The splice sites (SS) in the exon-exon junction, including 5′ donor site and 3′ acceptor site, were usually conserved. However, in some instances the SS alternate, resulting in AS event [41]. By comparing the nucleotide preferences surrounding the splicing junction using sequence logos [42], the nucleotide bias around the donor and acceptor site of 5′UTR, CDS and 3′UTR introns were found. Figure 3A,B respectively shows the aggregation of nucleotides around the splice donor (GT) and splice acceptor (AG) junctions in 5′UTRs and 3′UTRs. About 97.92% of the 5UIs were with splice site pair of GT-AG, followed by GC-AG (1.45%), AT-AC (0.20%), CT-AC (0.20%), CT-GG (0.10%), and TG-AT (0.10%). Of the 3UIs, 97.18% were with splice site pair of GT-AG, followed by GC-AG (2.14%), AT-AC (0.26%), GT-TG (0.13%), TA-AG (0.13%), and TT-AG (0.13%). The splice site pair category results of sweet orange UIs were similar to the findings in Arabidopsis and mammalian genomes [30,43]. The sequence logos showed that both the UTR introns and the CDS introns exhibit A/T-rich element in both donor sites and receptor sites. The early recognition of introns is thought to be mediated by UA-binding proteins [40]. Thus, it was suspected that the A/T-rich element might contribute to intron recognition. It was reported that U-rich sequence in the 5UI can bind to the RNA-binding protein, which can promote the translation initiation of uAUG by interacting with transcription initiation factors [44,45,46,47]. This suggests that the A/T-rich element in the 5UI contribute to gene translation initiation. Additionally, Chung et al. [30] identified a significant C-rich region near the donor site of Arabidopsis 5UI, which was predicted to be necessary for the spliceosomal recognition of introns within non-coding sequences. Although a C-rich region (ranges from +5 to +25 bases after intron start) can also be found in sweet orange 5UIs, the frequency of C is much lower.

2.2. Functional Implications of Genes with UTR Introns

To explore the functional characteristics of 5UI-Ts and 3UI-Ts, GO enrichment analysis was performed. From the aspect of biological processes, 107 and 7 GO terms were significantly enriched for 5UI-Ts and 3UI-Ts, respectively (Supplemental Materials Tables S2 and S3). For the 5UI-Ts, genes involved in “translation”, “peptide biosynthetic process”, “metabolic process”, and “gene expression” accounted a lot. While, 3UI-Ts were mainly involved in “defense response”, “response to stress”, “response to stimulus”, “histone methylation”, “peptidyl-lysine methylation”, “histone lysine methylation”, and “peptidyl-lysine modification”. From the aspect of cellular component, 41 GO terms, including genes involved in “organelle”, “ribosome”, “intracellular ribonucleoprotein complex”, “intracellular part”, “cell part”, and “cell” were significantly enriched for 5UI-Ts. For the 3UI-Ts, there are just 4 terms, i.e., “exocyst”, “cell cortex”, “cell cortex part”, and “cytoplasmic region”, were identified to be significantly enriched. From the aspect of molecular function, 45 and 12 GO terms were significantly enriched respectively for 5UI-Ts and 3UI-Ts. For the 5UI-Ts, genes were mainly involved in “protein binding”, “zinc ion binding”, “methionine adenosyltransferase activity”, “glycylpeptide N-tetradecanoyltransferase activity”, “ribonuclease inhibitor activity” “myristoyltransferase activity”, “structural constituent of ribosome”, “translation factor activity”, “RNA binding”, and so on. For the 3UI-Ts, genes were found to involved in “ADP binding”, “protein binding”, “binding”, “histone binding” and “translation factor activity, RNA binding”, and so on. The GO enrichment analysis results indicated that the 5UI-Ts were highly correlated with the gene expression, while many 3UI-Ts were mainly involved in stress responses. Moreover, the main functions of 5UI-Ts and 3UI-Ts differed and their roles in different cell parts varied.
By using KEGG enrichment analysis, we investigated the pathway enrichment of both 5UI-Ts and 3UI-Ts (Supplemental Materials Table S4 and S5). It has been confirmed that the presence or absence of a 5UIs can determine the mRNA export mechanism [23,48]. Consistently, the only significantly enriched pathway for 5UI-Ts is “RNA transport”. Genes such as translation initiation factor protein EIF1 (Cs4g10310.1, Cs4g10310.2), EIF1A (Cs4g10310.1, Cs4g10310.2) and EIF5 (Cs3g18950.1, Cs6g18000.1, Cs6g18000.2), protein transport proteins (SEC13, Cs1g15600.1 and Cs2g28780.1), nonsense-mediated mRNA decay protein 3 (NMD3, Cs6g17980.1), and Ran GTPase-activating protein 1 (RANGAP1) (Cs9g06440.1, Cs9g06440.2, and Cs9g06440.3) which were identified to be RNA transport related. Notably, EIF1 and EIF5 have been revealed to function in stimulating the assembly of the translation initiation complex by interacting with the 40S ribosomal subunit [49]. SEC13, as a component of the nuclear pore complex (NPC) and the COPII coat, has been proved to be required for efficient mRNA export from the nucleus to the cytoplasm and for correct nuclear pore biogenesis and distribution [50]. NMD3, however, was found to be involved in NMD of mRNAs containing premature stop codons [51]. The RANGAP1 converts cytoplasmic GTP-bound RAN to GDP-bound RAN, which is essential for RAN-mediated nuclear import and export [52]. From these reports, it can be concluded that these RNA transport related genes were all involved in gene expression regulation. The existence of UI in these genes indicated that UI play roles in regulating gene expression.
For 3UI-Ts, only “splicesome” pathway showed significant enrichment through KEGG pathway enrichment analysis. The significant enrichment of splicesome related gene was consistent with the fact that 3UI-Ts would be rapidly degraded through NMD [23,24,25]. The splicesome related 3UI-Ts included one small nuclear ribonucleoprotein B and B’ gene transcript (SNRPB, Cs2g10375.1), four heterogeneous nuclear ribonucleoprotein A1/A3 gene transcripts (HNRNPA1, Cs6g16060.1, Cs7g25330.1, Cs7g25330.2 and Cs7g25330.3), one ATP-dependent RNA helicase DDX46/PRP5 gene transcript (DDX46/PRP5, orange1.1t03258.1), two U4/U6.U5 tri-snRNP-associated protein 3 gene transcripts (SNRNP27, Cs4g03590.1, and Cs4g03590.2), and two pre-mRNA-splicing factor gene transcripts (ISY1, Cs2g25120.1 and Cs2g25120.2). Their roles in pre-mRNA splicing have been well demonstrated [52,53,54,55,56,57], which is to support the 3UI function in regulating gene expression.
We further applied PageMan to show the pathway enrichment using the corresponding genes of the 5UI-Ts, 3UI-Ts and UI-Ts with both 5UI and 3UI (respectively 617, 469 and 74 genes). PageMan enrichment analysis showed that for the 5UI and 3 UI containing genes, most genes were categorized into “not assigned” pathway (Supplemental Materials Tables S6 and S7).
For the 5UI containing genes, more than 30 genes were found to be involved in “protein” (92/617, 14.91%), “RNA” (86/617, 13.94%), “RNA. regulation of transcription” (76/617, 12.32%), “protein. degradation” (50/617, 8.10%), “protein. degradation. ubiquitin” (46/617, 7.46%), “protein. degradation. ubiquitin. E3′ (38/617, 6.16%) and “signalling” (30/617, 4.86%) (Table 4). Studies have indicated that 5UI of AtMHX may have a special contribution to translation efficiency [58]. Promoter with 5UI can improve higher gene expression and product accumulation compared with promoter without it [14,15,16]. From the PageMan pathway enrichment result of 5UI containing genes, it is easy to find that these genes are mainly involved in gene expression, including both gene transcription and gene translation, suggesting that 5UI function at both transcriptional and post-transcriptional levels.
For the 3UI containing genes, however, pathways involving more than 30 genes include “not assigned. no ontology. pentatricopeptide (PPR) repeat-containing protein” (61/469, 13.01%), “stress” (44/469, 9.38%), “stress. biotic” (41/469, 8.74%), “RNA” (39/469, 8.32%), “stress. biotic. PR-proteins” (37/469, 7.89%), “protein” (31/469, 6.61%) and “RNA. regulation of transcription” (31/469, 6.61%). Notably, among these 3UI-containing genes, 61 (13.01%) were pentatricopeptide repeat-containing proteins (PPRPs) genes (Table 4). Proteins containing PPR motifs play a role in transcription, RNA processing, splicing, stability, editing, and translation [59]. PPRP is thought to be the main mediator of post-transcriptional regulation of organelles [60]. Most PPRPs are localized in mitochondria or chloroplasts. Dahan and Mireau [61] reported that mitochondrial PPRPs act by preventing translation or accumulation of mitochondrial transcripts, and their gene products can induce cytoplasmic male sterility mutants to embryonic developmental defects and cytoplasmic male sterility. In addition, PPRPs have been identified to play a role in translational and post-translational processes in response to biotic and abiotic stresses [62], suggesting that 3UIs play important roles in regulating the expression of stress responsive gene. Consistently, 41 (8.74%) out of the 3UI containing genes, were disease resistance (R) genes. And some R genes (such as Cs1g15550.1, Cs1g17000.1 and Cs1g16990.1) were found to contain multiple introns. Plant R genes are involved in pathogen recognition and subsequent activation of innate immune responses [63]. The abundance of disease resistance genes again supported the regulatory role of 3UIs in the expression of stress responsive genes.
The genes containing both 5UI and 3UI were mainly involved in “RNA” (12/74, 16.22%), “RNA. regulation of transcription” (10/74, 13.51%), “protein” (9/74, 12.16%), “not assigned. no ontology. Pentatricopeptide repeat (PPR)-containing protein” (7/74, 9.46%) and “signaling” (5/74, 6.76%) (Supplemental Materials Table S8).

2.3. UTR Introns and Transcriptional Enhancers

Transcriptional enhancers have been identified in intron sequences by computational methods [64,65,66]. Cenik et al. [28] found genes with regulatory roles are particularly enriched with 5UIs compared to genes without 5UI. Therefore, UIs can be considered as important cis-regulatory elements that regulate the multiple levels of gene expression. In this study, we predicted the cis-acting elements in UI sequences. Totally, 39893 and 28702 cis-acting elements are respectively identified in 5UIs and 3UIs. Among these cis-acting elements, “core promoter elements around -30 of transcription start” and “common cis-acting element in promoter and enhancer regions” take the largest part, and more than 88% 5UIs and 3UIs contain these elements.
It has been found that introns present in the 5′UTR often lead to increased expression of transgenes [64,65,66,67]. Grant et al. [16] found that the synthetic 5′ UTR intron fragments of the Glycine max polyubiquitin (Gmubi) gene placed downstream of the 35S core promoter could enhance the expression of transgene, suggesting that these fragments can function as promoter regulatory elements and contribute to increased expression. The existence of promoter elements in UIs suggested that UIs function similarly to promoters in regulating gene expression. Besides, “light responsive element”, “cis-acting regulatory elements essential for the anaerobic induction”, and many phytohormone responsive elements were also widely found in UI sequences (Supplemental data Tables S9 and S10), suggesting that UIs might function in the light or phytohormone responses.

2.4. Experimental Validation of 5UI and 3UI Splicing

By using leaf gDNA and cDNAs of root, leaf and stem as templates, PCR reactions were performed to verify the UI existence in UTRs. The results proved the existence of introns in the UTRs and intron retention events were found in some UI-Ts (Figure 4).
Two 3UIs, one 5UI and one 3UIs were annotated respectively in PPR (Cs6g01290.1), VPS28 (Cs2g06750.1) 5′UTR, R gene (Cs5g21990.1) 3′UTR, our RT-PCR result showed that the cDNA sequence did not contain these introns in all the three organs, which meant they were all removed during the forming of mature mRNA. While, for the one 3UI containing LTP gene (Cs5g09070.2), intron retention was found in all the organs. Moreover, there was one 5UI, one 5UI and one 3UI annotated in the PPR gene (Cs6g01290.1) 5′UTR, DUF247 gene (Cs2g24990.1) 5′UTR, and GRAS gene (Cs8g18700.1) 3′UTR respectively. However, by using RT-PCR, two bands were amplified from cDNA. Among the two bands, one was the same as the gDNA sequence and the other one was shorter than the gDNA sequence. The removed sequence length was the same as the length of the UI. This indicated that, for these three UTRs, intron retention happened in some gene transcripts.
The VPS28 gene also contained one 3UI. According to the RT-PCR result, we found that 3′UTR amplified from root cDNA is the same length as the gDNA sequence. The 3′UTR amplified from stem cDNA, however, was shorter than the gDNA, and the missed length was same to the length of the 3UI. The result indicated that intron transcription of the VPS28 3′UTR differed in these two organs. Moreover, no band was amplified from the leaf cDNA, suggesting that the expression of VPS28 3′UTR in leaf was so low that it could not be successfully amplified by RT-PCR.
Two 5UIs, one was 231 bp and the other 168 bp, were annotated in the EIN3 gene (Cs2g29100.1). By using RT-PCR, we amplified tree bands from the cDNAs of the three organs. The length of largest band was same as the gDNA, the lacked length of the middle one compared with gDNA was the length of the small intron, while the missing length of smallest one was the length of the big intron.
There are three 3UIs annotated in the TPR gene (Cs8g15200.1), and their length was 1006 bp, 208 bp and 158 bp, respectively. By using RT-PCR, we amplified two target bands, one was 1164 bp (the length of the largest 3UI plus the smallest 3UI) shorter than gDNA and the other 1372 bp (the total length of all the three 3UIs) shorter than gDNA.
To further study the expression of UIs in different organs and to investigate the correlation between the expression of UIs and their corresponding genes, qRT-PCR analysis of the 8 selected genes and all the UIs annotated in them in three sweet orange organs, i.e., root, stem and leaf, was performed (Figure 5). The qRT-PCR results revealed expression of all the UIs in all the three organs, which is different from the RT-PCR results. This might be caused by the sensitivity difference between the two techniques.
PPR gene contained one 5UI and two 3UIs, the gene’s expression in leaf was significantly higher than in the root, the expression of 3UI-1 in leaf and stem was very significantly lower than in the root, the expression of 3UI-2 in stem was significantly lower than in the root. VPS28 gene contained one 5UI and 3UI, the gene’s expression in leaf was significantly higher than in the root, while the expression of 3UI in leaf is significantly lower than in the root. DUF247 gene contains one 5UI, the gene’s expression in leaf and stem was respectively very significantly and significantly higher than in the root, while the 5UI expression in stem is very significantly lower than in the root. EIN3 gene contains two 5UIs, the gene’s expression in leaf is very significantly higher than in the root. The expression of 5UI-1in both leaf and stem were significantly higher than in the root. The expression of 5UI-2 in leaf was significantly higher than in the root. LTP gene contains one 3UI, the gene’s expression in leaf was significantly lower than in the root, and the 3UI’s expression significantly lower than in the root. GRAS gene contains one 3UI, the expression of the gene and 3UI showed no significant difference among the three organs. The R gene contains one 3UI, the gene’s expression showed no significant difference among the three organs, but the expression of its 3UI in leaf and stem was significantly higher than in the root. TPR gene contains three 3UIs, the gene’s expression in leaf and stem was significantly higher than in the root, and the expression of 3UI-1 in leaves significantly higher than in the root, but other two UIs showed no significant difference in the three organs.
It has been reported that the 5UI expression function in enhancing gene expression, and 3UI expression usually lead to the degradation of its corresponding gene [13,68,69,70,71]. Consistently, in our present study, we found that the expression of VPS28 and its 3UI was significantly negative correlated. Although no significant correlation was identified, the relative coefficient between the expression of EIN3 and its two 5UIs was positive. However, we also found that the expression of R gene and its 3 UI were significantly positive correlated. The relative coefficient between the expression of LTP and its 3UI, TPR and its three 3UIs was positive, the expression of PPR and its 5UI, VPS28 and its 5UI, DUF247 and its 5UI, was negative. This might be due to the very complex nature of gene expression regulation and its involvement of many sequence elements, noncoding RNAs or factors [72,73,74]. Thus, the detailed regulatory role of UIs in gene expression needs to be further studied by experimental researches.

3. Materials and Methods

3.1. Genome-Wide Identification of Sweet Orange UIs

Genome data of C. sinensis was downloaded from the Citrus sinensis annotation project (http://citrus.hzau.edu.cn/orange/download/index.php/) [31,75]. Based on the genome annotation data, introns in CDSs, 5′UTR and 3′ UTR were extracted. As many genes having alternative splicing events, intron retention in any transcript was excluded in order to identify all the UTR introns. Statistical analysis of UI density, position preference, length and nucleotide composition was performed using Perl. And the ggplot2 software was used for the figures drawing.

3.2. Enrichment Analysis of Genes Containing UIs

To illustrate the genes containing UIs, we conducted GO and KEGG pathway enrichment analyses of the 5UI containing and 3UI containing genes on the Dynamic GO Enrichment Analysis online website (https://www.omicshare.com/tools/Home/Report/goenrich) and on the dynamic KEGG enrichment analysis online website (https://www.omicshare.com/tools/Home/report/koenrich), respectively. Moreover, to better show the pathway enrichment result, MapMan and its embedded PageMan software were used [76].

3.3. Cis-Acting Element Prediction of UI Sequences

Intron regulatory elements play important roles in regulating gene expression [16]. In this study, the cis-acting elements in UIs were analyzed and retrieved using PlantCARE (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/) to show the distribution of cis-acting elements in UI sequences.

3.4. UI Structure Verification

To verify the UI structure (existence and expression pattern) in the identified UI containing genes, eight representative genes, including a Pentatricopeptide repeat superfamily protein (PPR, Cs6g01290.1), a Vacuolar protein sorting-associated 28 gene (VPS28, Cs2g06750.1), an ethylene-insensitive3 (EIN3, Cs2g29100.1), a domain of unknown function 247 gene (DUF247, Cs2g24990.1), a GRAS transcription factor (GRAS, Cs8g18700.1), a Tetratricopeptide repeat-like superfamily protein (TPR, Cs8g15200.1), a Disease resistance protein (R, Cs5g21990.1), and a Lipid transfer protein (LTP, Cs5g09070.2), were selected and subjected to RT-PCR and qRT-PCR analysis. Among these genes, EIN3 and DUF247 was predicted to contain 5UI, GRAS, TPR, R, and LTP only contain 3UI, and PPR and VPS28 contain both 5UI and 3UI.
Leaf, stem and root samples of Four-month-old Valencia sweet orange (Citrus sinensis cv. Valencia) were collected, pre-cooled in liquid nitrogen and then stored at −80 °C for further gDNA and RNA extraction. For DNA isolation, E.Z.N.A.® HP Plant DNA kit (Omega, Norcross, GA, USA) was used. For total RNA isolation, the total RNA extraction kit (TIANGEN, Beijing, China) was used. After RNA integrity and quality check using 1% agarose gel electrophoresis and spectrophotometry, high quality RNA samples were reverse-transcribed into cDNA using the First Strand cDNA Synthesis Kit (Thermo Scientific, Wilmington, DE, USA). The results of the validation showed that none of the UIs of some genes were successfully cloned. The extracted gDNA and reverse-transcribed cDNA of different tissues were used as templates to amplify by RT-PCR verification. Primers used for UI structure verification were designed according to the UTR sequences of each UI containing gene transcripts to amplify sequence containing all the possible UIs. Primer information was listed in Supplemental Materials Table S1.
To study the expression level of UI in different organs of sweet orange, qRT-PCR was performed to show the expression pattern of UIs of the selected eight genes using the same RNA for UI verification. cDNA for qRT-PCR was obtained using the RNA samples using Hifair® Ⅱ 1st Strand cDNA Synthesis SuperMix kit. Reactions were carried out on the LightCycler480 Real-time PCR in a final volume of 20 µL containing 1 µL of cDNA, 10 µL Hieff® qPCR SYBR® Green Master Mix (No Rox/ Low Rox/ High Rox), 0.8 µL each of the forward and the reverse primers (2 µM), and 7.4 µL of sterile water. The thermocycler was programmed as: 95 °C for 5 min followed by 40 cycles of 95 °C for 10 s, 54~58 °C for 20 s and 72 °C for 20 s. The expression was calculated by 2−∆∆Ct method [77] with β-actin gene as internal control [78]. Data processing and differential significance analysis were performed using Excel2016 and SPSS17.Information of primers used for qRT-PCR analysis was also shown in Table S11.

4. Conclusions

In this study, we identified a total of 965 5UI sequences and 745 3UI sequences from 825 5UI-Ts (corresponding to 617 genes) and 570 3UI-Ts (469 genes). Among these UI-Ts, 77 (74 genes) contain both 5UI and 3UI. The density of 5UI and 3UI was respectively the highest in chromosome 2 and chromosome 1, and were both the lowest in chromosome On chromosome 2 and chromosome 6, the density of 5UI was found to be significantly greater than 3UI. The average length of 5UIs and 3UIs were similar and were both larger than CDS introns. 5UIs were more biased towards the both ends especially close to the stop end of the 5′UTR, and 3UIs were biased towards the start of 3′UTR. Both the UTR introns and the CDS introns exhibit A/T-rich element, which is considered to be an efficient splicing intron [40,79]. Function enrichment analysis revealed that genes containing 5UI were significantly enriched in the RNA transport pathway, which supported their role in mRNA export [23]. Genes containing 3UI were significantly enriched in splicesome, suggesting that the 3UIs to be widely involved in the splicing of pre-mRNA or NMD [69,80,81]. Notably, many PPRPs and R genes were identified to be UI-containing, indicating that UIs contribute to the expression regulation of these two gene families and play roles in the plant responses to biotic and abiotic stresses during translation and post-translational processes [62,82,83]. The regulatory role of UIs in the expression of genes containing UIs, especially the R genes, can be further studied. Moreover, many promoter enhancing related elements were identified in the UIs. The existence of these sequences demonstrated their regulatory roles in gene expression. Additionally, the expression of UIs in gene transcripts was confirmed by using RT-PCR and qRT-PCR. To our knowledge, this is the first report about the identification and characterization of horticultural plants from genome-wide level. The results obtained from this study could provide evidences for further understanding of the role of UIs in regulating gene expression.

Supplementary Materials

The following are available online at https://www.mdpi.com/1422-0067/21/9/3088/s1, Table S1: Information of the 5UIs and 3UIs identified in the sweet orange (Citrus sinensis) genome and the gene transcripts containing them. Table S2: Gene Ontology (GO) enrichment analysis result of 5UI-Ts. Table S3: Gene Ontology (GO) enrichment analysis result of 3UI-Ts. Table S4: KEGG pathway enrichment analysis result of 5UI-Ts. Table S5: KEGG pathway enrichment analysis result of 3UI-Ts. Table S6: PageMan pathway enrichment analysis of the 617 genes containing 5UI. Table S7: PageMan pathway enrichment analysis of the 469 genes containing 3UI. Table S8: PageMan pathway enrichment analysis of the 74 genes containing both 5UI and 3UI. Table S9: Information of the identified cis-acting elements in both 5UI sequences. Table S10: Information of the identified Cis-acting elements in both 3UI sequences. Table S11: Information of the 5UIs and 3UIs identified in the sweet orange (Citrus sinensis) genome and the gene transcripts containing them.

Author Contributions

C.C. designed the work; X.S., C.C., G.Z., and B.W. performed the experiments, X.S., C.C., R.A.M., and B.W. wrote the paper; C.C. and X.S. analyzed the data; J.W., N.T., J.L., F.L., J.C. (Jialan Chen), J.C. (Jingru Che), and Y.G. helped to prepare the plant materials. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Fujian Provincial Science and Technology Program (2017J01615), the Natural Science Funds for Distinguished Young Scholar of the Fujian Agriculture and Forestry University (xjq201721), the Fujian Provincial College Students’ Innovation and Entrepreneurship Project (201910389050), the Guangdong Provincial Science and Technology Programs (2019B030316005, 2018B020202009), and the Construction of Plateau Discipline of Fujian Province (102/71201801101).

Acknowledgments

The authors would like to thank Zhenhua Zhuang and Chengdu Life Baseline Company for their assistance in bioinformatics analysis.

Conflicts of Interest

The authors declare no conflict of interest

Data Availability

All the data generated or analyzed during this study are included in this published article and its Supplemental data.

Abbreviations

CDSCoding sequence
UTRUntranslated region
UIUTR intron
RT-PCRReverse transcription PCR
qRT-PCRQuantitative real time PCR
EJCexon-junction complex
IMEIntron-mediated enhancement
NMDnonsense-mediated decay
NPCNuclear pore complex
UI-TsUI-containing gene transcripts
ASAlternative splicing
SSSplice sites

References

  1. Sambrook, J. Adenovirus amazes at Cold Spring Harbor. Nature 1977, 268, 101–104. [Google Scholar] [PubMed]
  2. Roy, S.W.; Gilbert, W. The evolution of spliceosomal introns: Patterns, puzzles and progress. Nat. Rev. Genet. 2006, 7, 211–221. [Google Scholar] [PubMed]
  3. Wahl, M.C.; Will, C.L.; Luhrmann, R. The spliceosome: Design principles of a dynamic RNP machine. Cell 2009, 136, 701–718. [Google Scholar] [PubMed] [Green Version]
  4. Chorev, M.; Carmel, L. The function of introns. Front. Genet. 2012, 3, 55. [Google Scholar] [PubMed] [Green Version]
  5. Le Hir, H.; Nott, A.; Moore, M.J. How introns influence and enhance eukaryotic gene expression. Trends Biochem. Sci. 2003, 28, 215–220. [Google Scholar]
  6. Moore, M.J.; Proudfoot, N.J. Pre-mRNA processing reaches back to transcription and ahead to translation. Cell 2009, 136, 688–700. [Google Scholar] [PubMed] [Green Version]
  7. Leon, P.; Planckaert, F.; Walbot, V. Transient Gene Expression in Protoplasts of Phaseolus vulgaris Isolated from a Cell Suspension Culture. Plant. Physiol. 1991, 95, 968–972. [Google Scholar]
  8. Maas, C.; Laufs, J.; Grant, S.; Korfhage, C.; Werr, W. The combination of a novel stimulatory element in the first exon of the maize Shrunken-1 gene with the following intron 1 enhances reporter gene expression up to 1000-fold. Plant Mol. Biol. 1991, 16, 199–207. [Google Scholar]
  9. Mascarenhas, D.; Mettler, I.J.; Pierce, D.A.; Lowe, H.W. Intron-mediated enhancement of heterologous gene expression in maize. Plant Mol. Biol. 1990, 15, 913–920. [Google Scholar]
  10. Luehrsen, K.R.; Walbot, V. Intron enhancement of gene expression and the splicing efficiency of introns in maize cells. Mol. Gen. Genet. 1991, 225, 81–93. [Google Scholar]
  11. Akua, T.; Berezin, I.; Shaul, O. The leader intron of AtMHX can elicit, in the absence of splicing, low-level intron-mediated enhancement that depends on the internal intron sequence. BMC Plant Biol. 2010, 10, 93. [Google Scholar]
  12. Cao, Y.; Wang, Y.; Li, Y.; Yang, J.; Ma, L. The Arabidopsis AGAMOUS 5′-UTR represses downstream gene translation. Sci. China Tehnol. SC 2019, 62, 272–275. [Google Scholar]
  13. Bradnam, K.R.; Korf, I. Longer first introns are a general property of eukaryotic gene structure. PLoS ONE 2008, 3, e3093. [Google Scholar]
  14. Samadder, P.; Sivamani, E.; Lu, J.L.; Li, X.G.; Qu, R.D. Transcriptional and post-transcriptional enhancement of gene expression by the 5′ UTR intron of rice rubi3 gene in transgenic rice cells. Mol. Genet. Genom. 2008, 279, 429–439. [Google Scholar]
  15. Kamo, K.; Kim, A.Y.; Park, S.H.; Joung, Y.H. The 5′ UTR-intron of the Gladiolus polyubiquitin promoter GUBQ1 enhances translation efficiency in Gladiolus and Arabidopsis. BMC Plant Biol. 2012, 12, 79. [Google Scholar]
  16. Grant, T.N.L.; De La Torre, C.M.; Zhang, N.; Finer, J.J. Synthetic introns help identify sequences in the 5′ UTR intron of the Glycine max polyubiquitin (Gmubi) promoter that give increased promoter activity. Planta 2017, 245, 849–860. [Google Scholar]
  17. Laxa, M.; Muller, K.; Lange, N.; Doering, L.; Pruscha, J.T.; Peterhansel, C. The 5′UTR Intron of Arabidopsis GGT1 Aminotransferase Enhances Promoter Activity by Recruiting RNA Polymerase II. Plant Physiol. 2016, 172, 313–327. [Google Scholar]
  18. Gallegos, J.E.; Rose, A.B. An intron-derived motif strongly increases gene expression from transcribed sequences through a splicing independent mechanism in Arabidopsis thaliana. Sci. Rep. 2019, 9, 13777. [Google Scholar]
  19. Norris, S.R.; Meyer, S.E.; Callis, J. The intron of Arabidopsis thaliana polyubiquitin genes is conserved in location and is a quantitative determinant of chimeric gene expression. Plant Mol. Biol. 1993, 21, 895–906. [Google Scholar]
  20. Sivamani, E.; Qu, R. Expression enhancement of a rice polyubiquitin gene promoter. Plant Mol. Biol. 2006, 60, 225–239. [Google Scholar]
  21. Chaubet-Gigot, N.; Kapros, T.; Flenet, M.; Kahn, K.; Gigot, C.; Waterborg, J.H. Tissue-dependent enhancement of transgene expression by introns of replacement histone H3 genes of Arabidopsis. Plant Mol. Biol. 2001, 45, 17–30. [Google Scholar] [PubMed]
  22. Barrett, L.W.; Fletcher, S.; Wilton, S.D. Regulation of eukaryotic gene expression by the untranslated gene regions and other non-coding elements. Cell Mol. Life Sci. 2012, 69, 3613–3634. [Google Scholar] [PubMed] [Green Version]
  23. Bicknell, A.A.; Cenik, C.; Chua, H.N.; Roth, F.P.; Moore, M.J. Introns in UTRs: Why we should stop ignoring them. Bioessays 2012, 34, 1025–1034. [Google Scholar] [PubMed]
  24. Zhang, J.; Sun, X.; Qian, Y.; Maquat, L.E. Intron function in the nonsense-mediated decay of beta-globin mRNA: Indications that pre-mRNA splicing in the nucleus can influence mRNA translation in the cytoplasm. RNA 1998, 4, 801–815. [Google Scholar]
  25. Chang, Y.F.; Imam, J.S.; Wilkinson, M.F. The nonsense-mediated decay RNA surveillance pathway. Annu. Rev. Biochem. 2007, 76, 51–74. [Google Scholar]
  26. McIlwain, D.R.; Pan, Q.; Reilly, P.T.; Elia, A.J.; McCracken, S.; Wakeham, A.C.; Itie-Youten, A.; Blencowe, B.J.; Mak, T.W. Smg1 is required for embryogenesis and regulates diverse genes via alternative splicing coupled to nonsense-mediated mRNA decay. Proc. Natl. Acad. Sci. USA 2010, 107, 12186–12191. [Google Scholar]
  27. Saltzman, A.L.; Pan, Q.; Blencowe, B.J. Regulation of alternative splicing by the core spliceosomal machinery. Genes Dev. 2011, 25, 373–384. [Google Scholar]
  28. Cenik, C.; Derti, A.; Mellor, J.C.; Berriz, G.F.; Roth, F.P. Genome-wide functional analysis of human 5′ untranslated region introns. Genome Biol. 2010, 11, R29. [Google Scholar]
  29. Hong, X.; Scofield, D.G.; Lynch, M. Intron size, abundance, and distribution within untranslated regions of genes. Mol. Biol. Evol. 2006, 23, 2392–2404. [Google Scholar]
  30. Chung, B.Y.W.; Simons, C.; Firth, A.E.; Brown, C.M.; Hellens, R.P. Effect of 5′ UTR introns on gene expression in Arabidopsis thaliana. BMC Genom. 2006, 7, 120. [Google Scholar]
  31. Xu, Q.; Chen, L.L.; Ruan, X.A.; Chen, D.J.; Zhu, A.D.; Chen, C.L.; Bertrand, D.; Jiao, W.B.; Hao, B.H.; Lyon, M.P.; et al. The draft genome of sweet orange (Citrus sinensis). Nat. Genet. 2013, 45, U59–U92. [Google Scholar]
  32. Nagy, E.; Maquat, L.E. A rule for termination-codon position within intron-containing genes: When nonsense affects RNA abundance. Trends Biochem. Sci. 1998, 23, 198–199. [Google Scholar] [PubMed]
  33. Lejeune, F.; Maquat, L.E. Mechanistic links between nonsense-mediated mRNA decay and pre-mRNA splicing in mammalian cells. Curr. Opin. Cell Biol. 2005, 17, 309–315. [Google Scholar] [PubMed]
  34. Pesole, G.; Mignone, F.; Gissi, C.; Grillo, G.; Licciulli, F.; Liuni, S. Structural and functional features of eukaryotic mRNA untranslated regions. Gene 2001, 276, 73–81. [Google Scholar] [PubMed]
  35. Nott, A.; Meislin, S.H.; Moore, M.J. A quantitative analysis of intron effects on mammalian gene expression. RNA 2003, 9, 607–617. [Google Scholar] [PubMed] [Green Version]
  36. Wiegand, H.L.; Lu, S.; Cullen, B.R. Exon junction complexes mediate the enhancing effect of splicing on mRNA expression. Proc. Natl. Acad. Sci. USA 2003, 100, 11327–11332. [Google Scholar] [PubMed] [Green Version]
  37. Nott, A.; Le Hir, H.; Moore, M.J. Splicing enhances translation in mammalian cells: An additional function of the exon junction complex. Genes Dev. 2004, 18, 210–222. [Google Scholar]
  38. Singh, G.; Rebbapragada, I.; Lykke-Andersen, J. A competition between stimulators and antagonists of Upf complex recruitment governs human nonsense-mediated mRNA decay. PLoS Biol. 2008, 6, 860–871. [Google Scholar]
  39. Maquat, L.E. Nonsense-mediated mRNA decay: Splicing, translation and mRNP dynamics. Nat. Rev. Mol. Cell Biol. 2004, 5, 89–99. [Google Scholar]
  40. Brown, J.W.; Simpson, C.G.; Thow, G.; Clark, G.P.; Jennings, S.N.; Medina-Escobar, N.; Haupt, S.; Chapman, S.C.; Oparka, K.J. Splicing signals and factors in plant intron removal. Biochem. Soc. Trans. 2002, 30, 146–149. [Google Scholar]
  41. Ling, L.; Oltean, S. Modulators of alternative splicing as novel therapeutics in cancer. Int. J. Mol. Med. 2017, 40, S23. [Google Scholar]
  42. Crooks, G.E.; Hon, G.; Chandonia, J.M.; Brenner, S.E. WebLogo: A sequence logo generator. Genome Res. 2004, 14, 1188–1190. [Google Scholar] [PubMed] [Green Version]
  43. Burset, M.; Seledtsov, I.A.; Solovyev, V.V. Analysis of canonical and non-canonical splice sites in mammalian genomes. Nucleic Acids Res. 2000, 28, 4364–4375. [Google Scholar] [PubMed]
  44. Merendino, L.; Guth, S.; Bilbao, D.; Martinez, C.; Valcarcel, J. Inhibition of msl-2 splicing by Sex-lethal reveals interaction between U2AF35 and the 3′ splice site AG. Nature 1999, 402, 838–841. [Google Scholar]
  45. Forch, P.; Merendino, L.; Martinez, C.; Valcarcel, J. Modulation of msl-2 5′ splice site recognition by Sex-lethal. RNA 2001, 7, 1185–1191. [Google Scholar]
  46. Clancy, M.; Hannah, L.C. Splicing of the maize Sh1 first intron is essential for enhancement of gene expression, and a T-rich motif increases expression without affecting splicing. Plant Physiol. 2002, 130, 918–929. [Google Scholar] [PubMed] [Green Version]
  47. Araujo, P.R.; Yoon, K.; Ko, D.; Smith, A.D.; Qiao, M.; Suresh, U.; Burns, S.C.; Penalva, L.O. Before It Gets Started: Regulating Translation at the 5′ UTR. Comp. Funct. Genom. 2012, 2012, 475731. [Google Scholar]
  48. Cenik, C.; Chua, H.N.; Zhang, H.; Tarnawsky, S.P.; Akef, A.; Derti, A.; Tasan, M.; Moore, M.J.; Palazzo, A.F.; Roth, F.P. Genome analysis reveals interplay between 5′UTR introns and nuclear mRNA export for secretory and mitochondrial genes. PLoS Genet. 2011, 7, e1001366. [Google Scholar]
  49. Merrick, W.C.; Hershey, J.W. The pathway and mechanism of eukaryotic protein synthesis. Cold Spring Harb. Monogr. Arch. 1996, 30, 31–69. [Google Scholar]
  50. Siniossoglou, S.; Wimmer, C.; Rieger, M.; Doye, V.; Tekotte, H.; Weise, C.; Emig, S.; Segref, A.; Hurt, E.C. A novel complex of nucleoporins, which includes Sec13p and a Sec13p homolog, is essential for normal nuclear pores. Cell 1996, 84, 265–275. [Google Scholar]
  51. Mitchell, S.F.; Jain, S.; She, M.; Parker, R. Global analysis of yeast mRNPs. Nat. Struct. Mol. Biol. 2013, 20, 127–133. [Google Scholar] [PubMed] [Green Version]
  52. Bischoff, F.R.; Klebe, C.; Kretschmer, J.; Wittinghofer, A.; Ponstingl, H. RanGAP1 induces GTPase activity of nuclear Ras-related Ran. Proc. Natl. Acad. Sci. USA 1994, 91, 2587–2591. [Google Scholar] [PubMed] [Green Version]
  53. Dix, I.; Russell, C.; Yehuda, S.B.; Kupiec, M.; Beggs, J.D. The identification and characterization of a novel splicing protein, Isy1p, of Saccharomyces cerevisiae. RNA 1999, 5, 360–368. [Google Scholar] [PubMed] [Green Version]
  54. Will, C.L.; Urlaub, H.; Achsel, T.; Gentzel, M.; Wilm, M.; Luhrmann, R. Characterization of novel SF3b and 17S U2 snRNP proteins, including a human Prp5p homologue and an SF3b DEAD-box protein. EMBO J. 2002, 21, 4978–4988. [Google Scholar]
  55. Pillai, R.S.; Grimmler, M.; Meister, G.; Will, C.L.; Luhrmann, R.; Fischer, U.; Schumperli, D. Unique Sm core structure of U7 snRNPs: Assembly by a specialized SMN complex and the role of a new component, Lsm11, in histone RNA processing. Genes Dev. 2003, 17, 2321–2333. [Google Scholar]
  56. He, Y.; Smith, R. Nuclear functions of heterogeneous nuclear ribonucleoproteins A/B. Cell Mol. Life Sci. 2009, 66, 1239–1256. [Google Scholar]
  57. Agafonov, D.E.; Kastner, B.; Dybkov, O.; Hofele, R.V.; Liu, W.T.; Urlaub, H.; Luhrmann, R.; Stark, H. Molecular architecture of the human U4/U6.U5 tri-snRNP. Science 2016, 351, 1416–1420. [Google Scholar]
  58. Akua, T.; Shaul, O. The Arabidopsis thaliana MHX gene includes an intronic element that boosts translation when localized in a 5′ UTR intron. J. Exp. Bot. 2013, 64, 4255–4270. [Google Scholar]
  59. Manna, S. An overview of pentatricopeptide repeat proteins and their applications. Biochimie 2015, 113, 93–99. [Google Scholar]
  60. Small, I.D.; Rackham, O.; Filipovska, A. Organelle transcriptomes: Products of a deconstructed genome. Curr. Opin. Microbiol. 2013, 16, 652–658. [Google Scholar]
  61. Dahan, J.; Mireau, H. The Rf and Rf-like PPR in higher plants, a fast-evolving subclass of PPR genes. RNA Biol. 2013, 10, 1469–1476. [Google Scholar] [PubMed] [Green Version]
  62. Xing, H.; Fu, X.; Yang, C.; Tang, X.; Guo, L.; Li, C.; Xu, C.; Luo, K. Genome-wide investigation of pentatricopeptide repeat gene family in poplar and their expression analysis in response to biotic and abiotic stresses. Sci. Rep. 2018, 8, 2817. [Google Scholar] [PubMed] [Green Version]
  63. Riedl, S.J.; Li, W.; Chao, Y.; Schwarzenbacher, R.; Shi, Y. Structure of the apoptotic protease-activating factor 1 bound to ADP. Nature 2005, 434, 926–933. [Google Scholar] [PubMed]
  64. Lu, J.; Sivamani, E.; Azhakanandam, K.; Samadder, P.; Li, X.; Qu, R. Gene expression enhancement mediated by the 5′ UTR intron of the rice rubi3 gene varied remarkably among tissues in transgenic rice plants. Mol. Genet. Genom. 2008, 279, 563–572. [Google Scholar]
  65. Ibraheem, O.; Botha, C.E.; Bradley, G. In silico analysis of cis-acting regulatory elements in 5′ regulatory regions of sucrose transporter gene families in rice (Oryza sativa Japonica) and Arabidopsis thaliana. Comput. Biol. Chem. 2010, 34, 268–283. [Google Scholar]
  66. Rose, A.B.; Emami, S.; Bradnam, K.; Korf, I. Evidence for a DNA-Based Mechanism of Intron-Mediated Enhancement. Front. Plant Sci. 2011, 2, 98. [Google Scholar]
  67. Ji, Z.; Lee, J.Y.; Pan, Z.; Jiang, B.; Tian, B. Progressive lengthening of 3′ untranslated regions of mRNAs by alternative polyadenylation during mouse embryonic development. Proc. Natl. Acad. Sci. USA 2009, 106, 7028–7033. [Google Scholar]
  68. Lewis, B.P.; Green, R.E.; Brenner, S.E. Evidence for the widespread coupling of alternative splicing and nonsense-mediated mRNA decay in humans. Proc. Natl. Acad. Sci. USA 2003, 100, 189–192. [Google Scholar]
  69. Ni, J.Z.; Grate, L.; Donohue, J.P.; Preston, C.; Nobida, N.; O’Brien, G.; Shiue, L.; Clark, T.A.; Blume, J.E.; Ares, M., Jr. Ultraconserved elements are associated with homeostatic control of splicing regulators by alternative splicing and nonsense-mediated decay. Genes Dev. 2007, 21, 708–718. [Google Scholar]
  70. Bruno, I.G.; Karam, R.; Huang, L.; Bhardwaj, A.; Lou, C.H.; Shum, E.Y.; Song, H.W.; Corbett, M.A.; Gifford, W.D.; Gecz, J.; et al. Identification of a microRNA that activates gene expression by repressing nonsense-mediated RNA decay. Mol. Cell 2011, 42, 500–510. [Google Scholar]
  71. Hoshida, H.; Kondo, M.; Kobayashi, T.; Yarimizu, T.; Akada, R. 5 -UTR introns enhance protein expression in the yeast Saccharomyces cerevisiae. Appl. Microbiol. Biotechnol. 2017, 101, 241–251. [Google Scholar]
  72. Bianchi, M.; Crinelli, R.; Giacomini, E.; Carloni, E.; Magnani, M. A potent enhancer element in the 5′-UTR intron is crucial for transcriptional regulation of the human ubiquitin C gene. Gene 2009, 448, 88–101. [Google Scholar]
  73. Beaulieu, E.; Green, L.; Elsby, L.; Alourfi, Z.; Morand, E.F.; Ray, D.W.; Donn, R. Identification of a novel cell type-specific intronic enhancer of macrophage migration inhibitory factor (MIF) and its regulation by mithramycin. Clin. Exp. Immunol. 2011, 163, 178–188. [Google Scholar]
  74. Rearick, D.; Prakash, A.; McSweeny, A.; Shepard, S.S.; Fedorova, L.; Fedorov, A. Critical association of ncRNA with introns. Nucleic Acids Res. 2011, 39, 2357–2366. [Google Scholar]
  75. Wu, G.A.; Prochnik, S.; Jenkins, J.; Salse, J.; Hellsten, U.; Murat, F.; Perrier, X.; Ruiz, M.; Scalabrin, S.; Terol, J.; et al. Sequencing of diverse mandarin, pummelo and orange genomes reveals complex history of admixture during citrus domestication. Nat. Biotechnol. 2014, 32, 656. [Google Scholar]
  76. Thimm, O.; Blasing, O.; Gibon, Y.; Nagel, A.; Meyer, S.; Kruger, P.; Selbig, J.; Muller, L.A.; Rhee, S.Y.; Stitt, M. MAPMAN: A user-driven tool to display genomics data sets onto diagrams of metabolic pathways and other biological processes. Plant J. 2004, 37, 914–939. [Google Scholar]
  77. Livak, K.J.; Schmittgen, T.D. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods 2001, 25, 402–408. [Google Scholar]
  78. Cheng, C.Z.; Yang, J.W.; Yan, H.B.; Bei, X.J.; Zhang, Y.Y.; Lu, Z.M.; Zhong, G.Y. Expressing p20 hairpin RNA of Citrus tristeza virus confers Citrus aurantium with tolerance/resistance against stem pitting and seedling yellow CTV strains. J. Integr. Agr. 2015, 14, 1767–1777. [Google Scholar]
  79. Simpson, C.G.; Jennings, S.N.; Clark, G.P.; Thow, G.; Brown, J.W. Dual functionality of a plant U-rich intronic sequence element. Plant J. 2004, 37, 82–91. [Google Scholar]
  80. Sureau, A.; Gattoni, R.; Dooghe, Y.; Stevenin, J.; Soret, J. SC35 autoregulates its expression by promoting splicing events that destabilize its mRNAs. EMBO J. 2001, 20, 1785–1796. [Google Scholar]
  81. Lareau, L.F.; Inada, M.; Green, R.E.; Wengrod, J.C.; Brenner, S.E. Unproductive splicing of SR genes associated with highly conserved and ultraconserved DNA elements. Nature 2007, 446, 926–929. [Google Scholar] [PubMed] [Green Version]
  82. Martin, G.B. Functional analysis of plant disease resistance genes and their downstream effectors. Curr. Opin. Plant Biol. 1999, 2, 273–279. [Google Scholar] [PubMed]
  83. Innes, R.W. Genetic dissection of R gene signal transduction pathways. Curr. Opin. Plant Biol. 1998, 1, 299–304. [Google Scholar]
Figure 1. Chromosome localization results of gene transcripts containing 5′UTR intron (5UI) (A) and 3′UTR intron (3UI) (B) in Citrus sinensis. Chr: chromosome, cM: centiMorgan.
Figure 1. Chromosome localization results of gene transcripts containing 5′UTR intron (5UI) (A) and 3′UTR intron (3UI) (B) in Citrus sinensis. Chr: chromosome, cM: centiMorgan.
Ijms 21 03088 g001
Figure 2. Length distributions of 5′UTR, 3′UTR and CDS introns (A) and position distribution of 5′UTR introns (B) and 3′UTR introns (C) relative to the beginning and end of the associated UTRs. For Figure 2A, the horizontal axis represents the size of the intron, and the vertical axis represents the proportion of the introns of different sizes. For Figure 2B,C, (a) Blue bars represent the observed positions of 5′UTR introns relative to the beginning of the 5′UTR and 3′UTR introns relative to the beginning of the 3′UTR (terminate the codon proximal end). Light blue bars represent the observed positions of 5UIs relative to the end of the 5′UTR (i.e., the start codon ATG) and 3UIs relative to the end of the 3′UTR. (b) Sierra of UIs relative to the end of the UTR and the end of the UTR with color diversity.
Figure 2. Length distributions of 5′UTR, 3′UTR and CDS introns (A) and position distribution of 5′UTR introns (B) and 3′UTR introns (C) relative to the beginning and end of the associated UTRs. For Figure 2A, the horizontal axis represents the size of the intron, and the vertical axis represents the proportion of the introns of different sizes. For Figure 2B,C, (a) Blue bars represent the observed positions of 5′UTR introns relative to the beginning of the 5′UTR and 3′UTR introns relative to the beginning of the 3′UTR (terminate the codon proximal end). Light blue bars represent the observed positions of 5UIs relative to the end of the 5′UTR (i.e., the start codon ATG) and 3UIs relative to the end of the 3′UTR. (b) Sierra of UIs relative to the end of the UTR and the end of the UTR with color diversity.
Ijms 21 03088 g002
Figure 3. Sequence logos showing the nucleotide bias around the donor site (A) and acceptor site (B) of 5′UTR, CDS, and 3′UTR introns. The sequence marker shows the nucleotide deviation around the donor site and acceptor site of the 5′UTR, CDS, and 3′UTR introns. The x-axis refers to the base at the beginning of the intron, and the letter height reflects the nucleotide deviation at each position. Only 5 nucleotide exons and 25 nucleotide intron sequences of the donor site and only 2 nucleotide exons and 25 nucleotide intron sequences of the acceptor site are included in the sequence identifier, because the nucleotide usage outside of these regions is not significantly different from the background level.
Figure 3. Sequence logos showing the nucleotide bias around the donor site (A) and acceptor site (B) of 5′UTR, CDS, and 3′UTR introns. The sequence marker shows the nucleotide deviation around the donor site and acceptor site of the 5′UTR, CDS, and 3′UTR introns. The x-axis refers to the base at the beginning of the intron, and the letter height reflects the nucleotide deviation at each position. Only 5 nucleotide exons and 25 nucleotide intron sequences of the donor site and only 2 nucleotide exons and 25 nucleotide intron sequences of the acceptor site are included in the sequence identifier, because the nucleotide usage outside of these regions is not significantly different from the background level.
Ijms 21 03088 g003
Figure 4. Electrophoresis detection results of the UTR introns (UIs) in untranslated regions (UTRs) of the eight selected genes. 1~4 respectively represents the PCR products using leaf gemomic DNA (gDNA), root complementary DNA (cDNA), leaf cDNA, and stem cDNA as template, respectively. M: DL5000 Marker; The UTR structure is available on the GSDS online website (http://gsds.cbi.pku.edu.cn/), with blue for exons and black for introns in UTR. PPR: Pentatricopeptide repeat superfamily protein (Cs6g01290.1), VPS28: Vacuolar protein sorting-associated 28 (Cs2g06750.1), EIN3: Ethylene-insensitive 3 (Cs2g29100.1), DUF247: domain of unknown function 247 gene (Cs2g24990.1), GRAS: GRAS transcription factors (Cs8g18700.1), TPR: Tetratricopeptide repeat-like superfamily protein (Cs8g15200.1), R: Disease resistance protein (Cs5g21990.1) and LTP: Lipid transfer protein (Cs5g09070.2).
Figure 4. Electrophoresis detection results of the UTR introns (UIs) in untranslated regions (UTRs) of the eight selected genes. 1~4 respectively represents the PCR products using leaf gemomic DNA (gDNA), root complementary DNA (cDNA), leaf cDNA, and stem cDNA as template, respectively. M: DL5000 Marker; The UTR structure is available on the GSDS online website (http://gsds.cbi.pku.edu.cn/), with blue for exons and black for introns in UTR. PPR: Pentatricopeptide repeat superfamily protein (Cs6g01290.1), VPS28: Vacuolar protein sorting-associated 28 (Cs2g06750.1), EIN3: Ethylene-insensitive 3 (Cs2g29100.1), DUF247: domain of unknown function 247 gene (Cs2g24990.1), GRAS: GRAS transcription factors (Cs8g18700.1), TPR: Tetratricopeptide repeat-like superfamily protein (Cs8g15200.1), R: Disease resistance protein (Cs5g21990.1) and LTP: Lipid transfer protein (Cs5g09070.2).
Ijms 21 03088 g004
Figure 5. Relative expression results of genes and their UTR introns (UIs) in Citrus sinensis root, leaf and stem. * and ** respectively represent significant difference (p < 0.05) and very significant difference (p < 0.01) compared with root. 5UI: intron in the 5′UTR; 3UI: intron in the 3′UTR; CDS: coding sequence; represent the relative expression levels of genes containing these structures. PPR: Pentatricopeptide repeat superfamily protein (Cs6g01290.1), VPS28: Vacuolar protein sorting-associated 28 (Cs2g06750.1), EIN3: Ethylene-insensitive 3 (Cs2g29100.1), DUF247: domain of unknown function 247 gene (Cs2g24990.1), GRAS: GRAS transcription factors (Cs8g18700.1), TPR: Tetratricopeptide repeat-like superfamily protein (Cs8g15200.1), R: Disease resistance protein (Cs5g21990.1) and LTP: Lipid transfer protein (Cs5g09070.2).
Figure 5. Relative expression results of genes and their UTR introns (UIs) in Citrus sinensis root, leaf and stem. * and ** respectively represent significant difference (p < 0.05) and very significant difference (p < 0.01) compared with root. 5UI: intron in the 5′UTR; 3UI: intron in the 3′UTR; CDS: coding sequence; represent the relative expression levels of genes containing these structures. PPR: Pentatricopeptide repeat superfamily protein (Cs6g01290.1), VPS28: Vacuolar protein sorting-associated 28 (Cs2g06750.1), EIN3: Ethylene-insensitive 3 (Cs2g29100.1), DUF247: domain of unknown function 247 gene (Cs2g24990.1), GRAS: GRAS transcription factors (Cs8g18700.1), TPR: Tetratricopeptide repeat-like superfamily protein (Cs8g15200.1), R: Disease resistance protein (Cs5g21990.1) and LTP: Lipid transfer protein (Cs5g09070.2).
Ijms 21 03088 g005
Table 1. Statistics information of 5′ untranslated regions (UTR), coding sequences (CDSs) and 3′ UTR in Citrus sinensis.
Table 1. Statistics information of 5′ untranslated regions (UTR), coding sequences (CDSs) and 3′ UTR in Citrus sinensis.
Number of SequencesSequences with IntronsTotal Bases (Genomic)Intron/SequenceNumber of Introns/Nucleotide (mRNA)
5′UTR16,9166176.8 × 1060.061.42 × 10−4
CDS23,39417,8973.8 × 1074.812.98 × 10−3
3′UTR17,4084691.2 ×1070.046.36 × 10−5
Table 2. Information of the UTR-intron (UI) numbers in the 5UI and 3UI containing gene transcripts (respectively abbreviated as 5UI-Ts and 3UI-Ts). 1 UI~6 UIs respectively means that there is 1~6 UIs present in the UTR. There are a total of 825 gene transcripts containing 5UI, and a total of 570 gene transcripts containing 3UI. NS: not shown.
Table 2. Information of the UTR-intron (UI) numbers in the 5UI and 3UI containing gene transcripts (respectively abbreviated as 5UI-Ts and 3UI-Ts). 1 UI~6 UIs respectively means that there is 1~6 UIs present in the UTR. There are a total of 825 gene transcripts containing 5UI, and a total of 570 gene transcripts containing 3UI. NS: not shown.
UI Number5UI-Ts Number/Percentage (Gene ID)3UI-Ts Number/Percentage (Gene ID)
1 UI713/86.43% (NS)448/78.60% (NS)
2 UIs90/10.91% (NS)90/15.79% (NS)
3 UIs18/2.18% (Cs1g06160.1, Cs2g16080.1, Cs2g03700.1, Cs2g01050.1, Cs3g12240.1, Cs3g19040.1, Cs4g10860.1, Cs5g07860.1, Cs6g06255.1, Cs7g12410.1, Cs9g09475.1, orange1.1t05830.1, orange1.1t01413.1, orange1.1t02679.1, orange1.1t02923.1, orange1.1t04234.1, orange1.1t05909.1, orange1.1t06043.1)18/3.16% (Cs1g09404.1, Cs1g10030.1, Cs3g02530.1, Cs3g21100.1, Cs3g25390.1, Cs4g03945.1, Cs6g08820.1, Cs7g04620.1, Cs7g09600.1, Cs8g11445.1, Cs8g11845.1, Cs8g15200.1, orange1.1t02481.1, orange1.1t05875.1, orange1.1t03487.1, orange1.1t06018.1, orange1.1t05916.1, orange1.1t05924.1)
4 UIs2/0.24% (Cs2g17870.1, Cs5g25765.1)8/1.40% (Cs2g08495.1, Cs3g10260.1, Cs3g12715.1, Cs5g26550.1; Cs5g28645.2, Cs7g15390.1, orange1.1t05956.1, orange1.1t03536.1)
5 UIs2/0.24% (orange1.1t06039.1, Cs7g06380.1)4/0.70% (Cs2g11780.1, orange1.1t01460.1, orange1.1t03486.1, Cs1g16990.1)
6 UIs02/0.35% (Cs1g15550.1, Cs1g17000.1)
Table 3. Statistics information of the distribution of UTR introns (UIs) and transcripts containing UI (UI-Ts) in the Citrus sinensis chromosomes. 5UI-T represents gene transcript containing 5UI; 3UI-T represents gene transcript containing 3UI. Chr: chromosome.
Table 3. Statistics information of the distribution of UTR introns (UIs) and transcripts containing UI (UI-Ts) in the Citrus sinensis chromosomes. 5UI-T represents gene transcript containing 5UI; 3UI-T represents gene transcript containing 3UI. Chr: chromosome.
Chr No.5UI Numbers/Percentage5UI Density3UI Numbers/Percentage3UI Density5UI-T Number/Percentage5UI-T Density3UI-T Number/Percentage3UI-T Density
chr181 (8.39%)2.81 × 10−6103 (13.83%)3.58 × 10−670 (8.49%)2.43 × 10−672 (12.6%)2.5 × 10−6
chr2162 (16.79%)5.26 × 10−665 (8.73%)2.11 × 10−6137 (16.61%)4.45 × 10−650 (8.77%)1.62 × 10−6
chr387 (9.02%)3.03 × 10−683 (11.14%)2.89 × 10−673 (8.85%)2.54 × 10−661 (10.7%)2.13 × 10−6
chr475 (7.77%)3.75 × 10−660 (8.05%)3.00 × 10−666 (8.00%)3.3 × 10−648 (8.42%)2.40 × 10−6
chr5100 (10.36%)2.76 × 10−685 (11.41%)2.35 × 10−684 (10.18%)2.32 × 10−671 (12.46%)1.96 × 10−6
chr696 (9.95%)4.53 × 10−639 (5.24%)1.84 × 10−685 (10.30%)4.01 × 10−633 (5.79%)1.56 × 10−6
chr797 (10.05%)3.01 × 10−678 (10.47%)2.42 × 10−685 (10.30%)2.64 × 10−662 (10.88%)1.93 × 10−6
chr854 (5.59%)2.38 × 10−657 (7.65%)2.51 × 10−648 (5.82%)2.11 × 10−643 (7.54%)1.89 × 10−6
chr948 (4.97%)2.59 × 10−627 (3.62%)1.46 × 10−643 (5.21%)2.32 × 10−623 (4.04%)1.24 × 10−6
chrUn165 (17.01%)-148 (19.86%)-134 (16.24%)-107 (18.77%)-
Table 4. 5′UTR intron (5UI) and 3′UTR intron (3UI) numbers and length in UTRs of UI-containing pentatricopeptide repeat containing proteins (PPRPs) and disease resistance (R) genes.
Table 4. 5′UTR intron (5UI) and 3′UTR intron (3UI) numbers and length in UTRs of UI-containing pentatricopeptide repeat containing proteins (PPRPs) and disease resistance (R) genes.
Gene FamilyGene ID3UI Number and Length (bp)5UI Number and Length (bp)
PPRCs4g02090.21 (150)-
Cs4g03660.11 (119)-
Cs4g07420.11 (1165)-
Cs4g13530.12 (366, 98)-
Cs4g13560.12 (710, 98)-
Cs4g20340.41 (334)-
Cs4g20340.1-1 (576)
Cs4g20340.2-1 (143)
Cs2g05520.11 (754)-
Cs2g07840.21 (486)-
Cs2g09470.21 (1044)-
Cs2g11780.15 (614, 1389, 178, 567, 78)-
Cs2g13460.11 (93)-
Cs2g19190.11 (93)-
Cs2g19710.12 (113, 442)-
Cs2g27580.11 (1233)-
Cs5g03910.11 (108)1 (147)
Cs5g04860.11 (422)-
Cs5g08440.21 (670)-
Cs5g17240.11 (98)-
Cs5g26200.21 (771)-
Cs5g26550.14 (314, 152, 720, 78)-
Cs5g34090.11 (513)1 (114)
Cs7g04230.11 (212)-
Cs7g04980.11 (234)-
Cs7g09600.13 (781, 330, 112)-
Cs7g10230.12 (278, 91)1 (490)
Cs7g13700.21 (949)-
Cs7g15390.14 (661, 120, 811, 136)-
Cs3g02530.21 (707)-
Cs3g09780.21 (103)-
Cs3g10260.14 (161, 862, 194, 186)-
Cs3g11640.12 (1094, 97)-
Cs3g19210.12 (334, 140)-
Cs3g20090.12 (506, 107)1 (666)
Cs3g20090.2-1 (462)
Cs3g20480.12 (220, 166)-
Cs3g24370.12 (104, 668)-
Cs3g25390.13 (109, 158, 105)-
Cs6g01290.12 (280, 132)1 (93)
Cs6g07760.12 (102, 211)-
Cs6g08820.21 (107)-
Cs6g11340.21 (270)-
Cs6g11530.11 (909)-
Cs6g11910.21 (388)-
Cs1g10030.41 (194)-
Cs1g10310.21 (1699)-
Cs1g12770.21 (997)-
Cs1g12780.12 (89, 306)-
Cs1g24360.11 (1265)-
Cs1g26320.11 (531)-
Cs8g15200.13 (158, 1006, 208)-
Cs8g18540.11 (134)-
Cs9g01900.11 (99)-
Cs9g03060.11 (468)-
Cs9g17260.11 (1131)-
orange1.1t00940.12 (725, 181)-
orange1.1t01460.15 (431, 636, 473, 134, 97)1 (268)
orange1.1t01541.12 (554, 509)-
orange1.1t04277.21 (366)-
orange1.1t04409.12 (343, 301)-
Cs4g03945.13 (633, 89, 99)-
Cs4g11335.11 (295)-
Cs9g14456.11 (91)-
RCs4g07730.12 (92, 142)-
Cs4g07730.22 (87, 138) -
Cs4g10830.11 (363)1 (93)
Cs2g19600.21 (238) -
Cs2g30590.12 (138, 605)-
Cs5g20470.11 (89)-
Cs5g21990.11 (131)-
Cs5g22710.11 (140)-
Cs5g28770.12 (239, 685)-
Cs5g29510.11 (145)-
Cs7g02220.11 (82)-
Cs3g13340.12 (187, 175)-
Cs3g13390.12 (180, 130)-
Cs1g06720.11 (163)-
Cs1g08080.21 (99)-
Cs1g11430.12 (291, 1,488)-
Cs1g12140.11 (482)-
Cs1g14030.11 (171)-
Cs1g14090.11 (327)-
Cs1g14120.11 (400)-
Cs1g15550.16 (408, 159, 163, 293, 120, 101)-
Cs1g16990.15 (174, 104, 82, 71, 95 )-
Cs1g17000.16 (332, 96, 357, 268, 161, 144)-
Cs1g18380.21 (663)-
Cs9g18740.11 (178)-
orange1.1t01926.11 (163)-
orange1.1t02481.13 (137, 210, 362)-
orange1.1t02498.11 (290)-
orange1.1t02751.11 (401)1 (347)
orange1.1t02917.11 (138)-
orange1.1t02924.11 (140)-
orange1.1t03486.15 (169, 195, 93, 377, 127)-
orange1.1t03487.31 (571)-
orange1.1t03742.21 (140)-
orange1.1t04592.11 (238)-
Cs2g30865.11 (303)-
Cs1g09404.13 (133, 117, 473)-
orange1.1t05891.11 (4655)-
Cs4g17710.11 (712)-
Cs4g08050.11 (282)-
Cs4g08050.21 (274)-
Cs4g08110.21 (277)-
Cs6g19070.11 (751)-
Cs1g14090.21 (321)-
Cs1g14090.31 (321)-
orange1.1t03332.11 (133)-

Share and Cite

MDPI and ACS Style

Shi, X.; Wu, J.; Mensah, R.A.; Tian, N.; Liu, J.; Liu, F.; Chen, J.; Che, J.; Guo, Y.; Wu, B.; et al. Genome-Wide Identification and Characterization of UTR-Introns of Citrus sinensis. Int. J. Mol. Sci. 2020, 21, 3088. https://doi.org/10.3390/ijms21093088

AMA Style

Shi X, Wu J, Mensah RA, Tian N, Liu J, Liu F, Chen J, Che J, Guo Y, Wu B, et al. Genome-Wide Identification and Characterization of UTR-Introns of Citrus sinensis. International Journal of Molecular Sciences. 2020; 21(9):3088. https://doi.org/10.3390/ijms21093088

Chicago/Turabian Style

Shi, Xiaobao, Junwei Wu, Raphael Anue Mensah, Na Tian, Jiapeng Liu, Fan Liu, Jialan Chen, Jingru Che, Ye Guo, Binghua Wu, and et al. 2020. "Genome-Wide Identification and Characterization of UTR-Introns of Citrus sinensis" International Journal of Molecular Sciences 21, no. 9: 3088. https://doi.org/10.3390/ijms21093088

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop