Next Article in Journal
Blue Biotechnology: Marine Bacteria Bioproducts
Next Article in Special Issue
Targeted Integration of siRNA against Porcine Cytomegalovirus (PCMV) Enhances the Resistance of Porcine Cells to PCMV
Previous Article in Journal
Genomic and Phenotypic Analysis of Salmonella enterica Bacteriophages Identifies Two Novel Phage Species
Previous Article in Special Issue
Who Is the Intermediate Host of RNA Viruses? A Study Focusing on SARS-CoV-2 and Poliovirus
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Comprehensive Genomics Investigation of Neboviruses Reveals Distinct Codon Usage Patterns and Host Specificity

1
Biotechnology Research Center, Technology Innovation Institute, Masdar City, Abu Dhabi P.O. Box 9639, United Arab Emirates
2
Diagnostics and Vaccines Group, ICAR—National Institute of High Security Animal Diseases, Bhopal 462021, Madhya Pradesh, India
3
Maximum Containment Facility, ICMR—National Institute of Virology, Pune 411001, Maharashtra, India
4
Department of Animal Sciences, Washington State University, Pullman, WA 99163, USA
5
Department of Biosciences and Bioengineering, Indian Institute of Technology Roorkee, Roorkee 247667, Uttarakhand, India
6
College of Animal Biotechnology, Guru Angad Dev Veterinary and Animal Science University, Ludhiana 141004, Punjab, India
*
Authors to whom correspondence should be addressed.
Microorganisms 2024, 12(4), 696; https://doi.org/10.3390/microorganisms12040696
Submission received: 6 March 2024 / Revised: 24 March 2024 / Accepted: 26 March 2024 / Published: 29 March 2024
(This article belongs to the Special Issue Diversity and Pathogenesis of Common Human and Animal Viruses)

Abstract

:
Neboviruses (NeVs) from the Caliciviridae family have been linked to enteric diseases in bovines and have been detected worldwide. As viruses rely entirely on the cellular machinery of the host for replication, their ability to thrive in a specific host is greatly impacted by the specific codon usage preferences. Here, we systematically analyzed the codon usage bias in NeVs to explore the genetic and evolutionary patterns. Relative Synonymous Codon Usage and Effective Number of Codon analyses indicated a marginally lower codon usage bias in NeVs, predominantly influenced by the nucleotide compositional constraints. Nonetheless, NeVs showed a higher codon usage bias for codons containing G/C at the third codon position. The neutrality plot analysis revealed natural selection as the primary factor that shaped the codon usage bias in both the VP1 (82%) and VP2 (57%) genes of NeVs. Furthermore, the NeVs showed a highly comparable codon usage pattern to bovines, as reflected through Codon Adaptation Index and Relative Codon Deoptimization Index analyses. Notably, yak NeVs showed considerably different nucleotide compositional constraints and mutational pressure compared to bovine NeVs, which appear to be predominantly host-driven. This study sheds light on the genetic mechanism driving NeVs’ adaptability, evolution, and fitness to their host species.

1. Introduction

In the intricate world of virology, understanding the genetic- and protein-level makeup of viruses is crucial for unraveling their evolutionary patterns, host adaptation behavior, transmission potential, and impact on animal as well as human health [1,2,3]. Among the diverse array of enteric viruses that affect animal populations, Caliciviruses emerge as intriguing viruses that have piqued the interest of scientists and veterinarians alike [4,5,6,7,8]. With their potential to cause gastrointestinal disturbances in animals, Caliciviruses have surfaced as a significant concern for both animal health and the agriculture industry [9,10,11,12,13].
The members of the Caliciviridae family are highly diverse in genetic makeup and the family has been classified into eleven genera, viz. Lagovirus, Vesivirus, Nebovirus, Sapovirus, Norovirus, Bavovirus, Minovirus, Nacovirus, Recovirus, Salovirus, and Valivirus [10,12,13,14,15]. The members of the family are responsible for causing a spectrum of diseases in various animals and humans, such as respiratory diseases, hemorrhagic diseases, and gastroenteritis [16,17,18,19,20,21]. Nonetheless, among the enteric viruses, Neboviruses (NeVs), which primarily cause calf diarrhea, have created more interest among the research community because of their huge genetic diversity and widespread geographical distribution. Till now, NeVs have been identified in more than 14 countries [4,9,14,15,22,23,24,25,26,27]. Furthermore, our previous study confirmed the presence of NeVs in the Indian bovine population and whole-genome characterization revealed a highly divergent NeV strain [27]. Despite the huge genetic diversity and widespread geographical distribution of NeVs, a detailed dissection of factors governing the NeVs’ evolution, particularly host-driven evolutionary pressures, as well as their codon usage preferences, has not been performed so far.
Codon optimization is a molecular biology technique used to enhance the expression of a gene in a particular host organism by altering the nucleotide sequence of the gene without changing the encoded amino acid sequence [2,3,8]. Viruses rely on the host cell’s machinery to replicate and produce viral proteins. Codon optimization aims to adapt viral genes to the codon usage patterns of the host organism, thereby facilitating efficient translation and protein production. Codon optimization involves replacing rare codons in the viral genome with synonymous codons that are more frequently used by the host organism to ensure efficient translation of the viral RNA in the host cell [2,5,8,20]. By adapting the viral genome to the host’s codon usage preferences, codon optimization can significantly increase the expression levels of viral proteins within host cells. Further, viruses with codon-optimized genomes are better equipped to exploit host cellular machinery for replication and propagation, enhancing fitness for more efficient viral spread and pathogenesis. Codon optimization can facilitate the adaptation of viruses to new host species. Overall, codon optimization is a powerful tool in viral research, enabling scientists to tailor viral genomes for efficient replication in specific host organisms, thereby advancing our understanding of viral pathogenesis and aiding the development of vaccines and antiviral therapies [2,8,16,20,22].
In fact, viruses undergo evolutionary pressures driven by their dependence on the host’s cellular machinery for survival. Consequently, they exhibit evolutionary signatures reflective of the host’s microcellular environment [3,28,29]. These distinctive evolutionary imprints contribute to the genetic and biochemical adaptation of viruses, enabling them to effectively circulate within their hosts, particularly in clinical and reservoir hosts [29]. Moreover, these viral evolutionary signatures have been shown to impact various biological processes in their hosts [2,3,30]. Investigating and understanding these signatures is crucial for gaining insights into the evolutionary mechanisms of synonymous codon usage and host-adapted evolution.
Therefore, in this study, we employed a panel of genetic tools, such as the Effective Number of Codons (ENC), neutrality and PR2-bias plots, Relative Codon Deoptimization Index (RCDI), Relative Synonymous Codon Usage (RSCU), and Codon Adaptation Index (CAI) to systematically analyze evolutionary signatures governing the host adaptation in highly divergent NeVs. We also looked for these signatures across the different proteins encoded by NeVs.

2. Materials and Methods

2.1. Whole-Genome NeV Dataset and Phylogenetic Analysis

The complete genomes of all the NeVs (n = 16) available in the National Center for Biotechnology Information (NCBI) were downloaded, the list of which is summarized in Table 1. Additional metadata related to the assembly of the complete genomes accounted for in Table 1 are provided in Supplementary Table S1. The representative sequences for each of the eleven genera within the Caliciviridae family were also extracted from the NCBI and then aligned using MAFFT v.7.475. [31]. The LG-Gamma substitution model, incorporating empirical amino acid frequencies (LG + G + F), was identified as the most suitable amino acid substitution model for the dataset, determined by the Bayesian Information Criterion (BIC). Subsequently, phylogenetic trees were constructed using Maximum Likelihood (ML) inference and ultrafast bootstrap with 1000 replicates, employing IQ tree v2.1.2 [32]. The multiple sequence alignment of the VP1 and VP2 protein sequences was further used for identifying conserved and non-conserved regions. The nucleotide sequences corresponding to conserved and non-conserved amino acid residues were extracted from the genomic sequences of the different NeVs. These nucleotide sequences were used for performing codon usage bias in conserved and non-conserved regions in NeVs.

2.2. Nucleotide Composition Analysis

The VP1 and VP2 genes of NeVs were analyzed in terms of their nucleotide composition, total GC content, GC content at the first, second, and third codon positions (G1, G2, G3), and third synonymous codon position (T3s, C3s, A3s, and G3s) using the standalone version of codonW [33] and the utilities in EMBOSS [34]. The potential compositional bias in the VP1 and VP2 genes of NeVs was investigated and compared between other NeVs.

2.3. Effective Number of Codons (ENC) and ENC-GC3 Plot Analysis

The Effective Number of Codons (ENC) is a simplistic estimation of codon usage that varies from 20 to 61. An ENC value of 20 suggests the usage of only single codons for each amino acid, while a value of 61 indicates the equal contribution of all synonymous codons in coding corresponding amino acids [35,36,37]. A lower value of ENC (i.e., ENC < 40) specifies a strong codon usage bias in viruses [3,28].
A comparison of observed (oENC) and expected (eENC) ENC distribution at different GC3 values helps in estimating the role of selection pressure in shaping codon usage [35,36,37]. The expected ENC (eENC) was calculated using Equation (1).
e E N C = 2 + S + 29 2 S 2 2 S + 1
The values for eENC were calculated at different GC3 contents (S), and the values can vary from 0 to 1. The observed ENC (oENC) was calculated by implementing the standalone version of codonW [33] with the help of simple shell scripts.

2.4. Neutrality Plot (Neutral Evolution) and PR2-Bias Plot

In absence of any external pressure, mutations at the first, second, or third codon positions are expected to be equally likely. However, mutations at the first and second codon positions may lead to changes in amino acids (non-synonymous mutations), while third-codon-position mutations mostly result in the same amino acids (synonymous mutations). The neutrality plot analysis was implemented to examine the mutation–selection equilibrium in determining codon usage bias [38,39,40,41,42,43,44]. The average GC content at the first and second positions (GC12) was plotted against the average GC content at the third codon position (GC3) to compare the impacts of natural selection and mutation pressure on codon usage of protein-coding sequences. The required values for neutrality plot analysis were either already calculated in Section 2.3 or derived from these values using simple shell scripts. The values of GC12 and GC3 for the VP1 and VP2 genes of different NeVs were plotted to approximate the neutral evolution of the genes, where the slope of the regression line represents the magnitude of mutation pressure (slope = 0 suggests no effect of mutational pressure) and natural selection (slope = 1 suggests complete neutrality), while the regression coefficient against GC3 represents the mutation–selection equilibrium coefficient [45,46,47,48].
It has been demonstrated that the AT and GC appeared in pairs at the third codon positions under the influence of natural selection in codon usage [43,48]. To investigate this further, the GC bias (G3/(G3 + C3)) and AT bias (A3/(A3 + T3)) were calculated for the VP1 and VP2 genes of the NeVs. The values for G3, C3, A3, and T3 were calculated in Section 2.3. An unequal distribution of GC and AT compositions in protein-coding genes suggests the role of mutation pressure in codon usage, which can be further analyzed by scrutinizing the directionality of GC and AT bias [39,42,49,50,51,52]. Therefore, we employed Parity Rule 2 (PR2)-bias plots to investigate the GC and AT bias in the VP1 and VP2 genes of NeVs.

2.5. Relative Synonymous Codon Usage (RSCU)

The estimation of synonymous codon usage for each codon is investigated with the help of Relative Synonymous Codon Usage (RSCU) analysis. The RSCU values for a codon equaling unity (RSCU = 1) indicate an absence of any codon usage bias, while the higher values (RSCU > 1) suggest the presence of positive bias (preferred codon), and lower values (RSCU < 1) suggest the presence of negative bias (non-preferred codon) [45,49,53,54,55,56,57]. The RSCU values for 59 codons were calculated with the help of CAIcal [58] and codonW [33].

2.6. Codon Adaptation Index (CAI)

The Codon Adaptation Index (CAI) helps in the estimation of the relative adaptation of a viral gene to the codon usage of its host. The CAI values vary from 0 to 1, where a higher value suggests the usage of the most preferred codons among the host and virus. It is a generally agreed parameter for quantifying the degree of resemblances between codon usage of a gene and a reference dataset to investigate synonymous codon usage. Also, an estimate of gene expressivity, an examination of the factors governing synonymous codon usage, and an investigation of horizontal gene transfer have been performed with the assistance of CAI values [53,58,59,60,61,62]. Therefore, we employed CAI to investigate the codon usage preferences of NeVs in relation to seven host species, viz. Bos taurus, Bubalus bubalis, Capra hircus, Ovis aries, Equus caballus, Equus asinus, and Camelus dromedarius. The codon usage of the hosts was extracted from the Codon Usage Database [63], whereas the codon usage of NeVs was calculated by codonW [33] and CAIcal [58]. The frequencies of tRNAs in different hosts were retrieved from the GtRNAdb database (http://gtrnadb.ucsc.edu, accessed on 22 March 2024). Notably, the tRNA frequencies were available for only three hosts (out of the 7 different hosts considered in this study), viz. Bos taurus, Equus caballus, and Ovis aries.

2.7. Relative Codon Deoptimization Index (RCDI)

The RCDI provides an inference to virus–host phylogenetic relationships and helps in deducing the possible host range of a virus. Also, a high RCDI reflects the possibility of expression of a gene in dormant stages and the presence of virus at a low replication rate in the hosts [5,46,49,64]. The ratio of observed RCDI (oRCDI) to the eRCDI offers a degree of virus–host adaptability, where a ratio close to 1 or higher represents better adaptability as compared to lower values. The synergistic effect of codon usage on gene expression and the plausible coevolution of the virus and its host genomes may also be indicated with the help of the RCDI of different genes for different hosts [2,5,28,29,44,46,57,64].
The Relative Codon Deoptimization Index (RCDI) values of the VP1 and VP2 genes of NeVs were calculated for an approximation of their similarity in codon frequencies against the codon frequencies of different hosts. The expected RCDI (eRCDI) was derived by using randomly generated sequences with the same G + C and amino acid compositions as the VP1 and VP2 genes in different NeVs. The observed RCDI (oRCDI) was directly calculated from the VP1 and VP2 gene sequences. The oRCDI and eRCDI were calculated by using the RCDI/eRCDI web server, available at http://genomes.urv.cat/CAIcal/RCDI, accessed on 22 March 2024 [64].

2.8. Recombination Analysis of NeVs

To investigate the recombination events among different strains of NeVs, RDP (version 4.1) was used [65,66]. The detection and characterization of recombination events in RDP (version 4.1) were accomplished by a simultaneous implementation of a diverse range of recombination detection methods that could deliver the desired efficiency despite the absence of a user-defined set of non-recombinant reference sequences. The multiple sequence alignment of all NeVs’ whole-genome sequences was performed by using MUSCLE [67] and this alignment was used as an input for recombination analysis in RDP (version 4.1) by selecting seven available methods for recombination detection, i.e., RDP [65], GeneConv [18,68], Chimera [69], MaxChi [70], 3Seq [71], BootScan [72], and SiScan [73]. Since the available genome sequences for NeVs are restricted (n = 16), all the sequences were used for performing recombination analysis. The highest acceptable p-value setting differs for the different methods. As recommended by the developers of RDP, the highest acceptable p-value 0.05 was used with multiple comparison correction. Further, the boundaries of breakpoints were determined by using the BURT method in the immediate vicinity of the identified recombination events along with implementing internal and external references.

3. Results

3.1. Nucleotide Composition and Codon Usage Bias Are Significantly Different in NeVs Infecting Yaks than Those of Bovines

The compositions of different nucleotides at the third synonymous codon position in NeVs are shown in Figure 1 for VP1 and VP2 genes. The composition analysis of the VP1 and VP2 genes of different NeVs indicated a GC-rich nature of the VP1 and VP2 genes. Notably, all the codons of each open reading frame were analyzed in the present study. The VP1 gene of different NeVs showed a higher (0.74 ± 0.043) GC content at the third synonymous codon positions (G3s + C3s) as compared to the VP2 gene (0.66 ± 0.057). Notably, the NeVs infecting yaks carried a lower GC content for both VP1 (0.65 ± 0.004) and VP2 (0.53 ± 0.032) at the third synonymous codon positions as compared to the NeVs infecting bovines (VP1 = 0.76 ± 0.037, and VP2 = 0.68 ± 0.029). Furthermore, an inverse pattern was observed for the AT composition at the third synonymous codon positions (A3s + T3s) of bovine-infecting NeVs. However, NeVs infecting yaks were observed to be AT rich at the third synonymous codon positions (VP1 = 0.55 ± 0.004, and VP2 = 0.64 ± 0.031) as compared to NeVs infecting bovines (VP1 = 0.42 ± 0.027, and VP2 = 0.47 ± 0.032). Further, we analyzed the codon usage patterns for conserved and non-conserved regions of the VP1 and VP2 genes of NeVs. Figure S1 summarizes the overall codon usage in conserved and non-conserved regions of the VP1 and VP2 genes of NeVs. Additionally, Tables S2 and S3 provide codon usage in conserved and non-conserved regions of the VP1 and VP2 genes. It is observed that the conserved regions preferred 11 A/T-ending and 16 G/C-ending codons while non-conserved regions preferred 19 A/T- and 15 G/C-ending codons.
The ENC values for the VP1 gene of different NeVs ranged from 52 to 56, with an ENC average of 52.8 ± 1.51. However, the ENC values for the VP2 gene showed a more diverse range (47.5 to 57.5), averaging 53.7 (±2.71). The ENC values for both the VP1 and VP2 genes of bovine NeVs are lower (52.7 ± 1.870) than those of yak NeVs (56.6 ± 0.639). The observed ENC values for different NeVs are provided in Supplementary Table S4. The higher ENC values in yak NeVs suggest a negligible codon usage bias as compared to bovine NeVs.

3.2. Both Mutational Pressure and Natural Selection Govern the Codon Usage Patterns Differently across the NeV Genes as Well as Host Species

Next, we estimated the impact of mutation pressure and natural selection on the codon usage bias in both VP1 and VP2 genes of NeVs using an ENC-GC3 plot. The observed ENCs (oENCs) and GC3s for VP1 and VP2 genes are plotted with the standard curve for eENCs (Figure 2A). In Figure 2A, the eENC curve denotes that the contribution of only mutation pressure dictates codon usage. The observed ENC values for both the VP1 and VP2 genes of NeVs are positioned away from the eENC standard curve, indicating the role of both mutational pressure and selection pressure in governing codon usage. Notably, the distribution of VP1 genes of NeVs is comparatively closer to the eENC standard curve than that of the VP2 gene, suggesting the differential magnitude of influence of mutational pressure on them [43,56]. Furthermore, the deviations of oENC from eENC at different values of GC3s (ΔENC = eENCGC3s − oENCGC3s) of the VP1 and VP2 genes of NeVs are calculated to compare the role of mutational pressure in shaping codon usage (Table S4).
A higher deviation is suggestive of less impact of mutational pressure than other factors and vice versa [29,35,36,37,45,49]. It is observed that the average deviation in eENC and oENC values is lower in the VP1 gene (4.0 ± 0.54) as compared to the VP2 gene (5.47 ± 2.44) of NeVs. However, it is important to note that this deviation is very high in the VP2 gene of BoNeV/YLA-2/2017/CHN/MH718886 (ΔENC = 11.2), BoNeV/M3641/2011/HUN/JX018212 (ΔENC = 9.1), and BoNeV/LZB-1/2017/CHN/MG599036 (ΔENC = 8.0), reflecting a more prominent role of natural selection. Additionally, both the VP1 and VP2 genes of yak NeVs clustered away from the bovine NeVs. A higher deviation is suggestive of less impact of mutational pressure than other factors and vice versa [29,35,36,37,45,49]. It is observed that the average deviation in eENC and oENC values is lower in the VP1 gene (4.0 ± 0.54) as compared to the VP2 gene (5.47 ± 2.44) of NeVs. However, it is important to note that this deviation is very high in the VP2 gene of BoNeV/YLA-2/2017/CHN/MH718886 (ΔENC = 11.2), BoNeV/M3641/2011/HUN/JX018212 (ΔENC = 9.1), and BoNeV/LZB-1/2017/CHN/MG599036 (ΔENC = 8.0), reflecting a more prominent role of natural selection. Additionally, both the VP1 and VP2 genes of yak NeVs clustered away from the bovine NeVs.
The neutrality plot analysis showed that the magnitude of mutational pressure In shaping the codon usage in VP1 genes of NeVs is 18% (R2 = 0.41, p < 0.01). However, in the case of VP2 genes of NeVs, a substantial contribution of mutational pressure (43%) was noted (R2 = 0.41, p < 0.01) (Figure 2B). It is also worth mentioning that the VP1 and VP2 genes of yak NeVs have considerably different compositions of GC12 and GC3 as compared to their counterparts isolated from bovines and clustered differently in the neutrality plot (Figure 2B). A narrow distribution of GC compositions reflected the dominance of natural selection over mutational pressure, while a significant correlation between GC12 and GC3 compositions signified the impact of mutational pressure at all the codon positions.
The role of mutation pressure in codon usage is further explored by scrutinizing the directionality of GC and AT bias through the distribution of GC and AT compositions in the VP1 and VP2 genes of NeVs [39,42,49,50,51,52]. The GC and AT biases in the VP2 genes of yak NeVs are quite different from those of bovine NeVs, which is in concordance with the neutrality plot (Figure 2C). The wide distribution of GC and AT bias in the VP1 and VP2 genes of NeVs suggests the differential mutational pressure’s contribution in shaping codon usage [47,52,74].

3.3. Most Common Codons across NeVs Have Abundant G/C at the Third Position

The RSCU analysis of NeVs showed that all the 13 most abundant codons common across NeVs have G/C at the third position (AAG, ACC, AUC, CAC, CCC, CGC, CUC, CUG, GAG, GCC, GGC, GUG, UCC). However, the most preferred codons among the bovine NeVs include an additional five codons (CAG, GAC, GGG, UAC, UUC). In the case of yak NeVs, the total number of preferred codons increased to 30, of which 12 codons have A/U at the third codon position. Overall, the RSCU analysis revealed that NeVs have a higher codon usage bias for the codons with G/C at the third codon position as compared to the A/U-ended codons. A correlation analysis of RSCU among the bovine NeVs indicated a highly similar codon usage pattern (average R2 = 0.75 ± 0.008, p < 0.001). However, the codon usage pattern in yak NeVs was found to deviate from the bovine NeVs (average R2 = 0.51 ± 0.003, p < 0.001). A heatmap representation of the RSCU analysis of 59 codons (excluding UGG, AUG, UAG, UAA, and UGA) of NeVs and their comparison with different host species are depicted in Figure 3. Although NeVs have been detected in bovines and yak species, other host species were also included in this analysis to understand their comparative codon usage similarity.
Furthermore, the correlation analysis between the RSCU of different NeVs and seven host species (Bos taurus, Equus caballus, Equus asinus, Camelus dromedarius, Bubalus bubalis, Ovis aries, Capra hircus) revealed that bovine NeVs have a higher preference for Bos taurus, indicating the practical utility of implementing RSCU analysis. A summary of the correlation analysis of RSCUs between different NeVs and potential hosts is provided in Supplementary Table S5 and common preferred codons (out of 59 codons) in different strains of NeVs and different hosts in Supplementary Table S6.
The host specificity of NeVs is further explored in terms of the number of preferred codons that are shared by the host species and virus, where a higher number of preferred codons common in the host and virus suggests a better host adaptability [41,54,57,75,76,77,78]. For instance, the BoNeV/Mukti/2016/IND/MN241817 is observed to share 21 preferred codons with Bos taurus and Equus caballus. Likewise, for BoNev/Kirklareli/2012/TUR/NC_030793, 23 preferred codons were shared with Bos taurus.

3.4. Codon Adaptation Index Showed Marginal Variations among NeVs

The CAI of NeVs is the highest for Bubalus bubalis (0.71 ± 0.009 for VP1, 0.69 ± 0.027 for VP2), followed by Bos taurus (0.70 ± 0.009 for VP1, 0.68 ± 0.026 for VP2). However, the CAI is the lowest for Camelus dromedarius (0.65 ± 0.011 for VP1, 0.63 ± 0.024 for VP2). The overall CAI for NeVs is observed to be on the higher side and well suited to the bovines. However, the variations in the CAI among NeVs are found to be marginal, including the yak NeVs. The CAIs of different NeVs for VP1 and VP2 genes are provided in Supplementary Tables S7 and S8, respectively.
It may be inferred that the CAI values of the VP1 and VP2 genes of different NeVs for different hosts were observed to be high and did not deviate significantly. A controlled expression of viral proteins and replicative stability in different viruses is accomplished either by using optimized codon usage or by escaping the immune system of hosts [49,50,56,70]. In the case of NeVs, the higher values of CAI suggest the usage of optimal codons, indicating their replication stability in hosts. Among the different hosts, the VP1 and VP2 genes of NeVs reflect a slightly better adaptability to Bos taurus, and Bubalus bubalis as compared to other hosts.

3.5. Viral Adaptability to Different Hosts through RCDI

Notably, the ratio of the observed RCDI to the expected RCDI is the highest for both the VP1 and VP2 genes of strain Mukti (accession id—MN241817) for Bos taurus (0.79 and 0.91), followed by Equus callabus (0.77 and 0.89). Nonetheless, the RCDI ratios for VP1 genes of other NeVs are also high for Bos taurus (0.78 ± 0.02) as compared to other hosts (Capra hircus 0.76 ± 0.06; Equus callabus 0.75 ± 0.02; Bubalus bubalis 0.74 ± 0.02; Camelus dromedarius 0.72 ± 0.02; Ovis aries 0.72 ± 0.01; Equss asinus 0.72 ± 0.01), as depicted in Figure 4. For VP2 genes, a similar trend of RCDI ratios was observed (Figure 4). The high RCDI ratios suggest a slightly better adaptability and specificity of NeVs to Bos taurus as compared to other hosts. The observed RCDI values and expected RCDI values for both the VP1 and VP2 genes of NeVs in relation to different host species are provided in Supplementary Tables S9 and S10, respectively.

3.6. Isoacceptor tRNA Pool Analysis of Different Hosts

Different codons that represent distinct amino acid residues are bound by distinct isoacceptor tRNAs of different tRNA species. Translation selection is determined by whether the majority of codons that NeVs prefer are recognized by the majority of isoacceptor tRNAs in the host. The most prevalent isoacceptor tRNAs identify the favored codons, demonstrating the impact of translational selection on codon usage. Understanding how neboviruses adapt to the tRNA pool of various hosts is crucial since translation is the primary mechanism in any virus life cycle. As a result, we examined the codons that the hosts’ tRNA pool liked [28]. When examining the synonymous codon families and the tRNA anticodon, it becomes apparent that, with the exception of Trp and Met, among nine two-fold synonymous codon families in NeVs, seven (Asn, Asp, Cys, Gln, Glu, His, Lys, Phe, and Tyr) exhibit optimal codon–anticodon usage (Supplementary Table S11). Specifically, the preferred codons within these families are associated with the most frequently occurring tRNA isotypes in bovine cells, indicating an optimal alignment of codon and anticodon pairs. In the rest of the synonymous codon families (Ala, Arg, Ile, Leu, Pro, Ser, Thr, Val), except Gly, NeVs exhibit non-optimal codon–anticodon usage. In contrast, two YAK NeVs carried distinct non-optimal codon–anticodon usage for Phe and Gln, and optimal for Ser and Val. These results highlight that NeVs have evolved to use combined codon–anticodon usage (optimal and non-optimal) of different host species’ tRNA pools.

3.7. Phylogenetic Analysis of NeVs Revealed Two Distinct Clusters

The genomic sequences may be used to deduce the evolutionary relationships among different organisms and these relationships may be represented in the form of phylogenetic trees. With the evolution and divergence of organisms, mutations are accumulated in the nucleotide sequences that may be compared via multiple sequence alignment to trace the evolutionary history [3,79,80,81,82]. The phylogenetic analysis of NeVs showed that NeVs formed two distinct clusters, where one cluster comprised Mukti, Kirklareli, and other NeVs recently detected in China in the year 2021, whereas the second cluster consisted of NeV strains from China, Japan, and Hungary, including the representative NeV strains (Figure 5). Additionally, to understand the identity and non-identity groups according to the amino acid translated, we created a pairwise homology plot for the VP1 and VP2 genes of neboviruses (Figure S2), which revealed a similar pattern of relationships to that indicated by phylogeny analysis.

3.8. Recombination Analysis Indicated Four Potential Recombination Events in NeVs

RNA viruses are known to adapt to respective hosts in a more efficient manner as compared to DNA viruses due to their high mutation rates [83,84,85]. The favorable mutations are believed to occur because of errors in gene replication that enable the viruses to acclimatize to diverse environmental selection pressures. Recombination events generally integrate distantly related or unrelated sequences and pose a challenge for accurate reconstruction of the sequence-based phylogeny [7,84].
Four events of recombination were observed among the NeVs, where at least two out of the seven methods detected recombination events. For instance, only two recombination events involving ‘BoNeV/LZB-1/2017/CHN/MG599036’ and ‘BoNev/Newbury1/2005/UK/NC_007916’ were detected using all methods. An overview of detected recombination events, corresponding recombinants, major parents, minor parents, and methods showing positive and negative symbols for the presence or absence of recombination is provided in Table 2. Also, the genomic positions of major and minor parents for different recombination events are summarized in Supplementary Table S12.

4. Discussion

The investigation of codon usage patterns in genomes has garnered significant attention due to its implications for understanding the molecular basis of gene expression, evolution, protein synthesis, and host adaptation [16,44]. Synonymous codon usage in different gene-coding regions is a consequence of codon usage bias and is governed through specific selection pressures such as natural selection, mutation pressure, and nucleotide compositional constraints [2,3,86]. Among these factors, the selective pressure from the host cells is opined to be very crucial in viruses [29,57]. The obligate intracellular parasitic nature of viruses subjugates them on the host cell mechanism for gene expression, and a comprehensive codon usage analysis of viral genomes may help in exploring the viral fitness and specificity to various primary and reservoir hosts [2,5,29,44,86]. However, the factors influencing the evolution of codon usage in NeVs, along with their preferences in codon usage in comparison with hosts, have not been explored to date. Therefore, in this study, we presented a comprehensive approach to systematically examine genome-wide codon usage patterns in NeVs. This examination extended to exploring these distinctive signatures in the various proteins encoded by NeVs.
To investigate the influence of nucleotide compositional constraints and mutational pressure on shaping synonymous codon usage bias, we extracted the complete genomes of 16 NeVs. It is important to acknowledge that biases or errors might be introduced by different sequencing technologies. Consequently, additional information about sequence depth could greatly enhance the interpretation of the results from assembled genomes. While the NCBI repository for complete genomes encourages submitters to provide such information as metadata, unfortunately, it is not available for the 16 NeVs, neither through metadata nor in the corresponding research articles. It is worth noting that some RNA viruses employ a strategy of population diversity to adapt to various host species, resulting in a spectrum of genotypes with differing frequencies across hosts. Hence, sequence depth emerges as a critical parameter for genetic analyses of viral genomic sequences. We performed an ENC plot analysis to investigate how the nucleotide compositional constraint and mutational pressure contribute to shaping the bias in synonymous codon usage within NeV genomes. This analysis revealed that the yak NeVs detected in China deviated significantly from other NeVs, indicating the profound influence of GC3s in compositional bias in yak NeVs. However, all the NeVs lay below the expected ENC curve, implying that mutation pressure is not the only primary factor governing the codon bias in NeVs; other factors also contribute to shaping the codon usage bias in NeVs. Therefore, we conducted a neutrality plot analysis to unravel the influence of crucial factors, namely natural selection and mutation pressure, on the bias observed in codon usage. Natural selection primarily dictated the codon usage shaping in both the VP1 (82%) and VP2 (57%) genes of NeVs. However, neutrality plot analysis also suggested that yak NeVs possess a considerably different composition of GC12 and GC3 as compared to their counterparts detected from bovines and cluster differently. This differential nucleotide compositional constraint and mutational pressure observed in yak NeVs seems to be primarily host-driven. The host-driven factors influencing the codon usage bias in viruses have been demonstrated in various previous studies [3,29,86].
The codon usage bias in multiple RNA viruses, such as Equine Influenza virus (ENC = 52.09) [28]; H5N1 Influenza virus (ENC = 50.91) [87]; Chikungunya Virus (ENC = 55.56) [88]; Nipah virus (ENC = 51.06) [89]; West Nile Virus (ENC = 53.81) [90]; and Foot-and-Mouth Disease Virus (ENC = 51.42) [91] showed that RNA viruses prefer lower codon bias, which is in concordance with our study (ENC = 52.7). The possible explanation of this lowered codon bias is that it provides an opportunity for RNA viruses to adapt to multiple hosts, whose microcellular environments offer completely different tRNA pools [28].
Furthermore, to elucidate the degree of preference for A/U- and G/C-ended codons and to assess the patterns of synonymous codon usage, we calculated the Relative Synonymous Codon Usage (RSCU) values for each codon across 16 strains of NeVs. These values were then compared among various host species. It is evident from RSCU analysis that NeVs possess a higher codon usage bias for the codons with G/C at the third codon position as compared to the A/U-ended codons. This contrasts with previous studies, where different RNA viruses prefer A- or U-ended codons [2,3,28,29,86,87,92]. To compare the preferences in codon usage between NeVs and various host species, we combine RSCU analysis with the Codon Adaptation Index (CAI) and Relative Codon Deoptimization Index (RCDI) analyses. These analyses clearly showed that NeVs possess codon usage patterns more similar to bovines.
Codon usage in viruses is also affected by the translation selection (tRNA abundance along with their modifications) in a particular host species [93], both of which can be quantified by nanopore RNA sequencing (nano tRNA-seq) [94]. Translational selection also influences the codon usage in viruses because the viruses show preferences for the codons that are identified by the most prevalent isoacceptor tRNAs in the host to ensure efficient translation of their proteins [28]. This way, viruses try to adapt to the tRNA pool of the host. Given that translation selection plays a pivotal role in the life cycle of any virus, it becomes crucial to understand how different NeVs adapt to the pool of different host species’ tRNAs. When examining the synonymous codon families and the tRNA anticodon, it becomes apparent that, with the exception of Trp and Met, among nine two-fold synonymous codon families in NeVs, seven (Asn, Asp, Cys, Gln, Glu, His, Lys, Phe, and Tyr) exhibit optimal codon–anticodon usage (Supplementary Information). Specifically, the preferred codons within these families are associated with the most frequently occurring tRNA isotypes in bovine cells, indicating an optimal alignment of codon and anticodon pairs. In the rest of the synonymous codon families (Ala, Arg, Ile, Leu, Pro, Ser, Thr, Val), except Gly, NeVs exhibit non-optimal codon–anticodon usage. In contrast, two yak NeVs carried distinct non-optimal codon–anticodon usage for Phe and Gln, and optimal for Ser and Val. These results highlight that NeVs have evolved to use combined codon–anticodon usage (optimal and non-optimal) of different host species’ tRNA pools.
Epigenetic modifications in viruses can influence codon usage by affecting the accessibility of viral genomes, regulating viral gene expression, and responding to host immune pressures [95]. Understanding these interactions is crucial for unraveling the molecular mechanisms underlying viral pathogenesis and evolution; however, studies linking specific epigenetic modifications to codon usage in viruses are limited.
This study convincingly illustrates that the intricate patterns of genome-wide codon usage in NeVs are shaped by a complex interplay of determinant factors, including mutation pressure, natural selection, nucleotide compositional constraints, and other unidentified factors. The codon usage bias in NeVs was found to be relatively weaker, primarily influenced by mutation pressure and nucleotide compositional constraints. Overall, the findings of this study contribute to a deeper understanding of the factors involved in shaping codon usage in NeVs and their fitness in relation to their host.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/microorganisms12040696/s1, Figure S1. A depiction of an overall codon usage in conserved and non-conserved regions of VP1 and VP2 genes of NeVs. Figure S2. A heatmap representation of pairwise homology of VP1 and VP2 genes of different NeVs. The lower triangle represents VP1 genes while the upper triangle represents VP2 genes. Table S1. A summary of metadata of complete genome assembly of different NeVs. Table S2. A summary of codon usage (per thousand nucleotides) for the conserved regions of VP1 and VP2 genes of different NeVs. Table S3. A summary of codon usage per thousand nucleotides for the non-conserved regions of VP1 and VP2 genes of NeVs. Table S4. A summary of Effective Number of Codons (ENC) in VP1 and VP2 gene of different NeVs. The GC composition at 3rd synonymous codon position (GC3s), observed ENC (oENC), expected ENC (eENC) and deviation in oENC and eENC (ΔENC) are summarized. Table S5. A summary of correlation analysis (R2) of RSCU between different NeVs and potential hosts. Table S6. A summary of common preferred codons (out of 59 codons) in different strains of NeVs and different hosts. Table S7. A summary of codon adaptation indices of VP1 gene of NeVs corresponding to different host. Table S8. A summary of codon adaptation indices of VP2 gene of NeVs corresponding to different host. Table S9. A summary of observed RCDI (oRCDI) values and expected RCDI (eRCDI) values for VP1 genes of different NeVs corresponding to different hosts. The expected RCDI (eRCDI) values for all the hosts are provided in parenthesis after their names. Table S10. A summary of observed RCDI (oRCDI) values and expected RCDI (eRCDI) values for VP2 genes of different Neboviruses corresponding to different hosts. The expected RCDI (eRCDI) values for all the hosts are provided in parenthesis after their names. Table S11. A summary of frequencies of tRNA genes in different hosts (Bos taurus, Equus caballus, and Ovis aries) for most preferentially used codons in NeVs. Table S12. A summary of recombination events observed from multiple sequence alignment of complete genome sequences of NeVs.

Author Contributions

Conceptualization, R.K., N.K. and Y.S.M.; Data curation, R.K., N.K., S.S., A.S. and S.T.; Formal analysis, R.K., N.K. and S.S.; Funding acquisition, R.K. and T.L.; Investigation, R.K.; Methodology, R.K. and N.K.; Project administration, R.K. and Y.S.M.; Resources, R.K., T.L. and Y.S.M.; Supervision, Y.S.M.; Validation, A.S.; Visualization, P.Y., S.S., A.S.-A., S.T. and T.L.; Writing—original draft preparation, R.K. and N.K.; Writing—review and editing, R.K., N.K., P.Y., S.S., A.S.-A., A.S., S.T., T.L. and Y.S.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in this study are included in the article and Supplementary Materials; further inquiries can be directed to the corresponding authors.

Acknowledgments

We are extremely grateful to the ICAR-Education Division, New Delhi, for the financial assistance with the ICAR National Fellowship to YSM and staff of the state animal husbandry departments for their constant help with the sampling process. We are thankful to the Indian Veterinary Research Institute for providing research facilities. RK and TL are thankful to the Biotechnology Research Center, Technology Innovation Institute, Abu Dhabi, UAE, for its support and resources.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Kumar, N.; Kaushik, R.; Singh, A.; Uversky, V.N.; Zhang, K.Y.J.; Sahu, U.; Bhatia, S.; Sanyal, A. Bayesian Molecular Dating Analyses Combined with Mutational Profiling Suggest an Independent Origin and Evolution of SARS-CoV-2 Omicron BA.1 and BA.2 Sub-Lineages. Viruses 2022, 14, 2764. [Google Scholar] [CrossRef] [PubMed]
  2. Kumar, N.; Kaushik, R.; Tennakoon, C.; Uversky, V.N.; Longhi, S.; Zhang, K.Y.J.; Bhatia, S. Insights into the evolutionary forces that shape the codon usage in the viral genome segments encoding intrinsically disordered protein regions. Brief. Bioinform. 2021, 22, bbab145. [Google Scholar] [CrossRef] [PubMed]
  3. Malik, Y.S.; Ansari, M.I.; Kattoor, J.J.; Kaushik, R.; Sircar, S.; Subbaiyan, A.; Tiwari, R.; Dhama, K.; Ghosh, S.; Tomar, S.; et al. Evolutionary and codon usage preference insights into spike glycoprotein of SARS-CoV-2. Brief. Bioinform. 2021, 22, 1006–1022. [Google Scholar] [CrossRef] [PubMed]
  4. Alkan, F.; Karayel, İ.; Catella, C.; Bodnar, L.; Lanave, G.; Bányai, K.; Di Martino, B.; Decaro, N.; Buonavoglia, C.; Martella, V. Identification of a Bovine Enteric Calicivirus, Kırklareli Virus, Distantly Related to Neboviruses, in Calves with Enteritis in Turkey. J. Clin. Microbiol. 2015, 53, 3614–3617. [Google Scholar] [CrossRef] [PubMed]
  5. Castells, M.; Victoria, M.; Colina, R.; Musto, H.; Cristina, J. Genome-wide analysis of codon usage bias in Bovine Coronavirus. Virol. J. 2017, 14, 115. [Google Scholar] [CrossRef] [PubMed]
  6. Di Felice, E.; Mauroy, A.; Pozzo, F.D.; Thiry, D.; Ceci, C.; Di Martino, B.; Marsilio, F.; Thiry, E. Bovine noroviruses: A missing component of calf diarrhoea diagnosis. Vet. J. 2016, 207, 53–62. [Google Scholar] [CrossRef] [PubMed]
  7. Koonin, E.V.; Dolja, V.V.; Krupovic, M.; Varsani, A.; Wolf, Y.I.; Yutin, N.; Zerbini, F.M.; Kuhn, J.H. Global Organization and Proposed Megataxonomy of the Virus World. Microbiol. Mol. Biol. Rev. 2020, 84, 2. [Google Scholar] [CrossRef] [PubMed]
  8. Lopes, A.M.; Dalton, K.P.; Magalhães, M.J.; Parra, F.; Esteves, P.J.; Holmes, E.C.; Abrantes, J. Full genomic analysis of new variant rabbit hemorrhagic disease virus revealed multiple recombination events. J. Gen. Virol. 2015, 96, 1309–1319. [Google Scholar] [CrossRef]
  9. Candido, M.; Alencar, A.L.; Almeida-Queiroz, S.R.; Buzinaro, M.G.; Munin, F.S.; Godoy, S.H.; Livonesi, M.C.; Fernandes, A.M.; Sousa, R.L. First detection and molecular characterization of Nebovirus in Brazil. Epidemiol. Infect. 2016, 144, 1876–1878. [Google Scholar] [CrossRef]
  10. Le Pendu, J.; Abrantes, J.; Bertagnoli, S.; Guitton, J.S.; Le Gall-Reculé, G.; Lopes, A.M.; Marchandeau, S.; Alda, F.; Almeida, T.; Célio, A.P.; et al. Proposal for a unified classification system and nomenclature of lagoviruses. J. Gen. Virol. 2017, 98, 1658–1666. [Google Scholar] [CrossRef]
  11. Martella, V.; Pinto, P.; Lorusso, E.; Di Martino, B.; Wang, Q.; Larocca, V.; Cavalli, A.; Camero, M.; Decaro, N.; Bányai, K.; et al. Detection and Full-Length Genome Characterization of Novel Canine Vesiviruses. Emerg. Infect. Dis. 2015, 21, 1433–1436. [Google Scholar] [CrossRef] [PubMed]
  12. Oka, T.; Stoltzfus, G.T.; Zhu, C.; Jung, K.; Wang, Q.; Saif, L.J. Attempts to grow human noroviruses, a sapovirus, and a bovine norovirus in vitro. PLoS ONE 2018, 13, e0178157. [Google Scholar] [CrossRef] [PubMed]
  13. Oka, T.; Wang, Q.; Katayama, K.; Saif, L.J. Comprehensive review of human sapoviruses. Clin. Microbiol. Rev. 2015, 28, 32–53. [Google Scholar] [CrossRef] [PubMed]
  14. Guo, Z.; He, Q.; Zhang, B.; Yue, H.; Tang, C. Detection and molecular characteristics of neboviruses in dairy cows in China. J. Gen. Virol. 2019, 100, 35–45. [Google Scholar] [CrossRef] [PubMed]
  15. Turan, T.; Işıdan, H.; Atasoy, M.O.; Irehan, B. Detection and Molecular Analysis of Bovine Enteric Norovirus and Nebovirus in Turkey. J. Vet. Res. 2018, 62, 129–135. [Google Scholar] [CrossRef] [PubMed]
  16. Alexaki, A.; Kames, J.; Holcomb, D.D.; Athey, J.; Santana-Quintero, L.V.; Lam, P.V.N.; Hamasaki-Katagiri, N.; Osipova, E.; Simonyan, V.; Bar, H.; et al. Codon and Codon-Pair Usage Tables (CoCoPUTs): Facilitating Genetic Variation Analyses and Recombinant Gene Design. J. Mol. Biol. 2019, 431, 2434–2441. [Google Scholar] [CrossRef]
  17. Edwards, J.F.; Yedloutschnig, R.J.; Dardiri, A.H.; Callis, J.J. Vesicular exanthema of swine virus: Isolation and serotyping of field samples. Can. J. Vet. Res. 1987, 51, 358–362. [Google Scholar]
  18. Padidam, M.; Sawyer, S.; Fauquet, C.M. Possible emergence of new geminiviruses by frequent recombination. Virology 1999, 265, 218–225. [Google Scholar] [CrossRef]
  19. Pedersen, N.C.; Elliott, J.B.; Glasgow, A.; Poland, A.; Keel, K. An isolated epizootic of hemorrhagic-like fever in cats caused by a novel and highly virulent strain of feline calicivirus. Vet. Microbiol. 2000, 73, 281–300. [Google Scholar] [CrossRef]
  20. Smiley, J.R.; Chang, K.O.; Hayes, J.; Vinjé, J.; Saif, L.J. Characterization of an enteropathogenic bovine calicivirus representing a potentially new calicivirus genus. J. Virol. 2002, 76, 10089–10098. [Google Scholar] [CrossRef]
  21. Smith, A.W.; Iversen, P.L.; Skilling, D.E.; Stein, D.A.; Bok, K.; Matson, D.O. Vesivirus viremia and seroprevalence in humans. J. Med. Virol. 2006, 78, 693–701. [Google Scholar] [CrossRef]
  22. Cho, E.H.; Soliman, M.; Alfajaro, M.M.; Kim, J.Y.; Seo, J.Y.; Park, J.G.; Kim, D.S.; Baek, Y.B.; Kang, M.I.; Park, S.I.; et al. Bovine Nebovirus Interacts with a Wide Spectrum of Histo-Blood Group Antigens. J. Virol. 2018, 92, 9. [Google Scholar] [CrossRef] [PubMed]
  23. Diez-Valcarce, M.; Castro, C.J.; Marine, R.L.; Halasa, N.; Mayta, H.; Saito, M.; Tsaknaridis, L.; Pan, C.Y.; Bucardo, F.; Becker-Dreps, S.; et al. Genetic diversity of human sapovirus across the Americas. J. Clin. Virol. 2018, 104, 65–72. [Google Scholar] [CrossRef] [PubMed]
  24. Guo, Z.; He, Q.; Yue, H.; Zhang, B.; Tang, C. First detection of Nebovirus and Norovirus from cattle in China. Arch. Virol. 2018, 163, 475–478. [Google Scholar] [CrossRef] [PubMed]
  25. Park, S.I.; Jeong, C.; Park, S.J.; Kim, H.H.; Jeong, Y.J.; Hyun, B.H.; Chun, Y.H.; Kang, M.I.; Cho, K.O. Molecular detection and characterization of unclassified bovine enteric caliciviruses in South Korea. Vet. Microbiol. 2008, 130, 371–379. [Google Scholar] [CrossRef]
  26. Pourasgari, F.; Kaplon, J.; Sanchooli, A.; Fremy, C.; Karimi-Naghlani, S.; Otarod, V.; Ambert-Balay, K.; Mojgani, N.; Pothier, P. Molecular prevalence of bovine noroviruses and neboviruses in newborn calves in Iran. Arch. Virol. 2018, 163, 1271–1277. [Google Scholar] [CrossRef]
  27. Kumar, N.; Kaushik, R.; Yadav, P.; Sircar, S.; Shete-Aich, A.; Singh, A.; Malik, Y.S. A novel highly divergent enteric calicivirus in a bovine calf, India. bioRxiv 2023. [Google Scholar] [CrossRef]
  28. Kumar, N.; Bera, B.C.; Greenbaum, B.D.; Bhatia, S.; Sood, R.; Selvaraj, P.; Anand, T.; Tripathi, B.N.; Virmani, N. Revelation of Influencing Factors in Overall Codon Usage Bias of Equine Influenza Viruses. PLoS ONE 2016, 11, e0154376. [Google Scholar] [CrossRef]
  29. Kumar, N.; Kulkarni, D.D.; Lee, B.; Kaushik, R.; Bhatia, S.; Sood, R.; Pateriya, A.K.; Bhat, S.; Singh, V.P. Evolution of Codon Usage Bias in Henipaviruses Is Governed by Natural Selection and Is Host-Specific. Viruses 2018, 10, 604. [Google Scholar] [CrossRef]
  30. Mordstein, C.; Cano, L.; Morales, A.C.; Young, B.; Ho, A.T.; Rice, A.M.; Liss, M.; Hurst, L.D.; Kudla, G. Transcription, mRNA Export, and Immune Evasion Shape the Codon Usage of Viruses. Genome Biol. Evol. 2021, 13, evab106. [Google Scholar] [CrossRef]
  31. Katoh, K.; Rozewicki, J.; Yamada, K.D. MAFFT online service: Multiple sequence alignment, interactive sequence choice and visualization. Brief. Bioinform. 2019, 20, 1160–1166. [Google Scholar] [CrossRef] [PubMed]
  32. Nguyen, L.T.; Schmidt, H.A.; von Haeseler, A.; Minh, B.Q. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 2015, 32, 268–274. [Google Scholar] [CrossRef]
  33. Thioulouse, J.; Chevenet, F. NetMul, a World-Wide Web user interface for multivariate analysis software. Comput. Stat. Data Anal. 1996, 21, 369–372. [Google Scholar] [CrossRef]
  34. Rice, P.; Longden, I.; Bleasby, A. EMBOSS: The European Molecular Biology Open Software Suite. Trends Genet. 2000, 16, 276–277. [Google Scholar] [CrossRef] [PubMed]
  35. Fuglsang, A. The ‘effective number of codons’ revisited. Biochem. Biophys. Res. Commun. 2004, 317, 957–964. [Google Scholar] [CrossRef]
  36. Fuglsang, A. Accounting for background nucleotide composition when measuring codon usage bias: Brilliant idea, difficult in practice. Mol. Biol. Evol. 2006, 23, 1345–1347. [Google Scholar] [CrossRef] [PubMed]
  37. Wright, F. The ‘effective number of codons’ used in a gene. Gene 1990, 87, 23–29. [Google Scholar] [CrossRef] [PubMed]
  38. Archetti, M. Codon usage bias and mutation constraints reduce the level of error minimization of the genetic code. J. Mol. Evol. 2004, 59, 258–266. [Google Scholar] [CrossRef] [PubMed]
  39. Bulmer, M. The selection-mutation-drift theory of synonymous codon usage. Genetics 1991, 129, 897–907. [Google Scholar] [CrossRef]
  40. Chen, S.L.; Lee, W.; Hottes, A.K.; Shapiro, L.; McAdams, H.H. Codon usage between genomes is constrained by genome-wide mutational processes. Proc. Natl. Acad. Sci. USA 2004, 101, 3480–3485. [Google Scholar] [CrossRef]
  41. Cristina, J.; Moreno, P.; Moratorio, G.; Musto, H. Genome-wide analysis of codon usage bias in Ebolavirus. Virus Res. 2015, 196, 87–93. [Google Scholar] [CrossRef] [PubMed]
  42. dos Reis, M.; Savva, R.; Wernisch, L. Solving the riddle of codon usage preferences: A test for translational selection. Nucleic Acids Res. 2004, 32, 5036–5044. [Google Scholar] [CrossRef] [PubMed]
  43. Knight, R.D.; Freeland, S.J.; Landweber, L.F. A simple model based on mutation and selection explains trends in codon and amino-acid usage and GC composition within and across genomes. Genome Biol. 2001, 2, Research0010. [Google Scholar] [CrossRef] [PubMed]
  44. Komar, A.A. The Yin and Yang of codon usage. Hum. Mol. Genet. 2016, 25, R77–R85. [Google Scholar] [CrossRef] [PubMed]
  45. Hershberg, R.; Petrov, D.A. Selection on codon bias. Annu. Rev. Genet. 2008, 42, 287–299. [Google Scholar] [CrossRef]
  46. Kunec, D.; Osterrieder, N. Codon Pair Bias Is a Direct Consequence of Dinucleotide Bias. Cell Rep. 2016, 14, 55–67. [Google Scholar] [CrossRef] [PubMed]
  47. Lehmann, J.; Libchaber, A. Degeneracy of the genetic code and stability of the base pair at the second position of the anticodon. RNA 2008, 14, 1264–1269. [Google Scholar] [CrossRef] [PubMed]
  48. Piovesan, A.; Pelleri, M.C.; Antonaros, F.; Strippoli, P.; Caracausi, M.; Vitale, L. On the length, weight and GC content of the human genome. BMC Res. Notes 2019, 12, 106. [Google Scholar] [CrossRef] [PubMed]
  49. Comeron, J.M.; Aguadé, M. An evaluation of measures of synonymous codon usage bias. J. Mol. Evol. 1998, 47, 268–274. [Google Scholar] [CrossRef]
  50. Sueoka, N. Directional mutation pressure and neutral molecular evolution. Proc. Natl. Acad. Sci. USA 1988, 85, 2653–2657. [Google Scholar] [CrossRef]
  51. Sueoka, N. Translation-coupled violation of Parity Rule 2 in human genes is not the cause of heterogeneity of the DNA G+C content of third codon position. Gene 1999, 238, 53–58. [Google Scholar] [CrossRef] [PubMed]
  52. Sueoka, N. Near homogeneity of PR2-bias fingerprints in the human genome and their implications in phylogenetic analyses. J. Mol. Evol. 2001, 53, 469–476. [Google Scholar] [CrossRef]
  53. Gustafsson, C.; Govindarajan, S.; Minshull, J. Codon bias and heterologous protein expression. Trends Biotechnol. 2004, 22, 346–353. [Google Scholar] [CrossRef]
  54. Shabalina, S.A.; Spiridonov, N.A.; Kashina, A. Sounds of silence: Synonymous nucleotides as a key to biological regulation and complexity. Nucleic Acids Res. 2013, 41, 2073–2094. [Google Scholar] [CrossRef] [PubMed]
  55. Sharp, P.M.; Li, W.H. An evolutionary perspective on synonymous codon usage in unicellular organisms. J. Mol. Evol. 1986, 24, 28–38. [Google Scholar] [CrossRef] [PubMed]
  56. Sharp, P.M.; Stenico, M.; Peden, J.F.; Lloyd, A.T. Codon usage: Mutational bias, translational selection, or both? Biochem. Soc. Trans. 1993, 21, 835–841. [Google Scholar] [CrossRef] [PubMed]
  57. Wang, H.; Liu, S.; Zhang, B.; Wei, W. Analysis of Synonymous Codon Usage Bias of Zika Virus and Its Adaption to the Hosts. PLoS ONE 2016, 11, e0166260. [Google Scholar] [CrossRef] [PubMed]
  58. Puigbò, P.; Bravo, I.G.; Garcia-Vallve, S. CAIcal: A combined set of tools to assess codon usage adaptation. Biol. Direct 2008, 3, 38. [Google Scholar] [CrossRef] [PubMed]
  59. Belalov, I.S.; Lukashev, A.N. Causes and implications of codon usage bias in RNA viruses. PLoS ONE 2013, 8, e56642. [Google Scholar] [CrossRef]
  60. Carbone, A.; Zinovyev, A.; Képès, F. Codon adaptation index as a measure of dominating codon bias. Bioinformatics 2003, 19, 2005–2015. [Google Scholar] [CrossRef]
  61. Dittmar, K.A.; Goodenbour, J.M.; Pan, T. Tissue-specific differences in human transfer RNA expression. PLoS Genet. 2006, 2, e221. [Google Scholar] [CrossRef] [PubMed]
  62. Sharp, P.M.; Li, W.H. The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications. Nucleic Acids Res. 1987, 15, 1281–1295. [Google Scholar] [CrossRef] [PubMed]
  63. Nakamura, Y.; Gojobori, T.; Ikemura, T. Codon usage tabulated from international DNA sequence databases: Status for the year 2000. Nucleic Acids Res. 2000, 28, 292. [Google Scholar] [CrossRef]
  64. Puigbò, P.; Aragonès, L.; Garcia-Vallvé, S. RCDI/eRCDI: A web-server to estimate codon usage deoptimization. BMC Res. Notes 2010, 3, 87. [Google Scholar] [CrossRef]
  65. Martin, D.; Rybicki, E. RDP: Detection of recombination amongst aligned sequences. Bioinformatics 2000, 16, 562–563. [Google Scholar] [CrossRef]
  66. Martin, D.P.; Murrell, B.; Golden, M.; Khoosal, A.; Muhire, B. RDP4: Detection and analysis of recombination patterns in virus genomes. Virus Evol. 2015, 1, vev003. [Google Scholar] [CrossRef]
  67. Löytynoja, A.; Goldman, N. An algorithm for progressive multiple alignment of sequences with insertions. Proc. Natl. Acad. Sci. USA 2005, 102, 10557–10562. [Google Scholar] [CrossRef]
  68. Sawyer, S. Statistical tests for detecting gene conversion. Mol. Biol. Evol. 1989, 6, 526–538. [Google Scholar] [CrossRef] [PubMed]
  69. Posada, D.; Crandall, K.A. Evaluation of methods for detecting recombination from DNA sequences: Computer simulations. Proc. Natl. Acad. Sci. USA 2001, 98, 13757–13762. [Google Scholar] [CrossRef]
  70. Smith, D.W. Problems of translating heterologous genes in expression systems: The role of tRNA. Biotechnol. Prog. 1996, 12, 417–422. [Google Scholar] [CrossRef]
  71. Boni, M.F.; Posada, D.; Feldman, M.W. An exact nonparametric method for inferring mosaic structure in sequence triplets. Genetics 2007, 176, 1035–1047. [Google Scholar] [CrossRef] [PubMed]
  72. Salminen, M.O.; Carr, J.K.; Burke, D.S.; McCutchan, F.E. Identification of breakpoints in intergenotypic recombinants of HIV type 1 by bootscanning. AIDS Res. Hum. Retroviruses 1995, 11, 1423–1425. [Google Scholar] [CrossRef] [PubMed]
  73. Gibbs, M.J.; Armstrong, J.S.; Gibbs, A.J. Sister-scanning: A Monte Carlo procedure for assessing signals in recombinant sequences. Bioinformatics 2000, 16, 573–582. [Google Scholar] [CrossRef] [PubMed]
  74. Dietel, A.K.; Merker, H.; Kaltenpoth, M.; Kost, C. Selective advantages favour high genomic AT-contents in intracellular elements. PLoS Genet. 2019, 15, e1007778. [Google Scholar] [CrossRef] [PubMed]
  75. Butt, A.M.; Nasrullah, I.; Qamar, R.; Tong, Y. Evolution of codon usage in Zika virus genomes is host and vector specific. Emerg. Microbes Infect. 2016, 5, e107. [Google Scholar] [CrossRef] [PubMed]
  76. Gu, W.; Zhou, T.; Ma, J.; Sun, X.; Lu, Z. Analysis of synonymous codon usage in SARS Coronavirus and other viruses in the Nidovirales. Virus Res. 2004, 101, 155–161. [Google Scholar] [CrossRef]
  77. Lara-Ramírez, E.E.; Salazar, M.I.; López-López Mde, J.; Salas-Benito, J.S.; Sánchez-Varela, A.; Guo, X. Large-scale genomic analysis of codon usage in dengue virus and evaluation of its phylogenetic dependence. BioMed Res. Int. 2014, 2014, 851425. [Google Scholar] [CrossRef]
  78. Zhou, H.; Wang, H.; Huang, L.F.; Naylor, M.; Clifford, P. Heterogeneity in codon usages of sobemovirus genes. Arch. Virol. 2005, 150, 1591–1605. [Google Scholar] [CrossRef]
  79. Castoe, T.A.; de Koning, A.P.; Kim, H.M.; Gu, W.; Noonan, B.P.; Naylor, G.; Jiang, Z.J.; Parkinson, C.L.; Pollock, D.D. Evidence for an ancient adaptive episode of convergent molecular evolution. Proc. Natl. Acad. Sci. USA 2009, 106, 8986–8991. [Google Scholar] [CrossRef]
  80. Kitchen, A.; Shackelton, L.A.; Holmes, E.C. Family level phylogenies reveal modes of macroevolution in RNA viruses. Proc. Natl. Acad. Sci. USA 2011, 108, 238–243. [Google Scholar] [CrossRef]
  81. Malik, Y.S.; Kumar, N.; Sharma, K.; Dhama, K.; Shabbir, M.Z.; Ganesh, B.; Kobayashi, N.; Banyai, K. Epidemiology, phylogeny, and evolution of emerging enteric Picobirnaviruses of animal origin and their relationship to human strains. BioMed Res. Int. 2014, 2014, 780752. [Google Scholar] [CrossRef] [PubMed]
  82. Pompei, S.; Loreto, V.; Tria, F. Phylogenetic properties of RNA viruses. PLoS ONE 2012, 7, e44849. [Google Scholar] [CrossRef] [PubMed]
  83. Rivailler, P.; Mao, N.; Zhu, Z.; Xu, W. Recombination analysis of Human mastadenovirus C whole genomes. Sci. Rep. 2019, 9, 2182. [Google Scholar] [CrossRef] [PubMed]
  84. Simon-Loriere, E.; Holmes, E.C. Why do RNA viruses recombine? Nat. Rev. Microbiol. 2011, 9, 617–626. [Google Scholar] [CrossRef] [PubMed]
  85. Worobey, M.; Holmes, E.C. Evolutionary aspects of recombination in RNA viruses. J. Gen. Virol. 1999, 80 Pt 10, 2535–2543. [Google Scholar] [CrossRef] [PubMed]
  86. Kumar, N.; Kaushik, R.; Tennakoon, C.; Uversky, V.N.; Mishra, A.; Sood, R.; Srivastava, P.; Tripathi, M.; Zhang, K.Y.J.; Bhatia, S. Evolutionary Signatures Governing the Codon Usage Bias in Coronaviruses and Their Implications for Viruses Infecting Various Bat Species. Viruses 2021, 13, 1847. [Google Scholar] [CrossRef] [PubMed]
  87. Zhou, T.; Gu, W.; Ma, J.; Sun, X.; Lu, Z. Analysis of synonymous codon usage in H5N1 virus and other influenza A viruses. Biosystems 2005, 81, 77–86. [Google Scholar] [CrossRef] [PubMed]
  88. Butt, A.M.; Nasrullah, I.; Tong, Y. Genome-wide analysis of codon usage and influencing factors in chikungunya viruses. PLoS ONE 2014, 9, e90905. [Google Scholar] [CrossRef] [PubMed]
  89. Khandia, R.; Singhal, S.; Kumar, U.; Ansari, A.; Tiwari, R.; Dhama, K.; Das, J.; Munjal, A.; Singh, R.K. Analysis of Nipah Virus Codon Usage and Adaptation to Hosts. Front. Microbiol. 2019, 10, 886. [Google Scholar] [CrossRef]
  90. Moratorio, G.; Iriarte, A.; Moreno, P.; Musto, H.; Cristina, J. A detailed comparative analysis on the overall codon usage patterns in West Nile virus. Infect. Genet. Evol. 2013, 14, 396–400. [Google Scholar] [CrossRef]
  91. Zhong, J.; Li, Y.; Zhao, S.; Liu, S.; Zhang, Z. Mutation pressure shapes codon usage in the GC-Rich genome of foot-and-mouth disease virus. Virus Genes 2007, 35, 767–776. [Google Scholar] [CrossRef] [PubMed]
  92. Goñi, N.; Iriarte, A.; Comas, V.; Soñora, M.; Moreno, P.; Moratorio, G.; Musto, H.; Cristina, J. Pandemic influenza A virus codon usage revisited: Biases, adaptation and implications for vaccine strain development. Virol. J. 2012, 9, 263. [Google Scholar] [CrossRef] [PubMed]
  93. Masuda, I.; Yamaki, Y.; Detroja, R.; Tagore, S.; Moore, H.; Maharjan, S.; Nakano, Y.; Christian, T.; Matsubara, R.; Lowe, T.M.; et al. tRNA methylation resolves codon usage bias at the limit of cell viability. Cell Rep. 2022, 41, 111539. [Google Scholar] [CrossRef] [PubMed]
  94. Lucas, M.C.; Pryszcz, L.P.; Medina, R.; Milenkovic, I.; Camacho, N.; Marchand, V.; Motorin, Y.; Ribas de Pouplana, L.; Novoa, E.M. Quantitative analysis of tRNA abundance and modifications by nanopore RNA sequencing. Nat. Biotechnol. 2024, 42, 72–86. [Google Scholar] [CrossRef]
  95. Tsai, K.; Cullen, B.R. Epigenetic and epitranscriptomic regulation of viral replication. Nature reviews. Microbiology 2020, 18, 559–570. [Google Scholar] [CrossRef]
Figure 1. Composition analysis of VP1 and VP2 genes of different NeVs at the 3rd synonymous codon positions. Since the compositions are at the 3rd synonymous codon positions, these need not necessarily sum up to one.
Figure 1. Composition analysis of VP1 and VP2 genes of different NeVs at the 3rd synonymous codon positions. Since the compositions are at the 3rd synonymous codon positions, these need not necessarily sum up to one.
Microorganisms 12 00696 g001
Figure 2. (A) ENC-GC3 plot for the VP1 and VP2 genes, where the circle- and diamond-shaped data points account for VP1 and VP2 genes, respectively. The grey and red colors represent the NeVs isolated from bovines and yaks, respectively. (B) Neutrality plot for the VP1 (red) and VP2 (blue) genes of different NeVs. The regression coefficient indicates an 18% and 43% effect of mutation pressure in shaping the codon usage in VP1 and VP2 genes of NeVs, respectively. (C) Parity Rule 2 (PR2)-bias plot for VP1 and VP2 genes of different NeVs, where deviation from equal GC and AT bias in the VP1 and VP2 genes suggests the contribution of mutational pressure in codon usage in NeVs. The values highlighted with red circles represent the NeVs isolated from yaks.
Figure 2. (A) ENC-GC3 plot for the VP1 and VP2 genes, where the circle- and diamond-shaped data points account for VP1 and VP2 genes, respectively. The grey and red colors represent the NeVs isolated from bovines and yaks, respectively. (B) Neutrality plot for the VP1 (red) and VP2 (blue) genes of different NeVs. The regression coefficient indicates an 18% and 43% effect of mutation pressure in shaping the codon usage in VP1 and VP2 genes of NeVs, respectively. (C) Parity Rule 2 (PR2)-bias plot for VP1 and VP2 genes of different NeVs, where deviation from equal GC and AT bias in the VP1 and VP2 genes suggests the contribution of mutational pressure in codon usage in NeVs. The values highlighted with red circles represent the NeVs isolated from yaks.
Microorganisms 12 00696 g002
Figure 3. A heatmap representation of RSCU analysis of 59 codons (excluding UGG, AUG, UAG, UAA, and UGA) of NeVs and different host species.
Figure 3. A heatmap representation of RSCU analysis of 59 codons (excluding UGG, AUG, UAG, UAA, and UGA) of NeVs and different host species.
Microorganisms 12 00696 g003
Figure 4. A heatmap depiction of ratios between observed and expected Relative Codon Deoptimization Index (oRCDI and eRCDI) of VP1 and VP2 genes of different NeVs with respect to different hosts, where the higher ratio indicates better viral adaptability to the corresponding host(s).
Figure 4. A heatmap depiction of ratios between observed and expected Relative Codon Deoptimization Index (oRCDI and eRCDI) of VP1 and VP2 genes of different NeVs with respect to different hosts, where the higher ratio indicates better viral adaptability to the corresponding host(s).
Microorganisms 12 00696 g004
Figure 5. A phylogenetic tree depicting genetic relationships of NeVs within the Caliciviridae family.
Figure 5. A phylogenetic tree depicting genetic relationships of NeVs within the Caliciviridae family.
Microorganisms 12 00696 g005
Table 1. A summary of different strains of NeVs available on the NCBI resource portal as complete genomes. The GenBank identifiers, genome size in base pairs, start and end positions of ORF1 and ORF2 on the genome, and country of isolation of the viruses are provided. For the sequencing methods, NA represents the NeVs for which the information about method is neither available though public resources nor through corresponding research articles.
Table 1. A summary of different strains of NeVs available on the NCBI resource portal as complete genomes. The GenBank identifiers, genome size in base pairs, start and end positions of ORF1 and ORF2 on the genome, and country of isolation of the viruses are provided. For the sequencing methods, NA represents the NeVs for which the information about method is neither available though public resources nor through corresponding research articles.
NeVsAccession IDMethodSize (bp)Source
(PMID)
Origin
BoNeV/Nebraska/2002/USA/NC_004064NC_004064NA745312239283USA
BoNeV/TCG-14/2003/JPN/NC_006875NC_006875NA7453NCBIJapan
BoNev/Newbury1/2005/UK/NC_007916NC_007916NA745416574184UK
BoNeV/M3641/2011/HUN/JX018212JX018212Sanger7453NCBIHungary
BoNev/Kirklareli/2012/TUR/NC_030793NC_030793Sanger748426292294Turkey
BoNeV/Mukti/2016/IND/MN241817MN241817Sanger7484NCBIIndia
BoNeV/LZB-1/2017/CHN/MG599036MG599036Sanger745331500726China
YAKNeV/NRG-17/2017/CHN/MK452013MK452013Sanger745931500726China
BoNeV/YLA-2/2017/CHN/MH718886MH718886Sanger745330444471China
YAKNeV/HY1-2/2018/CHN/MK452012MK452012Sanger746031500726China
BoNeV/11/2021/CHN/OP272996OP272996Illumina748538108282China
BoNeV/11/2021/CHN/OP272997OP272997Illumina745338108282China
BoNeV/11/2021/CHN/OP272998OP272998Illumina746238108282China
BoNeV/11/2021/CHN/OP272999OP272999Illumina746638108282China
BoNeV/11/2021/CHN/OP273000OP273000Illumina746338108282China
Table 2. A summary of recombination events observed from multiple sequence alignment of complete genome sequences of NeVs. The NeVs for which at least two of the selected methods (out of seven) detected the occurrence (symbolized with ‘+’) of recombination are shown here. The non-occurrence/detection is symbolized with ‘-’. The asterisk (*) in some of the major and minor parents indicates the sequence used to infer the unknown major or minor parent. The abbreviations for the labels in the first row are provided in the footnote to the table.
Table 2. A summary of recombination events observed from multiple sequence alignment of complete genome sequences of NeVs. The NeVs for which at least two of the selected methods (out of seven) detected the occurrence (symbolized with ‘+’) of recombination are shown here. The non-occurrence/detection is symbolized with ‘-’. The asterisk (*) in some of the major and minor parents indicates the sequence used to infer the unknown major or minor parent. The abbreviations for the labels in the first row are provided in the footnote to the table.
REsRecombMajor ParentMinor ParentRDPGCnBScMChChmSiS3SQ
1MG599036JX018212NC_007916 *+++++++
2NC_007916MH718886 *NC_004064+++++++
3NC_004064NC_007916MH718886-----++
4JX018212MH718886NC_004064-----++
Abbreviations—REs: recombination events; Recom: recombinant; GCn: GeneConv; BSc: BootScan; MCh: MaxChi; Chm: chimera; SiS: SiScan; 3SQ: 3Seq.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kaushik, R.; Kumar, N.; Yadav, P.; Sircar, S.; Shete-Aich, A.; Singh, A.; Tomar, S.; Launey, T.; Malik, Y.S. Comprehensive Genomics Investigation of Neboviruses Reveals Distinct Codon Usage Patterns and Host Specificity. Microorganisms 2024, 12, 696. https://doi.org/10.3390/microorganisms12040696

AMA Style

Kaushik R, Kumar N, Yadav P, Sircar S, Shete-Aich A, Singh A, Tomar S, Launey T, Malik YS. Comprehensive Genomics Investigation of Neboviruses Reveals Distinct Codon Usage Patterns and Host Specificity. Microorganisms. 2024; 12(4):696. https://doi.org/10.3390/microorganisms12040696

Chicago/Turabian Style

Kaushik, Rahul, Naveen Kumar, Pragya Yadav, Shubhankar Sircar, Anita Shete-Aich, Ankur Singh, Shailly Tomar, Thomas Launey, and Yashpal Singh Malik. 2024. "Comprehensive Genomics Investigation of Neboviruses Reveals Distinct Codon Usage Patterns and Host Specificity" Microorganisms 12, no. 4: 696. https://doi.org/10.3390/microorganisms12040696

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop