Next Article in Journal
14-year-old Schoolchildren can Consent to Get Vaccinated in Tyrol, Austria: What do They know about Diseases and Vaccinations?
Previous Article in Journal
Multiple Genome Wide Association Mapping Models Identify Quantitative Trait Nucleotides for Brown Planthopper (Nilaparvata lugens) Resistance in MAGIC Indica Population of Rice
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Whole Genome In-Silico Analysis of South African G1P[8] Rotavirus Strains before and after Vaccine Introduction over a Period of 14 Years

1
Next Generation Sequencing Unit and Division of Virology, Faculty of Health Sciences, University of the Free State, Bloemfontein 9300, South Africa
2
Diarrheal Pathogens Research Unit, Sefako Makgatho Health Sciences University, Medunsa 0204, South Africa
3
South African Medical Research Council, Pretoria 0001, South Africa
4
College of Medicine, Department of Biomedical Sciences, Faculty of Biomedical Sciences and Health Professions, University of Malawi, Private Bag 360, Chichiri, Blantyre 3, Malawi
5
Enteric and Diarrheal Diseases, Global Health, Bill & Melinda Gates Foundation, P.O. Box 23350, Seattle, WA 98109, USA
6
Faculty of Health Sciences, University of Buea, P.O. Box 63, Buea, Cameroon
7
Noguchi Memorial Institute for Medical Research, University of Ghana, P.O. Box LG581, Legon, Ghana
8
Center for Global Vaccine Research, Institute of Infection, Liverpool L697BE, UK
9
Veterinary and Ecological Sciences, University of Liverpool, Liverpool L697BE, UK
10
Malawi-Liverpool-Wellcome Trust Clinical Research Program, Department of Medical Laboratory Sciences, College of Medicine, University of Malawi, Blantyre 312225, Malawi
*
Author to whom correspondence should be addressed.
Vaccines 2020, 8(4), 609; https://doi.org/10.3390/vaccines8040609
Submission received: 7 September 2020 / Revised: 8 October 2020 / Accepted: 13 October 2020 / Published: 14 October 2020
(This article belongs to the Section Vaccines against Tropical and other Infectious Diseases)

Abstract

:
Rotavirus G1P[8] strains account for more than half of the group A rotavirus (RVA) infections in children under five years of age, globally. A total of 103 stool samples previously characterized as G1P[8] and collected seven years before and seven years after introducing the Rotarix® vaccine in South Africa were processed for whole-genome sequencing. All the strains analyzed had a Wa-like constellation (G1-P[8]-I1-R1-C1-M1-A1-N1-T1-E1-H1). South African pre- and post-vaccine G1 strains were clustered in G1 lineage-I and II while the majority (84.2%) of the P[8] strains were grouped in P[8] lineage-III. Several amino acid sites across ten gene segments with the exception of VP7 were under positive selective pressure. Except for the N147D substitution in the antigenic site of eight post-vaccine G1 strains when compared to both Rotarix® and pre-vaccine strains, most of the amino acid substitutions in the antigenic regions of post-vaccine G1P[8] strains were already present during the pre-vaccine period. Therefore, Rotarix® did not appear to have an impact on the amino acid differences in the antigenic regions of South African post-vaccine G1P[8] strains. However, continued whole-genome surveillance of RVA strains to decipher genetic changes in the post-vaccine period remains imperative.

1. Introduction

Group A rotavirus (RVA) is the major causative agent of acute gastroenteritis (AGE) in children under five years [1]. RVA-induced diarrhea is responsible for approximately 125,000 childhood mortality cases worldwide [2]. RVA is classified in the Reoviridae family and comprises 11 dsRNA gene segments that encode six structural proteins (VP1-VP4, VP6, and VP7) and five/six non-structural proteins (NSP1-NSP5/6) [1]. A binary classification scheme based on the properties of the outer capsid proteins, VP7 and VP4, has been universally used to classify RVA strains [3]. However, to fully describe rotavirus strains, the binary classification system was expanded by incorporating the other nine genome segments [4].
In the whole-genome classification scheme, nucleotide percentage similarity cut-off values of all the eleven viral gene segments, as recommended by the Rotavirus Classification Working Group (RCWG), are used to determine a genotypic scheme Gx-P[x]-Ix-Rx-Cx-Mx-Ax-Nx-Tx-Ex-Hx designating VP7-VP4-VP6-VP1-VP2-VP3-NSP1-NSP2-NSP3-NSP4-NSP5/6, respectively [5]. The majority of RVA strains are assigned into three genogroups: Wa-like (I1-R1-C1-M1-A1-N1-T1-E1-H1), DS-1-like (I2-R2-C2-M2-A2-N2-T2-E2-H2), and a relatively minor group, AU-1-like (I3-R3-C3-M3-A3-N3-T3-E3-H3) [4]. G1P[8], G3P[8], G4P[8], G9P[8], and G12P[8] typically have the Wa-like genotype constellation, while G2P[4], G8P[4] and G8P[6] usually have the DS-1-like genotype constellation [6].
G1P[8] is among the most predominant and medically important RVA strain, globally [7]. In Africa, G1P[8] accounts for approximately 29% of all the circulating RVA strains [7]. The G1 and P[8] gene segments sub-cluster into lineages whose emergence is attributed to various mechanisms of genetic diversity common in RNA viruses such as genetic mutation, recombination, and reassortment [8]. Distinct lineages for both genotypes G1 and P[8] collected from different geographical regions have been described in the literature [9,10,11,12,13].
In order to alleviate RVA disease burden, four vaccines: RotaTeq® (Merck & Co., West Point, PA, USA); Rotarix® (GlaxoSmithKline, Rixenstart, Belgium); ROTAVAC® (Bharat Biotech, Hyderabad, India); and Rotasiil® (Serum Institute of India, Pune, India) have been pre-qualified by the World Health Organization (WHO) for global use [14]. Rotarix® contains an attenuated human G1P[8] RVA strain [15], while RotaTeq® is composed of five human-bovine reassortant strains (G1P[5], G2P[5], G3P[5], G4P[5], and G6P[8]) [16]. ROTAVAC® contains a G9P[11] strain [17], while Rotasiil® is a pentavalent human-bovine reassortant vaccine comprising five reassortant strains containing human VP7, representing the G1, G2, G3, G4 and G9 genotypes [18].
South Africa was the first African country to introduce the monovalent RVA vaccine, Rotarix®, into its Expanded Program on Immunization (EPI) in September 2009 [19]. In the first year, after the vaccine was introduced, RVA infections indicated by laboratory confirmed results and hospitalizations were reduced significantly by approximately 58% [20]. After the introduction of Rotarix® in South Africa, the prevalence of non-G1P[8] strains (such as G2P[4], G2P[6], G9P[8], G12P[8] and G8P[4]) that were not incorporated in the monovalent G1P[8] vaccine increased significantly [21]. Notably, at Dr. George Mukhari Hospital, a key RVA surveillance site, no G1P[8] strains were reported in 2012 [21], whereas this was a predominant strain during the pre-vaccine period [22]. The enormous genetic and antigenic diversity within RVA [8,23] and the recent emergence of novel strains [24,25,26] emphasize the need to monitor the impact of RVA vaccines on the genetic and antigenic landscape of RVA circulating in the population.
The impact of RVA vaccination in Sub-Saharan Africa has been substantial [27] and it is essential to continuously assess the long-term impact of vaccination on circulating RVA strains. Thus, there is a need for whole-genome longitudinal surveillance studies in South Africa to decipher potential RVA vaccine-induced strain changes. In this study, we investigated the impact of RVA vaccine introduction in South Africa on the most common human RVA strain, G1P[8]. This study is the first large-scale genomic analysis of human RVA collected seven years before and seven years after the introduction of the RVA vaccine in South Africa.

2. Materials and Methods

2.1. Ethics Approval

The diarrheal stool samples were collected as a routine diagnostic clinical specimen when the parents brought their child to a health facility for clinical management, requiring no written informed consent. As part of the WHO-coordinated RVA surveillance network, the archived RVA-positive specimens were anonymized and utilized for strain characterization under a Technical Service Agreement and a Materials Transfer Agreement (MTA) to the WHO/AFRO Regional Reference Laboratory (WHO-RRL) based at Sefako Makgatho Health Sciences University (SMU), Pretoria, South Africa. The WHO Research Ethics Review Committee granted an exemption activity, noting that the study procedures were part of routine hospital-based RVA surveillance. The samples were transferred for whole-genome sequencing at the University of the Free State-Next Generation Sequencing (UFS-NGS) Unit through a MTA (SMU-UFS.1). The Health Sciences Research Ethics Committee (HSREC) of the UFS, Bloemfontein, South Africa, approved the study under ethics number UFS-HSD2018/0510/3107.

2.2. Strain Description

Rotavirus positive stool samples (n = 103) previously characterized as G1P[8] were sourced from the archival storage of the Diarrheal Pathogens Research Unit (DPRU), WHO-RRL in Pretoria, South Africa. The samples were distributed as follows: 2002 (n = 14), 2003 (n = 7), 2004 (n = 14), 2005 (n = 6), 2006 (n = 22), 2007 (n = 7), 2008 (n = 9), 2009 (n = 13), 2010 (n = 1), 2013 (n = 1), 2014 (n = 5), 2015 (n = 3), and 2017 (n = 1). In addition to the 103 samples, an additional 68 whole-genome sequences for G1P[8] strains collected from South Africa were extracted from the GenBank database [28].

2.3. Extraction and Purification of Double-Stranded RNA

A fecal suspension was prepared by adding approximately 100 mg stool sample into 200 µL Phosphate Buffered Saline (PBS) solution, 0.01 M, pH 7.2 (Sigma-Aldrich®, St Louis, MO, USA) with subsequent extraction of viral RNA as previously described [29] albeit with some modifications. Briefly, the modifications included the volume (900 µL TRIzol™-LS: 300 µL stool sample suspension), the incubation period of dsRNA enrichment (24 h), centrifugation speeds (20,000× g), and the staining reagent (PronaSafe, Condalab, Camberley, UK), as captured in the UFS-NGS unit extraction Standard Operating Procedure (SOP). The extracted RNA was purified using the MinElute PCR Purification Kit by following the manufacturers’ instructions (Qiagen, Hilden, Germany).

2.4. Complementary DNA (Cdna) Synthesis

cDNA was synthesized from the extracted viral RNA using the Maxima H Minus Double-Stranded Synthesis Kit and protocol (Thermo Fischer Scientific, Waltham, MA, USA) with some modifications (UFS-NGS Unit SOP). Briefly, for first-strand cDNA synthesis, a denaturing step of the purified extracted RNA at 95 °C for 5 min in Multigene Optimax thermocycler (Labnet, Edison, NJ, USA) was followed by the addition of 1 µL of Random Hexamer primer, 100 µM. The reaction mixture was incubated in a thermocycler at 65 °C for 5 min. Afterward, 5 µL volume of the 4× First-Strand Reaction Mix and 1 µL of First Strand Enzyme Mix was added and the reaction mixture was incubated in a thermocycler pre-programmed as follows: 10 min at 25 °C, 120 min at 50 °C, and 5 min at 85 °C. For the second-strand cDNA synthesis step, a 55 µL volume of Nuclease-Free water, 20 µL of 5× Second Strand Reaction Mix, and 5 µL of Second Strand Enzyme Mix was added and incubated in the thermal cycler at 16 °C for 60 min. A 6 µL volume of EDTA, pH 8.9 was added followed by 10 µL of RNase I. The synthesized cDNA was purified using MSB® Spin PCRapace Kit by following the manufacturer’s protocol (Stratec Molecular, Berlin, Germany).

2.5. DNA Library Preparation and Whole-Genome Sequencing

The DNA libraries were prepared by utilizing the Nextera® XT DNA Library Preparation Kit (Illumina, San Diego, CA, USA). Quantitative and qualitative assessment of DNA was performed using Qubit 3.0 fluorometer (Invitrogen, Carlsbad, CA, USA) and Agilent 2100 BioAnalyzer® (Agilent Technologies, Waldbronn, Germany), respectively, by following the manufacturer’s instructions. The DNA library and the PhiX Control v3 library (Illumina, San Diego, CA, USA) were normalized to 8 pM and 20 pM concentrations, respectively. A volume of 600 µL pooled denatured DNA library spiked with 20% PhiX Control v3 library was loaded into a MiSeq® Reagent Kit V3 for paired-end nucleotide sequencing (301 × 2) on a MiSeq® sequencer (Illumina, San Diego, CA, USA) at the University of the Free State-Next Generation Sequencing (UFS-NGS) Unit, Bloemfontein, South Africa.

2.6. Genome Assembly

The trimming of Illumina read ends and subsequent genome assembly was performed using a suite of tools embedded in Geneious Prime® software, version 2020.1.1 [30]. Complementary RVA assembly was also performed using an in-house developed genome assembly pipeline and CLC Genomics Workbench 12 (https://www.qiagenbioinformatics.com/).

2.7. Generation of Whole-Genome Constellations

The genotype of each gene segment was determined using the RVA determination tool in the Virus Pathogen Database and Analysis Resource (ViPR) [31] to generate the full genome constellations for each RVA strain.

2.8. Phylogenetic Analyses

Alignments and comparative analysis of the full-length sequences for each gene segment was performed as described previously [24]. Multiple sequence alignments were performed using the MAFFT package in Geneious Prime 2020 [30]. Duplicated sequences in the alignments were identified utilizing ElimDupes (https://www.hiv.lanl.gov/content/sequence/elimdupesv2/elimdupes.html). The best evolutionary models for each gene segment were estimated using the DNA Model Test program in MEGA 6 to guide in the construction of maximum-likelihood trees with 1000 bootstrap replicates [32].

2.9. Selection Pressure and Recombination Analysis

Analysis of natural selection in RVA genome segments was done using the suite from the DataMonkey Webserver [33]: Fixed-effects Likelihood (FEL) [34], Fast Unconstrained Bayesian Approximation for Inferring Selection (FUBAR) [35] and mixed-effects model of episodic selection (MEME) [36]. Amino acid sites undergoing positive selection were identified and tabulated. Analysis of genetic recombination was performed using Genetic Algorithm for Recombination Detection (GARD) [34].

2.10. Protein Modeling

The RVA protein structures were modeled using SWISS-MODEL with an initial template search [37]. The templates were selected from the SWISS-MODEL Template Library (SMTL) and their respective resolution values for the analyzed genes were as follows: VP7 (3 fmg.1, 3.40 Å) and VP4 (2 dwr.1, 2.50 Å). The evaluation of stereochemical quality parameters of the generated structures was performed using the Structure Assessment feature in the SWISS-MODEL server [37] and VERIFY3D [38]. Image visualization and analysis was performed in PyMol software [39].

2.11. In Silico Analysis of Effect of Mutation(s) on Protein Stability

The FoldX plugin [40] integrated in the YASARA platform [41] was used to predict the stability effect of mutation(s) in a 3D structure. FoldX estimates the stability effect of a mutation empirically whereby the stability (ΔG) of a protein is defined by the free energy, which is expressed in kcal/mol. In this study, G is the difference of free energy between the Rotarix® (vaccine) strain or pre-vaccine strain and mutant strain. The energy with positive value is regarded to destabilize the structure, while a mutation with a negative value is regarded to stabilize the structure. Free energy change of ±0.5 kcal/moL is regarded as statistically significant for the stabilizing/destabilizing effect [40].

3. Results

3.1. Whole-Genome Constellation Determination

The genotype constellation for the 103 South African G1P[8] strains (92 pre- and 11 post-vaccine) sequenced in this study and corresponding 68 strains (56 pre- and 12 post-vaccine) acquired from the GenBank database as reference strains was typical G1-P[8]-I1-R1-C1-M1-A1-N1-T1-E1-H1 (Supplementary Materials 1 (S1)). The sizes of the full-length genome segments one to eleven and their respective open reading frames (ORFs) were determined (S1). All the gene sequences in this study were submitted in the NCBI GenBank database under accession numbers MT854335-MT855467.

3.2. Phylogenetic Analyses

3.2.1. Phylogenetic Analyses of VP7 and VP4

Phylogenetic trees for each of the 11 gene segments were constructed. For VP7 and VP4, well-known lineage designations were utilized [9,11]. The VP7 phylogenetic tree comprised the VP7 gene sequences of the South African strains together with VP7 gene sequences of the reference strains from the seven established VP7 G1 genotype lineages [9]. The 171 G1 South African sequences utilized in this study were segregated into two main lineages: G1-lineage I and lineage II (Figure 1). G1-lineage I comprised 76 pre-vaccine G1 strains and 12 post-vaccine G1 strains, while G1-lineage II comprised 72 pre-vaccine G1 strains and 11 post-vaccine G1strains (Figure 1). Strain RVA/Human-wt/ZAF/UFS-NGS-MRC-DPRU2250/2013/G1P[8] had ≥99.9% nucleotide identity with Rotarix® in all its structural and nonstructural genes (S2) and clustered alongside cognate Rotarix® genes in all the 11 phylogenetic trees. South African pre- and post-vaccine G1 strains were highly identical to each other and similarly distant from the Rotarix® strain (Table 1).
The VP4 phylogenetic tree comprised nucleotide sequences of the South African RVA strains and those of the reference strains from the four established VP4 P[8] genotype lineages (I to IV) [11] (Figure 2). The P[8] strains for South African rotaviruses segregated into three evolutionary lineages, P[8]-lineage I, III, and IV (Figure 2). Lineage III comprised a mixture of 144 pre-vaccine and 21 post-vaccine P[8] strains, while lineage IV comprised five pre-vaccine strains (Figure 2). The P[8] strains identified pre- and post-vaccination were highly identical to each other and similarly distant from the Rotarix® strain (Table 1).

3.2.2. Phylogenetic Analysis of VP1–VP3, VP6, and NSP1–NSP5

South African VP1–VP3, VP6, and NSP1–NSP5 were clustered into two main lineages, one of the lineages comprising of the Rotarix® genome segments (Figures S1–S9 in S4). South African strains identified pre- and post-vaccination were highly identical to each other as well as similarly distant from the vaccine strain (Table 1).

3.3. Protein Modeling and Amino Acid Analysis

3.3.1. Comparative Analysis of Neutralizing Antigenic VP7 Epitopes of South African G1 Strains and Rotarix® Strains

The VP7 protein contains two defined neutralization antigenic epitopes spanning across 29 amino acid residues: 7–1 (7–1a and 7–1b) and 7–2 regions [42]. A comparison of the amino acid residues in the antigenic epitopes of wild type South African G1 strains with cognate sites in the Rotarix® strain was performed (Figure S1 in S5). Five amino acid differences appeared in both pre- and post-vaccine strains, four amino acids occurred before vaccine introduction while two amino acid differences, N147D (in eight strains) and T242A (in one strain), appeared only during the post-vaccine period (Figure S1 in S5; Table S1 in S6). We performed protein modeling analysis on the post-vaccine strains, which showed amino acid differences relative to both the pre-vaccine strains and Rotarix® strain. Therefore, for VP7, we selected strain RVA/Human-wt/ZAF/UFS-NGS-MRC-DPRU4357/2015/G1P[8] as a representative strain of the eight G1 post-vaccine strains that had N147D amino acid substitution (Figure S1 in S5). Additionally, we selected strain RVA/Human-wt/ZAF/MRC-DPRU1544/2010/G1P[8], the only post-vaccine strain exhibiting a T242A amino acid difference (Figure S1 in S5). The VP7 protein structures of the selected G1 strains were superposed with the Rotarix® VP7 structure to assess differences in the structural conformation. The VP7 structure of strain RVA/Human-wt/ZAF/UFS-NGS-MRC-DPRU4357/2015/G1P[8] when superposed with the Rotarix® VP7 had a root mean square deviation (RMSD) value of 0.020 Å (Table 2; Figure 3A). The RMSD value closer to zero suggests high structural homology [43]. Replacement of asparagine with aspartate (N147D) significantly destabilized the protein structure as the free energy change was +0.527 kcal/mol [40] (Table 2; Figure 3A). Apart from the VP7 protein structure of Rotarix®, we performed protein modeling analysis using the protein structures of five randomly selected pre-vaccine study strains (Figure S1 in S7). Similar significant free energy change trends for the destabilizing effect were observed and ranged from +0.506 to +0.579 kcal/mol (Figure S1 in S7). The VP7 structure of strain RVA/Human-wt/ZAF/MRC-DPRU1544/2010/G1P[8] had an RMSD value of 0.012 Å when superposed with Rotarix® VP7 (Table 2; Figure 3B). Threonine, which is a neutral hydrophilic amino acid, was replaced with alanine, an aliphatic hydrophobic amino acid, showing a change in polarity. However, the replacing amino acid (alanine) does not significantly alter the stability of the protein at this epitope (−0.076 kcal/mol) [40]. Protein modeling analysis with VP7 protein structures from five randomly selected pre-vaccine study strains showed similar free energy change trends ranging from −0.070 to −0.078 kcal/mol (Figure S2 in S7).

3.3.2. Comparative Analysis of VP7 Cytotoxic T Lymphocyte Epitopes of South Africa With Rotarix® Vaccine Strain

The two established VP7 cytotoxic T-cell lymphocyte epitopes at positions 16–28 and 40–52 [44] of the VP7 of South African G1P[8] strains were compared with cognate regions of Rotarix® VP7. Five amino acid differences appeared before Rotarix® introduction, while two amino acid substitutions (A43V in two strains and A46T in four strains) were found after vaccine introduction (Table S1 in S6). Comparative analysis of the South African post-vaccine G1 genotypes outside the known epitope regions with cognate regions in G1 of Rotarix® identified three amino acid differences (S37I, A68D, and E222K) (S8). The replacing amino acid at positions S37I, A68D, and E222K resulted in a change in the polarity of the amino acid residue (Table S2 in S6).

3.3.3. Comparative Analysis of Neutralizing Antigenic Epitopes in VP4 Genes of South African P[8] Strains and Rotarix® Vaccine Strain

The VP4 protein comprises the VP5* and VP8* regions. The VP8* region contains four (8–1 to 8–4) neutralizing antigenic epitopes, while VP5* has five (5–1 to 5–5) [45]. The differences between the P[8] study strains and Rotarix® P[8] component were mostly contained in the VP8* epitopes 8–1, 8–3, and 8–4 (Figure S2 in S5). Seven amino acid differences were identified in both pre- and post-vaccine strains, seven amino acid substitutions during pre-vaccination introduction and two amino acid substitutions, T88I (in one strain) and N89S (in one strain) were identified after vaccine introduction (Figure S2 in S5; Table S3 in S6). We performed protein modeling analysis on the post-vaccine strains that showed amino acid differences relative to both the pre-vaccine strains and Rotarix® strain and these were RVA/Human-wt/ZAF/UFS-NGS-MRC-DPRU74/2014/G1P[8] and RVA/Human-wt/ZAF/UFS-NGS-MRC-DPRU83/2011/G1P[8] (Figure S2 in S5). The VP4 structure of strain RVA/Human-wt/ZAF/UFS-NGS-MRC-DPRU74/2014/G1P[8] aligned with the RMSD value of RMSD 0.048 when superposed with the Rotarix® VP4 (Table 3; Figure 4A). Replacement with the isoleucine at position T88I altered the polarity of the residue from polar to nonpolar without affecting the charge [46]. This change in polarity may alter the physical properties of the protein. The mutation did not significantly impact the stability of the protein (−0.297 kcal/mol) (Table 3; Figure 4A). Further free energy change analysis with VP4 structures from five randomly selected pre-vaccine strains demonstrated similar free energy change trends that ranged from −0.504 to −0.322 kcal/mol (Figure S3 in S7).The VP4 structure of strain RVA/Human-wt/ZAF/MRC-DPRU83/2011/G1P[8] had an RMSD value of 0.049 when superposed with Rotarix® VP4, indicating significant alignment. Asparagine is a polar, neutrally charged amino acid, and the replacing amino acid, serine, is also polar with a neutral charge [46]. The replacing amino acid, serine, significantly destabilized the protein structure with +1.166 kcal/mol energy change (Table 3: Figure 4B).When protein modeling analysis was performed with VP4 structures of five randomly selected pre-vaccine P[8] strains, significant free energy change values ranging from +1.128 to +1.352 kcal/mol were also observed, indicating the amino acid substitution had a destabilizing effect on the protein structure (Figure S4 in S7).

3.3.4. Analysis of the VP4 and VP6 Non-Neutralizing Regions

Comparative analysis of the South African post-vaccine P[8] strain outside the known epitope regions with cognate regions in pre-vaccine strains identified five amino acid differences that were conservative in nature (Table S4 in S6). Analysis of the amino acid differences between the VP6 gene sequences of South African pre- and post-vaccine strains identified an amino acid difference Y353H in a 2017 post-vaccine strain, RVA/Human-wt/ZAF/MRC-DPRU15948/2017/G1P[8] (S8). The amino acid mutation Y353H occurred in a conserved region of the VP6 that has been associated with trimerization and single-shelled particle formation [47] (Table S5 in S6).

3.3.5. Analysis of Amino Acid Differences in VP1–VP3 and NSP1–NSP5 Amino Acid Sequences

The number of amino acid differences that were identified after Rotarix® vaccine introduction in South African G1P[8] VP1–VP3 and NSP1–NSP5 amino acid sequences were 12, 6, 7, 11, 3, 4, 5, and 2, respectively (Tables S6–S13 in S6). Most of the amino acid substitutions were conservative (46).

3.3.6. Analyses of Selection Pressure and Recombination

Selection pressure analysis by Mixed-Effects Model of Evolution (MEME), Fast, Unconstrained Bayesian AppRoximation for inferring selection (FUBAR), and Fixed-Effects Likelihood (FEL) showed that most of the codon sites in the 11 gene segments were undergoing purifying selection. With the exception of the VP7 gene segment, several amino acid sites in the rest of the gene segments were identified to be under positive selective pressure (Table 4) Amino acid sites 7 and 245 were identified to be under positive selective pressure by all three analysis tools used (Table 4). GARD found no significant evidence of recombination in all gene segments.

4. Discussion

The present study reports whole-genome analysis of G1P[8] RVA strains collected seven years before and seven years after the introduction of the Rotarix® vaccine in South Africa. Whole-genome analysis of South African G1P[8] strains showed that they possessed a Wa-like genotype constellation. These findings are comparable to what has been observed in several countries [11,48,49,50,51]. The prevalent association of G1P[8] genotypes with the Wa-like genetic background has been hypothesized to be due to epidemiological fitness [52].
Of the seven previously described G1 lineages [9], South African pre- and post-vaccine G1 strains in this study were segregated into G1-lineage I and G1-lineage II. Similar observations have been reported in several other studies [11,51,53], highlighting the likelihood of the epidemiological fitness of G1 strains in these two lineages, which has led to their prevalent circulation over time. Of the four P[8] lineages that have been established previously [11], the majority (84.2%) of the South African P[8] strains in this study clustered in P[8] lineage III, an observation comparable to other studies [51,53,54,55]. Additionally, the South African P[8] strains were observed in P[8] lineage I and P[8] lineage IV (also designated as the OP354-like lineage) [56], an observation reported in a study of Indian P[8] strains [57]. Both pre-and post-vaccine strains were found present in this P[8] lineage III, underscoring its global predominance in circulation [9,56,57,58,59,60].
Amino acid differences in the neutralization epitopes of VP7 compared to the cognate region in Rotarix® identified at positions N94S, S123N, K291R, and M217T appeared in both pre- and post-vaccine strains and have been previously reported in Belgium and Brazil [51,61]. Since these amino acid substitutions were present even before the introduction of the vaccine, they can be attributed to the natural RVA evolutionary processes. Amino acid differences in N147D occurred in eight post-vaccine G1 strains resulted in a change from neutral to negatively charged amino acid residue, while T24A occurred in one post-vaccine G1 strain, which resulted in a change from hydrophilic to hydrophobic [46]. The change in charge may affect the protein’s chemical properties, while the shift in polarity suggests possible inaccessibility of the epitope as it becomes more hydrophobic [46]. Differences in VP4 amino acids between the study strains and Rotarix® were observed in epitopes 8–1 and 8–3, similar to those already reported in Belgium, Brazil, and Tunisia [51,61,62]. The polarity changes (hydrophilic to hydrophobic) observed at position T88I only in a post-vaccine strain may affect antibody binding at this region of the 8–4 epitope as the resulting epitope becomes relatively inaccessible due to the hydrophobic effect [46]. The loss of glycosylation site at position N89S only observed in the post-vaccine period in one strain may alter the function of the protein at the 8–4 epitope [63].
To gain further insight into the impact of amino acid substitution occurring in the neutralization epitopes of VP7 and VP4 on the stability of the protein structures, we performed folding free energy change analysis. A protein’s folding free energy change is an essential aspect of the protein’s stability with a direct association with the protein’s function [64]. The N147D substitution in VP7 and N89S substitution in VP8* resulted in a predicted destabilizing effect on the protein structure. Due to this destabilization effect on the protein structure, as indicated by free energy change analysis, we hypothesize that these amino acid substitutions observed during the post-vaccine era may not be favored in the long-term to enhance viral fitness. We did not observe consistently occurring amino acid substitutions in the 11 gene segments in both the antigenic and non-antigenic regions throughout the post-vaccine period when contrasted to the pre-vaccine period. However, there were yearly observations such as L167M (in VP1), R44K (in VP2), I234V (in VP3), V112I (in VP4), T120I (in NSP1) and V73I (in NSP4), which occurred in all eight 2010 strains. Amino acid substitutions: K164R (in VP2), A411V (in VP3), A46T (in VP7), I341V (in NSP1), V254P (in NSP2), and D157E (in NSP5) of the four out of five 2011 strains and K96R (in VP1), V459I (in VP3), V600L (in VP4), and A259T (in NSP1) out of all the eight 2014 and 2015 G1P[8] strains.
Selection pressure analysis demonstrated that most of the codon sites in the 11 genome segments of South African G1P[8] strains were undergoing purifying selection, probably to purge deleterious polymorphisms that arise due to the inherent error-prone nature of the RNA polymerase enzyme [1]. Amino acid sites 7 and 245 in VP3 were identified to be under positive selection by all three selection pressure analysis methods and site 245 fell within the guanine-N7-methyltransferase (N7-MTpase) domain suggested to catalyze methylation during capping of nascent rotavirus transcripts [64]. Methylation capping by VP3 of RVA serves as a critical strategy to evade host antiviral innate immune response [65].

5. Conclusions

The study shows that there were no consistently conserved amino acid substitutions occurring throughout during post-RVA vaccine period in the antigenic and non-antigenic regions of South African G1P[8] strains as most of the amino acid substitutions found in the post-vaccine period were already present during the pre-vaccine period. Therefore, Rotarix® did not appear to have an impact on the genetic changes of South African G1P[8] post-vaccine strains. However, continued long-term whole-genome surveillance to monitor any consistently occurring genetic changes during the post-vaccine period, which may hint at vaccine selective pressure, remains essential. This study was limited mainly by the significant differences in the sample size between the pre- and post-vaccine period, although the significant decline in G1P[8] strains in South Africa during post-vaccination period has been previously reported [21].

Supplementary Materials

The following are available online at https://www.mdpi.com/2076-393X/8/4/609/s1, Supplementary data 1(S1): Whole genome constellation analysis containing Table S1: Whole genotype constellations of 103 sequenced South African G1P[8] rotavirus strains and 68 South African G1P[8] strains acquired from GenBank. Supplementary data 2 (S2): Identity matrices analysis for VP1, VP2, VP3, VP4, VP6, NSP1, NSP2, NSP3, NSP4, and NSP5 nucleotide and deduced amino acid identities among strains calculated by distance matrices using p-distance algorithm in MEGA 6. Supplementary data 3 (S3): Duplicate sequences in the alignments identified using ElimDupes. Supplementary data 4 (S4): Additional phylograms containing Figure S1: Maximum likelihood phylogenetic tree based on the full length sequences of the genome segment 1 encoding VP1 protein, Figure S2: Maximum likelihood phylogenetic tree based on the full length sequences of the genome segment 2 encoding VP2 protein, Figure S3: Maximum likelihood phylogenetic tree based on the full length sequences of the genome segment 3 encoding VP3 protein, Figure S4: Maximum likelihood phylogenetic tree based on the full length sequences of the genome segment 6 encoding VP6 protein, Figure S5: Maximum likelihood phylogenetic tree based on the full length sequences of the genome segment 6 encoding NSP1 protein, Figure S6: Maximum likelihood phylogenetic tree based on the full length sequences of the genome segment 8 encoding NSP2 protein, Figure S7: Maximum likelihood phylogenetic tree based on the full length sequences of the genome segment 7 encoding NSP3 protein, Figure S8: Maximum likelihood phylogenetic tree based on the full length sequences of the genome segment 10 encoding NSP4 protein, Figure S9: Maximum likelihood phylogenetic tree based on the full length sequences of the genome segment 11 encoding NSP5 protein. Supplementary data 5 (S5): Neutralization epitope analysis in VP4 and VP7 containing Figure S1: Alignment of VP7 neutralizing and cytotoxic T-cell lymphocytic epitopes residues of Rotarix® strain against wild type South African G1P[8] strains, Figure S2: Alignment of antigenic residues in VP4 between the P[8] component of Rotarix® and wild type P[8] strains. Supplementary data 6 (S6): Additional tables containing Table S1: Amino acid differences in the VP7 neutralization and cytotoxic T-cell lymphocyte (CTL) epitope regions between Rotarix® and South African G1 strains, Table S2: Amino acid differences in South African post-vaccine VP7 Cytotoxic T-cell Lymphocyte epitope and non-epitope regions, Table S3: Amino acid differences in the VP4 neutralization epitope regions between Rotarix® and South African P[4] strains, Table S4: Amino acid differences in South African post-vaccine P[8] RVA strains, Table S5: Amino acid differences in South African post-vaccine I1 RVA strains, Table S6: Amino acid differences in South African post-vaccine R1 RVA strains, Table S7: Amino acid differences in South African post-vaccine C1 RVA strains, Table S8: Amino acid differences in South African post-vaccine M1 RVA strains, Table S9: Amino acid differences in South African post-vaccine A1 RVA strains, Table S10: Amino acid differences in South African post-vaccine N1 RVA strains, Table S11: Amino acid differences in South African post-vaccine T1 RVA strains, Table S12: Amino acid differences in South African post-vaccine E1 RVA strains, Table S13: Amino acid differences in South African post-vaccine H1 RVA strains. Supplementary data 7(S7): Additional protein modeling analysis containing Figure S1: The VP7 structure models showing the N147D amino acid difference, Figure S2: The VP7 structure models showing the T242A amino acid difference, Figure S3: The VP4 structure models showing the T88I amino acid difference, Figure S4: The VP4 structure models showing the N89S amino acid difference. Supplementary data 8 (S8): Whole genome amino acid analysis, Supplementary data 9: Selection pressure analysis.

Author Contributions

M.M.N., K.C.J., F.E.D., and V.N.N. conceptualized the main project. P.N.M., M.T.M., and M.M.N. performed the laboratory experiments. M.M.N., P.N.M., M.J.M., and M.L.S. facilitated the sample resources. I.P. performed laboratory training on rotavirus extraction. Formal analysis was done by P.N.M. and M.M.N. Data curation was performed by P.N.M., M.T.M., and M.M.N. Data analysis was done by P.N.M., M.M.N., B.K., and M.D.E. Writing of the original draft was performed by P.N.M. Review of the drafts was performed by P.N.M., M.T.M., M.L.S., M.J.M., I.P., M.D.E., B.K., A.D.S., C.D.K., V.N.N., F.E.D., K.C.J. and M.M.N. Supervision, funding acquisition, and project administration was performed by M.M.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research was principally funded by a grant awarded to M.M.N., K.C.J., F.E.D., and V.N.N. by Bill and Melinda Gates Foundation (BMGF-OPP1180423_2017). Other funding grants awarded to M.M.N. that funded this research include the South African Medical Research Council (SAMRC) through the Self-Initiated Research grant (SIR), Poliomyelitis Research Foundation (PRF-19/16), and National Research Foundation (NRF-120814). K.C.J. was supported by a Wellcome Training Fellowship, number 201945/Z/16/Z.

Acknowledgments

We would like to acknowledge the assistance in retrieving the archival stool samples by Khutso Mothapo, Kebareng Rakau, and Nonkululeko Magagula at the WHO-RRL in Pretoria, South Africa. Assistance in performing laboratory work by Sebotsana Rasebotsa, Lesedi Mosime, Gilmore Pambuka, Emmanuel Ayodeji, and Teboho Mooko. Guidance by Felicity Burt as the Ph.D co-Supervisor of P.N.M. is also acknowledged. The provision of technical ICT support by Stephanus Riekert is duly acknowledged.

Conflicts of Interest

The authors declare no conflict of interest in this work.

References

  1. Estes, M.K.; Greenberg, H.B. Rotaviruses. In Fields Virology, 6th ed.; Knipe, D.M., Howley, P.M., Eds.; Wolters Kluwer Health; Lippincott Williams & Wilkins: Philadelphia, PA, USA, 2013; pp. 1347–1401. [Google Scholar]
  2. Troeger, C.; Khalil, I.A.; Rao, P.C.; Cao, S.; Blacker, B.F.; Ahmed, T.; Armah, G.; Bines, J.E.; Brewer, T.G.; Colombara, D.V.; et al. Rotavirus vaccination and the global burden of rotavirus diarrhea among children younger than 5 years. JAMA Pediatr. 2018, 172, 958–965. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Estes, M.K.; Kapikian, A. Rotaviruses. In Fields Virology; Knipe, D., Griffin, D., Lamb, R., Martin, M., Roizman, B., Straus, S., Eds.; Wolters Kluwer Health; Lippincott, Williams and Wilkins: Philadelphia, PA, USA, 2007; pp. 1917–1975. [Google Scholar]
  4. Matthijnssens, J.; Ciarlet, M.; McDonald, S.M.; Attoui, H.; Bányai, K.; Brister, J.R.; Buesa, J.; Esona, M.D.; Estes, M.K.; Gentsch, J.R.; et al. Uniformity of rotavirus strain nomenclature proposed by the Rotavirus Classification Working Group (RCWG). Arch. Virol. 2011, 156, 1397–1413. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Matthijnssens, J.; Ciarlet, M.; Rahman, M.; Attoui, H.; Bányai, K.; Estes, M.K.; Gentsch, J.R.; Iturriza-Gómara, M.; Kirkwood, C.D.; Martella, V.; et al. Recommendations for the classification of group A rotaviruses using all 11 genomic RNA segments. Arch. Virol. 2008, 153, 1621–1629. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. Matthijnssens, J.; Van Ranst, M. Genotype constellation and evolution of group a rotaviruses infecting humans. Curr. Opin. Virol. 2012, 2, 426–433. [Google Scholar] [CrossRef] [PubMed]
  7. Dóró, R.; László, B.; Martella, V.; Leshem, E.; Gentsch, J.; Parashar, U.; Bányai, K. Review of global rotavirus strain prevalence data from six years post vaccine licensure surveillance: Is there evidence of strain selection from vaccine pressure? Infect. Genet. Evol. 2014, 28, 446–461. [Google Scholar] [CrossRef] [PubMed]
  8. Kirkwood, C.D. Genetic and antigenic diversity of human rotaviruses: Potential impact on vaccination programs. J. Infect. Dis. 2010, 202, S43–S48. [Google Scholar] [CrossRef]
  9. Arista, S.; Giammanco, G.M.; De Grazia, S.; Ramirez, S.; Biundo, C.L.; Colomba, C.; Cascio, A.; Martella, V. Heterogeneity and temporal dynamics of evolution of G1 human rotaviruses in a settled population. J. Virol. 2006, 80, 10724–10733. [Google Scholar] [CrossRef] [Green Version]
  10. Bányai, K.; Gentsch, J.R.; Martella, V.; Bogdán, Á.; Havasi, V.; Kisfali, P.; Szabó, A.; Mihály, I.; Molnár, P.; Melegh, B.; et al. Trends in the epidemiology of human G1P [8] rotaviruses: A Hungarian study. J. Infect. Dis. 2009, 200 (Suppl. 1), S222–S227. [Google Scholar] [CrossRef]
  11. Le, V.P.; Chung, Y.C.; Kim, K.; Chung, S.I.; Lim, I.; Kim, W. Genetic variation of prevalent G1P [8] human rotaviruses in South Korea. J. Med. Virol. 2010, 82, 886–896. [Google Scholar] [CrossRef]
  12. Bucardo, F.; Rippinger, C.M.; Svensson, L.; Patton, J.T. Vaccine-derived NSP2 segment in rotaviruses from vaccinated children with gastroenteritis in Nicaragua. Infect. Genet. Evol. 2012, 12, 1282–1294. [Google Scholar] [CrossRef] [Green Version]
  13. Cho, M.K.; Jheong, W.H.; Lee, S.G.; Park, C.J.; Jung, K.H.; Paik, S.Y. Full genomic analysis of a human rotavirus G1P [8] strain isolated in South Korea. J. Med. Virol. 2013, 85, 157–170. [Google Scholar] [CrossRef] [PubMed]
  14. World Health Organization. WHO Prequalifies New Rotavirus Vaccine. Available online: https://www.who.int/medicines/news/2018/prequalified_new-rotavirus_vaccine/en/ (accessed on 21 January 2020).
  15. Bernstein, D.I.; Sack, D.A.; Rothstein, E.; Reisinger, K.; Smith, V.E.; O’Sullivan, D.; Spriggs, D.R.; Ward, R.L. Efficacy of live, attenuated, human rotavirus vaccine 89–12 in infants: A randomised placebo-controlled trial. Lancet 1999, 354, 287–290. [Google Scholar] [CrossRef]
  16. Ciarlet, M.; Schödel, F. Development of a rotavirus vaccine: Clinical safety, immunogenicity, and efficacy of the pentavalent rotavirus vaccine, RotaTeq®. Vaccine 2009, 27, G72–G81. [Google Scholar] [CrossRef] [PubMed]
  17. Bhandari, N.; Rongsen-Chandola, T.; Bavdekar, A.; John, J.; Antony, K.; Taneja, S.; Goyal, N.; Kawade, A.; Kang, G.; Rathore, S.S.; et al. Efficacy of a monovalent human-bovine (116E) rotavirus vaccine in Indian infants: A randomised, double-blind, placebo-controlled trial. Lancet 2014, 383, 2136–2143. [Google Scholar] [CrossRef] [Green Version]
  18. Zade, J.K.; Kulkarni, P.S.; Desai, S.A.; Sabale, R.N.; Naik, S.P.; Dhere, R.M. Bovine rotavirus pentavalent vaccine development in India. Vaccine 2014, 32, A124–A128. [Google Scholar] [CrossRef] [Green Version]
  19. World Health Organization. Rotavirus Vaccination, Meeting of the immunization Strategic Advisory Group of Experts, April 2009. Wkly. Epidemiol. Rec. 2009, 84, 220–236. [Google Scholar]
  20. Msimang, V.M.; Page, N.; Groome, M.J.; Moyes, J.; Cortese, M.M.; Seheri, M.; Kahn, K.; Chagan, M.; Madhi, S.A.; Cohen, C. Impact of rotavirus vaccine on childhood diarrheal hospitalization after introduction into the South African public immunization program. Pediatr. Infect. Dis. J. 2013, 32, 1359–1364. [Google Scholar] [CrossRef]
  21. Page, N.A.; Seheri, L.M.; Groome, M.J.; Moyes, J.; Walaza, S.; Mphahlele, J.; Kahn, K.; Kapongo, C.N.; Zar, H.J.; Tempia, S.; et al. Temporal association of rotavirus vaccination and genotype circulation in South Africa: Observations from 2002 to 2014. Vaccine 2018, 36, 7231–7237. [Google Scholar] [CrossRef]
  22. Steele, A.D.; Peenze, I.; De Beer, M.C.; Pager, C.T.; Yeats, J.; Potgieter, N.; Ramsaroop, U.; Page, N.A.; Mitchell, J.O.; Geyer, A.; et al. Anticipating rotavirus vaccines: Epidemiology and surveillance of rotavirus in South Africa. Vaccine 2003, 21, 354–360. [Google Scholar] [CrossRef]
  23. Hoxie, I.; Dennehy, J.J. Intragenic recombination influences rotavirus diversity and evolution. Virus Evol. 2020, 6, vez059. [Google Scholar] [CrossRef]
  24. Esona, M.D.; Roy, S.; Rungsrisuriyachai, K.; Gautam, R.; Hermelijn, S.; Rey-Benito, G.; Bowen, M.D. Molecular characterization of a human G20P [28] rotavirus a strain with multiple genes related to bat rotaviruses. Infect. Genet. Evol. 2018, 57, 166–170. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Hoa-Tran, T.N.; Nakagomi, T.; Vu, H.M.; Nguyen, T.T.T.; Takemura, T.; Hasebe, F.; Dao, A.T.H.; Anh, P.H.Q.; Nguyen, A.T.; Dang, A.D.; et al. Detection of three independently-generated DS-1-like G9P [8] reassortant rotavirus A strains during the G9P [8] dominance in Vietnam, 2016–2018. Infect. Genet. Evol. 2020, 80, 104194. [Google Scholar] [CrossRef] [PubMed]
  26. Maringa, W.M.; Mwangi, P.N.; Simwaka, J.; Mpabalwani, E.M.; Mwenda, J.M.; Peenze, I.; Esona, M.D.; Mphahlele, M.J.; Seheri, M.L.; Nyaga, M.M. Molecular Characterisation of a Rare Reassortant Porcine-Like G5P[6] Rotavirus Strain Detected in an Unvaccinated Child in Kasama, Zambia. Pathogens 2020, 9, 663. [Google Scholar] [CrossRef]
  27. Shah, M.P.; Tate, J.E.; Mwenda, J.M.; Steele, A.D.; Parashar, U.D. Estimated reductions in hospitalizations and deaths from childhood diarrhea following implementation of rotavirus vaccination in Africa. Expert Rev. Vaccines 2017, 16, 987–995. [Google Scholar] [CrossRef]
  28. Benson, D.A.; Clark, K.; Karsch-Mizrachi, I.; Lipman, D.J.; Ostell, J.; Sayers, E.W. 655 GenBank. Nucleic Acids Res 2015, 43, 30–35. [Google Scholar] [CrossRef] [Green Version]
  29. Potgieter, A.C.; Page, N.A.; Liebenberg, J.; Wright, I.M.; Landt, O.; Van Dijk, A.A. Improved strategies for sequence-independent amplification and sequencing of viral double-stranded RNA genomes. J. Gen. Virol. 2009, 90, 1423–1432. [Google Scholar] [CrossRef]
  30. Kearse, M.; Moir, R.; Wilson, A.; Stones-Havas, S.; Cheung, M.; Sturrock, S.; Buxton, S.; Cooper, A.; Markowitz, S.; Duran, C.; et al. Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 2012, 28, 1647–1649. [Google Scholar] [CrossRef] [PubMed]
  31. Pickett, B.E.; Sadat, E.L.; Zhang, Y.; Noronha, J.M.; Squires, R.B.; Hunt, V.; Liu, M.; Kumar, S.; Zaremba, S.; Gu, Z.; et al. ViPR: An open bioinformatics database and analysis resource for virology research. Nucleic Acids Res. 2012, 40, D593–D598. [Google Scholar] [CrossRef]
  32. Tamura, K.; Stecher, G.; Peterson, D.; Filipski, A.; Kumar, S. MEGA6: Molecular evolutionary genetics analysis version 6.0. Mol. Biol. Evol. 2013, 30, 2725–2729. [Google Scholar] [CrossRef] [Green Version]
  33. Weaver, S.; Shank, S.D.; Spielman, S.J.; Li, M.; Muse, S.V.; Kosakovsky Pond, S.L. Datamonkey 2.0: A modern web application for characterizing selective and other evolutionary processes. Mol. Biol. Evol. 2018, 35, 773–777. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Kosakovsky Pond, S.L.; Frost, S.D. Not so different after all: A comparison of methods for detecting amino acid sites under selection. Mol. Biol. Evol. 2005, 22, 1208–1222. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Murrell, B.; Moola, S.; Mabona, A.; Weighill, T.; Sheward, D.; Kosakovsky Pond, S.L.; Scheffler, K. FUBAR: A fast, unconstrained bayesian approximation for inferring selection. Mol. Biol. Evol. 2013, 30, 1196–1205. [Google Scholar] [CrossRef] [Green Version]
  36. Murrell, B.; Wertheim, J.O.; Moola, S.; Weighill, T.; Scheffler, K.; Pond, S.L.K. Detecting individual sites subject to episodic diversifying selection. PLoS Genet. 2012, 8, e1002764. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Waterhouse, A.; Bertoni, M.; Bienert, S.; Studer, G.; Tauriello, G.; Gumienny, R.; Heer, F.T.; de Beer, T.A.P.; Rempfer, C.; Bordoli, L.; et al. SWISS-MODEL: Homology modelling of protein structures and complexes. Nucleic Acids Res. 2018, 46, W296–W303. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  38. Eisenberg, D.; Lüthy, R.; Bowie, J.U. VERIFY3D: Assessment of protein models with three-dimensional profiles. In Methods in Enzymology; Academic Press: Cambridge, MA, USA, 1997; Volume 277, pp. 396–404. [Google Scholar]
  39. DeLano, W.L. Pymol: An open-source molecular graphics tool. CCP4 Newsl. Protein Crystallogr. 2002, 40, 82–92. [Google Scholar]
  40. Van Durme, J.; Delgado, J.; Stricher, F.; Serrano, L.; Schymkowitz, J.; Rousseau, F. A graphical interface for the FoldX forcefield. Bioinformatics 2011, 27, 1711–1712. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  41. Krieger, E.; Koraimann, G.; Vriend, G. Increasing the precision of comparative models with YASARA NOVA—A self-parameterizing force field. Proteins Struct. Funct. Bioinform. 2002, 47, 393–402. [Google Scholar] [CrossRef] [PubMed]
  42. Aoki, S.T.; Settembre, E.C.; Trask, S.D.; Greenberg, H.B.; Harrison, S.C.; Dormitzer, P.R. Structure of rotavirus outer-layer protein VP7 bound with a neutralizing Fab. Science 2009, 324, 1444–1447. [Google Scholar] [CrossRef] [Green Version]
  43. Maiorov, V.N.; Crippen, G.M. Significance of root-mean-square deviation in comparing three-dimensional structures of globular proteins. J. Mol. Biol. 1994, 235, 625–634. [Google Scholar] [CrossRef] [Green Version]
  44. Morozova, O.V.; Sashina, T.A.; Fomina, S.G.; Novikova, N.A. Comparative characteristics of the VP7 and VP4 antigenic epitopes of the rotaviruses circulating in Russia (Nizhny Novgorod) and the Rotarix and RotaTeq vaccines. Arch. Virol. 2015, 160, 1693–1703. [Google Scholar] [CrossRef]
  45. Dormitzer, P.R.; Sun, Z.Y.J.; Wagner, G.; Harrison, S.C. The rhesus rotavirus VP4 sialic acid binding domain has a galectin fold with a novel carbohydrate binding site. EMBO J. 2002, 21, 885–897. [Google Scholar] [CrossRef] [PubMed]
  46. Betts, M.J.; Russell, R.B. Amino acid properties and consequences of substitutions. Bioinform. Genet. 2003, 317, 289. [Google Scholar]
  47. Tosser, G.; Labbe, M.; Bremont, M.; Cohen, J. Expression of the major capsid protein VP6 of group C rotavirus and synthesis of chimeric single-shelled particles by using recombinant baculoviruses. J. Virol. 1992, 66, 5825–5831. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  48. Rahman, M.; Matthijnssens, J.; Saiada, F.; Hassan, Z.; Heylen, E.; Azim, T.; Van Ranst, M. Complete genomic analysis of a Bangladeshi G1P [8] rotavirus strain detected in 2003 reveals a close evolutionary relationship with contemporary human Wa-like strains. Infect. Genet. Evol. 2010, 10, 746–754. [Google Scholar] [CrossRef] [PubMed]
  49. Bányai, K.; László, B.; Duque, J.; Steele, A.D.; Nelson, E.A.S.; Gentsch, J.R.; Parashar, U.D. Systematic review of regional and temporal trends in global rotavirus strain diversity in the pre rotavirus vaccine era: Insights for understanding the impact of rotavirus vaccination programs. Vaccine 2012, 30, A122–A130. [Google Scholar] [CrossRef] [PubMed]
  50. Shintani, T.; Ghosh, S.; Wang, Y.H.; Zhou, X.; Zhou, D.J.; Kobayashi, N. Whole genomic analysis of human G1P [8] rotavirus strains from different age groups in China. Viruses 2012, 4, 1289–1304. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  51. Santos, F.S.; Junior, E.S.; Guerra, S.D.F.D.S.; Lobo, P.S.; Junior, E.P.; Lima, A.B.F.; Vinente, C.B.G.; Chagas, E.H.N.; Justino, M.C.A.; Linhares, A.D.C.; et al. G1P [8] Rotavirus in children with severe diarrhea in the post-vaccine introduction era in Brazil: Evidence of reassortments and structural modifications of the antigenic VP7 and VP4 regions. Infect. Genet. Evol. 2019, 69, 255–266. [Google Scholar] [CrossRef] [PubMed]
  52. Ghosh, S.; Kobayashi, N. Whole-genomic analysis of rotavirus strains: Current status and future prospects. Future Microbiol. 2011, 6, 1049–1065. [Google Scholar] [CrossRef]
  53. da Silva, M.F.M.; Rose, T.L.; Gómez, M.M.; Carvalho-Costa, F.A.; Fialho, A.M.; de Assis, R.M.S.; de Andrade, J.D.S.R.; de Mello Volotão, E.; Leite, J.P.G. G1P [8] species A rotavirus over 27 years–pre-and post-vaccination eras–in Brazil: Full genomic constellation analysis and no evidence for selection pressure by Rotarix® vaccine. Infect. Genet. Evol. 2015, 30, 206–218. [Google Scholar] [CrossRef] [Green Version]
  54. Dulgheroff, A.C.B.; Silva, G.A.V.D.; Naveca, F.G.; Oliveira, A.G.D.; Domingues, A.L.D.S. Diversity of group A rotavirus genes detected in the Triângulo Mineiro region, Minas Gerais, Brazil. Braz. J. Microbiol. 2016, 47, 731–740. [Google Scholar] [CrossRef] [Green Version]
  55. Almeida, T.N.V.; de Sousa, T.T.; da Silva, R.A.; Fiaccadori, F.S.; Souza, M.; Badr, K.R.; de Paula Cardoso, D.D.D. Phylogenetic analysis of G1P [8] and G12P [8] rotavirus A samples obtained in the pre-and post-vaccine periods, and molecular modeling of VP4 and VP7 proteins. Acta Trop. 2017, 173, 153–159. [Google Scholar] [CrossRef] [PubMed]
  56. Cunliffe, N.A.; Gondwe, J.S.; Graham, S.M.; Thindwa, B.D.M.; Dove, W.; Broadhead, R.L.; Molyneux, M.E.; Hart, C.A. Rotavirus strain diversity in Blantyre, Malawi, from 1997 to 1999. J. Clin. Microbiol. 2001, 39, 836–843. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  57. Tatte, V.S.; Chitambar, D.S.D. Intragenotypic diversity in the VP4 encoding genes of rotavirus strains circulating in adolescent and adult cases of acute gastroenteritis in Pune, Western India: 1993 to 1996 and 2004 to 2007. J. Gen. Mol. Virol. 2011, 3, 53–62. [Google Scholar]
  58. Gouvea, V.; Lima, R.C.; Linhares, R.E.; Clark, H.F.; Nosawa, C.M.; Santos, N. Identification of two lineages (WA-like and F45-like) within the major rotavirus genotype P [8]. Virus Res. 1999, 59, 141–147. [Google Scholar] [CrossRef]
  59. Ansaldi, F.; Pastorino, B.; Valle, L.; Durando, P.; Sticchi, L.; Tucci, P.; Biasci, P.; Lai, P.; Gasparini, R.; Icardi, G. Molecular characterization of a new variant of rotavirus P [8] G9 predominant in a sentinel-based survey in central Italy. J. Clin. Microbiol. 2007, 45, 1011–1015. [Google Scholar] [CrossRef] [Green Version]
  60. Espinola, E.E.; Amarilla, A.; Arbiza, J.; Parra, G.I. Sequence and phylogenetic analysis of the VP4 gene of human rotaviruses isolated in Paraguay. Arch. Virol. 2008, 153, 1067–1073. [Google Scholar] [CrossRef]
  61. Zeller, M.; Patton, J.T.; Heylen, E.; De Coster, S.; Ciarlet, M.; Van Ranst, M.; Matthijnssens, J. Genetic analyses reveal differences in the VP7 and VP4 antigenic epitopes between human rotaviruses circulating in Belgium and rotaviruses in Rotarix and RotaTeq. J. Clin. Microbiol. 2012, 50, 966–976. [Google Scholar] [CrossRef] [Green Version]
  62. Fredj, M.B.H.; BenHamida-Rebaï, M.; Heylen, E.; Zeller, M.; Moussa, A.; Kacem, S.; Van Ranst, M.; Matthijnssens, J.; Trabelsi, A. Sequence and phylogenetic analyses of human rotavirus strains: Comparison of VP7 and VP8∗ antigenic epitopes between Tunisian and vaccine strains before national rotavirus vaccine introduction. Infect. Genet. Evol. 2013, 18, 132–144. [Google Scholar] [CrossRef]
  63. Caust, J.; Dyall-Smith, M.L.; Lazdins, I.; Holmes, I.H. Glycosylation, an important modifier of rotavirus antigenicity. Arch. Virol. 1987, 96, 123–134. [Google Scholar] [CrossRef]
  64. Zhang, Z.; Wang, L.; Gao, Y.; Zhang, J.; Zhenirovskyy, M.; Alexov, E. Predicting folding free energy changes upon single point mutations. Bioinformatics 2012, 28, 664–671. [Google Scholar] [CrossRef]
  65. Ogden, K.M.; Snyder, M.J.; Dennis, A.F.; Patton, J.T. Predicted structure and domain organization of rotavirus capping enzyme and innate immune antagonist VP3. J. Virol. 2014, 88, 9072–9085. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. Maximum likelihood phylogenetic tree based on the full-length nucleotide sequences of genome segment 9 encoding the VP7 protein. The T92 + G evolutionary model was used for phylogenetic inference. South African pre-vaccine G1 strains are highlighted in bold-face while post-vaccine strains are highlighted in bold-red. Adjacent to some sequences is indicated a plus (+) sign followed by the number of identical sequences (S3). The number in brackets denotes the number of compressed strains. Lineages are indicated in roman numerals. Only bootstrap values ≥70% are shown adjacent to each branch node. Scale bar indicates the number of nucleotide substitutions per site.
Figure 1. Maximum likelihood phylogenetic tree based on the full-length nucleotide sequences of genome segment 9 encoding the VP7 protein. The T92 + G evolutionary model was used for phylogenetic inference. South African pre-vaccine G1 strains are highlighted in bold-face while post-vaccine strains are highlighted in bold-red. Adjacent to some sequences is indicated a plus (+) sign followed by the number of identical sequences (S3). The number in brackets denotes the number of compressed strains. Lineages are indicated in roman numerals. Only bootstrap values ≥70% are shown adjacent to each branch node. Scale bar indicates the number of nucleotide substitutions per site.
Vaccines 08 00609 g001
Figure 2. Maximum likelihood phylogenetic tree based on the full-length nucleotide sequences of genome segment 4 encoding VP4 protein. The GTR+G+I evolutionary model was used for phylogenetic inference. African pre-vaccine P[8] strains are highlighted in bold-face while post-vaccine strains are highlighted in bold-red. Adjacent to some sequences is indicated a plus (+) sign followed by the number of identical sequences (S3). The number in brackets denotes the number of compressed strains. Lineages are indicated in roman numerals. Only bootstrap values ≥ 70% are shown adjacent to each branch node. Scale bar indicates the number of nucleotide substitutions per site.
Figure 2. Maximum likelihood phylogenetic tree based on the full-length nucleotide sequences of genome segment 4 encoding VP4 protein. The GTR+G+I evolutionary model was used for phylogenetic inference. African pre-vaccine P[8] strains are highlighted in bold-face while post-vaccine strains are highlighted in bold-red. Adjacent to some sequences is indicated a plus (+) sign followed by the number of identical sequences (S3). The number in brackets denotes the number of compressed strains. Lineages are indicated in roman numerals. Only bootstrap values ≥ 70% are shown adjacent to each branch node. Scale bar indicates the number of nucleotide substitutions per site.
Vaccines 08 00609 g002
Figure 3. Structure models showing the mutation sites N147D (A) and T242A (B), respectively, that were only present in post-vaccine G1P[8] strains when superposed with VP7 of Rotarix®. The structure of the study strain is in green, while the Rotarix® structure is in deep-teal. The amino acid residues are represented in dark gray and red to examine whether the replacing amino acid alters the conformation of the protein structure. The amino acid residue highlighted in dark gray represents the study strain while that in firebrick red represents the Rotarix® strain. RMSD is the superposition value where the value of zero indicates absolute similarity. The stability of the protein after mutation was measured in kcal/mol, whereby free energy change of ±0.5 kcal/moL was regarded as significant for the stabilizing/destabilizing effect.
Figure 3. Structure models showing the mutation sites N147D (A) and T242A (B), respectively, that were only present in post-vaccine G1P[8] strains when superposed with VP7 of Rotarix®. The structure of the study strain is in green, while the Rotarix® structure is in deep-teal. The amino acid residues are represented in dark gray and red to examine whether the replacing amino acid alters the conformation of the protein structure. The amino acid residue highlighted in dark gray represents the study strain while that in firebrick red represents the Rotarix® strain. RMSD is the superposition value where the value of zero indicates absolute similarity. The stability of the protein after mutation was measured in kcal/mol, whereby free energy change of ±0.5 kcal/moL was regarded as significant for the stabilizing/destabilizing effect.
Vaccines 08 00609 g003
Figure 4. VP4 structure models showing mutations at sites T88I (A) and N89S (B), respectively, that were only seen in some post-vaccine strains when superposed with VP4 of Rotarix®. The structure of the study strain is in green while the Rotarix® structure is in deep-teal. The amino acid residues are represented in dark gray and red to examine whether the replacing amino acid alters the conformation of the protein structure. The amino acid residue highlighted in dark gray represents the study strain while the firebrick red represents the Rotarix® strain. The stability of the protein after mutation was measured in kcal/mol, whereby the folding energy change of ±0.5 kcal/mol is regarded as statistically significant for either stabilizing (−)/destabilizing (+) effect.
Figure 4. VP4 structure models showing mutations at sites T88I (A) and N89S (B), respectively, that were only seen in some post-vaccine strains when superposed with VP4 of Rotarix®. The structure of the study strain is in green while the Rotarix® structure is in deep-teal. The amino acid residues are represented in dark gray and red to examine whether the replacing amino acid alters the conformation of the protein structure. The amino acid residue highlighted in dark gray represents the study strain while the firebrick red represents the Rotarix® strain. The stability of the protein after mutation was measured in kcal/mol, whereby the folding energy change of ±0.5 kcal/mol is regarded as statistically significant for either stabilizing (−)/destabilizing (+) effect.
Vaccines 08 00609 g004
Table 1. Nucleotide identity analysis between South African pre- and post-vaccine G1P[8] strains, pre-vaccine G1P[8] strains with Rotarix®, and post-vaccine G1P[8] strains with Rotarix® strain.
Table 1. Nucleotide identity analysis between South African pre- and post-vaccine G1P[8] strains, pre-vaccine G1P[8] strains with Rotarix®, and post-vaccine G1P[8] strains with Rotarix® strain.
Gene Segments/Nucleotide Identity Values in PercentageVP7VP4VP6VP1VP2VP3NSP1NSP2NSP3NSP4NSP5
Comparison between pre- and post-vaccine G1P[8] strains 92.1–10087.8–99.589.0–99.493.7–99.092.9–99.689.4–99.183.5–99.589.1–99.794.2–10091.0–10092.2–99.8
Comparison between pre-vaccine G1P[8] strains and Rotarix® strain92.7–10089.6–91.188.2–98.794.5–99.193.1–99.091.2–98.683.6–10088.3–90.995.3–10091.4–98.992.2–98.8
Comparison between post-vaccine strains and Rotarix® strain93.3–10089.9–99.988.9–10094.5–10093.0–10090.9–99.984.0–99.989.7–10097.1–10091.0–10092.9–100
South African pre- and post-vaccine G1P[8] strains were compared with each other and also compared with the Rotarix® strain. The nucleotide identity values were calculated using the p-distance algorithm in MEGA 6 software (S2).
Table 2. Possible effects of amino acid mutations in South African post-vaccine G1 neutralization epitopes.
Table 2. Possible effects of amino acid mutations in South African post-vaccine G1 neutralization epitopes.
Strain Used for the Protein ModelingMutationNo. of Post-Vaccine Strain(s) with the MutationRegionAmino Acid Property ChangeSuperposition Value (RMSD)Free Energy Change
(kcal/mol)
Possible Effect
RVA/Human-wt/ZAF/UFS-NGS-MRC-DPRU4357/2015/G1P[8]N147D87–2 epitopeHydrophilic to hydrophilic;
Neutral to negative charge
0.020 Å+0.527 The change in charge may alter the biochemical properties of the epitope. The mutation significantly destabilizes the structure of the protein.
RVA/Human-wt/ZAF/MRC-DPRU1544/2010/G1P[8]T242A17–1b epitopeHydrophilic to hydrophobic;
Neutral to Neutral charge
0.012 Å−0.076 The change in polarity may alter the physicochemical property of the epitope
Relative mean square deviation (RMSD) is the superposition value where value of zero indicates absolute similarity. The stability of the protein after mutation was measured in kcal/mol, whereby free energy change of ±0.5 kcal/moL was regarded as significant for either stabilizing/destabilizing effect. Minus (−) value is indicative of stabilizing effect while positive (+) value is indicative of destabilizing effect.
Table 3. Possible effects of amino acid mutations in South African post-vaccine VP4 epitopes.
Table 3. Possible effects of amino acid mutations in South African post-vaccine VP4 epitopes.
Strain Used for Protein ModelingMutationNo. of Strain(s) with the MutationRegionAmino ACID Property ChangeSuperimposition Value (RMSD)Free Energy Change (kcal/mol)Possible Effect
RVA/Human-wt/ZAF/UFS-NGS-MRC-DPRU74/2014/G1P[8]T88I18–4 epitopeHydrophilic to hydrophobic;
Neutral charge to neutral charge
0.048 Å−0.297The change in polarity may alter the physicochemical properties of the protein. No significant impact on the stability of the protein structure
RVA/Human-wt/ZAF/UFS-NGS-MRC-DPRU83/2011/G1P[8]N89S18–4 epitopeHydrophilic to hydrophilic;
Neutral charge to Neutral charge
0.049 Å+1.166The loss of the glycosylation site may alter the chemical properties of the protein. The mutation destabilized the protein structure.
Relative mean square deviation (RMSD) is the superimposition value where the value of zero indicates absolute similarity. The stability of the protein after mutation was measured in kcal/moL, whereby the folding energy change of ±0.5 kcal/mol is considered statistically significant for either stabilizing (−)/destabilizing (+) effect.
Table 4. Positively selected sites as identified by FEL, FUBAR, and MEME analysis.
Table 4. Positively selected sites as identified by FEL, FUBAR, and MEME analysis.
MethodAmino Acid Sites in the Gene Segments under Positive Selection
VP1VP2VP3VP4VP6VP7NSP1NSP2NSP3NSP4NSP5
MEME2, 3, 5, 89, 243, 357, 733,
909,
1082
5837, 118, 137,
245, 310, 728
23, 28, 44, 90, 169, 201, 576, 578, 668, 773,
774, 775
85, 238-14, 32, 90,
132, 154,
175, 181,
225, 253,
261, 267,
293, 392,
396, 398,
404, 405,
416, 425
55, 258, 31520425, 1683
FUBAR67, 35712, 367, 245----75308--
FEL3-7, 245--------
Amino acid sites that were identified to be under positive selection by FEL, MEME, and FUBAR. Amino acid sites in bold and underlined were identified by all the three methods. For MEME and FEL, statistical significance was assessed at p ≤ 0.1 while for FUBAR, it was assessed at posterior probability ≥0.9. The dash (–) sign indicates no site was identified. Additional analyses reports are provided in S9.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Mwangi, P.N.; Mogotsi, M.T.; Seheri, M.L.; Mphahlele, M.J.; Peenze, I.; Esona, M.D.; Kumwenda, B.; Steele, A.D.; Kirkwood, C.D.; Ndze, V.N.; et al. Whole Genome In-Silico Analysis of South African G1P[8] Rotavirus Strains before and after Vaccine Introduction over a Period of 14 Years. Vaccines 2020, 8, 609. https://doi.org/10.3390/vaccines8040609

AMA Style

Mwangi PN, Mogotsi MT, Seheri ML, Mphahlele MJ, Peenze I, Esona MD, Kumwenda B, Steele AD, Kirkwood CD, Ndze VN, et al. Whole Genome In-Silico Analysis of South African G1P[8] Rotavirus Strains before and after Vaccine Introduction over a Period of 14 Years. Vaccines. 2020; 8(4):609. https://doi.org/10.3390/vaccines8040609

Chicago/Turabian Style

Mwangi, Peter N., Milton T. Mogotsi, Mapaseka L. Seheri, M. Jeffrey Mphahlele, Ina Peenze, Mathew D. Esona, Benjamin Kumwenda, A. Duncan Steele, Carl D. Kirkwood, Valantine N. Ndze, and et al. 2020. "Whole Genome In-Silico Analysis of South African G1P[8] Rotavirus Strains before and after Vaccine Introduction over a Period of 14 Years" Vaccines 8, no. 4: 609. https://doi.org/10.3390/vaccines8040609

APA Style

Mwangi, P. N., Mogotsi, M. T., Seheri, M. L., Mphahlele, M. J., Peenze, I., Esona, M. D., Kumwenda, B., Steele, A. D., Kirkwood, C. D., Ndze, V. N., Dennis, F. E., Jere, K. C., & Nyaga, M. M. (2020). Whole Genome In-Silico Analysis of South African G1P[8] Rotavirus Strains before and after Vaccine Introduction over a Period of 14 Years. Vaccines, 8(4), 609. https://doi.org/10.3390/vaccines8040609

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop