Next Article in Journal
First Detection of SARS-CoV-2 Delta (B.1.617.2) Variant of Concern in a Dog with Clinical Signs in Spain
Next Article in Special Issue
Detailed Analyses of Molecular Interactions between Favipiravir and RNA Viruses In Silico
Previous Article in Journal
Spatiotemporal Associations and Molecular Evolution of Highly Pathogenic Avian Influenza A H7N9 Virus in China from 2017 to 2021
Previous Article in Special Issue
Constraint of Base Pairing on HDV Genome Evolution
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Detailed Evolutionary Analyses of the F Gene in the Respiratory Syncytial Virus Subgroup A

1
Gunma Prefectural Institute of Public Health and Environmental Sciences, Maebashi-shi 371-0052, Japan
2
Department of Health Science, Gunma Paz University Graduate School, Takasaki-shi 370-0006, Japan
3
Department of Respiratory Medicine, Kyorin University School of Medicine, Mitaka-shi 181-8611, Japan
4
Division of Nursing Science, Hiroshima University, Hiroshima-shi 734-8551, Japan
5
Department of Pediatrics, Sapporo Medical University School of Medicine, Sapporo-shi 060-8543, Japan
6
Department of Microbiology, Yokohama City University School of Medicine, Yokohama-shi 236-0004, Japan
7
Department of Virology, National Institute of Infectious Diseases, Musashimurayama-shi 208-0011, Japan
8
Department of Pediatrics, Tokyo Medical University, Shinjuku-ku 160-0023, Japan
*
Author to whom correspondence should be addressed.
Viruses 2021, 13(12), 2525; https://doi.org/10.3390/v13122525
Submission received: 8 November 2021 / Revised: 6 December 2021 / Accepted: 13 December 2021 / Published: 15 December 2021
(This article belongs to the Special Issue RNA Viruses: Structure, Adaptation, and Evolution)

Abstract

:
We performed evolution, phylodynamics, and reinfection-related antigenicity analyses of respiratory syncytial virus subgroup A (RSV-A) fusion (F) gene in globally collected strains (1465 strains) using authentic bioinformatics methods. The time-scaled evolutionary tree using the Bayesian Markov chain Monte Carlo method estimated that a common ancestor of the RSV-A, RSV-B, and bovine-RSV diverged at around 450 years ago, and RSV-A and RSV-B diverged around 250 years ago. Finally, the RSV-A F gene formed eight genotypes (GA1-GA7 and NA1) over the last 80 years. Phylodynamics of RSV-A F gene, including all genotype strains, increased twice in the 1990s and 2010s, while patterns of each RSV-A genotype were different. Phylogenetic distance analysis suggested that the genetic distances of the strains were relatively short (less than 0.05). No positive selection sites were estimated, while many negative selection sites were found. Moreover, the F protein 3D structure mapping and conformational epitope analysis implied that the conformational epitopes did not correspond to the neutralizing antibody binding sites of the F protein. These results suggested that the RSV-A F gene is relatively conserved, and mismatches between conformational epitopes and neutralizing antibody binding sites of the F protein are responsible for the virus reinfection.

1. Introduction

The respiratory syncytial virus (RSV) belongs to the genus Orthopneumovirus and the family Pneumoviridae, and causes respiratory illness in humans [1]. Particularly, the agent is responsible for severe bronchitis, bronchiolitis, and pneumonia in early infants [2,3,4]. Moreover, primary infection of the virus in infants may frequently show wheezy lower respiratory infections [5]. Epidemiological data suggest that all infants with RSV are infected by the age of 2 years [5]. Therefore, the infection caused by the virus is a major disease burden in infants and the elderly [6,7].
RSV has two structural proteins: fusion proteins (F) and attachment glycoprotein (G) on the virion surface. Among these, the F protein plays important roles in infection for the virus through the TLR4 (an inert immunity-related ligand) on the host cells and acts as a major antigen [1]. A pharmaceutical production of monoclonal antibody (palivizumab) is used as a preventive drug against RSV infection [1,8]. This drug can bind to the F protein resulting in the neutralization of the virus. Therefore, the F protein is a target molecule for vaccine development [9].
There are two F protein forms, i.e., pre- and post-fusion types [1]. Previous reports show that the prefusion type undergoes conformational changes resulting in formation of the post-fusion type [10,11]. Furthermore, a previous report indicates that the prefusion type shows a stronger antigenicity than the post-fusion type [12]. Moreover, it is suggested that neutralization of the RSV in vitro is observed when neutralizing antibodies bind to the prefusion type [11]. Although RSV reinfections occur throughout life [1], relationships between RSV antigenicity and reinfection remain unclear [13].
Phylogenetically and genetically, RSV is classified into two subgroups (RSV-A and B) and many genotypes. Previous reports suggested that RSV-A is a major prevalent subgroup, including eight genotypes [14]. Other reports also showed that some genotypes of RSV-A caused epidemics in various areas [15,16,17,18]. However, the phylodynamics of RSV-A are not exactly known.
Various in silico techniques combined with bioinformatics have been developed, and these allow us to understand detailed viral evolution [19]. In the present study, we presented evolution, phylodynamics, and reinfection-related antigenicity data of RSV-A F gene globally collected strains.

2. Materials and Methods

2.1. Strains Used in This Study

We collected full-length nucleotide sequences of the F gene of RSV-A from GenBank (https://www.ncbi.nlm.nih.gov/genbank/) to analyze the molecular evolution on 30 June 2020. In total, we obtained sequences of 4256 RSV strains. First, we selected subgroup A strains, then we omitted strains with ambiguous sequences and sequences that were not accompanied by information on the detection year and region. We also omitted 100% nucleotide sequence similarity in the same detection year and country. As a result, we used a total of 1465 strains in the analysis. All strains used in this study are shown in Supplementary Table S1.

2.2. Phylogenetic Analysis and Estimation of Evolutionary Rate

We used the Bayesian Markov chain Monte Carlo (MCMC) method in BEAST package v2.4.8, as previously described, to construct a phylogenic tree and estimate the evolutionary rate of the RSV-A F gene [20,21,22]. In this phylogenetic analysis, we used all the RSV-A strains (1465 strains) and two reference strains, CH18537 (a prototype RSV-B strain, accession no. JX198143) and RB94 (a prototype bovine-RSV strain, accession no. D00953). We used the jModelTest 2.1.10 programs [23] to select an appropriate substitution model, and Standard_TIM2 was selected. We performed path sampling to determine the best of four clock models (strict clock, relaxed clock exponential, relaxed clock log-normal, and random local clock) and two tree prior models (coalescent constant population and coalescent exponential population), using the Path sampler implemented in BEAST. A strict clock and the coalescent exponential population were selected. The MCMC chains consisted of 100,000,000 steps with sampling every 5000 steps. Tracer v1.7 was used to confirm the convergence of all parameters (effective sample size values above 200) [24]. After discarding the 10% burn-in, phylogenetic trees were constructed with TreeAnnotator v2.4.8 and illustrated by FigTree v1.4.0. In addition, the rates of molecular evolution were also calculated by suitable models selected for each dataset as described above.

2.3. Bayesian Skyline Plot Analyses

Bayesian skyline plot (BSP) analyses were performed using BEAST v2.4.8 to analyze the effective population size of the RSV-A strains and each genotype [20]. The best substitution and clock models were selected as described above. The Bayesian skyline plots were visualized with 95% highest probability density (HPD) using Tracer v1.7 [24].

2.4. Similarity Plot Analyses and Calculation of the Phylogenetic Distances

We calculated the nucleotide similarity of each sequence using SimPlot program 3.5.1 to clarify the relationships among the aligned nucleotide sequences of the RSV-A F gene [25]. The Long strain (a prototype RSV-A strain, accession no. JX198112) was used as the query sequence. The similarity was calculated using the Kimura 2-parameter method with a window size of 200 nucleotides and a step size of 20 nucleotides.
We constructed a phylogenetic tree of all RSV-A strains based on the maximum likelihood (ML) method using MEGA7 software to estimate the phylogenetic distance [26]. We used the jModelTest 2.1.10 programs to determine the best substitution model. Subsequently, the phylogenetic distance of the ML tree was calculated using the Patristic program [27].

2.5. Selective Pressure Analyses

We tested using Datamonkey (http://www.datamonkey.org/) whether sites in the F gene were under positive or negative selection as previously described on 30 June 2020 [21]. Datamonkey has an upper limit on the number of computable sequences, so it is necessary to reduce this to 500. First, the sequences with 100% amino acid sequence similarity were deleted from the dataset (552 sequences), and 52 sequences were randomly selected and deleted from the sequences with a difference of 1 amino acid to obtain 500 sequences. We used four different methods: the single likelihood ancestor (SLAC) method, the fixed effects likelihood (FEL) method, the internal fixed effects likelihood (IFEL) method, and the mixed-effects model of evolution (MEME). Significance level was p < 0.05.

2.6. Prediction of Conformational B-Cell Epitope and Amino Acid Substitution Sites by Mapping on the Structure of the RSV-A F Protein

Structural models of the prefusion F protein of RSV-A were constructed for representative strains from each genotype (prototype, Long strain, JX198112; GA1, RSVA/Homo sapiens/USA/78I-004A-01-01/1977 strain, KU316106; GA2, HRSV/Yokohama.JPN/V13835/1996 strain, LC337817; GA3, HRSV/Yokohama.JPN/V10831/ 1992 strain, LC337812; GA4, RSVA/Homo sapiens/USA/81E-078-01/1977 strain, KU316149; GA5, RSVA/Homo sapiens/USA/MCRSV_259/1990 strain, MG642055; GA6, RSVA/Homo sapiens/USA/MCRSV_226/1982 strain, MG642063; GA7, RSVA/Homo sapiens/USA/84I-220A-01-01/1984 strain, KU316110; NA1, Kilifi_10028_12_ RSVA_2003 strain, KP317955) using MODELLER v9.20 [28]. The templates for homology modeling were based on the crystal structure of the protein (Protein Data Bank accession ID: 6EAD). Constructed models were minimized using GROMOS96 [29] implementation in the Swiss PDB Viewer v4.1 [30] and then evaluated by Ramachandran plots produced with Coot [31]. We analyzed conformational epitopes of the constructed models using DiscoTope 2.0 [32], BEpro [33], ElliPro [34], EPCES [35], and EPSVR [36] with cut-off values of −3.7 (DiscoTope 2.0), 1.3 (BEpro), 0.5 (ElliPro), and 70 (EPCES, EPSVR). The accuracy of the analyses was also supported among the consensus sites predicted by more than four of the five methods, and regions with close residues over two of the sites on the trimeric structure models were determined as conformational epitopes. We also estimated the amino acid substitution of the representative strains of each genotype from the prototype strain. Finally, we mapped predicted B-cell epitopes and amino acid substitution sites in each genotype and palivizumab epitopes on the models using Chimera v1.13.1 [37].

3. Results

3.1. Phylogenetic and Evolutionary Analyses of the RSV-A F Gene

At first, we performed a phylogenetic analysis using the MCMC method (Figure 1) for evaluating time scale evolution of the RSV-A F gene. As shown in Figure 1, human-RSV and bovine-RSV diverged from their common ancestor in 1563 (mean; 95% HPD, 1504–1624). RSV-A and -B diverged in 1766 (mean; 95% HPD, 1734–1794). RSV-A was classified into eight genotypes, with the most recently diverged genotype, NA1, accounting for 74.5% (1092/1465 strains) of the total. The respective divergence times and strain numbers of each genotype in RSV-A are shown in Table 1.
Secondly, we estimated the evolutionary rate of the RSV-A F gene. The evolutionary rate of the entire RSV-A was calculated to be 7.69 × 10−4 substitutions/site/year (95% HPD, 7.10–8.29 × 10−4 substitutions/site/year). The fastest evolutionary rate by genotype was 8.34 × 10−4 substitutions/site/year (95% HPD, 6.43 × 10−4–1.04 × 10−3 substitutions/site/year) for GA2, and the slowest was 4.48 × 10−4 substitutions/site/year (95% HPD, 2.79 × 10−4–6.24 × 10−4 substitutions/site/year) for GA1 (Supplementary Table S2). As noted, the genotype GA6 was not examined due to small strain numbers (six strains).

3.2. BSP Analyses

We analyzed the phylodynamics of the present RSV-A strains using the Bayesian skyline plots method (Figure 2a–h). First, the effective population size of the present strains increased by two steps at around 2000 and 2010, respectively. These may reflect an increase in the strain numbers and diverged years of the genotypes GA5 and NA1 in the phylogenetic tree. The phylodynamics of each genotype exception of GA6 were also reflected in these factors.

3.3. Similarity Analysis and Phylogenetic Distances

Using SimPlot analysis, we examined the nucleotide similarity of the F gene of RSV-A (Figure 3). The results showed that the similarity of the entire RSV-A was high (>92%). Furthermore, we performed a phylogenetic distance analysis for the RSV-A F gene (Figure 4). Phylogenetic distance for the entire RSV-A was 0.024 ± 0.021 (mean ± SD). The most far apart phylogenetic distance was 0.017 ± 0.007 for GA3 by RSV-A genotypes; conversely, the closest was 0.006 ± 0.004 for GA1 (Supplementary Table S3).

3.4. Selective Pressure Analyses

We analyzed positive and negative selection sites of the F protein to determine selective pressure against the host. Unfortunately, there were no positive selection sites common to all four methods when calculated by SLAC, FEL, IFEL, and MEME. On the other hand, the negative selection sites were 165 for SLAC, 242 for FEL, and 176 for IFEL, and 137 sites were identified as common to all three methods.

3.5. Mapping of Amino Acid Substitution Sites and Conformational B-Cell Epitopes on the Structure of the RSV-A F Protein

Structural models of the Long strain and representative strains of each genotype (NA1, GA1, GA2) were constructed (Figure 5). Since the amino acid sequences in the range included in the structural model of the representative strains of GA2, GA3, GA4, GA5, GA6, and GA7 were the same, only GA2 is shown. Then, amino acid substitutions corresponding to the prototype strain are shown in green. Three substitutions, “S101P,” “R213S,” and “V384I,” were detected in the amino acid sequence in the range included in the structural model of GA2 (Figure 5c). Four substitutions, “S101P,” “R213S,” “E356D,” and “V384I,” were found in the range of amino acid sequences included in the GA1 structural model (Figure 5b). E356D was not shown because it was almost hidden as an inner region of the protein structure. Likewise, four substitutions, “S101P,” “R213S,” “N276S,” and “V384I,” were present in the amino acid sequence in the range included in the structural model of NA1 (Figure 5d). We estimated the conformational B-cell epitope of RSV-A F protein (Table 2). Two epitopes were found in chain A, B, and C, respectively. Residues aa65~68 were in DIII of F2, and residues aa209~211 were in HRA. Both were located at the top of the structure protein models (colored in cyan).

4. Discussion

Molecular epidemiology of RSV infection based on the RSV-A F gene sequences has been studied in many reports [21,38,39,40,41]. However, most of these studies were domestic [35,36,37,38], while evolution, phylodynamics, and reinfection-related antigenicity of the virus based on the F gene are not exactly known. Therefore, we performed detailed evolutionary analyses of the RSV F gene using various in silico techniques combined with authentic bioinformatics to elucidate the evolution of the RSV-A F gene globally collected strains (1465 strains).
First, we constructed a time-scaled phylogenetic tree using MCMC methods (Figure 1). As a result of this tree, we demonstrated that the common ancestor of RSV-A, RSV-B, and bovine-RSV diverged around 450 years ago (1560s), and RSV-A and RSV-B diverged around 250 years ago (1760s). Moreover, the present RSV-A strains formed eight genotypes (GA1-GA7 and NA1) over 80 years. Interestingly, from the 1940s to the 1990s, seven genotypes (GA1-7) simultaneously emerged. Of these, off-springs of six genotypes (GA1, GA3-7) disappeared. As a result, an off-spring of the genotype GA2, i.e., genotype NA1, became a major prevalent genotype over the last 20 years. This genotype rapidly became dominant in the present phylogenetic tree. Between these values and the previous data, our report is different [21]. These may be partially due to the strain numbers and MCMC conditions [20]. Furthermore, we also estimated the evolutionary rates of the present strains. The evolutionary rate of the RSV-A F gene was estimated as 7.69 × 10−4 substitutions/site/year (s/s/y) (95% HPD, 7.10–8.29 × 10−4 s/s/y). This value was similar to the F gene of human respiro-virus type 3 [42]. Moreover, the rate was slower than that of the attachment glycoprotein (G) gene (another major antigen) [21].
Next, we assessed the phylodynamics of each genotype RSV-A F gene (excepting GA6), using the BSP method, during the past years. As a result, the genome population of genotypes GA1-3, GA5, and GA7 transiently increased from the 1980s to the 1990s. Moreover, genotype NA1 diverged from GA2 in 1994, and this became a recent prevalent genotype. Thus, these results regarding our phylodynamics data and epidemiological data may be compatible.
Next, to estimate the genetic divergence of the present strains, we calculated nucleotide identities and phylogenetic distances (Figure 4). The nucleotide identities among the strains were over 92%, and the ranges of phylogenetic distances were very narrow [0.024 ± 0.021 (mean ± SD)]. Thus, the results indicated that the F gene in these strains was highly conservative.
We also performed selective pressure analyses in the F protein genes among the strains. As a result, no selective pressure was estimated, while many negative selection sites (137 sites) were found. In general, positive selection sites may be shown to escape from the selective pressure in the host (i.e., immune system), while negative selection sites may act to prevent the protein deterioration [43]. These results suggested that the F protein was not sensitive to the selective pressure by the host, and amino acid substitutions of the proteins may maintain infectivity for the host cells.
Previous reports showed that the neutralization antibody (NT-Ab), including palivizumab (a monoclonal antibody to prevent RSV infection), could bind to the prefusion form of the F protein resulting in the prevention of RSV infection [11]. Conformational epitopes may act to produce the NT-Ab in the host [44]. Therefore, it is essential to estimate comparing conformational epitopes of viral antigens and NT-Ab binding sites [44]. If an incompatibility is seen in them, infections may recur. Therefore, we estimated comparing conformational epitopes and NT-Ab binding sites (palivizumab binding sites) in the prefusion type of the F protein (Figure 5 and Table 2). As a result, conformational epitopes did not correspond to the NT-Ab binding sites in the prefusion type of the protein. The results suggested that this mismatch may be partially responsible for RSV reinfection and another virus such as human respiro-virus type 3 [42]. Furthermore, an amino acid substitution of N276S in the palivizumab binding sites may reduce this drug effect. Our results showed that almost all strains (94.4%) of the recent prevalent genotype (NA1) had the N276S. Thus, most prevalent RSV-A strains had partial resistance to the palivizumab [45]. Presently, palivizumab is only available to prevent severe RSV infection in infants with underlying conditions such as congenital heart/lung diseases, low birth infants, and Down’s syndrome [46]. Continuous virus surveillance regarding the RSV-A F gene sequences may be needed to monitor drug sensitivity against RSV-A. Moreover, the NT-Ab binds to prefusion F protein and can inhibit infection to the host cell. In contrast, the antibody-bound post-fusion F protein has no affect [12]. Our previous report showed relationships between NT-Ab binding sites and conformational epitopes in the post-fusion F protein [21]. Thus, the current new data may be more precise than the previous data [21]. Moreover, there is regional bias of sequence availability in GenBank. This is a limitation of the present study.

5. Conclusions

We performed detailed evolutionary analyses of respiratory syncytial virus subgroup A (RSV-A) fusion (F) gene in the globally collected strains using various bioinformatics methods. A time-scaled evolutionary tree showed that a common ancestor of RSV-A, RSV-B, and bovine-RSV diverged around 450 years ago, and RSV-A and RSV-B diverged around 250 years ago. The RSV-A F gene formed eight genotypes (GA1-GA7 and NA1) during the last 80 years. Phylodynamics of RSV-A F gene including all genotype strains increased twice tin the 1990s and 2010s, while patterns of each RSV-A genotype were distinct. Phylogenetic distance analysis suggested that the genetic distances of the strains were short. No positive selection sites were found, while many negative selection sites were estimated. Moreover, the F protein 3D structure and conformational epitope analyses showed that the conformational epitopes did not correspond to the NT-Ab binding sites of the F protein. These data implied that the RSV-A F gene is relatively conserved. Mismatches between conformational epitopes and NT-Ab binding sites of the F protein may be responsible for virus reinfection.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/v13122525/s1, Table S1: Strains used in this study, Table S2: Evolutionary rates for each genotype, Table S3: Phylogenetic distances for each genotype.

Author Contributions

Conceptualization, H.K. (Hirokazu Kimura) and N.S.; methodology, M.S. (Mariko Saito), H.T., T.S. (Tatsuya Shirai), K.O., T.S. (Toshiyuki Sugai), T.T. and Y.H; formal analysis, M.S. (Mariko Saito), S.S. and H.K. (Hirokazu Kimura); data curation, H.K. (Hirokazu Kimura), M.T., H.K. (Hisashi Kawashima), A.R. and N.S.; writing—original draft preparation, M.S. (Mariko Saito), H.K. (Hirokazu Kimura) and M.S. (Mitsuru Sada); writing—review and editing, H.K. (Hirokazu Kimura) and Y.H. visualization, M.S. (Mariko Saito), S.S.; funding acquisition, H.K. (Hirokazu Kimura) and H.K (Hisashi Kawashima). All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by Japan Agency for Medical Research and Development, AMED (https://www.amed.go.jp/, accessed on 8 November 2021) under Grant Number JP21fk0108119. The funders had no role in study design, data collection, analysis, decision to publish, or manuscript preparation.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study, in the collection, analyses, or interpretation of data, in the writing of the manuscript, or in the decision to publish the results.

References

  1. Collins, P.L.; Karron, R.A. Respiratory syncytial virus and matapneumovirus. In Fields Virology, 6th ed.; Knipe, D.M., Howley, P.M., Cohen, J.I., Griffin, D.E., Lamb, R.A., Martin, M.A., Racaniello, V.D., Roizman, B., Eds.; Lippincott Williams & Wilkins: Philadelphia, PA, USA, 2013; Volume 1, pp. 1086–1123. [Google Scholar]
  2. Leung, A.K.; Kellner, J.D.; Davies, H.D. Respiratory syncytial virus bronchiolitis. J. Natl. Med. 2005, 97, 1708–1713. [Google Scholar]
  3. Shay, D.K.; Holman, R.C.; Newman, R.D.; Liu, L.L.; Stout, J.W.; Anderson, L.J. Bronchiolitis-associated hospitalizations among US children, 1980-1996. JAMA 1999, 282, 1440–1446. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Yorita, K.L.; Holman, R.C.; Steiner, C.A.; Effler, P.V.; Miyamura, J.; Forbes, S.; Anderson, L.J.; Balaraman, V. Severe bronchiolitis and respiratory syncytial virus among young children in Hawaii. Pediatr. Infect. Dis. J. 2007, 26, 1081–1088. [Google Scholar] [CrossRef] [PubMed]
  5. Glezen, W.P.; Taber, L.H.; Frank, A.L.; Kasel, J.A. Risk of primary infection and reinfection with respiratory syncytial virus. Am. J. Dis. Child. 1986, 140, 543–546. [Google Scholar] [CrossRef]
  6. Branche, A.R.; Falsey, A.R. Respiratory syncytial virus infection in older adults: An under-recognized problem. Drugs Aging 2015, 32, 261–269. [Google Scholar] [CrossRef] [PubMed]
  7. Lee, N.; Lui, G.C.; Wong, K.T.; Li, T.C.; Tse, E.C.; Chan, J.Y.; Yu, J.; Wong, S.S.; Choi, K.W.; Wong, R.Y.; et al. High morbidity and mortality in adults hospitalized for respiratory syncytial virus infections. Clin. Infect. Dis. 2013, 57, 1069–1077. [Google Scholar] [CrossRef]
  8. Johnson, S.; Oliver, C.; Prince, G.A.; Hemming, V.G.; Pfarr, D.S.; Wang, S.C.; Dormitzer, M.; O’Grady, J.; Koenig, S.; Tamura, J.K.; et al. Development of a humanized monoclonal antibody (MEDI-493) with potent in vitro and in vivo activity against respiratory syncytial virus. J. Infect. Dis. 1997, 176, 1215–1224. [Google Scholar] [CrossRef] [Green Version]
  9. Griffiths, C.; Grews, S.J.; Marchant, D.J. Respiratory Syncytial Virus: Infection, Detection, and New Options for Prevention and Treatment. Clin. Microbiol. Rev. 2017, 30, 277–319. [Google Scholar] [CrossRef] [Green Version]
  10. Swanson, K.A.; Settembre, E.C.; Shaw, C.A.; Dey, A.K.; Rappuoli, R.; Mandl, C.W.; Dormitzer, P.R.; Carfi, A. Structural basis for immunization with postfusion respiratory syncytial virus fusion F glycoprotein (RSV F) to elicit high neutralizing antibody titers. Proc. Natl. Acad. Sci. USA 2011, 108, 9619–9624. [Google Scholar] [CrossRef] [Green Version]
  11. McLellan, J.S.; Chen, M.; Leung, S.; Graepel, K.W.; Du, X.; Yang, Y.; Zhou, T.; Baxa, U.; Yasuda, E.; Beaumont, T.; et al. Structure of RSV fusion glycoprotein trimer bound to a prefusion-specific neutralizing antibody. Science 2013, 340, 1113–1117. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Killikelly, A.; Kanekiyo, M.; Graham, B. Pre-fusion F is absent on the surface of formalin-inactivated respiratory syncytial virus. Sci. Rep. 2016, 6, 34108. [Google Scholar] [CrossRef] [PubMed]
  13. Ascough, S.; Paterson, S.; Chiu, C. Induction and subversion of human protective immunity: Contrasting influenza and respiratory syncytial virus. Front. Immunol. 2018, 9, 323. [Google Scholar] [CrossRef] [Green Version]
  14. Muñoz-Escalante, J.C.; Comas-García, A.; Bernal-Silva, S.; Robles-Espinoza, C.D.; Gómez-Leal, G.; Noyola, D.E. Respiratory syncytial virus A genotype classification based on systematic intergenotypic and intragenotypic sequence analysis. Sci. Rep. 2019, 9, 20097. [Google Scholar] [CrossRef] [PubMed]
  15. Chen, X.; Xu, B.; Guo, J.; Li, C.; An, S.; Zhou, Y.; Chen, A.; Deng, L.; Fu, Z.; Zhu, Y.; et al. Genetic variations in the fusion protein of respiratory syncytial virus isolated from children hospitalized with community-acquired pneumonia in China. Sci. Rep. 2018, 8, 4491. [Google Scholar] [CrossRef] [PubMed]
  16. Di Giallonardo, F.; Kok, J.; Fernandez, M.; Carter, I.; Geoghegan, J.L.; Dwyer, D.E.; Holmes, E.C.; Eden, J.S. Evolution of human respiratory syncytial virus (RSV) over multiple seasons in New South Wales, Australia. Viruses 2018, 10, 476. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Otieno, J.R.; Kamau, E.M.; Oketch, J.W.; Ngoi, J.M.; Gichuki, A.M.; Binter, S.; Otieno, G.P.; Ngama, M.; Agoti, C.N.; Cane, P.A.; et al. Whole genome analysis of local Kenyan and global sequences unravels the epidemiological and molecular evolutionary dynamics of RSV genotype ON1 strains. Virus Evol. 2018, 4, vey027. [Google Scholar] [CrossRef] [Green Version]
  18. Yassine, H.M.; Sohail, M.U.; Younes, N.; Nasrallah, G.K. Systematic Review of the Respiratory Syncytial Virus (RSV) Prevalence, Genotype Distribution, and Seasonality in Children from the Middle East and North Africa (MENA) Region. Microorganisms 2020, 8, 713. [Google Scholar] [CrossRef]
  19. Pappas, N.; Roux, S.; Hölzer, M.; Lamkiewicz, K.; Mock, F.; Marz, M.; Dutilh, B.E. Virus bioinformatics. In Encyclopedia of Virology, 4th ed.; Academic Press: Cambridge, MA, USA, 2021; pp. 124–132. [Google Scholar]
  20. Bouckaert, R.; Heled, J.; Kühnert, D.; Vaughan, T.; Wu, C.H.; Xie, D.; Suchard, M.A.; Rambaut, A.; Drummond, A.J. BEAST 2: A software platform for Bayesian evolutionary analysis. PLoS Comput. Biol. 2014, 10, e1003537. [Google Scholar] [CrossRef] [Green Version]
  21. Kimura, H.; Nagasawa, K.; Tsukagoshi, H.; Matsushima, Y.; Fujita, K.; Yoshida, L.M.; Tanaka, R.; Ishii, H.; Shimojo, N.; Kuroda, M.; et al. Molecular evolution of the fusion protein gene in human respiratory syncytial virus subgroup A. Infect. Genet. Evol. 2016, 43, 398–406. [Google Scholar] [CrossRef] [PubMed]
  22. Kimura, H.; Nagasawa, K.; Kimura, R.; Tsukagoshi, H.; Matsushima, Y.; Fujita, K.; Hirano, E.; Ishiwada, N.; Misaki, T.; Oishi, K.; et al. Molecular evolution of the fusion protein (F) gene in human respiratory syncytial virus subgroup B. Infect. Genet. Evol. 2017, 52, 1–9. [Google Scholar] [CrossRef] [PubMed]
  23. Darriba, D.; Taboada, G.L.; Doallo, R.; Posada, D. jModelTest 2: More models, new heuristics and parallel computing. Nat. Methods 2012, 9, 772. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Rambaut, A.; Drummond, A.J.; Xie, D.; Baele, G.; Suchard, M.A. Posterior summarisation in Bayesian phylogenetics using Tracer 1.7. Syst. Biol. 2018, 67, 901–904. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Lole, K.S.; Bollinger, R.C.; Paranjape, R.S.; Gadkari, D.; Kulkarni, S.S.; Novak, N.G.; Ingersoll, R.; Sheppard, H.W.; Ray, S.C. Full-length human immunodeficiency virus type 1 genomes from subtype C-infected seroconverters in India, with evidence of intersubtype recombination. J. Virol. 1999, 73, 152–160. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Kumar, S.; Stecher, G.; Tamura, K. MEGA7: Molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 2016, 33, 1870–1874. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Fourment, M.; Gibbs, M.J. PATRISTIC: A program for calculating patristic distances and graphically comparing the components of genetic change. BMC Evol. Biol. 2006, 6, 1. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  28. Webb, B.; Sali, A. Protein structure modeling with MODELLER. Methods Mol. Biol. 2014, 1137, 1–15. [Google Scholar]
  29. Scott, W.R.P.; Hünenberger, P.H.; Tironi, I.G.; Mark, A.E.; Billeter, S.R.; Fennen, J.; Torda, A.E.; Huber, T.; Krüger, P.; van Gunsteren, W.F. The GROMOS biomolecular simulation program package. J. Phys. Chem. A 1999, 103, 3596–3607. [Google Scholar] [CrossRef]
  30. Guex, N.; Peitsch, M.C. SWISS-MODEL and the Swiss-PdbViewer: An environment for comparative protein modeling. Electrophoresis 1997, 18, 2714–2723. [Google Scholar] [CrossRef]
  31. Emsley, P.; Lohkamp, B.; Scott, W.G.; Cowtan, K. Features and development of Coot. Acta Crystallogr. D Biol. Crystallogr. 2010, 66, 486–501. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  32. Kringelum, J.V.; Lundegaard, C.; Lund, O.; Nielsen, M. Reliable B cell epitope predictions: Impacts of method development and improved benchmarking. PLoS Comput. Biol. 2012, 8, e1002829. [Google Scholar] [CrossRef]
  33. Sweredoski, M.J.; Baldi, P. PEPITO: Improved discontinuous B-cell epitope prediction using multiple distance thresholds and half sphere exposure. Bioinformatics 2008, 24, 1459–1460. [Google Scholar] [CrossRef] [Green Version]
  34. Ponomarenko, J.; Bui, H.H.; Fusseder, N.; Bourne, P.E.; Sette, A.; Peters, B. ElliPro: A new structure-based tool for the prediction of antibody epitopes. BMC Bioinform. 2008, 9, 514. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Liang, S.; Liu, S.; Zhang, C.; Zhou, Y. A simple reference state makes a significant improvement in near-native selections from structurally refined docking decoys. Proteins 2007, 69, 244–253. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Liang, S.; Zheng, D.; Standley, D.M.; Yao, B.; Zacharias, M.; Zhang, C. EPSVR and EPMeta: Prediction of antigenic epitopes using support vector regression and multiple server results. BMC Bioinform. 2010, 11, 381. [Google Scholar] [CrossRef] [Green Version]
  37. Pettersen, E.F.; Goddard, T.D.; Huang, C.C.; Couch, G.S.; Greenblatt, D.M.; Meng, E.C.; Ferrin, T.E. UCSF Chimera—A visualization system for exploratory research and analysis. J. Comput. Chem. 2004, 25, 1605–1612. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  38. Agenbach, E.; Tiemessen, C.T.; Venter, M. Amino acid variation within the fusion protein of respiratory syncytial virus subtype A and B strains during annual epidemics in South Africa. Virus Genes 2005, 30, 267–278. [Google Scholar] [CrossRef] [PubMed]
  39. Chi, H.; Liu, H.F.; Weng, L.C.; Wang, N.Y.; Chiu, N.C.; Lai, M.J.; Lin, Y.C.; Chiu, Y.Y.; Hsieh, W.S.; Huang, L.M. Molecular epidemiology and phylodynamics of the human respiratory syncytial virus fusion protein in northern Taiwan. PLoS ONE 2013, 8, e64012. [Google Scholar]
  40. Gaunt, E.R.; Jansen, R.R.; Poovorawan, Y.; Templeton, K.E.; Toms, G.L.; Simmonds, P. Molecular epidemiology and evolution of human respiratory syncytial virus and human metapneumovirus. PLoS ONE 2011, 6, e17427. [Google Scholar] [CrossRef] [PubMed]
  41. Tapia, L.I.; Shaw, C.A.; Aideyan, L.O.; Jewell, A.M.; Dawson, B.C.; Haq, T.R.; Piedra, P.A. Gene sequence variability of the three surface proteins of human respiratory syncytial virus (HRSV) in Texas. PLoS ONE 2014, 9, e90786. [Google Scholar]
  42. Aso, J.; Kimura, H.; Ishii, H.; Saraya, T.; Kurai, D.; Matsushima, Y.; Nagasawa, K.; Ryo, A.; Takizawa, H. Molecular evolution of the fusion protein (F) gene in human respirovirus 3. Front. Microbiol. 2020, 10, 3054. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  43. Holmes, E.C. Virus evolution. In Fields Virology, 6th ed.; Knipe, D.M., Howley, P.M., Cohen, J.I., Griffin, D.E., Lamb, R.A., Martin, M.A., Racaniello, V.D., Roizman, B., Eds.; Lippincott Williams & Wilkins: Philadelphia, PA, USA, 2013; Volume 1, pp. 286–313. [Google Scholar]
  44. Sharon, J.; Rynkiewicz, M.J.; Lu, Z.; Yang, C.Y. Discovery of protective B-cell epitopes for development of antimicrobial vaccines and antibody therapeutics. Immunology 2014, 142, 1–23. [Google Scholar] [CrossRef] [PubMed]
  45. Adams, O.; Bonzel, L.; Kovacevic, A.; Mayatepek, E.; Hoehn, T.; Vogel, M. Palivizumab-resistant human respiratory syncytial virus infection in infancy. Clin. Infect. Dis. 2010, 51, 185–188. [Google Scholar] [CrossRef] [PubMed]
  46. Wegzyn, C.; Toh, L.K.; Notario, G.; Biguenet, S.; Unnebrink, K.; Park, C.; Makari, D.; Norton, M. Safety and effectiveness of palivizumab in children at high risk of serious disease due to respiratory syncytial virus infection: A systematic review. Infect. Dis. Ther. 2014, 3, 133–158. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. Time-scaled phylogenetic tree for the RSV F gene constructed using Bayesian Markov chain Monte Carlo method. The scale bar represents time (years). Blue bars indicate the 95% highest probability density (HPD) for a branched year.
Figure 1. Time-scaled phylogenetic tree for the RSV F gene constructed using Bayesian Markov chain Monte Carlo method. The scale bar represents time (years). Blue bars indicate the 95% highest probability density (HPD) for a branched year.
Viruses 13 02525 g001
Figure 2. Bayesian skyline plots for the RSV-A F gene. Each panel illustrates the phylodynamics of all RSV-A strains (a), genotype GA1 (b), genotype GA2 (c), genotype GA3 (d), genotype GA4 (e), genotype GA5 (f), genotype GA7 (g), and genotype NA1 (h). The y-axis and x-axis indicate the effective population size and the time in years, respectively. The thick blue line shows the median value over time. 95% HPD intervals are represented in a thin blue line. As noted, the genotype GA6 was not examined due to small strain numbers (six strains).
Figure 2. Bayesian skyline plots for the RSV-A F gene. Each panel illustrates the phylodynamics of all RSV-A strains (a), genotype GA1 (b), genotype GA2 (c), genotype GA3 (d), genotype GA4 (e), genotype GA5 (f), genotype GA7 (g), and genotype NA1 (h). The y-axis and x-axis indicate the effective population size and the time in years, respectively. The thick blue line shows the median value over time. 95% HPD intervals are represented in a thin blue line. As noted, the genotype GA6 was not examined due to small strain numbers (six strains).
Viruses 13 02525 g002aViruses 13 02525 g002b
Figure 3. Similarity plot analysis of the F gene across all RSV-A strains. Nucleotide similarity to the prototype strain (Long strain, GenBank accession no. JX198112) was calculated using SimPlot analysis. Nucleotide position numbers correspond to the F gene in the prototype strain. The cleavage sites and the positions of each F1 and F2 subunits are shown below the graph [10]. SP, signal peptide; DI-DIII, domains I-III; HRA-HRC, heptad repeat A-C; p27, excised peptide; FP, fusion peptide; TM, transmembrane anchor; Tail, cytoplasmic tail.
Figure 3. Similarity plot analysis of the F gene across all RSV-A strains. Nucleotide similarity to the prototype strain (Long strain, GenBank accession no. JX198112) was calculated using SimPlot analysis. Nucleotide position numbers correspond to the F gene in the prototype strain. The cleavage sites and the positions of each F1 and F2 subunits are shown below the graph [10]. SP, signal peptide; DI-DIII, domains I-III; HRA-HRC, heptad repeat A-C; p27, excised peptide; FP, fusion peptide; TM, transmembrane anchor; Tail, cytoplasmic tail.
Viruses 13 02525 g003
Figure 4. Distribution of the phylogenetic distances between the full-length sequences of the F gene of all RSV-A strains. The y-axis represents the number of sequence pairs corresponding to each distance. The x-axis shows phylogenetic distances.
Figure 4. Distribution of the phylogenetic distances between the full-length sequences of the F gene of all RSV-A strains. The y-axis represents the number of sequence pairs corresponding to each distance. The x-axis shows phylogenetic distances.
Viruses 13 02525 g004
Figure 5. Structural models of the prefusion F protein of Long strain (a), genotype GA1 (b), genotype GA2 (c), and genotype NA1 (d). Chains of the trimeric structures are colored in light gray (chain A), dim gray (chain B), and black (chain C). Amino acid substitutions sites of chain A for each variant strain relative to the prototype strain are shown in green, and palivizumab epitopes are shown in red. The predicted conformational epitopes were indicated by cyan.
Figure 5. Structural models of the prefusion F protein of Long strain (a), genotype GA1 (b), genotype GA2 (c), and genotype NA1 (d). Chains of the trimeric structures are colored in light gray (chain A), dim gray (chain B), and black (chain C). Amino acid substitutions sites of chain A for each variant strain relative to the prototype strain are shown in green, and palivizumab epitopes are shown in red. The predicted conformational epitopes were indicated by cyan.
Viruses 13 02525 g005
Table 1. The divergence year of RSV-A genotypes.
Table 1. The divergence year of RSV-A genotypes.
VirusGenotypeDiverged Year (95%HPD)Strain Numbers
RSV-AGA11943 (1937–1948)35
GA41970 (1967–1973)17
GA51975 (1973–1977)174
GA61977 (1974–1979)6
GA71977 (1974–1979)34
GA31979 (1978–1981)28
GA21988 (1987–1991)77
NA11994 (1994–1997)1092
RSV-B1766 (1734–1794)1
Bovine-RSV1563 (1504–1624)1
Table 2. Predicted conformational epitopes and amino acid residues corresponding to the prototype strain (Long strain).
Table 2. Predicted conformational epitopes and amino acid residues corresponding to the prototype strain (Long strain).
Chain ABC
Residue 656667682102116566676820921021165666768209210211
Prototype strainKENKQSKENKKQSKENKKQS
GenotypeNA1....................
GA1....................
GA2....................
GA3....................
GA4....................
GA5....................
GA6....................
GA7....................
Shaded regions reflect the putative epitopes of each genotype. Amino acid residues of each genotype identical to the prototype strain are indicated by dots.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Saito, M.; Tsukagoshi, H.; Sada, M.; Sunagawa, S.; Shirai, T.; Okayama, K.; Sugai, T.; Tsugawa, T.; Hayashi, Y.; Ryo, A.; et al. Detailed Evolutionary Analyses of the F Gene in the Respiratory Syncytial Virus Subgroup A. Viruses 2021, 13, 2525. https://doi.org/10.3390/v13122525

AMA Style

Saito M, Tsukagoshi H, Sada M, Sunagawa S, Shirai T, Okayama K, Sugai T, Tsugawa T, Hayashi Y, Ryo A, et al. Detailed Evolutionary Analyses of the F Gene in the Respiratory Syncytial Virus Subgroup A. Viruses. 2021; 13(12):2525. https://doi.org/10.3390/v13122525

Chicago/Turabian Style

Saito, Mariko, Hiroyuki Tsukagoshi, Mitsuru Sada, Soyoka Sunagawa, Tatsuya Shirai, Kaori Okayama, Toshiyuki Sugai, Takeshi Tsugawa, Yuriko Hayashi, Akihide Ryo, and et al. 2021. "Detailed Evolutionary Analyses of the F Gene in the Respiratory Syncytial Virus Subgroup A" Viruses 13, no. 12: 2525. https://doi.org/10.3390/v13122525

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop