Next Article in Journal
Genetic Diversity of the LTR Region of Polish SRLVs and Its Impact on the Transcriptional Activity of Viral Promoters
Previous Article in Journal
The COVID-19 Pandemic—A Potential Role for Antivirals in Mitigating Pandemics
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Comparative Surface Electrostatics and Normal Mode Analysis of High and Low Pathogenic H7N7 Avian Influenza Viruses

Synthetic Biology and Biotechnology Unit, Department of Biology, University of Padua, Via Ugo Bassi, 58/B, 35131 Padua, Italy
*
Author to whom correspondence should be addressed.
Viruses 2023, 15(2), 305; https://doi.org/10.3390/v15020305
Submission received: 21 December 2022 / Revised: 18 January 2023 / Accepted: 19 January 2023 / Published: 21 January 2023
(This article belongs to the Section Animal Viruses)

Abstract

:
Influenza A viruses are rarely symptomatic in wild birds, while representing a higher threat to poultry and mammals, where they can cause a variety of symptoms, including death. H5 and H7 subtypes of influenza viruses are of particular interest because of their pathogenic potential and reported capacity to spread from poultry to mammals, including humans. The identification of molecular fingerprints for pathogenicity can help surveillance and early warning systems, which are crucial to prevention and protection from such potentially pandemic agents. In the past decade, comparative analysis of the surface features of hemagglutinin, the main protein antigen in influenza viruses, identified electrostatic fingerprints in the evolution and spreading of H5 and H9 subtypes. Electrostatic variation among viruses from avian or mammalian hosts was also associated with host jump. Recent findings of fingerprints associated with low and highly pathogenic H5N1 viruses, obtained by means of comparative electrostatics and normal modes analysis, prompted us to check whether such fingerprints can also be found in the H7 subtype. Indeed, evidence presented in this work showed that also in H7N7, hemagglutinin proteins from low and highly pathogenic strains present differences in surface electrostatics, while no meaningful variation was found in normal modes.

1. Introduction

Influenza A viruses can infect both birds and mammals, representing an economic threat to poultry farming, where millions of birds are routinely culled to block transmission and, less frequently, dangerous risk to human health. The worst (historically recorded) pandemic event in humans was the H1N1 influenza of 1918 (Spanish Flu: 20–100 million deaths worldwide, depending on estimates), while the H2N2 (1957) and the H3N2 (1968) pandemics caused 1 million deaths, i.e., two/threefold the baseline yearly death toll associated with seasonal influenza [1,2,3,4]. “Highly pathogenic avian influenza” (HPAI) and “low pathogenic avian influenza” (LPAI) were first defined to classify flu viruses in 1981, in the 1st International Symposium on Avian Influenza [5]. In 1997, a poultry to human host jump occurred with H5N1 in Hong Kong [6].
HPAI are found only among H5 and H7 subtypes, and these strains arise from LPAI ones; no naturally transmitted virus of the other subtypes has a mortality rate high enough to meet the classification criteria of highly pathogenic [1]. Zoonotic infections of humans by H5 and H7 viruses can result in pneumonia, acute respiratory distress syndrome (ARDS) and even death [7,8]. However, if an H5/H7 virus were to evolve the ability for direct, human-to-human airborne transmission, this could lead to devastating pandemic events like the Spanish flu [9], and this makes it extremely important to identify molecular fingerprints underlying host jump and pathogenicity shift.
The two major capsid proteins, hemagglutinin (HA) and neuraminidase, play a major role in the interaction of avian influenza (AI) viruses and host cells, as HA binds sialic acid (SA) as a functional receptor, and HA linkage to either α2,3-linked SA (avian) or α2,6-linked SA (mammalian/human) is a well-known host determinant [10,11,12].
However, sequence-based prediction of SA-binding efficacy is difficult, because even a few mutations may change HA specificity. Neuraminidase can also influence host specificity and respiratory-droplet transmissibility [13]. HA is also crucial to AI pathogenicity; however, other surface proteins such as neuraminidase and M1 also play a role, and even variation of internal proteins such as the NP can influence the pathogenicity [14]. Indeed, the pathogenic potential of a virus is a complex matter, as demonstrated by a study on rebuilding of the 1918 Spanish flu: full virulence potential could be achieved only with all of the original eight gene segments together, and any replacement lowered the virulence in mice [15].
A molecular signature for the LPAI to HPAI shift is an increased number of basic amino acids in the HA polybasic cleavage site or polybasic motif [16]. Changes in the surface features of HA have been reported to play an important role in the evolution and spreading of both H5 and H9 subtype clades [17,18], as well as in modulation of AI host specificity [19,20,21,22], while their influence on pathogenic shift is still unclear.
The recent finding in H5N1 of surface fingerprints specific to LPAI or HPAI [23] prompted us to extend the investigation and check whether such pathogenicity-related fingerprints are restricted to the H5 subtype, or can also be found in H7, possibly representing a pathogenicity hallmark of AI viruses. Comparative analysis of surface electrostatics has been able to identify fingerprints in clade evolution [17,18], as well as to suggest profiles in host specificity [24]. When surface electrostatics was also applied for the comparative analysis of LPAI and HPAI in H5N1, once again it was able to suggest fingerprints; moreover, normal mode analysis (NMA) provided further information, as fluctuation profiles unveiled an intriguing LPAI/HPAI difference at the 110-helix region [23].
Results from this work seem to confirm in H7 subtype that LPAI and HPAI show differences in the electrostatic profiles, while no meaningful variation was unveiled by normal modes analysis.

2. Materials and Methods

2.1. Structure Modeling, Refinement, Assessment, and Comparison

HA protein sequences were obtained from the NIAID Influenza Research Database (IRD) [25], which is available online [26], and subsequently screened for keeping only non-redundant sequences.
The target protein sequences were modeled by homology modeling using MODELLER [27,28] on the best available structure templates. Two models were built for each sequence; then, the GA341 score [29], which uses percentage of sequence identity between template and model as parameter, and the discrete optimized protein energy (DOPE) score [30], which assesses the energy function, were calculated for each model in order to assess its quality. Due to the very high identity between all templates and models, the lowest DOPE score was selected as a discriminant in choosing the best model.
Refinement of the structures was performed through SCWRL4 [31], which integrates a rotamer library, a frequency-dependent energy function based on rotamer frequencies and a repulsive steric energy term, and a graph decomposition in order to calculate the optimal side-chain packing. Finally, each model quality was checked via the QMEAN server [32] using the QMEAN6 composite scoring function [33]. Refined protein structures were viewed using the UCSF Chimera 1.16 software [34].
PDBeFold [35] was used to perform structural alignment through rigid superposition and to calculate root mean square deviation (RMSD), identity, and Q-score between structures.

2.2. Electrostatic Potential Analysis

Comparative analysis of electrostatic potentials was performed with the Adaptive Poisson–Boltzmann Solver (APBS) [36] at the APBS-PDB2PQR software suite web server [37]. PDB2PQR automates preparation of structures for subsequent analysis, adding atomic charge and radius parameters to PDB data and estimating titration states and protonation. APBS solves the equations of continuum electrostatics for biomolecular assemblages, calculating electrostatic and solvation properties, through the implicit solvent model Poisson–Boltzmann (PB), which provides a global solution suited to visualization and structural analyses. Parameters were chosen according to the previous literature [17,18,23].
Spatial distribution of the electrostatic potential was calculated at physiological ionic strength (I) = 150 mM, assuming +1/−1 charges for the counter-ions. Partial charges and van der Waals radii were assigned via PDB2PQR according to the PARSE force field [38], choosing an interior ε p = 2 for proteins and ε s = 78 for solvent. Temperature was set at T = 298.15 K. Probe radius for dielectric surface and ion accessibility surface were set to be r = 1.4 Å and r = 2.0 Å. Isopotential contours were plotted at ±1 kBT/e and viewed using UCSF Chimera [34].
Electrostatic distance (ED) matrices were calculated by protein interaction property similarity analysis (PIPSA) [39] with Hodgkin indexes at the webPIPSA server [40]. PIPSA provides calculation of the protein electrostatic potentials, calculates similarity indices for all pairs of proteins based on the electrostatic similarity, and converts the similarity indices into electrostatic “distances” that are visualized through heat maps and epograms (tree-like diagrams based on electrostatic differences). The ED between two molecules a and b is calculated with the following equation:
E D a , b = 2 2 S I a , b    
where the Hodgkin similarity index (SI) is provided in turn by:
S I a , b = 2 p a , p b p a , p a + p b , p b
where (pa, pb), (pa, pa), (pb, pb) are the scalar products of the electrostatic potentials over the region where the potentials are compared. Visualization of the clusters and generation of the trees were either provided by webPIPSA or performed by the statistical program R [41], which is based on webPIPSA calculations.

2.3. Normal Modes Analysis (NMA)

Protein structures used as templates for modeling were retrieved from the Protein Data Bank (PDB) at the RCSB PDB website [42]. Structures used for modeling target H7N7 HA sequences were PDB structures 4DJ6 (resolution 2.61 Å), from strain A/Netherlands/219/2003 (H7N7), and, at the beginning of this work, 6N5A (resolution 3.30 Å), from strain A/equine/NY/49/73 (H7N7), which was then replaced by 7T1V as an HA structure from the same strain, but with improved resolution (2.05 Å).
High confidence homology modeling was performed as illustrated above; however, five models were created for each protein sequence, and a very thorough optimization with the variable target function method (VTFM) and slow molecular dynamics (MD) and simulated annealing (SA) were performed, completing a total of 300 iterations of VTFM optimization. Model refining with SCWRL4 and quality checking through QMEAN were performed as previous methodological set-ups.
Single and comparative NMA analyses were performed with WebNM@ [43], which uses the coarse-grained elastic network model (ENM) of a pdb file to calculate low-frequency normal modes of single pre-aligned protein structures or sets thereof. ENM is a reliable and computationally cost-effective way to analyze protein flexibility and dynamics as well as to highlight differences between protein structures that normal comparison, such as the RMSD, may overlook.
The protein is represented as a string of C α atoms, and the interaction between two atoms is described by the pair potential U i j , as shown in the following equation:
U i j R = k R i j 0 ( R i j R i j 0 2 )
where ri and rj are the positions of residues i and j in the current conformation of the protein, the superscript 0 denotes the equilibrium conformation, and k is the force constant for the spring connecting residues i and j.
In single mode, dynamic cross-correlation matrices (DCCMs) are calculated, which help identify correlated and anticorrelated protein motions. DCCMs are two-dimensional matrices with a value range from 1 for completely correlated motions to −1 for anticorrelated motion. Coupling between two C α atoms i and j in the DCCM is given by the following equation:
C i j = m = 1 M X m i X m j m = 1 M 1 y m X m i X m i   m = 1 M 1 y m X m j X m j  
where X m and y m represent the eigenvectors and eigenvalues of the m t h normal mode. Default settings for WebNM@ were used according to previous work [23].
Dynamic similarity was also investigated through the Bhattacharyya coefficient (BC) and root mean square inner product (RMSIP). BC values range from 0 to 1, with 1 being the maximum overlap between the collective dynamics of the aligned proteins. RMSIP is a measure of the similarity between two sets of modes, calculated from the C α atom fluctuations for the lowest normal modes, as follows:
R M S I P = 1 n i = 1 n j = 1 n X i Y i 2
RMSIP ranges from 0 to 1, with 1 being the maximum similarity in C α fluctuations. WebNM@ further provides a graph of normalized C α fluctuations calculated as the sum of the C α atom displacements in each mode, weighted by the inverse of their corresponding eigenvalues. WebNM@ utilizes the first 200 modes to calculate the graph, from which flexible protein regions can be inferred by inspecting the peaks of the graph.

3. Results

3.1. Preliminary Comparison of a Mixed Dataset of H7N7 Hemagglutinin Proteins

A total of 247 H7N7 HA sequences were collected from UniProtKB [44], the Influenza Research database [26], and GISAID [45]. LPAI sequences were massively overrepresented with a total of 233 sequences, while only 14 HPAI sequences could be found, of which only 12 were from different outbreaks. Redundant sequences and sequences from the same outbreak were not utilized in the same set.
Structural models were obtained via homology modeling with high confidence, due to the high identity value between template and targets, with an average identity of 95.0% between targets and 4DJ6 and of 83.9% between targets and 6N5A/7T1V. Models were further refined, finally showing no atom clashes. QMEAN analysis showed extremely high-quality values for all refined models, with the GLOBAL quality score ranging between 0.86 and 0.91, with an uncertainty of 0.06.
A preliminary analysis of root mean square deviation (RMSD) was performed with PDBeFOLD, which provides structural alignments, calculates RMSD, and identities percentages. Two sub-datasets were used, namely low and highly pathogenic strains, for comparison.
Sequence identities, RMSD and Q-scores calculated for the HA receptor binding domain (RBD) from HPAI and LPAI H7N7 viruses are reported in Table 1 and Table 2.
RMSD and identity analysis were performed to show that, in accordance with previous research, eventual fingerprints specific to low or high pathogenicity are based on very minor sequence/structure changes and surface features, rather than on relevant structural differences.
An RMSD value under 1 Å is commonly considered a cut-off for different structures [46], and RMSD values in Figure 1 and Figure 2 are largely under 1 Å and very close to 0, indicating a very strong structural conservation. This is not surprising, while being in agreement with a similarly very high sequence identity. Since the RBD is the most variable part of HA, even higher RMSD values were obtained when comparing the whole HA, whose stalk sequence is largely identical and less subjected to mutations. Overall RMSD was 0.1365 for structure 4DJ6, with an overall Q-score of 0.9979; overall RMSD for structure 6N59 was 0.1314, with an overall Q-score of 0.9981.

3.2. Surface Electrostatics of HA RBDs from HPAI and LPAI Strains of H7N7

According to previous work [17,18,23,24], a comparative electrostatic analysis of 20 representative RBD structures (from 10 LPAI and 10 HPAI H7N7 viruses) was performed. The spatial distribution of the electrostatic potential was calculated at I = 150 mM, and +1/−1 charges were assumed for counter-ions. Partial charges and van der Waals radii were assigned with PDB2PQR, then the APBS webserver was used to carry out calculations with the Poisson–Boltzman (PB) method. Electrostatic distance (ED) was calculated at the WebPIPSA (Protein Interaction Property Similarity Analysis) server, obtaining, for each protein, a matrix of pairwise similarity indices visualized through a heatmap.
Calculations were performed on mixed model structures (6N5A for HPAI and 4DJ6 for LPAI strains) and on single model structures (each structure modeled with 6N5A and 4DJ6) using both Hodgkin and Carbo indexes. The heatmap for structures modeled with single template 4DJ6 is presented in Figure 1 (15), while the heatmap for mixed models is presented in Figure 2 (16).
Both heatmaps in Figure 1 and Figure 2 were built using the Hodgkin index, as no difference was found when using the Carbo index, and hence, the corresponding heatmaps are not shown.
In a previous study, intriguing differences between HA RBDs from high and low pathogenicity H5N1 strains were found, both in terms of total net charge and isocontour [23]. The difference in net charge among HPAI and LPAI H5N1 viruses was not just dependent on the positively charged residues at the HA polybasic cleavage site of HPAI, as this region is not part of the RBD, and hence, was excluded from the models. The vestigial esterase subdomain (VED) and the 110-helix region were found to show the most evident difference between the two subgroups, and this difference in the 110-helix was confirmed at the fluctuation profile level, using normal mode analysis [23].
Figure 3 highlights differences in the electrostatic isocontours of eight representative H7N7 strains: four HPAI and four LPAI. Four 90° stepwise orientations are shown, as differences may occur on each different “side” of the RBD.
The VED region, framed by an open circle (see 0° view) for comparison to H5 subtype, does not show in H7N7 an LPAI-HPAI difference as evident as that found in H5N1. In the framed VED subregion, both H7 LPAI and HPAI have a mix of negative and positive charges, like the H5N1 LPAI.
However, two LPAI to HPAI mutations, K122E and K182N, result in the evident negativization/ depositivization of a large RBD surface (framed in the figure by an open rectangle, see 180° view), clearly becoming more negative (red) in HPAI. Such a fingerprint is differentially conserved in H7N7 viruses, both in the eight representative HPAI and LPAI depicted in Figure 3, and in the other HPAI and LPAI H7 viruses that are not shown.
Intriguingly, mutation K122E, where charge shifts from +1 to −1, occurs at the most C-terminal residue of the 110-helix, with further depositivization being favored by close mutation K182N.

3.3. Normal Mode Analysis

NMA of HA RBDs from LPAI and HPAI H7N7 strains was performed using WebNM@. To avoid statistical bias, 10 different sets of 20 RBD sequences were used (the maximum amount that can be compared with WebNM@ due to computational constraints), each with 10 LPAI and 10 HPAI (randomly selected).
Normal modes were analyzed by the Bhattacharyya coefficient (BC) heatmap, root mean square inner product (RMSIP), dynamic cross-correlation matrices (DCCM) and finally with fluctuation analysis of single amino acids.
As shown by the BC and RMSIP of the 20 chosen sequences, presented in Figure 4 and Figure 5, respectively, the structures sharply segregate across pathogenicity (rather than based on phylogenetic distance, not shown). The color code varies from lighter green (higher divergency) to darker green (closest structures).
However, such results might only apparently be positive.
Indeed, both the BC and RMSIP present a high grade of closeness for all structures, ranging from 0.95 to 1, so we can infer that the dynamics between LPAI and HPAI viruses vary marginally in H7N7, even less than in H5N1, where the reported coefficient is 0.92 [23]. Moreover, models for LPAI and HPAI were obtained using two different templates, and hence, sorting might depend on differences in the template rather than on pathogenicity-related variation.
Therefore, to check the validity of the NMA clustering, and according to previous work, all 20 sequences were modeled on a single template, either 4DJ6 or 6N5A, and this caused pathogenicity-specific sorting to be lost (not shown), suggesting NMA is not applicable to this H7 subtype dataset, possibly because of the aforementioned high closeness values.
Nevertheless, correlation matrices were also calculated for each sequence, and representative pairs for LPAI and HPAI were chosen for visualization purposes. In all instances, correlation matrices were found to be nearly identical, with strongly correlated motions, occurring through the RBD, and weak anticorrelated motions (not shown).
When focusing on the 110-helix, as both an important determinant in the VED region of the hemagglutinin and the region having shown LPAI-HPAI variation in H5N1 [23], once again, no meaningful difference could be highlighted (not shown).

4. Discussion

Sequence divergence does not always correspond to structural divergence; two proteins with high sequence similarity may show very different properties considering recognition of epitopes by specific antibodies, and two proteins may share antigenic properties even with highly divergent sequences if epitopes are conserved. Local changes in accessible surface area, electrostatic potential, hydropathy or hydrophilicity features can change motif functionality.
Sequence-based studies are fundamental to highlight conservation—and differences—of amino acids close in sequence, but are limited in highlighting “group variation”, namely, identifying relationships of multiple mutations, especially when they are far from each other in primary sequence but close in 3D protein structure.
Surface features of biomolecules such as proteins are crucial to finely tune their interactions, and previous work demonstrated that comparative electrostatic and normal mode analyses can unveil functional fingerprints.
Such analyses, performed in this work, used the RBD for comparison as it is the most variable HA region, hence the best candidate to highlight differences across strains. The proteolytic and glycosylation sites were deliberately excluded, as differences at these sites between LPAI and HPAI are already known and reported. Difference in the cleavage site is extensively reported in the literature as the main discriminant for predicting pathogenicity other than direct infection of birds; however, despite its importance, it is unable to predict pathogenicity alone.
RBD “charge redistribution” has been found as a general feature of AI virus evolution and spread. This hallmark was first observed in a 2014 study [17] concerning H5N1 clades, with a denegativization of the VED associated by negativization (or depositivation) of the RBD in still-circulating clades. Due to the strong antigenic feature of both the RBD and VED subdomains, and the fact that local charge concentration is typical for antigenic epitopes, it was speculated that charge redistribution might have contributed to antigenic escape, and hence to their evolutionary success and spread.
The study was replicated in H9N2 viruses [18], with in-depth analysis that revealed structural agreement between H5 and H9 observations, showing similarity between the two HA subtypes.
Charge redistribution and mutation especially occurred in sub-regions important to sialic acid binding, host specificity and immune escape/antigenic drift: 110-helix, 130-loop, 190-helix and 220-loop. Intriguingly, variation at the 110-helix was then found to be also a potential fingerprint for LPAI-HPAI difference in H5N1, and this prompted us to check whether this would also happen with H7N7. The rationale for this is that H5, H9 and H7 viruses are studied as potential pandemic agents, all having been reported to infect mammals and humans; H5 and H7 in particular, can develop HPAI strains.
However, two main problems complicated this analysis when performed with H7. First, a general and structured clade classification for H7N7 is still missing; second, <250 non-redundant, complete nucleotide sequences of H7N7 HA are available at the Influenza Research Database, compared to >6500 H5N1 and >1600 H9N2 sequences. This notwithstanding, results from this work seem to confirm, also in the H7 subtype, that comparative electrostatic analysis can highlight meaningful differences and suggest fingerprints.
Recently, the infectivity and immune escape of another pandemic virus, SARS-CoV-2, were also found to be influenced by variation in surface electrostatics of the spike protein, as reported by comparative analysis of the SARS-CoV-2 variants [47,48,49].Lessons from comparative studies on hemagglutinin and spike protein features suggest that variation in surface electrostatics of the capsid proteins that mediate binding to the host cell functional receptors is a driving force and a functional fingerprint in the evolution of pandemic viruses.
Concerning NMA, negative results with H7 might depend on the poor dataset of sequences, on their very high structural closeness, as well as on bias derived from a very limited number of template structures for modeling. To clarify these aspects, further research entailing the use of a more complete dataset is needed.

5. Conclusions

Surveillance of influenza A virus is crucial, in terms of human and animal health and economy. This work sought to shed light on H7N7 influenza virus hemagglutinin fingerprints. Using an established computational framework, we were able to find some fingerprints. In particular, we noticed some mutations in the shift from LPAI to HPAI, able to change the shape of isocontour in the visual inspection. As for the H5N1 virus, electrostatic clustering sharply splits H7N7 in LPAI and HPAI strains. Moreover, H7N7 exhibits a behavior similar to that of H5N1: the RBD is subjected to “negativization” and “depositivization” events. Again, we can infer that charge redistribution may be involved in pathogenicity and immune escape
This work suggests that the residue 122 near the antigenic site 110-helix and the residue 182 may be involved in the pathogenicity shift from LPAI to HPAI. These findings can be useful for surveillance activity and vaccine design.

Author Contributions

Conceptualization, F.F., I.R.; methodology, I.R.; validation and formal analysis, I.R., F.F.; investigation, G.B.; writing—original draft preparation, G.B., F.F.; writing—review and editing, I.R., F.F.; supervision, F.F., I.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Not applicable.

Acknowledgments

We thank Chiara Romualdi for useful suggestions on data analysis and statistics.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Sutton, T.C. The Pandemic Threat of Emerging H5 and H7 Avian Influenza Viruses. Viruses 2018, 10, 461. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Poland, G.A.; Jacobson, R.M.; Targonski, P.V. Avian and pandemic influenza: An overview. Vaccine 2007, 25, 3057–3061. [Google Scholar] [CrossRef] [PubMed]
  3. Kessler, S.; Harder, T.C.; Schwemmle, M.; Ciminski, K. Influenza A Viruses and Zoonotic Events-Are We Creating Our Own Reservoirs? Viruses 2021, 13, 2250. [Google Scholar] [CrossRef] [PubMed]
  4. Paget, J.; Spreeuwenberg, P.; Charu, V.; Taylor, R.J.; Iuliano, A.D.; Bresee, J.; Simonsen, L.; Viboud, C.; Global Seasonal Influenza-associated Mortality Collaborator Network Team; GLaMOR Collaborating Team. Global mortality associated with seasonal influenza epidemics: New burden estimates and predictors from the GLaMOR Project. J. Glob. Health 2019, 9, 020421. [Google Scholar] [CrossRef] [PubMed]
  5. Lupiani, B.; Reddy, S.M. The history of avian influenza. Comp. Immunol. Microbiol. Infect. Dis. 2009, 32, 311–323. [Google Scholar] [CrossRef] [PubMed]
  6. Lai, S.; Qin, Y.; Cowling, B.J.; Ren, X.; Wardrop, N.A.; Gilbert, M.; Tsang, T.K.; Wu, P.; Feng, L.; Jiang, H.; et al. Global epidemiology of avian influenza A H5N1 virus infection in humans, 1997-2015: A systematic review of individual case data. Lancet Infect. Dis. 2016, 16, e108–e118. [Google Scholar] [CrossRef] [Green Version]
  7. To, K.K.; Ng, K.H.; Que, T.L.; Chan, J.M.; Tsang, K.Y.; Tsang, A.K.; Chen, H.; Yuen, K.Y. Avian influenza A H5N1 virus: A continuous threat to humans. Emerg. Microbes Infect. 2012, 1, e25. [Google Scholar] [CrossRef]
  8. Gao, H.N.; Lu, H.Z.; Cao, B.; Du, B.; Shang, H.; Gan, J.H.; Lu, S.H.; Yang, Y.D.; Fang, Q.; Shen, Y.Z.; et al. Clinical findings in 111 cases of influenza A (H7N9) virus infection. N. Engl. J. Med. 2013, 368, 2277–2285. [Google Scholar] [CrossRef] [Green Version]
  9. Richard, M.; Fouchier, R.A. Influenza A virus transmission via respiratory aerosols or droplets as it relates to pandemic potential. FEMS Microbiol. Rev. 2016, 40, 68–85. [Google Scholar] [CrossRef]
  10. Obadan, A.O.; Santos, J.; Ferreri, L.; Thompson, A.J.; Carnaccini, S.; Geiger, G.; Gonzalez Reiche, A.S.; Rajão, D.S.; Paulson, J.C.; Perez, D.R. Flexibility In Vitro of Amino Acid 226 in the Receptor-Binding Site of an H9 Subtype Influenza A Virus and Its Effect In Vivo on Virus Replication, Tropism, and Transmission. J. Virol. 2019, 93, e02011–e02018. [Google Scholar] [CrossRef]
  11. Thompson, A.J.; Paulson, J.C. Adaptation of influenza viruses to human airway receptors. J. Biol. Chem. 2021, 296, 100017. [Google Scholar] [CrossRef]
  12. Zhao, C.; Pu, J. Influence of Host Sialic Acid Receptors Structure on the Host Specificity of Influenza Viruses. Viruses 2022, 14, 2141. [Google Scholar] [CrossRef] [PubMed]
  13. Yen, H.L.; Liang, C.H.; Wu, C.Y.; Forrest, H.L.; Ferguson, A.; Choy, K.T.; Jones, J.; Wong, D.D.; Cheung, P.P.; Hsu, C.H.; et al. Hemagglutinin-neuraminidase balance confers respiratory-droplet transmissibility of the pandemic H1N1 influenza virus in ferrets. Proc. Natl. Acad. Sci. USA 2011, 108, 14264–14269. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Snyder, M.H.; Buckler-White, A.J.; London, W.T.; Tierney, E.L.; Murphy, B.R. The avian influenza virus nucleoprotein gene and a specific constellation of avian and human virus polymerase genes each specify attenuation of avian-human influenza A/Pintail/79 reassortant viruses for monkeys. J. Virol. 1987, 61, 2857–2863. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Tumpey, T.M.; Basler, C.F.; Aguilar, P.V.; Zeng, H.; Solórzano, A.; Swayne, D.E.; Cox, N.J.; Katz, J.M.; Taubenberger, J.K.; Palese, P.; et al. Characterization of the reconstructed 1918 Spanish influenza pandemic virus. Science 2005, 310, 77–80. [Google Scholar] [CrossRef] [Green Version]
  16. De Bruin, A.C.M.; Funk, M.; Spronken, M.I.; Gultyaev, A.P.; Fouchier, R.A.M.; Richard, M. Hemagglutinin Subtype Specificity and Mechanisms of Highly Pathogenic Avian Influenza Virus Genesis. Viruses 2022, 14, 1566. [Google Scholar] [CrossRef]
  17. Righetto, I.; Milani, A.; Cattoli, G.; Filippini, F. Comparative structural analysis of haemagglutinin proteins from type A influenza viruses: Conserved and variable features. BMC Bioinform. 2014, 15, 363. [Google Scholar] [CrossRef] [Green Version]
  18. Heidari, A.; Righetto, I.; Filippini, F. Electrostatic Variation of Haemagglutinin as a Hallmark of the Evolution of Avian Influenza Viruses. Sci. Rep. 2018, 8, 1929. [Google Scholar] [CrossRef] [Green Version]
  19. Vines, A.; Wells, K.; Matrosovich, M.; Castrucci, M.R.; Ito, T.; Kawaoka, Y. The role of influenza A virus hemagglutinin residues 226 and 228 in receptor specificity and host range restriction. J. Virol. 1998, 72, 7626–7631. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  20. Gambaryan, A.; Tuzikov, A.; Pazynina, G.; Bovin, N.; Balish, A.; Klimov, A. Evolution of the receptor binding phenotype of influenza A (H5) viruses. Virology 2006, 344, 432–438. [Google Scholar] [CrossRef]
  21. Chutinimitkul, S.; Herfst, S.; Steel, J.; Lowen, A.C.; Ye, J.; van Riel, D.; Schrauwen, E.J.; Bestebroer, T.M.; Koel, B.; Burke, D.F.; et al. Virulence-associated substitution D222G in the hemagglutinin of 2009 pandemic influenza A(H1N1) virus affects receptor binding. J. Virol. 2010, 84, 11802–11813. [Google Scholar] [CrossRef] [Green Version]
  22. Shi, Y.; Wu, Y.; Zhang, W.; Qi, J.; Gao, G.F. Enabling the ‘host jump’: Structural determinants of receptor-binding specificity in influenza A viruses. Nat. Rev. Microbiol. 2014, 12, 822–831. [Google Scholar] [CrossRef]
  23. Righetto, I.; Filippini, F. Normal modes analysis and surface electrostatics of haemagglutinin proteins as fingerprints for high pathogenic type A influenza viruses. BMC Bioinform. 2020, 21 (Suppl. S10), 354. [Google Scholar] [CrossRef] [PubMed]
  24. Righetto, I.; Filippini, F. Pandemic Avian Influenza and Intra/Interhaemagglutinin Subtype Electrostatic Variation among Viruses Isolated from Avian, Mammalian, and Human Hosts. Biomed. Res. Int. 2018, 2018, 3870508. [Google Scholar] [CrossRef] [PubMed]
  25. Zhang, Y.; Aevermann, B.D.; Anderson, T.K.; Burke, D.F.; Dauphin, G.; Gu, Z.; He, S.; Kumar, S.; Larsen, C.N.; Lee, A.J.; et al. Influenza Research Database: An integrated bioinformatics resource for influenza virus research. Nucleic Acids Res. 2017, 45, D466–D474. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Influenza Research Database. Available online: https://fludb.org (accessed on 7 December 2022).
  27. Webb, B.; Sali, A. Comparative Protein Structure Modeling Using MODELLER. Curr. Protoc. Bioinform. 2016, 54, 5.6.1–5.6.37. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  28. Martí-Renom, M.A.; Stuart, A.C.; Fiser, A.; Sánchez, R.; Melo, F.; Sali, A. Comparative protein structure modeling of genes and genomes. Annu. Rev. Biophys. Biomol. Struct. 2000, 29, 291–325. [Google Scholar] [CrossRef] [Green Version]
  29. Melo, F.; Sánchez, R.; Sali, A. Statistical potentials for fold assessment. Protein Sci. 2002, 11, 430–448. [Google Scholar] [CrossRef] [Green Version]
  30. Dong, G.Q.; Fan, H.; Schneidman-Duhovny, D.; Webb, B.; Sali, A. Optimized atomic statistical potentials: Assessment of protein interfaces and loops. Bioinformatics 2013, 29, 3158–3166. [Google Scholar] [CrossRef] [Green Version]
  31. Krivov, G.G.; Shapovalov, M.V.; Dunbrack, R.L., Jr. Improved prediction of protein side-chain conformations with SCWRL4. Proteins 2009, 77, 778–795. [Google Scholar] [CrossRef]
  32. Studer, G.; Rempfer, C.; Waterhouse, A.M.; Gumienny, R.; Haas, J.; Schwede, T. QMEANDisCo-distance constraints applied on model quality estimation. Bioinformatics 2020, 36, 1765–1771. [Google Scholar] [CrossRef]
  33. Benkert, P.; Biasini, M.; Schwede, T. Toward the estimation of the absolute quality of individual protein structure models. Bioinformatics 2011, 27, 343–350. [Google Scholar] [CrossRef] [Green Version]
  34. Pettersen, E.F.; Goddard, T.D.; Huang, C.C.; Couch, G.S.; Greenblatt, D.M.; Meng, E.C.; Ferrin, T.E. UCSF Chimera—A visualization system for exploratory research and analysis. J. Comput. Chem. 2004, 25, 1605–1612. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Mary Rajathei, D.; Selvaraj, S. Analysis of sequence repeats of proteins in the PDB. Comput. Biol. Chem. 2013, 47, 156–166. [Google Scholar] [CrossRef] [PubMed]
  36. Jurrus, E.; Engel, D.; Star, K.; Monson, K.; Brandi, J.; Felberg, L.E.; Brookes, D.H.; Wilson, L.; Chen, J.; Liles, K.; et al. Improvements to the APBS biomolecular solvation software suite. Protein Sci. 2018, 27, 112–128. [Google Scholar] [CrossRef] [Green Version]
  37. APBS-PDB2PQR Software Suite. Available online: https://server.poissonboltzmann.org/ (accessed on 7 December 2022).
  38. Sitkoff, D.; Sharp, K.A.; Honig, B. Accurate Calculation of Hydration Free Energies Using Macroscopic Solvent Models. J. Phys. Chem. 1994, 98, 1978–1988. [Google Scholar] [CrossRef]
  39. Richter, S.; Wenzel, A.; Stein, M.; Gabdoulline, R.R.; Wade, R.C. webPIPSA: A web server for the comparison of protein interaction properties. Nucleic Acids Res. 2008, 36, W276–W280. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  40. webPIPSA. Available online: https://pipsa.h-its.org/pipsa/ (accessed on 7 December 2022).
  41. Ihaka, R.; Gentleman, R. R: A language for data analysis and graphics. J. Comput. Graph. Stat. 1996, 5, 299–314. [Google Scholar] [CrossRef]
  42. RCSB PDB. Available online: https://www.rcsb.org/ (accessed on 7 December 2022).
  43. Tiwari, S.P.; Fuglebakk, E.; Hollup, S.M.; Skjærven, L.; Cragnolini, T.; Grindhaug, S.H.; Tekle, K.M.; Reuter, N. WEBnm@ v2.0: Web server and services for comparing protein flexibility. BMC Bioinform. 2014, 15, 427. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. UniProtKB. Available online: https://www.uniprot.org/ (accessed on 7 December 2022).
  45. GISAID. Available online: https://gisaid.org/ (accessed on 7 December 2022).
  46. Maiorov, V.N.; Crippen, G.M. Significance of root-mean-square deviation in comparing three-dimensional structures of globular proteins. J. Mol. Biol. 1994, 235, 625–634. [Google Scholar] [CrossRef] [PubMed]
  47. Pawłowski, P.H. Charged amino acids may promote coronavirus SARS-CoV-2 fusion with the host cell. AIMS Biophys. 2021, 8, 111–120. [Google Scholar] [CrossRef]
  48. Pawłowski, P.H. Additional Positive Electric Residues in the Crucial Spike Glycoprotein S Regions of the New SARS-CoV-2 Variants. Infect Drug Resist. 2021, 14, 5099–5105. [Google Scholar] [CrossRef] [PubMed]
  49. Ching, W.Y.; Adhikari, P.; Jawad, B.; Podgornik, R. Effect of Delta and Omicron Mutations on the RBD-SD1 Domain of the Spike Protein in SARS-CoV-2 and the Omicron Mutations on RBD-ACE2 Interface Complex. Int. J. Mol. Sci. 2022, 23, 10091. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Electrostatic distance heatmap for single template (PDB 4DJ6) models.
Figure 1. Electrostatic distance heatmap for single template (PDB 4DJ6) models.
Viruses 15 00305 g001
Figure 2. Electrostatic distance heatmap for mixed template models.
Figure 2. Electrostatic distance heatmap for mixed template models.
Viruses 15 00305 g002
Figure 3. Surface charge distribution of the HA RBDs from four representative HPAI and four LPAI H7N7 strains. Positive, blue; negative, red; neutral, white. Four 90° stepwise orientations are shown. Open circle, VED subregion; open rectangle, region where K122E, K182N mutations result in negativization/depositivization.
Figure 3. Surface charge distribution of the HA RBDs from four representative HPAI and four LPAI H7N7 strains. Positive, blue; negative, red; neutral, white. Four 90° stepwise orientations are shown. Open circle, VED subregion; open rectangle, region where K122E, K182N mutations result in negativization/depositivization.
Viruses 15 00305 g003
Figure 4. Bhattacharyya coefficient (BC) heatmap for HPAI and LPAI RBDs. Uniprot AC for each sequence, followed by pathogenicity and PDB of the template, are reported on the Y axis, while a distance tree is reported on the X axis.
Figure 4. Bhattacharyya coefficient (BC) heatmap for HPAI and LPAI RBDs. Uniprot AC for each sequence, followed by pathogenicity and PDB of the template, are reported on the Y axis, while a distance tree is reported on the X axis.
Viruses 15 00305 g004
Figure 5. RMSIP heatmap for HPAI and LPAI RBDs. Uniprot AC for each sequence, followed by pathogenicity and PDB of the template are reported on the Y axis, while a distance tree is reported on the X axis.
Figure 5. RMSIP heatmap for HPAI and LPAI RBDs. Uniprot AC for each sequence, followed by pathogenicity and PDB of the template are reported on the Y axis, while a distance tree is reported on the X axis.
Viruses 15 00305 g005
Table 1. Sequence identity, RMSD and Q-score for LPAI sequences modeled on PDB 4DJ6.
Table 1. Sequence identity, RMSD and Q-score for LPAI sequences modeled on PDB 4DJ6.
Sequence Identity
structure:12345678910
1AAC40998_low_4dj6.pdb: 0.9760.9610.9660.8630.9510.9220.9660.9660.859
2AAC54383_low_4dj6.pdb:0.976 0.9710.9660.8590.9610.9220.9660.9660.849
3AAC54388_low_4dj6.pdb:0.9610.971 0.9760.8540.9760.9070.9660.9660.849
4AAG10671_low_4dj6.pdb:0.9660.9660.976 0.8490.9660.9070.9610.9610.844
5AAR96248_low_4dj6.pdb:0.8630.8590.8540.849 0.8540.8540.8540.8590.815
6ACN90968_low_4dj6.pdb:0.9510.9610.9760.9660.854 0.9070.9560.9560.834
7ACZ36872_low_4dj6.pdb:0.9220.9220.9070.9070.8540.907 0.9070.9070.844
8AEK84772_low_4dj6.pdb:0.9660.9660.9660.9610.8540.9560.907 0.9900.849
9BAQ21382_low_4dj6.pdb:0.9660.9660.9660.9610.8590.9560.9070.990 0.849
10CAA44429_low_4dj6.pdb:0.8590.8490.8490.8440.8150.8340.8440.8490.849
RMSD
structure:12345678910
1AAC40998_low_4dj6.pdb: 0.1230.1000.1290.1090.1130.1250.1180.1180.156
2AAC54383_low_4dj6.pdb:0.123 0.1430.1350.1500.1490.1600.1530.1500.176
3AAC54388_low_4dj6.pdb:0.1000.143 0.1070.1040.1060.1140.1260.1320.151
4AAG10671_low_4dj6.pdb:0.1290.1350.107 0.1160.1060.1240.1320.1360.173
5AAR96248_low_4dj6.pdb:0.1090.1500.1040.116 0.0980.1140.1280.1460.177
6ACN90968_low_4dj6.pdb:0.1130.1490.1060.1060.098 0.0930.1230.1400.166
7ACZ36872_low_4dj6.pdb:0.1250.1600.1140.1240.1140.093 0.1280.1500.168
8AEK84772_low_4dj6.pdb:0.1180.1530.1260.1320.1280.1230.128 0.0950.195
9BAQ21382_low_4dj6.pdb:0.1180.1500.1320.1360.1460.1400.1500.095 0.183
10CAA44429_low_4dj6.pdb:0.1560.1760.1510.1730.1770.1660.1680.1950.183
Q-score
structure:12345678910
1AAC40998_low_4dj6.pdb: 0.9980.9990.9980.9990.9990.9980.9980.9980.997
2AAC54383_low_4dj6.pdb:0.998 0.9980.9980.9970.9980.9970.9970.9980.997
3AAC54388_low_4dj6.pdb:0.9990.998 0.9990.9990.9990.9990.9980.9980.997
4AAG10671_low_4dj6.pdb:0.9980.9980.999 0.9990.9990.9980.9980.9980.997
5AAR96248_low_4dj6.pdb:0.9990.9970.9990.999 0.9990.9990.9980.9980.997
6ACN90968_low_4dj6.pdb:0.9990.9980.9990.9990.999 0.9990.9980.9980.997
7ACZ36872_low_4dj6.pdb:0.9980.9970.9990.9980.9990.999 0.9980.9980.997
8AEK84772_low_4dj6.pdb:0.9980.9970.9980.9980.9980.9980.998 0.9990.996
9BAQ21382_low_4dj6.pdb:0.9980.9980.9980.9980.9980.9980.9980.999 0.996
10CAA44429_low_4dj6.pdb:0.9970.9970.9970.9970.9970.9970.9970.9960.996
Table 2. Sequence identity, RMSD and Q-score for HPAI sequences modelled on PDB 6N5A.
Table 2. Sequence identity, RMSD and Q-score for HPAI sequences modelled on PDB 6N5A.
Sequence Identity
structure:12345678910
1AAA43152_high_6N5A.pdb: 0.8780.9070.9170.9170.8830.9020.9020.9120.912
2AAA56803_high_6N5A.pdb:0.878 0.8980.9120.9170.9760.8980.8980.8980.898
3AAC54377_high_6N5A.pdb:0.9070.898 0.9660.9710.9020.9710.9710.9610.971
4AAR02639_high_6N5A.pdb:0.9170.9120.966 0.9950.9170.9850.9850.9850.985
5ABI94581_high_6N5A.pdb:0.9170.9170.9710.995 0.9220.9800.9800.9800.980
6AGB51388_high_6N5A.pdb:0.8830.9760.9020.9170.922 0.9020.9020.9020.902
7AGU69978_high_6N5A.pdb:0.9020.8980.9710.9850.9800.902 1.0000.9800.990
8AHF20373_high_6N5A.pdb:0.9020.8980.9710.9850.9800.9021.000 0.9800.990
9EPI168665_high_6N5A.pdb:0.9120.8980.9610.9850.9800.9020.9800.980 0.980
10EPI623939_high_6N5A.pdb:0.9120.8980.9710.9850.9800.9020.9900.9900.980
RMSD
structure:12345678910
1AAA43152_high_6N5A.pdb: 0.1470.1700.1600.1280.1380.2040.2040.1580.141
2AAA56803_high_6N5A.pdb:0.147 0.1070.1270.1150.1270.1400.1400.1450.119
3AAC54377_high_6N5A.pdb:0.1700.107 0.1140.1240.1350.1060.1060.1300.103
4AAR02639_high_6N5A.pdb:0.1600.1270.114 0.1270.1250.1240.1240.1310.117
5ABI94581_high_6N5A.pdb:0.1280.1150.1240.127 0.1080.1330.1330.1070.085
6AGB51388_high_6N5A.pdb:0.1380.1270.1350.1250.108 0.1610.1610.1300.107
7AGU69978_high_6N5A.pdb:0.2040.1400.1060.1240.1330.161 0.0000.1320.116
8AHF20373_high_6N5A.pdb:0.2040.1400.1060.1240.1330.1610.000 0.1320.116
9EPI168665_high_6N5A.pdb:0.1580.1450.1300.1310.1070.1300.1320.132 0.090
10EPI623939_high_6N5A.pdb:0.1410.1190.1030.1170.0850.1070.1160.1160.090
Q-score
structure:12345678910
1AAA43152_high_6N5A.pdb: 0.9980.9970.9970.9980.9980.9950.9950.9970.998
2AAA56803_high_6N5A.pdb:0.998 0.9990.9980.9990.9980.9980.9980.9980.998
3AAC54377_high_6N5A.pdb:0.9970.999 0.9990.9980.9980.9990.9990.9980.999
4AAR02639_high_6N5A.pdb:0.9970.9980.999 0.9980.9980.9980.9980.9980.998
5ABI94581_high_6N5A.pdb:0.9980.9990.9980.998 0.9990.9980.9980.9990.999
6AGB51388_high_6N5A.pdb:0.9980.9980.9980.9980.999 0.9970.9970.9980.999
7AGU69978_high_6N5A.pdb:0.9950.9980.9990.9980.9980.997 1.0000.9980.999
8AHF20373_high_6N5A.pdb:0.9950.9980.9990.9980.9980.9971.000 0.9980.999
9EPI168665_high_6N5A.pdb:0.9970.9980.9980.9980.9990.9980.9980.998 0.999
10EPI623939_high_6N5A.pdb:0.9980.9980.9990.9980.9990.9990.9990.9990.999
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Baggio, G.; Filippini, F.; Righetto, I. Comparative Surface Electrostatics and Normal Mode Analysis of High and Low Pathogenic H7N7 Avian Influenza Viruses. Viruses 2023, 15, 305. https://doi.org/10.3390/v15020305

AMA Style

Baggio G, Filippini F, Righetto I. Comparative Surface Electrostatics and Normal Mode Analysis of High and Low Pathogenic H7N7 Avian Influenza Viruses. Viruses. 2023; 15(2):305. https://doi.org/10.3390/v15020305

Chicago/Turabian Style

Baggio, Giulia, Francesco Filippini, and Irene Righetto. 2023. "Comparative Surface Electrostatics and Normal Mode Analysis of High and Low Pathogenic H7N7 Avian Influenza Viruses" Viruses 15, no. 2: 305. https://doi.org/10.3390/v15020305

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop