Next Article in Journal
Host Jump of an Exotic Fish Rhabdovirus into a New Class of Animals Poses a Disease Threat to Amphibians
Previous Article in Journal
VOGDB—Database of Virus Orthologous Groups
Previous Article in Special Issue
Impact of Missense Mutations on Spike Protein Stability and Binding Affinity in the Omicron Variant
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Semi-Covariance Coefficient Analysis of Spike Proteins from SARS-CoV-2 and Its Variants Omicron, BA.5, EG.5, and JN.1 for Viral Infectivity, Virulence and Immune Escape

by
Botao Zhu
1,
Huancheng Lin
2,
Jun Steed Huang
2,* and
Wandong Zhang
3,4,*
1
Department of Electrical and Computer engineering, Western University, London, ON N6A 5B9, Canada
2
School of Information Technology, Carleton University, Ottawa, ON K1S 5B6, Canada
3
Human Health Therapeutics Research Centre, National Research Council of Canada, 1200 Montreal Road, Building M54, Ottawa, ON K1A 0R6, Canada
4
Faculty of Medicine, University of Ottawa, Ottawa, ON K1H 8M5, Canada
*
Authors to whom correspondence should be addressed.
Viruses 2024, 16(8), 1192; https://doi.org/10.3390/v16081192
Submission received: 21 May 2024 / Revised: 21 July 2024 / Accepted: 24 July 2024 / Published: 25 July 2024
(This article belongs to the Special Issue Mechanism of Receptor Recognition in Coronavirus)

Abstract

:
Semi-covariance has attracted significant attention in recent years and is increasingly employed to elucidate statistical phenomena exhibiting fluctuations, such as the similarity or difference in charge patterns of spike proteins among coronaviruses. In this study, by examining values above and below the average/mean based on the positive and negative charge patterns of amino acid residues in the spike proteins of SARS-CoV-2 and its current circulating variants, the proposed methods offer profound insights into the nonlinear evolving trends in those viral spike proteins. Our study indicates that the charge span value can predict the infectivity of the virus and the charge density can estimate the virulence of the virus, and both predicated infectivity and virulence appear to be associated with the capability of viral immune escape. This semi-covariance coefficient analysis may be used not only to predict the infectivity, virulence and capability of immune escape for coronaviruses but also to analyze the functionality of other viral proteins. This study improves our understanding of the trend of viral evolution in terms of viral infectivity, virulence or the capability of immune escape, which remains further validated by more future studies and statistical data.

1. Introduction

The Fractal DNA hypothesis (FDH) was introduced in 1994 [1]. The arithmetic data derived from DNA sequences, obtained by counting the number of intervening bases from a specific base to the next one (inter-event data), exhibits a dynamical process characterized by the observation of long-range (fractal) correlations. The reports of fractal long-range correlations in DNA sequences were also made by Peng et al. and others [2,3,4]. Unlike the traditional DNA hypothesis, the analysis of RNA and proteins is based on the fragment length between domains with electrical charges. Notably, it underscores the impact on charge behaviors stemming from differences in information reception, expression lengths, or neighbor status, indicating the presence of a fractal structure in stable DNAs [5]. Recent work on FDH in RNA studies [6] involves converting genetic sequences into binary numbers, with purines converted to −1 and pyrimidines converted to +1 [7]. We have previously used the semi-covariance coefficient method to analyze the correlation between the amino acid composition of spike proteins from SARS-CoV-2 and other coronaviruses for viral evolution trends and characteristics associated with fatality or virulence [8]. The semi-covariance is better to elucidate statistical phenomena exhibiting fluctuations, such as the similarity or difference in charge patterns of spike proteins among coronaviruses. The fluctuation analysis method has been previously used in analyzing the long-range correlation of DNA sequences [9]. To further characterize the charge patterns of spike proteins among coronaviruses, we have then used normalized semi-covariance co-efficiency to analyze the charge of the spike protein composition for viral infectivity and virulence [10]. In this study, our objectives are to investigate the analogous electrical charge-specific relationship (+1 for positive amino acids, −1 for negative counterparts, and 0 for neutral ones; the charges of amino acids are determined as listed in the amino acid table in the biochemistry textbook) in the spike protein sequences of coronaviruses for the viral infectivity and virulence [11] and to attempt to associate the viral infectivity and virulence with the viral capability of immune escape. The SARS-CoV-2 virus stands out as one of the longest positive single-stranded RNA viruses [12], and its protein folding, tertiary structure, and functions are intricately linked to the exhibiting of more positive charges than other proteins for binding to or interacting with the receptor [13,14]. Therefore, it is crucial to scrutinize the charge structure/patterns or nonlinear correlation patterns of the spike proteins of SARS-CoV-2 and its variants in comparison to spike proteins from the original SARS-CoV-2 strain, where the negative charge is predominant [15]. This study on viral protein charges furthers our understanding of viral evolution trends in terms of infectivity and virulence and associates viral infectivity and virulence with the ability of viral immune escape.

2. Literature Review

2.1. Research on Spike Proteins of SARS-CoV-2

The spike protein is a structural protein unique to the surface of coronaviruses. It contains crucial information about the natural evolution of coronaviruses and plays a key role in the viral recognition and invasion of human cells [16,17,18,19,20]. Over the past decade, the spike protein has been one of the most important research subjects in studies of coronaviruses closely related to humans. Following the outbreak of the COVID-19 pandemic, the spike protein quickly became a focal point of research [21]. The authors in [22] utilized pseudo-viruses to monitor the impact of FCS (furin cleavage site)-spike mutations of either Alpha/Omicron, Beta, or Delta variants on viral infectivity and neutralization sensitivity against sera that were drawn from fully vaccinated individuals. In Cordsmeier and colleagues’ study [23], the authors constructed SARS-CoV-2 with a uniform B.1. backbone but with alternative spike proteins to analyze the specific impact of variant spike proteins on infection dynamics. In Emmelot et al.’s study [24], the authors analyzed the functional impact of Omicron BA.4/BA.5 spike mutations on T cell responsiveness to non-conserved epitopes in vaccines. The result showed that several BA.4/BA.5 mutations in the spike protein led to a reduced responsiveness of epitope-specific T cells in subjects that received two doses of an mRNA vaccine based on the ancestral wild-type spike sequence. In the Bains and colleagues’ study [25], the authors assessed the impact of SARS-CoV-2 spike S1-domain glycans and spike proteins from different strains on the ability of pseudotyped lentivirions to undergo DC-SIGN-mediated trans-infection. In Escalera et al.’s study [26], the authors analyzed a set of emerging SARS-CoV-2 variants to investigate how different sets of mutations may impact spike protein processing, and they demonstrated that the mutations in spike protein present in these variants that become epidemiologically prevalent in humans are linked to an increase in spike protein processing and virus transmission. The authors [27] introduced a total of 48 mutations in the spike protein of SARS-CoV-2 variants and demonstrated that several amino acid changes found in Omicron spike proteins impair infectivity. Additionally, numerous alterations in the N-terminal domain (NTD) and receptor-binding domain (RBD) of BA.1 and/or BA.2 spike proteins impact neutralization by sera from individuals vaccinated with BNT/BNT-based vaccines and therapeutic antibodies.

2.2. Research on Electrostatic Feature of SARS-CoV-2 Spike Protein

The spike protein of SARS-CoV-2 is made up of amino acids, each with specific chemical properties. Amino acids like arginine (Arg) and lysine (Lys) have positively charged side chains at physiological pH (around neutral pH). These positively charged residues can be found within different regions of the spike protein, including the RBD and other functional domains. These charges play critical roles in viral attachment, entry, and immune recognition, making them important factors in understanding and combating COVID-19 [28,29,30,31]. The increase in positive charge on spike protein of SARS-CoV-2 variants (such as Omicron) altered the biochemical properties of spike protein and may influence virion survival and promote transmission [32]. Studies found that mutations increased the electrostatic interactions of the Omicron spike protein RBD with ACE2 and promoted infectivity and transmission of the variants [33,34]. Nguyen and colleagues [35] investigated the evolving positive charge of the SARS-CoV-2 spike protein and found that the Omicron variant has enhanced binding rates to negatively charged glycocalyx and significantly stronger interactions with heparan sulfate as compared to Delta. This increased dependence on the heparan sulfate for viral attachment and infection suggests new therapeutic and diagnostic opportunities. Kim et al. [36] compared the spike proteins of SARS-CoV-2 Omicron sublineages with earlier SARS-CoV-2 variants and found that the SARS-CoV-2 Omicron has a higher electric field line density than that of earlier SARS-CoV-2 variants, which indicates a stronger interaction between the Omicron spike protein and the ACE2 receptor. Božič and Podgornik [37] found that the trend of increase in the positive charge on spike protein of SARS-CoV-2 variants has been halted with the appearance of Omicron variants, while these sublineages display a greater diversity in their composition of ionizable amino acids. Pascarella and colleagues [38] examined the spike proteins from B.1.617.1 (Kappa), B.1.617.2 (Delta), and B.1.617.3 and found that these variants exhibit significant changes in the spike protein’s surface electrostatic potential, particularly in Delta, which may enhance the interaction with the negatively charged ACE2 receptor and increase viral transmission. Lu and colleagues [39] calculated the charge distributions of SARS-CoV, SARS-CoV-2, and variants of concern using net charge calculation formulas and found that the SARS-CoV-2 spike protein had more positive charges than that of SARS-CoV. Further analysis showed that variants, particularly the Delta variant, had even higher positive charges in the S1 domain, significantly increasing Coulomb’s force with the negatively charged ACE2 receptor and potentially leading to higher infectivity. Another study [40] investigated the main electrostatic features involved in the interaction between the RBD of the SARS-CoV-2 spike protein and the human receptor ACE2. Using the FORTE approach, which models proton fluctuations and computes free energies for many protein-protein systems, these authors analyzed wild-type and critical variants, focusing on pH-dependent binding affinities, protein charges, charge regulation capacities, and dipole moments. They revealed a linear correlation, termed the “RBD charge rule”, between the total charge of the RBD and its binding affinity to ACE2, providing a quick test for predicting the severity of future SARS-CoV-2 variants.

3. Methodologies

The coronaviral spike protein and RNA sequences used in this research were obtained from NCBI GenBank and GISAID databases, including Wuhan strain SARS-CoV-2, UK variant (B.1.1.7), Delta variant (B.1.617), Omicron, and Omicron subvariants BA.5 (B.1.1.529), EG.5, and JN.1. The viral RNA sequences obtained from the GISAID database were translated into protein sequences before analysis. As said above, a positive-charged amino acid is represented as +1, a negative-charged amino acid is represented as −1, and a neutral amino acid is represented as 0. Assuming two distinct viral protein sequences are converted into sequences or symbols represented as charges, labeled as X = ( x 1 , , x n ) and Y = ( y 1 , , y n ) , with X serving as the baseline, various computational methods are proposed to compare and analyze the correlation between viral protein sequences from multiple dimensions, aiming to better understand the evolution trend of the virus, such as infectivity and virulence. These computational methods are detailed below.

3.1. Pearson Correlation Coefficient

The Pearson correlation coefficient quantifies the linear correlation between two variables, which is calculated as the ratio of the covariance of two variables to the product of their standard deviations. It can also be used to represent the correlation between two viral sequences, which is given by [41]
ρ X , Y = E X E [ X ] Y E [ Y ] E X 2 E X 2 E Y 2 E Y 2 .  

3.2. Semi-Variance Correlation

3.2.1. Semi-Variance Correlation Coefficient

While the Pearson correlation coefficient excels at expressing the correlation among multiple variables, it is limited to capturing only linear correlations and overlooks many other types of relationships or correlations. Inspired by the Pearson correlation coefficient, we propose the semi-covariance coefficient, which can measure the nonlinear correlation between variables and provide more detailed information and insights. As shown in Figure 1, when employing Pearson correlation to measure the relationship between two sequences, it only indicates the linear relationship (positive and negative correlation folded together) between the sequences. However, when using semi-variance correlation to measure the relationship between two sequences, it reveals the nonlinear relationship (positive and negative correlation unfolded separately) between the sequences, which can better uncover the relationship between them.
To shift from linear correlation to nonlinear correlation, the Rectified Linear Unit (ReLU) [42] [https://en.wikipedia.org/wiki/Rectifier_(neural_networks) (accessed on 21 May 2024)] is employed in this study. ReLU is an activation function that introduces nonlinearity, allowing the model to capture or unfold and represent more complex relationships in the data. The basic form of ReLU is given by the following:
R e L U R = max 0 , R ,
where R is a real number. Then, R can be expressed or unfolded as following:
R = R e L U R R e L U R ,
and the Pooling of the expectation of R
E [ R ] = E [ R e L U R R e L U R ] .
Setting R = ( X E [ X ] ) ( Y E [ Y ] ) and substituting it into Equation (1), the semi-covariance coefficient is thus defined as following:
S p o s X , Y = E [ R e L U ( ( X E [ X ] ) ( Y E [ Y ] ) ) ] E X 2 E X 2 E Y 2 E Y 2 ,
S n e g X , Y = E R e L U X E [ X ] Y E [ Y ] E X 2 E X 2 E Y 2 E Y 2 ,
where S p o s is the positive correlation covariance coefficient or the convergent part, and S n e g is the negative correlation covariance coefficient or the divergent part. According to the mean of each two variables, all variables are divided or unfolded into four quadrants. S p o s belongs to the first and third quadrants, while S n e g belongs to the second and fourth quadrants. We also include the following indicators to Semi-variance correlation to assess its advantage.

3.2.2. Quantity of Charge

The quantity of charge of the sequence Y is
Q Y = i = 1 n y i .

3.2.3. Gravity

As R e L U ( R ) is a sequence, denoted as ( R e L U r 1 , R e L U ( r n ) ) , where each is assumed to be the weight, the gravity of converge part of Y is calculated by
C p o s = R o u n d i n R e L U r i I i i n R e L U r i = R o u n d i n R e L U ( ( x i E [ Y ] ) ( y i E [ Y ] ) ) I i i n R e L U ( ( x i E [ Y ] ) ( y i E [ Y ] ) ) ,
where I i is the position of R e L U ( r i ) in the sequence, and Round is the rounding operation. Likewise, the gravity of divergent part of Y is given by
C n e g = R o u n d i n R e L U r i I i i n R e L U r i = R o u n d i n R e L U ( ( x i E [ Y ] ) ( y i E [ Y ] ) ) I i i n R e L U ( ( x i E [ Y ] ) ( y i E [ Y ] ) ) .

3.2.4. Charge Span

The charge span of Y is given by
M Y = C p o s C n e g .

3.2.5. Reproduction Rate

The reproduction rate of Y is calculated by
O Y = M Y O X M X ,
where O X and M X are the reproduction rate and the charge span of virus X, respectively.

3.2.6. Maximal Position

The position of maximal value in S p o s is
P p o s   a r g m a x 1 i n S p o s i .
Likewise, the position of maximal value in S n e g is
P n e g   a r g m a x 1 i n S n e g i .  

3.2.7. Charge Density

The charge density of Y is calculated by
D Y = P p o s P n e g O Y .

3.2.8. Virulence

The virulence of Y is calculated by
V Y = D Y V X D X ,
where V X and D X are the virulence and the charge density of X, respectively.

4. Result and Discussion

To assess the correlation between spike protein sequences of SARS-CoV-2 and its variants using the proposed methods, the Wuhan strain SARS-CoV-2 is employed as the sequence baseline, and the Delta variant is used as the data point reference. We first examine the difference between Pearson correlation and semi-variance correlation in measuring the correlation between BA.5 and Wuhan SARS-CoV-2. As shown in Figure 2a, Pearson correlation can only represent the relationship between these two viral spike protein sequences within the first quadrant of the coordinate axis. However, in Figure 2b, Semi-variance correlation can demonstrate the correlation between these two viral sequences in all four quadrants (unfolded with ReLU function). This result indicates that the proposed semi-variance correlation can uncover the multiple nonlinear relationships between viral sequences, thus facilitating the inference of the evolution between sequences.
The comparison results of different variants are presented in Table 1. The charge span M Y reflects the infectivity of viruses. The higher the M Y value of the viral spike protein, the greater the viral infectivity. The results are consistent with the actual infectivity of the virus and its variants. For instance, Omicron and EG.5 were both major prevalent variants a couple of years ago, while JN.1 is currently the predominant variant and more infectious. EG.5 is more infectious ( M Y value of 551) than Omicron ( M Y value of 270), while JN.1 is more infectious ( M Y value of 565) than EG.5 (Table 1). Furthermore, the charge density D Y represents viral virulence. The higher the value of D Y , the greater the virulence of the virus. Due to the wide campaign of vaccination against SARS-CoV-2 and its variants and the establishment of herd immunity to the virus and its variants, the viral virulence is minimized in the immunized population. The value of D Y may thus correlate with the capability of immune escape for the variants due to the rapid evolution of the variants. Immune escape is one aspect of the evolution of viral variants, and new variants gain more and more capabilities of immune escape [43]. People who received vaccinations or recovered from infections gain immunity or cross-immunity against new variants of the virus. For the survival or evolution of the virus, its new variants or new mutants have to develop the ability of immune escape to evade T cells, innate immunity and population immunity [43]. Mutations in the viral spike proteins allow new variants to emerge with the capabilities of greater transmissibility and immune escape [44]. The values of both predicted infectivity and virulence may thus correlate with the capability of immune escape for these variants. Although the new variants develop a stronger immune escape capability, the cross-immune response established previously in the human body may relieve the symptoms of people who are infected with the new variants. Our analysis results appear to be consistent with the trends of viral infectivity and virulence associated with enhanced immune escape.

5. Conclusions

In this study, we examined the positive charge patterns of spike proteins from SARS-CoV-2 and its current circulating variants. Our analyses indicate that the charge span (MY) value may predict the infectivity of the viral variants, with a high charge span value exhibiting high infectivity. The charge density (DY) may predict the virulence of the viral variants, with a high charge density value displaying more virulence for the original strain of SARS-CoV-2 or more capability of immune escape for current circulating variants derived from Omicron. Our analyses provide a profound understanding of the nonlinear pattern of viral evolution trend and identify the protein mutation characteristics that may be associated with viral infectivity and virulence for developing the immune escape capability of current circulating variants and for predicting the evolving trend of future new variants. More future studies and statistical data will be needed to further validate our findings and the association. This semi-covariance coefficient analysis may be used not only to predict the infectivity, virulence and capability of immune escape for coronaviruses but also to analyze the functionality of other viral proteins. There are limitations to our study. The charges of the amino acids may change under different pH environments, such as more acidic or more basic conditions. These conditions were not considered in our study since the pH is generally maintained from pH 7.35 to pH 7.45 in the human body. The infectivity and virulence of the viruses on the human body are affected by many factors, such as underlying diseases (hypertension, heart and lung diseases, medications of the patients, etc.), vaccinations, immunity of the affected individuals, etc. These conditions were not considered in the study. Another limitation is the Equations of (10) to (15) in our study. To further validate Equations (10) to (15), more research and statistical data are needed in the future.

Author Contributions

B.Z. and H.L. conducted data analyses, drafted the manuscript and prepared the figures. J.S.H. developed the algorithm, conceived the idea, wrote the manuscript and prepared the figures. W.Z. conceived the idea, collected data for coronaviral spike proteins, revised and finalized the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

The research work conducted by Wandong Zhang was supported by funding from the National Research Council of Canada (A1-020810-01-01-01) and by a team grant from the Canadian Institute of Health Research (CIHR #170355) on the Rapid Research Response to COVID-19 Outbreak to Marc-Andre Langlois at the University of Ottawa.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The calculation formulas are provided in this paper, and data will be available upon request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Peters, E.E. Fractal DNA Analysis: Applying Chaos Theory to Investment and Economics; Wiley: Hoboken, NJ, USA, 1994. [Google Scholar]
  2. Peng, C.K.; Buldyrev, S.V.; Goldberger, A.L.; Havlin, S.; Sciortino, F.; Simons, M.; Stanley, H.E. Long-range correlations in nucleotide sequences. Nature 1992, 356, 168–170. [Google Scholar] [CrossRef] [PubMed]
  3. Voss, R.F. Evolution of long-range fractal correlations and 1/f noise in DNA base sequences. Phys. Rev. Lett. 1992, 68, 3805–3808. [Google Scholar] [CrossRef] [PubMed]
  4. Buldyrev, S.V.; Goldberger, A.L.; Havlin, S.; Mantegna, R.N.; Matsa, M.E.; Peng, C.K.; Simons, M.; Stanley, H.E. Long-range correlation properties of coding and noncoding DNA sequences: GenBank analysis. Phys. Rev. E Stat. Phys. Plasmas Fluids Relat. Interdiscip. Top. 1995, 51, 5084–5091. [Google Scholar] [CrossRef] [PubMed]
  5. Xu, Q.; Bai, Y. Semiparametric statistical inferences for longitudinal data with nonparametric covariance modelling. Statistics 2017, 51, 1280–1303. [Google Scholar] [CrossRef]
  6. Fernandes, T.; Oliveira Filho, J.S.; Silva Lopes, I.M.S. Fractal signature of coronaviruses related to severe acute respiratory syndrome. Res. Biomed. Eng. 2022, 38, 293–297. [Google Scholar] [CrossRef]
  7. Kumar, S.; Karuppanan, K.; Subramaniam, G. Omicron (BA.1) and sub-variants (BA.1.1, BA.2, and BA.3) of SARS-CoV-2 spike infectivity and pathogenicity: A comparative sequence and structural-based computational assessment. J. Med. Virol. 2022, 94, 4780–4791. [Google Scholar] [CrossRef] [PubMed]
  8. Huang, J.S.; Huang, J.M.; Zhang, W. Semi-covariance coefficient analysis of spike proteins from SARS-CoV-2 and other coronaviruses for viral evolution and characteristics associated with fatality. Entropy 2021, 23, 512. [Google Scholar] [CrossRef] [PubMed]
  9. Peng, C.K.; Buldyrev, S.V.; Havlin, S.; Simons, M.; Stanley, H.E.; Goldberger, A.L. Mosaic organization of DNA nucleotides. Phys. Rev. E Stat. Phys. Plasmas Fluids Relat. Interdiscip. Top. 1994, 49, 1685–1689. [Google Scholar] [CrossRef] [PubMed]
  10. Xu, T.; Zhou, S.; Huang, J.S.; Zhang, W. Normalized semi-covariance co-efficiency analysis of spike proteins from SARS-CoV-2 variant Omicron and other coronaviruses for their infectivity and virulence. Arch. Clin. Biomed. Res. 2023, 7, 387–399. [Google Scholar] [CrossRef]
  11. Hathout, R.M.; Kassem, D.H. Positively charged electroceutical spun chitosan nanofibers can protect health care providers from COVID-19 infection: An opinion. Front. Bioeng. Biotechnol. 2020, 8, 885. [Google Scholar] [CrossRef]
  12. Tan, C.W.; Chia, W.N.; Qin, X.; Liu, P.; Chen, M.I.; Tiu, C.; Hu, Z.; Chen, V.C.; Young, B.E.; Sia, W.R.; et al. A SARS-CoV-2 surrogate virus neutralization test based on antibody-mediated blockage of ACE2-spike protein-protein interaction. Nat. Biotechnol. 2020, 38, 1073–1078. [Google Scholar] [CrossRef] [PubMed]
  13. Wang, L.; Zhao, J.; Nguyen, T.; Adkins, J.L.; Schank, M.; Khanal, S.; Nguyen, L.; Dang, X.; Cao, D.; Thakuri, C.; et al. Blockade of SARS-CoV-2 spike protein-mediated cell-cell fusion using COVID-19 convalescent plasma. Sci. Rep. 2021, 11, 5558. [Google Scholar] [CrossRef] [PubMed]
  14. Kim, M.H.; Kim, J.H.; Lee, K.; Gim, Y. The prediction of COVID-19 using LSTM algorithms. Int. J. Netw. Distrib. Comput. 2021, 9, 19–24. [Google Scholar] [CrossRef]
  15. Wu, F.; Zhao, S.; Yu, B.; Chen, Y.M.; Wang, W.; Song, Z.G.; Hu, Y.; Tao, Z.W.; Tian, J.H.; Pei, Y.Y.; et al. A new coronavirus associated with human respiratory disease in China. Nature 2020, 579, 265–269. [Google Scholar] [CrossRef] [PubMed]
  16. Desingu, P.A.; Nagarajan, K.; Dhama, K. SARS-CoV-2 gained a novel spike protein S1-N-Terminal Domain (S1-NTD). Environ. Res. 2022, 211, 113047. [Google Scholar] [CrossRef]
  17. Mittal, A.; Khattri, A.; Verma, V. Structural and antigenic variations in the spike protein of emerging SARS-CoV-2 variants. PLoS Pathog. 2022, 18, e1010260. [Google Scholar] [CrossRef] [PubMed]
  18. Tao, K.; Tzou, P.L.; Nouhin, J.; Gupta, R.K.; Oliveira, T.; Kosakovsky Pond, S.L.; Fera, D.; Shafer, R.W. The biological and clinical significance of emerging SARS-CoV-2 variants. Nat. Rev. Genet. 2021, 22, 757–773. [Google Scholar] [CrossRef]
  19. Banerjee, S.; Wang, X.; Du, S.; Zhu, C.; Jia, Y.; Wang, Y.; Cai, Q. Comprehensive role of SARS-CoV-2 spike glycoprotein in regulating host signaling pathway. J. Med. Virol. 2022, 94, 4071–4087. [Google Scholar] [CrossRef] [PubMed]
  20. Kumavath, R.; Barh, D.; Andrade, B.S.; Imchen, M.; Aburjaile, F.F.; Ch, A.; Rodrigues, D.L.N.; Tiwari, S.; Alzahrani, K.J.; Góes-Neto, A.; et al. The spike of SARS-CoV-2: Uniqueness and applications. Front. Immunol. 2021, 12, 663912. [Google Scholar] [CrossRef] [PubMed]
  21. Cocherie, T.; Zafilaza, K.; Leducq, V.; Marot, S.; Calvez, V.; Marcelin, A.G.; Todesco, E. Epidemiology and characteristics of SARS-CoV-2 variants of concern: The impacts of the spike mutations. Microorganisms 2023, 11, 30. [Google Scholar] [CrossRef]
  22. Kuzmina, A.; Wattad, S.; Engel, S.; Rosenberg, E.; Taube, R. Functional analysis of spike from SARS-CoV-2 variants reveals the role of distinct mutations in neutralization potential and viral infectivity. Viruses 2022, 14, 803. [Google Scholar] [CrossRef] [PubMed]
  23. Cordsmeier, A.; Jungnickl, D.; Herrmann, A.; Korn, K.; Ensser, A. Analysis of SARS-CoV-2 spike protein variants with recombinant reporter viruses created from a bacmid system. Int. J. Mol. Sci. 2023, 24, 8156. [Google Scholar] [CrossRef] [PubMed]
  24. Emmelot, M.E.; Vos, M.; Boer, M.C.; Rots, N.Y.; van Els, C.A.C.M.; Kaaijk, P. SARS-CoV-2 Omicron BA.4/BA.5 mutations in spike leading to T cell escape in recently vaccinated individuals. Viruses 2023, 15, 101. [Google Scholar] [CrossRef] [PubMed]
  25. Bains, A.; Guan, W.; LiWang, P.J. The effect of select SARS-CoV-2 n-linked glycan and variant of concern spike protein mutations on c-type lectin-receptor-mediated infection. Viruses 2023, 15, 1901. [Google Scholar] [CrossRef] [PubMed]
  26. Escalera, A.; Gonzalez-Reiche, A.S.; Aslam, S.; Mena, I.; Laporte, M.; Pearl, R.L.; Fossati, A.; Rathnasinghe, R.; Alshammary, H.; van de Guchte, A.; et al. Mutations in SARS-CoV-2 variants of concern link to increased spike cleavage and virus transmission. Cell Host Microbe 2022, 30, 373–387. [Google Scholar] [CrossRef] [PubMed]
  27. Pastorio, C.; Zech, F.; Noettger, S.; Jung, C.; Jacob, T.; Sanderson, T.; Sparrer, K.M.J.; Kirchhoff, F. Determinants of spike infectivity, processing, and neutralization in SARS-CoV-2 Omicron subvariants BA.1 and BA.2. Cell Host Microbe 2022, 30, 1255–1268. [Google Scholar] [CrossRef]
  28. Zhang, Z.; Zhang, J.; Wang, J. Surface charge changes in spike RBD mutations of SARS-CoV-2 and its variant strains alter the virus evasiveness via HSPGs: A review and mechanistic hypothesis. Front. Public Health 2022, 10, 952916. [Google Scholar] [CrossRef]
  29. Pawlowski, P.H.; Zielenkiewicz, P. The role of electric charge in SARS-CoV-2 and other viral infections. AIMS Biophys. 2024, 11, 166–189. [Google Scholar] [CrossRef]
  30. Magazine, N.; Zhang, T.; Wu, Y.; McGee, M.C.; Veggiani, G.; Huang, W. Mutations and evolution of the SARS-CoV-2 spike protein. Viruses 2022, 14, 640. [Google Scholar] [CrossRef]
  31. Pawłowski, P.H. Additional Positive Electric Residues in the Crucial Spike Glycoprotein S Regions of the New SARS-CoV-2 Variants. Infect. Drug Resist. 2021, 14, 5099–5105. [Google Scholar] [CrossRef]
  32. Cotton, M.; Phan, M.V.T. Evolution of increased positive charge on the SARS-CoV-2 spike protein may be adaptation to human transmission. iScience 2023, 26, 106230. [Google Scholar] [CrossRef] [PubMed]
  33. Jawad, B.; Adhikari, P.; Podgornik, R.; Ching, W.Y. Binding Interactions between Receptor-Binding Domain of Spike Protein and Human Angiotensin Converting Enzyme-2 in Omicron Variant. J. Phys. Chem. Lett. 2022, 13, 3915–3921. [Google Scholar] [CrossRef] [PubMed]
  34. Adhikari, P.; Jawad, B.; Podgornik, R.; Ching, W.Y. Mutations of Omicron Variant at the Interface of the Receptor Domain Motif and Human Angiotensin-Converting Enzyme-2. Int. J. Mol. Sci. 2022, 23, 2870. [Google Scholar] [CrossRef] [PubMed]
  35. Nguyen, H.L.; Lan, P.D.; Thai, N.Q.; Nissley, D.A.; O’Brien, E.P.; Li, M.S. Does SARS-CoV-2 bind to human ACE2 more strongly than does SARS-CoV. J. Phys. Chem. 2020, 124, 7336–7347. [Google Scholar] [CrossRef] [PubMed]
  36. Kim, S.H.; Kearns, F.L.; Rosenfeld, M.A.; Votapka, L.; Casalino, L.; Papanikolas, M.; Rommie, E.; Amaro, R.E.; Freeman, R. Positively bound: Remapping of increased positive charge drives SARS-CoV-2 spike evolution to optimize its binding to cell surface receptors. ChemRxiv 2022. [Google Scholar] [CrossRef]
  37. Božič, A.; Podgornik, R. Changes in total charge on spike protein of SARS-CoV-2 in emerging lineages. Bioinform. Adv. 2024, 4, vbae053. [Google Scholar] [CrossRef] [PubMed]
  38. Pascarella, S.; Ciccozzi, M.; Zella, D.; Bianchi, M.; Benedetti, F.; Benvenuto, D.; Broccolo, F.; Cauda, R.; Caruso, A.; Angeletti, S.; et al. SARS-CoV-2 B.1.617 Indian variants: Are electrostatic potential changes responsible for a higher transmission rate? J. Med. Virol. 2021, 93, 6551–6556. [Google Scholar] [CrossRef] [PubMed]
  39. Lu, Y.; Zhao, T.; Lu, M.; Zhang, Y.; Yao, X.; Wu, G.; Dai, F.; Zhang, F.; Zhang, G. The analyses of high infectivity mechanism of SARS-CoV-2 and its variants. COVID 2021, 1, 666–673. [Google Scholar] [CrossRef]
  40. Barroso da Silva, F.L.; Giron, C.C.; Laaksonen, A. Electrostatic features for the receptor binding domain of SARS-COV-2 wildtype and its variants. Compass to the severity of the future variants with the charge-rule. J. Phys. Chem. B 2022, 126, 6835–6852. [Google Scholar] [CrossRef] [PubMed]
  41. Saccenti, E.; Hendriks, B.; Smilde, A.K. Corruption of the pearson correlation coefficient by measurement error and its estimation, bias, and correction under different error models. Sci. Rep. 2020, 10, 438. [Google Scholar] [CrossRef]
  42. Chen, Y.; Dai, X.; Liu, M.; Chen, D.; Yuan, L.; Liu, Z. Dynamic relu. In Proceedings of the European Conference on Computer Vision, Glasgow, UK, 23–28 August 2020; pp. 351–367. [Google Scholar]
  43. Carabelli, A.M.; Peacock, T.P.; Thorne, L.G.; Harvey, W.T.; Hughes, J. SARS-CoV-2 variant biology: Immune escape, transmission and fitness. Nat. Rev. Microbiol. 2023, 21, 162–177. [Google Scholar] [CrossRef] [PubMed]
  44. Castonguay, N.; Zhang, W.; Langlois, M.A. Meta-analysis and structural dynamics of the emergence of genetic variants of SARS-CoV-2. Front. Microbiol. 2021, 12, 676314. [Google Scholar] [CrossRef]
Figure 1. Visual comparison of Pearson correlation and Semi-variance correlation.
Figure 1. Visual comparison of Pearson correlation and Semi-variance correlation.
Viruses 16 01192 g001
Figure 2. Visual comparison of correlations between the spike protein sequences of Wuhan SARS-CoV-2 and BA.5 variant using (a) Pearson correlation and (b) semi-variance correlation. The analyses with semi-variance correlation show multiple nonlinear patterns of viral evolution.
Figure 2. Visual comparison of correlations between the spike protein sequences of Wuhan SARS-CoV-2 and BA.5 variant using (a) Pearson correlation and (b) semi-variance correlation. The analyses with semi-variance correlation show multiple nonlinear patterns of viral evolution.
Viruses 16 01192 g002
Table 1. Comparison of spike proteins of SARS-CoV-2 variants *.
Table 1. Comparison of spike proteins of SARS-CoV-2 variants *.
UK
(B.1.1.7)
Delta
(B.1.617)
OmicronBA.5EG.5JN.1
S p o s 99.58%99.44%98.09%96.04%87.63%63.9%
S n e g 0.32%0.17%1%0.4%1.95%7.75%
ρ 0.99260.99270.97090.95640.85590.5615
Q Y 171518181719
C p o s 626629631665718868
C n e g 655557361568167303
*  M Y 297227097551565
O Y 0.842.087.792.815.8916.3
P p o s 516518518126412621262
P n e g 6146162166839797
*  D Y 10.40.310.660.820.8
V Y 5.33%2.15%1.64%3.51%4.35%4.24%
Actual rate of death (sourced from the Internet)1.3–5.3%0.3–3.4%0.06–0.3%0.06–0.3%/1.81%
* The charge span M Y reflects the infectivity of viruses and the variants (higher value represents high infectivity). The charge density D Y represents the virulence of viruses and the variants. Both viral infectivity and virulence further evolve to correlate with viral immune escape.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhu, B.; Lin, H.; Huang, J.S.; Zhang, W. Semi-Covariance Coefficient Analysis of Spike Proteins from SARS-CoV-2 and Its Variants Omicron, BA.5, EG.5, and JN.1 for Viral Infectivity, Virulence and Immune Escape. Viruses 2024, 16, 1192. https://doi.org/10.3390/v16081192

AMA Style

Zhu B, Lin H, Huang JS, Zhang W. Semi-Covariance Coefficient Analysis of Spike Proteins from SARS-CoV-2 and Its Variants Omicron, BA.5, EG.5, and JN.1 for Viral Infectivity, Virulence and Immune Escape. Viruses. 2024; 16(8):1192. https://doi.org/10.3390/v16081192

Chicago/Turabian Style

Zhu, Botao, Huancheng Lin, Jun Steed Huang, and Wandong Zhang. 2024. "Semi-Covariance Coefficient Analysis of Spike Proteins from SARS-CoV-2 and Its Variants Omicron, BA.5, EG.5, and JN.1 for Viral Infectivity, Virulence and Immune Escape" Viruses 16, no. 8: 1192. https://doi.org/10.3390/v16081192

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop