Next Article in Journal
Crocodile Oil Disrupts Mitochondrial Homeostasis and Exacerbates Diabetic Kidney Injury in Spontaneously Diabetic Torii Rats
Next Article in Special Issue
In Silico Maturation of a Nanomolar Antibody against the Human CXCR2
Previous Article in Journal
Calcium Binds to Transthyretin with Low Affinity
Previous Article in Special Issue
CavitySpace: A Database of Potential Ligand Binding Sites in the Human Proteome
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

CoVM2: Molecular Biological Data Integration of SARS-CoV-2 Proteins in a Macro-to-Micro Method

1
Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou 310058, China
2
Chu Kochen Honors College, Zhejiang University, Hangzhou 310058, China
3
Institute of Hematology, Zhejiang University School of Medicine, The First Affiliated Hospital, Zhejiang University, Hangzhou 310058, China
*
Author to whom correspondence should be addressed.
Biomolecules 2022, 12(8), 1067; https://doi.org/10.3390/biom12081067
Submission received: 8 June 2022 / Revised: 27 July 2022 / Accepted: 29 July 2022 / Published: 2 August 2022

Abstract

:
The COVID-19 pandemic has been a major public health event since 2020. Multiple variant strains of SARS-CoV-2, the causative agent of COVID-19, were detected based on the mutation sites in their sequences. These sequence mutations may lead to changes in the protein structures and affect the binding states of SARS-CoV-2 and human proteins. Experimental research on SARS-CoV-2 has accumulated a large amount of structural data and protein-protein interactions (PPIs), but the studies on the SARS-CoV-2–human PPI networks lack integration of physical associations with possible protein docking information. In addition, the docking structures of variant viral proteins with human receptor proteins are still insufficient. This study constructed SARS-CoV-2–human protein–protein interaction network with data integration methods. Crystal structures were collected to map the interaction pairs. The pairs of direct interactions and physical associations were selected and analyzed for variant docking calculations. The study examined the structures of spike (S) glycoprotein of variants Delta B.1.617.2, Omicron BA.1, and Omicron BA.2. The calculated docking structures of S proteins and potential human receptors were obtained. The study integrated binary protein interactions with 3D docking structures to fulfill an extended view of SARS-CoV-2 proteins from a macro- to micro-scale.

1. Introduction

COVID-19 has sparked a global pandemic since the year 2020. A coronavirus was identified as the causative agent of COVID-19 and named SARS-CoV-2 [1]. Systematic studies including sequencing, protein interactions, and structural docking have been carried out to identify SARS-CoV-2-related proteins. The related virus–human protein interactions occurred in the progression of a SARS-CoV-2 infection [2]. The spike glycoprotein became the focus of research due to its mediating role in virus entry into cells by binding with human ACE2 receptors [3]. When SARS-CoV-2 infects the host cell, the S protein binds to the host receptor ACE2, triggering the virus-cell fusion process [4].
The sequence of SARS-CoV-2 has undergone frequent mutations, and multiple mutant strains have been distinguished based on a large number of mutational characteristics [5]. The mutation site of variant strains Delta B.1.617.2, Omicron BA.1, and Omicron BA.2 have been investigated by multiple studies and recorded in detail in public databases [6], thus, the sequences of the variants can be collected. In addition to ACE2, several human proteins with potential S-protein receptor function have also been experimentally identified [7,8]. Tyrosine-protein kinase receptor UFO (AXL), Neuropilin-1 (NRP1), and CD209 act as SARS-CoV-2 receptors which play critical roles in SARS-CoV-2 invasion and infection [9,10,11,12].
The protein–protein interactions (PPIs) between SARS-CoV-2 and human receptor candidates have been detected by experimental methods. Affinity purification-mass spectrometry (AP-MS) [13] has been a widely-used method to identify PPIs and protein complexes by generating bait-prey data sets [14] and is suitable for detecting virus–host multiprotein complexes [15]. Proximity-dependent biotin identification (BioID-MS) [16] is a method designed for the detection of transient PPI that frequently emerged during the viral infection. Gordon et al. [17] obtained 332 pairs of high-confidence virus–human protein interactions using AP-MS and identified 66 human proteins that could be used for drug therapy. Using AP-MS and BioID-MS experimental methods, Liu et al. [15] mapped the interaction between viral proteins and host proteins, resulting in 693 hub proteins that can be used for drug repurposing. Public databases including IntAct [18], VirHost [19], and RCSB PDB [20] have collected the annotation information and structural data of related proteins. There have been repositories of comprehensive SARS-CoV-2-related information. Ahsan et al. proposed OverCOVID [21], an integrative server including SARS-CoV-2-related databases, web servers, and tools. Satyam et al. established COVIDium [22], which provided similar information, but also added PPI network data sets and classified all the data entries.
The docking structures analyzed in previous studies are based on the sequence of the wildtype (WT) strain of SARS-CoV-2. Wierbowski et al. [23] constructed a 3D-structured SARS-CoV-2 interactome showing the calculated binding interface of multiple proteins within the interaction between viral docking and human interactors. Barh et al. [24] detected gradually increasing transmissibility and declining pathogenicity of SARS-CoV-2 WT and variants Gamma, Delta, and BA.1 through sequence and structure analysis. The docking of BA.2 variant proteins with host receptor proteins has not been systematically analyzed and obtaining the experimental docking structures requires extensive experiments. Furthermore, these datasets require further integration and adoption of the novel mutant sequences of SARS-CoV-2 variants.
In this study, an interaction network with 2385 pairs of interactions between 29 viral proteins and 1144 human proteins was constructed. Among the 196 interaction pairs of the S protein, the crystal structures and protein complexes were collected. On the other hand, for the direct interactions and physical associations that may have structural binding in the integrated network, the structures of the two interacting proteins were collected respectively for docking. In comparison with the crystal structures and protein complexes of the SARS-CoV-2 wildtype (WT) strain, this study obtained some reliable computational docking structures of the S protein of the variant strain and the human receptor protein. The proposed integrated framework of the PPI network (macro view) and the virus–host protein binding (micro view) was named CoVM2. Calculations of affinities for these structures revealed differences from the WT strain proteins. Figure 1 illustrates the workflow of the research.

2. Materials and Methods

2.1. Collection of Datasets and Network Construction

The SARS-CoV-2 and human interaction pairs were obtained from Liu’s work [15], IntAct [18], and VirHost [19]. All the interaction data from different sources were integrated to construct the protein interaction network. The PPI network was integrated and visualized by Cytoscape [25]. Node degree is one of the network’s topological features indicating how many interactions are connected with a protein in the PPI network to identify specific target proteins [26]. The strength of interaction is determined through the detection methods, which can be a measurement of an exact PPI or can imply the affinity of the proteins [27]. Here, the degree was regarded as a necessary parameter to screen the human proteins that are more likely to interact with an individual SARS-CoV-2 protein. Metascape [28] analysis was conducted on the subgroup of S protein’s isolated interactions (node degree is 1) for validation in biological processes.
All protein sequences of SARS-CoV-2 and human are retrieved from UniProt [6]. The canonical sequence of SARS-CoV-2 WT S protein was retrieved from UniProt ID P0DTC2. It was then processed by an in-house python script to replace the amino acids in the mutation site to generate the variant sequences. The 3D structures of full-length proteins or partial regions were retrieved in the Protein Data Bank (PDB) database [20].

2.2. Homology Modeling for SARS-CoV-2 Variants

Other than the main viral proteins with 3D structures that have been solved experimentally, some of the variant strains’ structures have not been solved at the time this study was completed. Modeling of the structurally unknown viral proteins was performed with SWISS-MODEL [29]. Homology templates were sorted with multiple evaluation metrics such as global model quality estimation (GMQE) and sequence identity. The S protein receptor-binding domain (RBD) is composed of 223 residues from 319 to 541 in its sequence. The RBD of Omicron BA.2 was modeled by the template of 7WBP, chain B [30]. Chain B (319–541) of the structures 6M0J, 7WBQ, and 7WBP were extracted in PyMOL as the S protein RBD of WT, Delta, and Omicron BA.1, respectively [30,31].

2.3. Quality Assessment of Protein Structures

Procheck in the Structural Analysis and Verification Server (SAVES) [32] was applied to verify the modeled structure. Previous experiments suggested that structures with over 90% residues in the most favored regions of the Ramachandran plot can be regarded as accurate models [33,34]. However, the cutoff set in our research influenced the availability of models of the required PPIs. There were not sufficient native structures meeting the threshold either based on the resolution. Hence, in this research structures with less than 80% of residues in the most favored regions were filtered out according to Williams et al. [35]. In addition, ProSA-web-Protein Structure Analysis [36] is leveraged to verify whether the predicted structure has a required z-score. Usually the z-score is below zero, and it should be within the range of scores typically found in similar-size proteins solved by X-rays or NMR [35]. The root mean square deviation value (RMSD) is used to measure the structural similarity of the backbone compared with the experimentally solved template structure. Structures with a lower RMSD value are considered reliable [37], and values less than 2 Å would be considered similar. Due to lack of available crystal structures as models, protein structures that satisfy any of the quality assessments, which reach the best of a single parameter will be preserved for downstream docking calculation. We chose to keep the models which at least satisfied the z-score calculation.

2.4. Molecular Docking of SARS-CoV-2–Human Protein Pairs

The docking of the SARS-CoV-2–human protein pairs was performed by the HDOCK standalone package [38]. For each docking pair, the human protein was set as the receptor, and the viral protein was the ligand to generate docking structure models. The models with the lowest docking scores were screened as the candidate docking structures. The candidates were selected by comparison with experimental structures by the RMSD value by the align function in PyMOL. The PRODIGY program [39] was used to measure the binding affinity of different ligands and receptors.

3. Results

3.1. SARS-CoV-2–Human Protein Interaction Network

The integrated SARS-CoV-2–human protein interaction network was constructed and visualized (Figure 2A). Physical associations, direct interactions, and interactions of SARS-CoV-2 and 693 hub human proteins from Liu’s research [15] were integrated. Finally, 2385 pairs of interactions between 29 SARS-CoV-2 proteins and 1144 human proteins were included in the network. Annotations of nodes (proteins), edges (PPIs), and topology features were achieved from the NetworkAnalyzer and UniProt (Table S1).

3.2. Biological Process Enrichment of Spike Glycoprotein Subnetwork

S protein and 196 interacting human proteins were extracted from the whole PPI network (Figure 2B). The isolated interactions of S protein included 46 human proteins with no interactions with other viral proteins, which were under gene ontology enrichment analysis. By biological process, AXL, EGFR, LDLR, NRP1, CLEC4M, WWP1, WWP2, HAVCR1, CD209, and ACE2 were enriched in the term “viral entry into host cell” (GO:0046718), indicating the enriched cluster’s potential role in SARS-CoV-2 invasion into cells (Figure 2B, Table S2). Evidence of these proteins functioning as coronavirus spike receptors has been collected. CD209 is known as the receptor of severe acute respiratory syndrome (SARS) spike protein [40]. CLC4M is another receptor of SARS spike protein [41]. The structures of potential SARS-CoV-2-invasion-related proteins were collected as receptors for the docking calculation (Table 1).

3.3. Crystal Structures of Complexes of Binding Pairs

Forty-two binding structures of PPI pairs from multiple studies [10,30,31,42,43,44,45,46,47,48,49,50,51] were retrieved from the RCSB PDB database (Table S3). Four structures with non-variant SARS-CoV-2 proteins were selected for mapping with the edges in the PPI network as the demonstration of the coronavirus macro-to-micro (CoVM2) view (Figure 3), and the 3D structures were presented by Mol* viewer [52]. Structures of the variant spike glycoproteins bound with ACE2 protein were also collected. The S protein binding structures of the variants Delta B.1.617.2 and Omicron BA.1 (B.1.1.529) are available as 7WBP and 7WBQ from Han’s research [30] where the structure of Omicron BA.2 was not included. Quality assessment was run on the models of SARS-CoV-2 S proteins and the interactors (Table 2). The sequence identity of SARS-CoV-2 Omicron BA.2 S protein with the protein sequence in model 7WBP is 97.76%. The binding affinity values of different variant S proteins with ACE2 were calculated, and binding interfaces were highlighted in Figure 4.

3.4. The Predicted Docking Structures of Potential S-Protein-Binding Pairs

The variant S proteins’ structures were used as ligands to perform the docking processes with the human receptors. Thirty-six docking structures of nine human proteins with S proteins of four SARS-CoV-2 variants were modeled. Each pair of SARS-CoV-2–human docking structures generated 100 models via the HDOCK program. Among the models, the model with the lowest score was selected. The structure of the Omicron BA.2 variant S protein binding with ACE2 had an RMSD value of 0.43 with the experimental structure. The free energy values of the selected docking structures were calculated by PRODIGY (File S1). For similar formations in the same viral–human docking pairs, the binding affinities varied among different variants (Figure 4E). The binding states were assigned to the PPI network for revealing the patterns by different variant subnetworks. These structures can serve as reference data for the docking of SARS-CoV-2–human proteins that have not been solved by experimental methods.

3.5. The Macro-to-Micro View of the SARS-CoV-2–Human Protein Associations

The coronavirus macro-to-micro (CoVM2) view is a PPI network integration with possible docking structures of SARS-CoV-2–human PPI pairs. Experimental and computational docking structures have been mapped to corresponding interactions. The alterations of the SARS-CoV-2 variant S protein interaction subnetwork under the differences in their binding states were achieved. OverCOVID [21] is a web portal for integrating and sharing bioinformatics resources and information on COVID-19. It has included publicly available webservers, databases, and tools associated with SARS-CoV-2 on its resource page. The CoVM2 network and structural datasets are available to access on OverCOVID’s new page (http://bis.zju.edu.cn/overcovid/covm2, accessed on 7 June 2022), which could be a great expansion of this field of research. At the top of the website is the visualization of the integrated SARS-CoV-2–human PPI network. By clicking the panel, users can view the interactive functions such as the protein detail tables. Users can scroll down the page to find the download link of the data sets including network, structural details of virus–human protein binding, and python scripts.

4. Discussion

Most proteomics and structural studies on SARS-CoV-2–host interactions have focused on the spike–ACE2 interaction. More data have accumulated around S protein interactions based on its important mechanism. However, with the discovery of other viral proteins’ mechanisms, more interactions with host cell proteins can be investigated. On the enriched cluster, which potentially plays a critical role in viral invasion, the large-scale analysis of variant protein structure docking has not been conducted yet.
In this study, the binding states of SARS-CoV-2 S protein with more known or potential human receptor proteins were achieved. The influence of variant sequences in the docking structures was also considered. The sequence of the Omicron BA.2 variant as a new ligand was implemented in the SARS-CoV-2–human protein docking. A change in the viral proteins’ binding affinity with human proteins was observed.
The SARS-CoV-2–human PPI network is a macroscopic map of the node–node binary associations. It requires combination with the details of protein docking to assess the actual situation of a particular variant protein on viral invasion from a microscopic perspective [53]. The impact of structural changes and the affinity of docking structures through sequence mutations still require further evaluation. With highly accurate methods of molecular dynamics simulations and hotspot predictions [54], more analyses can be conducted on all SARS-CoV-2 proteins and human proteins related to the infection process.
Once the structures of the proteins of emerging SARS-CoV-2 variants are experimentally solved, the association of these proteins with host proteins will be better understood. The topology of the interaction network of each variant strain and the characteristics of the virus-host protein docking interface will provide more help for the prevention and treatment of SARS-CoV-2.
Gene ontology enrichment analysis can also yield enrichment for subcellular locations of human proteins that also need to be analyzed in more detail. Studies of the SARS-CoV-2 interactome [55] have revealed multiple subcellular localizations associated with viral entry, replication, gene expression, and effects on immune responses. Defining the docking events that occur at these subcellular localizations and mining the characteristics of the docking interface under various conditions could help understand the relevant mechanisms of SARS-CoV-2.
In this study, functional analysis and structural docking calculations were performed on the interaction between the viral protein and the host. It was found that the mutations in sequences may influence the structural affinity. The Omicron BA.2 variant has a stronger spike–ACE2 binding affinity as compared to other variants. In binding with AXL and CD209, the BA.1 spike had stronger affinities than the others as well. Compared with BA.2, the S protein of the Delta variant showed weaker affinity with receptors. However, when the Delta variant was compared to BA.1, the opposite relationship was observed in the different protein dockings. Wu et al. [56] provided similar results showing stronger affinity in the Delta S protein RBD binding with human proteins. The results could be related to different single-point mutations in their sequences [57]. The connections of binding affinity with transmissibility, pathogenicity, or stability require further investigation [24].
These results suggested the existence of potential receptors of viral proteins and their binding state alteration with different variants. However, the affinity of the docking structure could also be affected physically or biologically. Combining some specific experimental conditions and more datasets could improve the analysis and is helpful for SARS-CoV-2 protein receptor identification, drug design, and repurposing.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/biom12081067/s1, Table S1: SARS-CoV-2–human protein interaction network and the protein annotations, Table S2: Annotations and enrichment analysis of SARS-CoV-2 S protein interactors; Table S3: Experimental binding structures of SARS-CoV-2–human PPI pairs; File S1: prodigy_results.zip.

Author Contributions

Conceptualization, H.C. and M.C.; methodology, H.C. and X.H.; software, J.Z.; investigation, Y.H.; resources, M.C.; data curation, H.C., X.H. and Y.H.; writing—original draft preparation, H.C. and X.H.; writing—review and editing, H.C. and M.C.; visualization, H.C. and X.H.; supervision, M.C.; project administration, M.C.; funding acquisition, M.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key Research and Development Program of China [2016YFA0501704, 2018YFC0310602]; the National Natural Sciences Foundation of China [31771477, 32070677]; the 151 talent project of Zhejiang Province (first level), Jiangsu Collaborative Innovation Center for Modern Crop Production and Collaborative Innovation Center for Modern Crop Production co-sponsored by province and ministry.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data available at http://bis.zju.edu.cn/overcovid/covm2 (accessed on 6 June 2022).

Acknowledgments

Liangwei Wu and Mengxuan He are acknowledged for their efforts in collection of human receptor proteins. Cong Feng is acknowledged for web page publishing.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Hu, B.; Guo, H.; Zhou, P.; Shi, Z.-L. Characteristics of SARS-CoV-2 and COVID-19. Nat. Rev. Microbiol. 2021, 19, 141–154. [Google Scholar] [CrossRef] [PubMed]
  2. Khorsand, B.; Savadi, A.; Naghibzadeh, M. SARS-CoV-2-Human Protein-Protein Interaction Network. Inform. Med. Unlocked 2020, 20, 100413. [Google Scholar] [CrossRef] [PubMed]
  3. Letko, M.; Marzi, A.; Munster, V. Functional Assessment of Cell Entry and Receptor Usage for SARS-CoV-2 and Other Lineage B Betacoronaviruses. Nat. Microbiol. 2020, 5, 562–569. [Google Scholar] [CrossRef]
  4. Huang, Y.; Yang, C.; Xu, X.; Xu, W.; Liu, S. Structural and Functional Properties of SARS-CoV-2 Spike Protein: Potential Antivirus Drug Development for COVID-19. Acta Pharmacol. Sin. 2020, 41, 1141–1149. [Google Scholar] [CrossRef] [PubMed]
  5. Li, J.; Lai, S.; Gao, G.F.; Shi, W. The Emergence, Genomic Diversity and Global Spread of SARS-CoV-2. Nature 2021, 600, 408–418. [Google Scholar] [CrossRef] [PubMed]
  6. The UniProt Consortium UniProt: A Worldwide Hub of Protein Knowledge. Nucleic Acids Res. 2019, 47, D506–D515. [CrossRef] [PubMed]
  7. Singh, M.; Bansal, V.; Feschotte, C. A Single-Cell RNA Expression Map of Human Coronavirus Entry Factors. Cell Rep. 2020, 32. [Google Scholar] [CrossRef]
  8. Baggen, J.; Vanstreels, E.; Jansen, S.; Daelemans, D. Cellular Host Factors for SARS-CoV-2 Infection. Nat. Microbiol. 2021, 6, 1219–1232. [Google Scholar] [CrossRef]
  9. Wang, S.; Qiu, Z.; Hou, Y.; Deng, X.; Xu, W.; Zheng, T.; Wu, P.; Xie, S.; Bian, W.; Zhang, C.; et al. AXL Is a Candidate Receptor for SARS-CoV-2 That Promotes Infection of Pulmonary and Bronchial Epithelial Cells. Cell Res. 2021, 31, 126–140. [Google Scholar] [CrossRef]
  10. Daly, J.L.; Simonetti, B.; Klein, K.; Chen, K.-E.; Williamson, M.K.; Antón-Plágaro, C.; Shoemark, D.K.; Simón-Gracia, L.; Bauer, M.; Hollandi, R.; et al. Neuropilin-1 Is a Host Factor for SARS-CoV-2 Infection. Science 2020, 370, 861–865. [Google Scholar] [CrossRef]
  11. Cantuti-Castelvetri, L.; Ojha, R.; Pedro, L.D.; Djannatian, M.; Franz, J.; Kuivanen, S.; van der Meer, F.; Kallio, K.; Kaya, T.; Anastasina, M.; et al. Neuropilin-1 Facilitates SARS-CoV-2 Cell Entry and Infectivity. Science 2020, 370, 856–860. [Google Scholar] [CrossRef] [PubMed]
  12. Amraei, R.; Yin, W.; Napoleon, M.A.; Suder, E.L.; Berrigan, J.; Zhao, Q.; Olejnik, J.; Chandler, K.B.; Xia, C.; Feldman, J.; et al. CD209L/L-SIGN and CD209/DC-SIGN Act as Receptors for SARS-CoV-2. ACS Cent. Sci. 2021, 7, 1156–1165. [Google Scholar] [CrossRef] [PubMed]
  13. Varjosalo, M.; Sacco, R.; Stukalov, A.; van Drogen, A.; Planyavsky, M.; Hauri, S.; Aebersold, R.; Bennett, K.L.; Colinge, J.; Gstaiger, M.; et al. Interlaboratory Reproducibility of Large-Scale Human Protein-Complex Analysis by Standardized AP-MS. Nat. Methods 2013, 10, 307–314. [Google Scholar] [CrossRef] [PubMed]
  14. Tian, B.; Zhao, C.; Gu, F.; He, Z. A Two-Step Framework for Inferring Direct Protein-Protein Interaction Network from AP-MS Data. BMC Syst. Biol. 2017, 11, 82. [Google Scholar] [CrossRef]
  15. Liu, X.; Huuskonen, S.; Laitinen, T.; Redchuk, T.; Bogacheva, M.; Salokas, K.; Pöhner, I.; Öhman, T.; Tonduru, A.K.; Hassinen, A.; et al. SARS-CoV-2–Host Proteome Interactions for Antiviral Drug Discovery. Mol. Syst. Biol. 2021, 17, e10396. [Google Scholar] [CrossRef]
  16. Roux, K.J.; Kim, D.I.; Raida, M.; Burke, B. A Promiscuous Biotin Ligase Fusion Protein Identifies Proximal and Interacting Proteins in Mammalian Cells. J. Cell Biol. 2012, 196, 801–810. [Google Scholar] [CrossRef] [PubMed]
  17. Gordon, D.E.; Jang, G.M.; Bouhaddou, M.; Xu, J.; Obernier, K.; White, K.M.; O’Meara, M.J.; Rezelj, V.V.; Guo, J.Z.; Swaney, D.L.; et al. A SARS-CoV-2 Protein Interaction Map Reveals Targets for Drug Repurposing. Nature 2020, 583, 459–468. [Google Scholar] [CrossRef]
  18. Orchard, S.; Ammari, M.; Aranda, B.; Breuza, L.; Briganti, L.; Broackes-Carter, F.; Campbell, N.H.; Chavali, G.; Chen, C.; Del-Toro, N. The MIntAct Project—IntAct as a Common Curation Platform for 11 Molecular Interaction Databases. Nucleic Acids Res. 2014, 42, D358–D363. [Google Scholar] [CrossRef]
  19. Guirimand, T.; Delmotte, S.; Navratil, V. VirHostNet 2.0: Surfing on the Web of Virus/Host Molecular Interactions Data. Nucleic Acids Res. 2015, 43, D583–D587. [Google Scholar] [CrossRef]
  20. Berman, H.M.; Westbrook, J.; Feng, Z.; Gilliland, G.; Bhat, T.N.; Weissig, H.; Shindyalov, I.N.; Bourne, P.E. The Protein Data Bank. Nucleic Acids Res. 2000, 28, 235–242. [Google Scholar] [CrossRef]
  21. Ahsan, M.A.; Liu, Y.; Feng, C.; Hofestädt, R.; Chen, M. OverCOVID: An Integrative Web Portal for SARS-CoV-2 Bioinformatics Resources. J. Integr. Bioinforma. 2021, 18, 9–17. [Google Scholar] [CrossRef]
  22. Satyam, R.; Yousef, M.; Qazi, S.; Bhat, A.M.; Raza, K. COVIDium: A COVID-19 Resource Compendium. Database J. Biol. Databases Curation 2021, 2021, baab057. [Google Scholar] [CrossRef]
  23. Wierbowski, S.D.; Liang, S.; Liu, Y.; Chen, Y.; Gupta, S.; Andre, N.M.; Lipkin, S.M.; Whittaker, G.R.; Yu, H. A 3D Structural SARS-CoV-2–Human Interactome to Explore Genetic and Drug Perturbations. Nat. Methods 2021, 18, 1477–1488. [Google Scholar] [CrossRef] [PubMed]
  24. Barh, D.; Tiwari, S.; Gomes, L.G.R.; Pinto, C.H.R.; Andrade, B.S.; Ahmad, S.; Aljabali, A.A.A.; Alzahrani, K.J.; Banjer, H.J.; Hassan, S.S.; et al. SARS-CoV-2 Variants Show a Gradual Declining Pathogenicity and pro-Inflammatory Cytokine Spur, an Increasing Antigenic and Antiinflammatory Cytokine Induction, and Rising Structural Protein Instability. bioRxiv 2022, 2022. [Google Scholar] [CrossRef]
  25. Shannon, P.; Markiel, A.; Ozier, O.; Baliga, N.S.; Wang, J.T.; Ramage, D.; Amin, N.; Schwikowski, B.; Ideker, T. Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction Networks. Genome Res. 2003, 13, 2498–2504. [Google Scholar] [CrossRef] [PubMed]
  26. Gursoy, A.; Keskin, O.; Nussinov, R. Topological Properties of Protein Interaction Networks from a Structural Perspective. Biochem. Soc. Trans. 2008, 36, 1398. [Google Scholar] [CrossRef] [PubMed]
  27. Vakser, I.A. Challenges in Protein Docking. Curr. Opin. Struct. Biol. 2020, 64, 160–165. [Google Scholar] [CrossRef] [PubMed]
  28. Zhou, Y.; Zhou, B.; Pache, L.; Chang, M.; Khodabakhshi, A.H.; Tanaseichuk, O.; Benner, C.; Chanda, S.K. Metascape Provides a Biologist-Oriented Resource for the Analysis of Systems-Level Datasets. Nat. Commun. 2019, 10, 1523. [Google Scholar] [CrossRef]
  29. Waterhouse, A.; Bertoni, M.; Bienert, S.; Studer, G.; Tauriello, G.; Gumienny, R.; Heer, F.T.; de Beer, T.A.P.; Rempfer, C.; Bordoli, L.; et al. SWISS-MODEL: Homology Modelling of Protein Structures and Complexes. Nucleic Acids Res. 2018, 46, W296–W303. [Google Scholar] [CrossRef]
  30. Han, P.; Li, L.; Liu, S.; Wang, Q.; Zhang, D.; Xu, Z.; Han, P.; Li, X.; Peng, Q.; Su, C.; et al. Receptor Binding and Complex Structures of Human ACE2 to Spike RBD from Omicron and Delta SARS-CoV-2. Cell 2022, 185, 630–640.e10. [Google Scholar] [CrossRef]
  31. Lan, J.; Ge, J.; Yu, J.; Shan, S.; Zhou, H.; Fan, S.; Zhang, Q.; Shi, X.; Wang, Q.; Zhang, L.; et al. Structure of the SARS-CoV-2 Spike Receptor-Binding Domain Bound to the ACE2 Receptor. Nature 2020, 581, 215–220. [Google Scholar] [CrossRef]
  32. Laskowski, R.A.; MacArthur, M.W.; Moss, D.S.; Thornton, J.M. PROCHECK: A Program to Check the Stereochemical Quality of Protein Structures. J. Appl. Crystallogr. 1993, 26, 283–291. [Google Scholar] [CrossRef]
  33. Sumitha, A.; Devi, P.B.; Hari, S.; Dhanasekaran, R. COVID-19-In Silico Structure Prediction and Molecular Docking Studies with Doxycycline and Quinine. Biomed. Pharmacol. J. 2020, 13, 1185–1193. [Google Scholar] [CrossRef]
  34. Rai, D.K.; Rieder, E. Homology Modeling and Analysis of Structure Predictions of the Bovine Rhinitis B Virus RNA Dependent RNA Polymerase (RdRp). Int. J. Mol. Sci. 2012, 13, 8998–9013. [Google Scholar] [CrossRef] [PubMed]
  35. Williams, M.E.; Cloete, R. Molecular Modeling of Subtype-Specific Tat Protein Signatures to Predict Tat-TAR Interactions That May Be Involved in HIV-Associated Neurocognitive Disorders. Front. Microbiol. 2022, 13, 866611. [Google Scholar] [CrossRef] [PubMed]
  36. Wiederstein, M.; Sippl, M.J. ProSA-Web: Interactive Web Service for the Recognition of Errors in Three-Dimensional Structures of Proteins. Nucleic Acids Res. 2007, 35, W407–W410. [Google Scholar] [CrossRef]
  37. Zhang, Y.; Skolnick, J. The Protein Structure Prediction Problem Could Be Solved Using the Current PDB Library. Proc. Natl. Acad. Sci. USA 2005, 102, 1029–1034. [Google Scholar] [CrossRef] [PubMed]
  38. Yan, Y.; Zhang, D.; Zhou, P.; Li, B.; Huang, S.-Y. HDOCK: A Web Server for Protein–Protein and Protein–DNA/RNA Docking Based on a Hybrid Strategy. Nucleic Acids Res. 2017, 45, W365–W373. [Google Scholar] [CrossRef] [PubMed]
  39. Xue, L.C.; Rodrigues, J.P.; Kastritis, P.L.; Bonvin, A.M.; Vangone, A. PRODIGY: A Web Server for Predicting the Binding Affinity of Protein–Protein Complexes. Bioinformatics 2016, 32, 3676–3678. [Google Scholar] [CrossRef] [PubMed]
  40. Jeffers, S.A.; Tusell, S.M.; Gillim-Ross, L.; Hemmila, E.M.; Achenbach, J.E.; Babcock, G.J.; Thomas, W.D.; Thackray, L.B.; Young, M.D.; Mason, R.J.; et al. CD209L (L-SIGN) Is a Receptor for Severe Acute Respiratory Syndrome Coronavirus. Proc. Natl. Acad. Sci. USA 2004, 101, 15748–15753. [Google Scholar] [CrossRef]
  41. Chan, V.S.F.; Chan, K.Y.K.; Chen, Y.; Poon, L.L.M.; Cheung, A.N.Y.; Zheng, B.; Chan, K.-H.; Mak, W.; Ngan, H.Y.S.; Xu, X.; et al. Homozygous L-SIGN (CLEC4M) Plays a Protective Role in SARS Coronavirus Infection. Nat. Genet. 2006, 38, 38–46. [Google Scholar] [CrossRef] [PubMed]
  42. Klemm, T.; Ebert, G.; Calleja, D.J.; Allison, C.C.; Richardson, L.W.; Bernardini, J.P.; Lu, B.G.; Kuchel, N.W.; Grohmann, C.; Shibata, Y.; et al. Mechanism and Inhibition of the Papain-like Protease, PLpro, of SARS-CoV-2. EMBO J. 2020, 39, e106275. [Google Scholar] [CrossRef] [PubMed]
  43. Thoms, M.; Buschauer, R.; Ameismeier, M.; Koepke, L.; Denk, T.; Hirschenberger, M.; Kratzat, H.; Hayn, M.; Mackens-Kiani, T.; Cheng, J.; et al. Structural Basis for Translational Shutdown and Immune Evasion by the Nsp1 Protein of SARS-CoV-2. Science 2020, 369, 1249–1255. [Google Scholar] [CrossRef] [PubMed]
  44. Gao, X.; Zhu, K.; Qin, B.; Olieric, V.; Wang, M.; Cui, S. Crystal Structure of SARS-CoV-2 Orf9b in Complex with Human TOM70 Suggests Unusual Virus-Host Interactions. Nat. Commun. 2021, 12, 2843. [Google Scholar] [CrossRef]
  45. Yan, R.; Zhang, Y.; Li, Y.; Ye, F.; Guo, Y.; Xia, L.; Zhong, X.; Chi, X.; Zhou, Q. Structural Basis for the Different States of the Spike Protein of SARS-CoV-2 in Complex with ACE2. Cell Res. 2021, 31, 717–719. [Google Scholar] [CrossRef]
  46. Szeto, C.; Chatzileontiadou, D.S.M.; Nguyen, A.T.; Sloane, H.; Lobos, C.A.; Jayasinghe, D.; Halim, H.; Smith, C.; Riboldi-Tunnicliffe, A.; Grant, E.J.; et al. The Presentation of SARS-CoV-2 Peptides by the Common HLA-A∗02:01 Molecule. iScience 2021, 24, 102096. [Google Scholar] [CrossRef]
  47. Wu, D.; Kolesnikov, A.; Yin, R.; Guest, J.D.; Gowthaman, R.; Shmelev, A.; Serdyuk, Y.; Dianov, D.V.; Efimov, G.A.; Pierce, B.G.; et al. Structural Assessment of HLA-A2-Restricted SARS-CoV-2 Spike Epitopes Recognized by Public and Private T-Cell Receptors. Nat. Commun. 2022, 13, 19. [Google Scholar] [CrossRef]
  48. Chaurasia, P.; Nguyen, T.H.O.; Rowntree, L.C.; Juno, J.A.; Wheatley, A.K.; Kent, S.J.; Kedzierska, K.; Rossjohn, J.; Petersen, J. Structural Basis of Biased T Cell Receptor Recognition of an Immunodominant HLA-A2 Epitope of the SARS-CoV-2 Spike Protein. J. Biol. Chem. 2021, 297, 101065. [Google Scholar] [CrossRef]
  49. Javorsky, A.; Humbert, P.O.; Kvansakul, M. Structural Basis of Coronavirus E Protein Interactions with Human PALS1 PDZ Domain. Commun. Biol. 2021, 4, 724. [Google Scholar] [CrossRef]
  50. Biswal, M.; Lu, J.; Song, J. SARS-CoV-2 Nucleocapsid Protein Targets a Conserved Surface Groove of the NTF2-like Domain of G3BP1. J. Mol. Biol. 2022, 434, 167516. [Google Scholar] [CrossRef]
  51. McCallum, M.; Czudnochowski, N.; Rosen, L.E.; Zepeda, S.K.; Bowen, J.E.; Walls, A.C.; Hauser, K.; Joshi, A.; Stewart, C.; Dillen, J.R.; et al. Structural Basis of SARS-CoV-2 Omicron Immune Evasion and Receptor Engagement. Science 2022, 375, 864–868. [Google Scholar] [CrossRef] [PubMed]
  52. Sehnal, D.; Bittrich, S.; Deshpande, M.; Svobodová, R.; Berka, K.; Bazgier, V.; Velankar, S.; Burley, S.K.; Koča, J.; Rose, A.S. Mol* Viewer: Modern Web App for 3D Visualization and Analysis of Large Biomolecular Structures. Nucleic Acids Res. 2021, 49, W431–W437. [Google Scholar] [CrossRef]
  53. Padhi, A.K.; Rath, S.L.; Tripathi, T. Accelerating COVID-19 Research Using Molecular Dynamics Simulation. J. Phys. Chem. B 2021, 125, 9078–9091. [Google Scholar] [CrossRef] [PubMed]
  54. Li, Z.; Zhang, J.Z.H. Mutational Effect of Some Major COVID-19 Variants on Binding of the S Protein to ACE2. Biomolecules 2022, 12, 572. [Google Scholar] [CrossRef] [PubMed]
  55. Karunakaran, K.B.; Balakrishnan, N.; Ganapathiraju, M.K. Interactome of SARS-CoV-2 Modulated Host Proteins With Computationally Predicted PPIs: Insights From Translational Systems Biology Studies. Front. Syst. Biol. 2022, 2. [Google Scholar] [CrossRef]
  56. Wu, L.; Zhou, L.; Mo, M.; Liu, T.; Wu, C.; Gong, C.; Lu, K.; Gong, L.; Zhu, W.; Xu, Z. SARS-CoV-2 Omicron RBD Shows Weaker Binding Affinity than the Currently Dominant Delta Variant to Human ACE2. Signal Transduct. Target. Ther. 2022, 7, 8. [Google Scholar] [CrossRef]
  57. Kumar, S.; Thambiraja, T.S.; Karuppanan, K.; Subramaniam, G. Omicron and Delta Variant of SARS-CoV-2: A Comparative Computational Study of Spike Protein. J. Med. Virol. 2022, 94, 1641–1649. [Google Scholar] [CrossRef] [PubMed]
Figure 1. The integration of SARS-CoV-2–human PPI network and virus-host protein structural binding state. The PPI network was merged with experimental PPIs from multiple sources. Binding structures were collected from both experiments and calculations. The 3D view of S protein (Omicron BA.2 variant) and ACE2 protein binding as a demonstration was captured from the structure modeled in this research.
Figure 1. The integration of SARS-CoV-2–human PPI network and virus-host protein structural binding state. The PPI network was merged with experimental PPIs from multiple sources. Binding structures were collected from both experiments and calculations. The 3D view of S protein (Omicron BA.2 variant) and ACE2 protein binding as a demonstration was captured from the structure modeled in this research.
Biomolecules 12 01067 g001
Figure 2. Integrated SARS-CoV-2–human PPI S protein subnetwork and functional enrichment. (A) The subnetwork of S protein with its 196 first neighbors. The isolated interactions (degree = 1) were grouped in blue shadow; (B) GO enrichment of S protein’s isolated interactions. Nodes represent viral (V-shape) or human (ellipse-shape) proteins. Edges represent protein–protein interactions. The nodes are colored according to their degrees. For human proteins, the degree values range from 1 to 9 (from light blue to red shade). Nodes with lower degree values have fewer physical associations or direct interactions with other SARS-CoV-2 proteins.
Figure 2. Integrated SARS-CoV-2–human PPI S protein subnetwork and functional enrichment. (A) The subnetwork of S protein with its 196 first neighbors. The isolated interactions (degree = 1) were grouped in blue shadow; (B) GO enrichment of S protein’s isolated interactions. Nodes represent viral (V-shape) or human (ellipse-shape) proteins. Edges represent protein–protein interactions. The nodes are colored according to their degrees. For human proteins, the degree values range from 1 to 9 (from light blue to red shade). Nodes with lower degree values have fewer physical associations or direct interactions with other SARS-CoV-2 proteins.
Biomolecules 12 01067 g002
Figure 3. The experimental SARS-CoV-2–human binding structures mapped to PPIs and PPI clusters. Mapped structures are: S protein’s receptor-binding domain (RBD) bound with ACE2 (PDB ID: 6M0J [31]); S protein’s RBD bound with NRP1 (PDB ID: 7JJC [10]); ORF9b complex with human TOM70 (PDB ID: 7DHG [44]); Nsp1 bound to the human 40S ribosomal subunit (PDB ID: 6ZLW [43]). PDB 3D structures were presented by Mol* Viewer [52].
Figure 3. The experimental SARS-CoV-2–human binding structures mapped to PPIs and PPI clusters. Mapped structures are: S protein’s receptor-binding domain (RBD) bound with ACE2 (PDB ID: 6M0J [31]); S protein’s RBD bound with NRP1 (PDB ID: 7JJC [10]); ORF9b complex with human TOM70 (PDB ID: 7DHG [44]); Nsp1 bound to the human 40S ribosomal subunit (PDB ID: 6ZLW [43]). PDB 3D structures were presented by Mol* Viewer [52].
Biomolecules 12 01067 g003
Figure 4. States of human proteins binding with SARS-CoV-2 variants S proteins. (AC) Experimental structures of human ACE2 protein binding with S proteins (PDB ID: 6M0J, 7WBQ, and 7WBP); (D) Calculated docking structure of human ACE2 protein binding with Omicron BA.2 S protein. The docking temperature is 37 °C. The binding interfaces are highlighted in deep blue and pink colors; (E) The ΔG values represent free energy of human proteins binding with variants S proteins.
Figure 4. States of human proteins binding with SARS-CoV-2 variants S proteins. (AC) Experimental structures of human ACE2 protein binding with S proteins (PDB ID: 6M0J, 7WBQ, and 7WBP); (D) Calculated docking structure of human ACE2 protein binding with Omicron BA.2 S protein. The docking temperature is 37 °C. The binding interfaces are highlighted in deep blue and pink colors; (E) The ΔG values represent free energy of human proteins binding with variants S proteins.
Biomolecules 12 01067 g004
Table 1. Structures of SARS-CoV-2 S protein’s isolated interactions.
Table 1. Structures of SARS-CoV-2 S protein’s isolated interactions.
Gene NameUniProtSpike BindingTemplate for DockingChainNote
WWP2O00308 4Y07A
NRP1O147867JJC7JJCA
EGFRP00533 1IVOA
AXLP30530 Modeled structure Homology template 5VXZ.1.B
HAVCR1Q96D42 5DZOA
ACE2Q9BYF16M0J, 7WBQ, 7WBP6M0JA
WWP1Q9H0M0 1ND7A
CLEC4MQ9H2X3 1K9JB
CD209Q9NNX6 1SL4A
TMPRSS2O15393 7MEQA
Table 2. Summary of the quality assessment scores for the models of SARS-CoV-2 S proteins and the interactors.
Table 2. Summary of the quality assessment scores for the models of SARS-CoV-2 S proteins and the interactors.
ModelPercentage in Allowed RegionZ-ScoreRMSDGMQESequence Identity
Spike Omicron BA.287.50%−5.590.1540.7597.76%
Spike Omicron BA.184.10%−5.81
Spike Delta83.90%−6.04
Spike WT89.30%−5.87
AXL95.10%−4.490.1680.9395.10%
WWP292.30%−7.68
ACE293.30%−12.87
EGFR66.10%−8.46
HAVR189.10%−4.3
WWP185.20%−8.32
CLC4M89.40%−6.74
CD20991.40%−4.94
TMPS287.90%−7.11
NRP187.60%−5.18
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Chen, H.; Hu, X.; Hu, Y.; Zhou, J.; Chen, M. CoVM2: Molecular Biological Data Integration of SARS-CoV-2 Proteins in a Macro-to-Micro Method. Biomolecules 2022, 12, 1067. https://doi.org/10.3390/biom12081067

AMA Style

Chen H, Hu X, Hu Y, Zhou J, Chen M. CoVM2: Molecular Biological Data Integration of SARS-CoV-2 Proteins in a Macro-to-Micro Method. Biomolecules. 2022; 12(8):1067. https://doi.org/10.3390/biom12081067

Chicago/Turabian Style

Chen, Hongjun, Xiaotian Hu, Yanshi Hu, Jiawen Zhou, and Ming Chen. 2022. "CoVM2: Molecular Biological Data Integration of SARS-CoV-2 Proteins in a Macro-to-Micro Method" Biomolecules 12, no. 8: 1067. https://doi.org/10.3390/biom12081067

APA Style

Chen, H., Hu, X., Hu, Y., Zhou, J., & Chen, M. (2022). CoVM2: Molecular Biological Data Integration of SARS-CoV-2 Proteins in a Macro-to-Micro Method. Biomolecules, 12(8), 1067. https://doi.org/10.3390/biom12081067

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop