Next Article in Journal
The Chloronium Cation [(C2H3)2Cl+] and Unsaturated C4-Carbocations with C=C and C≡C Bonds in Their Solid Salts and in Solutions: An H1/C13 NMR and Infrared Spectroscopic Study
Next Article in Special Issue
Differentiation of SARS-CoV-2 Variants Using RT-qPCRs by Targeting Recurrent Mutation Sites: A Diagnostic Laboratory Experience from Multi-Center Regional Study, August 2020–December 2021, Poland
Previous Article in Journal
Antibacterial Soy Protein Isolate Prepared by Quaternization
Previous Article in Special Issue
Structural Evolution of Delta (B.1.617.2) and Omicron (BA.1) Spike Glycoproteins
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Perspective

Role of the Pangolin in Origin of SARS-CoV-2: An Evolutionary Perspective

by
Shishir K. Gupta
1,2,†,
Rashmi Minocha
3,†,
Prithivi Jung Thapa
1,2,
Mugdha Srivastava
2,4,* and
Thomas Dandekar
2,5,*
1
Evolutionary Genomics Group, Center for Computational and Theoretical Biology, University of Würzburg, 97074 Würzburg, Germany
2
Functional Genomics & Systems Biology Group, Department of Bioinformatics, Biocenter, Am Hubland, University of Würzburg, 97074 Würzburg, Germany
3
Department of Biochemistry, All India Institute of Medical Sciences, New Delhi 110029, India
4
Core Unit Systems Medicine, University of Würzburg, 97080 Würzburg, Germany
5
EMBL Heidelberg, BioComputing Unit, 69117 Heidelberg, Germany
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Int. J. Mol. Sci. 2022, 23(16), 9115; https://doi.org/10.3390/ijms23169115
Submission received: 13 July 2022 / Revised: 10 August 2022 / Accepted: 12 August 2022 / Published: 14 August 2022
(This article belongs to the Special Issue Genomic Variation and Epidemiology of SARS-CoV-2)

Abstract

:
After the recent emergence of SARS-CoV-2 infection, unanswered questions remain related to its evolutionary history, path of transmission or divergence and role of recombination. There is emerging evidence on amino acid substitutions occurring in key residues of the receptor-binding domain of the spike glycoprotein in coronavirus isolates from bat and pangolins. In this article, we summarize our current knowledge on the origin of SARS-CoV-2. We also analyze the host ACE2-interacting residues of the receptor-binding domain of spike glycoprotein in SARS-CoV-2 isolates from bats, and compare it to pangolin SARS-CoV-2 isolates collected from Guangdong province (GD Pangolin-CoV) and Guangxi autonomous regions (GX Pangolin-CoV) of South China. Based on our comparative analysis, we support the view that the Guangdong Pangolins are the intermediate hosts that adapted the SARS-CoV-2 and represented a significant evolutionary link in the path of transmission of SARS-CoV-2 virus. We also discuss the role of intermediate hosts in the origin of Omicron.

1. Introduction

Since the beginning of the coronavirus disease 2019 (COVID-19) pandemic in 2019, as of 26 June 2022, there have been approximately 574 million confirmed cases of COVID-19 worldwide that have claimed the lives of nearly 6.5 million individuals (weekly epidemiological update on COVID-19, see link: who.int (accessed on 3 August 2022)). The causative agent of COVID-19 has been identified as a novel coronavirus named severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2). The outbreak of this disease has caused a massive disruption in the global economy, leaving many developing as well as developed nations in financial turmoil. The rollout of different vaccines has eased the pressure of increasing infection rates. However, the emergence of new variants of SARS-CoV-2 is posing more uncertainty on the level of devastation that may occur in the future. Hence, there is a clear need for in-depth understanding of SARS-CoV-2’s evolutionary mechanisms in order to prepare better for zoonotic transmissions and potential pandemics of the future. There is an overwhelming consensus regarding the origin of SARS-CoV-2 from bats. However, its evolutionary history and path of transmission still remains elusive and thus, a better understanding of the cross-species transmission and evolutionary relationship between SARS-CoV-2 and other highly related coronaviruses (CoVs) is required.
The SARS-CoV-2 virus enters into the host cell through a two-step mechanism: (i) host-cell receptor recognition and (ii) host-virus cell membrane fusion. First, the receptor-binding domain (RBD) of the spike (S) glycoprotein in coronavirus SARS-CoV-2 is employed to interact with the functional angiotensin converting enzyme 2 (ACE2) receptors that are known to be expressed in many human organs [1]. ACE2 has also been recognized as a receptor for human coronavirus HCoV-NL63 and SARS-CoV [2]. Second, this ACE2 receptor recognition by the S-glycoprotein of SARS-CoV-2 leads to significant changes in the conformation of S-glycoprotein, consequently causing a fusion of host–virus cell membrane. In order to fight against the SARS-CoV-2 infection, the host cell generates neutralizing antibodies against the S-glycoprotein. Several research labs are attempting to understand how the virus escapes from neutralization. Thus, a systemic understanding of the RBD-ACE2 interface may provide important insights for such a study. This will also help us to determine active regions in the S-glycoprotein-ACE2 interface that can act as potential targets for designing SARS-CoV-2 drugs based on blocking of the S-glycoprotein-ACE2 interaction. Notably, identification of critical amino acid positions involved in this interaction will further provide us significant information on the evolutionary emergence of SARS-CoV-2 and may answer several outstanding questions related to the origin and path of transmission of SARS-CoV-2.

2. Current Understanding on the Intermediate Host of SARS-CoV-2 and Our Perspective

Since the outbreak of SARS-CoV-2, several independent groups have attempted to shed light on its zoonotic origin and emergence. For instance, bat coronavirus (Bat-CoV) RaTG13, a coronavirus isolated from the bat species Rhinolophus affinis obtained from Yunnan Province, has been found to share the closest whole-genome identity of 96% with SARS-CoV-2 [3]. In another study, cryo-EM structures of spikes of CoVs RaTG13 and GX Pangolin-CoV isolated from bat and pangolin species, respectively, were found to be closely related to the SARS-CoV-2 spike structure [4]. Furthermore, sequence differences of the SARS-CoV-2 spike proteins, its ACE2 interacting efficiencies and pathophysiological responses in humans and bats indicate the circulation and evolution of progenitor virus in single or multiple intermediate hosts, as in the previously existing coronaviruses SARS-CoV and MERS-CoV, where the progenitor virus passed through hosts such as minks, camels, etc., before jumping to humans [5,6,7]. The natural and intermediate hosts co-evolve with the virus and act as reservoirs for the virus replication [8]. Recombination and/or horizontal gene transfer with other closely related viruses inside the intermediate hosts can lead to the virus gaining additional survival and pathogenic features [5].
An intermediate host provides a selective environment that allows a zoonotic virus, not only to multiply massively in a second host but also to acquire mutations and evolve so as to become compatible with new hosts. Thus, it is critical to recognize the intermediate host as a dangerous virus reservoir for a new viral outbreak, and by interfering with viral replication in the intermediate host the viral transmission chain can be disrupted, and further spread can be controlled. Identifying intermediate host species not only permits the introduction of risk mitigation public health strategies, but also offers a deeper understanding of the evolution and pathogenesis of zoonotic diseases. According to our current understanding on the transmission chain of SARS-CoV2, bat is its original host while pangolin is likely its intermediate host, although details of the complete transmission process are still unaddressed [9]. Considering the importance of the intermediate host, two independently published Nature articles, first by Lam et al. (2020) [10] and then the work of Xiao et al. (2020) [11], provided a strong statement that Malayan pangolins are the potential intermediate hosts in the emergence of the novel coronavirus: Xiao et al. (2020) [11] reported the presence of a SARS-CoV-2-like virus (named “Pangolin-CoV”) in 17 out of 25 Malayan pangolins collected from Guangdong province in South China. CoV-infected pangolins showed clinical symptoms of the disease, and 14 infected animals died within a span of 1.5 months [11]. By analyzing the RNAseq data of lung samples from pangolins followed by sequence comparisons, the authors demonstrated high sequence identity between Pangolin-CoV and SARS-CoV-2 genes.
Regarding the paper by Xiao et al. (2020) [11], we argue that the article did not focus well to present critical residues in their CoVs sequence comparison (provided in their Figure 7b), and in consequence, we hereby now show new in our alignment how and where amino acid substitutions occur in the host ACE2 receptor. These ACE2 residues are interacting with important residues of the receptor-binding domain (RBD) of spike glycoprotein (S) in the coronavirus. We show and compare for these isolates from bats and virus isolates from pangolins collected from Guangdong province (GD Pangolin-CoV) and the Guangxi autonomous region (GX Pangolin-CoV) in South China.

3. Sequence Comparative Analysis of Coronavirus Isolates from Bat and Pangolins

To validate the CoVs multiple sequence alignment (MSA), we retrieved the exact sequences as used in the analysis of Xiao et al. (2020) [11] and performed genome annotation of GD Pangolin-CoV (GISAID accession ID: EPI_ISL_410721) with VIGOR [12] to extract S-protein sequences.
Other S-protein sequences from human isolate SARS-CoV-2_WIV02 and bat isolates (Bat-CoV_RaTG13, Bat-SARS-CoV_ZC45 and Bat-SARS-CoV_ZXC21) were retrieved from GenBank. We used L-INS-I, the most accurate algorithm of their software suite “MAFFT” for MSA [13]. Out of the five suggested positions (456, 487, 494, 502 and 506), we found that both in SARS-CoV-2_WIV02 and GD Pangolin-CoV, these positions correspond to Phe, Asn, Ser, Gly and Gln, respectively (Figure 1a,b). The differences found in critical positions from Xiao et al. (2020) [11] are depicted in Figure 1b,c. Moreover, the residues we found at these critical positions are in direct agreement with other recent studies [10,14,15].
The receptor binding domain (RBD) is an important and highly relevant target for identification of neutralizing antibodies and vaccine development [16]. There is a wide mutational space in RBD that can maintain proper folding, expression and generation of consistent phenotypes in SARS-CoV-2 [17]. Recent studies have identified 26 important RBD residues [14,15,18] that are involved in the interaction with the ACE2 receptor and collected more samples [10,19] of CoV-like viruses being isolated from pangolin species. Using information from these analyses, we offer an extension to the study by Xiao et al. (2020) [11] to illuminate the role of pangolins in the origin of SARS-CoV-2. To this end, we mapped all the residue positions according to the Wuhan-Hu-1 reference sequence (GISAID accession ID: EPI_ISL_402125). We observed an interesting trend of mutations here (Figure 2): Rhinolophus affinis isolated Bat-CoV_RaTG13 in 2013 needs at least four non-conservative and two semi-conservative mutations in RBD for human adaptation. Pangolin-CoV isolated from Guangxi [10] in 2017 needs three non-conservative and three semi-conservative mutations for human adaptation. Notably, Pangolin-CoV isolated from Guangdong [11,19] in 2019 requires only one semi-conservative mutation for human adaptation. The calculation of binding energy between soluble human ACE2 and RBD both in the SARS-CoV-2 and pangolin isolated CoVs confirmed that GD Pangolin-CoV binds comparatively more strongly to the ACE2 receptor than SARS-CoV-2 [11,17]. This provides first clear evidence that as compared to the GX Pangolin-CoV, the GD Pangolin-CoV is more likely to be involved in the origin of SARS-CoV-2.
Furthermore, in the sequence of Bat-CoV_RaTG13, we observed Arg at position 494 while the other four critical residues highlighted in [11] were homologous to that of SARS-CoV-2_WIV02 and GD Pangolin-CoV (Figure 1b). GX Pangolin-CoV also had Arg at position 494, which was found to be mutated into Ser in Guangdong pangolins (Figure 2). It is noteworthy that Ser494 strengthens the structural stability of the virus-binding hotspot Lys353 at ACE2 surface [18,20]. On the contrary, [11] reported Gln at position 494 in GD Pangolin-CoV (Figure 1c).
In the sequence of GD Pangolin-CoV, at position 498, Glu was observed while in SARS-CoV-2, His was identified at the same position. Reverting this mutation (His to Glu) at position 498 in SARS-CoV-2 causes a 5-fold increase in its human ACE2 binding affinity as compared to the wild type [21]. Although SARS-CoV-2 is not known to infect porcine cells, a 12-fold increase in infection efficiency in porcine cell lines was observed with Q498H mutation in SARS-CoV-2. Another study showed that when the Q498H mutation was introduced to SARS-CoV-2 RBD, an enhancement of its binding capacity to ACE2 homologs in mouse, rat and European hedgehog was observed, suggesting that pangolin CoVs are also capable of infecting humans [21]. In the sequence of GD Pangolin-CoV, at position 498, Glu was observed while in SARS-CoV-2, His was identified at the same position. Reverting this mutation (His to Glu) at position 498 in SARS-CoV-2 causes a 5-fold increase in its human ACE2 binding affinity as compared to the wild type [21]. Although SARS-CoV-2 is not known to infect porcine cells, a 12-fold increase in infection efficiency in porcine cell lines was observed with Q498H mutation in SARS-CoV-2. Another study showed that when the Q498H mutation was introduced to SARS-CoV-2 RBD, an enhancement of its binding capacity to ACE2 homologs in mouse, rat and European hedgehog was observed, suggesting that pangolin CoVs are also capable of infecting humans [21].
In porcine cell lines, mutation in the amino acid position 498 from Glu to His in GD Pangolin-CoV and GX Pangolin-CoV displayed significant reduction in infectivity with human ACE2 by more than 200-fold as well as 160-fold, respectively [21,22]. In addition, mutation at the same residue 498 from Glu to Tyr in Bat-CoV has been shown to enhance the binding affinity of RBD to ACE2 [17]. These observations confirm that the Q498H mutation in SARS-CoV-2 spike could be a potential threat to humans. Both GD Pangolin-CoV and SARS-CoV-2 with the Q498H spike mutation may infect farm pigs. Hence, this high potential of an outbreak in farm pigs and meat industry indicates vice versa also a clear hazard of reverse anthropo-zoonotic infections. Luckily, the Q498H mutation in SARS-CoV-2 strains is not widespread yet and at the time of writing this article, only 78 strains with spike Q498H mutations have been submitted to the GISAID database [23] as of 7 August 2022. These strains were identified from Europe (36), North America (19), Asia (14), Africa (4), Oceania (4), South America (1). Thus, surveillance of such a host range expansion associated mutation is extremely critical as it may lead to a zoonotic transmission between pangolin–human–pig, which may lead to a new type of zoonotic epidemic [21]. Notably, Q498H mutation is also found in coronavirus isolated from Laos horseshoe bats (R. malayanus (virus BANAL-52), R. marshalli (BANAL-236), R. pusillus (BANAL-103)) that has recently been identified as the closest relative of SARS-CoV-2 [24]. These viruses have >95% identity at sequence level to SARS-CoV-2 [25]. Moreover, out of more than 40 mutations in spike proteins, the current SARS-CoV-2 strain of concern Omicron contains Q498R and K417N. Interestingly, Bat-CoV_RaTG13, Beta (B.1.351 lineage) and Delta variant of SARS-CoV-2 consists of K417N. K417N mutation is known to increase the ACE2 binding and escape binding by SARS-CoV-2 neutralizing monoclonal antibodies (mAbs) such as LY-CoV016 (etesevimab) and C105 [26,27].

4. Role of Genomic Recombination between Bat and Pangolin Coronaviruses

It was identified by Li et al. (2020) that the GD Pangolin-CoV clade was more similar (91.2%) to SARS-CoV-2 than the GX Pangolin-CoV clade (85.4%) [28]. As compared to Bat-CoV, despite its high identity region in the RBD region, GD Pangolin-CoV showed reduced overall sequence similarity compared to SARS-CoV-2. This suggests that the SARS-CoV-2 may be originated by possible recombination events primarily between GD Pangolin-CoV and Bat-CoV RaTG13 or other related CoV’s. Recombination is one of the most widespread mechanisms by which virus genomes are known to evolve and adapt themselves to new hosts and new environments. Recombination in viruses occurs when two or more viruses co-infect the same host organism and exchange part of their genetic segments by “crossover” of their nucleic acid strands. In fact, there is growing evidence for recombination events in viruses to be associated with genetic variability, virus adaptation, new virus emergence, increased virulence, alteration of transmission mechanisms and rates, escape from host immune response and increased resistance to anti-viral drugs [29,30]. Thus, understanding patterns and mechanisms of recombination in viruses will help us to identify potential risks associated with virus adaptation. Recombination events between CoVs from different hosts are being studied in order to explore their role in the origin of SARS-CoV-2 [31]. It was found that recombination is one of the crucial factors contributing to the diversity of beta coronaviruses, and these viruses tend to have recombination in high frequencies [32]. There is strong evidence of purifying selection and recombination of bat-CoV with different CoVs along multiple hosts that led to the evolution on present SARS-CoV-2 [28,33]. In one study, three independent recombination events were identified from SARS-CoV-2 strains and proximal outgroups using the RDP4 algorithm. It was hypothesized that two of these recombination events could have altered the two Pangolin-CoVs. We see routes to two isolates, one that evolved from GX Pangolin-CoV and another from GD Pangolin-CoV. These events also indicate that some fragments in the SARS-CoV-2, Bat-CoV-RaTG13 and Pangolin-CoVs were possibly integrated from bat-CoV ZC45-ZXC21 through recombination [26]. Another study using horizontal gene transfer and recombination analysis revealed statistically significant recombination events between RaTG13 and GD Pangolin-CoV in S and N genes [33]. Additionally, more recombination events have been observed in ORF1ab, S, ORF3a, ORF7a, ORF8 and N genes between ancestors of SARS-CoV-2, RaTG13, GD Pangolin CoV and bat-CoV ZC45-ZXC21 [33]. As bat-CoV is able to infect pangolins, pangolins could be a source and reservoir for gene transfer and recombination between RaTG13 and GD Pangolin-CoV as well as other related CoVs [33]. SARS-CoV-2 and RaTG13 share the highest overall sequence similarity in the S gene. But when focusing specifically on the variable loop of the S-gene, a stronger extent of sequence similarity is observed between SARS-CoV-2 and GD Pangolin-CoV compared to SARS-CoV-2 and RaTG13. This might suggest acquisition of a new RBD sequence from the recombination event between the common ancestral lineage of SARS-CoV-2 and RaTG13. Alternatively, this happened after the lineages leading to the SARS-CoV-2 and RaTG13 split with GD Pangolin-CoV, including further subsequent mutations leading to variations in the RBD [34]. It was also shown that the RaTG13 pseudo viruses are unable to properly bind to ACE2 expressing cells, which means that the recombination event in the S region could have led to the SARS-CoV-2 last ancestral lineage to jump to humans as there is a difference of only one non-conservative mutation between RBD of SARS-CoV-2 and GD Pangolin-CoV [15]. Additionally, the antigenicity of GD Pangolin-CoV is very close to SARS-CoV-2 as compared to that of RATG13 and GX Pangolin-CoV. The GD Pangolin-CoV pseudo-typed viruses are neutralized by the neutralizing antibodies produced against SARS-CoV-2, the vaccine-induced serum and convalescent sera from COVID-19 patients with higher activity than the normal SARS-CoV-2 viruses themselves, whereas in the cases of RATG13 and GX Pangolin-CoV, poor cross-neutralization has been observed. We speculate that the RBD of SARS-CoV-2 has emerged by a combination of incremental adaptive mutations. Our observations strongly support the high identity of RBD of SARS-CoV-2 with Guangdong Pangolin-CoV but not with Guangxi pangolins. This further indicates that from Guangdong Pangolin-CoV to SARS-CoV-2 evolution, only few adaptations are required for ACE2 binding. Of note, geographically in comparison to Guangxi, Guangdong is closer to Wuhan.

5. Discussion

Amidst the mad rush of publications addressing the key question of the transmission of SARS-CoV-2, finding the source of intermediate hosts and involvement of several new hosts will be an essential contribution to study and understand the zoonotic COVID-19 outbreak. Moreover, the importance to consider all involved intermediate hosts applies to zoonotic diseases in general, particularly flu virus, coronavirus and Flaviviridae as well as the notorious vector-borne tropical diseases. Herein, we report the transmission process of SARS-CoV-2 from bats to humans through the pangolins, which can be regarded as an intermediate host for the novel coronavirus. In particular, as established by functional data for bats and pangolins, we can refute earlier publications questioning this and putting the whole zoonotic transition process just on bats as the sole culprit for transmission.
The evolutionary goal of a virus is to spread rather than to kill. To increase its transmission, the virus must reduce its infectivity or severity. If a virus is lethal, it will kill its host quickly, which will limit its ability to multiply and infect, which will ultimately [35] reduce its transmission. We speculate that the His to Glu mutation at the 498 position in SARS-CoV-2 is favorable for the virus to better reach this evolutionary optimum. In SARS-CoV-2 infected patients, the highest percentage of virus shedding, and thus transmissibility, occurs 1 to 2 days before the infected person begins to display symptoms [36]. It has been reported that 40–45% of SARS-CoV-2 infected people remain asymptomatic [37] and such carriers may be able to transmit the virus for a longer period. Altogether, these findings might explain the different SARS-CoV-2 evolutionary trade-offs. Furthermore, we illuminate the scenario that current SARS-CoV-2 originated by the recombination between Guangdong Pangolin-CoV and other Bat-CoV such as R. malayanus [38] or some yet to be identified Bat-CoVs that resulted in the addition of cleavage sites for human furin proteases. This plausible phylogeny emphasizes the importance of genomic recombination events in the evolution of these viruses and seconds previous studies that exclusively rely on and claim the highly recombinogenic nature of coronaviruses [39].
Of note, a host’s adaptive immune system also plays a contributing role in the evolution of a virus. Many recent studies have focused on understanding the involvement of intra-host infection dynamics in the emergence of new variants of SARS-CoV-2, especially in case of a chronic infection. For instance, a recent study identified several mutations in the SARS-CoV-2 genomic sequence that accumulated during persistent COVID-19 infection in an immunosuppressed patient [40]. Interestingly, the viral genotype was relatively stable during the first few weeks of infection, while most of the synonymous and non-synonymous mutations started accumulating following the 42nd day onward of persistent infection. In similar other case reports, researchers have demonstrated the emergence of novel multi-mutational viral variants in immunosuppressed patients with chronic viral infection [41,42,43]. It is noteworthy that the newly detected SARS-CoV-2 variants in the immunosuppressed patients showed a partial escape from vaccine-induced humoral immune response and thus could serve as the seeds for a new epidemic. It is also conceivable that the rate at which virus evolves in immunosuppressed COVID-19 patients is much higher than the community-driven rate of vial evolution [44]. Thus, immunocompromised individuals may serve as the secondary intermediate hosts responsible for an accelerated evolutionary jump, thereby posing an obstacle in the eradication of the COVID-19 pandemic.
The high number of spike mutations observed in the Omicron strain [45] again highlights the role of the intermediate host. Omicron’s origin from other human SARS-CoV-2 variants is questionable, as in a short time scale such a heavy number of mutations cannot be fixed by selection pressure. Regarding the origin of Omicron in humans, we consider as a most likely scenario that some chronically infected COVID-19 patient survived for a longer time period with the virus and provided thus a suitable host environment for the virus to mutate and increase adaptation. Phylogenetic analysis has suggested Omicron has evolved in parallel with other variants and diverged from other strains at least before mid-2020 [46,47]. This again indicates there should be a missing intermediate non-human animal host where the virus has accumulated mutations. A recent preprint suggests a possible mouse origin of Omicron [48].
By considering the current scenario of pandemic and transmission-mortality trade-off theory [49], we argue that evolutionary pressures do not make the virus more deadly, so we can expect the future variants of SARS-CoV-2 should be less deadly. Since the evolutionary pressure will favor over time more transmissible but less virulent strains, in the future, with fully vaccinated people, the pandemic will turn into a manageable infection such as common cold or flu. As there will always be the risk of a new virus causing a pandemic, better wildlife protection and restricted zoonotic disease exposure including pangolins will be critical in minimizing the risk of future zoonotic infection transmission.
The emergence of various variants of SARS-CoV-2 in recent times suggests that the virus population has a strong potential to evolve further in future. There is a possibility that genetic material of SARS-CoV-2 can recombine with genetic material of other coronavirus in animal hosts. Individuals with SARS-CoV-2 infections who come in close contact with coronavirus-infected animals can act as intermediate hosts for the development of recombinant viruses. Similarly, animals with coronavirus infections on meeting with SARS-CoV-2 infected individuals can act as intermediate hosts where recombinant strains can evolve for further species jump. If the recombinants are lethal, these can give rise to a new epidemic or pandemic. There is evidence that SARS-CoV-2 strains are already recombining with other SARS-CoV-2 strains [50,51]. With diversification of virulence evolution, the risk of multiple infections may also increase. In addition, the question is whether the first-generation vaccines for COVID-19 would be effective for the upcoming variants. There are many theories in this regard, and studies are still underway before anything can be said with confidence. Thus, it is very important to monitor closely the evolutionary trajectory or the evolutionary dynamics of SARS-CoV-2. Additionally, we have sequence datasets now available based on SARS-CoV-2 genomes [23] that can be used to determine if any specific sites in the viral genomic sequence are more prone to adaptive evolution. All these studies would help us to track the spread of the virus and thus efficiently manage public health strategies and therapeutic interventions for its control.

6. Conclusions

Since the first reports of SARS-CoV-2 virus infection came from Wuhan, there has been a considerable debate on its zoonotic origin and path of transmission among the human population. Through this article, we offer our perspective and an extension to the current understanding on the intermediate host for SARS-CoV-2. To this end, we performed sequence comparative analysis of the host ACE2-interacting residues of the RBD of spike glycoprotein in SARS-CoV-2 isolates from bats compared to the respective residues from pangolin isolated CoVs collected from Guangdong province and Guangxi autonomous regions of South China. Our analysis supports that Guangdong Pangolins are the intermediate hosts, which also provides the evidence that RBD of GD Pangolin-CoV has a stronger affinity for the ACE2 receptor as compared to the SARS-CoV-2. Further surveillance of GD Pangolin-CoV at a larger scale could help us to better understand their path of transmission in more detail. Additionally, it is very important to limit human exposure to wildlife to reduce risk of CoV transmission from animals to humans.

Author Contributions

Initial draft, S.K.G., R.M., P.J.T. and M.S.; figures, S.K.G. and M.S.; final draft, S.K.G., R.M. and T.D.; revision of the draft, S.K.G., R.M., M.S. and T.D. All authors have read and agreed to the published version of the manuscript.

Funding

The authors gratefully acknowledge funding by the Land Bavaria (contribution to DFG Project 324392634-TRR 221/INF).

Institutional Review Board Statement

Not applicable, pure in silico study.

Informed Consent Statement

Not applicable, pure in silico study.

Data Availability Statement

All data are contained in this manuscript, its references and accession numbers mentioned and hence freely available.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Ou, X.; Liu, Y.; Lei, X.; Li, P.; Mi, D.; Ren, L.; Guo, L.; Guo, R.; Chen, T.; Hu, J.; et al. Characterization of spike glycoprotein of SARS-CoV-2 on virus entry and its immune cross-reactivity with SARS-CoV. Nat. Commun. 2020, 11, 1620. [Google Scholar] [CrossRef] [PubMed]
  2. Mittal, A.; Manjunath, K.; Ranjan, R.K.; Kaushik, S.; Kumar, S.; Verma, V. COVID-19 pandemic: Insights into structure, function, and hACE2 receptor recognition by SARS-CoV-2. PLoS Pathog. 2020, 16, e1008762. [Google Scholar] [CrossRef] [PubMed]
  3. Zhou, P.; Yang, X.L.; Wang, X.G.; Hu, B.; Zhang, L.; Zhang, W.; Si, H.R.; Zhu, Y.; Li, B.; Huang, C.L.; et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature 2020, 579, 270–273. [Google Scholar] [CrossRef]
  4. Zhang, S.; Qiao, S.; Yu, J.; Zeng, J.; Shan, S.; Tian, L.; Lan, J.; Zhang, L.; Wang, X. Bat and pangolin coronavirus spike glycoprotein structures provide insights into SARS-CoV-2 evolution. Nat. Commun. 2021, 12, 1607. [Google Scholar] [CrossRef] [PubMed]
  5. Wrapp, D.; Wang, N.; Corbett, K.S.; Goldsmith, J.A.; Hsieh, C.L.; Abiona, O.; Graham, B.S.; McLellan, J.S. Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation. Science 2020, 367, 1260–1263. [Google Scholar] [CrossRef]
  6. Cui, J.; Li, F.; Shi, Z.L. Origin and evolution of pathogenic coronaviruses. Nat. Rev. Microbiol. 2019, 17, 181–192. [Google Scholar] [CrossRef]
  7. Friend, T.; Stebbing, J. What is the intermediate host species of SARS-CoV-2? Future Virol. 2021, 16, 153–156. [Google Scholar] [CrossRef]
  8. Wang, L.F.; Shi, Z.; Zhang, S.; Field, H.; Daszak, P.; Eaton, B.T. Review of bats and SARS. Emerg. Infect. Dis. 2006, 12, 1834–1840. [Google Scholar] [CrossRef]
  9. Zhao, J.; Cui, W.; Tian, B.P. The Potential Intermediate Hosts for SARS-CoV-2. Front. Microbiol. 2020, 11, 580137. [Google Scholar] [CrossRef] [PubMed]
  10. Lam, T.T.; Jia, N.; Zhang, Y.W.; Shum, M.H.; Jiang, J.F.; Zhu, H.C.; Tong, Y.G.; Shi, Y.X.; Ni, X.B.; Liao, Y.S.; et al. Identifying SARS-CoV-2-related coronaviruses in Malayan pangolins. Nature 2020, 583, 282–285. [Google Scholar] [CrossRef] [PubMed]
  11. Xiao, K.; Zhai, J.; Feng, Y.; Zhou, N.; Zhang, X.; Zou, J.J.; Li, N.; Guo, Y.; Li, X.; Shen, X.; et al. Isolation of SARS-CoV-2-related coronavirus from Malayan pangolins. Nature 2020, 583, 286–289. [Google Scholar] [CrossRef]
  12. Wang, S.; Sundaram, J.P.; Spiro, D. VIGOR, an annotation program for small viral genomes. BMC Bioinform. 2010, 11, 451. [Google Scholar] [CrossRef] [PubMed]
  13. Katoh, K.; Standley, D.M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 2013, 30, 772–780. [Google Scholar] [CrossRef]
  14. Wang, Q.; Zhang, Y.; Wu, L.; Niu, S.; Song, C.; Zhang, Z.; Lu, G.; Qiao, C.; Hu, Y.; Yuen, K.Y.; et al. Structural and Functional Basis of SARS-CoV-2 Entry by Using Human ACE2. Cell 2020, 181, 894–904.e9. [Google Scholar] [CrossRef] [PubMed]
  15. Shang, J.; Ye, G.; Shi, K.; Wan, Y.; Luo, C.; Aihara, H.; Geng, Q.; Auerbach, A.; Li, F. Structural basis of receptor recognition by SARS-CoV-2. Nature 2020, 581, 221–224. [Google Scholar] [CrossRef] [PubMed]
  16. Yang, Y.; Zhang, Y.; Qu, Y.; Zhang, C.; Liu, X.W.; Zhao, M.; Mu, Y.; Li, W. Key residues of the receptor binding domain in the spike protein of SARS-CoV-2 mediating the interactions with ACE2: A molecular dynamics study. Nanoscale 2021, 13, 9364–9370. [Google Scholar] [CrossRef]
  17. Starr, T.N.; Greaney, A.J.; Hilton, S.K.; Ellis, D.; Crawford, K.H.D.; Dingens, A.S.; Navarro, M.J.; Bowen, J.E.; Tortorici, M.A.; Walls, A.C.; et al. Deep Mutational Scanning of SARS-CoV-2 Receptor Binding Domain Reveals Constraints on Folding and ACE2 Binding. Cell 2020, 182, 1295–1310.e20. [Google Scholar] [CrossRef]
  18. Wan, Y.; Shang, J.; Graham, R.; Baric, R.S.; Li, F. Receptor Recognition by the Novel Coronavirus from Wuhan: An Analysis Based on Decade-Long Structural Studies of SARS Coronavirus. J. Virol. 2020, 94, e00127-20. [Google Scholar] [CrossRef]
  19. Liu, P.; Jiang, J.Z.; Wan, X.F.; Hua, Y.; Li, L.; Zhou, J.; Wang, X.; Hou, F.; Chen, J.; Zou, J.; et al. Are pangolins the intermediate host of the 2019 novel coronavirus (SARS-CoV-2)? PLoS Pathog. 2020, 16, e1008421. [Google Scholar] [CrossRef] [PubMed]
  20. Andersen, K.G.; Rambaut, A.; Lipkin, W.I.; Holmes, E.C.; Garry, R.F. The proximal origin of SARS-CoV-2. Nat. Med. 2020, 26, 450–452. [Google Scholar] [CrossRef]
  21. Niu, S.; Wang, J.; Bai, B.; Wu, L.; Zheng, A.; Chen, Q.; Du, P.; Han, P.; Zhang, Y.; Jia, Y.; et al. Molecular basis of cross-species ACE2 interactions with SARS-CoV-2-like viruses of pangolin origin. EMBO J. 2021, 40, e107786. [Google Scholar] [CrossRef]
  22. Nie, J.; Li, Q.; Zhang, L.; Cao, Y.; Zhang, Y.; Li, T.; Wu, J.; Liu, S.; Zhang, M.; Zhao, C.; et al. Functional comparison of SARS-CoV-2 with closely related pangolin and bat coronaviruses. Cell Discov. 2021, 7, 21. [Google Scholar] [CrossRef]
  23. Khare, S.; Gurry, C.; Freitas, L.; Schultz, M.B.; Bach, G.; Diallo, A.; Akite, N.; Ho, J.; Lee, R.T.; Yeo, W.; et al. GISAID’s Role in Pandemic Response. China CDC Wkly. 2021, 3, 1049–1051. [Google Scholar] [CrossRef]
  24. Temmam, S.; Vongphayloth, K.; Baquero, E.; Munier, S.; Bonomi, M.; Regnault, B.; Douangboubpha, B.; Karami, Y.; Chrétien, D.; Sanamxay, D.; et al. Bat coronaviruses related to SARS-CoV-2 and infectious for human cells. Nature 2022, 604, 330–336. [Google Scholar] [CrossRef]
  25. Mallapaty, S. Closest known relatives of virus behind COVID-19 found in Laos. Nature 2021, 597, 603. [Google Scholar] [CrossRef]
  26. Greaney, A.J.; Starr, T.N.; Barnes, C.O.; Weisblum, Y.; Schmidt, F.; Caskey, M.; Gaebler, C.; Cho, A.; Agudelo, M.; Finkin, S.; et al. Mapping mutations to the SARS-CoV-2 RBD that escape binding by different classes of antibodies. Nat. Commun. 2021, 12, 4196. [Google Scholar] [CrossRef]
  27. Cheng, M.H.; Krieger, J.M.; Banerjee, A.; Xiang, Y.; Kaynak, B.; Shi, Y.; Arditi, M.; Bahar, I. Impact of new variants on SARS-CoV-2 infectivity and neutralization: A molecular assessment of the alterations in the spike-host protein interactions. iScience 2022, 25, 103939. [Google Scholar] [CrossRef]
  28. Li, X.; Giorgi, E.E.; Marichannegowda, M.H.; Foley, B.; Xiao, C.; Kong, X.P.; Chen, Y.; Gnanakaran, S.; Korber, B.; Gao, F. Emergence of SARS-CoV-2 through recombination and strong purifying selection. Sci. Adv. 2020, 6, eabb9153. [Google Scholar] [CrossRef]
  29. Patino-Galindo, J.A.; Filip, I.; Rabadan, R. Global Patterns of Recombination across Human Viruses. Mol. Biol. Evol. 2021, 38, 2520–2531. [Google Scholar] [CrossRef]
  30. Perez-Losada, M.; Arenas, M.; Galan, J.C.; Palero, F.; Gonzalez-Candelas, F. Recombination in viruses: Mechanisms, methods of study, and evolutionary consequences. Infect. Genet. Evol. 2015, 30, 296–307. [Google Scholar] [CrossRef]
  31. Zhu, Z.; Meng, K.; Meng, G. Genomic recombination events may reveal the evolution of coronavirus and the origin of SARS-CoV-2. Sci. Rep. 2020, 10, 21617. [Google Scholar] [CrossRef] [PubMed]
  32. Makino, S.; Keck, J.G.; Stohlman, S.A.; Lai, M.M. High-frequency RNA recombination of murine coronaviruses. J. Virol. 1986, 57, 729–737. [Google Scholar] [CrossRef] [PubMed]
  33. Makarenkov, V.; Mazoure, B.; Rabusseau, G.; Legendre, P. Horizontal gene transfer and recombination analysis of SARS-CoV-2 genes helps discover its close relatives and shed light on its origin. BMC Ecol. Evol. 2021, 21, 5. [Google Scholar] [CrossRef]
  34. V’Kovski, P.; Kratzel, A.; Steiner, S.; Stalder, H.; Thiel, V. Coronavirus biology and replication: Implications for SARS-CoV-2. Nat. Rev. Microbiol. 2021, 19, 155–170. [Google Scholar] [CrossRef]
  35. Alizon, S.; Hurford, A.; Mideo, N.; Van Baalen, M. Virulence evolution and the trade-off hypothesis: History, current state of affairs and the future. J. Evol. Biol. 2009, 22, 245–259. [Google Scholar] [CrossRef]
  36. He, X.; Lau, E.H.Y.; Wu, P.; Deng, X.; Wang, J.; Hao, X.; Lau, Y.C.; Wong, J.Y.; Guan, Y.; Tan, X.; et al. Temporal dynamics in viral shedding and transmissibility of COVID-19. Nat. Med. 2020, 26, 672–675. [Google Scholar] [CrossRef]
  37. Oran, D.P.; Topol, E.J. Prevalence of Asymptomatic SARS-CoV-2 Infection: A Narrative Review. Ann. Intern. Med. 2020, 173, 362–367. [Google Scholar] [CrossRef]
  38. Zhou, H.; Chen, X.; Hu, T.; Li, J.; Song, H.; Liu, Y.; Wang, P.; Liu, D.; Yang, J.; Holmes, E.C.; et al. A Novel Bat Coronavirus Closely Related to SARS-CoV-2 Contains Natural Insertions at the S1/S2 Cleavage Site of the Spike Protein. Curr. Biol. 2020, 30, 2196–2203.e3. [Google Scholar] [CrossRef]
  39. Boni, M.F.; Lemey, P.; Jiang, X.; Lam, T.T.; Perry, B.W.; Castoe, T.A.; Rambaut, A.; Robertson, D.L. Evolutionary origins of the SARS-CoV-2 sarbecovirus lineage responsible for the COVID-19 pandemic. Nat. Microbiol. 2020, 5, 1408–1417. [Google Scholar] [CrossRef]
  40. Weigang, S.; Fuchs, J.; Zimmer, G.; Schnepf, D.; Kern, L.; Beer, J.; Luxenburger, H.; Ankerhold, J.; Falcone, V.; Kemming, J.; et al. Within-host evolution of SARS-CoV-2 in an immunosuppressed COVID-19 patient as a source of immune escape variants. Nat. Commun. 2021, 12, 6405. [Google Scholar] [CrossRef]
  41. Choi, B.; Choudhary, M.C.; Regan, J.; Sparks, J.A.; Padera, R.F.; Qiu, X.; Solomon, I.H.; Kuo, H.H.; Boucau, J.; Bowman, K.; et al. Persistence and Evolution of SARS-CoV-2 in an Immunocompromised Host. N. Engl. J. Med. 2020, 383, 2291–2293. [Google Scholar] [CrossRef]
  42. Corey, L.; Beyrer, C.; Cohen, M.S.; Michael, N.L.; Bedford, T.; Rolland, M. SARS-CoV-2 Variants in Patients with Immunosuppression. N. Engl. J. Med. 2021, 385, 562–566. [Google Scholar] [CrossRef]
  43. Kemp, S.A.; Collier, D.A.; Datir, R.P.; Ferreira, I.; Gayed, S.; Jahun, A.; Hosmillo, M.; Rees-Spear, C.; Mlcochova, P.; Lumb, I.U.; et al. SARS-CoV-2 evolution during treatment of chronic infection. Nature 2021, 592, 277–282. [Google Scholar] [CrossRef]
  44. Choudhary, M.C.; Crain, C.R.; Qiu, X.; Hanage, W.; Li, J.Z. Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) Sequence Characteristics of Coronavirus Disease 2019 (COVID-19) Persistence and Reinfection. Clin. Infect. Dis. 2022, 74, 237–245. [Google Scholar] [CrossRef]
  45. Jung, C.; Kmiec, D.; Koepke, L.; Zech, F.; Jacob, T.; Sparrer, K.M.J.; Kirchhoff, F. Omicron: What Makes the Latest SARS-CoV-2 Variant of Concern So Concerning? J. Virol. 2022, 96, e0207721. [Google Scholar] [CrossRef]
  46. Callaway, E. Heavily mutated Omicron variant puts scientists on alert. Nature 2021, 600, 21. [Google Scholar] [CrossRef]
  47. Kupferschmidt, K. Where did ‘weird’ Omicron come from? Science 2021, 374, 1179. [Google Scholar] [CrossRef]
  48. Wei, C.; Shan, K.J.; Wang, W.; Zhang, S.; Huan, Q.; Qian, W. Evidence for a mouse origin of the SARS-CoV-2 Omicron variant. J. Genet. Genom. 2021, 48, 1111–1121. [Google Scholar] [CrossRef]
  49. Bull, J.J.; Lauring, A.S. Theory and empiricism in virulence evolution. PLoS Pathog. 2014, 10, e1004387. [Google Scholar] [CrossRef]
  50. Lytras, S.; Hughes, J.; Martin, D.; Swanepoel, P.; de Klerk, A.; Lourens, R.; Kosakovsky Pond, S.L.; Xia, W.; Jiang, X.; Robertson, D.L. Exploring the Natural Origins of SARS-CoV-2 in the Light of Recombination. Genome Biol. Evol. 2022, 14, evac018. [Google Scholar] [CrossRef]
  51. Pollett, S.; Conte, M.A.; Sanborn, M.; Jarman, R.G.; Lidl, G.M.; Modjarrad, K.; Maljkovic Berry, I. A comparative recombination analysis of human coronaviruses and implications for the SARS-CoV-2 pandemic. Sci. Rep. 2021, 11, 17365. [Google Scholar] [CrossRef]
Figure 1. Analysis of the RBD sequence. (a) Multiple sequence alignment showing RBD of S protein from Pangolin CoV (GISAID accession ID: EPI_ISL_410721), SARS-CoV-2_WIV02 (EPI_ISL_402127), and Bat-CoVs (Bat-CoV_RaTG13, Bat-SARS-CoV_ZC45 and Bat-SARS-CoV_ZXC21). Conserved similar amino acids are colored in the alignment. The five critical residues for human ACE2 binding (as mentioned in [11] with respect to SARS-CoV-2_WIV02 coordinates) are indicated by arrowheads below the alignment. Sequence gaps are indicated with dashes. (b) Central part of the alignment from (a) showing the critical residues. (c) Part of alignment from Xiao et al. [11] Figure 7b. This shows only the mentioned critical residues. Identical residues are indicated with a dot.
Figure 1. Analysis of the RBD sequence. (a) Multiple sequence alignment showing RBD of S protein from Pangolin CoV (GISAID accession ID: EPI_ISL_410721), SARS-CoV-2_WIV02 (EPI_ISL_402127), and Bat-CoVs (Bat-CoV_RaTG13, Bat-SARS-CoV_ZC45 and Bat-SARS-CoV_ZXC21). Conserved similar amino acids are colored in the alignment. The five critical residues for human ACE2 binding (as mentioned in [11] with respect to SARS-CoV-2_WIV02 coordinates) are indicated by arrowheads below the alignment. Sequence gaps are indicated with dashes. (b) Central part of the alignment from (a) showing the critical residues. (c) Part of alignment from Xiao et al. [11] Figure 7b. This shows only the mentioned critical residues. Identical residues are indicated with a dot.
Ijms 23 09115 g001
Figure 2. Detailed comparison of ACE2 binding RBD residues of Bat-CoV_RaTG13, Pangolin CoVs and SARS-CoV-2. The level of conservation is visualized using seqLogo, the letter height is proportional to the level of conservation of each residue in the sequence and the color code indicates similar physicochemical attributes. Residues important for ACE2 binding were obtained from [14,15,18] and mapped to the Wuhan-Hu-1 referenced SARS-CoV-2 S protein sequence (GISAID accession ID: EPI_ISL_402125). To avoid visual misinterpretation, in each block equal number of sequences from the species in comparison were taken. In the seqLogo graphic wherever two amino acids are aligned, SARS-CoV-2 sequence conservation is shown at lower position. Required mutations in bat and pangolin RBD for human adaptation are shown with arrows (Normal arrow: semi-conservative substitution; Dashed arrow: non-conservative substitution and Bold arrow: conservative substitution). Critical residue described in Xiao et al. [11] is marked in red color at the bottom of the figure.
Figure 2. Detailed comparison of ACE2 binding RBD residues of Bat-CoV_RaTG13, Pangolin CoVs and SARS-CoV-2. The level of conservation is visualized using seqLogo, the letter height is proportional to the level of conservation of each residue in the sequence and the color code indicates similar physicochemical attributes. Residues important for ACE2 binding were obtained from [14,15,18] and mapped to the Wuhan-Hu-1 referenced SARS-CoV-2 S protein sequence (GISAID accession ID: EPI_ISL_402125). To avoid visual misinterpretation, in each block equal number of sequences from the species in comparison were taken. In the seqLogo graphic wherever two amino acids are aligned, SARS-CoV-2 sequence conservation is shown at lower position. Required mutations in bat and pangolin RBD for human adaptation are shown with arrows (Normal arrow: semi-conservative substitution; Dashed arrow: non-conservative substitution and Bold arrow: conservative substitution). Critical residue described in Xiao et al. [11] is marked in red color at the bottom of the figure.
Ijms 23 09115 g002
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Gupta, S.K.; Minocha, R.; Thapa, P.J.; Srivastava, M.; Dandekar, T. Role of the Pangolin in Origin of SARS-CoV-2: An Evolutionary Perspective. Int. J. Mol. Sci. 2022, 23, 9115. https://doi.org/10.3390/ijms23169115

AMA Style

Gupta SK, Minocha R, Thapa PJ, Srivastava M, Dandekar T. Role of the Pangolin in Origin of SARS-CoV-2: An Evolutionary Perspective. International Journal of Molecular Sciences. 2022; 23(16):9115. https://doi.org/10.3390/ijms23169115

Chicago/Turabian Style

Gupta, Shishir K., Rashmi Minocha, Prithivi Jung Thapa, Mugdha Srivastava, and Thomas Dandekar. 2022. "Role of the Pangolin in Origin of SARS-CoV-2: An Evolutionary Perspective" International Journal of Molecular Sciences 23, no. 16: 9115. https://doi.org/10.3390/ijms23169115

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop