Next Article in Journal
Gnas Promoter Hypermethylation in the Basolateral Amygdala Regulates Reconsolidation of Morphine Reward Memory in Rats
Previous Article in Journal
Expression of HK2, PKM2, and PFKM Is Associated with Metastasis and Late Disease Onset in Breast Cancer Patients
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Coronavirus Genomes and Unique Mutations in Structural and Non-Structural Proteins in Pakistani SARS-CoV-2 Delta Variants during the Fourth Wave of the Pandemic

by
Muhammad Zeeshan Anwar
,
Madeeha Shahzad Lodhi
*,
Muhammad Tahir Khan
,
Malik Ihsanullah Khan
and
Sumaira Sharif
Institute of Molecular Biology and Biotechnology (IMBB), The University of Lahore, 1 KM Defence Road, Lahore 58 810, Pakistan
*
Author to whom correspondence should be addressed.
Genes 2022, 13(3), 552; https://doi.org/10.3390/genes13030552
Submission received: 9 February 2022 / Revised: 10 March 2022 / Accepted: 18 March 2022 / Published: 21 March 2022
(This article belongs to the Section Molecular Genetics and Genomics)

Abstract

:
Genomic epidemiology of SARS-CoV-2 is imperative to explore the transmission, evolution, and also pathogenicity of viruses. The emergence of SARS-CoV-2 variants of concern posed a severe threat to the global public health efforts. To assess the potential consequence of these emerging variants on public health, continuous molecular epidemiology is of vital importance. The current study has been designed to investigate the major SARS-CoV-2 variants and emerging mutations in virus structural and non-structural proteins (NSP) during the fourth wave in September 2021 from the Punjab province of Pakistan. Twenty SARS-CoV-2 positive samples have been collected from major cities were subjected to next-generation sequencing. Among the 20 whole genomes (GenBank Accession SRR16294858-SRR16294877), 2 samples failed to be completely sequenced. These genome sequences harbored 207 non-synonymous mutations, among which 19 were unique to GISAID. The genome sequences were detected: Delta 21I, 21J variants (B.1.617.2). Mutation’s spike_F157del, spike_P681R, spike_T478K, spike_T19R, spike_L452R, spike_D614G, spike_G142D, spike_E156G, and spike_R158del have been detected in all samples where K1086Q, E554K, and C1250W were unique in spike protein. These genomic sequences also harbored 129 non-synonymous mutations in NSP. The most common were NSP3_P1469S (N = 17), NSP3_A488S (N = 17), NSP3_P1228L (N = 17), NSP4_V167L (N = 17), NSP4_T492I (N = 17), NSP6_T77A (N = 17), NSP14_A394V (N = 17), NSP12_G671S (N = 18), and NSP13_P77L (N = 18). The mutation, F313Y in NSP12, detected in the current study, was found in a single isolate from Belgium. Numerous other unique mutations have been detected in the virus papain-like protease (NSP3), main protease (NSP5), and RNA-dependent RNA polymerase (NSP12). The most common non-synonymous mutations in the spike protein were subjected to stability analysis, exhibiting a stabilizing effect on structures. The presence of Delta variants may affect therapeutic efforts and vaccine efficacy. Continuous genomic epidemiology of SARS-CoV-2 in Pakistan may be useful for better management of SARS-CoV-2 infections.

Graphical Abstract

1. Background

The deadly SARS-CoV-2 (Severe Acute Respiratory Syndrome Coronavirus-2) posed a major effect on public health, disrupting the global healthcare system. SARS-CoV-2 has the ability to exhibit a high mutation frequency due to the presence of single-stranded RNA, posing major issues to public health. Molecular epidemiology is required to study the virus evolutionary stages in specific geographic locations and also to establish how the virus may affect the vaccine efficacy and drug response.
The virus genome encodes 4 structural and 16 non-structural proteins (NSP) [1]. Variations have been reported, occurring very quickly, displacing the virus ancestral strains [2,3,4,5]. At population level, the virus rapid rates of transmission offered an advantage to its replication. The D614G mutation in spike (S) was one of the first discovered, improves viral infectivity, and shifts the spike (S) protein conformation toward binding a fusion-competent state [2,6,7]. Inside the virus S protein, all the variants of concern (VOC) harbored numerous mutations, in the receptor-binding domain (RBD) and in the N-terminus. These VOCs include α (B.1.1.7) (first reported in UK), β (B.1.351) (South Africa), and γ (P.1) in Brazil [4,5,6,7,8]. The Delta variant (B.1.617.2) in India, harbored numerous mutations in RBD SARS-CoV-2 isolates in October 2020 [9]. The latest variant, Omicron (B.1.1.529), was first reported from South Africa on 24 November 2021.
Among the several highly contagious VOCs, a large number of mutations have been detected. The B.1.1.7 lineage has been thought to be 40–80% more contagious [10,11]. However, as compared to P.1 and B.1.351, Delta is exhibiting more transmissible potency than the earlier ones. The VOCs have greater potential in terms of pathogenicity, virulence, and transmission rate, and also exhibit lower antibody neutralization sensitivity [10,12]. Multiple mutations in the S protein and other critical genomic regions of SARS-CoV-2 are resulting in reduced vaccinal response and treatment efficacy. The E484K mutation, present in the receptor-binding ridge, has been detected in S protein in multiple lineages. This mutation has been shown to be decreasing the virus binding to polyclonal sera [13] and protects the virus against treatment with the monoclonal antibodies [14] (p. 19). The P681H mutations have been found in the B.1.1.7, B.1.1.318, and P.3, whereas mutation P681R has been found in the A.23.1 lineages and all B.1.617 variants. The P681H and P681R improve S protein fusion to the host cell [15,16]. The P681H and D614G mutations have been thought to be responsible for the B.1.1.7 increased transmissibility [17].
Among the different SARS-CoV-2 variants, B.1.617 lineage was discovered in India [18,19,20]. This lineage includes three primary subtypes (B1.617.1, B.1.617.2, and B.1.617.3), each of which has mutations in the S protein’s N-terminal domain (NTD) and receptor-binding domain (RBD), possesses the ability to boost the immune evasion capability. The Delta variant is thought to spread more quickly than other variants. The B.1.617.1 is characterized by mutations L452R, P681R, and E484Q in S, whereas the Delta variant is characterized by the presence of mutations L452R, P681R, and T478K in S. The L452R and T478K enhance S protein’s binding affinity with human angiotensin-converting enzyme 2 (ACE2) receptor [21,22].
To ensure a better public health policy, it is critical to continuously monitor and identify the rapidly evolving variations in circulating isolates of SARS-CoV-2 in populations. The current study was designed to investigate the major circulating VOC and also the emerging mutations in structural proteins and NSP among the SARS-COV-2 patients from major cities in the Punjab province of Pakistan. Twenty SARS-CoV-2 positive samples have been collected and whole-genome sequencing was performed through Ion Torrent next-generation technology. All the samples harbored numerous mutations in NSP and structural proteins, including S protein.

2. Materials and Methods

2.1. Ethical Approval

The ethical clearance for the current study was obtained from the Ethical Review Committee for Medical and Biomedical Research at The University of Lahore (IMBB/UOL/21/1379).

2.2. Area of Sample Collection

The nasopharyngeal swab method was used to collect samples from COVID-19 infected individuals, as recommended by the World Health Organization (WHO) [23]. All the samples were collected for whole genome sequencing during the fourth wave in September 2021 from major cities in Punjab province, including Lahore, Sialkot, Okara, and Pak Pattan.

2.3. Processing of Samples

The proper specimen collection procedure under laboratory testing for coronavirus disease (COVID-19) interim guidance by WHO was followed for SARS-CoV-2 samples collection and processing. The nasopharyngeal swabs were collected in 2 mL of vial transport medium (LinkGen, Taizhou, Jiangsu, China) and placed at 4 °C for further use. All these samples were confirmed with TaqPath™ COVID-19 CE-IVD RT-PCR Kit (Thermo Fisher, Waltham, MA, USA).

2.4. RNA Extraction, Quantification and cDNA SYNTHESIS

RNA extraction was performed from nasopharyngeal swabs using a standard volume of 200 ul by MagMAX™ Viral/Pathogen Nucleic Acid Isolation Kit (Thermo Fisher, Waltham, MA, USA) and (Applied Biosystem, Waltham, MA, USA). The extracted RNA was stored at −80 °C for downstream application. The real-time quantification and copy number determination of nucleic acid was performed using TaqPath™ 1-Step RT-qPCR Master Mix (Thermo Fisher, Waltham, MA, USA) and the relevant copy number was determined according to the CT and control reactions in quantification. Only samples with relevant CT of 18–28 values were chosen. Using a minimum input amount of 10 ng/uL of the RNA sample, RNA to cDNA reversed transcription was performed with the SuperScript™ VILO™ cDNA Synthesis Kit (Thermo Fisher, Waltham, MA, USA).

2.5. Library Preparation

The library was prepared manually using Ion AmpliSeq™ Library Kit (Thermo Fisher, Waltham, MA, USA). Amplification was performed with a 2-pool Ion AmpliSeq™ SARS-CoV-2 Research Panel (Thermo Fisher, Waltham, MA, USA). The library was partially digested and ligated with Ion Xpress™ Barcode Adapters 1–96 Kit (Thermo Fisher, Waltham, MA, USA). Library purification and quantification steps were carried out with GenDx-AMPure® XP beads (GENDX, Utrecht, The Netherlands) and TaqMan™ Quantitation Kit (Thermo Fisher, Waltham, MA, USA). The libraries were diluted to a final concentration of 40 pM, loaded on Thermo Fisher Scientific Ion Chef™ Instrument for PCR, and loaded on Ion S5 530 and 510 (Thermo Fisher, Waltham, MA, USA).

2.6. Whole Genome Sequencing and Data Analysis

Sequencing was accomplished on Ion GeneStudio™ S5 System at LabGenetix, Lahore, Pakistan. The fastQ base sequence file quality was assessed using the FastQC (v0.11.8). Trimmomatic tool (v0.39) was used to remove the low-quality reads (Q < 30) and index adapter sequences were utilized to improve sample multiplexing. Sequenced reads were aligned with the reference (NC 045512, using Burrows Wheeler Aligner (BWA, v0.6). The duplicated PCR reads were removed with Picard Tools (v2.21.6). Mapping problems due to the presence of small Indels were removed with a Genome Analysis Toolkit, “RealignerTargetCreator” and “InDelRealigner” (GATK v. 3.3.0) to analyze the mapped read. To improve the accuracy of variant calling, GATK tool “HaplotypeCaller” was applied for realignment of sequenced reads via local de-novo assembly of haplotypes in the regions showing variation.
All the 20 whole genome sequences in fasta format were aligned with reference (NC_045512) using CoVsurver application (https://www.gisaid.org/epiflu-applications/covsurver-mutations-app/ (accessed on 28 September 2021)) on 15–20 September 2021, collected during the fourth wave of infection. The CoVsurver research tool has been developed with GISAID (Global initiative on sharing all influenza data), aiding researchers to identify and interpret the amino acids (aa) changes in coronavirus genomes. The server rapidly aligns the query genome sequences in fasta format with reference SARS-CoV-2 and screen coronavirus genomes for aa changes to identify any special epidemiological relevance. All mutations in structural proteins of SARS-CoV-2 were separated and arranged in the form of excel sheets. The statistical analysis was performed using EpiData Analysis [24] to analyze various non-synonymous mutations in virus structural and NSP.

2.7. Mutations Effect on Virus Structural Proteins

Some of the most common mutations were analyzed for their thermodynamic effect on S protein through DynaMut server [25]. The server implements mutation effect, which can be used to analyze the protein stability and structural flexibility upon point mutation. The server also measures the vibrational entropy changes and the impact of a mutation with graph-based signatures (p-value < 0.001) along with a good resolution of results. To compute the SARS-CoV-2 wild type and mutant protein stability and flexibility, the PDB file of proteins were retrieved from the Protein Data Bank [26] and uploaded to the DynaMut server and a point mutation was inserted at specific site. The impact in the form of total energies and vibrational entropy energies between wild type and mutants was recorded. The high-resolution structures of wilt type and mutant S proteins were retrieved for further processing.
In DynaMut, changes upon point mutation on free energy of protein folding, combining the impacts of mutation stability of protein and also the dynamic properties were computed by ENCoM, Bio3D, and DUET computational servers, generating a more robust predictor of energies.

2.8. Phylogenetic Analysis

Genomic sequences in the current study were subjected to MAFT server [27] and Nextstrain [28] for phylogenetic analysis. These are public servers, containing a pipeline for analysis, and visualization, presenting a real-time view into the evolution of viral pathogens.

3. Results

3.1. SARS-CoV-2 Patient Information

Twenty nasopharyngeal swab samples were collected during the fourth wave of the pandemic in September 2021 from SARS-CoV-2 patients (Table 1). Among these samples, 13 were collected from male suspect and seven from female. All the SARS-CoV-2 patients developed clear signs and symptoms, including fever, cough, headache, fatigue, and also significant loss of smell and taste. Eleven patients were found in age category 1 (20–40), seven were in category 2 (41–60) and two were 65 years old.

3.2. Whole Genome Sequences

Among the 20 samples (GenBank Accession No. SRR16294858-SRR16294877), samples 7 and 10 (Table 2) (Accession No. SRR16294864 and SRR162948667) were not completely sequenced. The genomic sequences were submitted to NCBI BioProject PRJNA770504 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA770504 (accessed on 28 September 2021)). The average length of sequenced genomes was 29,586 nucleotides (9691 amino acids). The sequence information, including length and clade, is shown in Table 2.

3.3. Unique Mutations in Structural Proteins

All the genomic sequences harbored 207 different non-synonymous mutations, including indel, among which 19 were unique to GISAID (S1). All the genomic samples were of GK clade. Mutations E554K (S1 domain of S), K1086Q, and C1250W (S2 domain of S) (Figure 1) were unique to S protein, and in GISAID, present in samples 1, 11, and 19. The remaining unique mutations in all samples are provided in the Supplementary File S1.

3.4. SARS-CoV-2 Variants

All the samples were detected as Delta variants. Mutation details in S and other structures are provided in Supplementary Files S1–S3. These sequences harbored diverse kinds of mutations in the N-terminal and RBD of S protein that may upsurge the immune evasion possibility of Delta variant.

3.5. Mutations in Spike and Other Structural Proteins

Among the 207 mutations, 26 were detected in S protein (Supplementary File S2). The most common non-synonymous and indel mutations, present in all complete genome samples were spike_T478K, spike_T19R, spike_L452R, spike_F157del, spike_E156G, spike_P681R, spike_D614G, spike_R158del, and spike_G142D (Table 3). Some of these mutations were present in receptor-binding domain (RBD) of S protein (Figure 1). Mutations, spike_D950N and spike_T95I were present in twelve samples each.
Similarly, nucleocapsid (N) also harbored nine different kinds of mutations (Supplementary File S2). Among these, the most common were N_D377Y, N_R203M, and N_D63G have been detected in all 18 samples (Table 3). The D63G was present in N-terminal domain (NTD) of N protein, which is also called RNA-binding domain (RBD) (Figure 1 and Figure 2). Two mutations, N_R203M and N_G215C, were present in SR-Linker region of N proteins and one D377Y in C-terminal domain (CTD). No mutation was detected in the envelope the protein where the membrane (M) protein harbored only two non-synonymous mutations (I82T (N = 18) and V70F (N = 1)) in the transmembrane domain.
Delta variant harbors L452R, P681R, and T478K in S protein. These mutations, along with others, present in all genomic samples, exhibited a stabilizing effect on the S protein structure (Figure 3 and Figure 4), and may demonstrate a good binding affinity towards human ACE2 SARS-CoV-2 receptor protein. Phylogenetically, all the isolates were 21I and 21J, subclades of Delta variants (Figure 5).
The spike_G142D mutant exhibited a little destabilizing affect (0.206 kcal/mole when compared with T478K (0.584 kcal/mol), E156G (0.049 kcal/mol), L452R (0.059 kcal/mol), P681R (0.503 kcal/mol), and T19R (0.403 kcal/mol).

3.6. Unique Mutations in Structural Proteins

The sequences harbored 129 mutations in NSP in which 13 were unique to GISAID (Table 4) (Supplementary File S3). The highest number of unique mutations were detected in NSP3 (N = 10), followed by NSP5 (V86L), NSP12 (F313Y), and NS3 (I118L). The remaining unique mutations in the samples are provided in Supplementary File S3.

3.7. Mutations in NSP

The mutation details in NSP are shown in Table 5. In SARS-CoV-2, NSP3 is a large multidomain protein of 1945 amino acids containing a papain-like protease (PLpro) domain (aa746-1060). PLpro, an essential component of the virus replication transcription complex, is highly conserved, present between unique and a nucleic acid-binding domains. Among the unique mutation, NS3_I118L was detected in 14 isolates (Table 4).
The highest frequency of mutations was detected in virus NSP3 (Table 2) in which P1469S (N = 17), A488S (N = 17), and P1228L (N = 17) were the most common. The majority of these mutations were detected in the C-terminal domain (CTD) of NSP3. Deletions with a single frequency in amino acid regions 1500 to 1533 were also observed at CTD. However, the PLpro domain (746–1060) seems highly conserved (Figure 6). Mutations with a single frequency at position L862F, P822L, and H920Y were detected in the PLpro domain of NSP3 (Figure 6D). NSP2, which modulates the host cell survival signaling pathway, also harbored six non-synonymous mutations with a very low frequency. NSP4 harbors four mutations in which V167L in NTD and T492I in CTD were present in 17 isolates.
NSP5 (main protease), harbored three non-synonymous mutations in a very low frequency. Four mutations were detected in NSP6 in which T77A was detected with the highest frequency (N = 17). Virus NSP12, which is also called RdRp (RNA-dependent RNA polymerase), plays an essential role in replication. Mutations NSP12_G671S (N = 18) and NSP12_P323L (N = 16) were the most common, present in the interface and RdRp domain (Figure 6).
A single mutation P77L in NSP13 was detected in all genomic isolates (N = 18). Forty-one mutations were detected in NSP14, including five non-synonymous mutations in which NSP14_A394V (N = 17) NSP14_M72I (N = 6) were the most common.

4. Discussion

The recent emergence of different SARS-CoV-2 variants posed severe health issues. The α is one of the identified variants that first emerged in the UK and became one of the predominant lineages worldwide. Pakistan reported its first case with α variant in December 2020 [29], triggered S protein, and rapidly spread, leading to the third wave in Pakistan [30]. Molecular epidemiology is essential for better management of viral diagnosis and treatment. However, limited genomic data are available on other genetic lineages from all provinces of Pakistan. Further, frequency of mutations in all target’s proteins, including NSP, has not been investigated properly. Surveillance of the population mainly depends on characterizing the differences between reported, emerging, and non-VOC lineages in order to design efficient diagnostic methods, proper therapeutics, and the designing of effective vaccines.
Among the different variants, the Delta variant of SARS-CoV-2 is more deadly and highly transmissible with severe disease signs and symptoms. This variant has been detected in 77 countries as of 24 June 2021 [31,32]. The UK has faced drastic effects from public health measures due to the Delta variant. To prevent the rapid transmission of the virus in the population, there is an urgent need to design more integrated molecular diagnostic systems with the implication for strict rules of quarantines for international travelers.
In the current study, all the genomic samples were detected as Delta variants when analyzed on GISAID CoV-Surver on 15 to 20 September 2021, collected during the fourth wave of infection. Mutations T478K, T19R, L452R, F157del, E156G, P681R, D614G, R158del, and G142D were the most common, present in all genomics isolates. The T478K mutation present in RBD of S protein is involved in interaction with human ACE2 [33,34]. The T478K is unique to the SARS-CoV2 Delta variant, present in the epitope region of potent neutralizing monoclonal antibodies [35]. This point mutation exhibited a structural stabilizing impact on S protein (Figure 3).
A previous study during the third wave from Pakistan reported that the Delta variant is prevalent in 45% of cases followed by β (46%) [36]. Very little information was provided on mutations in structural proteins while NSP mutations were not provided in detail. In the current study, comprehensive details of mutations in the virus, all targeting proteins, including accessory proteins, have been provided for better understanding of the variations in circulating isolates.
Among the structural proteins, the S protein harbored two mutations in the RBD region (L452R and T478K), four in the NTD (T19R, G142D, Δ156–157 and R158del), and one at the furin-cleavage site (P681R) and S2 region (D950N), detected in 12 virus genome sequences (Table 3). This cluster was observed in Indian isolates in October 2020 [37], exhibiting higher pathogenicity than other variants [38]. The T478K mutation at the RBD region of S protein, fell near the E484K that facilitates antibody escape [39]. The L452R is an antibody-escaping mutation and the virus with this mutant is resistant to convalescent plasma and monoclonal antibody therapy [40]. The furin-cleavage site plays an important role in viral pathogenesis [41] and mutation analysis revealed that L452R and P681R increase the ACE2 binding and transmissibility. Consistent with previous study [42] in which variants harbor different mutations, it exhibited increased binding between S protein RBD region and ACE2 receptor. Similar to this previous study, mutants T19R, E156G, L452R, T478K, and P681R in S protein (Figure 3 and Figure 4) exhibited a stabilizing effect on protein structure, facilitating the binding affinity for more stable interactions with human ACE2. This stability effect may increase the virus transmission in populations.
Mutations Arg158 and Phe-157/del were also detected in all genomic isolates (Table 3), present in NTD of S protein. In a more recent study, E156G, Arg158, and Phe-157/del were found, causing rigidity and reduced flexibility, thus providing fitness advantage and immune escape [43].
The Delta S protein P681R mutation has an important role in the variant replacement of the α-to-Delta. The Delta S P681R mutation present at the furin-cleavage site exhibited stabilizing effects (Figure 4), separating the S1 and S2 regions of S protein. Mechanistically, the P681R mutation of Delta improved the full-length cleavage of S1 and S2, facilitated the virus cell surface entry and increased infection 16. These mutations must be regularly monitored in ongoing pandemics for surveillance and virus severity [44].
Considering the circulation of the Delta virus with specific mutations and a high transmissible rate, the Pakistani national health authority needs to take timely measures against it. Further, molecular epidemiological studies with insight into genomic analysis of the virus are needed to screen the most prevalent variants and specific mutations that may affect the diagnosis and vaccine potency, to adopt potential measures in the future.
Emerging mutations in NSP may affect the virus transmission and pathogenicity. Recently, a synonymous mutation (F106F) in NSP3 along with other signature mutations exhibited a virus fitness effect [45]. In CoV-2 NSP3, the PLpro domain is a large protein and an essential member of replication transcription complex [46,47]. PLpro domain has a catalytic domain (Figure 1) for cleavage activity. Mutations in this domain may affect a catalytic process of PLpro. In the current study, three mutations were detected in this domain with many unique to GISAID. The effect of these mutations may be investigated in future studies for better designing of inhibitors against PLpro.
The NSP6_L37F mutation, which was very common in the earlier infections, were associated with asymptomatic cases. NSP6 reduces autophagic ability, important for viral infections and promotes cell death [48]. Mutation NSP6_T77A in the current study (Table 2) seems emerging in the current wave. However, its effect on virus severity is needed to be explored for better management of COVID-19.
Previously, mutation P323L in RdRp was the most common in Pakistani isolates [49]. This mutation is still present in the Delta variants (Table 2), which has a stabilizing effect, while interacting with viral RNA, which lies in the interface domain of RdRp. This domain is the antiviral (filibuvir, Simeprevir) binding site [43], which may show weak interaction if mutations emerge. Mutation G671S in NSP12 detected in the current study has been observed as emerging, present in all 18 isolates, increasing the stability of the protein [50]. However, further investigations are needed to see its effect on virus pathogenicity. A single mutation, NSP13_P77L, which is characteristic only in Delta variants, was detected in all genomic sequences in the current study, and may have a destabilizing effect [51]. NSP14_A394V is the most common mutation in NSP14 (Table 2), and has a role in one of the positive section sites of virus [52].
Vaccine efficacy may be reduced in previous VOCs; however, BNT162b2 retained its potency against β VOC [53]. Omicron harbored a number of known and unique mutation patterns (Supplementary Files) as compared to other VoCs; therefore, the vaccines which are effective against Omicron infections are not clear. Vaccines have been found to be effective against other VOCs. Observational investigation in Qatar and Kaiser Permanente [54,55] found more than 90% vaccine efficacy against the Delta-variant. Data from New York indicates a good efficacy for individuals 65 years and older, exhibiting different levels of efficacy for different vaccines [56].
In conclusion, Delta variants with some unique mutations, are circulating during the fourth wave of the pandemic, in major cities in the Punjab province of Pakistan. The structural proteins and NSP harbored numerous mutations, present in different functional domains. Mutations spike_K1086Q, spike_E554K, and spike_C1250W were unique to GISAID. The most common mutations in the S protein exhibited a stabilizing effect, facilitating the binding affinity for S protein RBD with human ACE2. Geographic specific vaccines and drugs may be designed for better management of COVID-19 in the future. The geo-climate distribution of the mutations may decipher higher uniqueness and disease severity in underdeveloped countries, including Pakistan. The effect of these mutations on virus pathogenicity may be experimentally verified to access the effect on virus severity and vaccine efficacy. Continuous molecular epidemiology is required to screen geographic-specific mutations for better understanding of the virus pathogenicity, diagnosis, and treatment of COVID-19.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes13030552/s1, File S1: Frequency of mutations; File S2: Genome samples summary; File S3: Mutation details in all targets and samples.

Author Contributions

Conceptualization: M.T.K. and M.S.L.; data acquisition: M.T.K. and M.I.K.; data analysis and interpretation: M.Z.A. and S.S.; writing, original draft: M.Z.A. and M.T.K.; writing, revision: M.I.K. and M.S.L.; final approval: M.T.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Written informed consent has been obtained from the patient(s) to publish this paper.

Data Availability Statement

Genomic data are freely available at NCBI GenBank BioProject PRJNA770504 (https://www.ncbi.nlm.nih.gov/bioprject/PRJNA770504) (Accession ID SRR16294858-SRR16294877).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Guo, Y.R.; Cao, Q.D.; Hong, Z.S.; Tan, Y.Y.; Chen, S.D.; Jin, H.J.; Tan, K.S.; Wang, D.Y.; Yan, Y. The origin, transmission and clinical therapies on coronavirus disease 2019 (COVID-19) outbreak–an update on the status. Mil. Med. Res. 2020, 7, 11. [Google Scholar] [CrossRef] [Green Version]
  2. Yurkovetskiy, L.; Wang, X.; Pascal, K.E.; Tomkins-Tinch, C.; Nyalile, T.P.; Wang, Y.; Baum, A.; Diehl, W.E.; Dauphin, A.; Carbone, C.; et al. Structural and functional analysis of the D614G SARS-CoV-2 spike protein variant. Cell 2020, 183, 739–751.e8. [Google Scholar] [CrossRef] [PubMed]
  3. Kemp, S.A.; Harvey, W.T.; Datir, R.P.; Collier, D.A.; Ferreira, I.A.; Carabelli, A.M.; Gupta, R.K.; Meng, B. Recurrent emergence and transmission of a SARS-CoV-2 spike deletion H69/V70. BioRxiv 2021. [Google Scholar] [CrossRef]
  4. Oude Munnink, B.B.; Sikkema, R.S.; Nieuwenhuijse, D.F.; Molenaar, R.J.; Munger, E.; Molenkamp, R.; van der Spek, A.; Tolsma, P.; Rietveld, A.; Brouwer, M.; et al. Transmission of SARS-CoV-2 on mink farms between humans and mink and back to humans. Science 2020, 371, 172–177. [Google Scholar] [CrossRef]
  5. Tegally, H.; Wilkinson, E.; Giovanetti, M.; Iranzadeh, A.; Fonseca, V.; Giandhari, J.; Doolabh, D.; Pillay, S.; San, E.J.; Msomi, N.; et al. Detection of a SARS-CoV-2 variant of concern in South Africa. Nature 2021, 592, 438–443. [Google Scholar] [CrossRef] [PubMed]
  6. Plante, J.A.; Liu, Y.; Liu, J.; Xia, H.; Johnson, B.A.; Lokugamage, K.G.; Zhang, X.; Muruato, A.E.; Zou, J.; Fontes-Garfias, C.R.; et al. Spike mutation D614G alters SARS-CoV-2 fitness. Nature 2020, 592, 116–121. [Google Scholar] [CrossRef]
  7. Weissman, D.; Alameh, M.-G.; de Silva, T.; Collini, P.; Hornsby, H.; Brown, R.; LaBranche, C.C.; Edwards, R.J.; Sutherland, L.; Santra, S.; et al. D614G spike mutation increases SARS CoV-2 susceptibility to neutralization. Cell Host Microbe 2021, 29, 23–31.e4. [Google Scholar] [CrossRef]
  8. Sabino, E.C.; Buss, L.F.; Carvalho, M.P.S.; Prete, C.A., Jr.; Crispim, M.A.E.; Fraiji, N.A.; Pereira, R.H.M.; Parag, K.V.; da Silva Peixoto, P.; Kraemer, M.U.G.; et al. Resurgence of COVID-19 in Manaus, Brazil, despite high seroprevalence. Lancet 2021, 397, 452–455. [Google Scholar] [CrossRef]
  9. Lopez Bernal, J.; Andrews, N.; Gower, C.; Gallagher, E.; Simmons, R.; Thelwall, S.; Stowe, J.; Tessier, E.; Groves, N.; Dabrera, G.; et al. Effectiveness of COVID-19 vaccines against the B.1.617.2 (Delta) variant. N. Engl. J. Med. 2021, 385, 585–594. [Google Scholar] [CrossRef]
  10. Davies, N.G.; Kucharski, A.J.; Eggo, R.M.; Gimma, A.; Edmunds, W.J.; Jombart, T.; O’Reilly, K.; Endo, A.; Hellewell, J.; Nightingale, E.S.; et al. Effects of non-pharmaceutical interventions on COVID-19 cases, deaths, and demand for hospital services in the UK: A modelling study. Lancet Public Health 2020, 5, e375–e385. [Google Scholar] [CrossRef]
  11. Volz, E.; Hill, V.; McCrone, J.T.; Price, A.; Jorgensen, D.; O’Toole, Á.; Southgate, J.; Johnson, R.; Jackson, B.; Nascimento, F.F.; et al. Evaluating the effects of SARS-CoV-2 spike mutation D614G on transmissibility and pathogenicity. Cell 2021, 184, 64–75. [Google Scholar] [CrossRef]
  12. Wang, W.; Xu, Y.; Gao, R.; Lu, R.; Han, K.; Wu, G.; Tan, W. Detection of SARS-CoV-2 in different types of clinical specimens. JAMA 2020, 323, 1843–1844. [Google Scholar] [CrossRef] [Green Version]
  13. Greaney, A.J.; Loes, A.N.; Crawford, K.H.; Starr, T.N.; Malone, K.D.; Chu, H.Y.; Bloom, J.D. Comprehensive mapping of mutations in the SARS-CoV-2 receptor-binding domain that affect recognition by polyclonal human plasma antibodies. Cell Host Microbe 2021, 29, 463–476.e6. [Google Scholar] [CrossRef] [PubMed]
  14. Starr, T.N.; Greaney, A.J.; Addetia, A.; Hannon, W.W.; Choudhary, M.C.; Dingens, A.S.; Li, J.Z.; Bloom, J.D. Prospective map-ping of viral mutations that escape antibodies used to treat COVID-19. BioRxiv Prepr. Serv. Biol. 2020. [Google Scholar] [CrossRef]
  15. Saito, A.; Nasser, H.; Uriu, K.; Kosugi, Y.; Irie, T.; Shirakawa, K. SARS-CoV-2 spike P681R mutation enhances and accelerates viral fusion. BioRxiv Prepr. 2021. [Google Scholar] [CrossRef]
  16. Liu, Y.; Liu, J.; Johnson, B.A.; Xia, H.; Ku, Z.; Schindewolf, C.; Widen, S.G.; An, Z.; Weaver, S.C.; Menachery, V.D.; et al. Delta spike P681R mutation enhances SARS-CoV-2 fitness over alpha variant. BioRxiv Prepr. Serv. Biol. 2021. [Google Scholar] [CrossRef]
  17. Korber, B.; Fischer, W.M.; Gnanakaran, S.; Yoon, H.; Theiler, J.; Abfalterer, W.; Hengartner, N.; Giorgi, E.E.; Bhattacharya, T.; Foley, B.; et al. Tracking changes in SARS-CoV-2 spike: Evidence that D614G increases infectivity of the COVID-19 virus. Cell 2020, 182, 812–827.e19. [Google Scholar] [CrossRef]
  18. Yadav, P.D.; Sapkal, G.N.; Abraham, P.; Ella, R.; Deshpande, G.; Patil, D.Y.; Nyayanit, D.A.; Gupta, N.; Sahay, R.R.; Shete, A.M.; et al. Neutralization of variant under investigation B.1.617 with sera of BBV152 vaccinees. Clin. Infect. Dis. Off. Public Infect. Dis. Soc. Am. 2021. [Google Scholar] [CrossRef]
  19. Hoffmann, M.; Hofmann-Winkler, H.; Krüger, N.; Kempf, A.; Nehlmeier, I.; Graichen, L.; Arora, P.; Sidarovich, A.; Moldenhauer, A.-S.; Winkler, M.S.; et al. SARS-CoV-2 variant B.1.617 is resistant to bamlanivimab and evades antibodies induced by infection and vaccination. Cell Rep. 2021, 36, 109415. [Google Scholar] [CrossRef]
  20. Cherian, S.; Potdar, V.; Jadhav, S.; Yadav, P.; Gupta, N.; Das, M.; Rakshit, P.; Singh, S.; Abraham, P.; Panda, S.; et al. SARS-CoV-2 spike mutations, L452R, T478K, E484Q and P681R, in the second wave of COVID-19 in Maharashtra, India. Microorganisms 2021, 9, 1542. [Google Scholar] [CrossRef]
  21. Starr, T.N.; Greaney, A.J.; Dingens, A.S.; Bloom, J.D. Complete map of SARS-CoV-2 RBD mutations that escape the monoclonal antibody LY-CoV555 and its cocktail with LY-CoV016. Cell Rep. Med. 2021, 2, 100255. [Google Scholar] [CrossRef] [PubMed]
  22. Zahradník, J.; Marciano, S.; Shemesh, M.; Zoler, E.; Chiaravalli, J.; Meyer, B.; Rudich, Y.; Dym, O.; Elad, N.; Schreiber, G. SARS-CoV-2 RBD in vitro evolution follows contagious mutation spread, yet generates an able infection inhibitor. BioRxiv 2021. [Google Scholar] [CrossRef]
  23. Péré, H.; Podglajen, I.; Wack, M.; Flamarion, E.; Mirault, T.; Goudot, G.; Hauw-Berlemont, C.; Le, L.; Caudron, E.; Carrabin, S.; et al. Nasal swab sampling for SARS-CoV-2: A convenient alternative in times of nasopharyngeal swab shortage. J. Clin. Microbiol. 2020, 58. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Singh, S. Review of epidata entry and analysis freewares. Indian J. Community Med. Off. Public Indian Assoc. Prev. Soc. Med. 2009, 34, 76–77. [Google Scholar] [CrossRef]
  25. Rodrigues, C.H.M.; Pires, D.E.V.; Ascher, D.B. DynaMut: Predicting the impact of mutations on protein conformation, flexibility and stability. Nucleic Acids Res. 2018, 46, W350–W355. [Google Scholar] [CrossRef]
  26. Berman, H.M. The Protein Data Bank: A historical perspective. Acta Crystallogr. Sect. A Found. Crystallogr. 2008, 64, 88–95. [Google Scholar] [CrossRef]
  27. Robinson, O.; Dylus, D.; Dessimoz, C. Phylo.io: Interactive viewing and comparison of large phylogenetic trees on the web. Mol. Biol. Evol. 2016, 33, 2163–2166. [Google Scholar] [CrossRef] [Green Version]
  28. Hadfield, J.; Megill, C.; Bell, S.M.; Huddleston, J.; Potter, B.; Callender, C.; Sagulenko, P.; Bedford, T.; Neher, R.A. Nextstrain: Real-time tracking of pathogen evolution. Bioinformatics 2018, 34, 4121–4123. [Google Scholar] [CrossRef]
  29. Umair, M.; Ikram, A.; Salman, M.; Alam, M.M.; Badar, N.; Rehman, Z.; Tamim, S.; Khurshid, A.; Ahad, A.; Ahmad, H.; et al. Importation of SARS-CoV-2 variant B.1.1.7 in Pakistan. J. Med. Virol. 2021, 93, 2623–2625. [Google Scholar] [CrossRef] [PubMed]
  30. Umair, M.; Salman, M.; Rehman, Z.; Badar, N.; Ali, Q.; Ahad, A.; Ikram, A. Proliferation of SARS-CoV-2 B.1.1.7 variant in Pakistan-A short surveillance account. Front. Public Health 2021, 9, 683378. [Google Scholar] [CrossRef]
  31. Mlcochova, P.; Kemp, S.A.; Dhar, M.S.; Papa, G.; Meng, B.; Ferreira, I.A.T.M.; Datir, R.; Collier, D.A.; Albecka, A.; Singh, S.; et al. SARS-CoV-2 B.1.617.2 Delta variant replication and immune evasion. Nature 2021, 599, 114–119. [Google Scholar] [CrossRef] [PubMed]
  32. Planas, D.; Veyer, D.; Baidaliuk, A.; Staropoli, I.; Guivel-Benhassine, F.; Rajah, M.M.; Planchais, C.; Porrot, F.; Robillard, N.; Puech, J.; et al. Reduced sensitivity of SARS-CoV-2 variant delta to antibody neutralization. Nature 2021, 596, 276–280. [Google Scholar] [CrossRef]
  33. Gurung, A.B.; Ali, M.A.; Lee, J.; Farah, M.A.; Al-Anazi, K.M.; Al-Hemaid, F.; Sami, H. Structural and functional insights into the major mutations of SARS-CoV-2 Spike RBD and its interaction with human ACE2 receptor. J. King Saud Univ.-Sci. 2022, 34, 101773. [Google Scholar] [CrossRef] [PubMed]
  34. Wang, Y.; Liu, C.; Zhang, C.; Wang, Y.; Hong, Q.; Xu, S.; Li, Z.; Yang, Y.; Huang, Z.; Cong, Y. Structural basis for SARS-CoV-2 Delta variant recognition of ACE2 receptor and broadly neutralizing antibodies. Nat. Commun. 2022, 13, 871. [Google Scholar] [CrossRef]
  35. Barnes, C.O.; Jette, C.A.; Abernathy, M.E.; Dam, K.-M.A.; Esswein, S.R.; Gristick, H.B.; Malyutin, A.G.; Sharaf, N.G.; Huey-Tubman, K.E.; Lee, Y.E.; et al. SARS-CoV-2 neutralizing antibody structures inform therapeutic strategies. Nature 2020, 588, 682–687. [Google Scholar] [CrossRef]
  36. Umair, M.; Ikram, A.; Salman, M.; Haider, S.A.; Badar, N.; Rehman, Z.; Ammar, M.; Rana, M.S.; Ali, Q. Genomic surveillance reveals the detection of SARS-CoV-2 delta, beta, and gamma VOCs during the third wave in Pakistan. J. Med. Virol. 2022, 94, 1115–1129. [Google Scholar] [CrossRef]
  37. Winger, A.; Caspari, T. The Spike of Concern—The Novel Variants of SARS-CoV-2. Viruses 2021, 13, 1002. [Google Scholar] [CrossRef]
  38. Yadav, P.D.; Mohandas, S.; Shete, A.M.; Nyayanit, D.A.; Gupta, N.; Patil, D.Y.; Sapkal, G.N.; Potdar, V.; Kadam, M.; Kumar, S. SARS CoV-2 variant B. 1.617. 1 is highly pathogenic in hamsters than B. 1 variant. BioRxiv 2021. [Google Scholar] [CrossRef]
  39. Plante, J.A.; Mitchell, B.M.; Plante, K.S.; Debbink, K.; Weaver, S.C.; Menachery, V.D. The variant gambit: COVID-19′s next move. Cell Host Microbe 2021, 29, 508–515. [Google Scholar] [CrossRef]
  40. Gottlieb, R.L.; Nirula, A.; Chen, P.; Boscia, J.; Heller, B.; Morris, J.; Huhn, G.; Cardona, J.; Mocherla, B.; Stosor, V.; et al. Effect of bamlanivimab as monotherapy or in combination with etesevimab on viral load in patients with mild to moderate COVID-19: A randomized clinical trial. JAMA 2021, 325, 632–644. [Google Scholar] [CrossRef]
  41. Johnson, B.A.; Xie, X.; Kalveram, B.; Lokugamage, K.G.; Muruato, A.; Zou, J.; Zhang, X.; Juelich, T.; Smith, J.K.; Zhang, L. Furin cleavage site is key to SARS-CoV-2 pathogenesis. BioRxiv 2020. [Google Scholar] [CrossRef]
  42. Gobeil, S.M.C.; Janowska, K.; McDowell, S.; Mansouri, K.; Parks, R.; Stalls, V.; Kopp, M.F.; Manne, K.; Li, D.; Wiehe, K.; et al. Effect of natural mutations of SARS-CoV-2 on spike structure, conformation, and antigenicity. Science 2021, 373, eabi6226. [Google Scholar] [CrossRef] [PubMed]
  43. Chaudhari, A.M.; Kumar, D.; Joshi, M.; Patel, A.; Joshi, C. E156G and Arg158, Phe-157/del mutation in NTD of spike protein in B.1.617.2 lineage of SARS-CoV-2 leads to immune evasion through antibody escape. BioRxiv 2021. [Google Scholar] [CrossRef]
  44. Yang, Q.; Xiao, X.; Gu, X.; Liang, D.; Cao, T.; Mou, J.; Huang, C.; Chen, L.; Liu, J. Surveillance of common respiratory infections during the COVID-19 pandemic demonstrates the preventive efficacy of non-pharmaceutical interventions. Int. J. Infect. Dis. 2021, 105, 442–447. [Google Scholar] [CrossRef]
  45. Tomaszewski, T.; DeVries, R.S.; Dong, M.; Bhatia, G.; Norsworthy, M.D.; Zheng, X.; Caetano-Anollés, G. New pathways of mutational change in SARS-CoV-2 Proteomes involve regions of intrinsic disorder important for virus replication and release. Evol. Bioinform. 2020, 16, 1176934320965149. [Google Scholar] [CrossRef]
  46. Khan, M.T.; Zeb, M.T.; Ahsan, H.; Ahmed, A.; Ali, A.; Akhtar, K.; Malik, S.I.; Cui, Z.; Ali, S.; Khan, A.S.; et al. SARS-CoV-2 nucleocapsid and Nsp3 binding: An in silico study. Arch. Microbiol. 2020, 203, 59–66. [Google Scholar] [CrossRef] [PubMed]
  47. Lei, J.; Kusov, Y.; Hilgenfeld, R. Nsp3 of coronaviruses: Structures and functions of a large multi-domain protein. Antivir. Res. 2018, 149, 58–74. [Google Scholar] [CrossRef]
  48. Wang, R.; Chen, J.; Hozumi, Y.; Yin, C.; Wei, G.-W. Decoding asymptomatic COVID-19 infection and transmission. J. Phys. Chem. Lett. 2020, 11, 10007–10015. [Google Scholar] [CrossRef]
  49. Khan, M.T.; Ali, S.; Khan, A.S.; Muhammad, N.; Khalil, F.; Ishfaq, M.; Irfan, M.; Al-Sehemi, A.G.; Muhammad, S.; Malik, A.; et al. SARS-CoV-2 genome from the Khyber Pakhtunkhwa Province of Pakistan. ACS Omega 2021, 6, 6588–6599. [Google Scholar] [CrossRef]
  50. Mazhari, S.; Alavifard, H.; Rahimian, K.; Karimi, Z.; Mahmanzar, M.; Sisakht, M.M.; Bitaraf, M.; Arefian, E. SARS-CoV-2 NSP-12 Mutations Survey during the Pandemic in the World. 2021. Available online: https://doi.org/10.21203/rs.3.rs-877078/v1 (accessed on 28 September 2021).
  51. Fibriani, A.; Stephanie, R.; Alfiantie, A.A.; Siregar, A.L.F.; Pradani, G.A.P.; Yamahoki, N.; Purba, W.S.; Alamanda, C.N.C.; Rahmawati, E.; Rachman, R.W.; et al. Analysis of SARS-CoV-2 genomes from West Java, Indonesia. Viruses 2021, 13, 2097. [Google Scholar] [CrossRef]
  52. Fan, L.-Q.; Hu, X.-Y.; Chen, Y.-Y.; Peng, X.-L.; Fu, Y.-H.; Zheng, Y.-P.; Yu, J.-M.; He, J.-S. Biological significance of the genomic variation and structural dynamics of SARS-CoV-2 B.1.617. Front. Microbiol. 2021, 12, 2837. [Google Scholar] [CrossRef]
  53. Karim, S.S.A.; de Oliveira, T. New SARS-CoV-2 variants—clinical, public health, and vaccine implications. N. Engl. J. Med. 2021, 384, 1866–1868. [Google Scholar] [CrossRef]
  54. Chemaitelly, H.; Tang, P.; Hasan, M.R.; AlMukdad, S.; Yassine, H.M.; Benslimane, F.M.; Al Khatib, H.A.; Coyle, P.; Ayoub, H.H.; Al Kanaani, Z.; et al. Waning of BNT162b2 vaccine protection against SARS-CoV-2 infection in Qatar. N. Engl. J. Med. 2021, 385, e83. [Google Scholar] [CrossRef]
  55. Tartof, S.Y.; Slezak, J.M.; Fischer, H.; Hong, V.; Ackerson, B.K.; Ranasinghe, O.N.; Frankland, T.B.; Ogun, O.A.; Zamparo, J.M.; Gray, S.; et al. Effectiveness of mRNA BNT162b2 COVID-19 vaccine up to 6 months in a large integrated health system in the USA: A retrospective cohort study. Lancet 2021, 398, 1407–1416. [Google Scholar] [CrossRef]
  56. Rosenberg, E.S.; Dorabawila, V.; Easton, D.; Bauer, U.E.; Kumar, J.; Hoen, R.; Hoefer, D.; Wu, M.; Lutterloh, E.; Conroy, M.B.; et al. COVID-19 vaccine effectiveness in New York State. N. Engl. J. Med. 2021, 386, 116–127. [Google Scholar] [CrossRef]
Figure 1. Domain organization of structural proteins. ORF: open reading frame, E: envelope, M: membrane, N: nucleocapsid, SP: signal peptide, NTD: N-terminal domain, RBD: receptor-binding domain, RBM: receptor-binding motif, FP: fusion peptide, HR: heptapeptide repeat sequence, TM: transmembrane, CP: cytoplasmic domain, SR: serine rich, CTD: C-terminal domain.
Figure 1. Domain organization of structural proteins. ORF: open reading frame, E: envelope, M: membrane, N: nucleocapsid, SP: signal peptide, NTD: N-terminal domain, RBD: receptor-binding domain, RBM: receptor-binding motif, FP: fusion peptide, HR: heptapeptide repeat sequence, TM: transmembrane, CP: cytoplasmic domain, SR: serine rich, CTD: C-terminal domain.
Genes 13 00552 g001
Figure 2. Location of most common mutations in S protein. Yellow indicates the receptor-binding motif, present in the RBD region (raspberry color). The majority of these mutations were found in the loop regions.
Figure 2. Location of most common mutations in S protein. Yellow indicates the receptor-binding motif, present in the RBD region (raspberry color). The majority of these mutations were found in the loop regions.
Genes 13 00552 g002
Figure 3. Mutation effect on S protein structure stability and flexibility. Mutants G142D, T478K, E156G, and L452R in S protein, exhibited stabilizing effect. The blue region shows rigidification of structure behind mutations and red shows gain in flexibility.
Figure 3. Mutation effect on S protein structure stability and flexibility. Mutants G142D, T478K, E156G, and L452R in S protein, exhibited stabilizing effect. The blue region shows rigidification of structure behind mutations and red shows gain in flexibility.
Genes 13 00552 g003
Figure 4. Mutation effect on S protein stability and flexibility. Mutants P681R and T19R in S protein, exhibited a stabilizing effect. The blue region shows rigidification of structure behind mutations and red shows gain in flexibility.
Figure 4. Mutation effect on S protein stability and flexibility. Mutants P681R and T19R in S protein, exhibited a stabilizing effect. The blue region shows rigidification of structure behind mutations and red shows gain in flexibility.
Genes 13 00552 g004
Figure 5. SARS-CoV-2 genomic epidemiology from Pakistan. The graph shows an estimate of divergence over the time period within genomes. (A) Current study isolates (labelled Pakistan Delta isolates). (B) Position of current SARS-CoV-2 isolates in radial tree (UOL) was built using Nextstrain 28 (Nextclade (nextstrain.org). Sub-clades have been color coded in the tree, characterized by some specific mutations in structural proteins (https://covariants.org/variants) (accessed on 28 September 2021).
Figure 5. SARS-CoV-2 genomic epidemiology from Pakistan. The graph shows an estimate of divergence over the time period within genomes. (A) Current study isolates (labelled Pakistan Delta isolates). (B) Position of current SARS-CoV-2 isolates in radial tree (UOL) was built using Nextstrain 28 (Nextclade (nextstrain.org). Sub-clades have been color coded in the tree, characterized by some specific mutations in structural proteins (https://covariants.org/variants) (accessed on 28 September 2021).
Genes 13 00552 g005
Figure 6. Structure of NSP12 and NSp3 (PLpro). (A) SARS-CoV-2 RdRp (PDB ID: 6M71) contains an N-terminal β-hairpin (residues 31–50). NiRAN (residues 50–249), interface domain (residues 251–365). The NiRAN domain is comprised of three helices and five β-strands, associated with RdRp domain (residues 366–920). (B) Complex of NSP12, NSP7, and NSP8. (C) Organization of NSP3 protein. (D) Structure of PLpro with mutations labeled NSP3_P822L (P77L), NSP3_L862F (L117F), and H920Y (H175y). PLpro has a small N-terminal ubiquitin-like (Ubl) domain and catalytic domain (thumb–palm–fingers).
Figure 6. Structure of NSP12 and NSp3 (PLpro). (A) SARS-CoV-2 RdRp (PDB ID: 6M71) contains an N-terminal β-hairpin (residues 31–50). NiRAN (residues 50–249), interface domain (residues 251–365). The NiRAN domain is comprised of three helices and five β-strands, associated with RdRp domain (residues 366–920). (B) Complex of NSP12, NSP7, and NSP8. (C) Organization of NSP3 protein. (D) Structure of PLpro with mutations labeled NSP3_P822L (P77L), NSP3_L862F (L117F), and H920Y (H175y). PLpro has a small N-terminal ubiquitin-like (Ubl) domain and catalytic domain (thumb–palm–fingers).
Genes 13 00552 g006
Table 1. SARS-CoV-2 patients’ information.
Table 1. SARS-CoV-2 patients’ information.
Sample IDGender *Age (Years)Location
1F20Okara
2M33Sialkot
3F65Sialkot
4M26Sahiwal
5M31Nankana
6M26Shakot
7M60Pak Pattan
8F25Okara
9M24Okara
10M28Pak Pattan
11F43Okara
12M46Pak Pattan
13F42Pak Pattan
14M53Pak Pattan
15M30Okara
16M65Okara
17F21Lahore
18F42Lahore
19M42Lahore
20M32Okara
* F: female, M: male.
Table 2. Genome length and mutation’s pattern in SARS-CoV-2 sample.
Table 2. Genome length and mutation’s pattern in SARS-CoV-2 sample.
SampleLength #(nt)Length #(aa)Muts #Muts %Unique Muts
129,8229705370.38%2
229,8119705340.35%0
329,8179705380.39%0
429,8239705310.32%0
529,8229705370.38%0
629,7979705330.34%0
829,7749705360.37%1
929,8279705320.33%0
1129,8249705310.32%1
1229,8289705350.36%0
1329,7029669690.71%0
1429,8289705280.29%1
1529,8269705340.35%0
1629,8269705310.32%0
1729,7379705370.38%2
1829,8219705390.40%0
1929,8279705360.37%2
2029,3839650981.02%10
# Mut: mutations, nt: nucleotide, aa: amino acid.
Table 3. Mutational frequency in structural proteins.
Table 3. Mutational frequency in structural proteins.
MutationNo. of Samples/Frequency
Spike_T478K18
Spike_T19R18
Spike_L452R18
Spike_F157del18
Spike_E156G18
Spike_P681R18
Spike_D614G18
Spike_R158del18
Spike_G142D18
N_D377Y18
N_D377Y18
N_R203M18
M_I82T18
N_G215C17
Spike_D950N12
Spike_T95I12
Table 4. Unique mutations in NSP and NS3 SARS-CoV-2 isolates from Punjab province, Pakistan.
Table 4. Unique mutations in NSP and NS3 SARS-CoV-2 isolates from Punjab province, Pakistan.
Sample IDUnique Mutations
20NSP3_V1388A
20NSP3_W1498S
20NSP3_S1495N
20NSP3_Y1535V
20NSP3_S1534R
20NSP3_K1497H
20NSP3_S1494G
20NSP3_D1499P
20NSP12_F313Y *
19NSP5_V86L *
14NS3_I118L
8NSP3_D339Y *
1NSP3_T1303P
* NSP3; Papain-like protease (PLpro), NSP5; Main protease (Mpro), NSP12; RNA-dependent RNA polymerase.
Table 5. Mutations in NSP of SARS-CoV-2 circulating isolates in Punjab province.
Table 5. Mutations in NSP of SARS-CoV-2 circulating isolates in Punjab province.
MutationCountMutationCountMutationCountMutationCount
NSP12_G671S #18NSP3_E391D1NSP3_F1516del1NSP14_I231del1
NSP13_P77L18NSP3_M1529del1NSP3_I1514del1NSP14_V236del1
NSP3_P1469S17NSP3_S1494G1NSP3_L1511del1NSP14_V136I1
NSP3_A488S17NSP3_I1528del1NSP3_L1505del1NSP14_W247del1
NSP3_P1228L17NSP3_F1503del1*NSP3_H920Y1NSP14_P203L1
NSP4_T492I17NSP3_V1522del1NSP3_L1525del1NSP14_G248S1
NSP4_V167L17NSP3_F1510del1NSP3_W1498S1NSP14_S221del1
NSP6_T77A17NSP3_F1519del1NSP3_L1531del1NSP14_Q246del1
NSP14_A394V17NSP3_A1512del1NSP3_L1500del1NSP14_S230del1
NSP12_P323L16NSP3_A1526del1NSP4_A446V1NSP14_I242del1
NSP14_M72I6NSP3_G1524del1NSP4_I377M1NSP14_Y235del1
NSP3_S1370F3NSP3_K1497H1NSP5_S254F1NSP14_C216del1
NSP3_S1285F2NSP3_D339Y1NSP5_V86L1NSP14_T223del1
NSP5_P184T2NSP3_A1507del1NSP6_M92V1NSP14_D243del1
NSP1_H83Y1NSP3_Y1521del1NSP6_T181I1NSP14_T219del1
NSP2_L271F1NSP3_Y1535V1NSP6_V149A1NSP14_M241del1
NSP2_E373K1NSP3_A1527del1NSP10_A20S1NSP14_P239del1
NSP2_D315N1NSP3_Q1530del1NSP12_A16V1NSP14_W227del1
NSP2_S591N1NSP3_S1534R1NSP12_A185V1NSP14_A225del1
NSP2_A306V1NSP3_F1533del1#NSP12_L829F1NSP14_F240del1
NSP2_K81N1NSP3_T1517del1NSP12_S318T1NSP14_Q245del1
NSP3_S1495N1NSP3_R1518del1NSP12_F317Y1NSP14_C226del1
NSP3_L1523del1NSP3_Y1513del1#NSP12_M380I1NSP14_H228del1
NSP3_S1699F1NSP3_W1509del1NSP12_Q357H1NSP14_T215del1
NSP3_E1508del1NSP3_D1499P1NSP12_F313Y1NSP14_G232del1
NSP3_L862F *1NSP3_S211G1NSP14_D222del1NSP14_A220del1
NSP3_P822L *1NSP3_V1506del1NSP14_F233del1NSP14_R213del1
NSP3_A465V1NSP3_F1520del1NSP14_V244del1NSP14_N238del1
NSP3_T1501del1NSP3_V1388A1NSP14_F217del1NSP14_S218del1
NSP3_F1532del1NSP3_A644S1NSP14_D234del1NSP14_Y224del1
NSP3_G1504del1NSP3_L1515del1NSP14_R212del1NSP14_Y237del1
NSP3_A1502del1NSP3_T1303P1NSP14_H229del1NSP14_A214del1
del: deletion, RdRp: RNA-dependent RNA-polymerase, NSP: non-structural protein. * Mutations in PLpro domain. # Mutations in RdRp domain of NSP12.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Anwar, M.Z.; Lodhi, M.S.; Khan, M.T.; Khan, M.I.; Sharif, S. Coronavirus Genomes and Unique Mutations in Structural and Non-Structural Proteins in Pakistani SARS-CoV-2 Delta Variants during the Fourth Wave of the Pandemic. Genes 2022, 13, 552. https://doi.org/10.3390/genes13030552

AMA Style

Anwar MZ, Lodhi MS, Khan MT, Khan MI, Sharif S. Coronavirus Genomes and Unique Mutations in Structural and Non-Structural Proteins in Pakistani SARS-CoV-2 Delta Variants during the Fourth Wave of the Pandemic. Genes. 2022; 13(3):552. https://doi.org/10.3390/genes13030552

Chicago/Turabian Style

Anwar, Muhammad Zeeshan, Madeeha Shahzad Lodhi, Muhammad Tahir Khan, Malik Ihsanullah Khan, and Sumaira Sharif. 2022. "Coronavirus Genomes and Unique Mutations in Structural and Non-Structural Proteins in Pakistani SARS-CoV-2 Delta Variants during the Fourth Wave of the Pandemic" Genes 13, no. 3: 552. https://doi.org/10.3390/genes13030552

APA Style

Anwar, M. Z., Lodhi, M. S., Khan, M. T., Khan, M. I., & Sharif, S. (2022). Coronavirus Genomes and Unique Mutations in Structural and Non-Structural Proteins in Pakistani SARS-CoV-2 Delta Variants during the Fourth Wave of the Pandemic. Genes, 13(3), 552. https://doi.org/10.3390/genes13030552

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop