Next Article in Journal
Does Phage Therapy Need a Pan-Phage?
Previous Article in Journal
MultiTEP-Based Vaccines Targeting SARS-CoV-2 Spike Protein IgG Epitopes Elicit Robust Binding Antibody Titers with Limited Virus-Neutralizing Activity
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Epidemiological Characterization and Genetic Variation of the SARS-CoV-2 Delta Variant in Palestine

by
Suheir Ereqat
1,†,
Nabil-Fareed Alikhan
2,†,
Amer Al-Jawabreh
3,4,*,
Michaela Matthews
2,
Ahmed Al-Jawabreh
1,5,
Leonardo de Oliveira Martins
2,
Alexander J. Trotter
2,
Mai Al-Kaila
5,
Andrew J. Page
2,‡,
Mark J. Pallen
2,6,7,‡ and
Abedelmajeed Nasereddin
1,8,‡
1
Biochemistry and Molecular Biology Department, Faculty of Medicine, Al-Quds University, Abu Dis, Jerusalem P.O. Box 51000, Palestine
2
Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
3
Department of Medical Laboratory Sciences, Faculty of Allied Health Sciences, Arab American University, Jenin P.O. Box 240, Palestine
4
Leishmaniases Research Unit, Jericho P5840227, Palestine
5
Ministry of Health of the State of Palestine, Ramallah P6028100, Palestine
6
University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, UK
7
School of Veterinary Medicine, University of Surrey, Guildford GU2 7AL, UK
8
Al-Quds Bard College, Al-Quds University, East Jerusalem P.O. Box 20002, Palestine
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
These authors also contributed equally to this work.
Pathogens 2024, 13(6), 521; https://doi.org/10.3390/pathogens13060521
Submission received: 19 April 2024 / Revised: 29 May 2024 / Accepted: 6 June 2024 / Published: 20 June 2024

Abstract

:
The emergence of new SARS-CoV-2 variants in Palestine highlights the need for continuous genetic surveillance and accurate screening strategies. This case series study aimed to investigate the geographic distribution and genetic variation of the SARS-CoV-2 Delta Variant in Palestine in August 2021. Samples were collected at random in August 2021 (n = 571) from eight districts in the West Bank, Palestine. All samples were confirmed as positive for COVID-19 by RT-PCR. The samples passed the quality control test and were successfully sequenced using the ARTIC protocol. The Delta Variant was revealed to have four dominant lineages: B.1.617 (19%), AY.122 (18%), AY.106 (17%), and AY.121 (13%). The study revealed eight significant purely spatial clusters (p < 0.005) distributed in the northern and southern parts of Palestine. Phylogenetic analysis of SARS-CoV-2 genomes (n = 552) showed no geographically specific clades. The haplotype network revealed three haplogroups without any geographic distribution. Chronologically, the Delta Variant peak in Palestine was shortly preceded by the one in the neighboring Israeli community and shortly followed by the peak in Jordan. In addition, the study revealed an extremely intense transmission network of the Delta Variant circulating between the Palestinian districts as hubs (SHR ≈ 0.5), with Al-Khalil, the district with the highest prevalence of COVID-19, witnessing the highest frequency of transitions. Genetic diversity analysis indicated closely related haplogroups, as haplotype diversity (Hd) is high but has low nucleotide diversity (π). However, nucleotide diversity (π) in Palestine is still higher than the global figures. Neutrality tests were significantly (p < 0.05) low, including Tajima’s D, Fu-Li’s F, and Fu-Li’s D, suggesting one or more of the following: population expansion, selective sweep, and natural negative selection. Wright’s F-statistic (Fst) showed genetic differentiation (Fst > 0.25) with low to medium gene flow (Nm). Recombination events were minimal between clusters (Rm) and between adjacent sites (Rs). The study confirms the utility of the whole genome sequence as a surveillance system to track the emergence of new SARS-CoV-2 variants for any possible geographical association and the use of genetic variation analysis and haplotype networking to delineate any minimal change or slight deviation in the viral genome from a reference strain.

1. Introduction

Coronavirus disease 2019 (COVID-19), caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), was first reported in Wuhan, PR China in late 2019. In March 2020, the World Health Organization (WHO) declared the COVID-19 outbreak a global pandemic [1]. As of July 2022, there have been over 562 million reported cases and 6.3 million fatalities [2]. Genome sequencing of SARS-CoV-2 positive samples allows for mutations to be identified and variants to be categorized [3,4,5]. Analyses of SARS-CoV-2 genomes have revealed successive “Variants of Concern” (VOCs), characterized by distinctive sets of mutations and renewed epidemic potential, leading to successive waves of the COVID-19 pandemic [6]. Since June 2021, these VOCs have been assigned letters of the Greek alphabet within a WHO naming scheme [7]. The Alpha Variant (PANGO lineage B.1.1.7) was first detected in the UK in September 2020, the Delta Variant (PANGO lineage B.1.617.2) was first detected in India in October 2020, and the Omicron Variant (B.1.1.529) emerged in multiple countries in late 2021 [8]. Emergence of these variants highlights the need for continuous genomic surveillance to identify and track new SARS-CoV-2 variants in real time.
The first cases of COVID-19 were detected in the West Bank, Palestine in March 2020 [9]. By February 2022, over 0.6 million cases and over 5000 deaths had been reported from Palestine [9]. A study conducted in the West Bank, Palestine showed that the country’s first wave in 2020 was caused by close relatives of the original Wuhan strain [10,11]. A subsequent study showed that a second surge of COVID-19 in early 2021 was associated with the Alpha lineage (B.1.1.7) [12]. Here, we have investigated the geographic distribution and genetic diversity of the SARS-CoV-2 Delta Variant whole genomes in Palestine in August 2021 as a prerequisite to elucidating the dynamics of disease transmission and the source of infection in the Palestinian community based on the whole-genome sequencing (WGS) and the observation of mutation dynamics and changes in the viral genome that affect the viral behavior in terms of spread and pathogenicity, in addition to vaccine candidate suitability.

2. Materials and Methods

2.1. Study Samples and Processing

In this case series study, nasopharyngeal swabs were arbitrarily collected from COVID-19 patients residing in different Palestinian districts in August 2021 during the third wave of the pandemic, without consideration of age, gender, place of residence, or symptoms. The patients’ samples were selected from the Ministry of Health (MOH) central laboratory, which serves as a hub for the satellite laboratories throughout the West Bank (Figure 1) and hold all the samples and data in the country. The patients were non-hospitalized individuals from the Palestinian community seeking COVID-19 testing. Hospitalized individuals, non-Palestinians, and those with incomplete data records were excluded from the study. As the population of the West Bank was 3.15 million, and the expected frequency (prevalence) of COVID-19 was 50%, the sample size at a 95% confidence level was calculated by EpiInfo v. 7.2.4.0 (CDC-free software) to be 384. The assumption of 50% prevalence gives a maximum sample size. Extra samples were included to account any possible test failures (n = 571). Viral RNA was extracted from all swabs at the Palestinian Ministry of Health laboratories and confirmed as SARS-CoV-2 cases by real-time (RT) PCR.
cDNA synthesis was performed with LunaScript RT SuperMix (New England Biolabs, Ipswich, MA, USA). DNA library preparation was based on ARTIC v4 amplicon using the IDT ARTIC nCoV-2019 V3 Panel [13] as previously described [14,15]. Libraries were sequenced using the NextSeq 500/550 Mid Output Kit v2.5 (150 Cycles) on the NextSeq 500 platform (Illumina, San Diego, CA, USA) at Quadram Institute Biosciences, Norwich, UK. One positive control containing an extinct lineage (B.1.177) and one negative control were included for each set of 94 samples.

2.2. Genome Sequencing and Strain Typing

A total of 571 RNA samples passed the quality control test indicating high viral loads and were successfully sequenced as follows. The raw reads were demultiplexed using bcl2fastq (v2.20) [16]. The reads were used to generate a consensus sequence using the ARTIC bioinformatics pipeline [15]. Briefly, the reads had adapters trimmed with TrimGalore [17] and were aligned to the Wuhan Hu-1 reference genome (Genbank reference accession MN908947.3) using BWA-MEM (v0.7.17) [18]; the ARTIC amplicons were trimmed, and a consensus was built using iVAR (v1.3.0) [19]. PANGO lineages were assigned using Pangolin (v3.1.20 and v4.0.6) [20].

2.3. Phylogenetic and Haplotype Analyses

The phylogenetic tree was built from 552 sequences using IQTREE2 (v2.2.0-beta) with a UNREST+FO+I+G4 model, chosen by the IQTREE2’s ModelFinder module [21]. The network illustrating state change between different districts was generated using StrainHub [22]. A parsimony ancestral reconstruction step is performed to create links between the tree and its associated metadata. The transmission network was implemented based on the degree centrality metric, which is the number of edges originating from (outdegree) and ending on (indegree) a given node and the “Source Hub Ratio” (SHR), which represents the ratio of all outward transitions from the node over all transitions from and to the node. A node scoring a SHR close to 1 indicates a source, a SHR close to 0.5 a hub, and a SHR close to 0 a sink for SARS-CoV-2. Temporal data were not used in the construction of this network.
In a second step of analysis, a median-joining haplotype network was constructed using the PopArt 1.7 based on single-nucleotide variation (SNV) across the SARS-CoV-2 S-protein domain spike gene, which contains the Delta Variant mutations that play a pivotal role in the infection process, according to default parameters [10,23,24,25]. Within this context, the term “haplotype” refers to the occurrence of a group of SARS-CoV2 sequences clustering in one unit and differing in at least a single nucleotide (single-nucleotide variation, SNV) to identify patterns of intra-population genetic variation. Along the same line, a “haplogroup’ is a group of closely linked haplotypes clustered together on the network and which tend to be inherited together. The network was constructed based on the geographical origin of the viral isolate. The overall number of DNA sequences (n = 552) was filtered by removing all sequences with extensively missing nucleotides, as pointed out by the software.
Based on the assumptions that SARS-CoV-2 has a high mutation rate, probable homoplasy, and a possibility of recombination, the parsimonious phylogenetic tree and median-joining haplotype networking were used side by side to maximize the probability of finding all connections to discern the genetic variation [26].

2.4. Genetic Diversity Analyses

The genetic variation analyses were based on the number of haplogroups produced by the haplotype network analysis using the PopArt 1.7. DnaSP ver. 6.12.03 was used to conduct the analyses, which included nucleotide–haplotype diversity, tests of neutrality, genetic differentiation estimators, and the degree of recombination, as described elsewhere [12,27]. Haplotype diversity (Hd or h) was calculated by DnaSP ver. 6.12.03 using the following formula:
h = n 1 x ^ i 2 / n 1
where n is the number of individuals sampled, and xi is the allele frequency. The hat sign (^) indicates estimation. Nei’s original equation had 2n [28].
Assuming that genetic diversity is driven by the processes of recombination, reassortment, and random mutation within the population, the nucleotide diversity, Pi (π) between ith and jth DNA sequences in the sample, was calculated by DnaSP ver. 6.12.03 using the following formula:
π ^ = n n 1 i j x ^ i x ^ j π i j
where n is the number of DNA sequences examined, xi is the population frequency of the ith type of DNA sequence, xj is the population frequency of the jth type of DNA sequence, and π i j is the number of nucleotide differences per site between the ith and jth sequences. The hat sign (^) indicates estimation.

2.5. Purely Spatial Cluster Mapping

The choropleth, spot, and cluster mapping of COVID-19 cases from Palestine were constructed with statistical inference using the SaTScan™ v9.7 mapping package for the spatial scan statistics of case clustering and the Epi Info™ 7 statistical package (CDC free-software) for accurate mapping. SaTScan™ v9.7 Freeware was used to detect statistical evidence for purely spatial clustering of cases in the study area. The input files included the number of cases per locality, year of infection, population size of the location in the year of infection, and the exact latitude–longitude coordinates of each location. Data were analyzed based on the discrete Poisson model, using Monte Carlo hypothesis. The level of significance used was at p-value ≤ 0.05 [29].

3. Results

3.1. Study Population

Our study included a total of 571 SARS-CoV-2 samples that were obtained in August 2021 from the Ministry of Health (MOH) Central Laboratory in Ramallah, the West Bank. This center serves as the reference laboratory and receives all COVID-19 positive samples from districts across Palestine excluding the Gaza Strip. The median age of study participants was 27 years, ranging from 1 to 84 years. The sex distribution was 50% females compared to 45% males, with 5% unknown. The samples were from 90 Palestinian localities in eight Palestinian districts, including Bethlehem (13.8%), Al-Khalil (32%), Jenin (18%), Ariha (Jericho) (4.7%), East Jerusalem (Al-Quds) (2.6%), Nablus (12%), Ramallah (2%), and Tulkarem (15%) (Figure 1, Supplementary Table S1). Genome sequences were deposited in the GISAID database (EPI_ISL_10033368-EPI_ISL_10033935).

3.2. Identification of Delta Lineages and Sub-Lineages

Our study showed that the most frequent lineage in August 2021 (representing the third wave of the SARS-CoV-2 pandemic) was the Delta Variant, which was detected in 99.9% (n = 570) of cases. The frequency of SARS-CoV-2 lineages/sub-lineages is shown in Table 1. Our results showed that 25 AY lineages were circulating in Palestine (the West Bank), with four dominating in the area: B.1.617.2 (31%), AY.122 (16%), AY.106 (19%), and AY.121 (13%). All study districts contained at least seven lineages (Figure 2).

3.3. Determination of Distribution and Purely Spatial Clusters

Considering the population size of a given locality and the number of COVID-19 cases within it, the study revealed 16 purely spatial clusters in Palestine (the West Bank). However, only eight of these were statistically significant (p < 0.05) (Table 2 and Figure 1). The significant clusters were distributed in the northern and southern parts of Palestine, with the most significant clusters in the village of Beit-Kahil (Al-Khalil district) (RR: 8.2, p < 0.001) and Jalboun village (Jenin district) (RR: 6.2, p < 0.001). The locality with the largest bulk of COVID-19 samples was the city of Al-Khalil (n = 90). However, it did not form a statistically significant cluster.

3.4. Phylogenetic and Haplotype Analyses

We constructed a phylogenetic relationship between the Delta Variant genomes (Figure 2). The phylogenetic tree included 552 genome sequences; these 552 out of 571 were those with less than 50% sequence ambiguity. Delta Variant genomes represent a diverse range of lineages consistent with the PANGO lineages defined (n = 25). No geographically specific clades were defined.
The phylogenetic tree of sequenced SARS-CoV-2 genomes (Figure 2), represented as a transmission network (Figure 3), illustrates frequent exchange between the different districts. However, the direction of transmission is unclear from these data. Indeed, transmission links (Figure 3) are based on the available data, and there maybe additional sources or intermediates. This also suggests there is no single identifiable location source of the Delta Variant of SARS-CoV-2 in Palestine. Full contextual data for samples can be found in Supplementary Table S1. The transmission network shows that all Palestinian districts have a SHR value close to 0.5 (0.375–0.6), except the district of Ramallah (SHR = 0), which indicates that all districts are hubs for SARS-CoV-2 except for Ramallah, which is a sink.
However, the haplotype network showed 67 haplotypes forming three major haplogroups, represented by three large circles: A, B, and C (Figure 4). Haplogroup A is the core cluster, containing 38 haplotypes from all locations of the study area. Haplogroup B is the second largest circle in diameter, with 22 haplotypes. Haplogroup C contained only seven haplotypes originating mainly from the Nablus district. Apart from five, all of the differences between adjacent haplotypes are restricted to one SNV, as indicated by the hatch marks. Each haplogroup had a star-like formation around it, representing subhaplogroups (intermediate circles) or even a single haplotype (smallest circles).

3.5. Genetic Diversity Analyses

The analyses (Table 3) were based on the three major haplogroups (Figure 4) spun off from the PopArt 1.7 analysis, which utilized 462 sequences of the spike region that were fully sequenced. Meanwhile, the rest (109) were excluded from the total bulk (571), as they were rejected by PopArt 1.7. Cluster A (n = 241), the largest of the three, showed the highest number of haplotypes (h) and number of spike mutations (η); however, the ratio of the number of haplotypes to the number of sequences (h:n) remained constant (=0.19) in the three clusters, indicating comparable haplotype distribution across the three clusters. Similarly, the ratio of the number of mutations to the number of sequences for cluster A, B, and C was 0.21, 0.25, and 0.19, respectively. It is obvious that cluster A had the highest Eta:n, which runs parallel with the highest nucleotide diversity (π = 0.0008) and the highest average number of nucleotide differences between any two sequences in the cluster (k = 2.01), specifying cluster A as the most diverse of the three. However, nucleotide diversity (π) (total = 0.0009; ranging = 0.0002–0.0008) is still considered relatively low; simultaneously, haplotype diversity (Hd) (total = 0.87; range = 0.44–0.81) is considered high. High haplotype diversity (Hd) with low nucleotide diversity (π) is indicative of closely related haplotypes. The three neutrality tests, including Tajima’s D, Fu-Li’s F, and Fu-Li’s D, were significantly below zero for individual clusters and the overall bulk (Table 3).
The study showed high levels of genetic differentiation, with Wright’s F-statistic pair genetic differentiation distance (Fst) between any two clusters greater than 0.25 (Fst = 0.47–0.70), together with low (Nm = 0.21) to medium (Nm = 0.36 and 0.57) gene flow between the three SARS-CoV-2 clusters (Table 4). The other genetic differentiation estimators such as the mean coefficient of gene differentiation (Gst) and the number of net nucleotide substitutions per site between populations are relatively low (Gst = 0.16–0.24, Da = 0.000–0.0006) and not in accordance with Fst. In contrast to the elevated Fst statistic, the average number of nucleotide substitutions per site between populations (Dxy), another measure of genetic differentiation, was unexpectedly low (Dxy = 0.0009–0.0011). The average number of nucleotide differences between populations (Kxy) was shown to be low (Kxy = 1.74–2.88). The Hudson–Kreitman–Aguadé (HKA) test was shown to be very low (HKA = 0.000–0.036). The test denotes that under a neutral theory of molecular evolution, it is assumed that polymorphism within a population indicates divergence between populations.
Table 5 shows that the minimum number of recombination events (Rm) in the three clusters was minimal (Rm = 0–1), with negligible recombination events taking place between adjacent sites (Rs = 0.007–0.009). However, considerable recombination within the gene (Rg = 27–105) was evident.

4. Discussion

The Delta Variant, which was first detected in India in October 2020 [36] and spread to more than 170 countries globally (GISAID), was introduced into our region at the beginning of May 2021 [37]. In 2021, Palestine witnessed two COVID-19 peaks: the first was the Alpha Variant peak during March–April, and the second was the Delta Variant during August–October [10,38,39,40]. In mid-September 2021, the number of active COVID-19 cases increased to 32,533, with a mortality rate of 19 per day (3850 deaths), owing to the spread of the Delta Variant. The entire country was enlisted on the list of international high-risk areas [38]. The situation was in line with reports that SARS-CoV-2 genomes from Israeli, Lebanese, and Egyptian patients were dominated by the Delta Variant during July to December 2021 [41,42]. Our study detected 45 cases of the Delta Variant who were Palestinian travelers arriving mainly from Turkey through the border bridge with Jordan and tested for COVID-19 directly upon arrival. However, the purely spatial distribution of COVID-19 cases in Palestine showed eight significant geographical clusters (Figure 1). Phylogenetic analysis (Figure 2) showed the absence of any clear pattern of geographic and genetic clustering (Figure 4). Conversely, haplotype networking displayed three haplogroups, though still without any geographical implications. By contrast, global-level studies have disclosed the SARS-CoV-2 haplotype association with geographic origin and case fatality rates among COVID-19 patients [43]. Compared to a vast geography like India, both analyses showed a high number of haplotypes and lineages, suggesting that the Delta Variant has been imported into and exported from Palestine multiple times [44]. Nearest neighbor analyses suggest that large numbers of Palestinians cross the Palestinian–Israeli borders as close relatives, laborers, or even as regular attendants of the Israeli industrial zones in Palestine and in Turkey, which acts as a regional transport hub. It was clearly observed that the Palestinian and Israeli peaks in 2021 had a sequential pattern in which the Palestinian peaks (maximum number of cases in the highest peak was around 8k) were always preceded by the Israeli ones, with a short period of a couple of weeks to one month in between (maximum number of cases in the highest peak was over 75k) [45] (Supplementary Figure S1). On the other hand, the Jordanian peak in spring was identical to that in Palestine, and the second peak was in December 2021, one month after the second Palestinian peak, ruling out any effect of the Jordanian Delta peak on Palestine, whereas the opposite could have happened [46]. Based on the timing of peaks and the fact that both Palestinians and Israelis are adjacent communities, the transfer of SARS-CoV-2 from the Israelis to the Palestinians is highly probable, including the Delta Variant, especially when knowing that the first Israeli Delta Variant was at the beginning of May, and the first Palestinian Delta Variant was reported two months later, at the end of June 2021 [37,47]. This lag time was enough to effectively spread the Delta Variant in the Palestinian community. Furthermore, the nature of the Delta Variant, which is 63–167% more transmissible and emerges 1.4–2.6 times faster than the Alpha Variant, could have contributed to the spread of the lethal COVID-19 variant [48]. Yet, the recombination between Alpha and Delta SARS-CoV-2 variants is extremely rare; thus, recombination cannot be used to explain the Delta peak that followed the Alpha one [49]. Additionally, in a relatively small geography like the West Bank, the absence of any geographical clustering with a high number of haplotypes and lineages may further indicate active endogenous circulation of the Delta Variant (Figure 3) due to noncompliance with the official preventive regulations such as the lockdown, mass gatherings like weddings, social distancing, and mask wearing. In addition, the transmission network showed that none of the districts formed a source of COVID-19; rather, they were hubs (SHR ≈ 0.5) with equal weights reflected by equal node sizes, except for Ramallah, shown as a sink due to the limited sample size. Although SHR does not indicate which node is the most important in the spread of SAR-CoV-2, Al-Khalil district has the thickest arrows originating from the node (outdegree), indicating a higher frequency of transitions.
In this study, the total nucleotide diversity (π = 0.0009 ± 0.000, Table 3) across the SARS-CoV-2 spike region was relatively high compared to the global nucleotide diversity (π = 0.00044 ± 0.00001) as well as that of the global regions across the whole genome [50]. However, the neutrality tests, Tajima’s D, Fu-Li’s F, and Fu-Li’s F (Table 3), significantly deviated from neutrality to the negative side, leading to a loss of equilibrium. The negativity of neutrality tests can be explained by a recent viral population expansion event or a recent introduction of a new mutation or a group of mutations that have placed themselves in the population and became fixed (selective sweep), or by natural (negative) selection in which deleterious alleles are removed, selectively leading to reduced genetic diversity. However, our results suggest the recent population expansion as the most plausible explanation to the low neutrality test values and low nucleotide diversity ( π ), along with the excess of rare mutations (negative neutrality tests, Table 3). Both forces, population expansion and excess of rare mutations, resulted in low genetic diversity among the viral populations. Moreover, other minor forces are expected to increase diversity, such as co-circulation and population genetic admixture. These results are in congruence with other studies that showed statistically significantly negative values for neutrality tests [51].
The genetic distance between the three SARS-CoV-2 clusters (Fst > 0.25) and low to intermediate gene flow (Nm) supports the clustering of the genetic diversity, with signs of gene flow between cluster A and B (Nm = 0.57) and between I and C (Nm = 0.36), but with low transfer of genetic diversity (Nm = 0.21) between cluster B and C due to extensive movement of hosts. A study in South America showed statistically significant values, indicating slight genetic differentiation [51]. Unlike the Fst, the other genetic differentiation estimators, Gst and Da, were underestimated due to the high mutation rate [33,52]. Furthermore, the low values of Dxy (0.0009–0.0011) combined with high Fst values can be explained by selection sweep, a mutation that increases its frequency and becomes fixed in a population, which ultimately reduces genetic variation after a period of time. The transmission network (Figure 3) partially supports the potential exchange of genes between clusters due to host movement between districts, especially those coming from Al-Khalil, the most COVID-19 prevalent district, forming the node with the thickest outdegree arrows in the transmission network. More evidently, the value of Rg, the recombination parameter for the entire gene, is high (27–105), suggesting that exchange within the gene may be occurring to varying degrees in the three clusters but more so in cluster C.
The study suggests that the three clusters did not completely evolve into isolated distinct units or clusters, which is most probably due to the recent population expansion, where Fst is expected to increase and Nm to decrease over time. The low estimates of HKA suggest very low departure from neutral theory, indicating an almost constant ratio between within-population polymorphism and between-population divergence, explained by neutral selection of mutations and a low recombination rate [35,53]. In addition to the mutation rate, the recombination rate is a major contributor to genetic diversity in viral genomes. Although a low recombination rate is classically explained by genetic hitchhiking and background selection, the spike gene might not be enough to detect recombination compared to WGS, as previously thought [49,54,55]. Further, the high frequency of consecutive SARS-CoV-2 pandemic waves and the genetically similar viral lineages add up to the difficulty of detecting an accurate recombination rate [49,54]. With the conflicting reports on recombination events in SARS-CoV-2, longer time scales are needed for the recombination to become more pronounced and to take effect on the long-term evolution of the virus [54,56].
As a limitation of the study, sampling did not reflect the incidence scenarios in the Palestinian districts, but rather reflected the preparations and activity in collecting samples as well as the degree of efficiency of the surveillance system in that district. Sampling was underrepresented in districts like Ramallah and absent from districts like Salfit and Qalqilya. This is evident in the transmission network, which identifies Ramallah as a sink of SARS-CoV-2 (Figure 3). Therefore, the potential for sampling bias precludes drawing firm conclusions from such analyses.

5. Conclusions

Our study confirmed the utility of genome sequencing analysis in ascertaining and understanding the dynamics of SARS-CoV-2 variants in Palestine in terms of spatial distribution, genetic variation, and forces shaping the creation of new variants, particularly mutation and recombination rates. This study reaffirmed the use of haplotype networking as a complementary technique to phylogenetic analysis, as the latter assumes no recombination, which is not the case in many RNA viruses such as SARS-CoV-2 [57]. This makes haplotype networking a good surrogate, and it is upheld by genetic diversity statistics. In addition, a temporal dimension should be added to accommodate for the rapid increase in the genetic diversity indices over time. The genetic diversity indices of the viral spike region can be amalgamated with vaccine development, in which vaccine efficacy decreases with high genetic diversity, mortality, and morbidity rates, as well as with the clinical picture and virulence. We also stressed the value of sequence deposition in international data repositories in allowing for the evaluation of local epidemiology patterns in a regional and global context.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/pathogens13060521/s1: Table S1. Contextual data for the 573 samples included in the study covering the demographic and test result characteristics. Figure S1. Timeline of the COVID-19 pandemic showing Israeli (IL), Palestinian (IL), and Jordanian (JO) peaks. Dates on the figure show the approximate peak date, and the red lines start from the peak on the Israeli timeline and move down through the Palestinian and Jordanian ones.

Author Contributions

Project administration and supervision: A.J.P., M.J.P., M.A.-K. and A.N. Funding acquisition: M.J.P. Metadata curation and data submission: N.-F.A. and M.M. Writing the manuscript and methodology: S.E., A.A.-J. (Amer Al-Jawabreh), A.N. and A.A.-J. (Ahmed Al-Jawabreh), assisted by N.-F.A. and L.d.O.M. Analyses were performed and figures were drawn by M.M., L.d.O.M., A.A.-J. (Amer Al-Jawabreh), and N.-F.A. Sample processing and assessment was performed by A.J.T. All authors have read and agreed to the published version of the manuscript.

Funding

The Quadram Institute authors gratefully acknowledge the support of the Biotechnology and Biological Sciences Research Council (BBSRC); their research was funded by the BBSRC Institute Strategic Program Microbes in the Food Chain BB/R012504/1 and its constituent project BBS/E/F/000PR10352. The Quadram Institute Bioscience BBSRC also funded the Core Capability Grant (project number BB/CCG1860/1). For the purposes of open access publishing, the author has applied a CC-BY public copyright license to any author-accepted manuscript version arising from this submission.

Institutional Review Board Statement

The study received ethics approval from the Palestinian Ministry of Health according to the material transfer agreement (MTA) dated 12 January 2022.

Informed Consent Statement

All individuals gave full consent to participate in this study. The personal data of all participants were kept anonymous.

Data Availability Statement

The data in this study are publicly available in the GISAID database (EPI_ISL_10033368-EPI_ISL_10033935).

Conflicts of Interest

The authors declare that there are no conflicts of interest.

Abbreviations

SARS-CoV-2Severe acute respiratory syndrome coronavirus 2
COVID-19Coronavirus disease 2019
WGSWhole-genome sequences
VOCVariants of Concern
SNVSingle-nucleotide variation

References

  1. Chen, N.; Zhou, M.; Dong, X.; Qu, J.; Gong, F.; Han, Y.; Qiu, Y.; Wang, J.; Liu, Y.; Wei, Y.; et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: A descriptive study. Lancet 2020, 395, 507–513. [Google Scholar] [CrossRef]
  2. Dong, E.; Du, H.; Gardner, L. An interactive web-based dashboard to track COVID-19 in real time. Lancet Infect. Dis. 2020, 20, 533–534. [Google Scholar] [CrossRef] [PubMed]
  3. Alm, E.; Broberg, E.K.; Connor, T.; Hodcroft, E.B.; Komissarov, A.B.; Maurer-Stroh, S.; Melidou, A.; Neher, R.A.; O’Toole, A.; Pereyaslov, D.; et al. Geographical and temporal distribution of SARS-CoV-2 clades in the WHO European Region, January to June 2020. Euro Surveill 2020, 25, 2001410. [Google Scholar] [CrossRef] [PubMed]
  4. Year-Letter Genetic Clade Naming for SARS-CoV-2 on Nextstrain.org. Available online: https://nextstrain.org/blog/2020-06-02-SARSCoV2-clade-naming (accessed on 1 February 2024).
  5. Rambaut, A.; Holmes, E.C.; O’Toole, A.; Hill, V.; McCrone, J.T.; Ruis, C.; du Plessis, L.; Pybus, O.G. A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat. Microbiol. 2020, 5, 1403–1407. [Google Scholar] [CrossRef]
  6. Volz, E.; Mishra, S.; Chand, M.; Barrett, J.C.; Johnson, R.; Geidelberg, L.; Hinsley, W.R.; Laydon, D.J.; Dabrera, G.; O’Toole, A.; et al. Assessing transmissibility of SARS-CoV-2 lineage B.1.1.7 in England. Nature 2021, 593, 266–269. [Google Scholar] [CrossRef]
  7. Konings, F.; Perkins, M.D.; Kuhn, J.H.; Pallen, M.J.; Alm, E.J.; Archer, B.N.; Barakat, A.; Bedford, T.; Bhiman, J.N.; Caly, L.; et al. SARS-CoV-2 Variants of Interest and Concern naming scheme conducive for global discourse. Nat. Microbiol. 2021, 6, 821–823. [Google Scholar] [CrossRef] [PubMed]
  8. Tracking SARS-CoV-2 Variants. Available online: https://www.who.int/activities/tracking-SARS-CoV-2-variants (accessed on 1 February 2024).
  9. COVID-19 Situation Reports. World Health Organization—Regional Office for the Eastern Mediterranean. Available online: http://www.emro.who.int/opt/information-resources/covid-19-situation-reports.html (accessed on 4 February 2024).
  10. Al-Jawabreh, A.; Ereqat, S.; Dumaidi, K.; Al-Jawabreh, H.; Nasereddin, A. Complete genome sequencing of SARS-CoV-2 strains: A pilot survey in Palestine reveals spike mutation H245N. BMC Res. Notes 2021, 14, 466. [Google Scholar] [CrossRef]
  11. Qutob, N.; Salah, Z.; Richard, D.; Darwish, H.; Sallam, H.; Shtayeh, I.; Najjar, O.; Ruzayqat, M.; Najjar, D.; Balloux, F.; et al. Genomic epidemiology of the first epidemic wave of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in Palestine. Microb. Genom. 2021, 7, 000584. [Google Scholar] [CrossRef]
  12. Nasereddin, A.; Al-Jawabreh, A.; Dumaidi, K.; Al-Jawabreh, A.; Al-Jawabreh, H.; Ereqat, S. Tracking of SARS-CoV-2 Alpha variant (B.1.1.7) in Palestine. Infect. Genet. Evol. 2022, 101, 105279. [Google Scholar] [CrossRef]
  13. COVID-19 ARTIC v4.1 Illumina Library Construction and Sequencing Protocol—Tailed Method. Protocols.io. Available online: https://www.protocols.io/view/covid-19-artic-v4-1-illumina-library-construction-b4myqu7w (accessed on 4 February 2024).
  14. Baker, D.J.; Aydin, A.; Le-Viet, T.; Kay, G.L.; Rudder, S.; de Oliveira Martins, L.; Tedim, A.P.; Kolyva, A.; Diaz, M.; Alikhan, N.F.; et al. CoronaHiT: High-throughput sequencing of SARS-CoV-2 genomes. Genome Med. 2021, 13, 21. [Google Scholar] [CrossRef]
  15. ARTIC Network 2019. ARTIC Nanopore Protocol for nCoV2019 Novel Coronavirus. Available online: https://github.com/artic-network/artic-ncov2019 (accessed on 4 October 2021).
  16. bcl2fastq Conversion Software. Available online: https://support.illumina.com/sequencing/sequencing_software/bcl2fastq-conversion-software.html (accessed on 13 March 2024).
  17. FelixKrueger/TrimGalore: v0.6.7. Available online: https://zenodo.org/record/5127899#.YulGiikzY2w (accessed on 13 March 2024).
  18. Heng, L. Aligning Sequence Reads, Clone Sequences and Assembly Contigs with BWA-MEM. arXiv 2013, arXiv:13033997. [Google Scholar]
  19. Grubaugh, N.D.; Gangavarapu, K.; Quick, J.; Matteson, N.L.; De Jesus, J.G.; Main, B.J.; Tan, A.L.; Paul, L.M.; Brackney, D.E.; Grewal, S.; et al. An amplicon-based sequencing framework for accurately measuring intrahost virus diversity using PrimalSeq and iVar. Genome Biol. 2019, 20, 8. [Google Scholar] [CrossRef]
  20. Preliminary Genomic Characterisation of an Emergent SARS-CoV-2 Lineage in the UK Defined by a Novel Set of Spike Mutations. Available online: https://virological.org/t/preliminary-genomic-characterisation-of-an-emergent-sars-cov-2-lineage-in-the-uk-defined-by-a-novel-set-of-spike-mutations/563 (accessed on 13 March 2024).
  21. Kalyaanamoorthy, S.; Minh, B.Q.; Wong, T.K.F.; von Haeseler, A.; Jermiin, L.S. ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat. Methods 2017, 14, 587–589. [Google Scholar] [CrossRef] [PubMed]
  22. De Bernardi Schneider, A.; Ford, C.T.; Hostager, R.; Williams, J.; Cioce, M.; Catalyurek, U.V.; Wertheim, J.O.; Janies, D. StrainHub: A phylogenetic tool to construct pathogen transmission networks. Bioinformatics 2020, 36, 945–947. [Google Scholar] [CrossRef] [PubMed]
  23. Bandelt, H.J.; Forster, P.; Rohl, A. Median-joining networks for inferring intraspecific phylogenies. Mol. Biol. Evol. 1999, 16, 37–48. [Google Scholar] [CrossRef] [PubMed]
  24. Clement, M.; Posada, D.; Crandall, K.A. TCS: A computer program to estimate gene genealogies. Mol. Ecol. 2000, 9, 1657–1659. [Google Scholar] [CrossRef]
  25. Huang, Y.; Yang, C.; Xu, X.F.; Xu, W.; Liu, S.W. Structural and functional properties of SARS-CoV-2 spike protein: Potential antivirus drug development for COVID-19. Acta Pharmacol. Sin. 2020, 41, 1141–1149. [Google Scholar] [CrossRef]
  26. Mardulyn, P. Trees and/or networks to display intraspecific DNA sequence variation? Mol. Ecol. 2012, 21, 3385–3390. [Google Scholar] [CrossRef]
  27. Rozas, J.; Ferrer-Mata, A.; Sanchez-DelBarrio, J.C.; Guirao-Rico, S.; Librado, P.; Ramos-Onsins, S.E.; Sanchez-Gracia, A. DnaSP 6: DNA Sequence Polymorphism Analysis of Large Data Sets. Mol. Biol. Evol. 2017, 34, 3299–3302. [Google Scholar] [CrossRef]
  28. Nei, M. Molecular Evolutionary Genetics; Columbia University Press: New York, NY, USA, 1987. [Google Scholar]
  29. Kulldorff, M.; Heffernan, R.; Hartman, J.; Assuncao, R.; Mostashari, F. A space-time permutation scan statistic for disease outbreak detection. PLoS Med. 2005, 2, e59. [Google Scholar] [CrossRef]
  30. Tajima, F. Evolutionary relationship of DNA sequences in finite populations. Genetics 1983, 105, 437–460. [Google Scholar] [CrossRef] [PubMed]
  31. Fu, Y.X.; Li, W.H. Statistical tests of neutrality of mutations. Genetics 1993, 133, 693–709. [Google Scholar] [CrossRef] [PubMed]
  32. Wright, S. The genetical structure of populations. Ann. Eugen. 1951, 15, 323–354. [Google Scholar] [CrossRef] [PubMed]
  33. Wang, J. On the measurements of genetic differentiation among populations. Genet. Res. 2012, 94, 275–289. [Google Scholar] [CrossRef] [PubMed]
  34. Nei, M. Evolution of human races at the gene level. In Human Genetics, Part A: The Unfolding Genome; Onne-Tamir, B., Cohen, T., Goodman, R.M., Eds.; Alan R. Liss: New York, NY, USA, 1982; pp. 167–181. [Google Scholar]
  35. Hudson, R.R.; Kreitman, M.; Aguade, M. A test of neutral molecular evolution based on nucleotide data. Genetics 1987, 116, 153–159. [Google Scholar] [CrossRef] [PubMed]
  36. Singh, J.; Rahman, S.A.; Ehtesham, N.Z.; Hira, S.; Hasnain, S.E. SARS-CoV-2 variants of concern are emerging in India. Nat. Med. 2021, 27, 1131–1133. [Google Scholar] [CrossRef] [PubMed]
  37. Mizrahi, B.; Sudry, T.; Flaks-Manov, N.; Yehezkelli, Y.; Kalkstein, N.; Akiva, P.; Ekka-Zohar, A.; Ben David, S.S.; Lerner, U.; Bivas-Benita, M.; et al. Long covid outcomes at one year after mild SARS-CoV-2 infection: Nationwide cohort study. BMJ 2023, 380, e072529. [Google Scholar] [CrossRef] [PubMed]
  38. Palestinian Ministry of Heath (PMOH) 2021. COVID-19 in Palestine Ramallah: PMOH 2021; Palestinian Ministry of Heath (PMOH): Ramallah, Palestine, 2021. [Google Scholar]
  39. SARS-CoV-2 Variants in Analyzed Sequences. Our World in Data. Available online: https://ourworldindata.org/grapher/covid-variants-area (accessed on 13 March 2024).
  40. Coronavirus: Palestine. Available online: https://www.worldometers.info/coronavirus/country/state-of-palestine/ (accessed on 13 March 2024).
  41. Alotaibi, B.; El-Masry, T.A.; Seadawy, M.G.; El-Harty, B.E.; Saleh, A.; Gad, A.F.; El-Hosseny, M.F.; Mahran, Y.F.; El-Bouseary, M.M. Identification, propagation and molecular characterization of SARS-CoV-2 delta variant isolated from Egyptian COVID-19 patients. Infect. Genet. Evol. 2022, 100, 105278. [Google Scholar] [CrossRef]
  42. Merhi, G.; Amayri, S.; Bitar, I.; Araj, G.F.; Tokajian, S. Whole Genome-Based Characterization of Multidrug Resistant Enterobacter and Klebsiella aerogenes Isolates from Lebanon. Microbiol. Spectr. 2023, 11, e0291722. [Google Scholar] [CrossRef]
  43. Goyal, M.; De Bruyne, K.; van Belkum, A.; West, B. Different SARS-CoV-2 haplotypes associate with geographic origin and case fatality rates of COVID-19 patients. Infect. Genet. Evol. 2021, 90, 104730. [Google Scholar] [CrossRef]
  44. Laskar, R.; Ali, S. Phylo-geo-network and haplogroup analysis of 611 novel coronavirus (SARS-CoV-2) genomes from India. Life Sci. Alliance 2021, 4, e202000925. [Google Scholar] [CrossRef] [PubMed]
  45. Coronavirus: Israel. Available online: https://www.worldometers.info/coronavirus/country/israel/ (accessed on 13 March 2024).
  46. Coronavirus: Jordan. Available online: https://www.worldometers.info/coronavirus/country/jordan/ (accessed on 13 March 2024).
  47. Palestine News & Info Agency. Palestine Logs First Two Cases of Delta Variant of COVID-19; WAFA-Ministry of Health: Ramallah, Palestine, 2021. [Google Scholar]
  48. Earnest, R.; Uddin, R.; Matluk, N.; Renzette, N.; Turbett, S.E.; Siddle, K.J.; Loreth, C.; Adams, G.; Tomkins-Tinch, C.H.; Petrone, M.E.; et al. Comparative transmissibility of SARS-CoV-2 variants Delta and Alpha in New England, USA. Cell Rep. Med. 2022, 3, 100583. [Google Scholar] [CrossRef] [PubMed]
  49. Focosi, D.; Maggi, F. Neutralising antibody escape of SARS-CoV-2 spike protein: Risk assessment for antibody-based COVID-19 therapeutics and vaccines. Rev. Med. Virol. 2021, 31, e2231. [Google Scholar] [CrossRef] [PubMed]
  50. Flores-Alanis, A.; Cruz-Rangel, A.; Rodriguez-Gomez, F.; Gonzalez, J.; Torres-Guerrero, C.A.; Delgado, G.; Cravioto, A.; Morales-Espinosa, R. Molecular Epidemiology Surveillance of SARS-CoV-2: Mutations and Genetic Diversity One Year after Emerging. Pathogens 2021, 10, 184. [Google Scholar] [CrossRef] [PubMed]
  51. Buitrago, S.P.; Garzon-Ospina, D. Genetic diversity of SARS-CoV-2 in South America: Demographic history and structuration signals. Arch. Virol. 2021, 166, 3357–3371. [Google Scholar] [CrossRef] [PubMed]
  52. Verity, R.; Nichols, R.A. What is genetic differentiation, and how should we measure it--GST, D, neither or both? Mol. Ecol. 2014, 23, 4216–4225. [Google Scholar] [CrossRef] [PubMed]
  53. Wright, S.I.; Charlesworth, B. The HKA test revisited: A maximum-likelihood-ratio test of the standard neutral model. Genetics 2004, 168, 1071–1076. [Google Scholar] [CrossRef] [PubMed]
  54. Turakhia, Y.; Thornlow, B.; Hinrichs, A.; McBroome, J.; Ayala, N.; Ye, C.; Smith, K.; De Maio, N.; Haussler, D.; Lanfear, R.; et al. Pandemic-scale phylogenomics reveals the SARS-CoV-2 recombination landscape. Nature 2022, 609, 994–997. [Google Scholar] [CrossRef] [PubMed]
  55. Bobay, L.M.; O’Donnell, A.C.; Ochman, H. Recombination events are concentrated in the spike protein region of Betacoronaviruses. PLoS Genet. 2020, 16, e1009272. [Google Scholar] [CrossRef]
  56. Li, Q.; Wu, J.; Nie, J.; Zhang, L.; Hao, H.; Liu, S.; Zhao, C.; Zhang, Q.; Liu, H.; Nie, L.; et al. The Impact of Mutations in SARS-CoV-2 Spike on Viral Infectivity and Antigenicity. Cell 2020, 182, 1284–1294. [Google Scholar] [CrossRef]
  57. Gribble, J.; Stevens, L.J.; Agostini, M.L.; Anderson-Daniels, J.; Chappell, J.D.; Lu, X.; Pruijssers, A.J.; Routh, A.L.; Denison, M.R. The coronavirus proofreading exoribonuclease mediates extensive viral recombination. PLoS Pathog. 2021, 17, e1009226. [Google Scholar] [CrossRef] [PubMed]
Figure 1. (a) Choropleth map showing density of 552 COVID-19 patients per district as indicated by intensity of blue color. The Arabic numerals in black represent the number of COVID-19 samples collected from each district. Numbers in brackets represent the population of the given district in the year 2021. Red circles with yellow rims represent the statistically significant purely spatial clusters of COVID-19 in the study area. (b) Spot and cluster map of the distribution of samples by locality represented by red circle, which may contain more than one sample per locality. The eight significant clusters are represented by the red circles with yellow rims.
Figure 1. (a) Choropleth map showing density of 552 COVID-19 patients per district as indicated by intensity of blue color. The Arabic numerals in black represent the number of COVID-19 samples collected from each district. Numbers in brackets represent the population of the given district in the year 2021. Red circles with yellow rims represent the statistically significant purely spatial clusters of COVID-19 in the study area. (b) Spot and cluster map of the distribution of samples by locality represented by red circle, which may contain more than one sample per locality. The eight significant clusters are represented by the red circles with yellow rims.
Pathogens 13 00521 g001
Figure 2. Maximum likelihood phylogenetic tree of 552 SARS-CoV-2 whole-genome sequences obtained from eight geographical regions (districts). Nodes are colored by location of collection, as indicated in the key, and the concentric ring indicates PANGO lineage. Tree was rooted to the midpoint. Full contextual data for samples can be found in Supplementary Table S1.
Figure 2. Maximum likelihood phylogenetic tree of 552 SARS-CoV-2 whole-genome sequences obtained from eight geographical regions (districts). Nodes are colored by location of collection, as indicated in the key, and the concentric ring indicates PANGO lineage. Tree was rooted to the midpoint. Full contextual data for samples can be found in Supplementary Table S1.
Pathogens 13 00521 g002
Figure 3. Potential SARS-CoV-2 transmission network based on reported sample location (district). The transmission network was built at StrainHub by mapping the district metadata (Supplementary Table S1) onto a phylogenetic tree in Figure 2. A “source hub ratio, SHR” centrality metric was used. The arrows represent potential transition of SARS-CoV-2, based on the included data, from one location to another location, and line weight reflects the transmission frequency.
Figure 3. Potential SARS-CoV-2 transmission network based on reported sample location (district). The transmission network was built at StrainHub by mapping the district metadata (Supplementary Table S1) onto a phylogenetic tree in Figure 2. A “source hub ratio, SHR” centrality metric was used. The arrows represent potential transition of SARS-CoV-2, based on the included data, from one location to another location, and line weight reflects the transmission frequency.
Pathogens 13 00521 g003
Figure 4. Median-joining haplotype network of the spike region of 462 SARS-CoV2 isolates. Node colors represent district of isolate collection, as indicated in the color map key. Each node represents a haplotype. The diameter of the node circle is proportional to the number of sequences. The number of hatch marks on the connecting lines between nodes indicates the number of mutations between nodes. Three major haplogroups are shown in bold A, B, and C.
Figure 4. Median-joining haplotype network of the spike region of 462 SARS-CoV2 isolates. Node colors represent district of isolate collection, as indicated in the color map key. Each node represents a haplotype. The diameter of the node circle is proportional to the number of sequences. The number of hatch marks on the connecting lines between nodes indicates the number of mutations between nodes. Three major haplogroups are shown in bold A, B, and C.
Pathogens 13 00521 g004
Table 1. Pangolin COVID-19 lineages of the study population, August 2021.
Table 1. Pangolin COVID-19 lineages of the study population, August 2021.
LineageFrequencyPercent
B.1.617.217731.00
AY.10611119.44
AY.1229015.76
AY.1217412.96
AY.122.1325.60
AY.112.3223.85
AY.25.1162.80
AY.126101.75
AY.7361.05
AY.12450.88
AY.4.630.53
AY.46.630.53
AY.530.53
AY.420.35
AY.4220.35
AY.4320.35
AY.4520.35
AY.7120.35
AY.920.35
AY.11110.18
AY.3810.18
AY.46.410.18
AY.6410.18
AY.9.210.18
B.1.1.710.18
None10.18
TOTAL571100.00
Table 2. Statistically significant purely spatial COVID-19 clusters.
Table 2. Statistically significant purely spatial COVID-19 clusters.
LocalityDistrict#Cases/PopulationRRp-Value
Beit-KahilAl-Khalil24/48638.2<0.001
Tulkarem cityTulkarem57/34,2152.9<0.001
DuraAl-Khalil33/21,5432.6<0.001
Jericho cityJericho21/22,2933.1<0.001
Jenin cityJenin36/26,7502.2<0.001
JalbounJenin6/15086.40.018
Nour-shams campTulkarem9/34054.20.016
Bethlehem cityBethlehem22/30,8802.30.022
Ash-Shuhada and Kafr QoudJenin5/2475, 1/16724.70.087
At-TaybehJenin4/23845.40.254
ZwataNablus4/27154.80.388
A’nza and QabatiaJenin2/2044, 13/26,3961.70.899
JalqamousJenin3/28243.40.959
Beit wazanNablus2/14044.60.974
Al-KhalilAl-Khalil82/221,1361.20.990
Beit QadJenin2/16553.90.997
RR, relative risk.
Table 3. Genetic diversity indices of the 462 spike sequences of SARS-CoV-2 from Palestine.
Table 3. Genetic diversity indices of the 462 spike sequences of SARS-CoV-2 from Palestine.
Haplotype–Nucleotide DiversityNeutrality Tests
ClusterNhEtaHd ± SDπ ± SDKSTajima’s DFu-Li’s FFu and Li’s D
Cluster A24144510.81 ± 0.080.0006 ± 0.0001.5750−2.38 **−4.55 **−4.91 **
Cluster B15830400.81 ± 0.0000.0008 ± 0.0002.0140−2.14 *−5.25 **−5.98 **
Cluster C6313120.44 ± 0.0000.0002 ± 0.0000.6212−2.15 *−2.83 *−2.52 *
Total46267760.87 ± 0.010.0009 ± 0.0001.7574−2.43 ***−6.52 **−8.14 **
N, number of sequences; h, number of haplotypes; Hd, haplotype (gene) diversity; π, nucleotide diversity (per site); K, average number of nucleotide differences between two randomly chosen sequences from within the population; S, number of variable/segregating sites; Eta (η), total number of mutations; ***: p < 0.001, **: p < 0.01, *: p < 0.05 [28,30,31].
Table 4. Genetic differentiation indices between the three clusters of the 462 SARS-CoV-2 spike sequences from Palestine.
Table 4. Genetic differentiation indices between the three clusters of the 462 SARS-CoV-2 spike sequences from Palestine.
Pop 1Pop 2FstNmKxyDxyγstDaHKA
Cluster ACluster B0.470.572.230.00110.160.00050.006
Cluster ACluster C0.580.361.740.00090.210.00050.036
Cluster BCluster C0.700.212.880.00140.240.0010.000
Fst, Wright’s F-statistics, pairwise genetic distance, Nm, gene flow, and population migration were calculated among populations. Nm = (1 − Fst)/2Fst haploid, Nm = (1 − Fst)/4Fst diploid [32]; Kxy, average number of nucleotide differences between populations; Dxy, the average number of nucleotide substitutions per site between populations; Da, the number of net nucleotide substitutions per site between populations [33]; γst (Gst), coefficient of genetic differentiation [34]; HKA, a test based on neutral theory that a genome exhibits intra-population polymorphism and inter-population divergence [35].
Table 5. Recombination events in the three clusters of the 462 SARS-CoV-2 spike sequences from Palestine.
Table 5. Recombination events in the three clusters of the 462 SARS-CoV-2 spike sequences from Palestine.
PopRmRgRs
Cluster A0.0>105--
Cluster B1.029.90.009
Cluster C0.027.00.007
R, variance of the average number of nucleotide differences between pairs of sequences. Recombination parameter R = 4 Nr, N: population size and r: recombination rate per sequence (per gene); Rs, recombination parameter between adjacent sites; Rg, recombination parameter for entire gene; Rm, minimum number of recombination events in the history of sample [35].
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ereqat, S.; Alikhan, N.-F.; Al-Jawabreh, A.; Matthews, M.; Al-Jawabreh, A.; de Oliveira Martins, L.; Trotter, A.J.; Al-Kaila, M.; Page, A.J.; Pallen, M.J.; et al. Epidemiological Characterization and Genetic Variation of the SARS-CoV-2 Delta Variant in Palestine. Pathogens 2024, 13, 521. https://doi.org/10.3390/pathogens13060521

AMA Style

Ereqat S, Alikhan N-F, Al-Jawabreh A, Matthews M, Al-Jawabreh A, de Oliveira Martins L, Trotter AJ, Al-Kaila M, Page AJ, Pallen MJ, et al. Epidemiological Characterization and Genetic Variation of the SARS-CoV-2 Delta Variant in Palestine. Pathogens. 2024; 13(6):521. https://doi.org/10.3390/pathogens13060521

Chicago/Turabian Style

Ereqat, Suheir, Nabil-Fareed Alikhan, Amer Al-Jawabreh, Michaela Matthews, Ahmed Al-Jawabreh, Leonardo de Oliveira Martins, Alexander J. Trotter, Mai Al-Kaila, Andrew J. Page, Mark J. Pallen, and et al. 2024. "Epidemiological Characterization and Genetic Variation of the SARS-CoV-2 Delta Variant in Palestine" Pathogens 13, no. 6: 521. https://doi.org/10.3390/pathogens13060521

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop