Next Article in Journal
The Role of Vitamins in the Pathogenesis of Asthma
Previous Article in Journal
Mutations in the Vicinity of the IRAK3 Guanylate Cyclase Center Impact Its Subcellular Localization and Ability to Modulate Inflammatory Signaling in Immortalized Cell Lines
Previous Article in Special Issue
In Silico Design of New Dual Inhibitors of SARS-CoV-2 MPRO through Ligand- and Structure-Based Methods
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Enhancing SARS-CoV-2 Surveillance through Regular Genomic Sequencing in Spain: The RELECOV Network

by
Sonia Vázquez-Morón
1,2,*,
María Iglesias-Caballero
1,
José Antonio Lepe
3,
Federico Garcia
4,5,
Santiago Melón
6,
José M. Marimon
7,
Darío García de Viedma
8,
Maria Dolores Folgueira
9,10,
Juan Carlos Galán
2,11,
Carla López-Causapé
12,
Rafael Benito-Ruesca
13,
Julia Alcoba-Florez
14,
Fernando Gonzalez Candelas
2,15,
María de Toro
16,
Miguel Fajardo
17,
Carmen Ezpeleta
18,
Fernando Lázaro
19,
Sonia Pérez Castro
20,
Isabel Cuesta
21,
Angel Zaballos
22,
Francisco Pozo
1,2,
Inmaculada Casas
1,2,* and
on behalf of RELECOV Network Members
add Show full author list remove Hide full author list
1
Respiratory Viruses and Influenza Unit, National Centre for Microbiology, Instituto de Salud Carlos III, 28222 Majadahonda, Spain
2
CIBER de Epidemiología y Salud Pública (CIBERESP), ISCIII, 28029 Madrid, Spain
3
Microbiology Service, Hospital Universitario Virgen del Rocio, 41013 Sevilla, Spain
4
Microbiology Service, Hospital Universitario San Cecilio, Instituto de Investigación Biosanitaria Ibs. Granada, 18016 Granada, Spain
5
CIBER de Enfermedades Infecciosas (CIBERINFEC), ISCIII, 28029 Madrid, Spain
6
Microbiology Service, Hospital Universitario Central de Asturias, 33011 Oviedo, Spain
7
Microbiology Service, Instituto de Investigación Sanitaria Biodonostia, Hospital Universitario Donostia, 20014 Donostia-San Sebastian, Spain
8
Microbiology Service, Hospital General Universitario Gregorio Marañón, Instituto de Investigación Sanitaria Gregorio Marañón, 28007 Madrid, Spain
9
Microbiology Department, Hospital Universitario 12 de Octubre, Biomedical Research Institute imas12, 28041 Madrid, Spain
10
Department of Medicine, School of Medicine, Universidad Complutense, 28040 Madrid, Spain
11
Microbiology Service, Hospital Universitario Ramón y Cajal, Instituto Ramón y Cajal de Investigación Sanitaria (IRYCIS), 28034 Madrid, Spain
12
Microbiology Service, Hospital Universitario Son Espases, 07120 Palma de Mallorca, Spain
13
Microbiology Service, Hospital Clínico Universitario Lozano Blesa, Departamento de Microbiología, Facultad de Medicina, Instituto de Investigación Sanitaria de Aragón, Universidad de Zaragoza, 50009 Zaragoza, Spain
14
Microbiology Service, Hospital Universitario Ntra. Sra de Candelaria, 38010 Santa Cruz de Tenerife, Spain
15
Joint Research Unit Infection and Public Health FISABIO-University of Valencia, Institute for Integrative Systems Biology (I2SysBio), 46020 Valencia, Spain
16
Plataforma de Genómica y Bioinformática, Centro de Investigación Biomédica de La Rioja (CIBIR), 26006 Logroño, Spain
17
Microbiology Service, Hospital Universitario de Badajoz, 06080 Badajoz, Spain
18
Complejo Hospitalario de Navarra and Navarra De Servicios Y Tecnologías S A (NASERTIC), 31008 Pamplona, Spain
19
Microbiology Service, Hospital Universitario La Paz, 28046 Madrid, Spain
20
Microbiology Service, Complexo Hospitalario Universitario de Vigo, 36204 Vigo, Spain
21
Bioinformatics Unit, Unidades Centrales Científico Técnicas, Instituto de Salud Carlos III, 28222 Majadahonda, Spain
22
Genomics Unit, Unidades Centrales Científico Técnicas, Instituto de Salud Carlos III, 28222 Majadahonda, Spain
*
Authors to whom correspondence should be addressed.
RELECOV Network Members is provided in Supplementary File S2.
Int. J. Mol. Sci. 2023, 24(10), 8573; https://doi.org/10.3390/ijms24108573
Submission received: 8 March 2023 / Revised: 25 April 2023 / Accepted: 3 May 2023 / Published: 10 May 2023
(This article belongs to the Special Issue Genomic Variation of SARS-CoV-2)

Abstract

:
Millions of SARS-CoV-2 whole genome sequences have been generated to date. However, good quality data and adequate surveillance systems are required to contribute to meaningful surveillance in public health. In this context, the network of Spanish laboratories for coronavirus (RELECOV) was created with the main goal of promoting actions to speed up the detection, analyses, and evaluation of SARS-CoV-2 at a national level, partially structured and financed by an ECDC-HERA-Incubator action (ECDC/GRANT/2021/024). A SARS-CoV-2 sequencing quality control assessment (QCA) was developed to evaluate the network’s technical capacity. QCA full panel results showed a lower hit rate for lineage assignment compared to that obtained for variants. Genomic data comprising 48,578 viral genomes were studied and evaluated to monitor SARS-CoV-2. The developed network actions showed a 36% increase in sharing viral sequences. In addition, analysis of lineage/sublineage-defining mutations to track the virus showed characteristic mutation profiles for the Delta and Omicron variants. Further, phylogenetic analyses strongly correlated with different variant clusters, obtaining a robust reference tree. The RELECOV network has made it possible to improve and enhance the genomic surveillance of SARS-CoV-2 in Spain. It has provided and evaluated genomic tools for viral genome monitoring and characterization that make it possible to increase knowledge efficiently and quickly, promoting the genomic surveillance of SARS-CoV-2 in Spain.

1. Introduction

On 30 January 2020, the World Health Organization (WHO) declared the 2019 coronavirus disease (COVID-19) outbreak a public health emergency of international concern (PHEIC) [1]. One week later, the WHO recommended a surveillance approach based on, or similar to, the Global Influenza Surveillance and Response System (GISRS) that facilitates less resource-intensive monitoring [2]. Thus, from the beginning of the pandemic caused by SARS-CoV-2 [3,4,5], the frantic race to sequence this virus began, promoted by rapid technological development. This has led to the description of millions of genome sequences to date from the first SARS-CoV-2 complete genome published from a patient with severe respiratory syndrome on 26 December 2019 [4].
This large amount of information on the SARS-CoV-2 genome has aroused different nomenclature systems, genome repositories, and web servers allowing the exchange of molecular information as well as its analysis. Currently, three nomenclature systems, with their own scientific approaches, are used to classify and track SARS-CoV-2. The Global Initiative on Sharing All Influenza Data (GISAID) offers EpiCoVTM that includes a data repository, nomenclature system, and many tools for viral analysis [6]. Additionally, the NEXSTRAIN web server allows SARS-CoV-2 genome analyses and visualizations of the data [7]. In addition, the third nomenclature system is known as Pangolin which was developed for the implementation of a dynamic nomenclature for SARS-CoV-2. This has been described by Rambaut et al. and is known as “Pango lineage” [8,9]. This tool is used to assign lineages based on complete virus genome sequences, date of detection, and geographical location, allowing us to delve into the viral genome, and its evolution and transmission. However, this system has generated more than a thousand lineages, which constitute a great challenge for their follow-up in public health. In this sense, in May 2021, the WHO announced the assignment of simple names, easy to say and remember for the variants designated as of concern (VOCs) or interest (VOIs), using the Greek alphabet and arguing to avoid stigmatization generated by the relationship between the variant and the country where it was discovered and developing a global risk-monitoring framework [10].
SARS-CoV-2 and other coronaviruses evolve rapidly through point mutations as well as recombination, specifically in the spike region which is both a recombination and a mutation hotspot in coronaviruses [11,12]. SARS-CoV-2 evolution studies estimated a substitution rate of 0.5 × 10−3–1.1 × 10−3 substitutions/site/year, corresponding to a rate of 1.3 to 2.8 substitutions/month for the whole genome, and the increases in substitution rates have been related to the emergence of new variants [13]. However, the spike region, in particular, accumulates mutations much faster than other regions. The emergence of the Omicron variant was a great example, because it accumulated an unusually high number of mutations in the spike region that was not observed previously with any other VOC [12], and led to evidence of a substantial reduction of the antibody neutralizing activity against Omicron [14].
These viral characteristics showed the need to increase molecular knowledge related to the transmission and spread of the virus that could raise concern, which is essential for public health. In this sense, not only is the massive generation of data important, but these must be of good quality and linked to the establishment of an adequate surveillance system in order to contribute to meaningful actions in public health. In this regard, the WHO and European Centre for Disease Prevention and Control (ECDC) guidance on implementing genomic SARS-CoV-2 surveillance were released in January and May 2021, respectively [15,16]. On the other hand, on 17 February 2021, the European Commission proposed immediate action to prepare Europe for the growing threat of coronavirus variants under a new plan promoted by the program “Health Emergency Preparedness and Response Authority (HERA) Incubator” [17]. In Spain, the creation of a network of Spanish laboratories (RELECOV) for genomic surveillance of SARS-CoV-2 was a priority starting from January 2021 and it was reflected in the document “Strategy for the integration of sequencing in the surveillance of SARS-CoV-2” [18]. The RELECOV network was then officially structured under the ECDC/HERA/2021/024 project “Enhancing whole genome sequencing (WGS) and/or reverse transcription polymerase chain reaction (RT-PCR), national infrastructures, and capacities to respond to the COVID-19 pandemic in Spain”. The network, composed initially of 43 member institutions, including some regional nodes with regional networks or well-defined consortia. The RELECOV’s main objectives include promoting actions to speed up SARS-CoV-2 genomic virus detection, analyses, and evaluation in Spain. RELECOV’s efforts to date have been focused on improving and enhancing the SARS-CoV-2 genomic surveillance directed at the circulating virus in Spain, through establishing national quality controls for sequencing performance and methodology implementation, implementing genomic analyses and sharing of the obtained sequences, tracking mutations in the genome and, finally, performing phylogenetic analyses to characterize the sequenced viral genomes to provide a global context to the viruses circulating in Spain. These actions are essential given the potential scenarios for the rise to new pathogens or variants, highlighting the need to continue research on coronaviruses and especially genomic surveillance [19].

2. Results

2.1. Surveillance of Virus from Sequences Deposited in GISAID by the RELECOV Network

The first six months from the structuring of the RELECOV network were evaluated by monitoring viral circulation and the progress in sequence sharing in Spain. For this purpose, three periods were established to observe network’s activity over time: two quarterly periods comprised of 05.09.2021 to 05.12.2021 and 06.12.2021 to 08.03.2022, and their corresponding semi-annual period (05.09.2021 to 08.03.2022).
RELECOV’s complete sequences deposited in GISAID in the period between 05.09.2021 and 05.12.2021 showed a spectrum of up to two hundred different lineages. The classification of the corresponding variants showed 1185 Alpha, 87 Beta, 18,272 Delta, 96 Gamma, 171 Mu, 2 Iota, 8 Omicron, 40 Lambda, 27 Eta, 1 Epsilon, 1 VUM (Variant Under Monitoring), 483 NA (Not applicable, lineages circulating before variant assignment started) 61 with an unassigned lineage, and 3 classified as recombinant (Table 1). The results showed that the Variant of Concern (VOC) with the highest circulation in Spain during this period was Delta, followed to a lesser extent by Alpha, which is consistent with the worldwide patterns at that time.
Between 06.12.2021and 08.03.2022, the virus genomes deposited in GISAID belonged to 172 different lineages, corresponding to the following variant classification: 72 Alpha, 2 Beta, 12,733 Delta, 4 Gamma, 15 Mu, 15,251 Omicron, 1 recombinant (XG), 49 NA, and 16 with an unassigned lineage (Table 2). For this period, the obtained results allowed us to monitor the co-circulation of the Omicron and Delta variants in similar proportions, as well as the quick spreading of the Omicron variant during this period.
The aggregated data for a semi-annual period (05.09.2021 to 08.03.2022) showed that viral sequences deposited in GISAID corresponded to 263 different lineages. In terms of variant classification, we found 1257 sequences belonging to Alpha, 89 Beta, 31,005 Delta, 27 Gamma, 1 Epsilon, 27 Eta, 40 Lambda, 186 Mu, 15,259 Omicron, 4 recombinant, 532 NA, 1 VUM, and 77 sequences with an unassigned lineage. The analysis also allowed us to observe sequences deposited during the study period but corresponding to prior dates, which demonstrates the network boost in data exchange. During 2020, an elevated percentage of virus circulation without variant assignment was observed since this nomenclature system was announced by the WHO in May 2021 [20]. Moreover, there was a high percentage of the Delta variant circulating in 2021 and a high percentage for the Omicron variant in the two first months of 2022 in Spain. Other variants such as Beta, Mu, Lambda, and Gamma circulated in Spain in smaller proportions during 2021 (Figure 1).
The monthly based analysis for the semi-annual period data (05.09.2021 to 08.03.2022) showed co-circulation of the main VOCs and their replacement in Spain. These data highlight the co-circulation of Alpha and Delta variants during June and July 2021, with Delta becoming the dominant variant in August 2021. The co-circulation of Delta and Omicron variants in December 2021 was also detected, concluding with the replacement of Delta by Omicron in February 2022 (Figure 2) reflecting similar dynamics for Alpha, Delta, and Omicron time course distributions in all submitted sequences to GISAID until 15.03.2022 (Supplementary File S3 and Supplementary Figure S1).
Moreover, comparing the data by dividing the semi-annual period by quarter (05.09.2021–05.12.2021 and 06.12.2021–08.03.2022), an increase of 36% in the number of deposited viral sequences from Spain in GISAID was observed, which demonstrates data sharing improvement over time.

2.2. Tracking Changes in Delta and Omicron Variants

To track changes in Delta and Omicron variants, we checked all lineage-defining mutations, considered as non-synonymous substitutions or deletions that occur in >75% of sequences within a lineage or its sublineages according to their definition on the web portal outbreak.info [21].

2.2.1. Delta Variant

The analyses of lineage-defining mutations corresponding to 248 Delta lineages and sublineages available until July 2022 were checked. The analyses showed a total of 441 mutations described for this set corresponding to the Delta variant, of which 14 were present in more than 98% of lineages and sublineages. These mutations were: ORF1b (P314L, G662S, and P1000L); S (T19R, T452R, T478K, D614G, and P681R); ORF3a (S26L); M (I82T); N (D63G, R203M, and D377Y); and ORF8 (S84L). The mutations present in all lineages and sublineages were: T19R, T478K, and D614G in S, I82T in M, and S26L in ORF3a.

2.2.2. Omicron Variant

The analysis for the Omicron variant included a total of 205 lineages and sublineages available until October 2022. A total of 343 mutations were present in this set, with the most frequent being: ORF1a (P3395H and T3255I); ORF1b (P314L and I1566V); S (G142D, G339D, S373P, S375F, S447N, T478K, E484A, Y505H, D614G, H655Y, N679K, P681H, N764K, D796Y, Q954H, and N969K), E (T19I); M (Q19E and A63T); ORF8 (S84L); and N (P13L, R203K, and G204R). However, only the mutations in ORF1a (T3255I and P3395H), S (D614G, H655Y, N679K, D796Y, Q954H, and N969K), and M (A63T) were present in all lineages and sublineages belonging to the Omicron variant. These results showed that 27 mutations were present in more than 95% of lineages/sublineages belonging to the Omicron variant, but only 9 were present in all of them, and most mutations were located in the S gene.
In this analysis, we also observed that the Delta lineages/sublineages had three times more defined lineage mutations (441) than Omicron (161) until July 2022. We suggest that this difference might be associated with the circulation period of each variant until the moment of analysis (Delta: October 2020 to July 2022 and Omicron: November 2021 to July 2002) and could be related to the number of infections, giving rise to a greater evolution of Delta reflected in the changes established as determinants of lineages. A posterior analysis on October 2022 revealed an increase to 342 lineage-defining mutations of Omicron lineages/sublineages, confirming its evolution over time.

2.3. Quality Control Assessment for SARS-CoV-2 Sequencing (QCA)

The QCA of SARS-CoV-2 sequencing was performed between 22.12.2021 and 28.01.2022. In total, 37 out of the 38 RELECOV network participants (97.4%) reported their results for the quality control assessment. Obtaining and sequencing of inactivated and lyophilized SARS-CoV-2 samples was carried out and assessed by the Reference Laboratory, Respiratory Virus and Influenza Unit of the National Centre for Microbiology (CNM), Carlos III Health Institute before dispatch, in order to obtain the correct results for the assignment of lineage and variant for each sample. The sample details for the QCA are shown in Table 3.
In regard to the library preparation kits, the most used by the participants were the DNA prep Tagmentation (35.14%), Nextera XT, and SeqCovid kits (10.81% each for the latter two) on the Illumina platform which was used by 70.27% of the participants. The Ion ampliseq kit for Chef DL8 (8.10%) on the Ion Torrent platform was used by 27.03% of the participants.
The overall results obtained by the RELECOV members, who were individually designated by a unique and random code anonymized, are shown in Figure 3.
The complete hit rates (percentage of assignment) for variant and lineage were 37.8% and 13.51%, respectively, and the correct assignment of both variant and lineage was low 13.51% (5/37). We identified a potential problem with one of the QCA samples, due to a failure in the RNA lyophilization procedure therefore, batches of tubes corresponding to this specific sample were not homogeneous. For analysis of results, hit rates were recalculated without considering that specific sample. After removing this sample, the global results generated a complete correct assignment rate of 94.59% for variant and 64.86% for lineage. In addition, performing a disaggregated analysis by QCA sample, the variant and lineage assignments were above 95% and 85%, respectively, regardless of the described problem with one sample. These results highlight the inherent complexity of comparing lineage assignments from a number of laboratories using different sequencing technologies and bioinformatic pipelines to generate the consensus SARS-CoV-2 sequences.

2.4. Reference SARS-CoV-2 Phylogenetic Analysis Implementation

The phylogenetic analyses were performed in order to establish a reference matrix and tree containing worldwide sequences including different lineages and variants as a tool for surveillance purposes.
A phylogenetic analysis was carried out using FastTree [22] and IQ-TREE 2 [23] from the FASTA files corresponding to 713 worldwide (76 countries and territories) viral sequences retrieved from GISAID using the NCBI Sequence NC_045512.2 as the reference genome for SARS-CoV-2. This analysis revealed the same clusters described by the WHO for variant assignments comprising different lineages and sublineages (Figure 4). An additional analysis using the sequence of the S gene (Figure 5) showed that variant clusters were consistent in both trees. As expected, viral evolution was better resolved using the complete viral genome for this analysis. However, the information provided by the S gene must be taken into consideration in view of the results obtained for variant classifications.

3. Discussion

The main goal in public health is the prevention of disease and promotion of health. For this purpose, different methods have been used to track the progression of diseases, locate outbreaks, and establish containment strategies. In recent years, genomic methods have given a great boost to the surveillance of infectious diseases, allowing us to track their spread in the population, their aetiology, as well as their resistance to drugs and vaccines, and becoming a priority for the surveillance of infectious diseases even before the SARS-CoV-2 pandemic [24].
Historically, genomic surveillance has only been routinely performed in a few countries due to the complexity and cost of the technology. However, these barriers have been changing during the COVID-19 pandemic, and genomic information on pathogens has been seen as crucial for public health decision making [25]. Consequently, in 2022, the WHO launched a strategy to strengthen and expand genomic surveillance worldwide [18]. At present, this information is highly relevant, since it provides knowledge that allows the identification of the pathogen, its circulation, and makes it possible to plan a response to the disease. Therefore, the establishment of genomic surveillance networks play a key role as tool that helps drive rapid public health decisions.
However, as far as we know, there is little data about the establishment of genomic surveillance networks in the SARS-CoV-2 pandemic context that have been published [26,27], the first being The Coronavirus Disease 2019 (COVID-19) Genomics UK Consortium (COG-UK) launched in May 2020, which would lay the groundwork for genomics to serve as core outbreak tracking tool in the future [28]. In Spain, during the initial pandemic situation, the SeqCOVID consortium contributed to the monitoring and study of the evolution of the epidemic [29]. The RELECOV network of laboratories was created to consolidate a national network to enhance the SARS-CoV-2 genomic surveillance, becoming one of the countries that had performed a high level of routine genomic surveillance and high sequencing availability according to the WHO’s Global genomic surveillance strategic objectives for reinforcing the capacities that include sample collection, diagnostics, data sharing, and analysis [30]. This is consistent with the observed increase in shared viral sequences in GISAID [31]. The continuous monitoring and data analysis through the RELECOV network offered a comprehensive knowledge about the virus dynamics and its circulating variants at a national level, increasing this knowledge in Europe and in the European/trans-national dimension.
The benefit of the SARS-CoV-2 genomic surveillance includes the tracking of changes in the viral genome and predicting their implications. Recently published studies have shown the impact of mutations in viral infectivity and immunogenicity [32,33,34,35,36]. The Omicron variant has been the variant with the largest number of mutations described in the S gene. A recently published study showed ten exclusive mutations found for BA.1 and BA.2 and only one mutation in the S gene was shared with the Delta variant [37]. However, we observed that nine mutations in Omicron lineages and sublineages were present in all of them. Furthermore, only five lineage-defining mutations across the viral genome are shared by Omicron and Delta variants in more than 95% of lineages and sublineages of both variants. Additionally, by focusing on the S gene, the Omicron variant has six mutations present in all lineages/sublineages, suggesting that these positions could have an important role in the establishment of this variant worldwide, , reinforcing the need for viral genomic surveillance due to the central role of the spike protein as a target for vaccine design.
Viral genomic surveillance is an essential tool, but the pillars of a molecular surveillance approach require good quality data (sequence and metadata). The quality control assessment is a perfect procedure to determine the performance of individual laboratories, identify procedural issues, establish the effectiveness and intercomparability of the methods used, and identify interlaboratory differences that can provide additional confidence to laboratories for the characterization of SARS-CoV-2. Our results for the QCA showed the inherent complexity to obtain comparable results regarding the lineage versus variant assignment, which would make monitoring difficult in the context of public health. Until July 2022, a totally of 2175 different lineages has been assigned to SARS-CoV-2 using the Pangolin tool [9]. Of these, 212 have been reassigned and some of them have even been eliminated [38]. This fact, in addition to the difficulties in obtaining the whole viral sequence derived from sample and technical issues, increases the SARS-CoV-2 monitoring complexity in the public health context. Even if we consider the success rate for obtaining a high-coverage complete sequence according to GISAID (sequences with less than 1% of undefined bases (NNNs) and insertions and/or deletions verified by the submitter), regarding the worldwide data available from GISAID (queried on 2022.10.16), only 42.4% (5,743,076/13,551,277) of the deposited sequences can be considered whole-genome sequences with high coverage. Furthermore, this does not guarantee the quality in the genomic regions of great interest such as the S gene for sequences with high coverage. In this situation, the establishment of clear criteria based on viral biological characteristics and their impact on human health leading to a consensus nomenclature is required for surveillance at the public health level.
In view of these difficulties, the use of tools for the phylogenetic analysis quickly provides knowledge about the viral molecular evolution reflected in clusters involving molecular changes that could be associated with biological features relevant for viral surveillance in the public health system. In this sense, the nomenclature proposed by the WHO takes on more relevance [20]. The phylogenetic analyses carried out from the complete viral genome and the S gene generated similar results highlighting the relevance of the S gene sequence variation in the public health context. This could be explained because the SARS-CoV-2 S gene presents the highest non-synonymous mutation rates in comparison to the rest of the viral genome [12], as well as their role in the SARS-CoV-2 environmental adaptation [39]. On the other hand, viral surface proteins are highly exposed to pressure by the immune system, leading to more frequent antigenic drift [40]. Since this region is also the target for vaccine design, this makes it a relevant issue for public health surveillance. Thus, although the availability of more information on the viral genome is important, it is more important to obtain this information with higher quality and focus on what may have a direct impact on health. In this sense, the selection of specific viral genome regions to perform phylogenetic analysis have been carried out for a long time in regard to public health surveillance, as is the case with the Influenza virus, where hemagglutinin is the most relevant region for its surveillance [41].
In summary, the establishment of molecular nationwide surveillance networks are essential for addressing science-based public health decisions and for achieving control of infectious diseases, and this should be a priority worldwide. The availability of methodologies for the sequencing of the whole viral genome has been a great advance, but we must not forget that other technologies that require fewer resources can also be used to obtain molecular information useful for surveillance and evaluation of vaccine effectiveness. Future actions should be aimed to ensuring high-quality data, focusing on relevant data for public heath applications and quality–cost evaluations that could favour the inclusion of surveillance in low- and middle-income countries, taking into account the global disparities on sequencing capacity [42] in order to increase the knowledge that is relevant to public health.

4. Materials and Methods

4.1. SARS-CoV-2 Viral Sequences Deposited in GISAID by the RELECOV Network Members

To assess the kick-off activity of the RELECOV network, we studied the level of viral sequence data sharing from Spain samples through GISAID. To disentangle the spectrum of variants circulating over time during the first 6 months of network activity, we established three periods for evaluation: a first quarter between 05.09.2021 and 05.12.2021; a second quarter between 06.12.2021and 08.03.2022; and a semi-annual period comprising both quarters. For this purpose, the sequences available in GISAID from Spain and deposited by RELECOV network members during this period were retrieved. A total of 20,675 viral sequences were deposited in GISAID during the first quarter from Spain, of which 20,435 sequences were deposited by 29 RELECOV network members. Second quarter data comprised 28,151 viral sequences from Spain, of which 28,143 corresponded to sequences deposited by 29 RELECOV network members. Finally, the semi-annual data analyses covered both quarters with a total of 48,578 viral genomes that were deposited in GISAID by 30 RELECOV network members.

4.2. Tracking the Changes in the SARS-CoV-2 Delta and Omicron Variants

Delta and Omicron variants were selected considering their prevalence from 03.09.2021 to 04.11.2022. A database containing information on lineage-defining mutations was generated for all lineages and sublineages of Delta and Omicron variants using information available in the outbreak.info platform [21]. We updated and reviewed the database every two weeks due to the changes that occur in the lineage assignments and lineage-defining mutations. A total of 441 lineages and sublineages were included in the study: 248 corresponding to Delta and 205 to the Omicron variant (Supplementary File S3 and Supplementary Table S1).

4.3. Quality Control Assessment (QCA) of SARS-CoV-2 Viral Genome Sequencing

A QCA of SARS-CoV-2 sequencing and characterization was organized and prepared at the National Center of Microbiology. This QCA was performed between 22.12.2021 and 28.01.2022 in order to evaluate the sequencing capacity of the RELECOV network laboratories and the skills in the bioinformatic assignment of variants and lineages of SARS-CoV-2.
The QCA included ten lyophilized RNA samples of different SARS-CoV-2 viruses isolated from VERO E6 cell cultures. The panel was made up of eight SARS-CoV-2 samples belonging to VOCs, one variant of interest (VOI) and another sample corresponding to the A.28 lineage not classified by the WHO as a VOC, VOI, or Variant Under Monitoring (VUM). The samples included in the QCA were selected based on their relevance in terms of virus circulation in Spain. The preparation of samples, the homogeneity prior to distribution, and the stability and sequencing tests of each virus were carried out at the National Centre of Microbiology and at the central scientific–technical units, the Genomics unit, and the Bioinformatics unit from the Instituto de Salud Carlos III. The characteristics of the panel and the evaluation of homogeneity and stability are shown in Table 4 and Table 5, respectively.
A total of 15 library preparation kits were used by the participants in this panel: ABL DeepChek® Assay WG SC2 V1 (Advanced Biological Laboratories, Luxembourg), Genexus™ Library Strips 1 and 2-AS; Ion AmpliSeq Chef DL8 Kit, Ion Ampliseq Library Kit and Ion AmpliSeq™ SARS-CoV-2 Insight Research Assay-GS Manual (Thermo Fisher Scientific Inc, Waltham, MA, USA); Illumina COVIDseq Kit, Illumina DNA Prep Tagmentation, Nextera DNA Flex, Nextera XT (Illumina Inc. San Diego, CA, USA) NEBNext® ARTIC SARS-CoV-2 FS Library Prep Kit, NEBNext® Fast DNA Library Prep Set for Ion Torrent™, NEBNext® Ultra™ II FS DNA Library Prep Kit for Illumina (New England Biolabs, UK); Oxford Nanopore Sequencing Kit (Oxford Nanopore Technologies, Oxford, UK); and ViroKey SQ FLEX Library Prep Reagents (Vela Diagnostics, The Kendall, Singapur). Ten sequencing systems were used including: Illumina iSeq 100, Illumina MiSeq, Illumina NextSeq, Illumina NovaSeq 6000 (Illumina Inc. San Diego, CA, USA); Ion Torrent, Ion Torrent PGM, Ion Torrent S5, Ion Torrent S5 XL, Ion Torrent Genexus™ Integrated Sequencer (Ion Torrent Systems, Inc., Gilford, NH, USA); and MinION (Oxford Nanopore Technologies, Oxford, UK).

4.4. Phylogenetic Analysis of SARS-CoV-2 Viral Genome Sequences

A total of 2557 worldwide viral sequences corresponding to different lineages corresponding to Alpha, Beta, Gamma, Delta, Eta, Iota, Kappa, Lambda, Mu, Theta, and Omicron were retrieved from GISAID. Reference sequences used in the phylogenetic analysis must agree the following criteria to be included:
  • Correspondence to a specific variant described by the WHO.
  • First described sequences with complete viral genome, high coverage (according to GISAID those sequences with less than 1% of undefined bases (NNNs) and insertions and/or deletions verified by the submitter), and complete collection date.
  • At least 3–6 sequences per lineage and/or sublineage described for each variant.
  • Does not present unresolved nucleotide positions.
  • Size of the fragment between position 55 and 29,674 (ORF10 end) refers to the reference sequence NC_04552 available in NCBI [43]
We used MEGA7 [43] and Bioedit 7.2.5 programs [44,45] in order to ensure compliance with established criteria and to achieve a working matrix with good quality for all retrieved sequences. Finally, a total of 714 sequences were included into the final matrix to obtain a reference tree. The list of selected sequences and related data are available on GISAID and in Supplementary File S1. Omicron lineage BA.3 was not included since no sequence was found meeting all the criteria. The final matrix of selected sequences was aligned using the online service of MAFFT version 7 [46]. The alignment was reviewed and corrected to ensure virus genome biological sense since some nucleotide positions were found that did not meet this criterion. Secondly, we also generate a matrix containing only the S gene fragment (3821 nt) to compare the resolution of both phylogenetic analyses. Subsequently, phylogenetic analysis for the complete genome and the S gene were performed using IQ-TREE 2 [23] and FastTree [22], and resources available from the CIPRES Science Gateway [47]. We inferred a maximum-likelihood tree using the GTR+G model, combining the SH-aLRT test and ultrafast bootstrap with 1000 replicates in IQ-TREE 2, and the fastest GTR+G model and 1000 replicates in FastTree.

5. Conclusions

The RELECOV network actions to date have made it possible to improve and enhance the surveillance of the SARS-CoV-2 viruses that circulate in Spain through sequences analysis, monitoring of virus genome changes, and a quality control assessment of the sequencing at the national level. In addition, the phylogenetic analysis to characterize SARS-CoV-2 allowed us to gain knowledge about the circulation of variants in Spain in an independent and rapid manner in order to assist public health decisions. The described actions in the RELECOV network have comprised efforts directed to control the quality of data obtained and shared, and overall, to understand the data that could provide meaningful biological interpretation to focus on viral molecular surveillance applications for the public health.

Supplementary Materials

The supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms24108573/s1.

Author Contributions

Conceptualization, S.V.-M. and I.C. (Inmaculada Casas); methodology, S.V.-M., M.I.-C. and I.C. (Inmaculada Casas); validation, S.V.-M., M.I.-C., F.P. and I.C. (Inmaculada Casas); formal analysis, S.V.-M. and M.I.-C.; investigation, S.V.-M., M.I.-C., J.A.L., F.G., S.M., J.M.M., D.G.d.V., M.D.F., J.C.G., C.L.-C., R.B.-R., J.A.-F., F.G.C., M.d.T., M.F., C.E., F.L., S.P.C., I.C. (Isabel Cuesta), A.Z., F.P. and RELECOV Network Members; data curation, S.V.-M. and M.I.-C.; writing—original draft preparation, S.V.-M.; writing—review and editing, S.V.-M., M.I.-C., J.A.L., F.G., S.M., J.M.M., D.G.d.V., M.D.F., J.C.G., C.L.-C., R.B.-R., J.A.-F., F.G.C., M.d.T., M.F., C.E., F.L., S.P.C., A.Z., and I.C. (Inmaculada Casas); supervision, S.V.-M. and I.C. (Inmaculada Casas); project administration, I.C. (Inmaculada Casas); funding acquisition, I.C. (Inmaculada Casas). All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by ECDC (agreement number ECDC/HERA/2021/024 ECD.12241), Fondo COVID (grant number COV20_00679_MPY 222/20 (ISCIII)), a UNESPA donation-“COVIDSEQ-UNESPA” (grant number MPY 226/22), Cabildo Insular de Tenerife (CGIEU0000219140 and “Apuestas científicas del ITER para colaborar en la lucha contra la COVID-19”), and Fundación Canaria Instituto de Investigación Sanitaria de Canarias (PIFIISC 21/37).

Data Availability Statement

Publicly available datasets were analysed in this study. This data can be found here: https://gisaid.org/ (accessed on 1 March 2023) and https://outbreak.info/ (accessed on 1 October 2022).

Acknowledgments

We thank Noelia Reyes, Silvia Moreno, María José Casas, Sara Camarero, and Mar Molinero from the National Reference Laboratory for their exceptional technical support. We thank Pilar Jimenez, Mercedes Jimenez, Cristina Nieto, Cristina Herranz, Marisa Navarro, and Leticia de la Vega from the Genomics unit, as well as Erika Kvalen, Alberto Lema, Luis Aranda, and Luis Chapado from the Bioinformatics unit at Instituto de Salud Carlos III who collaborated in the genomic sequence analysis and RELECOV platform development. We gratefully acknowledge all data contributors, authors, and their originating laboratories responsible for obtaining the specimens, and their submitting laboratories for generating the genetic sequence and metadata and sharing them via the GISAID Initiative, on which part of this research was based.

Conflicts of Interest

The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. WHO. Statement on the Second Meeting of the International Health Regulations (2005) Emergency Committee Regarding the Outbreak of Novel Coronavirus (2019-nCoV). Available online: https://www.who.int/news/item/30-01-2020-statement-on-the-second-meeting-of-the-international-health-regulations-(2005)-emergency-committee-regarding-the-outbreak-of-novel-coronavirus-(2019-ncov) (accessed on 1 October 2022).
  2. WHO. Responding to Community Spread of COVID-19. Available online: https://apps.who.int/iris/bitstream/handle/10665/331421/WHO-COVID-19-Community_Transmission-2020.1-eng.pdf (accessed on 1 October 2022).
  3. Gorbalenya, A.E.; Baker, S.C.; Baric, R.S.; de Groot, R.J.; Drosten, C.; Gulyaeva, A.A.; Haagmans, B.L.; Lauber, C.; Leontovich, A.M.; Neuman, B.W.; et al. The species Severe acute respiratory syndrome-related coronavirus: Classifying 2019-nCoV and naming it SARS-CoV-2. Nat. Microbiol. 2020, 5, 536–544. [Google Scholar] [CrossRef]
  4. Wu, F.; Zhao, S.; Yu, B.; Chen, Y.M.; Wang, W.; Song, Z.G.; Hu, Y.; Tao, Z.W.; Tian, J.H.; Pei, Y.Y.; et al. A new coronavirus associated with human respiratory disease in China. Nature 2020, 579, 265–269. [Google Scholar] [CrossRef] [PubMed]
  5. Zhou, P.; Yang, X.L.; Wang, X.G.; Hu, B.; Zhang, L.; Zhang, W.; Si, H.R.; Zhu, Y.; Li, B.; Huang, C.L.; et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature 2020, 579, 270–273. [Google Scholar] [CrossRef]
  6. Shu, Y.; McCauley, J. GISAID: Global initiative on sharing all influenza data—From vision to reality. Eurosurveillance 2017, 22, 30494. [Google Scholar] [CrossRef]
  7. Hadfield, J.; Megill, C.; Bell, S.M.; Huddleston, J.; Potter, B.; Callender, C.; Sagulenko, P.; Bedford, T.; Neher, R.A. Nextstrain: Real-time tracking of pathogen evolution. Bioinformatics 2018, 34, 4121–4123. [Google Scholar] [CrossRef]
  8. Rambaut, A.; Holmes, E.C.; O’Toole, A.; Hill, V.; McCrone, J.T.; Ruis, C.; du Plessis, L.; Pybus, O.G. A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat. Microbiol. 2020, 5, 1403–1407. [Google Scholar] [CrossRef] [PubMed]
  9. O’Toole, A.; Scher, E.; Underwood, A.; Jackson, B.; Hill, V.; McCrone, J.T.; Colquhoun, R.; Ruis, C.; Abu-Dahab, K.; Taylor, B.; et al. Assignment of epidemiological lineages in an emerging pandemic using the pangolin tool. Virus Evol. 2021, 7, veab064. [Google Scholar] [CrossRef]
  10. Konings, F.; Perkins, M.D.; Kuhn, J.H.; Pallen, M.J.; Alm, E.J.; Archer, B.N.; Barakat, A.; Bedford, T.; Bhiman, J.N.; Caly, L.; et al. SARS-CoV-2 Variants of Interest and Concern naming scheme conducive for global discourse. Nat. Microbiol. 2021, 6, 821–823. [Google Scholar] [CrossRef]
  11. Nikolaidis, M.; Markoulatos, P.; Van de Peer, Y.; Oliver, S.G.; Amoutzias, G.D. The Neighborhood of the Spike Gene Is a Hotspot for Modular Intertypic Homologous and Nonhomologous Recombination in Coronavirus Genomes. Mol. Biol. Evol. 2022, 39, msab292. [Google Scholar] [CrossRef]
  12. Nikolaidis, M.; Papakyriakou, A.; Chlichlia, K.; Markoulatos, P.; Oliver, S.G.; Amoutzias, G.D. Comparative Analysis of SARS-CoV-2 Variants of Concern, Including Omicron, Highlights Their Common and Distinctive Amino Acid Substitution Patterns, Especially at the Spike ORF. Viruses 2022, 14, 707. [Google Scholar] [CrossRef]
  13. Tay, J.H.; Porter, A.F.; Wirth, W.; Duchene, S. The Emergence of SARS-CoV-2 Variants of Concern Is Driven by Acceleration of the Substitution Rate. Mol. Biol. Evol. 2022, 39, msac013. [Google Scholar] [CrossRef]
  14. Cameroni, E.; Bowen, J.E.; Rosen, L.E.; Saliba, C.; Zepeda, S.K.; Culap, K.; Pinto, D.; VanBlargan, L.A.; De Marco, A.; di Iulio, J.; et al. Broadly neutralizing antibodies overcome SARS-CoV-2 Omicron antigenic shift. Nature 2022, 602, 664–670. [Google Scholar] [CrossRef] [PubMed]
  15. WHO. SARS-CoV-2 Genomic Sequencing for Public Health Goals: Interim Guidance, 8 January 2021. Available online: https://www.who.int/publications/i/item/WHO-2019-nCoV-genomic_sequencing-2021.1 (accessed on 1 October 2022).
  16. ECDC. Guidance for Representative and Targeted Genomic SARS-CoV-2 Monitoring. Available online: https://www.ecdc.europa.eu/en/publications-data/guidance-representative-and-targeted-genomic-sars-cov-2-monitoring (accessed on 1 November 2022).
  17. European-Commission. Coronavirus: Preparing Europe for the Increased Threat of Variants. 2021. Available online: https://ec.europa.eu/commission/presscorner/detail/en/ip_21_641 (accessed on 1 October 2022).
  18. Ministerio de Sanidad. Integración de la Secuenciación Genómica en la Vigilancia del SARS-Cov-2. 2021. Available online: https://www.sanidad.gob.es/profesionales/saludPublica/ccayes/alertasActual/nCov/documentos/Integracion_de_la_secuenciacion_genomica-en_la_vigilancia_del_SARS-CoV-2.pdf (accessed on 1 November 2022).
  19. Amoutzias, G.D.; Nikolaidis, M.; Tryfonopoulou, E.; Chlichlia, K.; Markoulatos, P.; Oliver, S.G. The Remarkable Evolutionary Plasticity of Coronaviruses by Mutation and Recombination: Insights for the COVID-19 Pandemic and the Future Evolutionary Paths of SARS-CoV-2. Viruses 2022, 14, 78. [Google Scholar] [CrossRef] [PubMed]
  20. WHO. WHO Announces Simple, Easy-to-Say Labels for SARS-CoV-2 Variants of Interest and Concern. Available online: https://www.who.int/news/item/31-05-2021-who-announces-simple-easy-to-say-labels-for-sars-cov-2-variants-of-interest-and-concern (accessed on 1 October 2022).
  21. Hughes, L.; Gangavarapu, K.; Latif, A.A.; Mullen, J.; Alkuzweny, M.; Hufbauer, E.; Tsueng, G.; Haag, E.; Zeller, M.; Aceves, C.; et al. Outbreak.info genomic reports: Scalable and dynamic surveillance of SARS-CoV-2 variants and mutations. Res. Sq. 2022. [Google Scholar] [CrossRef]
  22. Price, M.N.; Dehal, P.S.; Arkin, A.P. FastTree: Computing large minimum evolution trees with profiles instead of a distance matrix. Mol. Biol. Evol. 2009, 26, 1641–1650. [Google Scholar] [CrossRef] [PubMed]
  23. Minh, B.Q.; Schmidt, H.A.; Chernomor, O.; Schrempf, D.; Woodhams, M.D.; von Haeseler, A.; Lanfear, R. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol. Biol. Evol. 2020, 37, 1530–1534. [Google Scholar] [CrossRef]
  24. ECDC. ECDC Strategic Framework for the Integration of Molecular and Genomic Typing into European Surveillance and Multi-country Outbreak Investigations. Available online: https://www.ecdc.europa.eu/en/publications-data/ecdc-strategic-framework-integration-molecular-and-genomic-typing-european (accessed on 1 October 2022).
  25. Colijn, C.; Earn, D.J.; Dushoff, J.; Ogden, N.H.; Li, M.; Knox, N.; Van Domselaar, G.; Franklin, K.; Jolly, G.; Otto, S.P. The need for linked genomic surveillance of SARS-CoV-2. Can. Commun. Dis. Rep. 2022, 48, 131–139. [Google Scholar] [CrossRef]
  26. Leite, J.A.; Vicari, A.; Perez, E.; Siqueira, M.; Resende, P.; Motta, F.C.; Freitas, L.; Fernandez, J.; Parra, B.; Castillo, A.; et al. Implementation of a COVID-19 Genomic Surveillance Regional Network for Latin America and Caribbean region. PLoS ONE 2022, 17, e0252526. [Google Scholar] [CrossRef]
  27. Schubert, G.; Achi, V.; Ahuka, S.; Belarbi, E.; Bourhaima, O.; Eckmanns, T.; Johnstone, S.; Kabore, F.; Kra, O.; Mendes, A.; et al. The African Network for Improved Diagnostics, Epidemiology and Management of common infectious Agents. BMC Infect. Dis. 2021, 21, 539. [Google Scholar] [CrossRef]
  28. Smith, D.; Bashton, M. An integrated national scale SARS-CoV-2 genomic surveillance network. Lancet Microbe 2020, 1, e99–e100. [Google Scholar] [CrossRef]
  29. Lopez, M.G.; Chiner-Oms, A.; Garcia de Viedma, D.; Ruiz-Rodriguez, P.; Bracho, M.A.; Cancino-Munoz, I.; D’Auria, G.; de Marco, G.; Garcia-Gonzalez, N.; Goig, G.A.; et al. The first wave of the COVID-19 epidemic in Spain was associated with early introductions and fast spread of a dominating genetic variant. Nat. Genet. 2021, 53, 1405–1414. [Google Scholar] [CrossRef]
  30. WHO. Global Genomic Surveillance Strategy for Pathogens with Pandemic and Epidemic Potential, 2022–2032. Available online: https://www.who.int/publications/i/item/9789240046979 (accessed on 1 February 2023).
  31. Chen, Z.; Azman, A.S.; Chen, X.; Zou, J.; Tian, Y.; Sun, R.; Xu, X.; Wu, Y.; Lu, W.; Ge, S.; et al. Global landscape of SARS-CoV-2 genomic surveillance and data sharing. Nat. Genet. 2022, 54, 499–507. [Google Scholar] [CrossRef] [PubMed]
  32. Li, Q.; Wu, J.; Nie, J.; Zhang, L.; Hao, H.; Liu, S.; Zhao, C.; Zhang, Q.; Liu, H.; Nie, L.; et al. The Impact of Mutations in SARS-CoV-2 Spike on Viral Infectivity and Antigenicity. Cell 2020, 182, 1284–1294. [Google Scholar] [CrossRef]
  33. Lubinski, B.; Fernandes, M.H.V.; Frazier, L.; Tang, T.; Daniel, S.; Diel, D.G.; Jaimes, J.A.; Whittaker, G.R. Functional evaluation of the P681H mutation on the proteolytic activation the SARS-CoV-2 variant B.1.1.7 (Alpha) spike. bioRxiv 2021, 10.1101/2021.04.06.438731. [Google Scholar] [CrossRef] [PubMed]
  34. Liu, Y.; Liu, J.; Plante, K.S.; Plante, J.A.; Xie, X.; Zhang, X.; Ku, Z.; An, Z.; Scharton, D.; Schindewolf, C.; et al. The N501Y spike substitution enhances SARS-CoV-2 infection and transmission. Nature 2022, 602, 294–299. [Google Scholar] [CrossRef] [PubMed]
  35. Saito, A.; Irie, T.; Suzuki, R.; Maemura, T.; Nasser, H.; Uriu, K.; Kosugi, Y.; Shirakawa, K.; Sadamasu, K.; Kimura, I.; et al. Enhanced fusogenicity and pathogenicity of SARS-CoV-2 Delta P681R mutation. Nature 2022, 602, 300–306. [Google Scholar] [CrossRef] [PubMed]
  36. Zhao, Z.; Zhou, J.; Tian, M.; Huang, M.; Liu, S.; Xie, Y.; Han, P.; Bai, C.; Han, P.; Zheng, A.; et al. Omicron SARS-CoV-2 mutations stabilize spike up-RBD conformation and lead to a non-RBM-binding monoclonal antibody escape. Nat. Commun. 2022, 13, 4958. [Google Scholar] [CrossRef]
  37. Ou, J.; Lan, W.; Wu, X.; Zhao, T.; Duan, B.; Yang, P.; Ren, Y.; Quan, L.; Zhao, W.; Seto, D.; et al. Tracking SARS-CoV-2 Omicron diverse spike gene mutations identifies multiple inter-variant recombination events. Signal Transduct. Target Ther. 2022, 7, 138. [Google Scholar] [CrossRef]
  38. Cov-lineages.org. Available online: https://cov-lineages.org/lineage_list.html (accessed on 1 October 2022).
  39. Amicone, M.; Borges, V.; Alves, M.J.; Isidro, J.; Ze-Ze, L.; Duarte, S.; Vieira, L.; Guiomar, R.; Gomes, J.P.; Gordo, I. Mutation rate of SARS-CoV-2 and emergence of mutators during experimental evolution. Evol. Med. Public Health 2022, 10, 142–155. [Google Scholar] [CrossRef]
  40. Bush, R.M. Predicting adaptive evolution. Nat. Rev. Genet. 2001, 2, 387–392. [Google Scholar] [CrossRef]
  41. Influenza Virus Characterization: Summary Report, Europe, September 2022. Copenhagen and Stockholm: World Health Organization Regional Office for Europe and European Centre for Disease Prevention and Control; 2022. Licence: CC BY 3.0 IGO. Available online: https://www.ecdc.europa.eu/sites/default/files/documents/Influenza-ECDC-WHO-report-Sep-2022.pdf (accessed on 1 February 2023).
  42. Brito, A.F.; Semenova, E.; Dudas, G.; Hassler, G.W.; Kalinich, C.C.; Kraemer, M.U.G.; Ho, J.; Tegally, H.; Githinji, G.; Agoti, C.N.; et al. Global disparities in SARS-CoV-2 genomic surveillance. Nat. Commun. 2022, 13, 7003. [Google Scholar] [CrossRef] [PubMed]
  43. Nucleotide, National Library of Medicine. National Center for Biotechnology Information (NCBI). MD, USA. Available online: https://www.ncbi.nlm.nih.gov/nucleotide/ (accessed on 1 July 2022).
  44. Kumar, S.; Stecher, G.; Tamura, K. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. Mol. Biol. Evol. 2016, 33, 1870–1874. [Google Scholar] [CrossRef] [PubMed]
  45. Hall, T.A. BioEdit: A user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp. Ser. 1999, 41, 95–98. [Google Scholar]
  46. Katoh, K.; Rozewicki, J.; Yamada, K.D. MAFFT online service: Multiple sequence alignment, interactive sequence choice and visualization. Brief. Bioinform. 2019, 20, 1160–1166. [Google Scholar] [CrossRef]
  47. Miller, M.A.; Pfeiffer, W.; Schwartz, T. Creating the CIPRES Science Gateway for inference of large phylogenetic trees. In Proceedings of the 2010 Gateway Computing Environments Workshop (GCE), New Orleans, LA, USA, 14 November 2010; pp. 1–8. [Google Scholar]
Figure 1. Percentage of sequences corresponding to different variants by year. SARS-CoV-2 viral sequences from Spain shared in GISAID between 05.09.2021 and 08.03.2022. NA: not applicable (lineages without variant assignment). VUM: Variant Under Monitoring.
Figure 1. Percentage of sequences corresponding to different variants by year. SARS-CoV-2 viral sequences from Spain shared in GISAID between 05.09.2021 and 08.03.2022. NA: not applicable (lineages without variant assignment). VUM: Variant Under Monitoring.
Ijms 24 08573 g001
Figure 2. Time course of variants in Spain between 05.09.2021 and 08.03.2022 by month (sequences shared in GISAID). NA: Not applicable (lineages without variant assignment). VUM: Variant Under Monitoring.
Figure 2. Time course of variants in Spain between 05.09.2021 and 08.03.2022 by month (sequences shared in GISAID). NA: Not applicable (lineages without variant assignment). VUM: Variant Under Monitoring.
Ijms 24 08573 g002
Figure 3. Results for variant (green) and lineage (blue) assignments in ten samples for the RELECOV network participants.
Figure 3. Results for variant (green) and lineage (blue) assignments in ten samples for the RELECOV network participants.
Ijms 24 08573 g003
Figure 4. Phylogenetic tree, showed as cladogram, of the 713 worldwide (76 countries and territories) complete viral genomes retrieved from GISAID and NCBI NC_045512.2 SARS-CoV-2. Each colour represents a cluster associated with a specific variant identified by WHO. Nodes values: SH-aLRT/UFBoot (ultrafast bootstrap) supports.
Figure 4. Phylogenetic tree, showed as cladogram, of the 713 worldwide (76 countries and territories) complete viral genomes retrieved from GISAID and NCBI NC_045512.2 SARS-CoV-2. Each colour represents a cluster associated with a specific variant identified by WHO. Nodes values: SH-aLRT/UFBoot (ultrafast bootstrap) supports.
Ijms 24 08573 g004
Figure 5. Phylogenetic tree, showed as cladogram, for the S gene of the 713 worldwide (76 countries and territories) sequences retrieved from GISAID and NCBI NC_045512.2 SARS-CoV-2 reference. Each colour represents a cluster associated with specific variants identified by WHO. Nodes values: SH-aLRT/UFBoot (ultrafast bootstrap) supports.
Figure 5. Phylogenetic tree, showed as cladogram, for the S gene of the 713 worldwide (76 countries and territories) sequences retrieved from GISAID and NCBI NC_045512.2 SARS-CoV-2 reference. Each colour represents a cluster associated with specific variants identified by WHO. Nodes values: SH-aLRT/UFBoot (ultrafast bootstrap) supports.
Ijms 24 08573 g005
Table 1. SARS-CoV-2 viral sequences from Spain deposited in GISAID between 05.09.2021 and 05.12.2021.
Table 1. SARS-CoV-2 viral sequences from Spain deposited in GISAID between 05.09.2021 and 05.12.2021.
VariantNumber of SequencesNumber of Lineages
Alpha11853
Beta872
Gamma969
Delta18,272126
Mu1713
Omicron84
Lambda402
2 VUM11
Eta271
Epsilon11
1 NA48348
Unassigned lineage61NA
Recombinant3NA
Total20,435200
1 NA: Not applicable (lineages without variant assignment). 2 VUM: Variant Under Monitoring.
Table 2. SARS-CoV-2 viral sequences from Spain deposited in GISAID between 06.12.2021 and 08.03.2022.
Table 2. SARS-CoV-2 viral sequences from Spain deposited in GISAID between 06.12.2021 and 08.03.2022.
VariantNumber of SequencesNumber of Lineages
Alpha722
Beta22
Delta12,733107
Gamma42
Mu153
Omicron15,25148
Unassigned lineage16NA
Recombinant1NA
1 NA498
Total28,143172
1 NA: Not applicable (lineages without variant assignment).
Table 3. Characteristics of the samples and results of the 37 RELECOV network participants reporting results in the QCA.
Table 3. Characteristics of the samples and results of the 37 RELECOV network participants reporting results in the QCA.
SampleVariantLineageVariantLineage
Number of Correct Results (%)Number of Correct Results (%)
#1AlphaB.1.1.737 (100)36 (97.30)
#2BetaB.1.35137 (100)37 (100)
#31 NAA.2836 (97.30)36 (97.30)
#4MuB.1.62137 (100)37 (100)
#5GammaP.137 (100)34 (91.89)
#6DeltaAY.9.237 (100)33 (89.19)
#7DeltaAY.4314 (37.83)11 (29.73)
#8DeltaAY.9436 (97.30)31 (83.78)
#9DeltaAY.9437 (100)32 (86.49)
#10DeltaAY.4337 (100)37 (100)
1 NA: Not applicable.
Table 4. Characteristics of the QCA panel of samples.
Table 4. Characteristics of the QCA panel of samples.
SampleVariantLineagesPre-Shipment 2 CtsPost-Shipment 2 Cts
56 Days96 Days
#1AlphaB.1.1.719.117.821.0
#2BetaB.1.35117.616.019.2
#31 NAA.2822.819.621.6
#4MuB.1.62118.717.018.5
#5GammaP.117.718.720.6
#6DeltaAY.9.225.222.625.7
#7DeltaAY.4321.321.624.7
#8DeltaAY.9416.216.619.4
#9DeltaAY.9416.219.420.5
#10DeltaAY.4321.320.821.9
1 NA: Not applicable; 2 Cts = Cycle threshold.
Table 5. Sample homogeneity and stability.
Table 5. Sample homogeneity and stability.
SampleAverage1 SD2 CIAverage ± CI 2 (95%)
#119.31.331.5019.3 ± 1.50
#217.61.311.4817.6 ± 1.48
#321.31.331.5021.3 ± 1.50
#418.10.770.8718.1 ± 0.87
#519.01.211.3619.0 ± 1.36
#624.51.341.5224.5 ± 1.52
#722.61.521.7222.6 ± 1.72
#817.41.441.6317.4 ± 1.63
#918.71.832.0718.7 ± 2.07
#1021.30.460.5221.3 ± 0.52
1 SD: standard deviation; 2 CI: confidence interval.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Vázquez-Morón, S.; Iglesias-Caballero, M.; Lepe, J.A.; Garcia, F.; Melón, S.; Marimon, J.M.; García de Viedma, D.; Folgueira, M.D.; Galán, J.C.; López-Causapé, C.; et al. Enhancing SARS-CoV-2 Surveillance through Regular Genomic Sequencing in Spain: The RELECOV Network. Int. J. Mol. Sci. 2023, 24, 8573. https://doi.org/10.3390/ijms24108573

AMA Style

Vázquez-Morón S, Iglesias-Caballero M, Lepe JA, Garcia F, Melón S, Marimon JM, García de Viedma D, Folgueira MD, Galán JC, López-Causapé C, et al. Enhancing SARS-CoV-2 Surveillance through Regular Genomic Sequencing in Spain: The RELECOV Network. International Journal of Molecular Sciences. 2023; 24(10):8573. https://doi.org/10.3390/ijms24108573

Chicago/Turabian Style

Vázquez-Morón, Sonia, María Iglesias-Caballero, José Antonio Lepe, Federico Garcia, Santiago Melón, José M. Marimon, Darío García de Viedma, Maria Dolores Folgueira, Juan Carlos Galán, Carla López-Causapé, and et al. 2023. "Enhancing SARS-CoV-2 Surveillance through Regular Genomic Sequencing in Spain: The RELECOV Network" International Journal of Molecular Sciences 24, no. 10: 8573. https://doi.org/10.3390/ijms24108573

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop