Next Article in Journal
Detection and Molecular Characterization of Animal Adenovirus and Astrovirus from Western Maharashtra, India
Next Article in Special Issue
Genomic Analysis of Two Cold-Active Pseudoalteromonas Phages Isolated from the Continental Shelf in the Arctic Ocean
Previous Article in Journal
New Concepts in Therapeutic Manipulation of HIV-1 Transcription and Latency: Latency Reversal versus Latency Prevention
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Molecular Population Genetics of Aspen Mosaic-Associated Virus in Finland and Sweden

by
Shaheen Nourinejhad Zarghani
1,
Rim Al Kubrusli
1,
Serghei Iancev
1,
Risto Jalkanen
2,
Carmen Büttner
1 and
Susanne von Bargen
1,*
1
Division Phytomedicine, Faculty of Life Sciences, Albrecht Daniel Thaer-Institute of Agricultural and Horticultural Sciences, Humboldt-Universität zu Berlin, Lentzeallee 55-57, 14197 Berlin, Germany
2
Silva Lapponica, 96460 Rovaniemi, Finland
*
Author to whom correspondence should be addressed.
Viruses 2023, 15(8), 1678; https://doi.org/10.3390/v15081678
Submission received: 20 July 2023 / Revised: 29 July 2023 / Accepted: 31 July 2023 / Published: 1 August 2023
(This article belongs to the Special Issue Diversity and Evolution of Viruses in Ecosystem)

Abstract

:
Aspen mosaic-associated virus (AsMaV) is a newly identified Emaravirus, in the family Fimoviridae, Bunyavirales, associated with mosaic symptoms in aspen trees (Populus tremula). Aspen trees are widely distributed in Europe and understanding the population structure of AsMaV may aid in the development of better management strategies. The virus genome consists of five negative-sense single-stranded RNA (–ssRNA) molecules. To investigate the genetic diversity and population parameters of AsMaV, different regions of the genome were amplified and analyzed and full-length sequence of the divergent isolates were cloned and sequenced. The results show that RNA3 or nucleoprotein is a good representative for studying genetic diversity in AsMaV. Developed RT–PCR–RFLP was able to identify areas with a higher number of haplotypes and could be applied for screening the large number of samples. In general, AsMaV has a conserved genome and based on the phylogenetic studies, geographical structuring was observed in AsMaV isolates from Sweden and Finland, which could be attributed to founder effects. The genome of AsMaV is under purifying selection but not distributed uniformly on genomic RNAs. Distant AsMaV isolates displayed amino acid sequence variations compared to other isolates, and bioinformatic analysis predicted potential post-translational modification sites in some viral proteins.

1. Introduction

Aspen trees (Populus tremula) are widely distributed across Europe and play an important ecological and economic role in the region. These trees are particularly abundant in Northern Europe, including Finland, and Sweden. Aspens are known for their adaptability, fast growth, and ability to colonize disturbed areas [1]. They also provide a wide range of ecosystem services, including habitats for wildlife, carbon sequestration, soil stabilization, and water regulation [2,3,4]. In addition to their ecological significance, aspen trees have cultural and historical importance to people worldwide. In addition to fibers being used in the production of pulp as strand board, solid wood, and furniture, aspen wood is used in many specialty products, for instance, pencils, skis, sauna benches, and coffins [2].
Recently, it has been reported that aspen mosaic-associated virus (AsMaV) is associated with leaf symptoms, such as mottle, yellow blotching, variegation, and chloroses along veins [5]. Our surveys of many years have led to the conclusion that the virus is probably associated with the decline in aspen trees [5]. AsMaV is a member of the virus genus Emaravirus, which includes other plant pathogens such as the European mountain ash ringspot-associated virus and the rose rosette virus [6]. AsMaV has a segmented negative-sense single-stranded RNA (–ssRNA) genome designated as RNA1–RNA5. The genome of AsMaV consists of five –ssRNA molecules. The RNA1 (7.1 kb) encodes for the viral RNA-dependent RNA polymerase (RdRP, 268.2 kDa) and it is involved in virus replication. The RNA2 (2.3 kb), RNA3 (1.6 kb), RNA4 (1.6 kb), and RNA5 (1.3 kb) encode for the glycoprotein precursor (GPP, 73.5 kDa), viral nucleocapsid protein (NP, 35.6 kDa), a putative movement protein (MP, 41.0 kDa), and a protein of unknown function (P28, 28.1 kDa), respectively [5].
RNA viruses, which account for the majority of plant viruses, have high mutation rates due to the error prone nature of viral replicase or RdRP followed by other mechanisms such as recombination and reassortment [7,8,9,10,11]. The introduced mutations during the replication cycle of a virus are mostly deleterious for the virus, but sometimes they may lead to the generation of new isolates with altered phenotypes, such as host range, symptom severity, and transmission efficiency [12,13,14,15,16,17]. The effect of these phenomena in shaping of plant viruses has been reviewed [18,19,20,21,22,23,24,25,26,27].
The genetic diversity of plant viruses is important for understanding their evolution, spread, and potential of emerging new isolates may that are resistant to existing control measures. The genome of the emaraviruses consist of at least four –ssRNA (RNA1–4, core genome) but could extend to twelve segments [6,28]. The genetic diversity of a few members of emaraviruses, including fig mosaic virus (FMV) [29,30], blackberry leaf mottle-associated virus (BLMaV) [31], maple mottle-associated virus (MaMaV) [32], raspberry leaf blotch virus (RLBV) [33,34], rose rosette virus (RRV) [35], jujube yellow mottle-associated virus (JYMaV) [36], Perilla mosaic virus (PerMV) [37], European mountain ash ringspot-associated virus (EMARaV) [38], and High Plains wheat mosaic virus (HPWMoV) [39], has been studied. These studies have utilized one or more partial or complete viral genomic RNA(s), among which RNA3 has been identified as the main genomic region for studying the genetic diversity of emaraviruses followed by RNA4 and partial RNA1, all belonging to the core genome of emaraviruses. It should be mentioned that NP has been used not only in the genetic diversity of emaraviruses but also in all –ssRNA viruses [40,41,42,43]. Moreover, the NP is important in immunological tests for the identification of –ssRNA viruses and has been used in human [44] and plant viruses such as emaraviruses [45]. The aim of this study is the evaluation of the AsMaV diversity of populations originating from Finland and Sweden where the virus is widely distributed [5], based mainly on RNA3, RNA4, and partial RNA1 which encode for NP, MP, and RdRp, respectively.

2. Materials and Methods

2.1. Plant Source

A total number of 83 leaf samples of aspen trees showing mosaic, mottling, vein yellowing, line patterns, and chlorotic ring spots were collected in the summer of 2019. The samples were gathered from various forest sites in Finland, namely Oikarainen-Rovaniemi, Santa Claus Holiday Village, Hyypiöoja-Rovaniemi, Lustila-Rovaniemi, Kivitaipale-Rovaniemi, Petäjäinen-Rovaniemi, Mattinen-Tervola, Maula-Keminmaa, Peura, Tervalo, Lautiosaari-Keminmaa, Ritikka-Kemi, Kuninkaansaari-Helsinki, Hyypiökivalo, and Vallisaari-Helsinki. Additionally, sampling was conducted in several forest sites in Sweden, including Skede-Kråkshult, Ångelsjön Nature Reserve, Fagertärn-road, Sunnersberg-Fiskbäck, Kiså, Stora-Anrås, and Boliden. These samples were immediately transferred to the lab and kept at −70 °C for later use. The complete list of collected samples, along with their corresponding symptoms and geographical origin, is provided in Supplementary Table S1.

2.2. Primers, Detection of AsMaV by RT–PCR

A minimum of five symptomatic leaves from the same tree were selected and neatly piled up. Then, approximately 1–2 cm2 of the leaves (0.2–0.3 g) were used for RNA extraction. Total RNA was isolated according to the protocol described by Boom et al. [46] and 1.5 µg of the extracted RNA was applied as a template for cDNAs synthesis using random hexamers and Maxima H Minus Reverse Transcriptase kit (Thermo Scientific™, Waltham, MA, USA) following the protocol provided by the manufacturer. The presence of AsMaV in the collected samples was confirmed through a series of RT–PCRs employing specific primers (Supplementary Table S2). These primers were designed to hybridize with AsMaV-RNA3 to RNA5, following established method described elsewhere [5,47]. To design the novel primers, we utilized the RNA1 to RNA5 sequences of AsMaV isolates E55089 (GenBank accession No. LR74246, LR742462–65) as a reference. The detailed characteristics of these newly designed primers are provided in Supplementary Table S2. Notably, the isolate E55089 and its corresponding genomic molecules were used as a standard for nucleotide and amino acid positioning.

2.3. AsMaV–RNA3 RT–PCR–RFLP

The primer PDAP–213 [48], which hybridizes to the conserved 13 terminal nucleotides of all emaraviruses, was used to amplify RNA3 and RNA4 in at least 9 isolates. These PCR products were cloned and sequenced and the resulting data were used for designing specific primers for amplification of almost full-length RNA3 and RNA4 (Supplementary Table S2). Therefore, primer pair AsMaV–R3–18–s/AsMaV–R3–end–as was used for the amplification of almost full length RNA3 in remaining AsMaV isolates. The RT–PCR products were gel-extracted and subjected to restriction fragment length polymorphism (RFLP) using the FastDigest AanI restriction enzyme (Thermo Scientific™, Waltham, MA, USA). Isolates were grouped based on the RFLP profiles (restrictotypes) and sampling location. At least three samples from each location and each restrictotype were selected for downstream analysis.

2.4. Amplification of Different Genomic Regions

The primer pairs AsMaV–R4–12–s/AsMaV–R4–end–as and AsMaV–R1AE–1–s/AsMaV–R1A–1627–as were used for amplification of almost full-length RNA4 and partial RNA1 (nucleotides 1–1637), respectively. The sequences of primers are presented in the Supplementary Table S2. The primer PDAP–213 [48], which is hybridizing to the conserved 13 terminal nucleotides of all emaraviruses, was used to amplify RNA2, RNA3, RNA4, and RNA5 of isolates E58627 and E58632 according to the method described by [48]. RNA1 of isolates E58627 and E58632 was amplified in overlapping fragments using the primer pairs of AsMaV–R1AE–1–s/AsMaV–R1A–1627–as, AsMaV–R1B–1541–s/AsMaV–R1B–3107–as, AsMaV–R1C–2953–s/AsMAV–R1C–4492–as, AsMaV–R1D–4435–s/AsMaV–R1D–6011–as, and AsMaV–R1E–5968–s/AsMaV–R1AE–1–s. All PCRs were set up using the VELOCITY DNA Polymerase, a proof-reading DNA polymerase (BioCat GmbH, Heidelberg, Germany) in a 50 µL volume according to the manufacturer’s recommendations. RT–PCR products were separated by agarose gel electrophoresis, and expected DNA fragments were cut out and isolated using the GeneJET Gel Extraction Kit (Thermo Scientific, Waltham, MA, USA). The isolated fragments were subjected to direct sequencing or Sanger sequencing after cloning.

2.5. Cloning, Sequencing, and Data Analysis

CloneJET PCR Cloning Kit (Thermo Scientific) was used for cloning RT–PCR products following the manufacturer’s protocol. The sequencing was performed at Macrogen Europe (Amsterdam, The Netherlands). Sequence alignments were performed by MUSCLE program implemented into Geneious Prime ver. 2022.2.2 (12 December 2021) (https://www.geneious.com/). The best-fitting model of nucleotide substitution for each alignment was determined using Find best DNA/Protein models implemented in MEGA11 software (12 December 2021) [49]. The neighbor-joining (NJ, as a distance-based method) trees were generated using MEGA11 [49] and consensus maximum parsimony (CMP, as an evolutionary-based method) trees constructed by Phylogeny Inference Package (Phylip) version 3.65 software [50]. A bootstrap value of 1000 replicates for each node of the generated trees was calculated, and the trees were viewed using TreeDyn (available online at http://www.treedyn.org, 12 December 2021). The branches with bootstrap values lower than 70% were collapsed. Population genetic parameters were estimated by DnaSP version 6.12.03 software [51]. The corresponding regions of fig mosaic virus (FMV), pistacia virus B (PiVB), and pigeonpea sterility mosaic virus 2 (PPSMV–2) were used as outgroups in the analyses. RNAfold web server (http://rna.tbi.univie.ac.at//cgi-bin/RNAWebSuite/RNAfold.cgi, 12 April 2023 ) was used for prediction of the secondary structure of RNA based on the minimum free energy (MFE) and Centroid secondary structure with the minimum base pair distance prediction. RDP5 [52] was used for the recombination analysis. SignalP–6.0 (https://services.healthtech.dtu.dk/services/SignalP-6.0/, 12 April 2023), Prediction of Signal Peptides and their cleavage sites in all domains of life, was used for the prediction of cleavage sites of AsMaV GPP and NetNGlyc–1.0 (https://services.healthtech.dtu.dk/services/NetNGlyc-1.0/, 12 April 2023) was used for identification of post-translational modifications of the GPP. PopART software was used for generating the distribution of the restrictotype distribution of a geographical map [53,54,55].

3. Results and Discussion

3.1. Detection of AsMaV and RT–PCR–RFLP

RT–PCR was performed using detection primers for AsMaV [5] on total RNA extracted from collected plant material, resulting in the amplification of expected products of approximately 317, 288, and 672 bp for AsMaV RNA3–5, respectively, in 34 samples from Sweden and 49 samples from Finland. Using the primer set, AsMaV–R3–18–s/AsMaV–R3–end–as and AanI restriction enzyme, electrophoresis of digested RT–PCR products revealed seven different restrictotypes (Table 1). Restrictotype VII was the most prevalent and dominant type found in all tested areas, except in Mattinen-Tervola (Finland) and Skede-Kråkshult (Sweden), where only Type IV and Type VI profiles were found, respectively. Notably, Kuninkaansaari-Helsinki and Vallisaari-Helsinki (Finland), the two small islands connected by a narrow bridge, showed five different restrictotypes (Type II, IV, VI, and VII). Similarly, Kiså (Sweden) showed three different restrictotypes (Type III, V, and VII). Identical restrictotypes were identified in both countries, suggesting the fixation of these mutations in RNA3. At least three samples from each location and each of restrictotypes were selected for sequencing and further analysis (in total 27 isolates). There was no correlation between geographical origin and restrictotype, and the distribution of restrictotypes was not uniform across all studied geographical regions.

3.2. Genetic Diversity and Phylogeny of AsMaV Isolates Based on RNA3

The length of AsMaV–RNA3 in sequenced isolates was 1588–1589 nucleotides in size, with 86.8–100% sequence identity among the isolates. Isolates E58213 (Sweden), E58627, and E57260 (Finland) were the most divergent isolates. The 3′UTR region exhibited the highest level of sequence diversity (Table 2). The length of AsMaV RNA3 was previously reported to be 1587 nucleotides (nts) for isolate E55089 (GenBank Accession number LR742463). In 80% of the sequenced isolates, RNA3 was 1588 nts long, while in the remaining isolates, it was 1589 nucleotides in size. The ORF3 started at nt 101 and ended at nt 1057 with the exception of isolates E58627, E58213, and E57260 which ended at nt 1060 (Figure 1). Consequently, the length of the P3 gene or NP gene was 957 or 960 nucleotides encoding for 318–319 amino acids (aa), respectively. As previously published, RNA3 has a very long 3′ UTR, i.e., 530 nts [5]. Three different stop codons were detected in AsMaV ORF3, with the amber stop codon (UGA) being the most prevalent in the majority of isolates (18 out of 27). The ochre (UAA) and opal (UGA) stop codons were detected in six and three isolates, respectively (Figure 1). Stop codons have varying termination efficiencies (UAA > UAG > UGA). In plant viruses, either amber (UAG) or opal (UGA) stop codons are read through; there are no examples of read-through of the natural ochre (UAA) stop codon [56]. The presence of several stop codons in different frame shifts downstream of the ORF3 might be denoting the importance of the NP size.
The three conserved motifs previously reported for the NP of emaraviruses (NXXSXNXXXA, NXLA, and GYEF) [57,58] were also found in AsMaV–NP. The first motif identified in AsMaV–NP was NIVSFNKACA131–140 which was present in all sequenced isolates, except for E58446 and E58434 where the last amino acid was P instead of A. The motif NRLA and GYEF were found in positions 179–182 and 200–203, respectively. The first motif has not been reported for (PerMV) and motif GYEF was replaced with GVEF159–162. The aa X139 was also R in the motif NXLA138–141 of PerMV. In the AsMaV and PerMV, there is a 21 aa distance between the second and third motif, which appears to be conserved among the emaraviruses and essential for the folding or functioning of the protein. However, the distance between the first and second motif (35–39 amino acids) was not conserved among the emaraviruses.
Sequence data confirmed the presence of AanI site in RNA3 at the positions 627–632, 1069–1074, 1142–1147, 1406–1411, and 1473–1478 based on the nt sequence of the isolate E55089 (GenBank Acc. No. LR742463). Among these restriction sites, only one of them was placed in the ORF3 (nts 627–632), and the rest of the AanI restriction sites were detected in the 3′ UTR. The first nt of AanI restriction site (T628) was placed in the position of nt 627–632 (5′–CTTATAATC–3′) and is the second codon position of aa leucine. Therefore, the second nucleotide position of the AanI site is the third-codon position of leucine in which any changes in that position do not result in non-synonymous substitutions. Interestingly, the nt 627–632 AanI site is adjacent to the emaravirus motif NxLA (Table 1). The rest of the AanI sites are spanned in 3′ UTR which was the largest untranslated region of AsMaV and the most divergent part of RNA3. Searching for the secondary structure of RNA in the long 3′ UTR based on the minimum free energy (MFE) and Centroid secondary structure with the minimum base pair distance prediction (http://rna.tbi.univie.ac.at//cgi-bin/RNAWebSuite/RNAfold.cgi, 12 April 2023) in this region showed the potential complementary regions forming stem-loop structures. Interestingly, nt 1272–1358 in the 3′ UTR was the most conserved AT-rich region. This region did not harbor any secondary structure which might explain the toleration of mutations resulting in variations in this region. A similar sequence was not found when we applied BLAST search.
The substitution models of T92 + G and TN93 + G + I were selected as the best substitution models for AsMaV RNA3–4 and the partial RNA1, respectively, using the Find best DNA/Protein fit model test implemented in MEGA11 [49]. Similar tree topologies were obtained with both neighbor-joining and parsimony methods. Therefore, we only present and discuss the parsimony-generated trees. Based on the maximum parsimony tree generated from almost full-length 26 (18 Finnish and 8 Swedish) AsMaV RNA3 sequenced in this study and one previously reported isolate (LR742463) at the nucleotide level, AsMaV isolates clustered into four main groups. The first group, R3–G1, included E57260, E58627 (both from Finland), and E58213 (Sweden), and the second group, R3–G2, consisted of E58632 from Finland and E58227 from Sweden, group 3 (R3–G3) included the rest of the Swedish isolates (E58191, E58209, E58299, E58247, E58306, and E55089 with the E58491 isolate from Finland, and finally the rest of the Finnish isolates were placed in group 4 (R3–G4), i.e., E58377, E58385, E58386, E58397, E58398, E58405, E58413, E58421, E58433, E58434, and E58426. The identical grouping profile was documented when the tree was drawn based on the coding region of RNA3 (ORF3) at the nucleotide level. A similar tree, but with one difference was obtained when the phylogeny was based on the deduced amino acid sequences. In the latter tree, isolates E58227 (Sweden) and E58632 remained ungrouped. In general, isolates from Finnish and Swedish locations were separated based on their geographical origin with the exception of three isolates E58213 and E58227 from Sweden and E58491 from Finland based on the constructed phylogenetic trees obtained from RNA3. Interestingly, a robust correlation was observed between restrictotypes and phylogenetic groups, whereas no significant correlation was found between the geographical origin of the isolates and restrictotypes. Notably, restrictotypes I and II clustered together in the phylogenetic group R3–G1, restrictotypes III and IV formed a cohesive group in R3–G2, restrictotype V exhibited a close affinity with the Finnish isolates of restrictotype VII in R3–G3, and restrictotype VI demonstrated a close association with the Swedish isolates of restrictotype VII in R3–G4 (Figure 2 and Table 1).

3.3. Genetic Diversity and Phylogeny of AsMaV Isolates Based on RNA4

The length of RNA4 in sequenced isolates was 1557 nucleotides in size and there were no indel events. The length of 5′ UTR, ORF4, and 3′ UTR of RNA 4 were 86, 1086 (361 aa), and 385 nucleotides, respectively. The level of 95.1–99.9% and 95.8–99.4% identity on nucleotide level was estimated in pairwise comparisons of the isolates from Swedish and Finnish locations, respectively. Samples E58227 and E58209 from Sweden and the isolates E58421 and E58491 from Finland were the most divergent sequences of each geographical region. Like RNA3, the 3′ UTR was the most divergent region of the genomic RNA4 (Table 2).
The ORF4 in the sequenced isolates of AsMaV initiates with an AUG at position 87–90 and terminates with a UGA stop codon at position 1170–1172 (numbers based on the GenBank Acc. No. acc. no. LR742464). Interestingly, there were one or two UAA stop codons (in frame) in positions 1176–1181. This might denote that the size of MP plays an important role in the function of this protein. Alignment of the deduced amino acid sequences from published emaraviruses identified the following conserved amino acids or motifs: K58, Y72, R88I89XXXXXX(W/Y)97XP98, D113XR115, V146, D168I169X(K/Y)171(I/V)172, M183, W197XT199, (F/Y)215, E224, and (I/L)326. Similar motifs have been previously reported for PPSMV–1 and PPSMV–2, RLBV and RRV movement proteins and it has been suggested that some of these motifs showed a similarity to membrane-located protein and they might be involved in virus cell-to-cell movement [59,60].
In the CMP tree, constructed based on the AsMaV-RNA4, the 25 isolates sequenced in this study (18 from Finland and 7 from Sweden) and one previously reported isolate E55089 with 1000 bootstrap replicates at the nt level were placed in two main groups (R4–G1 and R4–G2). Isolate E58213 (Sweden) was placed in a separate clade close to the outgroups (FMV, PiVB, and PPSMV–2) (Figure 3). The R4–G1–A includes E58227, E58299, and E55089 from Sweden and an isolate from Finland, E58413. R4–G1–B includes three isolates from Finland, i.e., E58405, E58421, and E58434. The rest of the isolates from Sweden were placed as a clade in R4–G2 and the remaining Finnish isolates were placed in three subclades or remained ungrouped.
Most of the AsMaV isolates remained ungrouped in the CMP tree constructed based on the ORF4 at the amino acid level (data not shown). It might be that these sequences were not parsimoniously or phylogenetically informative. According to the CMD tree drawn based on the nucleotide sequence of ORF4, the Swedish isolates were grouped in two clades, ORF4–G2 and ORF4–G7, and isolate E58213 remained ungrouped. Five Finnish isolates were not placed in any group and isolate E58413 was placed between Swedish isolates in ORF4–G7. The rest of the Finnish isolates were grouped into five groups.

3.4. Genetic Diversity and Phylogeny of AsMaV Isolates Based on Partial RNA1

Based on the consensus maximum parsimony tree generated from 25 partial AsMaV RNA1 nt 20–1588 sequenced in this study (18 from Finland and 7 from Sweden) and one previously reported isolate (GenBank Acc. No. LR742461) at the nucleotide level, AsMaV isolates were placed in five main groups (Figure 3). Swedish isolates clustered in one clade (R1–G6) with the exception of E58213 which was grouped with the two Finnish isolates of E58632 and E58491 in the clade R1–G1. Interestingly, two Finnish isolates, E58632 and E58491, were clustered with the Swedish isolates in R1–G6. The isolates E58426, E58433, E58405, E58434, E57260, and E58385 in group R1–G2, isolates E58386, E58389, and E58446 in group R1–G3, isolates E58397 and E58398 in the group R1–G4, and the isolates E58413, E58421, and E58439 in the group R1–G5 (Figure 3). The phylogenetic tree based on the first 517 deduced aa sequences of AsMaV–ORF1 was not phylogenetically informative.
Among the generated phylogenetic trees for RAN3, RNA4, partial RNA1 at nt level, ORF3, ORF4, and partial ORF1 at both nucleotide and amino acid levels, phylogenetic trees constructed for AsMaV-RNA3 were in accordance with each other with one difference: isolates E58632 and E58227 remained ungrouped. There was no difference between phylogenetic trees generated based on the ORF3 or almost full length of the RNA3 at the nt levels. Therefore, it might be concluded that among investigated regions RNA3-based trees, regardless of full length (at nt level) or coding regions (at nt or aa level), produce almost the same results. AsMaV isolates were better discriminated in constructed phylogenetic trees based on the nt level of the AsMaV–RNA4 and partial RNA1 rather than aa-based trees, because approximately 57% (15 out of 26) and 30% (8 out of 26) of the isolates used for these analyses remained ungrouped in the mentioned phylogenetic trees, respectively. The phylogenetic trees based on the ORF4 in comparison to the almost full-length RNA4-based tree at nt levels had ungrouped isolates (six out of twenty-six). As mentioned before, both CMP trees constructed based on the partial nt sequence of RNA1 were identical. Therefore, using the nt sequences of almost full-length RNA3–4 and longer region(s) of RNA1 to generate phylogenetic trees was more informative than amino acid sequences of related ORFs (Figure 2 and Figure 3).

3.5. Complete Sequence of AsMaV-RNA1, RNA2 and RNA 5 of Distant Isolates

Based on the information provided, it appears that the isolates Kuninkaansaari, Finland), and E58632 (Vallisaari, Finland) were more divergent from compared isolates of AsMaV. This divergence is observed at both the nucleotide and amino acid levels in the pairwise alignment and their position in the constructed phylogenetic trees based on the RNA3, RNA4, and partial RNA1. To investigate this further, the complete RNA1, RNA2, and RNA5 of these isolates were amplified by RT–PCR, cloned, and sequenced.
The size of RNA1 was 7103 in E58627 and 7106 nucleotides in E58632, respectively. An indel event of three nucleotides was observed at positions 4033–4036, resulting in a shorter RdRP protein in E57627. However, the RdRP protein in these isolates still showed 98–99% similarity with the published isolate E55089 from Sweden at the amino acid level. E58632 and E55809 showed 97% identity at the nucleotide level, while E58627 showed 88% identity with E55809 and E58632. The ORF1 in the sequenced isolates of AsMaV initiates with an AUG at position 46–48 and terminates with an UGA stop codon at position 6955–6957 or 6952–6954 (numbers based on the GenBank Acc. No. LR742464). The RdRP was 2302 aa in isolate E58627 and 2303 in isolate 2303, respectively.
Three different domains, N-terminus SRD–1, central region SRD–2, and C-terminus SRD–3 have been reported for RdRP in emaraviruses. The SRD–1 harbor endonuclease sequences like Peribunyaviridae polymerase and probably is involved in “cap-snatching”. It contains four motifs located between amino acids 66 and 242. The motifs RHD98–100 and TPD136–138 are universally conserved in different genera of Peribunyaviridae [61]. AsMaV RdRP sequences obtained in this study revealed that these motifs were in the same position and there were no changes in these motifs. But motifs EVK152–154 and KTDL158–161 vary across different genera [61]. It should be mentioned that the emaraviruses have been grouped based on their phylogenetic relations in four clades (A to D), in which AsMaV is placed in the clade A [6]. In clade A species of emaraviruses, the motif EVK152–154 has been reported for PPSMAV–1 and FMV and instead ELV152–154 was the dominant motif in most of the species including AsMaV (Figure 4). The KTDL158–161 motif was replaced by RKTDL158–161 in the RdRP of AsMaV isolates and could be R/H/N/Q/K158T159D/N160L161 in different species of Clade A species but its function is not yet determined.
The presence of six polymerase motifs (designated preA, A–E) characteristic of negative-strand RNA viruses [6,62] have been identified in the RdRP of AsMaV RNA1. This region was conserved in the sequenced isolates.
A similar domain-like reported motif of DPLTYNYWVMPTN1947–1959 in the C-terminus of PPSMV–1 RdRP in the SRD–3 domain was identified as DSRVYNHWLVPTN in the AsMaV isolates sequenced in this study. Probably, this motif is a cap-binding site (CBS).
The RNA2 was 2287 nucleotides in length and ORF2 spanned on the nucleotide positions 58–1983 encodes for GPP (641 aa). There were no indel events. The RNA2 and GPP in isolates E58627 and E58632 shared 91% and 95% identity with E55089 at the nucleotide and amino acid levels, respectively. The two sequenced isolates shared 98% and 99% identity, respectively. It has been reported that the GPP of emaraviruses contains a N-terminal signal sequence, transmembrane domains, potential glycosylation sites, and a motif also found in the glycoprotein precursors of phleboviruses (family Peribunyaviridae) [37,63] which is processed by a protease into two glycoproteins, a smaller N-terminal one (Gn), and a larger C-terminal one (Gc) (Figure 1). The N-terminal region of the GPP precursor of AsMaV contains a signal peptide cleavage site, located at amino acid position 22 and 23 (AYT↓SIR) (SignalP–6.0 Prediction https://services.healthtech.dtu.dk/services/SignalP-6.0/, 12 April 2023). The similar cleavage site has been reported for PPSMV–1 at the same position but different sequence (AMA↓S) [63]. The glycoprotein contains four transmembrane helices (TMH) at amino acid positions 2–24, 106–128, 170–187, and 580–599 (TMHMM v.2.0 on https://services.healthtech.dtu.dk/services/TMHMM-2.0/, 12 April 2023). However, six glycosylation sites have been reported for emaraviruses, AsMaV had five out of six sites at the amino acid position 62, 201, 239, 328, and 438 which were predicted by NetNGlyc–1.0 (https://services.healthtech.dtu.dk/services/NetNGlyc-1.0/, 12 April 2023). Interestingly, the Mucin-type O-glycosylation site which is a widespread post-translational modification of proteins found in the entire animal kingdom, but also in higher plants, has been detected in divergent isolates of AsMaV identified in this study (E58627 and E58632) at the amino acid position 549. This was not detected in the previously published isolate of AsMaV. Interestingly, Mucin-type O-glycosylation sites have been detected in actinidia virus–2 (AcV–2), another species of clade A emaraviruses, in the positions 209, 254, and 264. It should be mentioned that Mucin-type O-glycosylation is an evolutionarily conserved protein modification reported for membrane-bound and -secreted proteins which are involved in protein secretion, stability, processing, and function [64]. More analyzing should be performed to understand the effects of these variations.
The AsMaV RNA5 was 1324 nts in size in sequenced isolates of E58627 and E58632. In both isolates, the ORF5 initiated with a start codon of AUG at the positions 70–72 and terminated with a stop codon of UGA at the positions 811–813 encoding the P28 (247 aa) with unknown function. There was only one aa change in the divergent isolates in comparison to the reference isolate (E55809), S240 instead of N240. These sequences shared 98% and 99–100% identity with E55089 and each other, respectively, at nt and amino acid levels. It has been reported that the P28 of AsMaV shares a similar structure with ABC proteins [6].

3.6. Relation of Distant Isolates of AsMaV with Clade A Emaraviruses

AsMaV isolates shared a max. of 76%, 50%, 81%, and 80% identity in the pairwise alignment of RdRP, GPP, NP, and MP protein of clade A species of emaravirus in the RdRP, GPP, NP, and MP at the aa level being PiVB, PPSMV–1, PPSMV–2 the closest virus as confirmed with CMP trees. Therefore, the divergent isolates also belong to the AsMaV and did not belong to a new species.
In the phylogenetic tree based on the aa sequences of the encoded protein of AsMaV with corresponding proteins, the sequenced isolates of AsMaV were clustered in a distinct sub-cluster next to the PiVB. The GPP was not parsimoniously informative, since most of the species remained ungrouped. The RdRP-based tree had better discrimination of the species (Figure 5).
To study the phylogeny and genetic diversity of emaraviruses, researchers have utilized different regions of the emaravirus genome, each exhibiting varying degrees of variation among different emaravirus isolates. One particular region of interest is the NP due to its correspondence to the coat protein in non-enveloped viruses [29,30,31,32,33,34,35,36,37,38,39]. In certain emaraviruses, such as HPWMoV, the RNA3 or NP region has been identified as the most divergent genomic region, leading to the division of HPWMoV isolates into subgroups based on this region [28,39,65]. Similarly, for EMARaV, RNA3 exhibited more diversity than RNA4, and a phylogenetic tree based on RNA3 provided better discrimination among EMARaV isolates [38]. The NP has also been effectively used to assess genetic diversity and phylogenetic relationships among Serbian and Finnish isolates of RLBV, revealing a minimum of 93% identity [34]. In our study on AsMaV, our results indicate that the RNA3 or NP region is a promising candidate for investigating genetic diversity, followed by RdRP, which was found to be more phylogenetically informative than RNA4. For FMV, partial sequences of RNA1 to RNA4 were used to assess genetic diversity, revealing that RNA1 and RNA3 exhibited more variation than RNA4 [30]. Partial sequences of RNA1, RNA3, and RNA4 of MaMaV showed limited diversity, with RNA1 having a maximum of 2% variation and no diversity observed in the amplicons of RNA3 and RNA4 [32].
In contrast of our findings, it has been shown that the BLMaV-MP region displayed more variation than NP, possibly due to several detected recombination events in the coding region of BLMaV-MP [31]. Additionally, pairwise nucleotide comparisons of ORF1 to ORF6 of six JYMaV isolates revealed that RNA2 and RNA1 were the most divergent genomic regions when compared to ORF3 to ORF6 [36]. In the case of RRV, sequencing several full-length isolates demonstrated a conserved core genome, with RNA5 and RNA7 being the most divergent RNAs [35]. Notably, it is important to acknowledge that RRV-RNA5 is not equivalent to AsMaV-RNA5 due to inconsistencies in the nomenclature of genomic RNAs and proteins in emaraviruses [6].
The parsimony-based trees of the core genomic proteins of clade A species of emaraviruses indicated that RdRP is a better candidate for higher taxonomic levels (Figure 5). Partial RdRP at the nt level was also able to discriminate the AsMaV isolates. Similarly, most but not all of emaraviruses species could be grouped by amino acid sequences of NP. Therefore, it could be concluded that NP and RdRP are the best candidates for phylogenetic studies of emaraviruses for intra- and inter-species relationships, respectively.
These examples highlight the diversity of genetic patterns across emaraviruses and underscore the uniqueness of each virus species in terms of their genomic characteristics and interactions with hosts. To gain a more comprehensive understanding of the genetic diversity and population structures of emaraviruses, increasing the number of available sequence data from more isolates, especially from distant geographical regions, would be beneficial. Such efforts can provide a clearer and more comprehensive picture of the evolutionary dynamics and relationships among emaraviruses.

3.7. Population Genetic Parameters

The estimated nucleotide diversity parameters for AsMaV populations in Finland and Sweden, including RNA3, RNA4, and partial RNA1 are represented in Table 3. As shown in Table 3, the AsMaV population from Finland had higher numbers of segregating sites compared to those from Sweden in the sequenced regions of the virus genome. This suggests that Finnish isolates are more genetically diverse than those in Sweden. The nucleotide diversity parameters show significant differences between RNA3 and RNA4 and partial RNA1 in both populations. In general, RNA4 ( θ W = 0.031979 and P i T = 0.02738) has a lower level of nucleotide diversity than RNA3 ( θ W = 0.05876 and P i T = 0.04994) and partial RNA1 ( θ W = 0.07509 and P i T = 0.07777). These results also indicate that RNA3 and partial RNA1 populations are more diverse than RNA4 in both Finnish and Swedish populations.
For RNA3, Tajima’s D values were negative for both Sweden (−0.961799) and Finland (−0.961799), with the total also having a negative value (−0.579852). Fu and Li’s D* values were also negative. This suggests a recent population bottleneck or purifying selection, indicating an excess of rare isolates, which could be due to population subdivision or selection.
For RNA4, the population genetic parameters show relatively low levels of genetic diversity in both Swedish and Finnish populations. Again, the values of Theta ( θ W ) and P i T are relatively low (0.02633 to 0.03198), indicating that there is not much genetic variation within these populations based on this region. The Tajima’s D value for Swedish isolates was positive, but it was not confirmed with negative values of Fu and Li’s D test. The Tajima’s D value and Fu and Li’s D test value for the Finnish population were both negative, indicating an excess of low-frequency isolates as a result of recent population expansion or purifying selection (Table 3).
For partial RNA1, the population genetic parameters show relatively high levels of genetic diversity in samples obtained from both countries, Sweden and Finland, with higher values of Theta ( θ W was 0.05930 to 0.07777) and P i T (0.04447 to 0.08018) than in the other two genomic regions (Table 3). For partial RNA1, Tajima’s D values were negative for the Swedish population and positive for the Finnish population, with the total having a positive value. This suggests balancing selection or a recent population expansion. Fu and Li’s D* values were positive for Finnish isolates and negative for Swedish isolates and the total, indicating a possible selective sweep or population subdivision.
Overall, the population genetic parameters suggest that the genetic diversity in these populations was relatively low to moderate, with evidence of an excess of low-frequency isolates in some cases. These patterns could be due to recent population expansion or purifying selection. Additionally, natural selection may be acting to preserve protein function, as suggested by the low dn/ds ratio in some populations.

3.8. Population Differentiation Parameters

The results of population differentiation tests based on three different genomic regions of AsMaV (RNA3, RNA4, and Partial RNA1) among two different geographical groups, Swedish and Finnish isolates, are shown in Table 4. The null hypothesis was rejected for all differentiation tests, KST*, Z*, and Snn, supported by p-values less than 0.05 (Table 4), denoting significant genetic differentiation and structure between the two groups in all the genomic regions. The FST values are moderate (ranging from 0.06 to 0.23), indicating that there is moderate genetic differentiation among populations, but some gene flow is still occurring. Overall, these results suggest that the virus populations of Sweden and Finland are genetically differentiated to some extent, but gene flow is still occurring. The moderate FST values suggest that there may be some environmental factors, such as geographic distance or barriers, influencing the genetic differentiation among the virus populations.

3.9. Recombination Analysis

The unstable position of a few isolates in phylogenetic trees raised the idea that there may be recombination events between the sequenced isolates. Searching for recombination events in the RNA4, RNA3, and partial RNA1 of AsMaV (nucleotides 20–1570) using RDP5 software revealed that there were no recombinant events in RNA4 and RNA3. However, a recombination event with a low probability has been detected by RDP (E-value 3.956 × 10−3), MaxChi (E-value 1.118 × 10−2), and Phylopro (E-value 2.545 × 10−2) in the isolate E58213 (Sweden) with the major parent of E58627 (Finland) and minor parent of E58299 (Sweden) in nts 581–631 and nts 647–915. It should be mentioned again that the E-value and the number of methods that supported this event were not strong. The isolate E58213 was placed in a different clade when the tree was based on the MP. The grouping profile of few isolates such as E58299, E58405, E58413, E58421, and E58434 were different. Since there were no recombination events reported for these isolates, two different scenarios would be possible: (i) there were insufficient data from AsMaV isolates especially from different geographical regions or (ii) the other mechanisms rather than recombination, for example, reassortment or genetic drift, were responsible for the evolution of these isolates.
Several recombination events have been reported in the HPWMoV-RNA3, especially in Australian isolates. It has been shown that recombination and reassortment plays an important role in the diversity of HPWMoV [39]. Therefore, it is concluded that in addition to mutation, the reassortment of gene segments and recombination within gene segments also contribute to the genetic diversity of emaraviruses [28]. Moreover, it has been suggested that certain FMV strains may have emerged through reassortments of genomic RNAs from different isolates [30].

4. Conclusions

Virus evolution involves changes in the genetic structure of a viral population over time, which begins with mutation, recombination, and reassortment. Then, host and vector factors, along with selection pressures, expose plant viruses to genetic drift, leading to the emergence of new viral strains or species with novel biological properties. Some viruses adapt to specific hosts or groups of hosts, while some can adapt to diverse hosts, vectors, and environments [23,24,25]. Therefore, studying the genetic diversity of viruses is important. Our understanding of the role of viruses in wild plant populations is improving, and we are beginning to understand how viruses affect the invasiveness of introduced plant species [66]. Since the wild trees, especially old ones, could survive even after being infected with viruses, we might consider these long interactions as symbiosis or tolerance of the wild trees to cope with these infections. However, viruses could also cause a decline in trees [67]. Most of the reported emaraviruses are attacking forest trees [6].
Our findings resolved that AsMaV–NP is the suitable candidate for genetic diversity studies of this virus since phylogenetic analysis based on full-length RNA3 or NP sequences resulted in similar profiles, and could discriminate the AsMaV isolates better than other genomic regions under investigation in this study. These results support the idea that the NP protein is a good representative of AsMaV genetic diversity studies. Although the RT–PCR–RFLP with AanI profiling did not correlate with the geographical origin of the isolates, it identified different haplotypes in Kuninkaansaari and Vallisaari islands, indicating the presence of multiple virus isolates in these islands as a hotspot for emerging new isolates. In addition, the number of the restriction site of AanI could be a landmark of emerging more divergent isolates of AsMaV since the isolates E58627, E58632, E58213, and E57260 were separated from other isolates of AsMaV in different phylogenetic trees. However, these were signals of recombination or reassortment in the phylogenetic trees based on different genomic regions of AsMaV isolates, thus, RDP analyses were not able to offer strong evidence. There is a need for sequence data from other geographical areas to obtain a better understanding of recombination events in AsMaV isolates. Recombination and reassortment would be also a mechanism involved in emerging new isolates, especially in the case of multipartite plant viruses [68] such as tospoviruses [69], and the cucumber mosaic virus (CMV) [19,20]. There are different reports on the frequency of reassortment in natural populations of plant viruses [19,20]. In the case of emaraviruses, it has been suggested that certain FMV strains may have emerged through reassortments of genomic RNAs from different isolates [30]. It has been shown that recombination and reassortment play an important role in the diversity of HPWMoV [39]. Therefore, it can be concluded that there is a need of sequence data from more AsMaV isolates, especially from more geographical regions.
The genetic diversity parameters suggest that the Finnish population has higher genetic diversity compared to the Swedish population in all three analyzed genomic regions and these regions are under purifying selection. There is evidence of high selection pressure on AsMaV genome, but which was not consistent across different genomic regions. This pressure was almost equal on the N terminal region of RdRP (0.40) and MP (0.49), but it was at least five time less for NP (0.21). The nucleotide diversity of RNA3 and RNA4 of AsMaV were 0.059 and 0.32, respectively, which is higher than those reported for FMV (0.022 and 0.023) [30]. This range of nucleotide diversity was in a similar range for some other woody plant-invading viruses, such as CTV (0.038) and citrus leaf blotch virus (0.021) [30,70,71] but very low in comparison to grapevine fanleaf virus (0.15) [11,30,72,73]. The nucleotide diversity of blackberry leaf mottle-associated virus (BLMaV) was rather low when compared to FMV and European mountain ash ringspot-associated virus (EMARaV) [31]. Therefore, it could be concluded that the genomes of emaraviruses including AsMaV are relatively conserved.
The AsMaV isolates sequenced in this study were separated based on their geographical origin based on the phylogenetic trees. Deviations from the neutral equilibrium model for the three analyzed genes together with a combination of high haplotype diversity, overall low nucleotide diversity within Finnish and Swedish isolates, and negative values of Tajima’s and Fu and Li’s value indicate the existence of population expansions for AsMaV which is expected following genetic drift, which accompanies bottleneck transmission [74]. Evolutionary bottlenecks/founder effects may arise as a result of plant- or vector-associated effects [75]. Root suckering will generally occur in the aspen stands as they deteriorate or as they are disturbed by fire or other events. In addition, some species of gall mites have been identified as a natural vector for a few emaraviruses and for most emaraviruses such a natural vector has not been identified so far [6]. Occurrence of population expansions following a bottleneck event for other multi-segmented plant RNA virus have been reported [76], and suggest that the inferred differentiation between viral populations is because of demographically related events.
The Finnish and Swedish AsMaV isolates sequenced in this study displayed geographical structuring that may have arisen by founder effects. The purifying selection acts on low to moderate genetic diversity of AsMaV isolates to preserve function of the protein. Since emaraviruses have a narrow host range and, in most of reported emaraviruses, they have been reported only from one natural host, it might reduce some factors affecting the shape of the population in comparison with viruses with a wider host range. Only a few reports are available for natural vectors of emaraviruses. The reported mites have also a narrow host range which could subsequently cause the lower genetic diversity of the emaraviruses. In this context, the most diversity in AsMaV isolates was found in Kuninkaansaari and Vallisaari islands. There is a need for investigation of the mite species found on Populous spp. and their potential in transmissions of AsMaV. Distant isolates of AsMaV showed some changes in aa sequences in comparison to other isolates and bioinformatic data predicted a post-translational Mucin-type O-glycosylation site in GPP which could happen because of interactions with the natural vector. Further studies are needed to investigate whether the mutations identified in the sequence variants of AsMaV RNA1, RNA3, and RNA4 affect the virus replication cycle, the severity of symptoms expressed by virus, may contribute to the decline in trees, or have an impact on the interaction with the assumed mite vectors of the virus. The functional characterization of the AsMaV-host–vector relationships will contribute to our understanding of the ecological position of the virus, and are a prerequisite to handle and prevent the damages including the decline that may be caused by the virus, especially when considering additional stress factors such as the predisposition and impacts of climate change.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/v15081678/s1, Table S1: list of collected samples; Table S2: characteristics of the oligonucleotides used in this study; Table S3: (a) the GenBank accession numbers of the isolates sequenced in this study; (b) the GenBank Accession numbers for the complete genome. Figure S1: distribution of AanI restrictotypes of AsMaV-RNA3 on a geographical map.

Author Contributions

Conceptualization, S.N.Z.; methodology, S.N.Z.; software, S.N.Z.; validation, S.N.Z.; formal analysis, S.N.Z.; investigation, S.N.Z., R.A.K. and S.I.; resources, C.B., R.J. and S.v.B.; writing—original draft preparation, S.N.Z.; writing—review and editing, S.N.Z., S.v.B., C.B., R.J., S.I. and R.A.K.; visualization, S.N.Z.; supervision, S.N.Z., S.v.B. and C.B.; project administration, C.B. and S.v.B.; funding acquisition, C.B. and R.A.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Einstein Foundation Berlin, grant numbers EGP–2018–476; EJS–2019–476–2. Financial support of our scientific collaboration with Johanna Witzell (LNU, Växjö, Sweden) through COST action CA20132 (UB3Guard) is kindly acknowledged. The article processing charge was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation)—491192747 and the Open Access Publication Fund of Humboldt-Universität zu Berlin.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The sequence data have been submitted to GenBank.

Acknowledgments

We are grateful to Johanna Witzell (LNU, Växjö, Sweden) and Martina Bandte (Humboldt-Universität zu Berlin, Germany) for supporting our sampling especially in the Swedish locations.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Caudullo, G.; De Rigo, D. Populus tremula in Europe: Distribution, habitat, usage and threats. In European Atlas of Forest Tree Species; San-Miguel-Ayanz, J., de Rigo, D., Caudullo, G., Houston Durrant, T., Mauri, A., Eds.; Publications Office of the European Union: Luxembourg, 2016. [Google Scholar]
  2. Rogers, P.C.; Pinno, B.D.; Šebesta, J.; Albrectsen, B.R.; Li, G.; Ivanova, N.; Kusbach, A.; Kuuluvainen, T.; Landhäusser, S.M.; Liu, H.; et al. A global view of aspen: Conservation science for widespread keystone systems. Glob. Ecol. Conserv. 2020, 21, e00828. [Google Scholar] [CrossRef]
  3. Kivinen, S.; Koivisto, E.; Keski-Saari, S.; Poikolainen, L.; Tanhuanpää, T.; Kuzmin, A.; Viinikka, A.; Heikkinen, R.K.; Pykälä, J.; Virkkala, R.; et al. A keystone species, European aspen (Populus tremula L.), in boreal forests: Ecological role, knowledge needs and mapping using remote sensing. For. Ecol. Manag. 2020, 462, 118008. [Google Scholar] [CrossRef]
  4. Jactel, H.; Koricheva, J.; Castagneyrol, B. Responses of forest insect pests to climate change: Not so simple. Curr. Opin. Insect Sci. 2019, 35, 103–108. [Google Scholar] [CrossRef]
  5. von Bargen, S.; Al Kubrusli, R.; Gaskin, T.; Fürl, S.; Hüttner, F.; Blystad, D.R.; Karlin, D.G.; Jalkanen, R.; Büttner, C. Characterisation of a novel Emaravirus identified in mosaic-diseased Eurasian aspen (Populus tremula). Ann. Appl. Biol. 2020, 176, 210–222. [Google Scholar] [CrossRef]
  6. Rehanek, M.; Karlin, D.G.; Bandte, M.; Al Kubrusli, R.; Nourinejhad Zarghani, S.; Candresse, T.; Büttner, C.; von Bargen, S. The Complex World of Emaraviruses—Challenges, Insights, and Prospects. Forests 2022, 13, 1868. [Google Scholar] [CrossRef]
  7. Domingo, E.; Escarmís, C.; Sevilla, N.; Moya, A.; Elena, S.F.; Quer, J.; Novella, I.S.; Holland, J.J. Basic concepts in RNA virus evolution. FASEB J. 1996, 10, 859–864. [Google Scholar] [CrossRef] [PubMed]
  8. Drake, J.W.; Holland, J.J. Mutation rates among RNA viruses. Proc. Natl. Acad. Sci. USA 1999, 96, 13910–13913. [Google Scholar] [CrossRef]
  9. Gago, S.; Elena, S.F.; Flores, R.; Sanjuán, R. Extremely high mutation rate of a hammerhead viroid. Science 2009, 323, 1308. [Google Scholar] [CrossRef] [Green Version]
  10. Nourinejhad Zarghani, S.; Shams-Bakhsh, M.; Zand, N.; Sokhandan-Bashir, N.; Pazhouhandeh, M. Genetic analysis of Iranian population of Potato leafroll virus based on ORF0. Virus Genes. 2012, 45, 567–574. [Google Scholar] [CrossRef]
  11. Nourinejhad Zarghani, S.; Shams-Bakhsh, M.; Bashir, N.S.; Wetzel, T. Molecular Characterization of Whole Genomic RNA2 From Iranian Isolates of Grapevine Fanleaf Virus. J. Phytopathol. 2013, 161, 419–425. [Google Scholar] [CrossRef]
  12. Wang, Y.; Gaba, V.; Yang, J.; Palukaitis, P.; Gal-On, A. Characterization of Synergy Between Cucumber mosaic virus and Potyviruses in Cucurbit Hosts. Phytopathology 2002, 92, 51–58. [Google Scholar] [CrossRef] [Green Version]
  13. Wintermantel, W.M. Co-infection of beet mosaic virus with beet yellowing viruses leads to increased symptom expression on sugar beet. Plant Dis. 2005, 89, 325–331. [Google Scholar] [CrossRef] [Green Version]
  14. Murphy, J.F.; Bowen, K.L. Synergistic disease in pepper caused by the mixed infection of cucumber mosaic virus and Pepper mottle virus. Phytopathology 2006, 96, 240–247. [Google Scholar] [CrossRef] [Green Version]
  15. Untiveros, M.; Fuentes, S.; Salazar, L.F. Synergistic interaction of sweet potato chlorotic stunt virus (Crinivirus) with carla-, cucumo-, ipomo-, and potyviruses infecting sweet potato. Plant Dis. 2007, 91, 669–676. [Google Scholar] [CrossRef] [Green Version]
  16. Syller, J. Facilitative and antagonistic interactions between plant viruses in mixed infections. Mol. Plant Pathol. 2012, 13, 204–216. [Google Scholar] [CrossRef]
  17. Moreno, A.B.; López-Moya, J.J. When viruses play team sports: Mixed infections in plants. Phytopathology 2020, 110, 29–48. [Google Scholar] [CrossRef]
  18. Neumann, G.; Noda, T.; Kawaoka, Y. Emergence and pandemic potential of swine-origin H1N1 influenza virus. Nature 2009, 459, 931–939. [Google Scholar] [CrossRef] [Green Version]
  19. Henderson, W.W.; Monroe, M.C.; St Jeor, S.C.; Thayer, W.P.; Rowe, J.E.; Peters, C.J.; Nichol, S.T. Naturally occurring Sin Nombre virus genetic reassortants. Virology 1995, 214, 602–610. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  20. Fraile, A.; Alonso-Prados, J.L.; Aranda, M.A.; Bernal, J.J.; Malpica, J.M.; Garcia-Arenal, F. Genetic exchange by recombination or reassortment is infrequent in natural populations of a tripartite RNA plant virus. J. Virol. 1997, 71, 934–940. [Google Scholar] [CrossRef] [PubMed]
  21. Schnippenkoetter, W.; Martin, D.; Willment, J.; Rybicki, E. Forced recombination between distinct strains of maize streak virus. J. Gen. Virol. 2001, 82, 3081–3090. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Shepherd, D.N.; Martin, D.P.; Van der Walt, E.; Dent, K.; Varsani, A.; Rybicki, E.P. Maize streak virus: An old and complex ‘emerging’pathogen. Mol. Plant Pathol. 2010, 11, 1–12. [Google Scholar] [CrossRef]
  23. Butković, A.; González, R. A brief view of factors that affect plant virus evolution. Front. Virol. 2022, 2, 994057. [Google Scholar] [CrossRef]
  24. Dolan, P.T.; Whitfield, Z.J.; Andino, R. Mechanisms and Concepts in RNA Virus Population Dynamics and Evolution. Annu. Rev. Virol. 2018, 5, 69–92. [Google Scholar] [CrossRef] [PubMed]
  25. Pagán, I. The diversity, evolution and epidemiology of plant viruses: A phylogenetic view. Infect. Genet. Evol. 2018, 65, 187–199. [Google Scholar] [CrossRef] [PubMed]
  26. Roossinck, M.J. Mechanisms of plantvirus evolution. Annu. Rev. Phytopathol. 1997, 35, 191–209. [Google Scholar] [CrossRef] [PubMed]
  27. Sanjuán, R.; Domingo-Calap, P. Mechanisms of viral mutation. Cell. Mol. Life Sci. 2016, 73, 4433–4448. [Google Scholar] [CrossRef] [Green Version]
  28. Jones, R.A.C.; Vazquez-Iglesias, I.; McGreig, S.; Fox, A.; Gibbs, A.J. Genomic High Plains wheat mosaic virus sequences from Australia: Their phylogenetics and evidence for emaravirus recombination and reassortment. Viruses 2023, 15, 401. [Google Scholar] [CrossRef]
  29. Danesh-Amuz, S.; Rakhshandehroo, F.; Rezaee, S. Prevalence and genetic diversity of fig mosaic virus isolates infecting fig tree in Iran. Acta Virol. 2014, 58, 245–252. [Google Scholar] [CrossRef] [Green Version]
  30. Walia, J.J.; Willemsen, A.; Elci, E.; Caglayan, K.; Falk, B.W.; Rubio, L. Genetic variation and possible mechanisms driving the evolution of worldwide Fig mosaic virus isolates. Phytopathology 2014, 104, 108–114. [Google Scholar] [CrossRef] [Green Version]
  31. Hassan, M.; Tzanetakis, I.E. Population structure, evolution and detection of blackberry leaf mottle-associated virus, an emerging emaravirus. Plant Pathol. 2019, 68, 775–782. [Google Scholar] [CrossRef]
  32. Rumbou, A.; Candresse, T.; von Bargen, S.; Buttner, C. Next-generation sequencing reveals a novel emaravirus in diseased maple trees from a german urban forest. Front. Microbiol. 2020, 11, 621179. [Google Scholar] [CrossRef]
  33. Dong, L.; Lemmetty, A.; Latvala, S.; Samuilova, O.; Valkonen, J.P.T. Occurrence and genetic diversity of raspberry leaf blotch virus (RLBV) infecting cultivated and wild Rubus species in Finland. Ann. Appl. Biol. 2016, 168, 122–132. [Google Scholar] [CrossRef]
  34. Jevremović, D.; Leposavić, A.; Paunović, S.A. Genetic diversity of raspberry leaf blotch emaravirus in red raspberries from Serbia. Span. J. Agric. Res. 2019, 17, e1004. [Google Scholar] [CrossRef]
  35. Verchot, J.; Herath, V.; Jordan, R.; Hammond, J. Genetic diversity among rose rosette virus isolates: A roadmap towards studies of gene function and pathogenicity. Pathogens 2023, 12, 707. [Google Scholar] [CrossRef]
  36. Guo, J.; Wang, Y.; Wang, G.; Hong, J.; Yang, Z.; Bai, J.; Hong, N. Molecular characteristics of jujube yellow mottle-associated virus infecting jujube (Ziziphus jujuba Mill.) Grown at Aksu in Xinjiang of China. Viruses 2020, 13, 25. [Google Scholar] [CrossRef] [PubMed]
  37. Kubota, K.; Usugi, T.; Tomitaka, Y.; Shimomoto, Y.; Takeuchi, S.; Kadono, F.; Yanagisawa, H.; Chiaki, Y.; Tsuda, S. Perilla mosaic virus is a highly divergent emaravirus transmitted by Shevtchenkella sp. (Acari: Eriophyidae). Phytopathology 2020, 110, 1352–1361. [Google Scholar] [CrossRef]
  38. Roßbach, J.; Dieckmann, H.L.; Büttner, T.; Mühlbach, H.-P.; von Bargen, S.; Büttner, C. Genetic variability and phylogeny of european mountain ash ringspot-associated virus RNA3 and RNA4. Forests 2015, 6, 4072–4087. [Google Scholar] [CrossRef] [Green Version]
  39. Tatineni, S.; Hein, G.L. High Plains wheat mosaic virus: An enigmatic disease of wheat and corn causing the High Plains disease. Mol. Plant Pathol. 2021, 22, 1167–1179. [Google Scholar] [CrossRef]
  40. Jacquot, M.; Wallace, M.A.; Streicker, D.G.; Biek, R. Geographic range overlap rather than phylogenetic distance explains rabies virus transmission among closely related bat species. Viruses 2022, 14, 2399. [Google Scholar] [CrossRef]
  41. Kuzmin, I.V.; Hughes, G.J.; Rupprecht, C.E. Phylogenetic relationships of seven previously unclassified viruses within the family Rhabdoviridae using partial nucleoprotein gene sequences. J. Gen. Virol. 2006, 87, 2323–2331. [Google Scholar] [CrossRef]
  42. Pappu, H.R.; du Toit, L.J.; Schwartz, H.F.; Mohan, S.K. Sequence diversity of the nucleoprotein gene of iris yellow spot virus (genus Tospovirus, family Bunyaviridae) isolates from the western region of the United States. Arch. Virol. 2006, 151, 1015–1025. [Google Scholar] [CrossRef]
  43. Herath, V.; Romay, G.; Urrutia, C.D.; Verchot, J. Family level phylogenies reveal relationships of plant viruses within the order Bunyavirales. Viruses 2020, 12, 1010. [Google Scholar] [CrossRef]
  44. Šantak, M.; Matić, Z. The role of nucleoprotein in immunity to human negative-stranded rna viruses—not just another brick in the viral nucleocapsid. Viruses 2022, 14, 521. [Google Scholar]
  45. Mielke-Ehret, N.; Muhlbach, H.P. Emaravirus: A novel genus of multipartite, negative strand RNA plant viruses. Viruses 2012, 4, 1515–1536. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Boom, R.; Sol, C.; Salimans, M.; Jansen, C.; Wertheim-van Dillen, P.; Van der Noordaa, J. Rapid and simple method for purification of nucleic acids. J. Clin. Microbiol. 1990, 28, 495–503. [Google Scholar] [CrossRef] [PubMed]
  47. Elbeaino, T.; Whitfield, A.; Sharma, M.; Digiaro, M. Emaravirus-specific degenerate PCR primers allowed the identification of partial RNA-dependent RNA polymerase sequences of Maize red stripe virus and Pigeonpea sterility mosaic virus. J. Virol. Methods 2013, 188, 37–40. [Google Scholar] [CrossRef] [Green Version]
  48. Di Bello, P.L.; Ho, T.; Tzanetakis, I.E. The evolution of emaraviruses is becoming more complex: Seven segments identified in the causal agent of Rose rosette disease. Virus Res. 2015, 210, 241–244. [Google Scholar] [CrossRef]
  49. Tamura, K.; Stecher, G.; Kumar, S. MEGA11: Molecular evolutionary genetics analysis version 11. Mol. Biol. Evol. 2021, 38, 3022–3027. [Google Scholar] [CrossRef]
  50. Felsenstein, J. PHYLIP (Phylogeny Inference Package), version 3.6, 2004. Distributed by the author; Department of Genome Sciences, University of Washington: Seattle, WA, USA, 2005. [Google Scholar]
  51. Rozas, J.; Ferrer-Mata, A.; Sánchez-DelBarrio, J.C.; Guirao-Rico, S.; Librado, P.; Ramos-Onsins, S.E.; Sánchez-Gracia, A. DnaSP 6: DNA sequence polymorphism analysis of large data sets. Mol. Biol. Evol. 2017, 34, 3299–3302. [Google Scholar] [CrossRef] [PubMed]
  52. Martin, D.P.; Varsani, A.; Roumagnac, P.; Botha, G.; Maslamoney, S.; Schwab, T.; Kelz, Z.; Kumar, V.; Murrell, B. RDP5: A computer program for analyzing recombination in, and removing signals of recombination from, nucleotide sequence datasets. Virus Evol. 2020, 7, veaa087. [Google Scholar] [CrossRef]
  53. Bandelt, H.-J.; Forster, P.; Röhl, A. Median-joining networks for inferring intraspecific phylogenies. Mol. Biol. Evol. 1999, 16, 37–48. [Google Scholar] [CrossRef]
  54. French, N.; Yu, S.; Biggs, P.; Holland, B.; Fearnhead, P.; Binney, B.; Fox, A.; Grove-White, D.; Leigh, J.W.; Miller, W.; et al. Evolution of Campylobacter species in New Zealand. In Campylobacter Ecology and Evolution; Sheppard, S.K., Méric, G., Eds.; Caister Academic Press: Norfolk, England, 2014; pp. 221–240. [Google Scholar]
  55. Clement, M.; Snell, Q.; Walker, P.; Posada, D.; Crandall, K. TCS: Estimating gene genealogies. Parallel Distrib. Process. Symp. Int. Proc. 2002, 2, 184. [Google Scholar]
  56. Hull, R. Matthews’ Plant Virology; Academic Press: San Diego, CA, USA, 2002; Volume 1056, p. 1001. [Google Scholar]
  57. Elbeaino, T.; Digiaro, M.; Uppala, M.; Sudini, H. Deep sequencing of dsRNAs recovered from mosaic-diseased pigeonpea reveals the presence of a novel emaravirus: Pigeonpea sterility mosaic virus 2. Arch. Virol. 2015, 160, 2019–2029. [Google Scholar] [CrossRef] [PubMed]
  58. Elbeaino, T.; Digiaro, M.; Martelli, G.P. Complete nucleotide sequence of four RNA segments of fig mosaic virus. Arch. Virol. 2009, 154, 1719–1727. [Google Scholar] [CrossRef] [PubMed]
  59. Yu, C.; Karlin, D.G.; Lu, Y.; Wright, K.; Chen, J.; MacFarlane, S. Experimental and bioinformatic evidence that raspberry leaf blotch emaravirus P4 is a movement protein of the 30K superfamily. J. Gen. Virol. 2013, 94, 2117–2128. [Google Scholar] [CrossRef]
  60. McGavin, W.J.; Mitchell, C.; Cock, P.J.A.; Wright, K.M.; MacFarlane, S.A. Raspberry leaf blotch virus, a putative new member of the genus Emaravirus, encodes a novel genomic RNA. J. Gen. Virol. 2012, 93, 430–437. [Google Scholar] [CrossRef]
  61. Reguera, J.; Weber, F.; Cusack, S. Bunyaviridae RNA polymerases (L-Protein) have an n-terminal, influenza-like endonuclease domain, essential for viral Cap-dependent transcription. PLoS Path. 2010, 6, e1001101. [Google Scholar] [CrossRef] [PubMed]
  62. Müller, R.; Poch, O.; Delarue, M.; Bishop, D.; Bouloy, M. Rift Valley fever virus L segment: Correction of the sequence and possible functional role of newly identified regions conserved in RNA-dependent polymerases. J. Gen. Virol. 1994, 75, 1345–1352. [Google Scholar] [CrossRef]
  63. Kumar, S.; Subbarao, B.; Hallan, V. Molecular characterization of emaraviruses associated with pigeonpea sterility mosaic disease. Sci. Rep. 2017, 7, 11831. [Google Scholar] [CrossRef] [Green Version]
  64. Tran, D.T.; Ten Hagen, K.G. Mucin-type O-glycosylation during development. J. Biol. Chem. 2013, 288, 6921–6929. [Google Scholar] [CrossRef] [Green Version]
  65. Stewart, L.R. Sequence diversity of wheat mosaic virus isolates. Virus Res. 2016, 213, 299–303. [Google Scholar] [CrossRef] [Green Version]
  66. Roossinck, M.J. Plants, viruses and the environment: Ecology and mutualism. Virology 2015, 479-480, 271–277. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  67. Büttner, C.; Landgraf, M.; Fernandez Colino, H.L.; von Bargen, S.; Bandte, M. Chapter 3-Virus diseases of forest and urban trees. In Forest Microbiology; Asiegbu, F.O., Kovalchuk, A., Eds.; Academic Press: Cambridge, MA, USA, 2023; Volume 3, pp. 61–97. [Google Scholar]
  68. Savory, F.R.; Varma, V.; Ramakrishnan, U. Identifying geographic hot spots of reassortment in a multipartite plant virus. Evol. Appl. 2014, 7, 569–579. [Google Scholar] [CrossRef] [PubMed]
  69. Zheng, K.; Liu, H.; Yin, Y.; Chen, T.-C.; Yeh, S.-D.; Zhang, Z.; Dong, J. Full-length M and L RNA sequences of tospovirus isolate 2009-GZT, which causes necrotic ringspot on tomato in China. Arch. Virol. 2016, 161, 1411–1414. [Google Scholar] [CrossRef] [PubMed]
  70. Kallinen, A.; Lindberg, I.; Tugume, A.; Valkonen, J. Detection, distribution, and genetic variability of European mountain ash ringspot-associated virus. Phytopathology 2009, 99, 344–352. [Google Scholar] [CrossRef] [Green Version]
  71. Thekke-Veetil, T.; Polashock, J.J.; Marn, M.V.; Plesko, I.M.; Schilder, A.C.; Keller, K.E.; Martin, R.R.; Tzanetakis, I.E. Population structure of blueberry mosaic associated virus: Evidence of reassortment in geographically distinct isolates. Virus Res. 2015, 201, 79–84. [Google Scholar] [CrossRef]
  72. Nourinejhad Zarghani, S.; Dupuis-Maguiraga, L.; Bassler, A.; Wetzel, T. Mapping of the exchangeable and dispensable domains of the RNA 2-encoded 2AHP protein of arabis mosaic nepovirus. Virology 2014, 458–459, 106–113. [Google Scholar] [CrossRef]
  73. Sokhandan Bashir, N.; Nourinejhad Zarghani, S.; Hejazi, M.S. Diversity of grapevine fanleaf virus isolates from Iran. Virus Res. 2007, 128, 144–148. [Google Scholar] [CrossRef]
  74. Novella, I.S.; Elena, S.F.; Moya, A.; Domingo, E.; Holland, J.J. Size of genetic bottlenecks leading to virus fitness loss is determined by mean initial population fitness. J. Virol. 1995, 69, 2869–2872. [Google Scholar] [CrossRef]
  75. Bergstrom, C.T.; McElhany, P.; Real, L.A. Transmission bottlenecks as determinants of virulence in rapidly evolving pathogens. Proc. Natl. Acad. Sci. USA 1999, 96, 5095–5100. [Google Scholar] [CrossRef]
  76. Tsompana, M.; Abad, J.; Purugganan, M.; Moyer, J.W. The molecular population genetics of the tomato spotted wilt virus (TSWV) genome. Mol. Ecol. 2005, 14, 53–66. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Nucleotide sequence alignment of AsMaV ORF3 termini reveals the presence of three distinct stop codons in the sequenced isolates. To highlight these stop codons, they have been marked with gray rectangular regions accompanied by the label “Stop” at the top. Notably, an indel event involving the insertion of three nucleotides, specifically “GAG”, has been observed in isolates E58213, E58627, and E57260, occurring immediately prior to the termination codon. These significant positions have been duly identified using asterisks. Furthermore, experimental confirmation of the AanI restriction site downstream of the respective stop codons has been obtained, and their corresponding positions in the alignment are indicated as AanI site.
Figure 1. Nucleotide sequence alignment of AsMaV ORF3 termini reveals the presence of three distinct stop codons in the sequenced isolates. To highlight these stop codons, they have been marked with gray rectangular regions accompanied by the label “Stop” at the top. Notably, an indel event involving the insertion of three nucleotides, specifically “GAG”, has been observed in isolates E58213, E58627, and E57260, occurring immediately prior to the termination codon. These significant positions have been duly identified using asterisks. Furthermore, experimental confirmation of the AanI restriction site downstream of the respective stop codons has been obtained, and their corresponding positions in the alignment are indicated as AanI site.
Viruses 15 01678 g001
Figure 2. Consensus maximum parsimony trees constructed based on the nucleotide sequence of RNA3 (A), the open reading frame at nucleotide level 3 (B) as well as amino acid sequence of NP protein (C). Nodes with bootstrap support of less than 70% collapsed. The RNA3 sequences of pigeonpea sterility mosaic virus 2 (PPSMV–2) (Emaravirus toordali) (NC_030661), fig mosaic virus (FMV) (Emaravirus fici) (NC_029563), and pistacia virus B (PiVB) (Emaravirus pistaciae) (MH727574) were used as out-groups in the analyses. The NP protein sequences of PPSMV–2 (YP_009268864), FMV (YP_009237270), and PiVB (QAR18004) were used as out-groups in the amino acid-based tree of NP protein. The geographical origin and accession number of each isolate are presented next to the isolate name, as well as in Supplementary Table S3. Isolates from Sweden and Finland are shown in red and black colors, respectively, and the out-groups are in blue.
Figure 2. Consensus maximum parsimony trees constructed based on the nucleotide sequence of RNA3 (A), the open reading frame at nucleotide level 3 (B) as well as amino acid sequence of NP protein (C). Nodes with bootstrap support of less than 70% collapsed. The RNA3 sequences of pigeonpea sterility mosaic virus 2 (PPSMV–2) (Emaravirus toordali) (NC_030661), fig mosaic virus (FMV) (Emaravirus fici) (NC_029563), and pistacia virus B (PiVB) (Emaravirus pistaciae) (MH727574) were used as out-groups in the analyses. The NP protein sequences of PPSMV–2 (YP_009268864), FMV (YP_009237270), and PiVB (QAR18004) were used as out-groups in the amino acid-based tree of NP protein. The geographical origin and accession number of each isolate are presented next to the isolate name, as well as in Supplementary Table S3. Isolates from Sweden and Finland are shown in red and black colors, respectively, and the out-groups are in blue.
Viruses 15 01678 g002
Figure 3. The consensus maximum parsimony trees constructed based on the nucleotide sequences of partial RNA1 (left) and RNA4 (right). Nodes with bootstrap support less than 70% were collapsed. Corresponding regions of RNA1 sequences of PPSMV–2, FMV, and PiVB with the accession numbers NC_030660, NC_029560, and MH727572, respectively, as well as those of full-length RNA4 with the accession numbers NC_030663, NC_029564, and MH727575, were used as out-groups in the analyses. The geographical origin and accession number of each isolate are presented next to the isolate name, as well as in Supplementary Table S3. Isolates from Sweden and Finland are shown in red and black colors, respectively. The out-groups are in blue.
Figure 3. The consensus maximum parsimony trees constructed based on the nucleotide sequences of partial RNA1 (left) and RNA4 (right). Nodes with bootstrap support less than 70% were collapsed. Corresponding regions of RNA1 sequences of PPSMV–2, FMV, and PiVB with the accession numbers NC_030660, NC_029560, and MH727572, respectively, as well as those of full-length RNA4 with the accession numbers NC_030663, NC_029564, and MH727575, were used as out-groups in the analyses. The geographical origin and accession number of each isolate are presented next to the isolate name, as well as in Supplementary Table S3. Isolates from Sweden and Finland are shown in red and black colors, respectively. The out-groups are in blue.
Viruses 15 01678 g003
Figure 4. Motifs reported for RdRP of emaraviruses from phylogenetic clade A. SDR–1, SDR–2, and SDR–3 are the three main regions reported for RdRP in emaraviruses. SDR–1 domain contains the endonuclease motifs and might be involved in “cap–snatching”. SDR–2 contains the polymerase motifs designated motifs A–E, and SDR–3 is a Cap-binding domain.
Figure 4. Motifs reported for RdRP of emaraviruses from phylogenetic clade A. SDR–1, SDR–2, and SDR–3 are the three main regions reported for RdRP in emaraviruses. SDR–1 domain contains the endonuclease motifs and might be involved in “cap–snatching”. SDR–2 contains the polymerase motifs designated motifs A–E, and SDR–3 is a Cap-binding domain.
Viruses 15 01678 g004
Figure 5. Constructed maximum parsimony trees based on the amino acid sequences of RdRP (A), GPP (B), NP (C), and MP (D) proteins. The evolutionary history was inferred using the Minimum Evolution method. The bootstrap consensus tree inferred from 1000 replicates is taken to represent the evolutionary history of the taxa analyzed. Branches corresponding to partitions reproduced in less than 70% of bootstrap replicates are collapsed. Trees were constructed by MEGA11 [49]. The AsMaV isolates are presented in red color.
Figure 5. Constructed maximum parsimony trees based on the amino acid sequences of RdRP (A), GPP (B), NP (C), and MP (D) proteins. The evolutionary history was inferred using the Minimum Evolution method. The bootstrap consensus tree inferred from 1000 replicates is taken to represent the evolutionary history of the taxa analyzed. Branches corresponding to partitions reproduced in less than 70% of bootstrap replicates are collapsed. Trees were constructed by MEGA11 [49]. The AsMaV isolates are presented in red color.
Viruses 15 01678 g005
Table 1. Position and frequency of AanI restriction site on AsMaV–RNA3 of 34, 47, and 81 collected samples from Finland (F), Sweden (S), and in total (T), respectively. The position of the nucleotides is based on AsMaV–RNA3 sequence of isolate E55089 (GenBank accession number LR742463). The black line represents the genomic RNA and orange rectangle shows the position of the ORF3, and the vertical red lines are denoting the position of AanI restriction site on the RNA3 (nucleotides: 627, 1069, 1142, 1406, 1474).
Table 1. Position and frequency of AanI restriction site on AsMaV–RNA3 of 34, 47, and 81 collected samples from Finland (F), Sweden (S), and in total (T), respectively. The position of the nucleotides is based on AsMaV–RNA3 sequence of isolate E55089 (GenBank accession number LR742463). The black line represents the genomic RNA and orange rectangle shows the position of the ORF3, and the vertical red lines are denoting the position of AanI restriction site on the RNA3 (nucleotides: 627, 1069, 1142, 1406, 1474).
RestrictotypePosition of AanI Site on cRNA3No. Cut 1No. Fr. 2Size of Fragments (bp)Frequency of Restrictotypes (%) 3
SFT
Type IViruses 15 01678 i001011587 to 15891.26.17.3
Type IIViruses 15 01678 i002121474, and 1150.01.21.2
Type IIIViruses 15 01678 i00312962, and 6272.43.76.1
Type IVViruses 15 01678 i00423627, 521, and 4420.04.94.9
Type VViruses 15 01678 i00534627, 515, 266, and 1821.22.43.7
Type VIViruses 15 01678 i00634627, 442, 338, and 1823.71.24.9
Type VIIViruses 15 01678 i00745627, 442, 264, 182, and 7332.939.072.0
1 number of DNA restriction enzyme sites; 2 number of fragments recorded on electrophoresis after digestion of the RT–PCR with AanI restriction enzyme; 3 the percentages of each restrictotype were calculated based on the total number of collected isolates in this study, the list of which, along with their corresponding restrictotype groups, is presented in Supplementary Table S1 and the geographical distribution of restrictotypes is presented in the Supplementary Figure S1.
Table 2. The identities of the RNA3, RNA4 and partial RNA1 at nucleotide (nt) and amino acid (aa) levels among the 18 Finnish and seven to eight Swedish isolates sequenced in this study.
Table 2. The identities of the RNA3, RNA4 and partial RNA1 at nucleotide (nt) and amino acid (aa) levels among the 18 Finnish and seven to eight Swedish isolates sequenced in this study.
Origin of IsolatesGenomic Region
RNA3RNA4Partial RNA1
5′ UTRORF33′ UTR5′ UTRORF43′ UTR5′ UTRORF1
Finlandnt95–10088.9–10080.8–98.897.6–10095.9–99.982.4–10096.5–10085.2–99.7
aa 88.4–100 96.9–100 96.6–100
Swedennt94–10089.4–99.680–98.810095.9–99.980–98.897.6–10087.4–98.7
aa 91.8–100 97.5–100 97.1–99.7
Values for amino acid level are shown in bold.
Table 3. Results from population parameters estimated based on demographic trends.
Table 3. Results from population parameters estimated based on demographic trends.
Genomic RegionsGeographical GroupmS θ w P i T P i s P i a P i a / P i s Tajima’s DFu and Li’s D
RNA3
Sweden92750.06347 ± (0.02651)0.04735 ± (0.01764)0.119840.021150.176−1.322703−1.56810
Finland182980.05435 ± (0.01881)0.04207 ± (0.01138)0.111800.024320.217−0.961799−0.47563
Total273610.05876 ± (0.01844)0.04994 ± (0.00834)0.128430.026950.210−0.579852−0.60644
RNA4
Sweden81190.02948 ± (0.01286)0.03048 ± (0.00391)0.094850.005420.0570.186391−0.14740
Finland181410.02633 ± (0.00929)0.02479 ± (0.00237)0.089770.004180.047−0.246562−0.29650
Total261900.03198 ± (0.01032)0.02738 ± (0.00187)0.094060.004600.049−0.568813−0.66359
Partial RNA1
Sweden82420.05930 ± (0.02564)0.04447 ± (0.01850)0.196650.009340.048−1.362813−1.50039
Finland184170.07702 ± (0.02673)0.08018 ± (0.00753)0.407300.016030.0390.1747210.10686
Total264510.07509 ± (0.02390)0.07777 ± (0.00663)0.391340.015540.0400.153180−0.04648
The abbreviations used are as follows: m, number of sites; S, number of segregating sites; θ W = Waterson’s Theta; P i T = total nucleotide diversity; P i s = the number of synonymous substitutions per synonymous site; P i a = the number of nonsynonymous substitutions per nonsynonymous site.
Table 4. Geneflow and genetic differentiation parameters.
Table 4. Geneflow and genetic differentiation parameters.
Genomic RegionPopulationsKST*p-ValueZ*p-ValueSnnp-ValueHsp-ValueFST
RNA3S. vs. F0.070920.0000 ***4.510060.0000 ***0.851850.0010 **0.995450.5330 ns0.22767
RNA4S. vs. F0.022880.0080 **4.634220.0050 **0.884620.0030 **1.000001.0000 ns0.06408
RNA1S. vs. F0.038900.0000 ***4.440270.0000 ***0.923080.0010 *1.000001.0000 ns0.23698
ns not significant, * 0.01 < p-value < 0.05; **, 0.001 < p-value < 0.01; ***, p-value < 0.001.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Nourinejhad Zarghani, S.; Al Kubrusli, R.; Iancev, S.; Jalkanen, R.; Büttner, C.; von Bargen, S. Molecular Population Genetics of Aspen Mosaic-Associated Virus in Finland and Sweden. Viruses 2023, 15, 1678. https://doi.org/10.3390/v15081678

AMA Style

Nourinejhad Zarghani S, Al Kubrusli R, Iancev S, Jalkanen R, Büttner C, von Bargen S. Molecular Population Genetics of Aspen Mosaic-Associated Virus in Finland and Sweden. Viruses. 2023; 15(8):1678. https://doi.org/10.3390/v15081678

Chicago/Turabian Style

Nourinejhad Zarghani, Shaheen, Rim Al Kubrusli, Serghei Iancev, Risto Jalkanen, Carmen Büttner, and Susanne von Bargen. 2023. "Molecular Population Genetics of Aspen Mosaic-Associated Virus in Finland and Sweden" Viruses 15, no. 8: 1678. https://doi.org/10.3390/v15081678

APA Style

Nourinejhad Zarghani, S., Al Kubrusli, R., Iancev, S., Jalkanen, R., Büttner, C., & von Bargen, S. (2023). Molecular Population Genetics of Aspen Mosaic-Associated Virus in Finland and Sweden. Viruses, 15(8), 1678. https://doi.org/10.3390/v15081678

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop