Next Article in Journal
Treatment of Plantar Warts with a Nitric-Zinc Complex Solution: A Review of 72 Cases
Previous Article in Journal
Exploring Virus Diversity in the Potato leafhopper (Empoasca fabae), an Economically Important Agricultural Pest
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Study of Endogenous Viruses in the Strawberry Plants

1
School of Life Sciences, Yunnan University, Kunming 650500, China
2
Yunnan International Joint Laboratory of Virology and Immunology, Kunming 650500, China
*
Authors to whom correspondence should be addressed.
Viruses 2024, 16(8), 1306; https://doi.org/10.3390/v16081306
Submission received: 24 July 2024 / Revised: 12 August 2024 / Accepted: 15 August 2024 / Published: 16 August 2024
(This article belongs to the Section Viruses of Plants, Fungi and Protozoa)

Abstract

:
Endogenous viral elements (EVEs) have been reported to exist widely in the genomes of eukaryotic organisms, and they are closely associated with the growth, development, genetics, adaptation, and evolution of their hosts. In this study, two methods—homologous sequence search and genome alignment—were used to explore the endogenous viral sequences in the genomes of Fragaria species. Results revealed abundant endogenous pararetroviruses (EPRVs) in the genomes of Fragaria species, including 786 sequences belonging to five known taxa such as Caulimovirus and other unclassified taxa. Differences were observed in the detected EPRVs between the two methods, with the homologous sequence search having a greater number of EPRVs. On the contrary, genome alignment identified various types and sources of virus-like sequences. Furthermore, through genome alignment, a 267-bp sequence with 95% similarity to the gene encoding the aphid-transmitted protein of Strawberry vein banding virus (Caulimovirus venafragariae) was discovered in the F. chiloensis genome, which was likely a recent insertion. In addition, the statistical analysis of the genome alignment results indicated a remarkably higher abundance of virus-like sequences in the genomes of polyploid strawberries compared with diploid ones. Moreover, the differences in virus-like sequences were observed between the genomes of Fragaria species and those of their close relatives. This study enriched the diversity of viruses that infect strawberries, and laid a theoretical foundation for further research on the origin of endogenous viruses in the strawberry genome, host–virus interactions, adaptation, evolution, and their functions.

1. Introduction

Endogenous viral elements (EVEs) are viral DNA sequences found in the genomes of non-viral organisms, consisting of entire viral genomes or only partial fragments [1,2]. EVEs record past viral infection events, providing “molecular fossils” to study the deep evolutionary history of viruses, and they are an important research tool in the emerging field of “Paleovirology” [3,4]. With the continuous discovery of EVEs, it is known that almost all types of viruses can integrate their nucleic acids into the host genome [5]. A large number of EVEs derived from DNA and RNA viruses are abundant in plant genomes [6,7]. At present, EVEs in plant genomes can be broadly categorized into two groups: those derived from members of the family Caulimoviridae are retrotranscribing double-stranded DNA (dsDNA) viruses, which are known as endogenous pararetroviruses (EPRVs), and those that originated from dsRNA, single-stranded RNA (ssRNA), and ssDNA viruses, which are known as endogenous non-retroviral elements (ENREs) [7,8]. The first reported EVE was a geminivirus-related DNA sequence (GRD) [9], whereas EPRVs derived from plant pararetroviruses (PRVs, Caulimoviridae) were widely abundant in plant genomes [10].
The Caulimoviridae family consists of 11 genera, including Caulimovirus, Rosadnavirus, Petuvirus, Dioscovirus, Badnavirus, Vaccinivirus, Soymovirus, Tungrovirus, Ruflodivirus, Cavemovirus, and Solendovirus (ICTV) [11]. The nucleotide similarity range of the conserved sequences among the genera of Caulimoviridae has been found to be 42–65%, and the amino acid similarity range 27–48%. In addition, the nucleotide similarity threshold for different species has been found to be 80% [12,13,14]. At present, EPRVs have been found in many plant genomes, such as tobacco [15], Musa spp. [16,17], rice [18,19,20], petunia [21], tomato [22], potato [23], pineapple [24], dahlia [25,26], African yam (Dioscorea cayenensis-rotundata complex) [6], Dracaena sanderiana [27], fig [28], eucalyptus [29], Aristotelia chilensis [30], Citrinae [31], red raspberry [32], alfalfa [8], grapevine [12,33], and eggplant [34]. Large-scale surveys have revealed that EPRVs are widespread in all major plant taxa, including club mosses, ferns, gymnosperms, and angiosperms (including monocotyledon and dicotyledon plants) [34,35,36]. Plant ENREs refer to the internalization of nucleic acids from ssDNA, dsRNA, and ssRNA viruses; plant ENREs primarily match the sequences related to the RNA-dependent RNA polymerase or replicase (Rep), movement protein, and coat protein of the associated viruses [7,37]. Apart from the unclassified ENREs, at least 11 families of ENREs have been reported to be distributed in genomes ranging from algae to higher plants [1,38,39,40,41]. Notably, sequences related to giant virus DNA have also been reported to be integrated into plant genomes, such as large nuclear-cytoplasmic DNA virus-like fragments found in the genomes of Physcomitrella patens and Selaginella moellendorffii [42]. Moniruzzaman et al. (2022) also documented the endogenization of giant viruses in the genomes of Chlamydomonas reinhardtii and Chlamydomonas incerta [40].
Plant EVEs are mainly distributed in plant genomes in the form of fragmentation, and most of them are inactive and only act as “gene fossils”; a small number of EVEs with a complete genome sequence can be assembled into infectious virus particles induced by damage [21], tissue culture [43], species hybridization [15], heat stress [44], sunshine, and seasonal variation [15,45]. To date, three plant EVEs have been activated and have triggered episomal viral infection, including eBSV in the genome of Musa spp. [16], eTVCV in the hybrid Nicotiana edwardsonii derived from Nicotiana glutinosa and Nicotiana clevelandii [15], and ePVCV in the hybrid offspring of Petunia integrifolia subsp. inflata and P. axillaris subsp. axillaris [21]. The integration of EVEs into or near host genes can be harmful, for example, the insertion of EVEs into the late blight-resistance gene (R1) of Solanum melongena and its wild relatives has disrupted R1 orthologs [34]. In addition, 9% of plant endogenous viruses in the grape genome are located in introns, which may affect the expression of host genes [12]. However, most EVEs are neutral and degraded by the accumulation of destructive mutations, insertions, or deletions [10,46]. Based on previous reports, long-term preservation of EVEs in plant genomes can provide plant immunity and resistance to related foreign viruses through homology-dependent transcription or post-transcriptional gene silencing [33]. Plant endogenous viral sequences can also be used as markers to clarify the phylogenetic and evolutionary relationship among plant species, providing unique insights into the evolution and biogeography of viruses and hosts [19,31]. In addition, the integration of endogenous viruses may enlarge the gene pool of plants and endow plants with new functions and stronger adaptation to survival and environment [35,38,47,48,49].
The genus Fragaria L. belongs to Rosaceae, and it has about 25 species, most of which are wild species. By contrast, F. × ananassa is a species that is widely cultivated worldwide [50,51]. Considering asexual reproduction and large-scale greenhouse cultivation, strawberry virus diseases occur frequently, and at least 30 viruses are reported to infect strawberries [52,53]. Among them, Strawberry vein banding virus (SVBV, Caulimovirus venafragariae), Strawberry mottle virus (SMoV, Sadwavirus fragariae), strawberry mild yellow edge virus (SMYEV, Potexvirus fragariae), Strawberry crinkle virus (SCV, Cytorhabdovirus fragariarugosus), and Strawberry latent ringspot virus (SLRSV, Stralarivirus fragariae) are the main strains causing strawberry virus diseases worldwide. Viral infections can reduce strawberry fruit yields by 30% to 80% and, in severe cases, by over 80%, posing a threat to strawberry production [54,55]. In addition, Geering et al. (2014) and Tomas et al. (2022) identified endogenous viral sequences integrated into the genomes of Fragaria species [10,12]. At present, data on the types, diversity, and classification of endogenous viral sequences in strawberries are still lacking. Thus, in this study, homologous sequence search and genome alignment were used to identify virus-like sequences in the genomes of strawberry plants. The integration differences of virus-like sequences in different strawberry species and their close relatives were compared. Our research expands the diversity of strawberry viruses and deepens the understanding of the integration of endogenous viral sequences and their effects on plants’ genetic evolution.

2. Materials and Methods

2.1. Materials

The genomes of strawberry and its related genera were obtained first. Considering the genomes of Duchesnea species were not found, the genomes of the species from Potentilla and Rubus were chosen as alternatives. The genomes of 14 Fragaria species, two Potentilla species, and four Rubus species were obtained from the NCBI Genome Database (home-genome-NCBI [nih.gov]) and Rosaceae Genome Database GDR (GDR [rosaceae.org], Supplementary Table S1) [56]. Then, probe sequences, including transcriptase sequences of 11 genera of Caulimoviridae, reverse transcriptase (RT) of Athila4, and Rep sequences of 14 genera of Geminiviridae were obtained from the NCBI protein database (https://www.ncbi.nlm.nih.gov/guide/proteins/ (accessed on 25 May 2023)) and Uniprot protein database (https://www.uniprot.org/ (accessed on 28 June 2023)). In addition, the Rep sequences of non-retroviruses such as SCV, SMoV, and Fragaria chiloensis cryptic virus (FCCV) were used to search for homologs (Supplementary Table S2). Furthermore, the NCBI virus genome database viral.1.1.genomic.fna.gz (Index of/refseq/release/viral [nih.gov]) was used for genome alignment.

2.2. Viral Homologous Sequence Search

Search, extraction, and clustering of virus-like sequences: First, the Rep sequences of 29 viruses (Supplementary Table S2) were used as the query probe, and the genome of strawberry and its relatives was used as the database for the tBLASTN search. The e-value was set at 1 × 10−5. Sequences with at least 50% amino acid similarity and at least 150 amino acid length were selected from the comparison results for further analysis to obtain authentic endogenous viral sequences [12,13,14]. Then, Seq [57] was used to extract viral homologous sequences from each plant genome in batches. Afterward, CD-HIT [58] was used to cluster the extracted sequences with 80% nucleotide similarity to form different clusters.
Phylogenetic analysis: representative sequences from each cluster were selected and added to the prepared RT sequence set (Supplementary Material S1). Then, initial phylogenetic analysis was conducted to eliminate sequences homologous to LTR transposons. The remaining sequences were aligned using MEGA-X software [59], and a maximum likelihood phylogenetic tree was constructed using 500 bootstrap repetitions, with other parameters set to default values. The phylogenetic results served as a classification reference for the extracted endogenous viral sequences. Furthermore, the virus-like sequences in strawberry genomes and the classified EPRVs were counted and analyzed using Excel (Microsoft office 2021).

2.3. Alignment of Viral Genomes with the Strawberry Genomes

Using the strawberry genome as a query probe and the viral genome database as the search target, a BLASTN search with the e-value set to 1 × 10−5 was conducted. To obtain credible endogenous viral sequences, only sequences that had at least 70% nucleotide similarity to viral genomes and a matching length greater than 100 bp were selected for annotation analysis. Seq was used to extract the selected sequences. Then, phylogenetic analysis was performed on the extracted sequences in the genomes of Fragaria species, which are similar to Caulimoviridae viruses, as described in Section 2.2. Meanwhile, statistics and an analysis of virus-like sequences in strawberry genomes and EPRVs with the established taxonomic status were performed using Excel (Office 2021).

3. Results

3.1. Homologous Sequence Search

3.1.1. Integration of Virus-like Sequences in the Genomes of Fragaria and Its Related Genera

In this study, the conserved sequences of viral Reps from 28 genera (six families) (including dsDNA, ssDNA, dsRNA, ssRNA(−), and ssRNA(+) viruses) were used to search for homologs in the genomes of 14 Fragaria species, two Potentilla species, and four Rubus species. The results revealed the presence of Caulimoviridae and Metaviridae virus-like sequences in the genomes of Fragaria, Potentilla, and Rubus. In particular, a Fragaria chiloensis cryptic virus (dsRNA virus) Rep-like sequence was found in the genome of Potentilla sterilis, whereas no similar sequences were found in the genomes of Fragaria and Rubus. Furthermore, the SCV (ssRNA(−) virus) and SMoV (ssRNA(+) virus) Rep-like sequences were not found in the genomes of these three genera of plants (Supplementary Table S3). Metaviridae is an important family of LTR retrotransposons, which widely exist in the genome of eukaryotes and play an important part in the eukaryotic genomes. On the contrary, Caulimoviridae is a family of plant viruses that can infect plants and cause serious diseases, and it is known as plant pararetrovirus because of its ability to encode RT. Therefore, this study focuses on plant pararetrovirus-like sequences.

3.1.2. Integration and Classification of EPRVs in the Strawberry Genomes

In the strawberry genomes of one cultivated species and 13 wild species, PRV-like sequences were found. Among them, the genomes of F. nipponica, F. pentaphylla, F. moupinensis, and F. nubicola contained over 100 sequences, with the F. moupinensis genome having 551 sequences. Conversely, the genomes of F. orientalis and F. vesca had fewer than 50 sequences, with the F. vesca genome harboring the lowest count at 37 sequences (Supplementary Table S4). The virus-like sequences in the genomes of each strawberry species were extracted and clustered with an 80% nucleotide similarity threshold, and a representative sequence was selected from each cluster for phylogenetic analysis to remove the sequences that were similar to retrotransposons, thereby obtaining the EPRV sequences, that is, the effective sequences in this study. A total of 779 EPRV sequences were obtained from strawberry genomes and clustered into 79 clusters. Similarly, the number of EPRVs in the genomes of F. nipponica, F. pentaphylla, F. moupinensis, and F. nubicola exceeded 50, with the F. moupinensis genome containing up to 276 sequences. Consistent with the virus-like sequences, the F. vesca genome harbored the fewest endogenous plant pararetrovirus sequences at 16 (Supplementary Table S4).
A representative sequence was selected from each cluster for phylogenetic analysis to determine the taxonomic position of EPRVs in the strawberry genomes (Figure 1 and Supplementary Figure S1). Based on the phylogenetic analysis results, the 779 EPRV sequences identified in the strawberry genomes were categorized into eight taxonomic groups, including four known endogenous plant pararetrovirus groups (Yendovirus, Zendovirus, Florendovirus, and Xendovirus) and four unknown groups labeled as VOTU1-OTU4 (Supplementary Figure S1). The phylogenetic tree revealed that VOTU1 branched independently, indicating a new taxonomic group. Meanwhile, VOTU2-OTU4 was positioned on the same evolutionary branch as the Zendovirus group, indicating a close relationship between them (Figure 1). Notably, the RT homologs of these plant pararetroviruses did not belong to any known taxonomic group within Caulimoviridae (per ICTV classification).

3.1.3. Diversity and Differences of EPRVs in Strawberry Genomes

The EPRV sequences in strawberry genomes were statistically analyzed. The results indicated that the most abundant EPRVs belonged to Zendovirus, accounting for 60.85%, followed by Florendovirus, accounting for 16.69%. The smallest group was VOTU2, accounting for 0.13%, followed by VOTU1, accounting for 1.93% (Figure 2A). From different strawberry species, Zendovirus and Florendovirus are integrated into the genomes of almost all strawberry plants. Zendovirus is the main group integrated into the genomes of F. orientalis, F. mandschurica, F. daltoniana, F. nipponica, F. pentaphylla, F. moupinensis, F. nubicola, F. vesca, and F. chiloensis. Florendovirus is the main group integrated into the genomes of F. iinumae. Furthermore, the unclassified taxonomic group VOTU4 was identified as a predominant clade in the genomes of F. virginiana and cultivated strawberry F. × ananassa Duch. cv. Wongyo 3115 [60], while being present in the genomes of F. nipponica and F. orientalis. The unknown taxon VOTU3 is the main group integrated into the genome of F. nilgerrensis, and this group is also integrated into the genomes of F. viridis and F. orientalis. Notably, the unknown taxa VOTU2 is only found in the genome of F. nilgerrensis (Figure 2B).

3.2. Genome Alignment Results

3.2.1. Virus-like Sequences and Differences in the Genomes of Strawberries

Genome alignment directly aligned the genomes of strawberries and other related plants to the NCBI viral genome database to identify virus-like sequences. In this study, the genomes of 14 Fragaria species, two Potentilla species, and four Rubus species were aligned with the NCBI viral genome database viral. 1.1. genomic.fna.gz (Index of/refseq/release/viral [nih.gov]), and sequences with at least 100 bp length and 70% similarity were considered as potential viral integration sequences. The results indicate that 15 different viral genome segments were found in 13 strawberry genomes (Figure 3). Among them, five virus-like fragments could be found in almost all strawberry plant genomes, including Finkel-Biskis-Jinkins murine sarcoma virus, Murine osteosarcoma virus, Neodiprion lecontei nucleopolyhedrovirus, and Trichoplusia ni ascovirus 2c. Prochlorococcus phage P-TIM 68-like sequences were also found in the genomes of eight strawberry species. Sequences similar to Cotesia glomerata bracovirus and Glyptapanteles indiensis bracovirus were found in the genomes of F. ananassa Duch. cv. Wongyo 3115, F. chiloensis, and F. virginiana. Sequences similar to Banana streak CA virus were present in the genome of F. nilgerrensis and F. nipponica. Sequences similar to Citrus yellow mosaic virus were found in the genomes of F. orientalis, F. nipponica, and F. vesca. In addition, some viral genome sequences could only be found in a specific strawberry genome, for example, Choristoneura fumiferana granulovirus only had similar fragments in the F. nipponica genome Dishui lake phycodnavirus 1. Similar sequences were only found in the F. nilgerrensis genome. Grapevine roditis leaf discoloration-associated virus (Caulimoviridae) only had similar fragments in the F. pentaphylla genome, and Birch leaf roll-associated virus (Caulimoviridae) only had similar fragments in the F. pentaphylla genome. This study also found fragments similar to the SVBV genome in the F. chiloensis genome, with 95% similarity and 267 bp alignment length. Notably, the complete genome of Escherichia phage phiX174 was assembled from the genome of F. orientalis (Supplementary Material S2 and Supplementary Figure S2).

3.2.2. Comparison of Virus-like Sequences in Different Plant Genomes

The differences and similarities in virus-like sequences among plant genomes of three genera are evident, with distinct yet shared similarities. Statistical analysis revealed variations in the virus-like sequences aligned in the genomes of strawberries and their relatives. Concurrently, some similar virus-like sequences were matched in the genomes of strawberries and their close relatives, including Neodiprion lecontei NPV, Trichoplusia ni ascovirus 2c, and Prochlorococcus phage P-TIM68. With regard to viral nucleic acid types, the virus-like sequences identified in the genomes of the three genera predominantly belonged to dsDNA viruses, with a minority attributed to ssRNA viruses (Figure 3).
Furthermore, differences in the quantity of virus-like segments were found in the genomes of different strawberry species, exhibiting a correlation with the ploidy level of the strawberry species. In particular, the number of virus-like segments in high-ploidy strawberry genomes was higher than that in low-ploidy strawberries. However, no variation in the quantity of virus-like sequences was found in high-ploidy strawberry genomes. An analysis of virus-like sequences in genomes of strawberries with different ploidy levels revealed that the number of virus-like segments in the octoploid strawberries F. virginiana and F. chiloensis, the cultivated F. × ananassa Duch. cv. Wongyo 3115 from Japan, and the tetraploid strawberry F. moupinensis was higher than that in the remaining diploid strawberries. The results from conducting Welch’s t-test and the Wilcoxon rank-sum test analysis to compare the differences in the number of virus-like segments among genomes of strawberries with different ploidy levels indicated significant differences in the number of virus-like sequences between tetraploid and diploid strawberry genomes, as well as between octoploid and diploid strawberry genomes. However, no significant difference was observed in the number of virus-like sequences between tetraploid and octoploid strawberry genomes (Figure 4A). Overall, the number of virus-like segments in the highly ploid strawberry genomes was significantly higher than that of the ordinary diploid strawberry (Figure 4B).

3.2.3. Integration and Classification of EPRVs in the Strawbery Genomes (Identification of EPRVs in the Strawberry Genomes)

Based on the species annotation results, segments similar to pararetroviruses in plant genomes were further analyzed, and the functional regions of these similar segments in the virus genome were determined through alignment. The results revealed that the region 22,846,325–22,846,439 on chromosome 4 of F. nipponica and the region 35,020,921–35,021,035 on chromosome 3 of F. nilgerrensis showed 80% and 81% alignment similarity, respectively, to regions 5570–5684 in the Banana streak CA virus genome. Upon inspecting the virus genome maps, these regions were found to correspond to the RT sequence of Banana streak CA virus. Similarly, the regions 18,447,245–18,447,362 on chromosome 5 of F. orientalis, 10,881,625–10,881,742 on chromosome 1 of F. nipponica, and region 536,392–536,509 on the F. vesca genome sequence GG775297.1 exhibited 79%, 80%, and 79% alignment similarities, respectively, to the RT region 5586–5703 of Citrus yellow mosaic virus. Furthermore, the region 29,284,759–29,284,886 on chromosome 2 of F. pentaphylla showed 78% alignment similarity to the RT of Grapevine roditis leaf discoloration-associated virus, and the region 16,387,420–16,387,565 on chromosome 4 showed 77% alignment similarity to the RT of Birch leaf roll-associated virus. In addition, the region 20,067,530–20,067,795 on chromosome Fchil2-B2 of F. chiloensis exhibited 95% alignment similarity to the aphid-transmission-associated protein sequence of SVBV (Table 1).
Fragments of plant pararetroviruses in the strawberry genomes were extracted, with 100 bp extensions upstream and downstream, for phylogenetic analysis using MEGAx. Phylogenetic analysis revealed that plant pararetroviruses, such as sequences on chromosome 5 of F. orientalis, belonged to the classification group of Zendovirus. The sequences on chromosome 4 of F. nipponica were classified under Yendovirus. The plant pararetroviruses, such as sequences on chromosome 3 of F. nilgerrensis, chromosome 1 of F. nipponica, chromosome 2 and chromosome 4 of F. pentaphylla, and chromosome GG775297.1 of F. vesca were categorized as an unknown classification group (Figure 5).
A segment of the aphid-transmission-associated protein sequence from SVBV was discovered in the F. chiloensis genome. SVBV belongs to the Caulimovirus genus of the Caulimoviridae family, it infects strawberries, and it can be transmitted by aphids. In this study, the region 20,067,530–20,067,795 on chromosome Fchil2-B2 of F. chiloensis shows 95% alignment similarity to the region 1133–1398 of the aphid-transmission-associated protein gene in the genome of SVBV. A phylogenetic analysis was conducted using the aphid-transmission-associated protein sequence belonging to Caulimoviridae viruses. The phylogenetic analysis results indicate that the aphid-transmission-associated protein-like sequence found in the F. chiloensis genome clustered with the aphid-transmission-associated protein sequence of SVBV, indicating that the similar sequence in the F. chiloensis genome represents an integrated sequence of the aphid-transmission-associated protein related to SVBV (Figure 6).

3.3. Comparison of the Two Study Methods

After searching for homologs of viral Reps (sequences with at least 50% amino acid similarity and at least 150 amino acid length were selected), homologs of plant pararetrovirus (Caulimoviridae) RT were identified in the strawberry genomes. These RT homologs found in the genomes of 14 species of Fragaria belong to eight taxonomic groups, totaling 779 sequences. These groups include Yendovirus, Zendovirus, Florendovirus, Xendovirus, and four unclassified groups (VOTU1-OTU4). By directly aligning the strawberry genomes with viral genome databases (sequences with at least 70% nucleotide similarity and at least 100 nucleotides in length were selected), plant pararetrovirus-like and non-retroviral virus-like sequences were identified in the strawberry genomes. Among these, the identified plant pararetroviruses were obtained from Yendovirus, Zendovirus, Caulimovirus, and an unclassified taxon, totaling eight sequences. These sequences were derived from the RT homologs of plant pararetroviruses (Caulimoviridae) and the aphid-transmitted protein homologs of Caulimovirus. In addition, the complete genome of Escherichia phage phiX174 was assembled from the genome of F. orientalis by genome alignment. Overall, the search results differ between the two methods. Homolog searches are more specific, whereas genome alignment is more comprehensive. Therefore, combining multiple methods is necessary for studying plant EVEs.

4. Discussion

Replicase is a conserved sequence of viruses, which is often used in virus phylogenetic analysis and homology searches [36,61]. Rep sequences of Caulimoviridae, Geminiviridae, and other non-retroviruses were used to search for homologs in the genome of Fragaria species. The results indicated the presence of RT homologs of Caulimoviridae viruses in the genomes of strawberry plants. The number of RT homologs in the genomes of different strawberry species varied from 16 to 276, whereas several EPRV fragments were reported from other plants [31,36]. Based on our results, no homologs of Geminiviridae virus or other non-retroviral virus Reps were found in the genomes of Fragaria species, despite their presence in different plant species [39,61,62,63]. Plant pararetroviruses differ from animal retroviruses in that integrating their genomes into the host genome is not essential for their life cycle [10,64]. In addition, most EVEs are discontinuous, rearranged viral gene fragments [12]. Therefore, reconstructing complete viral genomes of plant viruses through homologous sequence searches is challenging [61,65]. Thus, we aligned the strawberry plant genomes directly with the viral genome database to match and assemble the complete viral genome. Following our selection criteria, where sequences below 70% nucleotide similarity and shorter than 100 bp were discarded, a complete genome belonging to the Caulimoviridae virus in the genomes of Fragaria species was not obtained. However, Geering et al. (2014) successfully assembled a complete viral genome in the F. vesca genome [12]. This discrepancy may be attributed to our selected thresholds and the selection of effective sequences. We also managed to assemble the complete genome of Escherichia phage phiX174 [66,67] in the genome of F. orientalis (Supplementary Material S2 and Supplementary Figure S2). As the genome of F. orientalis has only been assembled to the scaffold level [68], this occurrence may have resulted from errors during sequencing and genome assembly, which warrants further validation through subsequent research.
In this study, the differences in virus-like sequences were compared between the genomes of Fragaria species and closely related genera (Potentilla and Rubus). The results indicated differences in virus-like sequences among the genomes of Fragaria, Potentilla, and Rubus species. In addition, some common virus-like sequences were shared between the genomes of Fragaria species and those of Potentilla and Rubus. These virus-like sequences could be ancient viral sequences that infected and integrated into their common ancestral genome [19]. The viral sequences integrated into the host genome can serve as markers to study the phylogeny, adaptation, and evolutionary relationships among plant species, as well as infer the activity of ancient viruses, their geographical origins, and integration history [19,31]. Therefore, the identification of virus-like sequences in the genomes of Fragaria and closely related species could be highly valuable for conducting research on the genetics, adaptation, evolution, phylogeny, and speciation of strawberry species. Furthermore, by aligning the genomes of Fragaria species directly with viral genome databases, the number of virus-like segments in the genomes of polyploid strawberries (such as tetraploid strawberries [F. orientalis and F. nilgerrensis] and octoploid strawberries [F. × ananassa, F. virginiana, and F. chiloensis]) was found to be significantly higher than that in diploid strawberries (such as F. nilgerrensis and F. pentaphylla). At present, the endogenization of viral nucleic acids in plant genomes was due to non-homologous recombination between virus and host genomes, which often occurs during double-strand DNA break repair or transposon-mediated recombination [6,61]. Therefore, polyploids with a number of chromosomes increased the probability of viral nucleic acid insertion, leading to the higher abundance of virus-like segments in polyploid strawberries compared with diploids.
SVBV is a plant pararetrovirus belonging to the Caulimovirus genus that can be transmitted by aphids to infect strawberries [69]. The aphid transmission factor of SVBV plays an important role in its infection and transmission processes [70,71]. Through genome alignment, we identified a homologous sequence of the aphid-transmission-associated protein of SVBV integrated into the genome of F. chiloensis. This segment is 267 bp long and shares 95% similarity with the aphid transmission protein sequence of SVBV, suggesting a recent insertion. We hypothesized that the integration of the homolog of the aphid-transmission-associated protein of SVBV may confer immune resistance to SVBV in F. chiloensis, although the transcriptional activity of this segment needs further verification [14,22,72].
In our report, two methods were used to search for virus-like sequences in the genomes of Fragaria species. In identifying highly homologous EPRVs, sequences with a minimum of 50% amino acid similarity and at least 70% nucleotide similarity were retained as EPRV-like sequences [12,13]. Using this threshold, our homology searches in the strawberry genome identified EPRVs belonging to Yendovirus, Zendovirus, Florendovirus, Xendovirus, and four unknown taxonomic groups VOTU1–OTU4 (with VOTU1 being a potential novel group). Compared with our results, Tungrovirus has been reported to be integrated into strawberry genomes [10]. In contrast to homology searches, genome alignment revealed the presence of Caulimovirus, Yendovirus, Zendovirus, and an unknown taxonomic group. Our results have expanded the diversity of viruses in the genomes of Fragaria species. Our research also suggested that different approaches to searching endogenous viral sequences in plant genomes yield distinct results, emphasizing the importance of combining multiple methods for an accurate and comprehensive characterization of EVEs in plant genomes.

5. Conclusions

In this study, a homologous sequence search and genome alignment were used to study the endogenous viral sequences in strawberry genomes. A notable diversity of EPRVs and potential new taxa of Caulimoviridae were identified in strawberry genomes. By comparing virus-like sequences in the genomes of Fragaria species and those of closely related plants, variations in the composition of virus-like sequences were found among different plant genera, while shared virus-like sequences were observed among closely related plant genera. Furthermore, we reported a nucleotide sequence from the aphid-transmission-associated protein of Strawberry vein banding virus in the genome of F. chiloensis, which may be related to the resistance of F. chiloensis to Strawberry vein banding virus. At present, the impact of the integration of these viral elements on strawberries needs further study.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/v16081306/s1. Supplementary Material S1: RT sequence set for phylogenetic analysis, Supplementary Material S2: Genome sequence of Escherichia phage phiX174 assembled from the genome of Fragaria orientalis, Supplementary Figure S1: the rooted tree of strawberry pararetroviral RT homologs within the Caulimoviridae family, got from homology search, Supplementary Figure S2: Genomic illustration of Escherichia phage phiX174 assembled from the genome of Fragaria orientalis, Supplementary Table S1: The plant genomes of Fragaria and its related genera used in this study, Supplementary Table S2: The probe sequences used for homologous search, Supplementary Table S3: Overview of the search for homologs of viral replicases in the genomes of the Fragaria and its closely related plant genera, Supplementary Table S4: Statistical results of plant pararetrovirus-like sequences and effective sequences identified by homologs search in the strawberry genomes.

Author Contributions

X.C. and J.Y. proposed this research plan, and reviewed the manuscript; Z.W. were responsible for the design and planning of this study; Z.W. and J.L. performed the experiments; Z.W., J.L., X.Q. and D.S. performed the analysis of the data; Z.W. and J.L. wrote the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by grants from the National Natural Science Foundation of China (31960220, 31660089) and the Science and Technology Planning Project in Key Areas of Yunnan Province (202303AP140004).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available in NCBI Genome Database(home-genome-NCBI [nih.gov]), reference number [GCA_019022445.1, GCA_000517285.1, GCA_009720345.1, GCA_010134655.1, GCF_000184155.1, GCA_933775445.1, GCA_963682095.1]; NCBI Protein Database (https://www.ncbi.nlm.nih.gov/guide/proteins/ (accessed on 25th May 2023)), reference number [NP_056728.1, YP_007761644.1, Q6XKE6, YP_009553219, AF378081.1, AAP03645.2, QPZ44443.1, QZN83648.1]; Uniprot protein database (https://www.uniprot.org/ (accessed on 28 June 2023)), reference number [A0A2H4N978, A0A0S2A4A7, G5CKJ8, C4QUK2, B8Y871, E5KBV1, Q9QD04, A0A166V1S2, A0A858M4E9, P0CK40, B1P3D6, P14991, C3UV60, T2B2N5, P14988, A0A6C0M7U4, A0A2R4Q8U9, A0A3S9JKY8, Q88888, A0A0S3JNW4]; and Rosaceae Genome Database GDR (GDR [rosaceae.org].

Acknowledgments

We would like to thank the National Natural Science Foundation of China (NSFC) for grant support (NSFC no. 31960220 and 31660089) and the Science and Technology Planning Project in Key Areas of Yunnan Province (202303AP140004).

Conflicts of Interest

We declare that there are no conflicts of interest regarding the publication of this paper in the field of biology. The study did not receive any direct funding from agencies in the public, commercial, or not-for-profit sectors that could influence the research outcomes. All of the authors are committed to maintaining transparency and upholding the principles of scientific integrity. Each contributor has independently participated in the conception, design, execution, and interpretation of the current research. Additionally, no author has any affiliation with any organization with a direct or indirect financial interest in the subject matter discussed in the manuscript. This declaration is aimed at maintaining the credibility and ethical standards of the research process and ensuring objectivity and impartiality in the reported findings.

References

  1. Chu, H.; Jo, Y.; Cho, W.K. Evolution of endogenous non-retroviral genes integrated into plant genomes. Curr. Plant Biol. 2014, 1, 55–59. [Google Scholar] [CrossRef]
  2. Holmes, E.C. The Evolution of Endogenous Viral Elements. Cell Host Microbe 2011, 10, 368–377. [Google Scholar] [CrossRef]
  3. Etienne, L. “Paleovirology”: Looking back in time to better understand and control modern viral infections. Virologie 2017, 21, 245–246. [Google Scholar] [PubMed]
  4. Emerman, M.; Malik, H.S. Paleovirology-Modern Consequences of Ancient Viruses. PLoS Biol. 2010, 8, e1000301. [Google Scholar] [CrossRef]
  5. Feschotte, C.; Gilbert, C. Endogenous viruses: Insights into viral evolution and impact on host biology. Nat. Rev. Genet. 2012, 13, 283–296. [Google Scholar] [CrossRef]
  6. Umber, M.; Filloux, D.; Muller, E.; Laboureau, N.; Galzi, S.; Roumagnac, P.; Iskra-Caruana, M.-L.; Pavis, C.; Teycheney, P.-Y.; Seal, S.E. The genome of African yam (Dioscorea cayenensis-rotundata complex) hosts endogenous sequences from four distinct badnavirus species. Mol. Plant Pathol. 2014, 15, 790–801. [Google Scholar] [CrossRef] [PubMed]
  7. Takahashi, H.; Fukuhara, T.; Kitazawa, H.; Kormelink, R. Virus Latency and the Impact on Plants. Front. Microbiol. 2019, 10, 2764. [Google Scholar] [CrossRef] [PubMed]
  8. Boutanaev, A.M.; Nemchinov, L.G. Genome-wide identification of endogenous viral sequences in alfalfa (Medicago sativa L.). Virol. J. 2021, 18, 185. [Google Scholar] [CrossRef]
  9. Bejarano, E.; Khashoggi, A.; Witty, M.; Lichtenstein, C. Integration of Multiple Repeats of Geminiviral DNA into the Nuclear Genome of Tobacco during Evolution. Proc. Natl. Acad. Sci. USA 1996, 93, 759–764. [Google Scholar] [CrossRef]
  10. Tomás, C.d.; Vicient, C.M. Genome-wide identification of Reverse Transcriptase domains of recently inserted endogenous plant pararetrovirus (Caulimoviridae). Front. Plant Sci. 2022, 13, 1011565. [Google Scholar] [CrossRef]
  11. Teycheney, P.-Y.; Geering, A.D.W.; Dasgupta, I.; Hull, R.; Kreuze, J.F.; Lockhart, B.; Muller, E.; Olszewski, N.; Pappu, H.; Pooggin, M.M.; et al. ICTV Virus Taxonomy Profile: Caulimoviridae. J. Gen. Virol. 2020, 101, 1025–1026. [Google Scholar] [CrossRef] [PubMed]
  12. Geering, A.D.W.; Maumus, F.; Copetti, D.; Choisne, N.; Zwickl, D.J.; Zytnicki, M.; McTaggart, A.R.; Scalabrin, S.; Vezzulli, S.; Wing, R.A.; et al. Endogenous florendoviruses are major components of plant genomes and hallmarks of virus evolution. Nat. Commun. 2014, 5, 5269. [Google Scholar] [CrossRef] [PubMed]
  13. Sukal, A.C.; Kidanemariam, D.B.; Dale, J.L.; Harding, R.M.; James, A.P. Characterization of a novel member of the family Caulimoviridae infecting Dioscorea nummularia in the Pacific, which may represent a new genus of dsDNA plant viruses. PLoS ONE 2018, 13, e0203038. [Google Scholar] [CrossRef]
  14. Valli, A.A.; Gonzalo-Magro, I.; Sanchez, D.H. Rearranged Endogenized Plant Pararetroviruses as Evidence of Heritable RNA-based Immunity. Mol. Biol. Evol. 2022, 40, msac240. [Google Scholar] [CrossRef] [PubMed]
  15. Lockhart, B.E.; Menke, J.; Dahal, G.; Olszewski, N.E. Characterization and genomic analysis of tobacco vein clearing virus, a plant pararetrovirus that is transmitted vertically and related to sequences integrated in the host genome. J. Gen. Virol. 2000, 81, 1579–1585. [Google Scholar] [CrossRef] [PubMed]
  16. Ndowora, T.; Dahal, G.; LaFleur, D.; Harper, G.; Hull, R.; Olszewski, N.E.; Lockhart, B. Evidence That Badnavirus Infection in Musa Can Originate from Integrated Pararetroviral Sequences. Virology 1999, 255, 214–220. [Google Scholar] [CrossRef] [PubMed]
  17. Lheureux, F.; Carreel, F.; Jenny, C.; Lockhart, B.E.L.; Iskra-Caruana, M.L. Identification of genetic markers linked to banana streak disease expression in inter-specific Musa hybrids. Theor. Appl. Genet. 2003, 106, 594–598. [Google Scholar] [CrossRef]
  18. Kunii, M.; Kanda, M.; Nagano, H.; Uyeda, I.; Kishima, Y.; Sano, Y. Reconstruction of putative DNA virus from endogenous rice tungro bacilliform virus-like sequences in the rice genome: Implications for integration and evolution. BMC Genom. 2004, 5, 80. [Google Scholar] [CrossRef] [PubMed]
  19. Chen, S.L.; Liu, R.F.; Koyanagi, K.O.; Kishima, Y. Rice genomes recorded ancient pararetrovirus activities: Virus genealogy and multiple origins of endogenization during rice speciation. Virology 2014, 471, 141–152. [Google Scholar] [CrossRef]
  20. Chen, S.; Zheng, H.; Kishima, Y. Genomic fossils reveal adaptation of non-autonomous pararetroviruses driven by concerted evolution of noncoding regulatory sequences. PLoS Pathog. 2017, 13, e1006413. [Google Scholar] [CrossRef]
  21. Richert-Pöggeler, K.R.; Noreen, F.; Schwarzacher, T.; Harper, G.; Hohn, T. Induction of infectious petunia vein clearing (pararetro) virus from endogenous provirus in petunia. Embo J. 2003, 22, 4836–4845. [Google Scholar] [CrossRef] [PubMed]
  22. Staginnus, C.; Gregor, W.; Mette, M.F.; Teo, C.; Borroto-Fernández, E.; Machado, M.L.; Matzke, M.; Schwarzacher, T. Endogenous pararetroviral sequences in tomato (Solanum lycopersicum) and related species. BMC Plant Biol. 2007, 7, 24. [Google Scholar] [CrossRef] [PubMed]
  23. Hansen, C.N.; Harper, G.; Heslop-Harrison, J.S. Characterisation of pararetrovirus-like sequences in the genome of potato (Solanum tuberosum). Cytogenet. Genome Res. 2005, 110, 559–565. [Google Scholar] [CrossRef] [PubMed]
  24. Gambley, C.F.; Geering, A.D.W.; Steele, V.; Thomas, J.E. Identification of viral and non-viral reverse transcribing elements in pineapple (Ananas comosus), including members of two new badnavirus species. Arch. Virol. 2008, 153, 1599–1604. [Google Scholar] [CrossRef] [PubMed]
  25. Pahalawatta, V.; Druffel, K.; Pappu, H. A new and distinct species in the genus Caulimovirus exists as an endogenous plant pararetroviral sequence in its host, Dahlia variabilis. Virology 2008, 376, 253–257. [Google Scholar] [CrossRef]
  26. Almeyda, C.V.; Raikhy, G.; Pappu, H.R. Characterization and comparative analysis of promoters from three plant pararetroviruses associated with Dahlia (Dahlia variabilis). Virus Genes 2015, 51, 96–104. [Google Scholar] [CrossRef]
  27. Su, L.; Gao, S.; Huang, Y.W.; Ji, C.Q.; Wang, D.; Ma, Y.; Fang, R.X.; Chen, X.Y. Complete genomic sequence of Dracaena mottle virus, a distinct badnavirus. Virus Genes 2007, 35, 423–429. [Google Scholar] [CrossRef]
  28. Laney, A.G.; Hassan, M.; Tzanetakis, I.E. An Integrated Badnavirus Is Prevalent in Fig Germplasm. Phytopathology 2012, 102, 1182–1189. [Google Scholar] [CrossRef]
  29. Marcon, H.S.; Costa-Silva, J.; Lorenzetti, A.P.R.; Marino, C.L.; Domingues, D.S. Genome-wide analysis of EgEVE_1, a transcriptionally active endogenous viral element associated to small RNAs in Eucalyptus genomes. Genet. Mol. Biol. 2017, 40, 217–225. [Google Scholar] [CrossRef]
  30. Villacreses, J.; Rojas-Herrera, M.; Sánchez, C.; Hewstone, N.; Undurraga, S.; Alzate, J.; Manque, P.; Maracaja-Coutinho, V.; Polanco, V. Deep Sequencing Reveals the Complete Genome and Evidence for Transcriptional Activity of the First Virus-Like Sequences Identified in Aristotelia chilensis (Maqui Berry). Viruses 2015, 7, 1685–1699. [Google Scholar] [CrossRef]
  31. Yu, H.; Wang, X.; Lu, Z.; Xu, Y.; Deng, X.; Xu, Q. Endogenous pararetrovirus sequences are widely present in Citrinae genomes. Virus Res. 2019, 262, 48–53. [Google Scholar] [CrossRef]
  32. Diaz-Lara, A.; Mosier, N.J.; Stevens, K.; Keller, K.E.; Martin, R.R. Evidence of Rubus Yellow Net Virus Integration into the Red Raspberry Genome. Cytogenet. Genome Res. 2020, 160, 329–334. [Google Scholar] [CrossRef]
  33. Bertsch, C.; Beuve, M.; Dolja, V.V.; Wirth, M.; Pelsy, F.; Herrbach, E.; Lemaire, O. Retention of the virus-derived sequences in the nuclear genome of grapevine as a potential pathway to virus resistance. Biol. Direct 2009, 4, 21. [Google Scholar] [CrossRef]
  34. Serfraz, S.; Sharma, V.; Maumus, F.; Aubriot, X.; Geering, A.D.W.; Teycheney, P.-Y. Insertion of Badnaviral DNA in the Late Blight Resistance Gene (R1a) of Brinjal Eggplant (Solanum melongena). Front. Plant Sci. 2021, 12, 683681. [Google Scholar] [CrossRef] [PubMed]
  35. Mushegian, A.R.; Elena, S.F. Evolution of plant virus movement proteins from the 30K superfamily and of their homologs integrated in plant genomes. Virology 2015, 476, 304–315. [Google Scholar] [CrossRef] [PubMed]
  36. Diop, S.I.; Geering, A.D.W.; Alfama-Depauw, F.; Loaec, M.; Teycheney, P.-Y.; Maumus, F. Tracheophyte genomes keep track of the deep evolution of the Caulimoviridae. Sci. Rep. 2018, 8, 572. [Google Scholar] [CrossRef] [PubMed]
  37. Liu, H.Q.; Fu, Y.P.; Xie, J.T.; Cheng, J.S.; Ghabrial, S.A.; Li, G.Q.; Yi, X.H.; Jiang, D.H. Discovery of Novel dsRNA Viral Sequences by In Silico Cloning and Implications for Viral Diversity, Host Range and Evolution. PLoS ONE 2012, 7, e42147. [Google Scholar] [CrossRef] [PubMed]
  38. Chiba, S.; Kondo, H.; Tani, A.; Saisho, D.; Sakamoto, W.; Kanematsu, S.; Suzuki, N. Widespread Endogenization of Genome Sequences of Non-Retroviral RNA Viruses into Plant Genomes. PLoS Pathog. 2011, 7, e1002146. [Google Scholar] [CrossRef]
  39. Kondo, H.; Hirano, S.; Chiba, S.; Andika, I.B.; Hirai, M.; Maeda, T.; Tamada, T. Characterization of burdock mottle virus, a novel member of the genus Benyvirus, and the identification of benyvirus-related sequences in the plant and insect genomes. Virus Res. 2013, 177, 75–86. [Google Scholar] [CrossRef]
  40. Moniruzzaman, M.; Erazo-Garcia, M.P.; Aylward, F.O. Endogenous giant viruses contribute to intraspecies genomic variability in the model green alga Chlamydomonas reinhardtii. Virus Evol. 2022, 8, veac102. [Google Scholar] [CrossRef]
  41. Nibert, M.L.; Vong, M.; Fugate, K.K.; Debat, H.J. Evidence for contemporary plant mitoviruses. Virology 2018, 518, 14–24. [Google Scholar] [CrossRef] [PubMed]
  42. Maumus, F.; Epert, A.; Nogué, F.; Blanc, G. Plant genomes enclose footprints of past infections by giant virus relatives. Nat. Commun. 2014, 5, 4268. [Google Scholar] [CrossRef] [PubMed]
  43. Côte, F.X.; Galzi, S.; Folliot, M.; Lamagnère, Y.; Teycheney, P.-Y.; Iskra-Caruana, M.-L. Micropropagation by tissue culture triggers differential expression of infectious endogenous Banana streak virus sequences (eBSV) present in the B genome of natural and synthetic interspecific banana plantains. Mol. Plant Pathol. 2010, 11, 137–144. [Google Scholar] [CrossRef] [PubMed]
  44. Noreen, F.; Akbergenov, R.; Hohn, T.; Richert-Pöggeler, K.R. Distinct expression of endogenous Petunia vein clearing virus and the DNA transposon dTph1 in two Petunia hybrida lines is correlated with differences in histone modification and siRNA production. Plant J. 2007, 50, 219–229. [Google Scholar] [CrossRef] [PubMed]
  45. Hull, R.; Harper, G.; Lockhart, B. Viral sequences integrated into plant genomes. Trends Plant Sci. 2000, 5, 362–365. [Google Scholar] [CrossRef] [PubMed]
  46. Aiewsakun, P.; Katzourakis, A. Endogenous viruses: Connecting recent and ancient viral evolution. Virology 2015, 479–480, 26–37. [Google Scholar] [CrossRef] [PubMed]
  47. Filloux, D.; Murrell, S.; Koohapitagtam, M.; Golden, M.; Julian, C.; Galzi, S.; Uzest, M.; Rodier-Goud, M.; D’Hont, A.; Vernerey, M.S.; et al. The genomes of many yam species contain transcriptionally active endogenous geminiviral sequences that may be functionally expressed. Virus Evol. 2015, 1, vev002. [Google Scholar] [CrossRef] [PubMed]
  48. Liu, H.Q.; Fu, Y.P.; Jiang, D.H.; Li, G.Q.; Xie, J.T.; Cheng, J.S.; Peng, Y.L.; Ghabrial, S.A.; Yi, X.H. Widespread Horizontal Gene Transfer from Double-Stranded RNA Viruses to Eukaryotic Nuclear Genomes. J. Virol. 2010, 84, 11876–11887. [Google Scholar] [CrossRef] [PubMed]
  49. Vassilieff, H.; Geering, A.D.W.; Choisne, N.; Teycheney, P.Y.; Maumus, F. Endogenous Caulimovirids: Fossils, Zombies, and Living in Plant Genomes. Biomolecules 2023, 13, 1069. [Google Scholar] [CrossRef]
  50. Sun, J.; Sun, R.; Liu, H.B.; Chang, L.L.; Li, S.T.; Zhao, M.Z.; Shennan, C.; Lei, J.J.; Dong, J.; Zhong, C.A.F.; et al. Complete chloroplast genome sequencing of ten wild Fragaria species in China provides evidence for phylogenetic evolution of Fragaria. Genomics 2021, 113, 1170–1179. [Google Scholar] [CrossRef]
  51. Li, C.X.; Cai, C.A.; Tao, Y.T.; Sun, Z.S.; Jiang, M.; Chen, L.X.; Li, J.M. Variation and Evolution of the Whole Chloroplast Genomes of Fragaria spp. (Rosaceae). Front. Plant Sci. 2021, 12, 754209. [Google Scholar] [CrossRef] [PubMed]
  52. Davalos, N.; Wells, H.; Ali, A. Preliminary Study of Viruses Infecting Strawberry (Fragaria spp.) in the Midsouthern United States. Plant Health Prog. 2022, 23, 7–13. [Google Scholar] [CrossRef]
  53. Koloniuk, I.; Matyášová, A.; Brázdová, S.; Veselá, J.; Přibylová, J.; Fránová, J.; Elena, S.F. Transmission of Diverse Variants of Strawberry Viruses Is Governed by a Vector Species. Viruses 2022, 14, 1362. [Google Scholar] [CrossRef] [PubMed]
  54. Torrico, A.K.; Salazar, S.M.; Kirschbaum, D.S.; Conci, V.C. Yield losses of asymptomatic strawberry plants infected with Strawberry mild yellow edge virus. Eur. J. Plant Pathol. 2017, 150, 983–990. [Google Scholar] [CrossRef]
  55. Thompson, J.R.; Wetzel, S.; Klerks, M.M.; Vašková, D.; Schoen, C.D.; Špak, J.; Jelkmann, W. Multiplex RT-PCR detection of four aphid-borne strawberry viruses in Fragaria spp. in combination with a plant mRNA specific internal control. J. Virol. Methods 2003, 111, 85–93. [Google Scholar] [CrossRef] [PubMed]
  56. Jung, S.; Lee, T.; Cheng, C.H.; Buble, K.; Zheng, P.; Yu, J.; Humann, J.; Ficklin, S.P.; Gasic, K.; Scott, K.; et al. 15 years of GDR: New data and functionality in the Genome Database for Rosaceae. Nucleic Acids Res. 2019, 47, D1137–D1145. [Google Scholar] [CrossRef] [PubMed]
  57. Shen, W.; Sipos, B.; Zhao, L.Y. SeqKit2: A Swiss army knife for sequence and alignment processing. iMeta 2024, 3. [Google Scholar] [CrossRef]
  58. Fu, L.M.; Niu, B.F.; Zhu, Z.W.; Wu, S.T.; Li, W.Z. CD-HIT: Accelerated for clustering the next-generation sequencing data. Bioinformatics 2012, 28, 3150–3152. [Google Scholar] [CrossRef]
  59. Kumar, S.; Stecher, G.; Li, M.; Knyaz, C.; Tamura, K. MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. Mol. Biol. Evol. 2018, 35, 1547–1549. [Google Scholar] [CrossRef] [PubMed]
  60. Lee, H.E.; Manivannan, A.; Lee, S.Y.; Han, K.; Yeum, J.G.; Jo, J.; Kim, J.; Rho, I.; Lee, Y.R.; Lee, E.S.; et al. Chromosome Level Assembly of Homozygous Inbred Line ‘Wongyo 3115′ Facilitates the Construction of a High-Density Linkage Map and Identification of QTLs Associated With Fruit Firmness in Octoploid Strawberry (Fragaria x ananassa). Front. Plant Sci. 2021, 12, 696229. [Google Scholar] [CrossRef]
  61. Sharma, V.; Lefeuvre, P.; Roumagnac, P.; Filloux, D.; Teycheney, P.-Y.; Martin, D.P.; Maumus, F. Large-scale survey reveals pervasiveness and potential function of endogenous geminiviral sequences in plants. Virus Evol. 2020, 6, veaa071. [Google Scholar] [CrossRef]
  62. Mushegian, A.; Shipunov, A.; Elena, S.F. Changes in the composition of the RNA virome mark evolutionary transitions in green plants. BMC Biol. 2016, 14, 68. [Google Scholar] [CrossRef]
  63. Maredza, A.T.; Allie, F.; Plata, G.; Rey, M.E.C. Sequences enhancing cassava mosaic disease symptoms occur in the cassava genome and are associated with South African cassava mosaic virus infection. Mol. Genet. Genom. 2015, 291, 1467–1485. [Google Scholar] [CrossRef] [PubMed]
  64. Staginnus, C.; Richert-Pöggeler, K.R. Endogenous pararetroviruses: Two-faced travelers in the plant genome. Trends Plant Sci. 2006, 11, 485–491. [Google Scholar] [CrossRef]
  65. Gong, Z.; Han, G.-Z. Euphyllophyte Paleoviruses Illuminate Hidden Diversity and Macroevolutionary Mode of Caulimoviridae. J. Virol. 2018, 92, e02043-17. [Google Scholar] [CrossRef] [PubMed]
  66. Romeyer Dherbey, J.; Parab, L.; Gallie, J.; Bertels, F. Stepwise Evolution of E. coli C and PhiX174 Reveals Unexpected Lipopolysaccharide (LPS) Diversity. Mol. Biol. Evol. 2023, 40, msad154. [Google Scholar] [CrossRef]
  67. Labrie, S.J.; Dupuis, M.E.; Tremblay, D.M.; Plante, P.L.; Corbeil, J.; Moineau, S. A New Microviridae Phage Isolated from a Failed Biotechnological Process Driven by Escherichia coli. Appl. Environ. Microbiol. 2014, 80, 6992–7000. [Google Scholar] [CrossRef]
  68. Hirakawa, H.; Shirasawa, K.; Kosugi, S.; Tashiro, K.; Nakayama, S.; Yamada, M.; Kohara, M.; Watanabe, A.; Kishida, Y.; Fujishiro, T.; et al. Dissection of the Octoploid Strawberry Genome by Deep Sequencing of the Genomes of Fragaria Species. DNA Res. 2014, 21, 169–181. [Google Scholar] [CrossRef]
  69. Bonneau, P.; Hogue, R.; Tellier, S.; Fournier, V. Evaluation of Various Sources of Viral Infection in Strawberry Fields of Quebec, Canada. J. Econ. Entomol. 2019, 112, 2577–2583. [Google Scholar] [CrossRef]
  70. Hohn, T. Plant pararetroviruses: Interactions of cauliflower mosaic virus with plants and insects. Curr. Opin. Virol. 2013, 3, 629–638. [Google Scholar] [CrossRef]
  71. Lutz, L.; Raikhy, G.; Leisner, S.M. Cauliflower mosaic virus major inclusion body protein interacts with the aphid transmission factor, the virion-associated protein, and gene VII product. Virus Res. 2012, 170, 150–153. [Google Scholar] [CrossRef] [PubMed]
  72. Mette, M.F.; Kanno, T.; Aufsatz, W.; Jakowitsch, J.; van der Winden, J.; Matzke, M.A.; Matzke, A.J.M. Endogenous viral sequences and their potential contribution to heritable virus resistance in plants. Embo J. 2002, 21, 461–469. [Google Scholar] [CrossRef] [PubMed]
Figure 1. The phylogenetic relationship of strawberry pararetroviral RT homologs within the Caulimoviridae family, as retrieved from a homology search. The maximum likelihood tree was constructed using representative sequences derived from strawberry genomes and 11 genera of exogenous viral species within the Caulimoviridae family (per ICTV classification), along with RT sequences from 7 endogenous viral genera. The phylogenetic analysis was bootstrapped 500 times, and the confidence level of the branch was indicated by the color of the branch. The 11 exogenous viral genera (ICTV) include: Caulimovirus, Rosadnavirus, Petuvirus, Dioscovirus, Badnavirus, Vaccinivirus, Soymovirus, Tungrovirus, Ruflodivirus, Cavemovirus, and Solendovirus. The 7 endogenous viral genera comprise: Florendovirus, Gymnendovirus, Fernendovirus, Yendovirus, Zendovirus, Xendovirus, and Wendovirus. The Ty1-copia and Ty3/Gypsy retrotransposon were selected as outgroups, and the rooted tree constructed by phylogenetic analysis was shown in Supplementary Figure S1.
Figure 1. The phylogenetic relationship of strawberry pararetroviral RT homologs within the Caulimoviridae family, as retrieved from a homology search. The maximum likelihood tree was constructed using representative sequences derived from strawberry genomes and 11 genera of exogenous viral species within the Caulimoviridae family (per ICTV classification), along with RT sequences from 7 endogenous viral genera. The phylogenetic analysis was bootstrapped 500 times, and the confidence level of the branch was indicated by the color of the branch. The 11 exogenous viral genera (ICTV) include: Caulimovirus, Rosadnavirus, Petuvirus, Dioscovirus, Badnavirus, Vaccinivirus, Soymovirus, Tungrovirus, Ruflodivirus, Cavemovirus, and Solendovirus. The 7 endogenous viral genera comprise: Florendovirus, Gymnendovirus, Fernendovirus, Yendovirus, Zendovirus, Xendovirus, and Wendovirus. The Ty1-copia and Ty3/Gypsy retrotransposon were selected as outgroups, and the rooted tree constructed by phylogenetic analysis was shown in Supplementary Figure S1.
Viruses 16 01306 g001
Figure 2. Classification statistics of strawberry EPRVs obtained by homologous search (A); composition of EPRVs in different strawberry genomes (B).
Figure 2. Classification statistics of strawberry EPRVs obtained by homologous search (A); composition of EPRVs in different strawberry genomes (B).
Viruses 16 01306 g002
Figure 3. Statistical diagram of the genome alignment results. Only sequences with at least 70% nucleotide similarity to the viral genome and at least 100 nucleotides in length were counted. Because the F. orientalis genome was only assembled to the scaffold level, to avoid errors the alignment results of its genome were not counted when comparing the differences in similar viral sequences between different species.
Figure 3. Statistical diagram of the genome alignment results. Only sequences with at least 70% nucleotide similarity to the viral genome and at least 100 nucleotides in length were counted. Because the F. orientalis genome was only assembled to the scaffold level, to avoid errors the alignment results of its genome were not counted when comparing the differences in similar viral sequences between different species.
Viruses 16 01306 g003
Figure 4. Welch’s t-test of endogenous virus-like sequences in different ploidy strawberry genomes (A); Wilcoxon rank-sum test analysis of endogenous virus-like sequences in the genomes of polyploid (octaploid and tetraploid) and lowploid (diploid) strawberries (B). The magnitude of the significance p-value was indicated by “*” (“*”, 0.01 < p < 0.05; “**”, 0.001 < p < 0.01).
Figure 4. Welch’s t-test of endogenous virus-like sequences in different ploidy strawberry genomes (A); Wilcoxon rank-sum test analysis of endogenous virus-like sequences in the genomes of polyploid (octaploid and tetraploid) and lowploid (diploid) strawberries (B). The magnitude of the significance p-value was indicated by “*” (“*”, 0.01 < p < 0.05; “**”, 0.001 < p < 0.01).
Viruses 16 01306 g004
Figure 5. Phylogenetic analysis of strawberry endogenous pararetrovirus sequences obtained from genomic alignments. The root tree was constructed using the maximum likelihood method by MEGA-X software. The retrotransposons Ty 1-copia and Ty 3/Gypsy were selected as outgroups. The pararetrovirus sequences extracted from the strawberry genomes are marked with red dots. Bootstrap is indicated by the circle size.
Figure 5. Phylogenetic analysis of strawberry endogenous pararetrovirus sequences obtained from genomic alignments. The root tree was constructed using the maximum likelihood method by MEGA-X software. The retrotransposons Ty 1-copia and Ty 3/Gypsy were selected as outgroups. The pararetrovirus sequences extracted from the strawberry genomes are marked with red dots. Bootstrap is indicated by the circle size.
Viruses 16 01306 g005
Figure 6. Phylogenetic analysis of aphid transmission-related protein gene sequences extracted from the F. chiloensis genome. The root tree was constructed using the maximum likelihood method by MEGA-X software. Virus-like sequences extracted from the F. chiloensis genome are marked with red dots. The sequences of the remaining genes encoding aphid-transmitted proteins were obtained from the NCBI database. “UD” represents the sequence resulting from extending the alignment hit with appropriate lengths upstream and downstream.
Figure 6. Phylogenetic analysis of aphid transmission-related protein gene sequences extracted from the F. chiloensis genome. The root tree was constructed using the maximum likelihood method by MEGA-X software. Virus-like sequences extracted from the F. chiloensis genome are marked with red dots. The sequences of the remaining genes encoding aphid-transmitted proteins were obtained from the NCBI database. “UD” represents the sequence resulting from extending the alignment hit with appropriate lengths upstream and downstream.
Viruses 16 01306 g006
Table 1. Statistical analysis of plant pararetrovirus-like sequences identified by genome alignment in the strawberry genomes.
Table 1. Statistical analysis of plant pararetrovirus-like sequences identified by genome alignment in the strawberry genomes.
Position in the Strawberry GenomesPosition in the Virus GenomesSimilarityLengthFunctional DomainVirus Genome Accession No
Fragaria mandshurica chr5 18,447,245–18,447,362Citrus yellow mosaic virus 5586–570379%119 bpRTNC_003382.1
Fragaria nipponica chr1 10,881,625–10,881,742Citrus yellow mosaic virus 5586–570381%119 bpRTNC_003382.1
Fragaria vesca GG775297.1 536,392–536,509Citrus yellow mosaic virus 5586–570379%119 bpRTNC_003382.1
Fragaria nipponica chr4 22,846,325–22,846,439Banana streak CA virus 5570–568480%119 bpRTNC_015506.1
Fragaria nilgerrensis chr3 35,020,921–35,021,035Banana streak CA virus 5570–568481%119 bpRTNC_015506.1
Fragaria pentaphylla chr2 29,284,759–29,284,886Grapevine roditis leaf discoloration-associated virus 5180–530778%133 bpRTNC_027131.1
Fragaria pentaphylla chr4 16,387,420–16,387,565Birch leaf roll-associated virus 5947–609277%149 bpRTNC_040635.1
Fragaria chiloensis Fchil2-B2 20,067,530–20,067,795Strawberry vein banding virus 1133–139895%267 bpAphid transmission associated proteinNC_001725.1
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, Z.; Liu, J.; Qi, X.; Su, D.; Yang, J.; Cui, X. Study of Endogenous Viruses in the Strawberry Plants. Viruses 2024, 16, 1306. https://doi.org/10.3390/v16081306

AMA Style

Wang Z, Liu J, Qi X, Su D, Yang J, Cui X. Study of Endogenous Viruses in the Strawberry Plants. Viruses. 2024; 16(8):1306. https://doi.org/10.3390/v16081306

Chicago/Turabian Style

Wang, Zongneng, Jian Liu, Xingyang Qi, Daifa Su, Junyu Yang, and Xiaolong Cui. 2024. "Study of Endogenous Viruses in the Strawberry Plants" Viruses 16, no. 8: 1306. https://doi.org/10.3390/v16081306

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop