Next Article in Journal
Controversial Impact of Sirtuins in Chronic Non-Transmissible Diseases and Rehabilitation Medicine
Next Article in Special Issue
Inhibition of LINE-1 Retrotransposition by Capsaicin
Previous Article in Journal
Whole-Body 12C Irradiation Transiently Decreases Mouse Hippocampal Dentate Gyrus Proliferation and Immature Neuron Number, but Does Not Change New Neuron Survival Rate
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Exploring the Remote Ties between Helitron Transposases and Other Rolling-Circle Replication Proteins

Departamento de Biologia Geral, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, MG, CEP 31270-901, Brazil
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2018, 19(10), 3079; https://doi.org/10.3390/ijms19103079
Submission received: 28 August 2018 / Revised: 26 September 2018 / Accepted: 7 October 2018 / Published: 9 October 2018
(This article belongs to the Special Issue Transposable Elements)

Abstract

:
Rolling-circle replication (RCR) elements constitute a diverse group that includes viruses, plasmids, and transposons, present in hosts from all domains of life. Eukaryotic RCR transposons, also known as Helitrons, are found in species from all eukaryotic kingdoms, sometimes representing a large portion of their genomes. Despite the impact of Helitrons on their hosts, knowledge about their relationship with other RCR elements is still elusive. Here, we compared the endonuclease domain sequence of Helitron transposases with the corresponding region from RCR proteins found in a wide variety of mobile genetic elements. To do that, we used a stepwise alignment approach followed by phylogenetic and multidimensional scaling analyses. Although it has been suggested that Helitrons might have originated from prokaryotic transposons or eukaryotic viruses, our results indicate that Helitron transposases share more similarities with proteins from prokaryotic viruses and plasmids instead. We also provide evidence for the division of RCR endonucleases into three groups (Y1, Y2, and Yx), covering the whole diversity of this protein family. Together, these results point to prokaryotic elements as the likely closest ancestors of eukaryotic RCR transposons, and further demonstrate the fluidity that characterizes the boundaries separating viruses, plasmids, and transposons.

Graphical Abstract

1. Introduction

Rolling-circle replication (RCR) proteins are essential components of many genetic elements found in all three domains of life. These proteins can be classified into three different groups according to their main function: (i) Rep proteins (vegetative replication), (ii) Mob proteins/relaxases (conjugation), and (iii) transposases (transposon mobility) [1,2]. Helitrons are the eukaryotic representatives of RCR transposable elements (TEs), found in species from all eukaryotic kingdoms in highly variable copy numbers [3,4]. Their transposition is thought to occur by a mechanism similar to the one proposed for bacterial RCR TEs, like the IS91 family of elements [4,5,6]. Briefly, the Helitron transposase binds to the 5’-end of the element, using one of its two catalytic tyrosines to create a 5′-phosphotyrosine intermediate and a free 3′-OH at the donor site. The leading strand covalently bound to the transposase is displaced, the lagging strand is synthesized, and the second catalytic tyrosine nicks the 3’-end, promoting the formation of a double-strand circle intermediate. The transposase then cleaves the leading strand from the circular intermediate, but this time the second tyrosine cleaves the host’s genome, forming a free 3’-OH which attacks the first 5’-phosphotyrosine linkage. After the 3’-end of the circular intermediate is also joined to the recipient’s free 5’-end, an integrated single-strand “loop” is formed and probably resolved during the host’s genome replication. In addition, it has been recently shown that Helitron transposition shares mechanistic similarities with the replication process used by some circular viruses [7]. Despite some of the differences in their mode of propagation, the main catalytic reaction used by all RCR elements is essentially the same [1].
Helitron transposases are composed of a typical domain, the endonuclease involved in the initiation of RCR (RCRE or Rep), fused to a helicase domain (Hel) from the superfamily 1 (S1H) (Figure 1) [4,8]. This protein, also known as RepHel, belongs to the HUH (named after one of its conserved motifs with two His residues separated by a hydrophobic residue) family of endonucleases [1]. Although HUH endonucleases from eukaryotic viruses and some plasmids also have a helicase domain, they belong to the superfamily 3 (S3H), which is unrelated to the one found in Helitrons. Furthermore, prokaryote viruses only encode a RCRE domain with no helicase (Figure 1) [8,9].
Since Helitrons were discovered [11], a few preliminary suggestions about their evolutive origins have been made. These can be generally divided in two scenarios: the first suggests that Helitrons originated from a prokaryotic ancestral RCR TE [8,11], and the second adds the possibility that Helitrons descended from an ancient eukaryotic viral integration [12]. The first scenario is mainly based on the obvious similarities in the mode of propagation of eukaryotic and prokaryotic RCR TEs, while the second scenario considers the fact that, in contrast to prokaryote RCR TEs, Helitron coding sequences include a helicase domain and sometimes a ssDNA-binding protein, similarly to some RCR proteins from eukaryotic viruses. The fact that many viral copies from geminiviruses were found to be integrated in the tobacco genome [13] was also used to support this hypothesis. In fact, since this scenario was first proposed, several studies showed copies from different eukaryotic RCR viruses in host chromosomes, revealing that viral integrations of these replicons are more common than it was previously thought (reviewed in [14]). In addition, it has been shown that several geminivirus- and parvovirus-related sequences integrated in eukaryote genomes display TE features, and have apparently shifted from a viral to a transposon-like mode of replication [15].
Despite the above considerations, some differences between the RCR proteins of Helitrons and eukaryotic viruses argue against their evolutionary relationship. Firstly, as mentioned before, helicases from these two classes of elements belong to different superfamilies. Also, with the exception of parvoviruses [16], all RCR proteins from eukaryotic ssDNA viruses contain only one tyrosine (Y1) in their catalytic core [9,17], in contrast to the RepHel from Helitrons, which has two (Y2) [4] (Figure 1). Although the number of catalytic tyrosines has been used to tentatively classify RCR proteins between two superfamilies [17], there is currently no phylogenetic support for this distinction. In view of these observations, and considering that domain rearrangements are not uncommon during protein evolution [18], the first scenario (i.e., that Helitrons originated from a prokaryotic ancestral RCR TE) seems to be more parsimonious, as the acquisition of a S1H domain would be the only major evolutionary step in a prokaryotic to eukaryotic RCR TE transition.
The relationship between Helitrons and other RCR genetic elements was initially assessed by Poulter et al. [19]. Although their results did not indicate a relationship between these TEs with specific RCR entities, they provided evidence for an ancient monophyletic origin of Helitrons, which probably occurred early on in the evolution of eukaryotes. However, the evolutionary origin of Helitrons has not been further examined, probably as a consequence of the low sequence identity of RepHel with any other group of RCR proteins [3].
In this study, we investigated the relationship of the Helitron RepHel with other RCR proteins by analyzing the RCRE amino acid sequences from a wide variety of mobile genetic elements, including TEs, plasmids, and viruses. Our results indicate that, despite being eukaryotic TEs, Helitron transposases display more sequence similarities with prokaryotic RCR proteins from bacteriophages and plasmids. In addition, we show that the HUH family of endonucleases can be divided into three major phylogenetic groups comprised of RCR proteins from highly heterogeneous mobile genetic elements.

2. Results and Discussion

2.1. Selecting and Preparing RCRE Domain Sequences

We selected a sample of 13 Helitron RepHel amino acid sequences, representing elements from distantly-related organisms across several phyla and including the main Helitron variants (Table S1). To analyze these TEs in a broad evolutionary context, at least three sequences of each family or group of RCR genetic elements from prokaryotes and eukaryotes were selected. These included single- and double-stranded viruses, plasmids, and TEs (Table S1).
Our analysis was restricted to the RCRE (or HUH) domain of the sequences (Figure 1), which has a central role in starting RCR reactions and is the only region common to all HUH endonucleases [1] (Figure 1). Modular rearrangements often occur during protein evolution [18] which is also the case for several RCR virus lineages [20]. For those reasons, and considering that flanking domains are highly variable amongst RCR elements [1], our restriction to the RCRE domain aimed to avoid spurious evolutionary inferences. Most proteins within the HUH family have three conserved motifs (I, II, and III) in the core region of the RCRE domain, despite the high sequence divergence between groups [1,2,21]. Only amino acid sequences containing all three conserved motifs in their typical arrangement (I-II-III) were selected for our analysis; this is because some HUH endonucleases display their motifs in the reverse order (e.g., III-II-I) [1,2], and these also have highly divergent amino acid sequences, which prevent reliable sequence alignments. A total of 115 amino acid sequences, representing the overall diversity of all known HUH endonucleases, were selected for the analysis (Table S1).
To reduce spurious alignments of the RCRE sequences, we conducted a stepwise alignment approach, which consisted of aligning each group of closely-related sequences separately, excluding segments flanking the RCRE domain and trimming the portions that were exclusive of individual taxa. The resulting sequences (Data S1) were aligned using PSI-Coffee, which is a method considered suitable for highly divergent protein sequences with little or no structural information available [22,23].

2.2. Major RCR Protein Phylogenetic Groups

A phylogenetic analysis was conducted and pairwise divergence values between sequences were used to generate non-metric multidimensional scaling (NMDS) ordinations. As expected for an analysis that includes highly divergent sequences, clade support values between major groups were low, although we observed an overall agreement between our results and the known topology for most of the clades (Figure 2). Our results support the monophyletic nature of all Helitron variants and the lack of any clear relationship of these TEs with other specific groups or families of mobile genetic elements, as previously suggested [19]. Nonetheless, in both the phylogenetic analysis (Figure 2) and NMDS ordinations (Figure 3) we observed an overall distinction between Y1 and Y2 RCR proteins, which we henceforth refer to as Y1 and Y2 groups. An exception is a third clade, composed of elements from both variants, which we refer to here as the Yx group because the number of tyrosines of the catalytic core of its members does not relate with the canonical Y1 and Y2 division. Although the resulting phylogeny revealed a basal segregation of Yx RCR proteins and the rest of the sequences, the Y2 group appears to be more closely related to Y1 RCR proteins, and perhaps constitutes a derivative clade of the Y1 group (Figure 2 and Figure 3).
The topology observed within the Yx group is roughly in agreement with previous results [24], indicating that this clade represents a bona fide phylogenetic cluster composed of archaeal viruses and bacterial TEs. Recent analyses using different methods have also shown that parvoviruses belong to a separate clade from other eukaryotic RCR viruses [25]. However, we did not expect that parvoviral RCR proteins (AAV2, AAV5, and SLP) would group together with Yx elements (Figure 2 and Figure 3). Although structural similarities indicate a distant relationship between parvoviral and other RCR proteins [26], the positioning of these viruses in the Yx group might also be the consequence of long branch attraction [27], so this result should be treated with caution.
As revealed by the results from both analyses, the assignment to a specific catalytic tyrosine group is not contingent on the element class (Figure 2 and Figure 3). For instance, bacterial plasmids, and eukaryotic and archaeal viruses have members in more than one group. Likewise, the element class does not always predict its topology, even within the same tyrosine group. For example, some Y1 viral families are closer to Y1 plasmids than other Y1 viruses, and the same is true in the Y2 group. This phenomenon has been observed in different studies and emphasizes the marked fluidity at the boundaries separating different classes of mobile genetic elements (reviewed in [8,9]). Thus, our results indicate that the tyrosine group division is the only informative phylogenetic feature encompassing the whole HUH endunuclease family.

2.3. Helitron Transposase is More Similar to Prokaryotic Proteins

Even though the Helitron RepHel does not appear to be phylogenetically closer to any single family of proteins, they clustered within the Y2 group which, apart from Helitrons, is exclusively composed of prokaryotic viruses and plasmids (Figure 2 and Figure 3). On the other hand, sequences from prokaryotic TEs clustered within the Yx group, even though some of them (including the IS91 family) have two tyrosines in their catalytic core and share a similar transposition mechanism with Helitrons [4,5,6,28]. It is also notable that RepHel proteins appear to be only distantly related to RCR proteins from eukaryotic viruses, which almost exclusively belong to the Y1 group. These observations indicate that the core domain from Helitron transposases is more similar to proteins from prokaryotic viruses and plasmids than to prokaryotic RCR transposases or to eukaryotic viral proteins.
As we have mentioned, in addition to the RCRE domain, RepHel proteins also have a S1 helicase domain (Figure 1); more specifically, this S1 helicase belongs to the Pif1 family [4]. Although Pif1 helicases are present in essentially all eukaryote genomes, they also have been found in some prokaryotes [29,30]. Because all known prokaryotic Y2 RCR proteins lack a helicase, this domain could have been acquired from a prokaryote host by the Helitron ancestor before it colonized the first eukaryote genome. However, considering that Pif1 helicases are ubiquitous in eukaryote genomes and found less frequently in prokaryotes, it seems more plausible that Helitrons acquired their helicase domain from a eukaryotic host. Indeed, a preliminary analysis of Pif1 sequences from Helitrons, eukaryotes, and prokaryotes indicates that the helicase domain from Helitrons is closely related to fungal proteins (Figure S2). Interestingly, the helicase domains from distinct Helitron variants formed separate clusters with different fungal proteins, suggesting that Helitrons acquired their helicase domain from at least two independent events (Figure S2).
These results support the hypothesis of an ancient origin of Helitrons during the initial radiation of eukaryotes, and suggest that neither prokaryotic TEs, nor eukaryotic viruses, are among their closest relatives. Instead, we provide evidence for a closer relationship of these eukaryotic TEs with prokaryotic viruses and plasmids with Y2 RCR proteins, even though it is not possible to determine which specific family shares the most recent common ancestor with the RepHel (Figure 4). Thus, our proposition is that Helitrons descend from a prokaryotic Y2 mobile element that integrated in the genome of an early eukaryote ancestor. Like all other known prokaryotic Y2 elements, the Helitron progenitor probably coded an RCR protein devoid of a helicase domain and was dependent of its host for correct replication/transposition. Subsequently, each of the incipient Helitron variants acquired a eukaryotic helicase by the recombination of its RCRE domain with a host helicase gene. In any case, a comprehensive understanding of the Helitron origins will probably rely on the future discovery of new groups of RCR genetic elements.
Finally, although the RCRE phylogeny does not coincide with the taxonomic division of distinct genetic elements classes (viruses, plasmids and TEs), we suggest that the HUH family of endonucleases is composed by three major radiation groups (Y1, Y2 and Yx). Interestingly, most of the HUH endonucleases can be assigned to one of these groups simply by having a tyrosine residue at a specific position in the RCRE domain, regardless of the element’s class. The extreme diversity observed in each of these groups underscore the dynamic nature of mobile genetic elements which, in the long term, do not evolve under the usual taxonomic constraints acting upon their hosts.

3. Materials and Methods

3.1. Sequences Retrieval and Selection

RepHel amino acid sequences from Helitrons were retrieved from Repbase (https://www.girinst.org/repbase/) [32] and GenBank (https://www.ncbi.nlm.nih.gov/genbank/) [33], using elements from previous studies as a reference (e.g., [11,19,34]). The structure of these proteins was verified using the Conserved Domain Database (CDD) search tool (https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) [10]. RepHel sequences that could be clearly assigned to one of the three main Helitron variants [4] were selected: canonical Helitron (6 sequences), Helitron2 (1 sequence), and Helentron (6 sequences). Sequences representing each family or group of RCR proteins were retrieved on GenBank [33], based on several references (e.g., [9,21,24,35,36,37]). A total of 115 amino acid sequences were selected for the alignment (Table S1).

3.2. Sequence Alignment

Each family or group of sequences were aligned separately using the M-Coffee mode from T-Coffee (http://tcoffee.crg.cat/) [22] before being manually trimmed in order to exclude flanking portions of the RCRE domain and the segments that are exclusive of individual taxa. The trimmed sequences (Data S1) were aligned with PSI-Coffee (http://tcoffee.crg.cat/apps/tcoffee/do:psicoffee) [22] before manual correction. Alignment positions with less than 90% coverage were excluded.

3.3. NMDS and Phylogenetic Analysis

Pairwise evolutionary divergence between sequences was estimated using the Poisson correction model on MEGA7 [38]. The values were used to generate non-metric multidimensional scaling (NMDS) ordinations with the R package vegan [39], representing euclidean distances for two dimensions. NMDS and plotting of ordinations were conducted in RStudio v1.1.442 (Boston, MA, USA) [40]. The best-fit evolutionary model for the alignment (LG+G+I) was determined using MEGA7 [38] and the Smart model selection (SMS) in PhyML (http://www.atgc-montpellier.fr/phyml/) [41]. Maximum Likelihood phylogeny was inferred from 5000 replicates using MEGA7 [38], and the final phylogenetic tree edited using iTOL v4.2.3 (https://itol.embl.de/) [42].

Supplementary Materials

Supplementary materials can be found at https://www.mdpi.com/1422-0067/19/10/3079/s1.

Author Contributions

Conceptualization, P.H. and G.C.S.K.; Methodology, P.H.; Investigation, P.H.; Resources, G.C.S.K.; Data Curation, P.H.; Writing-Original Draft Preparation, P.H.; Writing-Review & Editing, G.C.S.K.; Visualization, P.H.; Funding Acquisition, G.C.S.K.

Funding

This research was funded by Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) grant number 404620/2016-7 to G.C.S.K and doctoral fellowship from Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) to P.H.

Acknowledgments

We thank Renan P. de Souza for the insightful discussions that helped to enrich our analysis and “Pró-Reitoria de Pesquisa da Universidade Federal de Minas Gerais” for partially covering the publication costs.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

RCRRolling-circle replication
TETransposable element
RCRERolling-circle replication endonuclease domain
S1HSuperfamily 1 helicase
S3HSuperfamily 3 helicase
RepHelHelitron transposase (Rep/Helicase)
ssDNASingle-strand DNA
NMDSNon-metric multidimensional scaling

References

  1. Chandler, M.; De La Cruz, F.; Dyda, F.; Hickman, A.B.; Moncalian, G.; Ton-Hoang, B. Breaking and joining single-stranded DNA: The HUH endonuclease superfamily. Nat. Rev. Microbiol. 2013, 11, 525–538. [Google Scholar] [CrossRef] [PubMed]
  2. Wawrzyniak, P.; Płucienniczak, G.; Bartosik, D. The Different Faces of Rolling-Circle Replication and Its Multifunctional Initiator Proteins. Front. Microbiol. 2017, 8, 2353. [Google Scholar] [CrossRef] [PubMed]
  3. Kapitonov, V.V.; Jurka, J. Helitrons on a roll: Eukaryotic rolling-circle transposons. Trends Genet. 2007, 23, 521–529. [Google Scholar] [CrossRef] [PubMed]
  4. Thomas, J.; Pritham, E.J. Helitrons, the Eukaryotic Rolling-Circle Transposable Elements in Mobile DNA III, 3rd ed.; ASM Press: Washington, DC, USA, 2015; pp. 893–926. [Google Scholar]
  5. Dias, G.B.; Heringer, P.; Kuhn, G.C. Helitrons in Drosophila: Chromatin modulation and tandem insertions. Mob. Genet. Elements 2016, 6, e1154638. [Google Scholar] [CrossRef] [PubMed]
  6. Grabundzija, I.; Messing, S.A.; Thomas, J.; Cosby, R.L.; Bilic, I.; Miskey, C.; Jurka, J. A Helitron transposon reconstructed from bats reveals a novel mechanism of genome shuffling in eukaryotes. Nat. Commun. 2016, 7, 10716. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Grabundzija, I.; Hickman, A.B.; Dyda, F. Helraiser intermediates provide insight into the mechanism of eukaryotic replicative transposition. Nat. Commun. 2018, 9, 1278. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Koonin, E.V.; Dolja, V.V. Virus world as an evolutionary network of viruses and capsidless selfish elements. Microbiol. Mol. Biol. Rev. 2014, 78, 278–303. [Google Scholar] [CrossRef] [PubMed]
  9. Krupovic, M. Networks of evolutionary interactions underlying the polyphyletic origin of ssDNA viruses. Curr. Opin. Virol. 2013, 3, 578–586. [Google Scholar] [CrossRef] [PubMed]
  10. Marchler-Bauer, A.; Bo, Y.; Han, L.; He, J.; Lanczycki, C.J.; Lu, S.; Chitsaz, F.; Derbyshire, M.K.; Geer, R.C.; Gonzales, N.R.; et al. CDD/SPARCLE: Functional classification of proteins via subfamily domain architectures. Nucleic Acids Res. 2017, 45, D200–D203. [Google Scholar] [CrossRef] [PubMed]
  11. Kapitonov, V.V.; Jurka, J. Rolling-circle transposons in eukaryotes. Proc. Natl. Acad. Sci. USA 2001, 98, 8714–8719. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Feschotte, C.; Wessler, S.R. Treasures in the attic: Rolling circle transposons discovered in eukaryotic genomes. Proc. Natl. Acad. Sci. USA 2001, 98, 8923–8924. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Bejarano, E.R.; Khashoggi, A.; Witty, M.; Lichtenstein, C. Integration of multiple repeats of geminiviral DNA into the nuclear genome of tobacco during evolution. Proc. Natl. Acad. Sci. USA 1996, 93, 759–764. [Google Scholar] [CrossRef] [PubMed]
  14. Krupovic, M.; Forterre, P. Single-stranded DNA viruses employ a variety of mechanisms for integration into host genomes. Ann. N. Y. Acad. Sci. 2015, 1341, 41–53. [Google Scholar] [CrossRef] [PubMed]
  15. Liu, H.; Fu, Y.; Li, B.; Yu, X.; Xie, J.; Cheng, J.; Ghabrial, S.A.; Li, G.; Yi, X.; Jiang, D. Widespread horizontal gene transfer from circular single-stranded DNA viruses to eukaryotic genomes. BMC Evol. Biol. 2011, 11, 276. [Google Scholar] [CrossRef] [PubMed]
  16. Hickman, A.B.; Ronning, D.R.; Kotin, R.M.; Dyda, F. Structural unity among viral origin binding proteins: Crystal structure of the nuclease domain of adeno-associated virus Rep. Mol. Cell 2002, 10, 327–337. [Google Scholar] [CrossRef]
  17. Rosario, K.; Duffy, S.; Breitbart, M. A field guide to eukaryotic circular single-stranded DNA viruses: Insights gained from metagenomics. Arch. Virol. 2012, 157, 1851–1871. [Google Scholar] [CrossRef] [PubMed]
  18. Björklund, Å.K.; Ekman, D.; Light, S.; Frey-Skött, J.; Elofsson, A. Domain rearrangements in protein evolution. J. Mol. Biol. 2005, 353, 911–923. [Google Scholar] [CrossRef] [PubMed]
  19. Poulter, R.T.; Goodwin, T.J.; Butler, M. Vertebrate helentrons and other novel Helitrons. Gene 2003, 313, 201–212. [Google Scholar] [CrossRef]
  20. Kazlauskas, D.; Varsani, A.; Krupovic, M. Pervasive Chimerism in the Replication-Associated Proteins of Uncultured Single-Stranded DNA Viruses. Viruses 2018, 10, 187. [Google Scholar] [CrossRef] [PubMed]
  21. Ilyina, T.V.; Koonin, E.V. Conserved sequence motifs in the initiator proteins for rolling circle DNA replication encoded by diverse replicons from eubacteria, eucaryotes and archaebacteria. Nucleic Acids Res. 1992, 20, 3279–3285. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Di Tommaso, P.; Moretti, S.; Xenarios, I.; Orobitg, M.; Montanyola, A.; Chang, J.M.; Notredame, C. T-Coffee: A web server for the multiple sequence alignment of protein and RNA sequences using structural information and homology extension. Nucleic Acids Res. 2011, 39, W13–W17. [Google Scholar] [CrossRef] [PubMed]
  23. Taly, J.F.; Magis, C.; Bussotti, G.; Chang, J.M.; Di Tommaso, P.; Erb, I.; Espinosa-Carrasco, J.; Kemena, C.; Notredame, C. Using the T-Coffee package to build multiple sequence alignments of protein, RNA, DNA sequences and 3D structures. Nat. Protoc. 2011, 6, 1669–1682. [Google Scholar] [CrossRef] [PubMed]
  24. Wang, Y.; Chen, B.; Cao, M.; Sima, L.; Prangishvili, D.; Chen, X.; Krupovic, M. Rolling-circle replication initiation protein of haloarchaeal sphaerolipovirus SNJ1 is homologous to bacterial transposases of the IS91 family insertion sequences. J. Gen. Virol. 2018, 99, 416–421. [Google Scholar] [CrossRef] [PubMed]
  25. Aiewsakun, P.; Simmonds, P. The genomic underpinnings of eukaryotic virus taxonomy: Creating a sequence-based framework for family-level virus classification. Microbiome 2018, 6, 38. [Google Scholar] [CrossRef] [PubMed]
  26. Campos-Olivas, R.; Louis, J.M.; Clérot, D.; Gronenborn, B.; Gronenborn, A.M. The structure of a replication initiator unites diverse aspects of nucleic acid metabolism. Proc. Natl. Acad. Sci. USA 2002, 99, 10310–10315. [Google Scholar] [CrossRef] [PubMed]
  27. Bergsten, J. A review of long-branch attraction. Cladistics 2005, 21, 163–193. [Google Scholar] [CrossRef] [Green Version]
  28. Garcillan-Barcia, M.P.; Bernales, I.; Mendiola, M.V.; de la Cruz, F. Single-stranded DNA intermediates in IS91 rolling-circle transposition. Mol. Microbiol. 2001, 39, 494–501. [Google Scholar] [CrossRef]
  29. Bochman, M.L.; Sabouri, N.; Zakian, V.A. Unwinding the functions of the Pif1 family helicases. DNA repair 2010, 9, 237–249. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  30. Bochman, M.L.; Judge, C.P.; Zakian, V.A. The Pif1 family in prokaryotes: What are our helicases doing in your bacteria? Mol. Biol. Cell 2011, 22, 1955–1959. [Google Scholar] [CrossRef] [PubMed]
  31. Carrillo-Tripp, M.; Shepherd, C.M.; Borelli, I.A.; Venkataraman, S.; Lander, G.; Natarajan, P.; Johnson, J.E.; Brooks, C.L.; Reddy, V.S. VIPERdb2: An enhanced and web API enabled relational database for structural virology. Nucleic Acids Res. 2009, 37, D436–D442. [Google Scholar] [CrossRef] [PubMed]
  32. Bao, W.; Kojima, K.K.; Kohany, O. Repbase Update, a database of repetitive elements in eukaryotic genomes. Mob. DNA 2015, 6, 11. [Google Scholar] [CrossRef] [PubMed]
  33. Benson, D.A.; Cavanaugh, M.; Clark, K.; Karsch-Mizrachi, I.; Lipman, D.J.; Ostell, J.; Sayers, E.W. GenBank. Nucleic Acids Res. 2017, 45, D37–D42. [Google Scholar] [CrossRef] [PubMed]
  34. Pritham, E.J.; Feschotte, C. Massive amplification of rolling-circle transposons in the lineage of the bat Myotis lucifugus. Proc. Natl. Acad. Sci. USA 2007, 104, 1895–1900. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Zawar-Reza, P.; Argüello-Astorga, G.R.; Kraberger, S.; Julian, L.; Stainton, D.; Broady, P.A.; Varsani, A. Diverse small circular single-stranded DNA viruses identified in a freshwater pond on the McMurdo Ice Shelf (Antarctica). Infect. Genet. Evol. 2014, 26, 132–138. [Google Scholar] [CrossRef] [PubMed]
  36. Kazlauskas, D.; Dayaram, A.; Kraberger, S.; Goldstien, S.; Varsani, A.; Krupovic, M. Evolutionary history of ssDNA bacilladnaviruses features horizontal acquisition of the capsid gene from ssRNA nodaviruses. Virology 2017, 504, 114–121. [Google Scholar] [CrossRef] [PubMed]
  37. Wang, Y.; Sima, L.; Lv, J.; Huang, S.; Liu, Y.; Wang, J.; Krupovic, M.; Chen, X. Identification, characterization, and application of the replicon region of the halophilic temperate sphaerolipovirus SNJ1. J. Bacteriol. 2016, 198, 1952–1964. [Google Scholar] [CrossRef] [PubMed]
  38. Kumar, S.; Stecher, G.; Tamura, K. MEGA7: Molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 2016, 33, 1870–1874. [Google Scholar] [CrossRef] [PubMed]
  39. Dixon, P. VEGAN, a package of R functions for community ecology. J. Veg. Sci. 2003, 14, 927–930. [Google Scholar] [CrossRef]
  40. RStudio Team. RStudio: Integrated Development for R; RStudio, Inc.: Boston, MA, USA, 2016; Available online: http://www.rstudio.com/ (accessed on 8 October 2018).
  41. Lefort, V.; Longueville, J.E.; Gascuel, O. SMS: Smart model selection in PhyML. Mol. Biol. Evol. 2017, 34, 2422–2424. [Google Scholar] [CrossRef] [PubMed]
  42. Letunic, I.; Bork, P. Interactive tree of life (iTOL) v3: An online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Res. 2016, 44, W242–W245. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Modular diversity of HUH endonucleases. Schematic representation of the rolling-circle replication (RCR) proteins included in the present analysis. Rolling-circle replication endonuclease (RCRE) domains have the first two motifs (I and II), in addition to the third motif represented by one or two tyrosines (Y) in the catalytic core (dots represent variable amino acid residues). Domains are not drawn to scale, and segments after helicase domains are not represented. Based on information from Chandler et al. [1], Koonin and Dolja [8], and the Conserved Domain Database (CDD) search tool [10].
Figure 1. Modular diversity of HUH endonucleases. Schematic representation of the rolling-circle replication (RCR) proteins included in the present analysis. Rolling-circle replication endonuclease (RCRE) domains have the first two motifs (I and II), in addition to the third motif represented by one or two tyrosines (Y) in the catalytic core (dots represent variable amino acid residues). Domains are not drawn to scale, and segments after helicase domains are not represented. Based on information from Chandler et al. [1], Koonin and Dolja [8], and the Conserved Domain Database (CDD) search tool [10].
Ijms 19 03079 g001
Figure 2. Phylogenetic analysis of RCRE domain sequences. Clade colors indicate each tyrosine group: Y1 (green), Y2 (red), and Yx (blue). Taxa colors represent the family of each element (box on the upper right). See Table S1 for taxa information. Phylogeny inferred by the Maximum Likelihood method (LG+G+I). The same phylogeny, with the numerical support values represented, is shown on Figure S1.
Figure 2. Phylogenetic analysis of RCRE domain sequences. Clade colors indicate each tyrosine group: Y1 (green), Y2 (red), and Yx (blue). Taxa colors represent the family of each element (box on the upper right). See Table S1 for taxa information. Phylogeny inferred by the Maximum Likelihood method (LG+G+I). The same phylogeny, with the numerical support values represented, is shown on Figure S1.
Ijms 19 03079 g002
Figure 3. Non-metric multidimensional scaling (NMDS) of evolutionary divergence between RCRE domains. (A) Ordinations with taxa represented by their sequence abbreviations. Colors indicate the different classes of mobile genetic elements. (B) Same ordinations of (A), with colors indicating the tyrosine group of each taxa. The scaling represents euclidean distances for two dimensions (stress: 0.26382).
Figure 3. Non-metric multidimensional scaling (NMDS) of evolutionary divergence between RCRE domains. (A) Ordinations with taxa represented by their sequence abbreviations. Colors indicate the different classes of mobile genetic elements. (B) Same ordinations of (A), with colors indicating the tyrosine group of each taxa. The scaling represents euclidean distances for two dimensions (stress: 0.26382).
Ijms 19 03079 g003
Figure 4. Proposed scenario for the origin of Helitrons and other RCR elements. Arrows represent putative pathways to explain the observed relationship among RCR elements. Virion images were obtained from VIPERdb (http://viperdb.scripps.edu) [31].
Figure 4. Proposed scenario for the origin of Helitrons and other RCR elements. Arrows represent putative pathways to explain the observed relationship among RCR elements. Virion images were obtained from VIPERdb (http://viperdb.scripps.edu) [31].
Ijms 19 03079 g004

Share and Cite

MDPI and ACS Style

Heringer, P.; Kuhn, G.C.S. Exploring the Remote Ties between Helitron Transposases and Other Rolling-Circle Replication Proteins. Int. J. Mol. Sci. 2018, 19, 3079. https://doi.org/10.3390/ijms19103079

AMA Style

Heringer P, Kuhn GCS. Exploring the Remote Ties between Helitron Transposases and Other Rolling-Circle Replication Proteins. International Journal of Molecular Sciences. 2018; 19(10):3079. https://doi.org/10.3390/ijms19103079

Chicago/Turabian Style

Heringer, Pedro, and Gustavo C. S. Kuhn. 2018. "Exploring the Remote Ties between Helitron Transposases and Other Rolling-Circle Replication Proteins" International Journal of Molecular Sciences 19, no. 10: 3079. https://doi.org/10.3390/ijms19103079

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop