Next Article in Journal
Less Grease, Please. Phosphatidylethanolamine Is the Only Lipid Required for Replication of a (+)RNA Virus
Previous Article in Journal
Viroporins, Examples of the Two-Stage Membrane Protein Folding Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Pan-Genome Analysis of Brazilian Lineage A Amoebal Mimiviruses

1
Instituto de Ciências Biológicas, Departamento de Microbiologia, Laboratório de Vírus, Universidade Federal de Minas Gerais, Belo Horizonte, 31270-901 Minas Gerais, Brazil
2
Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes (URMITE) UM63 CNRS 7278 IRD 198 INSERM U1095, Aix-Marseille Univ., 13385 Marseille, France
3
Department of Biochemistry, Faculty of Science, King Abdulaziz University, 21589 Jeddah, Saudi Arabia
4
Centro de Ciências Biológicas, Departamento de Microbiologia e Parasitologia, Laboratório de Virologia Aplicada, Universidade Federal de Santa Catarina, Florianópolis, 88040-900 Santa Catarina, Brazil
5
Institut Hospitalo-Universitaire (IHU) Méditerranée Infection, Assistance Publique-Hôpitaux de Marseille, Centre Hospitalo-Universitaire Timone, Pôle des Maladies Infectieuses et Tropicales Clinique et Biologique, Fédération de Bactériologie-Hygiène-Virologie, 13385 Marseille, France
*
Author to whom correspondence should be addressed.
Viruses 2015, 7(7), 3483-3499; https://doi.org/10.3390/v7072782
Submission received: 3 April 2015 / Revised: 15 June 2015 / Accepted: 18 June 2015 / Published: 26 June 2015
(This article belongs to the Section Viruses of Plants, Fungi and Protozoa)

Abstract

:
Since the recent discovery of Samba virus, the first representative of the family Mimiviridae from Brazil, prospecting for mimiviruses has been conducted in different environmental conditions in Brazil. Recently, we isolated using Acanthamoeba sp. three new mimiviruses, all of lineage A of amoebal mimiviruses: Kroon virus from urban lake water; Amazonia virus from the Brazilian Amazon river; and Oyster virus from farmed oysters. The aims of this work were to sequence and analyze the genome of these new Brazilian mimiviruses (mimi-BR) and update the analysis of the Samba virus genome. The genomes of Samba virus, Amazonia virus and Oyster virus were 97%–99% similar, whereas Kroon virus had a low similarity (90%–91%) with other mimi-BR. A total of 3877 proteins encoded by mimi-BR were grouped into 974 orthologous clusters. In addition, we identified three new ORFans in the Kroon virus genome. Additional work is needed to expand our knowledge of the diversity of mimiviruses from Brazil, including if and why among amoebal mimiviruses those of lineage A predominate in the Brazilian environment.

1. Introduction

Acanthamoeba polyphaga mimivirus (APMV) was isolated in 1992. In 2003, it was identified as a nucleocytoplasmic large DNA virus (NCLDV) and the first member of the family Mimiviridae [1]. Before that identification, such morphological complexity, virion size (>400 nm), genome length (≈1.2 million base pairs (bp)) and extensive gene content (979 protein coding genes) had not been attributed to any virus. Moreover, functional studies revealed the presence of proteins unknown in other viral genomes, such as aminoacyl tRNA-synthetases and translation factors [2]. Since then, several studies have been conducted to determine how widespread and diverse this new viral family is. Currently, over 50 mimiviruses have been isolated from cooling tower water, freshwater and saltwater, soil, leech and clinical samples collected in England, France, USA, Chile, Brazil and Tunisia [3,4,5,6,7,8,9,10,11]. Additionally, DNA from these viruses was identified in the Sargasso Sea and other ocean samples during metagenomic studies [12,13,14].
The family Mimiviridae can be divided into a main group (Group 1), composed of mimiviruses infecting amoebal species, and a distantly related mimiviruses group (Group 2) comprised of Cafeteria roenbergensis virus (CroV; which infects a marine heterotrophic bi-flagellate) [15], organic lake phycodnaviruses and Phaeocystis globosa virus, as recently refined by phylogenomic studies [16]. Mimiviruses infecting amoebal hosts can be divided into three lineages (A [Mimivirus], B [Moumouvirus] and C [Megavirus chiliensis]) according to phylogenies based on conserved core genes including, for example, family B DNA polymerase and ribonucleotide reductase encoding genes [8,15,17,18]. Unexpectedly, the discovery of mimivirus allowed the detection of a new type of virus able to infect giant viruses. These new viral entities were named virophages, in analogy to bacteriophage behavior [4]. Of significant importance is the growing body of evidence for the presence of mimiviruses in humans and the recent isolation of two such giant viruses from pneumonia patients [11,19].
Recently, a new member of lineage A of mimiviruses, named Samba virus (SMBV), was isolated in the Brazilian Amazon forest from a water sample collected in 2011 from the surface of Negro River (3°6′ S 60°1′ W) [8]. A new virophage, named Rio Negro virus (RNV), was also isolated in association with SMBV. SMBV was biologically and molecularly characterized [8]. Its genome was partially (~98%) sequenced, revealing a G+C content of 27.0%. A total of 938 ORFs, ranging in size from 150 to 8835 bp, with an average size of 1002 bp, were predicted. The RNV characterization is in progress (unpublished data), but previous data indicate that it is very similar to the Sputnik virophage. SMBV was the first mimivirus ever isolated in Brazil, which raised the question of the diversity, distribution and specific features of giant viruses from different Brazilian ecosystems. Since then, our Brazilian team has been engaged in prospective studies for giant viruses in Brazil. These studies led to the detection of mimivirus DNA and neutralizing antibodies in serum samples of domestic and wild mammals from the Amazon region [20]. In addition, we isolated three new Brazilian mimiviruses from different environmental sources in different Brazilian regions (Figure 1). The present work describes the complete genome sequencing of these three viruses and a pan-genome analysis of Brazilian mimiviruses. Moreover, we performed an updated analysis of the SMBV genome and the phylogenetic clustering of Brazilian mimiviruses among other mimiviruses into the NCLDV group, or proposed order Megavirales [21].
Figure 1. Geographical location, coordinates, and biome representation for each Brazilian mimivirus isolate.
Figure 1. Geographical location, coordinates, and biome representation for each Brazilian mimivirus isolate.
Viruses 07 02782 g001

2. Materials and Methods

2.1. Mimivirus Origin and Isolation

The viruses described in this work were isolated from: (A) Amazonia virus (AMAV): water sample from the Negro River, Amazon forest, in 2011; (B) Kroon virus (KROV): water sample from an urban lake in Lagoa Santa city, Minas Gerais State, in 2012; and (C) Oyster virus (OYTV): oyster samples farmed on the Atlantic coast, Florianópolis, Santa Catarina state, in 2013 [22] (Figure 1). The samples were cultured in A. castellanii monolayers and the isolated viruses were grown and purified as described previously [8].

2.2. Genome Sequencing and Assembly

The genomes were sequenced using the Illumina MiSeq instrument (Illumina Inc., San Diego, CA, USA) with the paired end application. The sequence reads were assembled de novo using ABYSS software [23] and the resulting contigs were ordered by the python-based CONTIGuator.py software [24]. Concurrently, the CLC_Bio software [25] was used for mapping-based genome assembly using the APMV genome (NC_014649.1) as reference. Draft genomes obtained by both strategies were mapped back to check the reads assembly and to close gaps. The best assemblages for each genome were kept, and the few remaining small gaps were closed by Sanger sequencing.

2.3. Genome Annotation

The gene predictions were performed using RAST (Rapid Annotation using Subsystem Technology) [26] and GeneMarkS [27] tools. Transfer RNA (tRNA) sequences were identified using the tRNAscan-SE tool [28]. The functional annotations were inferred by BLAST searches against the GenBank NCBI non-redundant protein sequence database (nr) (e-value < 1 × 10−3), the set of clusters of orthologous groups of proteins (COGs) of the NCLDVs (named NCVOGs) [16] and by searching specialized databases through the Blast2GO platform [29]. Finally, the genome annotation was manually revised and curated. The predicted ORFs that were smaller than 100 amino acids and had no hit in any database were ruled out. The ORFs longer than 100 amino acids without hits in any database (ORFans) were kept.
ORFan transcripts were detected by Sanger sequencing after extraction from a culture supernatant with the RNeasy Mini kit (Qiagen, Hilden, Germany). The TURBO DNA-Free kit (Ambion, Austin, TX, USA) was used to remove contaminating DNA from RNA preparations as checked by performing PCR with the HotStar TaqDNA polymerase (Qiagen) in absence of a reverse transcription step. Then, RNeasy Mini Elute clean up kit (Qiagen) and SuperScript VILO cDNA synthesis kit (Invitrogen, Carlsbad, CA, USA) were used. Finally, this cDNA template was amplified by PCR with the Hot Star Taq DNA polymerase (Qiagen) then sequenced using the following primers: K22A-F: 5′-TTTCAAACAAATGCAACGTAAAGT; K22A-R: 5′-ACGCTTATTTGAAAACAAACCAAA; K22B-F: 5′-CAATTCTTTCAAACAAATGCAACG; K22B-R: 5′-ACGCTTATTTGAAAACAAACCAAAA; K61A-F: 5′-AGCGCCATGGTGTCCAATAA; K61A-R: 5′-CGAGTGTATGGACAACCGGTAA; K61B-F: 5′-ACCGGTTGTCCATACACTCG; K61B-R: 5′-ATTTCAACCGGATTATTCTTGGG; K933A-F: 5′-ACCATGATGGATATCCGGTGG; K933A-R: 5′-TCAGGATGGATATTTGCCGTGT; K933B-F: 5′-TGGATATCCGGTGGATGAAGA; K933B-R: 5′-ATCAGGATGGATATTTGCCGT. In addition, we evaluated the expression of viral tRNA molecules into cells infected with Brazilian mimiviruses. Total RNA was extracted using the RNeasy kit (Qiagen), and reverse transcription was performed by using the MMLV reverse transcriptase (Promega, Madison, WI, USA), following the manufacturers recommendations. The cDNA was used in qPCR reactions using specific primers (Leucyl-tRNA: forward 5′-GGGATTCGAACCCACGACAT, reverse 5′-ATAAGCAAAGGTGGCGGAGT; Histidyl-tRNA: forward 5′-TTAGTGGTAGAACTACTGTTTGTGG, reverse 5′-TTTTCAAAAATGACCCGTACAGGAA; Cysteinyl-tRNA: forward 5′-ACAGTCAACTGGATCGTTAGC, reverse 5′-AGGATCGTATCAGAATTGAACTGA; Tryptophanyl-tRNA: forward 5′-GTGCAACAATAGACCTGTTAGTTTA, reverse 5′-ACCGGAATCGAACCAGTATCA) with SYBR Green PCR Master Mix (Applied Biosystems, Foster City, CA, USA). Reactions were carried out on the StepOne instrument (Applied Biosystem) and optimized. Relative gene expression was evaluated by using the 2-Delta-Delta Ct method [30] and normalized to 18S ribosomal RNA and viral RNA helicase mRNA.

2.4. Comparative Genomic Analysis

The co-linearity between Brazilian mimiviruses was checked using MUMmer.3.23 [31] and MAUVE programs [32]. The Proteinortho tool [33] was used to define the bona fide orthologous genes shared among Brazilian mimiviruses and representatives of amoebal mimiviruses of lineages A–C, using the reciprocal best hits strategy with 1 × 10−3, 30% and 70% as thresholds for the BLASTp e-value, and identity and coverage of amino acid sequences, respectively. The OrthoMCL tool [34] was used to identify the paralog families among Brazilian mimivirus proteins.

2.5. Analysis of the Pan-Genome and Core Genome of Brazilian Mimiviruses

To estimate the size of the pan-genome of Brazilian mimiviruses, their predicted proteins were clustered by the BLASTclust program using an amino acid sequence identity of 30% and sequence coverage of 70% as the parameters. We also described the pan-genome size evolution by stepwise inclusion of each new virus annotation in the pairwise comparisons of the gene contents. The orthologous protein clusters encompassing at least one protein from each virus, obtained by the BLASTclust and Proteinortho5.pl programs, were taken into account to determine the core genome and strict core genome, respectively, of the mimivirus lineage A group.

2.6. Phylogeny

Protein sequences of Brazilian mimiviruses were aligned with those from the other giant viruses previously described in GenBank using the ClustalW program [35]. After visual analyses and manual curation of protein sequence alignments, the phylogeny reconstruction was performed using the maximum likelihood method implemented by MEGA5 software [36].

3. Results

3.1. Genome Annotation of Brazilian Mimiviruses

The genomes of AMAV (GenBank Accession no. KM982403), KROV (KM982402) and OYTV (KM982401) are double-stranded DNA molecules of 1,179,579, 1,221,932 and 1,200,220 bp, respectively. KROV presents the largest DNA genome among lineage A amoebal mimiviruses. Its genome is ~21, 42 and 40 kb larger than those from OYTV, AMAV and SMBV, respectively, and ~30 kb and 40 kb larger than those from Mamavirus and APMV, respectively. These genomes have a mean G+C content of 27%, similar to other mimiviruses. The genomes of AMAV and OYTV contain six regions encoding the tRNA molecules for leucine (3 sequences), histidine, cysteine and tryptophan amino acids, whereas the genome of KROV lacks the sequence encoding the tRNA for tryptophan. The assessment of tRNA gene expression was congruent with the genomic analyses, by detecting six tRNA molecules in SMBV and OYTV, whereas no tryptophanyl-tRNA expression was detected in KROV.
We predicted 979 ORFs for AMAV, 944 for KROV and 948 for OYTV, with ORF sizes ranging from 113 bp to 8835 bp (mean, 1046 bp). For all genomes, the predicted protein-encoding genes were evenly distributed on both positive and negative DNA strands (in ~51% of cases on the negative strand). A total of 35 clusters consisting of 125 paralogous proteins were identified in the KROV genome, 54 clusters consisting of 174 paralogous proteins were identified in the OYTV genome and 55 clusters consisting of 176 paralogous proteins were identified in the AMAV genome (Table 1). Although the KROV genome is the largest among the lineage A amoebal mimiviruses, it was predicted to encode fewer ORFs, including fewer paralogous proteins. Thus, approximately 13% of the KROV ORFs were comprised of paralogs, whereas this proportion was 18% for ORFs encoded by the OYTV, AMAV and SMBV genomes.
Table 1. Summary of results for genome annotation from Brazilian mimiviruses.
Table 1. Summary of results for genome annotation from Brazilian mimiviruses.
Brazilian MimivirusViral Source/Year of IsolationGenome Size (bp)G+C %Number of ORFsNumber of ORFansNumber of APMV OrthologsNumber of Paralogous Proteins/ClusterstRNAs
Samba virusNegro River water, Amazon forest/20111,181,38028.09710916178/586
Amazonia virusNegro River water, Amazon forest/20111,179,57927.99791 *905176/556
Oyster virusOyster farmed in Atlantic cost, Florianopolis, SC/20131,200,22027.99481 *864174/546
Kroon virusUrban lake water, Minas Gerais state/20121,221,93227.59443769125/355
APMV, Acanthamoeba polyphaga mimivirus; bp, base pair; ORF, open reading frame; SC: Santa Catarina state. * ORFans with no hit with the following criteria: e-value: <1 × 10−3; similarity: >30%; coverage: >50%.
The functional annotation revealed that ~50% of the ORFs in Brazilian mimivirus genomes were hypothetical proteins, i.e., without a defined function. For all the Brazilian mimivirus genomes, the best hits were most frequently proteins from APMV, followed by those from Mamavirus (Figure 2). These viruses belong to lineage A of the amoebal mimiviruses. One ORF in the AMAV genome, three ORFs in the KROV genome and three ORFs in the OYTV genome had no hit against the NCBI nr protein sequence database or the Blast2GO platform. To confirm whether these genes were bona fide ORFans, their nucleotide sequences were submitted to a BLASTn search against the GenBank nucleotide sequence database (nt) of the NCBI. For AMAV, the putative ORFan R931 had hits with other lineage A mimiviruses (coverage of 49%; similarity of 100%; e-value of 1 × 10−20 or less). In contrast, the three putative KROV ORFans (L22, L61 and R933 genes) had no significant hits. Transcripts for these three ORFans were detected by Sanger sequencing. For OYTV’s putative ORFans, significant hits against lineage A mimiviruses were only obtained for R123 and L733 genes. For ORFan L25, a hit with an e-value of 0.63, coverage of 83% and identity of 26% was observed against Lausannevirus, a giant amoebal virus from the family Marseilleviridae. Whether these putative ORFans were shared by Brazilian mimiviruses was determined by BLASTn analyses. The putative ORFan of AMAV was shared by the KROV and OYTV genomes, whereas ORFans from KROV were not found in any other Brazilian mimivirus genomes. The OYTV putative ORFans L733 and R123 were shared by KROV and AMAV, whereas ORFan L25 was exclusive to the OYTV genome.

3.2. Samba Virus Genome Update

For this new genome announcement, the SMBV genome was re-sequenced using Illumina Deep Sequencing Technology. The assembled genome obtained consisted of a double-stranded DNA molecule of 1,181,380 bp with a G+C content of 28%, similar to that of other mimivirus isolates. Although shorter (30,300 bp less) than in the first report, the SMBV genome was predicted to encode 971 ORFs (33 more ORFs than in the first report), ranging in size from 113 to 8834 bp, with a mean size of 1068 bp. The ORF density was 0.82 genes/kb, with a coding density of 87.9%. The genome of SMBV was predicted to encode 142 cellular metabolism-associated proteins and up to 180 proteins involved in general metabolic processes, in addition to some regulatory proteins, suggesting a certain level of autonomy for the virus, even as an obligate intracellular parasite. Among the predicted ORFs, 45% were hypothetical proteins, and 85% showed a best hit against APMV. We identified four sequences encoding aminoacyl-tRNA synthetase, a landmark of mimiviruses because no other virus encodes such proteins. Moreover, we were able to identify six tRNA sequences: three cognates to leucine, and one to histidine, cysteine or tryptophan. The G+C content of these tRNA sequences was 48.6%, far greater than that for the entire genome.
All the predicted ORFs matched orthologs in the NVCOG database with an average similarity of 93.7%; thus, new ORFans were not predicted. A total of 58 clusters consisting of 179 paralogous proteins were identified in the SMBV genome, similar to that detected in the APMV and Hirudovirus genomes. The reciprocal best hit analysis identified 917 bona fide orthologous proteins between SMBV and APMV. The same analysis identified 339 bona fide orthologous proteins shared with Moumouvirus (lineage B of amoebal mimiviruses) and 485 shared with Megavirus chilensis (lineage C).
Figure 2. Graphical distribution of the best BLAST hits for Brazilian mimivirus gene contents. The analysis was performed using BLASTp algorithm searching with the predicted ORFs from Brazilian mimiviruses against the NCBI GenBank non-redundant (nr) protein sequence database using the java-based free software Blast2GO [29]. (A) Oyster virus; (B) Amazonia virus; (C) Samba virus; and (D) Kroon virus. The Acanthamoeba polyphaga mimivirus was the predominant target for hits from all Brazilian mimiviruses (896, 588 and 577 best hits to AMAV, OYTV and KROV, respectively), with an average of 76% of the hits.
Figure 2. Graphical distribution of the best BLAST hits for Brazilian mimivirus gene contents. The analysis was performed using BLASTp algorithm searching with the predicted ORFs from Brazilian mimiviruses against the NCBI GenBank non-redundant (nr) protein sequence database using the java-based free software Blast2GO [29]. (A) Oyster virus; (B) Amazonia virus; (C) Samba virus; and (D) Kroon virus. The Acanthamoeba polyphaga mimivirus was the predominant target for hits from all Brazilian mimiviruses (896, 588 and 577 best hits to AMAV, OYTV and KROV, respectively), with an average of 76% of the hits.
Viruses 07 02782 g002

3.3. Comparative Genomics of Lineage A Mimiviruses

The genome sequence of SMBV was most closely related to APMV, with a similarity level of 99% (and a coverage of 100%), followed by AMAV (similarity: 99%, coverage: 99%), OYTV (similarity: 99%, coverage: 97%) and KROV (similarity: 94%, coverage: 90%). The same tendency was observed among Brazilian mimiviruses and other lineage A mimiviruses, such as Mamavirus (JF801956.1), Terra2 virus (NC_023639.1) and Hirudovirus (KF493731.1) (data not shown). Among the mimi-BR, SMBV, AMAV and OYTV genomes had similarity values ranging from 97% to 99%, whereas KROV had similarity values between 90%–91% with the other mimi-BR.
The gene synteny analysis showed that mimi-BR genomes share a similar architecture with those from the other lineage A mimiviruses, with some reassortments at their extremities, primarily compared to the Terra2 virus and Hirudovirus (Figure 3). While analyzing the mimi-BR alignment, we identified short fragments at the 5′ extremity of the SMBV genome that were related to fragments at the 3′ extremities of the OYTV and AMAV genomes but inverted in orientation. Furthermore, we observed the presence of a large region (~30,000 bp) at the 5′ extremity of the KROV genome that had no similarity with any region in the SMBV, OYTV and AMAV genomes. This singular region in the 5′ extremity of the KROV genome was predicted to encode 25 ORFs, of which 16 (66%) encode ankyrin-like proteins (Figure 3). These ORFs seem to be comprised of fragmented and/or intron-containing genes, besides some duplicated genes, resembling what was described previously at the 5′ extremity of the Mamavirus genome [37]. KROV also presents differences in some gene structure, including the major capsid protein-encoding gene (APMV L425), which might be involved in a particular splicing process.
Figure 3. Genome alignment of lineage A amoebal mimiviruses showing their genome architecture and synteny. The genome alignment and schematic were obtained using the Mauve software package [32].
Figure 3. Genome alignment of lineage A amoebal mimiviruses showing their genome architecture and synteny. The genome alignment and schematic were obtained using the Mauve software package [32].
Viruses 07 02782 g003
For all mimi-BR genomes, the paralogous genes were predominantly distributed towards both extremities of each genome (59% on average), with few occurrences in the central regions (average of 7%) where nevertheless, as for the case of the KROV genome, we observed genes flanked by paralogs. Although the majority of these paralogs were co-localized, in some cases, genes from paralogous gene pairs were found at different genome extremities and in inverted orientations, as previously described for Mimivirus [38].
The reciprocal best hits (RBH) analyses of protein contents corroborated with genome synteny analyses. The SMBV shared 950, 892 and 781 RBHs with AMAV, OYTV and KROV, respectively. AMAV and OYTV shared 886 RBHs, whereas KROV shared 784 and 785 RBHs with AMAV and OYTV, respectively. Moreover, these orthologous genes shared a high level of colinearity when compared with their counterparts from other mimi-BR genomes (Figure 4).
Figure 4. Genomic dot-plots based on a BLASTp analysis for pairs of Brazilian mimiviruses. Each circle shows a pair of orthologous proteins found in each pair of Brazilian mimiviruses. The diameters of the bubbles are proportional to the BLASTp similarity scores, and their positions are relative to the position of each pair in the genome.
Figure 4. Genomic dot-plots based on a BLASTp analysis for pairs of Brazilian mimiviruses. Each circle shows a pair of orthologous proteins found in each pair of Brazilian mimiviruses. The diameters of the bubbles are proportional to the BLASTp similarity scores, and their positions are relative to the position of each pair in the genome.
Viruses 07 02782 g004

3.4. Pan-Genome and Core Genome Analysis

Using the BLASTclust program, an increasing pan-genome size was noted to each genome annotation of the new mimiviruses, to finally obtain a total of 1129 clusters, which we identified as the pan-genome of the lineage A mimiviruses infecting amoeba (Figure 5). Taking into account only the pan-genome of the Brazilian mimiviruses, a total of 3877 proteins were grouped into 974 clusters of orthologous proteins. The size of the core genome of mimiviruses of lineage A, including Brazilian mimiviruses, was calculated using all clusters of orthologous proteins and bona fide orthologous proteins created by BLASTclust and Proteinortho5.pl programs, respectively. These two approaches delineated a core genome comprised of 597–644 genes and similar curves for the size of the core genome of lineage A amoebal mimiviruses, the number of orthologs shared by all of these viruses decreasing with each new genome annotation.
Figure 5. Evolution of the pan-genome size of amoebal mimiviruses of lineage A.
Figure 5. Evolution of the pan-genome size of amoebal mimiviruses of lineage A.
Viruses 07 02782 g005

3.5. Phylogeny

The phylogenetic tree based on family B DNA polymerase showed that all Brazilian mimiviruses were clustered with other lineage A mimiviruses in the major group of the family Mimiviridae (Figure 6). It is worth mentioning that OYTV and AMAV were more closely related to each other than to other lineage A members, as were SMBV and APMV. Congruent with results from comparative genomics, KROV was more divergent from the other Brazilian mimiviruses, although robustly clustered with other lineage A mimiviruses.
Figure 6. Phylogenetic reconstruction of Brazilian mimiviruses and other megaviruses based on family B DNA polymerase. A phylogenetic tree was generated using MEGA5 software with the Maximum likelihood method. The percentage of trees using 1000 bootstrap replicates in which the associated taxa clustered together is shown next to the branches. Brazilian mimiviruses, highlighted by red markers, are clustered with members of lineage A amoebal mimiviruses. The circles indicate new viruses; the triangle indicates the previously reported Brazilian mimivirus. For each sequence, the GenBank gene identification numbers are indicated. The branches were identified by brackets and family names. Branches corresponding to lineages of amoebal mimiviruses are differentiated by colors. Currently unclassified viruses are highlighted in pink.
Figure 6. Phylogenetic reconstruction of Brazilian mimiviruses and other megaviruses based on family B DNA polymerase. A phylogenetic tree was generated using MEGA5 software with the Maximum likelihood method. The percentage of trees using 1000 bootstrap replicates in which the associated taxa clustered together is shown next to the branches. Brazilian mimiviruses, highlighted by red markers, are clustered with members of lineage A amoebal mimiviruses. The circles indicate new viruses; the triangle indicates the previously reported Brazilian mimivirus. For each sequence, the GenBank gene identification numbers are indicated. The branches were identified by brackets and family names. Branches corresponding to lineages of amoebal mimiviruses are differentiated by colors. Currently unclassified viruses are highlighted in pink.
Viruses 07 02782 g006

4. Discussion

Since the description of the original strain of Mimivirus in 2003, eight mimiviruses infecting Acanthamoeba spp. have been isolated and biologically and molecularly characterized. Here, we have described the genomic characterization of three new amoebal mimiviruses isolated from different environments in Brazil and the re-analysis of the genome of SMBV, the first Mimiviridae isolate from this country. In particular, we studied the gene arsenal and genome architecture of these viruses and their relationship with other members of this viral family. As for other mimivirus genomes, the majority of the ORFs in Brazilian mimivirus genomes were ORFans or putative genes encoding hypothetical proteins, and their annotation was based on similarity with previously described mimiviral genomes. Thus, most of these ORFans are family ORFans, which means that they have no known ortholog in public sequence databases apart from other mimiviral genomes [39]. Hence, the function, significance and evolutionary relationship remain to be deciphered for a significant proportion of putative genes in these mimiviral genomes.
As previously described [8], the SMBV genome is closely related to APMV and has high similarity with other Brazilian mimiviruses such as OYTV and AMAV. The analysis of the clusters of orthologous proteins among these three viruses particularly highlights their proximity. However, although more closely related to Brazilian mimiviruses than to other lineage A mimiviruses, the KROV isolate showed singular features compared to SMBV, OYTV and AMAV. For example, this virus has the largest genome among amoebal mimiviruses of lineage A. Nonetheless, the KROV genome encodes fewer ORFs, which comprise fewer paralogs than other lineage A amoebal mimiviruses, but increases the pan-genome size for these mimiviruses. In addition, the KROV genome is lacking the cognate tRNA to tryptophan and is predicted to encode three new genuine ORFans, unknown in any other organism.
It has been suggested that mimiviruses from different geographical areas may be closely related [6]. Examples include Megavirus chilensis, isolated from coastal seawater in Chile, and Courdo11 virus, isolated from freshwater samples in Southeastern France [5,40]. Brazilian ecosystems have a high level of complexity, and the majority of these ecosystems remain to be investigated. Previous results indicated that mimiviruses are common in aquatic environments in Brazil, and the isolation and analysis of new mimiviruses from this country contribute to our knowledge of the diversity and evolution of this viral family. The Brazilian mimiviruses studied here were isolated from different environments, including fresh- and saltwater, but were all clustered in the same lineage of amoebal mimiviruses that includes Mimivirus, Mamavirus and Terra2 viruses isolated in England and France from freshwater or soil [2,4,41]. The presence of mimiviruses in animals and humans is also of interest. Recently, our Brazilian team detected mimiviruses in serum samples from wild and domestic animals [20], while other studies have suggested a role for mimiviruses in pneumonia, which was strengthened by the recent isolation of two mimiviruses in patients with atypical unexplained pneumonia [11,19]. Overall, the exploration of the diversity of mimiviruses in different environments may help to solve many questions about the mimivirus life cycle and their clinical importance.

Acknowledgments

We thank our colleagues from Gepvig and Laboratório de Vírus of Universidade Federal de Minas Gerais and the bioinformatics team of URMITE—Aix Marseille Université for their excellent technical support. We would also like to thank CNPq, CAPES, FAPEMIG and Pro-Reitoria de Pesquisa da Universidade Federal de Minas Gerais (PRPq-UFMG) for financial support.

Author Contributions

Fabio P. Dornas, Kétyllen R. Andrade, Paulo V.M. Boratto and Mariana R. Pilotto performed virus isolation; Catherine Robert performed genome sequencing; Felipe L. Assis and Samia Benamar. performed genomic analyses; Leena Bajrai performed ORFans expression experiments; Felipe L. Assis, Jonatas S. Abrahao, Erna G. Kroon, Bernard La Scola and Philippe Colson designed the experiments and analyzed the data. Felipe L. Assis, Jonatas S. Abrahao and Philippe Colson wrote the paper.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. La Scola, B.; Audic, S.; Robert, C.; Jungang, L.; de Lamballerie, X.; Drancourt, M.; Birtles, R.; Claverie, J.M.; Raoult, D. A giant virus in amoebae. Science 2003, 299, 2033–2033. [Google Scholar] [CrossRef] [PubMed]
  2. Raoult, D.; Audic, S.; Robert, C.; Abergel, C.; Renesto, P.; Ogata, H.; La Scola, B.; Suzan, M.; Claverie, J.M. The 1.2-megabase genome sequence of Mimivirus. Science 2004, 306, 1344–1350. [Google Scholar] [CrossRef] [PubMed]
  3. Pagnier, I.; Reteno, D.G.; Saadi, H.; Boughalmi, M.; Gaia, M.; Slimani, M.; Ngounga, T.; Bekliz, M.; Colson, P.; Raoult, D.; et al. A decade of improvements in Mimiviridae and Marseilleviridae isolation from amoeba. Intervirology 2013, 56, 354–363. [Google Scholar] [CrossRef] [PubMed]
  4. La Scola, B.; Desnues, C.; Pagnier, I.; Robert, C.; Barrassi, L.; Fournous, G.; Merchat, M.; Suzan-Monti, M.; Forterre, P.; Koonin, E.; Raoult, D. The virophage as a unique parasite of the giant mimivirus. Nature 2008, 455, 100–104. [Google Scholar] [CrossRef] [PubMed]
  5. Arslan, D.; Legendre, M.; Seltzer, V.; Abergel, C.; Claverie, J.M. Distant Mimivirus relative with a larger genome highlights the fundamental features of Megaviridae. Proc. Natl. Acad. Sci. USA 2011, 108, 17486–17491. [Google Scholar] [CrossRef] [PubMed]
  6. Yoosuf, N.; Yutin, N.; Colson, P.; Shabalina, S.A.; Pagnier, I.; Robert, C.; Azza, S.; Klose, T.; Wong, J.; Rossmann, M.G.; et al. Related giant viruses in distant locations and different habitats: Acanthamoeba polyphaga moumouvirus represents a third lineage of the Mimiviridae that is close to the megavirus lineage. Genome Biol. Evol. 2012, 4, 1324–1330. [Google Scholar] [CrossRef] [PubMed]
  7. Fischer, M.G.; Allen, M.J.; Wilson, W.H.; Suttle, C.A. Giant virus with a remarkable complement of genes infects marine zooplankton. Proc. Natl. Acad. Sci. USA 2010, 107, 19508–19513. [Google Scholar] [CrossRef] [PubMed]
  8. Campos, R.K.; Boratto, P.V.; Assis, F.L.; Aguiar, E.R.; Silva, L.C.; Albarnaz, J.D.; Dornas, F.P.; Trindade, G.S.; Ferreira, P.P.; Marques, J.T.; et al. Samba virus: A novel mimivirus from a giant rain forest, the Brazilian Amazon. Virol. J. 2014, 11. [Google Scholar] [CrossRef] [PubMed]
  9. Boughalmi, M.; Saadi, H.; Pagnier, I.; Colson, P.; Fournous, G.; Raoult, D.; La Scola, B. High-throughput isolation of giant viruses of the Mimiviridae and Marseilleviridae families in the Tunisian environment. Environ. Microbiol. 2012, 15, 2000–2007. [Google Scholar] [CrossRef] [PubMed]
  10. Boughalmi, M.; Pagnier, I.; Aherfi, S.; Colson, P.; Raoult, D.; La Scola, B. First Isolation of a Giant Virus from Wild Hirudo medicinalis Leech: Mimiviridae isolation in Hirudo medicinalis. Viruses 2013, 5, 2920–2930. [Google Scholar] [CrossRef] [PubMed]
  11. Saadi, H.; Pagnier, I.; Colson, P.; Cherif, J.K.; Beji, M.; Boughalmi, M.; Azza, S.; Armstrong, N.; Robert, C.; Fournous, G.; et al. First isolation of Mimivirus in a patient with pneumonia. Clin. Infect. Dis. 2013, 57, e127–e134. [Google Scholar] [CrossRef] [PubMed]
  12. Ghedin, E.; Claverie, J.M. Mimivirus relatives in the Sargasso sea. Virol. J. 2005, 2. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Monier, A.; Larsen, J.B.; Sandaa, R.A.; Bratbak, G.; Claverie, J.M.; Ogata, H. Marine mimivirus relatives are probably large algal viruses. Virol. J. 2008, 5. [Google Scholar] [CrossRef] [PubMed]
  14. Williamson, S.J.; Allen, L.Z.; Lorenzi, H.A.; Fadrosh, D.W.; Brami, D.; Thiagarajan, M.; McCrow, J.P.; Tovchigrechko, A.; Yooseph, S.; Venter, J.C. Metagenomic Exploration of Viruses throughout the Indian Ocean. PLoS ONE 2012, 7, e42047. [Google Scholar] [CrossRef] [PubMed]
  15. Colson, P.; de Lamballerie, X.; Fournous, G.; Raoult, D. Reclassification of giant viruses composing a fourth domain of life in the new order Megavirales. Intervirology 2012, 55, 321–332. [Google Scholar] [CrossRef] [PubMed]
  16. Yutin, N.; Colson, P.; Raoult, D.; Koonin, E.V. Mimiviridae: Clusters of orthologous genes, reconstruction of gene repertoire evolution and proposed expansion of the giant virus family. Virol. J. 2013, 10. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Legendre, M.; Arslan, D.; Abergel, C.; Claverie, J.M. Genomics of Megavirus and the elusive fourth domain of Life. Commun. Integr. Biol. 2012, 5, 102–106. [Google Scholar] [CrossRef] [PubMed]
  18. Boyer, M.; Madoui, M.A.; Gimenez, G.; La Scola, B.; Raoult, D. Phylogenetic and phyletic studies of informational genes in genomes highlight existence of a 4 domain of life including giant viruses. PLoS ONE 2010, 5, e15530. [Google Scholar] [CrossRef] [PubMed]
  19. Colson, P.; La Scola, B.; Raoult, D. Giant viruses of amoebae as potential human pathogens. Intervirology 2013, 56, 376–385. [Google Scholar] [CrossRef] [PubMed]
  20. Dornas, F.P.; Rodrigues, F.P.; Boratto, P.V.; Silva, L.C.; Ferreira, P.C.; Bonjardim, C.A.; Trindade, G.S.; Kroon, E.G.; La Scola, B.; et al. Mimivirus circulation among wild and domestic mammals, Amazon Region, Brazil. Emerg. Infect. Dis. 2014, 20, 469–472. [Google Scholar] [CrossRef] [PubMed]
  21. Colson, P.; de Lamballerie, X; Yutin, N.; Asgari, S.; Bigot, Y.; Bideshi, D.K.; Cheng, X.W.; Federici, B.A.; Van Etten, J.L.; Koonin, E.V.; et al. “Megavirales”, a proposed new order for eukaryotic nucleocytoplasmic large DNA viruses. Arch. Virol. 2013, 158, 2517–2521. [Google Scholar] [CrossRef] [PubMed]
  22. Andrade, K.R.; Boratto, P.P.; Rodrigues, F.P.; Silva, L.C.; Dornas, F.P.; Pilotto, M.R.; La Scola, B.; Almeida, G.M.; Kroon, E.G.; Abrahao, J.S. Oysters as hot spots for mimivirus isolation. Arch. Virol. 2015, 160, 477–482. [Google Scholar] [CrossRef] [PubMed]
  23. Simpson, J.T.; Wong, K.; Jackman, S.D.; Schein, J.E.; Jones, S.J.; Birol, I. ABySS: A parallel assembler for short read sequence data. Genome Res. 2009, 19, 1117–1123. [Google Scholar] [CrossRef]
  24. Galardini, M.; Biondi, E.G.; Bazzicalupo, M.; Mengoni, A. CONTIGuator: A bacterial genomes finishing tool for structural insights on draft genomes. Source Code Biol. Med. 2011, 6, 11. [Google Scholar] [CrossRef] [PubMed]
  25. CLCbio. Available online: http://www.clcbio.com/index.php?id=28 (accessed on 23 June 2015).
  26. Aziz, R.K.; Bartels, D.; Best, A.A.; DeJongh, M.; Disz, T.; Edwards, R.A.; Formsma, K.; Gerdes, S.; Glass, E.M.; Kubal, M.; et al. The RAST Server: Rapid annotations using subsystems technology. BMC Genomics. 2008, 9. [Google Scholar] [CrossRef]
  27. Besemer, J.; Borodovsky, M. GeneMark: Web software for gene finding in prokaryotes, eukaryotes and viruses. Nucleic Acids Res. 2005, 33, W451–W454. [Google Scholar] [CrossRef] [PubMed]
  28. Schattner, P.; Brooks, A.N.; Lowe, T.M. The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs. Nucleic Acids Res. 2005, 33, W686–W689. [Google Scholar] [CrossRef] [PubMed]
  29. Conesa, A.; Gotz, S.; Garcia-Gomez, J.M.; Terol, J.; Talon, M.; Robles, M. Blast2GO: A universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 2005, 21, 3674–3676. [Google Scholar] [CrossRef] [PubMed]
  30. Livak, K.J.; Schmittgen, T.D. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods 2001, 25, 402–408. [Google Scholar] [CrossRef] [PubMed]
  31. Kurtz, S.; Phillippy, A.; Delcher, A.L.; Smoot, M.; Shumway, M.; Antonescu, C.; Salzberg, S.L. Versatile and open software for comparing large genomes. Genome Biol. 2004, 5. [Google Scholar] [CrossRef] [PubMed]
  32. Darling, A.E.; Mau, B.; Perna, N.T. progressiveMauve: Multiple genome alignment with gene gain, loss and rearrangement. PLoS ONE 2010, 5, e11147. [Google Scholar] [CrossRef] [PubMed]
  33. Lechner, M.; Findeiss, S.; Steiner, L.; Marz, M.; Stadler, P.F.; Prohaska, S.J. Proteinortho: Detection of (co-)orthologs in large-scale analysis. BMC. Bioinformatics 2011, 12, 124–112. [Google Scholar] [CrossRef] [PubMed]
  34. Li, L.; Stoeckert, C.J., Jr.; Roos, D.S. OrthoMCL: Identification of ortholog groups for eukaryotic genomes. Genome Res. 2003, 13, 2178–2189. [Google Scholar] [CrossRef] [PubMed]
  35. Larkin, M.A.; Blackshields, G.; Brown, N.P.; Chenna, R.; McGettigan, P.A.; McWilliam, H.; Valentin, F.; Wallace, I.M.; Wilm, A.; Lopez, R.; et al. Clustal W and Clustal X version 2.0. Bioinformatics 2007, 23, 2947–2948. [Google Scholar] [CrossRef] [PubMed]
  36. Tamura, K.; Peterson, D.; Peterson, N.; Stecher, G.; Nei, M.; Kumar, S. MEGA5: Molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol. Biol. Evol. 2011, 28, 2731–2739. [Google Scholar] [CrossRef] [PubMed]
  37. Colson, P.; Yutin, N.; Shabalina, S.A.; Robert, C.; Fournous, G.; La Scola, B.; Raoult, D.; Koonin, E.V. Viruses with More Than 1000 Genes: Mamavirus, a New Acanthamoeba polyphaga mimivirus Strain, and Reannotation of Mimivirus Genes. Genome Biol. Evol. 2011, 3, 737–742. [Google Scholar] [CrossRef] [PubMed]
  38. Suhre, K. Gene and genome duplication in Acanthamoeba polyphaga Mimivirus. J. Virol. 2005, 79, 14095–14101. [Google Scholar] [CrossRef] [PubMed]
  39. Boyer, M.; Gimenez, G.; Suzan-Monti, M.; Raoult, D. Classification and determination of possible origins of ORFans through analysis of nucleocytoplasmic large DNA viruses. Intervirology 2010, 53, 310–320. [Google Scholar] [CrossRef] [PubMed]
  40. Yoosuf, N.; Pagnier, I.; Fournous, G.; Robert, C.; La Scola, B.; Raoult, D.; Colson, P. Complete genome sequence of Courdo11 virus, a member of the family Mimiviridae. Virus Genes 2013, 48, 218–223. [Google Scholar] [CrossRef] [PubMed]
  41. Yoosuf, N.; Pagnier, I.; Fournous, G.; Robert, C.; Raoult, D.; La Scola, B.; Colson, P. Draft genome sequences of Terra1 and Terra2 viruses, new members of the family Mimiviridae isolated from soil. Virology 2014, 452–453, 125–132. [Google Scholar] [CrossRef]

Share and Cite

MDPI and ACS Style

Assis, F.L.; Bajrai, L.; Abrahao, J.S.; Kroon, E.G.; Dornas, F.P.; Andrade, K.R.; Boratto, P.V.M.; Pilotto, M.R.; Robert, C.; Benamar, S.; et al. Pan-Genome Analysis of Brazilian Lineage A Amoebal Mimiviruses. Viruses 2015, 7, 3483-3499. https://doi.org/10.3390/v7072782

AMA Style

Assis FL, Bajrai L, Abrahao JS, Kroon EG, Dornas FP, Andrade KR, Boratto PVM, Pilotto MR, Robert C, Benamar S, et al. Pan-Genome Analysis of Brazilian Lineage A Amoebal Mimiviruses. Viruses. 2015; 7(7):3483-3499. https://doi.org/10.3390/v7072782

Chicago/Turabian Style

Assis, Felipe L., Leena Bajrai, Jonatas S. Abrahao, Erna G. Kroon, Fabio P. Dornas, Kétyllen R. Andrade, Paulo V. M. Boratto, Mariana R. Pilotto, Catherine Robert, Samia Benamar, and et al. 2015. "Pan-Genome Analysis of Brazilian Lineage A Amoebal Mimiviruses" Viruses 7, no. 7: 3483-3499. https://doi.org/10.3390/v7072782

APA Style

Assis, F. L., Bajrai, L., Abrahao, J. S., Kroon, E. G., Dornas, F. P., Andrade, K. R., Boratto, P. V. M., Pilotto, M. R., Robert, C., Benamar, S., Scola, B. L., & Colson, P. (2015). Pan-Genome Analysis of Brazilian Lineage A Amoebal Mimiviruses. Viruses, 7(7), 3483-3499. https://doi.org/10.3390/v7072782

Article Metrics

Back to TopTop