Next Article in Journal
Bacteriophages as Fecal Pollution Indicators
Previous Article in Journal
Replication Kinetics of B.1.351 and B.1.1.7 SARS-CoV-2 Variants of Concern Including Assessment of a B.1.1.7 Mutant Carrying a Defective ORF7a Gene
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Circular Rep-Encoding Single-Stranded DNA Sequences in Milk from Water Buffaloes (Bubalus arnee f. bubalis)

1
Department of Veterinary Sciences, Institute of Food Safety, Faculty of Veterinary Medicine, Ludwig-Maximilians-Universität München, Schönleutnerstraße 8, 85764 Oberschleißheim, Germany
2
Department of Veterinary Sciences, Chair of Virology, Faculty of Veterinary Medicine, Ludwig-Maximilians-Universität München, Veterinärstraße 13, 80539 München, Germany
*
Author to whom correspondence should be addressed.
Viruses 2021, 13(6), 1088; https://doi.org/10.3390/v13061088
Submission received: 3 May 2021 / Revised: 26 May 2021 / Accepted: 3 June 2021 / Published: 7 June 2021
(This article belongs to the Section Animal Viruses)

Abstract

:
Isolation and characterization of circular replicase-encoding single-stranded (ss) DNA from animal, plant and environmental samples are rapidly evolving in virology. We detected 21 circular DNA elements, including one genomoviral sequence, in individual milk samples from domesticated Asian water buffaloes (Bubalus arnee f. bubalis). Most of the obtained genomes are related to Sphinx 1.76 and Sphinx 2.36 sequences and share a high degree of similarity to recently published circular DNAs—named BMMF (bovine meat and milk factors)—that have been isolated from commercial milk, as well as from bovine serum. Characteristic features such as rep genes, tandem repeats and inverted repeats were detected. These BMMF have recently been found to be present in taurine-type dairy cattle breeds descending from the aurochs (Bos primigenius). Importantly, the occurrence of BMMF has been linked to the higher incidence of colorectal and breast cancer in North America and Western Europe compared with Asia. This is the first report of circular ssDNA detected in milk from the domesticated form of the wild Asian water buffalo (B. arnee) belonging to the subfamily Bovinae. This novelty should be taken into account in view of the above-mentioned cancer hypothesis.

1. Introduction

Single-stranded (ss) DNA viruses can be detected throughout all domains of life and are currently classified into thirteen families according to the International Committee on Taxonomy of Viruses (ICTV) [1]. Reports on unclassified virus-like ssDNA have drastically increased in recent years [2,3]. This augmentation is mainly caused by the application of high-throughput metagenomics such as next-generation sequencing (NGS) [4]. Nevertheless, classical approaches comprising rolling circle amplification (RCA) [5] followed by PCR and Sanger sequencing still have their importance for sequence detection and characterization. Recently, these sequences have been assigned to the virus phylum Cressdnaviricota, where CRESS stands for “circular Rep-encoding single-stranded”, to describe functional features of these isolates, and viricota is the suffix for phylum taxa [6]. Besides their circular ssDNA composition, they depend on a replicase (Rep) for rolling circle replication (RCR) of their genomes. Encoded Reps exhibit a nicking/joining activity at the origin of replication (ori) of the ssDNAs. In addition to this endonuclease domain at the N-terminus, in some cases Reps include a superfamily 3 (SF3) helicase domain at their C-terminus, which helps to unwind the double-stranded DNA replicative intermediate [7,8]. In other cases, helicases encoded by the host are recruited during RCR. Sequence analysis of Reps on the genetic as well as on the protein level provides information about the evolutionary history and the phylogenetic relatedness of different CRESS isolates [9]. The presence of a replicase is not restricted to CRESS DNA viruses, with sometimes blurred borders between CRESS viruses and plasmid-/phage-derived sequences. A prominent example of this overlap is the co-purification of Sphinx 1.76 and Sphinx 2.36 from transmissible spongiform encephalopathy (TSE) particles [10]. The first isolate shows significant sequence homology to a plasmid isolated from Acinetobacter baumannii, while the latter one shows similarities to an Acinetobacter sp. phage [11]. The resemblance between these sequences and their common mechanism of replication led to the hypothesis that ssDNA viruses developed from bacterial plasmids via gene transfer with RNA viruses [12,13]. Recently, Sphinx 1.76-related and Sphinx 2.36-related circular DNAs have also been isolated from cattle milk, cattle serum and human brain affected by multiple sclerosis [14,15,16]. With the ongoing identification of these types of ssDNA from bovine samples, the term BMMF (bovine meat and milk factors) has been introduced, and these factors have been suggested to represent a specific class of infectious agents [17]. According to their sequence homologies to known organisms, the factors are divided into four groups. The first two groups consist of isolates resembling the sequences of Sphinx 1.76 (group 1) or Sphinx 2.36 (group 2). Group 3 contains isolates that have been characterized as novel gemycircularviruses, being one of nine genera within the family Genomoviridae. A single isolate with similarity to a Psychrobacter species plasmid forms group 4. The terms HCBI (healthy cattle blood isolate) and CMI (cow milk isolate) are utilized to refer to the origin of each isolate [11]. A common attribute on the DNA level of all group 1 isolates is the presence of an iteron-like tandem repeat, a neighbouring inverted repeat, and at least one large open reading frame (ORF)-encoding Rep. Some of the isolates encode a putative second ORF resembling a mobilization element [18]. In contrast to other animal-derived CRESS viruses, these circular DNAs lack the genetic information for a capsid protein.
From a disease-related point of view, BMMF and Sphinx-like sequences have entered the spotlight due to a potential connection with colorectal and breast cancer as well as neurodegenerative illnesses such as multiple sclerosis [19]. BMMF DNA and Rep proteins have been found in tissue of colon cancer patients and are proposed to cause chronic inflammation [20], thus linking the cancer hypothesis to the consumption of milk and meat from taurine cattle. These are descendants from the European aurochs and are potential carriers of BMMF [21]. In North America and Western Europe, cattle breeds are largely taurine in origin (Bos taurus taurus). In contrast, the Asian cattle population largely consists of cattle breeds derived from the indicine lineage (Bos taurus indicus) [22,23], yaks (Bos mutus) [24] and water buffaloes (Bubalus arnee) [25], which are not supposed to be carriers of BMMF [26]. Especially on the Indian subcontinent and in Mongolia, the above-mentioned cancer incidence is lower than in Western countries [21,27]. However, studies on samples of “Asian-type” cattle investigating the presence of circular replication competent ssDNA sequences are missing in the literature. For this reason, we initiated an in-depth analysis of individual milk samples obtained from Asian water buffaloes (B. arnee f. bubalis).

2. Methods

2.1. Sample Collection

In total, 30 individual milk samples (50 mL) were collected from two different buffalo flocks in Germany, one kept in Lower Saxony (A) and one in Baden-Württemberg (B). A total of 23 samples were obtained from herd A and 7 from herd B. Immediately after collection, the milk was cooled to 4 °C and cold-shipped to the Institute of Food Safety within 24 h where it was immediately processed. Samples from flock A were numbered 8 through 30 and from flock B 42 through 48. In the case of obtaining full-length circular DNA sequences, sample numbers were completed with the term BAMI, which refers to “Bubalus Arnee Milk Isolate” and a number indicating the sequence length.

2.2. DNA Extraction and Rolling Circle Amplification (RCA) of Circular ssDNA

The QIAamp DNA Mini Kit (Qiagen, Hilden, Germany) was applied to isolate total DNA from 200 μL of whole milk according to the manufacturer’s instructions. Approx. 40 ng of purified DNA was subjected to RCA using the TempliPhi Amplification Kit (Cytiva, Marlborough, MA, USA) with random primers according to the instruction manual of the supplier. Isothermal amplification was carried out at 30 °C for 18 h.

2.3. PCR Screening

RCA products were initially screened for the presence of rep-sequences with a set of 10 primer pairs designed via the rep genes of Sphinx 1.76 (GenBank Acc. No. HQ444404.1), Sphinx 2.36 (GenBank Acc. No. HQ444405.1) and related sequences obtained by Blast search. A similar approach was chosen with primers for Genomoviridae. The total volume of all PCR reactions was 50 μL applied on a TProfessional Gradient 96 Thermocycler (Biometra, Jena, Germany). ThermoPrime Plus Polymerase (ThermoScientific, Waltham, MA, USA) was used at 1.25 U/reaction. The dNTPs (ThermoScientific, Waltham, MA, USA) were added to a final concentration of 200 μM each. Detailed information on the primer sequences as well as the individual amplification programmes is summarized in Tables S1 and S2. After agarose-gel electrophoresis, bands were purified with the HighYield PCR Purification/Gel Extraction Kit (SLG, Südlaborbedarf, Gauting, Germany). Sequencing was performed by MWG Eurofins (Ebersberg, Germany). The sequences obtained from the PCR screening were aligned, and primer pairs matching homologous regions of the alignments were designed for inverse PCR in order to obtain full-length sequences of the circular DNA elements (for detailed information see Table S3). Additionally, abutting primers for amplification of BMMF2 sequences published by de Villiers et al. [18] and for BMMF1 sequences published by Whitley et al. [16] were applied.

2.4. Cloning of Inverse PCR Products

Amplification products obtained from inverse PCR were cloned into pCR2.1 TopoTA vector (Invitrogen, Carlsbad, CA, USA). Briefly, the gel-purified PCR product was mixed with the vector at a molar ratio of approx. 3:1 in a total volume of 6 μL and allowed to rest for 30 min at room temperature for ligation. Afterwards, chemically competent Escherichia coli DH5 alpha were transformed with the ligation reaction for 30 min on ice. After a heat shock at 42 °C for 1 min, 1 mL of LB medium was added to the reaction and cells were allowed to recover for 1 h at 37 °C with gentle agitation. Next, E. coli were centrifuged at 6100× g for 5 min. The pellet was suspended in 200 μL LB medium and plated on LB agar with ampicillin (100 μg/mL). After overnight incubation in LB medium with ampicillin, five clones from each plate were subjected to MiniPrep Plasmid isolation via the GenElute Plasmid Miniprep Kit (Sigma Aldrich, Taufkirchen, Germany). A control digest of 500 ng plasmid DNA with EcoRI (New England BioLabs, Ipswich, MA, USA) was performed, and insert-bearing clones were sequenced via the M13for and M13rev primer binding sites present in the vector.

2.5. Data Analysis

Sequences obtained from MWG Eurofins were subjected to BlastSearch against the nucleotide database at NCBI [28]. For multiple sequence alignments and cross-comparison, the DNASTAR Software (DNASTAR Inc., Madison, WI, USA, Version 15.0.0) was used [29]. Phylogenetic trees were calculated in MEGA software (version 5.03) by using the maximum likelihood method, and bootstrap values were computed with 500 replicates [30]. Furthermore, all sequences were checked at Emboss explorer for tandem repeats [31] and inverted repeats [32] upstream the rep gene sequence. Maximum size of tandem repeats was restricted to 30 nucleotides (nt), and the minimum size of inverted repeats was set to 4 nt. In silico detection of potential open reading frames was performed with the ORFfinder software at NCBI [33]. ATG or TTG as an alternative start codon was allowed. The minimum ORF size was restricted to 75 amino acids (aa). For subsequent comparative and functional analysis, only ORFs ≥ 95 aa were included. All full-length genomes and their annotations detected in this study were deposited in the GenBank database (Accession numbers MW828657–MW828677).

3. Results

3.1. PCR Screening and Detection of Circular Single-Stranded DNA Elements

By using RCA followed by several screening PCRs, 83% (19/23) of the individual milk samples from flock A and 100% (7/7) from flock B reacted positive in at least one PCR applied for BMMF and CRESS DNA detection. Inverse PCRs employing novel primers from this study resulted in retrieval of four circular DNA elements, all from flock A (8BAMI.2351, 11BAMI.2199, 12BAMI.2349, 21BAMI.2076). Application of abutting primers published by Whitley et al. [16] and de Villiers et al. [18] yielded 17 additional circular DNAs from samples of both flocks. Interestingly, five of the positive milk samples (8BAMI, 11BAMI, 12BAMI, 21BAMI, 22BAMI) contained two different circular DNA elements. Altogether, 21 unique complete circular ssDNA genomes were detected in the present study, one of them later assigned as gemycircularvirus.

3.2. Characterization of the Sequences on the DNA Level

The overall sequence length of the circular DNA elements ranged from 1972 to 2522 nt. After subjecting all DNAs to BlastSearch, the best two matches were summarized, as shown in Table S4. Nine of the 21 detected circular DNAs are nearly identical to the BMMF1 isolates HCBI6.252 and CMI1.252, showing a query coverage of 100% and more than 99% identity. Seven sequences are similar to C1MI GenBank entries with a query coverage of 71–98% and 77–97% identity, respectively. Four of the DNA elements revealed similarities to C2MI isolates, belonging to the BMMF2 group. One sequence from the present study was assigned to a gemycircularvirus detected in a giant panda with 92.94% identity in BlastSearch but 48% query coverage only and therefore belonged to the BMMF group 3. None of the herein detected isolates belongs to BMMF group 4.
Homologies among sequences from the present study were calculated by additional multiple sequence alignment. Sequences 8BAMI.2351, 12BAMI.2349, 21BAMI.2076 and 44BAMI.2371 are the most remote ssDNA elements showing a maximum of 41% identity to the other sequences, but 59–99% identity among each other. Some of the other circular DNAs (8BAMI.2522, 9BAMI.2522, 15BAMI.2522, 17BAMI.2522, 22BAMI.2521, 22BAMI.2522, 45BAMI.2521, 46BAMI.2522) share ≥ 99% identity despite originating from different flocks. These isolates differ in 6 to 14 nt positions only. A similar observation was made for isolates 11BAMI.2092, 12BAMI.2092, 13BAMI.2092, 21BAMI.2091 and 24BAMI.2092, which possess ≥ 99% identity as well.
Next, a phylogenetic distance tree was generated including Sphinx 1.76, Sphinx 2.36, representatives from the BMMF1 and 2 groups and from the family Genomoviridae as references (Figure 1). This approach allowed the affiliation of the novel sequences to the BMMF1 and 2 groups. Within the BMMF1 group, the isolates 11BAMI.2092, 12BAMI.2092, 13BAMI.2092, 21BAMI.2091 and 24BAMI.2092 form a cluster distinct from any of the sequences published by the group of zur Hausen so far. The Gemycircularvirus-like element detected in sample 11 is the only representative of the BMMF3 group according to the nomenclature of zur Hausen et al. [11].
Further analysis of the 21 circular sequences for typical replication-relevant ssDNA elements included: (i) search for tandem repeats (TR) [31], (ii) search for inverted repeats (IR)/stem loops as a potential origin of replication [32], (iii) search for replicase genes and (iv) search for additional putative ORFs [33].
Except for the circular DNA sequences 8BAMI.2351, 12BAMI.2349, 21BAMI.2076 and 44.BAMI.2371, all isolates exhibited typical tandem repeats that can also be found in related BMMF [11]. The DNA elements lacking a tandem repeat belonged either to the BMMF2 or BMMF3 group. Detailed information on the tandem repeat features is illustrated in Figure 2.
Furthermore, the circular DNAs were analysed for the presence of inverted repeats in proximity to the tandem repeat region, thus potentially able to form a double strand and to function as a replicase docking site. All BMMF1 sequences with TR motifs contain the same palindromic sequence TAAATGCTTTTA (the first and the last 4 nt build the inverted repeat region separated by a 4 nt gap), which is located 49 to 56 nt upstream of the TR region (Table 1) and has also been described by zur Hausen et al. [11].

3.3. In Silico Characterization of BMMF-like DNA Elements on the Protein Level

In addition to the analyses on the nucleotide level, we performed an in silico characterization in order to detect open reading frames (ORFs) [33] and the deduced proteins including a subsequent BlastSearch. Basic features of all circular DNA elements detected in this study are depicted in Figure 3. The only gemycircularvirus found herein is described in Section 3.4.
Each circular DNA element possesses an ORF coding for Rep. Depending on the settings in the ORFfinder tool (“ATG only” versus “ATG and alternative initiation codons”), results slightly differ. Therefore, two isolates with overlapping or nested rep-ORFs were found, starting with either ATG or TTG. In this case, respective protein sequences were named as follows: 8BAMI.2351.RepI and 8BAMI.2351.RepII. Results of BlastSearch analysis are included in Table S5.
The majority (13) of the 22 putative Reps found in this study showed the best Blast hits for BMMF-related replicases. The remaining nine Reps scored closer to Acinetobacter-related replicases or those from even more distant organisms. Remarkably, the rep translation products of 12BAMI.2349 and 18BAMI.2184 are shorter (up to 150 aa) than their best matches after BlastSearch (Table S5). Phylogenetic analysis of the Rep proteins revealed that they belong to the established BMMF groups (Figure 4). Again, the isolates 11BAMI.2092, 12BAMI.2092, 13BAMI.2092, 21BAMI.2091 and 24BAMI.2092 build their own cluster.
All DNA elements were analysed for the potential occurrence of further proteins. Although the software tool detected up to 22 putative ORFs per DNA sequence, the length of the potential proteins was restricted to a minimum of 95 aa to be included in the Results section of the manuscript. For additional putative proteins, the same approach as for the Reps was carried out. Thirteen putative proteins found in the novel sequences were identical or nearly identical to uncharacterized proteins from BMMF1 or BMMF2 isolates. Detailed results of the BlastSearch are summarized in Table S6. 12BAMI.2092 and 25BAMI.1972 do not code for an additional, hypothetical protein. A phylogenetic tree based on the putative protein sequences is given in Figure S1.

3.4. In Silico Characterization of a Novel Gemycircularvirus on the Protein Level

The only CRESS DNA virus isolated herein was recovered from a buffalo milk sample also containing one more circular DNA element belonging to the BMMF1 group. After in silico analyses, this new CRESS DNA virus was named 11BAMI.2199 (GenBank accession number MW828677). Typical characteristics of a gemycircularvirus could be identified. Besides a stem-loop structure at the origin of replication, it includes a capsid (cap) gene on the sense strand and a rep gene on the antisense strand. The putative stem-loop structure in the intergenic region between rep and cap features a nonanucleotide motif (TAATATTAT) that is present at the loop-tip and therefore a further characteristic of Genomoviridae [37,38,39] (Figure 5). Additionally, the isolate comprises two putative proteins, one in sense, the other one in antisense direction.
BlastSearches of the proteins encoded by this virus revealed 60–93% identity to giant panda-associated and plant-associated proteins, inter alia (GenBank entries: QCX29436.1, ASH99166.1, ASH99165.1).
The typical replication mechanism for all CRESS DNA viruses is the rolling circle replication initiated by the Rep. 11BAMI.2199 encodes a Rep that contains well-conserved RCR motifs (Figure 6). Motif I is supposed to have a sequence recognition function [38]. Motif II is characteristic of HUH endonucleases [40]. Motif III is possibly involved in dsDNA nicking [38,41]. The GRS motif has initially been described in geminiviruses by Nash et al. [42].

4. Discussion

BMMF initially described by zur Hausen et al. [11,17] are suspected to correlate with colon and breast cancer incidences as well as neurodegenerative diseases in the “Western” world. BMMF DNA and Rep proteins have been detected in colorectal cancer peritumour and tumour tissues of human patients by immunohistochemistry and Western blot [20]. Thereby and by the means of transfection assays, transcriptional activity of BMMF can be assumed [43]. Lower cancer incidences in countries traditionally breeding cattle evolved from other bovine lineages such as water buffaloes, yaks and zebus suggest that circular ssDNA elements in milk from taurine cattle might be one of several food-borne risk factors that contribute to the aforementioned diseases.
The present study was initiated to screen milk samples from non-taurine cattle for the presence of BMMF and CRESS viruses. Although screening PCR results suggested that 86% (26/30) of the water buffalo milk samples were positive, circular full-length sequences could only be recovered from 53% (16/30). This discrepancy might be caused by the formation of linear by-products, which occur to some extent during RCA. Thus, a PCR screening with primers, amplifying solely subgenomic fragments, might not be a suitable marker for the potential presence of full-length genomes. Of the 21 circular ssDNA elements detected and characterized herein, 20 belong to the BMMF groups 1 and 2. Inverse PCR additionally revealed one sequence belonging to the genus Gemycircularvirus (BMMF3). The further in silico analysis of genomes described herein revealed that two sequences (12BAMI.2349 and 18BAMI.2184) contained Reps substantially reduced in length (Table S5). Their replication capacity might thus be questionable due to a potentially dysfunctional Rep. On the other hand, five of the 21 positive samples harbour two ssDNA elements. Therefore, one must also consider that a functional Rep from one element might be able to complement a dysfunctional Rep from a second sequence, thus enabling RCR of both. However, to answer this question, the functional activity and replication behaviour of these DNA elements will have to be tested in transfection assays, as described by Eilebrecht et al. [43].
From a nutritional point of view, water buffalo milk is of great importance in some Asian countries. Although the multipurpose domestic water buffalo is widely distributed all over the world [44], the largest number of animals kept for dairy production is recorded in India, contributing to approx. 68% of the world’s buffalo milk production [45]. Buffalo milk production in India exceeded 86 million tonnes in 2017–2018; at 49%, this accounts for the largest part of total Indian milk production, compared with cow’s milk at only 47%. The per capita availability of milk adds up to 337 g per day, and the domestic demand for milk and milk products is steadily increasing [46]. From the centre of domestication in the Indian subcontinent, domestic water buffaloes spread westward to Egypt, the Balkans and Italy around the 7th century [25]. Nowadays, significant flocks in Europe are mainly kept in Italy, Romania and Turkey, but they represent only a minor proportion of total livestock [47,48].
Apart from one study in Switzerland that found one gemycircularvirus in the plasma of water buffaloes [49], reports on “Asian-type” cattle harbouring CRESS viruses or even BMMF are scarce. Either the cattle under study descend from the taurine lineage [50,51] or authors do not provide substantial information on the breeds [52,53,54]. None of the previous studies address the occurrence of BMMF in “Asian-type” cattle such as water buffaloes. Hence, this is the first profound survey investigating the appearance of circular ssDNA, particularly that of BMMF.
Although data from the present study underpin that BMMF are not restricted to taurine cattle, one has to keep in mind that water buffaloes from “Western” flocks might have acquired these DNA elements due to their coexistence with taurine cattle by yet unknown routes of transmission. As water buffaloes in Germany typically have access to pasture, the transmission from neighbouring, taurine cattle via vectors such as black flies, for example, cannot be excluded. To clarify this issue, future studies on samples from water buffaloes domiciled in Asia are needed.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/v13061088/s1, Figure S1: Maximum likelihood phylogenetic tree of novel hypothetical protein sequences and uncharacterized BMMF1 and 2 proteins; Table S1: Information on the primer sets applied in the PCR screening for the amplification of Sphinx-like sequences. Table S2: Information on the primer sets applied in the PCR screening for the amplification of Genomoviridae family members. Table S3: Information on the primer sets applied in the inverse PCR for amplification of full-length DNA. Table S4: Summary of the BLASTn results for the individual isolates. Table S5: Summary of the BLASTp results of the in silico generated Reps. Table S6: Summary of the BLASTp results of the in silico generated additional proteins.

Author Contributions

Conceptualization—E.M. and G.S.; Methodology—E.L. and R.F.; Investigation and Visualization—M.-T.K.; Validation and Writing—A.D. All authors have read and agreed to the published version of the manuscript.

Funding

This project was supported by funds from the Federal Ministry of Food and Agriculture (BMEL) based on a decision by the Parliament of the Federal Republic of Germany via the Federal Office for Agriculture and Food (BLE) under the innovation support programme (funding reference: 2819110217).

Institutional Review Board Statement

Ethical review and approval was waived for this study because milk was collected during the daily milking routine on the farm.

Informed Consent Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. International Committee on Taxonomy of Viruses (ICTV). Available online: https://talk.ictvonline.org (accessed on 28 October 2020).
  2. Rosario, K.; Duffy, S.; Breitbart, M. A field guide to eukaryotic circular single-stranded DNA viruses: Insights gained from metagenomics. Arch. Virol. 2012, 157, 1851–1871. [Google Scholar] [CrossRef] [PubMed]
  3. Tisza, M.J.; Pastrana, D.V.; Welch, N.L.; Stewart, B.; Peretti, A.; Starrett, G.J.; Pang, Y.S.; Krishnamurthy, S.R.; Pesavento, P.A.; McDermott, D.H.; et al. Discovery of several thousand highly diverse circular DNA viruses. Elife 2020, 9. [Google Scholar] [CrossRef]
  4. Dayaram, A.; Galatowitsch, M.L.; Argüello-Astorga, G.R.; Van Bysterveldt, K.; Kraberger, S.; Stainton, D.; Harding, J.S.; Roumagnac, P.; Martin, D.P.; Lefeuvre, P.; et al. Diverse circular replication-associated protein encoding viruses circulating in invertebrates within a lake ecosystem. Infect. Genet. Evol. 2016, 39, 304–316. [Google Scholar] [CrossRef] [PubMed]
  5. Johne, R.; Müller, H.; Rector, A.; van Ranst, M.; Stevens, H. Rolling-circle amplification of viral DNA genomes using phi29 polymerase. Trends Microbiol. 2009, 17, 205–211. [Google Scholar] [CrossRef] [PubMed]
  6. Krupovic, M.; Varsani, A.; Kazlauskas, D.; Breitbart, M.; Delwart, E.; Rosario, K.; Yutin, N.; Wolf, Y.I.; Harrach, B.; Zerbini, F.M.; et al. Cressdnaviricota: A Virus Phylum Unifying Seven Families of Rep-Encoding Viruses with Single-Stranded, Circular DNA Genomes. J. Virol. 2020, 94, 12. [Google Scholar] [CrossRef]
  7. Kazlauskas, D.; Varsani, A.; Koonin, E.V.; Krupovic, M. Multiple origins of prokaryotic and eukaryotic single-stranded DNA viruses from bacterial and archaeal plasmids. Nat. Commun. 2019, 10, 3425. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Zhao, L.; Rosario, K.; Breitbart, M.; Duffy, S. Eukaryotic Circular Rep-Encoding Single-Stranded DNA (CRESS DNA) Viruses: Ubiquitous Viruses With Small Genomes and a Diverse Host Range. Adv. Virus Res. 2019, 103, 71–133. [Google Scholar] [CrossRef]
  9. Rosario, K.; Mettel, K.A.; Benner, B.E.; Johnson, R.; Scott, C.; Yusseff-Vanegas, S.Z.; Baker, C.C.M.; Cassill, D.L.; Storer, C.; Varsani, A.; et al. Virus discovery in all three major lineages of terrestrial arthropods highlights the diversity of single-stranded DNA viruses associated with invertebrates. PeerJ 2018, 6, e5761. [Google Scholar] [CrossRef] [PubMed]
  10. Manuelidis, L. Nuclease resistant circular DNAs copurify with infectivity in scrapie and CJD. J. Neurovirol. 2011, 17, 131–145. [Google Scholar] [CrossRef] [PubMed]
  11. Hausen, H.Z.; Bund, T.; de Villiers, E.-M. Infectious Agents in Bovine Red Meat and Milk and Their Potential Role in Cancer and Other Chronic Diseases. In Viruses, Genes, and Cancer; Hunter, E., Bister, K., Eds.; Springer International Publishing: Cham, Switzerland, 2017; Volume 407, pp. 83–116. ISBN 978-3-319-61803-6. [Google Scholar]
  12. Krupovic, M. Networks of evolutionary interactions underlying the polyphyletic origin of ssDNA viruses. Curr. Opin. Virol. 2013, 3, 578–586. [Google Scholar] [CrossRef]
  13. Koonin, E.V.; Dolja, V.V.; Krupovic, M. Origins and evolution of viruses of eukaryotes: The ultimate modularity. Virology 2015, 479, 2–25. [Google Scholar] [CrossRef] [Green Version]
  14. Funk, M.; Gunst, K.; Lucansky, V.; Muller, H.; Hausen, H.Z.; De Villiers, E.-M. Isolation of Protein-Associated Circular DNA from Healthy Cattle Serum. Genome Announc. 2014, 2, e00846-14. [Google Scholar] [CrossRef] [Green Version]
  15. Lamberto, I.; Gunst, K.; Müller, H.; Hausen, H.Z.; De Villiers, E.-M. Mycovirus-Like DNA Virus Sequences from Cattle Serum and Human Brain and Serum Samples from Multiple Sclerosis Patients. Genome Announc. 2014, 2, e00848-14. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  16. Whitley, C.; Gunst, K.; Müller, H.; Funk, M.; Hausen, H.Z.; De Villiers, E.-M. Novel Replication-Competent Circular DNA Molecules from Healthy Cattle Serum and Milk and Multiple Sclerosis-Affected Human Brain Tissue. Genome Announc. 2014, 2, e00849-14. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Hausen, H.Z.; Bund, T.; De Villiers, E. Specific nutritional infections early in life as risk factors for human colon and breast cancers several decades later. Int. J. Cancer 2018, 144, 1574–1583. [Google Scholar] [CrossRef] [PubMed]
  18. De Villiers, E.-M.; Gunst, K.; Chakraborty, D.; Ernst, C.; Bund, T.; Hausen, H.Z. A specific class of infectious agents isolated from bovine serum and dairy products and peritumoral colon cancer tissue. Emerg. Microbes Infect. 2019, 8, 1205–1218. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  19. Hausen, H.Z. What do breast and CRC cancers and MS have in common? Nat. Rev. Clin. Oncol. 2015, 12, 569–570. [Google Scholar] [CrossRef] [PubMed]
  20. Bund, T.; Nikitina, E.; Chakraborty, D.; Ernst, C.; Gunst, K.; Boneva, B.; Tessmer, C.; Volk, N.; Brobeil, A.; Weber, A.; et al. Analysis of chronic inflammatory lesions of the colon for BMMF Rep antigen expression and CD68 macrophage interactions. Proc. Natl. Acad. Sci. USA 2021, 118, e2025830118. [Google Scholar] [CrossRef] [PubMed]
  21. Hausen, H.Z.; De Villiers, E.-M. Dairy cattle serum and milk factors contributing to the risk of colon and breast cancers. Int. J. Cancer 2015, 137, 959–967. [Google Scholar] [CrossRef] [PubMed]
  22. Decker, J.E.; McKay, S.; Rolf, M.; Kim, J.; Alcalá, A.M.; Sonstegard, T.S.; Hanotte, O.; Gotherstrom, A.; Seabury, C.M.; Praharani, L.; et al. Worldwide Patterns of Ancestry, Divergence, and Admixture in Domesticated Cattle. PLoS Genet. 2014, 10, e1004254. [Google Scholar] [CrossRef] [Green Version]
  23. Amer, P.R.; Kemp, R.A.; Smith, C. Genetic differences among the predominant beef cattle breeds in Canada: An analysis of published results. Can. J. Anim. Sci. 1992, 72, 759–771. [Google Scholar] [CrossRef]
  24. Qi, X.B.; Jianlin, H.; Wang, G.; Rege, J.E.O.; Hanotte, O. Assessment of cattle genetic introgression into domestic yak populations using mitochondrial and microsatellite DNA markers. Anim. Genet. 2010, 41, 242–252. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Zhang, Y.; Colli, L.; Barker, J.S.F. Asian water buffalo: Domestication, history and genetics. Anim. Genet. 2020, 51, 177–191. [Google Scholar] [CrossRef] [PubMed]
  26. Hausen, H.Z.; De Villiers, E.-M. Risikofaktoren für Krebs und MS in Kuhmilch und Rindfleisch? Biol. Unserer Zeit 2016, 46, 26–31. [Google Scholar] [CrossRef]
  27. Ferlay, J.; Soerjomataram, I.; Dikshit, R.; Eser, S.; Mathers, C.; Rebelo, M.; Parkin, D.M.; Forman, D.; Bray, F. Cancer incidence and mortality worldwide: Sources, methods and major patterns in GLOBOCAN 2012. Int. J. Cancer 2015, 136, E359–E386. [Google Scholar] [CrossRef] [PubMed]
  28. Johnson, M.; Zaretskaya, I.; Raytselis, Y.; Merezhuk, Y.; McGinnis, S.; Madden, T.L. NCBI BLAST: A better web interface. Nucleic Acids Res. 2008, 36, W5–W9. [Google Scholar] [CrossRef]
  29. Burland, T.G. DNASTAR’s Lasergene sequence analysis software. Methods Mol. Biol. 2000, 132, 71–91. [Google Scholar] [CrossRef]
  30. Tamura, K.; Peterson, D.; Peterson, N.; Stecher, G.; Nei, M.; Kumar, S. MEGA5: Molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol. Biol. Evol. 2011, 28, 2731–2739. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  31. The European Molecular Biology Open Software Suite (EMBOSS) A. Available online: http://emboss.bioinformatics.nl/cgi-bin/emboss/equicktandem (accessed on 26 October 2020).
  32. The European Molecular Biology Open Software Suite (EMBOSS) B. Available online: https://www.bioinformatics.nl/cgi-bin/emboss/palindrome (accessed on 26 October 2020).
  33. National Center for Biotechnology Information (NCBI). ORFfinder. Available online: https://www.ncbi.nlm.nih.gov/orffinder/ (accessed on 26 October 2020).
  34. Tamura, K.; Nei, M. Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. Mol. Biol. Evol. 1993, 10, 512–526. [Google Scholar] [CrossRef] [PubMed]
  35. Felsenstein, J. Confidence limits on phylogenies: An approach using the bootstrap. Evolution 1985, 39, 783–791. [Google Scholar] [CrossRef]
  36. Jones, D.T.; Taylor, W.R.; Thornton, J.M. The rapid generation of mutation data matrices from protein sequences. Comput. Appl. Biosci. 1992, 8, 275–282. [Google Scholar] [CrossRef] [PubMed]
  37. Krupovic, M.; Ghabrial, S.A.; Jiang, D.; Varsani, A. Genomoviridae: A new family of widespread single-stranded DNA viruses. Arch. Virol. 2016, 161, 2633–2643. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  38. Varsani, A.; Krupovic, M. Sequence-based taxonomic framework for the classification of uncultured single-stranded DNA viruses of the family Genomoviridae. Virus Evol. 2017, 3, vew037. [Google Scholar] [CrossRef]
  39. Heyraud-Nitschke, F.; Schumacher, S.; Laufs, J.; Schaefer, S.; Schell, J.; Gronenborn, B. Determination of the origin cleavage and joining domain of geminivirus Rep proteins. Nucleic Acids Res. 1995, 23, 910–916. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  40. Chandler, M.; de la Cruz, F.; Dyda, F.; Hickman, A.B.; Moncalian, G.; Ton-Hoang, B. Breaking and joining single-stranded DNA: The HUH endonuclease superfamily. Nat. Rev. Microbiol. 2013, 11, 525–538. [Google Scholar] [CrossRef]
  41. Ilyina, T.V.; Koonin, E.V. Conserved sequence motifs in the initiator proteins for rolling circle DNA replication encoded by diverse replicons from eubacteria, eucaryotes and archaebacteria. Nucleic. Acids. Res. 1992, 20, 3279–3285. [Google Scholar] [CrossRef] [PubMed]
  42. Nash, T.E.; Dallas, M.B.; Reyes, M.I.; Buhrman, G.K.; Ascencio-Ibañez, J.T.; Hanley-Bowdoin, L. Functional Analysis of a Novel Motif Conserved across Geminivirus Rep Proteins. J. Virol. 2011, 85, 1182–1192. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  43. Eilebrecht, S.; Hotz-Wagenblatt, A.; Sarachaga, V.; Burk, A.; Falida, K.; Chakraborty, D.; Nikitina, E.; Tessmer, C.; Whitley, C.; Sauerland, C.; et al. Expression and replication of virus-like circular DNA in human cells. Sci. Rep. 2018, 8, 2851. [Google Scholar] [CrossRef] [Green Version]
  44. Borghese, A. Situation and perspectives of buffalo in the world, Europe and Macedonia. Maced. J. Anim. Sci. 2011, 1, 281–296. [Google Scholar]
  45. Habib, M.R.; Haque, M.N.; Rahman, A.; Aftabuzzaman, M.; Ali, M.M.; Shahjahan, M. Dairy buffalo production scenario in Bangladesh: A review. Asian J. Med. Biol. Res. 2017, 3, 305–316. [Google Scholar] [CrossRef] [Green Version]
  46. Government of India. Department of Animal Husbandry and Dairying. Annual Report (2018–2019). Available online: https://dahd.nic.in/documents/reports (accessed on 13 November 2020).
  47. Borghese, A.; Moioli, B. Buffalo: Mediterranean Region. In Reference Module in Food Science; Elsevier: Amsterdam, The Netherlands, 2016. [Google Scholar] [CrossRef]
  48. Borghese, A.; Mazzi, M.; Moiolo, B.; Barile, V.L.; Terzano, G.M.; Terramoccia, S.; Bartocci, S.; Puppo, S.; Tripaldi, C.; Allegrini, S.; et al. Buffalo Production and Research; Food and Agriculture Organization of the United Nations (FAO): Rome, Italy, 2005; ISSN 1020-3737. [Google Scholar]
  49. Lechmann, J.V. A Pilot Study Contributing to the Virome of Water Buffaloes in Switzerland. Ph.D. Thesis, Universität Zürich, Zürich, Switzerland, 2017. [Google Scholar]
  50. Guo, Z.; He, Q.; Tang, C.; Zhang, B.; Yue, H. Identification and genomic characterization of a novel CRESS DNA virus from a calf with severe hemorrhagic enteritis in China. Virus Res. 2018, 255, 141–146. [Google Scholar] [CrossRef] [PubMed]
  51. Ling, Y.; Zhang, X.; Qi, G.; Yang, S.; Jingjiao, L.; Shen, Q.; Wang, X.; Cui, L.; Hua, X.; Deng, X.; et al. Viral metagenomics reveals significant viruses in the genital tract of apparently healthy dairy cows. Arch. Virol. 2019, 164, 1059–1067. [Google Scholar] [CrossRef] [PubMed]
  52. Wang, H.; Li, S.; Mahmood, A.; Yang, S.; Wang, X.; Shen, Q.; Shan, T.; Deng, X.; Li, J.; Hua, X.; et al. Plasma virome of cattle from forest region revealed diverse small circular ssDNA viral genomes. Virol. J. 2018, 15, 11. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  53. Wen, L.; Mao, A.; Fan, Z.; Li, W.; Xiao, Q.; Liu, Q.; Xie, J.; He, K. Porcine circovirus-like virus P1 in cattle, goats and rabbits in China. Transbound. Emerg. Dis. 2018, 65, e217–e218. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  54. Kim, H.K.; Park, S.J.; Nguyen, V.G.; Song, D.S.; Moon, H.J.; Kang, B.K.; Park, B.K. Identification of a novel single-stranded, circular DNA virus from bovine stool. J. Gen. Virol. 2012, 93, 635–639. [Google Scholar] [CrossRef]
Figure 1. Results of the molecular phylogenetic analysis including novel full-length nucleotide sequences and representatives of the BMMF groups 1 to 3. The latter also includes members of the family Genomoviridae. The evolutionary history was inferred by using the maximum likelihood method and a JTT matrix-based model [34]. The bootstrap consensus tree was calculated from 500 replicates. Bootstrap values at the branch points indicate the percentage of replicate trees in which respective isolates clustered [35]. Branch support values lower than 70% were not included. Blue dots mark all sequences found in this study. The scale bar represents the number of substitutions per site. All taxa are indicated by name followed by the corresponding GenBank accession number.
Figure 1. Results of the molecular phylogenetic analysis including novel full-length nucleotide sequences and representatives of the BMMF groups 1 to 3. The latter also includes members of the family Genomoviridae. The evolutionary history was inferred by using the maximum likelihood method and a JTT matrix-based model [34]. The bootstrap consensus tree was calculated from 500 replicates. Bootstrap values at the branch points indicate the percentage of replicate trees in which respective isolates clustered [35]. Branch support values lower than 70% were not included. Blue dots mark all sequences found in this study. The scale bar represents the number of substitutions per site. All taxa are indicated by name followed by the corresponding GenBank accession number.
Viruses 13 01088 g001
Figure 2. Clustal alignment of the three different TR motifs identified. Each of the upper two boxes comprises isolates with identical TR sequences, except for one nucleotide substitution in 14BAMI.2522. The lower box shows one further TR represented by a single isolate only. Vertical black lines indicate the end of a single repeat. The TR size is 22 nt, and the copy number ranges from 3.5 to 3.9.
Figure 2. Clustal alignment of the three different TR motifs identified. Each of the upper two boxes comprises isolates with identical TR sequences, except for one nucleotide substitution in 14BAMI.2522. The lower box shows one further TR represented by a single isolate only. Vertical black lines indicate the end of a single repeat. The TR size is 22 nt, and the copy number ranges from 3.5 to 3.9.
Viruses 13 01088 g002
Figure 3. Schematic illustration of BMMF-like DNA elements recovered from buffalo milk. Red arrows indicate rep genes starting with ATG; orange arrows show rep genes starting with TTG. The light blue arrows represent further potential ORFs. According to the results of the in silico analysis, circular DNA elements were assigned to three groups (AC). Group (A) contains nine related isolates, which could be assigned to the GenBank entries CMI1.252 and HCBI6.252; group (B) comprises seven elements, which exhibit the greatest homologies to other BMMF1 isolates; group (C) includes four circular DNAs belonging to the BMMF2 group.
Figure 3. Schematic illustration of BMMF-like DNA elements recovered from buffalo milk. Red arrows indicate rep genes starting with ATG; orange arrows show rep genes starting with TTG. The light blue arrows represent further potential ORFs. According to the results of the in silico analysis, circular DNA elements were assigned to three groups (AC). Group (A) contains nine related isolates, which could be assigned to the GenBank entries CMI1.252 and HCBI6.252; group (B) comprises seven elements, which exhibit the greatest homologies to other BMMF1 isolates; group (C) includes four circular DNAs belonging to the BMMF2 group.
Viruses 13 01088 g003
Figure 4. Maximum likelihood phylogenetic tree of Rep amino acid sequences and representative Reps from BMMF1 and 2 isolates calculated as described in Figure 1 [36]. The bootstrap consensus tree is based on 500 replicates. Bootstrap values at the branch points indicate the percentage of replicate trees in which respective isolates clustered [35]. Branch support values lower than 70% were not included. Blue dots mark all sequences found in this study. The scale bar represents the number of substitutions per site. All taxa are indicated by name followed by the corresponding GenBank accession number.
Figure 4. Maximum likelihood phylogenetic tree of Rep amino acid sequences and representative Reps from BMMF1 and 2 isolates calculated as described in Figure 1 [36]. The bootstrap consensus tree is based on 500 replicates. Bootstrap values at the branch points indicate the percentage of replicate trees in which respective isolates clustered [35]. Branch support values lower than 70% were not included. Blue dots mark all sequences found in this study. The scale bar represents the number of substitutions per site. All taxa are indicated by name followed by the corresponding GenBank accession number.
Viruses 13 01088 g004
Figure 5. (A) Schematic map of the gemycircularvirus 11BAMI.2199. The in silico generated genes encoding Rep and Cap are shown as red and green arrows, respectively. Further putative ORFs are shown as light blue arrows. The nonanucleotide motif (TAATATTAT) within the potential stem-loop structure at position 1 is depicted at the top. (B) Enlarged view of the stem-loop structure between rep and cap as a potential origin of replication as calculated by the ProbKnot server at the University of Rochester. The nucleic acid type was set to DNA (http://rna.urmc.rochester.edu/RNAstructureWeb/Servers/ProbKnot/ProbKnot.html; accessed on 17 March 2021).
Figure 5. (A) Schematic map of the gemycircularvirus 11BAMI.2199. The in silico generated genes encoding Rep and Cap are shown as red and green arrows, respectively. Further putative ORFs are shown as light blue arrows. The nonanucleotide motif (TAATATTAT) within the potential stem-loop structure at position 1 is depicted at the top. (B) Enlarged view of the stem-loop structure between rep and cap as a potential origin of replication as calculated by the ProbKnot server at the University of Rochester. The nucleic acid type was set to DNA (http://rna.urmc.rochester.edu/RNAstructureWeb/Servers/ProbKnot/ProbKnot.html; accessed on 17 March 2021).
Viruses 13 01088 g005
Figure 6. Conserved motifs in the Rep of the Gemycircularvirus found in this study in accordance with data by Varsani et al. [38].
Figure 6. Conserved motifs in the Rep of the Gemycircularvirus found in this study in accordance with data by Varsani et al. [38].
Viruses 13 01088 g006
Table 1. Inverted repeat motif as a potential origin of replication. Sequences were detected by EMBOSS inverted repeat finder and additionally compared with the sequences described by zur Hausen et al. [11].
Table 1. Inverted repeat motif as a potential origin of replication. Sequences were detected by EMBOSS inverted repeat finder and additionally compared with the sequences described by zur Hausen et al. [11].
IsolateSizeInverted Repeatnt between
IR and TR
nt between
IR and rep
8BAMI.252212TAAATGCTTTTA50193
9BAMI.25221250193
11BAMI.20921256160
12BAMI.20921256160
13BAMI.20921256160
14BAMI.25221250193
15BAMI.25221250193
17BAMI.25221250193
18BAMI.21841249133
21BAMI.20911256192
22BAMI.25221250193
22BAMI.25211250193
24BAMI.20921256160
25BAMI.19721256160
45BAMI.25211250193
46BAMI.25221250193
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

König, M.-T.; Fux, R.; Link, E.; Sutter, G.; Märtlbauer, E.; Didier, A. Circular Rep-Encoding Single-Stranded DNA Sequences in Milk from Water Buffaloes (Bubalus arnee f. bubalis). Viruses 2021, 13, 1088. https://doi.org/10.3390/v13061088

AMA Style

König M-T, Fux R, Link E, Sutter G, Märtlbauer E, Didier A. Circular Rep-Encoding Single-Stranded DNA Sequences in Milk from Water Buffaloes (Bubalus arnee f. bubalis). Viruses. 2021; 13(6):1088. https://doi.org/10.3390/v13061088

Chicago/Turabian Style

König, Marie-T., Robert Fux, Ellen Link, Gerd Sutter, Erwin Märtlbauer, and Andrea Didier. 2021. "Circular Rep-Encoding Single-Stranded DNA Sequences in Milk from Water Buffaloes (Bubalus arnee f. bubalis)" Viruses 13, no. 6: 1088. https://doi.org/10.3390/v13061088

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop