Next Article in Journal
Gene Polymorphisms of Epithelial Cell-Derived Alarmins and Their Effects on Protein Levels and Disease Severity in Patients with COVID-19
Previous Article in Journal
Oxalate (dys)Metabolism: Person-to-Person Variability, Kidney and Cardiometabolic Toxicity
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genomic Instability of G-Quadruplex Sequences in Escherichia coli: Roles of DinG, RecG, and RecQ Helicases

by
Virali J. Parekh
1,
Grzegorz Węgrzyn
2,
Véronique Arluison
3,4,* and
Richard R. Sinden
1,*
1
Laboratory of DNA Structure and Mutagenesis, Department of Chemistry, Biology and Health Sciences, South Dakota School of Mines and Technology, Rapid City, SD 57701, USA
2
Department of Molecular Biology, University of Gdansk, Wita Stwosza 59, 80-308 Gdansk, Poland
3
Laboratoire Léon Brillouin LLB, CEA, CNRS UMR12, CEA Saclay, 91191 Gif-sur-Yvette, France
4
UFR Sciences du Vivant, Université Paris Cité, 75006 Paris, France
*
Authors to whom correspondence should be addressed.
Genes 2023, 14(9), 1720; https://doi.org/10.3390/genes14091720
Submission received: 4 August 2023 / Revised: 23 August 2023 / Accepted: 26 August 2023 / Published: 29 August 2023
(This article belongs to the Section Microbial Genetics and Genomics)

Abstract

:
Guanine-rich DNA can fold into highly stable four-stranded DNA structures called G-quadruplexes (G4). Originally identified in sequences from telomeres and oncogene promoters, they can alter DNA metabolism. Indeed, G4-forming sequences represent obstacles for the DNA polymerase, with important consequences for cell life as they may lead to genomic instability. To understand their role in bacterial genomic instability, different G-quadruplex-forming repeats were cloned into an Escherichia coli genetic system that reports frameshifts and complete or partial deletions of the repeat when the G-tract comprises either the leading or lagging template strand during replication. These repeats formed stable G-quadruplexes in single-stranded DNA but not naturally supercoiled double-stranded DNA. Nevertheless, transcription promoted G-quadruplex formation in the resulting R-loop for (G3T)4 and (G3T)8 repeats. Depending on genetic background and sequence propensity for structure formation, mutation rates varied by five orders of magnitude. Furthermore, while in vitro approaches have shown that bacterial helicases can resolve G4, it is still unclear whether G4 unwinding is important in vivo. Here, we show that a mutation in recG decreased mutation rates, while deficiencies in the structure-specific helicases DinG and RecQ increased mutation rates. These results suggest that G-quadruplex formation promotes genetic instability in bacteria and that helicases play an important role in controlling this process in vivo.

1. Introduction

Repeated DNA motifs are common in genomes and can adopt noncanonical B-form conformations. These include hairpins, cruciform, H-DNA, or triplex structures [1,2,3,4]. Among these conformations, a guanine quadruplex (G4) can form a planar assembly of G’s through Hoogsteen hydrogen bonds [5]. Successive G-planes stack on top of each other and form a structure that may be stabilized by the presence of a cation [6]. G4 usually forms from four tracts of at least three guanines [7,8,9]. The topology of these stacking quadruplexes is highly variable, and strands can be arranged in a parallel, antiparallel, or mixed orientation [10]. A single repeat motif may even form multiple structures depending on the ionic conditions [11,12].
Sequences prone to G-quadruplex formation are found throughout all genomes, from archaea to higher eukaryotes. Although G4s were originally identified in eukaryotes telomeres, oncogene promoters, or the DNA region coding for immunoglobulin heavy-chain switch regions, they are also present in bacteria. G4 sequence patterns were mapped in the Escherichia coli genome and found more than 3000 times [13]. In eukaryotes, G4-forming regions often correlate with genomic regions that are unstable, such as in many oncogene promoters, where they could participate in the regulation and expression of these genes [9,14,15,16]. Furthermore, the expression of many genes is influenced by mutations affecting the functions of the RecQ family of helicases, mainly the Bloom syndrome protein (BLM) and the Werner syndrome protein (WRN) helicases that can unwind quadruplexes. This suggests a potential global role for G4-helicases in gene expression regulation [17]. For this reason, the association of G-quadruplex sequences with oncogenes and telomeres has been a focus for the development of therapeutic intervention [18].
Note that in addition to G4 sequences, on the complementary strand of G-quadruplexes, C-rich sequences can also form a four-stranded motif called an i-motif [19,20]. In this motif, cytosines form two interdigitated C•C+ base-paired strands at a low pH [21], as shown in human telomere repeats and several oncogene promoters.
Since G-quadruplexes can be very stable, they can result in DNA replication pausing in vivo. A replication fork blockage requires a subsequent restart pathway, and mutations can occur during this process [22,23]. Thus, DNA helicases that unwind replication-blocking structures, including G-quadruplexes, and can influence genomic instability have been identified [24]. Additionally, proteins other than helicases (ATP independent) may also influence the genetic instability in bacteria [25,26,27]. Therefore, the enzymes involved in replication restart and recombination may be expected to influence the instability of G-quadruplex-forming repeats. One might expect these proteins to be potential therapeutic targets [28].
In this work, we focused our analysis on the role of three helicases in resolving G4 in vivo. Note that we will not discuss the role of E. coli Rep helicase, a protein serving during DNA replication and implicated in resolving E. coli DNA G4, as this has been analyzed previously [27,29]. Moreover, although G4 can be formed in RNA strands and despite the fact that G-quadruplexes might influence translation efficiency, we will not consider these processes as we focused on genetic instability at the level of chromosomal DNA (through monitoring specific deletions).
In Escherichia coli, several helicases and nucleic acid remodeling proteins could potentially affect G-quadruplexes [26]. Few bacterial G4 helicases have been identified, but the best characterized is bacterial RecQ. RecQ belongs to a conserved family of helicases important for genome maintenance [30] (see above, WRM and BLM). The RecQ family of helicases can unwind G-quadruplex structures [31,32]. Nevertheless, while in vitro approaches have shown that bacterial RecQ can resolve G4s, it is unclear whether it plays a role in vivo [27]. Recently, a structure of the RecQ bound to G4 DNA was resolved [33], evidence of a guanine-flipping and sequestration model.
Another helicase that may play a role in G-quadruplex stability is that encoded by the dinG gene. DinG unwinds branched DNA structures from 5′ to 3′ [34,35]. DinG requires a 5′ single-stranded tail of 11–15 nucleotides, and it unwinds both D-loops and R-loops [34]. D-loops and R-loops are comprised of a single-stranded bubble displaced by a DNA or RNA strand hybridized to the complementary strand [36]. DinG is a homolog of the human protein FANCJ that can unwind G4s. DinG and Rep may cooperate to promote G4 unwinding [37].
Finally, RecG is a structure-specific, branched-DNA helicase (translocase) involved in recombination during DNA replication restart [38]. RecG has also been shown to resolve four-stranded DNA structures [38]. Nevertheless, its possible role in resolving G4 in vivo has not been addressed previously.
In this work, to understand the relationships between G-quadruplex formation in cells, helicases, and genomic instability, G-quadruplex forming repeats with different propensities for formation were cloned into the chloramphenicol acetyltransferase gene (cat) in the plasmid pBR325. This genetic system, operating in E. coli cells, reports complete or partial deletion and frameshift mutations, as well as differences due to the location of the G-quadruplex-forming sequence in either the leading or lagging template strand of replication [39]. The analysis of the mutation rates can provide information about the relationship between G-quadruplex formation in cells and genetic instability. Here, we examined the influence of DinG, RecG, and RecQ helicases on bacterial G4-genetic instability.

2. Materials and Methods

2.1. Bacterial Strains and Media

The E. coli strains used were HB101 (supE44 hsdS20(rB-mB-) recA13 ara-14 proA2 lacY1 galK2 rpsL20 xyl-5 mtl), BW25113 (Δ(araD-araB)567, ΔlacZ4787::rrnB-3 λ- rph-1 Δ(rhaD-rhaB)568 hsdR514), JW0784 (BW25113 with ΔdinG771::kan), AB1157 (F- thr-1 leuB6 thi-1 lacY1 galK2 ara14 xyl5 mtl-1 proA2 his4 argE3 rpsl3 (StrR) tsx-33 supE44 kdgK51), DL1104 (AB1157 with ΔrecG263::kan), SWM1003 (AB1157 with ΔrecQ::kan), and N6916 (AB1157 with ΔrecG263::kan ΔrecQ::kan).
K media and LB (Luria-Bertani) broth have been described [23,40] and were supplemented, when needed, with ampicillin at 30 μg/mL. K medium diluted with M9, salts, and thiamine but without glucose or casamino acids was used for some mutation rate assays. To select Cm-resistant (Cmr) bacteria, 25 μg/mL of chloramphenicol (Cm) was added to the LB plates.

2.2. Cloning Quadruplex-Forming Repeats

Complementary oligonucleotides for (G3T)4, (G3T)8, (G3T2)4, (G4T)4, (G4T2)4, and RET G4 DNA inserts were obtained from Integrated DNA Technologies, Coralville, IA, USA. For structural studies, complementary oligonucleotides were hybridized and then ligated in the EcoRI site of pGEM®-Z3 (Promega, Madison, WI, USA). For mutation rate assays, DNA inserts were cloned into pBR325 [40,41] and confirmed by sequencing. The RET G4 repeat [(G4C)3G5C] was 30-bp but contained an in-frame TAG stop codon. This design, in one orientation, should favor the selection of predominantly complete deletions between the flanking EcoRI sites. pBR235 plasmids contained a revered BsrBI fragment, constructed as described in [25]. Plasmids were purified for structural studies from ethidium bromide CsCl (Sigma, Saint Louis, MO, USA) density gradients.

2.3. Mutation Rate Measurement and Analysis of Cmr Revertants

Luria-Delbrück fluctuation analyses were performed to determine the mutation rates [42]. Eighteen parallel 1-mL E. coli cultures (started at ~104 cells/mL) were grown from overnight cultures of a single colony. Viable cell counts were an average of three independent cultures plated on LB-ampicillin plates. Cmr revertants were determined from LB-Cm plates. The MSS estimator [43] was used to calculate mutation rates.
Due to different inherent mutation rates, different media were used for some of the bacterial strains. The inserts with the highest mutation rates had rates too high for analysis using 1 mL cultures of LB or K medium. By limiting glucose and casamino acids (CAA), the final cell density could be adjusted to result in a number of Cmr colonies that could be accurately counted (~200–400). These analyses were performed using 150 μL cultures grown in a 96-well microtiter dish. For strains BW25113 and JW0784, K medium was used for cells containing plasmids with (G3T)4, 0.001 K medium (0.001% glucose and CAA) for (G3T)8, 0.01 K medium (0.01% glucose and CAA) for (G3T2)4, 0.025 K medium (0.025% glucose and CAA) for (G3T)4-dimer, 0.001 K medium for (G4T)4, 0.2 K medium (0.2% glucose and CAA) for (G4T2)4, and 0.001 K medium for G4T-dimer. For AB1157 and derivatives, 0.005 K medium (0.005% glucose and CAA) was used for (G4T)4-dimer, and 0.005 K medium for (G3T)8. Mutation spectra were determined as described [25,40].

3. Results

3.1. Confirmation of the DNA Structure Formation in G-Quadruplex-Forming DNA Repeats

(G3T)4, (G4T)4, (G3T2)4, (G4T2)4, (G3T)8, and the RET oncogene repeat (G4C)3G5C form G-quadruplex structures that have been well characterized structurally [25]. Circular Dichroism (CD) has, for instance, been used to confirm they adopt a specific quadruplex conformation [25,44]. DNA oligonucleotides with (G3T)n and (G4T)n repeats formed parallel quadruplexes with a CD peak at 264 nm and a minimum at 243 nm [44]. The Ret G4 repeat also gave a spectra diagnostic for a parallel strand quadruplex, as reported previously [45]. The CD spectra of (G4T2)4 in NaCl and KCl exhibited peaks at 265 nm and ~292 nm, with a minimum at ~243 nm, which is diagnostic for a 3 + 1 structure [44]. Note that the (G3T2)4 repeat formed a 3 + 1 G-quadruplex in NaCl but a parallel G-quadruplex in KCl. Finally, the complementary C-rich strand could form an i-motif. The (C3A)4 repeat exhibited a CD peak at ~290 nm at pH 4.7, diagnostic for an i-motif, while the spectra at neutral pH were characteristic of single-stranded DNA [44]. All the spectra can be found in and downloaded from our Nucleic Acid Circular Dichroism Database NACDDB https://genesilico.pl/nacddb/ (accessed on 28 August 2023) [44]. Relative stabilities (Tm) of the various structures are given in Supplementary Table S1.

3.2. Plasmids for the Analysis of the Genetic Instability of G-Quadruplex-Forming Repeats

To understand the inherent genetic instability of G-quadruplex-forming DNA repeats, the various sequences were cloned in the EcoRI site of the plasmid pBR325 (Figure S1). As the stability of G-quadruplex structures varies with the sequence, this allows for the analysis of the correlation of inherent stability with the rates of instability. Insertion in the EcoRI site in the gene encoding chloramphenicol acetyltransferase (cat) renders cells chloramphenicol sensitive (Cms). Genetic instability within the G-quadruplex-forming repeats, resulting in complete deletion between flanking EcoRI sites, partial deletion, or simple frameshift mutation, can result in a chloramphenicol-resistant (Cmr) phenotype, as observed in other repeats [40]. To ascertain the genetic instability differences when a G-rich sequence resides in the leading or lagging replication strands, the ampicillin-resistance (bla) gene and unidirectional replication origin ColE1 were reversed for the pBR235 series plasmids [25].

3.3. Measurement of the Mutation Rates of Quadruplex-Forming Repeats in pBR325 and pBR235

Chloramphenicol resistance (Cmr) mutation rates for G-quadruplex-forming repeats were measured in strain BW25113 (wild type), as described in Materials and Methods. In this genetic background, mutation rates varied by more than 105 as a function of the DNA sequence, ranging from 5.5 × 10−5 to 2.7 × 10−10 Cmr revertants per cell per generation (Figure 1). In the three repeats containing (C3A)n, longer repeats showed higher rates of instability, with (C3A)4-dimer > (C3A)8 > (C3A)4. The large increase in mutation rates (factors of 270 and 354) between (C3T)8 and (G3T)8 reflected the possibility of transcription-driven G4 formation when the G-rich strand comprised the non-template strand, as discussed previously [25]. The RET G4 repeat, which can also form a stable parallel G-quadruplex, had a mutation rate similar to that of the (C3A)4-dimer repeat. The two plasmids containing (G4T)4 ((G4T)4 and (G4T)4-dimer) that formed a parallel G-quadruplex in vitro also exhibited high rates of Cmr reversion. Repeats with a TT dinucleotide interspersed between G3 or G4 tracts, which form less-stable structures, including the 3 + 1 G-quadruplex in NaCl, exhibited the lowest rates of Cmr reversion.
Mutation rates varied from 0.26-fold to 8.08-fold, dependent on the orientation of the replication origin ColE1 (Figure 1). Mutation rates were 8.08-fold, 7.44-fold, and 2.71-fold higher for (G4T)4, (G4T2)4, and RET G4 repeats, respectively, when the G-rich strand comprised the lagging template. The mutation rate for the (G4T)4-dimer repeat was 4-fold higher when the G-rich strand comprised the leading template.
Note that the use of scramble sequences to analyze the mutation rate imposed by the genetic state of the host strain, regardless of the sequence with this system, has been evaluated in our previous works [39,40,46,47,48,49]. We concluded from these works that the Cmr reversion frequencies for this series of “scrambled rate” inserts ranged from 10−10 to 10−9. Also, note that ~10−10 approximates the limits of detection in this genetic selection test and that sequences here that are less prone to structure formation (C3A2)4, and (G4T2)4 (Supplementary Table S1) has low mutation rates approaching this lower limit of detection.

3.4. Influence of DinG on the Genetic Instability of Quadruplex-Forming Repeats

To ascertain the involvement of the DinG helicase on G-quadruplex repeat instability, mutation rates were measured in the dinG mutant and isogenic dinG+ parental strains for both orientations of the replication origin. The mutation rates for plasmids in BW25113 (wild type) and JW0784 (ΔdinG) are shown in Table 1. For the (G3T)4 and (G3T)8 repeats, in both orientations, the mutation in dinG resulted in a 3-fold and ~7-fold increase in mutation rates, respectively. Other repeats analyzed showed smaller effects (<2-fold).

3.5. Influence of RecG and RecQ Helicases on the Instability of Quadruplex-Forming Repeats

The (G3T)8 and (G4T)4-dimer repeats were analyzed in AB1157 background strains containing mutations in the recG and recQ genes, encoding RecG or RecQ helicases, respectively, and in a recG recQ double mutant. For the (G3T)8 repeat, the mutation rates ranged from 5.95 × 10−9 to 1.39 × 10−8 in pBR325 when the G-rich tract comprised the leading template and from 3.12 × 10−8 to 1.18 × 10−6 in pBR235 when the G-rich tract comprised the lagging template (Table 2). In this genetic background, a significant dependence on orientation for mutation rates of 5-fold to 174-fold was observed, with a higher mutation rate when the G-rich strand constituted the lagging template.
The role of the RecG and RecQ helicases in the instability of G-quadruplex-forming repeats was assessed by comparing the mutation rates in wild type and mutant strains. For the (G3T)8 repeat, little effect was observed for the stable orientation (pBR325) when the G-rich strand comprised the leading template (C-rich, lagging strand) (Table 2). In contrast, when the G-rich strand comprised the lagging template, the recG mutation resulted in a 3.4-fold decrease in mutation rate, while the recQ mutation resulted in a 1.95-fold increase in this rate. The recG recQ double mutant exhibited a 10.8-fold increase in the mutation rate. Cmr reversion rates for strains containing the (G4T)4-dimer, which, as discussed below, may not form alternative structures in cells, showed less than 2-fold differences in the recG, recQ, and recG recQ mutant strains compared with the parental strain for both orientations (Table 2). This repeat did show leading/lagging asymmetries with a higher rate when the G-rich strand comprised the lagging template.

3.6. Cmr Mutation Spectra for E. coli Containing Quadruplex-Forming Repeats

Cmr revertants were scored for complete or partial deletions using PCR and DNA sequencing, as described previously [25,40]. Different repeats underwent different types of mutation in the cat gene, and this was generally independent of genetic background, although some effects of the specific mutations analyzed were observed.
For the (G3T) series repeats that form the most stable G-quadruplex, different mutation spectra were observed for the 3 repeats. The (G3T)4-dimer underwent complete deletions between EcoRI sites in 24 of 25 revertants analyzed, with one partial deletion to a 21-bp insert. The (G3T)8 repeat predominantly exhibited deletion of (G3T)2, leaving a 30-bp insert with the remaining events being complete deletions. In the combined BW25113 and AB1157 backgrounds, 59/62 and 44/48 events were a (G3T)2 deletion in pBR325 and pBR235, respectively. For the (G3T)4 repeat, when the G-rich strand comprised the lagging template, 67% (6/9) and 100% (10/10) complete deletions occurred in the wild type and dinG mutant cells, respectively. In the opposite orientation, 56% (9/16) and 60% (6/10) complete deletions occurred in the wild type and dinG mutant cells, respectively.
For the (G4T) series repeats, complete deletions occurred for the (G4T)4-dimer in 85% (39/47) of the revertants analyzed in the BW25113 background and 97% (70/72) in the AB1157 derivatives. The (G4T)4 repeat underwent deletion of one (G4T) repeat (at a high rate) in all cases examined (46/46).
For the (G3T2)4 and (G4T2)4 repeats that form the less stable 3 + 1 G-quadruplex and were the most stable genetically, partial deletions occurred preferentially (90–100% in pBR325 and 60–62% in pBR235 plasmids). The deletion mutations analyzed were all consistent with replication slippage. The sequences of major and representative mutations are shown in Figure S2.

4. Discussion

G-quadruplex-forming DNA repeats occur upstream of many human promoters, including those of proto-oncogenes. The correspondence of breaks and chromosomal rearrangements associated with these sequences is consistent with an interpretation that the formation of an alternative DNA structure, specifically a G-quadruplex in duplex DNA, may lead to replication fork blockage and ultimately a double-stranded break that may initiate events responsible for genomic instability [23]. To understand how G-quadruplex structure formation in cells can lead to bacterial genomic instability, we have characterized the relative genetic instability of a series of G-quadruplex-forming repeats in an E. coli model mutagenesis system. The rates of genomic instability and reversion to chloramphenicol resistance by complete or partial deletion are consistent with the interpretation that, for certain repeats, G-quadruplex formation during transcription, replication, or replication restart may promote the genetic instability of the repeats. In addition, the DinG, RecG, and RecQ helicases influence rates of genetic instability associated with these repeats.
Structural analysis, stability, and rate of formation measurements in single-stranded DNA have been completed for simple G-quadruplex forming sequences, including those analyzed here [50]. In general, an increase in loop length decreases the stability of the G-quadruplex [51].
G-quadruplex structures may form in cells by different pathways (Figure 2). First, a G-quadruplex may form driven by the energy inherent in negatively supercoiled DNA. An i-motif may then form in the complementary strand (Figure 2A). Noncanonical structures may form following strand denaturation and slow renaturation [52] or potentially during incubation in a buffer containing K+ that acts to stabilize G-quadruplex structures [53]. Our extensive analyses failed to observe a structural transition in supercoiled DNA in a physiological ionic strength buffer for any of the G-quadruplex-forming repeats studied here, consistent with Sekibo and Fox [54]. Transcription resulting in a G-rich RNA strand can cause the formation of an R-loop, leading to a G-quadruplex in the displaced strand (see Figure 2B), as shown by Duquette et al. and Parekh et al. [25,55,56]. The formation of an RNA-DNA hybrid during the transcription of a G-rich tract appears to be common [57].
Moreover, our results confirmed transcription-driven G-quadruplex formation in the (G3T) series repeats in vitro [25]. In cells with the (G3T)8 repeat in an orientation that supports R-loop formation in vivo, deletion rates are dramatically increased compared with the opposite orientation where R-loop and structure formation do not occur. A replication fork may encounter a G-quadruplex structure once formed within an R-loop. pBR325 and pBR235 would contain the G-quadruplex in the leading and lagging template strands, respectively. Another possibility for G-quadruplex formation involves formation in the lagging strand when it is single-stranded (Figure 2D). As the leading strand generally remains duplex, a lower probability may exist for G-quadruplex formation prior to the arrival of the replication fork (Figure 2C). However, if the helicase were uncoupled from polymerase, a single-stranded region may be generated that could support G-quadruplex formation. If, however, a replication fork blockage occurs at a G-quadruplex-forming repeat during replication due to inherent pausing or an encounter with a DNA secondary structure, then replication restart pathways provide many opportunities in regions of single-stranded DNA for G-quadruplex structure formation, as shown in Figure 3A. If the G-rich repeat comprises the lagging template, replication restart pathways afford fewer opportunities for structure formation (Figure 3B). An additional factor that may influence the probability of formation is that, in pBR325, the direction of replication and transcription are convergent, which may slow or block replication.
Many different mutations within the G-quadruplex region can result in a Cmr phenotype. First, replication slippage between proximal EcoRI sites flanking the G4-forming sequences will result in the complete deletion of the cloned insert, as shown in Figure 2D. Second, replication slippage within a G4-forming repeat tract can restore the reading frame, resulting in a Cmr phenotype. Third, replication slippage that does not involve the G4-forming repeats can restore the reading frame and Cmr phenotype. The rate and type of mutation event observed are often dependent on the secondary DNA structure. Moreover, mutation rates and spectra observed will also reflect the frequency with which an event occurs that leads to a Cmr phenotype. For example, Cmr reversion by a single trinucleotide deletion within (CTG)25•(CAG)25 can occur at a rate of 5 × 10−2, but lower rates in longer repeat tracts reflect the requirement for larger deletions [40].
Complete deletion of palindromic inserts cloned between flanking EcoRI sites can occur at very high frequencies, and this is believed to be associated with a cruciform or hairpin formation [39,46,48]. In addition, imperfect hairpins within (CTG)•(CAG) repeats are believed to promote high frequencies of deletions [40,58]. Moreover, asymmetries in rates of deletions and duplications in leading or lagging replication strands are frequently observed with cloned inserts that can form alternative DNA structures [39,40,46,48,58,59]. A working hypothesis evidences that a G4 structure blocks the replication fork and brings the flanking restriction sites into closer juxtaposition, which then facilitates misalignment between the proximal restriction sites (Figure 2D), and this is responsible for the high frequency of deletion.
Complete deletions were the predominant event for (G3T)4, (G3T)4-dimer, and the (G4T)4-dimer repeats. Complete deletions in (G3T)4 are consistent with replication slippage across a G-quadruplex. In the dinG mutant, when the G-rich tract comprises the lagging template, all revertants showed complete deletions, compared with 67% of the parental wild type strain. This result may be consistent with DinG activity that prevents the accumulation of G4 structures. In the wild type and dinG strains, the (G4T)4-dimer and (G3T)4-dimer exhibited predominantly, or exclusively, complete deletions that occurred at relatively high frequencies. While G-quadruplex structure formation may occur in these longer repeat tracts, it is also possible that the central palindromic EcoRI site may favor the formation of an antiparallel Hoogsteen base-paired hairpin that promotes replication slippage at a high frequency in a fashion analogous to that observed for (CTG)•(CAG) repeats [40].
All partial deletions that restored the reading frame and resulted in a Cmr phenotype were consistent with primer-template slippage between repeats. In the (G4T)4 repeat, deletion of a single (G4T) repeat, resulting in (G4T)3, restored the reading frame resulting in reversion to Cmr. Complete deletions were not observed. This simple slippage mutation happened at a very high frequency with a higher rate when the G-rich strand comprised the lagging template. Moreover, the absence of an effect of the dinG mutation and the inability of this repeat to form G-quadruplex structures on transcription all suggest that slippage is not associated with G-quadruplex structure formation. Remarkably, the simple slippage of a single (G3T) repeat, which restores the reading frame in the most stable G-quadruplex-forming sequence (G3T)4, was only rarely observed; complete deletions were predominant, consistent with structure formation.
The (G3T)8 repeat showed a high Cmr reversion frequency from deletion involving two (G3T) repeats. This long repeat may form stable G4 structures resulting from transcription in cells. Moreover, mutations in the dinG, recQ, and recG genes encoded structure-specific helicases influenced mutation rates, consistent with structure formation. During transcription in vitro, the (G3T)8 can form one or two G4 structures [25]. The formation of a stable parallel (G3T)4 may block DNA polymerase, promoting the deletion of two (G3T) repeats.
The low rates of Cmr reversion for the (G3T2)4 and (G4T2)4 repeats, the predominance of partial deletions, the low thermal stabilities of their G4 structures (3 + 1 and parallel for (G3T2)4 in K+), and the lack of structure formation on transcription may all be consistent with the lack of structure formation in cells.
Complex sequence changes in a single genetic mutation event can be diagnostic of primer-template misalignments that occur during the synthesis of either the leading or the lagging strand [59,60,61,62,63]. All complete and partial deletions shown in Figure S2 could occur by slippage between direct repeats during the replication of either the leading or lagging strand, as denoted by thick underlines with arrowheads facing in both directions. Two sequenced Cmr revertants represented complex mutations that revealed the direction of replication. First, complete deletion of the (G3T)4 (the most stable G-quadruplex) occurred concomitantly with a C to G change 5′ to the EcoRI site (Figure S2), consistent with replication slippage in the lagging strand (in pBR235) following synthesis of the first GGGT repeat. Second, in several independent (G4T)4-dimer revertants, concomitant with the deletion of the entire repeat, the sequence 5′CGTATGG3′ 3′ of the EcoRI site was converted to 5′GGGGTGG3′, consistent with replication slippage following synthesis into the second GGGGT repeat during synthesis of the leading strand (in pBR235). In both cases, polymerase paused after synthesis through the first repeat and initiated a misalignment across the repeat. Replication into a structure could promote pausing and misalignment, as is known to occur during the replication of hairpins [64].
In general, the ease of formation and the stability of alternative DNA structures, which include cruciforms, Z-DNA, and intermolecular triplex structures, increase with the length of the structure-forming DNA repeat [1]. Moreover, the genetic instability (measured biochemically) of the direct repeats that form slipped strand structures can increase as a function of the repeat length [65], although in a genetic assay, a simple slippage that results in a Cmr reversion in a short repeat can occur at a higher rate than a structure-dependent reversion event in a long repeat [40,49]. For the quadruplex-forming sequences, in general, longer repeats exhibited higher mutation rates (Figure 2).
DinG, RecG, and RecQ helicases possess unwinding activities on unusual DNA structures. DinG will unwind branched structures, D-loops, and R-loops [34,35]. RecG and RecQ helicase family members, including the WRN and BLM helicases, will unwind G-quadruplex DNA structures [66,67]. These E. coli proteins are important in replication restart activities in which branched structures associated with replication forks are removed and resolved by them [34,35]. In addition, DinG and RecQ helicases may act to remove R-loops to avoid ectopic replication initiation [37].
A mutation in dinG increased the mutation rates of complete deletions for plasmids containing (G3T)4 and partial deletions in (G3T)8, in either orientation with respect to replication fork progression, by factors of 3.3 and 7, respectively. These results suggest that the DinG helicase may be removing G-quadruplex structures formed in either leading or lagging templates during replication. The mutation rates for the (G3T)4 and (G3T)8 repeats differed by 1000-fold, presumably reflecting the potential for structure formation, which is clearly higher for the longer repeat. The similar influence of the dinG mutation, when the G-rich strand comprised the leading or lagging template strand, may indicate that DinG is actually acting predominantly to remove the G-quadruplex structures formed within transcription-dependent R-loops, as this is replication orientation independent.
Little effect of a mutation in dinG was observed with the (G4T)4 repeats. However, the mutation, a deletion of one (G4T), occurred at a very high rate for this repeat. This high rate of replication slippage may be structure-induced, or, if not, it may mask any structure-dependent mutations that may occur at a much lower rate. While the mutation spectra of predominantly complete deletions for the (G4T)4-dimer are consistent with structure formation, little effect of the dinG mutation was observed. It is also possible that a structure formed with the (G4T)4-dimer repeat was not recognized or resolved by the DinG helicase.
For cells with deficiencies in the RecG and RecQ helicases, a large influence on mutation rates was evidenced for the (G3T)8 repeat. In strains of this AB1157 genetic background, mutation spectra were indicative of structure-related instabilities (complete or partial deletions). For the (G3T)8 repeat, the orientation dependence for mutation rates was highest by factors of 5–176 when the G-rich strand comprised the lagging template strand, where structures may be most likely to form. While a deficiency in RecG caused decreased mutation rates by 3-fold, the deficiency in RecQ and the double mutant showed elevated mutation rates (to a factor of 10), consistent with an interpretation that these helicases may participate in structure removal from the lagging strand template. The (G4T)4-dimer repeat showed a different pattern of response to the RecG and RecQ helicase dysfunctions. First, mutation rates were overall high (10−6 to 4 × 10−5) and highest when the G-rich strand comprised the leading strand template. This pattern was also observed in the BW25113 background and both the wild type and dinG mutant strains. Mutations in recG and recQ had little effect on (G4T)4-dimer repeat Cmr mutation rates. The high proportion of complete deletions is suggestive of replication slippage across a secondary structure, as expected for deletion of inverted repeats forming cruciforms and (CTG)•(CAG) repeats forming hairpins [48]. This may suggest the replication fork pausing during replication of these repeats, as observed for CTG, CGG, and GAA repeats [68]. Replication fork pausing would be associated then with the formation of a complex structure that is not susceptible to removal by the helicases studied here.

5. Conclusions

In summary, our results support the proposal that G-quadruplex structures form in E. coli cells and promote replication misalignment when a replication fork encounters a stable G-quadruplex. These events are influenced by the actions of DNA helicases. Different effects of different helicases on G4 stability in vivo are intriguing, and they might arise from various specificities of particular enzymes that could result in specific actions (and the final results) depending on both DNA structural features and kinetics of biochemical processes (replication, recombination) occurring in certain genome regions. The observed effects could also participate in the rearrangements associated with promoters containing G-quadruplex-forming sequences, which is supported by previous suggestions [55,69,70]. G4 motifs are indeed over-represented in the regulatory regions of bacterial genes, as evidenced by in silico evaluations [13]. The results of this study may also shed new light on the role of G4s in the physiology of pathogenic bacteria, especially as it was demonstrated that G4 results in antigen variation that may help bacteria evade the host immune system [71]. Other features of bacterial pathogenicity may also be affected in this way, especially in the expression of the genes encoding the proteins involved in bacterial virulence or antibiotic resistance or if the replication of plasmids bearing such genes is influenced. In fact, the importance of such regulations in antibiotic resistance has been demonstrated recently [72]. Finally, our analysis may also give new clues in the emerging field of G4 in the mitochondrial genome, as it was shown that in the quadruplex regions of mitochondrial DNA, genetic variation is enhanced [73].

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes14091720/s1, Figure S1: Relationship between (G4T)4 repeats, transcription, and replication fork direction in pBR325 and pBR235; Figure S2: Deletion mutations consistent with primer-template misalignment during replication; Table S1: G-quadruplex-forming DNA repeats.

Author Contributions

Conceptualization, V.A. and R.R.S.; methodology, V.J.P. and R.R.S.; software, R.R.S.; validation, V.J.P. and R.R.S.; formal analysis, V.J.P., V.A., G.W. and R.R.S.; investigation, V.J.P. and R.R.S.; resources, R.R.S.; writing—original draft preparation, G.W., V.A. and R.R.S.; writing—review and editing, V.J.P., G.W., V.A. and R.R.S.; visualization, R.R.S.; supervision, R.R.S.; project administration, R.R.S.; funding acquisition, G.W., V.A. and R.R.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by CNRS & CEA (V.A.) and the University of Gdansk 531-D020-D242-23; (G.W.). This study contributes to the IdEx Université de Paris ANR-18-IDEX-0001 (V.A.).

Data Availability Statement

The data that support the findings of this study are available on request from the corresponding authors.

Acknowledgments

We are grateful to Brittany A. Niccum, Marisa A. Rivera, Amber L. Thomas, Rachna Shah, Vishnu M. Kumarasamy, Jan C. Varada, Glen E. Hamman, Yasir H. Syed, and Mark J. Novak for assistance with the experiments. We also thank K. Mosca (LLB CEA, Kevin and Synchrotron SOLEIL) for his help in SRCD spectra analysis. Synchrotron Radiation Circular Dichroism G4 spectra available in NACDDB were performed on DISCO beamline at SOLEIL Synchrotron, France (proposals 20201304, 20200074, 20210819). We are thankful to F. Wien (Synchrotron SOLEIL) for many fruitful discussions regarding CD of nucleic acids.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Sinden, R.R. DNA Structure and Function; Academic Press: Cambridge, MA, USA, 1994. [Google Scholar]
  2. Duardo, R.C.; Guerra, F.; Pepe, S.; Capranico, G. Non-B DNA structures as a booster of genome instability. Biochimie, 2023; in press. [Google Scholar] [CrossRef]
  3. Mirkin, S.M.; Frank-Kamenetskii, M.D. H-DNA and related structures. Annu. Rev. Biophys. Biomol. Struct. 1994, 23, 541–576. [Google Scholar] [CrossRef] [PubMed]
  4. Wang, G.; Vasquez, K.M. Dynamic alternative DNA structures in biology and disease. Nat. Rev. Genet. 2023, 24, 211–234. [Google Scholar] [CrossRef] [PubMed]
  5. Sen, D.; Gilbert, W. Formation of parallel four-stranded complexes by guanine-rich motifs in DNA and its implications for meiosis. Nature 1988, 334, 364–366. [Google Scholar] [CrossRef]
  6. Chen, E.V.; Nicoludis, J.M.; Powell, B.M.; Li, K.S.; Yatsunyk, L.A. Crystal structure of a three-tetrad, parallel, K(+)-stabilized human telomeric G-quadruplex at 1.35 A resolution. Acta Crystallogr. Sect. F Struct. Biol. Commun. 2023, 79, 144–150. [Google Scholar] [CrossRef]
  7. Simonsson, T. G-quadruplex DNA structures variations on a theme. Biol. Chem. 2001, 382, 621–628. [Google Scholar] [CrossRef] [PubMed]
  8. Liano, D.; Monti, L.; Chowdhury, S.; Raguseo, F.; Di Antonio, M. Long-range DNA interactions: Inter-molecular G-quadruplexes and their potential biological relevance. Chem. Commun. 2022, 58, 12753–12762. [Google Scholar] [CrossRef] [PubMed]
  9. Liu, Y.; Li, J.; Zhang, Y.; Wang, Y.; Chen, J.; Bian, Y.; Xia, Y.; Yang, M.H.; Zheng, K.; Wang, K.B.; et al. Structure of the Major G-Quadruplex in the Human EGFR Oncogene Promoter Adopts a Unique Folding Topology with a Distinctive Snap-Back Loop. J. Am. Chem. Soc. 2023, 145, 16228–16237. [Google Scholar] [CrossRef]
  10. Burge, S.; Parkinson, G.N.; Hazel, P.; Todd, A.K.; Neidle, S. Quadruplex DNA: Sequence, topology and structure. Nucleic Acids Res. 2006, 34, 5402–5415. [Google Scholar] [CrossRef] [PubMed]
  11. Phan, A.T.; Kuryavyi, V.; Luu, K.N.; Patel, D.J. Structure of two intramolecular G-quadruplexes formed by natural human telomere sequences in K+ solution. Nucleic Acids Res. 2007, 35, 6517–6525. [Google Scholar] [CrossRef]
  12. Patel, D.J.; Phan, A.T.; Kuryavyi, V. Human telomere, oncogenic promoter and 5′-UTR G-quadruplexes: Diverse higher order DNA and RNA targets for cancer therapeutics. Nucleic Acids Res. 2007, 35, 7429–7455. [Google Scholar] [CrossRef] [PubMed]
  13. Dey, U.; Sarkar, S.; Teronpi, V.; Yella, V.R.; Kumar, A. G-quadruplex motifs are functionally conserved in cis-regulatory regions of pathogenic bacteria: An in-silico evaluation. Biochimie 2021, 184, 40–51. [Google Scholar] [CrossRef] [PubMed]
  14. Deze, O.; Laffleur, B.; Cogne, M. Roles of G4-DNA and G4-RNA in Class Switch Recombination and Additional Regulations in B-Lymphocytes. Molecules 2023, 28, 1159. [Google Scholar] [CrossRef] [PubMed]
  15. Bahls, B.; Aljnadi, I.M.; Emidio, R.; Mendes, E.; Paulo, A. G-Quadruplexes in c-MYC Promoter as Targets for Cancer Therapy. Biomedicines 2023, 11, 969. [Google Scholar] [CrossRef]
  16. Panova, V.V.; Dolinnaya, N.G.; Novoselov, K.A.; Savitskaya, V.Y.; Chernykh, I.S.; Kubareva, E.A.; Alexeevski, A.V.; Zvereva, M.I. Conserved G-Quadruplex-Forming Sequences in Mammalian TERT Promoters and Their Effect on Mutation Frequency. Life 2023, 13, 1478. [Google Scholar] [CrossRef] [PubMed]
  17. Johnson, J.E.; Cao, K.; Ryvkin, P.; Wang, L.S.; Johnson, F.B. Altered gene expression in the Werner and Bloom syndromes is associated with sequences having G-quadruplex forming potential. Nucleic Acids Res. 2010, 38, 1114–1122. [Google Scholar] [CrossRef]
  18. Gunaratnam, M.; Neidle, S. An evaluation cascade for G-quadruplex telomere targeting agents in human cancer cells. Methods Mol. Biol. 2010, 613, 303–313. [Google Scholar] [CrossRef]
  19. Escaja, N.; Mir, B.; Garavis, M.; Gonzalez, C. Non-G Base Tetrads. Molecules 2022, 27, 5287. [Google Scholar] [CrossRef]
  20. Abou Assi, H.; Garavis, M.; Gonzalez, C.; Damha, M.J. i-Motif DNA: Structural features and significance to cell biology. Nucleic Acids Res. 2018, 46, 8038–8056. [Google Scholar] [CrossRef]
  21. Luo, X.; Zhang, J.; Gao, Y.; Pan, W.; Yang, Y.; Li, X.; Chen, L.; Wang, C.; Wang, Y. Emerging roles of i-motif in gene expression and disease treatment. Front. Pharmacol. 2023, 14, 1136251. [Google Scholar] [CrossRef]
  22. Atkinson, J.; McGlynn, P. Replication fork reversal and the maintenance of genome stability. Nucleic Acids Res. 2009, 37, 3475–3492. [Google Scholar] [CrossRef]
  23. Kim, S.H.; Pytlos, M.J.; Sinden, R.R. Replication restart: A pathway for (CTG).(CAG) repeat deletion in Escherichia coli. Mutat. Res. 2006, 595, 5–22. [Google Scholar] [CrossRef]
  24. Piazza, A.; Boule, J.B.; Lopes, J.; Mingo, K.; Largy, E.; Teulade-Fichou, M.P.; Nicolas, A. Genetic instability triggered by G-quadruplex interacting Phen-DC compounds in Saccharomyces cerevisiae. Nucleic Acids Res. 2010, 38, 4337–4348. [Google Scholar] [CrossRef]
  25. Parekh, V.J.; Niccum, B.A.; Shah, R.; Rivera, M.A.; Novak, M.J.; Geinguenaud, F.; Wien, F.; Arluison, V.; Sinden, R.R. Role of Hfq in Genome Evolution: Instability of G-Quadruplex Sequences in E. coli. Microorganisms 2019, 8, 28. [Google Scholar] [CrossRef] [PubMed]
  26. Cueny, R.R.; McMillan, S.D.; Keck, J.L. G-quadruplexes in bacteria: Insights into the regulatory roles and interacting proteins of non-canonical nucleic acid structures. Crit. Rev. Biochem. Mol. Biol. 2022, 57, 539–561. [Google Scholar] [CrossRef] [PubMed]
  27. Paul, T.; Voter, A.F.; Cueny, R.R.; Gavrilov, M.; Ha, T.; Keck, J.L.; Myong, S. E. coli Rep helicase and RecA recombinase unwind G4 DNA and are important for resistance to G4-stabilizing ligands. Nucleic Acids Res. 2020, 48, 6640–6653. [Google Scholar] [CrossRef]
  28. Estep, K.N.; Butler, T.J.; Ding, J.; Brosh, R.M. G4-Interacting DNA Helicases and Polymerases: Potential Therapeutic Targets. Curr. Med. Chem. 2019, 26, 2881–2897. [Google Scholar] [CrossRef] [PubMed]
  29. McGlynn, P. Helicases that underpin replication of protein-bound DNA in Escherichia coli. Biochem. Soc. Trans. 2011, 39, 606–610. [Google Scholar] [CrossRef]
  30. Wu, X.; Maizels, N. Substrate-specific inhibition of RecQ helicase. Nucleic Acids Res. 2001, 29, 1765–1771. [Google Scholar] [CrossRef]
  31. Bagchi, D.; Manosas, M.; Zhang, W.; Manthei, K.A.; Hodeib, S.; Ducos, B.; Keck, J.L.; Croquette, V. Single molecule kinetics uncover roles for E. coli RecQ DNA helicase domains and interaction with SSB. Nucleic Acids Res. 2018, 46, 8500–8515. [Google Scholar] [CrossRef]
  32. Khairnar, N.P.; Maurya, G.K.; Pandey, N.; Das, A.; Misra, H.S. DrRecQ regulates guanine quadruplex DNA structure dynamics and its impact on radioresistance in Deinococcus radiodurans. Mol. Microbiol. 2019, 112, 854–865. [Google Scholar] [CrossRef]
  33. Voter, A.F.; Qiu, Y.; Tippana, R.; Myong, S.; Keck, J.L. A guanine-flipping and sequestration mechanism for G-quadruplex unwinding by RecQ helicases. Nat. Commun. 2018, 9, 4201. [Google Scholar] [CrossRef]
  34. Voloshin, O.N.; Camerini-Otero, R.D. The DinG protein from Escherichia coli is a structure-specific helicase. J. Biol. Chem. 2007, 282, 18437–18447. [Google Scholar] [CrossRef]
  35. Voloshin, O.N.; Vanevski, F.; Khil, P.P.; Camerini-Otero, R.D. Characterization of the DNA damage-inducible helicase DinG from Escherichia coli. J. Biol. Chem. 2003, 278, 28284–28293. [Google Scholar] [CrossRef]
  36. Mackay, R.P.; Xu, Q.; Weinberger, P.M. R-Loop Physiology and Pathology: A Brief Review. DNA Cell Biol. 2020, 39, 1914–1925. [Google Scholar] [CrossRef] [PubMed]
  37. Boubakri, H.; de Septenville, A.L.; Viguera, E.; Michel, B. The helicases DinG, Rep and UvrD cooperate to promote replication across transcription units in vivo. EMBO J. 2010, 29, 145–157. [Google Scholar] [CrossRef]
  38. McGlynn, P.; Lloyd, R.G. RecG helicase activity at three- and four-strand DNA structures. Nucleic Acids Res. 1999, 27, 3049–3056. [Google Scholar] [CrossRef]
  39. Trinh, T.Q.; Sinden, R.R. Preferential DNA secondary structure mutagenesis in the lagging strand of replication in E. coli. Nature 1991, 352, 544–547. [Google Scholar] [CrossRef]
  40. Hashem, V.I.; Rosche, W.A.; Sinden, R.R. Genetic assays for measuring rates of (CAG).(CTG) repeat instability in Escherichia coli. Mutat. Res. 2002, 502, 25–37. [Google Scholar] [CrossRef] [PubMed]
  41. Kim, S.H.; Cai, L.; Pytlos, M.J.; Edwards, S.F.; Sinden, R.R. Generation of long tracts of disease-associated DNA repeats. Biotechniques 2005, 38, 247–253. [Google Scholar] [CrossRef]
  42. Luria, S.E.; Delbruck, M. Mutations of Bacteria from Virus Sensitivity to Virus Resistance. Genetics 1943, 28, 491–511. [Google Scholar] [CrossRef] [PubMed]
  43. Sarkar, S.; Ma, W.T.; Sandri, G.H. On fluctuation analysis: A new, simple and efficient method for computing the expected number of mutants. Genetica 1992, 85, 173–179. [Google Scholar] [CrossRef]
  44. Cappannini, A.; Mosca, K.; Mukherjee, S.; Moafinejad, S.N.; Sinden, R.R.; Arluison, V.; Bujnicki, J.; Wien, F. NACDDB: Nucleic Acid Circular Dichroism Database. Nucleic Acids Res. 2022, 51, D226–D231. [Google Scholar] [CrossRef] [PubMed]
  45. Guo, K.; Pourpak, A.; Beetz-Rogers, K.; Gokhale, V.; Sun, D.; Hurley, L.H. Formation of pseudosymmetrical G-quadruplex and i-motif structures in the proximal promoter region of the RET oncogene. J. Am. Chem. Soc. 2007, 129, 10220–10228. [Google Scholar] [CrossRef] [PubMed]
  46. Trinh, T.Q.; Sinden, R.R. The influence of primary and secondary DNA structure in deletion and duplication between direct repeats in Escherichia coli. Genetics 1993, 134, 409–422. [Google Scholar] [CrossRef]
  47. Rosche, W.A.; Trinh, T.Q.; Sinden, R.R. Differential DNA secondary structure-mediated deletion mutation in the leading and lagging strands. J. Bacteriol. 1995, 177, 4385–4391. [Google Scholar] [CrossRef]
  48. Sinden, R.R.; Zheng, G.X.; Brankamp, R.G.; Allen, K.N. On the deletion of inverted repeated DNA in Escherichia coli: Effects of length, thermal stability, and cruciform formation in vivo. Genetics 1991, 129, 991–1005. [Google Scholar] [CrossRef]
  49. Edwards, S.F.; Hashem, V.I.; Klysik, E.A.; Sinden, R.R. Genetic instabilities of (CCTG).(CAGG) and (ATTCT).(AGAAT) disease-associated repeats reveal multiple pathways for repeat deletion. Mol. Carcinog. 2009, 48, 336–349. [Google Scholar] [CrossRef]
  50. Wang, Y.; Patel, D.J. Solution structure of the Tetrahymena telomeric repeat d(T2G4)4 G-tetraplex. Structure 1994, 2, 1141–1156. [Google Scholar] [CrossRef]
  51. Kumar, N.; Sahoo, B.; Varun, K.A.; Maiti, S.; Maiti, S. Effect of loop length variation on quadruplex-Watson Crick duplex competition. Nucleic Acids Res. 2008, 36, 4433–4442. [Google Scholar] [CrossRef]
  52. Nambiar, M.; Goldsmith, G.; Moorthy, B.T.; Lieber, M.R.; Joshi, M.V.; Choudhary, B.; Hosur, R.V.; Raghavan, S.C. Formation of a G-quadruplex at the BCL2 major breakpoint region of the t(14;18) translocation in follicular lymphoma. Nucleic Acids Res. 2011, 39, 936–948. [Google Scholar] [CrossRef] [PubMed]
  53. Sun, D.; Hurley, L.H. The importance of negative superhelicity in inducing the formation of G-quadruplex and i-motif structures in the c-Myc promoter: Implications for drug targeting and control of gene expression. J. Med. Chem. 2009, 52, 2863–2874. [Google Scholar] [CrossRef]
  54. Sekibo, D.A.T.; Fox, K.R. The effects of DNA supercoiling on G-quadruplex formation. Nucleic Acids Res. 2017, 45, 12069–12079. [Google Scholar] [CrossRef] [PubMed]
  55. Duquette, M.L.; Huber, M.D.; Maizels, N. G-rich proto-oncogenes are targeted for genomic instability in B-cell lymphomas. Cancer Res. 2007, 67, 2586–2594. [Google Scholar] [CrossRef] [PubMed]
  56. Duquette, M.L.; Handa, P.; Vincent, J.A.; Taylor, A.F.; Maizels, N. Intracellular transcription of G-rich DNAs induces formation of G-loops, novel structures containing G4 DNA. Genes. Dev. 2004, 18, 1618–1629. [Google Scholar] [CrossRef] [PubMed]
  57. Belotserkovskii, B.P.; Liu, R.; Tornaletti, S.; Krasilnikova, M.M.; Mirkin, S.M.; Hanawalt, P.C. Mechanisms and implications of transcription blockage by guanine-rich DNA sequences. Proc. Natl. Acad. Sci. USA 2010, 107, 12816–12821. [Google Scholar] [CrossRef]
  58. Hashem, V.I.; Rosche, W.A.; Sinden, R.R. Genetic recombination destabilizes (CTG)n.(CAG)n repeats in E. coli. Mutat. Res. 2004, 554, 95–109. [Google Scholar] [CrossRef]
  59. Hashem, V.I.; Sinden, R.R. Duplications between direct repeats stabilized by DNA secondary structure occur preferentially in the leading strand during DNA replication. Mutat. Res. 2005, 570, 215–226. [Google Scholar] [CrossRef]
  60. Glickman, B.W.; Ripley, L.S. Structural intermediates of deletion mutagenesis: A role for palindromic DNA. Proc. Natl. Acad. Sci. USA 1984, 81, 512–516. [Google Scholar] [CrossRef]
  61. Rosche, W.A.; Trinh, T.Q.; Sinden, R.R. Leading strand specific spontaneous mutation corrects a quasipalindrome by an intermolecular strand switch mechanism. J. Mol. Biol. 1997, 269, 176–187. [Google Scholar] [CrossRef]
  62. Rosche, W.A.; Ripley, L.S.; Sinden, R.R. Primer-template misalignments during leading strand DNA synthesis account for the most frequent spontaneous mutations in a quasipalindromic region in Escherichia coli. J. Mol. Biol. 1998, 284, 633–646. [Google Scholar] [CrossRef] [PubMed]
  63. Viswanathan, M.; Lacirignola, J.J.; Hurley, R.L.; Lovett, S.T. A novel mutational hotspot in a natural quasipalindrome in Escherichia coli. J. Mol. Biol. 2000, 302, 553–564. [Google Scholar] [CrossRef] [PubMed]
  64. Gordenin, D.A.; Lobachev, K.S.; Degtyareva, N.P.; Malkova, A.L.; Perkins, E.; Resnick, M.A. Inverted DNA repeats: A source of eukaryotic genomic instability. Mol. Cell Biol. 1993, 13, 5315–5322. [Google Scholar] [CrossRef] [PubMed]
  65. Rosche, W.A.; Jaworski, A.; Kang, S.; Kramer, S.F.; Larson, J.E.; Geidroc, D.P.; Wells, R.D.; Sinden, R.R. Single-stranded DNA-binding protein enhances the stability of CTG triplet repeats in Escherichia coli. J. Bacteriol. 1996, 178, 5042–5044. [Google Scholar] [CrossRef]
  66. Sun, H.; Karow, J.K.; Hickson, I.D.; Maizels, N. The Bloom’s syndrome helicase unwinds G4 DNA. J. Biol. Chem. 1998, 273, 27587–27592. [Google Scholar] [CrossRef]
  67. London, T.B.; Barber, L.J.; Mosedale, G.; Kelly, G.P.; Balasubramanian, S.; Hickson, I.D.; Boulton, S.J.; Hiom, K. FANCJ is a structure-specific DNA helicase associated with the maintenance of genomic G/C tracts. J. Biol. Chem. 2008, 283, 36132–36139. [Google Scholar] [CrossRef]
  68. Krasilnikova, M.M.; Mirkin, S.M. Replication stalling at Friedreich’s ataxia (GAA)n repeats in vivo. Mol. Cell Biol. 2004, 24, 2286–2295. [Google Scholar] [CrossRef] [PubMed]
  69. Maizels, N. Dynamic roles for G4 DNA in the biology of eukaryotic cells. Nat. Struct. Mol. Biol. 2006, 13, 1055–1059. [Google Scholar] [CrossRef]
  70. Eddy, J.; Maizels, N. Selection for the G4 DNA motif at the 5′ end of human genes. Mol. Carcinog. 2009, 48, 319–325. [Google Scholar] [CrossRef]
  71. Cahoon, L.A.; Seifert, H.S. An alternative DNA structure is necessary for pilin antigenic variation in Neisseria gonorrhoeae. Science 2009, 325, 764–767. [Google Scholar] [CrossRef]
  72. Gaffke, L.; Kubiak, K.; Cyske, Z.; Wegrzyn, G. Differential Chromosome- and Plasmid-Borne Resistance of Escherichia coli hfq Mutants to High Concentrations of Various Antibiotics. Int. J. Mol. Sci. 2021, 22, 8886. [Google Scholar] [CrossRef] [PubMed]
  73. Butler, T.J.; Estep, K.N.; Sommers, J.A.; Maul, R.W.; Moore, A.Z.; Bandinelli, S.; Cucca, F.; Tuke, M.A.; Wood, A.R.; Bharti, S.K.; et al. Mitochondrial genetic variation is enriched in G-quadruplex regions that stall DNA synthesis in vitro. Hum. Mol. Genet. 2020, 29, 1292–1309. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Mutation rates in BW25113. The results are plotted for the G-rich or C-rich tract comprising the lagging template strand during replication. Black bar, G-rich strand in the lagging template; grey bar, C-rich strand in the lagging template. G-rich/C-rich is the ratio of mutation rates. Images of minimal G4 structures are shown above the data that correspond to the different G4 structures that can form in the G-rich strands (also see Supplemental Table S1). Note that (G3T)8 can form one or two G4 structures, as can dimer repeat inserts.
Figure 1. Mutation rates in BW25113. The results are plotted for the G-rich or C-rich tract comprising the lagging template strand during replication. Black bar, G-rich strand in the lagging template; grey bar, C-rich strand in the lagging template. G-rich/C-rich is the ratio of mutation rates. Images of minimal G4 structures are shown above the data that correspond to the different G4 structures that can form in the G-rich strands (also see Supplemental Table S1). Note that (G3T)8 can form one or two G4 structures, as can dimer repeat inserts.
Genes 14 01720 g001
Figure 2. Pathways to quadruplex formation and deletion. Quadruplex structures may form spontaneously in supercoiled DNA (A) in a displaced strand following R-loop formation (B) in the leading strand ahead upstream of a replication fork (if helicases unwind the strands well ahead of the fork (C) or in a single-stranded lagging template strand of replication (D). In (D), complete deletion likely occurs by replication slippage that involves primer-template misalignment between the two proximal EcoRI restriction sites generated when repeats are cloned into the cat gene. A stable G-quadruplex would be expected to block replication (left), thereby promoting replication slippage (right). This is shown in the context of lagging strand replication, although it could happen during replication of either strand.
Figure 2. Pathways to quadruplex formation and deletion. Quadruplex structures may form spontaneously in supercoiled DNA (A) in a displaced strand following R-loop formation (B) in the leading strand ahead upstream of a replication fork (if helicases unwind the strands well ahead of the fork (C) or in a single-stranded lagging template strand of replication (D). In (D), complete deletion likely occurs by replication slippage that involves primer-template misalignment between the two proximal EcoRI restriction sites generated when repeats are cloned into the cat gene. A stable G-quadruplex would be expected to block replication (left), thereby promoting replication slippage (right). This is shown in the context of lagging strand replication, although it could happen during replication of either strand.
Genes 14 01720 g002
Figure 3. G-quadruplex formation during replication pausing and restart as a source of genetic instability. (A) G3T Leading Template. (A) Leading strand replication pausing during synthesis of the G-rich strand of (G3T)n. (B) After pausing, a G-quadruplex may form in the leading template. Replication may continue on the lagging template. (C) RecG-dependent fork reversal could place a G-quadruplex/i-motif ahead of the replication fork. It is not known if DNA repair machinery would work on the G-quadruplex/i-motif structure prior to replication restart. G-quadruplexes are shown in blue, while i-motifs are shown in green. (D) Replication on the ‘chicken foot’ following primer-template misalignment may also encounter a G-quadruplex, which could lead to deletions in the leading template. After branch migration, a G-quadruplex/i-motif could also form in the lagging strand, although it may not block continued fork progression. Note that an alternative restart pathway involving Holliday junction cleavage is not shown (see Kim et al. [23]). (B) G3T Lagging Template. While similar intermediates can be drawn when the G-rich repeat (G3T)n comprises the lagging template, the opportunity for formation of a G-quadruplex within a single strand is limited to late steps of a Holliday junction cleavage, recombination, and replication restart.
Figure 3. G-quadruplex formation during replication pausing and restart as a source of genetic instability. (A) G3T Leading Template. (A) Leading strand replication pausing during synthesis of the G-rich strand of (G3T)n. (B) After pausing, a G-quadruplex may form in the leading template. Replication may continue on the lagging template. (C) RecG-dependent fork reversal could place a G-quadruplex/i-motif ahead of the replication fork. It is not known if DNA repair machinery would work on the G-quadruplex/i-motif structure prior to replication restart. G-quadruplexes are shown in blue, while i-motifs are shown in green. (D) Replication on the ‘chicken foot’ following primer-template misalignment may also encounter a G-quadruplex, which could lead to deletions in the leading template. After branch migration, a G-quadruplex/i-motif could also form in the lagging strand, although it may not block continued fork progression. Note that an alternative restart pathway involving Holliday junction cleavage is not shown (see Kim et al. [23]). (B) G3T Lagging Template. While similar intermediates can be drawn when the G-rich repeat (G3T)n comprises the lagging template, the opportunity for formation of a G-quadruplex within a single strand is limited to late steps of a Holliday junction cleavage, recombination, and replication restart.
Genes 14 01720 g003
Table 1. Effect of a mutation in dinG on Cmr mutation rates.
Table 1. Effect of a mutation in dinG on Cmr mutation rates.
Mutation Rate
QuadruplexG-rich Strand Lagging TemplateC-rich Strand Lagging Template
wtdinGdinG/wtwtdinGdinG/wt
(G3T)49.93 × 10−93.27 × 10−83.297.32 × 10−92.47 × 10−83.37
(G3T)84.99 × 10−63.18 × 10−56.378.28 × 10−65.94 × 10−57.17
(G4T)45.50 × 10−55.50 × 10−51.006.81 × 10−61.04 × 10−51.53
(G4T)4 D1.10 × 10−54.78 × 10−60.434.27 × 10−54.00 × 10−50.94
RET G42.88 × 10−85.04 × 10−81.757.80 × 10−86.58 × 10−80.84
Table 2. Effect of mutations in genes encoding RecG and RecQ on Cmr mutation rates for the (G3T)8 and (G4T)4-dimer repeats.
Table 2. Effect of mutations in genes encoding RecG and RecQ on Cmr mutation rates for the (G3T)8 and (G4T)4-dimer repeats.
GenotypeMutation RateRatioMutant/wt
G-rich LagC-rich LagG-rich/C-richG-richC-rich
(G3T)8 repeat
wild type1.09 × 10−75.95 × 10−918.3411
recG3.12 × 10−86.25 × 10−95.000.291.05
recQ2.12 × 10−71.39 × 10−815.231.952.34
recG recQ1.17 × 10−76.79 × 10−9173.6310.801.14
(G4T)4-dimer repeat
wild type2.70 × 10−51.87 × 10−614.4111
recG3.86 × 10−51.24 × 10−631.121.430.66
recQ1.56 × 10−52.55 × 10−66.110.581.36
recG recQ1.72 × 10−52.38 × 10−67.230.641.27
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Parekh, V.J.; Węgrzyn, G.; Arluison, V.; Sinden, R.R. Genomic Instability of G-Quadruplex Sequences in Escherichia coli: Roles of DinG, RecG, and RecQ Helicases. Genes 2023, 14, 1720. https://doi.org/10.3390/genes14091720

AMA Style

Parekh VJ, Węgrzyn G, Arluison V, Sinden RR. Genomic Instability of G-Quadruplex Sequences in Escherichia coli: Roles of DinG, RecG, and RecQ Helicases. Genes. 2023; 14(9):1720. https://doi.org/10.3390/genes14091720

Chicago/Turabian Style

Parekh, Virali J., Grzegorz Węgrzyn, Véronique Arluison, and Richard R. Sinden. 2023. "Genomic Instability of G-Quadruplex Sequences in Escherichia coli: Roles of DinG, RecG, and RecQ Helicases" Genes 14, no. 9: 1720. https://doi.org/10.3390/genes14091720

APA Style

Parekh, V. J., Węgrzyn, G., Arluison, V., & Sinden, R. R. (2023). Genomic Instability of G-Quadruplex Sequences in Escherichia coli: Roles of DinG, RecG, and RecQ Helicases. Genes, 14(9), 1720. https://doi.org/10.3390/genes14091720

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop