1. Introduction
In eukaryotic cells, RNA molecules are transcribed from DNA by three major RNA polymerases. RNA polymerase I transcribes most of the ribosomal RNAs. RNA polymerase II (Pol II) transcribes messenger RNAs (mRNAs) and most regulatory noncoding RNAs. RNA polymerase III transcribes transfer RNAs, the U6 small nuclear RNA, and the 5S ribosomal RNA [
1].
Due to its critical role in mRNA biosynthesis, Pol II activity is a major determinant of protein homeostasis. Human malignancies, including cancer, autoimmunity, cardiovascular disease, and neurological disorders, are frequently associated with mutations in Pol II regulatory sequences and in proteins and RNAs that recognize such sequences [
2], which are reflected in abnormal protein levels. For example, heart failure has been related to insufficient Pol II recruitment to metabolic gene promoters, possibly due to transcriptional coactivator dysregulation [
3]. On the other hand, cancer frequently involves the up-regulation of transcription factors and other proteins [
2]. As such, inhibition of the Pol II transcription machinery is a potential anticancer strategy [
4]. Pol II may also be directly affected by diseases. Viral infections and DNA damage can modify the Pol II phosphorylation status and cause subunit depletion, both of which severely perturb Pol II function [
5]. Therefore, understanding the functional mechanism of Pol II is valuable for designing innovative strategies for the therapeutic modulation of its transcriptional activity.
The functional mechanism of Pol II is remarkably complex, in part due to the unusually large number of molecular and supra-molecular interactions involved. Pol II is a protein complex containing 12 subunits [
1]. Its activity requires additional and distinct proteins and protein complexes for the initiation, elongation, and termination of transcription [
1]. Pol II function also depends on specific properties, including biomolecular condensation, which drives the reversible clustering of proximal Pol II complexes into nuclear compartments [
6]. Pol II clustering increases transcription levels, which are usually proportional to cluster lifetime [
7]. Nascent RNA also forms nuclear compartments seeded by noncoding RNA molecules at the transcription sites [
8]. This mechanism promotes rapid local increases in RNA concentration and a competitive adaptation to environmental cues. Nevertheless, it remains poorly understood if, in single compartments, Pol II and nascent RNA are homogeneously distributed or physically segregated. Aberrations in biomolecular condensation processes have been related to human diseases [
9]. As such, testing whether the Pol II and nascent RNA subcellular compartments are physically associated or segregated might uncover Pol II properties whose modulation could affect its transcriptional activity.
One of the strategies for Pol II transcriptional modulation is the binding of catalytically dead CRISPR-associated system 9 (dCas9) to a protein-coding gene [
10,
11]. In
Escherichia coli, dCas9 binding to a gene could physically block mRNA transcription and decrease protein levels [
10,
11]. Although this effect is less efficient in eukaryotic cells [
10], several promising applications have been developed. Multiple bindings of dCas9 to repetitive DNA sequences in human cells blocked the transcription of toxic RNAs related to microsatellite expansion diseases, including Huntington’s disease, myotonic dystrophy types 1 and 2, and others [
12]. In Chinese hamster ovary cells, dCas9 binding to an essential gene increased the mRNA levels and the purification yield of a protein expressed from an adjacent recombinant gene [
13]. In
Saccharomyces cerevisiae, dCas9 binding to DNA modified chromatin condensation and generated novel sites of sense and antisense transcription [
14]. Moreover, dCas9 binding could block not only DNA replication in
E. coli and
S. cerevisiae [
15], but also the repair of DNA double-strand breaks in mouse embryonic stem cells [
16]. These findings demonstrate that dCas9 binding to DNA can modulate not only gene expression but also DNA metabolism and repair, with promising applications in biomedicine and biotechnology. Notably, the use of dCas9 allows all these applications without modifying the genome.
dCas9 binding to DNA requires the hybridization of its attached guide RNA (gRNA) with one of the DNA strands. In both bacterial and human cells, dCas9-mediated transcriptional blocking was more effective when the gRNA hybridized with the coding strand of the gene [
10,
11,
12,
17]. Such a requirement might be exploited to strengthen unidirectional RNA transcription [
17]. Nevertheless, the molecular basis of the decreased dCas9 blocking efficiency upon gRNA hybridization with the template strand remains poorly understood. In vitro protein assays demonstrated that DNA-bound dCas9 is a highly asymmetric roadblock [
18]. Changing the gRNA-hybridized DNA strand rotates dCas9 180 degrees, possibly determining its fate in the event of colliding with Pol II. Consistently, gRNA hybridization with the template strand correlated with dCas9 removal by RNA polymerases from
E. coli [
18] and the T7 phage [
16]. Nevertheless, it remains unclear if gRNA hybridization with the template strand induces the removal of dCas9 and/or Pol II from the gene in human cells.
Here, we studied the transcriptional mechanisms of human Pol II. To determine if the Pol II and nascent RNA compartments are physically associated or segregated, we visualized their supra-molecular arrangements from an array of short identical recombinant genes. To understand the decreased dCas9 transcriptional blocking upon gRNA hybridization with the template strand of a gene, we targeted dCas9 to different gene regions and monitored Pol II distribution throughout the gene body and nascent RNA levels.
3. Discussion
In the present research, we studied two features of human Pol II. One was the extensive compartmentalization of its supra-molecular interactions with genes and nascent RNA. This property could be required for efficient mRNA transcription and may be affected by human disease. The second was the Pol II capacity to maintain transcriptional activity in the presence of DNA-bound dCas9. This capacity demonstrates not only the versatility of Pol II in overcoming a physical obstacle but also the limitation of dCas9 as a tool for inducing transcriptional interference. The potential implications of these findings are discussed below.
We visualized the supra-molecular interactions among an array of ~335 recombinant human β-globin genes, Pol II, and their nascent RNA molecules. We quantified both the longest diameters and intensities of fluorescent signals to search for correlations between signal pairs. While the measurement of the signal diameter is more reproducible among independent microscopy experiments, signal intensity provides a more direct readout of the number of fluorescent molecules bound to the target. Although all genes in the array were identical, they were not uniformly transcribed (
Figure 2B,
Figure 3B and
Figures S2A and S4A). Furthermore, Pol II did not simultaneously overlap with all of them (
Figure 4B and
Figure S5A). The confined localization of Pol II within the gene array implies that Pol II is not equally transcribing all the genes simultaneously. In our work, nascent RNA also formed compartments with extensive overlap with Pol II (
Figure 5B,
Figure 6B and
Figures S6A and S7A). The extensive segregation between Pol II and nascent RNA is striking, considering the relatively short length of each nascent RNA molecule (about 3.7 kbp) and the previously characterized interactions between nascent RNA and the C-terminal domain of Pol II [
26,
27]. Such extensive segregation might result from the high affinity between nascent RNA and the molecules responsible for its processing. Super-resolution microscopy studies also identified clusters of transcribing Pol II and partially overlapping nanoscale domains of nascent RNA [
28]. However, the labeling method used was not specific for short mRNA molecules, as in our work. In our system, as a likely consequence of the segregation between Pol II and nascent RNA, nascent RNA also exhibited a partial overlap with the gene array (
Figure 2B,
Figure 3B, and
Figures S2A and S4A). Consistently, super-resolution microscopy studies also visualized segregation between nascent RNA and nucleosomes [
28].
Similarly to Pol II, RNA polymerase I has also been shown to cluster at nucleolar transcription sites [
29]. Nucleoli exhibit DNA–RNA segregation as a consequence of several partial gene silencing mechanisms, including DNA methylation [
30], histone modifications [
31], chromatin remodeling [
32,
33], and interactions with intergenic RNAs [
34]. Nevertheless, these mechanisms are likely absent from our gene array. We propose that the biomolecular condensation properties of Pol II and nascent RNA could promote DNA–RNA segregation in the absence of partial gene silencing mechanisms.
The formation of chromatin loops during RNA transcription has been extensively documented [
35]. These loops are not exclusive to long DNA regions; they are also found in transcribed sequences as short as 1 kbp [
36]. Each gene copy in our array is about 5.5 kbp long, compatible with the formation of chromatin loops. Transcription from chromatin loops was also observed in RNA polymerase I-transcribed ribosomal RNA in human and mouse cells [
37]. In our gene array, Pol II was likely able to cluster through its biomolecular condensation properties. Pol II clusters could have been formed at chromatin loops, further promoting their pulling out from the gene array to facilitate mRNA transcription (
Figure 10A). The formation of chromatin loops in central regions of the gene array could have resulted in fully overlapped signals between the gene array and Pol II. On the other hand, chromatin loops might have been pulled out from peripheral regions of the gene array, resulting in partially overlapped signals (
Figure 10A). In chromatin loops, Pol II clusters likely promote the physical separation, not only between Pol II and the gene array but also between Pol II and its nascent RNA (
Figure 10B). This is supported by transcriptional activity affecting chromatin mobility independently of its condensation state [
38]. The decreased overlap of Pol II with genes and their nascent RNAs could be a feature of human mRNA transcription. Alterations in the segregation extent between Pol II and genes and/or between Pol II and nascent RNA might affect the rates and fidelity of transcription, with possible implications for human disease.
We have also addressed the transcriptional consequences of dCas9 binding to DNA. The binding of dCas9-GFP to exon 2 of a recombinant β-globin gene (using a gRNA hybridizing the template strand of the gene) was stable over time, both in the absence (
Figure 7B) and presence (
Figure 7C) of high Pol II transcription levels. Minor or negligible recovery of FRAP signals has also been observed in other studies, including dCas9-GFP binding to mouse B2-type short interspersed nuclear elements [
39] and telomeric repeats [
40,
41]. These observations further support long-lasting binding to DNA as a general property of dCas9, with its average residence time between three and four hours [
42]. Stable binding of dCas9 to a gene indicates that it was likely not removed by transcribing Pol II. Moreover, this property potentially enables dCas9 to function as an effective roadblock to Pol II. This hypothesis was tested in a cell line carrying a single copy of a recombinant β-globin gene by ChIP and ddRT-PCR. In comparison with telomeric DNA binding (
Figure 8A), when dCas9 was bound to the first, second, or third exons of the recombinant β-globin gene (
Figure 8C,E,G), there was no significant accumulation of transcriptionally paused Pol II in the promoter/exon 1 region of the gene (
Figure 8D,F,H). In addition, the levels of nascent RNA were increased only upon binding of dCas9-GFP to exon 3 (
Figure 9B). When the gRNA was switched to the coding strand of exon 1 (
Figure 8I), the accumulation of transcriptionally paused Pol II increased (
Figure 8J), but the levels of nascent RNA (
Figure 9D) were not significantly affected. Targeting the gRNA to the coding strand has been correlated with higher efficiency of transcriptional interference in both bacterial and human cells [
10,
11,
12]. Nevertheless, the efficiency of Cas9/dCas9 biotechnological applications usually depends on the gRNA binding sequence within a specific DNA region. This observation might explain the transcriptional interference in our gene system upon gRNA hybridization with the coding strand. Importantly, multiple bindings of dCas9 to repetitive DNA sequences were sufficient to robustly block mRNA transcription in human cells [
12]. In our recombinant gene, the binding of a single dCas9 molecule was not sufficient for the same purpose, indicating that multiple bindings are critical. Furthermore, these findings uncover a limitation of using dCas9 to block transcription of toxic RNA. Our observations indicate that Pol II is able to maintain its transcriptional activity in the presence of DNA-bound dCas9. Here, we assessed the impact of dCas9 binding on Pol II distribution and nascent RNA levels, providing direct measurements of the response at the transcriptional level. We conclude that, in our recombinant gene, dCas9 was unable to block a high flow of transcribing Pol II (induced with 1 µg/mL doxycycline) while remaining bound to its target, with a negligible impact on transcription (
Figure 10C). This observation is consistent with the Pol II capacity for transcribing through nucleosomes with minor histone displacement [
43], in which FRAP has been used to monitor the binding dynamics between DNA and histones [
44].
In addition to the above-described findings, our work also introduces a cell system for studying transcriptional architecture. Our cell line, which contains an array of ~335 identical β-globin gene copies, enabled the fluorescent imaging of a portion of chromatin composed of short identical recombinant genes. Here, we have only visualized its associated Pol II and nascent RNA. Nevertheless, additional factors could be fluorescently labeled and visualized in future studies, including transcription factors, splicing factors, noncoding RNAs, and other molecules associated with mRNA transcription. Moreover, this cell line enabled visualizing the relative positioning of DNA, Pol II, and nascent RNA without the need for super-resolution microscopy. Notably, the simultaneous visualization of the gene array and its MS2-labeled nascent RNA was achieved in living cells. When live-cell visualization is attainable, dynamic processes can be tracked over time. However, a preliminary experiment (
Figure S3) has shown that this cell line reveals detectable mRNA transcription signals in the absence of transcription induction with doxycycline. This significant basal transcription, which contrasts with early observations in a distinct doxycycline-inducible system [
45], limits kinetic studies and the feasibility of visualizing the supra-molecular structures associated with full transcription silencing or shorty after transcription induction, such as differences in signal overlap. Furthermore, the recombinant gene integration sites have not, so far, been experimentally determined in single-copy or multiple-copy cell lines. Additional limitations of our work include the limited spatial resolution of ChIP, which might have contributed to masking minor Pol II blocking events (
Figure 8F). Future steps could include the visualization of the total Pol II levels (including its unphosphorylated and serine 5-phosphorylated forms) by immunofluorescence in the multiple-copy cell line, which could provide a more complete depiction of Pol II interactions with the gene array. Concerning the potential blocking of transcriptional elongation by dCas9, a possible future test could be the simultaneous targeting of the three gene exons by dCas9. This approach should create a more robust barrier to Pol II transcriptional elongation and contribute to better understanding the absence of Pol II blocking observed herein. In addition, it could be useful to address, by sequencing approaches, if nascent RNA was fragmented in the presence of dCas9 binding to the β-globin gene.
In conclusion, we have shown extensive compartmentalization among an array of short identical recombinant genes, Pol II, and nascent RNA. Moreover, Pol II could retain its transcriptional activity in the presence of DNA-bound dCas9. These findings contribute insight into the mechanism of RNA transcription by Pol II, which could be useful for designing therapeutic modulation strategies. These strategies could aim at changing Pol II transcription levels by affecting its compartmentalization or by effectively blocking its activity.
4. Materials and Methods
4.1. Cell Lines
All cell lines in this study were cultured in Dulbecco’s Modified Eagle Medium (DMEM; Gibco, New York, NY, USA) supplemented with 10% fetal bovine serum (Gibco, New York, NY, USA). The cell line carrying a single recombinant human β-globin gene copy was previously generated [
21]. The cell line carrying multiple copies of a recombinant human β-globin gene was generated in this study using genetically modified U2OS cells (Clontech, Mountain View, CA, USA), as described below. Cell lines were authenticated through the determination of the respective recombinant human β-globin gene copy number.
4.2. Cloning of gRNA-Expressing Sequences
For gRNA expression, the PX459 plasmid (Addgene plasmid #62988) was modified with an optimized gRNA design [
23]. Its wild-type Cas9-coding gene was removed by restriction digestion, gel extraction, and religation of the Cas9-free plasmid. For cloning of each gRNA-expressing construct, the modified PX459 plasmid was digested with BbsI (New England Biolabs, Ipswich, MA, USA), dephosphorylated, and gel-purified. DNA oligonucleotides (
Table S1) were phosphorylated, annealed, and ligated into the digested plasmid using T4 DNA ligase (Thermo Fisher Scientific, Waltham, MA, USA). Ligation products were transformed into competent
E. coli DH5α cells and purified using the NZYtech plasmid miniprep kit (NZYtech, Lisbon, Portugal). Plasmids were eluted in 10 mM Tris-HCl pH 8.0 prepared in water deionized in a Milli-Q system (Millipore Corporation, Billerica, MA, USA). Insert ligation was confirmed by Sanger sequencing.
For stable expression of dCas9-GFP and a gRNA in cell lines containing a single recombinant human β-globin gene copy, gRNA sequences recognizing the gene were re-cloned into a lentiviral gRNA vector with optimized design [
23]. gRNAs were PCR-amplified from the modified PX459 vector with the primers listed in
Table S2. Phusion DNA Polymerase (Thermo Fisher Scientific, Waltham, MA, USA) was used, according to the enzyme’s manual. PCR products and the lentiviral gRNA-expressing plasmid were digested with BstXI and XhoI and purified. Digested PCR products were ligated into the lentiviral gRNA vector (which originally contained a telomeric repeat recognition sequence) using T4 DNA ligase (Thermo Fisher Scientific, Waltham, MA, USA). Insert ligation was confirmed by Sanger sequencing.
4.3. Generation of a Human Cell Line Carrying Multiple Copies of a Recombinant Human β-globin Gene
The human β-globin genomic construct used consists of the human gene from cytogenetic band 11p15.4, carrying a deletion of 593 bp on intron 2–3 between RsaI and SspI restriction sites. The construct was previously modified on intron 1 (at the BpiI restriction site) by insertion of an array of 25 ʎN-BoxB binding sites and on intron 2 (at the Psp5II restriction site) by insertion of an array of 24 MS2-coat binding sites. This construct was previously detailed [
21]. A blunted NcoI-Acc65I insert containing the genomic construct, from initiation codon to approximately 1800 bp past the poly(A) site (at B
glII restriction site), was recombined with the blunted Acc65I site of the pTRE-tight vector (Clontech, Mountain View, CA, USA).
The construct described above was transfected into Tet-On U2OS-derived cells (Clontech; Mountain View, CA, USA), which provides a regulated, high-level gene expression system [
46] upon doxycycline induction. These cells have a transgenic insertion of the plasmid pTet-On, which confers resistance to the G418 antibiotic (or neomycin). This gene drives the expression of the rtTA regulatory protein. The construct was linearized by digestion with the PvuI restriction enzyme and further co-transfected with pTK. Hyg plasmid linearized with VspI in a 9:1 proportion. The recombinant cells were selected by culture on growth medium containing 0.6 mg/mL hygromycin (Roche, Basel, Switzerland). Clones were screened by expression of the β-globin construct after induction with 6 mg/mL doxycycline (Clontech, Mountain View, CA, USA) by RT-PCR.
The pCAG-dCas9-GFP plasmid [
22] was further transfected using the Neon electroporation kit (Life Technologies, Waltham, MA, USA). 4 × 10
5 cells were resuspended in 120 µL Neon buffer R, and 5 µg pCAG-dCas9-GFP plasmid was added. Cells were electroporated by applying three 10 ms pulses of 1400 V through a 100 µL Neon tip. Electroporated cells were seeded onto a 15 cm culture plate, and 10 µg/mL blasticidin S (Thermo Fisher Scientific, Waltham, MA, USA) were added after 24 h for the selection of positive clones.
4.4. Stable Integration of dCas9-GFP and gRNA into a Human Cell Line Carrying a Single Copy of a Recombinant Human β-globin Gene
Cells stably expressing dCas9-GFP and a gRNA were generated by lentiviral infection of the cell line carrying a single copy of the recombinant human β-globin gene [
21]. Lentiviral suspensions used for infection expressed dCas9-GFP, the Tet-ON-3G transactivation factor, and a gRNA recognizing telomeric repeats or either human β-globin exon 1, exon 2, or exon 3 in the template strand or exon 1 in the coding strand (
Table S2). The lentiviral gRNA vector also expresses mCherry.
To produce each lentiviral suspension, HEK293T cells were grown in a T75 flask (Thermo Fisher Scientific, Waltham, MA, USA) to 50–70% confluency and transfected with 1 µg pMD2.G, 8 µg pCMV-dR8.91, and 9 µg transfer vector (either dCas9-GFP, Tet-ON-3G, or a lentiviral gRNA plasmid) using Lipofectamine 2000 (Invitrogen, Waltham, MA, USA). On day 2, medium was replaced (12 mL/T75 flask). On day 3, 12 mL lentiviral suspension were collected from each flask and stored at 4 ºC; 12 mL fresh medium were added to each flask. On day 4, a second portion of 12 mL lentiviral suspension was collected from each flask, pooled with the corresponding suspension collected on day 3, filtered through a 0.45 µm filter, and concentrated by ultracentrifugation. Ultracentrifugation was performed at 25,000 rpm for 2 h at 4 °C in a Beckman SW41 rotor. The supernatant was drained, and lentiviral pellets were resuspended in fresh medium by pipetting. Each 24 mL portion of centrifuged lentiviral suspension was resuspended in 2 mL fresh medium.
For the infection of the cell line carrying a single recombinant human β-globin gene copy, cells were seeded on 3.5-cm plates and grown to 50–70% confluency. They were infected with 1 mL dCas9-GFP, 1 mL Tet-ON-3G, and 2 mL gRNA lentiviral suspensions, in the presence of 5 µg/mL polybrene (Sigma-Aldrich, St. Louis, MO, USA). On day 5, 2 µg/mL puromycin (Sigma-Aldrich, St. Louis, MO, USA) was added to select for transduced cells. The antibiotic was kept for 5–7 days, and on day 8 after lentiviral infection, cells exhibiting simultaneous GFP and mCherry fluorescence were sorted by flow cytometry at the Flow Cytometry Facility of Instituto de Medicina Molecular João Lobo Antunes.
4.5. Recombinant β-globin Copy Number Determination by ddPCR
For recombinant β-globin gene copy number determination, genomic DNA was isolated and analyzed by ddPCR. Cells were harvested and resuspended in phosphate-buffered saline (PBS) buffer pH 7.4. Histones were degraded by overnight incubation with 200 µg/mL Proteinase K (Thermo Fisher Scientific, Waltham, MA, USA). Genomic DNA was extracted in a phenol/chloroform/isoamyl alcohol solution (Thermo Fisher Scientific, Waltham, MA, USA), precipitated in ammonium acetate and ethanol, and dissolved in deionized water. Isolated genomic DNA was analyzed by ddPCR assay using the QX200™ Droplet Digital™ PCR System (Bio-Rad, Hercules, CA, USA) and EvaGreen Digital PCR Supermix according to the manufacturer’s instructions. Genomic DNA was analyzed with primer sets that were specific to the stem-loops in the recombinant gene as well as to the endogenous human β-globin only (taking advantage of a partial deletion in the second intron of the recombinant gene, between
RsaI and
SspI restriction sites). As a negative control, genomic DNA from plain U2OS cells was also analyzed. Primers are listed in
Table S3.
Purified genomic DNA was digested overnight with the HaeIII restriction enzyme (New England BioLabs, Ipswitch, MA, USA) before concentration determination in NanoDrop 2000 UV-Vis spectrometer (Thermo Fisher Scientific, Waltham, MA, USA). For ddPCR, 20 μL reaction mixtures were prepared, containing 2 ng of template DNA, 0.5 μL of HaeIII 10 U/μL, 2× ddPCR EvaGreen Supermix (Bio-Rad, Hercules, CA, USA), and 0.1 µM primers forward and reverse. The reaction mixtures were partitioned into a droplet emulsion with the QX200 droplet generator (Bio-Rad, Hercules, CA, USA), then transferred into a 96-well PCR plate, heat sealed, and placed in a Veriti™ Thermal Cycler (Applied Biosystems, Foster City, CA, USA). The thermal cycling program was 95 °C for 10 min; 10 cycles of 94 °C for 40 s and 68 °C for 110 s, with a decrease of 0.5 °C each step; followed by 50 cycles of 94 °C for 40 s and 63 °C for 110 s; then 90 °C for 5 min, and holding at 4 °C. The PCR product was read on the QX200 droplet reader (Bio-Rad, Hercules, CA, USA), and data were analyzed with the QuantaSoft software (Bio-Rad, Hercules, CA, USA). Recombinant β-globin copy number was calculated as described in the legend of
Figure 1.
4.6. Transient Transfection, Immunofluorescence, and RNA FISH
The cell line carrying multiple copies of a recombinant human β-globin gene was grown to ~70% confluency on MatTek (Ashland, MA, USA) #1.5 glass-bottom 35 mm plates (for live-cell microscopy) coated with poly-lysine or on Menzel-Gläser (Braunschweig, Germany) #1.5 glass coverslips (for immunofluorescence and/or RNA FISH).
Cells were transfected using Lipofectamine 2000 (Invitrogen, Waltham, MA, USA) and 600 ng of a gRNA-expressing plasmid and 400 ng of MCP-GFP or MCP-mCherry-expressing plasmid. Recombinant β-globin transcription was induced with 1 µg/mL doxycycline (Clontech, Mountain View, CA, USA) for 16–24 h. For live-cell microscopy, medium was changed to DMEM/F12 (1:1) (Gibco, New York, NY, USA), and imaging was performed.
An optimized Pol II immunofluorescence protocol was followed [
47,
48]. About 24 h after treatment with 1 µg/mL doxycycline, cells were fixed in 125–250 mM HEPES pH 7.6 containing 4% formaldehyde (10 min) and 8% formaldehyde (50 min) at 4ºC, followed by permeabilization in PBS pH 7.4 containing 0.5% Triton X-100 for 30 min at 4ºC with gentle rocking. Blocking was performed in PBS pH 7.4 containing 1% bovine serum albumin, 0.2% bovine skin gelatin, and 0.1% bovine milk casein for 1 h at room temperature. A rabbit anti-Pol II C-terminal domain repeats YSPTSPS phospho S2 primary antibody (IgG polyclonal; Abcam [Cambridge, UK]; catalogue number: ab5095) was incubated in a 1:1000 dilution for 2 h at room temperature in blocking solution, followed by 3x 1 h washes in the same solution. The secondary antibody (1:500–1:2000 dilution) was incubated 1 h at room temperature, and slides were rinsed overnight at 4 °C in blocking solution. Slides were washed 3× 5 min in PBS pH 7.4 0.1% Tween20, followed by 2× 5min in PBS pH 7.4 and mounted in Vectashield mounting medium (Vector Laboratories, Newark, NJ, USA).
For RNA FISH, about 24 h after treatment with 1 µg/mL doxycycline, cells were fixed in PBS pH 7.4 containing 3.7% formaldehyde for 10 min at room temperature and permeabilized in PBS PH 7.4 containing 0.5% Triton X-100 for 10 min at room temperature. They were subsequently washed 3× 5 min in PBS pH 7.4, followed by 2× 5 min in 2× saline-sodium citrate (SSC) buffer pH 7.0, 0.05% Tween20, at room temperature. Samples were blocked in 2× SSC containing 0.05% Tween20, 1% bovine serum albumin, 1 mg/mL tRNA, 2 mM vanadyl ribonucleoside complex (VRC) for 30 min at room temperature. They were hybridized overnight at 30 °C in 2× SSC containing 10% formamide, 25% dextran sulfate, and 10 nM of a FISH probe recognizing human β-globin exon 2. The FISH probe (Sigma-Aldrich, St. Louis, MO, USA) has the 5′ to 3′ sequence GCCCATAACAGCATCAGGAG and was modified with a cyanine 3 (Cy3) fluorophore at both ends of the oligonucleotide. Before hybridization, the probe was denatured for 5 min at 75 °C. After overnight incubation, coverslips were washed 2× 30 min in 2× SSC with 0.05% Tween20 and 10% formamide, followed by 2× 5 min washes in the same buffer without formamide, at 30 °C. Coverslips were further washed in PBS pH 7.4 and fixed again in PBS pH 7.4 containing 3.7% formaldehyde, washed 2× 5 min in PBS pH 7.4, and mounted in Vectashield mounting medium (Vector Laboratories, Newark, NJ, USA).
For immunofluorescence combined with RNA FISH, the FISH protocol described above was followed, with the following changes: the primary antibody was added to the hybridization solution containing probe (after denaturation and cooling down), and the secondary antibody was added during the first post-hybridization washing step, which was extended to 50 min.
4.7. Confocal Microscopy and FRAP
Microscopy experiments were performed at the Bioimaging Unit of Instituto de Medicina Molecular João Lobo Antunes. For simultaneous visualization of recombinant β-globin and nascent transcripts, transfected cells were visualized in a 3i Marianas spinning disk confocal microscope (Intelligent Imaging Innovations, Denver, CO, USA) equipped with a Plan-apochromat 100× 1.40 NA oil objective (Carl Zeiss, Jena, Germany). Images were recorded using an Evolve 512 EMCCD camera (Photometrics, Tucson, AZ, USA). Exposure times were 30–100 ms, and laser power was kept between 10–70%. Intensification was 200. Unless mentioned otherwise, all images shown are single-plan micrographs obtained from z-stack images with a 0.27 µm step size.
3D reconstructions were performed using the IMARIS 7.0 software (Bitplane, Zurich, Switzerland). Z-stack images of two elements among gene array, Pol II and nascent RNA, were tightly cropped around the green and red signals. Using the original micrographs as references, intensity thresholds were manually adjusted in each case in order to minimize background while keeping relative size, shape, and overlap between signals. Reconstructions were manually divided into classes based on the extent of overlap. Micrographs and 3D reconstruction images were color-corrected (red to magenta) using Adobe Photoshop CS3.
The total fluorescence intensities of the gene array, nascent RNA, and Pol II signals were determined using ImageJ 1.53e. A polygon region of interest (ROI) was placed over the fluorescent signal or adjacent nucleoplasmic background. For quantification of Pol II signal intensity, nucleoli (the only nucleoplasmic regions depleted of Pol II) were selected as the background. The integrated (total) fluorescence intensity of the signal ROI, the ROI area, and the mean fluorescence intensity of the background were determined from single-channel, maximum-intensity projection images containing only the slices where each signal was found. The total corrected fluorescence intensity of each signal was calculated as:
For FRAP experiments, cells were seeded and transfected as described above, using 1 μg of gRNA-expressing plasmid. For FRAP of dCas9-GFP bound to the recombinant β-globin operator or nucleoli, cells were co-transfected with dCas9-GFP-expressing plasmid to increase dCas9-GFP levels in the nucleoplasm and nucleoli. A Zeiss 880 point scanning confocal microscope equipped with a 1.46 NA 63× or 1.40 NA 100× oil objective (Carl Zeiss, Oberkochen, Germany) was used. A series of 40 images (frame size, 512 × 512; pixel width, 56 nm; pixel time, 1.03 μs) were acquired at regular intervals with an open pinhole. Intervals were 15 s (for dCas9-GFP bound to recombinant β-globin) or 2 s (for dCas9-GFP bound to nucleoli). Photobleaching was induced at 100% laser transmission on a circular ROI with a diameter of 30 pixels (0.8 μm radius) for 723 ms (20 iterations). For imaging, the transmission was attenuated to 3% of the bleaching intensity.
Fluorescence intensity quantification was carried out automatically for each FRAP time series using a macro created in ImageJ (
https://imagej.net/ij/index.html; accessed on 30 March 2023). Briefly, time-lapse frames were background subtracted and corrected for cell displacement during image acquisition using a rigid body registration algorithm (
http://bigwww.epfl.ch/thevenaz/stackreg/; accessed on 30 March 2023). The average fluorescence I(t) in the bleached region was then measured using the Time Series Analyzer plugin for ImageJ (
http://rsbweb.nih.gov/ij/plugins/time-series.html; accessed on 30 March 2023), taking advantage of the Recenter function to automatically coincide the geometric center of the measuring ROI with the center of mass of the green-labeled transcription sites, thus correcting for any minor intranuclear movement of the gene array. In each time frame, the nuclear region was subsequently identified using k-means clustering (
https://github.com/ij-plugins/ijp-toolkit/wiki/k%E2%80%90means-Clustering; accessed on 30 March 2023), and the average fluorescence in the nucleus T(t) at time t after bleaching was measured to normalize FRAP recovery curves, as previously described [
49].
4.8. ChIP and qPCR Analysis
A previously published protocol [
50] was followed. Cells stably expressing a single copy of a recombinant human β-globin gene, dCas9-GFP, Tet-ON 3G, and a gRNA-recognizing telomeric repeat, β-globin exon 1, exon 2, or exon 3, were grown on 10-cm plates. When confluency reached 50–70%, recombinant human β-globin expression was induced with 1 µg/mL doxycycline. About 24 h later, chromatin elements were cross-linked with 1% formaldehyde (diluted in PBS pH 7.4) for 10 min at room temperature, followed by quenching with 140 mM glycine for 5 min at room temperature. Cells were scraped, harvested in PBS pH 7.4, and resuspended in 700 µL 1% SDS, 50 mM Tris-HCl, 10 mM EDTA, pH 8.0, supplemented with 1/8 tablet/mL Complete, EDTA-free Protease Inhibitor Cocktail (Roche, Basel, Switzerland). After an incubation period of 10–30 min on ice, cell suspensions were sonicated 5× 20 s in Soniprep 150 (Sanyo, Osaka, Japan) operating at 10 micron amplitude. The lysate was cleared by centrifugation, and a 50 µL aliquot was taken for total DNA purification (input).
A total of 8 µL samples of chromatin fragments (either before or after sonication) were analyzed in a 2% agarose gel stained with Midori Green (Nippon Genetics, Tokyo, Japan). 1 kbp Plus DNA Ladder (Thermo Fisher Scientific, Waltham, MA, USA) was used as a molecular mass marker.
For immunoprecipitation, the leftover was 2-fold diluted with dilution buffer (167 mM NaCl, 17 mM Tris-HCl, 1.1% Triton X-100, 1.2 mM EDTA, 0.01% SDS, pH 8.0) and incubated with 20 µL suspension of magnetic DynabeadsTM Protein G (Invitrogen, Waltham, MA, USA) for 1 h at 4 °C with rotation. After Dynabeads harvesting in a magnetic rack, each sample was split in two and further diluted to 1.3 mL with dilution buffer. Dilutions were incubated overnight at 4 °C with rotation in the presence of 3 µg mouse anti-GFP (from IgG1k clones 7.1 and 13.1; Roche [Basel, Switzerland]; catalogue number 11814460001) or 3 µg rabbit anti-Pol II C-terminal domain repeats YSPTSPS phospho S5 (IgG polyclonal; Abcam [Cambridge, UK]; catalog number: ab5131) antibodies. After incubation, 50 µL dynabead suspension were added to each sample and incubated for 3 h at 4 °C with rotation and subjected to four washes. First wash was in 20 mM Tris-HCl pH 8.0, 0.1% SDS, 1% Triton X-100, 2 mM EDTA containing 150 mM NaCl. The second wash was in the same buffer containing 500 mM NaCl. The third wash was in 0.25 mM LiCl, Tris-HCl, pH 8.0, 1% Nonidet P-40, 1% sodium deoxycholate, 1 mM EDTA. The fourth wash was in 10 mM Tris-HCl pH 8.0, 1 mM EDTA. Buffer was completely removed, and Dynabeads boiled for 10 min in a 10% Chelex-100 (Bio-Rad, Hercules, CA, USA) suspension, incubated for 45 min at 55 °C with 20 µg proteinase K (Thermo Fisher Scientific, Waltham, MA, USA), and re-boiled for 10 min. Immunoprecipitated DNA fragments were collected from the supernatant in a total volume of 160 µL.
For total DNA purification, 50 µL sample was added 450 µL dilution buffer and 20 µL 5 M NaCl and incubated overnight at 65 °C. 10 µL EDTA, 20 µL Tris-HCl pH 6.8, 20 µg proteinase K (Thermo Fisher Scientific, Waltham, MA, USA) were added and incubated 90 min at 45 °C. Total DNA was purified by extraction in phenol-chloroform-isoamyl alcohol (Thermo Fisher Scientific, Waltham, MA, USA) and chloroform partitioning, followed by precipitation and dissolution in 50 µL DNase/RNase-free water.
Samples and 5-fold diluted inputs were analyzed by quantitative PCR using 5 µL sample or diluted input and 10 µL of a mixture containing iTaq Universal SYBR Green Supermix (Bio-Rad, Hercules, CA, USA) and the primer pairs listed in
Table S5. Amplification was monitored in real time during 40 cycles using SYBR Green standard protocol. Each ChIP sample (or 5-fold diluted input) was amplified in triplicate, and the corresponding cycle threshold (Ct) was averaged. Then, the averaged Ct of each diluted input was subtracted from the averaged Ct of the corresponding dCas9 and Pol II samples. The ΔCt value of each sample was obtained. The fold-over input of each sample was calculated as 2
^(−ΔCt). Additional ChIP assays were performed, and results were combined.
4.9. Isolation of Nascent RNA and ddRT-PCR
Cells stably expressing dCas9-GFP and a gRNA-recognizing telomeric repeat, β-globin exon 1, exon 2, or exon 3, were cultured in 10-cm plates. Recombinant β-globin expression was induced overnight with 1 µg/mL doxycycline. Cells were harvested, resuspended in culture medium, distributed into two aliquots, centrifuged, and resuspended in PBS pH 7.4. The fractionation procedure was performed at 4 °C. Samples were centrifuged 3 min at 1500× g, and pellets resuspended in 1 mL 10 mM Tris pH 7.4, 10 mM NaCl, 3 mM MgCl2. After centrifuging 3 min at 800× g, pellets were resuspended in 200 µL 10 mM Tris-HCl pH 7.4, 10 mM NaCl, 3 mM MgCl2, 10% glycerol, 0.5% Nonidet P-40, 0.5 mM dithiothreitol (DTT), 100 units/mL RNAse inhibitor (Bioline, Toronto, ON, Canada) and centrifuged 3 min at 4500× g. Pellets were resuspended in 25 µL 20 mM tris-HCl pH 8.0, 75 mM NaCl, 0.5 mM EDTA, 0.85 mM DTT, 0.2125 mM PMSF, 50% glycerol, 100 units/mL RNase inhibitor (Bioline, Toronto, ON, Canada). 225 µL 10 mM HEPES pH 7.6, 300 mM NaCl, 0.2 mM EDTA, 1 mM DTT, 7.5 mM MgCl2, 1 M urea, 1% Nonidet P-40, 100 units/mL RNase inhibitor (Bioline, Toronto, ON, Canada) were added. Samples were centrifuged at 16,000× g for 2 min and resuspended in 200 µL 10 mM Tris-HCl pH 7.5, 500 mM NaCl, 10 mM MgCl2, 100 units/mL DNaseI (Roche, Basel, Switzerland), 100 units/mL RNase inhibitor (Bioline, Toronto, ON, Canada), and incubated 30 min at 37 °C. Chromatin-associated RNA was extracted with PureZOL RNA isolation reagent (Bio-Rad, Hercules, CA, USA), followed by chloroform partitioning, and dissolved in 15 µL RNase-free water.
For ddRT-PCR, One-Step RT-ddPCR Advanced Kit for Probes (Bio-Rad, Hercules, CA, USA) was used. For each condition, technical duplicates were prepared by mixing on ice 10 µL Supermix, 4 µL reverse transcriptase, 2 µL 300 mM DTT, and 4 µL of each primer and probe set. Primers and probes used in ddRT-PCR are listed in
Table S4. 20 µL chromatin-associated RNA diluted to 0.25 ng/µL were added to each mix (final volume: 40 µL), which was split into two aliquots. Droplets were prepared using 20 µL mix and 70 µL Droplet Generation Oil for Probes (Bio-Rad, Hercules, CA, USA). Thermal cycling conditions included reverse transcription (60 min at 50 °C), enzyme activation (10 min at 95 °C), followed by 50 cycles of denaturation (30 s at 95 °C), and annealing/extension (1 min and 40 s at 65 °C). Finally, enzymes were heat-inactivated (10 min at 98 °C). After droplet reading, results were visualized in QuantaSoft 1.7.4.0917 software, where the threshold between negative (void of amplified target) and positive (containing at least one amplified target copy) droplets was manually adjusted. For each reaction, the number of copies/µL for both β-globin and GAPDH was calculated by the QuantaSoft software (Bio-Rad, Hercules, CA, USA). Results from each technical duplicate were combined, and levels of recombinant β-globin nascent RNA were normalized by dividing by the corresponding number of GAPDH RNA copies. Additional ddRT-PCR assays were performed, and results were combined.
4.10. Statistics
Data processing calculations, coefficients of determination, as well as line and pie graphs, were obtained in Microsoft Excel 2016. Scatter plots and statistical analyses were performed in GraphPad Prism 8.0.2 (GraphPad Software, San Diego, CA, USA). In each cell line, the recombinant human β-globin gene copy number was determined as the mean ± standard deviation of the copy numbers obtained using each of the two stem-loop-specific primer pairs (
Figure 1C,D). In all confocal microscopy assays, each experimental measurement corresponds to a single cell or a single 3D reconstruction. In ChIP and ddRT-PCR assays, each measurement corresponds to the cell population from a single cell growth plate. FRAP quantification data were represented as the mean ± standard deviation. All other numerical data were assumed to have a non-Gaussian distribution and represented as the median ± interquartile range. In 3D reconstructions, ChIP, and ddRT-PCR data, statistical significance was assessed using the Mann–Whitney test, in which one-tailed (ChIP) or two-tailed (3D reconstructions and ddRT-PCR)
p-values < 0.05 were considered statistically significant.