Next Article in Journal
Ferroptosis and Senescence: A Systematic Review
Next Article in Special Issue
Involvement of the H3.3 Histone Variant in the Epigenetic Regulation of Gene Expression in the Nervous System, in Both Physiological and Pathological Conditions
Previous Article in Journal
Enzyme Conditioning of Chicken Collagen and Taguchi Design of Experiments Enhancing the Yield and Quality of Prepared Gelatins
Previous Article in Special Issue
Targeting Chromatin-Remodeling Factors in Cancer Cells: Promising Molecules in Cancer Therapy
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Unveiling the Machinery behind Chromosome Folding by Polymer Physics Modeling

Dipartimento di Fisica, Università di Napoli Federico II, and INFN Napoli, Complesso Universitario di Monte Sant’Angelo, 80126 Naples, Italy
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Int. J. Mol. Sci. 2023, 24(4), 3660; https://doi.org/10.3390/ijms24043660
Submission received: 14 January 2023 / Revised: 6 February 2023 / Accepted: 9 February 2023 / Published: 11 February 2023
(This article belongs to the Special Issue Chromatin Architecture: A Flexible Foundation for Gene Expression)

Abstract

:
Understanding the mechanisms underlying the complex 3D architecture of mammalian genomes poses, at a more fundamental level, the problem of how two or multiple genomic sites can establish physical contacts in the nucleus of the cells. Beyond stochastic and fleeting encounters related to the polymeric nature of chromatin, experiments have revealed specific, privileged patterns of interactions that suggest the existence of basic organizing principles of folding. In this review, we focus on two major and recently proposed physical processes of chromatin organization: loop-extrusion and polymer phase-separation, both supported by increasing experimental evidence. We discuss their implementation into polymer physics models, which we test against available single-cell super-resolution imaging data, showing that both mechanisms can cooperate to shape chromatin structure at the single-molecule level. Next, by exploiting the comprehension of the underlying molecular mechanisms, we illustrate how such polymer models can be used as powerful tools to make predictions in silico that can complement experiments in understanding genome folding. To this aim, we focus on recent key applications, such as the prediction of chromatin structure rearrangements upon disease-associated mutations and the identification of the putative chromatin organizing factors that orchestrate the specificity of DNA regulatory contacts genome-wide.

1. Introduction

Mammalian genomes are highly compartmentalized in the nucleus of cells [1,2,3,4,5,6,7], as revealed by independent sequencing-based technologies such as Hi-C [8], GAM [9,10] or SPRITE [11,12], and microscopy approaches [13,14,15,16,17,18,19,20,21,22,23,24]. From the gene up to the nucleus scale, chromosomes form a multilayered architecture of genomic interactions, comprising DNA loops [25,26], megabase-sized topological domains with enhanced levels of self-contacts (e.g., TADs) [27,28,29], higher-order structures (such as meta-TADs [30] and lamina-associated domains [31]), A/B compartments [8] and nuclear territories [32]. Such a complex three-dimensional (3D) organization has been increasingly linked to crucial functional roles, as, for example, DNA regulatory sequences (e.g., enhancers) can control the transcriptional output of distal genes by establishing specific, long-range contacts with their target promoters [1,33,34]. Those interactions are typically favored inside TADs, which are then thought to constrain the spatial communication between functional elements [2,3,35,36]. Disruption of TAD boundaries, indeed, has been shown to affect the network of gene-regulator contacts by altering the 3D organization of the genome, thus resulting in gene misexpression and disease [37,38,39]. On the other hand, recent advances in microscopy-based approaches, such as super-resolution multiplexed FISH techniques, pushed the investigation of those chromatin structures at the single-cell level [13,19,20,23]. Those studies, by allowing a direct visualization of chromatin conformations in individual nuclei, highlighted, for example, the abundance of TAD-like domains in single cells and their broad cell-to-cell structural heterogeneity, thus adding important constraints on chromosome folding beyond population-averaged contacts [13,40]. Additionally, recent live-cell imaging reports are enabling to track in time the dynamic of specific chromatin loci and the transient behavior of DNA functional interactions at the single-molecule level, unveiling the fourth dimension of genome topologies with a resolution of very few seconds [41,42].
However, despite those technological advancements, a clear understanding of the molecular mechanisms responsible for chromatin architectures is still lacking. In the last years, many different computational strategies have been developed to reproduce with high accuracy the complex patterns of chromosomal interactions as detected, e.g., in bulk contact data [43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60], albeit those methods provide only poor mechanistic insights on the processes underlying their origin and formation. To tackle such a major challenge, principled models from polymer physics, investigated within simplified yet rigorous theoretical frameworks, have been extensively used. In particular, two main classes of physics models, which rely on radically different biological scenarios of DNA contact formation, have been increasingly sustained by recent experimental evidence: loop-extrusion (LE) and polymer phase-separation [61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94].
Here, we review the key molecular ingredients of those different mechanisms of folding and describe their implementation into polymer physics models. We discuss their structural predictions at the single-molecule level that we test against recent super-resolution imaging data [13]. We show that both mechanisms are consistent with chromatin structure in single cells and that they can coexist to shape chromosome folding at the scale of TADs and functional interactions [64,65]. Next, we show that those polymer models can be employed to predict in silico the patterns of structural rearrangements upon disease-associated genomic mutations, clarifying how chromatin architecture impacts gene regulation in the case of specific structural variants [78,95]. Finally, we examine the molecular determinants of folding envisaged by the different models and show that, in the emerging picture, a specific combination of genome organizing factors, including but not limited to CTCF and cohesin, links 3D chromatin architecture to 1D chromatin states [95]. Overall, as they provide access to the underlying molecular processes, polymer physics models are shown to be crucial to start unveiling the complex machinery shaping chromatin spatial organization and valuable predictive tools for real-world applications, for example in biomedicine.

2. Results

2.1. Loop-Extrusion and Polymer Phase-Separation as Mechanisms of DNA Contact Formation

In this section, we describe the basic ingredients of the LE and phase-separation models of chromosome folding. By using recent chromatin imaging data from multiplexed FISH experiments [13], we discuss how they perform in capturing the structural properties of specific human genomic loci at both the population-average and single-cell level, showing that they can cooperate simultaneously in single molecules to establish chromatin architecture.
The LE envisages a biological scenario in which the spatial proximity between non-adjacent anchor sites is achieved by the translocation of SMC molecular complexes (e.g., cohesin rings) that extrude DNA loops along the chromatin chain, in a non-equilibrium process that requires energy burning, such as ATP hydrolysis (Figure 1a) [61,62,73,89]. The extrusion process halts in correspondence with specific blocking anchors, i.e., convergently-oriented CTCF binding sites, which mark the boundaries of consecutive genomic regions [61,62]. In different variants of the model, the extruding complexes are subject, for instance, to passive diffusion [91] or pushed, e.g., by transcription-induced supercoiling [84], or their positions along the chain are averaged out in order to build effective equilibrium models consistent with explicit-extruder approaches [96]. Polymer simulations based on LE have been used to explain, for example, the formation and compaction of mitotic chromosomes [97], organization of TADs in interphase [61,62], or the structural effects of CTCF/cohesin degradation at the cell population-average level [73,98]. Furthermore, recent single-molecule imaging experiments, although in simplified conditions, provided direct evidence of a cohesin or condensin-based extrusion activity in vitro [99,100,101,102]. Yet, besides those observations, other reports highlighted the role of additional chromatin architecture factors, including but not limited to CTCF and cohesin [103,104], in shaping chromosome structure. Those studies showed, for instance, that loop-extrusion is not essential for establishing functional (e.g., enhancer–promoter) interactions and controlling gene regulation [105,106,107,108], hinting that chromatin contacts can arise from distinct mechanisms.
A different class of polymer models, exemplified, e.g., by the Strings and Binders (SBS) model [75,77], which relies on phase-separation mechanisms, represents the biological scenario whereby diffusing binders can drive the formation of a chromatin loop by bridging distal cognate DNA binding sites (Figure 1b, distinct types of binding sites are visually represented by different colors). The binders can represent single multivalent proteins, such as Transcription Factors (TFs), or, more generally, aggregates of several different molecules. In this framework, homotypic DNA interactions drive the spontaneous self-assembly of the system into specific globular domains, each associated with the local abundance of cognate binding sites [64,79]. The folding process in this case does not involve external energy input as it is sustained by the thermal bath. Such a mechanism is typically referred to as polymer–polymer (or chromatin-driven) phase-separation, as chromatin-associated proteins can induce the formation of collapsed polymer globules by cross-linking different cognate DNA segments [109]. This class of models can also include DNA binding molecules which have specific multivalent interactions with each other: in this case, the system undergoes a different folding process, known as liquid–liquid phase-separation, in which the interactions among soluble molecules promote the assembly of liquid-like protein droplets around the corresponding binding sites on chromatin [109]. Those nuclear droplets can also be stable in the absence of the polymer scaffold and are consistent with those observed in recent experiments, such as combinations of Pol-II, TFs, nuclear bodies, and coactivators linked to transcriptional control [110,111,112,113,114,115,116]. This scenario can be theoretically investigated, e.g., in SBS-like polymer models by allowing specific binder–binder interactions that could mimic, for example, those observed between many proteins, chromatin modulators, and RNA molecules in the cell [117,118,119]. In a more refined model, DNA-binding proteins could also switch between active and inactive binding states, providing a theoretical system to explore the possible role of a time-dependent switch of chromatin epigenetics on folding [120]. Polymer models that investigate the interplay between the formation of liquid-like protein aggregates and protein-DNA binding in shaping chromatin architecture are extensively discussed, e.g., in [109,121,122]. Notably, these models predict that even when a direct mutual interaction of the binders is not included, the binders can cluster and form phase-separated molecular aggregates under suitable conditions, via a process named bridging-induced attraction, recently observed also by microscopy experiments [82,108,123,124]. Finally, a model with direct polymer-to-polymer attractive interactions (that mimic, e.g., internucleosomal interactions between histone molecules bound to DNA), rather than protein-mediated, reproduces similar behaviors as dictated by polymer thermodynamics [65,125].
Different, and complementary, strategies have been developed to identify the putative binding sites of phase-separation-based polymer models, which mainly differ in using or not a priori knowledge of biological information, such as epigenetic marks and binding molecules. The first strategy, in which the model binding sites are derived, e.g., from epigenetics, is useful to test the molecular hypothesis, albeit by definition it cannot identify novel, additional molecular factors beyond those used to inform the model [60,63,68,93]. In the second approach, the binding domains are inferred with no prior epigenetics information and derived, e.g., from available contact data only. Such a strategy is exemplified, for instance, by the polymer-based recursive statistical computational procedure (PRISMR) [78], which infers the optimal SBS polymer model to best-fit input pairwise contact data (e.g., bulk Hi-C or GAM) [95,126]. Supported by increasing experimental evidence that traces back to phase-separation mechanisms [108,110,112], those models provided, overall, a quantitative explanation of contact formation across genomic scales, from the sub-TAD scale up to nuclear compartments [67,71,73,77,79,81,93,94], and also at the single-cell level [64]. To set a reference within such a broad spectrum of models, we consider hereafter the SBS as a representative phase-separation-based polymer model.
The technical implementation of those distinct folding mechanisms (loop-extrusion and phase-separation) into polymer models is broadly discussed in specialized papers (see, e.g., [61,64,73,79,127]) and for brevity will not be addressed here. Briefly, in those models, chromatin is represented as a coarse-grained polymer chain of consecutive, non-overlapping beads subject to classic physical potentials (i.e., FENE or harmonic and properly truncated Lennard-Jones functions) and regulated by a stochastic Langevin dynamics investigated by massive Molecular Dynamics (MD) simulations in the steady-state [128,129,130]. In the case of the LE model, the extruders stochastically bind to the polymer binding sites and translocate on them until they encounter a convergently-oriented pair of blocking sites or another extruder, or they stochastically dissociate from the chain. In the SBS model, the binders, initially randomly located in the system, establish homotypic attractive interactions with their cognate sites, hence driving a phase transition of the polymer into an equilibrium phase-separated state in which the chain is partitioned into specific and spatially segregated globular domains. MD simulations are typically performed in cubic boxes with periodic boundary conditions and optimized by using a high-performance toolkit, such as OpenMM [131], LAMMPS [132], or HOOMD [133].

2.1.1. The Polymer Models Are Consistent with the Structure of Chromatin at the Population-Average Level

To illustrate the predictive power of the models in explaining chromatin conformations, we consider as a case study a 2.5 Mb wide genomic region (Chr21: 34.6–37.1 Mb, hg38) in a human colon cancer cell line (HCT116) where single-cell microscopy data are available [13] (Figure 2, top panel).
In the LE model of the locus (Figure 2, middle), the blocking anchor sites correspond to the genomic locations of CTCF ChIP-seq peaks available in the studied chromatin region from the ENCODE database [134]. Furthermore, to consider the epigenetic diversity of single-cell landscapes, those anchor sites are present in each single-polymer molecule with a specific probability value (roughly in the range of 20–80%), chosen to best match the average (e.g., Hi-C) contact data of the locus [65,81]. Similarly, since a principled approach to parameterize the model is missing [73], other LE parameters, e.g., the processivity of the cohesin rings along the chain and their number, are selected upon a systematic parameter sweep suited to best reproduce average contact or distance data [65,73]. In the studied locus, the processivity, that is the extrusion velocity divided by the extruder dissociation rate, is 700 kb, while 10 is the number of extruding factors [65]. We point out, however, that fresh data from live-cell imaging technologies could help in a more precise and quantitative calibration of the LE dynamic parameters, as also discussed in recent studies [41,42].
On the other hand, the SBS model of the locus (Figure 2, bottom) has four distinct types of binding domains (visually represented by different colors), which are associated with the major TAD-like domains visible in the experimental median distance map of the locus. Each type of binding site (i.e., each color) of the model has been shown to correlate significantly with a specific, distinct combination of epigenetic factors (e.g., H3K4me3, H3K27ac, H3K27me3) or architectural proteins (such as CTCF/Cohesin and Pol-II) [64], hinting that a combinatorial action of different molecular factors shapes the polymer phase-separation of the studied chromatin region [64,95]. The energy affinities between the binders and the binding sites of the SBS chain are set in the weak biochemical energy scale (i.e., 1–10 KBT, KB is the Boltzmann constant and T the temperature), while binder concentrations are in the range of tens nmol/l (i.e., 50–500 nmol/L) [64,79]. Within such a robust range of energy/concentration parameters, the SBS chain undergoes a thermodynamic phase transition from an initial coil (i.e., randomly folded) conformation to an equilibrium globule phase-separated state (where the polymer self-assembles into segregated micro-globular structures each enriched of cognate binding sites) [64].
As a first benchmark of the LE and SBS models, their median spatial distance matrix (i.e., the median of the distance maps of each single-polymer conformation) has been compared against the corresponding locus map from imaging data [65]. It was found that both models faithfully recapitulate the ensemble-average conformation of the HCT116 locus, as they capture its main TAD structures and loop interactions (the latter particularly enriched upstream, e.g., in correspondence of the RUNX1 and SETD4 genes, Figure 2). The high degree of similarity between the model and experimental distance matrices is quantified, e.g., by their high distance-corrected Pearson correlation coefficients [78]: r’ = 0.87 for LE and r’ = 0.84 for SBS (Figure 2). As a further check of the significance of the results, a null control model, for instance, a self-avoiding chain with no mechanism in place, has been shown to return only poorer r’ correlations, i.e., roughly 7 times lower [135]. Similar findings on the performance of the models have been also reported in other genomic loci and in different cell lines, e.g., in human fibroblast cells where super-resolution microscopy data are available [13,19,64,65].
Taken together, those results highlight that active processes, such as DNA loop-extrusion, and passive mechanisms, such as polymer phase-separation, both work well in recapitulating the structural properties of chromatin architecture, e.g., TADs and loops, at the cell population level.

2.1.2. Loop-Extrusion and Polymer Phase-Separation Cooperate in Determining Heterogeneity of Chromatin Architectures in Single Cells

Next, we aim to discuss the loop-extrusion and phase-separation scenarios of folding at the single-cell level. To this aim, the statistical ensemble of single-polymer conformations predicted by the LE and SBS models [65] was compared against the imaged single chromatin structures of the locus in HCT116 cells (Figure 3a) [13]. To address whether the different models do provide a statistically significant, bonafide, representation of the microscopy conformations, the root-mean-square deviation (RMSD) criterion was used to associate each experimental 3D structure to a corresponding best-match single-polymer conformation of the models [65,136]. In brief, by performing a roto-translational alignment of model and microscopy structures to optimize the RMSD of their spatial coordinates, the criterion identifies, for each imaged single-cell structure, the corresponding model conformation with the least RMSD [65,137]. A visual example of the RMSD best-match procedure is reported in Figure 3b (top panel): a considered microscopy structure (cell 75 is shown as an example) has a single-cell distance matrix with two, spatially separated, TAD-like domains, reflected in two main segregated globules in 3D space; such a specific topology is steadily found in the corresponding best-matches predicted, respectively, by the LE and SBS models (Figure 3b, top panel), highlighting, at least visually, the consistency of the method. To properly assess the significance of the RMSD procedure, the distribution of RMSD values between random pairs of microscopy conformations was set as a control [64]. It was found that the RMSD distribution of the best-matching experiment–model pairs only poorly overlaps the control (two-sided Mann–Whitney p-value < 10−3, Figure 3b, middle), with more than 90% of the LE and SBS best matches falling below 10% of control (pie charts in Figure 3b, bottom). Similar results are found by taking control polymer models without LE- or SBS-based mechanisms (e.g., chains with only self-avoidance effects) [138].
Overall, the structural comparison via the RMSD criterion shows that the single-molecule conformations predicted by loop-extrusion and polymer phase-separation are statistically consistent with chromatin imaged structures at the single-cell level [65].
Finally, in order to understand whether the different models could provide a theoretical framework to explain the origin of the microscopy-reported cell-to-cell variability of the locus [13], the level of structural heterogeneity in imaging data has been quantified by computing the distribution of r’ correlations between all pairs of experimental single-cell distance matrices [64,139]. Consistent with the substantial variability of the observed single-locus conformations [13], the experimental distribution has a broad variance (Var(r’) = 0.20, Figure 3c, blue distribution) that is overall well reproduced by the LE and SBS models (respectively, in orange and green). However, the non-zero average value of the experimental distribution (r’ = 0.27, vertical dashed line in Figure 3c) signals that, albeit broadly varying, the imaged conformations have a residual structural correlation, which is, respectively, under- and over-estimated by models relying solely on loop-extrusion or phase-separation [65] (Figure 3c). Those small discrepancies could be related to the different nature of the underlying LE or SBS processes, as, for instance, the first tends to favor the formation of strong, yet more fleeting (i.e., less correlated) loop contacts, whereas the second tends to establish spatially segregated, stable (i.e., more correlated) globular domains. Consistent with such a picture, a polymer model that combines simultaneously both mechanisms at the single-molecule level (LE + SBS, Figure 3c) returns an average value consistent with the data, indicating that loop-extrusion and phase-separation can indeed coexist to establish chromatin architecture and shape its full range of heterogenous structural realizations in single cells [65].
Summarizing, those recent studies suggest that the observed cell-to-cell variability of chromatin conformations is consistent with a more complex scenario of folding in which loop-extrusion and polymer phase-separation cooperatively act at the single-molecule level to shape genome structure. Furthermore, as different phase-separation mechanisms may take place in the nucleus, an interesting perspective would be to dissect their specific roles in chromatin folding, for instance by considering more sophisticated models with a prominent liquid phase along with a chromatin-driven phase and active LE. The exact interplay and dynamics of this cooperation are yet to be comprehensively explored and could be one of the grand challenges of the field for both experimental and theoretical tests.

2.2. Further Validation and Key Applications of Chromatin Polymer Models

A central role for validated models from polymer physics is their ability to make accurate predictions on chromatin architecture that can be tested against independent experiments.
In this respect, applications of polymer models have been crucial, for example, in understanding the role of chromatin architecture in human genetic diseases. Recent studies have shown, in particular, that the SBS model can accurately predict the effects on chromatin 3D structure of disease-associated genomic structural variants (SVs), such as deletions, inversions, and duplications, thus helping to understand their contribution in driving disease [78,95,140,141,142,143]. To illustrate this point, in the following subsection, we focus on recent applications at a crucial chromatin locus, Sox9, linked to embryonic development [95]. On the other hand, while the phase-separation-based SBS model can well explain the effects of large mutations such as SVs, the LE model has been shown to be particularly suited to predict the effects of much smaller genomic rearrangements, such as single CTCF sites mutations [62]. Interestingly, however, an SBS polymer model including explicit interactions between preferential CTCF sites, as in the LE, has been shown to slightly improve predictions about the effects of SVs, indicating that the cooperation of the two mechanisms, consistent with the findings discussed above, can better explain chromatin folding also in disease [78].
Another relevant aspect of chromatin polymer models is that they are helping to spread light on the molecular factors that organize chromatin folding across chromosomal scales. Indeed, different studies based on polymer modeling and supported by experiments have revealed that the formation of different chromatin sub-compartments and specific folding patterns are driven, e.g., by homotypic interactions between chromatin sites sharing similar chromatin occupancy features, including histone modifications, active and poised gene promoter states, and TF binding sites [64,67,68,81,93,95]. To further elaborate on this point, we focus in the last subsection on a recent study that, by employing the SBS model, identifies the key putative chromatin organizing factors that are involved in the specificity of DNA regulatory contacts genome-wide [95].

2.2.1. Prediction of Genomic Mutations Effects

To exemplify the ability of the SBS model to predict the effects of disease-associated mutations on chromosome conformation, we take here as a case study the developmental-related Sox9 locus, in which different and partially overlapping SVs were shown to produce different phenotypes [144]. The SBS model specific to the locus has been built based on wild-type (WT) cHi-C data in mouse E12.5 limb buds (Figure 4a) by using the PRISMR approach [78,95], which infers the SBS model that best describes the locus contact matrix, i.e., the minimal number and positioning of the binding site types in order to obtain an ensemble of polymer 3D structures consistent with input data. The SVs are implemented in silico on the WT SBS model and their corresponding contact matrices, derived from polymer physics with no fitting parameters whatsoever, are compared to independent cHi-C data from cells that carry those precise mutations [95].
As exemplificative cases, in Figure 4b we show the result of a 0.4 Mb duplication (Dup-S) in the non-coding DNA region within the Sox9 gene TAD (intra-TAD duplication) associated with female-to-male sex reversal in humans, whereas in Figure 4c we illustrate the result for a 1.6 Mb duplication (Dup-L) encompassing the neighboring TAD boundary (inter-TAD duplication) with no phenotypic effects. The Pearson and distance-corrected Pearson coefficients between the model-predicted and cHi-C contact matrices reflect their good degree of similarity being, respectively, r = 0.95 and r’ = 0.76 in Dup-S and r = 0.92 and r’ = 0.63 in Dup-L. Specifically, and consistent with experimental data, the model predicts that while the intra-TAD duplication Dup-S does not affect the overall TAD structure, the inter-TAD duplication Dup-L produces the formation of a separate chromatin domain termed “neo-TAD” [144]. However, the neo-TAD results are spatially insulated from its neighboring regions thus preventing ectopic interactions between Sox9 and its regulatory sequence, explaining the lack of pathogenicity in humans carrying this type of mutation.
The polymer model of the Sox9 locus allows dissecting of the interactions of the duplicated from the original DNA sequence, information not accessible through only Hi-C data [95]. Importantly, while DupS is fully included within the TAD encompassing Sox9, in DupL the duplicated region spans two TADs (Figure 4b,c). Since a TAD, in the model, derives from the presence of a dominant type of binding sites in that region (see, for example, Figure 2, bottom), the duplicated and original sequence in DupS share many homologous binding sites producing an excess of interactions localized around the mutated region, without altering the overall contact pattern. The 3D conformation of the Sox9 DupS locus (Figure 4b) shows indeed that the duplicated region (colored in red in the figure) remains well embedded into the original sequence (in brown). On the other hand, in the inter-TAD duplication DupL, since different prevailing binding sites are present, the portion of the duplication within the Sox9 TAD interacts with itself more than with its neighboring regions, remaining partially isolated from the rest of the locus. The snapshot of the 3D structure of DupL locus shows, indeed, that the region corresponding to the neo-TAD (in red) is spatially isolated.
Summarizing, in addition to demonstrating that the SBS model can be successfully used to predict the effect of disease-associated mutations on genome 3D structure, those results clarify how mutations differently impact chromatin architecture and gene regulation, resulting in distinct phenotypes.

2.2.2. A Combinatorial Epigenetic Code Linking 1D and 3D Chromatin Organization

The genomic size of the chromatin regions discussed so far is in the range of a few millions of base pairs. However, the folding machinery also acts at much larger scales, up to the size of whole chromosomes [7,30]. Chromosome-wide polymer models are then fundamental to test the validity of the underlying physical mechanisms. Here, we show that the SBS model can be successfully employed genome-wide and discuss how that allows to both identify the location and combination of the putative binding sites underlying chromatin contacts and to derive a first characterization of their molecular features.
The ability of the SBS model in describing the folding at larger genomic length scales has been tested, e.g., by using high resolution (5 kb) in situ Hi-C data in the human lymphoblastoid cell line GM12878 [25]. For each chromosome, PRISMR [78,95] has been used to infer the SBS model that best describes its corresponding contact matrix. Importantly, PRISMR exploits only Hi-C data as input, with no prior knowledge of binding factors. As an example, Figure 5a shows the results of the PRISMR procedure for chromosome 20, where experimental and model contact patterns show a quantitative degree of similarity (r’ = 0.85). The position and abundance of the different types of binding sites (binding domains) are also shown (Figure 5a, middle panel). Interestingly, the arrangement of the binding site types along the polymer chain is highly non-trivial: rather than being restricted to small, contiguous genomic regions (as found, e.g., in common and much simpler block-copolymer models), the binding domains do overlap with each other and are spread across several mega-bases, hence capturing contacts occurring up to the chromosomal scale [95]. These results indicate that the basic molecular ingredients considered by the model are sufficient to explain contact patterns across genomic scales.
Since the model uses as input only the GM12878 Hi-C contact matrices, its inferred binding sites have been characterized a posteriori, by crossing their position along the genome with independent biological information. In particular, five key histone marks available from the ENCODE database in the same cell line [134] have been considered and the correlation between their genomic signals and the genomic location of the model binding domains computed, thus obtaining for each domain its epigenetic signature. By grouping the domain histone profiles across chromosomes via hierarchical clustering algorithms, it has been discovered that there are 9 statistically different groups (or epigenetic classes) [95]. Figure 5b shows the average histone profile per class. Interestingly, each class correlates with a specific combination of different epigenetic factors rather than with a single one. For instance, although there are three classes that strongly correlate with active chromatin marks (Active 1, Active 2, and Active 3 in Figure 5b), class 1 is enriched for only active marks, while classes 2 and 3 are both enriched in H3K9me3 (also found in heterochromatin) and class 3, in particular, shows a stronger correlation with H3K4me1, a histone mark associated especially with active enhancer regions [145]. That suggests that the binding domains corresponding to the first three classes produce contacts between transcribed and regulatory regions. Importantly, the first six classes correlate with CTCF (Figure 5b), confirming its significance in regulating chromatin architecture and gene activity [146], but also indicating that its role can be modulated by different sets of histone marks and molecular factors as discussed above.
Summarizing, the binding domains inferred genome-wide have a specific barcode of histone marks and fall in epigenetic classes that match well the chromatin states found by previous segmentation studies [14,145,147,148,149]. However, while the latter are, by definition, non-overlapping segments, the binding domains do overlap with each other along the genome, providing each DNA window with a distinctive set of binding site types that can be read as a combinatorial code linking 1D and 3D chromatin organization [95].

3. Discussion

In this work, we discussed principled models of polymer physics to explore the complex machinery underlying chromosome spatial organization. To this aim, we reviewed two major classes of models, i.e., loop-extrusion and polymer phase-separation, that envisage distinct biological pictures of folding. In the first class, chromatin structures, e.g., loops and TADs, are established by molecular motors (such as SMC proteins) that extrude DNA loops in an off-equilibrium, cell-energy-consuming, process [61,73,89]. In the second, those topological structures are spontaneously shaped by molecular diffusing binders (corresponding to specific combinations of chromatin organizing factors, such as CTCF/cohesin, histone marks, TFs, active/poised Pol-II) that bridge distal DNA cognate sites via equilibrium polymer thermodynamics mechanisms [64,77,79]. The phase behaviors described in such models are typical of complex systems of physics [150,151,152,153,154]. In the emerging scenario, consistent with single-cell imaging data, chromatin contacts at the megabase-scale arise from the interplay of those distinct (respectively, active and passive) physical processes, which can coexist and act simultaneously at the single-molecule level [65].
The models we considered, albeit simplified, enable quantitative predictions on real chromatin structures that can be tested against independent experiments. For example, within the framework of the SBS (i.e., phase-separation-based) model, we discussed how specific genomic mutations at key chromatin loci (such as the Sox9 locus) perturb 3D chromatin structure, producing ectopic contacts consistent with real data (e.g., via specific enhancer-hijacking events) that result in distinct phenotypes [95]. Importantly, such predictions can help the interpretation of human genetic variants in driving severe diseases, such as congenital disorders and cancers, which is currently a challenging question in many clinical cases. Recently, and interestingly, those validated models have been also shown, in a reverse approach, to serve as ground truth to benchmark independent experiments, for instance by assessing advantages and limitations of technologies for the determination of chromosomal structure [138,155], therefore expanding the spectrum of their useful applications in silico.
Nevertheless, the search for the different mechanisms controlling chromosome architecture is still open and debated, and many long-standing questions of DNA biology, such as the molecular rules shaping enhancer–promoter communication, are only poorly addressed from first principles. In this respect, a few models of long-range transcriptional control have been recently proposed, invoking, e.g., longer-lived promoter states [36], “activity-by-contact” models relying on chromatin state measurements [156], or TF-grounded-activity models based on local 3D gradients of chemical signals [157], yet a comprehensive understanding at the molecular level is yet far from being reached. For that reason, validated theories from polymer physics that rely on robust organizing principles can be crucial to elucidate the relationship between genome architecture and function, which we expect to be among the most critical challenges of the field in the nearest future.

Author Contributions

Conceptualization, M.C., A.E., F.V., A.A. and S.B.; writing—original draft preparation, M.C., A.E. and S.B.; visualization, M.C. and A.E.; supervision, S.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Acknowledgments

S.B. acknowledges support from the CINECA ISCRA grant ID HP10CCZ4KN. We acknowledge computer resources from INFN, CINECA, ENEA CRESCO/ENEAGRID [158] and Scope/ReCAS/Ibisco at the University of Naples.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Dekker, J.; Mirny, L. The 3D Genome as Moderator of Chromosomal Communication. Cell 2016, 164, 1110–1121. [Google Scholar] [CrossRef]
  2. Dixon, J.R.; Gorkin, D.U.; Ren, B. Chromatin Domains: The Unit of Chromosome Organization. Mol. Cell 2016, 62, 668–680. [Google Scholar] [CrossRef] [PubMed]
  3. Bickmore, W.A.; Van Steensel, B. Genome Architecture: Domain Organization of Interphase Chromosomes. Cell 2013, 152, 1270–1284. [Google Scholar] [CrossRef] [PubMed]
  4. Kempfer, R.; Pombo, A. Methods for Mapping 3D Chromosome Architecture. Nat. Rev. Genet. 2020, 21, 207–226. [Google Scholar] [CrossRef] [PubMed]
  5. Sexton, T.; Cavalli, G. The Role of Chromosome Domains in Shaping the Functional Genome. Cell 2015, 160, 1049–1059. [Google Scholar] [CrossRef] [PubMed]
  6. Misteli, T. The Self-Organizing Genome: Principles of Genome Architecture and Function. Cell 2020, 183, 28–45. [Google Scholar] [CrossRef] [PubMed]
  7. Bonev, B.; Cavalli, G. Organization and Function of the 3D Genome. Nat. Rev. Genet. 2016, 17, 661–678. [Google Scholar] [CrossRef] [PubMed]
  8. Lieberman-Aiden, E.; Van Berkum, N.L.; Williams, L.; Imakaev, M.; Ragoczy, T.; Telling, A.; Amit, I.; Lajoie, B.R.; Sabo, P.J.; Dorschner, M.O.; et al. Comprehensive Mapping of Long-Range Interactions Reveals Folding Principles of the Human Genome. Science 2009, 26, 289–293. [Google Scholar] [CrossRef] [PubMed]
  9. Beagrie, R.A.; Scialdone, A.; Schueler, M.; Kraemer, D.C.A.; Chotalia, M.; Xie, S.Q.; Barbieri, M.; De Santiago, I.; Lavitas, L.M.; Branco, M.R.; et al. Complex Multi-Enhancer Contacts Captured by Genome Architecture Mapping. Nature 2017, 543, 519–524. [Google Scholar] [CrossRef] [PubMed]
  10. Winick-Ng, W.; Kukalev, A.; Harabula, I.; Zea-Redondo, L.; Szabó, D.; Meijer, M.; Serebreni, L.; Zhang, Y.; Bianco, S.; Chiariello, A.M.; et al. Cell-Type Specialization Is Encoded by Specific Chromatin Topologies. Nature 2021, 599, 684–691. [Google Scholar] [CrossRef]
  11. Quinodoz, S.A.; Ollikainen, N.; Tabak, B.; Palla, A.; Schmidt, J.M.; Detmar, E.; Lai, M.M.; Shishkin, A.A.; Bhat, P.; Takei, Y.; et al. Higher-Order Inter-Chromosomal Hubs Shape 3D Genome Organization in the Nucleus. Cell 2018, 174, 744–757.e24. [Google Scholar] [CrossRef]
  12. Arrastia, M.V.; Jachowicz, J.W.; Ollikainen, N.; Curtis, M.S.; Lai, C.; Quinodoz, S.A.; Selck, D.A.; Ismagilov, R.F.; Guttman, M. Single-Cell Measurement of Higher-Order 3D Genome Organization with ScSPRITE. Nat. Biotechnol. 2021, 40, 64–73. [Google Scholar] [CrossRef] [PubMed]
  13. Bintu, B.; Mateo, L.J.; Su, J.-H.; Sinnott-Armstrong, N.A.; Parker, M.; Kinrot, S.; Yamaya, K.; Boettiger, A.N.; Zhuang, X. Super-Resolution Chromatin Tracing Reveals Domains and Cooperative Interactions in Single Cells. Science 2018, 362, eaau1783. [Google Scholar] [CrossRef]
  14. Boettiger, A.N.; Bintu, B.; Moffitt, J.R.; Wang, S.; Beliveau, B.J.; Fudenberg, G.; Imakaev, M.; Mirny, L.A.; Wu, C.T.; Zhuang, X. Super-Resolution Imaging Reveals Distinct Chromatin Folding for Different Epigenetic States. Nature 2016, 529, 418–422. [Google Scholar] [CrossRef]
  15. Szabo, Q.; Donjon, A.; Jerković, I.; Papadopoulos, G.L.; Cheutin, T.; Bonev, B.; Nora, E.P.; Bruneau, B.G.; Bantignies, F.; Cavalli, G. Regulation of Single-Cell Genome Organization into TADs and Chromatin Nanodomains. Nat. Genet. 2020, 52, 1151–1157. [Google Scholar] [CrossRef]
  16. Nguyen, H.Q.; Chattoraj, S.; Castillo, D.; Nguyen, S.C.; Nir, G.; Lioutas, A.; Hershberg, E.A.; Martins, N.M.C.; Reginato, P.L.; Hannan, M.; et al. 3D Mapping and Accelerated Super-Resolution Imaging of the Human Genome Using in Situ Sequencing. Nat. Methods 2020, 17, 822–832. [Google Scholar] [CrossRef]
  17. Wang, S.; Su, J.H.; Beliveau, B.J.; Bintu, B.; Moffitt, J.R.; Wu, C.T.; Zhuang, X. Spatial Organization of Chromatin Domains and Compartments in Single Chromosomes. Science 2016, 353, 598–602. [Google Scholar] [CrossRef] [PubMed]
  18. Finn, E.H.; Pegoraro, G.; Brandão, H.B.; Valton, A.L.; Oomen, M.E.; Dekker, J.; Mirny, L.; Misteli, T. Extensive Heterogeneity and Intrinsic Variation in Spatial Genome Organization. Cell 2019, 176, P1502–P1515.E10. [Google Scholar] [CrossRef] [PubMed]
  19. Takei, Y.; Zheng, S.; Yun, J.; Shah, S.; Pierson, N.; White, J.; Schindler, S.; Tischbirek, C.H.; Yuan, G.C.; Cai, L. Single-Cell Nuclear Architecture across Cell Types in the Mouse Brain. Science 2021, 374, 586–594. [Google Scholar] [CrossRef]
  20. Takei, Y.; Yun, J.; Zheng, S.; Ollikainen, N.; Pierson, N.; White, J.; Shah, S.; Thomassie, J.; Suo, S.; Eng, C.H.L.; et al. Integrated Spatial Genomics Reveals Global Architecture of Single Nuclei. Nature 2021, 590, 344–350. [Google Scholar] [CrossRef]
  21. Mateo, L.J.; Sinnott-Armstrong, N.; Boettiger, A.N. Tracing DNA Paths and RNA Profiles in Cultured Cells and Tissues with ORCA. Nat. Protoc. 2021, 16, 1647–1713. [Google Scholar] [CrossRef] [PubMed]
  22. Cardozo Gizzi, A.M.; Cattoni, D.I.; Fiche, J.B.; Espinola, S.M.; Gurgo, J.; Messina, O.; Houbron, C.; Ogiyama, Y.; Papadopoulos, G.L.; Cavalli, G.; et al. Microscopy-Based Chromosome Conformation Capture Enables Simultaneous Visualization of Genome Organization and Transcription in Intact Organisms. Mol. Cell 2019, 74, 212–222.e5. [Google Scholar] [CrossRef] [PubMed]
  23. Su, J.H.; Zheng, P.; Kinrot, S.S.; Bintu, B.; Zhuang, X. Genome-Scale Imaging of the 3D Organization and Transcriptional Activity of Chromatin. Cell 2020, 182, 1641–1659.e26. [Google Scholar] [CrossRef] [PubMed]
  24. Mateo, L.J.; Murphy, S.E.; Hafner, A.; Cinquini, I.S.; Walker, C.A.; Boettiger, A.N. Visualizing DNA Folding and RNA in Embryos at Single-Cell Resolution. Nature 2019, 568, 49–54. [Google Scholar] [CrossRef]
  25. Rao, S.S.P.; Huntley, M.H.; Durand, N.C.; Stamenova, E.K.; Bochkov, I.D.; Robinson, J.T.; Sanborn, A.L.; Machol, I.; Omer, A.D.; Lander, E.S.; et al. A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping. Cell 2014, 159, 1665–1680. [Google Scholar] [CrossRef]
  26. Rao, S.S.P.; Huang, S.C.; Glenn St Hilaire, B.; Engreitz, J.M.; Perez, E.M.; Kieffer-Kwon, K.R.; Sanborn, A.L.; Johnstone, S.E.; Bascom, G.D.; Bochkov, I.D.; et al. Cohesin Loss Eliminates All Loop Domains. Cell 2017, 171, 305–320.e24. [Google Scholar] [CrossRef]
  27. Dixon, J.R.; Selvaraj, S.; Yue, F.; Kim, A.; Li, Y.; Shen, Y.; Hu, M.; Liu, J.S.; Ren, B. Topological Domains in Mammalian Genomes Identified by Analysis of Chromatin Interactions. Nature 2012, 485, 376–380. [Google Scholar] [CrossRef]
  28. Nora, E.P.; Lajoie, B.R.; Schulz, E.G.; Giorgetti, L.; Okamoto, I.; Servant, N.; Piolot, T.; Van Berkum, N.L.; Meisig, J.; Sedat, J.; et al. Spatial Partitioning of the Regulatory Landscape of the X-Inactivation Centre. Nature 2012, 5, 381–385. [Google Scholar] [CrossRef] [PubMed]
  29. Szabo, Q.; Bantignies, F.; Cavalli, G. Principles of Genome Folding into Topologically Associating Domains. Sci. Adv. 2019, 5, eaaw1668. [Google Scholar] [CrossRef]
  30. Fraser, J.; Ferrai, C.; Chiariello, A.M.; Schueler, M.; Rito, T.; Laudanno, G.; Barbieri, M.; Moore, B.L.; Kraemer, D.C.; Aitken, S.; et al. Hierarchical Folding and Reorganization of Chromosomes Are Linked to Transcriptional Changes in Cellular Differentiation. Mol. Syst. Biol. 2015, 11, 852. [Google Scholar] [CrossRef]
  31. van Steensel, B.; Belmont, A.S. Lamina-Associated Domains: Links with Chromosome Architecture, Heterochromatin, and Gene Repression. Cell 2017, 169, 780–791. [Google Scholar] [CrossRef]
  32. Cremer, T.; Cremer, C. Chromosome Territories, Nuclear Architecture and Gene Regulation in Mammalian Cells. Nat. Rev. Genet. 2001, 2, 292–301. [Google Scholar] [CrossRef]
  33. Kubo, N.; Ishii, H.; Xiong, X.; Bianco, S.; Meitinger, F.; Hu, R.; Hocker, J.D.; Conte, M.; Gorkin, D.; Yu, M.; et al. Promoter-Proximal CTCF Binding Promotes Distal Enhancer-Dependent Gene Activation. Nat. Struct. Mol. Biol. 2021, 28, 152–161. [Google Scholar] [CrossRef]
  34. Furlong, E.E.M.; Levine, M. Developmental Enhancers and Chromosome Topology. Science 2018, 361, 1341–1345. [Google Scholar] [CrossRef]
  35. Huang, H.; Zhu, Q.; Jussila, A.; Han, Y.; Bintu, B.; Kern, C.; Conte, M.; Zhang, Y.; Bianco, S.; Chiariello, A.M.; et al. CTCF Mediates Dosage- and Sequence-Context-Dependent Transcriptional Insulation by Forming Local Chromatin Domains. Nat. Genet. 2021, 53, 1064–1074. [Google Scholar] [CrossRef]
  36. Zuin, J.; Roth, G.; Zhan, Y.; Cramard, J.; Redolfi, J.; Piskadlo, E.; Mach, P.; Kryzhanovska, M.; Tihanyi, G.; Kohler, H.; et al. Nonlinear Control of Transcription through Enhancer–Promoter Interactions. Nature 2022, 604, 571–577. [Google Scholar] [CrossRef]
  37. Spielmann, M.; Lupiáñez, D.G.; Mundlos, S. Structural Variation in the 3D Genome. Nat. Rev. Genet. 2018, 19, 453–467. [Google Scholar] [CrossRef]
  38. Valton, A.L.; Dekker, J. TAD Disruption as Oncogenic Driver. Curr. Opin. Genet. Dev. 2016, 36, 34–40. [Google Scholar] [CrossRef]
  39. Wang, X.; Xu, J.; Zhang, B.; Hou, Y.; Song, F.; Lyu, H.; Yue, F. Genome-Wide Detection of Enhancer-Hijacking Events from Chromatin Interaction Data in Rearranged Genomes. Nat. Methods 2021, 18, 661–668. [Google Scholar] [CrossRef]
  40. Finn, E.H.; Misteli, T. Molecular Basis and Biological Function of Variability in Spatial Genome Organization. Science 2019, 365, eaaw9498. [Google Scholar] [CrossRef]
  41. Gabriele, M.; Brandão, H.B.; Grosse-Holz, S.; Jha, A.; Dailey, G.M.; Cattoglio, C.; Hsieh, T.-H.S.; Mirny, L.; Zechner, C.; Hansen, A.S. Dynamics of CTCF- and Cohesin-Mediated Chromatin Looping Revealed by Live-Cell Imaging. Science 2022, 376, 496–501. [Google Scholar] [CrossRef]
  42. Mach, P.; Kos, P.I.; Zhan, Y.; Cramard, J.; Gaudin, S.; Tünnermann, J.; Marchi, E.; Eglinger, J.; Zuin, J.; Kryzhanovska, M.; et al. Cohesin and CTCF Control the Dynamics of Chromosome Folding. Nat. Genet. 2022, 54, 1907–1918. [Google Scholar] [CrossRef]
  43. Tjong, H.; Li, W.; Kalhor, R.; Dai, C.; Hao, S.; Gong, K.; Zhou, Y.; Li, H.; Zhou, X.J.; Le Gros, M.A.; et al. Population-Based 3D Genome Structure Analysis Reveals Driving Forces in Spatial Genome Organization. Proc. Natl. Acad. Sci. USA 2016, 113, E1663–E1672. [Google Scholar] [CrossRef]
  44. Lesne, A.; Riposo, J.; Roger, P.; Cournac, A.; Mozziconacci, J. 3D Genome Reconstruction from Chromosomal Contacts. Nat. Methods 2014, 11, 1141–1143. [Google Scholar] [CrossRef]
  45. Lin, D.; Bonora, G.; Yardimci, G.G.; Noble, W.S. Computational Methods for Analyzing and Modeling Genome Structure and Organization. Wiley Interdiscip. Rev. Syst. Biol. Med. 2018, 11, e1435. [Google Scholar] [CrossRef]
  46. Nir, G.; Farabella, I.; Pérez Estrada, C.; Ebeling, C.G.; Beliveau, B.J.; Sasaki, H.M.; Lee, S.H.; Nguyen, S.C.; McCole, R.B.; Chattoraj, S.; et al. Walking along Chromosomes with Super-Resolution Imaging, Contact Maps, and Integrative Modeling. PLoS Genet. 2018, 14, e1007872. [Google Scholar] [CrossRef] [PubMed]
  47. Serra, F.; Baù, D.; Goodstadt, M.; Castillo, D.; Filion, G.; Marti-Renom, M.A. Automatic Analysis and 3D-Modelling of Hi-C Data Using TADbit Reveals Structural Features of the Fly Chromatin Colors. PLoS Comput. Biol. 2017, 13, e1005665. [Google Scholar] [CrossRef]
  48. Kim, H.J.; Yardımcı, G.G.; Bonora, G.; Ramani, V.; Liu, J.; Qiu, R.; Lee, C.; Hesson, J.; Ware, C.B.; Shendure, J.; et al. Capturing Cell Type-Specific Chromatin Compartment Patterns by Applying Topic Modeling to Single-Cell Hi-C Data. PLoS Comput. Biol. 2020, 16, e1008173. [Google Scholar] [CrossRef]
  49. Qi, Y.; Zhang, B. Predicting Three-Dimensional Genome Organization with Chromatin States. PLoS Comput. Biol. 2019, 15, e1007024. [Google Scholar] [CrossRef]
  50. Boninsegna, L.; Yildirim, A.; Polles, G.; Zhan, Y.; Quinodoz, S.A.; Finn, E.H.; Guttman, M.; Zhou, X.J.; Alber, F. Integrative Genome Modeling Platform Reveals Essentiality of Rare Contact Events in 3D Genome Organizations. Nat. Methods 2022, 19, 938–949. [Google Scholar] [CrossRef]
  51. Yildirim, A.; Boninsegna, L.; Zhan, Y.; Alber, F. Uncovering the Principles of Genome Folding by 3D Chromatin Modeling. Cold Spring Harb. Perspect. Biol. 2022, 14, a039693. [Google Scholar] [CrossRef]
  52. International Nucleome Consortium; Belokopytova, P.; Viesná, E.; Chiliński, M.; Qi, Y.; Salari, H.; Di Stefano, M.; Esposito, A.; Conte, M.; Chiariello, A.M.; et al. 3DGenBench: A Web-Server to Benchmark Computational Models for 3D Genomics. Nucleic Acids Res. 2022, 50, W4–W12. [Google Scholar] [CrossRef]
  53. Fudenberg, G.; Kelley, D.R.; Pollard, K.S. Predicting 3D Genome Folding from DNA Sequence with Akita. Nat. Methods 2020, 17, 1111–1117. [Google Scholar] [CrossRef]
  54. Zhang, S.; Chasman, D.; Knaack, S.; Roy, S. In Silico Prediction of High-Resolution Hi-C Interaction Matrices. Nat. Commun. 2019, 10, 5449. [Google Scholar] [CrossRef]
  55. Wang, Y.; Zhang, Y.; Zhang, R.; van Schaik, T.; Zhang, L.; Sasaki, T.; Peric-Hupkes, D.; Chen, Y.; Gilbert, D.M.; van Steensel, B.; et al. SPIN Reveals Genome-Wide Landscape of Nuclear Compartmentalization. Genome Biol. 2021, 22, 36. [Google Scholar] [CrossRef]
  56. Schwessinger, R.; Gosden, M.; Downes, D.; Brown, R.C.; Oudelaar, A.M.; Telenius, J.; Teh, Y.W.; Lunter, G.; Hughes, J.R. DeepC: Predicting 3D Genome Folding Using Megabase-Scale Transfer Learning. Nat. Methods 2020, 17, 1131–1138. [Google Scholar] [CrossRef] [PubMed]
  57. Shi, G.; Thirumalai, D. From Hi-C Contact Map to Three-Dimensional Organization of Interphase Human Chromosomes. Phys. Rev. X 2021, 11, 011051. [Google Scholar] [CrossRef]
  58. Li, Q.; Tjong, H.; Li, X.; Gong, K.; Zhou, X.J.; Chiolo, I.; Alber, F. The Three-Dimensional Genome Organization of Drosophila Melanogaster through Data Integration. Genome Biol. 2017, 18, 145. [Google Scholar] [CrossRef] [PubMed]
  59. Di Stefano, M.; Paulsen, J.; Lien, T.G.; Hovig, E.; Micheletti, C. Hi-C-Constrained Physical Models of Human Chromosomes Recover Functionally-Related Properties of Genome Organization. Sci. Rep. 2016, 6, 35985. [Google Scholar] [CrossRef]
  60. Di Stefano, M.; Paulsen, J.; Jost, D.; Marti-Renom, M.A. 4D Nucleome Modeling. Curr. Opin. Genet. Dev. 2021, 67, 25–32. [Google Scholar] [CrossRef] [PubMed]
  61. Fudenberg, G.; Imakaev, M.; Lu, C.; Goloborodko, A.; Abdennur, N.; Mirny, L.A. Formation of Chromosomal Domains by Loop Extrusion. Cell Rep. 2016, 15, 2038–2049. [Google Scholar] [CrossRef] [PubMed]
  62. Sanborn, A.L.; Rao, S.S.P.; Huang, S.C.; Durand, N.C.; Huntley, M.H.; Jewett, A.I.; Bochkov, I.D.; Chinnappan, D.; Cutkosky, A.; Li, J.; et al. Chromatin Extrusion Explains Key Features of Loop and Domain Formation in Wild-Type and Engineered Genomes. Proc. Natl. Acad. Sci. USA 2015, 112, E6456–E6465. [Google Scholar] [CrossRef] [PubMed]
  63. Brackley, C.A.; Brown, J.M.; Waithe, D.; Babbs, C.; Davies, J.; Hughes, J.R.; Buckle, V.J.; Marenduzzo, D. Predicting the Three-Dimensional Folding of Cis-Regulatory Regions in Mammalian Genomes Using Bioinformatic Data and Polymer Models. Genome Biol. 2016, 17, 59. [Google Scholar] [CrossRef]
  64. Conte, M.; Fiorillo, L.; Bianco, S.; Chiariello, A.M.; Esposito, A.; Nicodemi, M. Polymer Physics Indicates Chromatin Folding Variability across Single-Cells Results from State Degeneracy in Phase Separation. Nat. Commun. 2020, 11, 3289. [Google Scholar] [CrossRef]
  65. Conte, M.; Irani, E.; Chiariello, A.M.; Abraham, A.; Bianco, S.; Esposito, A.; Nicodemi, M. Loop-Extrusion and Polymer Phase-Separation Can Co-Exist at the Single-Molecule Level to Shape Chromatin Folding. Nat. Commun. 2022, 13, 4070. [Google Scholar] [CrossRef]
  66. Fiorillo, L.; Bianco, S.; Esposito, A.; Conte, M.; Sciarretta, R.; Musella, F.; Chiariello, A.M. A Modern Challenge of Polymer Physics: Novel Ways to Study, Interpret, and Reconstruct Chromatin Structure. Wiley Interdiscip. Rev. Comput. Mol. Sci. 2020, 10, e1454. [Google Scholar] [CrossRef]
  67. Di Pierro, M.; Zhang, B.; Aiden, E.L.; Wolynes, P.G.; Onuchic, J.N. Transferable Model for Chromosome Architecture. Proc. Natl. Acad. Sci. USA 2016, 113, 12168–12173. [Google Scholar] [CrossRef]
  68. Barbieri, M.; Xie, S.Q.S.Q.; Torlai Triglia, E.; Chiariello, A.M.A.M.; Bianco, S.; De Santiago, I.; Branco, M.R.M.R.; Rueda, D.; Nicodemi, M.; Pombo, A. Active and Poised Promoter States Drive Folding of the Extended HoxB Locus in Mouse Embryonic Stem Cells. Nat. Struct. Mol. Biol. 2017, 24, 515–524. [Google Scholar] [CrossRef]
  69. Shi, G.; Liu, L.; Hyeon, C.; Thirumalai, D. Interphase Human Chromosome Exhibits out of Equilibrium Glassy Dynamics. Nat. Commun. 2018, 9, 3161. [Google Scholar] [CrossRef] [PubMed]
  70. Salari, H.; Di Stefano, M.; Jost, D. Spatial Organization of Chromosomes Leads to Heterogeneous Chromatin Motion and Drives the Liquid- or Gel-like Dynamical Behavior of Chromatin. Genome Res. 2022, 32, 28–43. [Google Scholar] [CrossRef]
  71. Bianco, S.; Annunziatella, C.; Andrey, G.; Chiariello, A.M.; Esposito, A.; Fiorillo, L.; Prisco, A.; Conte, M.; Campanile, R.; Nicodemi, M. Modeling Single-Molecule Conformations of the HoxD Region in Mouse Embryonic Stem and Cortical Neuronal Cells. Cell Rep. 2019, 28, 1574–1583.e4. [Google Scholar] [CrossRef]
  72. Plewczynski, D.; Kadlof, M. Computational Modelling of Three-Dimensional Genome Structure. Methods 2020, 181–182, 1–4. [Google Scholar] [CrossRef]
  73. Nuebler, J.; Fudenberg, G.; Imakaev, M.; Abdennur, N.; Mirny, L.A. Chromatin Organization by an Interplay of Loop Extrusion and Compartmental Segregation. Proc. Natl. Acad. Sci. USA 2018, 115, E6697–E6706. [Google Scholar] [CrossRef] [PubMed]
  74. Nicodemi, M.; Pombo, A. Models of Chromosome Structure. Curr. Opin. Cell Biol. 2014, 28, 90–95. [Google Scholar] [CrossRef]
  75. Nicodemi, M.; Prisco, A. Thermodynamic Pathways to Genome Spatial Organization in the Cell Nucleus. Biophys. J. 2009, 96, 2168–2177. [Google Scholar] [CrossRef] [PubMed]
  76. Bohn, M.; Heermann, D.W. Diffusion-Driven Looping Provides a Consistent Provides a Consistent Framework for Chromatin Organization. PLoS ONE 2010, 5, e12218. [Google Scholar] [CrossRef]
  77. Barbieri, M.; Chotalia, M.; Fraser, J.; Lavitas, L.-M.L.M.; Dostie, J.; Pombo, A.; Nicodemi, M. Complexity of Chromatin Folding Is Captured by the Strings and Binders Switch Model. Proc. Natl. Acad. Sci. USA 2012, 109, 16173–16178. [Google Scholar] [CrossRef] [PubMed]
  78. Bianco, S.; Lupiáñez, D.G.; Chiariello, A.M.; Annunziatella, C.; Kraft, K.; Schöpflin, R.; Wittler, L.; Andrey, G.; Vingron, M.; Pombo, A.; et al. Polymer Physics Predicts the Effects of Structural Variants on Chromatin Architecture. Nat. Genet. 2018, 50, 662–667. [Google Scholar] [CrossRef] [PubMed]
  79. Chiariello, A.M.; Annunziatella, C.; Bianco, S.; Esposito, A.; Nicodemi, M. Polymer Physics of Chromosome Large-Scale 3D Organisation. Sci. Rep. 2016, 6, 29775. [Google Scholar] [CrossRef]
  80. Esposito, A.; Annunziatella, C.; Bianco, S.; Chiariello, A.M.; Fiorillo, L.; Nicodemi, M. Models of Polymer Physics for the Architecture of the Cell Nucleus. Wiley Interdiscip. Rev. Syst. Biol. Med. 2019, 11, e1444. [Google Scholar] [CrossRef]
  81. Buckle, A.; Brackley, C.A.; Boyle, S.; Marenduzzo, D.; Gilbert, N. Polymer Simulations of Heteromorphic Chromatin Predict the 3D Folding of Complex Genomic Loci. Mol. Cell 2018, 72, 786–797.e11. [Google Scholar] [CrossRef]
  82. Brackley, C.A.; Taylor, S.; Papantonis, A.; Cook, P.R.; Marenduzzo, D. Nonspecific Bridging-Induced Attraction Drives Clustering of DNA-Binding Proteins and Genome Organization. Proc. Natl. Acad. Sci. USA 2013, 110, E3605–E3611. [Google Scholar] [CrossRef] [PubMed]
  83. Conte, M.; Fiorillo, L.; Annunziatella, C.; Esposito, A.; Musella, F.; Abraham, A.; Bianco, S.; Chiariello, A.M. Dynamic and Equilibrium Properties of Finite-Size Polymer Models of Chromosome Folding. Phys. Rev. E 2021, 104, 054402. [Google Scholar] [CrossRef] [PubMed]
  84. Racko, D.; Benedetti, F.; Dorier, J.; Stasiak, A. Transcription-Induced Supercoiling as the Driving Force of Chromatin Loop Extrusion during Formation of TADs in Interphase Chromosomes. Nucleic Acids Res. 2018, 46, 1648–1660. [Google Scholar] [CrossRef] [PubMed]
  85. Bianco, S.; Chiariello, A.M.; Conte, M.; Esposito, A.; Fiorillo, L.; Musella, F.; Nicodemi, M. Computational Approaches from Polymer Physics to Investigate Chromatin Folding. Curr. Opin. Cell Biol. 2020, 64, 10–17. [Google Scholar] [CrossRef] [PubMed]
  86. Chiariello, A.M.; Bianco, S.; Oudelaar, A.M.; Esposito, A.; Annunziatella, C.; Fiorillo, L.; Conte, M.; Corrado, A.; Prisco, A.; Larke, M.S.C.; et al. A Dynamic Folded Hairpin Conformation Is Associated with α-Globin Activation in Erythroid Cells. Cell Rep. 2020, 30, 2125–2135.e5. [Google Scholar] [CrossRef]
  87. Esposito, A.; Abraham, A.; Conte, M.; Vercellone, F.; Prisco, A.; Bianco, S.; Chiariello, A.M. The Physics of DNA Folding: Polymer Models and Phase-Separation. Polymers 2022, 14, 1918. [Google Scholar] [CrossRef]
  88. Nicodemi, M.; Panning, B.; Prisco, A. A Thermodynamic Switch for Chromosome Colocalization. Genetics 2008, 179, 717–721. [Google Scholar] [CrossRef]
  89. Banigan, E.J.; Mirny, L.A. Loop Extrusion: Theory Meets Single-Molecule Experiments. Curr. Opin. Cell Biol. 2020, 64, 124–138. [Google Scholar] [CrossRef]
  90. Goloborodko, A.; Marko, J.F.; Mirny, L.A. Chromosome Compaction by Active Loop Extrusion. Biophys. J. 2016, 110, 2162–2168. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  91. Brackley, C.A.; Johnson, J.; Michieletto, D.; Morozov, A.N.; Nicodemi, M.; Cook, P.R.; Marenduzzo, D. Nonequilibrium Chromosome Looping via Molecular Slip Links. Phys. Rev. Lett. 2017, 119, 138101. [Google Scholar] [CrossRef]
  92. Chiariello, A.M.; Bianco, S.; Esposito, A.; Fiorillo, L.; Conte, M.; Irani, E.; Musella, F.; Abraham, A.; Prisco, A.; Nicodemi, M. Physical Mechanisms of Chromatin Spatial Organization. FEBS J. 2021, 289, 1180–1190. [Google Scholar] [CrossRef] [PubMed]
  93. Jost, D.; Carrivain, P.; Cavalli, G.; Vaillant, C. Modeling Epigenome Folding: Formation and Dynamics of Topologically Associated Chromatin Domains. Nucleic Acids Res. 2014, 42, 9553–9561. [Google Scholar] [CrossRef]
  94. Zhang, B.; Wolynes, P.G. Topology, Structures, and Energy Landscapes of Human Chromosomes. Proc. Natl. Acad. Sci. USA 2015, 112, 6062–6067. [Google Scholar] [CrossRef]
  95. Esposito, A.; Bianco, S.; Chiariello, A.M.; Abraham, A.; Fiorillo, L.; Conte, M.; Campanile, R.; Nicodemi, M. Polymer Physics Reveals a Combinatorial Code Linking 3D Chromatin Architecture to 1D Chromatin States. Cell Rep. 2022, 38, 110601. [Google Scholar] [CrossRef]
  96. Crippa, M.; Zhan, Y.; Tiana, G. Effective Model of Loop Extrusion Predicts Chromosomal Domains. Phys. Rev. E 2020, 102, 032414. [Google Scholar] [CrossRef]
  97. Gibcus, J.H.; Samejima, K.; Goloborodko, A.; Samejima, I.; Naumova, N.; Nuebler, J.; Kanemaki, M.T.; Xie, L.; Paulson, J.R.; Earnshaw, W.C.; et al. A Pathway for Mitotic Chromosome Formation. Science 2018, 359, eaao6135. [Google Scholar] [CrossRef]
  98. Schwarzer, W.; Abdennur, N.; Goloborodko, A.; Pekowska, A.; Fudenberg, G.; Loe-Mie, Y.; Fonseca, N.A.; Huber, W.; Haering, C.H.; Mirny, L.; et al. Two Independent Modes of Chromatin Organization Revealed by Cohesin Removal. Nature 2017, 551, 51–56. [Google Scholar] [CrossRef] [PubMed]
  99. Ganji, M.; Shaltiel, I.A.; Bisht, S.; Kim, E.; Kalichava, A.; Haering, C.H.; Dekker, C. Real-Time Imaging of DNA Loop Extrusion by Condensin. Science 2018, 360, 102–105. [Google Scholar] [CrossRef]
  100. Kim, Y.; Shi, Z.; Zhang, H.; Finkelstein, I.J.; Yu, H. Human Cohesin Compacts DNA by Loop Extrusion. Science 2019, 366, 1345–1349. [Google Scholar] [CrossRef]
  101. Kong, M.; Cutts, E.E.; Pan, D.; Beuron, F.; Kaliyappan, T.; Xue, C.; Morris, E.P.; Musacchio, A.; Vannini, A.; Greene, E.C. Human Condensin I and II Drive Extensive ATP-Dependent Compaction of Nucleosome-Bound DNA. Mol. Cell 2020, 79, 99–114.e9. [Google Scholar] [CrossRef]
  102. Davidson, I.F.; Bauer, B.; Goetz, D.; Tang, W.; Wutz, G.; Peters, J.-M. DNA Loop Extrusion by Human Cohesin. Science 2019, 366, 1338–1345. [Google Scholar] [CrossRef]
  103. Cremer, M.; Brandstetter, K.; Maiser, A.; Rao, S.S.P.; Schmid, V.J.; Guirao-Ortiz, M.; Mitra, N.; Mamberti, S.; Klein, K.N.; Gilbert, D.M.; et al. Cohesin Depleted Cells Rebuild Functional Nuclear Compartments after Endomitosis. Nat. Commun. 2020, 11, 6146. [Google Scholar] [CrossRef] [PubMed]
  104. Taylor, T.; Sikorska, N.; Shchuka, V.M.; Chahar, S.; Ji, C.; Macpherson, N.N.; Moorthy, S.D.; de Kort, M.A.C.; Mullany, S.; Khader, N.; et al. Transcriptional Regulation and Chromatin Architecture Maintenance Are Decoupled Functions at the Sox2 Locus. Genes Dev. 2022, 36, 699–717. [Google Scholar] [CrossRef]
  105. Aljahani, A.; Hua, P.; Karpinska, M.A.; Quililan, K.; Davies, J.O.J.; Oudelaar, A.M. Analysis of Sub-Kilobase Chromatin Topology Reveals Nano-Scale Regulatory Interactions with Variable Dependence on Cohesin and CTCF. Nat. Commun. 2022, 13, 2139. [Google Scholar] [CrossRef]
  106. Chakraborty, S.; Kopitchinski, N.; Eraso, A.; Awasthi, P.; Chari, R.; Rocha, P.P. High Affinity Enhancer-Promoter Interactions Can Bypass CTCF/Cohesin-Mediated Insulation and Contribute to Phenotypic Robustness. bioRxiv 2021. [Google Scholar] [CrossRef]
  107. Ahn, J.H.; Davis, E.S.; Daugird, T.A.; Zhao, S.; Quiroga, I.Y.; Uryu, H.; Li, J.; Storey, A.J.; Tsai, Y.H.; Keeley, D.P.; et al. Phase Separation Drives Aberrant Chromatin Looping and Cancer Development. Nature 2021, 595, 591–595. [Google Scholar] [CrossRef]
  108. Ryu, J.K.; Bouchoux, C.; Liu, H.W.; Kim, E.; Minamino, M.; de Groot, R.; Katan, A.J.; Bonato, A.; Marenduzzo, D.; Michieletto, D.; et al. Bridging-Induced Phase Separation Induced by Cohesin SMC Protein Complexes. Sci. Adv. 2021, 7, eabe5905. [Google Scholar] [CrossRef]
  109. Erdel, F.; Rippe, K. Formation of Chromatin Subcompartments by Phase Separation. Biophys. J. 2018, 114, 2262–2270. [Google Scholar] [CrossRef]
  110. Boija, A.; Klein, I.A.; Sabari, B.R.; Dall’Agnese, A.; Coffey, E.L.; Zamudio, A.V.; Li, C.H.; Shrinivas, K.; Manteiga, J.C.; Hannett, N.M.; et al. Transcription Factors Activate Genes through the Phase-Separation Capacity of Their Activation Domains. Cell 2018, 175, 1842–1855.e16. [Google Scholar] [CrossRef] [Green Version]
  111. Cho, W.K.; Spille, J.H.; Hecht, M.; Lee, C.; Li, C.; Grube, V.; Cisse, I.I. Mediator and RNA Polymerase II Clusters Associate in Transcription-Dependent Condensates. Science 2018, 361, 412–415. [Google Scholar] [CrossRef] [PubMed]
  112. Sabari, B.R.; Dall’Agnese, A.; Boija, A.; Klein, I.A.; Coffey, E.L.; Shrinivas, K.; Abraham, B.J.; Hannett, N.M.; Zamudio, A.V.; Manteiga, J.C.; et al. Coactivator Condensation at Super-Enhancers Links Phase Separation and Gene Control. Science 2018, 361, eaar3958. [Google Scholar] [CrossRef] [PubMed]
  113. Chong, S.; Dugast-Darzacq, C.; Liu, Z.; Dong, P.; Dailey, G.M.; Cattoglio, C.; Heckert, A.; Banala, S.; Lavis, L.; Darzacq, X.; et al. Imaging Dynamic and Selective Low-Complexity Domain Interactions That Control Gene Transcription. Science 2018, 361, eaar2555. [Google Scholar] [CrossRef] [PubMed]
  114. Guo, Y.E.; Manteiga, J.C.; Henninger, J.E.; Sabari, B.R.; Dall’Agnese, A.; Hannett, N.M.; Spille, J.-H.; Afeyan, L.K.; Zamudio, A.V.; Shrinivas, K.; et al. Pol II Phosphorylation Regulates a Switch between Transcriptional and Splicing Condensates. Nature 2019, 572, 543–548. [Google Scholar] [CrossRef]
  115. Larson, A.G.; Elnatan, D.; Keenen, M.M.; Trnka, M.J.; Johnston, J.B.; Burlingame, A.L.; Agard, D.A.; Redding, S.; Narlikar, G.J. Liquid Droplet Formation by HP1α Suggests a Role for Phase Separation in Heterochromatin. Nature 2017, 547, 236–240. [Google Scholar] [CrossRef]
  116. Strom, A.R.; Emelyanov, A.V.; Mir, M.; Fyodorov, D.V.; Darzacq, X.; Karpen, G.H. Phase Separation Drives Heterochromatin Domain Formation. Nature 2017, 547, 241–245. [Google Scholar] [CrossRef]
  117. Banani, S.F.; Lee, H.O.; Hyman, A.A.; Rosen, M.K. Biomolecular Condensates: Organizers of Cellular Biochemistry. Nat. Rev. Mol. Cell Biol. 2017, 18, 285–298. [Google Scholar] [CrossRef] [PubMed]
  118. Brangwynne, C.P.; Tompa, P.; Pappu, R.V. Polymer Physics of Intracellular Phase Transitions. Nat. Phys. 2015, 11, 899–904. [Google Scholar] [CrossRef]
  119. Hnisz, D.; Shrinivas, K.; Young, R.A.; Chakraborty, A.K.; Sharp, P.A. A Phase Separation Model for Transcriptional Control. Cell 2017, 169, 13–23. [Google Scholar] [CrossRef] [PubMed]
  120. Brackley, C.A.; Liebchen, B.; Michieletto, D.; Mouvet, F.; Cook, P.R.; Marenduzzo, D. Ephemeral Protein Binding to DNA Shapes Stable Nuclear Bodies and Chromatin Domains. Biophys. J. 2017, 112, 1085–1093. [Google Scholar] [CrossRef] [Green Version]
  121. Chiariello, A.M.; Corberi, F.; Salerno, M. The Interplay between Phase Separation and Gene-Enhancer Communication: A Theoretical Study. Biophys. J. 2020, 119, 873–883. [Google Scholar] [CrossRef] [PubMed]
  122. Ng, W.S.; Sielaff, H.; Zhao, Z.W. Phase Separation-Mediated Chromatin Organization and Dynamics: From Imaging-Based Quantitative Characterizations to Functional Implications. Int. J. Mol. Sci. 2022, 23, 8039. [Google Scholar] [CrossRef]
  123. Ryu, J.K.; Hwang, D.E.; Choi, J.M. Current Understanding of Molecular Phase Separation in Chromosomes. Int. J. Mol. Sci. 2021, 22, 10736. [Google Scholar] [CrossRef]
  124. Brackey, C.A.; Marenduzzo, D.; Gilbert, N. Mechanistic Modeling of Chromatin Folding to Understand Function. Nat. Methods 2020, 17, 767–775. [Google Scholar] [CrossRef] [PubMed]
  125. De Gennes, P.G. Scaling Concepts in Polymer Physics; Cornell University Press: Ithaca, NY, USA, 1979. [Google Scholar] [CrossRef]
  126. Fiorillo, L.; Bianco, S.; Chiariello, A.M.; Barbieri, M.; Esposito, A.; Annunziatella, C.; Conte, M.; Corrado, A.; Prisco, A.; Pombo, A.; et al. Inference of Chromosome 3D Structures from GAM Data by a Physics Computational Approach. Methods 2019, 181–182, 70–79. [Google Scholar] [CrossRef]
  127. Conte, M.; Esposito, A.; Fiorillo, L.; Campanile, R.; Annunziatella, C.; Corrado, A.; Chiariello, M.G.; Bianco, S.; Chiariello, A.M. Efficient Computational Implementation of Polymer Physics Models to Explore Chromatin Structure. Int. J. Parallel Emergent Distrib. Syst. 2019, 37, 91–102. [Google Scholar] [CrossRef]
  128. Kremer, K.; Grest, G.S. Dynamics of Entangled Linear Polymer Melts: A Molecular-Dynamics Simulation. J. Chem. Phys. 1990, 92, 5057–5086. [Google Scholar] [CrossRef]
  129. Rosa, A.; Everaers, R. Structure and Dynamics of Interphase Chromosomes. PLoS Comput. Biol. 2008, 4, e1000153. [Google Scholar] [CrossRef] [PubMed]
  130. Conte, M.; Esposito, A.; Fiorillo, L.; Annunziatella, C.; Corrado, A.; Musella, F.; Sciarretta, R.; Chiariello, A.M.; Bianco, S. Hybrid Machine Learning and Polymer Physics Approach to Investigate 3D Chromatin Structure. In Proceedings of the Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Springer Nature: Cham, Switzerland, 2020; Volume 11997 LNCS. [Google Scholar]
  131. Eastman, P.; Friedrichs, M.S.; Chodera, J.D.; Radmer, R.J.; Bruns, C.M.; Ku, J.P.; Beauchamp, K.A.; Lane, T.J.; Wang, L.-P.; Shukla, D.; et al. OpenMM 4: A Reusable, Extensible, Hardware Independent Library for High Performance Molecular Simulation. J. Chem. Theory Comput. 2013, 9, 461–469. [Google Scholar] [CrossRef]
  132. Plimpton, S. Fast Parallel Algorithms for Short-Range Molecular Dynamics. J. Comput. Phys. 1995, 117, 1–19. [Google Scholar] [CrossRef]
  133. Anderson, J.A.; Glaser, J.; Glotzer, S.C. HOOMD-Blue: A Python Package for High-Performance Molecular Dynamics and Hard Particle Monte Carlo Simulations. Comput. Mater. Sci. 2020, 173, 109363. [Google Scholar] [CrossRef]
  134. Dunham, I.; Kundaje, A.; Aldred, S.F.; Collins, P.J.; Davis, C.A.; Doyle, F.; Epstein, C.B.; Frietze, S.; Harrow, J.; Kaul, R.; et al. An Integrated Encyclopedia of DNA Elements in the Human Genome. Nature 2012, 489, 57–74. [Google Scholar] [CrossRef]
  135. Conte, M.; Chiariello, A.M.; Abraham, A.; Bianco, S.; Esposito, A.; Nicodemi, M.; Matteuzzi, T.; Vercellone, F. Polymer Models of Chromatin Imaging Data in Single Cells. Algorithms 2022, 15, 330. [Google Scholar] [CrossRef]
  136. Stevens, T.J.; Lando, D.; Basu, S.; Atkinson, L.P.; Cao, Y.; Lee, S.F.; Leeb, M.; Wohlfahrt, K.J.; Boucher, W.; O’Shaughnessy-Kirwan, A.; et al. 3D Structures of Individual Mammalian Genomes Studied by Single-Cell Hi-C. Nature 2017, 544, 59–64. [Google Scholar] [CrossRef]
  137. Theobald, D.L. Rapid Calculation of RMSDs Using a Quaternion-Based Characteristic Polynomial. Acta Crystallogr. Sect. A Found. Crystallogr. 2005, 61, 478–480. [Google Scholar] [CrossRef] [PubMed]
  138. Fiorillo, L.; Musella, F.; Conte, M.; Kempfer, R.; Chiariello, A.M.; Bianco, S.; Kukalev, A.; Irastorza-Azcarate, I.; Esposito, A.; Abraham, A.; et al. Comparison of the Hi-C, GAM and SPRITE Methods Using Polymer Models of Chromatin. Nat. Methods 2021, 18, 482–490. [Google Scholar] [CrossRef]
  139. Götz, M.; Messina, O.; Espinola, S.; Fiche, J.-B.; Nollmann, M. Multiple Parameters Shape the 3D Chromatin Structure of Single Nuclei at the Doc Locus in Drosophila. Nat. Commun. 2022, 13, 5375. [Google Scholar] [CrossRef] [PubMed]
  140. Kragesteen, B.K.; Spielmann, M.; Paliou, C.; Heinrich, V.; Schöpflin, R.; Esposito, A.; Annunziatella, C.; Bianco, S.; Chiariello, A.M.; Jerković, I.; et al. Dynamic 3D Chromatin Architecture Contributes to Enhancer Specificity and Limb Morphogenesis. Nat. Genet. 2018, 50, 1463–1473. [Google Scholar] [CrossRef] [PubMed]
  141. Esposito, A.; Chiariello, A.M.; Conte, M.; Fiorillo, L.; Musella, F.; Sciarretta, R.; Bianco, S. Higher-Order Chromosome Structures Investigated by Polymer Physics in Cellular Morphogenesis and Differentiation. J. Mol. Biol. 2020, 432, 701–711. [Google Scholar] [CrossRef]
  142. Conte, M.; Fiorillo, L.; Bianco, S.; Chiariello, A.M.; Esposito, A.; Musella, F.; Flora, F.; Abraham, A.; Nicodemi, M. A Polymer Physics Model to Dissect Genome Organization in Healthy and Pathological Phenotypes. In Methods in Molecular Biology; Springer Nature: Cham, Switzerland, 2022; Volume 2301. [Google Scholar]
  143. Esposito, A.; Bianco, S.; Fiorillo, L.; Conte, M.; Abraham, A.; Musella, F.; Nicodemi, M.; Prisco, A.; Chiariello, A.M. Polymer Models Are a Versatile Tool to Study Chromatin 3d Organization. Biochem. Soc. Trans. 2021, 49, 1675–1684. [Google Scholar] [CrossRef]
  144. Franke, M.; Ibrahim, D.M.; Andrey, G.; Schwarzer, W.; Heinrich, V.; Schöpflin, R.; Kraft, K.; Kempfer, R.; Jerković, I.; Chan, W.L.; et al. Formation of New Chromatin Domains Determines Pathogenicity of Genomic Duplications. Nature 2016, 538, 265–269. [Google Scholar] [CrossRef]
  145. Ho, J.W.K.; Jung, Y.L.; Liu, T.; Alver, B.H.; Lee, S.; Ikegami, K.; Sohn, K.-A.; Minoda, A.; Tolstorukov, M.Y.; Appert, A.; et al. Comparative Analysis of Metazoan Chromatin Organization. Nature 2014, 512, 449–452. [Google Scholar] [CrossRef]
  146. Tang, Z.; Luo, O.J.; Li, X.; Zheng, M.; Zhu, J.J.; Szalaj, P.; Trzaskoma, P.; Magalska, A.; Wlodarczyk, J.; Ruszczycki, B.; et al. CTCF-Mediated Human 3D Genome Architecture Reveals Chromatin Topology for Transcription. Cell 2015, 163, 1611–1627. [Google Scholar] [CrossRef]
  147. Ernst, J.; Kheradpour, P.; Mikkelsen, T.S.; Shoresh, N.; Ward, L.D.; Epstein, C.B.; Zhang, X.; Wang, L.; Issner, R.; Coyne, M.; et al. Mapping and Analysis of Chromatin State Dynamics in Nine Human Cell Types. Nature 2011, 473, 43–49. [Google Scholar] [CrossRef]
  148. Gifford, C.A.; Ziller, M.J.; Gu, H.; Trapnell, C.; Donaghey, J.; Tsankov, A.; Shalek, A.K.; Kelley, D.R.; Shishkin, A.A.; Issner, R.; et al. Transcriptional and Epigenetic Dynamics during Specification of Human Embryonic Stem Cells. Cell 2013, 153, 1149–1163. [Google Scholar] [CrossRef]
  149. Javierre, B.M.; Burren, O.S.; Wilder, S.P.; Kreuzhuber, R.; Hill, S.M.; Sewitz, S.; Cairns, J.; Wingett, S.W.; Várnai, C.; Thiecke, M.J.; et al. Lineage-Specific Genome Architecture Links Enhancers and Non-Coding Disease Variants to Target Gene Promoters. Cell 2016, 167, 1369–1384.e19. [Google Scholar] [CrossRef]
  150. Nicodemi, M.; Coniglio, A.; Herrmann, H.J. Compaction and Force Propagation in Granular Packings. Phys. A Stat. Mech. its Appl. 1997, 240, 405–418. [Google Scholar] [CrossRef]
  151. Nicodemi, M.; Jensen, H.J. Creep of Superconducting Vortices in the Limit of Vanishing Temperature: A Fingerprint of off-Equilibrium Dynamics. Phys. Rev. Lett. 2001, 86, 4378–4381. [Google Scholar] [CrossRef]
  152. Grebenkov, D.S.; Ciamarra, M.P.; Nicodemi, M.; Coniglio, A. Flow, Ordering, and Jamming of Sheared Granular Suspensions. Phys. Rev. Lett. 2008, 100, 078001. [Google Scholar] [CrossRef]
  153. Oliveira, L.P.; Jensen, H.J.; Nicodemi, M.; Sibani, P. Record Dynamics and the Observed Temperature Plateau in the Magnetic Creep-Rate of Type-II Superconductors. Phys. Rev. B-Condens. Matter Mater. Phys. 2005, 71, 104526. [Google Scholar] [CrossRef]
  154. Tarzia, M.; De Candia, A.; Fierro, A.; Nicodemi, M.; Coniglio, A. Glass Transition in Granular Media. Europhys. Lett. 2004, 66, 531–537. [Google Scholar] [CrossRef]
  155. Marti-Renom, M.A. Benchmarking Experiments with Polymer Modeling. Nat. Methods 2021, 18, 456–457. [Google Scholar] [CrossRef]
  156. Fulco, C.P.; Nasser, J.; Jones, T.R.; Munson, G.; Bergman, D.T.; Subramanian, V.; Grossman, S.R.; Anyoha, R.; Doughty, B.R.; Patwardhan, T.A.; et al. Activity-by-Contact Model of Enhancer–Promoter Regulation from Thousands of CRISPR Perturbations. Nat. Genet. 2019, 51, 1664–1669. [Google Scholar] [CrossRef]
  157. Karr, J.P.; Ferrie, J.J.; Tjian, R.; Darzacq, X. The Transcription Factor Activity Gradient (TAG) Model: Contemplating a Contact-Independent Mechanism for Enhancer–Promoter Communication. Genes Dev. 2022, 36, 7–16. [Google Scholar] [CrossRef]
  158. Iannone, F.; Ambrosino, F.; Bracco, G.; De Rosa, M.; Funel, A.; Guarnieri, G.; Migliori, S.; Palombi, F.; Ponti, G.; Santomauro, G.; et al. CRESCO ENEA HPC Clusters: A Working Example of a Multifabric GPFS Spectrum Scale Layout. In Proceedings of the 2019 International Conference on High Performance Computing and Simulation, HPCS 2019, Dublin, Ireland, 15–19 July 2019. [Google Scholar]
Figure 1. Two main distinct physical mechanisms of DNA contact formation. (a) Cartoon of the loop-extrusion model: active motors (e.g., cohesin rings, depicted in orange in the figure) translocate along the chromosome and extrude DNA loops, hence defining a physical contact between distal, blocking, anchor sites (e.g., convergently-oriented CTCF sites, in red and light blue) where translocation halts. (b) Cartoon of phase-separation-based polymer models: diffusing binders, such as TFs, enable spatial proximity by bridging pairs (or multiplets) of distal cognate sites (each type represented by a different specific color), thus driving the spontaneous self-assembly of the chain into phase-separated clusters of sites with enriched levels of self-interactions.
Figure 1. Two main distinct physical mechanisms of DNA contact formation. (a) Cartoon of the loop-extrusion model: active motors (e.g., cohesin rings, depicted in orange in the figure) translocate along the chromosome and extrude DNA loops, hence defining a physical contact between distal, blocking, anchor sites (e.g., convergently-oriented CTCF sites, in red and light blue) where translocation halts. (b) Cartoon of phase-separation-based polymer models: diffusing binders, such as TFs, enable spatial proximity by bridging pairs (or multiplets) of distal cognate sites (each type represented by a different specific color), thus driving the spontaneous self-assembly of the chain into phase-separated clusters of sites with enriched levels of self-interactions.
Ijms 24 03660 g001
Figure 2. Loop extrusion (LE) and Strings and Binders (SBS) models of a 2.5 Mb wide chromatin locus in human HCT116. The experimental median distance matrix of the locus (top) [13] is consistently recapitulated by the LE (middle) and SBS (bottom) models, as highlighted by their high distance-corrected correlation coefficients (respectively, r’ = 0.87 and r’ = 0.84), indicating that both mechanisms significantly capture the spatial conformation of the locus at the ensemble-average level. Adapted from [64,65].
Figure 2. Loop extrusion (LE) and Strings and Binders (SBS) models of a 2.5 Mb wide chromatin locus in human HCT116. The experimental median distance matrix of the locus (top) [13] is consistently recapitulated by the LE (middle) and SBS (bottom) models, as highlighted by their high distance-corrected correlation coefficients (respectively, r’ = 0.87 and r’ = 0.84), indicating that both mechanisms significantly capture the spatial conformation of the locus at the ensemble-average level. Adapted from [64,65].
Ijms 24 03660 g002
Figure 3. LE and SBS cooperate in determining the heterogeneity of chromatin architectures in single cells. (a) Schematic of the all-against-all structural comparison between microscopy [13] and model [65] 3D structures. (b) Top: example of experiment–model best-match association via the RMSD criterion. Middle: the identified RMSD best matches are statistically significant in the LE and SBS models (two-sided Mann–Whitney p-value < 10−3, RMSD are z-scored on the x-axis). The control is the distribution of RMSD values for randomly taken pairs of experimental conformations. Bottom: more than 90% of the model best matches fall below the first decile of the control distribution. (c) The degree of cell-to-cell structural variability is quantified by an all-against-all pair correlation of single-molecule distance matrices. While LE (orange distribution) and SBS (green), respectively, under- and over-estimate the experimental distribution (blue) of pair correlations, a model combining both processes at the single-cell level (LE + SBS, light blue) best explains the data, hinting that LE and SBS cooperate in determining chromatin structure and its observed cell-to-cell heterogeneity. The dashed vertical line is the average value of the experimental distribution. Adapted from [64,65].
Figure 3. LE and SBS cooperate in determining the heterogeneity of chromatin architectures in single cells. (a) Schematic of the all-against-all structural comparison between microscopy [13] and model [65] 3D structures. (b) Top: example of experiment–model best-match association via the RMSD criterion. Middle: the identified RMSD best matches are statistically significant in the LE and SBS models (two-sided Mann–Whitney p-value < 10−3, RMSD are z-scored on the x-axis). The control is the distribution of RMSD values for randomly taken pairs of experimental conformations. Bottom: more than 90% of the model best matches fall below the first decile of the control distribution. (c) The degree of cell-to-cell structural variability is quantified by an all-against-all pair correlation of single-molecule distance matrices. While LE (orange distribution) and SBS (green), respectively, under- and over-estimate the experimental distribution (blue) of pair correlations, a model combining both processes at the single-cell level (LE + SBS, light blue) best explains the data, hinting that LE and SBS cooperate in determining chromatin structure and its observed cell-to-cell heterogeneity. The dashed vertical line is the average value of the experimental distribution. Adapted from [64,65].
Ijms 24 03660 g003
Figure 4. Prediction of the impact of genomic mutations. (a) Hi-C contact matrix [144] of the wild-type (WT) Sox9 genomic region (top) and its corresponding SBS model (middle) show high similarity (r’ = 0.87). Bottom: snapshot of a single-molecule conformation in the WT predicted by the model. (b) The model contact matrix (middle) in the Dup-S case, predicted by implementing the duplication on the WT model. It shows a high correlation (r’ = 0.76) with the corresponding matrix derived from an independent experiment (top) in cells bearing the Dup-S mutation. The black arrows highlight the effect of the duplication on the contact pattern. Bottom: snapshot of a model-predicted single-molecule conformation in Dup-S (note, in particular, how the duplicated region, highlighted in red, is well embedded into the original sequence, colored in brown). (c) The model contact matrix (middle) in the Dup-L case, predicted by implementing the duplication on the WT model. It shows a correlation with a matrix from an independent experiment (top) in cells bearing the Dup-L mutation as high as r’ = 0.63. The black arrows highlight the effect of the duplication on the contact pattern. Differently from the Dup-S case, a snapshot of the 3D structure in Dup-L (bottom) shows that a fraction of the duplicated region (neo-TAD, colored in red) results to be spatially isolated. This clarifies that different mutations result in different 3D structures and distinct enhancer-hijackings, explaining their phenotypes. Adapted from [95].
Figure 4. Prediction of the impact of genomic mutations. (a) Hi-C contact matrix [144] of the wild-type (WT) Sox9 genomic region (top) and its corresponding SBS model (middle) show high similarity (r’ = 0.87). Bottom: snapshot of a single-molecule conformation in the WT predicted by the model. (b) The model contact matrix (middle) in the Dup-S case, predicted by implementing the duplication on the WT model. It shows a high correlation (r’ = 0.76) with the corresponding matrix derived from an independent experiment (top) in cells bearing the Dup-S mutation. The black arrows highlight the effect of the duplication on the contact pattern. Bottom: snapshot of a model-predicted single-molecule conformation in Dup-S (note, in particular, how the duplicated region, highlighted in red, is well embedded into the original sequence, colored in brown). (c) The model contact matrix (middle) in the Dup-L case, predicted by implementing the duplication on the WT model. It shows a correlation with a matrix from an independent experiment (top) in cells bearing the Dup-L mutation as high as r’ = 0.63. The black arrows highlight the effect of the duplication on the contact pattern. Differently from the Dup-S case, a snapshot of the 3D structure in Dup-L (bottom) shows that a fraction of the duplicated region (neo-TAD, colored in red) results to be spatially isolated. This clarifies that different mutations result in different 3D structures and distinct enhancer-hijackings, explaining their phenotypes. Adapted from [95].
Ijms 24 03660 g004
Figure 5. A combinatorial epigenetic code links 1D and 3D chromatin structures. (a) Top: in situ Hi-C data [25] of chromosome 20 at 5 kb of resolution in the human GM12878 cell line. Middle: position and abundance of the different binding domains along chromosome 20. The colors reflect the specific association of each domain with a set of key epigenetic marks (see panel b). Bottom: the contact matrix inferred from the model shows a contact pattern similar to Hi-C (r’ = 0.85). Similar results are found across chromosomes [95]. (b) According to their correlation with key histone marks, the genome-wide binding domains of the model cluster in nine epigenetic classes matching well the chromatin states found in epigenetic segmentation studies [95]. The heatmap shows the histone profile of the centroid of each class. Interestingly, some but not all the classes also correlate with CTCF and cohesin, suggesting that a specific combination of genome organizing factors, including but not limited to CTCF and cohesin, links 3D chromatin architecture to 1D chromatin states. Adapted from [95].
Figure 5. A combinatorial epigenetic code links 1D and 3D chromatin structures. (a) Top: in situ Hi-C data [25] of chromosome 20 at 5 kb of resolution in the human GM12878 cell line. Middle: position and abundance of the different binding domains along chromosome 20. The colors reflect the specific association of each domain with a set of key epigenetic marks (see panel b). Bottom: the contact matrix inferred from the model shows a contact pattern similar to Hi-C (r’ = 0.85). Similar results are found across chromosomes [95]. (b) According to their correlation with key histone marks, the genome-wide binding domains of the model cluster in nine epigenetic classes matching well the chromatin states found in epigenetic segmentation studies [95]. The heatmap shows the histone profile of the centroid of each class. Interestingly, some but not all the classes also correlate with CTCF and cohesin, suggesting that a specific combination of genome organizing factors, including but not limited to CTCF and cohesin, links 3D chromatin architecture to 1D chromatin states. Adapted from [95].
Ijms 24 03660 g005
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Conte, M.; Esposito, A.; Vercellone, F.; Abraham, A.; Bianco, S. Unveiling the Machinery behind Chromosome Folding by Polymer Physics Modeling. Int. J. Mol. Sci. 2023, 24, 3660. https://doi.org/10.3390/ijms24043660

AMA Style

Conte M, Esposito A, Vercellone F, Abraham A, Bianco S. Unveiling the Machinery behind Chromosome Folding by Polymer Physics Modeling. International Journal of Molecular Sciences. 2023; 24(4):3660. https://doi.org/10.3390/ijms24043660

Chicago/Turabian Style

Conte, Mattia, Andrea Esposito, Francesca Vercellone, Alex Abraham, and Simona Bianco. 2023. "Unveiling the Machinery behind Chromosome Folding by Polymer Physics Modeling" International Journal of Molecular Sciences 24, no. 4: 3660. https://doi.org/10.3390/ijms24043660

APA Style

Conte, M., Esposito, A., Vercellone, F., Abraham, A., & Bianco, S. (2023). Unveiling the Machinery behind Chromosome Folding by Polymer Physics Modeling. International Journal of Molecular Sciences, 24(4), 3660. https://doi.org/10.3390/ijms24043660

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop