Next Article in Journal
Quantitative Proteomics Comparison of Total Expressed Proteomes of Anisakis simplex Sensu Stricto, A. pegreffii, and Their Hybrid Genotype
Previous Article in Journal / Special Issue
Centromeric Transcription: A Conserved Swiss-Army Knife
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Centromeres under Pressure: Evolutionary Innovation in Conflict with Conserved Function

1
Dipartimento di Biologia e Biotecnologie “Charles Darwin”, Sapienza Università di Roma, 00185 Roma, Italy
2
Laboratory of Chromosome and Cell Biology, The Rockefeller University, 1230 York Avenue, New York, NY 10065, USA
*
Author to whom correspondence should be addressed.
Genes 2020, 11(8), 912; https://doi.org/10.3390/genes11080912
Submission received: 7 July 2020 / Revised: 4 August 2020 / Accepted: 4 August 2020 / Published: 10 August 2020
(This article belongs to the Special Issue The Role of Centromeres in Genome Stability)

Abstract

:
Centromeres are essential genetic elements that enable spindle microtubule attachment for chromosome segregation during mitosis and meiosis. While this function is preserved across species, centromeres display an array of dynamic features, including: (1) rapidly evolving DNA; (2) wide evolutionary diversity in size, shape and organization; (3) evidence of mutational processes to generate homogenized repetitive arrays that characterize centromeres in several species; (4) tolerance to changes in position, as in the case of neocentromeres; and (5) intrinsic fragility derived by sequence composition and secondary DNA structures. Centromere drive underlies rapid centromere DNA evolution due to the “selfish” pursuit to bias meiotic transmission and promote the propagation of stronger centromeres. Yet, the origins of other dynamic features of centromeres remain unclear. Here, we review our current understanding of centromere evolution and plasticity. We also detail the mutagenic processes proposed to shape the divergent genetic nature of centromeres. Changes to centromeres are not simply evolutionary relics, but ongoing shifts that on one side promote centromere flexibility, but on the other can undermine centromere integrity and function with potential pathological implications such as genome instability.

1. An Introduction to Centromere Diversity

In 1882, Walter Flemming observed the central structure that forms the primary constriction on mitotic chromosomes [1], later named the centromere [2]. Despite its early cytological discovery, the centromere remains a fascinating and rather mysterious region of the genome. A hundred years after Flemming’s observation, the smallest centromere, suitably named “point centromere”, was characterized by Louise Clarke and John Carbon in the budding yeast Saccharomyces cerevisiae [3], made of a single centromere-specific nucleosome [4]. Already from these early studies, two key and apparently contrasting aspects of centromere biology emerged: great heterogeneity in centromere DNA size, organization and structure across species [5,6], while holding an essential and evolutionarily conserved function in enabling chromosome segregation [7]. Centromeres can be broadly classified into different types (Table 1) based on relative size: (1) point centromeres, which are rare and only found in fungi; (2) regional centromeres, which are the most common type of centromere where a specific genomic region defines the centromere location (because regional centromeres can vary widely in size, a further sub-classification has been proposed between short (<40 kb) and long (>40 kb) regional centromeres [8]); (3) holocentric centromeres, which are diffused and encompass the entire chromosome (recently, single base pair resolution data have shown that holocentric organisms like C. elegans in reality consist of hundreds of budding yeast-like point centromeres in a “polycentric” set up); and (4) meta-polycentric centromeres, which are a recently-added, rare category where the centromeres are alternated and thus extended to cover a section of the chromosome. These categories that highlight the genetic diversity of centromeres are recapitulated in Table 1, and described in detail below.
A unified consensus for the centromere can be reached when describing its conserved and essential role: centromeres are necessary for the correct inheritance of genetic material by enabling chromosome attachment to the spindle microtubules during each round of cell division [22,23]. Centromeres as conditio sine qua non for genome inheritance are highlighted by the quest to engineer human artificial chromosomes (HACs). HACs require centromeric DNA, or centromere chromatin, in order to be stably transmitted over cellular generations [24].
Centromere specialization is primarily determined by a unique chromatin environment founded on the presence of a centromere-specific nucleosome containing the histone H3 variant protein centromere protein A (CENP-A) that serves as a docking template for centromere factor binding and mitotic kinetochore assembly, and epigenetically encodes the transgenerational inheritance and propagation of the centromeric locus [25]. Underscoring its essential and evolutionarily conserved function, homologs for CENP-A are found in many species throughout evolution and are studied in a variety of laboratory model organisms (Table 2) [26,27].
The centromere histone is preserved in flies as the centromere identifier (Cid) [32,43], in worms as histone H3-like centromeric protein (HCP-3) [36], in plants and fungi as centromeric histone 3 (CenH3), in fission yeast as centromere-specific histone H3 (Cnp1) [30], in budding yeast as chromosome segregation 4 (Cse4) [28], in mouse as (Cenp-a) [38] and in human as CENP-A [40,41], as well as other species (Table 2). The ubiquity and conservation of the centromere-specific histone variant prompted the suggestion for a common designation of CenH3/CENP-A [44]. As more model organisms are being studied, our understanding of centromere epigenetic specification and its diversity broadens. Recent work in the garden pea Pisum sativum show that it contains multiple copies of CenH3 protein to generate an extended primary constriction, defined as a “meta-polycentric centromere” with alternated CenH3 domains [16,45]. Similar scattered features are also seen in other organisms [46]. CENP-A is interspersed with the canonical H3-containing nucleosome in a way that is conserved from flies to humans [46], and forms high-density islands/sub-domains of CENP-A across the human centromeres, as it was reported looking at stretched chromatin fibers [47]. The expansion of the peculiar centromere structure of Pisum was also found in another legume tribe species, Lathyrus, where it has an additional copy of the CENH3 gene that was not seen in other phylogenetically-related species [48], underscoring centromere genetic and epigenetic diversity across even closely related species. While a structural relationship exists between human centromere proteins that mark the functional “centrochromatin” [49], CENP-A and other centromere proteins are amongst the fastest changing during evolution, with hyper-variable regions, divergence in length (Table 2) and divergence in overall sequence and domain composition (Figure 1).
Intriguingly, new evidence has demonstrated the absence of the largely conserved centromere histone in some organisms. CenH3-independent centromeres were found in the African sleeping sickness parasite Trypanosoma brucei [53] and in four lineages of insects, underscoring an ancient transition associated with a switch from regional or point centromeres to holocentric centromeres that was accompanied by loss of the centromere-specific histone [54]. This raises the question as to why some holocentric organisms retain a centromere-specific histone while others do not. Partly, it may relate to the conservation of kinetochore proteins present among holocentric and monocentric centromeres even in species where CenH3 is lost [55]. Retaining kinetochore assembly is the ultimate goal to enable centromere activity [56]. In the case of specific insect lineages, the holocentric centromeres devoid of CenH3 still present canonical kinetochore proteins, especially the outer part where the kinetochore interfaces with microtubules [54,57]. Trypanosoma brucei remains to date as an exception, showcasing extremely divergent outer kinetochore components defined as an “unconventional” kinetochore which is made up of 20 apomorphic kinetoplastid kinetochore proteins (KKT1–20) not conserved across the other flagellated members of the monophyletic group of Euglenozoa [53,58]. The Trypanosoma “exception” challenges the assumption that centromere function is funded on its epigenetic specification. Other systems may exist where chromosome segregation is free from the imposed presence of CenH3, or even “canonical” kinetochore constrains [59]. Further investigations into CenH3 divergent evolution, holocentromere condition and cases that lack epigenetic specification for centromeres will shed light on essential and universal requirements for chromosome segregation.
The wide diversity of centromere proteinaceous constituents is paralleled by the progressive mutability of underlying centromere DNA [60]. At the genetic level, centromere sequences are characterized by repetitive DNA, often rich in A/T nucleotides and arranged in tandem units as found in many organisms. The high representation of repeats across species implies a bias for reiterated DNA in supporting centromere formation and function [61]. Yet the finding by Voullaire et al. (1993) of an ectopic human centromere, so-called neocentromere, on marker chromosomes 10 deprived of repetitive DNA brought the requirements for DNA repeats at centromere under scrutiny [62]. Neocentromeres seem to have a sequence-independent formation [63], underscoring the epigenetic foundation of centromeres [64,65,66]. Alphoid-less centromeres likely originated from neocentromeres. An absence of satellite repeats was seen in the horse centromere on chromosome 11 (Equus Caballus 11, ECA11) [67], in zebra for chromosomes 2, 5, 7, 13, 18–21 [68] and in the donkey centromeres 11 and 16 [69]. These satellite-free centromeres form primary constrictions and still guarantee segregation fidelity [70]. In particular, ECA11 is well conserved in the syntenic region in other mammals and its two internal regions of 136 and 99 kb both bind CENP-A and CENP-B [71], respectively, suggesting robust propagation even in the absence of satellite DNA repeats.
A reconciliation regarding the functionality of repetitive centromere sequences was offered by recent data pointing to a role for CENP-B in fulfilling centromere specification by stabilizing and partly recruiting CENP-C directly to the centromere in human cells depleted of CENP-A [72,73]. CENP-B is recruited to a specific consensus sequence, the CENP-B box present within human α-satellite repeats [74]. Thus, CENP-B-containing centromeres are specified by a concerted contribution of both CENP-A loading, in a sequence independent manner, and of CENP-B recruitment to the CENP-B box [75]. So, while epigenetically CENP-A is necessary and sufficient to establish a centromere in proliferating somatic cells [76], whether it is on a HAC [24] in an ectopic location [63,77] or on a lactose operon (LacO) array [78], recent evidence shows that CENP-B may be able to fully compensate for CENP-A in enabling centromere specification, formation, positioning and transgenerational inheritance [72,73] (Daniele Fachinetti and Sebastian Hoffman; personal communication).
Cis-acting α-satellite sequences are not sufficient to define a functional centromere. Indeed, “non-alphoid centromeres” have been found in plants [79,80], in birds [81], among Equidae subspecies (e.g., speciation between horse and donkey) [69,82,83], in different primate species [84] and in humans [85]. This means that new centromere sites are generated without a corresponding alteration in DNA organization and they are still undergoing repositioning. Indeed, new centromere formation could represent a way to insert inter- and intra-species diversity [86,87,88].
Ectopic centromere formation represents an opportunity to re-localize the centromere to a new position outside the endogenous site, giving rise to a functional neocentromere which enables cell division upon disruption of the endogenous centromere. The configuration of the neocentromere can occur at a distance from the endogenous centromere, as found within inverted duplications between a breakpoint and a telomere end [89]. The ability for kinetochore protein assembly on the new locus is assisted by CENP-A recruitment to the neocentromere [90]. Interestingly, chromosomes containing active neocentromeres can be maintained over generations, implying that the chromosomal positioning of the centromere region retains flexibility in its localization and can promote sister chromatid separation even when decentered or greatly shifted from the endogenous locus. Thus, the pliability in accommodating centromere functionality over diverse sequences and variable overall size also extends to adaptability to different locations along the chromosome [91]. Similarly to gene duplication being the first step toward divergence and functional innovation, the establishment of a new, competent centromere site outside of the endogenous locus offers flexibility and sustained functionality. Amongst the many plausible mechanisms for neocentromere formation, the recently reported ectopic CENP-A loading [92] and/or binding transiently to DNA double strand breaks (DSBs) [93] may represent favorable sites for the initiation of neocentromere formation, the establishment of a functional de novo centromere [94,95,96,97] and for its stabilization during subsequent generations [98]. Leo et al. offers a detailed review in this Genes Centromere Stability special issue of the different models of neocentromere formation [99].
Following the evolutionary footsteps of centromere sequences and proteins can help unravel some of the aforementioned riddles and paradoxes in centromere biology. Here, we have delved into the conflict between evolutionarily and ongoing mutagenesis in centromere DNA and whether these processes may impact the conserved and essential functions of centromeres. How these seemingly detrimental mechanisms converge to undermine centromere function while also being important contributors to centromere biology and evolution will be discussed (Section 2).

2. Centromere Organizational Diversity in Light of Evolution

From the smallest and simplest centromere of Saccharomyces cerevisiae to the large and complex ones found in higher eukaryotes, including human megabase-sized ones, the evolutionary compulsion to sustain variability in order to exploit this locus for chromosome segregation is evident [100].
A case in point is the fast evolving “point” centromere of budding yeast S. cerevisiae with as little as ~125 bp (base pair) consensus AT-rich sequences [4,28,101].
Each centromere has three centromere DNA elements (centromere determining elements, CDEs) for the association of the centromere DNA binding protein complex: CDEI (~8 bp), CDEII (~78–86 bp) and CDEIII (~25 bp) (Figure 2A) [102,103,104,105,106]. Cse4 maps on the CDEII DNA element and forms a modified histone octamer, with different studies proposing a variety of models for this nucleosome:homotypic tetrasome (Cse4/H4)2 [107], hexasomes with non-histone proteins (Cse4-H4/Scm3)2 [108], asymmetric/mixed octasomes (Cse4/H3/(H4/H2B/H2A) [109] and single right-handed hemisomes (CenH3/H4/H2A/H2B) wrapping the ~80 bp of DNA centromeric sequence [110,111].
In fission yeast Schizosaccharomyces pombe, the centromeric region is large relative to the total genome size, spanning 35–110 kb, of which ~4 kb represents a unique central sequence (cnt) flanked by two inverted repetitive sequences (ImrL and ImrR) (Figure 2B) [10,112].
Next based on overall size is the 420 kb repetitive centromere of Drosophila melanogaster, composed of over 85% satellite DNA interrupted by the presence of transposable elements (TE) (Figure 2C) [113].
A very similar composition of satellite DNA and centromeric transposable elements was also found in plants, such as Arabidopsis thaliana [114], Oryza sativa [115] and Zea mays [13]. Elements of diversity in these plant satellite DNA are displayed by the size of the basic unit present and number of reiterations of these units which make up the centromeres, ranging from 400 kb to 1.4 Mb. For instance, the Arabidopsis centromere has a 180 bp monomer (Figure 2D) [11,116], rice has a 155 bp satellite CentO unit [12] and maize contains a 156 bp satellite unit named CentC [13]. These repeated units, while divergent, all specifically bind well-characterized centromere proteins. Satellite sequences found in the mouse centromere also contain repetitive domains with distinct unit sizes [15,117]. The mouse centromere is organized into minor satellite DNA with a 120 bp homogenized unit that constitutes the core centromere region, and flanking major satellite DNA of pericentromeric heterochromatin that is made up of less-ordered 234 bp units (Figure 2E) [15,118]. In humans, the centromere is also distinct from the flanking pericentromere. The former is made up of tandemly organized repeats, called α-satellite DNA, while the latter is made of monomeric α-satellite units and other types of repeats. Within the core centromere, the 171 bp monomeric units of α-satellite DNA arranged in tandem share between 50% to 70% sequence homology. Several repeat units form a higher order repeat (HOR) block that is reiterated with a similarity of 97–100% to make up a homogenized array spanning several megabases, usually 2–5 Mb (Figure 2F) [16,61]. Notably, each human chromosome has a different number of monomers that make up its HOR, with some chromosome-specific sequences contained within the homogenized array. Thus, sequence diversity is not only found across species but also within species, across the karyotype.
In addition to the aforementioned regional centromeres, large or small (which we categorized as short and long regional centromeres, as in Table 1), there are other kinds of centromere genetic structures with less common organization, including organisms that have multiple or diffused centromeres. A striking example of a centromere which is an intermediate between a monocentric (single) centromere, and a polycentric, is the garden pea, P. sativum. Similarly to other species equipped with satellite DNA, the P. sativum centromere is constructed on tandem repeated domains of 13 individual families of satellite DNA and one family of Ty3/gypsy retrotransposons (Figure 2G). The Pisum meta-polycentric centromere is then made up of 1–5 domains. Reminiscent of the multiple centromeric arrays found in human chromosomes, only one array represents the active centromere that forms the kinetochore. Notably, the garden peas’ centromere is considered polycentric because multiple active arrays contribute to a linear-like kinetochore [17], unlike other centromeres where only one of the repetitive arrays is functional [119].
In addition to the monocentromere and meta-polycentric centromere described above with a defined site for each chromosome, the holocentromere is dispersed to the total length of chromosome with a non-localized kinetochore. The holocentric condition is spread in several phyla, implying multiple distinct and independent occurrences during evolution [120]. The Caenorhabditis elegans centromere is a prime example of a holocentric organism, where the centromere encompasses the full length of the chromosome (14–21 Mb) (Figure 2H) [21], yet it is still dependent on the H3-like centromere histone HCP-3 for chromosome segregation during mitosis [36,121]. Through the evolutionary lens, centromere organization looks somewhat stochastic, with different species having evolved their own particular way to adapt a centromere locus for chromosome segregation. Importantly, while centromeres can exist in different forms and arrangements, their purpose to achieve accurate division of genetic material is always accomplished [22,122].
Indeed, primary constriction size appears invariant and with a constant scale of magnitude from yeast to human [123]. Thus, despite the great evolutionary diversity and organization across eukaryotes, centromere function in chromosome segregation remains conserved.

Centromere Drive: From Conflicts to Benefits

A rapid and heterogeneous evolution of centromere components across eukaryotes is in disagreement with its vital and conserved centromere function [7,124]. Yet, these mutagenic changes must be in accord with a synchronized shift of centromeric elements that provide an evolutionary advantage. A plausible reason for this fast centromere evolution–adaptation paradox is elegantly provided by the “centromere drive” hypothesis formulated by Malik and Henikoff [7,125], where centromere DNA and protein components co-evolve under genetic conflict [126]. Centromere drive sees centromeres not only as essential regions of the genome during cell division, but also as “selfish genetic elements” that have an opportunity to play tug-of-war during the first asymmetric division (MI) in female meiosis and bias their transmission [126,127]. In fact, in the centromere drive model, the stronger centromeres segregate successfully with respect to the competitors. Their ability to exploit the asymmetry of oocyte meiosis, overthrowing Mendelian genetic laws [127], means that there is a Darwinian selection between centromeric variants for their transmission to the gametes and consequently for their inheritance, which underlies the constant genetic changes as a continued quest toward improved strength and favored inheritance. There are several examples demonstrating the validity of the centromere drive hypothesis. Recent elegant proofs were provided by the Lampson lab using crosses between mouse strains with different amounts of centromere proteins. The “stronger” centromere was preferentially inherited during female meiosis due to increased levels of kinetochore proteins contributing to the likelihood of transmission to the egg [128]. The presence of mutational changes in centromeric sequences is reconciled with simultaneous conformational changes in centromeric proteins, generating more microtubule attachment sites [129,130,131]. Lampson and collaborators set up a system to investigate the implication of changes in satellite DNA in recruiting the kinetochore complex. They found a 6–10-fold increment of minor satellite mouse centromeric repeats in “strong” centromeres compared to the “weaker” centromere mouse strain [129,132]. The size difference translates into increased retention of CENP-B protein on its DNA binding motifs, CENP-B box present on the minor satellite that consequentially recruits additional CENP-A proteins [133] and, in turn, is responsible for the robust assembly of the outer kinetochore for robust attachment to the asymmetric meiotic spindle [129]. The stronger centromeres are able to orient towards the egg pole and remain in the mature oocyte, winning a spot in self-propagation [128,133]. In addition to centromere DNA changes, meiosis can also be biased by other features, including spindle asymmetries [128].
Even though this evidence elucidates the advantage of centromere evolutionary changes, deleterious effects must also be taken into consideration, including unbalanced segregation that could generate incompatible post-zygotic hybrids contributing to speciation [124,134].
Centromere rearrangements are protagonists in karyotypic divergence, as in the case of the horse and donkey. Changes in centromere repositioning created chromosomal structural variations that act like a “genetic barrier” between these two species due to the odd rate of meiotic chromosome recombination, which causes the gametogenic failure in mules [135].
To contrast this constraint, CenH3 gene duplications are positively evolving, with the vast majority becoming pseudogenes and fixing in the population as they are able to adapt to the selection imposed by changes in centromeric sequences [136]. For instance, Mimulus aurantiacus displays many CenH3 duplication events under a divergent process in which paralogs differentiate with distinct sub-specialized functions [136]. CenH3 duplication and divergence are also seen in Drosophila where five duplications of the Cid gene correlate with tissue-specific expression [60,137,138].
Thus, similarly to other evolutionary changes, centromere DNA and centromeric genes use duplications as a mechanism to mitigate rapid mutagenesis. Notably, this rapid evolution of centromere sequences and/or proteins is an irreversible process and on some occasions, it might turn into chromosomal instability [139].
In addition to the issue of speciation, changes at the centromere are not simply evolutionary relics that are now settled, but ongoing shifts in the context of centromere drive. Centromeres may be unstable regions of the genome not just on an evolutionary timescale, but also within the cellular lifetime [140]. Indeed, recombination and rearrangements were found to happen within a single cell cycle in human primary epithelial cells [141]. In Section 3, we will review the mutagenic processes that occurred to form the peculiar genetic structures of centromeres during evolution, and that may continue to undermine centromere stability during cell division.

3. Mapping Mutagenic Mechanisms by Following Their Evolutionary Footsteps on Centromere DNA

Centromere DNA is one of the fastest evolving sequences found within the eukaryotic genome. The repetitive nature of centromeres, often in head-to-tail orientation, implies that the repeat units were subjected to expansion and reiteration, followed by other rounds of mutagenesis, to enable formation of the region as we observe it today. To reconstruct the repetitive array, several simulations have been proposed to understand how mutagenesis acts on centromeres to shape their genetic structure. Recombination at the centromere seems obvious yet has remained counter-intuitive. Starting 80 years ago, numerous evidence has been accumulating, demonstrating the negative effects of meiotic recombination within the centromere region [142,143] in different organisms [144]. A reduced level of recombination events at centromeric and immediately flanking sequences during meiosis has long been established, giving a reputation to centromeres as “cold” spots to recombination, as described by Andy Choo, who asked the question: “Why is the centromere so cold (to recombination)?” [145]. Highly condensed chromatin has been thought to repress recombination in order to avoid instability within centromere DNA repeats [146], as well as DNA methylation [147]. Extreme linkage disequilibrium for single nucleotide polymorphisms (SNPs) found at centromeres is another indicator of a low rate of recombination and crossing over events [148,149,150]. Yet, centromere DNA structure and the high degree of homology between satellites across chromosomes are strongly indicative of recombination-driven homogenization and evolution. In addition to evolutionary processes, recombination has been shown to happen to centromeres at relatively high levels during a single cellular generation, with specific factors contributing to its (at least partial) suppression [141,147]. Sister chromatid exchanges were detected in mouse [147] and in human cells [141] using a technique called Centromere-Chromosome Orientation-Fluorescent in situ Hybridization (Cen-CO-FISH) [151], and centromere proteins including human CENP-A contribute to repressing centromere rearrangements [141]. Intriguingly, recombination and other mutagenic processes may be promoted by intrinsic features of centromere repetitive DNA. Given the exceptional flexibility of centromeric repeats, altered topological conformation and secondary structures are likely to occur [142,152,153]. Emerging roles for centromere chromatin in mitigating centromere instability by reducing recombination [141], transposition events and possibly suppressing DNA damage formation indicates an interesting balance between intrinsic or programmed mutagenesis and epigenetic stabilization at centromere.
On an evolutionary timescale, homogenization of centromeric repeats has been speculated to emerge precisely through short and long-range stochastic unequal exchange (Figure 3A) between sister chromatids. These were described in the Smith model [154] by a non-reciprocal recombination between homologous sequences that are neutral to selection [155,156,157]. Similarly, the mechanism of gene conversion (GC) (Figure 3B) [158] is a unidirectional transfer of genetic information from an intact to a broken strand, and can readily account for centromere expansion driven by DNA damage. Depending on the length of GC tracts, they can be called short-tract gene conversions (STGC) for DNA segments ranging between 50 to 200 bp [159,160] or long-tract gene conversions (LTGC) for segments over 1 kb [161,162], with LTGC likely playing a role at large centromeres.
Generally, homology tracts are templates for the resolution of double Holliday junctions (HJ) and synthesis-dependent strand annealing (SDSA) during gene conversion. Both these intermediates are implicated as down-stream processing for the resolution of DNA double stranded breaks (DSB) through DNA damage repair (DDR) pathways. The origins of DSBs within centromere repeats remain unknown. We speculate that stochastic damage can be exacerbated by the intrinsic fragility of centromeres [140]. Another interesting source of DNA damage is represented by transposons. The occurrence of non-allelic gene conversion between duplicated TEs has been demonstrated [163,164] and, while CENP-A nucleosomes seem to play a role in suppressing these TE-mediated mutagenic events, they are thought to retain an active role that impacts the centromere genomic landscape [165,166]. The insertion of TEs and post-insertion events are thought to produce the homogenization of arrays seen among non-homologous chromosomes within the same cell [165]. Indeed, recent evidence in Monopterus albus show that two TEs, called GYPSY5-ZM_I retrotransposable element of Zea mays and MuDR-13_VV DNA transposable element of Vitis vinifera, gave rise to the Monopterus albus satDNA repeats MALREP (MALREP-A, MALREP-B, and MALREP-C) through unequal crossing-over [167]. The same mechanism was previously observed in the P. sativum tandem repeat satellite PisTR-A, in which the long terminal repeats (LTRs) of the Ty3/gypsy Ogre retrotransposons represent the template for the amplification of satDNA arrays [168] and, thus, contribute to the origin of species-specific centromeric satellites [48,169]. Generation of a new centromere site has also been correlated with the pervasive transcription of TEs that recruit CENP-A through small RNAs called centromere repeat-associated short interacting RNAs (crasiRNAs) [166,169]. Given the recently appreciated role of centromere transcripts and transcription in centromere function [170], it is possible that TEs operate by inducing breaks and/or by exerting the induction of transcription, and both these processes may converge to promote centromere formation.
High prevalence of gene conversion events are overrepresented in palindromic and reversed repetitive sequences [164]. DNA palindromes appear to be a feature of centromeres and pericentromeres in different species [171,172,173]. Palindromes also have the intrinsic potential to adopt non-canonical B-DNA helix conformations, including Z-DNA, triplex, quadruplex, cruciform [174], again suggesting a multi-step challenge associated with DNA-based transactions like replication, transcription and repair processes at centromere repeats [175]. In addition to palindromes, there are a multitude of alternative DNA secondary structures that centromere repetitive DNA assume, including non-B-DNA [153], triples and G-quadruplex (G4) [176,177,178], i-motifs [179,180], hairpins [181] and loops found at human α-satellites [152]. These and other possible arrangements for three-dimensional DNA folding are expected to directly hinder the replication process as physical barriers. These impediments can also lead to the lower affinity of DNA polymerase for the newly synthetized strand, causing out of register “replication slippage” (Figure 3C) [182]. Replication slippage has been speculated to contribute to centromere repeat amplification, and can provoke either replication fork stalling or collapse, generating a DSB and further promoting mutagenesis [183,184]. DSBs can be repaired through different pathways with specialized protein cascades and diverse outcomes. While DSB repair pathways have been extensively detailed, information on centromeric DSB repair is still lacking. Generally, non-homologous end joining (NHEJ) is a primary pathway of repair utilized throughout the cell cycle that promotes the rapid re-ligation of broken DNA ends without requiring extensive processing. NHEJ is comprised of canonical-NHEJ (c-NHEJ) or alternative-NHEJ (a-NHEJ). The latter can utilize micro-homology between the two broken ends for alignment between sequences of 1–16 nucleotides before rejoining [185]. NHEJ represents an error–prone repair solution, which leaves behind a mutational scar, but such a signature is not obviously observed within the available centromere sequences. Only once a homologous sequence is available after replication can the damaged locus be repaired by homologous recombination (HR). In S-phase and G2, approximately half of all DSBs become substrates for HR using the sister template. To date, it is unclear how the suppression of HR in G1 occurs to prevent centromere recombination with homologous sequences in other chromosomes or within the same chromatid. Activation of HR relies on the generation of single stranded DNA as the DSB is resected. HR or homology-directed repair (HDR) encompasses different sub-pathways but commonly initiates with DNA resection (strand invasion mediated by RecA (in bacteria) or Rad51 (in eukaryotes) that leads to the formation of a displacement loop (D-loop) to create the Holliday junction). A conservative form of HDR is synthesis-dependent strand annealing (SDSA) [186]. SDSA fills DSBs and inhibits crossing over [187]. Because centromeres actively undergo recombination during the mitotic cell cycle [141,147,188] and short- and long-range recombination events are speculated to drive centromere formation and evolution, HR likely represents an active mode of repair for centromere damage. However, this poses important questions on how faithful recognition of the true sister sequence is accomplished, differentiating the many identical and matching sequences within the same chromatid or across chromosomes. Aberrant recombination would give rise to non-allelic exchanges, as we reviewed previously [140].
There are other forms of DNA damage repair whose mutational signatures have been associated with centromere DNAs. Replication fork failure, regression into so-called chicken foot structures and other stalled/collapsed fork conformations can also produce unusual HR substrates, where resolution of the one-ended DSB can be achieved through activation of break-induced repair (BIR) (Figure 3D) [189], or microhomology-mediated break-induced repair (MMBIR) in case of non-sister templates [190,191]. BIR pathway activation on repetitive sequences can cause an out-of-register invasion and the resolution of the D-loop leads to expansions and/or contractions of repeat arrays [192]. Centromere sequences seem to carry a mutational signature compatible with BIR according to a recent report [183].
As an alternative, circular 3′ ssDNA (single stranded DNA) templates generated at the D-loop lead to the induction of rolling circle replication (RCR) (Figure 3E) which occurs preferentially within inverted repeats arrays, generating concatemers [193]. As a result, DNA repair protein RAD51 homolog 1 (RAD51) plays a central role in processing the HJ loop [194], principally with the aim to inhibit single-strand annealing (SSA), an error-prone mechanism that anneals the homologous DNA sequence at the break without a gap, causing a sequence deletion (Figure 3F) [187,195]. SSA results in loss of DNA where the 25-nucleotide strand annealing is followed only by polymerase filling and intermediate ligation [196,197]. Because many of these repair pathways are error-prone, they induce mutagenesis that may favor the evolution of centromere DNA (Figure 3).
Indeed, Rice [183] assigned a contribution to both BIR and SSA pathways in the plasticity of HORs. Contrary to Smith [146], intermingled alternation of CENP-A-enriched/centric core expansion by the BIR pathway during replication, and the length-eroding SSA pathway during the repair of DSBs have converged to enable the formation of homogenized HORs. The latter repair pathway (SSA) appears quite infrequently in centromeric and pericentromeric regions [183]. The large size of the HORs underscores this expansion [189,198]. Furthermore, there is a corresponding increase in CENP-A with expansion of HOR sequence arrays, which in turn leads to increased CENP-A deposition in the form of a positive feedback loop [199,200].
The aforementioned processes cause amplification, expansion and large-scale remodeling of the genomic landscape at the centromere. However, they must also be intersected by localized mutagenesis, including that which triggers divergence between monomers. In the example of the human centromere, individual monomers of α-satellites share only 50–70% sequence identity between each other, while HOR blocks are nearly-identical. Thus, large-scale processes may be rarer and have operated on a wider timescale than small-scale changes and micro-mutations that may continue to shape and diverge centromeres. Notably, BIR seems sufficient to create mutations within the replicated sequences (around 1000-fold with respect to DNA replication without out-of-register forks [183,201]) and results in both long and short-range changes.
A supplementary mechanism to accomplish concomitant mutagenizing and homogenizing of the centromeric repeats is based on inter-chromosomal translocations guided by the organization and proximity of spatial repeats. A high percentage of translocation events has been demonstrated in centromeric homology inverted repeats (HIRs) of common progenitors of C. albicans and C. tropicalis, in which the loss of these inverted repeats provokes the formation of a new centromere. When the essential function of centromeric HIRs is missing, a CENP-A-rich zone influences the seeding of evolutionary new centromeres (ENCs) in order to reestablish the eroded centromere region [202]. The plasticity of the centromere in establishing into a completely new location adds another layer of complexity in tracking sequence generation through mutagenic processes, where sequences may be originating from diverse and changing ancestral seeding DNA. Yet, these fitting simulations represent important points of reflection to gain a more profound and complete appreciation of the complexity in sustaining centromere evolution and maintenance. Much needed empirical evidence will uncover which of these processes operate within the repetitive satellites through current sequencing efforts. Because mechanisms to suppress processes like HR are emerging [141,147], mutagenic processes, along with their mitigating pathways, will reveal how centromere DNA stability and evolution are maintained.

Formation of Human Centromeres through Evolutionary Mutagenesis

The DNA organization at human centromeres is a notable example of repeat amplification, homogenization and mutagenesis. One of the first studies on the evolution of human satellite DNA was advanced by Smith in 1976 [153], with the unequal sister crossover model used to describe the dynamic mutability shown by α-satellite repeats. The model explains that the diverse nature of these repetitive sequences is driven by the proportion between the rate of recombination of the mitotic sister chromatid (r), the rate of the base pair mutation (u), and the minimum match length (m) required for unequal crossover [203,204].
More recent advances in methodologies and sequencing allowed the construction of centromere phylogenies to compare centromeres among different organisms, as well as between the same species. Intra- and inter-species analyses are a very helpful tool for the recognition of ancestral and new properties of centromere repeats, exposing evolutionary constrains and adaptive changes over different timescales [201]. In fact, even if the base substitution rate between chimps and human species is only 1.2% in non-centromeric regions (whether or not there is over-repeated and non-repeated DNA [205]), there is a continuous rapid divergence that has been demonstrated through the hybridization of human centromeric DNA probes on the ortholog chimp centromere sequences, suggesting that centromeres have higher degree of divergence [206,207,208]. α-satellite DNA has been found in Old World Monkeys [209,210,211], in New World Monkeys [212,213] and in prosimians [214,215], where it maintains a monomeric, more disordered α-satellite organization [216,217,218]. Instead, α-satellite higher order structure (as found in human centromeres) is also present in our relative Great Apes such as chimpanzees, gorillas [218,219], and orangutans [218,220]. This may reflect a very recent evolution of monomeric satellites into an upper level organization through homogenized HORs [221]. This is particularly interesting as pericentromeres retain monomeric, seemingly ancestral, α-satellite DNA interspersed with Long interspersed nuclear elements (LINEs), Short interspersed nuclear elements (SINEs) and other repetitive elements, suggesting that monomeric α-satellites served as an early template for the HOR homogenization that followed.
Alexandrov and colleagues advanced a very interesting model about the formation of HOR in Great Apes from an old ancestral monomer in lower primates [209]. Supposedly, the divergence of old monomers prior to the split among human, chimpanzee and gorilla gave rise to a monomer type able to bind CENP-B, creating three supra-chromosomal families (SF) in which both the old and new monomers are alternated [222,223]. In Great Apes, the new type of monomer is present in all chromosomes with some exceptions (e.g., the Y chromosome in humans), although these peculiar cases also have condensed structural organization [224]. In this model, HOR expansion and homogenization could be raised by two different mechanisms: improper replication with the creation of multiple copies (such as rolling circle replication, Figure 3E) [225] and unequal crossovers/gene conversion events (Figure 3A,B) ([154] and [226], respectively). Given the shared layers of α-satellites between chromosomes, it is possible that the newest-born centromere within an old centromere promotes the sliding to the side of the old monomers [227]. New FS arrays, homogenized in chromosome-specific HORs, may facilitate the maintenance of higher order structure through the concomitant recruitment of DNA binding proteins [228]. The integration of the CENP-B box within the HOR array could facilitate kinetochore assembly, yet its absence from the Y chromosome remains unclear [229]. The kinetochore-associated recombination machine (KARM) is proposed to have a role in homogenizing functional centromeres through topoisomerase II-induced breaks that are subsequently repaired by recombination [227].
While evolutionary processes underlying centromere divergence remain unclear [7], a new attractive model was recently provided by Rice [183] by assigning a contribution to all cellular processes involved in the plasticity of HORs, as if HORs have their own molecularly encoded life cycle. The steady drafting of HOR array extension and organization promotes a continued expansion, rather than shrinkage, to generate megabases of homogenized HORs, while SSA contributes to diversity between the individual units [183]. For the longest centromere, the overall size can reach up to 8 Mb [230]. This rapid increment in HOR size cannot be justified solely through antiparallel and unbalanced exchanges between sister chromatids, first due to the exceptional variation found in sex chromosomes and second due to the conserved head-to-tail orientation in all centromeric HORs. Their homogenization seems principally due to replication-associated repair processes that contribute to length diversification and homogenization of the HOR array [183].
The model’s structural frame is based on the spatial organization of three types of ~170 bp monomeric repeat units [231,232] that are predicted to influence centromere strength (i.e., the level of outer kinetochore proteins): (1) one with a protein-binding sequence at its 5′ end (the 17 bp b-box that binds CENP-B), (2) a second that is identical to the first except that the CENP-B-box is mutated so that it no longer binds CENP-B, and (3) a third lacking CENP-B docking site altogether [193].
Among these three monomeric repetitive units, intra-array competition exists. It is based on the capability of centromeric core repeats to extend and migrate towards the flanking heterochromatin region, contrasting it. Thus, this new and interesting model highlights the contrasting forces and high level of evolution caused by the amplification (BIR process), shirking (SSA process) and homogenization of HORs [183].
Inside human HORs, the number of monomers ranges from two (as in chromosome 1 [233]) to 34 monomers (as in chromosome Y) [224,234]. The sequence of monomers has up to 35% variability among chromosomes and within the same chromosome [235], indicating that the formation of HOR followed a different mutagenic process than HOR amplification through homogenization. Despite the human HOR on the Y chromosome possessing alphoid DNA sequences, it differs from the other HORs on autosomes and X chromosomes because it lacks CENP-B boxes [235], indicating that CENP-B is not essential for a functional centromere [72,219]. Notably, some younger HORs with more homogenized monomers [236] that have yet to accumulate additional mutations and SNPs are shared among non-homologous autosomes [237], as for the chromosome groups 1, 5, 19-13, 21-14 and 22 [202]. Some of these sequences are regarded as “pan-centromeric” and are often used for the rapid detection of multiple centromeres in different chromosomes. The fact that we can distinguish between younger and older HORs based on mutational burden implies that either: (1) centromeres are exposed to genetic changes at a high rate, or (2) mechanisms that protect centromeres mitigate for these events yet are not fool proof, leading to the progressive accumulation of mutations.
While chromosomes can contain more than one centromere array with its own set of HORs [238], Sullivan and colleagues have highlighted the striking example of metastable epialleles found on chromosome 17, where three contiguous unique Chr17-specific α-satellite HOR arrays (D17Z1, D17Z1-B, and D17Z1-C) are found within the centromeric region, but only one array is active at any given time [239]. This helps to prevent errors in nucleating the kinetochore and segregating chromosomes during cell division. Interestingly, all arrays still have the ability to recruit CENP-A, acting like epialleles. Yet in the majority of individuals across the human population, the active centromere forms on the main array containing less inter-HOR variation [239]. These data indicate that the homogenization of HOR is functionally important to support centromere function [119,154,239]. As the homogenization of HORs relies on replication fork collapse and re-initiation of replication through BIR and SSA repair processes [183,202], there could be a process in place for a continuous HOR life cycle, beginning with the expansion of α-satellite units as monomers, dimers, and multimeric units, up to full HOR amplification.
Even if these processes are valid and important attempts at placing repetitive pieces of a large puzzle together, experimental evidence is needed to validate their action within human centromeres.

4. Changing Identity: Pathological Consequences of Rapid Centromere Evolution

Changes in centromere DNA can reasonably induce chromosome segregation errors and result in chromosome instability (CIN) [140,240]. However, evidence for a direct link between changes in centromere DNA and segregation errors are lacking. Recent work points to centromere size not being a determining feature in aneuploidy, while centromere-specific DNA features such as the presence and density of CENP-B boxes plays a more important role in contributing to centromere function in chromosome segregation [241]. Thus, centromere rapid sequence changes, putative mutagenic processes and intrinsic fragility that converge to undermine centromere repeats would reasonably need to be mitigated to prevent functional disruptions. For instance, dramatic erosion of a centromere may no longer support chromosome segregation, although no defined threshold currently exists for an “optimal” centromere length to perform its function, nor for ribosomal DNA (rDNA) or telomeres. Additionally, DNA changes may impact recruitment and retention of CENP-A and disruption of the CENP-B box for CENP-B recruitment and other essential centromeric components. In addition to size and sequence composition, there are other perilous features, like secondary structure and RNA:DNA hybrids (R-loops) that may need addressing to maintain centromere stability [140,152]. Interestingly, in addition to representing burdens for replication, single-stranded DNA R-loops facilitate homologous sequence matching for BIR crossover [242] that eventually result like gross chromosomal rearrangements (GCRs) [243], a series of processes that may be happening at centromeres.
The pericentromere region is responsible for sister chromatid cohesion and has been found to contribute to centromere integrity [240]. Except for the pericentromeric region of S. cerevisiae, where there is a cohesin enrichment [244,245], the pericentromere sequences of fission yeast and other organisms possess heterochromatic characteristics such as high methylation, H3 Lys 9 methylation, cohesin enrichment and the presence of heterochromatin protein 1 (HP1) [246]. Heterochromatin condensation inhibits Transcription elongation factor S-II (Tfs1)-promoted transcription, preventing deleterious transcription–replication conflicts, R-loops and centromere rearrangements [243]. This is reminiscent of mouse centromere recombination being suppressed by the DNA methyltransferases 3 α and β (Dnmt3a/b) that contribute to heterochromatin silencing [147], yet it remains unclear how the cross-talk between the centromere and pericentromere occurs. Hypo-acetylation of centromeric repeats, due to loss-of-function of the de novo methyltransferase DNMT3b, increases DSBs following nucleotide excision repair (NER) [247]. Hypo-acetylation of centromeric repeats, due to loss-of-function of de novo methyltransferase DNMT3b, also increases DSBs caused by NER, which specializes in removing R-loop structures. Mutations in DNMT3b are associated with human immunodeficiency–centromeric instability–facial anomalies (ICF) syndrome [248], as well as mutations in other genes such as CDCA7 [249,250,251], HELLS [249,250,251] and ZBTB24 [252]. DNA hypomethylation of the pericentric heterochromatin of chromosomes 1, 9 and 16 gives rise to peculiar stretched centromeres and chromosome instability in ICF patients [253]. Collectively, this evidence suggests a relationship between transcription, R-loops and other epigenetic features that could either facilitate or undermine the maintenance of centromere stability. Centromere epigenetics may directly influence sequence stability, similarly to how a delicate balance between methylation and acetylation aids CENP-A loading within the centromeric domain [254]. Epigenetic state, nucleosome histone dynamics and changes in specific post-translational modifications (PTMs) impact centromere stability. Because loss of CENPs, especially CENP-A, CENP-C and CENP-T/W, triggers centromere rearrangements [141], maintenance of the epigenetic and proteinaceous components of centromeres is a key component in the stability of DNA repeats. Notably, the proper localization and methylation of CENP-A is essential for cell growth and for the prevention of chromosome instability, together with p53 [255]. Notably, centromere α-satellite stability is compromised in cancer cell lines and in primary cells undergoing senescence [141].
From recent evidence, a shift in CENP-A localization or its depletion led to a change in chromatin status that could interfere with local and long-range transcription processes [243]. This was seen during cellular senescence and leads to CENP-A mislocalization and mitotic arrest [255] in aging cells [256,257,258,259], in cells overexpressing Myc proto-oncogene protein (MYC), under an ectopic interaction with CENP-A [260], and with a corresponding de-repression of centromeric TEs (frequently observed in many cancers) [261,262]. The aberrant transcription of retrotransposons in pericentromeric human satellite II (HSATII) repeats leads to an increased accumulation of centromere RNAs [263], often seen in cancer [169,264,265,266,267]. Thus, temporary and spatial control of transcription may limit the emergence of breaks at centromeres [267,268]. Seemingly, replication timing of the centromere region may be evolutionarily set for spatial purposes [228]. In accordance with the DuPraw’s model [269], the centromere is a late replicating region [92,270] and both the centromere-specific histone H3-like protein CENP-A [271] and CENP-B [272] may contribute to complete the replication process. Yet, the origins, mechanisms and consequences of the replication dynamics at these repetitive regions are poorly understood, and whether replication stress may in turn lead to breakage and, as previously described, trigger catastrophic rearrangements, is unclear [140]. Once a break is generated, fork stalling and template switching (FoSTeS), non-allelic homologous recombination (NAHR), BIR and MMBIR pathways may repair the chromosomal break and produce unbalanced translocations, isochromosomes, acentric chromosomes generating fragment loss, ring chromosomes, dicentric chromosomes, Robertsonian translocations, pseudo-dicentric chromosomes and other gross chromosomal rearrangements, leading to aneuploidy (for further information, see [140,240]). Because these genomic aberrations represent a potential source of instability with numerical and structural alterations found in multiple cancers [240], fully understanding their molecular origins is of great importance.

5. Conclusions

Centromeres hold multiple paradoxes, including rapidly evolving DNA and molecular players, tolerance to changes in position and size, evidence of profound mutational processes, intrinsic fragility dictated by repetitive DNA and possibly secondary structures, all while maintaining a fundamental and conserved function. Interestingly, centromere sequences reveal the preferential accumulation of tandem repeats and a conserved epigenetic identity as the driving force for maintaining centromere function in spite of their high mutational rate. Such molecular processes affect the clusterization of satellite DNA and its higher order assembly, but it is not exactly clear at which level they operate and especially what the mechanisms are that preserve centromere DNA stability and mitigate ongoing mutagenesis. Because of its genetic variability, the unifying definition of a centromere refers to its functionality in enabling chromosome segregation. Under evolutionary forces, each organism modulated this essential structure based on their evolutionarily benefits, mitigating drawbacks and tolerating adaptation. Thus, the changes in DNA content, size and positional shifts, as well as the three-dimensional arrangements of increasing complexity that can converge to sustain a loop of mutagenesis that feeds centromere evolution, are both an advantage or a hindrance depending on the timescale snapshot (within a few generations or selected over millions of years). Looking at centromere abnormalities widely found in cancer and other disorders, the precarious equilibrium between rapid changes and functional preservation in the quest for the sustained propagation of centromeres likely comes at a cost in conflict with DNA stability. We do not yet know the complete journey that makes a centromere so, but it is certainly an exciting and eventful one that we hope will soon fully emerge.

Funding

This research was supported by La Sapienza University grants 2017/2018 to E.B. S.G. work is supported by NIH Centromere Stability R01 grant to the Laboratory of Chromosome and Cell Biology, Rockefeller University.

Acknowledgments

The author is deeply thankful to Laura Fanti for critical reading and disclosing unpublished information. We thank Dani Fachinetti and Sebastian Hoffman for sharing unpublished data. Thanks to Alistair Field, Seneca Jason and Sofia Elisabeth for assistance with the writing of this manuscript during the SARS-CoV2 pandemic.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Flemming, W. Zellsubstanz, Kern und Zelltheilung. DMW—Dtsch. Med. Wochenschr. 1883, 9, 342. [Google Scholar] [CrossRef]
  2. Darlington, C.D.; Hallpike, C.S.; Hartridge, H.; Rawdon-Smith, A.F. The external mechanics of the chromosomes I—The scope of enquiry. Proc. R. Soc. Lond. Ser. B: Biol. Sci. 1936, 121, 264–273. [Google Scholar] [CrossRef]
  3. Carbon, J.; Clarke, L. Structural and Functional Analysis of a Yeast Centromere (CEN3). J. Cell Sci. 1984, 1984, 43–58. [Google Scholar] [CrossRef] [Green Version]
  4. Furuyama, S.; Biggins, S. Centromere identity is specified by a single centromeric nucleosome in budding yeast. Proc. Natl. Acad. Sci. USA 2007, 104, 14706–14711. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Willard, H.F. Centromeres: The missing link in the development of human artificial chromosomes. Curr. Opin. Genet. Dev. 1998, 8, 219–225. [Google Scholar] [CrossRef]
  6. Malik, H.S. Conflict begets complexity: The evolution of centromeres. Curr. Opin. Genet. Dev. 2002, 12, 711–718. [Google Scholar] [CrossRef]
  7. Henikoff, S.; Ahmad, K.; Malik, H.S. The Centromere Paradox: Stable Inheritance with Rapidly Evolving DNA. Science 2001, 293, 1098–1102. [Google Scholar] [CrossRef] [Green Version]
  8. Mandal, S.S. Gene Regulation, Epigenetics and Hormone Signaling; Wiley-VCH Verlag GmbH & Co. KGaA: Weinheim, Germany, 2017. [Google Scholar]
  9. Sanyal, K.; Baum, M.; Carbon, J. Centromeric DNA sequences in the pathogenic yeast Candida albicans are all different and unique. Proc. Natl. Acad. Sci. USA 2004, 101, 11374–11379. [Google Scholar] [CrossRef] [Green Version]
  10. Wood, V.; Gwilliam, R.; Rajandream, M.A.; Lyne, M.; Lyne, R.; Stewart, A.; Sgouros, J.; Peat, N.; Hayles, J.; Baker, S.; et al. The genome sequence of Schizosaccharomyces pombe. Nature 2002, 415, 871–880. [Google Scholar] [CrossRef] [Green Version]
  11. Copenhaver, G.P. Genetic Definition and Sequence Analysis of Arabidopsis Centromeres. Science 1999, 286, 2468–2474. [Google Scholar] [CrossRef] [Green Version]
  12. Cheng, Z.; Dong, F.; Langdon, T.; Ouyang, S.; Buell, C.R.; Gu, M.; Blattner, F.R.; Jiang, J. Functional Rice Centromeres Are Marked by a Satellite Repeat and a Centromere-Specific Retrotransposon. Plant Cell 2002, 14, 1691–1704. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Ananiev, E.V.; Phillips, R.L.; Rines, H.W. Chromosome-specific molecular organization of maize (Zea mays L.) centromeric regions. Proc. Natl. Acad. Sci. USA 1998, 95, 13073–13078. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Murphy, T.D.; Karpen, G.H. Localization of Centromere Function in a Drosophila Minichromosome. Cell 1995, 82, 599–609. [Google Scholar] [CrossRef] [Green Version]
  15. Kipling, D.; Ackford, H.E.; Taylor, B.A.; Cooke, H.J. Mouse minor satellite DNA genetically maps to the centromere and is physically linked to the proximal telomere. Genomics 1991, 11, 235–241. [Google Scholar] [CrossRef]
  16. Lo, A.W.I.; Craig, J.M.; Saffery, R.; Kalitsis, P.; Irvine, D.V.; Earle, E.; Magliano, D.J.; Choo, K.H.A. A 330 kb CENP-A binding domain and altered replication timing at a human neocentromere. EMBO J. 2001, 20, 2087–2096. [Google Scholar] [CrossRef]
  17. Neumann, P.; Navrátilová, A.; Schroeder-Reiter, E.; Koblížková, A.; Steinbauerova, V.; Chocholová, E.; Novak, P.; Wanner, G.; Macas, J. Stretching the Rules: Monocentric Chromosomes with Multiple Centromere Domains. PLoS Genet. 2012, 8, e1002777. [Google Scholar] [CrossRef] [Green Version]
  18. Barlow, P.W.; Nevin, D. Quantitative karyology of some species of Luzula. Plant Syst. Evol. 1976, 125, 77–86. [Google Scholar] [CrossRef]
  19. The International Silkworm Genome; The International Silkworm Genome Consortium. The genome of a lepidopteran model insect, the silkworm Bombyx mori. Insect Biochem. Mol. Biol. 2008, 38, 1036–1045. [Google Scholar] [CrossRef]
  20. Kawamoto, M.; Jouraku, A.; Toyoda, A.; Yokoi, K.; Minakuchi, Y.; Katsuma, S.; Fujiyama, A.; Kiuchi, T.; Yamamoto, K.; Shimada, T. High-quality genome assembly of the silkworm, Bombyx mori. Insect Biochem. Mol. Biol. 2019, 107, 53–62. [Google Scholar] [CrossRef]
  21. Waterston, R. Genome Sequence of the Nematode C. elegans: A Platform for Investigating Biology. Science 1998, 282, 2012–2018. [Google Scholar] [CrossRef]
  22. McKinley, K.L.; Cheeseman, I.M. The molecular basis for centromere identity and function. Nat. Rev. Mol. Cell Biol. 2015, 17, 16–29. [Google Scholar] [CrossRef] [PubMed]
  23. Talbert, P.B.; Henikoff, S. What makes a centromere? Exp. Cell Res. 2020, 389, 111895. [Google Scholar] [CrossRef] [PubMed]
  24. Logsdon, G.A.; Gambogi, C.W.; Liskovykh, M.A.; Barrey, E.J.; Larionov, V.; Miga, K.H.; Heun, P.; Black, B.E. Human Artificial Chromosomes that Bypass Centromeric DNA. Cell 2019, 178, 624–639. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Sullivan, K.F. A solid foundation: Functional specialization of centromeric chromatin. Curr. Opin. Genet. Dev. 2001, 11, 182–188. [Google Scholar] [CrossRef]
  26. Talbert, P.B.; Henikoff, S. Histone variants—Ancient wrap artists of the epigenome. Nat. Rev. Mol. Cell Biol 2010, 11, 264–275. [Google Scholar] [CrossRef]
  27. Saffery, R.; Earle, E.; Irvine, D.V.; Kalitsis, P.; Choo, K.H.A. Conservation of centromere protein in vertebrates. Chromosom. Res. 1999, 7, 261–265. [Google Scholar] [CrossRef]
  28. Meluh, P.B.; Yang, P.; Glowczewski, L.; Koshland, D.; Smith, M. Cse4p Is a Component of the Core Centromere of Saccharomyces cerevisiae. Cell 1998, 94, 607–613. [Google Scholar] [CrossRef] [Green Version]
  29. UniProt. Available online: https://www.uniprot.org/uniprot/P36012 (accessed on 17 June 2020).
  30. Takahashi, K.; Chen, E.S.; Yanagida, M. Requirement of Mis6 Centromere Connector for Localizing a CENP-A-Like Protein in Fission Yeast. Science 2000, 288, 2215–2219. [Google Scholar] [CrossRef]
  31. UniProt. Available online: https://www.uniprot.org/uniprot/Q9Y812 (accessed on 17 June 2020).
  32. Henikoff, S.; Ahmad, K.; Platero, J.S.; Van Steensel, B. Heterochromatic deposition of centromeric histone H3-like proteins. Proc. Natl. Acad. Sci. USA 2000, 97, 716–721. [Google Scholar] [CrossRef] [Green Version]
  33. UniProt. Available online: https://www.uniprot.org/uniprot/Q9V6Q2 (accessed on 17 June 2020).
  34. Lermontova, I.; Schubert, V.; Fuchs, J.; Klatte, S.; Macas, J.; Schubert, I. Loading of Arabidopsis Centromeric Histone CENH3 Occurs Mainly during G2 and Requires the Presence of the Histone Fold Domain. Plant Cell 2006, 18, 2443–2451. [Google Scholar] [CrossRef] [Green Version]
  35. UniProt. Available online: https://www.uniprot.org/uniprot/Q8RVQ9-1 (accessed on 17 June 2020).
  36. Buchwitz, B.J.; Ahmad, K.; Moore, L.L.; Roth, M.B.; Henikoff, S. A histone-H3-like protein in C. elegans. Nature 1999, 401, 547–548. [Google Scholar] [CrossRef] [PubMed]
  37. UniProt. Available online: https://www.uniprot.org/uniprot/P34470 (accessed on 17 June 2020).
  38. Kalitsis, P.; Macdonald, A.C.; Newson, A.J.; Hudson, D.F.; Choo, K. Gene Structure and Sequence Analysis of Mouse Centromere Proteins A and C. Genomics 1998, 47, 108–114. [Google Scholar] [CrossRef] [PubMed]
  39. UniProt. Available online: https://www.uniprot.org/uniprot/O35216 (accessed on 17 June 2020).
  40. Sullivan, K.F.; Hechenberger, M.; Masri, K. Human CENP-A contains a histone H3 related histone fold domain that is required for targeting to the centromere. J. Cell Biol. 1994, 127, 581–592. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  41. Earnshaw, W.C.; Rothfield, N. Identification of a family of human centromere proteins using autoimmune sera from patients with scleroderma. Chromosoma 1985, 91, 313–321. [Google Scholar] [CrossRef]
  42. UniProt. Available online: https://www.uniprot.org/uniprot/P49450 (accessed on 17 June 2020).
  43. Mellone, B.G.; Grive, K.J.; Shteyn, V.; Bowers, S.R.; Oderberg, I.; Karpen, G. Assembly of Drosophila Centromeric Chromatin Proteins during Mitosis. PLoS Genet. 2011, 7, e1002068. [Google Scholar] [CrossRef] [Green Version]
  44. Talbert, P.B.; Henikoff, S. Phylogeny as the basis for naming histones. Trends Genet. 2013, 29, 499–500. [Google Scholar] [CrossRef]
  45. Neumann, P.; Pavlíková, Z.; Koblížková, A.; Fuková, I.; Jedličková, V.; Novak, P.; Macas, J. Centromeres off the Hook: Massive Changes in Centromere Size and Structure Following Duplication of CenH3 Gene in Fabeae Species. Mol. Biol. Evol. 2015, 32, 1862–1879. [Google Scholar] [CrossRef] [Green Version]
  46. Blower, M.D.; Sullivan, B.A.; Karpen, G. Conserved Organization of Centromeric Chromatin in Flies and Humans. Dev. Cell 2002, 2, 319–330. [Google Scholar] [CrossRef] [Green Version]
  47. Sullivan, B.A.; Karpen, G.H. Centromeric chromatin exhibits a histone modification pattern that is distinct from both euchromatin and heterochromatin. Nat. Struct. Mol. Biol. 2004, 11, 1076–1083. [Google Scholar] [CrossRef]
  48. Robledillo, L.Á.; Neumann, P.; Koblížková, A.; Novák, P.; Vrbová, I.; Macas, J. Extraordinary Sequence Diversity and Promiscuity of Centromeric Satellites in the Legume Tribe Fabeae. Mol. Biol. Evol. 2020. [Google Scholar] [CrossRef] [Green Version]
  49. Earnshaw, W.C.; Migeon, B.R. Three related centromere proteins are absent from the inactive centromere of a stable isodicentric chromosome. Chromosoma 1985, 92, 290–296. [Google Scholar] [CrossRef] [PubMed]
  50. Black, B.E.; Foltz, D.R.; Chakravarthy, S.; Luger, K.; Woods, V.L.; Cleveland, D.W. Structural determinants for generating centromeric chromatin. Nature 2004, 430, 578–582. [Google Scholar] [CrossRef] [PubMed]
  51. Goutte-Gattat, D.; Shuaib, M.; Ouararhni, K.; Gautier, T.; Skoufias, D.A.; Hamiche, A.; Dimitrov, S. Phosphorylation of the CENP-A amino-terminus in mitotic centromeric chromatin is required for kinetochore function. Proc. Natl. Acad. Sci. USA 2013, 110, 8579–8584. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  52. Incenp. Available online: https://incenp.org/research/cenpa.html (accessed on 26 June 2020).
  53. Akiyoshi, B.; Gull, K. Discovery of unconventional kinetochores in kinetoplastids. Cell 2014, 156, 1247–1258. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  54. Drinnenberg, I.A.; deYoung, D.; Henikoff, S.; Malik, H.S. Recurrent loss of CenH3 is associated with independent transitions to holocentricity in insects. eLife 2014, 3. [Google Scholar] [CrossRef]
  55. Oegema, K.A.; Hyman, A. Cell division. WormBook 2006, 1–40. [Google Scholar] [CrossRef]
  56. Cortes-Silva, N.; Ulmer, J.; Kiuchi, T.; Hsieh, E.; Cornilleau, G.; Ladid, I.; Dingli, F.; Loew, D.; Katsuma, S.; Drinnenberg, I.A. CenH3-Independent Kinetochore Assembly in Lepidoptera Requires CCAN, Including CENP-T. Curr. Biol. 2020, 30, 561–572.e10. [Google Scholar] [CrossRef]
  57. Ross, B.D.; Rosin, L.; Thomae, A.; Hiatt, M.A.; Vermaak, D.; De La Cruz, A.F.A.; Imhof, A.; Mellone, B.G.; Malik, H.S. Stepwise Evolution of Essential Centromere Function in a Drosophila Neogene. Science 2013, 340, 1211–1214. [Google Scholar] [CrossRef] [Green Version]
  58. Nerusheva, O.; Akiyoshi, B. Divergent polo box domains underpin the unique kinetoplastid kinetochore. Open Biol. 2016, 6, 150206. [Google Scholar] [CrossRef] [Green Version]
  59. D’Archivio, S.; Wickstead, B. Trypanosome outer kinetochore proteins suggest conservation of chromosome segregation machinery across eukaryotes. J. Cell Biol. 2016, 216, 379–391. [Google Scholar] [CrossRef]
  60. Talbert, P.B.; Bryson, T.D.; Henikoff, S. Adaptive evolution of centromere proteins in plants and animals. J. Biol. 2004, 3, 18. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  61. Aldrup-MacDonald, M.E.; Sullivan, B.A. The Past, Present, and Future of Human Centromere Genomics. Genes 2014, 5, 33–50. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  62. Voullaire, L.E.; Slater, H.R.; Petrovic, V.; Choo, K.H. A functional marker centromere with no detectable α-satellite, satellite III, or CENP-B protein: Activation of a latent centromere? Am. J. Hum. Genet. 1993, 52, 1153–1163. [Google Scholar] [PubMed]
  63. Marshall, O.J.; Chueh, A.; Wong, L.H.; Choo, K.H.A. Neocentromeres: New Insights into Centromere Structure, Disease Development, and Karyotype Evolution. Am. J. Hum. Genet. 2008, 82, 261–282. [Google Scholar] [CrossRef] [Green Version]
  64. Westhorpe, F.G.; Straight, A.F. The Centromere: Epigenetic Control of Chromosome Segregation during Mitosis. Cold Spring Harb. Perspect. Biol. 2014, 7, a015818. [Google Scholar] [CrossRef] [Green Version]
  65. Sullivan, K.F.; Glass, C.A. CENP-B is a highly conserved mammalian centromere protein with homology to the helix-loop-helix family of proteins. Chromosoma 1991, 100, 360–370. [Google Scholar] [CrossRef]
  66. Tomkiel, J.; Cooke, C.A.; Saitoh, H.; Bernat, R.L.; Earnshaw, W.C. CENP-C is required for maintaining proper kinetochore size and for a timely transition to anaphase. J. Cell Biol. 1994, 125, 531–545. [Google Scholar] [CrossRef]
  67. Giulotto, E.; Raimondi, E.; Sullivan, K.F. The Unique DNA Sequences Underlying Equine Centromeres. Adv. Biochem. Eng. Biotechnol. 2017, 56, 337–354. [Google Scholar] [CrossRef]
  68. Piras, F.M.; Nergadze, S.G.; Magnani, E.; Bertoni, L.; Attolini, C.; Khoriauli, L.; Raimondi, E.M.C.; Giulotto, E. Uncoupling of Satellite DNA and Centromeric Function in the Genus Equus. PLoS Genet. 2010, 6, e1000845. [Google Scholar] [CrossRef] [Green Version]
  69. Nergadze, S.G.; Piras, F.M.; Gamba, R.; Corbo, M.; Cerutti, F.; McCarter, J.G.; Cappelletti, E.; Gozzo, F.; Harman, R.M.; Antczak, D.F.; et al. Birth, evolution, and transmission of satellite-free mammalian centromeric domains. Genome Res. 2018, 28, 789–799. [Google Scholar] [CrossRef] [Green Version]
  70. Roberti, A.; Bensi, M.; Mazzagatti, A.; Piras, F.M.; Nergadze, S.G.; Giulotto, E.; Raimondi, E.M.C. Satellite DNA at the Centromere is Dispensable for Segregation Fidelity. Genes 2019, 10, 469. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  71. Wade, C.M.; Giulotto, E.; Sigurdsson, S.; Zoli, M.; Gnerre, S.; Imsland, F.; Lear, T.L.; Adelson, D.L.; Bailey, E.; Bellone, R.R.; et al. Genome Sequence, Comparative Analysis, and Population Genetics of the Domestic Horse. Science 2009, 326, 865–867. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  72. Fachinetti, D.; Han, J.S.; McMahon, M.A.; Ly, P.; Abdullah, A.; Wong, A.J.; Cleveland, D.W. DNA Sequence-Specific Binding of CENP-B Enhances the Fidelity of Human Centromere Function. Dev. Cell 2015, 33, 314–327. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  73. Hoffmann, S.; Dumont, M.; Barra, V.; Ly, P.; Nechemia-Arbely, Y.; McMahon, M.A.; Hervé, S.; Cleveland, D.W.; Fachinetti, D. CENP-A Is Dispensable for Mitotic Centromere Function after Initial Centromere/Kinetochore Assembly. Cell Rep. 2016, 17, 2394–2404. [Google Scholar] [CrossRef] [Green Version]
  74. Earnshaw, W.C.; Sullivan, K.F.; Machlin, P.; Cooke, C.; Kaiser, D.; Pollard, T.; Rothfield, N.; Cleveland, D.W. Molecular cloning of cDNA for CENP-B, the major human centromere autoantigen. J. Cell Biol. 1987, 104, 817–829. [Google Scholar] [CrossRef] [Green Version]
  75. Black, B.E.; Cleveland, D.W. Epigenetic Centromere Propagation and the Nature of CENP-A Nucleosomes. Cell 2011, 144, 471–479. [Google Scholar] [CrossRef] [Green Version]
  76. Mendiburo, M.J.; Padeken, J.; Fülöp, S.; Schepers, A.; Heun, P. Drosophila CENH3 Is Sufficient for Centromere Formation. Science 2011, 334, 686–690. [Google Scholar] [CrossRef]
  77. Hori, T.; Shang, W.H.; Takeuchi, K.; Fukagawa, T. The CCAN recruits CENP-A to the centromere and forms the structural core for kinetochore assembly. J. Cell Biol. 2012, 200, 45–60. [Google Scholar] [CrossRef]
  78. Barnhart, M.C.; Kuich, P.H.J.L.; Stellfox, M.E.; Ward, J.A.; Bassett, E.A.; Black, B.E.; Foltz, D.R. HJURP is a CENP-A chromatin assembly factor sufficient to form a functional de novo kinetochore. J. Cell Biol. 2011, 194, 229–243. [Google Scholar] [CrossRef] [Green Version]
  79. Wang, L.; Zeng, Z.; Zhang, W.; Jiang, J. Three Potato Centromeres Are Associated with Distinct Haplotypes with or Without Megabase-Sized Satellite Repeat Arrays. Genetics 2013, 196, 397–401. [Google Scholar] [CrossRef] [Green Version]
  80. Wang, K.; Wu, Y.; Zhang, W.; Dawe, R.K.; Jiang, J. Maize centromeres expand and adopt a uniform size in the genetic background of oat. Genome Res. 2013, 24, 107–116. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  81. Kasai, F.; Garcia, C.B.; Arruga, M.V.; Ferguson-Smith, M.A. Chromosome homology between chicken (Gallus gallus domesticus) and the red-legged partridge (Alectoris rufa); evidence of the occurrence of a neocentromere during evolution. Cytogenet. Genome Res. 2003, 102, 326–330. [Google Scholar] [CrossRef] [PubMed]
  82. Yang, F.; Fu, B.; O’Brien, P.C.M.; Nie, W.; Ryder, O.A.; Ferguson-Smith, M.A. Refined genome-wide comparative map of the domestic horse, donkey and human based on cross-species chromosome painting: Insight into the occasional fertility of mules. Chromosom. Res. 2004, 12, 65–76. [Google Scholar] [CrossRef] [PubMed]
  83. Purgato, S.; Belloni, E.; Piras, F.M.; Zoli, M.; Badiale, C.; Cerutti, F.; Mazzagatti, A.; Perini, G.; Della Valle, G.; Nergadze, S.G.; et al. Centromere sliding on a mammalian chromosome. Chromosoma 2014, 124, 277–287. [Google Scholar] [CrossRef] [Green Version]
  84. Ventura, M.; Archidiacono, N.; Rocchi, M. Centromere Emergence in Evolution. Genome Res. 2001, 11, 595–599. [Google Scholar] [CrossRef] [Green Version]
  85. du Sart, D.; Cancilla, M.R.; Earle, E.; Mao, J.I.; Saffery, R.; Tainton, K.M.; Kalitsis, P.; Martyn, J.; Barry, A.E.; Choo, K.H.A. A functional neo-centromere formed through activation of a latent human centromere and consisting of non-α-satellite DNA. Nat. Genet. 1997, 16, 144–153. [Google Scholar] [CrossRef]
  86. Montefalcone, G.; Tempesta, S.; Rocchi, M.; Archidiacono, N. Centromere Repositioning. Genome Res. 1999, 9, 1184–1188. [Google Scholar] [CrossRef] [Green Version]
  87. Rocchi, M.; Archidiacono, N.; Schempp, W.; Capozzi, O.; Stanyon, R. Centromere repositioning in mammals. Heredity 2011, 108, 59–67. [Google Scholar] [CrossRef]
  88. Amor, D.J.; Bentley, K.; Ryan, J.; Perry, J.K.; Wong, L.H.; Slater, H.; Choo, K.H.A. Human centromere repositioning “in progress”. Proc. Natl. Acad. Sci. USA 2004, 101, 6542–6547. [Google Scholar] [CrossRef] [Green Version]
  89. Amor, D.J.; Choo, K.H.A. Neocentromeres: Role in Human Disease, Evolution, and Centromere Study. Am. J. Hum. Genet. 2002, 71, 695–714. [Google Scholar] [CrossRef] [Green Version]
  90. Burrack, L.S.; Berman, J. Neocentromeres and epigenetically inherited features of centromeres. Chromosom. Res. 2012, 20, 607–619. [Google Scholar] [CrossRef] [Green Version]
  91. Stimpson, K.M.; Matheny, J.E.; Sullivan, B.A. Dicentric chromosomes: Unique models to study centromere function and inactivation. Chromosom. Res. 2012, 20, 595–605. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  92. Nechemia-Arbely, Y.; Miga, K.H.; Shoshani, O.; Aslanian, A.; McMahon, M.A.; Lee, A.Y.; Fachinetti, D.; Yates, J.R.; Ren, B.; Cleveland, D.W. DNA replication acts as an error correction mechanism to maintain centromere identity by restricting CENP-A to centromeres. Nature 2019, 21, 743–754. [Google Scholar] [CrossRef]
  93. Zeitlin, S.G.; Baker, N.M.; Chapados, B.R.; Soutoglou, E.; Wang, J.Y.J.; Berns, M.W.; Cleveland, D.W. Double-strand DNA breaks recruit the centromeric histone CENP-A. Proc. Natl. Acad. Sci. USA 2009, 106, 15762–15767. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  94. Williams, B.C.; Murphy, T.D.; Goldberg, M.L.; Karpen, G. Neocentromere activity of structurally acentric mini-chromosomes in Drosophila. Nat. Genet. 1998, 18, 30–38. [Google Scholar] [CrossRef] [PubMed]
  95. Maggert, K.A.; Karpen, G.H. The activation of a neocentromere in Drosophila requires proximity to an endogenous centromere. Genetics 2001, 158, 1615–1628. [Google Scholar]
  96. Olszak, A.M.; Van Essen, M.; Pereira, A.; Diehl, S.; Manke, T.; Maiato, H.; Saccani, S.; Heun, P. Heterochromatin boundaries are hotspots for de novo kinetochore formation. Nature 2011, 13, 799–808. [Google Scholar] [CrossRef]
  97. Piacentini, L.; Marchetti, M.; Bucciarelli, E.; Casale, A.M.; Cappucci, U.; Bonifazi, P.; Renda, F.; Fanti, L. A role of the Trx-G complex in Cid/CENP-A deposition at Drosophila melanogaster centromeres. Chromosoma 2019, 128, 503–520. [Google Scholar] [CrossRef]
  98. Topp, C.N.; Okagaki, R.; Melo, J.; Kynast, R.; Phillips, R.; Dawe, R. Identification of a maize neocentromere in an oat-maize addition line. Cytogenet. Genome Res. 2009, 124, 228–238. [Google Scholar] [CrossRef] [Green Version]
  99. Leo, L.; Marchetti, M.; Giunta, S.; Fanti, L. Epigenetics as an Evolutionary Tool for Centromere Flexibility. Genes 2020, 11, 809. [Google Scholar] [CrossRef]
  100. Malik, H.S.; Henikoff, S. Major Evolutionary Transitions in Centromere Complexity. Cell 2009, 138, 1067–1082. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  101. Henikoff, S.; Henikoff, J.G. “Point” centromeres of Saccharomyces harbor single centromere-specific nucleosomes. Genetics 2012, 190, 1575–1577. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  102. Hegemann, J.H.; Fleig, U.N. The centromere of budding yeast. BioEssays 1993, 15, 451–460. [Google Scholar] [CrossRef] [PubMed]
  103. Krassovsky, K.; Henikoff, J.G.; Henikoff, S. Tripartite organization of centromeric chromatin in budding yeast. Proc. Natl. Acad. Sci. USA 2011, 109, 243–248. [Google Scholar] [CrossRef] [Green Version]
  104. Tatchell, K.; Van Holde, K.E. Nucleosome reconstitution: Effect of DNA length on nucleosome structure. Biochemistry 1979, 18, 2871–2880. [Google Scholar] [CrossRef]
  105. Brogaard, K.; Xi, L.; Wang, J.P.; Widom, J. A map of nucleosome positions in yeast at base-pair resolution. Nature 2012, 486, 496–501. [Google Scholar] [CrossRef]
  106. Henikoff, S.; Ramachandran, S.; Krassovsky, K.; Bryson, T.D.; Codomo, C.A.; Brogaard, K.; Widom, J.; Wang, J.P.; Henikoff, J.G. The budding yeast Centromere DNA Element II wraps a stable Cse4 hemisome in either orientation in vivo. Abstract 2014, 3, e01861. [Google Scholar] [CrossRef]
  107. Aravamudhan, P.; Felzer-Kim, I.; Joglekar, A. The budding yeast point centromere associates with two Cse4 molecules during mitosis. Curr. Biol. 2013, 23, 770–774. [Google Scholar] [CrossRef] [Green Version]
  108. Mizuguchi, G.; Xiao, H.; Wiśniewski, J.; Smith, M.M.; Wu, C. Nonhistone Scm3 and Histones CenH3-H4 Assemble the Core of Centromere-Specific Nucleosomes. Cell 2007, 129, 1153–1164. [Google Scholar] [CrossRef] [Green Version]
  109. Lochmann, B.; Ivanov, D. Histone H3 Localizes to the Centromeric DNA in Budding Yeast. PLoS Genet. 2012, 8, e1002739. [Google Scholar] [CrossRef] [Green Version]
  110. Henikoff, S.; Furuyama, T. The unconventional structure of centromeric nucleosomes. Chromosoma 2012, 121, 341–352. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  111. Furuyama, T.; Codomo, C.A.; Henikoff, S. Reconstitution of hemisomes on budding yeast centromeric DNA. Nucleic Acids Res. 2013, 41, 5769–5783. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  112. Nakaseko, Y.; Adachi, Y.; Funahashi, S.I.; Niwa, O.; Yanagida, M. Chromosome walking shows a highly homologous repetitive sequence present in all the centromere regions of fission yeast. EMBO J. 1986, 5, 1011–1021. [Google Scholar] [CrossRef] [PubMed]
  113. Sun, X.; Wahlstrom, J.; Karpen, G.H. Molecular Structure of a Functional Drosophila Centromere. Cell 1997, 91, 1007–1019. [Google Scholar] [CrossRef] [Green Version]
  114. Round, E.K.; Flowers, S.K.; Richards, E.J. Arabidopsis thaliana Centromere Regions: Genetic Map Positions and Repetitive DNA Structure. Genome Res. 1997, 7, 1045–1053. [Google Scholar] [CrossRef] [Green Version]
  115. Dong, F.; Miller, J.T.; Jackson, S.A.; Wang, G.L.; Ronald, P.; Jiang, J. Rice (Oryza sativa) centromeric regions consist of complex DNA. Proc. Natl. Acad. Sci. USA 1998, 95, 8135–8140. [Google Scholar] [CrossRef] [Green Version]
  116. Nagaki, K.; Talbert, P.B.; Zhong, C.X.; Dawe, R.K.; Henikoff, S.; Jiang, J. Chromatin immunoprecipitation reveals that the 180-bp satellite repeat is the key functional DNA element of Arabidopsis thaliana centromeres. Genetics 2003, 163, 1221–1225. [Google Scholar]
  117. Vig, B.K.; Latour, D.; Frankovich, J. Dissociation of minor satellite from the centromere in mouse. J. Cell Sci. 1994, 107, 3091–3095. [Google Scholar]
  118. Joseph, A.; Mitchell, A.; Miller, O. The organization of the mouse satellite DNA at centromeres. Exp. Cell Res. 1989, 183, 494–500. [Google Scholar] [CrossRef]
  119. Sullivan, L.L.; Chew, K.; Sullivan, B.A. α satellite DNA variation and function of the human centromere. Nucleus 2017, 8, 331–339. [Google Scholar] [CrossRef] [Green Version]
  120. Melters, D.; Paliulis, L.V.; Korf, I.F.; Chan, S.W.L. Holocentric chromosomes: Convergent evolution, meiotic adaptations, and genomic analysis. Chromosom. Res. 2012, 20, 579–593. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  121. Stinchcomb, D.T.; Shaw, J.E.; Carr, S.H.; Hirsh, D. Extrachromosomal DNA transformation of Caenorhabditis elegans. Mol. Cell. Biol. 1985, 5, 3484–3496. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  122. Henikoff, S.; Dalal, Y. Centromeric chromatin: What makes it unique? Curr. Opin. Genet. Dev. 2005, 15, 177–184. [Google Scholar] [CrossRef] [PubMed]
  123. Lawrimore, J.; Bloom, K. The regulation of chromosome segregation via centromere loops. Crit. Rev. Biochem. Mol. Biol. 2019, 54, 352–370. [Google Scholar] [CrossRef]
  124. Roach, K.C.; Ross, B.; Malik, H.S. Rapid evolution of centromeres and centromeric/kinetochore proteins. In Rapidly Evolving Genes and Genetic Systems; Oxford University Press (OUP): Oxford, UK, 2012; pp. 83–93. [Google Scholar]
  125. Malik, H.S.; Bayes, J. Genetic conflicts during meiosis and the evolutionary origins of centromere complexity. Biochem. Soc. Trans. 2006, 34, 569–573. [Google Scholar] [CrossRef]
  126. Kursel, L.E.; Malik, H.S. The cellular mechanisms and consequences of centromere drive. Curr. Opin. Cell Biol. 2018, 52, 58–65. [Google Scholar] [CrossRef]
  127. Henikoff, S.; Malik, H.S. Centromeres: Selfish drivers. Nature 2002, 417, 227. [Google Scholar] [CrossRef]
  128. Akera, T.; Chmatal, L.; Trimm, E.; Yang, K.; Aonbangkhen, C.; Chenoweth, D.M.; Janke, C.; Schultz, R.M.; Lampson, M.A. Spindle asymmetry drives non-Mendelian chromosome segregation. Science 2017, 358, 668–672. [Google Scholar] [CrossRef] [Green Version]
  129. Lampson, M.A.; Black, B.E. Cellular and Molecular Mechanisms of Centromere Drive. Cold Spring Harb. Symp. Quant. Biol. 2017, 82, 249–257. [Google Scholar] [CrossRef]
  130. Malik, H.S. The Centromere-Drive Hypothesis: A Simple Basis for Centromere Complexity. Silicon Biominer. 2009, 48, 33–52. [Google Scholar] [CrossRef]
  131. Vermaak, D.; Hayden, H.S.; Henikoff, S. Centromere Targeting Element within the Histone Fold Domain of Cid. Mol. Cell. Biol. 2002, 22, 7553–7561. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  132. Iwata-Otsubo, A.; Dawicki-McKenna, J.M.; Akera, T.; Falk, S.J.; Chmátal, L.; Yang, K.; Sullivan, B.A.; Schultz, R.M.; Lampson, M.A.; Black, B.E. Expanded Satellite Repeats Amplify a Discrete CENP-A Nucleosome Assembly Site on Chromosomes that Drive in Female Meiosis. Curr. Biol. 2017, 27, 2365–2373. [Google Scholar] [CrossRef] [PubMed]
  133. Das, A.; Smoak, E.M.; Linares-Saldana, R.; Lampson, M.A.; Black, B.E. Centromere inheritance through the germline. Chromosoma 2017, 126, 595–604. [Google Scholar] [CrossRef] [PubMed]
  134. Maheshwari, S.; Tan, E.H.; West, A.; Franklin, F.C.H.; Comai, L.; Chan, S.W.L. Naturally Occurring Differences in CENH3 Affect Chromosome Segregation in Zygotic Mitosis of Hybrids. PLoS Genet. 2015, 11, e1004970. [Google Scholar] [CrossRef] [Green Version]
  135. Carbone, L.; Nergadze, S.G.; Magnani, E.; Misceo, D.; Cardone, M.F.; Roberto, R.; Bertoni, L.; Attolini, C.; Piras, M.F.; de Jong, P.; et al. Evolutionary movement of centromeres in horse, donkey, and zebra. Genomics 2006, 87, 777–782. [Google Scholar] [CrossRef] [Green Version]
  136. Finseth, F.R.; Dong, Y.; Saunders, A.; Fishman, L. Duplication and Adaptive Evolution of a Key Centromeric Protein in Mimulus, a Genus with Female Meiotic Drive. Mol. Biol. Evol. 2015, 32, 2694–2706. [Google Scholar] [CrossRef] [Green Version]
  137. Kursel, L.E.; Welsh, F.C.; Malik, H.S. Ancient Coretention of Paralogs of Cid Centromeric Histones and Cal1 Chaperones in Mosquito Species. Mol. Biol. Evol. 2020, 37, 1949–1963. [Google Scholar] [CrossRef]
  138. Kursel, L.E.; Malik, H.S. Recurrent Gene Duplication Leads to Diverse Repertoires of Centromeric Histones in Drosophila Species. Mol. Biol. Evol. 2017, 34, 1445–1462. [Google Scholar] [CrossRef] [Green Version]
  139. Brown, J.D.; O’Neill, R.J. Chromosomes, Conflict, and Epigenetics: Chromosomal Speciation Revisited. Annu. Rev. Genom. Hum. Genet. 2010, 11, 291–316. [Google Scholar] [CrossRef]
  140. Black, E.M.; Giunta, S. Repetitive Fragile Sites: Centromere Satellite DNA as a Source of Genome Instability in Human Diseases. Genes 2018, 9, 615. [Google Scholar] [CrossRef] [Green Version]
  141. Giunta, S.; Funabiki, H. Integrity of the human centromere DNA repeats is protected by CENP-A., CENP-C., and CENP-T. Proc. Natl. Acad. Sci. USA 2017, 114, 1928–1933. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  142. Sax, K. Chromosome structure and the mechanism of crossing over. J. Arnold. Arb. 1930, 13, 180–212. [Google Scholar]
  143. Mather, K. CROSSING-OVER. Biol. Rev. 1938, 13, 252–292. [Google Scholar] [CrossRef]
  144. Mahtani, M.M.; Willard, H.F. A primary genetic map of the pericentromeric region of the human X chromosome. Genomics 1988, 2, 294–301. [Google Scholar] [CrossRef]
  145. Choo, K.H.A. Why Is the Centromere So Cold? Genome Res. 1998, 8, 81–82. [Google Scholar] [CrossRef] [Green Version]
  146. Roberts, P.A. Difference in the Behaviour of Eu- and Hetero-chromatin: Crossing-over. Nature 1965, 205, 725–726. [Google Scholar] [CrossRef]
  147. Jaco, I.; Canela, A.; Vera, E.; Blasco, M.A. Centromere mitotic recombination in mammalian cells. J. Cell Biol. 2008, 181, 885–892. [Google Scholar] [CrossRef] [Green Version]
  148. Roizès, G. Human centromeric alphoid domains are periodically homogenized so that they vary substantially between homologues. Mechanism and implications for centromere functioning. Nucleic Acids Res. 2006, 34, 1912–1924. [Google Scholar] [CrossRef] [Green Version]
  149. Pironon, N.; Puechberty, J.; Roizès, G. Molecular and evolutionary characteristics of the fraction of human α satellite DNA associated with CENP-A at the centromeres of chromosomes 1, 5, 19, and 21. BMC Genom. 2010, 11, 195. [Google Scholar] [CrossRef] [Green Version]
  150. Langley, S.A.; Miga, K.H.; Karpen, G.H.; Langley, C.H. Haplotypes spanning centromeric regions reveal persistence of large blocks of archaic DNA. eLife 2019. [Google Scholar] [CrossRef]
  151. Giunta, S. Centromere Chromosome Orientation Fluorescent in situ Hybridization (Cen-CO-FISH) Detects Sister Chromatid Exchange at the Centromere in Human Cells. Bio Protoc. 2018, 8. [Google Scholar] [CrossRef] [PubMed]
  152. Aze, A.; Sannino, V.; Soffientini, P.; Bachi, A.; Costanzo, V. Centromeric DNA replication reconstitution reveals DNA loops and ATR checkpoint suppression. Nature 2016, 18, 684–691. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  153. Kasinathan, S.; Henikoff, S. Non-B-Form DNA Is Enriched at Centromeres. Mol. Biol. Evol. 2018, 35, 949–962. [Google Scholar] [CrossRef] [Green Version]
  154. Smith, G. Evolution of repeated DNA sequences by unequal crossover. Science 1976, 191, 528–535. [Google Scholar] [CrossRef] [PubMed]
  155. Talbert, P.B.; Henikoff, S. Centromeres Convert but Don’t Cross. PLoS Biol. 2010, 8, e1000326. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  156. Shi, J.; Wolf, S.E.; Burke, J.M.; Presting, G.G.; Ross-Ibarra, J.; Dawe, R.K. Widespread Gene Conversion in Centromere Cores. PLoS Biol. 2010, 8, e1000327. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  157. Brown, S.D.M.; Dover, G.A. Conservation of segmental variants of satellite DNA of Mus musculus in a related species: Mus spretus. Nature 1980, 285, 47–49. [Google Scholar] [CrossRef] [PubMed]
  158. Stahl, F.W. Gene Conversion. In Brenner’s Encyclopedia of Genetics, 2nd ed.; Academic Press: San Diego, CA, USA, 2013; ISBN 9780080961569. [Google Scholar]
  159. Taghian, D.G.; Nickoloff, J.A. Chromosomal double-strand breaks induce gene conversion at high frequency in mammalian cells. Mol. Cell. Biol. 1997, 17, 6386–6393. [Google Scholar] [CrossRef] [Green Version]
  160. Elliott, B.; Richardson, C.; Winderbaum, J.; Nickoloff, J.A.; Jasin, M. Gene Conversion Tracts from Double-Strand Break Repair in Mammalian Cells. Mol. Cell. Biol. 1998, 18, 93–101. [Google Scholar] [CrossRef] [Green Version]
  161. Richardson, C.; Moynahan, M.E.; Jasin, M. Double-strand break repair by interchromosomal recombination: Suppression of chromosomal translocations. Genes Dev. 1998, 12, 3831–3842. [Google Scholar] [CrossRef] [Green Version]
  162. Johnson, R.D.; Jasin, M. Sister chromatid gene conversion is a prominent double-strand break repair pathway in mammalian cells. EMBO J. 2000, 19, 3398–3407. [Google Scholar] [CrossRef] [PubMed]
  163. Fawcett, J.A.; Innan, H. Neutral and Non-Neutral Evolution of Duplicated Genes with Gene Conversion. Genes 2011, 2, 191–209. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  164. Chen, J.M.; Cooper, D.N.; Chuzhanova, N.; Férec, C.; Patrinos, G.P. Gene conversion: Mechanisms, evolution and human disease. Nat. Rev. Genet. 2007, 8, 762–775. [Google Scholar] [CrossRef] [PubMed]
  165. Birchler, J.A.; Presting, G.G. Retrotransposon insertion targeting: A mechanism for homogenization of centromere sequences on nonhomologous chromosomes. Genes Dev. 2012, 26, 638–640. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  166. Klein, S.J.; O’Neill, R.J. Transposable elements: Genome innovation, chromosome diversity, and centromere conflict. Chromosom. Res. 2018, 26, 5–23. [Google Scholar] [CrossRef] [Green Version]
  167. Suntronpong, A.; Singchat, W.; Kruasuwan, W.; Prakhongcheep, O.; Sillapaprayoon, S.; Muangmai, N.; Somyong, S.; Indananda, C.; Kraichak, E.; Peyachoknagul, S.; et al. Characterization of centromeric satellite DNAs (MALREP) in the Asian swamp eel (Monopterus albus) suggests the possible origin of repeats from transposable elements. Genomics 2020. [Google Scholar] [CrossRef]
  168. Macas, J.; Koblízková, A.; Navrátilová, A.; Neumann, P. Hypervariable 3′ UTR region of plant LTR-retrotransposons as a source of novel satellite repeats. Gene 2009, 448, 198–206. [Google Scholar] [CrossRef]
  169. Carone, D.M.; Zhang, C.; Hall, L.E.; Obergfell, C.; Carone, B.R.; O’Neill, M.J.; O’Neill, R.J. Hypermorphic expression of centromeric retroelement-encoded small RNAs impairs CENP-A loading. Chromosom. Res. 2013, 21, 49–62. [Google Scholar] [CrossRef]
  170. Corless, S.; Höcker, S.; Erhardt, S. Centromeric RNA and Its Function at and Beyond Centromeric Chromatin. J. Mol. Biol. 2020. [Google Scholar] [CrossRef]
  171. Niedenthal, R.; Stoll, R.; Hegemann, J.H. In vivo characterization of the Saccharomyces cerevisiae centromere DNA element I, a binding site for the helix-loop-helix protein CPF1. Mol. Cell. Biol. 1991, 11, 3545–3553. [Google Scholar] [CrossRef] [Green Version]
  172. Kheiavi, E.K.; Ahmadikhah, A. Genome Mining of Rice (Oryza sativa subsp. indica) for Detection and Characterization of Long Palindromic Sequences. J. Data Min. Genom. Proteom. 2016, 7. [Google Scholar] [CrossRef] [Green Version]
  173. Méndez-Lago, M.; Bergman, C.M.; De Pablos, B.; Tracey, A.; Whitehead, S.L.; Villasante, A. A Large Palindrome with Interchromosomal Gene Duplications in the Pericentromeric Region of the D. melanogaster Y Chromosome. Mol. Biol. Evol. 2011, 28, 1967–1971. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  174. Chuzhanova, N.; Chen, J.M.; Bacolla, A.; Patrinos, G.P.; Férec, C.; Wells, R.D.; Cooper, D.N. Gene conversion causing human inherited disease: Evidence for involvement of non-B-DNA-forming sequences and recombination-promoting motifs in DNA breakage and repair. Hum. Mutat. 2009, 30, 1189–1198. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  175. Walsh, J.B. Persistence of Tandem Arrays: Implications for Satellite and Simple-Sequence Dnas. Genetics 1987, 115, 553–567. [Google Scholar] [PubMed]
  176. Zhao, J.; Bacolla, A.; Wang, G.; Vasquez, K.M. Non-B DNA structure-induced genetic instability and evolution. Cell. Mol. Life Sci. 2009, 67, 43–62. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  177. Zhu, L.; Chou, S.H.; Reid, B.R. A single G-to-C change causes human centromere TGGAA repeats to fold back into hairpins. Proc. Natl. Acad. Sci. USA 1996, 93, 12159–12164. [Google Scholar] [CrossRef] [Green Version]
  178. Ohno, M.; Fukagawa, T.; Lee, J.S.; Ikemura, T. Triplex-forming DNAs in the human interphase nucleus visualized in situ by polypurine/polypyrimidine DNA probes and antitriplex antibodies. Chromosoma 2002, 111, 201–213. [Google Scholar] [CrossRef]
  179. Garavís, M.; Escaja, N.; Gabelica, V.; Villasante, A.; Gonzalez, C. Centromeric α-Satellite DNA Adopts Dimeric i-Motif Structures Capped by AT Hoogsteen Base Pairs. Chem. Eur. J. 2015, 21, 9816–9824. [Google Scholar] [CrossRef] [Green Version]
  180. Garavís, M.; Méndez-Lago, M.; Gabelica, V.; Whitehead, S.L.; Gonzalez, C.; Villasante, A. The structure of an endogenous Drosophila centromere reveals the prevalence of tandemly repeated sequences able to form i-motifs. Sci. Rep. 2015, 5, 13307. [Google Scholar] [CrossRef] [Green Version]
  181. Jonstrup, A.T.; Thomsen, T.; Wang, Y.; Knudsen, B.R.; Koch, J.; Andersen, A.H. Hairpin structures formed by α satellite DNA of human centromeres are cleaved by human topoisomerase II. Nucleic Acids Res. 2008, 36, 6165–6174. [Google Scholar] [CrossRef] [Green Version]
  182. Madireddy, A.; Gerhardt, J. Replication through Repetitive DNA Elements and Their Role in Human Diseases. In Retinal Degenerative Diseases; Springer Science and Business Media LLC: Berlin, Germany, 2017; pp. 549–581. [Google Scholar]
  183. Available online: https://www.biorxiv.org/content/10.1101/731471v1 (accessed on 10 August 2019). [CrossRef] [Green Version]
  184. Maccaroni, K.; Balzano, E.; Mirimao, F.; Giunta, S.; Pelliccia, F. Impaired Replication Timing Promotes Tissue-Specific Expression of Common Fragile Sites. Genes 2020, 11, 326. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  185. Chang, H.H.Y.; Pannunzio, N.R.; Adachi, N.; Lieber, M.R. Non-homologous DNA end joining and alternative pathways to double-strand break repair. Nat. Rev. Mol. Cell Biol. 2017, 18, 495–506. [Google Scholar] [CrossRef] [PubMed]
  186. Scully, R.; Panday, A.; Elango, R.; Willis, N.A. DNA double-strand break repair-pathway choice in somatic mammalian cells. Nat. Rev. Mol. Cell Biol. 2019, 20, 698–714. [Google Scholar] [CrossRef] [PubMed]
  187. Ceccaldi, R.; Rondinelli, B.; D’Andrea, A. Repair Pathway Choices and Consequences at the Double-Strand Break. Trends Cell Biol. 2015, 26, 52–64. [Google Scholar] [CrossRef] [Green Version]
  188. Callen, E.; Zong, D.; Wu, W.; Wong, N.; Stanlie, A.; Ishikawa, M.; Pavani, R.; Dumitrache, L.C.; Byrum, A.K.; Mendez-Dorantes, C.; et al. 53BP1 Enforces Distinct Pre- and Post-resection Blocks on Homologous Recombination. Mol. Cell 2020, 77, 26–38. [Google Scholar] [CrossRef]
  189. Malkova, A.; Ira, G. Break-induced replication: Functions and molecular mechanism. Curr. Opin. Genet. Dev. 2013, 23, 271–279. [Google Scholar] [CrossRef] [Green Version]
  190. Zhang, F.; Khajavi, M.; Connolly, A.M.; Towne, C.F.; Batish, S.D.; Lupski, J.R. The DNA replication FoSTeS/MMBIR mechanism can generate genomic, genic and exonic complex rearrangements in humans. Nat. Genet. 2009, 41, 849–853. [Google Scholar] [CrossRef] [Green Version]
  191. Hastings, P.J.; Ira, G.; Lupski, J.R. A Microhomology-Mediated Break-Induced Replication Model for the Origin of Human Copy Number Variation. PLoS Genet. 2009, 5, e1000327. [Google Scholar] [CrossRef] [Green Version]
  192. Llorente, B.; Smith, C.E.; Symington, L.S. Break-induced replication: What is it and what is it for? Cell Cycle 2008, 7, 859–864. [Google Scholar] [CrossRef] [Green Version]
  193. Bertelsen, A.H.; Humayun, M.Z.; Karfopoulos, S.G.; Rush, M.G. Molecular characterization of small polydisperse circular DNA from an African green monkey cell line. Biochemistry 1982, 21, 2076–2085. [Google Scholar] [CrossRef]
  194. Baumann, P.; West, S.C. Role of the human RAD51 protein in homologous recombination and double-stranded-break repair. Trends Biochem. Sci. 1998, 23, 247–251. [Google Scholar] [CrossRef]
  195. Available online: https://www.biorxiv.org/content/10.1101/768887v1.full (accessed on 23 September 2019). [CrossRef]
  196. Bhargava, R.; Onyango, D.O.; Stark, J.M. Regulation of Single-Strand Annealing and its Role in Genome Maintenance. Trends Genet. 2016, 32, 566–575. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  197. Ramakrishnan, S.; Kockler, Z.; Evans, R.; Downing, B.D.; Malkova, A. Single-strand annealing between inverted DNA repeats: Pathway choice, participating proteins, and genome destabilizing consequences. PLoS Genet. 2018, 14, e1007543. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  198. Lo, A.W.; Liao, G.C.C.; Rocchi, M.; Choo, K.H.A. Extreme Reduction of Chromosome-Specific α-Satellite Array Is Unusually Common in Human Chromosome 21. Genome Res. 1999, 9, 895–908. [Google Scholar] [CrossRef] [Green Version]
  199. Okamoto, Y.; Nakano, M.; Ohzeki, J.I.; Larionov, V.; Masumoto, H. A minimal CENP-A core is required for nucleation and maintenance of a functional human centromere. EMBO J. 2007, 26, 1279–1291. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  200. Bodor, D.L.; Mata, J.F.; Sergeev, M.; David, A.F.; Salimian, K.J.; Panchenko, T.; Cleveland, D.W.; Black, B.E.; Shah, J.V.; Jansen, L.E.T. The quantitative architecture of centromeric chromatin. eLife 2014, 3, e2137. [Google Scholar] [CrossRef] [Green Version]
  201. Available online: https://www.biorxiv.org/content/10.1101/731430v1 (accessed on 10 August 2019). [CrossRef] [Green Version]
  202. Guin, K.; Chen, Y.; Mishra, R.; Muzaki, S.R.B.M.; Thimmappa, B.C.; O’Brien, C.E.; Butler, G.; Sanyal, A.; Sanyal, K. Spatial inter-centromeric interactions facilitated the emergence of evolutionary new centromeres. eLife 2020, 9. [Google Scholar] [CrossRef]
  203. Stephan, W. Tandem-repetitive noncoding DNA: Forms and forces. Mol. Biol. Evol. 1989, 6, 198–212. [Google Scholar] [CrossRef]
  204. Stephan, W.; Cho, S. Possible Role of Natural Selection in the Formation of Tandem-Repetitive Noncoding DNA. Genetics 1994, 136, 333–341. [Google Scholar]
  205. Britten, R.J. Divergence between samples of chimpanzee and human DNA sequences is 5%, counting indels. Proc. Natl. Acad. Sci. USA 2002, 99, 13633–13635. [Google Scholar] [CrossRef] [Green Version]
  206. Haaf, T.; Willard, H.F. Chromosome-specific α-satellite DNA from the centromere of chimpanzee chromosome 4. Chromosoma 1997, 106, 226–232. [Google Scholar] [CrossRef] [PubMed]
  207. Archidiacono, N.; Antonacci, R.; Marzella, R.; Finelli, P.; Lonoce, A.; Rocchi, M. Comparative mapping of human alphoid sequences in great apes using fluorescence in situ hybridization. Genomics 1995, 25, 477–484. [Google Scholar] [CrossRef]
  208. Haaf, T.; Mater, A.G.; Wienberg, J.; Ward, D.C.; Matera, A.A. Presence and abundance of CENP-B box sequences in great ape subsets of primate-specific α-satellite DNA. J. Mol. Evol. 1995, 41, 487–491. [Google Scholar] [CrossRef] [PubMed]
  209. Alexandrov, I.; Kazakov, A.; Tumeneva, I.; Shepelev, V.; Yurov, Y.B. α-satellite DNA of primates: Old and new families. Chromosoma 2001, 110, 253–266. [Google Scholar] [CrossRef] [PubMed]
  210. Cacheux, L.; Ponger, L.; Gerbault-Seureau, M.; Loll, F.; Gey, D.; Richard, F.A.; Escudé, C. The Targeted Sequencing of α Satellite DNA in Cercopithecus pogonias Provides New Insight into the Diversity and Dynamics of Centromeric Repeats in Old World Monkeys. Genome Biol. Evol. 2018, 10, 1837–1851. [Google Scholar] [CrossRef]
  211. Cacheux, L.; Ponger, L.; Gerbault-Seureau, M.; Richard, F.A.; Escudé, C. Diversity and distribution of α satellite DNA in the genome of an Old World monkey: Cercopithecus solatus. BMC Genom. 2016, 17, 1–14. [Google Scholar] [CrossRef] [Green Version]
  212. Alves, G.; Seuánez, H.N.; Fanning, T. A Clade of New World Primates with Distinctive Alphoid Satellite DNAs. Mol. Phylogenetics Evol. 1998, 9, 220–224. [Google Scholar] [CrossRef]
  213. Alves, G.; Seuánez, H.N.; Fanning, T. α satellite DNA in neotropical primates (Platyrrhini). Chromosoma 1994, 103, 262–267. [Google Scholar] [CrossRef]
  214. Maio, J.J.; Brown, F.L.; Musich, P.R. Toward a molecular paleontology of primate genomes. Chromosoma 1981, 83, 103–125. [Google Scholar] [CrossRef]
  215. Musich, P.R.; Brown, F.L.; Maio, J.J. Highly repetitive component α and related alphoid DNAs in man and monkeys. Chromosoma 1980, 80, 331–348. [Google Scholar] [CrossRef]
  216. Baldini, A.; Miller, D.A.; Miller, O.J.; Ryder, O.A.; Mitchell, A.R. A chimpanzee-derived chromosome-specific α satellite DNA sequence conserved between chimpanzee and human. Chromosoma 1991, 100, 156–161. [Google Scholar] [CrossRef]
  217. Warburton, P.E.; Haaf, T.; Gosden, J.; Lawson, D.; Willard, H.F. Characterization of a Chromosome-Specific Chimpanzee α Satellite Subset: Evolutionary Relationship to Subsets on Human Chromosomes. Genomics 1996, 33, 220–228. [Google Scholar] [CrossRef]
  218. Waye, J.S.; Willard, H.F. Chromosome specificity of satellite DNAs: Short- and long-range organization of a diverged dimeric subset of human α satellite from chromosome 3. Chromosoma 1989, 97, 475–480. [Google Scholar] [CrossRef] [PubMed]
  219. Durfy, S.J.; Willard, H.F. Concerted evolution of primate α satellite DNA. Evidence for an ancestral sequence shared by gorilla and human X chromosome α satellite. J. Mol. Biol. 1990, 216, 555–566. [Google Scholar] [CrossRef]
  220. Haaf, T.; Willard, H.F. Orangutan α-satellite monomers are closely related to the human consensus sequence. Mamm. Genome 1998, 9, 440–447. [Google Scholar] [CrossRef] [PubMed]
  221. Rudd, M.K.; Matera, A.G.; Willard, H.F.; Hunt, P.A.; Schwartz, S.; Tartakoff, A. Organization, Evolution and Function of α Satellite DNA at Human Centromeres. Ph.D. Thesis, Case Western Reserve University, Cleveland, OH, USA, 2005. [Google Scholar]
  222. Willard, H.F.; Waye, J.S. Hierarchical order in chromosome-specific human α satellite DNA. Trends Genet. 1987, 3, 192–198. [Google Scholar] [CrossRef]
  223. Romanova, L.; Deriagin, G.; Mashkova, T.; Tumeneva, I.; Mushegian, A.R.; Kisselev, L.; Alexandrov, I. Evidence for Selection in Evolution of α Satellite DNA: The Central Role of CENP-B/pJα Binding Region. J. Mol. Biol. 1996, 261, 334–340. [Google Scholar] [CrossRef]
  224. Tyler-Smith, C.; Brown, W.R. Structure of the major block of alphoid satellite DNA on the human Y chromosome. J. Mol. Biol. 1987, 195, 457–470. [Google Scholar] [CrossRef]
  225. Laurent, A.; Puechberty, J.; Roizès, G. Hypothesis: For the worst and for the best, L1Hs retrotransposons actively participate in the evolution of the human centromeric alphoid sequences. Chromosom. Res. 1999, 7, 305–317. [Google Scholar] [CrossRef]
  226. Schindelhauer, D.; Schwarz, T. Evidence for a Fast, Intrachromosomal Conversion Mechanism from Mapping of Nucleotide Variants within a Homogeneous α-Satellite DNA Array. Genome Res. 2002, 12, 1815–1826. [Google Scholar] [CrossRef] [Green Version]
  227. Shepelev, V.A.; Alexandrov, A.A.; Yurov, Y.B.; Alexandrov, I.A. The Evolutionary Origin of Man Can Be Traced in the Layers of Defunct Ancestral α Satellites Flanking the Active Centromeres of Human Chromosomes. PLoS Genet. 2009, 5, e1000641. [Google Scholar] [CrossRef] [PubMed]
  228. Csink, A.K.; Henikoff, S. Something from nothing: The evolution and utility of satellite repeats. Trends Genet. 1998, 14, 200–204. [Google Scholar] [CrossRef]
  229. Otake, K.; Ohzeki, J.I.; Shono, N.; Kugou, K.; Okazaki, K.; Nagase, T.; Yamakawa, H.; Kouprina, N.; Larionov, V.; Kimura, H.; et al. CENP-B creates alternative epigenetic chromatin states permissive for CENP-A or heterochromatin assembly. J. Cell Sci. 2020. [Google Scholar] [CrossRef] [PubMed]
  230. Miga, K.H.; Newton, Y.; Jain, M.; Altemose, N.; Willard, H.F.; Kent, W.J. Centromere reference models for human chromosomes X and Y satellite arrays. Genome Res. 2014, 24, 697–707. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  231. Choo, K.H.; Vissel, B.; Nagy, A.; Earle, E.; Kalitsis, P. A survey of the genomic distribution of α satellite DNA on all the human chromosomes, and derivation of a new consensus sequence. Nucleic Acids Res. 1991, 19, 1179. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  232. Rudd, M.K.; Wray, G.A.; Willard, H.F. The evolutionary dynamics of α satellite. Genome Res. 2005, 16, 88–96. [Google Scholar] [CrossRef] [Green Version]
  233. Carine, K.; Jacquemin-Sablon, A.; Waltzer, E.; Mascarello, J.; Scheffler, I.E. Molecular characterization of human minichromosomes with centromere from chromosome 1 in human-hamster hybrid cells. Somat. Cell Mol. Genet. 1989, 15, 445–460. [Google Scholar] [CrossRef]
  234. Willard, H.F.; Waye, J.S. Chromosome-specific subsets of human α satellite DNA: Analysis of sequence divergence within and between chromosomal subsets and evidence for an ancestral pentameric repeat. J. Mol. Evol. 1987, 25, 207–214. [Google Scholar] [CrossRef]
  235. Fukagawa, T.; Earnshaw, W.C. The Centromere: Chromatin Foundation for the Kinetochore Machinery. Dev. Cell 2014, 30, 496–508. [Google Scholar] [CrossRef] [Green Version]
  236. Hartley, G.; O’Neill, R.J. Centromere Repeats: Hidden Gems of the Genome. Genes 2019, 10, 223. [Google Scholar] [CrossRef] [Green Version]
  237. Glunčić, M.; Vlahović, I.; Paar, V. Discovery of 33mer in chromosome 21—The largest α satellite higher order repeat unit among all human somatic chromosomes. Sci. Rep. 2019, 9, 12629. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  238. Ziccardi, W.; Zhao, C.; Shepelev, V.; Uralsky, L.; Alexandrov, I.; Andreeva, T.; Rogaev, E.; Bun, C.; Miller, E.; Putonti, C.; et al. Clusters of α satellite on human chromosome 21 are dispersed far onto the short arm and lack ancient layers. Chromosom. Res. 2016, 24, 421–436. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  239. Aldrup-MacDonald, M.E.; Kuo, M.E.; Sullivan, L.L.; Chew, K.; Sullivan, B.A. Genomic variation within α satellite DNA influences centromere location on human chromosomes with metastable epialleles. Genome Res. 2016, 26, 1301–1311. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  240. Barra, V.; Fachinetti, D. The dark side of centromeres: Types, causes and consequences of structural abnormalities implicating centromeric DNA. Nat. Commun. 2018, 9, 4340. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  241. Dumont, M.; Gamba, R.; Gestraud, P.; Klaasen, S.; Worrall, J.T.; De Vries, S.G.; Boudreau, V.; Salinas-Luypaert, C.; Maddox, P.S.; Lens, S.M.; et al. Human chromosome-specific aneuploidy is influenced by DNA-dependent centromeric features. EMBO J. 2019, 39, e102924. [Google Scholar] [CrossRef]
  242. Amon, J.D.; Koshland, D. RNase H enables efficient repair of R-loop induced DNA damage. eLife 2016, 5, 115. [Google Scholar] [CrossRef]
  243. Okita, A.K.; Zafar, F.; Su, J.; Weerasekara, D.; Kajitani, T.; Takahashi, T.S.; Kimura, H.; Murakami, Y.; Masukata, H.; Nakagawa, T. Heterochromatin suppresses gross chromosomal rearrangements at centromeres by repressing Tfs1/TFIIS-dependent transcription. Commun. Biol. 2019, 2, 17. [Google Scholar] [CrossRef]
  244. Blat, Y.; Kleckner, N. Cohesins Bind to Preferential Sites along Yeast Chromosome III, with Differential Regulation along Arms versus the Centric Region. Cell 1999, 98, 249–259. [Google Scholar] [CrossRef] [Green Version]
  245. Weber, S.A.; Gerton, J.L.; Polancic, J.E.; DeRisi, J.L.; Koshland, D.; Megee, P.C. The Kinetochore Is an Enhancer of Pericentric Cohesin Binding. PLoS Biol. 2004, 2, e260. [Google Scholar] [CrossRef]
  246. González-Barrios, R.; Soto-Reyes, E.; Herrera, L.A. Assembling pieces of the centromere epigenetics puzzle. Epigenetics 2012, 7, 3–13. [Google Scholar] [CrossRef]
  247. Available online: https://www.biorxiv.org/content/10.1101/2020.06.04.133272v1 (accessed on 4 June 2020). [CrossRef]
  248. Wijmenga, C.; Scott Hansen, R.; Gimelli, G.; Björck, E.J.; Graham Davies, E.; Valentine, D.; Belohradsky, B.H.; Van Dongen, J.J.; Smeets, D.F.C.M.; Van Den Heuvel, L.P.W.J.; et al. Genetic variation in ICF syndrome: Evidence for genetic heterogeneity. Hum. Mutat. 2000. [Google Scholar] [CrossRef]
  249. Thijssen, P.E.; Ito, Y.; Grillo, G.; Wang, J.; Velasco, G.; Nitta, H.; Unoki, M.; Yoshihara, M.; Suyama, M.; Sun, Y.; et al. Mutations in CDCA7 and HELLS cause immunodeficiency–centromeric instability–facial anomalies syndrome. Nat. Commun. 2015, 6, 7870. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  250. Unoki, M.; Funabiki, H.; Velasco, G.; Francastel, C.; Sasaki, H. CDCA7 and HELLS mutations undermine nonhomologous end joining in centromeric instability syndrome. J. Clin. Investig. 2018, 129, 78–92. [Google Scholar] [CrossRef] [PubMed]
  251. Jenness, C.; Giunta, S.; Müller, M.M.; Kimura, H.; Muir, T.W.; Funabiki, H. HELLS and CDCA7 comprise a bipartite nucleosome remodeling complex defective in ICF syndrome. Proc. Natl. Acad. Sci. USA 2018, 115, E876–E885. [Google Scholar] [CrossRef] [Green Version]
  252. De Greef, J.C.; Wang, J.; Balog, J.; Dunnen, J.T.D.; Frants, R.R.; Straasheijm, K.R.; Aytekin, C.; Van Der Burg, M.; Duprez, L.; Ferster, A.; et al. Mutations in ZBTB24 Are Associated with Immunodeficiency, Centromeric Instability, and Facial Anomalies Syndrome Type 2. Am. J. Hum. Genet. 2011, 88, 796–804. [Google Scholar] [CrossRef] [Green Version]
  253. Weemaes, C.M.R.; Van Tol, M.J.; Wang, J.; Dam, M.M.V.O.T.; Van Eggermond, M.C.; Thijssen, P.E.; Aytekin, C.; Brunetti-Pierri, N.; Van Der Burg, M.; Davies, E.G.; et al. Heterogeneous clinical presentation in ICF syndrome: Correlation with underlying gene defects. Eur. J. Hum. Genet. 2013, 21, 1219–1225. [Google Scholar] [CrossRef]
  254. Ohzeki, J.I.; Bergmann, J.H.; Kouprina, N.; Noskov, V.N.; Nakano, M.; Kimura, H.; Earnshaw, W.C.; Larionov, V.; Masumoto, H. Breaking the HAC Barrier: Histone H3K9 acetyl/methyl balance regulates CENP-A assembly. EMBO J. 2012, 31, 2391–2402. [Google Scholar] [CrossRef] [Green Version]
  255. Sathyan, K.M.; Fachinetti, D.; Foltz, D.R. α-amino trimethylation of CENP-A by NRMT is required for full recruitment of the centromere. Nat. Commun. 2017, 8, 14678. [Google Scholar] [CrossRef]
  256. Hedouin, S.; Grillo, G.; Ivkovic, I.; Velasco, G.; Francastel, C. CENP-A chromatin disassembly in stressed and senescent murine cells. Sci. Rep. 2017, 7, 42520. [Google Scholar] [CrossRef] [Green Version]
  257. Lee, S.-H.; Itkin-Ansari, P.; Levine, F. CENP-A, a protein required for chromosome segregation in mitosis, declines with age in islet but not exocrine cells. Aging 2010, 2, 785–790. [Google Scholar] [CrossRef] [Green Version]
  258. Ly, D.H.; Lockhart, D.J.; Lerner, R.A.; Schultz, P.G. Mitotic Misregulation and Human Aging. Science 2000, 287, 2486–2492. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  259. Narita, M.; Narita, M.; Krizhanovsky, V.; Nũnez, S.; Chicas, A.; Hearn, S.A.; Myers, M.P.; Lowe, S.W. A Novel Role for High-Mobility Group A Proteins in Cellular Senescence and Heterochromatin Formation. Cell 2006, 126, 503–514. [Google Scholar] [CrossRef] [Green Version]
  260. Nye, J.; Sturgill, D.; Athwal, R.; Dalal, Y. HJURP antagonizes CENP-A mislocalization driven by the H3.3 chaperones HIRA and DAXX. PLoS ONE 2018, 13, e205948. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  261. Zaratiegui, M.; Vaughn, M.; Irvine, D.V.; Goto, D.; Watt, S.; Bähler, J.; Arcangioli, B.; Martienssen, R.A. CENP-B preserves genome integrity at replication forks paused by retrotransposon LTR. Nature 2010, 469, 112–115. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  262. Anwar, S.L.; Wulaningsih, W.; Lehmann, U. Transposable Elements in Human Cancer: Causes and Consequences of Deregulation. Int. J. Mol. Sci. 2017, 18, 974. [Google Scholar] [CrossRef] [Green Version]
  263. Ting, D.T.; Lipson, R.; Paul, S.; Brannigan, B.W.; Akhavanfard, S.; Coffman, E.J.; Contino, G.; Deshpande, V.; Iafrate, A.J.; Letovsky, S.; et al. Aberrant Overexpression of Satellite Repeats in Pancreatic and Other Epithelial Cancers. Science 2011, 331, 593–596. [Google Scholar] [CrossRef] [Green Version]
  264. Bersani, F.; Lee, E.; Kharchenko, P.V.; Xu, A.W.; Liu, M.; Xega, K.; MacKenzie, O.C.; Brannigan, B.W.; Wittner, B.S.; Jung, H.; et al. Pericentromeric satellite repeat expansions through RNA-derived DNA intermediates in cancer. In Proc. Natl. Acad. Sci. USA 2015, 112, 15148–15153. [Google Scholar] [CrossRef] [Green Version]
  265. Kishikawa, T.; Otsuka, M.; Yoshikawa, T.; Ohno, M.; Yamamoto, K.; Yamamoto, N.; Kotani, A.; Koike, K. Quantitation of circulating satellite RNAs in pancreatic cancer patients. JCI Insight 2016, 1, e86646. [Google Scholar] [CrossRef] [Green Version]
  266. Zhu, Q.; Hoong, N.; Aslanian, A.; Hara, T.; Benner, C.; Heinz, S.; Miga, K.H.; Ke, E.; Verma, S.; Soroczynski, J.; et al. Heterochromatin-Encoded Satellite RNAs Induce Breast Cancer. Mol. Cell 2018, 70, 842–853. [Google Scholar] [CrossRef] [Green Version]
  267. Kim, N.; Jinks-Robertson, S. Transcription as a source of genome instability. Nat. Rev. Genet. 2012, 13, 204–214. [Google Scholar] [CrossRef] [Green Version]
  268. Liu, Y.; Su, H.; Zhang, J.; Liu, Y.; Feng, C.; Han, F. Back-spliced RNA from retrotransposon binds to centromere and regulates centromeric chromatin loops in maize. PLoS Biol. 2020, 18, e3000582. [Google Scholar] [CrossRef] [PubMed]
  269. DuPraw, E.J. Cell and Molecular Biology; Academic Press: Cambridge, MA, USA, 1968; ISBN -10: 012224950X. [Google Scholar]
  270. Hagen, K.G.T.; Gilbert, D.M.; Willard, H.F.; Cohen, S.N. Replication timing of DNA sequences associated with human centromeres and telomeres. Mol. Cell. Biol. 1990, 10, 6348–6355. [Google Scholar] [CrossRef] [PubMed]
  271. Shelby, R.D.; Vafa, O.; Sullivan, K.F. Assembly of CENP-A into Centromeric Chromatin Requires a Cooperative Array of Nucleosomal DNA Contact Sites. J. Cell Biol. 1997, 136, 501–513. [Google Scholar] [CrossRef] [PubMed]
  272. Erliandri, I.; Fu, H.; Nakano, M.; Kim, J.H.; Miga, K.H.; Liskovykh, M.; Earnshaw, W.C.; Masumoto, H.; Kouprina, N.; Aladjem, M.I.; et al. Replication of α-satellite DNA arrays in endogenous human centromeric regions and in human artificial chromosome. Nucleic Acids Res. 2014, 42, 11502–11516. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. CenH3 protein alignments, conservation and diversity across species. The structural elements of CenH3 proteins are illustrated, with conserved residues in blue. The histogram above the sequences shows the conserved regions: the carboxyl terminal domain and its components (L1 and α-helix) are highly preserved across eukaryotes. The shared CENP-A Targeting Domain (CATD) drives the association between proteins and centromeres [50]. Despite the variability of the amino terminal tail, this domain contains a phosphorylatable serine for CenH3 mitotic function [51]. This image is courtesy of Damien Goutte-Gattat [52].
Figure 1. CenH3 protein alignments, conservation and diversity across species. The structural elements of CenH3 proteins are illustrated, with conserved residues in blue. The histogram above the sequences shows the conserved regions: the carboxyl terminal domain and its components (L1 and α-helix) are highly preserved across eukaryotes. The shared CENP-A Targeting Domain (CATD) drives the association between proteins and centromeres [50]. Despite the variability of the amino terminal tail, this domain contains a phosphorylatable serine for CenH3 mitotic function [51]. This image is courtesy of Damien Goutte-Gattat [52].
Genes 11 00912 g001
Figure 2. Centromere structures in different eukaryotes. (A) The S. cerevisiae point centromere is 125 bp in size and it is composed of three centromere DNA elements (CDEs): CDEI, CDEII and CDEIII. (B) The S. pombe centromere is made of inner (ImrL and ImrR) and outer (dg and dh) inverted repetitive sequences that flank a central unique sequence (Cnt). (C) The two main satellite domains (AATAT and AAGAG) of the D. melanogaster centromere are interspersed with transposable elements (black lines). (D) A. thaliana has a 180 bp repeat unit intermingled with retrotransposons (black lines). (E) The mouse centromere is made up of major satellite sequences (MaSat) of 234 bp monomers (spanning ~6 Mb; green arrows) and minor satellite sequences (MiSat) of 120 bp monomers (spanning ~600 kb; blue arrows). (F) Human centromeres contain tandem repeats of α-satellite 171 bp monomers organized head to tail into higher order repeats (HORs). (G) The meta-polycentric centromere of P. sativum is a very long centromere of 13 families of satellite DNA repeats and one family of Ty3/gypsy retrotransposons, organized into 3–5 domains containing CenH3. (H) The polycentric or holocentric centromere of C. elegans covers the entire length of chromosome on which there are several points for microtubule attachment. In spite of this great diversity, all these centromeres perform faithful roles in chromosome segregation.
Figure 2. Centromere structures in different eukaryotes. (A) The S. cerevisiae point centromere is 125 bp in size and it is composed of three centromere DNA elements (CDEs): CDEI, CDEII and CDEIII. (B) The S. pombe centromere is made of inner (ImrL and ImrR) and outer (dg and dh) inverted repetitive sequences that flank a central unique sequence (Cnt). (C) The two main satellite domains (AATAT and AAGAG) of the D. melanogaster centromere are interspersed with transposable elements (black lines). (D) A. thaliana has a 180 bp repeat unit intermingled with retrotransposons (black lines). (E) The mouse centromere is made up of major satellite sequences (MaSat) of 234 bp monomers (spanning ~6 Mb; green arrows) and minor satellite sequences (MiSat) of 120 bp monomers (spanning ~600 kb; blue arrows). (F) Human centromeres contain tandem repeats of α-satellite 171 bp monomers organized head to tail into higher order repeats (HORs). (G) The meta-polycentric centromere of P. sativum is a very long centromere of 13 families of satellite DNA repeats and one family of Ty3/gypsy retrotransposons, organized into 3–5 domains containing CenH3. (H) The polycentric or holocentric centromere of C. elegans covers the entire length of chromosome on which there are several points for microtubule attachment. In spite of this great diversity, all these centromeres perform faithful roles in chromosome segregation.
Genes 11 00912 g002
Figure 3. Mutagenic processes that may operate at centromere sequences and have contributed to their repetitive origins. (A) Unequal exchange following recombination can cause gain or loss of tandem repeats and DNA rearrangements. (B) Gene conversion causes the unidirectional transfer of genetic information among homologous repetitive DNA sequences and can result in reciprocal or non-reciprocal exchange (the latter is depicted). (C) Replication slippage on misalignment repeated DNA strands during replication is thought to induce centromere expansion or contraction depending on whether the hairpin (depicted)/distortion is found on the newly synthesized strand (blue repeats) or the bulge (depicted)/distortion is on the template DNA (green repeats). (D) Break-induced replication (BIR) repairs one-ended double-stranded break (DSB) substrate, produced by replication fork collapse. (E) Rolling circle replication occurs when the 3′ end circularizes, and its replication produces repeated concatemers. (F) Single strand annealing (SSA) repairs DSBs through the annealing of complementary ssDNA strands succeeded by DNA tail end digestion and ligation. These repair pathways are essential for maintaining genome stability, yet when operating on repetitive sequences (especially arranged in tandem and sharing high degree of sequence homology like at the centromere), they may result in mutagenic variability as a way for ongoing DNA evolution and shaping.
Figure 3. Mutagenic processes that may operate at centromere sequences and have contributed to their repetitive origins. (A) Unequal exchange following recombination can cause gain or loss of tandem repeats and DNA rearrangements. (B) Gene conversion causes the unidirectional transfer of genetic information among homologous repetitive DNA sequences and can result in reciprocal or non-reciprocal exchange (the latter is depicted). (C) Replication slippage on misalignment repeated DNA strands during replication is thought to induce centromere expansion or contraction depending on whether the hairpin (depicted)/distortion is found on the newly synthesized strand (blue repeats) or the bulge (depicted)/distortion is on the template DNA (green repeats). (D) Break-induced replication (BIR) repairs one-ended double-stranded break (DSB) substrate, produced by replication fork collapse. (E) Rolling circle replication occurs when the 3′ end circularizes, and its replication produces repeated concatemers. (F) Single strand annealing (SSA) repairs DSBs through the annealing of complementary ssDNA strands succeeded by DNA tail end digestion and ligation. These repair pathways are essential for maintaining genome stability, yet when operating on repetitive sequences (especially arranged in tandem and sharing high degree of sequence homology like at the centromere), they may result in mutagenic variability as a way for ongoing DNA evolution and shaping.
Genes 11 00912 g003
Table 1. Centromere structure in different species.
Table 1. Centromere structure in different species.
Centromere TypeSpeciesSizeReferences
Point centromerefungi [4]
Saccharomyces cerevisiae~125 bp
Short regional centromerefungi [9,10]
Candida albicans~3–5 kb
Schizosaccharomyces pombe~35–110 kb
Long regional centromereviridiplantae [11,12,13]
Arabidopsis thaliana~400 kb–1.4 Mb
Oryza sativa~65 kb–2 Mb
Zea mays~180 kb
metazoa [14,15,16]
Drosophila melanogaster~420 kb
Mus musculus~1 Mb
Homo sapiens~0.5 to 5 Mb
Meta-polycentric centromeretracheobionta [17]
Pisum sativum~69–107 Mb
Holocentromereviriplantae [18]
Luzula nivea~100 Mb
metazoa [19,20,21]
Bombyx mori~8–21 Mb
Caenorhabditis elegans~14–21 Mb
Table 2. H3-like centromeric protein A homologues in different model organisms.
Table 2. H3-like centromeric protein A homologues in different model organisms.
H3-Like Centromeric Protein A HomologuesModel OrganismSize
Chromosome segregation 4 (Cse4)Saccharomyces cerevisiae [28]~26 kDa [29]
Centromere-specific histone H3 (Cnp1)Schizosaccharomyces pombe [30]~13 kDa [31]
Centromere identifier (Cid)Drosophila melanogaster [32]~25 kDa [33]
Centromeric histone 3 (CenH3)Arabidopsis thaliana [34]~19 kDa [35]
Histone H3-like centromeric protein (HCP-3)Caenorhabditis elegans [36]~32 kDa [37]
Centromeric protein A (Cenpa)Mus musculus [38]~15 kDa [39]
Centromeric protein A (CENP-A)Homo sapiens [40,41]~15 kDa [42]

Share and Cite

MDPI and ACS Style

Balzano, E.; Giunta, S. Centromeres under Pressure: Evolutionary Innovation in Conflict with Conserved Function. Genes 2020, 11, 912. https://doi.org/10.3390/genes11080912

AMA Style

Balzano E, Giunta S. Centromeres under Pressure: Evolutionary Innovation in Conflict with Conserved Function. Genes. 2020; 11(8):912. https://doi.org/10.3390/genes11080912

Chicago/Turabian Style

Balzano, Elisa, and Simona Giunta. 2020. "Centromeres under Pressure: Evolutionary Innovation in Conflict with Conserved Function" Genes 11, no. 8: 912. https://doi.org/10.3390/genes11080912

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop