Next Article in Journal
Differential Alternative Splicing Genes in Response to Boron Deficiency in Brassica napus
Next Article in Special Issue
Does the Presence of Transposable Elements Impact the Epigenetic Environment of Human Duplicated Genes?
Previous Article in Journal
The Genetic Variability of APOE in Different Human Populations and Its Implications for Longevity
Previous Article in Special Issue
The Integrity of piRNA Clusters is Abolished by Insulators in the Drosophila Germline
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Centromere Repeats: Hidden Gems of the Genome

by
Gabrielle Hartley
1 and
Rachel J. O’Neill
2,*
1
Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269, USA
2
Department of Molecular and Cell Biology and Institute for Systems Genomics, University of Connecticut, Storrs, CT 06269, USA
*
Author to whom correspondence should be addressed.
Genes 2019, 10(3), 223; https://doi.org/10.3390/genes10030223
Submission received: 8 February 2019 / Revised: 7 March 2019 / Accepted: 11 March 2019 / Published: 16 March 2019
(This article belongs to the Special Issue Repetitive DNA Sequences)

Abstract

:
Satellite DNAs are now regarded as powerful and active contributors to genomic and chromosomal evolution. Paired with mobile transposable elements, these repetitive sequences provide a dynamic mechanism through which novel karyotypic modifications and chromosomal rearrangements may occur. In this review, we discuss the regulatory activity of satellite DNA and their neighboring transposable elements in a chromosomal context with a particular emphasis on the integral role of both in centromere function. In addition, we discuss the varied mechanisms by which centromeric repeats have endured evolutionary processes, producing a novel, species-specific centromeric landscape despite sharing a ubiquitously conserved function. Finally, we highlight the role these repetitive elements play in the establishment and functionality of de novo centromeres and chromosomal breakpoints that underpin karyotypic variation. By emphasizing these unique activities of satellite DNAs and transposable elements, we hope to disparage the conventional exemplification of repetitive DNA in the historically-associated context of ‘junk’.

1. Introduction

Specific types of repetitive segments within eukaryotic genomes are now recognized as critical to maintaining subspecialized genomic functions. Common elements within repetitive segments include both transposable elements (TEs) and satellite DNA [1], collectively representing a large portion of eukaryotic genomes [2,3]. Unlike TEs that are capable of moving within a genome and thus are often found dispersed (albeit not randomly; reviewed in [4]), satellite DNA consists of short stationary DNA sequences that tandemly repeat to form a larger array, often restricted to specific sub-regions of chromosomes [1,5]. Ranging from just a few base pairs to several megabases in length, satellite repetitive units comprise up to 10% of the human genome [6]; across eukaryotes, variation in copy number and satellite family diversity contributes to differences in total satellite DNA content among taxa, often with dramatic total satellite content differentials [5]. Despite the high degree of variation among species in both sequence diversity and overall content, satellite DNAs are collectively found most highly concentrated in the centromeric and pericentromeric regions of chromosomes [7]. While the exact functions of satellite DNA have not been fully realized, this incommensurate distribution of satellite DNA within the genome highlights the importance of satellite DNA in chromosome inheritance through participation in centromere function.
First described in the context of DNA content in eukaryotes by Kit et al. [8] and Seuoka et al. [9] in 1961, satellite DNA was discovered via ultracentrifugation of genomic DNA—Note: the first use of the term satellite as a genetic descriptor is attributed to Sergius Navashin in his 1912 study of secondary constrictions on the chromosomes of a hyacinth [10]. Following the centrifugation of DNA from animal tissue extracts across a cesium chloride density gradient, Kit et al. [8] described a satellite band that was clearly differentiated from the major band of genomic DNA. Due to the repetitive nature of the DNA within this band, this fraction displayed an observable shift in density and led to the first description of satellite DNA. Despite this traditional descriptor, the phrase satellite DNA has been used more broadly to describe all tandemly arranged repetitive DNA sequences [11] irrespective of their resolution on density gradients. Since their discovery, a number of different methods have been used to characterize tandem repeats, including C0T analysis, in which the rate of re-association of complementary DNA strands is used to identify the frequency of repetitive elements [12], and separation following restriction endonuclease treatment, in which digested genomic DNA is separated via electrophoresis on an agarose gel [13]. Modern molecular techniques including next-generation sequencing (NGS) and fluorescence in situ hybridization (FISH) have provided additional clarity in the identification and physical characterization of satellite DNA sequences. This review includes emerging discoveries about satellite array characteristics and the other types of repeats found within, model systems proven useful for studying the role of satellite DNA in genome evolution, and the intimate relationship between satellite DNA and TEs. In addition, this review examines the paradoxical link between divergent satellite DNA and conserved centromere function as well as the connection between repeats and the emergence of new centromeres during chromosome evolution.

2. A Brief Primer on Satellite DNA in a Chromosomal Context

While the term satellite DNA encompasses all tandem nucleotide repeats, this large category can be further divided into a number of different subcategories and families. In addition to larger tandem repeats, one such grouping of smaller repeats can be created based on the number of nucleotides existing in the core repetitive segment. Microsatellites, for example [14], consist of repeating units less than 10 nucleotides in length and constitute up to 3% of the human genome [15]. Minisatellites, often referred to as variable number tandem repeats (VNTRs) [16], consist of a 10 to 100 nucleotide unit repeating up to several hundred times. With several thousand minisatellite loci distributed throughout the human genome [17], minisatellites are found at a high frequency in telomeric regions [18]. Telomeres are also enriched for a specific microsatellite, (TTAGGG)n, which constitutes the bulk of telomeric sequences, extending for 9–15 kb on human chromosomes [19,20]. Nucleoproteins (TRF1, TRF2, and POT1) bind to these telomeric satellites to form the shelterin complex [21], which interacts with the ribonucleoprotein telomerase that contains the enzyme component telomerase reverse transcriptase (TERT) [22], and an RNA (TERRA) [23]. The resulting ‘cap’ distinguishes chromosome ends from DNA breaks requiring repair and thus protects the chromosome from end-degradation and interchromosomal fusions [24].
Perhaps the most notable satellite families in the human genome are those located at both pericentromeric and centromeric regions: α satellites. α satellites, found ubiquitously at all human centromeres, are a ~171 base pair unit, known as a monomer, with sequences that are 50–80% identical among all monomers within an array (repeated monomers in tandem) [25]. The core of the centromere, where the kinetochore will form and mediate microtubule attachment and faithful chromosome segregation, is functionally defined by the assembly of centromeric nucleosomes containing the centromere-specific histone 3, CENP-A [26]. In humans, this core is enriched for α satellite DNA [25]. While found as solo repetitive units scattered among other satellites in the pericentromeric regions of human chromosomes without higher organization, α satellites within human centromeres are tandemly repeated to form a block of satellites, called a higher order repeat (HOR). HORs are comprised of a set number of monomers that varies from 2 to 34 monomers ([25,27,28,29,30] and reviewed in [31]) in a largely chromosome-specific arrangement (Figure 1a). For example, the α satellite HOR blocks on chromosome 1 consist of 2 monomers [32], referred to as 2-mers, the HOR blocks on chromosome 7 consist of 6-mers [33], and the HOR blocks on the Y chromosome are 34-mers [34]. These HOR blocks are further repeated to form HOR arrays than can span megabases. Because of the highly repetitive nature of these centromeric HOR arrays with identities among HOR blocks nearing 99% in some cases [35] and high HOR copy numbers [36], centromeric regions have historically been refractive to characterization, at least in the context of genome assemblies [6].
Despite challenges associated with characterizing highly repetitive stretches of DNA, groups are uncovering variation in satellite DNA, both within the human reference genome and among different individuals, and identifying the functional consequences of these variants. α satellite monomers are classified into 12 consensus monomers (J1, J2, D1, D2, W1, W2, W3, W4, W5, M1, R1, and R2) [29,30,37,38,39], which are further grouped into five suprachromosomal families (SF1-5) [29,30,40]. A specific strata of satellites within each human chromosome was revealed by fine-scale mapping and sequence annotation of monomers and HORs [30,40], wherein highly homogenized and recently derived monomers are organized into HORs within the functional core of the centromere and the older, divergent monomers are organized further from the core and into the pericentromere (Figure 1b). In other words, the closer a satellite stratum is to the functionally defined core of the centromere, the younger and more homogenized the monomers within those HORs will be. It has thus been proposed that the α satellite strata are a phylogenetic record of the evolution of human centromeres, with the younger and more homogenized monomers closer to the functionally defined core of the centromere and older centromere remnants orbiting the central core, indicating the location and/or abandoned sequence of long-dead centromeres shared with our primate ancestors [40] (Figure 1c).
Once a satellite variant becomes dominant in a species, there is subsequent intrachromosomal homogenization that further distinguishes chromosome-specific arrays. Recent work in humans has also revealed that there is variation of chromosome-specific α satellite arrays among different individuals in the human population [41,42]. Aldrup-MacDonald et al. [41] describe variation within the α satellite DNA arrays of human chromosome 17 first characterized by several groups over the past few decades [40,43,44,45]. At this chromosome, the centromeric region contains three unique α satellite arrays arranged adjacently: D17Z1, D17Z1-B, and D17Z1-C (Figure 2). Among these three arrays, only one acts as the functional centromere and recruits CENP-A histones; thus, multiple, potentially functional arrays on one chromosome are known as epialleles [46]. In roughly 70% of individuals, the centromere is assembled at the 16-mer D17Z1 locus, while the remaining 30% of individuals display differential centromere assembly at the D17Z1 locus of one homolog and the 14-mer D17Z1-B locus of the other [46] (Figure 2a). While the D17Z1-B epiallele can support centromere assembly in human artificial chromosomes, no individual homozygous for this allele has yet been identified. Because of this, it is purported that those homozygous for the D17Z1-B epiallele represent a rare, yet functionally viable, variant in the human population [41]. Similarly, Miga et al. [42] have identified size and sequence satellite array variants on human chromosomes X and Y via their utilization of whole-genome shotgun sequencing in efforts to create centromeric reference models [47]. This ongoing work continues to build upon the foundational understanding of satellite array variation that has been characterized by others [48,49] and suggests that centromeric HOR variants are not a phenomenon exclusive to human chromosome 17.
While the underlying molecular foundation for the formation of centromeric epialleles remains unknown, Aldrup-MacDonald et al. [41] propose, based on their work with somatic cell hybrid lines, that genomic variation of satellite DNA is an influential factor dictating which epiallele will assemble centromeric nucleosomes [46]. Using restriction enzyme digestion and Southern blotting to identify variation in D17Z1, D17Z1-B, and D17Z1-C epialleles, Aldrup-MacDonald et al. [41] determined that larger D17Z1 satellite arrays were more likely to be both homogenous (wild type for the canonical 16-mer HOR) and the active site of centromere assembly. By using cytogenetic techniques like fluorescence in situ hybridization (FISH) and monitoring chromosome stability, it was determined that centromeres assembling at a highly variant D17Z1 array locus (containing a number of different HOR variants) were unstable while those assembling at the D17Z1-B locus remained stable despite D17Z1 variability [41] (Figure 2b). Furthermore, it was determined that these unstable centromeric locations had about half of the amount of centromeric proteins CENP-A and CENP-C present in comparison to stable centromeres [41]. These studies suggest that variant chromosome 17 epialleles do not perform equally and thus highlight the important role variation of satellite DNA might play in the maintenance of proper chromosome segregation.

3. Centromere Repeats Endure Unique Evolutionary Processes

Although the presence of satellite DNA in centromeres is a shared characteristic found among many eukaryotic taxonomic groups, as is the protein cascade required for faithful chromosome segregation mediated by CENP-A, the underlying sequence of this satellite DNA is highly variable and largely species-specific [50,51,52,53,54,55]. Tandemly arrayed satellites within a single chromosome experience high rates of sequence turnover via concerted evolution, a non-independent process of molecular drive [56] (Figure 3a). Several mechanisms have been invoked to explain this observation, including nonhomologous and/or unequal crossing over [57], replication slippage [58], gene conversion [59], and rolling circle amplification and subsequent reinsertion ([60], reviewed in [61]). Such mechanisms impact sequence homogenization across an array as well as variation in overall array length.
While tandemly arrayed sequences are not capable of transposition, a family of arrays appears to spread from one chromosome to another, rendering the centromere repeats of non-homologous chromosomes within a karyotype highly similar and phylogenetically closely related. For example, several pairs of human chromosomes share the same satellite arrays: chromosomes 1, 5, and 19 [62,63], 13 and 21 [64], and 14 and 22 [65]. Interestingly, chromosomes 13 and 21 in the chimpanzee share the same satellite array as is observed on the homologous chromosomes 13 and 21 in humans, but the 13/21 arrays of these two species are not orthologous [65,66], indicating some chromosomes efficiently evoke inter-chromosomal recombination in independent lineages [28]. How this occurs or why this appears restricted to a subset of chromosomes is not known.
Homogenization of arrays is not linked specifically to the presence of tandem repeats. In fact, the only stratum of satellites across the centromere/pericentromere that experience forces of homogenization across an array, and thus carry HORs and high identity repeat units, is that of the recently derived and functional core [40] (Figure 3a). In other words, only the satellites that serve as the foundation for the kinetochore undergo continual homogenization, linking the assembly of the kinetochore to the homogenization process [37], and consequently, rapid evolution. It has been proposed that proteins facilitating homogenization, known as a kinetochore-associated recombination machine (KARM), have become integrated into the kinetochore complex, fostering this core-satellite specific homogenization process [28,40]. One candidate for this machine is topoisomerase II [40], a DNA decatenating enzyme that resides in the kinetochore during mitosis and initiates homologous recombination following the induction of DNA breaks [67].
What is the source material for new satellite arrays that seed within older arrays, eventually pushing them to outer, non-homogenized and highly variable strata? The library hypothesis [68] provides one explanation for how satellite DNA content at the centromere may diverge rapidly among closely related species (Figure 3a). In this scenario, extant but distinct centromeric repeats, representing a satellite library for a species, may independently expand or contract in copy number in different evolutionary lineages (be they chromosomes or species within a complex). If a repeat from this library finds itself in the core of the centromere, associated homogenization and expansion could result in the establishment of what appears to be a new satellite array [69,70,71,72,73,74]. In some cases, the seeding of a centromere from such a library is facilitated by chromosome rearrangement [75,76,77].
Another mechanism proposed to give rise to the variability of satellite sequences in different species is a meiotic drive model, known as centromere drive [78] (Figure 3b). As predicted by this model, satellite arrays attract more microtubules during female meiosis if the arrays experience accretion [79,80]. Preferentially sorted into the egg, these expanded satellite sequences are predicted to promote increased rates of evolution of centromere proteins, particularly CENP-A, which directly interacts with satellite DNA, through genetic conflict. Eventually, these divergent centromere proteins become highly prevalent in the population as they evolve to restore parity in meiosis [52,81]. In fact, this model is supported not only by the rapid evolution and variability of satellites in a variety of species, but by the positive selection of nucleic-acid interacting centromere proteins like CENP-A and CENP-C in plants, primates, and others [82,83,84,85,86]. This model is further supported by evidence that Robertsonian fusions with a single centromere are preferentially segregated due to a higher recruitment of CENP-A, Ncd80, and microtubules than their unfused mates [87]. Heterozygosity for these fusions has been observed to reduce male fertility, creating a selective pressure for the fixation of a new karyotype via a fitness cost ([88], reviewed in [89]).
Despite the ability of the centromere drive model to explain the high variation observed in satellite sequence from one species to the next, this model does not offer a complete mechanism by which chromosomal evolution and karyotypic changes may occur, particularly when considering the circumstances of de novo centromere formation. Described in human patients presenting with an abnormal karyotype (reviewed in [90]), a neocentromere forms on an ectopic site on a chromosome when the original centromere is lost or inactivated, or the entire karyotype is unstable, as in cancer (e.g., [91]) —Note: It has been argued that the term neocentromere is incorrectly used to describe de novo centromeres that are kinetochore-competent [92]. The original use of the term neocentromere is attributed to describe subtelomeric heterochromatin blocks that behave similarly to centromeres but do not build a traditional kinetochore [93]. While stable neocentromeres can be fully functional in kinetochore assembly and thus maintain proper chromosome segregation, most lack the typical satellite DNA characteristic of centromeric regions [94,95,96,97,98,99]. Not only are functional neocentromeres devoid of satellite DNA, but in some cases, the original inactive centromere retains satellite arrays yet they no longer recruit centromere proteins (and thus are rendered non-functional) (reviewed in [90,100]). The identification of functional neocentromeres lacking satellite DNA spawned the prediction that satellite DNA is neither sufficient nor required for centromere function [101], despite its apparent ubiquity across taxonomic groups.
Neocentromeres are not restricted to clinical cases of chromosome instability; shifts in centromere location with no discernable change in intervening gene order distinguish species-specific karyotypes in many eukaryotic taxa. Formerly referred to as centric shifts [102,103], these evolutionary new centromeres (ENCs) [104] (Figure 4) have been characterized in primates, horses, cattle, marsupials, plants, insects, and many other species complexes (see [102,103,104,105,106,107,108] for examples). Moreover, several groups have noted a lack of higher order satellite arrays in newly emerged, functional centromeres, indicating that the formation of homogenized arrays succeeds centromere fixation in a population [109]. It has been proposed that following the fixation of a novel centromere in a species, satellite arrays accumulate to further stabilize the centromere [110,111]. Successive interchromosomal homogenization further support the establishment of large, stable regional centromeres that are rendered species-specific [109,112,113,114,115]. Thus, ENCs accumulate satellite DNA arrays across successive generations as they phylogenetically age, while their immature counterparts lack these types of repetitive sequences (Figure 4).
Not only has it been established that some recently emerged centromeres lack the higher order satellite arrays characteristic of functional centromeres in a wide variety of organisms, but species in the Equus genus carry several centromeres that lack satellite DNA altogether [110,111,116]. Included in those devoid of satellite DNA are ENCs, repositioned to a non-centromeric location following the loss of function at the original centromere [108]. Based on the emerging ENC hypothesis, the recently diverged Equus genus, estimated to share a last common ancestor with other genera just 2–3 million years ago despite considerable karyotypic variation, would be predicted to contain de novo centromeres helping drive karyotypic variation that lack satellite DNA. Immuno-FISH experiments using satellite DNA and antibodies against CENP-A completed by Piras et al. [111] identified both functional centromeres lacking satellite DNA as well as satellite repeats present at non-centromeric locations, suggesting the presence of both immature centromeres and ancestral yet inactive centromeric locations, respectively. The identification of a fixed, satellite-free centromere on chromosome 11 in Equus caballus presented a distinctive opportunity to test whether there was detectable variability in kinetochore assembly localization on an ENC. ChIP-on-chip analyses in five Equus individuals using an antibody against CENP-A revealed at least seven functional centromere epialleles on chromosome 11 dispersed across a region of 500 kb and extending between 80 to 160 kb [117]. The results of these experiments, and recent work in Equus asinus [110], demonstrate significant plasticity in CENP-A binding domains among individuals and suggest the potential for centromeres across mammalian species to positionally ‘slide’, resulting in the formation of variable functional epialleles [110,111].
Genome sequencing efforts have further revealed that many eukaryotic species lack centromeres enriched for satellite arrays. For example, sequencing following chromatin immunoprecipitation with antibodies to centromeric proteins CENP-A and CREST, Johnson et al. [118] report a lack of satellite arrays in the centromeres of the recently characterized koala (Phascolarctos cinereus) genome, an observation also described in gibbon centromeres and suggestive of the recent evolution of new elements associated with centromere function [119]. Furthermore, this observation has also been documented in a number of other species with small centromeres, ranging from plant species like rice [120] and potato [74] to marsupials like the tammar wallaby [121,122], and fungal species such as Candida albicans [123,124]. Taken collectively, new centromere formation is likely not initiated by satellite DNAs; however, satellite DNA is a shared feature of regional centromeres and thus likely promotes their stability. While the introduction of α satellite arrays in human cells can result in the formation of a functional neocentromere, supporting the proposal that satellite DNA is foundational to centromere activity [125,126], the seeding of new ectopic neocentromeres appears to occur in the absence of satellite DNA.

4. Satellites and Their Party Friends—Transposable Elements

While satellite DNA is pervasive in the stable, regional centromeres of many species, another class of repetitive element is found within satellite-rich centromeres, ENCs, and neocentromeres: TEs. TEs are repetitive sequences that are able to alter their location in the genome and thus are often considered selfish elements [1,127,128]. Originally characterized by cytogeneticist Barbara McClintock [129], transposable elements can be divided into two categories based on mobility; transposons alter their position directly via a cut and paste mechanism, while retrotransposons move via a copy and paste mechanism through which an RNA intermediate is first created before being reverse transcribed into an identical DNA sequence inserted at a particular genomic locus [130,131].
Transposons moving via a cut and paste mechanism, also called type II transposable elements, require a self-encoded enzyme, transposase, in order to move from one locus to another [130,131]. The transposon, flanked by terminal inverted repeats, is recognized by transposase which removes the transposon before reintegrating it at a target location. The gap left behind by transposon excision can be repaired either with, or without, the addition of a replacement transposon. Dissimilarly, retrotransposons, also called type I transposable elements, rely on the transcription of an RNA intermediate as part of their transposition [130,131]. Following transcription, retrotransposon RNA intermediates are reverse transcribed into identical DNA sequences and integrated into a target locus [130,131]. Unlike transposase-mediated mobility, the number of retrotransposons present in a genome increases in number each time they undergo transposition.
Like satellite DNA, transposable elements form a significant portion of eukaryotic genomes. In fact, due to the ability for many subfamilies to multiply during retrotransposition, TEs can occupy a significant majority of eukaryotic genomes [132,133,134], constituting up to 85% of the maize genome [134] and nearly 50% of the human genome [135]. Historically believed to simply self-propagate, it is now understood that these elements not only comprise a bulk of eukaryotic DNA but also contribute significantly to a wide range of regulatory functions within a genome. Unsilenced TEs have been observed to contain cis-regulatory sequences that, due to their motility, have been dispersed broadly throughout the human genome [136,137]. These cis-regulatory elements have been shown by several groups to act as promoters, enhancers, and repressors of transcription [138,139,140,141,142]. Using human and mouse cell lines, Sundaram et al. [136] found that 20% of transcription factor binding sites were embedded within transposable elements. Similarly, Cao et al. [142] identified widespread enhancer-like repeats throughout the human genome, many of which were enriched in the mammalian-wide interspersed repeat (MIR) family of short interspersed nuclear elements (SINEs) and the L2 family of long interspersed nuclear elements (LINEs). Moreover, Makarevitch et al. [143] suggest the potential for TEs to provide a mechanism for the upregulation of particular genetic transcripts following abiotic stress in maize via their enhancer-like activity. These studies represent just a fraction of the mounting evidence suggesting that TEs provide necessary regulatory functions within a genome (e.g., see [4,144,145] for reviews).
Despite the high frequency of transposable elements in both human and other eukaryotic genomes, the majority of transposable elements are not actively moving from one genomic locus to another. While mutations have rendered many transposable elements inactive, some have been epigenetically silenced through various mechanisms, such as post transcriptional modifications via RNA interference, DNA and chromatin modifications, and germline silencing. Epigenetic silencing prevents TEs from producing the proteins required for mobility despite a lack of change to the underlying DNA sequence (reviewed in [109,146,147]).
While satellite DNA is characteristic of centromeres across eukaryotic organisms, the surrounding regions of pericentric heterochromatin are often enriched in TE content. For example, while human centromere cores are enriched for tandem repeat stretches of α satellite DNA, the surrounding heterochromatin regions consist of shorter satellites (e.g., satellites I and II) and primarily two different types of retrotransposons: LINEs and SINEs. Emerging models of centromeric contigs have shown that TE insertions are also found within HOR arrays of the centromere core of all human chromosomes [148,149,150]. This characteristic, coupled with the observations that TEs are often found at both neocentromeres [99,151,152] and ENCs that are devoid of any satellite content [117,118,153,154], suggests a potential role for TEs in centromere function independent of resident satellite DNA.
While the exact role TEs play in centromere function is not currently known, several features of centromeric TEs have been revealed. For example, epigenetic silencing of transposable elements appears to be critical in maintaining proper centromere function and chromosome segregation [147,155,156]. In mice, activating regularly silenced long terminal repeat (LTR) and non-LTR retrotransposons at centromeric regions has led to defects in both meiosis and chromosomal segregation, suggesting the necessity of epigenetically silent transposable elements for appropriate centromere function [156]. Undermethylation of centromeric retroelements in interspecific hybrids led to centromere destabilization and chromosome instability [157,158], indicating that tight regulation of TE activity underlies centromere stability. Moreover, studies have suggested a link between centromeric retrotransposons and the silencing of satellite DNA in the centromere, as well as a link between satellite DNA and the silencing of retrotransposons. May et al. [159] describe this relationship in Arabidopsis thaliana, in which satellite-derived transcripts are epigenetically silenced in part due to the insertion of transposable elements. Phylogenetic analyses and TE annotations have led to the observation that species-specific [118] and recently active [160] or hot TEs [152,161] are often the type of element found within centromere cores, while divergent and ancestral TEs are relegated to the hypermutated satellites [30,40,149] found in the outer strata of the centromere and pericentromere (Figure 1a,b).
It has been suggested that a close evolutionary relationship exists between centromeric TEs and the birth of new satellite families (Figure 3a). In the plant species Aegilops speltoides, a 250 base pair repeat satellite array family is present at centromeres [162]. While not identical to that of a transposable element, this satellite DNA sequence shares high similarity to portions of a transposable element: Ty3/gypsy-like retrotransposons. Furthermore, this phenomenon has been observed in other model species as well, including members of the Arabidopsis [163], Drosophila [164], and Cetacean [165] genera. A recent study observed that tandem dimers of TEs form during bursts of TE activity and may serve as fodder for the evolution of satellite arrays, as was found for the hobo element in Drosophila [166]. It has been proposed that large-scale mutations, insertions and deletions within centromeric TEs followed by unequal crossing over or even seeding across chromosomes, may give rise to novel tandem repetitive elements found highly enriched at centromeres [112,167,168,169,170]. These processes are thought to act as part of the host-defense mechanisms to inactivate mobile elements ([171,172] but see [173]) or prevent non-allelic homologous recombination ([174,175] and reviewed in [109,176,177]).
Within plants, allopolyploidy presents a unique opportunity for the evolution of centromeric sequences from resident TEs. Following allopolyploidization, and during the genomic instability that ensues, centromeric TEs from the different progenitor genomes may become activated [178]. Evidence has been found in Gossypium (cottons) that such activation likely occurred, resulting in the integration of TEs from one genome into another, and subsequent proliferation among centromeres [179]. This activity, coupled with the exposure of new genomic material in the polyploid state, provides an opportunity for competition among multiple, newly emerged centromere repeats and the possible selection for repeats that are more conducive to supporting centromere nucleosome structure [180].

5. Transcription in the Centromere—Let’s Get the Party Started!

While a function for satellite DNAs in kinetochore assembly and/or stability has been inferred since their discovery (e.g., [181,182,183]), a common misconception has been that these sequences were not actively transcribed into RNAs (but see [184] and references therein from the 1960s). Undoubtedly, the discovery that satellite DNAs are transcriptionally viable has led to a shift in how we view centromeric chromatin [185,186]. Soon after the discovery of satellite DNA in cesium gradients, electron microscopy revealed RNA at plant and animal centromeres [187,188], although satellites themselves were not directly linked to active transcription. Furthermore, examination of the linear organization of histones within centromeres using chromatin fiber FISH revealed that CENP-A nucleosome domains were interrupted by nucleosomes containing H3K4me2 [189,190] and H3K36me2 [191], epigenetic marks of active transcription. Using chromatin immunoprecipitation, Choi et al. reported the detection of RNA polymerase II at centromeres in the fission yeast Schizosaccharomyces pombe [192]. Further analyses have identified the presence of RNA polymerase II at centromeres in humans [193], flies [194,195], and budding yeast [196], among others. The presence of RNA polymerase II at these sites suggested active transcription occurring from the DNA present at centromeres: satellites and TEs.
Transcripts originating from centromeric satellite DNA and TEs have now been observed in a variety of species across eukaryotes [122,159,186,194,196,197,198,199,200,201,202,203,204], while some species, such as S. pombe, exhibit transcription of boundary elements (e.g., tRNAs [205]). Thus, centromeric RNAs are a conserved component of the centromere, despite a lack of sequence conservation across these regions. Recent work implicates centromere transcription as integral to centromere function, impacting the pivotal event in centromere assembly: the loading of newly synthesized CENP-A histones. For example, in budding yeast [196] and human artificial chromosomes [191,206,207], transcriptional silencing of centromeric DNA has been shown to lead to a failure to maintain proper centromere function. In human artificial chromosomes, this malfunction was attributed to the inability to load new CENP-A during mitosis to G1. Conversely, upregulation of satellite transcripts is also detrimental to centromere function, causing the removal of the CENP-A histone variant [196,200,207,208] as well as cellular instability [196,200,209,210,211,212,213]. Intriguingly, several proteins involved in the kinetochore assembly cascade are either RNA binding proteins or have been demonstrated to associate with RNAs in a complex, including CENP-A [199], CENP-C [202,214], and KNL2/M18BP1 [215]. While the transcriptional framework underlying centromere assembly is not fully understood (but see [109,186]), several mechanisms have been proposed that can promote transcription within regional centromeres. Early work in plants [199] and marsupials [122] supported the hypothesis that centromeric TEs promote transcription, and their ability to transcribe neighboring satellites is implied by the presence of bi-directional promoters within these TEs [200,216,217,218]. More recently, it has been hypothesized that non-B form DNA facilitated by dyad symmetries and CENP-B binding within centromeres may facilitate transcription [219]. While there is a clear connection between transcription and centromere nucleosome assembly [194,195,199,200,220,221], how and when this occurs during the cell cycle remains elusive.
Transcription has also been linked to the emergence of new centromeres. In a human neocentromere case, a L1 was found transcribed and actively demarcating the CENP-A domain of the new centromere [152,222]. Given earlier work demonstrating that demethylation of centromeric TEs led to increased activity [158], release of ectopic TEs from a silenced state may facilitate their transcription and subsequent recruitment of CENP-A nucleosomes, leading to the rescue of acentric chromosome fragments following the inactivation of the native centromere. How an ectopic site becomes activated, enabling the recruitment of CENP-A nucleosomes in the absence of chromosome damage, as is implied by centromere repositioning events, is unknown. If multiple inversion events, insertion events by active TEs, or simply deletions of part of an HOR array lead to the interruption of the native satellite array, destabilization of the kinetochore assembly cascade may follow, necessitating a rescue centromere elsewhere on the chromosome. Perhaps the most recent TE insertions in a genome allow ectopic centromere formation as such elements have yet to experience silencing by host defense mechanisms. Under this model, some mechanism must prevent the activation of ectopic centromeres at these hot elements when native centromeres are still functional to prevent the formation of dicentric chromosomes and subsequent breakage-fusion-bridge cycles [223,224,225].

6. Conclusions

The influence of repeated DNAs on eukaryotic genomes is often presented in the framework of the logical fallacy that repeated DNA should no longer be considered inconsequential ‘junk DNA’. Contextualizing repeated DNAs under such as false descriptor, even when presented as an oft challenged and subsequently defeated cliché, undercuts not only the long-standing validity of studying repeated DNAs, but the growing impact the field of repeat DNA biology has had on our understanding of eukaryotic genome biology and evolution. The repeats found at centromeres are an excellent case in point. There is little doubt that centromeric repeats, including both satellites and TEs, are integral to centromere function and stability as well as the evolution of novel karyotypes. The models discussed herein are not all-inclusive yet demonstrate the unique processes that have allowed for significant species-specific variation among repetitive DNAs despite a simultaneously foundational role in genome stability and regulation. As we gain an understanding of the evolutionary forces that influence the constitution of centromeric DNA, we can start to unravel the impact centromeric sequences have on both maintaining chromosome stability within a species and karyotypic change during species evolution.

Author Contributions

Contributions to this review are as follows: conceptualization, writing and editing, G.H. and R.O.; funding acquisition, R.O.

Funding

This research was funded by the National Science Foundation, grant number 1613806.

Acknowledgments

The authors thank Kate Castellano, Judy Brown, and Mike O’Neill for critical comments on the manuscript. Chromosome ideogram schematic in Figure 1 and Figure 2 made by Mysid, based on http://ghr.nlm.nih.gov/.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. Biscotti, M.A.; Olmo, E.; Heslop-Harrison, J.S. Repetitive DNA in eukaryotic genomes. Chromosome Res. 2015, 23, 415–420. [Google Scholar] [CrossRef] [PubMed]
  2. Charlesworth, B.; Sniegowski, P.; Stephan, W. The evolutionary dynamics of repetitive DNA in eukaryotes. Nature 1994, 371, 215–220. [Google Scholar] [CrossRef] [PubMed]
  3. Lopez-Flores, I.; Garrido-Ramos, M.A. The repetitive DNA content of eukaryotic genomes. Genome Dyn. 2012, 7, 1–28. [Google Scholar] [CrossRef] [PubMed]
  4. Bourque, G.; Burns, K.H.; Gehring, M.; Gorbunova, V.; Seluanov, A.; Hammell, M.; Imbeault, M.; Izsvak, Z.; Levin, H.L.; Macfarlan, T.S.; et al. Ten things you should know about transposable elements. Genome Biol. 2018, 19, 199. [Google Scholar] [CrossRef] [PubMed]
  5. Garrido-Ramos, M.A. Satellite DNA: An Evolving Topic. Genes 2017, 8, 230. [Google Scholar] [CrossRef] [PubMed]
  6. Sullivan, L.L.; Chew, K.; Sullivan, B.A. α satellite DNA variation and function of the human centromere. Nucleus 2017, 8, 331–339. [Google Scholar] [CrossRef] [Green Version]
  7. Jagannathan, M.; Cummings, R.; Yamashita, Y.M. A conserved function for pericentromeric satellite DNA. eLife 2018, 7, e34122. [Google Scholar] [CrossRef] [Green Version]
  8. Kit, S. Equilibrium sedimentation in density gradients of DNA preparations from animal tissues. J. Mol. Biol. 1961, 3, 711–716. [Google Scholar] [CrossRef]
  9. Sueoka, N. Variation and heterogeneity of base composition of deoxyribonucleic acids: A compilation of old and new data. J. Mol. Biol. 1961, 3, 31–40. [Google Scholar] [CrossRef]
  10. Navashin, S. On the nuclear dimorphism in somatic cells of Galtonia candicans. Bull. Acad. Imp. Sci 1912, 6, 375–385. [Google Scholar]
  11. Singer, M.F. Highly repeated sequences in mammalian genomes. Int. Rev. Cytol. 1982, 76, 67–112. [Google Scholar] [PubMed]
  12. Waring, M.; Britten, R.J. Nucleotide sequence repetition: A rapidly reassociating fraction of mouse DNA. Science 1966, 154, 791–794. [Google Scholar] [CrossRef]
  13. Horz, W.; Zachau, H.G. Characterization of distinct segments in mouse satellite DNA by restriction nucleases. Eur. J. Biochem. 1977, 73, 383–392. [Google Scholar] [CrossRef] [PubMed]
  14. Vieira, M.L.C.; Santini, L.; Diniz, A.L.; Munhoz Cde, F. Microsatellite markers: What they mean and why they are so useful. Genet. Mol. Biol. 2016, 39, 312–328. [Google Scholar] [CrossRef] [PubMed]
  15. Subramanian, S.; Mishra, R.K.; Singh, L. Genome-wide analysis of microsatellite repeats in humans: Their abundance and density in specific genomic regions. Genome Biol. 2003, 4, R13. [Google Scholar] [CrossRef] [PubMed]
  16. Ramel, C. Mini- and microsatellites. Environ Health Perspect 1997, 105, 781–789. [Google Scholar] [PubMed]
  17. Naslund, K.; Saetre, P.; von Salome, J.; Bergstrom, T.F.; Jareborg, N.; Jazin, E. Genome-wide prediction of human VNTRs. Genomics 2005, 85, 24–35. [Google Scholar] [CrossRef] [PubMed]
  18. Vergnaud, G.; Gauguier, D.; Schott, J.J.; Lepetit, D.; Lauthier, V.; Mariat, D.; Buard, J. Detection, cloning, and distribution of minisatellites in some mammalian genomes. EXS 1993, 67, 47–57. [Google Scholar]
  19. O’Sullivan, R.J.; Karlseder, J. Telomeres: Protecting chromosomes against genome instability. Nat. Rev. Mol. Cell Biol. 2010, 11, 171. [Google Scholar] [CrossRef]
  20. Moyzis, R.K.; Buckingham, J.M.; Cram, L.S.; Dani, M.; Deaven, L.L.; Jones, M.D.; Meyne, J.; Ratliff, R.L.; Wu, J.R. A highly conserved repetitive DNA sequence, (TTAGGG)n, present at the telomeres of human chromosomes. Proc. Natl. Acad. Sci. USA 1988, 85, 6622–6626. [Google Scholar] [CrossRef]
  21. Bandaria, J.N.; Qin, P.; Berk, V.; Chu, S.; Yildiz, A. Shelterin protects chromosome ends by compacting telomeric chromatin. Cell 2016, 164, 735–746. [Google Scholar] [CrossRef]
  22. Wyatt, H.D.; West, S.C.; Beattie, T.L. InTERTpreting telomerase structure and function. Nucleic Acids Res. 2010, 38, 5609–5622. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Cusanelli, E.; Chartrand, P. Telomeric repeat-containing RNA TERRA: A noncoding RNA connecting telomere biology to genome integrity. Front. Genet. 2015, 6, 143. [Google Scholar] [CrossRef]
  24. Maddar, H.; Ratzkovsky, N.; Krauskopf, A. Role for telomere cap structure in meiosis. Mol. Biol. Cell. 2001, 12, 3191–3203. [Google Scholar] [CrossRef] [PubMed]
  25. Willard, H.F. Chromosome-specific organization of human α satellite DNA. Am. J. Hum. Genet. 1985, 37, 524–532. [Google Scholar] [PubMed]
  26. Van Hooser, A.A.; Ouspenski, I.I.; Gregson, H.C.; Starr, D.A.; Yen, T.J.; Goldberg, M.L.; Yokomori, K.; Earnshaw, W.C.; Sullivan, K.F.; Brinkley, B.R. Specification of kinetochore-forming chromatin by the histone H3 variant CENP-A. J. Cell Sci. 2001, 114, 3529–3542. [Google Scholar] [PubMed]
  27. Willard, H.F.; Waye, J.S.; Skolnick, M.H.; Schwartz, C.E.; Powers, V.E.; England, S.B. Detection of restriction fragment length polymorphisms at the centromeres of human chromosomes by using chromosome-specific α satellite DNA probes: Implications for development of centromere-based genetic linkage maps. Proc. Natl. Acad. Sci. USA 1986, 83, 5611–5615. [Google Scholar] [CrossRef]
  28. Alexandrov, I.; Kazakov, A.; Tumeneva, I.; Shepelev, V.; Yurov, Y. α-satellite DNA of primates: Old and new families. Chromosoma 2001, 110, 253–266. [Google Scholar] [CrossRef] [PubMed]
  29. Alexandrov, I.A.; Medvedev, L.I.; Mashkova, T.D.; Kisselev, L.L.; Romanova, L.Y.; Yurov, Y.B. Definition of a new α satellite suprachromosomal family characterized by monomeric organization. Nucleic Acids Res. 1993, 21, 2209–2215. [Google Scholar] [CrossRef] [PubMed]
  30. Shepelev, V.A.; Uralsky, L.I.; Alexandrov, A.A.; Yurov, Y.B.; Rogaev, E.I.; Alexandrov, I.A. Annotation of suprachromosomal families reveals uncommon types of α satellite organization in pericentromeric regions of hg38 human genome assembly. Genom. Data 2015, 5, 139–146. [Google Scholar] [CrossRef]
  31. McNulty, S.M.; Sullivan, B.A. α satellite DNA biology: Finding function in the recesses of the genome. Chromosome Res 2018, 26, 115–138. [Google Scholar] [CrossRef] [PubMed]
  32. Carine, K.; Jacquemin-Sablon, A.; Waltzer, E.; Mascarello, J.; Scheffler, I.E. Molecular characterization of human minichromosomes with centromere from chromosome 1 in human-hamster hybrid cells. Somat. Cell Mol. Genet. 1989, 15, 445–460. [Google Scholar] [CrossRef] [PubMed]
  33. Waye, J.S.; England, S.B.; Willard, H.F. Genomic organization of α satellite DNA on human chromosome 7: Evidence for two distinct alphoid domains on a single chromosome. Mol. Cell. Biol. 1987, 7, 349–356. [Google Scholar] [CrossRef]
  34. Tyler-Smith, C.; Brown, W.R. Structure of the major block of alphoid satellite DNA on the human Y chromosome. J. Mol. Biol. 1987, 195, 457–470. [Google Scholar] [CrossRef]
  35. Roizès, G. Human centromeric alphoid domains are periodically homogenized so that they vary substantially between homologues. Mechanism and implications for centromere functioning. Nucleic Acids Res. 2006, 34, 1912–1924. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Aldrup-Macdonald, M.E.; Sullivan, B.A. The past, present, and future of human centromere genomics. Genes 2014, 5, 33–50. [Google Scholar] [CrossRef]
  37. Alexandrov, I.A.; Mashkova, T.D.; Akopian, T.A.; Medvedev, L.I.; Kisselev, L.L.; Mitkevich, S.P.; Yurov, Y.B. Chromosome-specific α satellites: Two distinct families on human chromosome 18. Genomics 1991, 11, 15–23. [Google Scholar] [CrossRef]
  38. Alexandrov, I.A.; Mitkevich, S.P.; Yurov, Y.B. The phylogeny of human chromosome specific α satellites. Chromosoma 1988, 96, 443–453. [Google Scholar] [CrossRef]
  39. Rosandic, M.; Paar, V.; Basar, I.; Gluncic, M.; Pavin, N.; Pilas, I. CENP-B box and pJalpha sequence distribution in human α satellite higher-order repeats (HOR). Chromosome Res 2006, 14, 735–753. [Google Scholar] [CrossRef]
  40. Shepelev, V.A.; Alexandrov, A.A.; Yurov, Y.B.; Alexandrov, I.A. The evolutionary origin of man can be traced in the layers of defunct ancestral α satellites flanking the active centromeres of human chromosomes. PLoS Genet. 2009, 5, e1000641. [Google Scholar] [CrossRef]
  41. Aldrup-MacDonald, M.E.; Kuo, M.E.; Sullivan, L.L.; Chew, K.; Sullivan, B.A. Genomic variation within α satellite DNA influences centromere location on human chromosomes with metastable epialleles. Genome Res. 2016, 26, 1301–1311. [Google Scholar] [CrossRef] [PubMed]
  42. Miga, K.H.; Newton, Y.; Jain, M.; Altemose, N.; Willard, H.F.; Kent, W.J. Centromere reference models for human chromosomes X and Y satellite arrays. Genome Res. 2014, 24, 697–707. [Google Scholar] [CrossRef] [PubMed]
  43. Waye, J.S.; Willard, H.F. Structure, organization, and sequence of α satellite DNA from human chromosome 17: Evidence for evolution by unequal crossing-over and an ancestral pentamer repeat shared with the human X chromosome. Mol. Cell. Biol. 1986, 6, 3156–3165. [Google Scholar] [CrossRef]
  44. Rudd, M.K.; Willard, H.F. Analysis of the centromeric regions of the human genome assembly. Trends Genet. 2004, 20, 529–533. [Google Scholar] [CrossRef]
  45. Warburton, P.E.; Willard, H.F. Interhomologue sequence variation of α satellite DNA from human chromosome 17: Evidence for concerted evolution along haplotypic lineages. J. Mol. Evol. 1995, 41, 1006–1015. [Google Scholar] [CrossRef] [PubMed]
  46. Maloney, K.A.; Sullivan, L.L.; Matheny, J.E.; Strome, E.D.; Merrett, S.L.; Ferris, A.; Sullivan, B.A. Functional epialleles at an endogenous human centromere. Proc. Natl. Acad. Sci. USA 2012, 109, 13704–13709. [Google Scholar] [CrossRef] [Green Version]
  47. Jain, M.; Olsen, H.E.; Turner, D.J.; Stoddart, D.; Bulazel, K.V.; Paten, B.; Haussler, D.; Willard, H.F.; Akeson, M.; Miga, K.H. Linear assembly of a human centromere on the Y chromosome. Nat. Biotechnol. 2018, 36, 321–323. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  48. Durfy, S.J.; Willard, H.F. Molecular analysis of a polymorphic domain of α satellite from the human X chromosome. Am. J. Hum. Genet. 1987, 41, 391–401. [Google Scholar] [PubMed]
  49. Schindelhauer, D.; Schwarz, T. Evidence for a fast, intrachromosomal conversion mechanism from mapping of nucleotide variants within a homogeneous α-satellite DNA array. Genome Res. 2002, 12, 1815–1826. [Google Scholar] [CrossRef]
  50. Alkan, C.; Cardone, M.F.; Catacchio, C.R.; Antonacci, F.; O’Brien, S.J.; Ryder, O.A.; Purgato, S.; Zoli, M.; Della Valle, G.; Eichler, E.E.; et al. Genome-wide characterization of centromeric satellites from multiple mammalian genomes. Genome Res. 2011, 21, 137–145. [Google Scholar] [CrossRef]
  51. Henikoff, S.; Ahmad, K.; Malik, H. The centromere paradox: Stable inheritance with rapidly evolving DNA. Science 2001, 293, 1098–1102. [Google Scholar] [CrossRef] [PubMed]
  52. Malik, H.S.; Henikoff, S. Conflict begets complexity: The evolution of centromeres. Curr. Opin. Genet. Dev. 2002, 12, 711–718. [Google Scholar] [CrossRef]
  53. Melters, D.P.; Bradnam, K.R.; Young, H.A.; Telis, N.; May, M.R.; Ruby, J.G.; Sebra, R.; Peluso, P.; Eid, J.; Rank, D.; et al. Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution. Genome Biol. 2013, 14, R10. [Google Scholar] [CrossRef]
  54. Plohl, M.; Mestrovic, N.; Mravinac, B. Centromere identity from the DNA point of view. Chromosoma 2014, 123, 313–325. [Google Scholar] [CrossRef] [Green Version]
  55. Plohl, M.; Mestrovic, N.; Mravinac, B. Satellite DNA evolution. Genome Dyn. 2012, 7, 126–152. [Google Scholar] [CrossRef] [PubMed]
  56. Dover, G. Molecular drive: A cohesive mode of species evolution. Nature 1982, 299, 111–117. [Google Scholar] [CrossRef] [PubMed]
  57. Smith, G.P. Evolution of repeated DNA sequences by unequal crossover. Science 1976, 191, 528–535. [Google Scholar] [CrossRef]
  58. Walsh, J.B. Persistence of tandem arrays: Implications for satellite and simple-sequence DNAs. Genetics 1987, 115, 553–567. [Google Scholar]
  59. Shi, J.; Wolf, S.E.; Burke, J.M.; Presting, G.G.; Ross-Ibarra, J.; Dawe, R.K. Widespread gene conversion in centromere cores. PLoS Biol. 2010, 8, e1000327. [Google Scholar] [CrossRef]
  60. Bertelsen, A.H.; Humayun, M.Z.; Karfopoulos, S.G.; Rush, M.G. Molecular characterization of small polydisperse circular deoxyribonucleic acid from an African green monkey cell line. Biochemistry 1982, 21, 2076–2085. [Google Scholar] [CrossRef]
  61. Gaubatz, J.W. Extrachromosomal circular DNAs and genomic sequence plasticity in eukaryotic cells. Mutat. Res. 1990, 237, 271–292. [Google Scholar] [CrossRef]
  62. Baldini, A.; Smith, D.I.; Rocchi, M.; Miller, O.J.; Miller, D.A. A human alphoid DNA clone from the EcoRI dimeric family: Genomic and internal organization and chromosomal assignment. Genomics 1989, 5, 822–828. [Google Scholar] [CrossRef]
  63. Pironon, N.; Puechberty, J.; Roizès, G. Molecular and evolutionary characteristics of the fraction of human α satellite DNA associated with CENP-A at the centromeres of chromosomes 1, 5, 19, and 21. BMC Genom. 2010, 11, 195. [Google Scholar] [CrossRef]
  64. Greig, G.M.; Warburton, P.E.; Willard, H.F. Organization and evolution of an α satellite DNA subset shared by human chromosomes 13 and 21. J. Mol. Evol. 1993, 37, 464–475. [Google Scholar] [CrossRef] [PubMed]
  65. Jorgensen, A.L.; Kolvraa, S.; Jones, C.; Bak, A.L. A subfamily of alphoid repetitive DNA shared by the NOR-bearing human chromosomes 14 and 22. Genomics 1988, 3, 100–109. [Google Scholar] [CrossRef]
  66. Jorgensen, A.L.; Laursen, H.B.; Jones, C.; Bak, A.L. Evolutionarily different alphoid repeat DNA on homologous chromosomes in human and chimpanzee. Proc. Natl. Acad. Sci. USA 1992, 89, 3310–3314. [Google Scholar] [CrossRef]
  67. Sabourin, M.; Nitiss, J.L.; Nitiss, K.C.; Tatebayashi, K.; Ikeda, H.; Osheroff, N. Yeast recombination pathways triggered by topoisomerase II-mediated DNA breaks. Nucleic Acids Res. 2003, 31, 4373–4384. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  68. Salser, W.; Bowen, S.; Browne, D.; el-Adli, F.; Fedoroff, N.; Fry, K.; Heindell, H.; Paddock, G.; Poon, R.; Wallace, B.; et al. Investigation of the organization of mammalian chromosomes at the DNA sequence level. Fed. Proc. 1976, 35, 23–35. [Google Scholar]
  69. Cacheux, L.; Ponger, L.; Gerbault-Seureau, M.; Loll, F.; Gey, D.; Richard, F.A.; Escude, C. The targeted sequencing of α satellite DNA in Cercopithecus pogonias provides new insight into the diversity and dynamics of centromeric repeats in old world monkeys. Genome Biol. Evol. 2018, 10, 1837–1851. [Google Scholar] [CrossRef] [PubMed]
  70. da Silva, E.L.; Busso, A.F.; Parise-Maltempi, P.P. Characterization and genome organization of a repetitive element associated with the nucleolus organizer region in Leporinus elongatus (Anostomidae: Characiformes). Cytogenet. Genome Res. 2013, 139, 22–28. [Google Scholar] [CrossRef] [PubMed]
  71. Mestrovic, N.; Plohl, M.; Mravinac, B.; Ugarkovic, D. Evolution of satellite DNAs from the genus Palorus--experimental evidence for the “library” hypothesis. Mol. Biol. Evol. 1998, 15, 1062–1068. [Google Scholar] [CrossRef] [PubMed]
  72. Lee, H.R.; Zhang, W.; Langdon, T.; Jin, W.; Yan, H.; Cheng, Z.; Jiang, J. Chromatin immunoprecipitation cloning reveals rapid evolutionary patterns of centromeric DNA in Oryza species. Proc. Natl. Acad. Sci. USA 2005, 102, 11793–11798. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  73. Faravelli, M.; Moralli, D.; Bertoni, L.; Attolini, C.; Chernova, O.; Raimondi, E.; Giulotto, E. Two extended arrays of a satellite DNA sequence at the centromere and at the short-arm telomere of Chinese hamster chromosome 5. Cytogenet. Cell Genet. 1998, 83, 281–286. [Google Scholar] [CrossRef]
  74. Gong, Z.; Wu, Y.; Koblizkova, A.; Torres, G.A.; Wang, K.; Iovene, M.; Neumann, P.; Zhang, W.; Novak, P.; Buell, C.R.; et al. Repeatless and repeat-based centromeres in potato: Implications for centromere evolution. Plant Cell 2012, 24, 3559–3574. [Google Scholar] [CrossRef] [PubMed]
  75. Bulazel, K.; Ferreri, G.C.; Eldridge, M.D.; O’Neill, R.J. Species-specific shifts in centromere sequence composition are coincident with breakpoint reuse in karyotypically divergent lineages. Genome Biol. 2007, 8, R170. [Google Scholar] [CrossRef] [PubMed]
  76. Chaves, R.; Adega, F.; Heslop-Harrison, J.S.; Guedes-Pinto, H.; Wienberg, J. Complex satellite DNA reshuffling in the polymorphic t(1;29) Robertsonian translocation and evolutionarily derived chromosomes in cattle. Chromosome Res. 2003, 11, 641–648. [Google Scholar] [CrossRef] [PubMed]
  77. Chaves, R.; Guedes-Pinto, H.; Heslop-Harrison, J.; Schwarzacher, T. The species and chromosomal distribution of the centromeric α-satellite I sequence from sheep in the tribe Caprini and other Bovidae. Cytogenet. Cell Genet. 2000, 91, 62–66. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  78. Malik, H.S. The centromere-drive hypothesis: A simple basis for centromere complexity. Prog. Mol. Subcell. Biol. 2009, 48, 33–52. [Google Scholar] [CrossRef]
  79. Iwata-Otsubo, A.; Dawicki-McKenna, J.M.; Akera, T.; Falk, S.J.; Chmatal, L.; Yang, K.; Sullivan, B.A.; Schultz, R.M.; Lampson, M.A.; Black, B.E. Expanded satellite repeats amplify a discrete CENP-A nucleosome assembly site on chromosomes that drive in female meiosis. Curr. Biol. 2017, 27, 2365–2373.e8. [Google Scholar] [CrossRef]
  80. Drpic, D.; Almeida, A.C.; Aguiar, P.; Renda, F.; Damas, J.; Lewin, H.A.; Larkin, D.M.; Khodjakov, A.; Maiato, H. Chromosome segregation is biased by kinetochore size. Curr. Biol. 2018, 28, 1344–1356. [Google Scholar] [CrossRef]
  81. Zwick, M.E.; Salstrom, J.L.; Langley, C.H. Genetic variation in rates of nondisjunction: Association of two naturally occurring polymorphisms in the chromokinesin nod with increased rates of nondisjunction in Drosophila melanogaster. Genetics 1999, 152, 1605–1614. [Google Scholar] [PubMed]
  82. Hirsch, C.D.; Wu, Y.; Yan, H.; Jiang, J. Lineage-specific adaptive evolution of the centromeric protein CENH3 in diploid and allotetraploid Oryza species. Mol. Biol. Evol. 2009, 26, 2877–2885. [Google Scholar] [CrossRef] [PubMed]
  83. Schueler, M.G.; Swanson, W.; Thomas, P.J.; Green, E.D. Adaptive evolution of foundation kinetochore proteins in primates. Mol. Biol. Evol. 2010, 27, 1585–1597. [Google Scholar] [CrossRef] [PubMed]
  84. Talbert, P.B.; Masuelli, R.; Tyagi, A.P.; Comai, L.; Henikoff, S. Centromeric localization and adaptive evolution of an Arabidopsis histone H3 variant. Plant Cell 2002, 14, 1053–1066. [Google Scholar] [CrossRef] [PubMed]
  85. Talbert, P.B.; Bryson, T.D.; Henikoff, S. Adaptive evolution of centromere proteins in plants and animals. J. Biol. 2004, 3, 18. [Google Scholar] [CrossRef] [PubMed]
  86. Zedek, F.; Bures, P. Evidence for centromere drive in the holocentric chromosomes of Caenorhabditis. PLoS ONE 2012, 7, e30496. [Google Scholar] [CrossRef] [PubMed]
  87. Chmatal, L.; Gabriel, S.I.; Mitsainas, G.P.; Martinez-Vargas, J.; Ventura, J.; Searle, J.B.; Schultz, R.M.; Lampson, M.A. Centromere strength provides the cell biological basis for meiotic drive and karyotype evolution in mice. Curr. Biol. 2014, 24, 2295–2300. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  88. Pardo-Manuel de Villena, F.; Sapienza, C. Female meiosis drives karyotypic evolution in mammals. Genetics 2001, 159, 1179–1189. [Google Scholar] [PubMed]
  89. Rosin, L.F.; Mellone, B.G. Centromeres drive a hard bargain. Trends Genet. 2017, 33, 101–117. [Google Scholar] [CrossRef] [PubMed]
  90. Scott, K.C.; Sullivan, B.A. Neocentromeres: A place for everything and everything in its place. Trends Genet. 2014, 30, 66–74. [Google Scholar] [CrossRef]
  91. Garsed, D.W.; Marshall, O.J.; Corbin, V.D.; Hsu, A.; Di Stefano, L.; Schroder, J.; Li, J.; Feng, Z.P.; Kim, B.W.; Kowarsky, M.; et al. The architecture and evolution of cancer neochromosomes. Cancer Cell 2014, 26, 653–667. [Google Scholar] [CrossRef] [PubMed]
  92. Schubert, I. What is behind “centromere repositioning”? Chromosoma 2018, 127, 229–234. [Google Scholar] [CrossRef] [PubMed]
  93. Rhoades, M.M.; Vilkomerson, H. On the anaphase movement of chromosomes. Proc. Natl. Acad. Sci. USA 1942, 28, 433–436. [Google Scholar] [CrossRef] [PubMed]
  94. Alonso, A.; Fritz, B.; Hasson, D.; Abrusan, G.; Cheung, F.; Yoda, K.; Radlwimmer, B.; Ladurner, A.G.; Warburton, P.E. Co-localization of CENP-C and CENP-H to discontinuous domains of CENP-A chromatin at human neocentromeres. Genome Biol. 2007, 8, R148. [Google Scholar] [CrossRef] [PubMed]
  95. Alonso, A.; Mahmood, R.; Li, S.; Cheung, F.; Yoda, K.; Warburton, P.E. Genomic microarray analysis reveals distinct locations for the CENP-A binding domains in three human chromosome 13q32 neocentromeres. Hum. Mol. Genet. 2003, 12, 2711–2721. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  96. Voullaire, L.E.; Slater, H.R.; Petrovic, V.; Choo, K.H. A functional marker centromere with no detectable α-satellite, satellite III, or CENP-B protein: Activation of a latent centromere? Am. J. Hum. Genet. 1993, 52, 1153–1163. [Google Scholar]
  97. du Sart, D.; Cancilla, M.R.; Earle, E.; Mao, J.I.; Saffery, R.; Tainton, K.M.; Kalitsis, P.; Martyn, J.; Barry, A.E.; Choo, K.H. A functional neo-centromere formed through activation of a latent human centromere and consisting of non-α-satellite DNA. Nat. Genet. 1997, 16, 144–153. [Google Scholar] [CrossRef] [PubMed]
  98. Amor, D.J.; Bentley, K.; Ryan, J.; Perry, J.; Wong, L.; Slater, H.; Choo, K.H. Human centromere repositioning “in progress”. Proc. Natl. Acad. Sci. USA 2004, 101, 6542–6547. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  99. Barry, A.E.; Howman, E.V.; Cancilla, M.R.; Saffery, R.; Choo, K.H. Sequence analysis of an 80 kb human neocentromere. Hum. Mol. Genet. 1999, 8, 217–227. [Google Scholar] [CrossRef] [Green Version]
  100. Amor, D.J.; Choo, K.H.A. Neocentromeres: Role in human disease, evolution, and centromere study. Am. J. Hum. Genet. 2002, 71, 695–714. [Google Scholar] [CrossRef]
  101. Warburton, P.E. Chromosomal dynamics of human neocentromere formation. Chromosome Res. 2004, 12, 617–626. [Google Scholar] [CrossRef] [PubMed]
  102. Eldridge, M.D.; Close, R.L. Radiation of chromosome shuffles. Curr. Opin. Genet. Dev. 1993, 3, 915–922. [Google Scholar] [CrossRef]
  103. Suja, J.A.; Camacho, J.P.M.; Cabrero, J.; Rufas, J.S. Analysis of a centric shift in the S11 chromosome of Aiolopus strepens (Orthoptera: Acrididae). Genetica 1986, 70, 211–216. [Google Scholar] [CrossRef]
  104. Ventura, M.; Archidiacono, N.; Rocchi, M. Centromere emergence in evolution. Genome Res. 2001, 11, 595–599. [Google Scholar] [CrossRef]
  105. Iannuzzi, L.; Di Meo, G.P.; Perucatti, A.; Incarnato, D.; Schibler, L.; Cribiu, E.P. Comparative FISH mapping of bovid X chromosomes reveals homologies and divergences between the subfamilies bovinae and caprinae. Cytogenet. Cell Genet. 2000, 89, 171–176. [Google Scholar] [CrossRef] [PubMed]
  106. Wang, K.; Wu, Y.; Zhang, W.; Dawe, R.K.; Jiang, J. Maize centromeres expand and adopt a uniform size in the genetic background of oat. Genome Res. 2014, 24, 107–116. [Google Scholar] [CrossRef] [PubMed]
  107. Rothfels, K.H.; Mason, G.F. Achiasmate meiosis and centromere shift in Eusimulium aureum (Diptera-Simuliidae). Chromosoma 1975, 51, 111–124. [Google Scholar] [CrossRef] [PubMed]
  108. Carbone, L.; Nergadze, S.G.; Magnani, E.; Misceo, D.; Francesca Cardone, M.; Roberto, R.; Bertoni, L.; Attolini, C.; Francesca Piras, M.; de Jong, P.; et al. Evolutionary movement of centromeres in horse, donkey, and zebra. Genomics 2006, 87, 777–782. [Google Scholar] [CrossRef] [Green Version]
  109. Klein, S.J.; O’Neill, R.J. Transposable elements: Genome innovation, chromosome diversity, and centromere conflict. Chromosome Res. 2018, 26, 5–23. [Google Scholar] [CrossRef]
  110. Nergadze, S.G.; Piras, F.M.; Gamba, R.; Corbo, M.; Cerutti, F.; McCarter, J.G.W.; Cappelletti, E.; Gozzo, F.; Harman, R.M.; Antczak, D.F.; et al. Birth, evolution, and transmission of satellite-free mammalian centromeric domains. Genome Res. 2018, 28, 789–799. [Google Scholar] [CrossRef]
  111. Piras, F.M.; Nergadze, S.G.; Magnani, E.; Bertoni, L.; Attolini, C.; Khoriauli, L.; Raimondi, E.; Giulotto, E. Uncoupling of satellite DNA and centromeric function in the genus Equus. PLoS Genet. 2010, 6, e1000845. [Google Scholar] [CrossRef] [PubMed]
  112. Birchler, J.A.; Presting, G.G. Retrotransposon insertion targeting: A mechanism for homogenization of centromere sequences on nonhomologous chromosomes. Genes Dev. 2012, 26, 638–640. [Google Scholar] [CrossRef] [PubMed]
  113. Rocchi, M.; Archidiacono, N.; Schempp, W.; Capozzi, O.; Stanyon, R. Centromere repositioning in mammals. Heredity 2012, 108, 59–67. [Google Scholar] [CrossRef] [PubMed]
  114. Rocchi, M.; Stanyon, R.; Archidiacono, N. Evolutionary new centromeres in primates. Prog. Mol. Subcell. Biol. 2009, 48, 103–152. [Google Scholar] [CrossRef] [PubMed]
  115. O’Neill, R.J.; Eldridge, M.D.; Metcalfe, C.J. Centromere dynamics and chromosome evolution in marsupials. J. Hered. 2004, 95, 375–381. [Google Scholar] [CrossRef] [PubMed]
  116. Wade, C.; Giulotto, E.; Sigurdsson, S.; Zoli, M.; Gnerre, S.; Imsland, F.; Lear, T.; Adelson, D.; Bailey, E.; Bellone, R.; et al. Genome sequence, comparative analysis and population genetics of the domestic horse (Equus caballus). Science 2009, 326, 865–867. [Google Scholar] [CrossRef] [PubMed]
  117. Purgato, S.; Belloni, E.; Piras, F.M.; Zoli, M.; Badiale, C.; Cerutti, F.; Mazzagatti, A.; Perini, G.; Della Valle, G.; Nergadze, S.G.; et al. Centromere sliding on a mammalian chromosome. Chromosoma 2015, 124, 277–287. [Google Scholar] [CrossRef] [PubMed]
  118. Johnson, R.N.; O’Meally, D.; Chen, Z.; Etherington, G.J.; Ho, S.Y.W.; Nash, W.J.; Grueber, C.E.; Cheng, Y.; Whittington, C.M.; Dennison, S.; et al. Adaptation and conservation insights from the koala genome. Nat. Genet. 2018, 50, 1102–1111. [Google Scholar] [CrossRef]
  119. Carbone, L.; Harris, R.A.; Gnerre, S.; Veeramah, K.R.; Lorente-Galdos, B.; Huddleston, J.; Meyer, T.J.; Herrero, J.; Roos, C.; Aken, B.; et al. Gibbon genome and the fast karyotype evolution of small apes. Nature 2014, 513, 195–201. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  120. Nagaki, K.; Cheng, Z.; Ouyang, S.; Talbert, P.B.; Kim, M.; Jones, K.M.; Henikoff, S.; Buell, C.R.; Jiang, J. Sequencing of a rice centromere uncovers active genes. Nat. Genet. 2004, 36, 138–145. [Google Scholar] [CrossRef] [Green Version]
  121. Renfree, M.B.; Papenfuss, A.T.; Deakin, J.E.; Lindsay, J.; Heider, T.; Belov, K.; Rens, W.; Waters, P.D.; Pharo, E.A.; Shaw, G.; et al. Genome sequence of an Australian kangaroo, Macropus eugenii, provides insight into the evolution of mammalian reproduction and development. Genome Biol. 2011, 12, R81. [Google Scholar] [CrossRef] [PubMed]
  122. Carone, D.; Longo, M.; Ferreri, G.; Hall, L.; Harris, M.; Shook, N.; Bulazel, K.; Carone, B.; Obergfell, C.; O’Neill, M.; et al. A new class of retroviral and satellite encoded small RNAs emanates from mammalian centromeres. Chromosoma 2009, 118, 113–125. [Google Scholar] [CrossRef] [PubMed]
  123. Sanyal, K.; Baum, M.; Carbon, J. Centromeric DNA sequences in the pathogenic yeast Candida albicans are all different and unique. Proc. Natl. Acad. Sci. USA 2004, 101, 11374–11379. [Google Scholar] [CrossRef] [PubMed]
  124. Mishra, P.K.; Baum, M.; Carbon, J. Centromere size and position in Candida albicans are evolutionarily conserved independent of DNA sequence heterogeneity. Mol. Genet. Genom. 2007, 278, 455–465. [Google Scholar] [CrossRef] [PubMed]
  125. Harrington, J.J.; Van Bokkelen, G.; Mays, R.W.; Gustashaw, K.; Willard, H.F. Formation of de novo centromeres and construction of first-generation human artificial microchromosomes. Nat. Genet. 1997, 15, 345–355. [Google Scholar] [CrossRef] [PubMed]
  126. Ebersole, T.A.; Ross, A.; Clark, E.; McGill, N.; Schindelhauer, D.; Cooke, H.; Grimes, B. Mammalian artificial chromosome formation from circular alphoid input DNA does not require telomere repeats. Hum. Mol. Genet. 2000, 9, 1623–1631. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  127. Doolittle, W.F.; Sapienza, C. Selfish genes, the phenotype paradigm and genome evolution. Nature 1980, 284, 601–603. [Google Scholar] [CrossRef]
  128. Orgel, L.E.; Crick, F.H.C. Selfish DNA: the ultimate parasite. Nature 1980, 284, 604–607. [Google Scholar] [CrossRef]
  129. Ravindran, S. Barbara McClintock and the discovery of jumping genes. Proc. Natl. Acad. Sci. USA 2012. [Google Scholar] [CrossRef]
  130. Wessler, S.R. Transposable elements and the evolution of eukaryotic genomes. Proc. Natl. Acad. Sci. USA 2006, 103, 17600–17601. [Google Scholar] [CrossRef] [Green Version]
  131. Craig, N.L.; Craigie, R.; Gellert, M.; Lambowitz, A.M. Mobile DNA II; American Society for Microbiology Press: Washington, DC, USA, 2002. [Google Scholar]
  132. Howe, K.; Clark, M.D.; Torroja, C.F.; Torrance, J.; Berthelot, C.; Muffato, M.; Collins, J.E.; Humphray, S.; McLaren, K.; Matthews, L.; et al. The zebrafish reference genome sequence and its relationship to the human genome. Nature 2013, 496, 498–503. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  133. Sotero-Caio, C.G.; Platt, R.N.; Suh, A.; Ray, D.A. Evolution and diversity of transposable elements in vertebrate genomes. Genome Biol. Evol. 2017, 9, 161–177. [Google Scholar] [CrossRef] [PubMed]
  134. Schnable, P.S.; Ware, D.; Fulton, R.S.; Stein, J.C.; Wei, F.; Pasternak, S.; Liang, C.; Zhang, J.; Fulton, L.; Graves, T.A.; et al. The B73 maize genome: Complexity, diversity, and dynamics. Science 2009, 326, 1112–1115. [Google Scholar] [CrossRef] [PubMed]
  135. Mills, R.E.; Bennett, E.A.; Iskow, R.C.; Devine, S.E. Which transposable elements are active in the human genome? Trends Genet. 2007, 23, 183–191. [Google Scholar] [CrossRef] [PubMed]
  136. Sundaram, V.; Cheng, Y.; Ma, Z.; Li, D.; Xing, X.; Edge, P.; Snyder, M.P.; Wang, T. Widespread contribution of transposable elements to the innovation of gene regulatory networks. Genome Res. 2014, 24, 1963–1976. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  137. Wittkopp, P.J.; Kalay, G. Cis-regulatory elements: Molecular mechanisms and evolutionary processes underlying divergence. Nat. Rev. Genet. 2011, 13, 59–69. [Google Scholar] [CrossRef]
  138. Jacques, P.E.; Jeyakani, J.; Bourque, G. The majority of primate-specific regulatory sequences are derived from transposable elements. PLoS Genet. 2013, 9, e1003504. [Google Scholar] [CrossRef]
  139. Bourque, G.; Leong, B.; Vega, V.B.; Chen, X.; Lee, Y.L.; Srinivasan, K.G.; Chew, J.L.; Ruan, Y.; Wei, C.L.; Ng, H.H.; et al. Evolution of the mammalian transcription factor binding repertoire via transposable elements. Genome Res. 2008, 18, 1752–1762. [Google Scholar] [CrossRef] [Green Version]
  140. Wang, T.; Zeng, J.; Lowe, C.B.; Sellers, R.G.; Salama, S.R.; Yang, M.; Burgess, S.M.; Brachmann, R.K.; Haussler, D. Species-specific endogenous retroviruses shape the transcriptional network of the human tumor suppressor protein p53. Proc. Natl. Acad. Sci. USA 2007, 104, 18613–18618. [Google Scholar] [CrossRef] [Green Version]
  141. Roman, A.C.; Benitez, D.A.; Carvajal-Gonzalez, J.M.; Fernandez-Salguero, P.M. Genome-wide B1 retrotransposon binds the transcription factors dioxin receptor and Slug and regulates gene expression in vivo. Proc. Natl. Acad. Sci. USA 2008, 105, 1632–1637. [Google Scholar] [CrossRef] [Green Version]
  142. Cao, Y.; Chen, G.; Wu, G.; Zhang, X.; McDermott, J.; Chen, X.; Xu, C.; Jiang, Q.; Chen, Z.; Zeng, Y.; et al. Widespread roles of enhancer-like transposable elements in cell identity and long-range genomic interactions. Genome Res. 2018. [Google Scholar] [CrossRef]
  143. Makarevitch, I.; Waters, A.J.; West, P.T.; Stitzer, M.; Hirsch, C.N.; Ross-Ibarra, J.; Springer, N.M. Transposable elements contribute to activation of maize genes in response to abiotic stress. PLoS Genet. 2015, 11, e1004915. [Google Scholar] [CrossRef]
  144. Feschotte, C. Transposable elements and the evolution of regulatory networks. Nat. Rev. Genet. 2008, 9, 397–405. [Google Scholar] [CrossRef] [PubMed]
  145. Chuong, E.B.; Elde, N.C.; Feschotte, C. Regulatory activities of transposable elements: From conflicts to benefits. Nat. Rev. Genet. 2017, 18, 71–86. [Google Scholar] [CrossRef] [PubMed]
  146. Slotkin, R.K.; Martienssen, R. Transposable elements and the epigenetic regulation of the genome. Nat. Rev. Genet. 2007, 8, 272–285. [Google Scholar] [CrossRef] [PubMed]
  147. Ariumi, Y. Guardian of the human genome: host defense mechanisms against LINE-1 retrotransposition. Front. Chem. 2016, 4, 28. [Google Scholar] [CrossRef] [PubMed]
  148. Schueler, M.G.; Higgins, A.W.; Rudd, M.K.; Gustashaw, K.; Willard, H.F. Genomic and genetic definition of a functional human centromere. Science 2001, 294, 109–115. [Google Scholar] [CrossRef] [PubMed]
  149. Kazakov, A.E.; Shepelev, V.A.; Tumeneva, I.G.; Alexandrov, A.A.; Yurov, Y.B.; Alexandrov, I.A. Interspersed repeats are found predominantly in the “old” α satellite families. Genomics 2003, 82, 619–627. [Google Scholar] [CrossRef]
  150. Rosenbloom, K.R.; Armstrong, J.; Barber, G.P.; Casper, J.; Clawson, H.; Diekhans, M.; Dreszer, T.R.; Fujita, P.A.; Guruvadoo, L.; Haeussler, M.; et al. The UCSC Genome Browser database: 2015 update. Nucleic Acids Res. 2015, 43, D670–D681. [Google Scholar] [CrossRef]
  151. Burrack, L.S.; Berman, J. Neocentromeres and epigenetically inherited features of centromeres. Chromosome Res. 2012, 20, 607–619. [Google Scholar] [CrossRef] [Green Version]
  152. Chueh, A.C.; Northrop, E.L.; Brettingham-Moore, K.H.; Choo, K.H.; Wong, L.H. LINE retrotransposon RNA is an essential structural and functional epigenetic component of a core neocentromeric chromatin. PLoS Genet. 2009, 5, e1000354. [Google Scholar] [CrossRef]
  153. Carbone, L.; Harris, R.A.; Mootnick, A.R.; Milosavljevic, A.; Martin, D.I.; Rocchi, M.; Capozzi, O.; Archidiacono, N.; Konkel, M.K.; Walker, J.A.; et al. Centromere remodeling in Hoolock leuconedys (Hylobatidae) by a new transposable element unique to the gibbons. Genome Biol. Evol. 2012, 4, 648–658. [Google Scholar] [CrossRef] [PubMed]
  154. Tolomeo, D.; Capozzi, O.; Stanyon, R.R.; Archidiacono, N.; D’Addabbo, P.; Catacchio, C.R.; Purgato, S.; Perini, G.; Schempp, W.; Huddleston, J.; et al. Epigenetic origin of evolutionary novel centromeres. Sci. Rep. 2017, 7, 41980. [Google Scholar] [CrossRef] [Green Version]
  155. Kato, M.; Takashima, K.; Kakutani, T. Epigenetic control of CACTA transposon mobility in Arabidopsis thaliana. Genetics 2004, 168, 961–969. [Google Scholar] [CrossRef] [PubMed]
  156. Bourc’his, D.; Bestor, T.H. Meiotic catastrophe and retrotransposon reactivation in male germ cells lacking Dnmt3L. Nature 2004, 431, 96–99. [Google Scholar] [CrossRef] [PubMed]
  157. Metcalfe, C.J.; Bulazel, K.V.; Ferreri, G.C.; Schroeder-Reiter, E.; Wanner, G.; Rens, W.; Obergfell, C.; Eldridge, M.D.; O’Neill, R.J. Genomic instability within centromeres of interspecific marsupial hybrids. Genetics 2007, 177, 2507–2517. [Google Scholar] [CrossRef]
  158. O’Neill, R.J.; O’Neill, M.J.; Graves, J.A. Undermethylation associated with retroelement activation and chromosome remodelling in an interspecific mammalian hybrid. Nature 1998, 393, 68–72. [Google Scholar] [CrossRef] [PubMed]
  159. May, B.P.; Lippman, Z.B.; Fang, Y.; Spector, D.L.; Martienssen, R.A. Differential regulation of strand-specific transcripts from Arabidopsis centromeric satellite repeats. PLoS Genet. 2005, 1, e79. [Google Scholar] [CrossRef] [PubMed]
  160. Ferreri, G.C.; Brown, J.D.; Obergfell, C.; Jue, N.; Finn, C.E.; O’Neill, M.J.; O’Neill, R.J. Recent amplification of the kangaroo endogenous retrovirus, KERV, limited to the centromere. J. Virol. 2011, 85, 4761–4771. [Google Scholar] [CrossRef] [PubMed]
  161. Contreras-Galindo, R.; Kaplan, M.H.; He, S.; Contreras-Galindo, A.C.; Gonzalez-Hernandez, M.J.; Kappes, F.; Dube, D.; Chan, S.M.; Robinson, D.; Meng, F.; et al. HIV infection reveals widespread expansion of novel centromeric human endogenous retroviruses. Genome Res. 2013, 23, 1505–1513. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  162. Cheng, Z.J.; Murata, M. A centromeric tandem repeat family originating from a part of Ty3/gypsy-retroelement in wheat and its relatives. Genetics 2003, 164, 665–672. [Google Scholar] [PubMed]
  163. Kapitonov, V.V.; Jurka, J. Molecular paleontology of transposable elements from Arabidopsis thaliana. Genetica 1999, 107, 27–37. [Google Scholar] [CrossRef] [PubMed]
  164. Heikkinen, E.; Launonen, V.; Muller, E.; Bachmann, L. The pvB370 BamHI satellite DNA family of the Drosophila virilis group and its evolutionary relation to mobile dispersed genetic pDv elements. J. Mol. Evol. 1995, 41, 604–614. [Google Scholar] [CrossRef] [PubMed]
  165. Kapitonov, V.V.; Holmquist, G.P.; Jurka, J. L1 repeat is a basic unit of heterochromatin satellites in cetaceans. Mol. Biol. Evol. 1998, 15, 611–612. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  166. McGurk, M.P.; Barbash, D.A. Double insertion of transposable elements provides a substrate for the evolution of satellite DNA. Genome Res. 2018, 28, 714–725. [Google Scholar] [CrossRef] [PubMed]
  167. Dias, G.B.; Svartman, M.; Delprat, A.; Ruiz, A.; Kuhn, G.C. Tetris is a foldback transposon that provided the building blocks for an emerging satellite DNA of Drosophila virilis. Genome Biol. Evol. 2014, 6, 1302–1313. [Google Scholar] [CrossRef] [PubMed]
  168. Mestrovic, N.; Mravinac, B.; Pavlek, M.; Vojvoda-Zeljko, T.; Satovic, E.; Plohl, M. Structural and functional liaisons between transposable elements and satellite DNAs. Chromosome Res. 2015, 23, 583–596. [Google Scholar] [CrossRef] [PubMed]
  169. Ahmed, M.; Liang, P. Transposable elements are a significant contributor to tandem repeats in the human genome. Comp. Funct. Genom. 2012, 2012, 947089. [Google Scholar] [CrossRef]
  170. Satovic, E.; Vojvoda Zeljko, T.; Luchetti, A.; Mantovani, B.; Plohl, M. Adjacent sequences disclose potential for intra-genomic dispersal of satellite DNA repeats and suggest a complex network with transposable elements. BMC Genom. 2016, 17, 997. [Google Scholar] [CrossRef]
  171. McLaughlin, R.N., Jr.; Malik, H.S. Genetic conflicts: The usual suspects and beyond. J. Exp. Biol. 2017, 220, 6–17. [Google Scholar] [CrossRef]
  172. Yoder, J.A.; Walsh, C.P.; Bestor, T.H. Cytosine methylation and the ecology of intragenomic parasites. Trends Genet. 1997, 13, 335–340. [Google Scholar] [CrossRef]
  173. Fedoroff, N.V. Presidential address. Transposable elements, epigenetics, and genome evolution. Science 2012, 338, 758–767. [Google Scholar] [CrossRef]
  174. Symer, D.E.; Connelly, C.; Szak, S.T.; Caputo, E.M.; Cost, G.J.; Parmigiani, G.; Boeke, J.D. Human l1 retrotransposition is associated with genetic instability in vivo. Cell 2002, 110, 327–338. [Google Scholar] [CrossRef]
  175. Gilbert, N.; Lutz, S.; Morrish, T.A.; Moran, J.V. Multiple fates of L1 retrotransposition intermediates in cultured human cells. Mol. Cell. Biol. 2005, 25, 7780–7795. [Google Scholar] [CrossRef]
  176. Kazazian, H.H., Jr.; Moran, J.V. Mobile DNA in health and disease. N. Engl. J. Med. 2017, 377, 361–370. [Google Scholar] [CrossRef]
  177. Beck, C.R.; Garcia-Perez, J.L.; Badge, R.M.; Moran, J.V. LINE-1 elements in structural variation and disease. Annu. Rev. Genom. Hum. Genet. 2011, 12, 187–215. [Google Scholar] [CrossRef]
  178. Divashuk, M.G.; Khuat, T.M.; Kroupin, P.Y.; Kirov, I.V.; Romanov, D.V.; Kiseleva, A.V.; Khrustaleva, L.I.; Alexeev, D.G.; Zelenin, A.S.; Klimushina, M.V.; et al. Variation in copy number of Ty3/Gypsy centromeric retrotransposons in the genomes of Thinopyrum intermedium and its diploid progenitors. PLoS ONE 2016, 11, e0154241. [Google Scholar] [CrossRef]
  179. Han, J.; Masonbrink, R.E.; Shan, W.; Song, F.; Zhang, J.; Yu, W.; Wang, K.; Wu, Y.; Tang, H.; Wendel, J.F.; et al. Rapid proliferation and nucleolar organizer targeting centromeric retrotransposons in cotton. Plant J. 2016, 88, 992–1005. [Google Scholar] [CrossRef]
  180. Yang, X.; Zhao, H.; Zhang, T.; Zeng, Z.; Zhang, P.; Zhu, B.; Han, Y.; Braz, G.T.; Casler, M.D.; Schmutz, J.; et al. Amplification and adaptation of centromeric repeats in polyploid switchgrass species. New Phytol. 2018, 218, 1645–1657. [Google Scholar] [CrossRef]
  181. Pardue, M.L.; Gall, J.G. Chromosomal localization of mouse satellite DNA. Science 1970, 168, 1356–1358. [Google Scholar] [CrossRef]
  182. Jones, K.W. Chromosomal and nuclear location of mouse satellite DNA in individual cells. Nature 1970, 225, 912–915. [Google Scholar] [CrossRef] [PubMed]
  183. Yunis, J.J.; Yasmineh, W.G. Heterochromatin, satellite DNA, and cell function. Structural DNA of eucaryotes may support and protect genes and aid in speciation. Science 1971, 174, 1200–1209. [Google Scholar] [CrossRef] [PubMed]
  184. Britten, R.J.; Kohne, D.E. Repeated sequences in DNA. Hundreds of thousands of copies of DNA sequences have been incorporated into the genomes of higher organisms. Science 1968, 161, 529–540. [Google Scholar] [CrossRef]
  185. Hall, L.E.; Mitchell, S.E.; O’Neill, R.J. Pericentric and centromeric transcription: A perfect balance required. Chromosome Res. 2012, 20, 535–546. [Google Scholar] [CrossRef] [PubMed]
  186. Talbert, P.B.; Henikoff, S. Transcribing centromeres: noncoding RNAs and kinetochore assembly. Trends Genet. 2018, 34, 587–599. [Google Scholar] [CrossRef] [PubMed]
  187. Braselton, J.P. Ribonucleoprotein staining of Allium cepa kinetochores. Cytobiologie 1975, 12, 148–151. [Google Scholar]
  188. Rieder, C.L. Ribonucleoprotein staining of centrioles and kinetochores in newt lung cell spindles. J. Cell Biol. 1979, 80, 1–9. [Google Scholar] [CrossRef] [Green Version]
  189. Blower, M.D.; Sullivan, B.A.; Karpen, G.H. Conserved organization of centromeric chromatin in flies and humans. Dev. Cell 2002, 2, 319–330. [Google Scholar] [CrossRef]
  190. Sullivan, B.A.; Karpen, G.H. Centromeric chromatin exhibits a histone modification pattern that is distinct from both euchromatin and heterochromatin. Nat. Struct. Mol. Biol. 2004, 11, 1076–1083. [Google Scholar] [CrossRef] [Green Version]
  191. Bergmann, J.H.; Rodriguez, M.G.; Martins, N.M.; Kimura, H.; Kelly, D.A.; Masumoto, H.; Larionov, V.; Jansen, L.E.; Earnshaw, W.C. Epigenetic engineering shows H3K4me2 is required for HJURP targeting and CENP-A assembly on a synthetic human kinetochore. EMBO J. 2011, 30, 328–340. [Google Scholar] [CrossRef] [PubMed]
  192. Choi, E.S.; Stralfors, A.; Castillo, A.G.; Durand-Dubief, M.; Ekwall, K.; Allshire, R.C. Identification of noncoding transcripts from within CENP-A chromatin at fission yeast centromeres. J. Biol. Chem. 2011, 286, 23600–23607. [Google Scholar] [CrossRef] [PubMed]
  193. Chan, F.L.; Marshall, O.J.; Saffery, R.; Kim, B.W.; Earle, E.; Choo, K.H.; Wong, L.H. Active transcription and essential role of RNA polymerase II at the centromere during mitosis. Proc. Natl. Acad. Sci. USA 2012, 109, 1979–1984. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  194. Rosic, S.; Kohler, F.; Erhardt, S. Repetitive centromeric satellite RNA is essential for kinetochore formation and cell division. J. Cell Biol. 2014, 207, 335–349. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  195. Chen, C.C.; Bowers, S.; Lipinszki, Z.; Palladino, J.; Trusiak, S.; Bettini, E.; Rosin, L.; Przewloka, M.R.; Glover, D.M.; O’Neill, R.J.; et al. Establishment of centromeric chromatin by the CENP-A assembly factor CAL1 requires FACT-mediated transcription. Dev. Cell 2015, 34, 73–84. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  196. Ohkuni, K.; Kitagawa, K. Endogenous transcription at the centromere facilitates centromere activity in budding yeast. Curr. Biol. 2011, 21, 1695–1703. [Google Scholar] [CrossRef]
  197. Chan, F.L.; Wong, L.H. Transcription in the maintenance of centromere chromatin identity. Nucleic Acids Res. 2012, 40, 11178–11188. [Google Scholar] [CrossRef]
  198. McNulty, S.M.; Sullivan, L.L.; Sullivan, B.A. Human centromeres produce chromosome-specific and array-specific α satellite transcripts that are complexed with CENP-A and CENP-C. Dev. Cell 2017, 42, 226–240. [Google Scholar] [CrossRef]
  199. Topp, C.N.; Zhong, C.X.; Dawe, R.K. Centromere-encoded RNAs are integral components of the maize kinetochore. Proc. Natl. Acad. Sci. USA 2004, 101, 15986–15991. [Google Scholar] [CrossRef] [Green Version]
  200. Carone, D.M.; Zhang, C.; Hall, L.E.; Obergfell, C.; Carone, B.R.; O’Neill, M.J.; O’Neill, R.J. Hypermorphic expression of centromeric retroelement-encoded small RNAs impairs CENP-A loading. Chromosome Res. 2013, 21, 49–62. [Google Scholar] [CrossRef]
  201. Saffery, R.; Sumer, H.; Hassan, S.; Wong, L.H.; Craig, J.M.; Todokoro, K.; Anderson, M.; Stafford, A.; Choo, K.H. Transcription within a functional human centromere. Mol. Cell 2003, 12, 509–516. [Google Scholar] [CrossRef]
  202. Wong, L.H.; Brettingham-Moore, K.H.; Chan, L.; Quach, J.M.; Anderson, M.A.; Northrop, E.L.; Hannan, R.; Saffery, R.; Shaw, M.L.; Williams, E.; et al. Centromere RNA is a key component for the assembly of nucleoproteins at the nucleolus and centromere. Genome Res. 2007, 17, 1146–1160. [Google Scholar] [CrossRef] [Green Version]
  203. Ugarkovic, D. Functional elements residing within satellite DNAs. EMBO Rep. 2005, 6, 1035–1039. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  204. Neumann, P.; Yan, H.; Jiang, J. The centromeric retrotransposons of rice are transcribed and differentially processed by RNA interference. Genetics 2007, 176, 749–761. [Google Scholar] [CrossRef] [PubMed]
  205. Scott, K.C.; White, C.V.; Willard, H.F. An RNA polymerase III-dependent heterochromatin barrier at fission yeast centromere 1. PLoS ONE 2007, 2, e1099. [Google Scholar] [CrossRef]
  206. Cardinale, S.; Bergmann, J.H.; Kelly, D.; Nakano, M.; Valdivia, M.M.; Kimura, H.; Masumoto, H.; Larionov, V.; Earnshaw, W.C. Hierarchical inactivation of a synthetic human kinetochore by a chromatin modifier. Mol. Biol. Cell 2009, 20, 4194–4204. [Google Scholar] [CrossRef] [PubMed]
  207. Nakano, M.; Cardinale, S.; Noskov, V.N.; Gassmann, R.; Vagnarelli, P.; Kandels-Lewis, S.; Larionov, V.; Earnshaw, W.C.; Masumoto, H. Inactivation of a human kinetochore by specific targeting of chromatin modifiers. Dev. Cell 2008, 14, 507–522. [Google Scholar] [CrossRef] [PubMed]
  208. Bergmann, J.H.; Jakubsche, J.N.; Martins, N.M.; Kagansky, A.; Nakano, M.; Kimura, H.; Kelly, D.A.; Turner, B.M.; Masumoto, H.; Larionov, V.; et al. Epigenetic engineering: Histone H3K9 acetylation is compatible with kinetochore structure and function. J. Cell Sci. 2012, 125, 411–421. [Google Scholar] [CrossRef] [PubMed]
  209. Ting, D.T.; Lipson, D.; Paul, S.; Brannigan, B.W.; Akhavanfard, S.; Coffman, E.J.; Contino, G.; Deshpande, V.; Iafrate, A.J.; Letovsky, S.; et al. Aberrant overexpression of satellite repeats in pancreatic and other epithelial cancers. Science 2011, 331, 593–596. [Google Scholar] [CrossRef]
  210. Hill, A.; Bloom, K. Genetic manipulation of centromere function. Mol. Cell. Biol. 1987, 7, 2397–2405. [Google Scholar] [CrossRef]
  211. Bouzinba-Segard, H.; Guais, A.; Francastel, C. Accumulation of small murine minor satellite transcripts leads to impaired centromeric architecture and function. Proc. Natl. Acad. Sci. USA 2006, 103, 8709–8714. [Google Scholar] [CrossRef] [Green Version]
  212. Ferri, F.; Bouzinba-Segard, H.; Velasco, G.; Hube, F.; Francastel, C. Non-coding murine centromeric transcripts associate with and potentiate Aurora B kinase. Nucleic Acids Res. 2009, 37, 5071–5080. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  213. Chan, D.Y.L.; Moralli, D.; Khoja, S.; Monaco, Z.L. Noncoding centromeric RNA expression impairs chromosome stability in human and murine stem cells. Dis. Mark. 2017, 2017, 7506976. [Google Scholar] [CrossRef] [PubMed]
  214. Du, Y.; Topp, C.N.; Dawe, R.K. DNA binding of centromere protein C (CENPC) is stabilized by single-stranded RNA. PLoS Genet. 2010, 6, e1000835. [Google Scholar] [CrossRef]
  215. Sandmann, M.; Talbert, P.; Demidov, D.; Kuhlmann, M.; Rutten, T.; Conrad, U.; Lermontova, I. Targeting of Arabidopsis KNL2 to centromeres depends on the conserved CENPC-k motif in its C terminus. Plant Cell 2017, 29, 144–155. [Google Scholar] [CrossRef] [PubMed]
  216. Dawe, R.K. RNA interference, transposons, and the centromere. Plant Cell 2003, 15, 297–301. [Google Scholar] [CrossRef] [PubMed]
  217. Wong, L.H.; Choo, K.H. Evolutionary dynamics of transposable elements at the centromere. Trends Genet. 2004, 20, 611–616. [Google Scholar] [CrossRef] [PubMed]
  218. O’Neill, R.J.; Carone, D.M. The role of ncRNA in centromeres: A lesson from marsupials. Prog. Mol. Subcell. Biol. 2009, 48, 77–101. [Google Scholar] [CrossRef] [PubMed]
  219. Kasinathan, S.; Henikoff, S. Non-B-form DNA is enriched at centromeres. Mol. Biol. Evol. 2018, 35, 949–962. [Google Scholar] [CrossRef]
  220. Bobkov, G.O.M.; Gilbert, N.; Heun, P. Centromere transcription allows CENP-A to transit from chromatin association to stable incorporation. J. Cell Biol. 2018, 217, 1957–1972. [Google Scholar] [CrossRef] [Green Version]
  221. Quenet, D.; Dalal, Y. A long non-coding RNA is required for targeting centromeric protein A to the human centromere. Elife 2014, 3, e03254, Erratum in 2018, 7, e41593. [Google Scholar] [CrossRef]
  222. Chueh, A.C.; Wong, L.H.; Wong, N.; Choo, K.H. Variable and hierarchical size distribution of L1-retroelement-enriched CENP-A clusters within a functional human neocentromere. Hum. Mol. Genet. 2005, 14, 85–93. [Google Scholar] [CrossRef] [PubMed]
  223. McClintock, B. The behaviour of successive nuclear divisions of a chromosome broken at meiosis. Proc. Natl. Acad. Sci. USA 1939, 25, 405–416. [Google Scholar] [CrossRef] [PubMed]
  224. McClintock, B. The stability of broken ends of chromosomes in Zea Mays. Genetics 1941, 26, 234–282. [Google Scholar] [PubMed]
  225. McClintock, B. The production of homozygous deficient tissues with mutant characteristics by means of the aberrant mitotic behavior of ring-shaped chromosomes. Genetics 1938, 23, 315–376. [Google Scholar] [PubMed]
Figure 1. Overview of satellite DNA structure in a human centromere/pericentromere. (a) α satellite monomers (colored solid arrows) are organized into a repeating unit, called a higher order repeat (HOR) (red dashed arrows). In this example, 10 monomers are in each HOR (10-mers). HOR units are repeated in a chromosome-specific manner 100–1000 s of times within a functional centromere core. Within a single HOR, monomers share anywhere from 50–80% sequence identity with one another. The same monomer within different HORs in the same array may share up to 99% identity. Solo monomers (solid arrows) are found in the pericentromeric region and are highly variable in terms of sequence and orientation. Within the centromere, transposable elements (TE) insertions typically include recently active or active (hot) elements, while the TE insertions found in the pericentromere are older, inactive elements. (b) The core centromere structure (red dot, chromosome schematic) of human chromosomes (a generic chromosome ideogram is indicated, middle) consists of different α satellite arrays arranged in HORs (dashed arrows). Each HOR array may contain a different monomer number; in this example, the functional centromere (i.e., assembles CENP-A nucleosomes) at a 10-mer HOR (red dashed arrows). A 7-mer HOR is found nearby but is an inactive epiallele. Both HORs are separated by non-centromeric DNA, which may contain genes. α satellites are also found throughout the pericentromere (bottom schematic, different colored blocks). (c) Representative cladogram of the phylogenetic relationship of the non-HOR α monomers shown in (b). In this example, strata of newer satellites are closer to the HOR arrays, while older satellites are found more distally. Relative age of satellites is indicated by tree branch length; shorter branches are younger elements and deeper branches are older.
Figure 1. Overview of satellite DNA structure in a human centromere/pericentromere. (a) α satellite monomers (colored solid arrows) are organized into a repeating unit, called a higher order repeat (HOR) (red dashed arrows). In this example, 10 monomers are in each HOR (10-mers). HOR units are repeated in a chromosome-specific manner 100–1000 s of times within a functional centromere core. Within a single HOR, monomers share anywhere from 50–80% sequence identity with one another. The same monomer within different HORs in the same array may share up to 99% identity. Solo monomers (solid arrows) are found in the pericentromeric region and are highly variable in terms of sequence and orientation. Within the centromere, transposable elements (TE) insertions typically include recently active or active (hot) elements, while the TE insertions found in the pericentromere are older, inactive elements. (b) The core centromere structure (red dot, chromosome schematic) of human chromosomes (a generic chromosome ideogram is indicated, middle) consists of different α satellite arrays arranged in HORs (dashed arrows). Each HOR array may contain a different monomer number; in this example, the functional centromere (i.e., assembles CENP-A nucleosomes) at a 10-mer HOR (red dashed arrows). A 7-mer HOR is found nearby but is an inactive epiallele. Both HORs are separated by non-centromeric DNA, which may contain genes. α satellites are also found throughout the pericentromere (bottom schematic, different colored blocks). (c) Representative cladogram of the phylogenetic relationship of the non-HOR α monomers shown in (b). In this example, strata of newer satellites are closer to the HOR arrays, while older satellites are found more distally. Relative age of satellites is indicated by tree branch length; shorter branches are younger elements and deeper branches are older.
Genes 10 00223 g001
Figure 2. Chromosome 17 epialleles. (a) Ideogram of chromosome 17 (top). Zoom inset of epialleles showing monomer number for HORs and orientation. D17Z1B HORs carry 14 monomers, as do D17Z1C HORs. D17Z1 HORs are variable in the human population, with wild type epialleles containing 16-mer, 15-mer, and 14-mer HORs (pink) and variant epialleles containing wild type HORs in addition to 13-mer and 12-mer HORs (green). (b) Variation of the D17Z1 epiallele is linked to centromere activity. When the variation in D17Z1 increases, CENP-A nucleosomes (red) decrease; when variation exceeds 80%, the centromere assembles on the D17Z1B epiallele.
Figure 2. Chromosome 17 epialleles. (a) Ideogram of chromosome 17 (top). Zoom inset of epialleles showing monomer number for HORs and orientation. D17Z1B HORs carry 14 monomers, as do D17Z1C HORs. D17Z1 HORs are variable in the human population, with wild type epialleles containing 16-mer, 15-mer, and 14-mer HORs (pink) and variant epialleles containing wild type HORs in addition to 13-mer and 12-mer HORs (green). (b) Variation of the D17Z1 epiallele is linked to centromere activity. When the variation in D17Z1 increases, CENP-A nucleosomes (red) decrease; when variation exceeds 80%, the centromere assembles on the D17Z1B epiallele.
Genes 10 00223 g002
Figure 3. Schematic of the evolutionary mechanisms that impact centromere repeats. (a) Two models for the derivation of species-specific satellites are shown: (left) A satellite array evolves from a library of satellites, culminating in a dominant satellite; (right) TE insertion(s) followed by mutations, such as deletions, lead to the evolution of new satellites. In both cases, a homogenized array evolves through molecular drive mechanisms, such as intra-array concerted evolution. Stabilization of the arrays into HOR arrays defines the active centromere core, where CENP-A nucleosomes (red) are assembled. Other events, such as inter-array conversion, can lead to the spread of new HORs or changes in HOR copy number (bottom). (b) Two homologous chromosomes share the same satellite repeat (red), but one homolog experiences an expansion of that repeat through de novo mutations. During female meiosis, the larger centromere attracts more microtubules, resulting in the loss of the homolog with the weaker centromere into the polar body during meiosis I. The larger centromere is preferentially driven to the viable egg following unequal distribution of chromatids during meiosis II.
Figure 3. Schematic of the evolutionary mechanisms that impact centromere repeats. (a) Two models for the derivation of species-specific satellites are shown: (left) A satellite array evolves from a library of satellites, culminating in a dominant satellite; (right) TE insertion(s) followed by mutations, such as deletions, lead to the evolution of new satellites. In both cases, a homogenized array evolves through molecular drive mechanisms, such as intra-array concerted evolution. Stabilization of the arrays into HOR arrays defines the active centromere core, where CENP-A nucleosomes (red) are assembled. Other events, such as inter-array conversion, can lead to the spread of new HORs or changes in HOR copy number (bottom). (b) Two homologous chromosomes share the same satellite repeat (red), but one homolog experiences an expansion of that repeat through de novo mutations. During female meiosis, the larger centromere attracts more microtubules, resulting in the loss of the homolog with the weaker centromere into the polar body during meiosis I. The larger centromere is preferentially driven to the viable egg following unequal distribution of chromatids during meiosis II.
Genes 10 00223 g003
Figure 4. The hypothetical evolution of new centromeres. The ancestral chromosome in this example is submetacentric (the centromere is indicated with red ‘nucleosomes’). The active locus (black dot) carries satellite arrays. Some individual(s) in a population experience the destabilization of the active centromere and formation of a neocentromere, perhaps through the activation of a new TE, resulting in a centric shift (CS). The new centromere is indicated with a black dot, while the latent centromere is indicated with an open circle. The new centromere becomes fixed in a population and eventually gains new satellite arrays (orange), either by interchromosomal seeding from the old centromere (grey) or from the TE itself. Over time, the latent centromere loses its HORs while the new centromere becomes stabilized. In some cases, the ENC can lead to a new species karyotype.
Figure 4. The hypothetical evolution of new centromeres. The ancestral chromosome in this example is submetacentric (the centromere is indicated with red ‘nucleosomes’). The active locus (black dot) carries satellite arrays. Some individual(s) in a population experience the destabilization of the active centromere and formation of a neocentromere, perhaps through the activation of a new TE, resulting in a centric shift (CS). The new centromere is indicated with a black dot, while the latent centromere is indicated with an open circle. The new centromere becomes fixed in a population and eventually gains new satellite arrays (orange), either by interchromosomal seeding from the old centromere (grey) or from the TE itself. Over time, the latent centromere loses its HORs while the new centromere becomes stabilized. In some cases, the ENC can lead to a new species karyotype.
Genes 10 00223 g004

Share and Cite

MDPI and ACS Style

Hartley, G.; O’Neill, R.J. Centromere Repeats: Hidden Gems of the Genome. Genes 2019, 10, 223. https://doi.org/10.3390/genes10030223

AMA Style

Hartley G, O’Neill RJ. Centromere Repeats: Hidden Gems of the Genome. Genes. 2019; 10(3):223. https://doi.org/10.3390/genes10030223

Chicago/Turabian Style

Hartley, Gabrielle, and Rachel J. O’Neill. 2019. "Centromere Repeats: Hidden Gems of the Genome" Genes 10, no. 3: 223. https://doi.org/10.3390/genes10030223

APA Style

Hartley, G., & O’Neill, R. J. (2019). Centromere Repeats: Hidden Gems of the Genome. Genes, 10(3), 223. https://doi.org/10.3390/genes10030223

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop