Galaxy Dnpatterntools for Computational Analysis of Nucleosome Positioning Sequence Patterns †
Abstract
:1. Introduction
1.1. Features of Nucleosomal DNA Sequences
1.2. Patterns of Dinucleotide Distributions in the Ensembles of Nucleosomal DNA Sequences
1.3. Computational Tools
2. Results
2.1. Implementation
2.2. Workflow
- computation of distribution of frequency of dinucleotide occurrences in a batch of aligned sequences;
- determination of nucleosome position in the sequences;
- selection and symmetrization of dinucleotide frequency profiles from the determined interval;
- computation of frequency profiles of composite dinucleotides WW/SS (W = A or T and S = C or G) and RR/YY (R = A or G and Y = C or T);
- normalization and smoothing of the frequency profiles to remove noise
- computation of the periodograms.
2.2.1. Computation of Distributions of Dinucleotide Frequencies along Nucleosomal DNA from a Batch of Sequences
2.2.2. Determination of Nucleosome Position Using Dyad-Symmetry of Dinucleotide Frequency Profiles
2.2.3. Correlations between Forward and Reverse Patterns
2.2.4. Patterns of Dinucleotide Frequency Distributions and Their Periodograms
2.2.5. Symmetrization
2.2.6. Periodicity of Dinucleotide Steps
2.2.7. Dinucleotide Shuffling
3. Mapping of Nucleosome Positions in Sequence by Pattern
3.1. Mapping Algorithm
3.2. Mapping Application in Nucleosome DNA Sequences of 17 Organisms
4. Discussion
5. Materials and Methods
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
NPS | Nucleosome Positioning Sequence |
SHL | Super Helical Locations |
NCP | Nucleosome Core Particle |
GEO | Gene Expression Omnibus |
NCBI | National Center of Biotechnology Information |
NAC | Nucleus Accumbens Cells |
fw | forward |
rc | reverse complement |
DANPOS | Dynamic analysis of nucleosome position and occupancy by sequencing |
Hi-C | high-throughput chromosome conformation capture |
ATAC-seq | Assay for Transposase-Accessible Chromatin with high-throughput sequencing |
MNase-seq | micrococcal nuclease digestion with deep sequencing |
References
- Iyer, V.R. Nucleosome positioning: Bringing order to the eukaryotic genome. Trends Cell Biol. 2012, 22, 250–256. [Google Scholar] [CrossRef] [Green Version]
- Lowary, P.T.; Widom, J. New DNA sequence rules for high affinity binding to histone octamer and sequence-directed nucleosome positioning. J. Mol. Biol. 1998, 276, 19–42. [Google Scholar] [CrossRef] [PubMed]
- Segal, M.R. Re-cracking the nucleosome positioning code. Stat. Appl. Genet. Mol. Biol. 2008, 7, 14. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Scipioni, A.; De Santis, P. Predicting nucleosome positioning in genomes: Physical and bioinformatic approaches. Biophys. Chem. 2011, 155, 53–64. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Chung, H.R.; Vingron, M. Sequence-dependent nucleosome positioning. J. Mol. Biol. 2009, 386, 1411–1422. [Google Scholar] [CrossRef] [PubMed]
- Ioshikhes, I.; Hosid, S.; Pugh, F.B. Variety of genomic DNA patterns for nucleosome positioning. Genome Res. 2011, 21, 1863–1871. [Google Scholar] [CrossRef] [Green Version]
- Hughes, A.L.; Rando, O.J. Mechanisms underlying nucleosome positioning in vivo. Annu. Rev. Biophys. 2014, 43, 41–63. [Google Scholar] [CrossRef]
- Snyder, M.W.; Kircher, M.; Hill, A.J.; Daza, R.M.; Shendure, J. Cell-free DNA comprises an in vivo nucleosome footprint that informs its tissues-of-origin. Cell 2016, 164, 57–68. [Google Scholar] [CrossRef] [Green Version]
- Clapier, C.R.; Chacravarthy, S.; Petosa, C.; Fernandez-Tornero, C.; Luger, K.; Muller, C.W. Structure of the Drosophila nucleosome core particle highlights evolutionary constraints on the H2A-H2B histone dimer. Proteins 2008, 71, 1–7. [Google Scholar] [CrossRef]
- Teif, V.B.; Clarkson, C.T. Nucleosome positioning. In Encyclopedia of Bioinformatics and Computational Biology; Ranganathan, S., Gribskov, M., Schonbach, C., Eds.; Elsevier: Amsterdam, The Netherlands, 2019; Volume 2, pp. 308–317. ISBN 978-0-12-811432-2. [Google Scholar]
- Struhl, K.; Segal, E. Determinants of nucleosome positioning. Nat. Struct. Mol. Biol. 2013, 20, 267–273. [Google Scholar] [CrossRef]
- Cui, F.; Zhurkin, V. Structure-based Analysis of DNA Sequence Patterns Guiding Nucleosome Positioning in vitro. J. Biomol. Struct. Dyn. 2010, 27, 821–841. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Luque, A.; Ozer, G.; Schlick, T. Correlation among DNA linker length, linker histone concentration, and histone tails in chromatin. Biophys. J. 2016, 110, 2309–2319. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Gaffney, D.J.; McVicker, G.; Pai, A.A.; Fondufe-Mittendorf, Y.N.; Lewellen, N.; Michelini, K.; Widom, J.; Gilad, Y.; Pritchard, J.K. Controls of nucleosome positioning in the human genome. PLoS Genet. 2012, 8, e1003036. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Giancarlo, R.; Rombo, S.E.; Utro, F. In vitro versus in vivo compositional landscapes of histone sequence preferences in eucaryotic genomes. Bioinformatics 2018, 34, 3454–3460. [Google Scholar] [CrossRef] [PubMed]
- Tompitak, M.; Vaillant, C.; Schiessel, H. Genomes of multicellular organisms have evolved to attract nucleosomes to promoter regions. Biophys. J. 2017, 112, 505–511. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Pranckeviciene, E.; Hosid, S.; Liang, N.; Ioshikhes, I. Nucleosome sequence positioning patterns as packing or regulatory. PLoS Comput. Biol. 2020, 16, e1007365. [Google Scholar] [CrossRef] [Green Version]
- Gu, Z.; Gu, L.; Eils, R.; Schlesner, M.; Brors, B. Circlize implements and enhances circular visualization in R. Bioinformatics 2014, 30, 2811–2812. [Google Scholar] [CrossRef] [Green Version]
- Trifonov, E.N. Nucleosome Positioning by Sequence, State of the Art and Apparent Finale. J. Biomol. Struct. Dyn. 2010, 27, 741–746. [Google Scholar] [CrossRef]
- Polishko, A.; Ponts, N.; Le Roch, K.G.; Lonardi, S. NOrMAL: Accurate nucleosome positioning using a modified Gaussian mixture model. Bioinformatics 2012, 28, i242–i249. [Google Scholar] [CrossRef]
- Quintales, L.; Vazquez, E.; Antequera, F. Comparative analysis of methods for genome-wide nucleosome cartography. Brief Bioinform. 2015, 4, 576–587. [Google Scholar] [CrossRef] [Green Version]
- Chen, K.; Xi, Y.; Pan, X.; Li, Z.; Kaestner, K.; Tyler, J.; Dent, S.; He, X.; Li, W. DANPOS: Dynamic analysis of nucleosome position and occupancy by sequencing. Genome Res. 2013, 23, 341–351. [Google Scholar] [CrossRef] [Green Version]
- Zhang, Y.; Shin, H.; Song, J.S.; Lei, Y.; Liu, X.S. Identifying positioned nucleosomes with epigenetic marks in human from ChIP-Seq. BMC Genom. 2008, 9, 537. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Tirosh, I. Computational analysis of nucleosome positioning. Methods Mol. Biol. 2012, 833, 443–449. [Google Scholar]
- Nellore, A.; Bobkov, K.; Howe, E.; Pankov, A.; Diaz, A.; Song, J.S. NSeq: A multithreaded Java application for finding positioned nucleosomes from sequencing data. Front. Genet. 2012, 3, 320. [Google Scholar] [CrossRef] [Green Version]
- Zhou, X.; Blocker, A.W.; Airoldi, E.M.; O’Shea, E.K. A computational approach to map nucleosome positions and alternative chromatin states with base pair resolution. eLife 2016, 5, e16970. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Brunelle, M.; Rodrigue, S.; Jacques, P.E.; Gevry, N. Hgh-resolution genome-wide mapping of nucleosome positioning and occupancy level using paired-end sequencing technology. Methods Mol. Biol. 2017, 1528, 229–243. [Google Scholar]
- Zhang, J.; Peng, W.; Wang, L. LeNup: Learning nucleosome positioning from DNA sequences with improved convolutional neural networks. Bioinformatics 2018, 30, 1705–1712. [Google Scholar] [CrossRef] [Green Version]
- Teif, V.B. Nucleosome positioning: Resources and tools online. Brief Bioinform. 2016, 17, 745–757. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Moser, C.; Gupta, M. A generalized hidden Markov model for determining sequence-based predictors of nucleosome positioning. Stat. Appl. Genet. Mol. Biol. 2012, 11, 1–21. [Google Scholar] [CrossRef] [Green Version]
- Xi, L.; Fondufe-Mittendorf, Y.; Xia, L.; Flatow, J.; Widom, J.; Wang, J.P. Predicting nucleosome positioning using a duration Hidden Markov Model. BMC Bioinform. 2010, 11, 346. [Google Scholar] [CrossRef] [Green Version]
- Afgan, E.; Baker, D.; Batut, B.; van den Beek, M.; Bouvier, D.; Cech, M.; Chilton, J.; Clements, D.; Coraor, N.; Gruning, B.; et al. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update. Nucleic Acids Res. 2018, 46, W537–W544. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Hosid, S.; Ioshikhes, I. Apoptotic Lymphocytes of H. Sapiens lose nucleosomes in CG-rich promoters. PLoS Comput. Biol. 2014, 10, e1003760. [Google Scholar] [CrossRef] [PubMed]
- Doring, A.; Weese, D.; Rausch, T.; Reinert, K. SeqAn an efficient generic C++ library for sequence analysis. BMC Bioinform. 2008, 9, 11. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Grüning, B.; Dale, R.; Sjödin, A.; Chapman, B.A.; Rowe, J.; Tomkins-Tinch, C.H.; Valieris, R.; Köster, J.; The Bioconda Team. Bioconda: Sustainable and comprehensive software distribution for the life sciences. Nat. Methods 2018, 15, 475–476. [Google Scholar] [CrossRef] [PubMed]
- Jiang, M.; Anderson, J.; Gillespie, J.; Mayne, M. uShuffle: A useful tool for shuffling biological sequences while preserving the k-let counts. BMC Bioinform. 2008, 9, 192. [Google Scholar] [CrossRef] [Green Version]
- Pranckeviciene, E.; Ioshikhes, I. Dnpatterntools suite for nucleosome positioning sequence. In Proceedings of the Bioinformatics Community Conference 2020 (BCC2020), Online, 19–21 July 2020. [Google Scholar]
- Schones, D.E.; Cui, K.; Cuddapah, S.; Roh, T.; Barski, A.; Wang, Z.; Wei, G.; Zhao, K. Dynamic regulation of nucleosome positioning in the human genome. Cell 2008, 132, 887–898. [Google Scholar] [CrossRef] [Green Version]
- Bettecken, T.; Frenkel, Z.M.; Altmuller, J.; Nurnberg, P.; Trifonov, E.N. Apoptotic cleavage of DNA in human lymphocyte chromatin shows high sequence specificity. J. Biomol. Struct. Dyn. 2012, 30, 211–216. [Google Scholar] [CrossRef]
- Sun, H.; Damez-Werno, D.; Scobie, K.N.; Shao, N.; Dias, C.; Rabkin, J.; Koo, J.W.; Korb, E.; Bagot, R.C.; Ahn, F.H.; et al. ACF chromatin-remodeling complex mediates stress-induced depressive-like behavior. Nat. Med. 2015, 21, 1146–1153. [Google Scholar] [CrossRef] [Green Version]
- Zhang, Z.; Pugh, B.F. High-resolution genome-wide mapping of the primary structure of chromatin. Cell 2011, 144, 175–186. [Google Scholar] [CrossRef] [Green Version]
- Brogaard, K.R.; Xi, L.; Wang, J.P.; Widom, J. A chemical approach to mapping nucleosomes at base pair resolution in yeast. Methods Enzymol. 2012, 513, 315–334. [Google Scholar]
- Chereji, R.V.; Ramachandran, S.; Bryson, T.D.; Henikoff, S. Precise genome-wide mapping of single nucleosomes and linkers in vivo. Genome Biol. 2018, 19, 1–20. [Google Scholar] [CrossRef] [PubMed]
- Zhong, J.; Luo, K.; Winter, P.S.; Crawford, G.E.; Iversen, E.S.; Hartemink, A.J. Mapping nucleosome positions using DNase-seq. Genome Res. 2016, 26, 351–364. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Giresi, P.G.; Lieb, J.D. Isolation of active regulatory elements from eukaryotic chromatin using FAIRE (Formaldehyde Assisted Isolation of Regulatory Elements). Methods 2009, 48, 233–239. [Google Scholar] [CrossRef] [Green Version]
- Luger, K.; Mader, A.W.; Richmond, R.K.; Sargent, D.F.; Richmond, T.J. Crystal structure of the nucleosome core particle at 2.8 A resolution. Nature 1997, 389, 251–260. [Google Scholar] [CrossRef] [PubMed]
- Beh, L.Y.; Muller, M.M.; Muir, T.W.; Kaplan, N.; Landweber, L.F. DNA-guided establishment of nucleosome patterns with coding regions of a eukaryotic genome. Genome Res. 2015, 25, 1727–1738. [Google Scholar] [CrossRef] [Green Version]
- Bolshoy, A.; Ioshikhes, I.; Trifonov, E.N. Applicability of the multiple alignment algorithm for detection of weak patterns: Periodically distributed DNA pattern as a study case. Comput. Appl. Biosci. 1996, 12, 383–389. [Google Scholar] [CrossRef] [Green Version]
- Eslami-Mossalam, B.; Shiessel, H.; van Noort, J. Nucleosome dynamics: Sequence maters. Adv. Colloid Interface Sci. 2016, 232, 101–113. [Google Scholar] [CrossRef]
- Bolshoy, A. CC dinucleotides contribute to the bending of DNA in chromatin. Nat. Struct. Biol. 1995, 2, 446–448. [Google Scholar] [CrossRef]
- Bettecken, T.; Trifonov, E.N. Repertoires of the nucleosome positioning dinucleotides. PLoS ONE 2009, 4, e7654. [Google Scholar] [CrossRef]
- Radman-Livaja, M.; Rando, O. Nucleosome positioning: How is it established, and why does it matter? Dev. Biol. 2010, 339, 258–266. [Google Scholar] [CrossRef] [Green Version]
- Yang, D.; Ioshikhes, I. Drosophila H2A and H2A.Z nucleosome sequences reveal different nucleosome positioning sequence patterns. J. Comput. Biol. 2017, 24, 289–298. [Google Scholar] [CrossRef] [PubMed]
- Ioshikhes, I.P.; Albert, I.; Zanton, S.J.; Pugh, B.F. Nucleosome positions predicted through comparative genomics. Nat. Genet. 2006, 38, 1210–1215. [Google Scholar] [CrossRef] [PubMed]
- Ioshikhes, I.; Bolshoy, A.; Derenshteyn, K.; Borodovsky, M.; Trifonov, E.N. Nucleosome DNA sequence pattern revealed by multiple alignment of experimentally mapped sequences. J. Mol. Biol. 1996, 262, 129–139. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Servant, N.; Varoquaux, N.; Lajoie, B.R.; Viara, E.; Chen, C.J.; Vert, J.P.; Heard, E.; Dekker, J.; Barillot, E. HiC-Pro: An optimized and flexible pipeline for Hi-C data processing. Genome Biol. 2015, 16, 259. [Google Scholar] [CrossRef] [Green Version]
- Buenrostro, J.D.; Giresi, P.G.; Zaba, L.C.; Chang, H.Y.; Greenleaf, W.J. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat. Methods 2013, 10, 1213–1218. [Google Scholar] [CrossRef]
- Lai, B.; Gao, W.; Cui, K.; Xie, W.; Tang, Q.; Jin, W.; Hu, G.; Ni, B.; Zhao, K. Principles of nucleosome organization revealed by single-cell micrococcal nuclease sequencing. Nature 2018, 562, 281–285. [Google Scholar] [CrossRef]
Core Utilities |
---|
C/C++ binary tools (bioconda package) |
Compute binary strings from sequences (dnp-binstrings). |
Compute dinucleotide frequencies in sequences (dnp-diprofile). |
Compute correlation between forward and reverse complement profiles (dnp-corrprofile). |
Compute periodogram, normalization and smoothing (dnp-fourier). |
Mapping_CC, map nucleosome by pattern in a given sequence (dnp-mapping). |
Helper Utilities |
Shell scripts |
Binary strings for multiple dinucleotides. |
Frequency profiles of all dinucleotides. |
Correlations for all dinucleotides. |
Select profiles within interval. |
Composite WW/SS and RR/YY dinucleotide profiles. |
Symmetrization of frequency profiles. |
Smoothing by moving average. |
Periodogram for all dinucleotides. |
Gnuplot of selected columns. |
Mapping nucleosomes in multiple FASTA sequences by multiple patterns. |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Pranckeviciene, E.; Hosid, S.; Maziukas, I.; Ioshikhes, I. Galaxy Dnpatterntools for Computational Analysis of Nucleosome Positioning Sequence Patterns. Int. J. Mol. Sci. 2022, 23, 4869. https://doi.org/10.3390/ijms23094869
Pranckeviciene E, Hosid S, Maziukas I, Ioshikhes I. Galaxy Dnpatterntools for Computational Analysis of Nucleosome Positioning Sequence Patterns. International Journal of Molecular Sciences. 2022; 23(9):4869. https://doi.org/10.3390/ijms23094869
Chicago/Turabian StylePranckeviciene, Erinija, Sergey Hosid, Indiras Maziukas, and Ilya Ioshikhes. 2022. "Galaxy Dnpatterntools for Computational Analysis of Nucleosome Positioning Sequence Patterns" International Journal of Molecular Sciences 23, no. 9: 4869. https://doi.org/10.3390/ijms23094869