Next Article in Journal
Difunctional Magnetic Nanoparticles Employed in Immunochromatographic Assay for Rapid and Quantitative Detection of Carcinoembryonic Antigen
Next Article in Special Issue
The First Five Mitochondrial Genomes for the Family Nidulariaceae Reveal Novel Gene Rearrangements, Intron Dynamics, and Phylogeny of Agaricales
Previous Article in Journal
miR-143-3p Promotes Ovarian Granulosa Cell Senescence and Inhibits Estradiol Synthesis by Targeting UBE2E3 and LHCGR
Previous Article in Special Issue
Transcriptomic Responses of Salvia hispanica to the Infestation of Red Spider Mites (Tetranychus neocaledonicus)
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Classification of Promoter Sequences from Human Genome

1
Bach Institute of Biochemistry, Federal Research Center of Biotechnology of the Russian Academy of Sciences, 119071 Moscow, Russia
2
Institute of Bioengineering, Federal Research Center of Biotechnology of the Russian Academy of Sciences, 119071 Moscow, Russia
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2023, 24(16), 12561; https://doi.org/10.3390/ijms241612561
Submission received: 2 June 2023 / Revised: 28 July 2023 / Accepted: 3 August 2023 / Published: 8 August 2023
(This article belongs to the Collection Feature Papers in Molecular Genetics and Genomics)

Abstract

We have developed a new method for promoter sequence classification based on a genetic algorithm and the MAHDS sequence alignment method. We have created four classes of human promoters, combining 17,310 sequences out of the 29,598 present in the EPD database. We searched the human genome for potential promoter sequences (PPSs) using dynamic programming and position weight matrices representing each of the promoter sequence classes. A total of 3,065,317 potential promoter sequences were found. Only 1,241,206 of them were located in unannotated parts of the human genome. Every other PPS found intersected with either true promoters, transposable elements, or interspersed repeats. We found a strong intersection between PPSs and Alu elements as well as transcript start sites. The number of false positive PPSs is estimated to be 3 × 10−8 per nucleotide, which is several orders of magnitude lower than for any other promoter prediction method. The developed method can be used to search for PPSs in various eukaryotic genomes.
Keywords: human genome; promoter; genetic algorithm; multiple alignment human genome; promoter; genetic algorithm; multiple alignment

Share and Cite

MDPI and ACS Style

Zaytsev, K.; Fedorov, A.; Korotkov, E. Classification of Promoter Sequences from Human Genome. Int. J. Mol. Sci. 2023, 24, 12561. https://doi.org/10.3390/ijms241612561

AMA Style

Zaytsev K, Fedorov A, Korotkov E. Classification of Promoter Sequences from Human Genome. International Journal of Molecular Sciences. 2023; 24(16):12561. https://doi.org/10.3390/ijms241612561

Chicago/Turabian Style

Zaytsev, Konstantin, Alexey Fedorov, and Eugene Korotkov. 2023. "Classification of Promoter Sequences from Human Genome" International Journal of Molecular Sciences 24, no. 16: 12561. https://doi.org/10.3390/ijms241612561

APA Style

Zaytsev, K., Fedorov, A., & Korotkov, E. (2023). Classification of Promoter Sequences from Human Genome. International Journal of Molecular Sciences, 24(16), 12561. https://doi.org/10.3390/ijms241612561

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop