Next Article in Journal
VR Traffic Dataset on Broad Range of End-User Activities
Previous Article in Journal
Towards Action-State Process Model Discovery
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Data Descriptor

Draft Genome Sequence Data of Streptomyces anulatus, Strain K-31

by
Andrey P. Bogoyavlenskiy
*,
Madina S. Alexyuk
,
Amankeldi K. Sadanov
,
Vladimir E. Berezin
,
Lyudmila P. Trenozhnikova
and
Gul B. Baymakhanova
Research and Production Center for Microbiology and Virology, Almaty 050010, Kazakhstan
*
Author to whom correspondence should be addressed.
Data 2023, 8(8), 131; https://doi.org/10.3390/data8080131
Submission received: 1 July 2023 / Revised: 25 July 2023 / Accepted: 8 August 2023 / Published: 10 August 2023

Abstract

:
Streptomyces anulatus is a typical representative of the Streptomyces genus synthesizing a large number of biologically active compounds. In this study, the draft genome of Streptomyces anulatus, strain K-31 is presented, generated from Illumina reads by SPAdes software. The size of the assembled genome was 8.548838 Mb. Annotation of the S. anulatus genome assembly identified C. hemipterus genome 7749 genes, including 7149 protein-coding genes and 92 RNA genes. This genome will be helpful to further understand Streptomyces genetics and evolution and can be useful for obtained biological active compounds.
Dataset: The raw genome sequencing data of Illumina MiSeq were submitted to NCBI SRA database in FASTQ format with BioSample: SAMN35538655, under BioProject PRJNA977889. The assembled genome available in NCBI GeneBank under CP127386.1.
Dataset License: CC0

1. Summary

Actinobacteria are typical representatives of soil and water samples [1,2,3]. They take part in the cycle of organic substances and in the carbon cycle. In addition, more than half of clinically used antibiotics, including aminoglycosides, lactamybean anthracyclines, macrolides, etc., are synthesized by representatives of actinomycetes. To date, more than 7600 compounds are known to be synthesized by this group of microorganisms [1,2,3]. The genome Streptomycetes have biosynthesis gene clusters encode enzymes that are responsible for the synthesis of several classes of compounds, including nonribosomal peptides, polyketides, phenazines, terpenes, etc. [4]. Streptomyces anulatus can produces actinomycin C, cactinomycin, endophenazine A, endophenazine B, endophenazine C, epocarbazolin A, epocarbazolin B, tubermycin B, dextranase, and telomestatin; therefore, a draft genome of a new strain may be of interest to many groups of investigators.

2. Data Description (Required)

Streptomyces anulatus, strain k-31 was isolated from soil “Solonetzes”, Kazakhstan, Kostanay region “52.179274 N 64.069596 E”. The draft genome characteristics are summarized in Table 1. The size of the genome was 8548838 bp. Based on genome annotation, there were 7749 genes, including 92 RNA genes and 27 numbers of subsystem sequences (Table 1, Figure 1).
Additionally, in the genome Streptomyces anulatus, strain K-31, the presence of gene clusters synthesizing secondary metabolites was determined by antiSMASH 7.0 program [9]. Thirty-two gene clusters were determined (Table 2). The presented genes included type I, II, and III polyketides synthase (PKS) and non-ribosomal polypeptide synthetase (NRPS). Polyketide synthases (PKSs) produce highly diverse polyketide chains by sequential condensation of the starter units with extender units, followed by reduction, cyclization, aromatization, and tailoring reactions to synthesis aromatic core with several annulated rings. There is coelimycin, coelichelin, borrelidin, steffimycin, etc.
The 16S rRNA phylogenetic analysis revealed that Streptomyces anulatus K-31 is closely related to the CP 110539 Streptomyces anulatus strain YUW1 (97.07%), CP061072 Streptomyces sp. (97.07%), and CP096281 Streptomyces sp. WAC00303 (100%).
This genome will be helpful in further understanding the evolutions of Streptomyces genetics and can be useful for obtained biological active compounds.

3. Methods

3.1. Sample Collection and Isolation of Streptomyces

Natural soil sample (muds and rhizosphere) was collected from the extreme habitats of northern (Kostanay region, 52.179274 N 64.069596 E) Kazakhstan. The soil sample collected was of the solonetz type. Solonetz are soils with a high content of exchangeable sodium and/or magnesium ions. The soil sample was collected at the depth of 10 cm (~4 inches). The samples were packed in sterile plastic containers, transported to the laboratory, and refrigerated at 4 °C (39 °F) until ready for analysis [10].
Soil samples were plated using the standard method. Actinomycetes samples were isolated on two variants of modified Bennett’s agar: glucose (0.2%), peptone (0.2%), yeast extract (0.1%), and agar (2.0%), with 5% NaCl, pH 7.2 (variant #1), or with 0.5% Na2CO3, pH 9.0 (variant #2). The medium was adjusted with NaOH to pH 9.0 after sterilization. The purity of the isolated strains was confirmed using the standard microbiological method. The culture was stored in 20% glycerol stock at −80 °C.

3.2. DNA Isolation, Genome Sequencing, Assembly, and Annotation

Genomic DNA was isolated using the PureLink Genomic DNA Mini Kit according to the manufacturer’s instructions (ThermoFisher Scientific, Waltham, MA, USA). A whole-genome sequencing library was prepared using the Nextera XT DNA library preparation kit following the manufacturer’s instructions (Illumina, Cambridge, UK). The libraries were sequenced using the Miseq platform (Illumina, Cambridge, UK) to generate 2 × 300 paired end reads. The raw reads adapters were trimmed by Trimmomatic version 0.38.0 [11]. Low-quality sequences (<Q30) were deleted. After trimming, reads were contained from 50 to 250 bp. Genome assembly was performed using SPAdes version 3.12.0 [6]. After assembly, the quality of the genome was checked by the software Geneious Prime 2023 by mapping to the reference genome [12]. The annotation genome was determined by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP), GeneMarkS-2+, RAST, and Bacta [5,7,8]. The secondary metabolites biosynthetic gene clusters of the strain K-31 draft genome were identified using antiSMASH version 7.0 [9].

Author Contributions

Conceptualization, A.P.B.; methodology, M.S.A.; software, M.S.A.; validation, G.B.B. and L.P.T.; formal analysis, A.P.B.; strain isolation and cultivation, L.P.T. and G.B.B.; data curation, M.S.A.; writing—original draft preparation, A.P.B.; writing—review and editing, all authors; visualization, A.P.B.; supervision, A.K.S. and V.E.B.; funding acquisition, A.K.S. and V.E.B. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Ministry of Education and Science of the Republic of Kazakhstan (grant number BR10965178 “Development of original domestic drugs with antiviral activity efficient against COVID-19 and influenza”).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw genome sequencing data of Illumina MiSeq deposited in NCBI in FASTQ format with BioSample: SAMN35538655, under BioProject PRJNA977889. The assembled genome is available in NCBI GeneBank under CP127386.1.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Jagannathan, S.V.; Manemann, E.M.; Rowe, S.E.; Callender, M.C.; Soto, W. Marine Actinomycetes, New Sources of Biotechnological Products. Mar. Drugs 2021, 19, 365. [Google Scholar] [CrossRef] [PubMed]
  2. Singh, L.S.; Sharma, H.; Talukdar, N.C. Production of potent antimicrobial agent by actinomycete, Streptomyces sannanensis strain SU118 isolated from phoomdi in Loktak Lake of Manipur, India. BMC Microbiol. 2014, 14, 278. [Google Scholar] [CrossRef] [PubMed]
  3. Salih, S.I.; Maaroof, M.N. Isolation and identification some species of Streptomyces bacteria producing antibiotics and molecular detection of gene 16srrna and alignment nucleotide sequences with the NCBI. Ann. For. Res. 2022, 65, 4000–4013. [Google Scholar]
  4. Yang, Y.; Kessler, M.G.C.; Marchán-Rivadeneira, M.R.; Han, Y. Combating Antimicrobial Resistance in the Post-Genomic Era: Rapid Antibiotic Discovery. Molecules 2023, 28, 4183. [Google Scholar] [CrossRef] [PubMed]
  5. Aziz, R.K.; Bartels, D.; Best, A.A.; DeJongh, M.; Disz, T.; Edwards, R.A.; Formsma, K.; Gerdes, S.; Glass, E.M.; Kubal, M.; et al. The RAST Server: Rapid Annotations using Subsystems Technology. BMC Genom. 2008, 9, 75. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. Bankevich, A.; Nurk, S.; Antipov, D.; Gurevich, A.A.; Dvorkin, M.; Kulikov, A.S.; Lesin, V.M.; Nikolenko, S.I.; Pham, S.; Prjibelski, A.D.; et al. SPAdes: A new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 2012, 19, 455–477. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Lomsadze, A.; Gemayel, K.; Tang, S.; Borodovsky, M. Modeling leaderless transcription and atypical genes results in more accurate gene prediction in prokaryotes. Genome Res. 2018, 29, 1079–1089. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Schwengers, O.; Jelonek, L.; Dieckmann, M.A.; Beyvers, S.; Blom, J.; Goesmann, A. Bakta: Rapid and standardized annotation of bacterial genomes via alignment-free sequence identification. Microb. Genom. 2021, 7, 000685. [Google Scholar] [CrossRef] [PubMed]
  9. Blin, K.; Shaw, S.; Augustijn, H.; Reitz, Z.; Biermann, F.; Alanjary, M.; Fetter, A.; Terlouw, B.; Metcalf, W.; Helfrich, E.; et al. antiSMASH 7.0: New and improved predictions for detection, regulation, chemical structures and visualization. Nucleic Acids Res. 2023, 51, W46–W50. [Google Scholar] [CrossRef] [PubMed]
  10. Trenozhnikova, L.; Azizan, A. Discovery of Actinomycetes from Extreme Environments with Potential to Produce Novel Antibiotics. Cent. Asian J. Glob. Health 2018, 7, 337. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  11. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Kearse, M.; Moir, R.; Wilson, A.; Stones-Havas, S.; Cheung, M.; Sturrock, S.; Buxton, S.; Cooper, A.; Markowitz, S.; Duran, C.; et al. Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 2012, 28, 1647–1649. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. Subsystem statistics information of K-31 using RAST annotation [5]. The subsystems category and corresponding feature counts were shown in the legend.
Figure 1. Subsystem statistics information of K-31 using RAST annotation [5]. The subsystems category and corresponding feature counts were shown in the legend.
Data 08 00131 g001
Table 1. Characteristics of draft genome assembly of Streptomyces anulatus, strain K-31 [5,6,7,8].
Table 1. Characteristics of draft genome assembly of Streptomyces anulatus, strain K-31 [5,6,7,8].
Total genome size8,548,838
Number of contigs about 100,000 bp 1754
N50 contig number9871
Genes (total)7749
CDSs (total)7657
CDSs (with protein)7143
Genes (RNA)92
tRNAs71
Table 2. Putative region clusters coding for secondary metabolites detected by antiSMASH annotation of Streptomyces anulatus K-31 [9].
Table 2. Putative region clusters coding for secondary metabolites detected by antiSMASH annotation of Streptomyces anulatus K-31 [9].
RegionTypefromtoSimilarityMIBiG AccessionShort Description
Region 1butyrolactone40,58351,5278%BGC0000038coelimycin P1, Polyketide, biosynthetic gene cluster from Streptomyces coelicolor A3(2)
Region 2terpene85,010105,914100%BGC0001181geosmin biosynthetic gene cluster from Streptomyces coelicolor A3(2)
Region 3NRP-metallophore232,803291,36581%BGC0000325coelichelin biosynthetic gene cluster from Streptomyces coelicolor A3(2)
Region 4T3PKS 359,289398,962100%BGC0001310naringenin biosynthetic gene cluster from Streptomyces clavuligerus ATCC 27064
Region 5NRPS 655,387698,4499%BGC0000031borrelidin biosynthetic gene cluster from Streptomyces parvulus
Region 6PKS-like 717,893758,9515%BGC0001036polyoxypeptin biosynthetic gene cluster from Streptomyces sp. MK498-98F14
Region 7Anthracycline 1,035,0681,054,28819%BGC0000273steffimycin D biosynthetic gene cluster from Streptomyces steffisburgensis
Region 8ectoine 1,509,0351,519,433100%BGC0000853ectoine biosynthetic gene cluster from Streptomyces anulatus
Region 9lanthipeptide-class-ii, iii2,543,0342,574,3579%BGC0000603cyclothiazomycin biosynthetic gene cluster from Streptomyces hygroscopicus
Region 10NI-siderophore 2,629,5092,639,74480%BGC0000941desferrioxamin B biosynthetic gene cluster from Streptomyces griseus subsp. griseus NBRC 13350
Region 11lanthipeptide-class-i 3,978,5524,003,814
Region 12NRPS 4,330,2994,424,13063%BGC0002676skyllamycin D biosynthetic gene cluster from Streptomyces anulatus
Region 13arylpolyene 4,769,1414,810,54113%BGC0000698hygromycin A biosynthetic gene cluster from Streptomyces hygroscopicus
Region 14NRPS-like 5,353,5075,406,85510%BGC0000341enduracidin biosynthetic gene cluster from Streptomyces fungicidicus
Region 15thiopeptide 5,418,6385,447,54011%BGC0002376bombyxamycin A biosynthetic gene cluster from Streptomyces sp
Region 16melanin 5,465,6245,474,333100%BGC0000912melanin biosynthetic gene cluster from Streptomyces griseus subsp. griseus NBRC 13350
Region 17lanthipeptide-class-iii 5,516,2825,538,933100%BGC0000496AmfS biosynthetic gene cluster from Streptomyces griseus subsp. griseus NBRC 13350
Region 18terpene 5,922,7045,942,363
Region 19NI-siderophore 6,342,9626,357,68320%BGC0002683schizokinen biosynthetic gene cluster from Nostoc sp. PCC 7120 = FACHB-418
Region 20NRPS 6,543,2376,624,40797%BGC0001438warkmycin CS1 biosynthetic gene cluster from Streptomyces sp. CS057
Region 21Polyketide
6,931,0206,950,55411%BGC0001590formicamycins A m biosynthetic gene cluster from Streptomyces sp. KY5
Region 22terpene 7,370,0547,396,45161%BGC0000663hopene biosynthetic gene cluster from Streptomyces coelicolor A3(2)
Region 23linaridin 7,463,3397,483,9479%BGC0001735pentostatine biosynthetic gene cluster from Streptomyces antibioticus
Region 24T1PKS 7,546,7937,595,910100%BGC0001043SGR PTMs biosynthetic gene cluster from Streptomyces griseus subsp. griseus NBRC 13350
Region 25RiPP-like 7,611,0067,621,2673%BGC0000163tetronasin biosynthetic gene cluster from Streptomyces longisporoflavus
Region 26T1PKS 7,720,8167,765,65816%BGC0001584C-1027 biosynthetic gene cluster from Streptomyces sp. CB02366
Region 27RiPP-like 7,854,0787,865,44558%BGC0002115streptamidine biosynthetic gene cluster from Streptomyces albidoflavus J1074
Region 28melanin 7,866,2887,876,767100%BGC0000911melanin biosynthetic gene cluster from Streptomyces griseus subsp. griseus NBRC 13350
Region 29T3PKS 7,911,5017,952,553100%BGC0000282alkylresorcinol biosynthetic gene cluster from Streptomyces griseus subsp. griseus NBRC 13350
Region 30NRPS-like 7,991,6668,068,94761%BGC0000175cycloheximide biosynthetic gene cluster from Streptomyces sp. YIM 56141
Region 31T1PKS 8,129,2758,182,15911%BGC0000290A-47934 biosynthetic gene cluster from Streptomyces toyocaensis
Region 32NRPS 8,242,7278,329,466100%BGC0000664isorenieratene biosynthetic gene cluster from Streptomyces griseus subsp. griseus NBRC 13350
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Bogoyavlenskiy, A.P.; Alexyuk, M.S.; Sadanov, A.K.; Berezin, V.E.; Trenozhnikova, L.P.; Baymakhanova, G.B. Draft Genome Sequence Data of Streptomyces anulatus, Strain K-31. Data 2023, 8, 131. https://doi.org/10.3390/data8080131

AMA Style

Bogoyavlenskiy AP, Alexyuk MS, Sadanov AK, Berezin VE, Trenozhnikova LP, Baymakhanova GB. Draft Genome Sequence Data of Streptomyces anulatus, Strain K-31. Data. 2023; 8(8):131. https://doi.org/10.3390/data8080131

Chicago/Turabian Style

Bogoyavlenskiy, Andrey P., Madina S. Alexyuk, Amankeldi K. Sadanov, Vladimir E. Berezin, Lyudmila P. Trenozhnikova, and Gul B. Baymakhanova. 2023. "Draft Genome Sequence Data of Streptomyces anulatus, Strain K-31" Data 8, no. 8: 131. https://doi.org/10.3390/data8080131

Article Metrics

Back to TopTop