Next Article in Journal
Responses to Drought Stress Modulate the Susceptibility to Plasmopara viticola in Vitis vinifera Self-Rooted Cuttings
Next Article in Special Issue
Validation of Reference Genes for Studying Different Abiotic Stresses in Oat (Avena sativa L.) by RT-qPCR
Previous Article in Journal
Involvement of Polyamine Metabolism in the Response of Medicago truncatula Genotypes to Salt Stress
Previous Article in Special Issue
Natural Variation and Domestication Selection of ZmCKX5 with Root Morphological Traits at the Seedling Stage in Maize
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Sinbase 2.0: An Updated Database to Study Multi-Omics in Sesamum indicum

1
Key Laboratory of Biology and Genetic Improvement of Oil Crops, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Ministry of Agriculture, No.2 Xudong Second Road, Wuhan 430062, China
2
Shandong Luyan Agricultural Co., LTC, Jinan 250100, China
3
Boyce Thompson Institute, Ithaca, NY 14853, USA
*
Author to whom correspondence should be addressed.
Equal contributors.
Plants 2021, 10(2), 272; https://doi.org/10.3390/plants10020272
Submission received: 31 December 2020 / Revised: 27 January 2021 / Accepted: 27 January 2021 / Published: 30 January 2021
(This article belongs to the Collection Crop Genomics and Breeding)

Abstract

:
Sesame is one of the oldest oil crops in the world and is widely grown in the tropical and subtropical areas of Asia, Africa and America. Upon the completion of the sesame reference genome version 1.0, we launched Sinbase 1.0 as an integrated database for genomic and bioinformatics analyses. Recently, an upgraded version (version 2.0) of the genome sequence was released. In addition, large numbers of multi-omics data have been generated on sesame, but a comprehensive database that integrates these resources for the community has been lacking until now. Here, we developed an interactive and comprehensive sesame multi-omics database, Sinbase 2.0, which provides information of the sesame updated genome containing 13 chromosomes, 3 genetic linkage maps, 5 intra- and 6 inter-species comparative genomics, 1 genomic variation analysis, 5 transcriptome data, 1 proteome, 31 functional markers, 175 putative functional genes, and 54 QTLs detected for important agronomic traits. Moreover, Sinbase 2.0 has been enriched with novel user-friendly computational tools. All datasets of Sinbase 2.0 can be downloaded online conveniently. Sinbase 2.0 will be updated regularly with new available sesame multi-omics data and can be accessed freely via Sinbase 2.—Sesame Muti-Omics Database. We expect that Sinbase 2.0, similarly to the previous version, will continue to make a major contribution to advance sesame research towards a better understanding of its biology and genetic improvement, as well as comparative genomics and evolutionary biology.

1. Introduction

Sesame (Sesamum indicum L., 2n = 26) is one of the oldest oil crops in the world and is widely grown in the tropical and subtropical areas of Asia, Africa and America [1,2]. Sesame becomes a prized cash crop for small-scale farmers in developing countries because of its low production costs and high sale price. In addition, sesame is a very resilient crop able to provide yields and generate incomes on marginal lands and extreme climatic conditions. Therefore, sesame production experiences a strong growth and has attracted various studies from the scientific community on the genetic basis of important agronomic traits, essential for the improvement of the productivity and yield [3].
Over the past decade, the development of high-throughput sequencing technologies has promoted the study of sesame biology, generating invaluable multi-omics data. Foremost, the completion of the genome sequencing project of the modern cultivar “Zhongzhi13” (version 1.0) has ushered sesame research into a new era [4]. As sesame is gradually graduating from an “orphan crop” to a “genomic resource-rich crop”, several multi-omics data have been generated recently [3]. Different genome sequences of sesame landraces and modern cultivars have been released, providing opportunity for comparative genomics and pan-genome analysis [5,6,7]. Recently, the transcriptome data of sesame plants following drought stress and recovering periods was generated [8]. Another important abiotic stress was investigated by Wang et al. (2016) who reported large gene expression data under waterlogging stress in two contrasting sesame genotypes [9].
Several public databases including sesame genomics database (Sinbase 1.0) [10], sesame functional genomics database (SesameFG) [11] and sesame microsatellite marker database (SisatBase) [12], were then built to share sesame open data resources. However, SesameFG focused on some phenotypic and genotypic information, with SisatBase containing the information of microsatellite markers in sesame genome. So a comprehensive database that integrates the available multi-omics information for the community has still been lacking. Recently, the reference genome sequence was upgraded to reach 13 chromosomes (version 2.0), 94.3% of the estimated genome size and 97.2% of the predicted gene models [13]. Therefore, a new integrative and analytic web platform based on the version 2.0 of the sesame genome is critically needed. Here, we developed an interactive and comprehensive sesame multi-omics database, Sinbase 2.0 (http://www.sesame-bioinfo.org/Sinbase2.0), which provides information of the sesame updated genome (version 2.0), genetic linkage maps, comparative genomics, genomic variation, transcriptomics, proteomics, functional markers, genes, and QTLs. Moreover, Sinbase2.0 supplies user-friendly visualized and searchable tools, which can help users to retrieve sesame multi-omics information easily.

2. Implementation

Sinbase 2.0 was implemented in a CentOS operation system with the Apache HTTP server and MySQL relational database management system. Web interfaces were developed by Perl, JavaScript and HTML programming languages. The graphical views of sesame chromosomes, interspecies comparative genomics, co-expressed genes in different transcriptomic experiments, and protein–protein interaction in proteomic experiment were drawn by Perl GD module from the Comprehensive Perl Archive Network (http://www.cpan.org/), Circos [14], WGCNA [15], and Cytoscape [16], respectively (Figure 1A). Basic datasets of sesame multi-omics were collected and analyzed by in-house Perl and Python scripts. These datasets were stored in the MySQL relational database, which can help users to retrieve useful information conveniently (Figure 1B).

3. Contents and Functions

In Sinbase 2.0, we integrated sesame updated genomics; genetics; comparative genomics; transcriptomics; proteomics; and functional markers, genes and QTLs using multiple-layer methods according to different multi-omics data formats.

3.1. Genomics

Concerning the genomic layer, we collected the sesame newly assembled genome version 2.0 and added newly annotated functionalities of putative protein-coding genes for genome-wide analysis [13]. We displayed genome components on sesame chromosomes including 27,148 protein-coding genes, 207,167 transposable elements and 1748 non-coding RNAs (Table 1). Each genome component was provided with detailed information containing basic description and sequence information (Figure 2A). All protein-coding genes were annotated by InterPro [17], Gene Ontology [18], KEGG [19], and MetaCyc [20] open-resource databases (Table 2). Moreover, for each protein-coding gene, the detailed information of transcriptomic and proteomic expression values, as well as homologous genes among different plant species based on comparative genomics analysis were supplied.

3.2. Genetics Linkage Analysis

For the genetic linkage maps, we supplied three types of molecular markers anchored on the sesame 13 chromosomes including 975 specific-locus amplified fragment sequencing (SLAF-seq), 347 simple sequence repeats (SSR) and 1522 bin markers. Each marker was given the genomic location information, two paired-ends primer sequences, product sizes, and upstream and downstream flanking genes.

3.3. Intra- and Inter-Species Comparative Genomics

Sinbase 2.0 also supplies intra-species and interspecies comparison for comparative genomic studies. Until now, there have been five released genomes of sesame including two landraces: Baizhima and Mishuozhima [6]; three modern cultivars: Zhongzhi13 [4], Yuzhi11 [7] and Swetha [5]. After curation of these genomes, we employed Burrows–Wheeler Aligner’s Smith–Waterman Alignment (BWA-SW) v0.7.15 to perform collinear analysis among the four genome-sequenced sesame varieties compared to the reference genome, which can help users to investigate the genomic variation within sesame varieties [21]. We used model plants and sesame close relative species to perform interspecies comparative genomics including Arabidopsis (Arabidopsis thaliana), rice (Oryza sativa), sorghum (Sorghum bicolor), tomato (Solanum lycopersicum), potato (Solanum tuberosum), and grape (Vitis vinifera). OrthoMCL v2.0.9 was employed to get orthologous gene groups among the seven plant species [22], and MCScanX was used to detect collinear genomic regions with collinear gene pairs between sesame and other plant species [23]. These analysis improved sesame gene functional annotations by mean of homologous relationships between the seven species.

3.4. Genomic Variation Analysis

Based on population genetic analysis of 705 sesame accessions, we obtained 12,833,863 single nucleotide polymorphism sites (SNPs) on the reference genome [6]. In Sinbase 2.0, SNPs can be searched by two ways including setting a range and a scale on sesame chromosomes. For each SNP, the location, reference and alternative sites, annotation, the gene to which it belongs to, and upstream/downstream gene list were supplied in detail.

3.5. Transcriptomics

For the transcriptomic module, we collected five transcriptome data released by our group including sesame waterlogging resistance [9], drought tolerance [8], three color seeds [4], determinate and indeterminate growth habit, and seed development transcriptomes. For each transcriptomic experiment, we supplied the phenotype information under different experimental conditions, expression analysis of sesame genes among different samples, as well as co-expression analysis of up- and down-regulated genes among different sesame samples employing WGCNA [15] (Figure 2B). Expression values of co-expressed genes among different samples and corresponding functions including COG [24], SWISS-PROT/TrEMBL [25], and nr annotations were further provided.

3.6. Proteomics

The proteomic module concerns proteomics data collected from sesame salt sensitive (SS) and tolerant (ST) cultivars during five time points (0, 2, 6, 12, 24 h) with three biological replicates based on iTRAQ analysis [26]. We supplied phenotype information of sesame SS and ST cultivars, expression analysis of proteins among different samples, and protein–protein interaction of up- and down-regulated proteins during the different time points (Figure 2C). Furthermore, users can retrieve the expression values of each sesame protein in different samples and protein–protein interactive networks of up- and down-regulated genes.

3.7. Functional Markers, Genes and QTLs

At last, 31 functional markers, 175 functional genes and 54 functional QTLs in the sesame genome, which control different agronomic traits including coloration, disease resistance, growth cycle, morphological characteristics, yield component, oil content and quality traits, and abiotic stress resistance, were gathered from the literature and mapped onto the sesame updated genome. We added the genomic location for each functional marker, two paired-ends primer sequences, the corresponding agronomic traits, as well as the upstream and downstream genes. We also joined hyperlink to the details of every functional gene. Next, we provided the location of functional QTLs and the gene list contained within each QTLs.

3.8. Useful Tools

To extract useful information and perform customized analyses on sesame multi-omics data, we developed and embedded several novel user-friendly tools in Sinbase 2.0 as compared with the previous version. General search was developed to get basic information of sesame genome components, gene functional annotation, homologous genes in relative species, and gene expression values in different experiments (Figure 2D). Customized BLAST was also embedded for users to get homologous genes or regions in the sesame reference and non-reference genomes by supplying protein or DNA sequence [27]. Localized MISAweb was included to identify SSR in the sesame reference and non-reference genomes [28]. Customized GBrowse [29] and GBrowse_syn [30] were embedded to display genomic components in sesame genome and collinear genomic regions between sesame and its close relatives, as well as model plants.

4. Utility

Browse

Sinbase 2.0 supplied a multi-layer browsing function for users to extract useful information of sesame multi-omics. Users can browse sesame multi-omics information through “Resource” function of Sinbase 2.0, which includes genomics, genetics linkage analysis, intra- and inter-species comparative genomics, genomic variation analysis, transcriptomics analysis, proteomics analysis, markers, genes, and QTLs. Sesame different multi-omics information were collected in this section, which supplied a comprehensive annotation platform for functional exploration of sesame multi-omics data. For sesame genomics, users can get the latest sesame assembled genome sequences and function annotation with the latest versions of public open-resource functional databases, three types of genetic linkage groups based on sesame newly assembled genome sequences, intra- and inter-species comparative genomics, as well as genomic variation analysis among 705 sesame varieties from all over the world. For sesame transcriptomics, users can browse transcriptomic expression information of sesame protein-coding genes under different stress conditions, phenotypes, and growth and development stages. Moreover, users can get the co-expressed gene clusters and identical or interactive genes of interested sesame protein-coding genes focusing on same biological functions, key phenotypes or traits. For sesame proteomics, users can browse expression quantification of sesame proteins among sesame SS and ST samples. Moreover, users can get protein interactive information of interested sesame proteins under different treatments or different time periods.

5. Discussion

Sinbase 1.0 was the first practical and integrated database that focuses on the S. indicum genomics, genetics, and comparative genomics with its relatives and other important species in the plant community. It quickly became a reference genomic database for the scientific community working on sesame and related species. By providing the upgraded genomic sequences and more accurate genomic component annotation in the version 2.0 of Sinbase, we expect that users can perform functional genomics and comparative genomics studies in sesame with high precision and obtain more meaningful results. Compared with the Sinbase 1.0, Sinbase 2.0 supplies more comprehensive multi-omics datasets to allow users to study sesame thoroughly at the genetic and genomic levels. The genetic maps and molecular markers will be valuable resources for gene cloning and quantitative trait locus (QTL) detection. Transcriptome data and gene co-expression results will assist in genome-wide association studies and candidate gene mining. Since sesame entered the molecular breeding era, we believe that the functional genes, molecular markers and QTLs supplied in Sinbase 2.0 will assist breeders in the development of improved cultivars. Additionally, the user-friendly interface will allow users to access Sinbase 2.0 more conveniently. This database will be improved and updated continuously with new features, improvements to genome annotation and genomic sequences, and availability of more omics data.

6. Conclusions and Perspective

In summary, Sinbase 2.0 comprehensively integrates sesame multi-omics data including genomics, transcriptomics and proteomics; improves gene functional annotation; and provides a user-friendly functional analysis platform, which will bring more convenience for the scientific community. Functional modules of Sinbase 2.0 are useful for studying important issues of molecular biology, comparative and functional genomics, and molecular breeding in sesame. Sinbase 2.0 will be regularly updated with newly released sesame multi-omics data, and its gene functions will be refined with new versions of the sesame reference genome. As such, we hope that Sinbase 2.0, similarly to the previous version, will continue to make a major contribution to advance sesame research towards a better understanding of its biology and genetic improvement, as well as comparative genomics and evolutionary biology.

Author Contributions

L.W. (Liwen Wang) and J.Y. (Jingyin Yu) contributed to the design of the research. Y.Z., J.Y. (Jun You) and X.Z. participated in the study design. L.W. (Liwen Wang), J.Y. (Jingyin Yu) and L.W. (Linhai Wang) participated in finalization of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Natural Science Foundation of Hubei Province (2019CFB574), the Agricultural Science and Technology Innovation Project of the Chinese Academy of Agricultural Sciences (CAAS-ASTIP-2016-OCRI), the Science and Technology Innovation Project of Hubei province (201620000001048), and the Fundamental Research Funds for Central Non-profit Scientific Institution (1610172019004, Y2019XK15-02).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Bedigian, D. Evolution of sesame revisited: Domestication, diversity and prospects. Genet. Resour. Crop. Evol. 2003, 50, 779–787. [Google Scholar]
  2. Bedigian, D. History and lore of sesame in Southwest Asia. Econ. Bot. 2004, 58, 330–353. [Google Scholar] [CrossRef]
  3. Dossa, K.; Diouf, D.; Wang, L.; Wei, X.; Zhang, Y.; Niang, M.; Fonceka, D.; Yu, J.; Mmadi, M.A.; Yehouessi, L.W.; et al. The Emerging Oilseed Crop Sesamum indicum Enters the “Omics” Era. Front. Plant Sci. 2017, 8, 1154. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Wang, L.; Yu, S.; Tong, C.; Zhao, Y.; Liu, Y.; Song, C.; Zhang, Y.; Zhang, X.; Wang, Y.; Wei, H.; et al. Genome sequencing of the high oil crop sesame provides insight into oil biosynthesis. Genome Biol. 2014, 15, R39. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Kitts, P.A.; Church, D.M.; Thibaud-Nissen, F.; Choi, J.; Hem, V.; Sapojnikov, V.; Smith, R.G.; Tatusova, T.A.; Xiang, C.; Zherikov, A.; et al. Assembly: A resource for assembled genomes at NCBI. Nucleic Acids Res. 2016, 44, D73–D80. [Google Scholar] [CrossRef]
  6. Wei, X.; Liu, K.; Zhang, Y.; Feng, Q.; Wang, L.; Zhao, Y.; Li, D.; Zhao, Q.; Zhu, X.; Zhu, X.; et al. Genetic discovery for oil production and quality in sesame. Nat. Commun. 2015, 6, 8609. [Google Scholar] [CrossRef] [Green Version]
  7. Zhang, H.; Miao, H.; Wang, L.; Qu, L.-B.; Liu, H.; Wang, Q.; Yue, M. Genome sequencing of the important oilseed crop Sesamum indicum L. Genome Biol. 2013, 14, 401. [Google Scholar] [CrossRef] [Green Version]
  8. Dossa, K.; Li, D.; Wang, L.; Zheng, X.; Liu, A.; Yu, J.; Wei, X.; Zhou, R.; Foncéka, D.; Diouf, D.; et al. Transcriptomic, biochemical and physio-anatomical investigations shed more light on responses to drought stress in two contrasting sesame genotypes. Sci. Rep. 2017, 7, 8755. [Google Scholar] [CrossRef] [Green Version]
  9. Wang, L.; Li, N.; Zhang, Y.; Gao, Y.; Yu, J.; Wei, X.; Zhang, X. Tolerant and Susceptible Sesame Genotypes Reveal Waterlogging Stress Response Patterns. PLoS ONE 2016, 11, e0149912. [Google Scholar] [CrossRef] [Green Version]
  10. Wang, L.; Yu, J.; Li, D.; Zhang, X. Sinbase: An integrated database to study genomics, genetics and comparative genomics in Sesamum indicum. Plant Cell Physiol. 2015, 56, e2. [Google Scholar] [CrossRef]
  11. Wei, X.; Gong, H.; Yu, J.; Liu, P.; Wang, L.; Zhang, Y.; Zhang, X. SesameFG: An integrated database for the functional genomics of sesame. Sci. Rep. 2017, 7, 2342. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Dossa, K.; Yu, J.; Liao, B.; Cisse, N.; Zhang, X. Development of Highly Informative Genome-Wide Single Sequence Repeat Markers for Breeding Applications in Sesame and Construction of a Web Resource: SisatBase. Front. Plant Sci. 2017, 8, 1470. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Wang, L.; Xia, Q.; Zhang, Y.; Zhu, X.; Zhu, X.; Li, D.; Ni, X.; Gao, Y.; Xiang, H.; Wei, X.; et al. Updated sesame genome assembly and fine mapping of plant height and seed coat color QTLs using a new high-density genetic map. BMC Genom. 2016, 17, 31. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Krzywinski, M.; Schein, J.; Birol, I.; Connors, J.; Gascoyne, R.; Horsman, D.; Jones, S.J.; Marra, M.A. Circos: An information aesthetic for comparative genomics. Genome Res. 2009, 19, 1639–1645. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Langfelder, P.; Horvath, S. WGCNA: An R package for weighted correlation network analysis. BMC Bioinform. 2008, 9, 559. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  16. Shannon, P.; Markiel, A.; Ozier, O.; Baliga, N.S.; Wang, J.T.; Ramage, D.; Amin, N.; Schwikowski, B.; Ideker, T. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 2003, 13, 2498–2504. [Google Scholar] [CrossRef] [PubMed]
  17. Quevillon, E.; Silventoinen, V.; Pillai, S.; Harte, N.; Mulder, N.J.; Apweiler, R.; Lopez, R. InterProScan: Protein domains identifier. Nucleic Acids Res. 2005, 33, W116–W120. [Google Scholar] [CrossRef] [Green Version]
  18. Ashburner, M.; Ball, C.A.; Blake, J.A.; Botstein, D.; Butler, H.; Cherry, J.M.; Davis, A.P.; Dolinski, K.; Dwight, S.S.; Eppig, J.T.; et al. Gene ontology: Tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 2000, 25, 25–29. [Google Scholar] [CrossRef] [Green Version]
  19. Kanehisa, M.; Araki, M.; Goto, S.; Hattori, M.; Hirakawa, M.; Itoh, M.; Katayama, T.; Kawashima, S.; Okuda, S.; Tokimatsu, T.; et al. KEGG for linking genomes to life and the environment. Nucleic Acids Res. 2007, 36, D480–D484. [Google Scholar] [CrossRef]
  20. Caspi, R.; Billington, R.; Fulcher, C.A.; Keseler, I.M.; Kothari, A.; Krummenacker, M.; Latendresse, M.; Midford, P.E.; Ong, Q.; Ong, W.K.; et al. The MetaCyc database of metabolic pathways and enzymes. Nucleic Acids Res. 2018, 46, D633–D639. [Google Scholar] [CrossRef] [Green Version]
  21. Li, H.; Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 2010, 26, 589–595. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Li, L.; Stoeckert, C.J., Jr.; Roos, D.S. OrthoMCL: Identification of ortholog groups for eukaryotic genomes. Genome Res. 2003, 13, 2178–2189. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Wang, Y.; Tang, H.; DeBarry, J.D.; Tan, X.; Li, J.; Wang, X.; Lee, T.-H.; Jin, H.; Marler, B.; Guo, H.; et al. MCScanX: A toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 2012, 40, e49. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Makarova, K.S.; Wolf, Y.I.; Koonin, E.V. Archaeal Clusters of Orthologous Genes (arCOGs): An Update and Application for Analysis of Shared Features between Thermococcales, Methanococcales, and Methanobacteriales. Life 2015, 5, 818–840. [Google Scholar] [CrossRef] [Green Version]
  25. O’Donovan, C.; Martin, M.J.; Gattiker, A.; Gasteiger, E.; Bairoch, A.; Apweiler, R. High-quality protein knowledge resource: SWISS-PROT and TrEMBL. Briefings Bioinform. 2002, 3, 275–284. [Google Scholar] [CrossRef]
  26. Ross, P.L.; Huang, Y.N.; Marchese, J.N.; Williamson, B.; Parker, K.; Hattan, S.; Khainovski, N.; Pillai, S.; Dey, S.; Daniels, S.; et al. Multiplexed protein quantitation in Saccharomyces cerevisiae using amine-reactive isobaric tagging reagents. Mol. Cell. Proteom. 2004, 3, 1154–1169. [Google Scholar] [CrossRef] [Green Version]
  27. Altschul, S.F.; Madden, T.L.; Schäffer, A.A.; Zhang, J.; Zhang, Z.; Miller, W.; Lipman, D.J. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 1997, 25, 3389–3402. [Google Scholar] [CrossRef] [Green Version]
  28. Yu, J.; Dossa, K.; Wang, L.; Zhang, Y.; Wei, X.; Liao, B.; Zhang, X. PMDBase: A database for studying microsatellite DNA and marker development in plants. Nucleic Acids Res. 2017, 45, D1046–D1053. [Google Scholar] [CrossRef] [Green Version]
  29. Donlin, M.J. Using the Generic Genome Browser (GBrowse); John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2019; p. Unit 9.9. [Google Scholar]
  30. McKay, S.J.; Vergara, I.A.; Stajich, J.E. Using the Generic Synteny Browser. Curr. Protoc. Bioinform. 2010, 31, 9.12.1–9.12.25. [Google Scholar] [CrossRef]
Figure 1. Framework for data integration and database implementation. (A) Data integration for sesame multi-omics data including the data of genomics; transcriptomics; proteomics; and functional markers, genes and QTLs. (B) Implementation of Sinbase 2.0 including data integration, data storage and visualization.
Figure 1. Framework for data integration and database implementation. (A) Data integration for sesame multi-omics data including the data of genomics; transcriptomics; proteomics; and functional markers, genes and QTLs. (B) Implementation of Sinbase 2.0 including data integration, data storage and visualization.
Plants 10 00272 g001
Figure 2. Screenshots of main functional pages in Sinbase 2.0. (A) Sesame genome components including protein-coding genes, repeat elements and non-coding RNAs. (B) Sesame transcriptomics data of waterlogging resistance including phenotype for waterlogging resistance, expression analysis among different samples in sesame and co-expression analysis by WGCNA. (C) Sesame proteomics data of salty tolerance including phenotype for sesame samples under salt stress, expression analysis among different samples in sesame by iTRAQ, and protein–protein interaction analysis by STRING database. (D) Keyword searching function including genome components and functional identifiers.
Figure 2. Screenshots of main functional pages in Sinbase 2.0. (A) Sesame genome components including protein-coding genes, repeat elements and non-coding RNAs. (B) Sesame transcriptomics data of waterlogging resistance including phenotype for waterlogging resistance, expression analysis among different samples in sesame and co-expression analysis by WGCNA. (C) Sesame proteomics data of salty tolerance including phenotype for sesame samples under salt stress, expression analysis among different samples in sesame by iTRAQ, and protein–protein interaction analysis by STRING database. (D) Keyword searching function including genome components and functional identifiers.
Plants 10 00272 g002
Table 1. Statistics of genome components of Sesamum indicum genome v2.0.
Table 1. Statistics of genome components of Sesamum indicum genome v2.0.
CategoriesProtein-Coding GenesRepeat ElementsNon-Coding RNAsTotal
Chromosome 1216214,1429216,396
Chromosome 2192211,9227613,920
Chromosome 3267816,62712119,426
Chromosome 4199815,7318517,814
Chromosome 5150913,8858915,483
Chromosome 6269617,93212420,752
Chromosome 7162513,02022214,867
Chromosome 8269217,98414920,825
Chromosome 9250213,82618116,509
Chromosome 10195814,46812916,555
Chromosome 11140310,9196312,385
Chromosome 12180710,94711212,866
Chromosome 13142612,58913014,145
Chromosome Un77023,17517524,120
Total27,148207,1671748236,063
Table 2. Statistics of functional annotation of protein-coding genes in Sesamum indicum genome v2.0.
Table 2. Statistics of functional annotation of protein-coding genes in Sesamum indicum genome v2.0.
CategoriesTotal Sesame GenesAnnotated Sesame GenesPercentage (%)Involved Terms or Identifiers
InterPro27,14820,00973.705865
Gene Ontology14,47753.331780
KEGG19857.31122
MetaCyc13514.98577
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Wang, L.; Yu, J.; Zhang, Y.; You, J.; Zhang, X.; Wang, L. Sinbase 2.0: An Updated Database to Study Multi-Omics in Sesamum indicum. Plants 2021, 10, 272. https://doi.org/10.3390/plants10020272

AMA Style

Wang L, Yu J, Zhang Y, You J, Zhang X, Wang L. Sinbase 2.0: An Updated Database to Study Multi-Omics in Sesamum indicum. Plants. 2021; 10(2):272. https://doi.org/10.3390/plants10020272

Chicago/Turabian Style

Wang, Liwen, Jingyin Yu, Yanxin Zhang, Jun You, Xiurong Zhang, and Linhai Wang. 2021. "Sinbase 2.0: An Updated Database to Study Multi-Omics in Sesamum indicum" Plants 10, no. 2: 272. https://doi.org/10.3390/plants10020272

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop