ijms-logo

Journal Browser

Journal Browser

Data Analysis and Integration in Cancer Research 2.0

A special issue of International Journal of Molecular Sciences (ISSN 1422-0067). This special issue belongs to the section "Molecular Informatics".

Deadline for manuscript submissions: closed (31 December 2020) | Viewed by 17155

Special Issue Editors


E-Mail Website
Guest Editor
Finnish Hub for Development and Validation of Integrated Approaches (FHAIVE)–Faculty of Medicine and Health Technology, Tampere University, 33100 Tampere, Finland
Interests: network medicine; predictive pharmacology; human diseases; drug repositioning; systems biology; multi-omics data analysis; biomarker discovery; disease subtype discovery
Special Issues, Collections and Topics in MDPI journals

E-Mail
Guest Editor
Faculty of Medicine and Life Sciences, University of Tampere, Tampere, Finland
Interests: omics data analysis; integration and modelling; multi-omics approaches in the study of adverse outcome pathways; epigenomic therapeutic targets of human cancers; epidemiological epigenomics
Special Issues, Collections and Topics in MDPI journals

Special Issue Information

Dear colleagues,

Recent technological advancements in genomics have paved the way to a deeper understanding of the mechanisms driving the onset and progression of cancer, providing an unprecedented source of information for the investigation of the features determining treatment outcomes.

Recent studies carried out by big consortia, such as The Cancer Genome Atlas (TCGA) and the International Cancer Genome Consortium (ICGC), have generated comprehensive catalogues of molecular aberrations linked to tumorigenesis for large cohorts of cancer patients and across a wide range of cancer types.

The amount and diversity of currently available data poses an important challenge: the ability to meaningfully analyze and integrate information from different biological sources, in order to obtain a more detailed and complete picture of the complex dynamics behind biological events related to cancer.

In this regard, advances in bioinformatics and computational biology techniques, along with dedicated tools, are essential for the integrative analysis of high-dimensional data.

This Special Issue, “Data Analysis and Integration in Cancer Research”, will cover a selection of research articles, review articles, and commentaries reporting recent methodological advancements on the analysis and integration of high-dimensional data, with a particular focus on (multi-)omics technologies in the study of molecular dynamics in cancer.

Dr. Antonio Federico
Dr. Giovanni Scala
Guest Editors

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. International Journal of Molecular Sciences is an international peer-reviewed open access semimonthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. There is an Article Processing Charge (APC) for publication in this open access journal. For details about the APC please see here. Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

  • Biological big data
  • Precision oncology
  • Cancer therapy
  • Biological data integration
  • Cancer research
  • Multi-omics approaches
  • Next-generation sequencing
  • High-throughput technologies
  • Systems biology
  • Cancer genomics
  • Network analysis
  • Transcriptome analysis
  • Epigenomics analysis
  • Non-coding RNAs

Related Special Issue

Published Papers (17 papers)

Order results
Result details
Select all
Export citation of selected articles as:

Research

Jump to: Review

13 pages, 2367 KiB  
Article
GEM-Based Metabolic Profiling for Human Bone Osteosarcoma under Different Glucose and Glutamine Availability
by Ewelina Weglarz-Tomczak, Demi J. Rijlaarsdam, Jakub M. Tomczak and Stanley Brul
Int. J. Mol. Sci. 2021, 22(3), 1470; https://doi.org/10.3390/ijms22031470 - 2 Feb 2021
Cited by 6 | Viewed by 3443
Abstract
Cancer cell metabolism is dependent on cell-intrinsic factors, such as genetics, and cell-extrinsic factors, such nutrient availability. In this context, understanding how these two aspects interact and how diet influences cellular metabolism is important for developing personalized treatment. In order to achieve this [...] Read more.
Cancer cell metabolism is dependent on cell-intrinsic factors, such as genetics, and cell-extrinsic factors, such nutrient availability. In this context, understanding how these two aspects interact and how diet influences cellular metabolism is important for developing personalized treatment. In order to achieve this goal, genome-scale metabolic models (GEMs) are used; however, genetics and nutrient availability are rarely considered together. Here, we propose integrated metabolic profiling, a framework that allows enriching GEMs with metabolic gene expression data and information about nutrients. First, the RNA-seq is converted into Reaction Activity Score (RAS) to further scale reaction bounds. Second, nutrient availability is converted to Maximal Uptake Rate (MUR) to modify exchange reactions in a GEM. We applied our framework to the human osteosarcoma cell line (U2OS). Osteosarcoma is a common and primary malignant form of bone cancer with poor prognosis, and, as indicated in our study, a glutamine-dependent type of cancer. Full article
(This article belongs to the Special Issue Data Analysis and Integration in Cancer Research 2.0)
Show Figures

Figure 1

21 pages, 2335 KiB  
Article
Multi-Omics Characterization of Inflammatory Bowel Disease-Induced Hyperplasia/Dysplasia in the Rag2−/−/Il10−/− Mouse Model
by Qiyuan Han, Thomas J. Y. Kono, Charles G. Knutson, Nicola M. Parry, Christopher L. Seiler, James G. Fox, Steven R. Tannenbaum and Natalia Y. Tretyakova
Int. J. Mol. Sci. 2021, 22(1), 364; https://doi.org/10.3390/ijms22010364 - 31 Dec 2020
Cited by 9 | Viewed by 3151
Abstract
Epigenetic dysregulation is hypothesized to play a role in the observed association between inflammatory bowel disease (IBD) and colon tumor development. In the present work, DNA methylome, hydroxymethylome, and transcriptome analyses were conducted in proximal colon tissues harvested from the Helicobacter hepaticus ( [...] Read more.
Epigenetic dysregulation is hypothesized to play a role in the observed association between inflammatory bowel disease (IBD) and colon tumor development. In the present work, DNA methylome, hydroxymethylome, and transcriptome analyses were conducted in proximal colon tissues harvested from the Helicobacter hepaticus (H. hepaticus)-infected murine model of IBD. Reduced representation bisulfite sequencing (RRBS) and oxidative RRBS (oxRRBS) analyses identified 1606 differentially methylated regions (DMR) and 3011 differentially hydroxymethylated regions (DhMR). These DMR/DhMR overlapped with genes that are associated with gastrointestinal disease, inflammatory disease, and cancer. RNA-seq revealed pronounced expression changes of a number of genes associated with inflammation and cancer. Several genes including Duox2, Tgm2, Cdhr5, and Hk2 exhibited changes in both DNA methylation/hydroxymethylation and gene expression levels. Overall, our results suggest that chronic inflammation triggers changes in methylation and hydroxymethylation patterns in the genome, altering the expression of key tumorigenesis genes and potentially contributing to the initiation of colorectal cancer. Full article
(This article belongs to the Special Issue Data Analysis and Integration in Cancer Research 2.0)
Show Figures

Figure 1

14 pages, 933 KiB  
Article
Identification of Novel Potential Genes Involved in Cancer by Integrated Comparative Analyses
by Francesco Monticolo, Emanuela Palomba and Maria Luisa Chiusano
Int. J. Mol. Sci. 2020, 21(24), 9560; https://doi.org/10.3390/ijms21249560 - 15 Dec 2020
Cited by 2 | Viewed by 1904
Abstract
The main hallmarks of cancer diseases are the evasion of programmed cell death, uncontrolled cell division, and the ability to invade adjacent tissues. The explosion of omics technologies offers challenging opportunities to identify molecular agents and processes that may play relevant roles in [...] Read more.
The main hallmarks of cancer diseases are the evasion of programmed cell death, uncontrolled cell division, and the ability to invade adjacent tissues. The explosion of omics technologies offers challenging opportunities to identify molecular agents and processes that may play relevant roles in cancer. They can support comparative investigations, in one or multiple experiments, exploiting evidence from one or multiple species. Here, we analyzed gene expression data from induction of programmed cell death and stress response in Homo sapiens and compared the results with Saccharomyces cerevisiae gene expression during the response to cell death. The aim was to identify conserved candidate genes associated with Homo sapiens cell death, favored by crosslinks based on orthology relationships between the two species. We identified differentially-expressed genes, pathways that are significantly dysregulated across treatments, and characterized genes among those involved in induced cell death. We investigated on co-expression patterns and identified novel genes that were not expected to be associated with death pathways, that have a conserved pattern of expression between the two species. Finally, we analyzed the resulting list by HumanNet and identified new genes predicted to be involved in cancer. The data integration and the comparative approach between distantly-related reference species that were here exploited pave the way to novel discoveries in cancer therapy and also contribute to detect conserved genes potentially involved in programmed cell death. Full article
(This article belongs to the Special Issue Data Analysis and Integration in Cancer Research 2.0)
Show Figures

Figure 1

17 pages, 5635 KiB  
Article
CSI NGS Portal: An Online Platform for Automated NGS Data Analysis and Sharing
by Omer An, Kar-Tong Tan, Ying Li, Jia Li, Chan-Shuo Wu, Bin Zhang, Leilei Chen and Henry Yang
Int. J. Mol. Sci. 2020, 21(11), 3828; https://doi.org/10.3390/ijms21113828 - 28 May 2020
Cited by 18 | Viewed by 7941
Abstract
Next-generation sequencing (NGS) has been a widely-used technology in biomedical research for understanding the role of molecular genetics of cells in health and disease. A variety of computational tools have been developed to analyse the vastly growing NGS data, which often require bioinformatics [...] Read more.
Next-generation sequencing (NGS) has been a widely-used technology in biomedical research for understanding the role of molecular genetics of cells in health and disease. A variety of computational tools have been developed to analyse the vastly growing NGS data, which often require bioinformatics skills, tedious work and a significant amount of time. To facilitate data processing steps minding the gap between biologists and bioinformaticians, we developed CSI NGS Portal, an online platform which gathers established bioinformatics pipelines to provide fully automated NGS data analysis and sharing in a user-friendly website. The portal currently provides 16 standard pipelines for analysing data from DNA, RNA, smallRNA, ChIP, RIP, 4C, SHAPE, circRNA, eCLIP, Bisulfite and scRNA sequencing, and is flexible to expand with new pipelines. The users can upload raw data in FASTQ format and submit jobs in a few clicks, and the results will be self-accessible via the portal to view/download/share in real-time. The output can be readily used as the final report or as input for other tools depending on the pipeline. Overall, CSI NGS Portal helps researchers rapidly analyse their NGS data and share results with colleagues without the aid of a bioinformatician. The portal is freely available at: https://csibioinfo.nus.edu.sg/csingsportal. Full article
(This article belongs to the Special Issue Data Analysis and Integration in Cancer Research 2.0)
Show Figures

Graphical abstract

17 pages, 1874 KiB  
Article
Open Data for Differential Network Analysis in Glioma
by Claire Jean-Quartier, Fleur Jeanquartier and Andreas Holzinger
Int. J. Mol. Sci. 2020, 21(2), 547; https://doi.org/10.3390/ijms21020547 - 15 Jan 2020
Cited by 8 | Viewed by 3604
Abstract
The complexity of cancer diseases demands bioinformatic techniques and translational research based on big data and personalized medicine. Open data enables researchers to accelerate cancer studies, save resources and foster collaboration. Several tools and programming approaches are available for analyzing data, including annotation, [...] Read more.
The complexity of cancer diseases demands bioinformatic techniques and translational research based on big data and personalized medicine. Open data enables researchers to accelerate cancer studies, save resources and foster collaboration. Several tools and programming approaches are available for analyzing data, including annotation, clustering, comparison and extrapolation, merging, enrichment, functional association and statistics. We exploit openly available data via cancer gene expression analysis, we apply refinement as well as enrichment analysis via gene ontology and conclude with graph-based visualization of involved protein interaction networks as a basis for signaling. The different databases allowed for the construction of huge networks or specified ones consisting of high-confidence interactions only. Several genes associated to glioma were isolated via a network analysis from top hub nodes as well as from an outlier analysis. The latter approach highlights a mitogen-activated protein kinase next to a member of histondeacetylases and a protein phosphatase as genes uncommonly associated with glioma. Cluster analysis from top hub nodes lists several identified glioma-associated gene products to function within protein complexes, including epidermal growth factors as well as cell cycle proteins or RAS proto-oncogenes. By using selected exemplary tools and open-access resources for cancer research and differential network analysis, we highlight disturbed signaling components in brain cancer subtypes of glioma. Full article
(This article belongs to the Special Issue Data Analysis and Integration in Cancer Research)
Show Figures

Graphical abstract

23 pages, 11422 KiB  
Article
An Integrated Pan-Cancer Analysis and Structure-Based Virtual Screening of GPR15
by Yanjing Wang, Xiangeng Wang, Yi Xiong, Cheng-Dong Li, Qin Xu, Lu Shen, Aman Chandra Kaushik and Dong-Qing Wei
Int. J. Mol. Sci. 2019, 20(24), 6226; https://doi.org/10.3390/ijms20246226 - 10 Dec 2019
Cited by 16 | Viewed by 5705
Abstract
G protein-coupled receptor 15 (GPR15, also known as BOB) is an extensively studied orphan G protein-coupled receptors (GPCRs) involving human immunodeficiency virus (HIV) infection, colonic inflammation, and smoking-related diseases. Recently, GPR15 was deorphanized and its corresponding natural ligand demonstrated an ability to inhibit [...] Read more.
G protein-coupled receptor 15 (GPR15, also known as BOB) is an extensively studied orphan G protein-coupled receptors (GPCRs) involving human immunodeficiency virus (HIV) infection, colonic inflammation, and smoking-related diseases. Recently, GPR15 was deorphanized and its corresponding natural ligand demonstrated an ability to inhibit cancer cell growth. However, no study reported the potential role of GPR15 in a pan-cancer manner. Using large-scale publicly available data from the Cancer Genome Atlas (TCGA) and the Genotype-Tissue Expression (GTEx) databases, we found that GPR15 expression is significantly lower in colon adenocarcinoma (COAD) and rectal adenocarcinoma (READ) than in normal tissues. Among 33 cancer types, GPR15 expression was significantly positively correlated with the prognoses of COAD, neck squamous carcinoma (HNSC), and lung adenocarcinoma (LUAD) and significantly negatively correlated with stomach adenocarcinoma (STAD). This study also revealed that commonly upregulated gene sets in the high GPR15 expression group (stratified via median) of COAD, HNSC, LUAD, and STAD are enriched in immune systems, indicating that GPR15 might be considered as a potential target for cancer immunotherapy. Furthermore, we modelled the 3D structure of GPR15 and conducted structure-based virtual screening. The top eight hit compounds were screened and then subjected to molecular dynamics (MD) simulation for stability analysis. Our study provides novel insights into the role of GPR15 in a pan-cancer manner and discovered a potential hit compound for GPR15 antagonists. Full article
(This article belongs to the Special Issue Data Analysis and Integration in Cancer Research)
Show Figures

Figure 1

14 pages, 2090 KiB  
Article
RankerGUI: A Computational Framework to Compare Differential Gene Expression Profiles Using Rank Based Statistics
by Amarinder Singh Thind, Kumar Parijat Tripathi and Mario Rosario Guarracino
Int. J. Mol. Sci. 2019, 20(23), 6098; https://doi.org/10.3390/ijms20236098 - 3 Dec 2019
Cited by 6 | Viewed by 5431
Abstract
The comparison of high throughput gene expression datasets obtained from different experimental conditions is a challenging task. It provides an opportunity to explore the cellular response to various biological events such as disease, environmental conditions, and drugs. There is a need for tools [...] Read more.
The comparison of high throughput gene expression datasets obtained from different experimental conditions is a challenging task. It provides an opportunity to explore the cellular response to various biological events such as disease, environmental conditions, and drugs. There is a need for tools that allow the integration and analysis of such data. We developed the “RankerGUI pipeline”, a user-friendly web application for the biological community. It allows users to use various rank based statistical approaches for the comparison of full differential gene expression profiles between the same or different biological states obtained from different sources. The pipeline modules are an integration of various open-source packages, a few of which are modified for extended functionality. The main modules include rank rank hypergeometric overlap, enriched rank rank hypergeometric overlap and distance calculations. Additionally, preprocessing steps such as merging differential expression profiles of multiple independent studies can be added before running the main modules. Output plots show the strength, pattern, and trends among complete differential expression profiles. In this paper, we describe the various modules and functionalities of the developed pipeline. We also present a case study that demonstrates how the pipeline can be used for the comparison of differential expression profiles obtained from multiple platforms’ data of the Gene Expression Omnibus. Using these comparisons, we investigate gene expression patterns in kidney and lung cancers. Full article
(This article belongs to the Special Issue Data Analysis and Integration in Cancer Research)
Show Figures

Figure 1

20 pages, 4000 KiB  
Article
A Gene Signature of Survival Prediction for Kidney Renal Cell Carcinoma by Multi-Omic Data Analysis
by Fuyan Hu, Wenying Zeng and Xiaoping Liu
Int. J. Mol. Sci. 2019, 20(22), 5720; https://doi.org/10.3390/ijms20225720 - 14 Nov 2019
Cited by 36 | Viewed by 3798
Abstract
Kidney renal cell carcinoma (KIRC), which is the most common subtype of kidney cancer, has a poor prognosis and a high mortality rate. In this study, a multi-omics analysis is performed to build a multi-gene prognosis signature for KIRC. A combination of a [...] Read more.
Kidney renal cell carcinoma (KIRC), which is the most common subtype of kidney cancer, has a poor prognosis and a high mortality rate. In this study, a multi-omics analysis is performed to build a multi-gene prognosis signature for KIRC. A combination of a DNA methylation analysis and a gene expression data analysis revealed 863 methylated differentially expressed genes (MDEGs). Seven MDEGs (BID, CCNF, DLX4, FAM72D, PYCR1, RUNX1, and TRIP13) were further screened using LASSO Cox regression and integrated into a prognostic risk score model. Then, KIRC patients were divided into high- and low-risk groups. A univariate cox regression analysis revealed a significant association between the high-risk group and a poor prognosis. The time-dependent receiver operating characteristic (ROC) curve shows that the risk group performs well in predicting overall survival. Furthermore, the risk group is contained in the best multivariate model that was obtained by a multivariate stepwise analysis, which further confirms that the risk group can be used as a potential prognostic biomarker. In addition, a nomogram was established for the best multivariate model and shown to perform well in predicting the survival of KIRC patients. In summary, a seven-MDEG signature is a powerful prognosis factor for KIRC patients and may provide useful suggestions for their personalized therapy. Full article
(This article belongs to the Special Issue Data Analysis and Integration in Cancer Research)
Show Figures

Figure 1

15 pages, 4788 KiB  
Article
Molecular Inverse Comorbidity between Alzheimer’s Disease and Lung Cancer: New Insights from Matrix Factorization
by Alessandro Greco, Jon Sanchez Valle, Vera Pancaldi, Anaïs Baudot, Emmanuel Barillot, Michele Caselle, Alfonso Valencia, Andrei Zinovyev and Laura Cantini
Int. J. Mol. Sci. 2019, 20(13), 3114; https://doi.org/10.3390/ijms20133114 - 26 Jun 2019
Cited by 10 | Viewed by 3999
Abstract
Matrix factorization (MF) is an established paradigm for large-scale biological data analysis with tremendous potential in computational biology. Here, we challenge MF in depicting the molecular bases of epidemiologically described disease–disease (DD) relationships. As a use case, we focus on the inverse comorbidity [...] Read more.
Matrix factorization (MF) is an established paradigm for large-scale biological data analysis with tremendous potential in computational biology. Here, we challenge MF in depicting the molecular bases of epidemiologically described disease–disease (DD) relationships. As a use case, we focus on the inverse comorbidity association between Alzheimer’s disease (AD) and lung cancer (LC), described as a lower than expected probability of developing LC in AD patients. To this day, the molecular mechanisms underlying DD relationships remain poorly explained and their better characterization might offer unprecedented clinical opportunities. To this goal, we extend our previously designed MF-based framework for the molecular characterization of DD relationships. Considering AD–LC inverse comorbidity as a case study, we highlight multiple molecular mechanisms, among which we confirm the involvement of processes related to the immune system and mitochondrial metabolism. We then distinguish mechanisms specific to LC from those shared with other cancers through a pan-cancer analysis. Additionally, new candidate molecular players, such as estrogen receptor (ER), cadherin 1 (CDH1) and histone deacetylase (HDAC), are pinpointed as factors that might underlie the inverse relationship, opening the way to new investigations. Finally, some lung cancer subtype-specific factors are also detected, also suggesting the existence of heterogeneity across patients in the context of inverse comorbidity. Full article
(This article belongs to the Special Issue Data Analysis and Integration in Cancer Research)
Show Figures

Figure 1

13 pages, 2464 KiB  
Article
Chromogranin-A Expression as a Novel Biomarker for Early Diagnosis of Colon Cancer Patients
by Xueli Zhang, Hong Zhang, Bairong Shen and Xiao-Feng Sun
Int. J. Mol. Sci. 2019, 20(12), 2919; https://doi.org/10.3390/ijms20122919 - 14 Jun 2019
Cited by 35 | Viewed by 4633
Abstract
Colon cancer is one of the major causes of cancer death worldwide. The five-year survival rate for the early-stage patients is more than 90%, and only around 10% for the later stages. Moreover, half of the colon cancer patients have been clinically diagnosed [...] Read more.
Colon cancer is one of the major causes of cancer death worldwide. The five-year survival rate for the early-stage patients is more than 90%, and only around 10% for the later stages. Moreover, half of the colon cancer patients have been clinically diagnosed at the later stages. It is; therefore, of importance to enhance the ability for the early diagnosis of colon cancer. Taking advantages from our previous studies, there are several potential biomarkers which have been associated with the early diagnosis of the colon cancer. In order to investigate these early diagnostic biomarkers for colon cancer, human chromogranin-A (CHGA) was further analyzed among the most powerful diagnostic biomarkers. In this study, we used a logistic regression-based meta-analysis to clarify associations of CHGA expression with colon cancer diagnosis. Both healthy populations and the normal mucosa from the colon cancer patients were selected as the double normal controls. The results showed decreased expression of CHGA in the early stages of colon cancer as compared to the normal controls. The decline of CHGA expression in the early stages of colon cancer is probably a new diagnostic biomarker for colon cancer diagnosis with high predicting possibility and verification performance. We have also compared the diagnostic powers of CHGA expression with the typical oncogene KRAS, classic tumor suppressor TP53, and well-known cellular proliferation index MKI67, and the CHGA showed stronger ability to predict early diagnosis for colon cancer than these other cancer biomarkers. In the protein–protein interaction (PPI) network, CHGA was revealed to share some common pathways with KRAS and TP53. CHGA might be considered as a novel, promising, and powerful biomarker for early diagnosis of colon cancer. Full article
(This article belongs to the Special Issue Data Analysis and Integration in Cancer Research)
Show Figures

Figure 1

21 pages, 2308 KiB  
Article
A Prediction Model for Preoperative Risk Assessment in Endometrial Cancer Utilizing Clinical and Molecular Variables
by Erin A. Salinas, Marina D. Miller, Andreea M. Newtson, Deepti Sharma, Megan E. McDonald, Matthew E. Keeney, Brian J. Smith, David P. Bender, Michael J. Goodheart, Kristina W. Thiel, Eric J. Devor, Kimberly K. Leslie and Jesus Gonzalez Bosquet
Int. J. Mol. Sci. 2019, 20(5), 1205; https://doi.org/10.3390/ijms20051205 - 9 Mar 2019
Cited by 13 | Viewed by 3406
Abstract
The utility of comprehensive surgical staging in patients with low risk disease has been questioned. Thus, a reliable means of determining risk would be quite useful. The aim of our study was to create the best performing prediction model to classify endometrioid endometrial [...] Read more.
The utility of comprehensive surgical staging in patients with low risk disease has been questioned. Thus, a reliable means of determining risk would be quite useful. The aim of our study was to create the best performing prediction model to classify endometrioid endometrial cancer (EEC) patients into low or high risk using a combination of molecular and clinical-pathological variables. We then validated these models with publicly available datasets. Analyses between low and high risk EEC were performed using clinical and pathological data, gene and miRNA expression data, gene copy number variation and somatic mutation data. Variables were selected to be included in the prediction model of risk using cross-validation analysis; prediction models were then constructed using these variables. Model performance was assessed by area under the curve (AUC). Prediction models were validated using appropriate datasets in The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases. A prediction model with only clinical variables performed at 88%. Integrating clinical and molecular data improved prediction performance up to 97%. The best prediction models included clinical, miRNA expression and/or somatic mutation data, and stratified pre-operative risk in EEC patients. Integrating molecular and clinical data improved the performance of prediction models to over 95%, resulting in potentially useful clinical tests. Full article
(This article belongs to the Special Issue Data Analysis and Integration in Cancer Research)
Show Figures

Figure 1

10 pages, 1827 KiB  
Communication
Population Substructure Has Implications in Validating Next-Generation Cancer Genomics Studies with TCGA
by Marina D. Miller, Eric J. Devor, Erin A. Salinas, Andreea M. Newtson, Michael J. Goodheart, Kimberly K. Leslie and Jesus Gonzalez-Bosquet
Int. J. Mol. Sci. 2019, 20(5), 1192; https://doi.org/10.3390/ijms20051192 - 8 Mar 2019
Cited by 6 | Viewed by 2638
Abstract
In the era of large genetic and genomic datasets, it has become crucially important to validate results of individual studies using data from publicly available sources, such as The Cancer Genome Atlas (TCGA). However, how generalizable are results from either an independent or [...] Read more.
In the era of large genetic and genomic datasets, it has become crucially important to validate results of individual studies using data from publicly available sources, such as The Cancer Genome Atlas (TCGA). However, how generalizable are results from either an independent or a large public dataset to the remainder of the population? The study presented here aims to answer that question. Utilizing next generation sequencing data from endometrial and ovarian cancer patients from both the University of Iowa and TCGA, genomic admixture of each population was analyzed using STRUCTURE and ADMIXTURE software. In our independent data set, one subpopulation was identified, whereas in TCGA 4–6 subpopulations were identified. Data presented here demonstrate how different the genetic substructures of the TCGA and University of Iowa populations are. Validation of genomic studies between two different population samples must be aware of, account for and be corrected for background genetic substructure. Full article
(This article belongs to the Special Issue Data Analysis and Integration in Cancer Research)
Show Figures

Figure 1

15 pages, 2045 KiB  
Communication
Molecular Characterization of Non-responders to Chemotherapy in Serous Ovarian Cancer
by Megan E. McDonald, Erin A. Salinas, Eric J. Devor, Andreea M. Newtson, Kristina W. Thiel, Michael J. Goodheart, David P. Bender, Brian J. Smith, Kimberly K. Leslie and Jesus Gonzalez-Bosquet
Int. J. Mol. Sci. 2019, 20(5), 1175; https://doi.org/10.3390/ijms20051175 - 7 Mar 2019
Cited by 11 | Viewed by 3441
Abstract
Nearly one-third of patients with high-grade serous ovarian cancer (HGSC) do not respond to initial treatment with platinum-based therapy. Genomic and clinical characterization of these patients may lead to potential alternative therapies. Here, the objective is to classify non-responders into subsets using clinical [...] Read more.
Nearly one-third of patients with high-grade serous ovarian cancer (HGSC) do not respond to initial treatment with platinum-based therapy. Genomic and clinical characterization of these patients may lead to potential alternative therapies. Here, the objective is to classify non-responders into subsets using clinical and molecular features. Using patients from The Cancer Genome Atlas (TCGA) dataset with platinum-resistant or platinum-refractory HGSC, we performed a genome-wide unsupervised cluster analysis that integrated clinical data, gene copy number variations, gene somatic mutations, and DNA promoter methylation. Pathway enrichment analysis was performed for each cluster to identify the targetable processes. Following the unsupervised cluster analysis, three distinct clusters of non-responders emerged. Cluster 1 had overrepresentation of the stage IV disease and suboptimal debulking, under-expression of miRNAs and mRNAs, hypomethylated DNA, “loss of function” TP53 mutations, and the overexpression of genes in the PDGFR pathway. Cluster 2 had low miRNA expression, generalized hypermethylation, MUC17 mutations, and significant activation of the HIF-1 signaling pathway. Cluster 3 had more optimally cytoreduced stage III patients, overexpression of miRNAs, mixed methylation patterns, and “gain of function” TP53 mutations. However, the survival for all clusters was similar. Integration of genomic and clinical data from patients that do not respond to chemotherapy has identified different subgroups or clusters. Pathway analysis further identified the potential alternative therapeutic targets for each cluster. Full article
(This article belongs to the Special Issue Data Analysis and Integration in Cancer Research)
Show Figures

Figure 1

Review

Jump to: Research

15 pages, 472 KiB  
Review
Knowledge Generation with Rule Induction in Cancer Omics
by Giovanni Scala, Antonio Federico, Vittorio Fortino, Dario Greco and Barbara Majello
Int. J. Mol. Sci. 2020, 21(1), 18; https://doi.org/10.3390/ijms21010018 - 18 Dec 2019
Cited by 7 | Viewed by 3481
Abstract
The explosion of omics data availability in cancer research has boosted the knowledge of the molecular basis of cancer, although the strategies for its definitive resolution are still not well established. The complexity of cancer biology, given by the high heterogeneity of cancer [...] Read more.
The explosion of omics data availability in cancer research has boosted the knowledge of the molecular basis of cancer, although the strategies for its definitive resolution are still not well established. The complexity of cancer biology, given by the high heterogeneity of cancer cells, leads to the development of pharmacoresistance for many patients, hampering the efficacy of therapeutic approaches. Machine learning techniques have been implemented to extract knowledge from cancer omics data in order to address fundamental issues in cancer research, as well as the classification of clinically relevant sub-groups of patients and for the identification of biomarkers for disease risk and prognosis. Rule induction algorithms are a group of pattern discovery approaches that represents discovered relationships in the form of human readable associative rules. The application of such techniques to the modern plethora of collected cancer omics data can effectively boost our understanding of cancer-related mechanisms. In fact, the capability of these methods to extract a huge amount of human readable knowledge will eventually help to uncover unknown relationships between molecular attributes and the malignant phenotype. In this review, we describe applications and strategies for the usage of rule induction approaches in cancer omics data analysis. In particular, we explore the canonical applications and the future challenges and opportunities posed by multi-omics integration problems. Full article
(This article belongs to the Special Issue Data Analysis and Integration in Cancer Research)
Show Figures

Figure 1

22 pages, 1488 KiB  
Review
TCGA-TCIA Impact on Radiogenomics Cancer Research: A Systematic Review
by Mario Zanfardino, Katia Pane, Peppino Mirabelli, Marco Salvatore and Monica Franzese
Int. J. Mol. Sci. 2019, 20(23), 6033; https://doi.org/10.3390/ijms20236033 - 29 Nov 2019
Cited by 38 | Viewed by 5565
Abstract
In the last decade, the development of radiogenomics research has produced a significant amount of papers describing relations between imaging features and several molecular ‘omic signatures arising from next-generation sequencing technology and their potential role in the integrated diagnostic field. The most vulnerable [...] Read more.
In the last decade, the development of radiogenomics research has produced a significant amount of papers describing relations between imaging features and several molecular ‘omic signatures arising from next-generation sequencing technology and their potential role in the integrated diagnostic field. The most vulnerable point of many of these studies lies in the poor number of involved patients. In this scenario, a leading role is played by The Cancer Genome Atlas (TCGA) and The Cancer Imaging Archive (TCIA), which make available, respectively, molecular ‘omic data and linked imaging data. In this review, we systematically collected and analyzed radiogenomic studies based on TCGA-TCIA data. We organized literature per tumor type and molecular ‘omic data in order to discuss salient imaging genomic associations and limitations of each study. Finally, we outlined the potential clinical impact of radiogenomics to improve the accuracy of diagnosis and the prediction of patient outcomes in oncology. Full article
(This article belongs to the Special Issue Data Analysis and Integration in Cancer Research)
Show Figures

Figure 1

27 pages, 5023 KiB  
Review
Independent Component Analysis for Unraveling the Complexity of Cancer Omics Datasets
by Nicolas Sompairac, Petr V. Nazarov, Urszula Czerwinska, Laura Cantini, Anne Biton, Askhat Molkenov, Zhaxybay Zhumadilov, Emmanuel Barillot, Francois Radvanyi, Alexander Gorban, Ulykbek Kairov and Andrei Zinovyev
Int. J. Mol. Sci. 2019, 20(18), 4414; https://doi.org/10.3390/ijms20184414 - 7 Sep 2019
Cited by 51 | Viewed by 7402
Abstract
Independent component analysis (ICA) is a matrix factorization approach where the signals captured by each individual matrix factors are optimized to become as mutually independent as possible. Initially suggested for solving source blind separation problems in various fields, ICA was shown to be [...] Read more.
Independent component analysis (ICA) is a matrix factorization approach where the signals captured by each individual matrix factors are optimized to become as mutually independent as possible. Initially suggested for solving source blind separation problems in various fields, ICA was shown to be successful in analyzing functional magnetic resonance imaging (fMRI) and other types of biomedical data. In the last twenty years, ICA became a part of the standard machine learning toolbox, together with other matrix factorization methods such as principal component analysis (PCA) and non-negative matrix factorization (NMF). Here, we review a number of recent works where ICA was shown to be a useful tool for unraveling the complexity of cancer biology from the analysis of different types of omics data, mainly collected for tumoral samples. Such works highlight the use of ICA in dimensionality reduction, deconvolution, data pre-processing, meta-analysis, and others applied to different data types (transcriptome, methylome, proteome, single-cell data). We particularly focus on the technical aspects of ICA application in omics studies such as using different protocols, determining the optimal number of components, assessing and improving reproducibility of the ICA results, and comparison with other popular matrix factorization techniques. We discuss the emerging ICA applications to the integrative analysis of multi-level omics datasets and introduce a conceptual view on ICA as a tool for defining functional subsystems of a complex biological system and their interactions under various conditions. Our review is accompanied by a Jupyter notebook which illustrates the discussed concepts and provides a practical tool for applying ICA to the analysis of cancer omics datasets. Full article
(This article belongs to the Special Issue Data Analysis and Integration in Cancer Research)
Show Figures

Figure 1

12 pages, 569 KiB  
Review
Immune Checkpoint Blockade for Advanced NSCLC: A New Landscape for Elderly Patients
by Fabio Perrotta, Danilo Rocco, Fabiana Vitiello, Raffaele De Palma, Germano Guerra, Antonio De Luca, Neal Navani and Andrea Bianco
Int. J. Mol. Sci. 2019, 20(9), 2258; https://doi.org/10.3390/ijms20092258 - 7 May 2019
Cited by 32 | Viewed by 4457
Abstract
The therapeutic scenario for elderly patients with advanced NSCLC has been limited to radiotherapy and chemotherapy. Recently, a novel therapeutic approach based on targeting the immune-checkpoints has showed noteworthy results in advanced NSCLC. PD1/PD-L1 pathway is co-opted by tumor cells through the expression [...] Read more.
The therapeutic scenario for elderly patients with advanced NSCLC has been limited to radiotherapy and chemotherapy. Recently, a novel therapeutic approach based on targeting the immune-checkpoints has showed noteworthy results in advanced NSCLC. PD1/PD-L1 pathway is co-opted by tumor cells through the expression of PD-L1 on the tumor cell surface and on cells within the microenvironment, leading to suppression of anti-tumor cytolytic T-cell activity by the tumor. The success of immune-checkpoints inhibitors in clinical trials led to rapid approval by the FDA and EMA. Currently, data regarding efficacy and safety of ICIs in older subjects is limited by the poor number of elderly recruited in clinical trials. Careful assessment and management of comorbidities is essential to achieve better outcomes and limit the immune related adverse events in elderly NSCLC patients. Full article
(This article belongs to the Special Issue Data Analysis and Integration in Cancer Research)
Show Figures

Figure 1

Back to TopTop