Next Article in Journal
Validated and Predictive Processing of Gas Chromatography-Mass Spectrometry Based Metabolomics Data for Large Scale Screening Studies, Diagnostics and Metabolite Pattern Verification
Next Article in Special Issue
A Comprehensive Workflow of Mass Spectrometry-Based Untargeted Metabolomics in Cancer Metabolic Biomarker Discovery Using Human Plasma and Urine
Previous Article in Journal
Characterization of the Interaction Between the Small Regulatory Peptide SgrT and the EIICBGlc of the Glucose-Phosphotransferase System of E. coli K-12
Previous Article in Special Issue
Separation Technique for the Determination of Highly Polar Metabolites in Biological Samples
Metabolites 2012, 2(4), 775-795; doi:10.3390/metabo2040775

A Guideline to Univariate Statistical Analysis for LC/MS-Based Untargeted Metabolomics-Derived Data

1,2,3,* , 1,3
1 Metabolomics Platform, Campus Sescelades, Edifici N2, Rovira i Virgili University, Tarragona 43007, Spain 2 Spanish Biomedical Research Center in Diabetes and Associated Metabolic Disorders (CIBERDEM), Passeig Bonanova 69, Barcelona 08017, Spain 3 Institut d’Investigació Biomédica Pere Virgili (IISPV), C/Sant Llorenç, 21, Reus 43201, Spain 4 Institute for Research in Biomedicine (IRB Barcelona), Barcelona 08028, Spain 5 Department of Biochemistry and Molecular Biology, University of Barcelona, Barcelona 08028, Spain
* Authors to whom correspondence should be addressed.
Received: 2 August 2012 / Revised: 2 October 2012 / Accepted: 10 October 2012 / Published: 18 October 2012
(This article belongs to the Special Issue Analytical Techniques in Metabolomics)
View Full-Text   |   Download PDF [420 KB, 19 October 2012; original version 18 October 2012]   |   Browse Figures


Several metabolomic software programs provide methods for peak picking, retention time alignment and quantification of metabolite features in LC/MS-based metabolomics. Statistical analysis, however, is needed in order to discover those features significantly altered between samples. By comparing the retention time and MS/MS data of a model compound to that from the altered feature of interest in the research sample, metabolites can be then unequivocally identified. This paper reports on a comprehensive overview of a workflow for statistical analysis to rank relevant metabolite features that will be selected for further MS/MS experiments. We focus on univariate data analysis applied in parallel on all detected features. Characteristics and challenges of this analysis are discussed and illustrated using four different real LC/MS untargeted metabolomic datasets. We demonstrate the influence of considering or violating mathematical assumptions on which univariate statistical test rely, using high-dimensional LC/MS datasets. Issues in data analysis such as determination of sample size, analytical variation, assumption of normality and homocedasticity, or correction for multiple testing are discussed and illustrated in the context of our four untargeted LC/MS working examples.
Keywords: univariate; metabolomics; mass spectrometry univariate; metabolomics; mass spectrometry
This is an open access article distributed under the Creative Commons Attribution License (CC BY 3.0).

Share & Cite This Article

Further Mendeley | CiteULike
Export to BibTeX |
EndNote |
MDPI and ACS Style

Vinaixa, M.; Samino, S.; Saez, I.; Duran, J.; Guinovart, J.J.; Yanes, O. A Guideline to Univariate Statistical Analysis for LC/MS-Based Untargeted Metabolomics-Derived Data. Metabolites 2012, 2, 775-795.

View more citation formats

Related Articles

Article Metrics

For more information on the journal, click here


[Return to top]
Metabolites EISSN 2218-1989 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert