Next Article in Journal
An Integrative Approach to Assessing Diet–Cancer Relationships
Previous Article in Journal
Comparative Metabolomics and Molecular Phylogenetics of Melon (Cucumis melo, Cucurbitaceae) Biodiversity
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Deriving Lipid Classification Based on Molecular Formulas

by
Joshua M. Mitchell
1,2,3,
Robert M. Flight
1,2,3 and
Hunter N.B. Moseley
1,2,3,4,5,*
1
Department of Molecular & Cellular Biochemistry, University of Kentucky, Lexington, KY 40536, USA
2
Markey Cancer Center, University of Kentucky, Lexington, KY 40536, USA
3
Resource Center for Stable Isotope Resolved Metabolomics, University of Kentucky, Lexington, KY 40536, USA
4
Institute for Biomedical Informatics, University of Kentucky, Lexington, KY 40536, USA
5
Center for Clinical and Translational Science, University of Kentucky, Lexington, KY 40536, USA
*
Author to whom correspondence should be addressed.
Metabolites 2020, 10(3), 122; https://doi.org/10.3390/metabo10030122
Submission received: 20 January 2020 / Revised: 2 March 2020 / Accepted: 21 March 2020 / Published: 24 March 2020

Abstract

Despite instrument and algorithmic improvements, the untargeted and accurate assignment of metabolites remains an unsolved problem in metabolomics. New assignment methods such as our SMIRFE algorithm can assign elemental molecular formulas to observed spectral features in a highly untargeted manner without orthogonal information from tandem MS or chromatography. However, for many lipidomics applications, it is necessary to know at least the lipid category or class that is associated with a detected spectral feature to derive a biochemical interpretation. Our goal is to develop a method for robustly classifying elemental molecular formula assignments into lipid categories for an application to SMIRFE-generated assignments. Using a Random Forest machine learning approach, we developed a method that can predict lipid category and class from SMIRFE non-adducted molecular formula assignments. Our methods achieve high average predictive accuracy (>90%) and precision (>83%) across all eight of the lipid categories in the LIPIDMAPS database. Classification performance was evaluated using sets of theoretical, data-derived, and artifactual molecular formulas. Our methods enable the lipid classification of non-adducted molecular formula assignments generated by SMIRFE without orthogonal information, facilitating the biochemical interpretation of untargeted lipidomics experiments. This lipid classification appears insufficient for validating single-spectrum assignments, but could be useful in cross-spectrum assignment validation.
Keywords: SMIRFE; lipidomics; metabolomics; lipid category; machine learning; Random Forest SMIRFE; lipidomics; metabolomics; lipid category; machine learning; Random Forest

Share and Cite

MDPI and ACS Style

Mitchell, J.M.; Flight, R.M.; Moseley, H.N.B. Deriving Lipid Classification Based on Molecular Formulas. Metabolites 2020, 10, 122. https://doi.org/10.3390/metabo10030122

AMA Style

Mitchell JM, Flight RM, Moseley HNB. Deriving Lipid Classification Based on Molecular Formulas. Metabolites. 2020; 10(3):122. https://doi.org/10.3390/metabo10030122

Chicago/Turabian Style

Mitchell, Joshua M., Robert M. Flight, and Hunter N.B. Moseley. 2020. "Deriving Lipid Classification Based on Molecular Formulas" Metabolites 10, no. 3: 122. https://doi.org/10.3390/metabo10030122

APA Style

Mitchell, J. M., Flight, R. M., & Moseley, H. N. B. (2020). Deriving Lipid Classification Based on Molecular Formulas. Metabolites, 10(3), 122. https://doi.org/10.3390/metabo10030122

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop