Next Article in Journal
Metabolic Hormone Levels in Infants Fed Formulas with Age-Adapted Protein Concentrations from Birth to 12 Months
Previous Article in Journal
Ultra-Processed Food Consumption and Biological Aging in Italian Adults from the Moli-Sani Study Cohort
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Abstract

Machine Learning Tools Can Pinpoint High-Risk Water Pollutants †

1
Department of Materials and Environmental Chemistry, Stockholm University, Svante Arrhenius väg 16, 106 91 Stockholm, Sweden
2
Department of Environmental Science, Stockholm University, Svante Arrhenius väg 8, 106 91 Stockholm, Sweden
3
Institute of Biodiversity, Faculty of Biological Science, Cluster of Excellence Balance of the Microverse, Friedrich Schiller University Jena, 07743 Jena, Germany
*
Author to whom correspondence should be addressed.
Presented at the International Conference EcoBalt 2023 “Chemicals & Environment”, Tallinn, Estonia, 9–11 October 2023.
This author is Presenting author.
Proceedings 2023, 92(1), 68; https://doi.org/10.3390/proceedings2023092068
Published: 30 November 2023
(This article belongs to the Proceedings of International Conference EcoBalt 2023 "Chemicals & Environment")
Liquid chromatography–high-resolution mass spectrometry (LC/HRMS) is a powerful tool for detecting chemicals that are present in low concentrations. While this technique has revealed thousands of ionizable pollutants in environmental samples [1,2], the expanding list of emerging contaminants highlights the urgency to speed up their risk assessment [3,4]. Generally, the risk assessment workflow starts with structural identification, followed by obtaining the analytical standard for confirmation, toxicity assessment, and quantification with a calibration curve. To speed up the process, machine learning has found use in predicting toxicity and ionization efficiency; however, most of the models in use still require a chemical structure as an input. Therefore, detected but unidentified chemicals are frequently discarded from further analysis and the bioactivity of samples often remains partially unexplained [5]. Still, the fragmentation spectrum provides information about the structure which can be related to the properties of the chemical. We developed a workflow for estimating the risk of chemicals detected in non-target screening based on their MS2 data. Two prediction models, MS2Quant [6] for ionization efficiency and MS2Tox [7] for acute fish toxicity, were trained based on structural fingerprints. While structural fingerprints can be calculated from a structure, the recently developed SIRIUS+CSI:FingerID software [8] offers the possibility to predict these fingerprints based on the MS2 spectrum, and therefore predict chemical properties without structural assignment. Based on the validation set, the root mean square errors of MS2Quant and MS2Tox were 5.9× (39 chemicals) and 7.8× (219 chemicals), respectively. These models were applied in a non-target screening workflow regarding wastewater analysis. The preliminary results show that MS2Quant and MS2Tox help to pinpoint chemicals that pose a higher risk compared to a top five approach. Therefore, this approach provides the possibility to evaluate the risk of unidentified LC/HRMS features and prioritize high-risk chemicals in identification.

Author Contributions

H.S., P.P. and A.K. designed the research study. H.S. and P.P. developed the models and wrote the code. L.J., H.S. and L.M. performed the measurements. A.K., M.P. and M.M. (Michael McLachlan) performed supervision. A.K., J.M., M.M. (Matthew MacLeod) and M.B. acquired funding for the project. All authors have read and agreed to the published version of the manuscript.

Funding

The funding was generously provided by the Swedish Research Council for Sustainable Development, grant 2020-01511.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data will be made available together with the full publication.

Acknowledgments

The authors would like to thank Claudia Möckel and Merle Plassmann for their technical support.

Conflicts of Interest

The authors declare no competing financial interests.

References

  1. Papazian, S.; D’Agostino, L.A.; Sadiktsis, I.; Froment, J.; Bonnefille, B.; Sdougkou, K.; Xie, H.; Athanassiadis, I.; Budhavant, K.; Dasari, S.; et al. Nontarget Mass Spectrometry and in Silico Molecular Characterization of Air Pollution from the Indian Subcontinent. Commun. Earth Environ. 2022, 3, 35. [Google Scholar] [CrossRef]
  2. Schymanski, E.L.; Singer, H.P.; Slobodnik, J.; Ipolyi, I.M.; Oswald, P.; Krauss, M.; Schulze, T.; Haglund, P.; Letzel, T.; Grosse, S.; et al. Non-Target Screening with High-Resolution Mass Spectrometry: Critical Review Using a Collaborative Trial on Water Analysis. Anal. Bioanal. Chem. 2015, 407, 6237–6255. [Google Scholar] [CrossRef] [PubMed]
  3. Richardson, S.D.; Ternes, T.A. Water Analysis: Emerging Contaminants and Current Issues. Anal. Chem. 2022, 94, 382–416. [Google Scholar] [CrossRef] [PubMed]
  4. McCord, J.P.; Groff, L.C.; Sobus, J.R. Quantitative Non-Targeted Analysis: Bridging the Gap between Contaminant Discovery and Risk Characterization. Environ. Int. 2022, 158, 107011. [Google Scholar] [CrossRef] [PubMed]
  5. Houtman, C.J.; ten Broek, R.; van Oorschot, Y.; Kloes, D.; van der Oost, R.; Rosielle, M.; Lamoree, M.H. High Resolution Effect-Directed Analysis of Steroid Hormone (Ant)Agonists in Surface and Wastewater Quality Monitoring. Environ. Toxicol. Pharmacol. 2020, 80, 103460. [Google Scholar] [CrossRef] [PubMed]
  6. Sepman, H.; Malm, L.; Peets, P.; MacLeod, M.; Martin, J.; Breitholtz, M.; Kruve, A. Bypassing the Identification: MS2Quant for Concentration Estimations of Chemicals Detected with Nontarget LC-HRMS from MS 2 Data. Anal. Chem. 2023, 95, 12329–12338. [Google Scholar] [CrossRef] [PubMed]
  7. Peets, P.; Wang, W.-C.; MacLeod, M.; Breitholtz, M.; Martin, J.W.; Kruve, A. MS2Tox Machine Learning Tool for Predicting the Ecotoxicity of Unidentified Chemicals in Water by Nontarget LC-HRMS. Environ. Sci. Technol. 2022, 56, 15508–15517. [Google Scholar] [CrossRef]
  8. Dührkop, K.; Fleischauer, M.; Ludwig, M.; Aksenov, A.A.; Melnik, A.V.; Meusel, M.; Dorrestein, P.C.; Rousu, J.; Böcker, S. SIRIUS 4: A Rapid Tool for Turning Tandem Mass Spectra into Metabolite Structure Information. Nat. Methods 2019, 16, 299–302. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sepman, H.; Peets, P.; Jonsson, L.; Malm, L.; Posselt, M.; MacLeod, M.; Martin, J.; Breitholtz, M.; McLachlan, M.; Kruve, A. Machine Learning Tools Can Pinpoint High-Risk Water Pollutants. Proceedings 2023, 92, 68. https://doi.org/10.3390/proceedings2023092068

AMA Style

Sepman H, Peets P, Jonsson L, Malm L, Posselt M, MacLeod M, Martin J, Breitholtz M, McLachlan M, Kruve A. Machine Learning Tools Can Pinpoint High-Risk Water Pollutants. Proceedings. 2023; 92(1):68. https://doi.org/10.3390/proceedings2023092068

Chicago/Turabian Style

Sepman, Helen, Pilleriin Peets, Lisa Jonsson, Louise Malm, Malte Posselt, Matthew MacLeod, Jonathan Martin, Magnus Breitholtz, Michael McLachlan, and Anneli Kruve. 2023. "Machine Learning Tools Can Pinpoint High-Risk Water Pollutants" Proceedings 92, no. 1: 68. https://doi.org/10.3390/proceedings2023092068

APA Style

Sepman, H., Peets, P., Jonsson, L., Malm, L., Posselt, M., MacLeod, M., Martin, J., Breitholtz, M., McLachlan, M., & Kruve, A. (2023). Machine Learning Tools Can Pinpoint High-Risk Water Pollutants. Proceedings, 92(1), 68. https://doi.org/10.3390/proceedings2023092068

Article Metrics

Back to TopTop