Next Article in Journal
A Breast Cancer Polygenic Risk Score Validation in 15,490 Brazilians Using Exome Sequencing
Previous Article in Journal
Serum P-Cresyl Sulfate Levels Correlate with Peripheral Arterial Disease in Hypertensive Patients
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Machine Learning-Based Non-Invasive Prediction of Metabolic Dysfunction-Associated Steatohepatitis in Obese Patients: A Retrospective Study

1
Department of Ultrasound, China-Japan Friendship Hospital, Beijing 100029, China
2
School of Information Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China
3
Department of General Surgery & Obesity and Metabolic Disease Center, China-Japan Friendship Hospital, Beijing 100029, China
*
Author to whom correspondence should be addressed.
Diagnostics 2025, 15(9), 1096; https://doi.org/10.3390/diagnostics15091096
Submission received: 12 March 2025 / Revised: 19 April 2025 / Accepted: 22 April 2025 / Published: 25 April 2025
(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)

Abstract

Objectives: We aimed to develop and validate machine learning (ML) models that integrate clinical and laboratory data for the non-invasive prediction of metabolic dysfunction-associated steatohepatitis (MASH) in an obese population. Methods: In this retrospective study, clinical and laboratory data were collected from obese patients undergoing bariatric surgery. The cohort was divided using stratified random sampling, and optimal features were selected with SHapley Additive exPlanations (SHAP). Various ML models, including K-nearest neighbors, linear support vector machine, radial basis function support vector machine, Gaussian process, random forest, multilayer perceptron, adaptive boosting, and naïve Bayes, were developed through cross-validation and hyperparameter tuning. Diagnostic performance was assessed via the area under the curve (AUC) in both training and validation sets. Results: A total of 558 patients were analyzed, with 390 in the training set and 168 in the validation set. In the training cohort, the median age was 35 years, the median body mass index (BMI) was 39.8 kg/m2, 39.0% were male, 37.9% had diabetes mellitus, and 62.8% were diagnosed with MASH. The validation cohort had a median age of 34.1 years, a median BMI of 42.5 kg/m2, 41.7% male, 32.7% with diabetes, and 39.9% with MASH. Among the models, the random forest achieved the highest performance among the models with AUC values of 0.94 in the training set and 0.88 in the validation set. The Gaussian process model attained an AUC of 0.97 in the training cohort but 0.79 in the validation cohort, while the other models achieved AUC values ranging from 0.63 to 0.88 in the training cohort and 0.62 to 0.75 in the validation set. Conclusions: ML models, particularly the random forest, effectively predict MASH using readily available data, offering a promising non-invasive alternative to conventional serological scoring. Prospective studies and external validations are needed to further establish clinical utility.
Keywords: metabolic dysfunction-associated steatohepatitis (MASH); metabolic dysfunction-associated fatty liver disease (MAFLD); machine learning; non-invasive diagnosis metabolic dysfunction-associated steatohepatitis (MASH); metabolic dysfunction-associated fatty liver disease (MAFLD); machine learning; non-invasive diagnosis

Share and Cite

MDPI and ACS Style

Chen, J.; Zhang, B.; Cheng, Y.; Jia, Y.; Zhou, B. Machine Learning-Based Non-Invasive Prediction of Metabolic Dysfunction-Associated Steatohepatitis in Obese Patients: A Retrospective Study. Diagnostics 2025, 15, 1096. https://doi.org/10.3390/diagnostics15091096

AMA Style

Chen J, Zhang B, Cheng Y, Jia Y, Zhou B. Machine Learning-Based Non-Invasive Prediction of Metabolic Dysfunction-Associated Steatohepatitis in Obese Patients: A Retrospective Study. Diagnostics. 2025; 15(9):1096. https://doi.org/10.3390/diagnostics15091096

Chicago/Turabian Style

Chen, Jie, Bo Zhang, Yong Cheng, Yuanchen Jia, and Biao Zhou. 2025. "Machine Learning-Based Non-Invasive Prediction of Metabolic Dysfunction-Associated Steatohepatitis in Obese Patients: A Retrospective Study" Diagnostics 15, no. 9: 1096. https://doi.org/10.3390/diagnostics15091096

APA Style

Chen, J., Zhang, B., Cheng, Y., Jia, Y., & Zhou, B. (2025). Machine Learning-Based Non-Invasive Prediction of Metabolic Dysfunction-Associated Steatohepatitis in Obese Patients: A Retrospective Study. Diagnostics, 15(9), 1096. https://doi.org/10.3390/diagnostics15091096

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop