[2510.22293] Predicting Metabolic Dysfunction-Associated Steatotic Liver Disease using Machine Learning Methods: A Retrospective Cohort Study
Summary
This study presents a machine learning model for predicting Metabolic Dysfunction-Associated Steatotic Liver Disease (MASLD) using clinical data from a large cohort, aiming to enhance early detection and address disparities in prediction accuracy across demographics.
Why It Matters
MASLD is a prevalent condition affecting a significant portion of the U.S. adult population, often leading to severe health complications. This research contributes to the field by offering a predictive model that can be integrated into clinical practice, potentially improving patient outcomes and addressing healthcare disparities.
Key Takeaways
- The study developed a machine learning model (MASER) for MASLD prediction using a large electronic health record database.
- LASSO logistic regression was chosen for its interpretability and performance, achieving an AUROC of 0.84.
- Fairness adjustments improved overall accuracy but reduced sensitivity, highlighting the trade-offs in model performance.
- The model is designed for clinical implementation, facilitating early detection in primary care settings.
- Addressing disparities in prediction accuracy is crucial for equitable healthcare delivery.
Computer Science > Machine Learning arXiv:2510.22293 (cs) [Submitted on 25 Oct 2025 (v1), last revised 23 Feb 2026 (this version, v3)] Title:Predicting Metabolic Dysfunction-Associated Steatotic Liver Disease using Machine Learning Methods: A Retrospective Cohort Study Authors:Mary E. An, Paul Griffin, Jonathan G. Stine, Ramakrishna Balakrishnan, Soundar Kumara View a PDF of the paper titled Predicting Metabolic Dysfunction-Associated Steatotic Liver Disease using Machine Learning Methods: A Retrospective Cohort Study, by Mary E. An and 4 other authors View PDF Abstract:Background and Aims: Metabolic Dysfunction-Associated Steatotic Liver Disease (MASLD) affects 30-40% of U.S. adults and is the most common chronic liver disease. Although often asymptomatic, progression can lead to cirrhosis. We developed a prediction model to assist with early detection of MASLD. Approach and Results: We evaluated LASSO logistic regression, random forest, XGBoost, and a neural network model for MASLD prediction using clinical feature subsets from a large electronic health record (EHR) database, including the top 10 ranked features. To reduce disparities in true positive rates across racial and ethnic subgroups, we applied an equal opportunity postprocessing method in a prediction model called MASLD EHR Static Risk Prediction (MASER). This retrospective cohort study included 59,492 participants in the training data, 24,198 in the validating data, and 25,188 in the testing data. The LASSO lo...