Ensemble learning predicts multiple sclerosis disease course in the SUMMIT study

The rate of disability accumulation varies across multiple sclerosis (MS) patients. Machine learning techniques may offer more powerful means to predict disease course in MS patients. In our study, 724 patients from the Comprehensive Longitudinal Investigation in MS at Brigham and Women’s Hospital (...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:NPJ digital medicine Jg. 3; H. 1; S. 135 - 8
Hauptverfasser: Zhao, Yijun, Wang, Tong, Bove, Riley, Cree, Bruce, Henry, Roland, Lokhande, Hrishikesh, Polgar-Turcsanyi, Mariann, Anderson, Mark, Bakshi, Rohit, Weiner, Howard L., Chitnis, Tanuja
Format: Journal Article
Sprache:Englisch
Veröffentlicht: London Nature Publishing Group UK 16.10.2020
Nature Publishing Group
Nature Portfolio
Schlagworte:
ISSN:2398-6352, 2398-6352
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The rate of disability accumulation varies across multiple sclerosis (MS) patients. Machine learning techniques may offer more powerful means to predict disease course in MS patients. In our study, 724 patients from the Comprehensive Longitudinal Investigation in MS at Brigham and Women’s Hospital (CLIMB study) and 400 patients from the EPIC dataset, University of California, San Francisco, were included in the analysis. The primary outcome was an increase in Expanded Disability Status Scale ( EDSS ) ≥ 1.5 (worsening) or not (non-worsening) at up to 5 years after the baseline visit. Classification models were built using the CLIMB dataset with patients’ clinical and MRI longitudinal observations in first 2 years, and further validated using the EPIC dataset. We compared the performance of three popular machine learning algorithms ( SVM, Logistic Regression, and Random Forest ) and three ensemble learning approaches ( XGBoost, LightGBM , and a Meta-learner L ). A “threshold” was established to trade-off the performance between the two classes. Predictive features were identified and compared among different models. Machine learning models achieved 0.79 and 0.83 AUC scores for the CLIMB and EPIC datasets, respectively, shortly after disease onset. Ensemble learning methods were more effective and robust compared to standalone algorithms. Two ensemble models, XGBoost and LightGBM were superior to the other four models evaluated in our study. Of variables evaluated, EDSS, Pyramidal Function , and Ambulatory Index were the top common predictors in forecasting the MS disease course. Machine learning techniques, in particular ensemble methods offer increased accuracy for the prediction of MS disease course.
Bibliographie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:2398-6352
2398-6352
DOI:10.1038/s41746-020-00338-8