Machine learning for propensity score estimation: A systematic review and reporting guidelines

Machine learning (ML) has become a common approach for estimating propensity scores (PSs) for quasi-experimental research using matching, weighting, or stratification on the PS. This systematic review examined 179 applications of ML for PS estimation across different fields, such as health, educatio...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Psychological methods
Hauptverfasser: Leite, Walter, Zhang, Huibin, Collier, Zachary, Chawla, Kamal, Kong, Lingchen, Lee, YongSeok, Quan, Jia, Soyoye, Olushola
Format: Journal Article
Sprache:Englisch
Veröffentlicht: United States 16.10.2025
ISSN:1939-1463, 1939-1463
Online-Zugang:Weitere Angaben
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Machine learning (ML) has become a common approach for estimating propensity scores (PSs) for quasi-experimental research using matching, weighting, or stratification on the PS. This systematic review examined 179 applications of ML for PS estimation across different fields, such as health, education, social sciences, and business over 40 years. The results show that the gradient boosting machine (GBM) is the most frequently used method, followed by random forest. Classification and regression trees, neural networks, and the super learner were also used in more than 5% of studies. The most frequently used packages to estimate PSs were twang, gbm, and randomforest in the R statistical software. The review identified that critical steps of the propensity score analysis are frequently underreported. Covariate balance evaluation was not reported by 48.04% of articles. Also, improper use of values for covariate balance evaluation was identified in 13.97% of the studies. Only 22.8% of studies performed a sensitivity analysis. Many hyperparameter configurations were used for ML methods, but only 46.9% of studies reported the hyperparameters used. A set of guidelines for reporting the use of ML for PS estimation is provided. (PsycInfo Database Record (c) 2025 APA, all rights reserved).
Bibliographie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1939-1463
1939-1463
DOI:10.1037/met0000789