Machine learning in policy evaluation: new tools for causal inference

N Kreif, K DiazOrdaz - arXiv preprint arXiv:1903.00402, 2019 - arxiv.org
While machine learning (ML) methods have received a lot of attention in recent years, these
methods are primarily for prediction. Empirical researchers conducting policy evaluations …

Automated data-adaptive analytics for electronic healthcare data to study causal treatment effects

S Schneeweiss - Clinical epidemiology, 2018 - Taylor & Francis
Background Decision makers in health care increasingly rely on nonrandomized database
analyses to assess the effectiveness, safety, and value of medical products. Health care data …

Using super learner prediction modeling to improve high-dimensional propensity score estimation

R Wyss, S Schneeweiss, M Van Der Laan… - …, 2018 - journals.lww.com
The high-dimensional propensity score is a semiautomated variable selection algorithm that
can supplement expert knowledge to improve confounding control in nonexperimental …

[HTML][HTML] Application of targeted maximum likelihood estimation in public health and epidemiological studies: a systematic review

MJ Smith, RV Phillips, MA Luque-Fernandez… - Annals of …, 2023 - Elsevier
Purpose The targeted maximum likelihood estimation (TMLE) statistical data analysis
framework integrates machine learning, statistical theory, and statistical inference to provide …

High‐dimensional propensity scores for empirical covariate selection in secondary database studies: planning, implementation, and reporting

JA Rassen, P Blin, S Kloss… - … and Drug Safety, 2023 - Wiley Online Library
Real‐world evidence used for regulatory, payer, and clinical decision‐making requires
principled epidemiology in design and analysis, applying methods to minimize confounding …

[HTML][HTML] Analyses of child cardiometabolic phenotype following assisted reproductive technologies using a pragmatic trial emulation approach

JY Huang, S Cai, Z Huang, MT Tint, WL Yuan… - Nature …, 2021 - nature.com
Assisted reproductive technologies (ART) are increasingly used, however little is known
about the long-term health of ART-conceived offspring. Weak selection of comparison …

When can nonrandomized studies support valid inference regarding effectiveness or safety of new medical treatments?

JM Franklin, R Platt, NA Dreyer… - Clinical …, 2022 - Wiley Online Library
The randomized controlled trial (RCT) is the gold standard for evaluating the causal effects
of medications. Limitations of RCTs have led to increasing interest in using real‐world …

Selective machine learning of doubly robust functionals

Y Cui, EJ Tchetgen Tchetgen - Biometrika, 2024 - academic.oup.com
While model selection is a well-studied topic in parametric and nonparametric regression or
density estimation, selection of possibly high-dimensional nuisance parameters in …

Machine learning for improving high‐dimensional proxy confounder adjustment in healthcare database studies: An overview of the current literature

R Wyss, C Yanover, T El‐Hay, D Bennett… - … and drug safety, 2022 - Wiley Online Library
Purpose Supplementing investigator‐specified variables with large numbers of empirically
identified features that collectively serve as 'proxies' for unspecified or unmeasured factors …

Collaborative-controlled LASSO for constructing propensity score-based estimators in high-dimensional data

C Ju, R Wyss, JM Franklin… - … methods in medical …, 2019 - journals.sagepub.com
Propensity score-based estimators are increasingly used for causal inference in
observational studies. However, model selection for propensity score estimation in high …