Double reinforcement learning for efficient off-policy evaluation in markov decision processes

N Kallus, M Uehara - Journal of Machine Learning Research, 2020 - jmlr.org
Off-policy evaluation (OPE) in reinforcement learning allows one to evaluate novel decision
policies without needing to conduct exploration, which is often costly or otherwise infeasible …

On the role of surrogates in the efficient estimation of treatment effects with limited outcome data

N Kallus, X Mao - Journal of the Royal Statistical Society Series …, 2024 - academic.oup.com
In many experimental and observational studies, the outcome of interest is often difficult or
expensive to observe, reducing effective sample sizes for estimating average treatment …

Doubly-valid/doubly-sharp sensitivity analysis for causal inference with unmeasured confounding

J Dorn, K Guo, N Kallus - Journal of the American Statistical …, 2024 - Taylor & Francis
We consider the problem of constructing bounds on the average treatment effect (ATE) when
unmeasured confounders exist but have bounded influence. Specifically, we assume that …

[HTML][HTML] Nonparametric bootstrap inference for the targeted highly adaptive least absolute shrinkage and selection operator (LASSO) estimator

W Cai, M van der Laan - The international journal of biostatistics, 2020 - degruyter.com
Abstract The Highly-Adaptive least absolute shrinkage and selection operator (LASSO)
Targeted Minimum Loss Estimator (HAL-TMLE) is an efficient plug-in estimator of a pathwise …

[PDF][PDF] hal9001: Scalable highly adaptive lasso regression inR

NS Hejazi, JR Coyle, MJ van der Laan - Journal of Open Source …, 2020 - joss.theoj.org
The hal9001 R package provides a computationally efficient implementation of the highly
adaptive lasso (HAL), a flexible nonparametric regression and machine learning algorithm …

Efficient estimation of pathwise differentiable target parameters with the undersmoothed highly adaptive lasso

MJ van der Laan, D Benkeser, W Cai - The International Journal of …, 2023 - degruyter.com
We consider estimation of a functional parameter of a realistically modeled data distribution
based on observing independent and identically distributed observations. The highly …

Multivariate trend filtering for lattice data

V Sadhanala, YX Wang, AJ Hu… - arXiv preprint arXiv …, 2021 - arxiv.org
We study a multivariate version of trend filtering, called Kronecker trend filtering or KTF, for
the case in which the design points form a lattice in $ d $ dimensions. KTF is a natural …

Estimation of time‐specific intervention effects on continuously distributed time‐to‐event outcomes by targeted maximum likelihood estimation

HCW Rytgaard, F Eriksson, MJ van der Laan - Biometrics, 2023 - Wiley Online Library
This work considers targeted maximum likelihood estimation (TMLE) of treatment effects on
absolute risk and survival probabilities in classical time‐to‐event settings characterized by …

Nonparametric inverse‐probability‐weighted estimators based on the highly adaptive lasso

A Ertefaie, NS Hejazi, MJ van der Laan - Biometrics, 2023 - Wiley Online Library
Inverse‐probability‐weighted estimators are the oldest and potentially most commonly used
class of procedures for the estimation of causal effects. By adjusting for selection biases via …

Adaptive debiased machine learning using data-driven model selection techniques

L van der Laan, M Carone, A Luedtke… - arXiv preprint arXiv …, 2023 - arxiv.org
Debiased machine learning estimators for nonparametric inference of smooth functionals of
the data-generating distribution can suffer from excessive variability and instability. For this …