A review of off-policy evaluation in reinforcement learning

M Uehara, C Shi, N Kallus - arXiv preprint arXiv:2212.06355, 2022 - arxiv.org
Reinforcement learning (RL) is one of the most vibrant research frontiers in machine
learning and has been recently applied to solve a number of challenging problems. In this …

Doubly robust difference-in-differences estimators

PHC Sant'Anna, J Zhao - Journal of econometrics, 2020 - Elsevier
This article proposes doubly robust estimators for the average treatment effect on the treated
(ATT) in difference-in-differences (DID) research designs. In contrast to alternative DID …

Difference-in-differences with multiple time periods

B Callaway, PHC Sant'Anna - Journal of econometrics, 2021 - Elsevier
In this article, we consider identification, estimation, and inference procedures for treatment
effect parameters using Difference-in-Differences (DiD) with (i) multiple time periods,(ii) …

Double/debiased machine learning for treatment and structural parameters

V Chernozhukov, D Chetverikov, M Demirer, E Duflo… - 2018 - academic.oup.com
We revisit the classic semi‐parametric problem of inference on a low‐dimensional
parameter θ0 in the presence of high‐dimensional nuisance parameters η0. We depart from …

Locally robust semiparametric estimation

V Chernozhukov, JC Escanciano, H Ichimura… - …, 2022 - Wiley Online Library
Many economic and causal parameters depend on nonparametric or high dimensional first
steps. We give a general construction of locally robust/orthogonal moment functions for …

Orthogonal statistical learning

DJ Foster, V Syrgkanis - The Annals of Statistics, 2023 - projecteuclid.org
Orthogonal statistical learning Page 1 The Annals of Statistics 2023, Vol. 51, No. 3, 879–908
https://doi.org/10.1214/23-AOS2258 © Institute of Mathematical Statistics, 2023 ORTHOGONAL …

Double reinforcement learning for efficient off-policy evaluation in markov decision processes

N Kallus, M Uehara - Journal of Machine Learning Research, 2020 - jmlr.org
Off-policy evaluation (OPE) in reinforcement learning allows one to evaluate novel decision
policies without needing to conduct exploration, which is often costly or otherwise infeasible …

Program evaluation and causal inference with high‐dimensional data

A Belloni, V Chernozhukov, I Fernandez‐Val… - …, 2017 - Wiley Online Library
In this paper, we provide efficient estimators and honest confidence bands for a variety of
treatment effects including local average (LATE) and local quantile treatment effects (LQTE) …

Debiased machine learning without sample-splitting for stable estimators

Q Chen, V Syrgkanis, M Austern - Advances in Neural …, 2022 - proceedings.neurips.cc
Estimation and inference on causal parameters is typically reduced to a generalized method
of moments problem, which involves auxiliary functions that correspond to solutions to a …

Applied nonparametric instrumental variables estimation

JL Horowitz - Econometrica, 2011 - Wiley Online Library
Instrumental variables are widely used in applied econometrics to achieve identification and
carry out estimation and inference in models that contain endogenous explanatory …