Offline reinforcement learning: Tutorial, review, and perspectives on open problems
In this tutorial article, we aim to provide the reader with the conceptual tools needed to get
started on research on offline reinforcement learning algorithms: reinforcement learning …
started on research on offline reinforcement learning algorithms: reinforcement learning …
Machine learning methods that economists should know about
We discuss the relevance of the recent machine learning (ML) literature for economics and
econometrics. First we discuss the differences in goals, methods, and settings between the …
econometrics. First we discuss the differences in goals, methods, and settings between the …
The impact of machine learning on economics
S Athey - The economics of artificial intelligence: An agenda, 2018 - degruyter.com
I believe that machine learning (ML) will have a dramatic impact on the field of economics
within a short time frame. Indeed, the impact of ML on economics is already well underway …
within a short time frame. Indeed, the impact of ML on economics is already well underway …
Introduction to multi-armed bandits
A Slivkins - Foundations and Trends® in Machine Learning, 2019 - nowpublishers.com
Multi-armed bandits a simple but very powerful framework for algorithms that make
decisions over time under uncertainty. An enormous body of work has accumulated over the …
decisions over time under uncertainty. An enormous body of work has accumulated over the …
Neurosymbolic programming
We survey recent work on neurosymbolic programming, an emerging area that bridges the
areas of deep learning and program synthesis. Like in classic machine learning, the goal …
areas of deep learning and program synthesis. Like in classic machine learning, the goal …
Q-learning: Theory and applications
J Clifton, E Laber - Annual Review of Statistics and Its …, 2020 - annualreviews.org
Q-learning, originally an incremental algorithm for estimating an optimal decision strategy in
an infinite-horizon decision problem, now refers to a general class of reinforcement learning …
an infinite-horizon decision problem, now refers to a general class of reinforcement learning …
Constrained Bayesian optimization with noisy experiments
Constrained Bayesian Optimization with Noisy Experiments Page 1 Bayesian Analysis (2019)
14, Number 2, pp. 495–519 Constrained Bayesian Optimization with Noisy Experiments …
14, Number 2, pp. 495–519 Constrained Bayesian Optimization with Noisy Experiments …
Double reinforcement learning for efficient off-policy evaluation in markov decision processes
Off-policy evaluation (OPE) in reinforcement learning allows one to evaluate novel decision
policies without needing to conduct exploration, which is often costly or otherwise infeasible …
policies without needing to conduct exploration, which is often costly or otherwise infeasible …
Kinematic state abstraction and provably efficient rich-observation reinforcement learning
We present an algorithm, HOMER, for exploration and reinforcement learning in rich
observation environments that are summarizable by an unknown latent state space. The …
observation environments that are summarizable by an unknown latent state space. The …
Balanced linear contextual bandits
Contextual bandit algorithms are sensitive to the estimation method of the outcome model as
well as the exploration method used, particularly in the presence of rich heterogeneity or …
well as the exploration method used, particularly in the presence of rich heterogeneity or …