A novel evaluation methodology for assessing off-policy learning methods in contextual bandits

N Hassanpour, R Greiner - … on Artificial Intelligence, Canadian AI 2018 …, 2018 - Springer
We propose a novel evaluation methodology for assessing off-policy learning methods in
contextual bandits. In particular, we provide a way to use data from any given Randomized …

[PDF][PDF] A Novel Evaluation Methodology for Assessing Off-Policy Learning Methods in Contextual Bandits

N Hassanpour, R Greiner - academia.edu
A Novel Evaluation Methodology for Assessing Off-Policy Learning Methods in Contextual
Bandits Page 1 Synthesize many Observational Studies (2.B) Compute Counterfactual …