所有版本 - 学术资源搜索

文章

学术资源搜索

获得 2 条结果（用时0.02秒）

A novel evaluation methodology for assessing off-policy learning methods in contextual bandits

N Hassanpour, R Greiner - … on Artificial Intelligence, Canadian AI 2018 …, 2018 - Springer

We propose a novel evaluation methodology for assessing off-policy learning methods in
contextual bandits. In particular, we provide a way to use data from any given Randomized …

被引用次数：4 相关文章

[PDF] academia.edu

[PDF][PDF] A Novel Evaluation Methodology for Assessing Off-Policy Learning Methods in Contextual Bandits

N Hassanpour, R Greiner - academia.edu

A Novel Evaluation Methodology for Assessing Off-Policy Learning Methods in Contextual
Bandits Page 1 Synthesize many Observational Studies (2.B) Compute Counterfactual …