Counterfactual learning of stochastic policies with continuous actions: from models to offline...

文章

学术资源搜索

获得 3 条结果（用时0.02秒）

我的图书馆

Counterfactual learning of stochastic policies with continuous actions: from models to offline...

在引用文章中搜索

[PDF] researchgate.net

Off-policy learning over heterogeneous information for recommendation

X Wang, Q Li, D Yu, G Xu - Proceedings of the ACM Web Conference …, 2022 - dl.acm.org

Reinforcement learning has recently become an active topic in recommender system
research, where the logged data that records interactions between items and users …

被引用次数：9 相关文章所有 6 个版本

[PDF] hal.science

Efficient methods in counterfactual policy learning and sequential decision making

H Zenati - 2023 - theses.hal.science

Because logged data has become ubiquitous in wide-range applications and since
onlineexploration may be sensitive, counterfactual methods have gained significant …

[PDF][PDF] Counterfactual Estimation from Logged Data

R Féraud - 2023 - researchgate.net

Counterfactual Estimation from Logged Data Page 1 Counterfactual Estimation from Logged
Data Raphaël Féraud ORANGE Innovation March 2023 Raphaël Féraud (Orange Innovation) …