Off-policy learning over heterogeneous information for recommendation

X Wang, Q Li, D Yu, G Xu - Proceedings of the ACM Web Conference …, 2022 - dl.acm.org
Reinforcement learning has recently become an active topic in recommender system
research, where the logged data that records interactions between items and users …

Efficient methods in counterfactual policy learning and sequential decision making

H Zenati - 2023 - theses.hal.science
Because logged data has become ubiquitous in wide-range applications and since
onlineexploration may be sensitive, counterfactual methods have gained significant …

[PDF][PDF] Counterfactual Estimation from Logged Data

R Féraud - 2023 - researchgate.net
Counterfactual Estimation from Logged Data Page 1 Counterfactual Estimation from Logged
Data Raphaël Féraud ORANGE Innovation March 2023 Raphaël Féraud (Orange Innovation) …