Offline a/b testing for recommender systems

E Zangerle, C Bauer - ACM Computing Surveys, 2022 - dl.acm.org

The comprehensive evaluation of the performance of a recommender system is a complex
endeavor: many facets need to be considered in configuring an adequate and effective …

被引用次数：166 相关文章所有 17 个版本

[PDF] arxiv.org

Offline reinforcement learning: Tutorial, review, and perspectives on open problems

S Levine, A Kumar, G Tucker, J Fu - arXiv preprint arXiv:2005.01643, 2020 - arxiv.org

In this tutorial article, we aim to provide the reader with the conceptual tools needed to get
started on research on offline reinforcement learning algorithms: reinforcement learning …

被引用次数：2082 相关文章所有 3 个版本

[HTML] sciencedirect.com

[HTML][HTML] Advances and challenges in conversational recommender systems: A survey

C Gao, W Lei, X He, M de Rijke, TS Chua - AI open, 2021 - Elsevier

Recommender systems exploit interaction history to estimate user preference, having been
heavily used in a wide range of industry applications. However, static recommendation …

被引用次数：292 相关文章所有 8 个版本

[PDF] arxiv.org

Top-k off-policy correction for a REINFORCE recommender system

M Chen, A Beutel, P Covington, S Jain… - Proceedings of the …, 2019 - dl.acm.org

Industrial recommender systems deal with extremely large action spaces--many millions of
items to recommend. Moreover, they need to serve billions of users, who are unique at any …

被引用次数：532 相关文章所有 10 个版本

[PDF] arxiv.org

Disentangling user interest and conformity for recommendation with causal embedding

Y Zheng, C Gao, X Li, X He, Y Li, D Jin - Proceedings of the Web …, 2021 - dl.acm.org

Recommendation models are usually trained on observational interaction data. However,
observational interaction data could result from users' conformity towards popular items …

被引用次数：340 相关文章所有 8 个版本

[PDF] arxiv.org

KuaiRec: A fully-observed dataset and insights for evaluating recommender systems

C Gao, S Li, W Lei, J Chen, B Li, P Jiang, X He… - Proceedings of the 31st …, 2022 - dl.acm.org

The progress of recommender systems is hampered mainly by evaluation as it requires real-
time interactions between humans and systems, which is too laborious and expensive. This …

被引用次数：113 相关文章所有 8 个版本

[PDF] arxiv.org

Unbiased recommender learning from missing-not-at-random implicit feedback

Y Saito, S Yaginuma, Y Nishino, H Sakata… - Proceedings of the 13th …, 2020 - dl.acm.org

Recommender systems widely use implicit feedback such as click data because of its
general availability. Although the presence of clicks signals the users' preference to some …

被引用次数：291 相关文章所有 6 个版本

[PDF] arxiv.org

Reinforcement learning to optimize long-term user engagement in recommender systems

L Zou, L Xia, Z Ding, J Song, W Liu, D Yin - Proceedings of the 25th ACM …, 2019 - dl.acm.org

Recommender systems play a crucial role in our daily lives. Feed streaming mechanism has
been widely used in the recommender system, especially on the mobile Apps. The feed …

被引用次数：281 相关文章所有 4 个版本

[PDF] arxiv.org

Kuairand: an unbiased sequential recommendation dataset with randomly exposed videos

C Gao, S Li, Y Zhang, J Chen, B Li, W Lei… - Proceedings of the 31st …, 2022 - dl.acm.org

Recommender systems deployed in real-world applications can have inherent exposure
bias, which leads to the biased logged data plaguing the researchers. A fundamental way to …

被引用次数：80 相关文章所有 8 个版本

[PDF] mlr.press

Doubly robust joint learning for recommendation on data missing not at random

X Wang, R Zhang, Y Sun, J Qi - International Conference on …, 2019 - proceedings.mlr.press

In recommender systems, usually the ratings of a user to most items are missing and a
critical problem is that the missing ratings are often missing not at random (MNAR) in reality …

被引用次数：251 相关文章所有 5 个版本