Evaluating recommender systems: survey and framework

E Zangerle, C Bauer - ACM Computing Surveys, 2022 - dl.acm.org
The comprehensive evaluation of the performance of a recommender system is a complex
endeavor: many facets need to be considered in configuring an adequate and effective …

Offline reinforcement learning: Tutorial, review, and perspectives on open problems

S Levine, A Kumar, G Tucker, J Fu - arXiv preprint arXiv:2005.01643, 2020 - arxiv.org
In this tutorial article, we aim to provide the reader with the conceptual tools needed to get
started on research on offline reinforcement learning algorithms: reinforcement learning …

[HTML][HTML] Advances and challenges in conversational recommender systems: A survey

C Gao, W Lei, X He, M de Rijke, TS Chua - AI open, 2021 - Elsevier
Recommender systems exploit interaction history to estimate user preference, having been
heavily used in a wide range of industry applications. However, static recommendation …

Top-k off-policy correction for a REINFORCE recommender system

M Chen, A Beutel, P Covington, S Jain… - Proceedings of the …, 2019 - dl.acm.org
Industrial recommender systems deal with extremely large action spaces--many millions of
items to recommend. Moreover, they need to serve billions of users, who are unique at any …

Disentangling user interest and conformity for recommendation with causal embedding

Y Zheng, C Gao, X Li, X He, Y Li, D Jin - Proceedings of the Web …, 2021 - dl.acm.org
Recommendation models are usually trained on observational interaction data. However,
observational interaction data could result from users' conformity towards popular items …

KuaiRec: A fully-observed dataset and insights for evaluating recommender systems

C Gao, S Li, W Lei, J Chen, B Li, P Jiang, X He… - Proceedings of the 31st …, 2022 - dl.acm.org
The progress of recommender systems is hampered mainly by evaluation as it requires real-
time interactions between humans and systems, which is too laborious and expensive. This …

Unbiased recommender learning from missing-not-at-random implicit feedback

Y Saito, S Yaginuma, Y Nishino, H Sakata… - Proceedings of the 13th …, 2020 - dl.acm.org
Recommender systems widely use implicit feedback such as click data because of its
general availability. Although the presence of clicks signals the users' preference to some …

Reinforcement learning to optimize long-term user engagement in recommender systems

L Zou, L Xia, Z Ding, J Song, W Liu, D Yin - Proceedings of the 25th ACM …, 2019 - dl.acm.org
Recommender systems play a crucial role in our daily lives. Feed streaming mechanism has
been widely used in the recommender system, especially on the mobile Apps. The feed …

Kuairand: an unbiased sequential recommendation dataset with randomly exposed videos

C Gao, S Li, Y Zhang, J Chen, B Li, W Lei… - Proceedings of the 31st …, 2022 - dl.acm.org
Recommender systems deployed in real-world applications can have inherent exposure
bias, which leads to the biased logged data plaguing the researchers. A fundamental way to …

Doubly robust joint learning for recommendation on data missing not at random

X Wang, R Zhang, Y Sun, J Qi - International Conference on …, 2019 - proceedings.mlr.press
In recommender systems, usually the ratings of a user to most items are missing and a
critical problem is that the missing ratings are often missing not at random (MNAR) in reality …