Competing for shareable arms in multi-player multi-armed bandits

R Xu, H Wang, X Zhang, B Li… - … Conference on Machine …, 2023 - proceedings.mlr.press
Competitions for shareable and limited resources have long been studied with strategic
agents. In reality, agents often have to learn and maximize the rewards of the resources at …

User welfare optimization in recommender systems with competing content creators

F Yao, Y Liao, M Wu, C Li, Y Zhu, J Yang, J Liu… - Proceedings of the 30th …, 2024 - dl.acm.org
Driven by the new economic opportunities created by the creator economy, an increasing
number of content creators rely on and compete for revenue generated from online content …

Learning from Imperfect Human Feedback: a Tale from Corruption-Robust Dueling

Y Cheng, F Yao, X Liu, H Xu - arXiv preprint arXiv:2405.11204, 2024 - arxiv.org
This paper studies Learning from Imperfect Human Feedback (LIHF), motivated by humans'
potential irrationality or imperfect perception of true preference. We revisit the classic dueling …