Competing for shareable arms in multi-player multi-armed bandits
Competitions for shareable and limited resources have long been studied with strategic
agents. In reality, agents often have to learn and maximize the rewards of the resources at …
agents. In reality, agents often have to learn and maximize the rewards of the resources at …
User welfare optimization in recommender systems with competing content creators
Driven by the new economic opportunities created by the creator economy, an increasing
number of content creators rely on and compete for revenue generated from online content …
number of content creators rely on and compete for revenue generated from online content …
Learning from Imperfect Human Feedback: a Tale from Corruption-Robust Dueling
This paper studies Learning from Imperfect Human Feedback (LIHF), motivated by humans'
potential irrationality or imperfect perception of true preference. We revisit the classic dueling …
potential irrationality or imperfect perception of true preference. We revisit the classic dueling …