Distributed bandits with heterogeneous agents

L Yang, YZJ Chen, S Pasteris… - Advances in …, 2021 - proceedings.neurips.cc

This paper studies a cooperative multi-armed bandit problem with $ M $ agents cooperating
together to solve the same instance of a $ K $-armed stochastic bandit problem with the goal …

被引用次数：25 相关文章所有 13 个版本

[PDF] neurips.cc

Fair exploration via axiomatic bargaining

J Baek, V Farias - Advances in Neural Information …, 2021 - proceedings.neurips.cc

Motivated by the consideration of fairly sharing the cost of exploration between multiple
groups in learning problems, we develop the Nash bargaining solution in the context of multi …

被引用次数：31 相关文章所有 8 个版本

[PDF] mlr.press

On-demand communication for asynchronous multi-agent bandits

YZJ Chen, L Yang, X Wang, X Liu… - International …, 2023 - proceedings.mlr.press

This paper studies a cooperative multi-agent multi-armed stochastic bandit problem where
agents operate asynchronously–agent pull times and rates are unknown, irregular, and …

被引用次数：9 相关文章所有 8 个版本

[PDF] mlr.press

Exploration for free: how does reward heterogeneity improve regret in cooperative multi-agent bandits?

X Wang, L Yang, YZJ Chen, X Liu… - Uncertainty in …, 2023 - proceedings.mlr.press

This paper studies a cooperative multi-agent bandit scenario in which the rewards observed
by agents are heterogeneous—one agent's meat can be another agent's poison …

被引用次数：1 相关文章所有 9 个版本

[PDF] arxiv.org

Cooperative multi-agent bandits: Distributed algorithms with optimal individual regret and constant communication costs

L Yang, X Wang, M Hajiesmaili, L Zhang, J Lui… - arXiv preprint arXiv …, 2023 - arxiv.org

Recently, there has been extensive study of cooperative multi-agent multi-armed bandits
where a set of distributed agents cooperatively play the same multi-armed bandit game. The …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Adversarial Attacks on Cooperative Multi-agent Bandits

J Zuo, Z Zhang, X Wang, C Chen, S Li, J Lui… - arXiv preprint arXiv …, 2023 - arxiv.org

Cooperative multi-agent multi-armed bandits (CMA2B) consider the collaborative efforts of
multiple agents in a shared multi-armed bandit game. We study latent vulnerabilities …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org