Pervasive AI for IoT applications: A survey on resource-efficient distributed artificial intelligence

E Baccour, N Mhaisen, AA Abdellatif… - … Surveys & Tutorials, 2022 - ieeexplore.ieee.org
Artificial intelligence (AI) has witnessed a substantial breakthrough in a variety of Internet of
Things (IoT) applications and services, spanning from recommendation systems and speech …

Distributed multi-player bandits-a game of thrones approach

I Bistritz, A Leshem - Advances in Neural Information …, 2018 - proceedings.neurips.cc
We consider a multi-armed bandit game where N players compete for K arms for T turns.
Each player has different expected rewards for the arms, and the instantaneous rewards are …

Social learning in multi agent multi armed bandits

A Sankararaman, A Ganesh, S Shakkottai - Proceedings of the ACM on …, 2019 - dl.acm.org
Motivated by emerging need of learning algorithms for large scale networked and
decentralized systems, we introduce a distributed version of the classical stochastic Multi …

Decentralized cooperative stochastic bandits

D Martínez-Rubio, V Kanade… - Advances in Neural …, 2019 - proceedings.neurips.cc
We study a decentralized cooperative stochastic multi-armed bandit problem with K arms on
a network of N agents. In our model, the reward distribution of each arm is the same for each …

Fast distributed bandits for online recommendation systems

K Mahadik, Q Wu, S Li, A Sabne - Proceedings of the 34th ACM …, 2020 - dl.acm.org
Contextual bandit algorithms are commonly used in recommender systems, where content
popularity can change rapidly. These algorithms continuously learn latent mappings …

Distributed bandit learning: Near-optimal regret with efficient communication

Y Wang, J Hu, X Chen, L Wang - arXiv preprint arXiv:1904.06309, 2019 - arxiv.org
We study the problem of regret minimization for distributed bandits learning, in which $ M $
agents work collaboratively to minimize their total regret under the coordination of a central …

Learning with limited rounds of adaptivity: Coin tossing, multi-armed bandits, and ranking from pairwise comparisons

A Agarwal, S Agarwal, S Assadi… - … on Learning Theory, 2017 - proceedings.mlr.press
In many learning settings, active/adaptive querying is possible, but the number of rounds of
adaptivity is limited. We study the relationship between query complexity and adaptivity in …

Linear bandits with limited adaptivity and learning distributional optimal design

Y Ruan, J Yang, Y Zhou - Proceedings of the 53rd Annual ACM SIGACT …, 2021 - dl.acm.org
Motivated by practical needs such as large-scale learning, we study the impact of adaptivity
constraints to linear contextual bandits, a central problem in online learning and decision …

Near-optimal collaborative learning in bandits

C Réda, S Vakili, E Kaufmann - Advances in Neural …, 2022 - proceedings.neurips.cc
This paper introduces a general multi-agent bandit model in which each agent is facing a
finite set of arms and may communicate with other agents through a central controller in …

Beyond regret for decentralized bandits in matching markets

S Basu, KA Sankararaman… - … on Machine Learning, 2021 - proceedings.mlr.press
We design decentralized algorithms for regret minimization in the two sided matching market
with one-sided bandit feedback that significantly improves upon the prior works (Liu et al …