Pervasive AI for IoT applications: A survey on resource-efficient distributed artificial intelligence
Artificial intelligence (AI) has witnessed a substantial breakthrough in a variety of Internet of
Things (IoT) applications and services, spanning from recommendation systems and speech …
Things (IoT) applications and services, spanning from recommendation systems and speech …
Distributed multi-player bandits-a game of thrones approach
I Bistritz, A Leshem - Advances in Neural Information …, 2018 - proceedings.neurips.cc
We consider a multi-armed bandit game where N players compete for K arms for T turns.
Each player has different expected rewards for the arms, and the instantaneous rewards are …
Each player has different expected rewards for the arms, and the instantaneous rewards are …
Social learning in multi agent multi armed bandits
A Sankararaman, A Ganesh, S Shakkottai - Proceedings of the ACM on …, 2019 - dl.acm.org
Motivated by emerging need of learning algorithms for large scale networked and
decentralized systems, we introduce a distributed version of the classical stochastic Multi …
decentralized systems, we introduce a distributed version of the classical stochastic Multi …
Decentralized cooperative stochastic bandits
D Martínez-Rubio, V Kanade… - Advances in Neural …, 2019 - proceedings.neurips.cc
We study a decentralized cooperative stochastic multi-armed bandit problem with K arms on
a network of N agents. In our model, the reward distribution of each arm is the same for each …
a network of N agents. In our model, the reward distribution of each arm is the same for each …
Fast distributed bandits for online recommendation systems
Contextual bandit algorithms are commonly used in recommender systems, where content
popularity can change rapidly. These algorithms continuously learn latent mappings …
popularity can change rapidly. These algorithms continuously learn latent mappings …
Distributed bandit learning: Near-optimal regret with efficient communication
We study the problem of regret minimization for distributed bandits learning, in which $ M $
agents work collaboratively to minimize their total regret under the coordination of a central …
agents work collaboratively to minimize their total regret under the coordination of a central …
Learning with limited rounds of adaptivity: Coin tossing, multi-armed bandits, and ranking from pairwise comparisons
In many learning settings, active/adaptive querying is possible, but the number of rounds of
adaptivity is limited. We study the relationship between query complexity and adaptivity in …
adaptivity is limited. We study the relationship between query complexity and adaptivity in …
Linear bandits with limited adaptivity and learning distributional optimal design
Motivated by practical needs such as large-scale learning, we study the impact of adaptivity
constraints to linear contextual bandits, a central problem in online learning and decision …
constraints to linear contextual bandits, a central problem in online learning and decision …
Near-optimal collaborative learning in bandits
This paper introduces a general multi-agent bandit model in which each agent is facing a
finite set of arms and may communicate with other agents through a central controller in …
finite set of arms and may communicate with other agents through a central controller in …
Beyond regret for decentralized bandits in matching markets
S Basu, KA Sankararaman… - … on Machine Learning, 2021 - proceedings.mlr.press
We design decentralized algorithms for regret minimization in the two sided matching market
with one-sided bandit feedback that significantly improves upon the prior works (Liu et al …
with one-sided bandit feedback that significantly improves upon the prior works (Liu et al …