Multi-agent reinforcement learning: A selective overview of theories and algorithms

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer
Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

Synergistic integration between machine learning and agent-based modeling: A multidisciplinary review

W Zhang, A Valencia, NB Chang - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Agent-based modeling (ABM) involves developing models in which agents make adaptive
decisions in a changing environment. Machine-learning (ML) based inference models can …

A sharp analysis of model-based reinforcement learning with self-play

Q Liu, T Yu, Y Bai, C Jin - International Conference on …, 2021 - proceedings.mlr.press
Abstract Model-based algorithms—algorithms that explore the environment through building
and utilizing an estimated model—are widely used in reinforcement learning practice and …

V-Learning--A Simple, Efficient, Decentralized Algorithm for Multiagent RL

C Jin, Q Liu, Y Wang, T Yu - arXiv preprint arXiv:2110.14555, 2021 - arxiv.org
A major challenge of multiagent reinforcement learning (MARL) is the curse of multiagents,
where the size of the joint action space scales exponentially with the number of agents. This …

Model-based multi-agent rl in zero-sum markov games with near-optimal sample complexity

K Zhang, S Kakade, T Basar… - Advances in Neural …, 2020 - proceedings.neurips.cc
Abstract Model-based reinforcement learning (RL), which finds an optimal policy using an
empirical model, has long been recognized as one of the cornerstones of RL. It is especially …

Provable self-play algorithms for competitive reinforcement learning

Y Bai, C Jin - International conference on machine learning, 2020 - proceedings.mlr.press
Self-play, where the algorithm learns by playing against itself without requiring any direct
supervision, has become the new weapon in modern Reinforcement Learning (RL) for …

Reinforcement learning with general value function approximation: Provably efficient approach via bounded eluder dimension

R Wang, RR Salakhutdinov… - Advances in Neural …, 2020 - proceedings.neurips.cc
Value function approximation has demonstrated phenomenal empirical success in
reinforcement learning (RL). Nevertheless, despite a handful of recent progress on …

Near-optimal reinforcement learning with self-play

Y Bai, C Jin, T Yu - Advances in neural information …, 2020 - proceedings.neurips.cc
This paper considers the problem of designing optimal algorithms for reinforcement learning
in two-player zero-sum games. We focus on self-play algorithms which learn the optimal …

Learning zero-sum simultaneous-move markov games using function approximation and correlated equilibrium

Q Xie, Y Chen, Z Wang, Z Yang - Conference on learning …, 2020 - proceedings.mlr.press
In this work, we develop provably efficient reinforcement learning algorithms for two-player
zero-sum Markov games with simultaneous moves. We consider a family of Markov games …

Breaking the curse of multiagency: Provably efficient decentralized multi-agent rl with function approximation

Y Wang, Q Liu, Y Bai, C Jin - The Thirty Sixth Annual …, 2023 - proceedings.mlr.press
A unique challenge in Multi-Agent Reinforcement Learning (MARL) is the\emph {curse of
multiagency}, where the description length of the game as well as the complexity of many …