Feature-based q-learning for two-player stochastic games

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer

Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

被引用次数：1638 相关文章所有 8 个版本

[PDF] ieee.org

Synergistic integration between machine learning and agent-based modeling: A multidisciplinary review

W Zhang, A Valencia, NB Chang - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Agent-based modeling (ABM) involves developing models in which agents make adaptive
decisions in a changing environment. Machine-learning (ML) based inference models can …

被引用次数：65 相关文章所有 4 个版本

[PDF] mlr.press

A sharp analysis of model-based reinforcement learning with self-play

Q Liu, T Yu, Y Bai, C Jin - International Conference on …, 2021 - proceedings.mlr.press

Abstract Model-based algorithms—algorithms that explore the environment through building
and utilizing an estimated model—are widely used in reinforcement learning practice and …

被引用次数：156 相关文章所有 6 个版本

[PDF] arxiv.org

V-Learning--A Simple, Efficient, Decentralized Algorithm for Multiagent RL

C Jin, Q Liu, Y Wang, T Yu - arXiv preprint arXiv:2110.14555, 2021 - arxiv.org

A major challenge of multiagent reinforcement learning (MARL) is the curse of multiagents,
where the size of the joint action space scales exponentially with the number of agents. This …

被引用次数：118 相关文章所有 3 个版本

[PDF] neurips.cc

Model-based multi-agent rl in zero-sum markov games with near-optimal sample complexity

K Zhang, S Kakade, T Basar… - Advances in Neural …, 2020 - proceedings.neurips.cc

Abstract Model-based reinforcement learning (RL), which finds an optimal policy using an
empirical model, has long been recognized as one of the cornerstones of RL. It is especially …

被引用次数：153 相关文章所有 12 个版本

[PDF] mlr.press

Provable self-play algorithms for competitive reinforcement learning

Y Bai, C Jin - International conference on machine learning, 2020 - proceedings.mlr.press

Self-play, where the algorithm learns by playing against itself without requiring any direct
supervision, has become the new weapon in modern Reinforcement Learning (RL) for …

被引用次数：188 相关文章所有 6 个版本

[PDF] neurips.cc

Reinforcement learning with general value function approximation: Provably efficient approach via bounded eluder dimension

R Wang, RR Salakhutdinov… - Advances in Neural …, 2020 - proceedings.neurips.cc

Value function approximation has demonstrated phenomenal empirical success in
reinforcement learning (RL). Nevertheless, despite a handful of recent progress on …

被引用次数：183 相关文章所有 6 个版本

[PDF] neurips.cc

Near-optimal reinforcement learning with self-play

Y Bai, C Jin, T Yu - Advances in neural information …, 2020 - proceedings.neurips.cc

This paper considers the problem of designing optimal algorithms for reinforcement learning
in two-player zero-sum games. We focus on self-play algorithms which learn the optimal …

被引用次数：162 相关文章所有 8 个版本

[PDF] mlr.press

Learning zero-sum simultaneous-move markov games using function approximation and correlated equilibrium

Q Xie, Y Chen, Z Wang, Z Yang - Conference on learning …, 2020 - proceedings.mlr.press

In this work, we develop provably efficient reinforcement learning algorithms for two-player
zero-sum Markov games with simultaneous moves. We consider a family of Markov games …

被引用次数：161 相关文章所有 6 个版本

[PDF] mlr.press

Breaking the curse of multiagency: Provably efficient decentralized multi-agent rl with function approximation

Y Wang, Q Liu, Y Bai, C Jin - The Thirty Sixth Annual …, 2023 - proceedings.mlr.press

A unique challenge in Multi-Agent Reinforcement Learning (MARL) is the\emph {curse of
multiagency}, where the description length of the game as well as the complexity of many …

被引用次数：31 相关文章所有 4 个版本