Multi-agent deep reinforcement learning: a survey

S Gronauer, K Diepold - Artificial Intelligence Review, 2022 - Springer
The advances in reinforcement learning have recorded sublime success in various domains.
Although the multi-agent domain has been overshadowed by its single-agent counterpart …

Qplex: Duplex dueling multi-agent q-learning

J Wang, Z Ren, T Liu, Y Yu, C Zhang - arXiv preprint arXiv:2008.01062, 2020 - arxiv.org
We explore value-based multi-agent reinforcement learning (MARL) in the popular
paradigm of centralized training with decentralized execution (CTDE). CTDE has an …

Rode: Learning roles to decompose multi-agent tasks

T Wang, T Gupta, A Mahajan, B Peng… - arXiv preprint arXiv …, 2020 - arxiv.org
Role-based learning holds the promise of achieving scalable multi-agent learning by
decomposing complex tasks using roles. However, it is largely unclear how to efficiently …

Celebrating diversity in shared multi-agent reinforcement learning

C Li, T Wang, C Wu, Q Zhao… - Advances in Neural …, 2021 - proceedings.neurips.cc
Recently, deep multi-agent reinforcement learning (MARL) has shown the promise to solve
complex cooperative tasks. Its success is partly because of parameter sharing among …

Roma: Multi-agent reinforcement learning with emergent roles

T Wang, H Dong, V Lesser, C Zhang - arXiv preprint arXiv:2003.08039, 2020 - arxiv.org
The role concept provides a useful tool to design and understand complex multi-agent
systems, which allows agents with a similar role to share similar behaviors. However …

[PDF][PDF] A survey of multi-agent reinforcement learning with communication

C Zhu, M Dastani, S Wang - arXiv preprint arXiv:2203.08975, 2022 - researchgate.net
Communication is an effective mechanism for coordinating the behavior of multiple agents.
In the field of multi-agent reinforcement learning, agents can improve the overall learning …

Contrasting centralized and decentralized critics in multi-agent reinforcement learning

X Lyu, Y Xiao, B Daley, C Amato - arXiv preprint arXiv:2102.04402, 2021 - arxiv.org
Centralized Training for Decentralized Execution, where agents are trained offline using
centralized information but execute in a decentralized manner online, has gained popularity …

Dop: Off-policy multi-agent decomposed policy gradients

Y Wang, B Han, T Wang, H Dong… - … conference on learning …, 2020 - openreview.net
Multi-agent policy gradient (MAPG) methods recently witness vigorous progress. However,
there is a significant performance discrepancy between MAPG methods and state-of-the-art …

Towards a standardised performance evaluation protocol for cooperative marl

R Gorsane, O Mahjoub, RJ de Kock… - Advances in …, 2022 - proceedings.neurips.cc
Multi-agent reinforcement learning (MARL) has emerged as a useful approach to solving
decentralised decision-making problems at scale. Research in the field has been growing …

Pac: Assisted value factorization with counterfactual predictions in multi-agent reinforcement learning

H Zhou, T Lan, V Aggarwal - Advances in Neural …, 2022 - proceedings.neurips.cc
Multi-agent reinforcement learning (MARL) has witnessed significant progress with the
development of value function factorization methods. It allows optimizing a joint action-value …