Multi-agent deep reinforcement learning: a survey

S Gronauer, K Diepold - Artificial Intelligence Review, 2022 - Springer
The advances in reinforcement learning have recorded sublime success in various domains.
Although the multi-agent domain has been overshadowed by its single-agent counterpart …

A survey and critique of multiagent deep reinforcement learning

P Hernandez-Leal, B Kartal, ME Taylor - Autonomous Agents and Multi …, 2019 - Springer
Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …

Contrasting centralized and decentralized critics in multi-agent reinforcement learning

X Lyu, Y Xiao, B Daley, C Amato - arXiv preprint arXiv:2102.04402, 2021 - arxiv.org
Centralized Training for Decentralized Execution, where agents are trained offline using
centralized information but execute in a decentralized manner online, has gained popularity …

[PDF][PDF] Is multiagent deep reinforcement learning the answer or the question? A brief survey

P Hernandez-Leal, B Kartal, ME Taylor - learning, 2018 - researchgate.net
Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …

Solving multi-agent routing problems using deep attention mechanisms

G Bono, JS Dibangoye, O Simonin… - IEEE Transactions …, 2020 - ieeexplore.ieee.org
Routing delivery vehicles to serve customers in dynamic and uncertain environments like
dense city centers is a challenging task that requires robustness and flexibility. Most existing …

[HTML][HTML] A survey on multi-agent reinforcement learning and its application

Z Ning, L Xie - Journal of Automation and Intelligence, 2024 - Elsevier
Multi-agent reinforcement learning (MARL) has been a rapidly evolving field. This paper
presents a comprehensive survey of MARL and its applications. We trace the historical …

Sample and communication-efficient decentralized actor-critic algorithms with finite-time analysis

Z Chen, Y Zhou, RR Chen… - … Conference on Machine …, 2022 - proceedings.mlr.press
Actor-critic (AC) algorithms have been widely used in decentralized multi-agent systems to
learn the optimal joint control policy. However, existing decentralized AC algorithms either …

On centralized critics in multi-agent reinforcement learning

X Lyu, A Baisero, Y Xiao, B Daley, C Amato - Journal of Artificial Intelligence …, 2023 - jair.org
Abstract Centralized Training for Decentralized Execution, where agents are trained offline
in a centralized fashion and execute online in a decentralized manner, has become a …

A deeper understanding of state-based critics in multi-agent reinforcement learning

X Lyu, A Baisero, Y Xiao, C Amato - … of the AAAI conference on artificial …, 2022 - ojs.aaai.org
Abstract Centralized Training for Decentralized Execution, where training is done in a
centralized offline fashion, has become a popular solution paradigm in Multi-Agent …

HSVI can solve zero-sum partially observable stochastic games

A Delage, O Buffet, JS Dibangoye… - Dynamic Games and …, 2024 - Springer
State-of-the-art methods for solving 2-player zero-sum imperfect information games rely on
linear programming or regret minimization, though not on dynamic programming (DP) or …