A survey of progress on cooperative multi-agent reinforcement learning in open environment

L Yuan, Z Zhang, L Li, C Guan, Y Yu - arXiv preprint arXiv:2312.01058, 2023 - arxiv.org
Multi-agent Reinforcement Learning (MARL) has gained wide attention in recent years and
has made progress in various fields. Specifically, cooperative MARL focuses on training a …

[HTML][HTML] A review of scalable and privacy-preserving multi-agent frameworks for distributed energy resources

X Huo, H Huang, KR Davis, HV Poor, M Liu - Advances in Applied Energy, 2024 - Elsevier
Distributed energy resources (DERs) are gaining prominence due to their advantages in
improving energy efficiency, reducing carbon emissions, and enhancing grid resilience …

Multi-agent reinforcement learning for autonomous driving: A survey

R Zhang, J Hou, F Walter, S Gu, J Guan… - arXiv preprint arXiv …, 2024 - arxiv.org
Reinforcement Learning (RL) is a potent tool for sequential decision-making and has
achieved performance surpassing human capabilities across many challenging real-world …

On the Hardness of Constrained Cooperative Multi-Agent Reinforcement Learning

Z Chen, Y Zhou, H Huang - The Twelfth International Conference on …, 2024 - openreview.net
Constrained cooperative multi-agent reinforcement learning (MARL) is an emerging
learning framework that has been widely applied to manage multi-agent systems, and many …

Safe Multi-Agent Reinforcement Learning with Convergence to Generalized Nash Equilibrium

Z Li, N Azizan - arXiv preprint arXiv:2411.15036, 2024 - arxiv.org
Multi-agent reinforcement learning (MARL) has achieved notable success in cooperative
tasks, demonstrating impressive performance and scalability. However, deploying MARL …

SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP

S Mukherjee, JP Hanna, R Nowak - arXiv preprint arXiv:2406.02165, 2024 - arxiv.org
In this paper, we study safe data collection for the purpose of policy evaluation in tabular
Markov decision processes (MDPs). In policy evaluation, we are given a\textit {target} policy …

Knowing What Not to Do: Leverage Language Model Insights for Action Space Pruning in Multi-agent Reinforcement Learning

Z Liu, X Yang, Z Liu, Y Xia, W Jiang, Y Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Multi-agent reinforcement learning (MARL) is employed to develop autonomous agents that
can learn to adopt cooperative or competitive strategies within complex environments …

[PDF][PDF] On the duality gap of constrained cooperative multi-agent reinforcement learning

ZY Chen, Y Zhou, H Huang - … of the 12th International Conference on …, 2024 - sci.utah.edu
Constrained cooperative multi-agent reinforcement learning (MARL) is an emerging
learning framework that has been widely applied to manage multi-agent systems, and many …

On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning

A Barakat, S Chakraborty, P Yu, P Tokekar… - arXiv preprint arXiv …, 2024 - arxiv.org
Reinforcement learning with general utilities has recently gained attention thanks to its
ability to unify several problems, including imitation learning, pure exploration, and safe RL …

Scalable multi-agent reinforcement learning with general utilities

D Ying, Y Ding, A Koppel… - 2023 American Control …, 2023 - ieeexplore.ieee.org
We study the scalable multi-agent reinforcement learning (MARL) with general utilities,
defined as nonlinear functions of the team's long-term state-action occupancy measure. The …