Multi-agent reinforcement learning: A selective overview of theories and algorithms

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer
Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

An overview of multi-agent reinforcement learning from game theoretical perspective

Y Yang, J Wang - arXiv preprint arXiv:2011.00583, 2020 - arxiv.org
Following the remarkable success of the AlphaGO series, 2019 was a booming year that
witnessed significant advances in multi-agent reinforcement learning (MARL) techniques …

Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org
In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

Fully decentralized multi-agent reinforcement learning with networked agents

K Zhang, Z Yang, H Liu, T Zhang… - … conference on machine …, 2018 - proceedings.mlr.press
We consider the fully decentralized multi-agent reinforcement learning (MARL) problem,
where the agents are connected via a time-varying and possibly sparse communication …

Communication-efficient and distributed learning over wireless networks: Principles and applications

J Park, S Samarakoon, A Elgabli, J Kim… - Proceedings of the …, 2021 - ieeexplore.ieee.org
Machine learning (ML) is a promising enabler for the fifth-generation (5G) communication
systems and beyond. By imbuing intelligence into the network edge, edge nodes can …

Learning multiagent communication with backpropagation

S Sukhbaatar, R Fergus - Advances in neural information …, 2016 - proceedings.neurips.cc
Many tasks in AI require the collaboration of multiple agents. Typically, the communication
protocol between agents is manually specified and not altered during training. In this paper …

Independent policy gradient for large-scale markov potential games: Sharper rates, function approximation, and game-agnostic convergence

D Ding, CY Wei, K Zhang… - … Conference on Machine …, 2022 - proceedings.mlr.press
We examine global non-asymptotic convergence properties of policy gradient methods for
multi-agent reinforcement learning (RL) problems in Markov potential games (MPGs). To …

Global convergence of multi-agent policy gradient in markov potential games

S Leonardos, W Overman, I Panageas… - arXiv preprint arXiv …, 2021 - arxiv.org
Potential games are arguably one of the most important and widely studied classes of
normal form games. They define the archetypal setting of multi-agent coordination as all …

Reinforcement learning approach for optimal distributed energy management in a microgrid

E Foruzan, LK Soh, S Asgarpoor - IEEE Transactions on Power …, 2018 - ieeexplore.ieee.org
In this paper, a multiagent-based model is used to study distributed energy management in
a microgrid (MG). The suppliers and consumers of electricity are modeled as autonomous …

Multi-UAV trajectory planning for energy-efficient content coverage: A decentralized learning-based approach

C Zhao, J Liu, M Sheng, W Teng… - IEEE Journal on …, 2021 - ieeexplore.ieee.org
In next-generation wireless networks, high-mobility unmanned aerial vehicles (UAVs) are
promising to provide content coverage, where users can receive sufficient requested content …