Reinforcement learning to play an optimal Nash equilibrium in team Markov games

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer

Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

被引用次数：1638 相关文章所有 8 个版本

[PDF] arxiv.org

An overview of multi-agent reinforcement learning from game theoretical perspective

Y Yang, J Wang - arXiv preprint arXiv:2011.00583, 2020 - arxiv.org

Following the remarkable success of the AlphaGO series, 2019 was a booming year that
witnessed significant advances in multi-agent reinforcement learning (MARL) techniques …

被引用次数：340 相关文章所有 2 个版本

[PDF] jair.org

Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org

In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

被引用次数：320 相关文章所有 9 个版本

[PDF] mlr.press

Fully decentralized multi-agent reinforcement learning with networked agents

K Zhang, Z Yang, H Liu, T Zhang… - … conference on machine …, 2018 - proceedings.mlr.press

We consider the fully decentralized multi-agent reinforcement learning (MARL) problem,
where the agents are connected via a time-varying and possibly sparse communication …

被引用次数：720 相关文章所有 8 个版本

[PDF] oulu.fi

Communication-efficient and distributed learning over wireless networks: Principles and applications

J Park, S Samarakoon, A Elgabli, J Kim… - Proceedings of the …, 2021 - ieeexplore.ieee.org

Machine learning (ML) is a promising enabler for the fifth-generation (5G) communication
systems and beyond. By imbuing intelligence into the network edge, edge nodes can …

被引用次数：193 相关文章所有 9 个版本

[PDF] neurips.cc

Learning multiagent communication with backpropagation

S Sukhbaatar, R Fergus - Advances in neural information …, 2016 - proceedings.neurips.cc

Many tasks in AI require the collaboration of multiple agents. Typically, the communication
protocol between agents is manually specified and not altered during training. In this paper …

被引用次数：1380 相关文章所有 9 个版本

[PDF] mlr.press

Independent policy gradient for large-scale markov potential games: Sharper rates, function approximation, and game-agnostic convergence

D Ding, CY Wei, K Zhang… - … Conference on Machine …, 2022 - proceedings.mlr.press

We examine global non-asymptotic convergence properties of policy gradient methods for
multi-agent reinforcement learning (RL) problems in Markov potential games (MPGs). To …

被引用次数：85 相关文章所有 8 个版本

[PDF] arxiv.org

Global convergence of multi-agent policy gradient in markov potential games

S Leonardos, W Overman, I Panageas… - arXiv preprint arXiv …, 2021 - arxiv.org

Potential games are arguably one of the most important and widely studied classes of
normal form games. They define the archetypal setting of multi-agent coordination as all …

被引用次数：141 相关文章所有 7 个版本

Reinforcement learning approach for optimal distributed energy management in a microgrid

E Foruzan, LK Soh, S Asgarpoor - IEEE Transactions on Power …, 2018 - ieeexplore.ieee.org

In this paper, a multiagent-based model is used to study distributed energy management in
a microgrid (MG). The suppliers and consumers of electricity are modeled as autonomous …

被引用次数：297 相关文章所有 4 个版本

Multi-UAV trajectory planning for energy-efficient content coverage: A decentralized learning-based approach

C Zhao, J Liu, M Sheng, W Teng… - IEEE Journal on …, 2021 - ieeexplore.ieee.org

In next-generation wireless networks, high-mobility unmanned aerial vehicles (UAVs) are
promising to provide content coverage, where users can receive sufficient requested content …

被引用次数：111 相关文章所有 2 个版本