Decision-theoretic planning under anonymity in agent populations

P Doshi, P Gmytrasiewicz, E Durfee - Artificial Intelligence, 2020 - Elsevier

Individuals exhibit theory of mind, attributing beliefs, intent, and mental states to others as
explanations of observed actions. Dennett's intentional stance offers an analogous …

被引用次数：32 相关文章所有 3 个版本

[PDF] mlr.press

Learning while playing in mean-field games: Convergence and optimality

Q Xie, Z Yang, Z Wang, A Minca - … Conference on Machine …, 2021 - proceedings.mlr.press

We study reinforcement learning in mean-field games. To achieve the Nash equilibrium,
which consists of a policy and a mean-field state, existing algorithms require obtaining the …

被引用次数：42 相关文章所有 5 个版本

[PDF] mlr.press

Oracle-free reinforcement learning in mean-field games along a single sample path

MAU Zaman, A Koppel, S Bhatt… - … Conference on Artificial …, 2023 - proceedings.mlr.press

We consider online reinforcement learning in Mean-Field Games (MFGs). Unlike traditional
approaches, we alleviate the need for a mean-field oracle by developing an algorithm that …

被引用次数：21 相关文章所有 5 个版本

[PDF] neurips.cc

Learning regularized monotone graphon mean-field games

F Zhang, V Tan, Z Wang… - Advances in Neural …, 2024 - proceedings.neurips.cc

This paper studies two fundamental problems in regularized Graphon Mean-Field Games
(GMFGs). First, we establish the existence of a Nash Equilibrium (NE) of any $\lambda …

被引用次数：7 相关文章所有 5 个版本

[PDF] arxiv.org

Provable fictitious play for general mean-field games

Q Xie, Z Yang, Z Wang, A Minca - arXiv preprint arXiv:2010.04211, 2020 - arxiv.org

We propose a reinforcement learning algorithm for stationary mean-field games, where the
goal is to learn a pair of mean-field state and stationary policy that constitutes the Nash …

被引用次数：20 相关文章所有 3 个版本

[PDF] aaai.org

Scalable decision-theoretic planning in open and typed multiagent systems

A Eck, M Shah, P Doshi, LK Soh - Proceedings of the AAAI Conference on …, 2020 - aaai.org

In open agent systems, the set of agents that are cooperating or competing changes over
time and in ways that are nontrivial to predict. For example, if collaborative robots were …

被引用次数：22 相关文章所有 14 个版本

[PDF] mlr.press

Reinforcement learning in many-agent settings under partial observability

K He, P Doshi, B Banerjee - Uncertainty in Artificial …, 2022 - proceedings.mlr.press

Recent renewed interest in multi-agent reinforcement learning (MARL) has generated an
impressive array of techniques that leverage deep RL, primarily actor-critic architectures …

被引用次数：3 相关文章所有 4 个版本

[PDF] arxiv.org

Regularization of the policy updates for stabilizing Mean Field Games

T Algumaei, R Solozabal, R Alami, H Hacid… - Pacific-Asia Conference …, 2023 - Springer

This work studies non-cooperative Multi-Agent Reinforcement Learning (MARL) where
multiple agents interact in the same environment and whose goal is to maximize the …

被引用次数：1 相关文章所有 5 个版本

[PDF] kcl.ac.uk

Catastrophe by design in population games: a mechanism to destabilize inefficient locked-in technologies

S Leonardos, J Sakos, C Courcoubetis… - ACM Transactions on …, 2023 - dl.acm.org

In multi-agent environments in which coordination is desirable, the history of play often
causes lock-in at sub-optimal outcomes. Notoriously, technologies with significant …

被引用次数：2 相关文章所有 3 个版本

[PDF] arxiv.org

Many Agent Reinforcement Learning Under Partial Observability

K He, P Doshi, B Banerjee - arXiv preprint arXiv:2106.09825, 2021 - arxiv.org

Recent renewed interest in multi-agent reinforcement learning (MARL) has generated an
impressive array of techniques that leverage deep reinforcement learning, primarily actor …

被引用次数：1 相关文章所有 2 个版本