Recursively modeling other agents for decision making: A research perspective

P Doshi, P Gmytrasiewicz, E Durfee - Artificial Intelligence, 2020 - Elsevier
Individuals exhibit theory of mind, attributing beliefs, intent, and mental states to others as
explanations of observed actions. Dennett's intentional stance offers an analogous …

Learning while playing in mean-field games: Convergence and optimality

Q Xie, Z Yang, Z Wang, A Minca - … Conference on Machine …, 2021 - proceedings.mlr.press
We study reinforcement learning in mean-field games. To achieve the Nash equilibrium,
which consists of a policy and a mean-field state, existing algorithms require obtaining the …

Oracle-free reinforcement learning in mean-field games along a single sample path

MAU Zaman, A Koppel, S Bhatt… - … Conference on Artificial …, 2023 - proceedings.mlr.press
We consider online reinforcement learning in Mean-Field Games (MFGs). Unlike traditional
approaches, we alleviate the need for a mean-field oracle by developing an algorithm that …

Learning regularized monotone graphon mean-field games

F Zhang, V Tan, Z Wang… - Advances in Neural …, 2024 - proceedings.neurips.cc
This paper studies two fundamental problems in regularized Graphon Mean-Field Games
(GMFGs). First, we establish the existence of a Nash Equilibrium (NE) of any $\lambda …

Provable fictitious play for general mean-field games

Q Xie, Z Yang, Z Wang, A Minca - arXiv preprint arXiv:2010.04211, 2020 - arxiv.org
We propose a reinforcement learning algorithm for stationary mean-field games, where the
goal is to learn a pair of mean-field state and stationary policy that constitutes the Nash …

Scalable decision-theoretic planning in open and typed multiagent systems

A Eck, M Shah, P Doshi, LK Soh - Proceedings of the AAAI Conference on …, 2020 - aaai.org
In open agent systems, the set of agents that are cooperating or competing changes over
time and in ways that are nontrivial to predict. For example, if collaborative robots were …

Reinforcement learning in many-agent settings under partial observability

K He, P Doshi, B Banerjee - Uncertainty in Artificial …, 2022 - proceedings.mlr.press
Recent renewed interest in multi-agent reinforcement learning (MARL) has generated an
impressive array of techniques that leverage deep RL, primarily actor-critic architectures …

Regularization of the policy updates for stabilizing Mean Field Games

T Algumaei, R Solozabal, R Alami, H Hacid… - Pacific-Asia Conference …, 2023 - Springer
This work studies non-cooperative Multi-Agent Reinforcement Learning (MARL) where
multiple agents interact in the same environment and whose goal is to maximize the …

Catastrophe by design in population games: a mechanism to destabilize inefficient locked-in technologies

S Leonardos, J Sakos, C Courcoubetis… - ACM Transactions on …, 2023 - dl.acm.org
In multi-agent environments in which coordination is desirable, the history of play often
causes lock-in at sub-optimal outcomes. Notoriously, technologies with significant …

Many Agent Reinforcement Learning Under Partial Observability

K He, P Doshi, B Banerjee - arXiv preprint arXiv:2106.09825, 2021 - arxiv.org
Recent renewed interest in multi-agent reinforcement learning (MARL) has generated an
impressive array of techniques that leverage deep reinforcement learning, primarily actor …