Deep multiagent reinforcement learning: Challenges and directions
This paper surveys the field of deep multiagent reinforcement learning (RL). The
combination of deep neural networks with RL has gained increased traction in recent years …
combination of deep neural networks with RL has gained increased traction in recent years …
A survey of learning in multiagent environments: Dealing with non-stationarity
The key challenge in multiagent learning is learning a best response to the behaviour of
other agents, which may be non-stationary: if the other agents adapt their strategy as well …
other agents, which may be non-stationary: if the other agents adapt their strategy as well …
A deep bayesian policy reuse approach against non-stationary agents
In multiagent domains, coping with non-stationary agents that change behaviors from time to
time is a challenging problem, where an agent is usually required to be able to quickly …
time is a challenging problem, where an agent is usually required to be able to quickly …
Detecting and learning against unknown opponents for automated negotiations
Learning in automated negotiations, while successful for many tasks in recent years, is still
hard when coping with different types of opponents with unknown strategies. It is critically …
hard when coping with different types of opponents with unknown strategies. It is critically …
Smart Help: Strategic Opponent Modeling for Proactive and Adaptive Robot Assistance in Households
Despite the significant demand for assistive technology among vulnerable groups (eg the
elderly children and the disabled) in daily tasks research into advanced AI-driven assistive …
elderly children and the disabled) in daily tasks research into advanced AI-driven assistive …
Conditional imitation learning for multi-agent games
While advances in multi-agent learning have enabled the training of increasingly complex
agents, most existing techniques produce a final policy that is not designed to adapt to a …
agents, most existing techniques produce a final policy that is not designed to adapt to a …
Towards cooperation in sequential prisoner's dilemmas: a deep multiagent reinforcement learning approach
The Iterated Prisoner's Dilemma has guided research on social dilemmas for decades.
However, it distinguishes between only two atomic actions: cooperate and defect. In real …
However, it distinguishes between only two atomic actions: cooperate and defect. In real …
[HTML][HTML] Efficiently detecting switches against non-stationary opponents
P Hernandez-Leal, Y Zhan, ME Taylor… - Autonomous Agents and …, 2017 - Springer
Interactions in multiagent systems are generally more complicated than single agent ones.
Game theory provides solutions on how to act in multiagent scenarios; however, it assumes …
Game theory provides solutions on how to act in multiagent scenarios; however, it assumes …
Research progress of opponent modeling based on deep reinforcement learning
H Xu, L Qin, J Zeng, Y Hu… - Journal of …, 2023 - dc-china-simulation …
Deep reinforcement learning is an agent modeling method with both deep learning feature
extraction ability and reinforcement learning sequence decision-making ability, which can …
extraction ability and reinforcement learning sequence decision-making ability, which can …
Learning others' intentional models in multi-agent settings using interactive POMDPs
Y Han, P Gmytrasiewicz - Advances in Neural Information …, 2018 - proceedings.neurips.cc
Interactive partially observable Markov decision processes (I-POMDPs) provide a principled
framework for planning and acting in a partially observable, stochastic and multi-agent …
framework for planning and acting in a partially observable, stochastic and multi-agent …