Efficient multi-agent communication via self-supervised information aggregation
Utilizing messages from teammates can improve coordination in cooperative Multi-agent
Reinforcement Learning (MARL). To obtain meaningful information for decision-making …
Reinforcement Learning (MARL). To obtain meaningful information for decision-making …
Ace: Cooperative multi-agent q-learning with bidirectional action-dependency
Multi-agent reinforcement learning (MARL) suffers from the non-stationarity problem, which
is the ever-changing targets at every iteration when multiple agents update their policies at …
is the ever-changing targets at every iteration when multiple agents update their policies at …
Order matters: Agent-by-agent policy optimization
While multi-agent trust region algorithms have achieved great success empirically in solving
coordination tasks, most of them, however, suffer from a non-stationarity problem since …
coordination tasks, most of them, however, suffer from a non-stationarity problem since …
DPN: Decoupling Partition and Navigation for Neural Solvers of Min-max Vehicle Routing Problems
The min-max vehicle routing problem (min-max VRP) traverses all given customers by
assigning several routes and aims to minimize the length of the longest route. Recently …
assigning several routes and aims to minimize the length of the longest route. Recently …
Robust cooperative multi-agent reinforcement learning via multi-view message certification
Many multi-agent scenarios require message sharing among agents to promote
coordination, hastening the robustness of multi-agent communication when policies are …
coordination, hastening the robustness of multi-agent communication when policies are …
Robust Multi-agent Communication via Multi-view Message Certification
Many multi-agent scenarios require message sharing among agents to promote
coordination, hastening the robustness of multi-agent communication when policies are …
coordination, hastening the robustness of multi-agent communication when policies are …
Efficient Communication via Self-Supervised Information Aggregation for Online and Offline Multiagent Reinforcement Learning
Utilizing messages from teammates can improve coordination in cooperative multiagent
reinforcement learning (MARL). Previous works typically combine raw messages of …
reinforcement learning (MARL). Previous works typically combine raw messages of …