Partially observable markov decision processes in robotics: A survey
Noisy sensing, imperfect control, and environment changes are defining characteristics of
many real-world robot tasks. The partially observable Markov decision process (POMDP) …
many real-world robot tasks. The partially observable Markov decision process (POMDP) …
Model-based multi-agent reinforcement learning: Recent progress and prospects
Significant advances have recently been achieved in Multi-Agent Reinforcement Learning
(MARL) which tackles sequential decision-making problems involving multiple participants …
(MARL) which tackles sequential decision-making problems involving multiple participants …
A survey of progress on cooperative multi-agent reinforcement learning in open environment
Multi-agent Reinforcement Learning (MARL) has gained wide attention in recent years and
has made progress in various fields. Specifically, cooperative MARL focuses on training a …
has made progress in various fields. Specifically, cooperative MARL focuses on training a …
Asynchronous actor-critic for multi-agent reinforcement learning
Synchronizing decisions across multiple agents in realistic settings is problematic since it
requires agents to wait for other agents to terminate and communicate about termination …
requires agents to wait for other agents to terminate and communicate about termination …
Modeling, replicating, and predicting human behavior: a survey
Given the popular presupposition of human reasoning as the standard for learning and
decision making, there have been significant efforts and a growing trend in research to …
decision making, there have been significant efforts and a growing trend in research to …
[HTML][HTML] Learning scalable and efficient communication policies for multi-robot collision avoidance
Decentralized multi-robot systems typically perform coordinated motion planning by
constantly broadcasting their intentions to avoid collisions. However, the risk of collision …
constantly broadcasting their intentions to avoid collisions. However, the risk of collision …
[HTML][HTML] Efficient and scalable reinforcement learning for large-scale network control
The primary challenge in the development of large-scale artificial intelligence (AI) systems
lies in achieving scalable decision-making—extending the AI models while maintaining …
lies in achieving scalable decision-making—extending the AI models while maintaining …
Centralized model and exploration policy for multi-agent RL
Reinforcement learning (RL) in partially observable, fully cooperative multi-agent settings
(Dec-POMDPs) can in principle be used to address many real-world challenges such as …
(Dec-POMDPs) can in principle be used to address many real-world challenges such as …
Decentralized, unlabeled multi-agent navigation in obstacle-rich environments using graph neural networks
We propose a decentralized, learning-based solution to the challenging problem of
unlabeled multi-agent navigation among obstacles, where robots need to simultaneously …
unlabeled multi-agent navigation among obstacles, where robots need to simultaneously …
Learning pneumatic non-prehensile manipulation with a mobile blower
We investigate pneumatic non-prehensile manipulation (ie, blowing) as a means of
efficiently moving scattered objects into a target receptacle. Due to the chaotic nature of …
efficiently moving scattered objects into a target receptacle. Due to the chaotic nature of …