Robot learning in the era of foundation models: A survey
The proliferation of Large Language Models (LLMs) has s fueled a shift in robot learning
from automation towards general embodied Artificial Intelligence (AI). Adopting foundation …
from automation towards general embodied Artificial Intelligence (AI). Adopting foundation …
Distilling and retrieving generalizable knowledge for robot manipulation via language corrections
Today's robot policies exhibit subpar performance when faced with the challenge of
generalizing to novel environments. Human corrective feedback is a crucial form of …
generalizing to novel environments. Human corrective feedback is a crucial form of …
Prompt a robot to walk with large language models
Large language models (LLMs) pre-trained on vast internet-scale data have showcased
remarkable capabilities across diverse domains. Recently, there has been escalating …
remarkable capabilities across diverse domains. Recently, there has been escalating …
Robot utility models: General policies for zero-shot deployment in new environments
Robot models, particularly those trained with large amounts of data, have recently shown a
plethora of real-world manipulation and navigation capabilities. Several independent efforts …
plethora of real-world manipulation and navigation capabilities. Several independent efforts …
Prompt, plan, perform: Llm-based humanoid control via quantized imitation learning
In recent years, reinforcement learning and imitation learning have shown great potential for
controlling humanoid robots' motion. However, these methods typically create simulation …
controlling humanoid robots' motion. However, these methods typically create simulation …
Towards efficient llm grounding for embodied multi-agent collaboration
Grounding the reasoning ability of large language models (LLMs) for embodied tasks is
challenging due to the complexity of the physical world. Especially, LLM planning for multi …
challenging due to the complexity of the physical world. Especially, LLM planning for multi …
Closed-loop open-vocabulary mobile manipulation with gpt-4v
Autonomous robot navigation and manipulation in open environments require reasoning
and replanning with closed-loop feedback. We present COME-robot, the first closed-loop …
and replanning with closed-loop feedback. We present COME-robot, the first closed-loop …
Self-recovery prompting: Promptable general purpose service robot system with foundation models and self-recovery
M Shirasaka, T Matsushima… - … on Robotics and …, 2024 - ieeexplore.ieee.org
A general-purpose service robot (GPSR), which can execute diverse tasks in various
environments, requires a system with high generalizability and adaptability to tasks and …
environments, requires a system with high generalizability and adaptability to tasks and …
LoHoRavens: A Long-Horizon Language-Conditioned Benchmark for Robotic Tabletop Manipulation
The convergence of embodied agents and large language models (LLMs) has brought
significant advancements to embodied instruction following. Particularly, the strong …
significant advancements to embodied instruction following. Particularly, the strong …
Autonomous interactive correction MLLM for robust robotic manipulation
The ability to reflect on and correct failures is crucial for robotic systems to interact stably with
real-life objects. Observing the generalization and reasoning capabilities of Multimodal …
real-life objects. Observing the generalization and reasoning capabilities of Multimodal …