Text2motion: From natural language instructions to feasible plans

K Lin, C Agia, T Migimatsu, M Pavone, J Bohg - Autonomous Robots, 2023 - Springer
Abstract We propose Text2Motion, a language-based planning framework enabling robots
to solve sequential manipulation tasks that require long-horizon reasoning. Given a natural …

Sayplan: Grounding large language models using 3d scene graphs for scalable task planning

K Rana, J Haviland, S Garg, J Abou-Chakra, ID Reid… - CoRR, 2023 - openreview.net
Large language models (LLMs) have demonstrated impressive results in developing
generalist planning agents for diverse tasks. However, grounding these plans in expansive …

Learning multi-object dynamics with compositional neural radiance fields

D Driess, Z Huang, Y Li, R Tedrake… - Conference on robot …, 2023 - proceedings.mlr.press
We present a method to learn compositional multi-object dynamics models from image
observations based on implicit object encoders, Neural Radiance Fields (NeRFs), and …

Algorithms and systems for manipulating multiple objects

Z Pan, A Zeng, Y Li, J Yu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Robot manipulation of multiple objects is an important topic for applications including
warehouse automation, service robots performing cleaning, and large-scale object sorting …

Sayplan: Grounding large language models using 3d scene graphs for scalable robot task planning

K Rana, J Haviland, S Garg, J Abou-Chakra… - … Conference on Robot …, 2023 - openreview.net
Large language models (LLMs) have demonstrated impressive results in developing
generalist planning agents for diverse tasks. However, grounding these plans in expansive …

Robocook: Long-horizon elasto-plastic object manipulation with diverse tools

H Shi, H Xu, S Clarke, Y Li, J Wu - arXiv preprint arXiv:2306.14447, 2023 - arxiv.org
Humans excel in complex long-horizon soft body manipulation tasks via flexible tool use:
bread baking requires a knife to slice the dough and a rolling pin to flatten it. Often regarded …

Taskography: Evaluating robot task planning over large 3d scene graphs

C Agia, KM Jatavallabhula, M Khodeir… - … on Robot Learning, 2022 - proceedings.mlr.press
Abstract 3D scene graphs (3DSGs) are an emerging description; unifying symbolic,
topological, and metric scene representations. However, typical 3DSGs contain hundreds of …

Learning neuro-symbolic relational transition models for bilevel planning

R Chitnis, T Silver, JB Tenenbaum… - 2022 IEEE/RSJ …, 2022 - ieeexplore.ieee.org
In robotic domains, learning and planning are complicated by continuous state spaces,
continuous action spaces, and long task horizons. In this work, we address these challenges …

PDDLGym: Gym environments from PDDL problems

T Silver, R Chitnis - arXiv preprint arXiv:2002.06432, 2020 - arxiv.org
We present PDDLGym, a framework that automatically constructs OpenAI Gym
environments from PDDL domains and problems. Observations and actions in PDDLGym …

Efficient and interpretable robot manipulation with graph neural networks

Y Lin, AS Wang, E Undersander… - IEEE Robotics and …, 2022 - ieeexplore.ieee.org
Manipulation tasks, like loading a dishwasher, can be seen as a sequence of spatial
constraints and relationships between different objects. We aim to discover these rules from …