Instruction-driven history-aware policies for robotic manipulations

PL Guhur, S Chen, RG Pinel… - … on Robot Learning, 2023 - proceedings.mlr.press
In human environments, robots are expected to accomplish a variety of manipulation tasks
given simple natural language instructions. Yet, robotic manipulation is extremely …

A survey on explainable reinforcement learning: Concepts, algorithms, challenges

Y Qing, S Liu, J Song, H Wang, M Song - arXiv preprint arXiv:2211.06665, 2022 - arxiv.org
Reinforcement Learning (RL) is a popular machine learning paradigm where intelligent
agents interact with the environment to fulfill a long-term goal. Driven by the resurgence of …

Skilldiffuser: Interpretable hierarchical planning via skill abstractions in diffusion-based task execution

Z Liang, Y Mu, H Ma, M Tomizuka… - Proceedings of the …, 2024 - openaccess.thecvf.com
Diffusion models have demonstrated strong potential for robotic trajectory planning.
However generating coherent trajectories from high-level instructions remains challenging …

Monotonic location attention for length generalization

JR Chowdhury, C Caragea - International Conference on …, 2023 - proceedings.mlr.press
We explore different ways to utilize position-based cross-attention in seq2seq networks to
enable length generalization in algorithmic tasks. We show that a simple approach of …

DiffVL: scaling up soft body manipulation using vision-language driven differentiable physics

Z Huang, F Chen, Y Pu, C Lin… - Advances in Neural …, 2023 - proceedings.neurips.cc
Combining gradient-based trajectory optimization with differentiable physics simulation is an
efficient technique for solving soft-body manipulation problems. Using a well-crafted …

One-shot imitation in a non-stationary environment via multi-modal skill

S Shin, D Lee, M Yoo, WK Kim… - … Conference on Machine …, 2023 - proceedings.mlr.press
One-shot imitation is to learn a new task from a single demonstration, yet it is a challenging
problem to adopt it for complex tasks with the high domain diversity inherent in a non …

Large language models can implement policy iteration

E Brooks, L Walls, RL Lewis… - Advances in Neural …, 2024 - proceedings.neurips.cc
In this work, we demonstrate a method for implementing policy iteration using a large
language model. While the application of foundation models to RL has received …

In-context policy iteration

E Brooks, LA Walls, R Lewis, S Singh - NeurIPS 2022 Foundation …, 2022 - openreview.net
This work presents In-Context Policy Iteration, an algorithm for performing Reinforcement
Learning (RL), in-context, using foundation models. While the application of foundation …

Grounding Language Plans in Demonstrations Through Counterfactual Perturbations

Y Wang, TH Wang, J Mao, M Hagenow… - arXiv preprint arXiv …, 2024 - arxiv.org
Grounding the common-sense reasoning of Large Language Models in physical domains
remains a pivotal yet unsolved problem for embodied AI. Whereas prior works have focused …

Language-Conditioned Affordance-Pose Detection in 3D Point Clouds

T Nguyen, MN Vu, B Huang, T Van Vo… - arXiv preprint arXiv …, 2023 - arxiv.org
Affordance detection and pose estimation are of great importance in many robotic
applications. Their combination helps the robot gain an enhanced manipulation capability …