Rt-1: Robotics transformer for real-world control at scale

A Brohan, N Brown, J Carbajal, Y Chebotar… - arXiv preprint arXiv …, 2022 - arxiv.org
By transferring knowledge from large, diverse, task-agnostic datasets, modern machine
learning models can solve specific downstream tasks either zero-shot or with small task …

A survey of meta-reinforcement learning

J Beck, R Vuorio, EZ Liu, Z Xiong, L Zintgraf… - arXiv preprint arXiv …, 2023 - arxiv.org
While deep reinforcement learning (RL) has fueled multiple high-profile successes in
machine learning, it is held back from more widespread adoption by its often poor data …

Mt-opt: Continuous multi-task robotic reinforcement learning at scale

D Kalashnikov, J Varley, Y Chebotar… - arXiv preprint arXiv …, 2021 - arxiv.org
General-purpose robotic systems must master a large repertoire of diverse skills to be useful
in a range of daily tasks. While reinforcement learning provides a powerful framework for …

Language conditioned imitation learning over unstructured data

C Lynch, P Sermanet - arXiv preprint arXiv:2005.07648, 2020 - arxiv.org
Natural language is perhaps the most flexible and intuitive way for humans to communicate
tasks to a robot. Prior work in imitation learning typically requires each task be specified with …

Learning generalizable robotic reward functions from" in-the-wild" human videos

AS Chen, S Nair, C Finn - arXiv preprint arXiv:2103.16817, 2021 - arxiv.org
We are motivated by the goal of generalist robots that can complete a wide range of tasks
across many environments. Critical to this is the robot's ability to acquire some metric of task …

Skilldiffuser: Interpretable hierarchical planning via skill abstractions in diffusion-based task execution

Z Liang, Y Mu, H Ma, M Tomizuka… - Proceedings of the …, 2024 - openaccess.thecvf.com
Diffusion models have demonstrated strong potential for robotic trajectory planning.
However generating coherent trajectories from high-level instructions remains challenging …

Deep reinforcement learning from demonstrations to assist service restoration in islanded microgrids

Y Du, D Wu - IEEE Transactions on Sustainable Energy, 2022 - ieeexplore.ieee.org
Microgrids can be operated in island mode during utility grid outages to support service
restoration and improve system resilience. To schedule and dispatch distributed energy …

Imitation learning by estimating expertise of demonstrators

M Beliaev, A Shih, S Ermon, D Sadigh… - International …, 2022 - proceedings.mlr.press
Many existing imitation learning datasets are collected from multiple demonstrators, each
with different expertise at different parts of the environment. Yet, standard imitation learning …

Towards more generalizable one-shot visual imitation learning

Z Mandi, F Liu, K Lee, P Abbeel - … International Conference on …, 2022 - ieeexplore.ieee.org
A general-purpose robot should be able to master a wide range of tasks and quickly learn a
novel one by leveraging past experiences. One-shot imitation learning (OSIL) approaches …

Aligning robot and human representations

A Bobu, A Peng, P Agrawal, J Shah… - arXiv preprint arXiv …, 2023 - arxiv.org
To act in the world, robots rely on a representation of salient task aspects: for example, to
carry a coffee mug, a robot may consider movement efficiency or mug orientation in its …