Scalable multi-task imitation learning with autonomous improvement

A Brohan, N Brown, J Carbajal, Y Chebotar… - arXiv preprint arXiv …, 2022 - arxiv.org

By transferring knowledge from large, diverse, task-agnostic datasets, modern machine
learning models can solve specific downstream tasks either zero-shot or with small task …

被引用次数：883 相关文章所有 3 个版本

[PDF] arxiv.org

A survey of meta-reinforcement learning

J Beck, R Vuorio, EZ Liu, Z Xiong, L Zintgraf… - arXiv preprint arXiv …, 2023 - arxiv.org

While deep reinforcement learning (RL) has fueled multiple high-profile successes in
machine learning, it is held back from more widespread adoption by its often poor data …

被引用次数：163 相关文章所有 2 个版本

[PDF] arxiv.org

Mt-opt: Continuous multi-task robotic reinforcement learning at scale

D Kalashnikov, J Varley, Y Chebotar… - arXiv preprint arXiv …, 2021 - arxiv.org

General-purpose robotic systems must master a large repertoire of diverse skills to be useful
in a range of daily tasks. While reinforcement learning provides a powerful framework for …

被引用次数：168 相关文章所有 2 个版本

[PDF] arxiv.org

Language conditioned imitation learning over unstructured data

C Lynch, P Sermanet - arXiv preprint arXiv:2005.07648, 2020 - arxiv.org

Natural language is perhaps the most flexible and intuitive way for humans to communicate
tasks to a robot. Prior work in imitation learning typically requires each task be specified with …

被引用次数：242 相关文章所有 8 个版本

[PDF] arxiv.org

Learning generalizable robotic reward functions from" in-the-wild" human videos

AS Chen, S Nair, C Finn - arXiv preprint arXiv:2103.16817, 2021 - arxiv.org

We are motivated by the goal of generalist robots that can complete a wide range of tasks
across many environments. Critical to this is the robot's ability to acquire some metric of task …

被引用次数：117 相关文章所有 7 个版本

[PDF] thecvf.com

Skilldiffuser: Interpretable hierarchical planning via skill abstractions in diffusion-based task execution

Z Liang, Y Mu, H Ma, M Tomizuka… - Proceedings of the …, 2024 - openaccess.thecvf.com

Diffusion models have demonstrated strong potential for robotic trajectory planning.
However generating coherent trajectories from high-level instructions remains challenging …

被引用次数：25 相关文章所有 3 个版本

Deep reinforcement learning from demonstrations to assist service restoration in islanded microgrids

Y Du, D Wu - IEEE Transactions on Sustainable Energy, 2022 - ieeexplore.ieee.org

Microgrids can be operated in island mode during utility grid outages to support service
restoration and improve system resilience. To schedule and dispatch distributed energy …

被引用次数：69 相关文章所有 2 个版本

[PDF] mlr.press

Imitation learning by estimating expertise of demonstrators

M Beliaev, A Shih, S Ermon, D Sadigh… - International …, 2022 - proceedings.mlr.press

Many existing imitation learning datasets are collected from multiple demonstrators, each
with different expertise at different parts of the environment. Yet, standard imitation learning …

被引用次数：51 相关文章所有 8 个版本

[PDF] arxiv.org

Towards more generalizable one-shot visual imitation learning

Z Mandi, F Liu, K Lee, P Abbeel - … International Conference on …, 2022 - ieeexplore.ieee.org

A general-purpose robot should be able to master a wide range of tasks and quickly learn a
novel one by leveraging past experiences. One-shot imitation learning (OSIL) approaches …

被引用次数：65 相关文章所有 5 个版本

[PDF] arxiv.org

Aligning robot and human representations

A Bobu, A Peng, P Agrawal, J Shah… - arXiv preprint arXiv …, 2023 - arxiv.org

To act in the world, robots rely on a representation of salient task aspects: for example, to
carry a coffee mug, a robot may consider movement efficiency or mug orientation in its …

被引用次数：30 相关文章所有 3 个版本