Affordances from human videos as a versatile representation for robotics

S Bahl, R Mendonca, L Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
Building a robot that can understand and learn to interact by watching humans has inspired
several vision problems. However, despite some successful results on static datasets, it …

Combo: Conservative offline model-based policy optimization

T Yu, A Kumar, R Rafailov… - Advances in neural …, 2021 - proceedings.neurips.cc
Abstract Model-based reinforcement learning (RL) algorithms, which learn a dynamics
model from logged experience and perform conservative planning under the learned model …

Rambo-rl: Robust adversarial model-based offline reinforcement learning

M Rigter, B Lacerda, N Hawes - Advances in neural …, 2022 - proceedings.neurips.cc
Offline reinforcement learning (RL) aims to find performant policies from logged data without
further environment interaction. Model-based algorithms, which learn a model of the …

Mildly conservative q-learning for offline reinforcement learning

J Lyu, X Ma, X Li, Z Lu - Advances in Neural Information …, 2022 - proceedings.neurips.cc
Offline reinforcement learning (RL) defines the task of learning from a static logged dataset
without continually interacting with the environment. The distribution shift between the …

Zero-shot robotic manipulation with pretrained image-editing diffusion models

K Black, M Nakamoto, P Atreya, H Walke… - arXiv preprint arXiv …, 2023 - arxiv.org
If generalist robots are to operate in truly unstructured environments, they need to be able to
recognize and reason about novel objects and scenarios. Such objects and scenarios might …

How to leverage unlabeled data in offline reinforcement learning

T Yu, A Kumar, Y Chebotar… - International …, 2022 - proceedings.mlr.press
Offline reinforcement learning (RL) can learn control policies from static datasets but, like
standard RL methods, it requires reward annotations for every transition. In many cases …

Online and offline reinforcement learning by planning with a learned model

J Schrittwieser, T Hubert, A Mandhane… - Advances in …, 2021 - proceedings.neurips.cc
Learning efficiently from small amounts of data has long been the focus of model-based
reinforcement learning, both for the online case when interacting with the environment, and …

Conservative data sharing for multi-task offline reinforcement learning

T Yu, A Kumar, Y Chebotar… - Advances in …, 2021 - proceedings.neurips.cc
Offline reinforcement learning (RL) algorithms have shown promising results in domains
where abundant pre-collected data is available. However, prior methods focus on solving …

A workflow for offline model-free robotic reinforcement learning

A Kumar, A Singh, S Tian, C Finn, S Levine - arXiv preprint arXiv …, 2021 - arxiv.org
Offline reinforcement learning (RL) enables learning control policies by utilizing only prior
experience, without any online interaction. This can allow robots to acquire generalizable …

Fitvid: Overfitting in pixel-level video prediction

M Babaeizadeh, MT Saffar, S Nair, S Levine… - arXiv preprint arXiv …, 2021 - arxiv.org
An agent that is capable of predicting what happens next can perform a variety of tasks
through planning with no additional training. Furthermore, such an agent can internally …