How to train your robot with deep reinforcement learning: lessons we have learned
Deep reinforcement learning (RL) has emerged as a promising approach for autonomously
acquiring complex behaviors from low-level sensor observations. Although a large portion of …
acquiring complex behaviors from low-level sensor observations. Although a large portion of …
Survey on large language model-enhanced reinforcement learning: Concept, taxonomy, and methods
With extensive pretrained knowledge and high-level general capabilities, large language
models (LLMs) emerge as a promising avenue to augment reinforcement learning (RL) in …
models (LLMs) emerge as a promising avenue to augment reinforcement learning (RL) in …
On the opportunities and risks of foundation models
AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …
Vip: Towards universal visual reward and representation via value-implicit pre-training
Reward and representation learning are two long-standing challenges for learning an
expanding set of robot manipulation skills from sensory observations. Given the inherent …
expanding set of robot manipulation skills from sensory observations. Given the inherent …
What matters in learning from offline human demonstrations for robot manipulation
Imitating human demonstrations is a promising approach to endow robots with various
manipulation capabilities. While recent advances have been made in imitation learning and …
manipulation capabilities. While recent advances have been made in imitation learning and …
Learning language-conditioned robot behavior from offline data and crowd-sourced annotation
We study the problem of learning a range of vision-based manipulation tasks from a large
offline dataset of robot interaction. In order to accomplish this, humans need easy and …
offline dataset of robot interaction. In order to accomplish this, humans need easy and …
Language conditioned imitation learning over unstructured data
C Lynch, P Sermanet - arXiv preprint arXiv:2005.07648, 2020 - arxiv.org
Natural language is perhaps the most flexible and intuitive way for humans to communicate
tasks to a robot. Prior work in imitation learning typically requires each task be specified with …
tasks to a robot. Prior work in imitation learning typically requires each task be specified with …
Can foundation models perform zero-shot task specification for robot manipulation?
Task specification is at the core of programming autonomous robots. A low-effort modality for
task specification is critical for engagement of non-expert end users and ultimate adoption of …
task specification is critical for engagement of non-expert end users and ultimate adoption of …
Vision-language models as success detectors
Detecting successful behaviour is crucial for training intelligent agents. As such,
generalisable reward models are a prerequisite for agents that can learn to generalise their …
generalisable reward models are a prerequisite for agents that can learn to generalise their …
How to leverage unlabeled data in offline reinforcement learning
Offline reinforcement learning (RL) can learn control policies from static datasets but, like
standard RL methods, it requires reward annotations for every transition. In many cases …
standard RL methods, it requires reward annotations for every transition. In many cases …