Towards continual reinforcement learning: A review and perspectives
In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …
Large language models for robotics: A survey
The human ability to learn, generalize, and control complex manipulation tasks through multi-
modality feedback suggests a unique capability, which we refer to as dexterity intelligence …
modality feedback suggests a unique capability, which we refer to as dexterity intelligence …
Lm-nav: Robotic navigation with large pre-trained models of language, vision, and action
Goal-conditioned policies for robotic navigation can be trained on large, unannotated
datasets, providing for good generalization to real-world settings. However, particularly in …
datasets, providing for good generalization to real-world settings. However, particularly in …
On the opportunities and risks of foundation models
AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …
Q-transformer: Scalable offline reinforcement learning via autoregressive q-functions
In this work, we present a scalable reinforcement learning method for training multi-task
policies from large offline datasets that can leverage both human demonstrations and …
policies from large offline datasets that can leverage both human demonstrations and …
Bridgedata v2: A dataset for robot learning at scale
We introduce BridgeData V2, a large and diverse dataset of robotic manipulation behaviors
designed to facilitate research in scalable robot learning. BridgeData V2 contains 53,896 …
designed to facilitate research in scalable robot learning. BridgeData V2 contains 53,896 …
Contrastive learning as goal-conditioned reinforcement learning
In reinforcement learning (RL), it is easier to solve a task if given a good representation.
While deep RL should automatically acquire such good representations, prior work often …
While deep RL should automatically acquire such good representations, prior work often …
Playfusion: Skill acquisition via diffusion from language-annotated play
Learning from unstructured and uncurated data has become the dominant paradigm for
generative approaches in language or vision. Such unstructured and unguided behavior …
generative approaches in language or vision. Such unstructured and unguided behavior …
Planning to explore via self-supervised world models
Reinforcement learning allows solving complex tasks, however, the learning tends to be task-
specific and the sample efficiency remains a challenge. We present Plan2Explore, a self …
specific and the sample efficiency remains a challenge. We present Plan2Explore, a self …
Causal machine learning: A survey and open problems
Causal Machine Learning (CausalML) is an umbrella term for machine learning methods
that formalize the data-generation process as a structural causal model (SCM). This …
that formalize the data-generation process as a structural causal model (SCM). This …