Accelerating reinforcement learning with learned skill priors

K Pertsch, Y Lee, J Lim - Conference on robot learning, 2021 - proceedings.mlr.press
Intelligent agents rely heavily on prior experience when learning a new task, yet most
modern reinforcement learning (RL) approaches learn every task from scratch. One …

Latent plans for task-agnostic offline reinforcement learning

E Rosete-Beas, O Mees, G Kalweit… - … on Robot Learning, 2023 - proceedings.mlr.press
Everyday tasks of long-horizon and comprising a sequence of multiple implicit subtasks still
impose a major challenge in offline robot control. While a number of prior methods aimed to …

Guided reinforcement learning with learned skills

K Pertsch, Y Lee, Y Wu, JJ Lim - arXiv preprint arXiv:2107.10253, 2021 - arxiv.org
Demonstration-guided reinforcement learning (RL) is a promising approach for learning
complex behaviors by leveraging both reward feedback and a set of target task …

Compile: Compositional imitation learning and execution

T Kipf, Y Li, H Dai, V Zambaldi… - International …, 2019 - proceedings.mlr.press
Abstract We introduce Compositional Imitation Learning and Execution (CompILE): a
framework for learning reusable, variable-length segments of hierarchically-structured …

Stochastic image-to-video synthesis using cinns

M Dorkenwald, T Milbich, A Blattmann… - Proceedings of the …, 2021 - openaccess.thecvf.com
Video understanding calls for a model to learn the characteristic interplay between static
scene content and its dynamics: Given an image, the model must be able to predict a future …

READ: Retrieval-Enhanced Asymmetric Diffusion for Motion Planning

T Oba, M Walter, N Ukita - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Abstract This paper proposes Retrieval-Enhanced Asymmetric Diffusion (READ) for image-
based robot motion planning. Given an image of the scene READ retrieves an initial motion …

Long-horizon visual planning with goal-conditioned hierarchical predictors

K Pertsch, O Rybkin, F Ebert, S Zhou… - Advances in …, 2020 - proceedings.neurips.cc
The ability to predict and plan into the future is fundamental for agents acting in the world. To
reach a faraway goal, we predict trajectories at multiple timescales, first devising a coarse …

Variational temporal abstraction

T Kim, S Ahn, Y Bengio - Advances in Neural Information …, 2019 - proceedings.neurips.cc
We introduce a variational approach to learning and inference of temporally hierarchical
structure and representation for sequential data. We propose the Variational Temporal …

Efficient reinforcement learning for autonomous driving with parameterized skills and priors

L Wang, J Liu, H Shao, W Wang, R Chen, Y Liu… - arXiv preprint arXiv …, 2023 - arxiv.org
When autonomous vehicles are deployed on public roads, they will encounter countless and
diverse driving situations. Many manually designed driving policies are difficult to scale to …

Model-based reinforcement learning via latent-space collocation

O Rybkin, C Zhu, A Nagabandi… - International …, 2021 - proceedings.mlr.press
The ability to plan into the future while utilizing only raw high-dimensional observations,
such as images, can provide autonomous agents with broad and general capabilities …