Generative predecessor models for sample-efficient imitation learning

G Papamakarios, E Nalisnick, DJ Rezende… - Journal of Machine …, 2021 - jmlr.org

Normalizing flows provide a general mechanism for defining expressive probability
distributions, only requiring the specification of a (usually simple) base distribution and a …

被引用次数：1653 相关文章所有 8 个版本

[PDF] neurips.cc

Goal-conditioned imitation learning

Y Ding, C Florensa, P Abbeel… - Advances in neural …, 2019 - proceedings.neurips.cc

Abstract Designing rewards for Reinforcement Learning (RL) is challenging because it
needs to convey the desired task, be efficient to optimize, and be easy to compute. The latter …

被引用次数：246 相关文章所有 9 个版本

[PDF] arxiv.org

Coarse-to-fine imitation learning: Robot manipulation from a single demonstration

E Johns - 2021 IEEE international conference on robotics and …, 2021 - ieeexplore.ieee.org

We introduce a simple new method for visual imitation learning, which allows a novel robot
manipulation task to be learned from a single human demonstration, without requiring any …

被引用次数：107 相关文章所有 4 个版本

[PDF] arxiv.org

Deep generative models for offline policy learning: Tutorial, survey, and perspectives on future directions

J Chen, B Ganguly, Y Xu, Y Mei, T Lan… - arXiv preprint arXiv …, 2024 - arxiv.org

Deep generative models (DGMs) have demonstrated great success across various domains,
particularly in generating texts, images, and videos using models trained from offline data …

被引用次数：4 相关文章所有 3 个版本

Variational imitation learning with diverse-quality demonstrations

V Tangkaratt, B Han, ME Khan… - … on Machine Learning, 2020 - proceedings.mlr.press

Learning from demonstrations can be challenging when the quality of demonstrations is
diverse, and even more so when the quality is unknown and there is no additional …

被引用次数：49 相关文章所有 3 个版本

[PDF] mlr.press

Intrinsic reward driven imitation learning via generative model

X Yu, Y Lyu, I Tsang - International conference on machine …, 2020 - proceedings.mlr.press

Imitation learning in a high-dimensional environment is challenging. Most inverse
reinforcement learning (IRL) methods fail to outperform the demonstrator in such a high …

被引用次数：47 相关文章所有 8 个版本

[PDF] mlr.press

A coupled flow approach to imitation learning

GJ Freund, E Sarafian, S Kraus - … Conference on Machine …, 2023 - proceedings.mlr.press

In reinforcement learning and imitation learning, an object of central importance is the state
distribution induced by the policy. It plays a crucial role in the policy gradient theorem, and …

被引用次数：10 相关文章所有 7 个版本

[PDF] ed.ac.uk

Neural density estimation and likelihood-free inference

G Papamakarios - arXiv preprint arXiv:1910.13233, 2019 - arxiv.org

I consider two problems in machine learning and statistics: the problem of estimating the
joint probability density of a collection of random variables, known as density estimation, and …

被引用次数：51 相关文章所有 4 个版本

[PDF] arxiv.org

Goal-aware generative adversarial imitation learning from imperfect demonstration for robotic cloth manipulation

Y Tsurumine, T Matsubara - Robotics and Autonomous Systems, 2022 - Elsevier

Abstract Generative Adversarial Imitation Learning (GAIL) can learn policies without
explicitly defining the reward function from demonstrations. GAIL has the potential to learn …

被引用次数：12 相关文章所有 6 个版本

[PDF] neurips.cc

Forethought and hindsight in credit assignment

V Chelu, D Precup… - Advances in Neural …, 2020 - proceedings.neurips.cc

We address the problem of credit assignment in reinforcement learning and explore
fundamental questions regarding the way in which an agent can best use additional …

被引用次数：34 相关文章所有 7 个版本