Reinforcement learning with hierarchies of machines

S Pateria, B Subagdja, A Tan, C Quek - ACM Computing Surveys (CSUR …, 2021 - dl.acm.org

Hierarchical Reinforcement Learning (HRL) enables autonomous decomposition of
challenging long-horizon decision-making tasks into simpler subtasks. During the past …

被引用次数：307 相关文章所有 5 个版本

[PDF] jair.org Find it @ GRI

Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org

In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

被引用次数：247 相关文章所有 9 个版本

[PDF] arxiv.org

Transfer learning in deep reinforcement learning: A survey

Z Zhu, K Lin, AK Jain, J Zhou - IEEE Transactions on Pattern …, 2023 - ieeexplore.ieee.org

Reinforcement learning is a learning paradigm for solving sequential decision-making
problems. Recent years have witnessed remarkable progress in reinforcement learning …

被引用次数：492 相关文章所有 9 个版本

[PDF] neurips.cc

Data-efficient hierarchical reinforcement learning

O Nachum, SS Gu, H Lee… - Advances in neural …, 2018 - proceedings.neurips.cc

Hierarchical reinforcement learning (HRL) is a promising approach to extend traditional
reinforcement learning (RL) methods to solve more complex tasks. Yet, the majority of …

被引用次数：941 相关文章所有 10 个版本

[PDF] nowpublishers.com

Neural approaches to conversational AI

J Gao, M Galley, L Li - The 41st international ACM SIGIR conference on …, 2018 - dl.acm.org

This tutorial surveys neural approaches to conversational AI that were developed in the last
few years. We group conversational systems into three categories:(1) question answering …

被引用次数：819 相关文章所有 17 个版本

[PDF] neurips.cc Find it @ GRI

Deep hierarchical planning from pixels

D Hafner, KH Lee, I Fischer… - Advances in Neural …, 2022 - proceedings.neurips.cc

Intelligent agents need to select long sequences of actions to solve complex tasks. While
humans easily break down tasks into subgoals and reach them through millions of muscle …

被引用次数：59 相关文章所有 6 个版本

[PDF] arxiv.org

Relay policy learning: Solving long-horizon tasks via imitation and reinforcement learning

A Gupta, V Kumar, C Lynch, S Levine… - arXiv preprint arXiv …, 2019 - arxiv.org

We present relay policy learning, a method for imitation and reinforcement learning that can
solve multi-stage, long-horizon robotic tasks. This general and universally-applicable, two …

被引用次数：341 相关文章所有 7 个版本

[PDF] arxiv.org

From motor control to team play in simulated humanoid football

S Liu, G Lever, Z Wang, J Merel, SMA Eslami… - Science Robotics, 2022 - science.org

Learning to combine control at the level of joint torques with longer-term goal-directed
behavior is a long-standing challenge for physically embodied artificial agents. Intelligent …

被引用次数：110 相关文章所有 6 个版本

[PDF] universityofgalway.ie

Cooperative multi-agent control using deep reinforcement learning

JK Gupta, M Egorov, M Kochenderfer - … Best Papers, São Paulo, Brazil, May …, 2017 - Springer

This work considers the problem of learning cooperative policies in complex, partially
observable domains without explicit communication. We extend three classes of single …

被引用次数：1093 相关文章所有 3 个版本

[PDF] mlr.press

Feudal networks for hierarchical reinforcement learning

AS Vezhnevets, S Osindero, T Schaul… - International …, 2017 - proceedings.mlr.press

Abstract We introduce FeUdal Networks (FuNs): a novel architecture for hierarchical
reinforcement learning. Our approach is inspired by the feudal reinforcement learning …

被引用次数：1048 相关文章所有 9 个版本