Hierarchical reinforcement learning: A comprehensive survey

S Pateria, B Subagdja, A Tan, C Quek - ACM Computing Surveys (CSUR …, 2021 - dl.acm.org
Hierarchical Reinforcement Learning (HRL) enables autonomous decomposition of
challenging long-horizon decision-making tasks into simpler subtasks. During the past …

Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org
In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

Transfer learning in deep reinforcement learning: A survey

Z Zhu, K Lin, AK Jain, J Zhou - IEEE Transactions on Pattern …, 2023 - ieeexplore.ieee.org
Reinforcement learning is a learning paradigm for solving sequential decision-making
problems. Recent years have witnessed remarkable progress in reinforcement learning …

Data-efficient hierarchical reinforcement learning

O Nachum, SS Gu, H Lee… - Advances in neural …, 2018 - proceedings.neurips.cc
Hierarchical reinforcement learning (HRL) is a promising approach to extend traditional
reinforcement learning (RL) methods to solve more complex tasks. Yet, the majority of …

Neural approaches to conversational AI

J Gao, M Galley, L Li - The 41st international ACM SIGIR conference on …, 2018 - dl.acm.org
This tutorial surveys neural approaches to conversational AI that were developed in the last
few years. We group conversational systems into three categories:(1) question answering …

Deep hierarchical planning from pixels

D Hafner, KH Lee, I Fischer… - Advances in Neural …, 2022 - proceedings.neurips.cc
Intelligent agents need to select long sequences of actions to solve complex tasks. While
humans easily break down tasks into subgoals and reach them through millions of muscle …

Relay policy learning: Solving long-horizon tasks via imitation and reinforcement learning

A Gupta, V Kumar, C Lynch, S Levine… - arXiv preprint arXiv …, 2019 - arxiv.org
We present relay policy learning, a method for imitation and reinforcement learning that can
solve multi-stage, long-horizon robotic tasks. This general and universally-applicable, two …

From motor control to team play in simulated humanoid football

S Liu, G Lever, Z Wang, J Merel, SMA Eslami… - Science Robotics, 2022 - science.org
Learning to combine control at the level of joint torques with longer-term goal-directed
behavior is a long-standing challenge for physically embodied artificial agents. Intelligent …

Cooperative multi-agent control using deep reinforcement learning

JK Gupta, M Egorov, M Kochenderfer - … Best Papers, São Paulo, Brazil, May …, 2017 - Springer
This work considers the problem of learning cooperative policies in complex, partially
observable domains without explicit communication. We extend three classes of single …

Feudal networks for hierarchical reinforcement learning

AS Vezhnevets, S Osindero, T Schaul… - International …, 2017 - proceedings.mlr.press
Abstract We introduce FeUdal Networks (FuNs): a novel architecture for hierarchical
reinforcement learning. Our approach is inspired by the feudal reinforcement learning …