Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org
In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

[HTML][HTML] Deliberation for autonomous robots: A survey

F Ingrand, M Ghallab - Artificial Intelligence, 2017 - Elsevier
Autonomous robots facing a diversity of open environments and performing a variety of tasks
and interactions need explicit deliberation in order to fulfill their missions. Deliberation is …

An introduction to deep reinforcement learning

V François-Lavet, P Henderson, R Islam… - … and Trends® in …, 2018 - nowpublishers.com
Deep reinforcement learning is the combination of reinforcement learning (RL) and deep
learning. This field of research has been able to solve a wide range of complex …

A deep hierarchical approach to lifelong learning in minecraft

C Tessler, S Givony, T Zahavy, D Mankowitz… - Proceedings of the …, 2017 - ojs.aaai.org
We propose a lifelong learning system that has the ability to reuse and transfer knowledge
from one task to another while efficiently retaining the previously learned knowledge-base …

Graying the black box: Understanding dqns

T Zahavy, N Ben-Zrihem… - … conference on machine …, 2016 - proceedings.mlr.press
In recent years there is a growing interest in using deep representations for reinforcement
learning. In this paper, we present a methodology and tools to analyze Deep Q-networks …

Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning

RS Sutton, D Precup, S Singh - Artificial intelligence, 1999 - Elsevier
Learning, planning, and representing knowledge at multiple levels of temporal abstraction
are key, longstanding challenges for AI. In this paper we consider how these challenges can …

Hierarchical reinforcement learning with the MAXQ value function decomposition

TG Dietterich - Journal of artificial intelligence research, 2000 - jair.org
This paper presents a new approach to hierarchical reinforcement learning based on
decomposing the target Markov decision process (MDP) into a hierarchy of smaller MDPs …

Decision-theoretic planning: Structural assumptions and computational leverage

C Boutilier, T Dean, S Hanks - Journal of Artificial Intelligence Research, 1999 - jair.org
Planning under uncertainty is a central problem in the study of automated sequential
decision making, and has been addressed by researchers in many different fields, including …

Optimal behavioral hierarchy

A Solway, C Diuk, N Córdova, D Yee… - PLoS computational …, 2014 - journals.plos.org
Human behavior has long been recognized to display hierarchical structure: actions fit
together into subtasks, which cohere into extended goal-directed activities. Arranging …

State abstractions for lifelong reinforcement learning

D Abel, D Arumugam, L Lehnert… - … on Machine Learning, 2018 - proceedings.mlr.press
In lifelong reinforcement learning, agents must effectively transfer knowledge across tasks
while simultaneously addressing exploration, credit assignment, and generalization. State …