Towards continual reinforcement learning: A review and perspectives
In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …
[HTML][HTML] Deliberation for autonomous robots: A survey
Autonomous robots facing a diversity of open environments and performing a variety of tasks
and interactions need explicit deliberation in order to fulfill their missions. Deliberation is …
and interactions need explicit deliberation in order to fulfill their missions. Deliberation is …
An introduction to deep reinforcement learning
Deep reinforcement learning is the combination of reinforcement learning (RL) and deep
learning. This field of research has been able to solve a wide range of complex …
learning. This field of research has been able to solve a wide range of complex …
A deep hierarchical approach to lifelong learning in minecraft
We propose a lifelong learning system that has the ability to reuse and transfer knowledge
from one task to another while efficiently retaining the previously learned knowledge-base …
from one task to another while efficiently retaining the previously learned knowledge-base …
Graying the black box: Understanding dqns
T Zahavy, N Ben-Zrihem… - … conference on machine …, 2016 - proceedings.mlr.press
In recent years there is a growing interest in using deep representations for reinforcement
learning. In this paper, we present a methodology and tools to analyze Deep Q-networks …
learning. In this paper, we present a methodology and tools to analyze Deep Q-networks …
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
Learning, planning, and representing knowledge at multiple levels of temporal abstraction
are key, longstanding challenges for AI. In this paper we consider how these challenges can …
are key, longstanding challenges for AI. In this paper we consider how these challenges can …
Hierarchical reinforcement learning with the MAXQ value function decomposition
TG Dietterich - Journal of artificial intelligence research, 2000 - jair.org
This paper presents a new approach to hierarchical reinforcement learning based on
decomposing the target Markov decision process (MDP) into a hierarchy of smaller MDPs …
decomposing the target Markov decision process (MDP) into a hierarchy of smaller MDPs …
Decision-theoretic planning: Structural assumptions and computational leverage
Planning under uncertainty is a central problem in the study of automated sequential
decision making, and has been addressed by researchers in many different fields, including …
decision making, and has been addressed by researchers in many different fields, including …
Optimal behavioral hierarchy
Human behavior has long been recognized to display hierarchical structure: actions fit
together into subtasks, which cohere into extended goal-directed activities. Arranging …
together into subtasks, which cohere into extended goal-directed activities. Arranging …
State abstractions for lifelong reinforcement learning
In lifelong reinforcement learning, agents must effectively transfer knowledge across tasks
while simultaneously addressing exploration, credit assignment, and generalization. State …
while simultaneously addressing exploration, credit assignment, and generalization. State …