Hierarchical solution of Markov decision processes using macro-actions

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org

In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

被引用次数：324 相关文章所有 9 个版本

[HTML] sciencedirect.com

[HTML][HTML] Deliberation for autonomous robots: A survey

F Ingrand, M Ghallab - Artificial Intelligence, 2017 - Elsevier

Autonomous robots facing a diversity of open environments and performing a variety of tasks
and interactions need explicit deliberation in order to fulfill their missions. Deliberation is …

被引用次数：419 相关文章所有 9 个版本

[PDF] nowpublishers.com

An introduction to deep reinforcement learning

V François-Lavet, P Henderson, R Islam… - … and Trends® in …, 2018 - nowpublishers.com

Deep reinforcement learning is the combination of reinforcement learning (RL) and deep
learning. This field of research has been able to solve a wide range of complex …

被引用次数：1932 相关文章所有 16 个版本

[PDF] aaai.org

A deep hierarchical approach to lifelong learning in minecraft

C Tessler, S Givony, T Zahavy, D Mankowitz… - Proceedings of the …, 2017 - ojs.aaai.org

We propose a lifelong learning system that has the ability to reuse and transfer knowledge
from one task to another while efficiently retaining the previously learned knowledge-base …

被引用次数：468 相关文章所有 15 个版本

[PDF] mlr.press

Graying the black box: Understanding dqns

T Zahavy, N Ben-Zrihem… - … conference on machine …, 2016 - proceedings.mlr.press

In recent years there is a growing interest in using deep representations for reinforcement
learning. In this paper, we present a methodology and tools to analyze Deep Q-networks …

被引用次数：350 相关文章所有 11 个版本

[PDF] sciencedirect.com

Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning

RS Sutton, D Precup, S Singh - Artificial intelligence, 1999 - Elsevier

Learning, planning, and representing knowledge at multiple levels of temporal abstraction
are key, longstanding challenges for AI. In this paper we consider how these challenges can …

被引用次数：4684 相关文章所有 39 个版本

[PDF] jair.org

Hierarchical reinforcement learning with the MAXQ value function decomposition

TG Dietterich - Journal of artificial intelligence research, 2000 - jair.org

This paper presents a new approach to hierarchical reinforcement learning based on
decomposing the target Markov decision process (MDP) into a hierarchy of smaller MDPs …

被引用次数：2212 相关文章所有 29 个版本

[PDF] jair.org

Decision-theoretic planning: Structural assumptions and computational leverage

C Boutilier, T Dean, S Hanks - Journal of Artificial Intelligence Research, 1999 - jair.org

Planning under uncertainty is a central problem in the study of automated sequential
decision making, and has been addressed by researchers in many different fields, including …

被引用次数：1600 相关文章所有 27 个版本

[PDF] plos.org

Optimal behavioral hierarchy

A Solway, C Diuk, N Córdova, D Yee… - PLoS computational …, 2014 - journals.plos.org

Human behavior has long been recognized to display hierarchical structure: actions fit
together into subtasks, which cohere into extended goal-directed activities. Arranging …

被引用次数：241 相关文章所有 18 个版本

[PDF] mlr.press

State abstractions for lifelong reinforcement learning

D Abel, D Arumugam, L Lehnert… - … on Machine Learning, 2018 - proceedings.mlr.press

In lifelong reinforcement learning, agents must effectively transfer knowledge across tasks
while simultaneously addressing exploration, credit assignment, and generalization. State …

被引用次数：160 相关文章所有 9 个版本