Basis refinement strategies for linear value function approximation in MDPs

PS Castro, T Kastner… - Advances in Neural …, 2021 - proceedings.neurips.cc

We present a new behavioural distance over the state space of a Markov decision process,
and demonstrate the use of this distance as an effective means of shaping the learnt …

被引用次数：67 相关文章所有 6 个版本

[PDF] plos.org

Reward-predictive representations generalize across tasks in reinforcement learning

L Lehnert, ML Littman, MJ Frank - PLoS computational biology, 2020 - journals.plos.org

In computer science, reinforcement learning is a powerful framework with which artificial
agents can learn to maximize their performance for any given Markov decision process …

被引用次数：39 相关文章所有 19 个版本

[PDF] springer.com

A taxonomy for similarity metrics between Markov decision processes

J García, Á Visús, F Fernández - Machine Learning, 2022 - Springer

Although the notion of task similarity is potentially interesting in a wide range of areas such
as curriculum learning or automated planning, it has mostly been tied to transfer learning …

被引用次数：13 相关文章所有 7 个版本

[PDF] researchgate.net

Runtime Probabilistic Analysis of Self-Adaptive Systems via Formal Approximation Techniques

MA Nia - 2022 - search.proquest.com

Self-adaptive systems provide the ability of autonomous decision-making for handling the
changes affecting the functionalities of cyber-physical systems. A self-adaptive system …

被引用次数：4 相关文章所有 2 个版本

[PDF] arxiv.org

[图书][B] Learning Representations Using Reinforcement Learning

S Bose - 2019 - search.proquest.com

The framework of reinforcement learning is a powerful suite of algorithms that can learn
generalized solutions to complex decision making problems. However, the applications of …

[引用][C] A taxonomy for similarity metrics between Markov decision processes

FJ García Polo, Á Visús, F Fernández Rebollo - 2022 - Springer