MICo: Improved representations via sampling-based state similarity for Markov decision processes
We present a new behavioural distance over the state space of a Markov decision process,
and demonstrate the use of this distance as an effective means of shaping the learnt …
and demonstrate the use of this distance as an effective means of shaping the learnt …
Reward-predictive representations generalize across tasks in reinforcement learning
In computer science, reinforcement learning is a powerful framework with which artificial
agents can learn to maximize their performance for any given Markov decision process …
agents can learn to maximize their performance for any given Markov decision process …
A taxonomy for similarity metrics between Markov decision processes
J García, Á Visús, F Fernández - Machine Learning, 2022 - Springer
Although the notion of task similarity is potentially interesting in a wide range of areas such
as curriculum learning or automated planning, it has mostly been tied to transfer learning …
as curriculum learning or automated planning, it has mostly been tied to transfer learning …
Runtime Probabilistic Analysis of Self-Adaptive Systems via Formal Approximation Techniques
MA Nia - 2022 - search.proquest.com
Self-adaptive systems provide the ability of autonomous decision-making for handling the
changes affecting the functionalities of cyber-physical systems. A self-adaptive system …
changes affecting the functionalities of cyber-physical systems. A self-adaptive system …
Reward-Predictive Clustering
Recent advances in reinforcement-learning research have demonstrated impressive results
in building algorithms that can out-perform humans in complex tasks. Nevertheless, creating …
in building algorithms that can out-perform humans in complex tasks. Nevertheless, creating …
[图书][B] Reinforcement learning in partially observed and multi-agent systems
J Subramanian - 2020 - search.proquest.com
In this thesis we investigate the problem of reinforcement learning in partially observed and
multi-agent systems. The belief state, which is the most common information state used in …
multi-agent systems. The belief state, which is the most common information state used in …
MDP Autoencoder
S Bose, M Huber - … on Systems, Man and Cybernetics (SMC), 2019 - ieeexplore.ieee.org
This paper proposes a novel deep reinforcement learning (RL) architecture, which learns a
dynamics model in latent space that is behaviorally grounded to the observed space and …
dynamics model in latent space that is behaviorally grounded to the observed space and …
[图书][B] Learning Representations Using Reinforcement Learning
S Bose - 2019 - search.proquest.com
The framework of reinforcement learning is a powerful suite of algorithms that can learn
generalized solutions to complex decision making problems. However, the applications of …
generalized solutions to complex decision making problems. However, the applications of …