Learning state representations from random deep action-conditional predictions

J Modayil, Z Abbas - arXiv preprint arXiv:2311.02215, 2023 - arxiv.org

Conventional reinforcement learning (RL) algorithms exhibit broad generality in their
theoretical formulation and high performance on several challenging domains when …

被引用次数：2 相关文章所有 2 个版本

[PDF] neurips.cc

Exploring through random curiosity with general value functions

A Ramesh, L Kirsch, S van Steenkiste… - Advances in Neural …, 2022 - proceedings.neurips.cc

Efficient exploration in reinforcement learning is a challenging problem commonly
addressed through intrinsic rewards. Recent prominent approaches are based on state …

被引用次数：10 相关文章所有 11 个版本

[PDF] mlr.press

Towards a better understanding of representation dynamics under TD-learning

Y Tang, R Munos - International Conference on Machine …, 2023 - proceedings.mlr.press

TD-learning is a foundation reinforcement learning (RL) algorithm for value prediction.
Critical to the accuracy of value predictions is the quality of state representations. In this …

被引用次数：1 相关文章所有 6 个版本

[PDF] mlr.press

Auxiliary task discovery through generate-and-test

B Rafiee, S Ghiassian, J Jin, R Sutton… - Conference on …, 2023 - proceedings.mlr.press

In this paper, we explore an approach to auxiliary task discovery in reinforcement learning
based on ideas from representation learning. Auxiliary tasks tend to improve data efficiency …

被引用次数：1 相关文章所有 5 个版本

[PDF] github.io

[PDF][PDF] Predictions Predicting Predictions

M Schlegel, M White - The 5th Multi-disciplinary Conference on …, 2022 - mkschleg.github.io

Predicting the sensorimotor stream has consistently been a key component for building
general learning agents. Whether through predicting a reward signal to select the best …

被引用次数：1 相关文章

[PDF] openreview.net

Multi-Horizon Representations with Hierarchical Forward Models for Reinforcement Learning

T McInroe, L Schäfer, SV Albrecht - Transactions on Machine Learning … - openreview.net

Learning control from pixels is difficult for reinforcement learning (RL) agents because
representation learning and policy learning are intertwined. Previous approaches remedy …

被引用次数：1 相关文章