Towards model-free RL algorithms that scale well with unstructured data

J Modayil, Z Abbas - arXiv preprint arXiv:2311.02215, 2023 - arxiv.org
Conventional reinforcement learning (RL) algorithms exhibit broad generality in their
theoretical formulation and high performance on several challenging domains when …

Exploring through random curiosity with general value functions

A Ramesh, L Kirsch, S van Steenkiste… - Advances in Neural …, 2022 - proceedings.neurips.cc
Efficient exploration in reinforcement learning is a challenging problem commonly
addressed through intrinsic rewards. Recent prominent approaches are based on state …

Towards a better understanding of representation dynamics under TD-learning

Y Tang, R Munos - International Conference on Machine …, 2023 - proceedings.mlr.press
TD-learning is a foundation reinforcement learning (RL) algorithm for value prediction.
Critical to the accuracy of value predictions is the quality of state representations. In this …

Auxiliary task discovery through generate-and-test

B Rafiee, S Ghiassian, J Jin, R Sutton… - Conference on …, 2023 - proceedings.mlr.press
In this paper, we explore an approach to auxiliary task discovery in reinforcement learning
based on ideas from representation learning. Auxiliary tasks tend to improve data efficiency …

[PDF][PDF] Predictions Predicting Predictions

M Schlegel, M White - The 5th Multi-disciplinary Conference on …, 2022 - mkschleg.github.io
Predicting the sensorimotor stream has consistently been a key component for building
general learning agents. Whether through predicting a reward signal to select the best …

Multi-Horizon Representations with Hierarchical Forward Models for Reinforcement Learning

T McInroe, L Schäfer, SV Albrecht - Transactions on Machine Learning … - openreview.net
Learning control from pixels is difficult for reinforcement learning (RL) agents because
representation learning and policy learning are intertwined. Previous approaches remedy …