Stabilizing transformers for reinforcement learning

E Parisotto, F Song, J Rae, R Pascanu… - International …, 2020 - proceedings.mlr.press
Owing to their ability to both effectively integrate information over long time horizons and
scale to massive amounts of data, self-attention architectures have recently shown …

[PDF][PDF] Meta reinforcement learning through memory

E Parisotto - Pittsburgh: Carnegie Mellon University, 2021 - kilthub.cmu.edu
Modern deep reinforcement learning (RL) algorithms, despite being at the forefront of
artificial intelligence capabilities, typically require a prohibitive amount of training samples to …