Integrating behavior cloning and reinforcement learning for improved performance in dense and sparse reward environments

VG Goecks, GM Gremillion, VJ Lawhern… - arXiv preprint arXiv …, 2019 - arxiv.org
This paper investigates how to efficiently transition and update policies, trained initially with
demonstrations, using off-policy actor-critic reinforcement learning. It is well-known that …

Attentive multi-task deep reinforcement learning

T Bräm, G Brunner, O Richter, R Wattenhofer - Machine Learning and …, 2020 - Springer
Sharing knowledge between tasks is vital for efficient learning in a multi-task setting.
However, most research so far has focused on the easier case where knowledge transfer is …