Leveraging demonstrations for deep reinforcement learning on robotics problems with sparse rewards

M Vecerik, T Hester, J Scholz, F Wang… - arXiv preprint arXiv …, 2017 - arxiv.org
We propose a general and model-free approach for Reinforcement Learning (RL) on real
robotics with sparse rewards. We build upon the Deep Deterministic Policy Gradient (DDPG) …

Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards

M Vecerik, T Hester, J Scholz, F Wang… - arXiv e …, 2017 - ui.adsabs.harvard.edu
We propose a general and model-free approach for Reinforcement Learning (RL) on real
robotics with sparse rewards. We build upon the Deep Deterministic Policy Gradient (DDPG) …