Action priors for large action spaces in robotics

O Biza, D Wang, R Platt, JW van de Meent… - arXiv preprint arXiv …, 2021 - arxiv.org
In robotics, it is often not possible to learn useful policies using pure model-free
reinforcement learning without significant reward shaping or curriculum learning. As a …

[PDF][PDF] Lecture notes of 6.8200 Computational Sensorimotor Learning

ZW Hong, P Agrawal - williamd4112.github.io
Consider the problem of designing a recommendation system for a competitor to the popular
music service Spotify called MusicApp. To maximize earnings, the app's goal is not just to …