Novelty search in representational space for sample efficient exploration

RY Tao, V François-Lavet… - Advances in Neural …, 2020 - proceedings.neurips.cc
Advances in Neural Information Processing Systems, 2020proceedings.neurips.cc
We present a new approach for efficient exploration which leverages a low-dimensional
encoding of the environment learned with a combination of model-based and model-free
objectives. Our approach uses intrinsic rewards that are based on the distance of nearest
neighbors in the low dimensional representational space to gauge novelty. We then
leverage these intrinsic rewards for sample-efficient exploration with planning routines in
representational space for hard exploration tasks with sparse rewards. One key element of …
Abstract
We present a new approach for efficient exploration which leverages a low-dimensional encoding of the environment learned with a combination of model-based and model-free objectives. Our approach uses intrinsic rewards that are based on the distance of nearest neighbors in the low dimensional representational space to gauge novelty. We then leverage these intrinsic rewards for sample-efficient exploration with planning routines in representational space for hard exploration tasks with sparse rewards. One key element of our approach is the use of information theoretic principles to shape our representations in a way so that our novelty reward goes beyond pixel similarity. We test our approach on a number of maze tasks, as well as a control problem and show that our exploration approach is more sample-efficient compared to strong baselines.
proceedings.neurips.cc
以上显示的是最相近的搜索结果。 查看全部搜索结果