Deep exploration via randomized value functions- 学术资源搜索

文章

学术资源搜索

Deep exploration via randomized value functions

I Osband, B Van Roy, DJ Russo, Z Wen - Journal of Machine Learning …, 2019 - jmlr.org

Journal of Machine Learning Research, 2019•jmlr.org

We study the use of randomized value functions to guide deep exploration in reinforcement
learning. This offers an elegant means for synthesizing statistically and computationally
efficient exploration with common practical approaches to value function learning. We
present several reinforcement learning algorithms that leverage randomized value functions
and demonstrate their efficacy through computational studies. We also prove a regret bound
that establishes statistical efficiency with a tabular representation.

Abstract

We study the use of randomized value functions to guide deep exploration in reinforcement learning. This offers an elegant means for synthesizing statistically and computationally efficient exploration with common practical approaches to value function learning. We present several reinforcement learning algorithms that leverage randomized value functions and demonstrate their efficacy through computational studies. We also prove a regret bound that establishes statistical efficiency with a tabular representation.

jmlr.org

展开收起

被引用次数：358 相关文章所有 9 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果