The Effect of Q-function Reuse on the Total Regret of Tabular, Model-Free, Reinforcement Learning

文章

学术资源搜索

获得 1 条结果（用时0.02秒）

我的图书馆

The Effect of Q-function Reuse on the Total Regret of Tabular, Model-Free, Reinforcement Learning

在引用文章中搜索

[PDF] mlr.press

Can Q-learning be improved with advice?

N Golowich, A Moitra - Conference on Learning Theory, 2022 - proceedings.mlr.press

Despite rapid progress in theoretical reinforcement learning (RL) over the last few years,
most of the known guarantees are worst-case in nature, failing to take advantage of structure …

被引用次数：11 相关文章所有 4 个版本