Offline actor-critic reinforcement learning scales to large models

文章

学术资源搜索

获得 2 条结果（用时0.02秒）

我的图书馆

Offline actor-critic reinforcement learning scales to large models

在引用文章中搜索

[PDF] arxiv.org

Stop regressing: Training value functions via classification for scalable deep rl

J Farebrother, J Orbay, Q Vuong, AA Taïga… - arXiv preprint arXiv …, 2024 - arxiv.org

Value functions are a central component of deep reinforcement learning (RL). These
functions, parameterized by neural networks, are trained using a mean squared error …

被引用次数：11 相关文章所有 3 个版本

[PDF] washington.edu

When Models Meet Data: Pragmatic Robot Learning with Model-based Optimization

M Bhardwaj - 2024 - digital.lib.washington.edu

Autonomous robots operating in complex and dynamic real-world scenarios must exhibit fast
and reactive behaviors to adapt to environment changes, and learn to improve their …