Delays in reinforcement learning

文章

学术资源搜索

获得 1 条结果（用时0.02秒）

我的图书馆

Delays in reinforcement learning

在引用文章中搜索

[PDF] arxiv.org

Variational Delayed Policy Optimization

Q Wu, SS Zhan, Y Wang, Y Wang, CW Lin, C Lv… - arXiv preprint arXiv …, 2024 - arxiv.org

In environments with delayed observation, state augmentation by including actions within
the delay window is adopted to retrieve Markovian property to enable reinforcement learning …

被引用次数：1 相关文章所有 2 个版本