Posterior sampling with delayed feedback for reinforcement learning with linear function...- 学术资源搜索

文章

学术资源搜索

Posterior sampling with delayed feedback for reinforcement learning with linear function approximation

NL Kuang, M Yin, M Wang… - Advances in Neural …, 2023 - proceedings.neurips.cc

Recent studies in reinforcement learning (RL) have made significant progress by leveraging
function approximation to alleviate the sample complexity hurdle for better performance.
Despite the success, existing provably efficient algorithms typically rely on the accessibility
of immediate feedback upon taking actions. The failure to account for the impact of delay in
observations can significantly degrade the performance of real-world systems due to the
regret blow-up. In this work, we tackle the challenge of delayed feedback in RL with linear …

被引用次数：4 相关文章所有 5 个版本

Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation

N Lijing Kuang, M Yin, M Wang, YX Wang… - arXiv e …, 2023 - ui.adsabs.harvard.edu

以上显示的是最相近的搜索结果。查看全部搜索结果