Posterior sampling networks

文章

学术资源搜索

获得 4 条结果（用时0.01秒）

我的图书馆

在引用文章中搜索

[PDF] arxiv.org

Langevin dqn

V Dwaracherla, B Van Roy - arXiv preprint arXiv:2002.07282, 2020 - arxiv.org

Algorithms that tackle deep exploration--an important challenge in reinforcement learning--
have relied on epistemic uncertainty representation through ensembles or other …

被引用次数：6 相关文章所有 3 个版本

[PDF] aaai.org

Parameterized indexed value function for efficient exploration in reinforcement learning

T Tan, Z Xiong, VR Dwaracherla - … of the AAAI Conference on Artificial …, 2020 - ojs.aaai.org

It is well known that quantifying uncertainty in the action-value estimates is crucial for
efficient exploration in reinforcement learning. Ensemble sampling offers a relatively …

被引用次数：7 相关文章所有 6 个版本

[图书][B] Data-Driven Adaptive Traffic Signal Control via Deep Reinforcement Learning

T Tan - 2020 - search.proquest.com

Adaptive traffic signal control (ATSC) system serves a significant role for relieving urban
traffic congestion. The system is capable of adjusting signal phases and timings of all traffic …

[图书][B] Posterior Sampling for Efficient Reinforcement Learning

VR Dwaracherla - 2021 - search.proquest.com

Reinforcement learning has shown tremendous success over the past few years. Much of
this recent success can be attributed to agents learning from an inordinate amount of data in …