Model-based reinforcement learning with value-targeted regression

N Le, VS Rathour, K Yamazaki, K Luu… - Artificial Intelligence …, 2022 - Springer

Deep reinforcement learning augments the reinforcement learning framework and utilizes
the powerful representation of deep neural networks. Recent works have demonstrated the …

被引用次数：180 相关文章所有 10 个版本

[PDF] sciencedirect.com

Deep reinforcement learning in medical imaging: A literature review

SK Zhou, HN Le, K Luu, HV Nguyen, N Ayache - Medical image analysis, 2021 - Elsevier

Deep reinforcement learning (DRL) augments the reinforcement learning framework, which
learns a sequence of actions that maximizes the expected reward, with the representative …

被引用次数：150 相关文章所有 9 个版本

[PDF] arxiv.org

Dive into deep learning

A Zhang, ZC Lipton, M Li, AJ Smola - arXiv preprint arXiv:2106.11342, 2021 - arxiv.org

This open-source book represents our attempt to make deep learning approachable,
teaching readers the concepts, the context, and the code. The entire book is drafted in …

被引用次数：1122 相关文章所有 9 个版本

[PDF] mlr.press

Nearly minimax optimal reinforcement learning for linear mixture markov decision processes

D Zhou, Q Gu, C Szepesvari - Conference on Learning …, 2021 - proceedings.mlr.press

We study reinforcement learning (RL) with linear function approximation where the
underlying transition probability kernel of the Markov decision process (MDP) is a linear …

被引用次数：227 相关文章所有 7 个版本

[PDF] mlr.press

Nearly minimax optimal reinforcement learning for linear markov decision processes

J He, H Zhao, D Zhou, Q Gu - International Conference on …, 2023 - proceedings.mlr.press

We study reinforcement learning (RL) with linear function approximation. For episodic time-
inhomogeneous linear Markov decision processes (linear MDPs) whose transition …

被引用次数：55 相关文章所有 7 个版本

[PDF] mlr.press

Reward-free rl is no harder than reward-aware rl in linear markov decision processes

AJ Wagenmaker, Y Chen… - International …, 2022 - proceedings.mlr.press

Reward-free reinforcement learning (RL) considers the setting where the agent does not
have access to a reward function during exploration, but must propose a near-optimal policy …

被引用次数：62 相关文章所有 7 个版本

[PDF] mlr.press

Logarithmic regret for reinforcement learning with linear function approximation

J He, D Zhou, Q Gu - International Conference on Machine …, 2021 - proceedings.mlr.press

Reinforcement learning (RL) with linear function approximation has received increasing
attention recently. However, existing work has focused on obtaining $\sqrt {T} $-type regret …

被引用次数：105 相关文章所有 5 个版本

[PDF] mlr.press

Provably efficient reinforcement learning for discounted mdps with feature mapping

D Zhou, J He, Q Gu - International Conference on Machine …, 2021 - proceedings.mlr.press

Modern tasks in reinforcement learning have large state and action spaces. To deal with
them efficiently, one often uses predefined feature mapping to represent states and actions …

被引用次数：142 相关文章所有 5 个版本

[PDF] neurips.cc

Computationally efficient horizon-free reinforcement learning for linear mixture mdps

D Zhou, Q Gu - Advances in neural information processing …, 2022 - proceedings.neurips.cc

Recent studies have shown that episodic reinforcement learning (RL) is not more difficult
than bandits, even with a long planning horizon and unknown state transitions. However …

被引用次数：41 相关文章所有 8 个版本

[PDF] mlr.press

Nearly minimax optimal reinforcement learning with linear function approximation

P Hu, Y Chen, L Huang - International Conference on …, 2022 - proceedings.mlr.press

We study reinforcement learning with linear function approximation where the transition
probability and reward functions are linear with respect to a feature mapping $\boldsymbol …

被引用次数：36 相关文章所有 4 个版本