相关文章- 学术资源搜索

Fast reinforcement learning with generalized policy updates

A Barreto, S Hou, D Borsa, D Silver… - Proceedings of the …, 2020 - National Acad Sciences

The combination of reinforcement learning with deep learning is a promising approach to
tackle important sequential decision-making problems that are currently intractable. One …

被引用次数：136 相关文章所有 8 个版本

Approximate policy-based accelerated deep reinforcement learning

X Wang, Y Gu, Y Cheng, A Liu… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org

In recent years, the deep reinforcement learning (DRL) algorithms have been developed
rapidly and have achieved excellent performance in many challenging tasks. However, due …

被引用次数：52 相关文章所有 4 个版本

[PDF] aaai.org

Multi-step reinforcement learning: A unifying algorithm

K De Asis, J Hernandez-Garcia, G Holland… - Proceedings of the AAAI …, 2018 - ojs.aaai.org

Unifying seemingly disparate algorithmic ideas to produce better performing algorithms has
been a longstanding goal in reinforcement learning. As a primary example, TD (λ) elegantly …

被引用次数：132 相关文章所有 7 个版本

[PDF] mlr.press

Can increasing input dimensionality improve deep reinforcement learning?

K Ota, T Oiki, D Jha, T Mariyama… - … on machine learning, 2020 - proceedings.mlr.press

Deep reinforcement learning (RL) algorithms have recently achieved remarkable successes
in various sequential decision making tasks, leveraging advances in methods for training …

被引用次数：55 相关文章所有 9 个版本

[PDF] stanford.edu

Using inaccurate models in reinforcement learning

P Abbeel, M Quigley, AY Ng - … of the 23rd international conference on …, 2006 - dl.acm.org

In the model-based policy search approach to reinforcement learning (RL), policies are
found using a model (or" simulator") of the Markov decision process. However, for high …

被引用次数：307 相关文章所有 16 个版本

[PDF] mlr.press

Learning dynamics and generalization in deep reinforcement learning

C Lyle, M Rowland, W Dabney… - International …, 2022 - proceedings.mlr.press

Solving a reinforcement learning (RL) problem poses two competing challenges: fitting a
potentially discontinuous value function, and generalizing well to new observations. In this …

被引用次数：20 相关文章所有 2 个版本

[PDF] mlr.press

Dueling network architectures for deep reinforcement learning

Z Wang, T Schaul, M Hessel… - International …, 2016 - proceedings.mlr.press

In recent years there have been many successes of using deep representations in
reinforcement learning. Still, many of these applications use conventional architectures …

被引用次数：4860 相关文章所有 8 个版本

[PDF] mlr.press

Near optimal reward-free reinforcement learning

Z Zhang, S Du, X Ji - International Conference on Machine …, 2021 - proceedings.mlr.press

We study the reward-free reinforcement learning framework, which is particularly suitable for
batch reinforcement learning and scenarios where one needs policies for multiple reward …

被引用次数：26 相关文章所有 3 个版本

Taxonomy of reinforcement learning algorithms

H Zhang, T Yu - Deep reinforcement learning: Fundamentals, research …, 2020 - Springer

In this chapter, we introduce and summarize the taxonomy and categories for reinforcement
learning (RL) algorithms. Figure 3.1 presents an overview of the typical and popular …

被引用次数：64 相关文章所有 3 个版本

[PDF] berkeley.edu

[图书][B] Shaping and policy search in reinforcement learning

AY Ng - 2003 - search.proquest.com

To make reinforcement learning algorithms run in a reasonable amount of time, it is
frequently necessary to use a well-chosen reward function that gives appropriate “hints” to …

被引用次数：153 相关文章所有 7 个版本

Fast reinforcement learning with generalized policy updates

Approximate policy-based accelerated deep reinforcement learning

Multi-step reinforcement learning: A unifying algorithm

Can increasing input dimensionality improve deep reinforcement learning?

Using inaccurate models in reinforcement learning

Learning dynamics and generalization in deep reinforcement learning

Dueling network architectures for deep reinforcement learning

Near optimal reward-free reinforcement learning

Taxonomy of reinforcement learning algorithms

[图书][B] Shaping and policy search in reinforcement learning

相关搜索

高级搜索

引用