Illuminating generalization in deep reinforcement learning through procedural level generation....

K Souchleris, GK Sidiropoulos, GA Papakostas - Applied Sciences, 2023 - mdpi.com

This article focuses on the recent advances in the field of reinforcement learning (RL) as well
as the present state–of–the–art applications in games. First, we give a general panorama of …

被引用次数：20 相关文章所有 2 个版本

[PDF] neurips.cc

Why generalization in rl is difficult: Epistemic pomdps and implicit partial observability

D Ghosh, J Rahme, A Kumar, A Zhang… - Advances in neural …, 2021 - proceedings.neurips.cc

Generalization is a central challenge for the deployment of reinforcement learning (RL)
systems in the real world. In this paper, we show that the sequential structure of the RL …

被引用次数：106 相关文章所有 10 个版本

[PDF] mlr.press

Decoupling value and policy for generalization in reinforcement learning

R Raileanu, R Fergus - International Conference on …, 2021 - proceedings.mlr.press

Standard deep reinforcement learning algorithms use a shared representation for the policy
and value function, especially when training directly from images. However, we argue that …

被引用次数：103 相关文章所有 6 个版本

[PDF] neurips.cc

Automatic data augmentation for generalization in reinforcement learning

R Raileanu, M Goldstein, D Yarats… - Advances in …, 2021 - proceedings.neurips.cc

Deep reinforcement learning (RL) agents often fail to generalize beyond their training
environments. To alleviate this problem, recent work has proposed the use of data …

被引用次数：101 相关文章所有 7 个版本

[PDF] neurips.cc

On the importance of exploration for generalization in reinforcement learning

Y Jiang, JZ Kolter, R Raileanu - Advances in Neural …, 2024 - proceedings.neurips.cc

Existing approaches for improving generalization in deep reinforcement learning (RL) have
mostly focused on representation learning, neglecting RL-specific aspects such as …

被引用次数：9 相关文章所有 5 个版本

[PDF] arxiv.org

Secant: Self-expert cloning for zero-shot generalization of visual policies

L Fan, G Wang, DA Huang, Z Yu, L Fei-Fei… - arXiv preprint arXiv …, 2021 - arxiv.org

Generalization has been a long-standing challenge for reinforcement learning (RL). Visual
RL, in particular, can be easily distracted by irrelevant factors in high-dimensional …

被引用次数：61 相关文章所有 6 个版本

[PDF] mdpi.com

Transformer Decoder-Based Enhanced Exploration Method to Alleviate Initial Exploration Problems in Reinforcement Learning

D Kyoung, Y Sung - Sensors, 2023 - mdpi.com

In reinforcement learning, the epsilon (ε)-greedy strategy is commonly employed as an
exploration technique This method, however, leads to extensive initial exploration and …

被引用次数：1 相关文章所有 8 个版本

[PDF] arxiv.org

The Generalization Gap in Offline Reinforcement Learning

I Mediratta, Q You, M Jiang, R Raileanu - arXiv preprint arXiv:2312.05742, 2023 - arxiv.org

Despite recent progress in offline learning, these methods are still trained and tested on the
same environment. In this paper, we compare the generalization abilities of widely used …

被引用次数：1 相关文章所有 3 个版本

[PDF] mdpi.com

Continuous Autonomous Ship Learning Framework for Human Policies on Simulation

J Kim, J Park, K Cho - Applied Sciences, 2022 - mdpi.com

Considering autonomous navigation in busy marine traffic environments (including harbors
and coasts), major study issues to be solved for autonomous ships are avoidance of static …

被引用次数：4 相关文章所有 6 个版本

[PDF] openreview.net

A Study of Generalization in Offline Reinforcement Learning

I Mediratta, Q You, M Jiang… - NeurIPS 2023 Workshop …, 2023 - openreview.net

Despite the recent progress in offline reinforcement learning (RL) algorithms, agents are
usually trained and tested on the same environment. In this paper, we perform an in-depth …

被引用次数：1 相关文章