Rotation, translation, and cropping for zero-shot generalization

DJ Strouse, K McKee, M Botvinick… - Advances in …, 2021 - proceedings.neurips.cc

Collaborating with humans requires rapidly adapting to their individual strengths,
weaknesses, and preferences. Unfortunately, most standard multi-agent reinforcement …

被引用次数：158 相关文章所有 6 个版本

[PDF] neurips.cc

Emergent complexity and zero-shot transfer via unsupervised environment design

M Dennis, N Jaques, E Vinitsky… - Advances in neural …, 2020 - proceedings.neurips.cc

A wide range of reinforcement learning (RL) problems---including robustness, transfer
learning, unsupervised RL, and emergent complexity---require specifying a distribution of …

被引用次数：206 相关文章所有 13 个版本

[PDF] arxiv.org

Contrastive behavioral similarity embeddings for generalization in reinforcement learning

R Agarwal, MC Machado, PS Castro… - arXiv preprint arXiv …, 2021 - arxiv.org

Reinforcement learning methods trained on few environments rarely learn policies that
generalize to unseen environments. To improve generalization, we incorporate the inherent …

被引用次数：195 相关文章所有 11 个版本

[PDF] neurips.cc

The nethack learning environment

H Küttler, N Nardelli, A Miller… - Advances in …, 2020 - proceedings.neurips.cc

Abstract Progress in Reinforcement Learning (RL) algorithms goes hand-in-hand with the
development of challenging environments that test the limits of current methods. While …

被引用次数：166 相关文章所有 8 个版本

[PDF] mlr.press

Decoupling value and policy for generalization in reinforcement learning

R Raileanu, R Fergus - International Conference on …, 2021 - proceedings.mlr.press

Standard deep reinforcement learning algorithms use a shared representation for the policy
and value function, especially when training directly from images. However, we argue that …

被引用次数：103 相关文章所有 6 个版本

[PDF] arxiv.org

Learning with amigo: Adversarially motivated intrinsic goals

A Campero, R Raileanu, H Küttler… - arXiv preprint arXiv …, 2020 - arxiv.org

A key challenge for reinforcement learning (RL) consists of learning in environments with
sparse extrinsic rewards. In contrast to current RL methods, humans are able to learn new …

被引用次数：149 相关文章所有 5 个版本

[PDF] acm.org

Neuroevolution of self-interpretable agents

Y Tang, D Nguyen, D Ha - Proceedings of the 2020 Genetic and …, 2020 - dl.acm.org

Inattentional blindness is the psychological phenomenon that causes one to miss things in
plain sight. It is a consequence of the selective attention in perception that lets us remain …

被引用次数：132 相关文章所有 5 个版本

[PDF] neurips.cc

Automatic data augmentation for generalization in reinforcement learning

R Raileanu, M Goldstein, D Yarats… - Advances in …, 2021 - proceedings.neurips.cc

Deep reinforcement learning (RL) agents often fail to generalize beyond their training
environments. To alleviate this problem, recent work has proposed the use of data …

被引用次数：101 相关文章所有 7 个版本

[PDF] neurips.cc

On the importance of exploration for generalization in reinforcement learning

Y Jiang, JZ Kolter, R Raileanu - Advances in Neural …, 2024 - proceedings.neurips.cc

Existing approaches for improving generalization in deep reinforcement learning (RL) have
mostly focused on representation learning, neglecting RL-specific aspects such as …

被引用次数：9 相关文章所有 5 个版本

[PDF] springer.com

Warmth and competence in human-agent cooperation

KR McKee, X Bai, ST Fiske - Autonomous Agents and Multi-Agent Systems, 2024 - Springer

Interaction and cooperation with humans are overarching aspirations of artificial intelligence
research. Recent studies demonstrate that AI agents trained with deep reinforcement …

被引用次数：30 相关文章所有 8 个版本