Discovering general reinforcement learning algorithms with adversarial environment design

MT Jackson, C Lu, L Kirsch, RT Lange… - arXiv preprint arXiv …, 2024 - arxiv.org

Recent advancements in meta-learning have enabled the automatic discovery of novel
reinforcement learning algorithms parameterized by surrogate objective functions. To …

被引用次数：16 相关文章所有 4 个版本

[PDF] arxiv.org

Can Learned Optimization Make Reinforcement Learning Less Difficult?

AD Goldie, C Lu, MT Jackson, S Whiteson… - arXiv preprint arXiv …, 2024 - arxiv.org

While reinforcement learning (RL) holds great potential for decision making in the real world,
it suffers from a number of unique difficulties which often need specific consideration. In …

被引用次数：1 相关文章

[PDF] arxiv.org

Learning to optimize for reinforcement learning

Q Lan, AR Mahmood, S Yan, Z Xu - arXiv preprint arXiv:2302.01470, 2023 - arxiv.org

In recent years, by leveraging more data, computation, and diverse tasks, learned optimizers
have achieved remarkable success in supervised learning, outperforming classical hand …

被引用次数：8 相关文章所有 3 个版本

[PDF] arxiv.org

Learning Curricula in Open-Ended Worlds

M Jiang - arXiv preprint arXiv:2312.03126, 2023 - arxiv.org

Deep reinforcement learning (RL) provides powerful methods for training optimal sequential
decision-making agents. As collecting real-world interactions can entail additional costs and …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks

M Matthews, M Beukman, C Lu, J Foerster - arXiv preprint arXiv …, 2024 - arxiv.org

While large models trained with self-supervised learning on offline datasets have shown
remarkable capabilities in text and image domains, achieving the same generalisation for …

Black box meta-learning intrinsic rewards for sparse-reward environments

O Pappalardo, R Ramele, JM Santos - arXiv preprint arXiv:2407.21546, 2024 - arxiv.org

Despite the successes and progress of deep reinforcement learning over the last decade,
several challenges remain that hinder its broader application. Some fundamental aspects to …

Procedural generation of meta-reinforcement learning tasks

T Miconi - arXiv preprint arXiv:2302.05583, 2023 - arxiv.org

Open-endedness stands to benefit from the ability to generate an infinite variety of diverse,
challenging environments. One particularly interesting type of challenge is meta-learning (" …

被引用次数：1 相关文章所有 3 个版本

[PDF] uni-hannover.de

Reinforcing automated machine learning-bridging AutoML and reinforcement learning

T Eimer - 2024 - repo.uni-hannover.de

Reinforcement learning is a machine learning paradigm that allows learning through
interaction. It intertwines data collection and model training into a single problem statement …

Higher Order and Self-Referential Evolution for Population-based Methods

S Coward, C Lu, A Letcher, M Jiang… - … Exploring Meta-Learning … - openreview.net

Due to their simplicity and support of high levels of parallelism, evolutionary algorithms have
regained popularity in machine learning applications such as curriculum generation for …