Discovering temporally-aware reinforcement learning algorithms

MT Jackson, C Lu, L Kirsch, RT Lange… - arXiv preprint arXiv …, 2024 - arxiv.org
Recent advancements in meta-learning have enabled the automatic discovery of novel
reinforcement learning algorithms parameterized by surrogate objective functions. To …

Can Learned Optimization Make Reinforcement Learning Less Difficult?

AD Goldie, C Lu, MT Jackson, S Whiteson… - arXiv preprint arXiv …, 2024 - arxiv.org
While reinforcement learning (RL) holds great potential for decision making in the real world,
it suffers from a number of unique difficulties which often need specific consideration. In …

Learning to optimize for reinforcement learning

Q Lan, AR Mahmood, S Yan, Z Xu - arXiv preprint arXiv:2302.01470, 2023 - arxiv.org
In recent years, by leveraging more data, computation, and diverse tasks, learned optimizers
have achieved remarkable success in supervised learning, outperforming classical hand …

Learning Curricula in Open-Ended Worlds

M Jiang - arXiv preprint arXiv:2312.03126, 2023 - arxiv.org
Deep reinforcement learning (RL) provides powerful methods for training optimal sequential
decision-making agents. As collecting real-world interactions can entail additional costs and …

Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks

M Matthews, M Beukman, C Lu, J Foerster - arXiv preprint arXiv …, 2024 - arxiv.org
While large models trained with self-supervised learning on offline datasets have shown
remarkable capabilities in text and image domains, achieving the same generalisation for …

Black box meta-learning intrinsic rewards for sparse-reward environments

O Pappalardo, R Ramele, JM Santos - arXiv preprint arXiv:2407.21546, 2024 - arxiv.org
Despite the successes and progress of deep reinforcement learning over the last decade,
several challenges remain that hinder its broader application. Some fundamental aspects to …

Procedural generation of meta-reinforcement learning tasks

T Miconi - arXiv preprint arXiv:2302.05583, 2023 - arxiv.org
Open-endedness stands to benefit from the ability to generate an infinite variety of diverse,
challenging environments. One particularly interesting type of challenge is meta-learning (" …

Reinforcing automated machine learning-bridging AutoML and reinforcement learning

T Eimer - 2024 - repo.uni-hannover.de
Reinforcement learning is a machine learning paradigm that allows learning through
interaction. It intertwines data collection and model training into a single problem statement …

Higher Order and Self-Referential Evolution for Population-based Methods

S Coward, C Lu, A Letcher, M Jiang… - … Exploring Meta-Learning … - openreview.net
Due to their simplicity and support of high levels of parallelism, evolutionary algorithms have
regained popularity in machine learning applications such as curriculum generation for …