A survey of progress on cooperative multi-agent reinforcement learning in open environment

L Yuan, Z Zhang, L Li, C Guan, Y Yu - arXiv preprint arXiv:2312.01058, 2023 - arxiv.org
Multi-agent Reinforcement Learning (MARL) has gained wide attention in recent years and
has made progress in various fields. Specifically, cooperative MARL focuses on training a …

Approximate shielding of atari agents for safe exploration

AW Goodall, F Belardinelli - arXiv preprint arXiv:2304.11104, 2023 - arxiv.org
Balancing exploration and conservatism in the constrained setting is an important problem if
we are to use reinforcement learning for meaningful tasks in the real world. In this paper, we …

LuckyMera: a modular AI framework for building hybrid NetHack agents

L Quarantiello, S Marzeddu, A Guzzi… - Intelligenza …, 2023 - content.iospress.com
In the last few decades we have witnessed a significant development in Artificial Intelligence
(AI) thanks to the availability of a variety of testbeds, mostly based on simulated …

Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem

M Wołczyk, B Cupiał, M Ostaszewski… - arXiv preprint arXiv …, 2024 - arxiv.org
Fine-tuning is a widespread technique that allows practitioners to transfer pre-trained
capabilities, as recently showcased by the successful applications of foundation models …

Multiagent Continual Coordination via Progressive Task Contextualization

L Yuan, L Li, Z Zhang, F Zhang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Cooperative multiagent reinforcement learning (MARL) has attracted significant attention
and has the potential for many real-world applications. Previous arts mainly focus on …

Statistical Guarantees for Lifelong Reinforcement Learning using PAC-Bayesian Theory

Z Zhang, C Chow, Y Zhang, Y Sun, H Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Lifelong reinforcement learning (RL) has been developed as a paradigm for extending
single-task RL to more realistic, dynamic settings. In lifelong RL, the" life" of an RL agent is …

Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement

Z Wang, L Zhang, W Wu, Y Zhu, D Zhao… - arXiv preprint arXiv …, 2024 - arxiv.org
A longstanding goal of artificial general intelligence is highly capable generalists that can
learn from diverse experiences and generalize to unseen tasks. The language and vision …

The Role of Forgetting in Fine-Tuning Reinforcement Learning Models

M Wolczyk, B Cupiał, M Ostaszewski, M Bortkiewicz… - 2023 - openreview.net
Fine-tuning is a widespread technique that allows practitioners to transfer pre-trained
capabilities, as recently showcased by the successful applications of foundation models …

Retaining skills under distribution shifts: sequential Bayesian inference, reinforcement learning and applications

SC Kessler - 2023 - ora.ox.ac.uk
Modern machine learning models, such as neural networks which are the focus of this
thesis, have been shown to be extremely powerful tools for learning function mappings from …

[PDF][PDF] Continual Multi-Objective Reinforcement Learning via Reward Model Rehearsal

L Li, R Chen, Z Zhang, Z Wu, YC Li, C Guan, Y Yu… - lamda.nju.edu.cn
Multi-objective reinforcement learning (MORL) approaches address real-world problems
with multiple objectives by learning policies maximizing returns weighted by different user …