- 学术资源搜索

A survey of progress on cooperative multi-agent reinforcement learning in open environment

L Yuan, Z Zhang, L Li, C Guan, Y Yu - arXiv preprint arXiv:2312.01058, 2023 - arxiv.org

Multi-agent Reinforcement Learning (MARL) has gained wide attention in recent years and
has made progress in various fields. Specifically, cooperative MARL focuses on training a …

被引用次数：13 相关文章所有 2 个版本

[PDF] arxiv.org

Approximate shielding of atari agents for safe exploration

AW Goodall, F Belardinelli - arXiv preprint arXiv:2304.11104, 2023 - arxiv.org

Balancing exploration and conservatism in the constrained setting is an important problem if
we are to use reinforcement learning for meaningful tasks in the real world. In this paper, we …

被引用次数：3 相关文章所有 3 个版本

[PDF] arxiv.org

LuckyMera: a modular AI framework for building hybrid NetHack agents

L Quarantiello, S Marzeddu, A Guzzi… - Intelligenza …, 2023 - content.iospress.com

In the last few decades we have witnessed a significant development in Artificial Intelligence
(AI) thanks to the availability of a variety of testbeds, mostly based on simulated …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem

M Wołczyk, B Cupiał, M Ostaszewski… - arXiv preprint arXiv …, 2024 - arxiv.org

Fine-tuning is a widespread technique that allows practitioners to transfer pre-trained
capabilities, as recently showcased by the successful applications of foundation models …

被引用次数：4 相关文章所有 3 个版本

[PDF] arxiv.org

Multiagent Continual Coordination via Progressive Task Contextualization

L Yuan, L Li, Z Zhang, F Zhang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Cooperative multiagent reinforcement learning (MARL) has attracted significant attention
and has the potential for many real-world applications. Previous arts mainly focus on …

被引用次数：2 相关文章所有 5 个版本

[PDF] arxiv.org

Statistical Guarantees for Lifelong Reinforcement Learning using PAC-Bayesian Theory

Z Zhang, C Chow, Y Zhang, Y Sun, H Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org

Lifelong reinforcement learning (RL) has been developed as a paradigm for extending
single-task RL to more realistic, dynamic settings. In lifelong RL, the" life" of an RL agent is …

Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement

Z Wang, L Zhang, W Wu, Y Zhu, D Zhao… - arXiv preprint arXiv …, 2024 - arxiv.org

A longstanding goal of artificial general intelligence is highly capable generalists that can
learn from diverse experiences and generalize to unseen tasks. The language and vision …

The Role of Forgetting in Fine-Tuning Reinforcement Learning Models

M Wolczyk, B Cupiał, M Ostaszewski, M Bortkiewicz… - 2023 - openreview.net

Fine-tuning is a widespread technique that allows practitioners to transfer pre-trained
capabilities, as recently showcased by the successful applications of foundation models …

被引用次数：2 相关文章所有 2 个版本

[PDF] ox.ac.uk

Retaining skills under distribution shifts: sequential Bayesian inference, reinforcement learning and applications

SC Kessler - 2023 - ora.ox.ac.uk

Modern machine learning models, such as neural networks which are the focus of this
thesis, have been shown to be extremely powerful tools for learning function mappings from …

[PDF] nju.edu.cn

[PDF][PDF] Continual Multi-Objective Reinforcement Learning via Reward Model Rehearsal

L Li, R Chen, Z Zhang, Z Wu, YC Li, C Guan, Y Yu… - lamda.nju.edu.cn

Multi-objective reinforcement learning (MORL) approaches address real-world problems
with multiple objectives by learning policies maximizing returns weighted by different user …