Tesseract: Tensorised actors for multi-agent reinforcement learning

FM Luo, T Xu, H Lai, XH Chen, W Zhang… - Science China Information …, 2024 - Springer

Reinforcement learning (RL) interacts with the environment to solve sequential decision-
making problems via a trial-and-error approach. Errors are always undesirable in real-world …

被引用次数：102 相关文章所有 4 个版本

[PDF] neurips.cc

Smacv2: An improved benchmark for cooperative multi-agent reinforcement learning

B Ellis, J Cook, S Moalla… - Advances in …, 2024 - proceedings.neurips.cc

The availability of challenging benchmarks has played a key role in the recent progress of
machine learning. In cooperative multi-agent reinforcement learning, the StarCraft Multi …

被引用次数：85 相关文章所有 6 个版本

[PDF] arxiv.org

Model-based multi-agent reinforcement learning: Recent progress and prospects

X Wang, Z Zhang, W Zhang - arXiv preprint arXiv:2203.10603, 2022 - arxiv.org

Significant advances have recently been achieved in Multi-Agent Reinforcement Learning
(MARL) which tackles sequential decision-making problems involving multiple participants …

被引用次数：38 相关文章所有 2 个版本

[PDF] neurips.cc

TTOpt: A maximum volume quantized tensor train-based optimization and its application to reinforcement learning

K Sozykin, A Chertkov, R Schutski… - Advances in …, 2022 - proceedings.neurips.cc

We present a novel procedure for optimization based on the combination of efficient
quantized tensor train representation and a generalized maximum matrix volume principle …

被引用次数：34 相关文章所有 8 个版本

[PDF] mlr.press

Uneven: Universal value exploration for multi-agent reinforcement learning

T Gupta, A Mahajan, B Peng… - International …, 2021 - proceedings.mlr.press

VDN and QMIX are two popular value-based algorithms for cooperative MARL that learn a
centralized action value function as a monotonic mixing of per-agent utilities. While this …

被引用次数：54 相关文章所有 11 个版本

[PDF] neurips.cc

Shaq: Incorporating shapley value theory into multi-agent q-learning

J Wang, Y Zhang, Y Gu, TK Kim - Advances in Neural …, 2022 - proceedings.neurips.cc

Value factorisation is a useful technique for multi-agent reinforcement learning (MARL) in
global reward game, however, its underlying mechanism is not yet fully understood. This …

被引用次数：28 相关文章所有 14 个版本

One4all: Manipulate one agent to poison the cooperative multi-agent reinforcement learning

H Zheng, X Li, J Chen, J Dong, Y Zhang, C Lin - Computers & Security, 2023 - Elsevier

Reinforcement Learning (RL) has achieved a plenty of breakthroughs in the past decade.
Notably, existing studies have shown that RL is suffered from poisoning attack, which results …

被引用次数：10 相关文章所有 2 个版本

[PDF] mlr.press

Efficient model-based multi-agent reinforcement learning via optimistic equilibrium computation

PG Sessa, M Kamgarpour… - … Conference on Machine …, 2022 - proceedings.mlr.press

We consider model-based multi-agent reinforcement learning, where the environment
transition model is unknown and can only be learned via expensive interactions with the …

被引用次数：12 相关文章所有 7 个版本

[PDF] neurips.cc

Dual self-awareness value decomposition framework without individual global max for cooperative MARL

Z Xu, B Zhang, G Zhou, Z Zhang… - Advances in Neural …, 2023 - proceedings.neurips.cc

Value decomposition methods have gained popularity in the field of cooperative multi-agent
reinforcement learning. However, almost all existing methods follow the principle of …

被引用次数：2 相关文章所有 4 个版本

[PDF] neurips.cc

Towards understanding cooperative multi-agent q-learning with value factorization

J Wang, Z Ren, B Han, J Ye… - Advances in Neural …, 2021 - proceedings.neurips.cc

Value factorization is a popular and promising approach to scaling up multi-agent
reinforcement learning in cooperative settings, which balances the learning scalability and …

被引用次数：38 相关文章所有 6 个版本