A survey on model-based reinforcement learning

FM Luo, T Xu, H Lai, XH Chen, W Zhang… - Science China Information …, 2024 - Springer
Reinforcement learning (RL) interacts with the environment to solve sequential decision-
making problems via a trial-and-error approach. Errors are always undesirable in real-world …

Smacv2: An improved benchmark for cooperative multi-agent reinforcement learning

B Ellis, J Cook, S Moalla… - Advances in …, 2024 - proceedings.neurips.cc
The availability of challenging benchmarks has played a key role in the recent progress of
machine learning. In cooperative multi-agent reinforcement learning, the StarCraft Multi …

Model-based multi-agent reinforcement learning: Recent progress and prospects

X Wang, Z Zhang, W Zhang - arXiv preprint arXiv:2203.10603, 2022 - arxiv.org
Significant advances have recently been achieved in Multi-Agent Reinforcement Learning
(MARL) which tackles sequential decision-making problems involving multiple participants …

TTOpt: A maximum volume quantized tensor train-based optimization and its application to reinforcement learning

K Sozykin, A Chertkov, R Schutski… - Advances in …, 2022 - proceedings.neurips.cc
We present a novel procedure for optimization based on the combination of efficient
quantized tensor train representation and a generalized maximum matrix volume principle …

Uneven: Universal value exploration for multi-agent reinforcement learning

T Gupta, A Mahajan, B Peng… - International …, 2021 - proceedings.mlr.press
VDN and QMIX are two popular value-based algorithms for cooperative MARL that learn a
centralized action value function as a monotonic mixing of per-agent utilities. While this …

Shaq: Incorporating shapley value theory into multi-agent q-learning

J Wang, Y Zhang, Y Gu, TK Kim - Advances in Neural …, 2022 - proceedings.neurips.cc
Value factorisation is a useful technique for multi-agent reinforcement learning (MARL) in
global reward game, however, its underlying mechanism is not yet fully understood. This …

One4all: Manipulate one agent to poison the cooperative multi-agent reinforcement learning

H Zheng, X Li, J Chen, J Dong, Y Zhang, C Lin - Computers & Security, 2023 - Elsevier
Reinforcement Learning (RL) has achieved a plenty of breakthroughs in the past decade.
Notably, existing studies have shown that RL is suffered from poisoning attack, which results …

Efficient model-based multi-agent reinforcement learning via optimistic equilibrium computation

PG Sessa, M Kamgarpour… - … Conference on Machine …, 2022 - proceedings.mlr.press
We consider model-based multi-agent reinforcement learning, where the environment
transition model is unknown and can only be learned via expensive interactions with the …

Dual self-awareness value decomposition framework without individual global max for cooperative MARL

Z Xu, B Zhang, G Zhou, Z Zhang… - Advances in Neural …, 2023 - proceedings.neurips.cc
Value decomposition methods have gained popularity in the field of cooperative multi-agent
reinforcement learning. However, almost all existing methods follow the principle of …

Towards understanding cooperative multi-agent q-learning with value factorization

J Wang, Z Ren, B Han, J Ye… - Advances in Neural …, 2021 - proceedings.neurips.cc
Value factorization is a popular and promising approach to scaling up multi-agent
reinforcement learning in cooperative settings, which balances the learning scalability and …