A survey on model-based reinforcement learning
Reinforcement learning (RL) interacts with the environment to solve sequential decision-
making problems via a trial-and-error approach. Errors are always undesirable in real-world …
making problems via a trial-and-error approach. Errors are always undesirable in real-world …
Smacv2: An improved benchmark for cooperative multi-agent reinforcement learning
The availability of challenging benchmarks has played a key role in the recent progress of
machine learning. In cooperative multi-agent reinforcement learning, the StarCraft Multi …
machine learning. In cooperative multi-agent reinforcement learning, the StarCraft Multi …
Model-based multi-agent reinforcement learning: Recent progress and prospects
Significant advances have recently been achieved in Multi-Agent Reinforcement Learning
(MARL) which tackles sequential decision-making problems involving multiple participants …
(MARL) which tackles sequential decision-making problems involving multiple participants …
TTOpt: A maximum volume quantized tensor train-based optimization and its application to reinforcement learning
We present a novel procedure for optimization based on the combination of efficient
quantized tensor train representation and a generalized maximum matrix volume principle …
quantized tensor train representation and a generalized maximum matrix volume principle …
Uneven: Universal value exploration for multi-agent reinforcement learning
VDN and QMIX are two popular value-based algorithms for cooperative MARL that learn a
centralized action value function as a monotonic mixing of per-agent utilities. While this …
centralized action value function as a monotonic mixing of per-agent utilities. While this …
Shaq: Incorporating shapley value theory into multi-agent q-learning
Value factorisation is a useful technique for multi-agent reinforcement learning (MARL) in
global reward game, however, its underlying mechanism is not yet fully understood. This …
global reward game, however, its underlying mechanism is not yet fully understood. This …
One4all: Manipulate one agent to poison the cooperative multi-agent reinforcement learning
Reinforcement Learning (RL) has achieved a plenty of breakthroughs in the past decade.
Notably, existing studies have shown that RL is suffered from poisoning attack, which results …
Notably, existing studies have shown that RL is suffered from poisoning attack, which results …
Efficient model-based multi-agent reinforcement learning via optimistic equilibrium computation
PG Sessa, M Kamgarpour… - … Conference on Machine …, 2022 - proceedings.mlr.press
We consider model-based multi-agent reinforcement learning, where the environment
transition model is unknown and can only be learned via expensive interactions with the …
transition model is unknown and can only be learned via expensive interactions with the …
Dual self-awareness value decomposition framework without individual global max for cooperative MARL
Value decomposition methods have gained popularity in the field of cooperative multi-agent
reinforcement learning. However, almost all existing methods follow the principle of …
reinforcement learning. However, almost all existing methods follow the principle of …
Towards understanding cooperative multi-agent q-learning with value factorization
Value factorization is a popular and promising approach to scaling up multi-agent
reinforcement learning in cooperative settings, which balances the learning scalability and …
reinforcement learning in cooperative settings, which balances the learning scalability and …