关注
Matteo Gallici
标题
引用次数
引用次数
年份
Jaxmarl: Multi-agent rl environments in jax
A Rutherford, B Ellis, M Gallici, J Cook, A Lupu, G Ingvarsson, T Willi, ...
arXiv preprint arXiv:2311.10090, 2023
202023
TransfQMix: Transformers for leveraging the graph structure of multi-agent reinforcement learning problems
M Gallici, M Martin, I Masmitja
arXiv preprint arXiv:2301.05334, 2023
82023
JaxMARL: Multi-Agent RL Environments and Algorithms in JAX
A Rutherford, B Ellis, M Gallici, J Cook, A Lupu, G Ingvarsson, T Willi, ...
Proceedings of the 23rd International Conference on Autonomous Agents and …, 2024
62024
Simplifying Deep Temporal Difference Learning
M Gallici, M Fellows, B Ellis, B Pou, I Masmitja, JN Foerster, M Martin
arXiv preprint arXiv:2407.04811, 2024
12024
系统目前无法执行此操作,请稍后再试。
文章 1–4