QVMix and QVMix-Max: Extending the Deep Quality-Value Family of Algorithms to Cooperative Multi-Agent Reinforcement Learning P Leroy, D Ernst, P Geurts, G Louppe, J Pisane, M Sabatelli arXiv preprint arXiv:2012.12062, 2020 | 6 | 2020 |
IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL P Leroy, PG Morato, J Pisane, A Kolios, D Ernst Thirty-seventh Conference on Neural Information Processing Systems Datasets …, 2023 | 2 | 2023 |
Value-based CTDE Methods in Symmetric Two-team Markov Game: from Cooperation to Team Competition P Leroy, J Pisane, D Ernst arXiv preprint arXiv:2211.11886, 2022 | 2 | 2022 |
Contributions to Multi-agent Reinforcement Learning P Leroy ULiège-Université de Liège [School of Engineering], Liège, Belgium, 2024 | | 2024 |
Master thesis: Automatic defect recognition in x-ray imaging by machine learning P Leroy Université de Liège, Liège, Belgique, 2018 | | 2018 |