关注
Ruan John de Kock
Ruan John de Kock
Research Engineer, InstaDeep Ltd
在 instadeep.com 的电子邮件经过验证
标题
引用次数
引用次数
年份
Towards a standardised performance evaluation protocol for cooperative marl
R Gorsane, O Mahjoub, RJ de Kock, R Dubb, S Singh, A Pretorius
Advances in Neural Information Processing Systems 35, 5510-5521, 2022
372022
Jumanji: a diverse suite of scalable reinforcement learning environments in jax
C Bonnet, D Luo, D Byrne, S Surana, S Abramowitz, P Duckworth, ...
arXiv preprint arXiv:2306.09884, 2023
23*2023
Mava: A research framework for distributed multi-agent reinforcement learning
A Pretorius, K Tessera, AP Smit, C Formanek, SJ Grimbly, K Eloff, ...
arXiv e-prints, arXiv: 2107.01460, 2021
19*2021
Efficiently Quantifying Individual Agent Importance in Cooperative MARL
O Mahjoub, R de Kock, S Singh, W Khlifi, A Vall, K Tessera, A Pretorius
arXiv preprint arXiv:2312.08466, 2023
22023
On Diagnostics for Understanding Agent Training Behaviour in Cooperative MARL
W Khlifi, S Singh, O Mahjoub, R de Kock, A Vall, R Gorsane, A Pretorius
arXiv preprint arXiv:2312.08468, 2023
12023
How much can change in a year? Revisiting Evaluation in Multi-Agent Reinforcement Learning
S Singh, O Mahjoub, R de Kock, W Khlifi, A Vall, K Tessera, A Pretorius
arXiv preprint arXiv:2312.08463, 2023
2023
Generalisable Agents for Neural Network Optimisation
K Tessera, CR Tilbury, S Abramowitz, R de Kock, O Mahjoub, B Rosman, ...
arXiv preprint arXiv:2311.18598, 2023
2023
Jumanji: a diverse suite of scalable reinforcement learning environments in jax
C Bonnet, D Luo, D Byrne, S Surana, S Abramowitz, P Duckworth, ...
arXiv preprint arXiv:2306.09884, 2023
2023
Generalisable Agents for Neural Network Optimisation
CR Tilbury, S Abramowitz, RJ de Kock, O Mahjoub, B Rosman, S Hooker, ...
OPT 2023: Optimization for Machine Learning, 2023
2023
Generalisable Agents for Neural Network Optimisation
R de Kock, O Mahjoub, B Rosman, S Hooker, A Pretorius
Past to Present: Assessing Evaluation in Multi-agent Reinforcement Learning
S Singh, O Mahjoub, RJ de Kock, A Pretorius, W Khlifi, A Vall
Deep Learning Indaba 2023, 0
系统目前无法执行此操作,请稍后再试。
文章 1–11