关注
Carlo Alfano
Carlo Alfano
在 linacre.ox.ac.uk 的电子邮件经过验证
标题
引用次数
引用次数
年份
A novel framework for policy mirror descent with general parameterization and linear convergence
C Alfano, R Yuan, P Rebeschini
Advances in Neural Information Processing Systems 36, 2024
172024
Linear convergence for natural policy gradient with log-linear policy parametrization
C Alfano, P Rebeschini
arXiv preprint arXiv:2209.15382, 2022
142022
Dimension-free rates for natural policy gradient in multi-agent reinforcement learning
C Alfano, P Rebeschini
arXiv preprint arXiv:2109.11692, 2021
72021
Meta-learning the mirror map in policy mirror descent
C Alfano, S Towers, S Sapora, C Lu, P Rebeschini
arXiv preprint arXiv:2402.05187, 2024
22024
系统目前无法执行此操作,请稍后再试。
文章 1–4