Victor Gabillon 个人学术档案

引用次数

	总计	2019 年至今
引用	1145	766
h 指数	14	12
i10 指数	15	13

180

135

201220132014201520162017201820192020202120222023202418 30 50 59 74 72 68 103 128 161 135 141 96

开放获取的出版物数量

查看全部

7 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Mohammad GhavamzadehAmazon在 amazon.com 的电子邮件经过验证
Alessandro LazaricResearch Scientist, Facebook Artificial Intelligence Research在 inria.fr 的电子邮件经过验证
Peter BartlettProfessor, EECS and Statistics, UC Berkeley在 cs.berkeley.edu 的电子邮件经过验证
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMind在 meta.com 的电子邮件经过验证
Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)在 univ-lorraine.fr 的电子邮件经过验证
Yasin Abbasi YadkoriGoogle DeepMind在 google.com 的电子邮件经过验证
Brian ErikssonAdobe在 adobe.com 的电子邮件经过验证
Branislav KvetonAmazon在 amazon.com 的电子邮件经过验证
S MuthukrishnanRutgers Univ在 cs.rutgers.edu 的电子邮件经过验证
Zheng WenGoogle DeepMind在 google.com 的电子邮件经过验证
Alan MalekMIT在 mit.edu 的电子邮件经过验证
Sebastien BubeckVP GenAI Research, Microsoft AI在 microsoft.com 的电子邮件经过验证
Fabio Maria CarlucciMeta在 meta.com 的电子邮件经过验证
Antoine YangGoogle DeepMind在 google.com 的电子邮件经过验证
Pedro M EsperançaMachine Learning Engineer在 samsung.com 的电子邮件经过验证
Hang XuResearcher, Huawei Noah's Ark Lab在 huawei.com 的电子邮件经过验证
Ronald OrtnerMontanuniversität Leoben在 unileoben.ac.at 的电子邮件经过验证
Jun WangProfessor, Computer Science, University College London在 cs.ucl.ac.uk 的电子邮件经过验证
Jennifer HealeySenior Research Scientist, Adobe在 adobe.com 的电子邮件经过验证
Haitham Bou-AmmarRL-Team Leader, BO-Team Leader, MAS-Team Leader @ Huawei London & H. Assistant Professor @ UCL在 huawei.com 的电子邮件经过验证

关注

Victor Gabillon

未知所在单位机构

没有经过验证的电子邮件地址 - 首页

machine learning learning theory reinforcement learning online learning multi-armed bandits


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Best arm identification: A unified approach to fixed budget and fixed confidence V Gabillon, M Ghavamzadeh, A Lazaric NIPS, Neural Information Processing Systems, 2012	346	2012
Approximate modified policy iteration and its application to the game of Tetris. B Scherrer, M Ghavamzadeh, V Gabillon, B Lesner, M Geist JMLR, Journal of Machine Learning Research 16, 2015	154	2015
Multi-bandit best arm identification V Gabillon, M Ghavamzadeh, A Lazaric, S Bubeck NIPS, Neural Information Processing Systems, 2011	129	2011
Approximate dynamic programming finally performs well in the game of Tetris V Gabillon, M Ghavamzadeh, B Scherrer NIPS, Neural Information Processing systems, 2013	78	2013
Adaptive submodular maximization in bandit setting V Gabillon, B Kveton, Z Wen, B Eriksson, S Muthukrishnan NIPS, Neural Information Processing Systems, 2013	67	2013
Approximate modified policy iteration B Scherrer, V Gabillon, M Ghavamzadeh, M Geist ICML, International Conference on Machine Learning, 2012	63	2012
Best of both worlds: Stochastic & adversarial best-arm identification Y Abbasi-Yadkori, P Bartlett, V Gabillon, A Malek, M Valko Conference on learning theory, 918-949, 2018	54	2018
Improved learning complexity in combinatorial pure exploration bandits V Gabillon, A Lazaric, M Ghavamzadeh, R Ortner, P Bartlett AISTATS, Artificial Intelligence and Statistics, 2016	45	2016
A simple parameter-free and adaptive approach to optimization under a minimal local smoothness assumption PL Bartlett, V Gabillon, M Valko ALT, Algorithmic Learning Theory, 2019	39	2019
MANAS: Multi-agent neural architecture search V Lopes, FM Carlucci, PM Esperança, M Singh, V Gabillon, A Yang, H Xu, ... arXiv preprint arXiv:1909.01051, 2019	31*	2019
Classification-based policy iteration with a critic V Gabillon, A Lazaric, M Ghavamzadeh, B Scherrer ICML, International Conference on Machine Learning, 2011	30	2011
Hit-and-Run for Sampling and Planning in Non-Convex Spaces Y Abbasi-Yadkori, PL Bartlett, V Gabillon, A Malek AISTATS, Artificial Intelligence and Statistics, 2017	26	2017
Near Minimax Optimal Players for the Finite-Time 3-Expert Prediction Problem Y Abbasi-Yadkori, PL Bartlett, V Gabillon NIPS, Neural Information Processing Systems, 2017	17	2017
Large-Scale Optimistic Adaptive Submodularity. V Gabillon, B Kveton, Z Wen, B Eriksson, S Muthukrishnan AAAI, Association for the Advancement of Artificial Intelligence, 2014	17	2014
Rollout allocation strategies for classification-based policy iteration V Gabillon, A Lazaric, M Ghavamzadeh Workshop on Reinforcement Learning and Search in Very Large Spaces, 2010	14	2010
Adaptive multi-fidelity optimization with fast learning rates C Fiegel, V Gabillon, M Valko International Conference on Artificial Intelligence and Statistics, 3493-3502, 2020	7	2020
Derivative-Free & Order-Robust Optimisation V Gabillon, R Tutunov, M Valko, HB Ammar AISTATS, Artificial Intelligence and Statistics, 2020	7*	2020
Scale-free adaptive planning for deterministic dynamics & discounted rewards P Bartlett, V Gabillon, J Healey, M Valko ICML, International Conference on Machine Learning, 495-504, 2019	7	2019
Multi-media content-recommender system that learns how to elicit user preferences VF Gabillon, B Kveton, B Eriksson US Patent App. 14/489,703, 2016	5	2016
Machine learning tools for online advertisement V Gabillon Technical report, INRIA Lille, France, 2009	5	2009

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用