Best arm identification: A unified approach to fixed budget and fixed confidence V Gabillon, M Ghavamzadeh, A Lazaric NIPS, Neural Information Processing Systems, 2012 | 346 | 2012 |
Approximate modified policy iteration and its application to the game of Tetris. B Scherrer, M Ghavamzadeh, V Gabillon, B Lesner, M Geist JMLR, Journal of Machine Learning Research 16, 2015 | 154 | 2015 |
Multi-bandit best arm identification V Gabillon, M Ghavamzadeh, A Lazaric, S Bubeck NIPS, Neural Information Processing Systems, 2011 | 129 | 2011 |
Approximate dynamic programming finally performs well in the game of Tetris V Gabillon, M Ghavamzadeh, B Scherrer NIPS, Neural Information Processing systems, 2013 | 78 | 2013 |
Adaptive submodular maximization in bandit setting V Gabillon, B Kveton, Z Wen, B Eriksson, S Muthukrishnan NIPS, Neural Information Processing Systems, 2013 | 67 | 2013 |
Approximate modified policy iteration B Scherrer, V Gabillon, M Ghavamzadeh, M Geist ICML, International Conference on Machine Learning, 2012 | 63 | 2012 |
Best of both worlds: Stochastic & adversarial best-arm identification Y Abbasi-Yadkori, P Bartlett, V Gabillon, A Malek, M Valko Conference on learning theory, 918-949, 2018 | 54 | 2018 |
Improved learning complexity in combinatorial pure exploration bandits V Gabillon, A Lazaric, M Ghavamzadeh, R Ortner, P Bartlett AISTATS, Artificial Intelligence and Statistics, 2016 | 45 | 2016 |
A simple parameter-free and adaptive approach to optimization under a minimal local smoothness assumption PL Bartlett, V Gabillon, M Valko ALT, Algorithmic Learning Theory, 2019 | 39 | 2019 |
MANAS: Multi-agent neural architecture search V Lopes, FM Carlucci, PM Esperança, M Singh, V Gabillon, A Yang, H Xu, ... arXiv preprint arXiv:1909.01051, 2019 | 31* | 2019 |
Classification-based policy iteration with a critic V Gabillon, A Lazaric, M Ghavamzadeh, B Scherrer ICML, International Conference on Machine Learning, 2011 | 30 | 2011 |
Hit-and-Run for Sampling and Planning in Non-Convex Spaces Y Abbasi-Yadkori, PL Bartlett, V Gabillon, A Malek AISTATS, Artificial Intelligence and Statistics, 2017 | 26 | 2017 |
Near Minimax Optimal Players for the Finite-Time 3-Expert Prediction Problem Y Abbasi-Yadkori, PL Bartlett, V Gabillon NIPS, Neural Information Processing Systems, 2017 | 17 | 2017 |
Large-Scale Optimistic Adaptive Submodularity. V Gabillon, B Kveton, Z Wen, B Eriksson, S Muthukrishnan AAAI, Association for the Advancement of Artificial Intelligence, 2014 | 17 | 2014 |
Rollout allocation strategies for classification-based policy iteration V Gabillon, A Lazaric, M Ghavamzadeh Workshop on Reinforcement Learning and Search in Very Large Spaces, 2010 | 14 | 2010 |
Adaptive multi-fidelity optimization with fast learning rates C Fiegel, V Gabillon, M Valko International Conference on Artificial Intelligence and Statistics, 3493-3502, 2020 | 7 | 2020 |
Derivative-Free & Order-Robust Optimisation V Gabillon, R Tutunov, M Valko, HB Ammar AISTATS, Artificial Intelligence and Statistics, 2020 | 7* | 2020 |
Scale-free adaptive planning for deterministic dynamics & discounted rewards P Bartlett, V Gabillon, J Healey, M Valko ICML, International Conference on Machine Learning, 495-504, 2019 | 7 | 2019 |
Multi-media content-recommender system that learns how to elicit user preferences VF Gabillon, B Kveton, B Eriksson US Patent App. 14/489,703, 2016 | 5 | 2016 |
Machine learning tools for online advertisement V Gabillon Technical report, INRIA Lille, France, 2009 | 5 | 2009 |