Regret analysis of stochastic and nonstochastic multi-armed bandit problems S Bubeck, N Cesa-Bianchi Foundations and Trends in Machine Learning 5, 1-122, 2012 | 3093 | 2012 |
Sparks of artificial general intelligence: Early experiments with gpt-4 S Bubeck, V Chandrasekaran, R Eldan, J Gehrke, E Horvitz, E Kamar, ... arXiv preprint arXiv:2303.12712, 2023 | 2459 | 2023 |
Convex optimization: Algorithms and complexity S Bubeck Foundations and Trends in Machine Learning 8, 231-357, 2014 | 2382 | 2014 |
Best arm identification in multi-armed bandits JY Audibert, S Bubeck, R Munos COLT 2010, 2010 | 924 | 2010 |
Is Q-learning provably efficient? C Jin, Z Allen-Zhu, S Bubeck, MI Jordan Advances in neural information processing systems 31, 2018 | 904 | 2018 |
Benefits, limits, and risks of GPT-4 as an AI chatbot for medicine P Lee, S Bubeck, J Petro New England Journal of Medicine 388 (13), 1233-1239, 2023 | 799 | 2023 |
Pure exploration in multi-armed bandits problems S Bubeck, R Munos, G Stoltz Algorithmic Learning Theory, 23-37, 2009 | 595 | 2009 |
Provably robust deep learning via adversarially trained smoothed classifiers H Salman, J Li, I Razenshteyn, P Zhang, H Zhang, S Bubeck, G Yang Advances in neural information processing systems 32, 2019 | 557 | 2019 |
X-armed bandits S Bubeck, R Munos, G Stoltz, C Szepesvári Journal of Machine Learning Research 12, 1587-1627, 2011 | 498 | 2011 |
Minimax policies for adversarial and stochastic bandits JY Audibert, S Bubeck COLT 2009, 2009 | 491 | 2009 |
lil'UCB: An Optimal Exploration Algorithm for Multi-Armed Bandits K Jamieson, M Malloy, R Nowak, S Bubeck COLT 2014, 2013 | 464 | 2013 |
Optimal algorithms for smooth and strongly convex distributed optimization in networks K Scaman, F Bach, S Bubeck, YT Lee, L Massoulié international conference on machine learning, 3027-3036, 2017 | 352 | 2017 |
Pure exploration in finitely-armed and continuous-armed bandits S Bubeck, R Munos, G Stoltz Theoretical Computer Science 412, 1832-1852, 2010 | 310 | 2010 |
Bandits with heavy tail S Bubeck, N Cesa-Bianchi, G Lugosi IEEE Transactions on Information Theory 59 (11), 7711-7717, 2013 | 307 | 2013 |
Online Optimization in X-Armed Bandits S Bubeck, G Stoltz, C Szepesvári, R Munos Advances in Neural Information Processing Systems 21, 201-208, 2008 | 270 | 2008 |
Regret bounds and minimax policies under partial monitoring JY Audibert, S Bubeck The Journal of Machine Learning Research 11, 2635-2686, 2010 | 268 | 2010 |
The best of both worlds: Stochastic and adversarial bandits S Bubeck, A Slivkins COLT 2012, 2012 | 256 | 2012 |
Regret in online combinatorial optimization JY Audibert, S Bubeck, G Lugosi Mathematics of Operations Research 39 (1), 31-45, 2014 | 249 | 2014 |
Textbooks are all you need S Gunasekar, Y Zhang, J Aneja, CCT Mendes, A Del Giorno, S Gopi, ... arXiv preprint arXiv:2306.11644, 2023 | 245 | 2023 |
Adversarial examples from computational constraints S Bubeck, YT Lee, E Price, I Razenshteyn International Conference on Machine Learning, 831-840, 2019 | 236 | 2019 |