Improved algorithms for linear stochastic bandits Y Abbasi-Yadkori, C Szepesvári, D Pal Advances in Neural Information Processing Systems, 2312-2320, 2011 | 1966 | 2011 |
Regret Bounds for the Adaptive Control of Linear Quadratic Systems. Y Abbasi-Yadkori, C Szepesvári COLT, 1-26, 2011 | 424 | 2011 |
Fast approximate nearest-neighbor search with k-nearest neighbor graph K Hajebi, Y Abbasi-Yadkori, H Shahbazi, H Zhang Twenty-Second International Joint Conference on Artificial Intelligence, 2011 | 278 | 2011 |
Online-to-Confidence-Set Conversions and Application to Sparse Stochastic Bandits. Y Abbasi-Yadkori, D Pal, C Szepesvari AISTATS 22, 1-9, 2012 | 188 | 2012 |
Sharp Convergence Rates for Langevin Dynamics in the Nonconvex Setting X Cheng, NS Chatterji, Y Abbasi-Yadkori, PL Bartlett, MI Jordan arXiv preprint arXiv:1805.01648, 2018 | 182 | 2018 |
POLITEX: Regret bounds for policy iteration using expert prediction Y Abbasi-Yadkori, P Bartlett, K Bhatia, N Lazic, C Szepesvári, G Weisz Proceedings of the 36th International Conference on Machine Learning 97 …, 2019 | 138 | 2019 |
POLITEX: Regret Bounds for Policy Iteration Using Expert Prediction Y Abbasi-Yadkori, PL Bartlett, K Bhatia, N Lazic, C Szepesvári, G Weisz | 138 | 2019 |
Conservative contextual linear bandits A Kazerouni, M Ghavamzadeh, YA Yadkori, B Van Roy Advances in Neural Information Processing Systems, 3910-3919, 2017 | 117 | 2017 |
Model-Free Linear Quadratic Control via Reduction to Expert Prediction Y Abbasi-Yadkori, N Lazic, C Szepesvari The 22nd International Conference on Artificial Intelligence and Statistics, 2019 | 114* | 2019 |
Online Learning in Markov Decision Processes with Adversarially Chosen Transition Probability Distributions Y Abbasi-Yadkori, P Bartlett, V Kanade, Y Seldin, C Szepesvari Neural Information Processing Systems, 2013 | 99 | 2013 |
Model selection in contextual stochastic bandit problems A Pacchiano, M Phan, Y Abbasi Yadkori, A Rao, J Zimmert, T Lattimore, ... Advances in Neural Information Processing Systems 33, 10328-10337, 2020 | 98 | 2020 |
Online Learning for Linearly Parametrized Control Problems Y Abbasi-Yadkori University of Alberta, 2012 | 85 | 2012 |
Online least squares estimation with self-normalized processes: An application to bandit problems Y Abbasi-Yadkori, D Pál, C Szepesvári arXiv preprint arXiv:1102.2670, 2011 | 74 | 2011 |
Prediction with limited advice and multiarmed bandits with paid observations Y Seldin, P Bartlett, K Crammer, Y Abbasi-Yadkori International Conference on Machine Learning, 280-287, 2014 | 73 | 2014 |
Offline Evaluation of Ranking Policies with Click Models S Li, Y Abbasi-Yadkori, B Kveton, S Muthukrishnan, V Vinay, Z Wen Proceedings of the 24th ACM SIGKDD International Conference on Knowledge …, 2018 | 71 | 2018 |
Bayesian Optimal Control of Smoothly Parameterized Systems Y Abbasi-Yadkori, C Szepesvári Proceedings of the Conference on Uncertainty in Artificial Intelligence, 2015 | 70* | 2015 |
Evaluation and Analysis of the Performance of the EXP3 Algorithm in Stochastic Environments. Y Seldin, C Szepesvári, P Auer, Y Abbasi-Yadkori EWRL, 103-116, 2012 | 63 | 2012 |
Bootstrapping upper confidence bound B Hao, YA Yadkori, Z Wen, G Cheng Advances in Neural Information Processing Systems, 12123-12133, 2019 | 61 | 2019 |
Bootstrapping upper confidence bound B Hao, YA Yadkori, Z Wen, G Cheng Advances in Neural Information Processing Systems, 12123-12133, 2019 | 61 | 2019 |
Linear Programming for Large-Scale Markov Decision Problems Y Abbasi-Yadkori, P Bartlett, A Malek Proceedings of the 31st International Conference on Machine Learning (ICML …, 2014 | 56* | 2014 |