Tor Lattimore 个人学术档案

引用次数

	总计	2019 年至今
引用	6765	6298
h 指数	38	35
i10 指数	67	62

1600

800

400

1200

20132014201520162017201820192020202120222023202424 26 53 56 95 179 350 766 1216 1443 1591 928

开放获取的出版物数量

查看全部

21 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Csaba SzepesvariDeepMind & University of Alberta在 cs.ualberta.ca 的电子邮件经过验证
Marcus HutterResearcher@DeepMind & Professor at ANU在 anu.edu.au 的电子邮件经过验证
Botao HaoOpenAI在 openai.com 的电子邮件经过验证
Andras GyorgyDeepMind在 google.com 的电子邮件经过验证
Laurent OrseauResearch Scientist at Google DeepMind在 google.com 的电子邮件经过验证
Branislav KvetonAmazon在 amazon.com 的电子邮件经过验证
Eren SezenerDeepMind在 google.com 的电子邮件经过验证
Ian OsbandOpenAI在 openai.com 的电子邮件经过验证
Christoph DannResearch Scientist, Google在 google.com 的电子邮件经过验证
Emma BrunskillAssociate Professor of Computer Science, Stanford University在 cs.stanford.edu 的电子邮件经过验证
Julian ZimmertGoogle Research在 google.com 的电子邮件经过验证
Mengdi WangCenter for Statistics & Machine Learning, ECE, Princeton University在 princeton.edu 的电子邮件经过验证
Joel VenessGoogle DeepMind在 google.com 的电子邮件经过验证
Benjamin Van RoyStanford University在 stanford.edu 的电子邮件经过验证
Satinder SinghGoogle DeepMind / U. of Michigan在 umich.edu 的电子邮件经过验证
Johannes KirschnerSwiss Data Science Center, ETH Zurich在 sdsc.ethz.ch 的电子邮件经过验证
Dale SchuurmansUniversity of Alberta, Google DeepMind在 cs.ualberta.ca 的电子邮件经过验证
Avishkar BhoopchandResearch Engineer, DeepMind在 google.com 的电子邮件经过验证
Agnieszka Grabska BarwińskaDeepMind在 google.com 的电子邮件经过验证
Peter TothAI Research在 techcombank.com.vn 的电子邮件经过验证

关注

Tor Lattimore

DeepMind

在 google.com 的电子邮件经过验证 - 首页

machine learning learning theory reinforcement learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Bandit algorithms T Lattimore, C Szepesvári Cambridge University Press, 2020	2760	2020
Unifying PAC and regret: Uniform PAC bounds for episodic reinforcement learning C Dann, T Lattimore, E Brunskill Advances in Neural Information Processing Systems 30, 2017	304	2017
Causal bandits: Learning good interventions via causal inference F Lattimore, T Lattimore, MD Reid Advances in neural information processing systems 29, 2016	264*	2016
Degenerate feedback loops in recommender systems R Jiang, S Chiappa, T Lattimore, A György, P Kohli Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, 383-390, 2019	223	2019
Learning with good feature representations in bandits and in rl with a generative model T Lattimore, C Szepesvari, G Weisz International conference on machine learning, 5662-5670, 2020	182	2020
Behaviour suite for reinforcement learning I Osband, Y Doron, M Hessel, J Aslanides, E Sezener, A Saraiva, ... arXiv preprint arXiv:1908.03568, 2019	178	2019
PAC bounds for discounted MDPs T Lattimore, M Hutter Algorithmic Learning Theory: 23rd International Conference, ALT 2012, Lyon …, 2012	140	2012
The end of optimism? an asymptotic analysis of finite-armed linear bandits T Lattimore, C Szepesvari Artificial Intelligence and Statistics, 728-737, 2017	134	2017
Conservative bandits Y Wu, R Shariff, T Lattimore, C Szepesvári International Conference on Machine Learning, 1254-1262, 2016	122	2016
On explore-then-commit strategies A Garivier, T Lattimore, E Kaufmann Advances in Neural Information Processing Systems 29, 2016	117	2016
A geometric perspective on optimal representations for reinforcement learning M Bellemare, W Dabney, R Dadashi, A Ali Taiga, PS Castro, N Le Roux, ... Advances in neural information processing systems 32, 2019	99	2019
Model selection in contextual stochastic bandit problems A Pacchiano, M Phan, Y Abbasi Yadkori, A Rao, J Zimmert, T Lattimore, ... Advances in Neural Information Processing Systems 33, 10328-10337, 2020	94	2020
Garbage in, reward out: Bootstrapping exploration in multi-armed bandits B Kveton, C Szepesvari, S Vaswani, Z Wen, T Lattimore, M Ghavamzadeh International Conference on Machine Learning, 3601-3610, 2019	77	2019
Toprank: A practical algorithm for online stochastic ranking T Lattimore, B Kveton, S Li, C Szepesvari Advances in Neural Information Processing Systems 31, 2018	71	2018
Linear bandits with stochastic delayed feedback C Vernade, A Carpentier, T Lattimore, G Zappella, B Ermis, M Brueckner International Conference on Machine Learning, 9712-9721, 2020	70	2020
The sample-complexity of general reinforcement learning T Lattimore, M Hutter, P Sunehag International Conference on Machine Learning, 28-36, 2013	70	2013
Near-optimal PAC bounds for discounted MDPs T Lattimore, M Hutter Theoretical Computer Science 558, 125-143, 2014	69	2014
Bounded Regret for Finite-Armed Structured Bandits T Lattimore, R Munos	68	2014
Adaptive exploration in linear contextual bandit B Hao, T Lattimore, C Szepesvari International Conference on Artificial Intelligence and Statistics, 3536-3545, 2020	65	2020
An information-theoretic approach to minimax regret in partial monitoring T Lattimore, C Szepesvári Conference on Learning Theory, 2111-2139, 2019	64	2019

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用