Alessandro Lazaric 个人学术档案

引用次数

	总计	2019 年至今
引用	7009	5243
h 指数	46	39
i10 指数	105	94

1400

700

350

1050

2008200920102011201220132014201520162017201820192020202120222023202423 24 52 88 130 136 188 179 258 279 366 475 678 998 1179 1318 592

开放获取的出版物数量

查看全部

19 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Matteo PirottaResearch Scientist, Meta (FAIR)在 fb.com 的电子邮件经过验证
Mohammad GhavamzadehAmazon在 amazon.com 的电子邮件经过验证
Marcello RestelliAssociate Professor, Politecnico di Milano在 polimi.it 的电子邮件经过验证
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMind在 meta.com 的电子邮件经过验证
Rémi MunosGoogle DeepMind在 inria.fr 的电子邮件经过验证
Andrea BonariniFull Professor, Politecnico di Milano, Dipartimento di Eletronica, Informazione e Biongegneria, AI在 polimi.it 的电子邮件经过验证
Emma BrunskillAssociate Professor of Computer Science, Stanford University在 cs.stanford.edu 的电子邮件经过验证
Daniele CalandrielloResearch Scientist, DeepMind在 google.com 的电子邮件经过验证
Jean TarbouriechGoogle DeepMind在 google.com 的电子邮件经过验证
Marc AbeilleCriteo在 ens-cachan.fr 的电子邮件经过验证
Andrea TirinzoniMeta在 fb.com 的电子邮件经过验证
Ronan FruitPhD candidate, Inria Lille, SequeL team在 inria.fr 的电子邮件经过验证
Evrard GarcelonFacebook AI Research在 fb.com 的电子邮件经过验证
Andrea ZanetteAssistant Professor, Carnegie Mellon University在 andrew.cmu.edu 的电子邮件经过验证
Marta SoareUniversité d'Orléans在 univ-orleans.fr 的电子邮件经过验证
Denis YaratsCofounder and CTO, Perplexity AI在 perplexity.ai 的电子邮件经过验证
Lerrel PintoNew York University在 cs.nyu.edu 的电子邮件经过验证
Anima AnandkumarCalifornia Institute of Technology and NVIDIA在 caltech.edu 的电子邮件经过验证
Kamyar AzizzadenesheliNvidia在 nvidia.com 的电子邮件经过验证
Amir SaniTechstars在 amirsani.com 的电子邮件经过验证

关注

Alessandro Lazaric

Research Scientist, Facebook Artificial Intelligence Research

在 inria.fr 的电子邮件经过验证 - 首页

Machine Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Transfer in reinforcement learning: a framework and a survey A Lazaric Reinforcement Learning: State-of-the-Art, 143-173, 2012	362	2012
Best arm identification: A unified approach to fixed budget and fixed confidence V Gabillon, M Ghavamzadeh, A Lazaric Advances in Neural Information Processing Systems 25, 2012	344	2012
Linear thompson sampling revisited M Abeille, A Lazaric Artificial Intelligence and Statistics, 176-184, 2017	262	2017
Mastering visual continuous control: Improved data-augmented reinforcement learning D Yarats, R Fergus, A Lazaric, L Pinto arXiv preprint arXiv:2107.09645, 2021	244	2021
Learning near optimal policies with low inherent bellman error A Zanette, A Lazaric, M Kochenderfer, E Brunskill International Conference on Machine Learning, 10978-10989, 2020	223	2020
Reinforcement learning with prototypical representations D Yarats, R Fergus, A Lazaric, L Pinto International Conference on Machine Learning, 11920-11931, 2021	206	2021
Best-arm identification in linear bandits M Soare, A Lazaric, R Munos Advances in Neural Information Processing Systems 27, 2014	206	2014
Transfer of samples in batch reinforcement learning A Lazaric, M Restelli, A Bonarini Proceedings of the 25th international conference on Machine learning, 544-551, 2008	205	2008
Reinforcement learning in continuous action spaces through sequential monte carlo methods A Lazaric, M Restelli, A Bonarini Advances in neural information processing systems 20, 2007	194	2007
Risk-aversion in multi-armed bandits A Sani, A Lazaric, R Munos Advances in neural information processing systems 25, 2012	184	2012
Bayesian multi-task reinforcement learning A Lazaric, M Ghavamzadeh ICML-27th international conference on machine learning, 599-606, 2010	142	2010
Frequentist regret bounds for randomized least-squares value iteration A Zanette, D Brandfonbrener, E Brunskill, M Pirotta, A Lazaric International Conference on Artificial Intelligence and Statistics, 1954-1964, 2020	141	2020
Reinforcement learning of pomdps using spectral methods K Azizzadenesheli, A Lazaric, A Anandkumar Conference on Learning Theory, 193-256, 2016	135	2016
Finite-sample analysis of least-squares policy iteration A Lazaric, M Ghavamzadeh, R Munos Journal of Machine Learning Research 13, 3041-3074, 2012	130	2012
Upper-confidence-bound algorithms for active learning in multi-armed bandits A Carpentier, A Lazaric, M Ghavamzadeh, R Munos, P Auer International Conference on Algorithmic Learning Theory, 189-203, 2011	127	2011
Multi-bandit best arm identification V Gabillon, M Ghavamzadeh, A Lazaric, S Bubeck Advances in Neural Information Processing Systems 24, 2011	124	2011
Sequential transfer in multi-armed bandit with finite set of models A Lazaric, E Brunskill Advances in Neural Information Processing Systems 26, 2013	116	2013
Efficient bias-span-constrained exploration-exploitation in reinforcement learning R Fruit, M Pirotta, A Lazaric, R Ortner International Conference on Machine Learning, 1578-1586, 2018	112	2018
Improved regret bounds for thompson sampling in linear quadratic control problems M Abeille, A Lazaric International Conference on Machine Learning, 1-9, 2018	105	2018
A truthful learning mechanism for contextual multi-slot sponsored search auctions with externalities N Gatti, A Lazaric, F Trovo Proceedings of the 13th ACM Conference on Electronic Commerce, 605-622, 2012	95	2012

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用