John Quan 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	17897	16524
h 指数	18	18
i10 指数	20	20

3800

1900

950

2850

20162017201820192020202120222023202473 274 951 1574 2189 2887 3213 3789 2869

开放获取的出版物数量

查看全部

1 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

关注

John Quan

Google DeepMind

在 google.com 的电子邮件经过验证

Reinforcement Learning Deep Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Overcoming catastrophic forgetting in neural networks J Kirkpatrick, R Pascanu, N Rabinowitz, J Veness, G Desjardins, AA Rusu, ... Proceedings of the national academy of sciences 114 (13), 3521-3526, 2017	7530	2017
Prioritized experience replay T Schaul, J Quan, I Antonoglou, D Silver arXiv preprint arXiv:1511.05952, 2015	5068	2015
Deep q-learning from demonstrations T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ... Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	1262	2018
Starcraft ii: A new challenge for reinforcement learning O Vinyals, T Ewalds, S Bartunov, P Georgiev, AS Vezhnevets, M Yeo, ... arXiv preprint arXiv:1708.04782, 2017	1078	2017
Distributed prioritized experience replay D Horgan, J Quan, D Budden, G Barth-Maron, M Hessel, H Van Hasselt, ... arXiv preprint arXiv:1803.00933, 2018	894	2018
Distral: Robust multitask reinforcement learning Y Teh, V Bapst, WM Czarnecki, J Quan, J Kirkpatrick, R Hadsell, N Heess, ... Advances in neural information processing systems 30, 2017	611	2017
Recurrent experience replay in distributed reinforcement learning S Kapturowski, G Ostrovski, J Quan, R Munos, W Dabney International conference on learning representations, 2018	552	2018
Transfer in deep reinforcement learning using successor features and generalised policy improvement A Barreto, D Borsa, J Quan, T Schaul, D Silver, M Hessel, D Mankowitz, ... International Conference on Machine Learning, 501-510, 2018	195	2018
The DeepMind JAX Ecosystem I Babuschkin, K Baumli, A Bell, S Bhupatiraju, J Bruce, P Buchlovsky, ... URL http://github.com/google-deepmind, 2020	184*	2020
Observe and look further: Achieving consistent performance on atari T Pohlen, B Piot, T Hester, MG Azar, D Horgan, D Budden, G Barth-Maron, ... arXiv preprint arXiv:1805.11593, 2018	139	2018
Universal successor features approximators D Borsa, A Barreto, J Quan, D Mankowitz, R Munos, H Van Hasselt, ... arXiv preprint arXiv:1812.07626, 2018	130	2018
The value-improvement path: Towards better representations for reinforcement learning W Dabney, A Barreto, M Rowland, R Dadashi, J Quan, MG Bellemare, ... Proceedings of the AAAI Conference on Artificial Intelligence 35 (8), 7160-7168, 2021	70	2021
Unicorn: Continual learning with a universal, off-policy agent DJ Mankowitz, A Žídek, A Barreto, D Horgan, M Hessel, J Quan, J Oh, ... arXiv preprint arXiv:1802.08294, 2018	48	2018
Training neural networks using a prioritized experience memory T Schaul, J Quan, D Silver US Patent 10,650,310, 2020	25	2020
Podracer architectures for scalable reinforcement learning M Hessel, M Kroiss, A Clark, I Kemaev, J Quan, T Keck, F Viola, ... arXiv preprint arXiv:2104.06272, 2021	21	2021
DQN Zoo: Reference implementations of DQN-based agents J Quan, G Ostrovski URL http://github.com/google-deepmind/dqn_zoo, 2020	21*	2020
Reply to Huszár: The elastic weight consolidation penalty is empirically valid J Kirkpatrick, R Pascanu, N Rabinowitz, J Veness, G Desjardins, AA Rusu, ... Proceedings of the National Academy of Sciences 115 (11), E2498-E2498, 2018	21	2018
The phenomenon of policy churn T Schaul, A Barreto, J Quan, G Ostrovski Advances in Neural Information Processing Systems 35, 2537-2549, 2022	19	2022
Reinforcement learning using distributed prioritized replay D Budden, G Barth-Maron, J Quan, DG Horgan US Patent 11,625,604, 2023	11	2023
General non-linear bellman equations H van Hasselt, J Quan, M Hessel, Z Xu, D Borsa, A Barreto arXiv preprint arXiv:1907.03687, 2019	11	2019

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

引用