Zeyu Zheng 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	1913	1857
h 指数	9	9
i10 指数	9	9

1100

550

275

825

2017201820192020202120222023202412 44 91 136 192 168 209 1058

开放获取的出版物数量

查看全部

5 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Satinder SinghGoogle DeepMind / U. of Michigan在 umich.edu 的电子邮件经过验证
Junhyuk OhResearch Scientist, DeepMind在 google.com 的电子邮件经过验证
Will DabneyDeepMind在 google.com 的电子邮件经过验证
Eric XingPresident at Mohamed bin Zayed University of AI, Professor of Computer Science, Carnegie Mellon U在 cs.cmu.edu 的电子邮件经过验证
Hao ZhangUC San Diego在 ucsd.edu 的电子邮件经过验证
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCL在 google.com 的电子邮件经过验证
Razvan PascanuGoogle DeepMind在 google.com 的电子邮件经过验证
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMind在 meta.com 的电子邮件经过验证
Rémi MunosGoogle DeepMind在 inria.fr 的电子邮件经过验证
Zhaohan Daniel GuoDeepMind在 google.com 的电子邮件经过验证
Yunhao TangResearch Scientist, DeepMind在 columbia.edu 的电子邮件经过验证
Daniele CalandrielloResearch Scientist, DeepMind在 google.com 的电子邮件经过验证
Wenfei FanProfessor of Web Data Management, University of Edinburgh在 inf.ed.ac.uk 的电子邮件经过验证
Clare LyleGoogle DeepMind在 deepmind.com 的电子邮件经过验证
Richard L. LewisProfessor of Psychology, Linguistics and Cognitive Science, University of Michigan在 umich.edu 的电子邮件经过验证
Zhongwen XuTencent在 tencent.com 的电子邮件经过验证
David SilverDeepMind, UCL在 google.com 的电子邮件经过验证
Matteo HesselResearch Engineer, Google DeepMind在 google.com 的电子邮件经过验证
Haozhu WangAmazon在 amazon.com 的电子邮件经过验证
Chengang JiPhD, University of Michigan-Ann Arbor在 umich.edu 的电子邮件经过验证

关注

Zeyu Zheng

DeepMind

在 deepmind.com 的电子邮件经过验证 - 首页

artificial intelligence machine learning reinforcement learning deep learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	924	2023
Poseidon: An efficient communication architecture for distributed deep learning on {GPU} clusters H Zhang, Z Zheng, S Xu, W Dai, Q Ho, X Liang, Z Hu, J Wei, P Xie, ... 2017 USENIX Annual Technical Conference (USENIX ATC 17), 181-193, 2017	410	2017
On learning intrinsic rewards for policy gradient methods Z Zheng, J Oh, S Singh Advances in Neural Information Processing Systems, 4644-4654, 2018	200	2018
Parallelizing sequential graph computations W Fan, J Xu, Y Wu, W Yu, J Jiang, Z Zheng, B Zhang, Y Cao, C Tian Proceedings of the 2017 ACM International Conference on Management of Data …, 2017	119	2017
What Can Learned Intrinsic Rewards Capture? Z Zheng, J Oh, M Hessel, Z Xu, M Kroiss, H Van Hasselt, D Silver, S Singh International Conference on Machine Learning, 11436-11446, 2020	90	2020
Automated multi-layer optical design via deep reinforcement learning H Wang, Z Zheng, C Ji, LJ Guo Machine Learning: Science and Technology 2 (2), 025013, 2021	60	2021
Understanding plasticity in neural networks C Lyle, Z Zheng, E Nikishin, BA Pires, R Pascanu, W Dabney International Conference on Machine Learning, 23190-23211, 2023	44	2023
Generalized Preference Optimization: A Unified Approach to Offline Alignment Y Tang, ZD Guo, Z Zheng, D Calandriello, R Munos, M Rowland, ... arXiv preprint arXiv:2402.05749, 2024	21	2024
Understanding the performance gap between online and offline alignment algorithms Y Tang, DZ Guo, Z Zheng, D Calandriello, Y Cao, E Tarassov, R Munos, ... arXiv preprint arXiv:2405.08448, 2024	13	2024
Disentangling the Causes of Plasticity Loss in Neural Networks C Lyle, Z Zheng, K Khetarpal, H van Hasselt, R Pascanu, J Martens, ... arXiv preprint arXiv:2402.18762, 2024	8	2024
Adaptive Pairwise Weights for Temporal Credit Assignment Z Zheng, R Vuorio, R Lewis, S Singh Proceedings of the AAAI Conference on Artificial Intelligence 36 (8), 9225-9232, 2022	7*	2022
Learning State Representations from Random Deep Action-conditional Predictions Z Zheng, V Veeriah, R Vuorio, RL Lewis, S Singh Advances in Neural Information Processing Systems 34, 23679-23691, 2021	6	2021
Towards multi‐agent reinforcement learning‐driven over‐the‐counter market simulations N Vadori, L Ardon, S Ganesh, T Spooner, S Amrouni, J Vann, M Xu, ... Mathematical Finance 34 (2), 262-347, 2024	5	2024
GrASP: Gradient-Based Affordance Selection for Planning V Veeriah, Z Zheng, R Lewis, S Singh arXiv preprint arXiv:2202.04772, 2022	4	2022
Human Alignment of Large Language Models through Online Preference Optimisation D Calandriello, D Guo, R Munos, M Rowland, Y Tang, BA Pires, ... arXiv preprint arXiv:2403.08635, 2024	2	2024
Advances in Deep Reinforcement Learning: Intrinsic Rewards, Temporal Credit Assignment, State Representations, and Value-equivalent Models Z Zheng		2022
Reinforcement learning using meta-learned intrinsic rewards Z Zheng, J Oh, SS Baveja US Patent App. 17/033,410, 2021		2021
Towards Perpetually Trainable Neural Networks C Lyle, Z Zheng, K Khetarpal, R Pascanu, J Martens, H van Hasselt, ...
Supplementary Material: On Learning Intrinsic Rewards for Policy Gradient Methods Z Zheng, J Oh, S Singh

系统目前无法执行此操作，请稍后再试。

文章 1–19

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用