Hongming Zhang 个人学术档案

引用次数

	总计	2019 年至今
引用	367	367
h 指数	5	5
i10 指数	4	4

140

105

202020212022202320246 53 83 129 94

开放获取的出版物数量

查看全部

1 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Hao DongAssistant Professor at Peking University在 pku.edu.cn 的电子邮件经过验证
Zihan DingPrinceton University在 princeton.edu 的电子邮件经过验证
Fengshuo BaiShanghai Jiao Tong University在 sjtu.edu.cn 的电子邮件经过验证
Martin MüllerProfessor, Computing Science, University of Alberta在 ualberta.ca 的电子邮件经过验证
Jun JinAssistant Professor at the University of Alberta在 ualberta.ca 的电子邮件经过验证
Ke Sun (孙科)University of Alberta在 ualberta.ca 的电子邮件经过验证
Linglong KongProfessor, Canada Research Chair in Statistical Learning, UAlberta, and Canada CIFAR AI Chair, Amii在 ualberta.ca 的电子邮件经过验证
Dale SchuurmansUniversity of Alberta, Google DeepMind在 cs.ualberta.ca 的电子邮件经过验证
Tongzheng RenCitadel Securities在 utexas.edu 的电子邮件经过验证
Bo DaiGoogle Brain & Georgia Tech在 google.com 的电子邮件经过验证
Chao GaoHuawei Canada Research Center在 huawei.com 的电子邮件经过验证

关注

Hongming Zhang

University of Alberta

在 ualberta.ca 的电子邮件经过验证 - 首页

reinforcement learning tree search statistical machine learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Deep Reinforcement Learning: Fundamentals, Research, and Applications H Dong, Z Ding, S Zhang, H Yuan, H Zhang, J Zhang, Y Huang, T Yu, ... Springer Singapore, 2020	244	2020
Taxonomy of reinforcement learning algorithms H Zhang, T Yu Deep reinforcement learning: Fundamentals, research and applications, 125-133, 2020	77	2020
AlphaZero H Zhang, T Yu Deep Reinforcement Learning: Fundamentals, Research and Applications, 391-415, 2020	21	2020
Efficient reinforcement learning development with rlzoo Z Ding, T Yu, H Zhang, Y Huang, G Li, Q Guo, L Mai, H Dong Proceedings of the 29th ACM International Conference on Multimedia, 3759-3762, 2021	12*	2021
Picor: Multi-task deep reinforcement learning with policy correction F Bai, H Zhang, T Tao, Z Wu, Y Wang, B Xu Proceedings of the AAAI Conference on Artificial Intelligence 37 (6), 6728-6736, 2023	7	2023
Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay H Zhang, C Xiao, H Wang, J Jin, B Xu, M Müller The Eleventh International Conference on Learning Representations, 2023	2	2023
Provable Representation with Efficient Planning for Partially Observable Reinforcement Learning H Zhang, T Ren, C Xiao, D Schuurmans, B Dai Forty-first International Conference on Machine Learning, 2024	1	2024
A Simple Unified Framework for Anomaly Detection in Deep Reinforcement Learning H Zhang, K Sun, B Xu, L Kong, M Müller arXiv preprint arXiv:2109.09889, 2021	1	2021
Combine Deep Q-Networks with Actor-Critic H Zhang, T Yu, R Huang Deep Reinforcement Learning: Fundamentals, Research and Applications, 213-245, 2020	1	2020
A logarithmic barrier method for proximal policy optimization C Zeng, H Zhang arXiv preprint arXiv:1812.06502, 2018	1	2018
Monte Carlo Tree Search in the Presence of Transition Uncertainty F Kohankhaki, K Aghakasiri, H Zhang, TH Wei, C Gao, M Müller Proceedings of the AAAI Conference on Artificial Intelligence 38 (18), 20151 …, 2024		2024
Build generally reusable agent-environment interaction models J Jin, H Zhang, J Luo arXiv preprint arXiv:2211.08234, 2022		2022
Iterative Update and Unified Representation for Multi-Agent Reinforcement Learning J Long, H Zhang, T Yu, B Xu arXiv preprint arXiv:1908.06758, 2019		2019
RevCuT Tree Search Method in Complex Single-player Game with Continuous Search Space H Zhang, F Cheng, B Xu, F Chen, J Liu, W Wu 2019 International Joint Conference on Neural Networks (IJCNN), 1-8, 2019		2019

系统目前无法执行此操作，请稍后再试。

文章 1–14

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用