Hengshuai Yao 个人学术档案

引用次数

	总计	2019 年至今
引用	1068	994
h 指数	18	17
i10 指数	25	24

320

160

240

201120122013201420152016201720182019202020212022202320244 8 2 2 5 15 11 17 37 88 152 234 301 177

开放获取的出版物数量

查看全部

5 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Linglong KongProfessor, Canada Research Chair in Statistical Learning, UAlberta, and Canada CIFAR AI Chair, Amii在 ualberta.ca 的电子邮件经过验证
Csaba SzepesvariDeepMind & University of Alberta在 cs.ualberta.ca 的电子邮件经过验证
Shangtong ZhangUniversity of Virginia在 virginia.edu 的电子邮件经过验证
Bei JiangAssociate Professor of Statistics, University of Alberta在 ualberta.ca 的电子邮件经过验证
Richard S. SuttonKeen, Amii, and University of Alberta在 richsutton.com 的电子邮件经过验证
Shalabh BhatnagarProfessor in the Department of Computer Science and Automation, Indian Institute of Science在 iisc.ac.in 的电子邮件经过验证
Randy GoebelProfessor of Computing Science, University of Alberta在 ualberta.ca 的电子邮件经过验证
Borislav MavrinUniversity of Alberta在 ualberta.ca 的电子邮件经过验证
Masoud S. Nosrati, PhDSenior ML Software Engineer, Meta (Facebook)在 fb.com 的电子邮件经过验证
Shahin AtakishiyevPhD Candidate in Computing Science, University of Alberta在 ualberta.ca 的电子邮件经过验证
Martha WhiteUniversity of Alberta在 ualberta.ca 的电子邮件经过验证
Peyman YadmellatWoven Planet Holdings在 woven-planet.global 的电子邮件经过验证
Martin JagersandUniversity of Alberta在 cs.ualberta.ca 的电子邮件经过验证
Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, Waymo在 cs.ox.ac.uk 的电子邮件经过验证
Bo LiuPhD, AAAI SM, IEEE SM在 cs.umass.edu 的电子邮件经过验证
Amir-massoud FarahmandUniversity of Toronto在 cs.toronto.edu 的电子邮件经过验证
Mennatullah SiamOntario Tech University在 ontariotechu.ca 的电子邮件经过验证
Naren DoraiswamyUniversity of Michigan在 umich.edu 的电子邮件经过验证
Boris N. OreshkinPrincipal Scientist at Amazon在 amazon.com 的电子邮件经过验证
Jincheng MeiResearch Scientist, Google Brain在 google.com 的电子邮件经过验证

关注

Hengshuai Yao

Sony AI

在 ualberta.ca 的电子邮件经过验证 - 首页

Deep Representation Decision Boundary SGD Reinforcement Learning step-size adaptation


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Explainable artificial intelligence for autonomous driving: A comprehensive overview and field guide for future research directions S Atakishiyev, M Salameh, H Yao, R Goebel arXiv preprint arXiv:2112.11561, 2021	119	2021
Distributional Reinforcement Learning for Efficient Exploration B Mavrin, S Zhang, H Yao, K Kong, Linglong, Wu, Y Yu https://arxiv.org/abs/1905.06125, 2019	93	2019
Negative log likelihood ratio loss for deep neural network classification H Yao, D Zhu, B Jiang, P Yu Proceedings of the Future Technologies Conference (FTC) 2019: Volume 1, 276-282, 2020	91	2020
Discounted reinforcement learning is not an optimization problem A Naik, R Shariff, N Yasui, H Yao, RS Sutton arXiv preprint arXiv:1910.02140, 2019	65	2019
Mapless navigation among dynamics with social-safety-awareness: a reinforcement learning approach from 2d laser scans J Jin, NM Nguyen, N Sakib, D Graves, H Yao, M Jagersand 2020 IEEE international conference on robotics and automation (ICRA), 6979-6985, 2020	60	2020
Provably convergent two-timescale off-policy actor-critic with function approximation S Zhang, B Liu, H Yao, S Whiteson International Conference on Machine Learning, 11204-11213, 2020	54	2020
Weakly supervised few-shot object segmentation using co-attention with visual and semantic embeddings M Siam, N Doraiswamy, BN Oreshkin, H Yao, M Jagersand arXiv preprint arXiv:2001.09540, 2020	49	2020
Universal Option Models H Yao, C Szepesvari, R Sutton, S Bhatnagar, J Modayil	45*	2014
Breaking the deadly triad with a target network S Zhang, H Yao, S Whiteson International Conference on Machine Learning, 12621-12631, 2021	42	2021
A multi-component framework for the analysis and design of explainable artificial intelligence MY Kim, S Atakishiyev, HKB Babiker, N Farruque, R Goebel, OR Zaïane, ... Machine Learning and Knowledge Extraction 3 (4), 900-921, 2021	41	2021
Method of prediction of a state of an object in the environment using an action model of a neural network H Yao, SM Nosrati, H Chen, P Yadmellat, Y Zhang US Patent 10,997,491, 2021	41	2021
Multi-step dyna planning for policy evaluation and control H Yao, S Bhatnagar, D Diao Advances in neural information processing systems 22, 2009	35*	2009
Quota: The quantile option architecture for reinforcement learning S Zhang, H Yao Proceedings of the AAAI conference on artificial intelligence 33 (01), 5797-5804, 2019	32	2019
Ace: An actor ensemble algorithm for continuous control with tree search S Zhang, H Yao Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 5789-5796, 2019	31	2019
Pseudo-MDPs and Factored Linear Action Models H Yao, C Szepesvari, BA Pires, X Zhang IEEE ADPRL, 2014	27	2014
Method of selection of an action for an object using a neural network H Yao, H Chen, SM Nosrati, P Yadmellat, Y Zhang US Patent 10,935,982, 2021	24	2021
Approximate policy iteration with linear action models H Yao, C Szepesvári Proceedings of the AAAI Conference on Artificial Intelligence 26 (1), 1212-1218, 2012	18	2012
Preconditioned temporal difference learning H Yao, ZQ Liu Proceedings of the 25th international conference on Machine learning, 1208-1215, 2008	18	2008
Hill climbing on value estimates for search-control in Dyna Y Pan, H Yao, A Farahmand, M White arXiv preprint arXiv:1906.07791, 2019	17	2019
Towards practical hierarchical reinforcement learning for multi-lane autonomous driving MS Nosrati, EA Abolfathi, M Elmahgiubi, P Yadmellat, J Luo, Y Zhang, ...	16	2018

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用