Han Wang 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	58	58
h 指数	3	3
i10 指数	2	2

0

32

16

2022202320241 21 31

Han Wang

Han Wang

University of Alberta

在 ualberta.ca 的电子邮件经过验证

Reinforcement learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
The in-sample softmax for offline reinforcement learning C Xiao, H Wang, Y Pan, A White, M White arXiv preprint arXiv:2302.14372, 2023	27	2023
Investigating the properties of neural network representations in reinforcement learning H Wang, E Miahi, M White, MC Machado, Z Abbas, R Kumaraswamy, ... Artificial Intelligence 330, 104100, 2024	19	2024
No more pesky hyperparameters: Offline hyperparameter tuning for RL H Wang, A Sakhadeo, A White, J Bell, V Liu, X Zhao, P Liu, T Kozuno, ... arXiv preprint arXiv:2205.08716, 2022	7	2022
Measuring and mitigating interference in reinforcement learning V Liu, H Wang, RY Tao, K Javed, A White, M White Conference on Lifelong Learning Agents, 781-795, 2023	2	2023
Replay memory as an empirical MDP: Combining conservative estimation with experience replay H Zhang, C Xiao, H Wang, J Jin, M Müller The Eleventh International Conference on Learning Representations, 2023	2	2023
A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization Y Luo, Y Pan, H Wang, P Torr, P Poupart arXiv preprint arXiv:2403.11062, 2024	1	2024
Offline Reinforcement Learning via Tsallis Regularization L Zhu, MK Schlegel, H Wang, M White Transactions on Machine Learning Research, 0

系统目前无法执行此操作，请稍后再试。

文章 1–7

共建清朗的网络空间,如遇有害信息,请举报。
本站数据皆整合自互联网公开资源索引,方便科研学术方面查询,并不存储相关数据资源;如对此有异议,请联系我们解决.
© 2023 学术资源搜索 @联系我们 | 申请短期会员 | 数据源提交