关注
Han Wang
Han Wang
在 ualberta.ca 的电子邮件经过验证
标题
引用次数
引用次数
年份
The in-sample softmax for offline reinforcement learning
C Xiao, H Wang, Y Pan, A White, M White
arXiv preprint arXiv:2302.14372, 2023
272023
Investigating the properties of neural network representations in reinforcement learning
H Wang, E Miahi, M White, MC Machado, Z Abbas, R Kumaraswamy, ...
Artificial Intelligence 330, 104100, 2024
192024
No more pesky hyperparameters: Offline hyperparameter tuning for RL
H Wang, A Sakhadeo, A White, J Bell, V Liu, X Zhao, P Liu, T Kozuno, ...
arXiv preprint arXiv:2205.08716, 2022
72022
Measuring and mitigating interference in reinforcement learning
V Liu, H Wang, RY Tao, K Javed, A White, M White
Conference on Lifelong Learning Agents, 781-795, 2023
22023
Replay memory as an empirical MDP: Combining conservative estimation with experience replay
H Zhang, C Xiao, H Wang, J Jin, M Müller
The Eleventh International Conference on Learning Representations, 2023
22023
A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Y Luo, Y Pan, H Wang, P Torr, P Poupart
arXiv preprint arXiv:2403.11062, 2024
12024
Offline Reinforcement Learning via Tsallis Regularization
L Zhu, MK Schlegel, H Wang, M White
Transactions on Machine Learning Research, 0
系统目前无法执行此操作,请稍后再试。
文章 1–7