The in-sample softmax for offline reinforcement learning C Xiao, H Wang, Y Pan, A White, M White arXiv preprint arXiv:2302.14372, 2023 | 27 | 2023 |
Investigating the properties of neural network representations in reinforcement learning H Wang, E Miahi, M White, MC Machado, Z Abbas, R Kumaraswamy, ... Artificial Intelligence 330, 104100, 2024 | 19 | 2024 |
No more pesky hyperparameters: Offline hyperparameter tuning for RL H Wang, A Sakhadeo, A White, J Bell, V Liu, X Zhao, P Liu, T Kozuno, ... arXiv preprint arXiv:2205.08716, 2022 | 7 | 2022 |
Measuring and mitigating interference in reinforcement learning V Liu, H Wang, RY Tao, K Javed, A White, M White Conference on Lifelong Learning Agents, 781-795, 2023 | 2 | 2023 |
Replay memory as an empirical MDP: Combining conservative estimation with experience replay H Zhang, C Xiao, H Wang, J Jin, M Müller The Eleventh International Conference on Learning Representations, 2023 | 2 | 2023 |
A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization Y Luo, Y Pan, H Wang, P Torr, P Poupart arXiv preprint arXiv:2403.11062, 2024 | 1 | 2024 |
Offline Reinforcement Learning via Tsallis Regularization L Zhu, MK Schlegel, H Wang, M White Transactions on Machine Learning Research, 0 | | |