Deep Reinforcement Learning: Fundamentals, Research, and Applications H Dong, Z Ding, S Zhang, H Yuan, H Zhang, J Zhang, Y Huang, T Yu, ... Springer Singapore, 2020 | 244 | 2020 |
Taxonomy of reinforcement learning algorithms H Zhang, T Yu Deep reinforcement learning: Fundamentals, research and applications, 125-133, 2020 | 77 | 2020 |
AlphaZero H Zhang, T Yu Deep Reinforcement Learning: Fundamentals, Research and Applications, 391-415, 2020 | 21 | 2020 |
Efficient reinforcement learning development with rlzoo Z Ding, T Yu, H Zhang, Y Huang, G Li, Q Guo, L Mai, H Dong Proceedings of the 29th ACM International Conference on Multimedia, 3759-3762, 2021 | 12* | 2021 |
Picor: Multi-task deep reinforcement learning with policy correction F Bai, H Zhang, T Tao, Z Wu, Y Wang, B Xu Proceedings of the AAAI Conference on Artificial Intelligence 37 (6), 6728-6736, 2023 | 7 | 2023 |
Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay H Zhang, C Xiao, H Wang, J Jin, B Xu, M Müller The Eleventh International Conference on Learning Representations, 2023 | 2 | 2023 |
Provable Representation with Efficient Planning for Partially Observable Reinforcement Learning H Zhang, T Ren, C Xiao, D Schuurmans, B Dai Forty-first International Conference on Machine Learning, 2024 | 1 | 2024 |
A Simple Unified Framework for Anomaly Detection in Deep Reinforcement Learning H Zhang, K Sun, B Xu, L Kong, M Müller arXiv preprint arXiv:2109.09889, 2021 | 1 | 2021 |
Combine Deep Q-Networks with Actor-Critic H Zhang, T Yu, R Huang Deep Reinforcement Learning: Fundamentals, Research and Applications, 213-245, 2020 | 1 | 2020 |
A logarithmic barrier method for proximal policy optimization C Zeng, H Zhang arXiv preprint arXiv:1812.06502, 2018 | 1 | 2018 |
Monte Carlo Tree Search in the Presence of Transition Uncertainty F Kohankhaki, K Aghakasiri, H Zhang, TH Wei, C Gao, M Müller Proceedings of the AAAI Conference on Artificial Intelligence 38 (18), 20151 …, 2024 | | 2024 |
Build generally reusable agent-environment interaction models J Jin, H Zhang, J Luo arXiv preprint arXiv:2211.08234, 2022 | | 2022 |
Iterative Update and Unified Representation for Multi-Agent Reinforcement Learning J Long, H Zhang, T Yu, B Xu arXiv preprint arXiv:1908.06758, 2019 | | 2019 |
RevCuT Tree Search Method in Complex Single-player Game with Continuous Search Space H Zhang, F Cheng, B Xu, F Chen, J Liu, W Wu 2019 International Joint Conference on Neural Networks (IJCNN), 1-8, 2019 | | 2019 |