Exploration in deep reinforcement learning: From single-agent to multiagent domain J Hao, T Yang, H Tang, C Bai, J Liu, Z Meng, P Liu, Z Wang IEEE Transactions on Neural Networks and Learning Systems, 2023 | 172* | 2023 |
Euclid: Towards efficient unsupervised reinforcement learning with multi-choice dynamics model Y Yuan, J Hao, F Ni, Y Mu, Y Zheng, Y Hu, J Liu, Y Chen, C Fan arXiv preprint arXiv:2210.00498, 2022 | 10 | 2022 |
FIGCPS: Effective failure-inducing input generation for cyber-physical systems with deep reinforcement learning S Zhang, S Liu, J Sun, Y Chen, W Huang, J Liu, J Liu, J Hao 2021 36th IEEE/ACM International Conference on Automated Software …, 2021 | 10 | 2021 |
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles K Zhao, Y Ma, J Liu, HAO Jianye, Y Zheng, Z Meng ICML Workshop on New Frontiers in Learning, Control, and Dynamical Systems, 2023 | 8* | 2023 |
Ovd-explorer: Optimism should not be the sole pursuit of exploration in noisy environments J Liu, Z Wang, Y Zheng, J Hao, C Bai, J Ye, Z Wang, H Piao, Y Sun Proceedings of the AAAI Conference on Artificial Intelligence 38 (12), 13954 …, 2024 | 3 | 2024 |
SheetAgent: A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models Y Chen, Y Yuan, Z Zhang, Y Zheng, J Liu, F Ni, J Hao arXiv preprint arXiv:2403.03636, 2024 | 2 | 2024 |
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback Y Yuan, J Hao, Y Ma, Z Dong, H Liang, J Liu, Z Feng, K Zhao, Y Zheng arXiv preprint arXiv:2402.02423, 2024 | 2 | 2024 |
A Trajectory Perspective on the Role of Data Sampling Techniques in Offline Reinforcement Learning J Liu, Y Ma, J Hao, Y Hu, Y Zheng, T Lv, C Fan Proceedings of the 23rd International Conference on Autonomous Agents and …, 2024 | 1 | 2024 |
Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models J Liu, Y Yuan, J Hao, F Ni, L Fu, Y Chen, Y Zheng arXiv preprint arXiv:2402.14245, 2024 | 1 | 2024 |
ED2: an environment dynamics decomposition framework for world model construction C Wang, T Yang, HAO Jianye, Y Zheng, H Tang, F Barez, J Liu, J Peng, ... | 1 | 2021 |
A Policy-Decoupled Method for High-Quality Data Augmentation in Offline Reinforcement Learning S Lian, Y Ma, J Liu, HAO Jianye, Y Zheng, Z Meng ICML Workshop on New Frontiers in Learning, Control, and Dynamical Systems, 0 | 1* | |
CellAgent: An LLM-driven Multi-Agent Framework for Automated Single-cell Data Analysis Y Xiao, J Liu, Y Zheng, X Xie, J Hao, M Li, R Wang, F Ni, Y Li, J Luo, ... bioRxiv, 2024.05. 13.593861, 2024 | | 2024 |
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement Y Zhu, J Liu, W Wei, Q Fu, Y Hu, Z Fang, B An, J Hao, T Lv, C Fan arXiv preprint arXiv:2405.08638, 2024 | | 2024 |
ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles K Zhao, J Hao, Y Ma, J Liu, Y Zheng, Z Meng Proceedings of the 23rd International Conference on Autonomous Agents and …, 2024 | | 2024 |
vMFER: von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement of Actor-Critic Algorithms Y Zhu, J Liu, W Wei, Q Fu, Y Hu, Z Fang, B An, J Hao, T Lv, C Fan Proceedings of the 23rd International Conference on Autonomous Agents and …, 2024 | | 2024 |
OSCAR: OOD State-Conservative Offline Reinforcement Learning for Sequential Decision Making Y Ma, C Wang, C Chen, J Liu, Z Meng, Y Zheng, J Hao CAAI Artificial Intelligence Research 2, 2023 | | 2023 |
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning J Liu, Y Ma, J Hao, Y Hu, Y Zheng, T Lv, C Fan Data-centric Machine Learning Research (DMLR) Workshop at ICML 2023, 2023 | | 2023 |