Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization H Xu, L Jiang, J Li, Z Yang, Z Wang, VWK Chan, X Zhan ICLR 2023, 2023 | 51 | 2023 |
A Policy-Guided Imitation Approach for Offline Reinforcement Learning H Xu, L Jiang, J Li, X Zhan NeurIPS 2022, 2022 | 36 | 2022 |
When data geometry meets deep function: Generalizing offline reinforcement learning J Li, X Zhan, H Xu, X Zhu, J Liu, YQ Zhang ICLR 2023, 2023 | 27* | 2023 |
Offline Reinforcement Learning with Soft Behavior Regularization H Xu, X Zhan, J Li, H Yin NeurIPS 2021 Offline Reinforcement Learning Workshop, 2021 | 26 | 2021 |
Mind the Gap: Offline Policy Optimization for Imperfect Rewards J Li*, X Hu*, H Xu, J Liu, X Zhan, QS Jia, YQ Zhang ICLR 2023, 2023 | 16 | 2023 |
PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning J Li, X Hu, H Xu, J Liu, X Zhan, YQ Zhang arXiv preprint arXiv:2305.15669, 2023 | 12 | 2023 |
Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model Y Zheng*, J Li*, D Yu, Y Yang, SE Li, X Zhan, J Liu ICLR 2024, 2024 | 8 | 2024 |
Query-Policy Misalignment in Preference-Based Reinforcement Learning X Hu*, J Li*, X Zhan, QS Jia, YQ Zhang ICLR 2024, 2023 | 7 | 2023 |
DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning J Li*, J Zheng*, Y Zheng*, L Mao, X Hu, S Cheng, H Niu, J Liu, Y Liu, J Liu, ... ICML 2024, 2024 | 1 | 2024 |
A Fully Data-Driven Approach for Realistic Traffic Signal Control Using Offline Reinforcement Learning J Li*, S Lin*, T Shi, C Tian, Y Mei, J Song, X Zhan, R Li arXiv preprint arXiv:2311.15920, 2023 | 1 | 2023 |
Instruction-Guided Visual Masking J Zheng*, J Li*, S Cheng, Y Zheng, J Li, J Liu, Y Liu, J Liu, X Zhan MFM-EAI@ICML2024 workshop, 2024 | | 2024 |
Vehicle Extreme Control based on Offline Reinforcement Leaning S Zhao, J Li, X Hu, J Zhang, C He CAC 2022, 4539-4543, 2022 | | 2022 |