SMARTS: Scalable Multi-agent Reinforcement Learning Training School for Autonomous Driving M Zhou, J Luo, J Villella, Y Yang, D Rusu, J Miao, W Zhang, M Alban, ... CoRL 2020 (Best System Paper Award), 2020 | 183* | 2020 |
Diffusion Models for Reinforcement Learning: A Survey Z Zhu, H Zhao, H He, Y Zhong, S Zhang, Y Yu, W Zhang arXiv preprint arXiv:2311.01223, 2023 | 15 | 2023 |
MADiff: Offline Multi-agent Learning with Diffusion Models Z Zhu, M Liu, L Mao, B Kang, M Xu, Y Yu, S Ermon, W Zhang arXiv preprint arXiv:2305.17330, 2023 | 13 | 2023 |
Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization M Liu, Z Zhu, Y Zhuang, W Zhang, J Hao, Y Yu, J Wang ICML 2022, 2022 | 8 | 2022 |
Weakly-supervised Reconstruction of 3D Objects with Large Shape Variation from Single In-the-Wild Images S Sun, Z Zhu, X Dai, Q Zhao, J Li ACCV 2020, 2020 | 5 | 2020 |
RITA: Boost Driving Simulators with Realistic Interactive Traffic Flow Z Zhu, S Zhang, Y Zhuang, Y Liu, M Liu, L Mao, Z Gong, S Kai, Q Gu, ... DAI 2023 (Best Student Paper Award), 2023 | 2* | 2023 |
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems Z Zhu, R Qin, J Huang, X Dai, Y Yu, Y Yu, W Zhang ACM Transactions on Information Systems 42 (4), 2024 | 1 | 2024 |
Diffusion-based Dynamics Models for Long-Horizon Rollout in Offline Reinforcement Learning H Zhao, X Han, Z Zhu, M Liu, Y Yu, W Zhang arXiv preprint arXiv:2405.19189, 2024 | | 2024 |
Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning Y Shan, Z Zhu, T Long, Q Liang, Y Chang, W Zhang, L Yin arXiv preprint arXiv:2402.02772, 2024 | | 2024 |
DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching G Li, Y Shan, Z Zhu, T Long, W Zhang ICML 2024, 2024 | | 2024 |
Planning Immediate Landmarks of Targets for Model-Free Skill Transfer across Agents M Liu, Z Zhu, M Zhu, Y Zhuang, W Zhang, J Hao arXiv preprint arXiv:2212.09033, 2022 | | 2022 |
Imitation Learning via Multi-Step Occupancy Measure Matching M Liu, H Wang, Y Zhang, M Xu, Z Zhu, W Zhang | | |