关注
Zhilong Zhang
Zhilong Zhang
在 lamda.nju.edu.cn 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Flow to better: Offline preference-based reinforcement learning via preferred trajectory generation
Z Zhang, Y Sun, J Ye, TS Liu, J Zhang, Y Yu
The Twelfth International Conference on Learning Representations, 2023
52023
UDCA may promote COVID-19 recovery: a cohort study with AI-aided analysis
Y Yu, G Yu, LY Han, J Li, ZL Zhang, TS Liu, MF Li, DC Zhan, SQ Tang, ...
MedRxiv, 2023.05. 02.23289410, 2023
12023
Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning
XH Liu, TS Liu, S Jiang, R Chen, Z Zhang, X Chen, Y Yu
arXiv preprint arXiv:2407.12448, 2024
2024
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation
C Jia, P Wang, Z Li, YC Li, Z Zhang, N Tang, Y Yu
arXiv preprint arXiv:2405.17039, 2024
2024
Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning
H Lin, YY Xu, Y Sun, Z Zhang, YC Li, C Jia, J Ye, J Zhang, Y Yu
arXiv preprint arXiv:2405.17031, 2024
2024
Limited Preference Aided Imitation Learning from Imperfect Demonstrations
X Cao, FM Luo, J Ye, T Xu, Z Zhang, Y Yu
Forty-first International Conference on Machine Learning, 0
系统目前无法执行此操作,请稍后再试。
文章 1–6