关注
Kai Yan
Kai Yan
在 illinois.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Language agent tree search unifies reasoning acting and planning in language models
A Zhou, K Yan, M Shlapentokh-Rothman, H Wang, YX Wang
arXiv preprint arXiv:2310.04406, 2023
942023
Algorithms in multi-agent systems: a holistic perspective from reinforcement learning and game theory
Y Lu, K Yan
arXiv preprint arXiv:2001.06487, 2020
182020
A Surrogate Objective Framework for Prediction+ Programming with Soft Constraints
K Yan, J Yan, C Luo, L Chen, Q Lin, D Zhang
Advances in Neural Information Processing Systems 34, 21520-21532, 2021
62021
CEIP: combining explicit and implicit priors for reinforcement learning with demonstrations
K Yan, A Schwing, YX Wang
Advances in Neural Information Processing Systems 35, 7614-7627, 2022
32022
A simple solution for offline imitation from observations and examples with possibly incomplete trajectories
K Yan, A Schwing, YX Wang
Advances in Neural Information Processing Systems 36, 2024
22024
A microscopic pandemic simulator for pandemic prediction using scalable million-agent reinforcement learning
Z Tang, K Yan, L Sun, W Zhan, C Liu
arXiv preprint arXiv:2108.06589, 2021
22021
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
K Yan, AG Schwing, YX Wang
arXiv preprint arXiv:2410.24108, 2024
2024
Offline Imitation from Observation via Primal Wasserstein State Occupancy Matching
K Yan, AG Schwing, YX Wang
arXiv preprint arXiv:2311.01331, 2023
2023
系统目前无法执行此操作,请稍后再试。
文章 1–8