Language agent tree search unifies reasoning acting and planning in language models A Zhou, K Yan, M Shlapentokh-Rothman, H Wang, YX Wang arXiv preprint arXiv:2310.04406, 2023 | 94 | 2023 |
Algorithms in multi-agent systems: a holistic perspective from reinforcement learning and game theory Y Lu, K Yan arXiv preprint arXiv:2001.06487, 2020 | 18 | 2020 |
A Surrogate Objective Framework for Prediction+ Programming with Soft Constraints K Yan, J Yan, C Luo, L Chen, Q Lin, D Zhang Advances in Neural Information Processing Systems 34, 21520-21532, 2021 | 6 | 2021 |
CEIP: combining explicit and implicit priors for reinforcement learning with demonstrations K Yan, A Schwing, YX Wang Advances in Neural Information Processing Systems 35, 7614-7627, 2022 | 3 | 2022 |
A simple solution for offline imitation from observations and examples with possibly incomplete trajectories K Yan, A Schwing, YX Wang Advances in Neural Information Processing Systems 36, 2024 | 2 | 2024 |
A microscopic pandemic simulator for pandemic prediction using scalable million-agent reinforcement learning Z Tang, K Yan, L Sun, W Zhan, C Liu arXiv preprint arXiv:2108.06589, 2021 | 2 | 2021 |
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers K Yan, AG Schwing, YX Wang arXiv preprint arXiv:2410.24108, 2024 | | 2024 |
Offline Imitation from Observation via Primal Wasserstein State Occupancy Matching K Yan, AG Schwing, YX Wang arXiv preprint arXiv:2311.01331, 2023 | | 2023 |