关注
Xingzhou Lou
Xingzhou Lou
Institution of Automation, Chinese Academy of Sciences
在 ia.ac.cn 的电子邮件经过验证
标题
引用次数
引用次数
年份
Pecan: Leveraging policy ensemble for context-aware zero-shot human-ai coordination
X Lou, J Guo, J Zhang, J Wang, K Huang, Y Du
arXiv preprint arXiv:2301.06387, 2023
132023
Offline reinforcement learning with representations for actions
X Lou, Q Yin, J Zhang, C Yu, Z He, N Cheng, K Huang
Information Sciences 610, 746-758, 2022
62022
An efficient end-to-end training approach for zero-shot human-AI coordination
X Yan, J Guo, X Lou, J Wang, H Zhang, Y Du
Advances in Neural Information Processing Systems 36, 2024
32024
Position: Foundation Agents as the Paradigm Shift for Decision Making
X Liu, X Lou, J Jiao, J Zhang
arXiv preprint arXiv:2405.17009, 2024
12024
Leveraging Joint-action Embedding in Multi-agent Reinforcement Learning for Cooperative Games
X Lou, J Zhang, Y Du, C Yu, Z He, K Huang
IEEE Transactions on Games, 2023
12023
SPO: Multi-Dimensional Preference Sequential Alignment With Implicit Reward Modeling
X Lou, J Zhang, J Xie, L Liu, D Yan, K Huang
arXiv preprint arXiv:2405.12739, 2024
2024
TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient
X Lou, J Zhang, TJ Norman, K Huang, Y Du
Proceedings of the AAAI Conference on Artificial Intelligence 38 (16), 17496 …, 2024
2024
Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models
X Lou, J Zhang, Z Wang, K Huang, Y Du
arXiv preprint arXiv:2401.07553, 2024
2024
SPO: Multi-Dimensional Preference Alignment With Implicit Reward Modeling
X Lou, J Zhang, J Xie, L Liu, D Yan, K Huang
系统目前无法执行此操作,请稍后再试。
文章 1–9