Pecan: Leveraging policy ensemble for context-aware zero-shot human-ai coordination X Lou, J Guo, J Zhang, J Wang, K Huang, Y Du arXiv preprint arXiv:2301.06387, 2023 | 13 | 2023 |
Offline reinforcement learning with representations for actions X Lou, Q Yin, J Zhang, C Yu, Z He, N Cheng, K Huang Information Sciences 610, 746-758, 2022 | 6 | 2022 |
An efficient end-to-end training approach for zero-shot human-AI coordination X Yan, J Guo, X Lou, J Wang, H Zhang, Y Du Advances in Neural Information Processing Systems 36, 2024 | 3 | 2024 |
Position: Foundation Agents as the Paradigm Shift for Decision Making X Liu, X Lou, J Jiao, J Zhang arXiv preprint arXiv:2405.17009, 2024 | 1 | 2024 |
Leveraging Joint-action Embedding in Multi-agent Reinforcement Learning for Cooperative Games X Lou, J Zhang, Y Du, C Yu, Z He, K Huang IEEE Transactions on Games, 2023 | 1 | 2023 |
SPO: Multi-Dimensional Preference Sequential Alignment With Implicit Reward Modeling X Lou, J Zhang, J Xie, L Liu, D Yan, K Huang arXiv preprint arXiv:2405.12739, 2024 | | 2024 |
TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient X Lou, J Zhang, TJ Norman, K Huang, Y Du Proceedings of the AAAI Conference on Artificial Intelligence 38 (16), 17496 …, 2024 | | 2024 |
Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models X Lou, J Zhang, Z Wang, K Huang, Y Du arXiv preprint arXiv:2401.07553, 2024 | | 2024 |
SPO: Multi-Dimensional Preference Alignment With Implicit Reward Modeling X Lou, J Zhang, J Xie, L Liu, D Yan, K Huang | | |