Xingzhou Lou 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	24	24
h 指数	3	3
i10 指数	1	1

0

14

7

2023202410 14

开放获取的出版物数量

0 篇文章

2 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Junge ZHANGNational Laboratory of Pattern Recognition, CASIA在 nlpr.ia.ac.cn 的电子邮件经过验证
kaiqi huangNLPR,CASIA在 nlpr.ia.ac.cn 的电子邮件经过验证
Yali DuTuring Fellow, Associate professor, King's College London在 kcl.ac.uk 的电子邮件经过验证
Jiaxian GuoPh.D. Student, Computer Science Department, The University of Sydney在 uni.sydney.edu.au 的电子邮件经过验证
Jun WangProfessor, Computer Science, University College London在 cs.ucl.ac.uk 的电子邮件经过验证

Xingzhou Lou

Xingzhou Lou

Institution of Automation, Chinese Academy of Sciences

在 ia.ac.cn 的电子邮件经过验证

Deep Learning Reinforcement Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Pecan: Leveraging policy ensemble for context-aware zero-shot human-ai coordination X Lou, J Guo, J Zhang, J Wang, K Huang, Y Du arXiv preprint arXiv:2301.06387, 2023	13	2023
Offline reinforcement learning with representations for actions X Lou, Q Yin, J Zhang, C Yu, Z He, N Cheng, K Huang Information Sciences 610, 746-758, 2022	6	2022
An efficient end-to-end training approach for zero-shot human-AI coordination X Yan, J Guo, X Lou, J Wang, H Zhang, Y Du Advances in Neural Information Processing Systems 36, 2024	3	2024
Position: Foundation Agents as the Paradigm Shift for Decision Making X Liu, X Lou, J Jiao, J Zhang arXiv preprint arXiv:2405.17009, 2024	1	2024
Leveraging Joint-action Embedding in Multi-agent Reinforcement Learning for Cooperative Games X Lou, J Zhang, Y Du, C Yu, Z He, K Huang IEEE Transactions on Games, 2023	1	2023
SPO: Multi-Dimensional Preference Sequential Alignment With Implicit Reward Modeling X Lou, J Zhang, J Xie, L Liu, D Yan, K Huang arXiv preprint arXiv:2405.12739, 2024		2024
TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient X Lou, J Zhang, TJ Norman, K Huang, Y Du Proceedings of the AAAI Conference on Artificial Intelligence 38 (16), 17496 …, 2024		2024
Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models X Lou, J Zhang, Z Wang, K Huang, Y Du arXiv preprint arXiv:2401.07553, 2024		2024
SPO: Multi-Dimensional Preference Alignment With Implicit Reward Modeling X Lou, J Zhang, J Xie, L Liu, D Yan, K Huang

系统目前无法执行此操作，请稍后再试。

文章 1–9

共建清朗的网络空间,如遇有害信息,请举报。
本站数据皆整合自互联网公开资源索引,方便科研学术方面查询,并不存储相关数据资源;如对此有异议,请联系我们解决.
© 2023 学术资源搜索 @联系我们 | 申请短期会员 | 数据源提交