Bei Peng 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	2159	1978
h 指数	16	15
i10 指数	19	16

640

320

160

480

201520162017201820192020202120222023202416 36 43 76 56 139 313 513 632 322

开放获取的出版物数量

查看全部

14 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Matthew E. TaylorAssociate Professor, University of Alberta在 ualberta.ca 的电子邮件经过验证
Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, Waymo在 cs.ox.ac.uk 的电子邮件经过验证
David L. RobertsAssociate Professor, Assistant Director Undergraduate Programs, Interim Director Digital Games在 csc.ncsu.edu 的电子邮件经过验证
Michael LittmanBrown University在 brown.edu 的电子邮件经过验证
Robert LoftinLecturer, University of Sheffield在 sheffield.ac.uk 的电子邮件经过验证
James MacGlashanSony AI在 sony.com 的电子邮件经过验证
Wendelin BöhmerSequential Decision Making Group, Delft University of Technology在 tudelft.nl 的电子邮件经过验证
Tabish RashidMicrosoft Research在 microsoft.com 的电子邮件经过验证
Christian Schroeder de WittUniversity of Oxford在 robots.ox.ac.uk 的电子邮件经过验证
Tarun GuptaUniversity of Oxford, Microsoft Research在 microsoft.com 的电子邮件经过验证
Jeff HuangBrown University在 jeffhuang.com 的电子邮件经过验证
Philip TorrProfessor, University of Oxford在 eng.ox.ac.uk 的电子邮件经过验证
Anuj MahajanAmazon在 cs.ox.ac.uk 的电子邮件经过验证
Sanmit NarvekarResearch Scientist, Waymo在 cs.utexas.edu 的电子邮件经过验证
Peter StoneProfessor of Computer Science, The University of Texas at Austin在 cs.utexas.edu 的电子邮件经过验证
Jivko SinapovAssistant Professor, Tufts University在 cs.tufts.edu 的电子邮件经过验证
Matteo LeonettiDepartment of Informatics, King's College London在 kcl.ac.uk 的电子邮件经过验证
Gregory FarquharDeepMind在 google.com 的电子邮件经过验证
Tonghan WangEcon CS group, Harvard University在 g.harvard.edu 的电子邮件经过验证
Shariq IqbalResearch Scientist, Deepmind在 deepmind.com 的电子邮件经过验证

关注

Bei Peng

Lecturer (Assistant Professor), University of Liverpool

在 liverpool.ac.uk 的电子邮件经过验证 - 首页

Machine Learning Reinforcement Learning Interactive Learning Multi-Agent Systems Curriculum Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey S Narvekar, B Peng, M Leonetti, J Sinapov, ME Taylor, P Stone Journal of Machine Learning Research (JMLR 2020) 21, 1-50, 2020	475	2020
Weighted QMIX: Expanding Monotonic Value Function Factorisation T Rashid, G Farquhar, B Peng, S Whiteson Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS 2020), 2020	332*	2020
Interactive learning from policy-dependent human feedback J MacGlashan, MK Ho, R Loftin, B Peng, G Wang, DL Roberts, ME Taylor, ... 34th International Conference on Machine Learning (ICML 2017), 2285-2294, 2017	314	2017
RODE: Learning Roles to Decompose Multi-Agent Tasks T Wang, T Gupta, A Mahajan, B Peng, S Whiteson, C Zhang International Conference on Learning Representations (ICLR 2021), 2020	189	2020
FACMAC: Factored Multi-Agent Centralised Policy Gradients B Peng, T Rashid, CAS de Witt, PA Kamienny, PHS Torr, W Böhmer, ... 35th Conference on Neural Information Processing Systems (NeurIPS 2021), 2021	182	2021
Learning behaviors via human-delivered discrete feedback: modeling implicit feedback strategies to speed up learning R Loftin, B Peng, J MacGlashan, ML Littman, ME Taylor, J Huang, ... Autonomous agents and multi-agent systems (JAAMAS 2016) 30 (1), 30-59, 2016	125	2016
A strategy-aware technique for learning behaviors from discrete human feedback RT Loftin, J MacGlashan, B Peng, ME Taylor, ML Littman, J Huang, ... Twenty-Eighth AAAI Conference on Artificial Intelligence (AAAI 2014), 2014	80	2014
Deep Multi-Agent Reinforcement Learning for Decentralized Continuous Cooperative Control CS de Witt, B Peng (equal contribution), PA Kamienny, P Torr, W Böhmer, ... arXiv preprint arXiv:2003.06709, 2020	76	2020
Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning S Iqbal, CAS de Witt, B Peng, W Böhmer, S Whiteson, F Sha 38th International Conference on Machine Learning (ICML 2021), 2021	74*	2021
A need for speed: Adapting agent action speed to improve task learning from non-expert humans B Peng, J MacGlashan, R Loftin, ML Littman, DL Roberts, ME Taylor Autonomous Agents and Multiagent Systems (AAMAS 2016), 2016	57	2016
UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning T Gupta, A Mahajan, B Peng, W Böhmer, S Whiteson 38th International Conference on Machine Learning (ICML 2021), 2021	49	2021
Optimistic Exploration even with a Pessimistic Initialisation T Rashid, B Peng, W Böhmer, S Whiteson International Conference on Learning Representations (ICLR 2020), 2020	47	2020
Regularized Softmax Deep Multi-Agent Q-Learning L Pan, T Rashid, B Peng, L Huang, S Whiteson 35th Conference on Neural Information Processing Systems (NeurIPS 2021), 2021	30*	2021
Learning something from nothing: Leveraging implicit human feedback strategies R Loftin, B Peng, J MacGlashan, ML Littman, ME Taylor, J Huang, ... The 23rd IEEE international symposium on robot and human interactive …, 2014	30	2014
Training an agent to ground commands with reward and punishment J MacGlashan, M Littman, R Loftin, B Peng, D Roberts, M Taylor Workshops at the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014	25	2014
Curriculum Design for Machine Learners in Sequential Decision Tasks B Peng, J MacGlashan, R Loftin, ML Littman, DL Roberts, ME Taylor IEEE Transactions on Emerging Topics in Computational Intelligence 2 (4 …, 2018	18	2018
An empirical study of non-expert curriculum design for machine learners B Peng, J MacGlashan, R Loftin, ML Littman, DL Roberts, ME Taylor Proceedings of the IJCAI Interactive Machine Learning Workshop, 2016	14	2016
Convergent Actor Critic by Humans J MacGlashan, ML Littman, DL Roberts, R Loftin, B Peng, ME Taylor International Conference on Intelligent Robots and Systems (IROS 2016), 2016	12	2016
Towards integrating real-time crowd advice with reinforcement learning GV de la Cruz, B Peng, WS Lasecki, ME Taylor Proceedings of the 20th International Conference on Intelligent User …, 2015	10	2015
Generating real-time crowd advice to improve reinforcement learning agents GV de la Cruz, B Peng, WS Lasecki, ME Taylor Workshops at the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015	4	2015

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用