Transfer learning for reinforcement learning domains: A survey. ME Taylor, P Stone Journal of Machine Learning Research 10 (7), 2009 | 2306 | 2009 |
Deep recurrent q-learning for partially observable mdps M Hausknecht, P Stone 2015 aaai fall symposium series, 2015 | 2184 | 2015 |
Multiagent systems: A survey from a machine learning perspective P Stone, M Veloso Autonomous Robots 8, 345-383, 2000 | 2007 | 2000 |
A multiagent approach to autonomous intersection management K Dresner, P Stone Journal of artificial intelligence research 31, 591-656, 2008 | 1538 | 2008 |
Layered learning in multiagent systems PH Stone Carnegie Mellon University, 1998 | 1034* | 1998 |
Artificial intelligence and life in 2030: the one hundred year study on artificial intelligence P Stone, R Brooks, E Brynjolfsson, R Calo, O Etzioni, G Hager, ... arXiv preprint arXiv:2211.06318, 2022 | 1009* | 2022 |
Multiagent traffic management: A reservation-based intersection control mechanism K Dresner, P Stone Autonomous Agents and Multiagent Systems, International Joint Conference on …, 2004 | 882 | 2004 |
Policy gradient reinforcement learning for fast quadrupedal locomotion N Kohl, P Stone IEEE International Conference on Robotics and Automation, 2004. Proceedings …, 2004 | 813 | 2004 |
Behavioral cloning from observation F Torabi, G Warnell, P Stone arXiv preprint arXiv:1805.01954, 2018 | 686 | 2018 |
Task decomposition, dynamic role assignment, and low-bandwidth communication for real-time strategic teamwork P Stone, M Veloso Artificial Intelligence 110 (2), 241-273, 1999 | 652 | 1999 |
Scalable training of artificial neural networks with adaptive sparse connectivity inspired by network science DC Mocanu, E Mocanu, P Stone, PH Nguyen, M Gibescu, A Liotta Nature communications 9 (1), 2383, 2018 | 620 | 2018 |
Reinforcement learning for robocup soccer keepaway P Stone, RS Sutton, G Kuhlmann Adaptive Behavior 13 (3), 165-188, 2005 | 601 | 2005 |
Interactively shaping agents via human reinforcement: The TAMER framework WB Knox, P Stone Proceedings of the fifth international conference on Knowledge capture, 9-16, 2009 | 592 | 2009 |
Autonomous agents modelling other agents: A comprehensive survey and open problems SV Albrecht, P Stone Artificial Intelligence 258, 66-95, 2018 | 537 | 2018 |
The RoboCup synthetic agent challenge 97 H Kitano, M Tambe, P Stone, M Veloso, S Coradeschi, E Osawa, ... RoboCup-97: Robot Soccer World Cup I 1, 62-73, 1998 | 495 | 1998 |
Curriculum learning for reinforcement learning domains: A framework and survey S Narvekar, B Peng, M Leonetti, J Sinapov, ME Taylor, P Stone Journal of Machine Learning Research 21 (181), 1-50, 2020 | 477 | 2020 |
Ad hoc autonomous agent teams: Collaboration without pre-coordination P Stone, G Kaminka, S Kraus, J Rosenschein Proceedings of the AAAI Conference on Artificial Intelligence 24 (1), 1504-1509, 2010 | 451 | 2010 |
PAC subset selection in stochastic multi-armed bandits. S Kalyanakrishnan, A Tewari, P Auer, P Stone ICML 12, 655-662, 2012 | 414 | 2012 |
Deep reinforcement learning in parameterized action space M Hausknecht, P Stone arXiv preprint arXiv:1511.04143, 2015 | 386 | 2015 |
Multiagent traffic management: An improved intersection control mechanism K Dresner, P Stone Proceedings of the fourth international joint conference on Autonomous …, 2005 | 374 | 2005 |