Model-agnostic meta-learning for fast adaptation of deep networks C Finn, P Abbeel, S Levine International conference on machine learning, 1126-1135, 2017 | 12656 | 2017 |
Denoising diffusion probabilistic models J Ho, A Jain, P Abbeel Advances in neural information processing systems 33, 6840-6851, 2020 | 10286 | 2020 |
Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor T Haarnoja, A Zhou, P Abbeel, S Levine International conference on machine learning, 1861-1870, 2018 | 8362 | 2018 |
Trust region policy optimization J Schulman, S Levine, P Abbeel, M Jordan, P Moritz International conference on machine learning, 1889-1897, 2015 | 8191 | 2015 |
Infogan: Interpretable representation learning by information maximizing generative adversarial nets X Chen, Y Duan, R Houthooft, J Schulman, I Sutskever, P Abbeel Advances in neural information processing systems 29, 2016 | 5312 | 2016 |
Multi-agent actor-critic for mixed cooperative-competitive environments R Lowe, YI Wu, A Tamar, J Harb, OAI Pieter Abbeel, I Mordatch Advances in neural information processing systems 30, 2017 | 4858 | 2017 |
Apprenticeship learning via inverse reinforcement learning P Abbeel, AY Ng Proceedings of the twenty-first international conference on Machine learning, 1, 2004 | 4191 | 2004 |
End-to-end training of deep visuomotor policies S Levine, C Finn, T Darrell, P Abbeel Journal of Machine Learning Research 17 (39), 1-40, 2016 | 3913 | 2016 |
High-dimensional continuous control using generalized advantage estimation J Schulman, P Moritz, S Levine, M Jordan, P Abbeel arXiv preprint arXiv:1506.02438, 2015 | 3627 | 2015 |
Domain randomization for transferring deep neural networks from simulation to the real world J Tobin, R Fong, A Ray, J Schneider, W Zaremba, P Abbeel 2017 IEEE/RSJ international conference on intelligent robots and systems …, 2017 | 3191 | 2017 |
Hindsight experience replay M Andrychowicz, F Wolski, A Ray, J Schneider, R Fong, P Welinder, ... Advances in neural information processing systems 30, 2017 | 2753 | 2017 |
Soft actor-critic algorithms and applications T Haarnoja, A Zhou, K Hartikainen, G Tucker, S Ha, J Tan, V Kumar, ... arXiv preprint arXiv:1812.05905, 2018 | 2564 | 2018 |
Benchmarking deep reinforcement learning for continuous control Y Duan, X Chen, R Houthooft, J Schulman, P Abbeel International conference on machine learning, 1329-1338, 2016 | 2016 | 2016 |
A simple neural attentive meta-learner N Mishra, M Rohaninejad, X Chen, P Abbeel arXiv preprint arXiv:1707.03141, 2017 | 1510 | 2017 |
Sim-to-real transfer of robotic control with dynamics randomization XB Peng, M Andrychowicz, W Zaremba, P Abbeel 2018 IEEE international conference on robotics and automation (ICRA), 3803-3810, 2018 | 1479 | 2018 |
Constrained policy optimization J Achiam, D Held, A Tamar, P Abbeel International conference on machine learning, 22-31, 2017 | 1422 | 2017 |
Reinforcement learning with deep energy-based policies T Haarnoja, H Tang, P Abbeel, S Levine International conference on machine learning, 1352-1361, 2017 | 1406 | 2017 |
Decision transformer: Reinforcement learning via sequence modeling L Chen, K Lu, A Rajeswaran, K Lee, A Grover, M Laskin, P Abbeel, ... Advances in neural information processing systems 34, 15084-15097, 2021 | 1300 | 2021 |
Guided cost learning: Deep inverse optimal control via policy optimization C Finn, S Levine, P Abbeel International conference on machine learning, 49-58, 2016 | 1112 | 2016 |
RL: Fast Reinforcement Learning via Slow Reinforcement Learning Y Duan, J Schulman, X Chen, PL Bartlett, I Sutskever, P Abbeel arXiv preprint arXiv:1611.02779, 2016 | 1111 | 2016 |