A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning K Khetarpal, ZD Guo, BA Pires, Y Tang, C Lyle, M Rowland, N Heess, ... arXiv preprint arXiv:2406.02035, 2024 | | 2024 |
Deep Dive into Model-free Reinforcement Learning for Biological and Robotic Systems: Theory and Practice Y Jiao, F Ling, S Heydari, N Heess, J Merel, E Kanso arXiv preprint arXiv:2405.11457, 2024 | | 2024 |
Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning D Tirumala, M Wulfmeier, B Moran, S Huang, J Humplik, G Lever, ... arXiv preprint arXiv:2405.02425, 2024 | | 2024 |
Learning agile soccer skills for a bipedal robot with deep reinforcement learning T Haarnoja, B Moran, G Lever, SH Huang, D Tirumala, J Humplik, ... Science Robotics 9 (89), eadi8022, 2024 | 56 | 2024 |
The Probabilities Also Matter: A More Faithful Metric for Faithfulness of Free-Text Explanations in Large Language Models NY Siegel, OM Camburu, N Heess, M Perez-Ortiz arXiv preprint arXiv:2404.03189, 2024 | | 2024 |
TacticAI: an AI assistant for football tactics Z Wang, P Veličković, D Hennes, N Tomašev, L Prince, M Kaisers, ... Nature communications 15 (1), 1906, 2024 | 9 | 2024 |
Genie: Generative Interactive Environments J Bruce, M Dennis, A Edwards, J Parker-Holder, Y Shi, E Hughes, M Lai, ... arXiv preprint arXiv:2402.15391, 2024 | 23 | 2024 |
Data-efficient reinforcement learning for continuous control tasks M Riedmiller, R Hafner, M Vecerik, TP Lillicrap, T Lampe, I Popov, ... US Patent App. 18/351,440, 2024 | | 2024 |
Learning to Learn Faster from Human Feedback with Language Model Predictive Control J Liang, F Xia, W Yu, A Zeng, MG Arenas, M Attarian, M Bauza, M Bennice, ... arXiv preprint arXiv:2402.11450, 2024 | 5 | 2024 |
NfgTransformer: Equivariant Representation Learning for Normal-form Games S Liu, L Marris, G Piliouras, I Gemp, N Heess arXiv preprint arXiv:2402.08393, 2024 | 3 | 2024 |
PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs S Nasiriany, F Xia, W Yu, T Xiao, J Liang, I Dasgupta, A Xie, D Driess, ... arXiv preprint arXiv:2402.07872, 2024 | 18 | 2024 |
Offline Actor-Critic Reinforcement Learning Scales to Large Models JT Springenberg, A Abdolmaleki, J Zhang, O Groth, M Bloesch, T Lampe, ... arXiv preprint arXiv:2402.05546, 2024 | 2 | 2024 |
Selecting reinforcement learning actions using a low-level controller NMO Heess, TP Lillicrap, GD Wayne, Y Tassa US Patent 11,875,258, 2024 | | 2024 |
Neural Population Learning beyond Symmetric Zero-sum Games S Liu, L Marris, M Lanctot, G Piliouras, JZ Leibo, N Heess arXiv preprint arXiv:2401.05133, 2024 | 1 | 2024 |
Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots T Lampe, A Abdolmaleki, S Bechtle, SH Huang, JT Springenberg, ... arXiv preprint arXiv:2312.11374, 2023 | 1 | 2023 |
Foundations for Transfer in Reinforcement Learning: A Taxonomy of Knowledge Modalities M Wulfmeier, A Byravan, S Bechtle, K Hausman, N Heess arXiv preprint arXiv:2312.01939, 2023 | 1 | 2023 |
Replay across Experiments: A Natural Extension of Off-Policy RL D Tirumala, T Lampe, JE Chen, T Haarnoja, S Huang, G Lever, B Moran, ... arXiv preprint arXiv:2311.15951, 2023 | | 2023 |
Open X-Embodiment: Robotic Learning Datasets and RT-X Models Q Vuong, S Levine, HR Walke, K Pertsch, A Singh, R Doshi, C Xu, J Luo, ... Towards Generalist Robots: Learning Paradigms for Scalable Skill Acquisition …, 2023 | 11 | 2023 |
Reinforcement and imitation learning for a task S Tunyasuvunakool, Y Zhu, J Merel, J Kramár, Z Wang, NMO Heess US Patent App. 18/306,711, 2023 | | 2023 |
Open X-Embodiment: Robotic Learning Datasets and RT-X Models A Padalkar, A Pooley, A Jain, A Bewley, A Herzog, A Irpan, A Khazatsky, ... arXiv preprint arXiv:2310.08864, 2023 | 127 | 2023 |