关注
Nicolas Heess
Nicolas Heess
DeepMind
在 google.com 的电子邮件经过验证
标题
引用次数
年份
A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning
K Khetarpal, ZD Guo, BA Pires, Y Tang, C Lyle, M Rowland, N Heess, ...
arXiv preprint arXiv:2406.02035, 2024
2024
Deep Dive into Model-free Reinforcement Learning for Biological and Robotic Systems: Theory and Practice
Y Jiao, F Ling, S Heydari, N Heess, J Merel, E Kanso
arXiv preprint arXiv:2405.11457, 2024
2024
Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning
D Tirumala, M Wulfmeier, B Moran, S Huang, J Humplik, G Lever, ...
arXiv preprint arXiv:2405.02425, 2024
2024
Learning agile soccer skills for a bipedal robot with deep reinforcement learning
T Haarnoja, B Moran, G Lever, SH Huang, D Tirumala, J Humplik, ...
Science Robotics 9 (89), eadi8022, 2024
562024
The Probabilities Also Matter: A More Faithful Metric for Faithfulness of Free-Text Explanations in Large Language Models
NY Siegel, OM Camburu, N Heess, M Perez-Ortiz
arXiv preprint arXiv:2404.03189, 2024
2024
TacticAI: an AI assistant for football tactics
Z Wang, P Veličković, D Hennes, N Tomašev, L Prince, M Kaisers, ...
Nature communications 15 (1), 1906, 2024
92024
Genie: Generative Interactive Environments
J Bruce, M Dennis, A Edwards, J Parker-Holder, Y Shi, E Hughes, M Lai, ...
arXiv preprint arXiv:2402.15391, 2024
232024
Data-efficient reinforcement learning for continuous control tasks
M Riedmiller, R Hafner, M Vecerik, TP Lillicrap, T Lampe, I Popov, ...
US Patent App. 18/351,440, 2024
2024
Learning to Learn Faster from Human Feedback with Language Model Predictive Control
J Liang, F Xia, W Yu, A Zeng, MG Arenas, M Attarian, M Bauza, M Bennice, ...
arXiv preprint arXiv:2402.11450, 2024
52024
NfgTransformer: Equivariant Representation Learning for Normal-form Games
S Liu, L Marris, G Piliouras, I Gemp, N Heess
arXiv preprint arXiv:2402.08393, 2024
32024
PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
S Nasiriany, F Xia, W Yu, T Xiao, J Liang, I Dasgupta, A Xie, D Driess, ...
arXiv preprint arXiv:2402.07872, 2024
182024
Offline Actor-Critic Reinforcement Learning Scales to Large Models
JT Springenberg, A Abdolmaleki, J Zhang, O Groth, M Bloesch, T Lampe, ...
arXiv preprint arXiv:2402.05546, 2024
22024
Selecting reinforcement learning actions using a low-level controller
NMO Heess, TP Lillicrap, GD Wayne, Y Tassa
US Patent 11,875,258, 2024
2024
Neural Population Learning beyond Symmetric Zero-sum Games
S Liu, L Marris, M Lanctot, G Piliouras, JZ Leibo, N Heess
arXiv preprint arXiv:2401.05133, 2024
12024
Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots
T Lampe, A Abdolmaleki, S Bechtle, SH Huang, JT Springenberg, ...
arXiv preprint arXiv:2312.11374, 2023
12023
Foundations for Transfer in Reinforcement Learning: A Taxonomy of Knowledge Modalities
M Wulfmeier, A Byravan, S Bechtle, K Hausman, N Heess
arXiv preprint arXiv:2312.01939, 2023
12023
Replay across Experiments: A Natural Extension of Off-Policy RL
D Tirumala, T Lampe, JE Chen, T Haarnoja, S Huang, G Lever, B Moran, ...
arXiv preprint arXiv:2311.15951, 2023
2023
Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Q Vuong, S Levine, HR Walke, K Pertsch, A Singh, R Doshi, C Xu, J Luo, ...
Towards Generalist Robots: Learning Paradigms for Scalable Skill Acquisition …, 2023
112023
Reinforcement and imitation learning for a task
S Tunyasuvunakool, Y Zhu, J Merel, J Kramár, Z Wang, NMO Heess
US Patent App. 18/306,711, 2023
2023
Open X-Embodiment: Robotic Learning Datasets and RT-X Models
A Padalkar, A Pooley, A Jain, A Bewley, A Herzog, A Irpan, A Khazatsky, ...
arXiv preprint arXiv:2310.08864, 2023
1272023
系统目前无法执行此操作,请稍后再试。
文章 1–20