Reinforcement Learning: An Overview

K Murphy - arXiv preprint arXiv:2412.05265, 2024 - arxiv.org
This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement
learning and sequential decision making, covering value-based RL, policy-gradient …

Vid2robot: End-to-end video-conditioned policy learning with cross-attention transformers

V Jain, M Attarian, NJ Joshi, A Wahid, D Driess… - arXiv preprint arXiv …, 2024 - arxiv.org
While large-scale robotic systems typically rely on textual instructions for tasks, this work
explores a different approach: can robots infer the task directly from observing humans? This …

Flare: Achieving masterful and adaptive robot policies with large-scale reinforcement learning fine-tuning

J Hu, R Hendrix, A Farhadi, A Kembhavi… - arXiv preprint arXiv …, 2024 - arxiv.org
In recent years, the Robotics field has initiated several efforts toward building generalist
robot policies through large-scale multi-task Behavior Cloning. However, direct deployments …

Rt-affordance: Affordances are versatile intermediate representations for robot manipulation

S Nasiriany, S Kirmani, T Ding, L Smith, Y Zhu… - arXiv preprint arXiv …, 2024 - arxiv.org
We explore how intermediate policy representations can facilitate generalization by
providing guidance on how to perform manipulation tasks. Existing representations such as …

Natural language reinforcement learning

X Feng, Z Wan, H Fu, B Liu, M Yang… - arXiv preprint arXiv …, 2024 - arxiv.org
Reinforcement Learning (RL) mathematically formulates decision-making with Markov
Decision Process (MDP). With MDPs, researchers have achieved remarkable breakthroughs …

MotionGlot: A Multi-Embodied Motion Generation Model

S Harithas, S Sridhar - arXiv preprint arXiv:2410.16623, 2024 - arxiv.org
This paper introduces MotionGlot, a model that can generate motion across multiple
embodiments with different action dimensions, such as quadruped robots and human …

Neural Scaling Laws for Embodied AI

S Sartor, N Thompson - arXiv preprint arXiv:2405.14005, 2024 - arxiv.org
Scaling laws have driven remarkable progress across machine learning domains like
language modeling and computer vision. However, the exploration of scaling laws in …

JUICER: Data-Efficient Imitation Learning for Robotic Assembly

L Ankile, A Simeonov, I Shenfeld, P Agrawal - arXiv preprint arXiv …, 2024 - arxiv.org
While learning from demonstrations is powerful for acquiring visuomotor policies, high-
performance imitation without large demonstration datasets remains challenging for tasks …

Limits of Deep Learning: Sequence Modeling through the Lens of Complexity Theory

N Zubić, F Soldá, A Sulser, D Scaramuzza - arXiv preprint arXiv …, 2024 - arxiv.org
Deep learning models have achieved significant success across various applications but
continue to struggle with tasks requiring complex reasoning over sequences, such as …

DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots

M Bauza, JE Chen, V Dalibard, N Gileadi… - arXiv preprint arXiv …, 2024 - arxiv.org
We present DemoStart, a novel auto-curriculum reinforcement learning method capable of
learning complex manipulation behaviors on an arm equipped with a three-fingered robotic …