Reinforcement Learning: An Overview
K Murphy - arXiv preprint arXiv:2412.05265, 2024 - arxiv.org
This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement
learning and sequential decision making, covering value-based RL, policy-gradient …
learning and sequential decision making, covering value-based RL, policy-gradient …
Vid2robot: End-to-end video-conditioned policy learning with cross-attention transformers
While large-scale robotic systems typically rely on textual instructions for tasks, this work
explores a different approach: can robots infer the task directly from observing humans? This …
explores a different approach: can robots infer the task directly from observing humans? This …
Flare: Achieving masterful and adaptive robot policies with large-scale reinforcement learning fine-tuning
In recent years, the Robotics field has initiated several efforts toward building generalist
robot policies through large-scale multi-task Behavior Cloning. However, direct deployments …
robot policies through large-scale multi-task Behavior Cloning. However, direct deployments …
Rt-affordance: Affordances are versatile intermediate representations for robot manipulation
We explore how intermediate policy representations can facilitate generalization by
providing guidance on how to perform manipulation tasks. Existing representations such as …
providing guidance on how to perform manipulation tasks. Existing representations such as …
Natural language reinforcement learning
Reinforcement Learning (RL) mathematically formulates decision-making with Markov
Decision Process (MDP). With MDPs, researchers have achieved remarkable breakthroughs …
Decision Process (MDP). With MDPs, researchers have achieved remarkable breakthroughs …
MotionGlot: A Multi-Embodied Motion Generation Model
S Harithas, S Sridhar - arXiv preprint arXiv:2410.16623, 2024 - arxiv.org
This paper introduces MotionGlot, a model that can generate motion across multiple
embodiments with different action dimensions, such as quadruped robots and human …
embodiments with different action dimensions, such as quadruped robots and human …
Neural Scaling Laws for Embodied AI
S Sartor, N Thompson - arXiv preprint arXiv:2405.14005, 2024 - arxiv.org
Scaling laws have driven remarkable progress across machine learning domains like
language modeling and computer vision. However, the exploration of scaling laws in …
language modeling and computer vision. However, the exploration of scaling laws in …
JUICER: Data-Efficient Imitation Learning for Robotic Assembly
While learning from demonstrations is powerful for acquiring visuomotor policies, high-
performance imitation without large demonstration datasets remains challenging for tasks …
performance imitation without large demonstration datasets remains challenging for tasks …
Limits of Deep Learning: Sequence Modeling through the Lens of Complexity Theory
N Zubić, F Soldá, A Sulser, D Scaramuzza - arXiv preprint arXiv …, 2024 - arxiv.org
Deep learning models have achieved significant success across various applications but
continue to struggle with tasks requiring complex reasoning over sequences, such as …
continue to struggle with tasks requiring complex reasoning over sequences, such as …
DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots
We present DemoStart, a novel auto-curriculum reinforcement learning method capable of
learning complex manipulation behaviors on an arm equipped with a three-fingered robotic …
learning complex manipulation behaviors on an arm equipped with a three-fingered robotic …