Understanding the complexity gains of single-task rl with a curriculum

Q Li, Y Zhai, Y Ma, S Levine - International Conference on …, 2023 - proceedings.mlr.press
Reinforcement learning (RL) problems can be challenging without well-shaped rewards.
Prior work on provably efficient RL methods generally proposes to address this issue with …

Cross-domain policy adaptation via value-guided data filtering

K Xu, C Bai, X Ma, D Wang, B Zhao… - Advances in …, 2023 - proceedings.neurips.cc
Generalizing policies across different domains with dynamics mismatch poses a significant
challenge in reinforcement learning. For example, a robot learns the policy in a simulator …

Rapidly evolving soft robots via action inheritance

S Liu, W Yao, H Wang, W Peng… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
The automatic design of soft robots characterizes as jointly optimizing structure and control.
As reinforcement learning is gradually used to optimize control, the time-consuming …

Mirage: Cross-Embodiment Zero-Shot Policy Transfer with Cross-Painting

LY Chen, K Hari, K Dharmarajan, C Xu… - arXiv preprint arXiv …, 2024 - arxiv.org
The ability to reuse collected data and transfer trained policies between robots could
alleviate the burden of additional data collection and training. While existing approaches …

Differentiable soft-robot generation

F Cochevelou, D Bonner, MP Schmidt - Proceedings of the Genetic and …, 2023 - dl.acm.org
Soft robots have multiple potential applications for artificial life, ergonomics and human
interaction but they also present many design and control challenges. One of these …

Transferability in the automatic off-line design of robot swarms: from sim-to-real to embodiment and design-method transfer across different platforms

M Kegeleirs, DG Ramos, K Hasselmann… - IEEE Robotics and …, 2024 - ieeexplore.ieee.org
Automatic off-line design is an attractive approach to implementing robot swarms. In this
approach, a designer specifies a mission to be accomplished by the swarm, and an …

Herd: Continuous human-to-robot evolution for learning from human demonstration

X Liu, D Pathak, KM Kitani - arXiv preprint arXiv:2212.04359, 2022 - arxiv.org
The ability to learn from human demonstration endows robots with the ability to automate
various tasks. However, directly learning from human demonstration is challenging since the …

Diff-Transfer: Model-based Robotic Manipulation Skill Transfer via Differentiable Physics Simulation

Y Xiang, F Chen, Q Wang, Y Gang, X Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org
The capability to transfer mastered skills to accomplish a range of similar yet novel tasks is
crucial for intelligent robots. In this work, we introduce $\textit {Diff-Transfer} $, a novel …

A system for morphology-task generalization via unified representation and behavior distillation

H Furuta, Y Iwasawa, Y Matsuo, SS Gu - arXiv preprint arXiv:2211.14296, 2022 - arxiv.org
The rise of generalist large-scale models in natural language and vision has made us
expect that a massive data-driven approach could achieve broader generalization in other …

Meta-evolve: Continuous robot evolution for one-to-many policy transfer

X Liu, D Pathak, D Zhao - arXiv preprint arXiv:2405.03534, 2024 - arxiv.org
We investigate the problem of transferring an expert policy from a source robot to multiple
different robots. To solve this problem, we propose a method named $ Meta $-$ Evolve …