Reinforcement learning for versatile, dynamic, and robust bipedal locomotion control

Z Li, XB Peng, P Abbeel, S Levine, G Berseth… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper presents a comprehensive study on using deep reinforcement learning (RL) to
create dynamic locomotion controllers for bipedal robots. Going beyond focusing on a single …

Bundled gradients through contact via randomized smoothing

HJT Suh, T Pang, R Tedrake - IEEE Robotics and Automation …, 2022 - ieeexplore.ieee.org
The empirical success of derivative-free methods in reinforcement learning for planning
through contact seems at odds with the perceived fragility of classical gradient-based …

Robust and versatile bipedal jumping control through reinforcement learning

Z Li, XB Peng, P Abbeel, S Levine, G Berseth… - arXiv preprint arXiv …, 2023 - arxiv.org
This work aims to push the limits of agility for bipedal robots by enabling a torque-controlled
bipedal robot to perform robust and versatile dynamic jumps in the real world. We present a …

Trajectory optimization with optimization-based dynamics

TA Howell, S Le Cleac'h, S Singh… - IEEE Robotics and …, 2022 - ieeexplore.ieee.org
We present a framework for bi-level trajectory optimization in which a system's dynamics are
encoded as the solution to a constrained optimization problem and smooth gradients of this …

Trajectotree: Trajectory optimization meets tree search for planning multi-contact dexterous manipulation

C Chen, P Culbertson, M Lepert… - 2021 IEEE/RSJ …, 2021 - ieeexplore.ieee.org
Dexterous manipulation tasks often require contact switching, where fingers make and break
contact with the object. We propose a method that plans trajectories for dexterous …

Learning diverse and physically feasible dexterous grasps with generative model and bilevel optimization

A Wu, M Guo, CK Liu - arXiv preprint arXiv:2207.00195, 2022 - arxiv.org
To fully utilize the versatility of a multi-fingered dexterous robotic hand for executing diverse
object grasps, one must consider the rich physical constraints introduced by hand-object …

Optimizing dynamic trajectories for robustness to disturbances using polytopic projections

H Ferrolho, W Merkt, V Ivan, W Wolfslag… - 2020 IEEE/RSJ …, 2020 - ieeexplore.ieee.org
This paper focuses on robustness to disturbance forces and uncertain payloads. We present
a novel formulation to optimize the robustness of dynamic trajectories. A straightforward …

Autogenerated manipulation primitives

E Huang, X Cheng, Y Mao, A Gupta… - … Journal of Robotics …, 2023 - journals.sagepub.com
The central theme in robotic manipulation is that of the robot interacting with the world
through physical contact. We tend to describe that physical contact using specific words that …

Contact-implicit trajectory optimization with learned deformable contacts using bilevel optimization

Y Zhu, Z Pan, K Hauser - 2021 IEEE International Conference …, 2021 - ieeexplore.ieee.org
We present a bilevel, contact-implicit trajectory optimization (TO) formulation that searches
for robot trajectories with learned soft contact models. On the lower-level, contact forces are …

Graph reinforcement learning for network control via bi-level optimization

D Gammelli, J Harrison, K Yang, M Pavone… - arXiv preprint arXiv …, 2023 - arxiv.org
Optimization problems over dynamic networks have been extensively studied and widely
used in the past decades to formulate numerous real-world problems. However,(1) …