A survey on policy search algorithms for learning robot controllers in a handful of trials

K Chatzilygeroudis, V Vassiliades… - IEEE Transactions …, 2019 - ieeexplore.ieee.org
Most policy search (PS) algorithms require thousands of training episodes to find an
effective policy, which is often infeasible with a physical robot. This survey article focuses on …

Bayesian optimization using domain knowledge on the ATRIAS biped

A Rai, R Antonova, S Song, W Martin… - … on Robotics and …, 2018 - ieeexplore.ieee.org
Robotics controllers often consist of expert-designed heuristics, which can be hard to tune in
higher dimensions. Simulation can aid in optimizing these controllers if parameters learned …

Robot learning with crash constraints

A Marco, D Baumann, M Khadiv… - IEEE Robotics and …, 2021 - ieeexplore.ieee.org
In the past decade, numerous machine learning algorithms have been shown to
successfully learn optimal policies to control real robotic systems. However, it is common to …

Using simulation to improve sample-efficiency of Bayesian optimization for bipedal robots

A Rai, R Antonova, F Meier, CG Atkeson - Journal of machine learning …, 2019 - jmlr.org
Learning for control can acquire controllers for novel robotic tasks, paving the path for
autonomous agents. Such controllers can be expert-designed policies, which typically …

Using parameterized black-box priors to scale up model-based policy search for robotics

K Chatzilygeroudis, JB Mouret - 2018 IEEE international …, 2018 - ieeexplore.ieee.org
The most data-efficient algorithms for reinforcement learning in robotics are model-based
policy search algorithms, which alternate between learning a dynamical model of the robot …

Robust walking based on MPC with viability guarantees

MH Yeganegi, M Khadiv, A Del Prete… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
Model predictive control (MPC) has shown great success for controlling complex systems,
such as legged robots. However, when closing the loop, the performance and feasibility of …

Deep kernels for optimizing locomotion controllers

R Antonova, A Rai, CG Atkeson - Conference on Robot …, 2017 - proceedings.mlr.press
Sample efficiency is important when optimizing parameters of locomotion controllers, since
hardware experiments are time consuming and expensive. Bayesian Optimization, a sample …

Beyond basins of attraction: Quantifying robustness of natural dynamics

S Heim, A Spröwitz - IEEE Transactions on Robotics, 2019 - ieeexplore.ieee.org
Properly designing a system to exhibit favorable natural dynamics can greatly simplify
designing or learning the control policy. However, it is still unclear what constitutes favorable …

On the design of LQR kernels for efficient controller learning

A Marco, P Hennig, S Schaal… - 2017 IEEE 56th Annual …, 2017 - ieeexplore.ieee.org
Finding optimal feedback controllers for nonlinear dynamic systems from data is hard.
Recently, Bayesian optimization (BO) has been proposed as a powerful framework for direct …

Learning to control highly accelerated ballistic movements on muscular robots

D Büchler, R Calandra, J Peters - Robotics and Autonomous Systems, 2023 - Elsevier
High-speed and high-acceleration movements are inherently hard to control. Applying
learning to the control of such motions on anthropomorphic robot arms can improve the …