Sample efficient optimization for learning controllers for bipedal locomotion

K Chatzilygeroudis, V Vassiliades… - IEEE Transactions …, 2019 - ieeexplore.ieee.org

Most policy search (PS) algorithms require thousands of training episodes to find an
effective policy, which is often infeasible with a physical robot. This survey article focuses on …

被引用次数：199 相关文章所有 17 个版本

[PDF] arxiv.org

Bayesian optimization using domain knowledge on the ATRIAS biped

A Rai, R Antonova, S Song, W Martin… - … on Robotics and …, 2018 - ieeexplore.ieee.org

Robotics controllers often consist of expert-designed heuristics, which can be hard to tune in
higher dimensions. Simulation can aid in optimizing these controllers if parameters learned …

被引用次数：85 相关文章所有 6 个版本

[PDF] arxiv.org

Robot learning with crash constraints

A Marco, D Baumann, M Khadiv… - IEEE Robotics and …, 2021 - ieeexplore.ieee.org

In the past decade, numerous machine learning algorithms have been shown to
successfully learn optimal policies to control real robotic systems. However, it is common to …

被引用次数：29 相关文章所有 12 个版本

[PDF] jmlr.org

Using simulation to improve sample-efficiency of Bayesian optimization for bipedal robots

A Rai, R Antonova, F Meier, CG Atkeson - Journal of machine learning …, 2019 - jmlr.org

Learning for control can acquire controllers for novel robotic tasks, paving the path for
autonomous agents. Such controllers can be expert-designed policies, which typically …

被引用次数：51 相关文章所有 6 个版本

[PDF] arxiv.org

Using parameterized black-box priors to scale up model-based policy search for robotics

K Chatzilygeroudis, JB Mouret - 2018 IEEE international …, 2018 - ieeexplore.ieee.org

The most data-efficient algorithms for reinforcement learning in robotics are model-based
policy search algorithms, which alternate between learning a dynamical model of the robot …

被引用次数：49 相关文章所有 10 个版本

[PDF] arxiv.org

Robust walking based on MPC with viability guarantees

MH Yeganegi, M Khadiv, A Del Prete… - IEEE Transactions …, 2021 - ieeexplore.ieee.org

Model predictive control (MPC) has shown great success for controlling complex systems,
such as legged robots. However, when closing the loop, the performance and feasibility of …

被引用次数：25 相关文章所有 10 个版本

[PDF] mlr.press

Deep kernels for optimizing locomotion controllers

R Antonova, A Rai, CG Atkeson - Conference on Robot …, 2017 - proceedings.mlr.press

Sample efficiency is important when optimizing parameters of locomotion controllers, since
hardware experiments are time consuming and expensive. Bayesian Optimization, a sample …

被引用次数：46 相关文章所有 6 个版本

[PDF] ieee.org

Beyond basins of attraction: Quantifying robustness of natural dynamics

S Heim, A Spröwitz - IEEE Transactions on Robotics, 2019 - ieeexplore.ieee.org

Properly designing a system to exhibit favorable natural dynamics can greatly simplify
designing or learning the control policy. However, it is still unclear what constitutes favorable …

被引用次数：36 相关文章所有 6 个版本

[PDF] arxiv.org

On the design of LQR kernels for efficient controller learning

A Marco, P Hennig, S Schaal… - 2017 IEEE 56th Annual …, 2017 - ieeexplore.ieee.org

Finding optimal feedback controllers for nonlinear dynamic systems from data is hard.
Recently, Bayesian optimization (BO) has been proposed as a powerful framework for direct …

被引用次数：33 相关文章所有 9 个版本

[PDF] arxiv.org

Learning to control highly accelerated ballistic movements on muscular robots

D Büchler, R Calandra, J Peters - Robotics and Autonomous Systems, 2023 - Elsevier

High-speed and high-acceleration movements are inherently hard to control. Applying
learning to the control of such motions on anthropomorphic robot arms can improve the …

被引用次数：23 相关文章所有 7 个版本