Tuning legged locomotion controllers via safe bayesian optimization

D Widmer, D Kang, B Sukhija… - … on Robot Learning, 2023 - proceedings.mlr.press
This paper presents a data-driven strategy to streamline the deployment of model-based
controllers in legged robotic hardware platforms. Our approach leverages a model-free safe …

Safe reinforcement learning of dynamic high-dimensional robotic tasks: navigation, manipulation, interaction

P Liu, K Zhang, D Tateo, S Jauhri, Z Hu… - … on Robotics and …, 2023 - ieeexplore.ieee.org
Safety is a fundamental property for the real-world deployment of robotic platforms. Any
control policy should avoid dangerous actions that could harm the environment, humans, or …

Transductive active learning: Theory and applications

J Hübotter, B Sukhija, L Treven… - 38th Annual …, 2024 - research-collection.ethz.ch
We study a generalization of classical active learning to real-world settings with concrete
prediction targets where sampling is restricted to an accessible region of the domain, while …

Safe Reinforcement Learning on the Constraint Manifold: Theory and Applications

P Liu, H Bou-Ammar, J Peters, D Tateo - arXiv preprint arXiv:2404.09080, 2024 - arxiv.org
Integrating learning-based techniques, especially reinforcement learning, into robotics is
promising for solving complex problems in unstructured environments. However, most …

Event-triggered safe Bayesian optimization on quadcopters

A Holzapfel, P Brunzema… - 6th Annual Learning for …, 2024 - proceedings.mlr.press
Bayesian optimization (BO) has proven to be a powerful tool for automatically tuning control
parameters without requiring knowledge of the underlying system dynamics. Safe BO …

No-Regret Algorithms for Safe Bayesian Optimization with Monotonicity Constraints

A Losalka, J Scarlett - International Conference on Artificial …, 2024 - proceedings.mlr.press
We consider the problem of sequentially maximizing an unknown function $ f $ over a set of
actions of the form $(s, x) $, where the selected actions must satisfy a safety constraint with …

Efficient safe learning for controller tuning with experimental validation

M Zagorowska, C König, H Yu, EC Balta… - arXiv preprint arXiv …, 2023 - arxiv.org
Optimization-based controller tuning is challenging because it requires formulating
optimization problems explicitly as functions of controller parameters. Safe learning …

MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization

B Sukhija, S Coros, A Krause, P Abbeel… - arXiv preprint arXiv …, 2024 - arxiv.org
Reinforcement learning (RL) algorithms aim to balance exploiting the current best strategy
with exploring new options that could lead to higher rewards. Most common RL algorithms …

ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning

Y As, B Sukhija, L Treven, C Sferrazza, S Coros… - arXiv preprint arXiv …, 2024 - arxiv.org
Reinforcement learning (RL) is ubiquitous in the development of modern AI systems.
However, state-of-the-art RL agents require extensive, and potentially unsafe, interactions …

Benefits of monotonicity in safe exploration with Gaussian processes

A Losalka, J Scarlett - Uncertainty in Artificial Intelligence, 2023 - proceedings.mlr.press
We consider the problem of sequentially maximising an unknown function over a set of
actions while ensuring that every sampled point has a function value below a given safety …