Tuning legged locomotion controllers via safe bayesian optimization
This paper presents a data-driven strategy to streamline the deployment of model-based
controllers in legged robotic hardware platforms. Our approach leverages a model-free safe …
controllers in legged robotic hardware platforms. Our approach leverages a model-free safe …
Safe reinforcement learning of dynamic high-dimensional robotic tasks: navigation, manipulation, interaction
Safety is a fundamental property for the real-world deployment of robotic platforms. Any
control policy should avoid dangerous actions that could harm the environment, humans, or …
control policy should avoid dangerous actions that could harm the environment, humans, or …
Transductive active learning: Theory and applications
We study a generalization of classical active learning to real-world settings with concrete
prediction targets where sampling is restricted to an accessible region of the domain, while …
prediction targets where sampling is restricted to an accessible region of the domain, while …
Safe Reinforcement Learning on the Constraint Manifold: Theory and Applications
Integrating learning-based techniques, especially reinforcement learning, into robotics is
promising for solving complex problems in unstructured environments. However, most …
promising for solving complex problems in unstructured environments. However, most …
Event-triggered safe Bayesian optimization on quadcopters
A Holzapfel, P Brunzema… - 6th Annual Learning for …, 2024 - proceedings.mlr.press
Bayesian optimization (BO) has proven to be a powerful tool for automatically tuning control
parameters without requiring knowledge of the underlying system dynamics. Safe BO …
parameters without requiring knowledge of the underlying system dynamics. Safe BO …
No-Regret Algorithms for Safe Bayesian Optimization with Monotonicity Constraints
A Losalka, J Scarlett - International Conference on Artificial …, 2024 - proceedings.mlr.press
We consider the problem of sequentially maximizing an unknown function $ f $ over a set of
actions of the form $(s, x) $, where the selected actions must satisfy a safety constraint with …
actions of the form $(s, x) $, where the selected actions must satisfy a safety constraint with …
Efficient safe learning for controller tuning with experimental validation
M Zagorowska, C König, H Yu, EC Balta… - arXiv preprint arXiv …, 2023 - arxiv.org
Optimization-based controller tuning is challenging because it requires formulating
optimization problems explicitly as functions of controller parameters. Safe learning …
optimization problems explicitly as functions of controller parameters. Safe learning …
MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization
Reinforcement learning (RL) algorithms aim to balance exploiting the current best strategy
with exploring new options that could lead to higher rewards. Most common RL algorithms …
with exploring new options that could lead to higher rewards. Most common RL algorithms …
ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning
Reinforcement learning (RL) is ubiquitous in the development of modern AI systems.
However, state-of-the-art RL agents require extensive, and potentially unsafe, interactions …
However, state-of-the-art RL agents require extensive, and potentially unsafe, interactions …
Benefits of monotonicity in safe exploration with Gaussian processes
A Losalka, J Scarlett - Uncertainty in Artificial Intelligence, 2023 - proceedings.mlr.press
We consider the problem of sequentially maximising an unknown function over a set of
actions while ensuring that every sampled point has a function value below a given safety …
actions while ensuring that every sampled point has a function value below a given safety …