A review of safe reinforcement learning: Methods, theory and applications

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arXiv preprint arXiv …, 2022 - arxiv.org
Reinforcement learning (RL) has achieved tremendous success in many complex decision
making tasks. When it comes to deploying RL in the real world, safety concerns are usually …

Safe learning in robotics: From learning-based control to safe reinforcement learning

L Brunke, M Greeff, AW Hall, Z Yuan… - Annual Review of …, 2022 - annualreviews.org
The last half decade has seen a steep rise in the number of contributions on safe learning
methods for real-world robotic deployments from both the control and reinforcement learning …

Learning-based model predictive control: Toward safe learning in control

L Hewing, KP Wabersich, M Menner… - Annual Review of …, 2020 - annualreviews.org
Recent successes in the field of machine learning, as well as the availability of increased
sensing and computational capabilities in modern control systems, have led to a growing …

[HTML][HTML] All you need to know about model predictive control for buildings

J Drgoňa, J Arroyo, IC Figueroa, D Blum… - Annual Reviews in …, 2020 - Elsevier
It has been proven that advanced building control, like model predictive control (MPC), can
notably reduce the energy use and mitigate greenhouse gas emissions. However, despite …

Model-based reinforcement learning: A survey

TM Moerland, J Broekens, A Plaat… - … and Trends® in …, 2023 - nowpublishers.com
Sequential decision making, commonly formalized as Markov Decision Process (MDP)
optimization, is an important challenge in artificial intelligence. Two key approaches to this …

Data-driven model predictive control with stability and robustness guarantees

J Berberich, J Köhler, MA Müller… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
We propose a robust data-driven model predictive control (MPC) scheme to control linear
time-invariant systems. The scheme uses an implicit model description based on behavioral …

Efficient and accurate estimation of lipschitz constants for deep neural networks

M Fazlyab, A Robey, H Hassani… - Advances in neural …, 2019 - proceedings.neurips.cc
Tight estimation of the Lipschitz constant for deep neural networks (DNNs) is useful in many
applications ranging from robustness certification of classifiers to stability analysis of closed …

A tour of reinforcement learning: The view from continuous control

B Recht - Annual Review of Control, Robotics, and Autonomous …, 2019 - annualreviews.org
This article surveys reinforcement learning from the perspective of optimization and control,
with a focus on continuous control applications. It reviews the general formulation …

Safety-critical model predictive control with discrete-time control barrier function

J Zeng, B Zhang, K Sreenath - 2021 American Control …, 2021 - ieeexplore.ieee.org
The optimal performance of robotic systems is usually achieved near the limit of state and
input bounds. Model predictive control (MPC) is a prevalent strategy to handle these …

Safe model-based reinforcement learning with stability guarantees

F Berkenkamp, M Turchetta… - Advances in neural …, 2017 - proceedings.neurips.cc
Reinforcement learning is a powerful paradigm for learning optimal policies from
experimental data. However, to find optimal policies, most reinforcement learning algorithms …