A review of safe reinforcement learning: Methods, theory and applications

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arXiv preprint arXiv …, 2022 - arxiv.org
Reinforcement learning (RL) has achieved tremendous success in many complex decision
making tasks. When it comes to deploying RL in the real world, safety concerns are usually …

Safe learning in robotics: From learning-based control to safe reinforcement learning

L Brunke, M Greeff, AW Hall, Z Yuan… - Annual Review of …, 2022 - annualreviews.org
The last half decade has seen a steep rise in the number of contributions on safe learning
methods for real-world robotic deployments from both the control and reinforcement learning …

Learning-based model predictive control: Toward safe learning in control

L Hewing, KP Wabersich, M Menner… - Annual Review of …, 2020 - annualreviews.org
Recent successes in the field of machine learning, as well as the availability of increased
sensing and computational capabilities in modern control systems, have led to a growing …

Physics-informed machine learning: A survey on problems, methods and applications

Z Hao, S Liu, Y Zhang, C Ying, Y Feng, H Su… - arXiv preprint arXiv …, 2022 - arxiv.org
Recent advances of data-driven machine learning have revolutionized fields like computer
vision, reinforcement learning, and many scientific and engineering domains. In many real …

The safety filter: A unified view of safety-critical control in autonomous systems

KC Hsu, H Hu, JF Fisac - Annual Review of Control, Robotics …, 2023 - annualreviews.org
Recent years have seen significant progress in the realm of robot autonomy, accompanied
by the expanding reach of robotic technologies. However, the emergence of new …

End-to-end safe reinforcement learning through barrier functions for safety-critical continuous control tasks

R Cheng, G Orosz, RM Murray, JW Burdick - Proceedings of the AAAI …, 2019 - aaai.org
Reinforcement Learning (RL) algorithms have found limited success beyond simulated
applications, and one main reason is the absence of safety guarantees during the learning …

Recovery rl: Safe reinforcement learning with learned recovery zones

B Thananjeyan, A Balakrishna, S Nair… - IEEE Robotics and …, 2021 - ieeexplore.ieee.org
Safety remains a central obstacle preventing widespread use of RL in the real world:
learning new tasks in uncertain environments requires extensive exploration, but safety …

Data-enabled predictive control: In the shallows of the DeePC

J Coulson, J Lygeros, F Dörfler - 2019 18th European Control …, 2019 - ieeexplore.ieee.org
We consider the problem of optimal trajectory tracking for unknown systems. A novel data-
enabled predictive control (DeePC) algorithm is presented that computes optimal and safe …

Natural policy gradient primal-dual method for constrained markov decision processes

D Ding, K Zhang, T Basar… - Advances in Neural …, 2020 - proceedings.neurips.cc
We study sequential decision-making problems in which each agent aims to maximize the
expected total reward while satisfying a constraint on the expected total utility. We employ …

Disturbance observers and extended state observers for marine vehicles: A survey

N Gu, D Wang, Z Peng, J Wang, QL Han - Control Engineering Practice, 2022 - Elsevier
The operation performance of marine vehicles (MVs) is significantly vulnerable to external
disturbances induced by wind, waves, and ocean currents in complex marine environments …