A review of safe reinforcement learning: Methods, theory and applications
Reinforcement learning (RL) has achieved tremendous success in many complex decision
making tasks. When it comes to deploying RL in the real world, safety concerns are usually …
making tasks. When it comes to deploying RL in the real world, safety concerns are usually …
Constrained variational policy optimization for safe reinforcement learning
Safe reinforcement learning (RL) aims to learn policies that satisfy certain constraints before
deploying them to safety-critical applications. Previous primal-dual style approaches suffer …
deploying them to safety-critical applications. Previous primal-dual style approaches suffer …
Crpo: A new approach for safe reinforcement learning with convergence guarantee
In safe reinforcement learning (SRL) problems, an agent explores the environment to
maximize an expected total reward and meanwhile avoids violation of certain constraints on …
maximize an expected total reward and meanwhile avoids violation of certain constraints on …
Provably efficient safe exploration via primal-dual policy optimization
We study the safe reinforcement learning problem using the constrained Markov decision
processes in which an agent aims to maximize the expected total reward subject to a safety …
processes in which an agent aims to maximize the expected total reward subject to a safety …
Sauté rl: Almost surely safe reinforcement learning using state augmentation
A Sootla, AI Cowen-Rivers, T Jafferjee… - International …, 2022 - proceedings.mlr.press
Satisfying safety constraints almost surely (or with probability one) can be critical for the
deployment of Reinforcement Learning (RL) in real-life applications. For example, plane …
deployment of Reinforcement Learning (RL) in real-life applications. For example, plane …
Long-term fairness with unknown dynamics
While machine learning can myopically reinforce social inequalities, it may also be used to
dynamically seek equitable outcomes. In this paper, we formalize long-term fairness as an …
dynamically seek equitable outcomes. In this paper, we formalize long-term fairness as an …
Penalized proximal policy optimization for safe reinforcement learning
Safe reinforcement learning aims to learn the optimal policy while satisfying safety
constraints, which is essential in real-world applications. However, current algorithms still …
constraints, which is essential in real-world applications. However, current algorithms still …
Achieving zero constraint violation for constrained reinforcement learning via primal-dual approach
Reinforcement learning is widely used in applications where one needs to perform
sequential decisions while interacting with the environment. The problem becomes more …
sequential decisions while interacting with the environment. The problem becomes more …
Model-free safe reinforcement learning through neural barrier certificate
Safety is a critical concern when applying reinforcement learning (RL) to real-world control
tasks. However, existing safe RL works either only consider expected safety constraint …
tasks. However, existing safe RL works either only consider expected safety constraint …
Safe policies for reinforcement learning via primal-dual methods
S Paternain, M Calvo-Fullana… - … on Automatic Control, 2022 - ieeexplore.ieee.org
In this article, we study the design of controllers in the context of stochastic optimal control
under the assumption that the model of the system is not available. This is, we aim to control …
under the assumption that the model of the system is not available. This is, we aim to control …