POCE: Primal Policy Optimization with Conservative Estimation for Multi-constraint Offline Reinforcement Learning
Multi-constraint offline reinforcement learning (RL) promises to learn policies that satisfy
both cumulative and state-wise costs from offline datasets. This arrangement provides an …
both cumulative and state-wise costs from offline datasets. This arrangement provides an …
Guard: A safe reinforcement learning benchmark
Due to the trial-and-error nature, it is typically challenging to apply RL algorithms to safety-
critical real-world applications, such as autonomous driving, human-robot interaction, robot …
critical real-world applications, such as autonomous driving, human-robot interaction, robot …
The Feasibility of Constrained Reinforcement Learning Algorithms: A Tutorial Study
Satisfying safety constraints is a priority concern when solving optimal control problems
(OCPs). Due to the existence of infeasibility phenomenon, where a constraint-satisfying …
(OCPs). Due to the existence of infeasibility phenomenon, where a constraint-satisfying …
[HTML][HTML] SafeRPlan: Safe deep reinforcement learning for intraoperative planning of pedicle screw placement
Spinal fusion surgery requires highly accurate implantation of pedicle screw implants, which
must be conducted in critical proximity to vital structures with a limited view of the anatomy …
must be conducted in critical proximity to vital structures with a limited view of the anatomy …
Learn with imagination: Safe set guided state-wise constrained policy optimization
Deep reinforcement learning (RL) excels in various control tasks, yet the absence of safety
guarantees hampers its real-world applicability. In particular, explorations during learning …
guarantees hampers its real-world applicability. In particular, explorations during learning …
Safe Multi-Agent Reinforcement Learning with Convergence to Generalized Nash Equilibrium
Multi-agent reinforcement learning (MARL) has achieved notable success in cooperative
tasks, demonstrating impressive performance and scalability. However, deploying MARL …
tasks, demonstrating impressive performance and scalability. However, deploying MARL …
Implicit Safe Set Algorithm for Provably Safe Reinforcement Learning
Deep reinforcement learning (DRL) has demonstrated remarkable performance in many
continuous control tasks. However, a significant obstacle to the real-world application of …
continuous control tasks. However, a significant obstacle to the real-world application of …
POLICEd RL: Learning Closed-Loop Robot Control Policies with Provable Satisfaction of Hard Constraints
In this paper, we seek to learn a robot policy guaranteed to satisfy state constraints. To
encourage constraint satisfaction, existing RL algorithms typically rely on Constrained …
encourage constraint satisfaction, existing RL algorithms typically rely on Constrained …
Learning to Provably Satisfy High Relative Degree Constraints for Black-Box Systems
In this paper, we develop a method for learning a control policy guaranteed to satisfy an
affine state constraint of high relative degree in closed loop with a black-box system …
affine state constraint of high relative degree in closed loop with a black-box system …