Lyapunov barrier policy optimization

Model-based safe deep reinforcement learning via a constrained proximal policy optimization algorithm

AK Jayant, S Bhatnagar - Advances in Neural Information …, 2022 - proceedings.neurips.cc

During initial iterations of training in most Reinforcement Learning (RL) algorithms, agents
perform a significant number of random exploratory steps. In the real world, this can limit the …

被引用次数：30 相关文章所有 7 个版本

[PDF] neurips.cc

Learning barrier certificates: Towards safe reinforcement learning with zero training-time violations

Y Luo, T Ma - Advances in Neural Information Processing …, 2021 - proceedings.neurips.cc

Training-time safety violations have been a major concern when we deploy reinforcement
learning algorithms in the real world. This paper explores the possibility of safe RL …

被引用次数：45 相关文章所有 6 个版本

Autonomous driving based on approximate safe action

X Wang, J Zhang, D Hou… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Safety limits the application of traditional reinforcement learning (RL) methods to
autonomous driving. To address the challenge of safe exploration in autonomous driving …

被引用次数：9 相关文章所有 3 个版本

[PDF] mlr.press

Safe driving via expert guided policy optimization

Z Peng, Q Li, C Liu, B Zhou - Conference on Robot Learning, 2022 - proceedings.mlr.press

When learning common skills like driving, beginners usually have domain experts standing
by to ensure the safety of the learning process. We formulate such learning scheme under …

被引用次数：32 相关文章所有 5 个版本

Efficient off-policy safe reinforcement learning using trust region conditional value at risk

D Kim, S Oh - IEEE Robotics and Automation Letters, 2022 - ieeexplore.ieee.org

This letter aims to solve a safe reinforcement learning (RL) problem with risk measure-based
constraints. As risk measures, such as conditional value at risk (CVaR), focus on the tail …

被引用次数：17 相关文章所有 3 个版本

[PDF] mlr.press

Learning off-policy with online planning

H Sikchi, W Zhou, D Held - Conference on Robot Learning, 2022 - proceedings.mlr.press

Reinforcement learning (RL) in low-data and risk-sensitive domains requires performant and
flexible deployment policies that can readily incorporate constraints during deployment. One …

被引用次数：31 相关文章所有 8 个版本

On the feasibility guarantees of deep reinforcement learning solutions for distribution system operation

MM Hosseini, M Parvania - IEEE Transactions on Smart Grid, 2023 - ieeexplore.ieee.org

Deep reinforcement learning (DRL) has scored unprecedented success in finding near-
optimal solutions in high-dimensional stochastic problems, leading to its extensive use in …

被引用次数：9 相关文章

[PDF] arxiv.org

Improving safety in deep reinforcement learning using unsupervised action planning

HL Hsu, Q Huang, S Ha - 2022 International Conference on …, 2022 - ieeexplore.ieee.org

One of the key challenges to deep reinforcement learning (deep RL) is to ensure safety at
both training and testing phases. In this work, we propose a novel technique of …

被引用次数：16 相关文章所有 8 个版本

[PDF] arxiv.org

Safe exploration by solving early terminated mdp

H Sun, Z Xu, M Fang, Z Peng, J Guo, B Dai… - arXiv preprint arXiv …, 2021 - arxiv.org

Safe exploration is crucial for the real-world application of reinforcement learning (RL).
Previous works consider the safe exploration problem as Constrained Markov Decision …

被引用次数：14 相关文章所有 2 个版本

[PDF] arxiv.org

Safe Reinforcement Learning on the Constraint Manifold: Theory and Applications

P Liu, H Bou-Ammar, J Peters, D Tateo - arXiv preprint arXiv:2404.09080, 2024 - arxiv.org

Integrating learning-based techniques, especially reinforcement learning, into robotics is
promising for solving complex problems in unstructured environments. However, most …

被引用次数：2 相关文章所有 2 个版本