- 学术资源搜索

Constrained decision transformer for offline safe reinforcement learning

Z Liu, Z Guo, Y Yao, Z Cen, W Yu… - International …, 2023 - proceedings.mlr.press

Safe reinforcement learning (RL) trains a constraint satisfaction policy by interacting with the
environment. We aim to tackle a more challenging problem: learning a safe policy from an …

被引用次数：56 相关文章所有 7 个版本

[PDF] jmlr.org

Omnisafe: An infrastructure for accelerating safe reinforcement learning research

J Ji, J Zhou, B Zhang, J Dai, X Pan, R Sun… - Journal of Machine …, 2024 - jmlr.org

AI systems empowered by reinforcement learning (RL) algorithms harbor the immense
potential to catalyze societal advancement, yet their deployment is often impeded by …

被引用次数：36 相关文章所有 4 个版本

[PDF] mlr.press

Constrained variational policy optimization for safe reinforcement learning

Z Liu, Z Cen, V Isenbaev, W Liu, S Wu… - International …, 2022 - proceedings.mlr.press

Safe reinforcement learning (RL) aims to learn policies that satisfy certain constraints before
deploying them to safety-critical applications. Previous primal-dual style approaches suffer …

被引用次数：86 相关文章所有 6 个版本

[PDF] arxiv.org

Trustworthy reinforcement learning against intrinsic vulnerabilities: Robustness, safety, and generalizability

M Xu, Z Liu, P Huang, W Ding, Z Cen, B Li… - arXiv preprint arXiv …, 2022 - arxiv.org

A trustworthy reinforcement learning algorithm should be competent in solving challenging
real-world problems, including {robustly} handling uncertainties, satisfying {safety} …

被引用次数：48 相关文章所有 2 个版本

[PDF] neurips.cc

Safebench: A benchmarking platform for safety evaluation of autonomous vehicles

C Xu, W Ding, W Lyu, Z Liu, S Wang… - Advances in …, 2022 - proceedings.neurips.cc

As shown by recent studies, machine intelligence-enabled systems are vulnerable to test
cases resulting from either adversarial manipulation or natural distribution shifts. This has …

被引用次数：45 相关文章所有 7 个版本

[PDF] neurips.cc

Constraint-conditioned policy optimization for versatile safe reinforcement learning

Y Yao, Z Liu, Z Cen, J Zhu, W Yu… - Advances in Neural …, 2024 - proceedings.neurips.cc

Safe reinforcement learning (RL) focuses on training reward-maximizing agents subject to
pre-defined safety constraints. Yet, learning versatile safe policies that can adapt to varying …

被引用次数：11 相关文章所有 8 个版本

[PDF] arxiv.org

On the robustness of safe reinforcement learning under observational perturbations

Z Liu, Z Guo, Z Cen, H Zhang, J Tan, B Li… - arXiv preprint arXiv …, 2022 - arxiv.org

Safe reinforcement learning (RL) trains a policy to maximize the task reward while satisfying
safety constraints. While prior works focus on the performance optimality, we find that the …

被引用次数：36 相关文章所有 6 个版本

[PDF] aaai.org

Conservative and adaptive penalty for model-based safe reinforcement learning

YJ Ma, A Shen, O Bastani, J Dinesh - Proceedings of the AAAI …, 2022 - ojs.aaai.org

Reinforcement Learning (RL) agents in the real world must satisfy safety constraints in
addition to maximizing a reward objective. Model-based RL algorithms hold promise for …

被引用次数：26 相关文章所有 10 个版本

[PDF] mlr.press

Learning off-policy with online planning

H Sikchi, W Zhou, D Held - Conference on Robot Learning, 2022 - proceedings.mlr.press

Reinforcement learning (RL) in low-data and risk-sensitive domains requires performant and
flexible deployment policies that can readily incorporate constraints during deployment. One …

被引用次数：38 相关文章所有 8 个版本

[PDF] mlr.press

Towards robust and safe reinforcement learning with benign off-policy data

Z Liu, Z Guo, Z Cen, H Zhang, Y Yao… - International …, 2023 - proceedings.mlr.press

Previous work demonstrates that the optimal safe reinforcement learning policy in a noise-
free environment is vulnerable and could be unsafe under observational attacks. While …

被引用次数：5 相关文章所有 5 个版本