A CMDP-within-online framework for meta-safe reinforcement learning

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arXiv preprint arXiv …, 2022 - arxiv.org

Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

被引用次数：226 相关文章所有 2 个版本

[PDF] neurips.cc

Constraint-conditioned policy optimization for versatile safe reinforcement learning

Y Yao, Z Liu, Z Cen, J Zhu, W Yu… - Advances in Neural …, 2024 - proceedings.neurips.cc

Safe reinforcement learning (RL) focuses on training reward-maximizing agents subject to
pre-defined safety constraints. Yet, learning versatile safe policies that can adapt to varying …

被引用次数：6 相关文章所有 8 个版本

[PDF] openreview.net

Meta inverse constrained reinforcement learning: Convergence guarantee and generalization analysis

S Liu, M Zhu - The Twelfth International Conference on Learning …, 2023 - openreview.net

This paper considers the problem of learning the reward function and constraints of an
expert from few demonstrations. This problem can be considered as a meta-learning …

被引用次数：5 相关文章

[PDF] neurips.cc

Online constrained meta-learning: provable guarantees for generalization

S Xu, M Zhu - Advances in Neural Information Processing …, 2024 - proceedings.neurips.cc

Meta-learning has attracted attention due to its strong ability to learn experiences from
known tasks, which can speed up and enhance the learning process for new tasks …

被引用次数：2 相关文章所有 5 个版本

[PDF] neurips.cc

Multi-agent meta-reinforcement learning: sharper convergence rates with task similarity

W Mao, H Qiu, C Wang, H Franke… - Advances in …, 2024 - proceedings.neurips.cc

Multi-agent reinforcement learning (MARL) has primarily focused on solving a single task in
isolation, while in practice the environment is often evolving, leaving many related tasks to …

Gradient shaping for multi-constraint safe reinforcement learning

Y Yao, Z Liu, Z Cen, P Huang… - … Annual Learning for …, 2024 - proceedings.mlr.press

Online safe reinforcement learning (RL) involves training a policy that maximizes task
efficiency while satisfying constraints via interacting with the environments. In this paper, our …

被引用次数：1 相关文章所有 2 个版本

[PDF] berkeley.edu

Local analysis of entropy-regularized stochastic soft-max policy gradient methods

Y Ding, J Zhang, J Lavaei - 2023 European Control Conference …, 2023 - ieeexplore.ieee.org

Entropy regularization is an efficient technique for encouraging exploration and preventing a
premature convergence of (vanilla) policy gradient methods in reinforcement learning (RL) …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Preparing for Black Swans: The Antifragility Imperative for Machine Learning

M Jin - arXiv preprint arXiv:2405.11397, 2024 - arxiv.org

Operating safely and reliably despite continual distribution shifts is vital for high-stakes
machine learning applications. This paper builds upon the transformative concept …

被引用次数：1 相关文章所有 2 个版本

[PDF] psu.edu

Safe machine learning for intelligent multi-robot systems

Z Yuan - 2024 - etda.libraries.psu.edu

Recent advances in embedded computing and mobile sensing have led to pervasive use of
robotic systems in both civil and military applications. With single autonomous robots for …

[PDF] osu.edu

[PDF][PDF] Robust Position Estimation using Range Measurements from Transmitters with Inaccurate Positions

A Sel, S Hayek, ZM Kassas - people.engineering.osu.edu

The problem of position estimation using range measurements from transmitters with
inaccurately known positions is considered. The true position of each transmitter is assumed …