LCRL: Certified policy synthesis via logically-constrained reinforcement learning

A Lavaei, S Soudjani, A Abate, M Zamani - Automatica, 2022 - Elsevier

Stochastic hybrid systems have received significant attentions as a relevant modeling
framework describing many systems, from engineering to the life sciences: they enable the …

被引用次数：110 相关文章所有 9 个版本

[PDF] arxiv.org

Certified reinforcement learning with logic guidance

H Hasanbeig, D Kroening, A Abate - Artificial Intelligence, 2023 - Elsevier

Reinforcement Learning (RL) is a widely employed machine learning architecture that has
been applied to a variety of control problems. However, applications in safety-critical …

被引用次数：66 相关文章所有 12 个版本

[PDF] jair.org Full View

Symbolic task inference in deep reinforcement learning

H Hasanbeig, NY Jeppu, A Abate, T Melham… - Journal of Artificial …, 2024 - jair.org

This paper proposes DeepSynth, a method for effective training of deep reinforcement
learning agents when the reward is sparse or non-Markovian, but at the same time progress …

被引用次数：3 相关文章所有 3 个版本

[PDF] arxiv.org

Allure: A systematic protocol for auditing and improving llm-based evaluation of text using iterative in-context-learning

H Hasanbeig, H Sharma, L Betthauser… - arXiv preprint arXiv …, 2023 - arxiv.org

From grading papers to summarizing medical documents, large language models (LLMs)
are evermore used for evaluation of text generated by humans and AI alike. However …

被引用次数：11 相关文章

[PDF] aaai.org

Safeguarded Progress in Reinforcement Learning: Safe Bayesian Exploration for Control Policy Synthesis

R Mitta, H Hasanbeig, J Wang, D Kroening… - Proceedings of the …, 2024 - ojs.aaai.org

This paper addresses the problem of maintaining safety during training in Reinforcement
Learning (RL), such that the safety constraint violations are bounded at any point during …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Probabilistic Counterexample Guidance for Safer Reinforcement Learning

X Ji, A Filieri - International Conference on Quantitative Evaluation of …, 2023 - Springer

Safe exploration aims at addressing the limitations of Reinforcement Learning (RL) in safety-
critical scenarios, where failures during trial-and-error learning may incur high costs. Several …

被引用次数：1 相关文章所有 4 个版本

[PDF] arxiv.org

Sample-Efficient Reinforcement Learning with Temporal Logic Objectives: Leveraging the Task Specification to Guide Exploration

Y Kantaros, J Wang - IEEE Transactions on Automatic Control, 2024 - ieeexplore.ieee.org

This paper addresses the problem of learning optimal control policies for systems with
uncertain dynamics and high-level control objectives specified as Linear Temporal Logic …

System III: Learning with domain knowledge for safety constraints

F Barez, H Hasanbieg, A Abbate - arXiv preprint arXiv:2304.11593, 2023 - arxiv.org

Reinforcement learning agents naturally learn from extensive exploration. Exploration is
costly and can be unsafe in $\textit {safety-critical} $ domains. This paper proposes a novel …

被引用次数：4 相关文章所有 3 个版本

[PDF] arxiv.org

Mission-driven Exploration for Accelerated Deep Reinforcement Learning with Temporal Logic Task Specifications

J Wang, H Hasanbeig, K Tan, Z Sun… - arXiv preprint arXiv …, 2023 - arxiv.org

This paper addresses the problem of designing optimal control policies for mobile robots
with mission and safety requirements specified using Linear Temporal Logic (LTL). We …

被引用次数：2 相关文章所有 3 个版本

[PDF] arxiv.org

Verified Compositional Neuro-Symbolic Control for Stochastic Systems with Temporal Logic Tasks

J Wang, H Chen, Z Sun, Y Kantaros - arXiv preprint arXiv:2311.10863, 2023 - arxiv.org

Several methods have been proposed recently to learn neural network (NN) controllers for
autonomous agents, with unknown and stochastic dynamics, tasked with complex missions …