Automated verification and synthesis of stochastic hybrid systems: A survey

A Lavaei, S Soudjani, A Abate, M Zamani - Automatica, 2022 - Elsevier
Stochastic hybrid systems have received significant attentions as a relevant modeling
framework describing many systems, from engineering to the life sciences: they enable the …

Certified reinforcement learning with logic guidance

H Hasanbeig, D Kroening, A Abate - Artificial Intelligence, 2023 - Elsevier
Reinforcement Learning (RL) is a widely employed machine learning architecture that has
been applied to a variety of control problems. However, applications in safety-critical …

Symbolic task inference in deep reinforcement learning

H Hasanbeig, NY Jeppu, A Abate, T Melham… - Journal of Artificial …, 2024 - jair.org
This paper proposes DeepSynth, a method for effective training of deep reinforcement
learning agents when the reward is sparse or non-Markovian, but at the same time progress …

Allure: A systematic protocol for auditing and improving llm-based evaluation of text using iterative in-context-learning

H Hasanbeig, H Sharma, L Betthauser… - arXiv preprint arXiv …, 2023 - arxiv.org
From grading papers to summarizing medical documents, large language models (LLMs)
are evermore used for evaluation of text generated by humans and AI alike. However …

Safeguarded Progress in Reinforcement Learning: Safe Bayesian Exploration for Control Policy Synthesis

R Mitta, H Hasanbeig, J Wang, D Kroening… - Proceedings of the …, 2024 - ojs.aaai.org
This paper addresses the problem of maintaining safety during training in Reinforcement
Learning (RL), such that the safety constraint violations are bounded at any point during …

Probabilistic Counterexample Guidance for Safer Reinforcement Learning

X Ji, A Filieri - International Conference on Quantitative Evaluation of …, 2023 - Springer
Safe exploration aims at addressing the limitations of Reinforcement Learning (RL) in safety-
critical scenarios, where failures during trial-and-error learning may incur high costs. Several …

Sample-Efficient Reinforcement Learning with Temporal Logic Objectives: Leveraging the Task Specification to Guide Exploration

Y Kantaros, J Wang - IEEE Transactions on Automatic Control, 2024 - ieeexplore.ieee.org
This paper addresses the problem of learning optimal control policies for systems with
uncertain dynamics and high-level control objectives specified as Linear Temporal Logic …

System III: Learning with domain knowledge for safety constraints

F Barez, H Hasanbieg, A Abbate - arXiv preprint arXiv:2304.11593, 2023 - arxiv.org
Reinforcement learning agents naturally learn from extensive exploration. Exploration is
costly and can be unsafe in $\textit {safety-critical} $ domains. This paper proposes a novel …

Mission-driven Exploration for Accelerated Deep Reinforcement Learning with Temporal Logic Task Specifications

J Wang, H Hasanbeig, K Tan, Z Sun… - arXiv preprint arXiv …, 2023 - arxiv.org
This paper addresses the problem of designing optimal control policies for mobile robots
with mission and safety requirements specified using Linear Temporal Logic (LTL). We …

Verified Compositional Neuro-Symbolic Control for Stochastic Systems with Temporal Logic Tasks

J Wang, H Chen, Z Sun, Y Kantaros - arXiv preprint arXiv:2311.10863, 2023 - arxiv.org
Several methods have been proposed recently to learn neural network (NN) controllers for
autonomous agents, with unknown and stochastic dynamics, tasked with complex missions …