Control synthesis from linear temporal logic specifications using model-free reinforcement learning

AK Bozkurt, Y Wang, MM Zavlanos… - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
We present a reinforcement learning (RL) frame-work to synthesize a control policy from a
given linear temporal logic (LTL) specification in an unknown stochastic environment that …

Omega-regular objectives in model-free reinforcement learning

EM Hahn, M Perez, S Schewe, F Somenzi… - … conference on tools and …, 2019 - Springer
We provide the first solution for model-free reinforcement learning of ω-regular objectives for
Markov decision processes (MDPs). We present a constructive reduction from the almost …

Limit-deterministic Büchi automata for linear temporal logic

S Sickert, J Esparza, S Jaax, J Křetínský - International Conference on …, 2016 - Springer
Limit-deterministic Büchi automata can replace deterministic Rabin automata in probabilistic
model checking algorithms, and can be significantly smaller. We present a direct …

JANI: quantitative model and tool interaction

CE Budde, C Dehnert, EM Hahn, A Hartmanns… - … 2017, Held as Part of the …, 2017 - Springer
The formal analysis of critical systems is supported by a vast space of modelling formalisms
and tools. The variety of incompatible formats and tools however poses a significant …

Policy synthesis and reinforcement learning for discounted LTL

R Alur, O Bastani, K Jothimurugan, M Perez… - … on Computer Aided …, 2023 - Springer
The difficulty of manually specifying reward functions has led to an interest in using linear
temporal logic (LTL) to express objectives for reinforcement learning (RL). However, LTL …

Optimal probabilistic motion planning with potential infeasible LTL constraints

M Cai, S Xiao, Z Li, Z Kan - IEEE Transactions on Automatic …, 2021 - ieeexplore.ieee.org
This paper studies optimal motion planning subject to motion and environment uncertainties.
By modeling the system as a probabilistic labeled Markov decision process (PL-MDP), the …

Multi-objective ω-regular reinforcement learning

EM Hahn, M Perez, S Schewe, F Somenzi… - Formal Aspects of …, 2023 - dl.acm.org
The expanding role of reinforcement learning (RL) in safety-critical system design has
promoted ω-automata as a way to express learning requirements—often non-Markovian …

Policy optimization with linear temporal logic constraints

C Voloshin, H Le, S Chaudhuri… - Advances in Neural …, 2022 - proceedings.neurips.cc
We study the problem of policy optimization (PO) with linear temporal logic (LTL) constraints.
The language of LTL allows flexible description of tasks that may be unnatural to encode as …

Mungojerrie: Linear-time objectives in model-free reinforcement learning

EM Hahn, M Perez, S Schewe, F Somenzi… - … Conference on Tools …, 2023 - Springer
Mungojerrie is an extensible tool that provides a framework to translate linear-time
objectives into reward for reinforcement learning (RL). The tool provides convergent RL …

Translating omega-regular specifications to average objectives for model-free reinforcement learning

M Kazemi, M Perez, F Somenzi, S Soudjani… - Proc. of the 21st …, 2022 - par.nsf.gov
Recent success in reinforcement learning (RL) has brought renewed attention to the design
of reward functions by which agent behavior is reinforced or deterred. Manually designing …