Control synthesis from linear temporal logic specifications using model-free reinforcement learning
We present a reinforcement learning (RL) frame-work to synthesize a control policy from a
given linear temporal logic (LTL) specification in an unknown stochastic environment that …
given linear temporal logic (LTL) specification in an unknown stochastic environment that …
Omega-regular objectives in model-free reinforcement learning
We provide the first solution for model-free reinforcement learning of ω-regular objectives for
Markov decision processes (MDPs). We present a constructive reduction from the almost …
Markov decision processes (MDPs). We present a constructive reduction from the almost …
Limit-deterministic Büchi automata for linear temporal logic
Limit-deterministic Büchi automata can replace deterministic Rabin automata in probabilistic
model checking algorithms, and can be significantly smaller. We present a direct …
model checking algorithms, and can be significantly smaller. We present a direct …
JANI: quantitative model and tool interaction
The formal analysis of critical systems is supported by a vast space of modelling formalisms
and tools. The variety of incompatible formats and tools however poses a significant …
and tools. The variety of incompatible formats and tools however poses a significant …
Policy synthesis and reinforcement learning for discounted LTL
The difficulty of manually specifying reward functions has led to an interest in using linear
temporal logic (LTL) to express objectives for reinforcement learning (RL). However, LTL …
temporal logic (LTL) to express objectives for reinforcement learning (RL). However, LTL …
Optimal probabilistic motion planning with potential infeasible LTL constraints
This paper studies optimal motion planning subject to motion and environment uncertainties.
By modeling the system as a probabilistic labeled Markov decision process (PL-MDP), the …
By modeling the system as a probabilistic labeled Markov decision process (PL-MDP), the …
Multi-objective ω-regular reinforcement learning
The expanding role of reinforcement learning (RL) in safety-critical system design has
promoted ω-automata as a way to express learning requirements—often non-Markovian …
promoted ω-automata as a way to express learning requirements—often non-Markovian …
Policy optimization with linear temporal logic constraints
We study the problem of policy optimization (PO) with linear temporal logic (LTL) constraints.
The language of LTL allows flexible description of tasks that may be unnatural to encode as …
The language of LTL allows flexible description of tasks that may be unnatural to encode as …
Mungojerrie: Linear-time objectives in model-free reinforcement learning
Mungojerrie is an extensible tool that provides a framework to translate linear-time
objectives into reward for reinforcement learning (RL). The tool provides convergent RL …
objectives into reward for reinforcement learning (RL). The tool provides convergent RL …
Translating omega-regular specifications to average objectives for model-free reinforcement learning
Recent success in reinforcement learning (RL) has brought renewed attention to the design
of reward functions by which agent behavior is reinforced or deterred. Manually designing …
of reward functions by which agent behavior is reinforced or deterred. Manually designing …