Deep reinforcement learning verification: a survey

M Landers, A Doryab - ACM Computing Surveys, 2023 - dl.acm.org
Deep reinforcement learning (DRL) has proven capable of superhuman performance on
many complex tasks. To achieve this success, DRL algorithms train a decision-making agent …

Safe reinforcement learning using probabilistic shields

N Jansen, B Könighofer, S Junges… - 31st International …, 2020 - drops.dagstuhl.de
This paper concerns the efficient construction of a safety shield for reinforcement learning.
We specifically target scenarios that incorporate uncertainty and use Markov decision …

Verifying reinforcement learning up to infinity

E Bacci, M Giacobbe, D Parker - Proceedings of the International Joint …, 2021 - ora.ox.ac.uk
Formally verifying that reinforcement learning systems act safely is increasingly important,
but existing methods only verify over finite time. This is of limited use for dynamical systems …

COOL-MC: a comprehensive tool for reinforcement learning and model checking

D Gross, N Jansen, S Junges, GA Pérez - International Symposium on …, 2022 - Springer
This paper presents COOL-MC, a tool that integrates state-of-the-art reinforcement learning
(RL) and model checking. Specifically, the tool builds upon the OpenAI gym and the …

Verifiable strategy synthesis for multiple autonomous agents: a scalable approach

R Gu, PG Jensen, DB Poulsen, C Seceleanu… - International Journal on …, 2022 - Springer
Path planning and task scheduling are two challenging problems in the design of multiple
autonomous agents. Both problems can be solved by the use of exhaustive search …

Strategy Synthesis for Autonomous Driving in a Moving Block Railway System with Uppaal Stratego

D Basile, MH ter Beek, A Legay - International Conference on Formal …, 2020 - Springer
Moving block railway systems are the next generation signalling systems currently under
development as part of the Shift2Rail European initiative, including autonomous driving …

Analyzing neural network behavior through deep statistical model checking

TP Gros, H Hermanns, J Hoffmann, M Klauck… - International Journal on …, 2023 - Springer
Neural networks (NN) are taking over ever more decisions thus far taken by humans, even
though verifiable system-level guarantees are far out of reach. Neither is the verification …

Shielded reinforcement learning for hybrid systems

AH Brorholt, PG Jensen, KG Larsen, F Lorber… - … Conference on Bridging …, 2023 - Springer
Safe and optimal controller synthesis for switched-controlled hybrid systems, which combine
differential equations and discrete changes of the system's state, is known to be intricately …

Verified probabilistic policies for deep reinforcement learning

E Bacci, D Parker - NASA Formal Methods Symposium, 2022 - Springer
Deep reinforcement learning is an increasingly popular technique for synthesising policies
to control an agent's interaction with its environment. There is also growing interest in …

[HTML][HTML] Correctness-guaranteed strategy synthesis and compression for multi-agent autonomous systems

R Gu, PG Jensen, C Seceleanu, E Enoiu… - Science of Computer …, 2022 - Elsevier
Planning is a critical function of multi-agent autonomous systems, which includes path
finding and task scheduling. Exhaustive search-based methods such as model checking …