Teaching stratego to play ball: Optimal synthesis for continuous space MDPs

M Landers, A Doryab - ACM Computing Surveys, 2023 - dl.acm.org

Deep reinforcement learning (DRL) has proven capable of superhuman performance on
many complex tasks. To achieve this success, DRL algorithms train a decision-making agent …

被引用次数：23 相关文章所有 2 个版本

[PDF] dagstuhl.de

Safe reinforcement learning using probabilistic shields

N Jansen, B Könighofer, S Junges… - 31st International …, 2020 - drops.dagstuhl.de

This paper concerns the efficient construction of a safety shield for reinforcement learning.
We specifically target scenarios that incorporate uncertainty and use Markov decision …

被引用次数：114 相关文章所有 8 个版本

[PDF] ox.ac.uk

Verifying reinforcement learning up to infinity

E Bacci, M Giacobbe, D Parker - Proceedings of the International Joint …, 2021 - ora.ox.ac.uk

Formally verifying that reinforcement learning systems act safely is increasingly important,
but existing methods only verify over finite time. This is of limited use for dynamical systems …

被引用次数：26 相关文章所有 12 个版本

[PDF] arxiv.org

COOL-MC: a comprehensive tool for reinforcement learning and model checking

D Gross, N Jansen, S Junges, GA Pérez - International Symposium on …, 2022 - Springer

This paper presents COOL-MC, a tool that integrates state-of-the-art reinforcement learning
(RL) and model checking. Specifically, the tool builds upon the OpenAI gym and the …

被引用次数：17 相关文章所有 11 个版本

[PDF] springer.com

Verifiable strategy synthesis for multiple autonomous agents: a scalable approach

R Gu, PG Jensen, DB Poulsen, C Seceleanu… - International Journal on …, 2022 - Springer

Path planning and task scheduling are two challenging problems in the design of multiple
autonomous agents. Both problems can be solved by the use of exhaustive search …

被引用次数：13 相关文章所有 7 个版本

[HTML] nih.gov

Strategy Synthesis for Autonomous Driving in a Moving Block Railway System with Uppaal Stratego

D Basile, MH ter Beek, A Legay - International Conference on Formal …, 2020 - Springer

Moving block railway systems are the next generation signalling systems currently under
development as part of the Shift2Rail European initiative, including autonomous driving …

被引用次数：26 相关文章所有 13 个版本

[PDF] springer.com

Analyzing neural network behavior through deep statistical model checking

TP Gros, H Hermanns, J Hoffmann, M Klauck… - International Journal on …, 2023 - Springer

Neural networks (NN) are taking over ever more decisions thus far taken by humans, even
though verifiable system-level guarantees are far out of reach. Neither is the verification …

被引用次数：7 相关文章所有 7 个版本

[PDF] arxiv.org

Shielded reinforcement learning for hybrid systems

AH Brorholt, PG Jensen, KG Larsen, F Lorber… - … Conference on Bridging …, 2023 - Springer

Safe and optimal controller synthesis for switched-controlled hybrid systems, which combine
differential equations and discrete changes of the system's state, is known to be intricately …

被引用次数：8 相关文章所有 8 个版本

[PDF] arxiv.org

Verified probabilistic policies for deep reinforcement learning

E Bacci, D Parker - NASA Formal Methods Symposium, 2022 - Springer

Deep reinforcement learning is an increasingly popular technique for synthesising policies
to control an agent's interaction with its environment. There is also growing interest in …

被引用次数：9 相关文章所有 15 个版本

[HTML] sciencedirect.com

[HTML][HTML] Correctness-guaranteed strategy synthesis and compression for multi-agent autonomous systems

R Gu, PG Jensen, C Seceleanu, E Enoiu… - Science of Computer …, 2022 - Elsevier

Planning is a critical function of multi-agent autonomous systems, which includes path
finding and task scheduling. Exhaustive search-based methods such as model checking …

被引用次数：9 相关文章所有 10 个版本