A survey of algorithms for black-box safety validation of cyber-physical systems

A Corso, R Moss, M Koren, R Lee… - Journal of Artificial …, 2021 - jair.org
Autonomous cyber-physical systems (CPS) can improve safety and efficiency for safety-
critical applications, but require rigorous testing before deployment. The complexity of these …

Deep reinforcement learning for demand fulfillment in online retail

Y Wang, S Minner - International Journal of Production Economics, 2024 - Elsevier
A distinctive feature of online retail is the flexibility to ship items to customers from different
distribution centers (DCs). This creates interdependence between DCs and poses new …

A multi-agent cooperative reinforcement learning model using a hierarchy of consultants, tutors and workers

BH Abed-alguni, SK Chalup, FA Henskens… - Vietnam Journal of …, 2015 - Springer
The hierarchical organisation of distributed systems can provide an efficient decomposition
for machine learning. This paper proposes an algorithm for cooperative policy construction …

Control design for bounded partially controlled TPNs using timed extended reachability graphs and MDP

D Lefebvre, C Daoui - IEEE Transactions on Systems, Man, and …, 2018 - ieeexplore.ieee.org
This paper is about the design of control sequences for discrete event systems (DESs)
modeled with bounded partially controlled timed Petri nets (PC-TPNs) including a set of …

Swarm Intelligence in Cooperative Environments: -Step Dynamic Tree Search Algorithm Overview

M Espinós Longa, A Tsourdos, G Inalhan - Journal of Aerospace …, 2023 - arc.aiaa.org
Reinforcement learning tree-based planning methods have been gaining popularity in the
last few years due to their success in single-agent domains, where a perfect simulator model …

Swarm Intelligence in Cooperative Environments: Introducing the N-Step Dynamic Tree Search Algorithm

M Espinós Longa, G Inalhan, A Tsourdos - AIAA SciTech 2022 Forum, 2022 - arc.aiaa.org
View Video Presentation: https://doi. org/10.2514/6.2022-1839. vid Uncertainty and partial or
unknown information about environment dynamics have led reward-based methods to play …

Optimal control in Markov decision processes via distributed optimization

J Fu, S Han, U Topcu - 2015 54th IEEE Conference on Decision …, 2015 - ieeexplore.ieee.org
Optimal control synthesis in stochastic systems with respect to quantitative temporal logic
constraints can be formulated as linear programming problems. However, centralized …

Swarm Intelligence in Cooperative Environments: N-Step Dynamic Tree Search Algorithm Extended Analysis

ME Longa, A Tsourdos… - 2022 American Control …, 2022 - ieeexplore.ieee.org
Reinforcement learning tree-based planning methods have been gaining popularity in the
last few years due to their success in single-agent domains, where a perfect simulator model …

Decomposition Methods for Solving Finite‐Horizon Large MDPs

B el Akraoui, C Daoui, A Larach… - Journal of …, 2022 - Wiley Online Library
Conventional algorithms for solving Markov decision processes (MDPs) become intractable
for a large finite state and action spaces. Several studies have been devoted to this issue …

Approximated timed reachability graphs for the robust control of discrete event systems

D Lefebvre - Discrete Event Dynamic Systems, 2019 - Springer
This paper is about control sequences design for Discrete Event Systems (DES) modeled
with Time Petri nets (TPN) including a set of temporal specifications. Petri nets are known as …