A survey of algorithms for black-box safety validation of cyber-physical systems
Autonomous cyber-physical systems (CPS) can improve safety and efficiency for safety-
critical applications, but require rigorous testing before deployment. The complexity of these …
critical applications, but require rigorous testing before deployment. The complexity of these …
Deep reinforcement learning for demand fulfillment in online retail
A distinctive feature of online retail is the flexibility to ship items to customers from different
distribution centers (DCs). This creates interdependence between DCs and poses new …
distribution centers (DCs). This creates interdependence between DCs and poses new …
A multi-agent cooperative reinforcement learning model using a hierarchy of consultants, tutors and workers
The hierarchical organisation of distributed systems can provide an efficient decomposition
for machine learning. This paper proposes an algorithm for cooperative policy construction …
for machine learning. This paper proposes an algorithm for cooperative policy construction …
Control design for bounded partially controlled TPNs using timed extended reachability graphs and MDP
D Lefebvre, C Daoui - IEEE Transactions on Systems, Man, and …, 2018 - ieeexplore.ieee.org
This paper is about the design of control sequences for discrete event systems (DESs)
modeled with bounded partially controlled timed Petri nets (PC-TPNs) including a set of …
modeled with bounded partially controlled timed Petri nets (PC-TPNs) including a set of …
Swarm Intelligence in Cooperative Environments: -Step Dynamic Tree Search Algorithm Overview
M Espinós Longa, A Tsourdos, G Inalhan - Journal of Aerospace …, 2023 - arc.aiaa.org
Reinforcement learning tree-based planning methods have been gaining popularity in the
last few years due to their success in single-agent domains, where a perfect simulator model …
last few years due to their success in single-agent domains, where a perfect simulator model …
Swarm Intelligence in Cooperative Environments: Introducing the N-Step Dynamic Tree Search Algorithm
M Espinós Longa, G Inalhan, A Tsourdos - AIAA SciTech 2022 Forum, 2022 - arc.aiaa.org
View Video Presentation: https://doi. org/10.2514/6.2022-1839. vid Uncertainty and partial or
unknown information about environment dynamics have led reward-based methods to play …
unknown information about environment dynamics have led reward-based methods to play …
Optimal control in Markov decision processes via distributed optimization
Optimal control synthesis in stochastic systems with respect to quantitative temporal logic
constraints can be formulated as linear programming problems. However, centralized …
constraints can be formulated as linear programming problems. However, centralized …
Swarm Intelligence in Cooperative Environments: N-Step Dynamic Tree Search Algorithm Extended Analysis
ME Longa, A Tsourdos… - 2022 American Control …, 2022 - ieeexplore.ieee.org
Reinforcement learning tree-based planning methods have been gaining popularity in the
last few years due to their success in single-agent domains, where a perfect simulator model …
last few years due to their success in single-agent domains, where a perfect simulator model …
Decomposition Methods for Solving Finite‐Horizon Large MDPs
B el Akraoui, C Daoui, A Larach… - Journal of …, 2022 - Wiley Online Library
Conventional algorithms for solving Markov decision processes (MDPs) become intractable
for a large finite state and action spaces. Several studies have been devoted to this issue …
for a large finite state and action spaces. Several studies have been devoted to this issue …
Approximated timed reachability graphs for the robust control of discrete event systems
D Lefebvre - Discrete Event Dynamic Systems, 2019 - Springer
This paper is about control sequences design for Discrete Event Systems (DES) modeled
with Time Petri nets (TPN) including a set of temporal specifications. Petri nets are known as …
with Time Petri nets (TPN) including a set of temporal specifications. Petri nets are known as …