- 学术资源搜索

A survey of algorithms for black-box safety validation of cyber-physical systems

A Corso, R Moss, M Koren, R Lee… - Journal of Artificial …, 2021 - jair.org

Autonomous cyber-physical systems (CPS) can improve safety and efficiency for safety-
critical applications, but require rigorous testing before deployment. The complexity of these …

被引用次数：210 相关文章所有 9 个版本

Deep reinforcement learning for demand fulfillment in online retail

Y Wang, S Minner - International Journal of Production Economics, 2024 - Elsevier

A distinctive feature of online retail is the flexibility to ship items to customers from different
distribution centers (DCs). This creates interdependence between DCs and poses new …

被引用次数：4 相关文章所有 3 个版本

[PDF] springer.com

A multi-agent cooperative reinforcement learning model using a hierarchy of consultants, tutors and workers

BH Abed-alguni, SK Chalup, FA Henskens… - Vietnam Journal of …, 2015 - Springer

The hierarchical organisation of distributed systems can provide an efficient decomposition
for machine learning. This paper proposes an algorithm for cooperative policy construction …

被引用次数：58 相关文章所有 10 个版本

Control design for bounded partially controlled TPNs using timed extended reachability graphs and MDP

D Lefebvre, C Daoui - IEEE Transactions on Systems, Man, and …, 2018 - ieeexplore.ieee.org

This paper is about the design of control sequences for discrete event systems (DESs)
modeled with bounded partially controlled timed Petri nets (PC-TPNs) including a set of …

被引用次数：22 相关文章

[PDF] aiaa.org

Swarm Intelligence in Cooperative Environments: -Step Dynamic Tree Search Algorithm Overview

M Espinós Longa, A Tsourdos, G Inalhan - Journal of Aerospace …, 2023 - arc.aiaa.org

Reinforcement learning tree-based planning methods have been gaining popularity in the
last few years due to their success in single-agent domains, where a perfect simulator model …

被引用次数：3 相关文章所有 2 个版本

[PDF] cranfield.ac.uk

Swarm Intelligence in Cooperative Environments: Introducing the N-Step Dynamic Tree Search Algorithm

M Espinós Longa, G Inalhan, A Tsourdos - AIAA SciTech 2022 Forum, 2022 - arc.aiaa.org

View Video Presentation: https://doi. org/10.2514/6.2022-1839. vid Uncertainty and partial or
unknown information about environment dynamics have led reward-based methods to play …

被引用次数：5 相关文章所有 3 个版本

[PDF] arxiv.org

Optimal control in Markov decision processes via distributed optimization

J Fu, S Han, U Topcu - 2015 54th IEEE Conference on Decision …, 2015 - ieeexplore.ieee.org

Optimal control synthesis in stochastic systems with respect to quantitative temporal logic
constraints can be formulated as linear programming problems. However, centralized …

被引用次数：19 相关文章所有 9 个版本

Swarm Intelligence in Cooperative Environments: N-Step Dynamic Tree Search Algorithm Extended Analysis

ME Longa, A Tsourdos… - 2022 American Control …, 2022 - ieeexplore.ieee.org

Reinforcement learning tree-based planning methods have been gaining popularity in the
last few years due to their success in single-agent domains, where a perfect simulator model …

被引用次数：3 相关文章

[PDF] wiley.com Full View

Decomposition Methods for Solving Finite‐Horizon Large MDPs

B el Akraoui, C Daoui, A Larach… - Journal of …, 2022 - Wiley Online Library

Conventional algorithms for solving Markov decision processes (MDPs) become intractable
for a large finite state and action spaces. Several studies have been devoted to this issue …

被引用次数：3 相关文章所有 5 个版本

Approximated timed reachability graphs for the robust control of discrete event systems

D Lefebvre - Discrete Event Dynamic Systems, 2019 - Springer

This paper is about control sequences design for Discrete Event Systems (DES) modeled
with Time Petri nets (TPN) including a set of temporal specifications. Petri nets are known as …

被引用次数：10 相关文章所有 4 个版本