Diagnosis of asynchronous discrete-event systems: a net unfolding approach

A Benveniste, E Fabre, S Haar… - IEEE Transactions on …, 2003 - ieeexplore.ieee.org
In this paper, we consider the diagnosis of asynchronous discrete event systems. We follow
a so-called true concurrency approach, in which no global state and no global time is …

Distributed monitoring of concurrent and asynchronous systems

E Fabre, A Benveniste, S Haar, C Jard - Discrete Event Dynamic Systems, 2005 - Springer
In this paper we study the diagnosis of distributed asynchronous systems with concurrency.
Diagnosis is performed by a peer-to-peer distributed architecture of supervisors. Our …

Machine learning approaches to early fault detection and identification in NFV architectures

A Elmajed, A Aghasaryan… - 2020 6th IEEE Conference …, 2020 - ieeexplore.ieee.org
Virtualization technologies become pervasive in networking, as a way to better exploit
hardware capabilities and to quickly deploy tailored networking solutions for customers. But …

Distributed monitoring of concurrent and asynchronous systems

A Benveniste, S Haar, E Fabre, C Jard - International Conference on …, 2003 - Springer
Developing applications over a distributed and asynchronous architecture without the need
for synchronization services is going to become a central track for distributed computing …

[PDF][PDF] Identification of discrete event systems for fault detection purposes

S Klein - Doctorat de l'Ecole Normale Supérieure de Cachan …, 2005 - Citeseer
Increasing the availability of production systems is an important economic issue. The
challenge is to reduce the downtime of the systems by avoiding failures. This can be …

Root Cause Analysis for Cloud-native Applications

B Żurkowski, K Zieliński - IEEE Transactions on Cloud …, 2024 - ieeexplore.ieee.org
Root cause analysis (RCA) is a critical component in maintaining the reliability and
performance of modern cloud applications. However, due to the inherent complexity of cloud …

UniFAFF: a unified framework for implementing autonomic fault management and failure detection for self‐managing networks

R Chaparadza - International Journal of Network Management, 2009 - Wiley Online Library
Today's network management, as known within the Fault, Configuration, Accounting,
Performance, Security (FCAPS) management framework, is moving towards the definition …

Stimulus-based sandbox for learning resource dependencies in virtualized distributed applications

A Aghasaryan, M Bouzid… - 2017 20th Conference on …, 2017 - ieeexplore.ieee.org
In this paper, we present an approach for automated profiling of cloud-based distributed
applications. The failure dependencies within or between application nodes can be …

Distributed and asynchronous discrete event systems diagnosis

A Benveniste, S Haar, E Fabre… - 42nd IEEE International …, 2003 - ieeexplore.ieee.org
This paper deals with distributed and asynchronous discrete event systems diagnosis. This
paper has proposed an unfolding approach to the distributed diagnosis of concurrent and …

UML Specification of a generic model for fault diagnosis of telecommunication networks

A Aghasaryan, C Jard, J Thomas - International Conference on …, 2004 - Springer
This document presents a generic model capturing the essential structural and behavioral
characteristics of network components in the light of fault management. The generic model is …