[PDF][PDF] Architecting fault tolerant systems

H Muccini, A Romanovsky - School of Computing Science …, 2007 - researchgate.net
As building trustworthy (dependable) systems is one of the major challenges faced by
software developers, dealing with various threats (such as errors, faults and failures) is …

[图书][B] Fault injection techniques and tools for embedded systems reliability evaluation

A Benso, P Prinetto - 2003 - books.google.com
Our society is faced with an increasing dependence on computing systems, not only in high
tech consumer applications but also in areas (eg, air and railway traffic control, nuclear plant …

[图书][B] A discrete event systems approach to failure diagnosis

M Sampath - 1995 - search.proquest.com
Failure diagnosis in industrial systems is a crucial and challenging task. Accurate and timely
diagnosis of system failures can enhance the safety, reliability, availability, quality, and …

Fault analysis and debugging of microservice systems: Industrial survey, benchmark system, and empirical study

X Zhou, X Peng, T Xie, J Sun, C Ji… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
The complexity and dynamism of microservice systems pose unique challenges to a variety
of software engineering tasks such as fault analysis and debugging. In spite of the …

Intelligent prognostics tools and e-maintenance

J Lee, J Ni, D Djurdjanovic, H Qiu, H Liao - Computers in industry, 2006 - Elsevier
In today's global competitive marketplace, there is intense pressure for manufacturing
industries to continuously reduce and eliminate costly, unscheduled downtime and …

[PDF][PDF] Contemporary approaches to fault tolerance

A Wright - Communications of the ACM, 2009 - dl.acm.org
Contemporary approaches to fault tolerance Page 1 news july 2009 | vol. 52 | no. 7 |
communications of the acm 13 Science | DOI:10.1145/1538788.1538794 Alex Wright …

Supporting model-based safety analysis for safety-critical IoT systems

F Ihirwe, D Di Ruscio, K Di Blasio… - Journal of Computer …, 2024 - Elsevier
Dependability is regarded as the ability of the system to provide services that can be trusted
within a specific period. As the complexity and heterogeneity of Internet of Things (IoT) …

[PDF][PDF] Data-driven design of fault diagnosis systems

S Yin - 2012 - duepublico2.uni-due.de
Due to the increasing demands on system performance, production quality as well as
economic operation, modern technical processes become more complicated and the …

IoTRepair: Flexible fault handling in diverse IoT deployments

M Norris, ZB Celik, P Venkatesh, S Zhao… - ACM Transactions on …, 2022 - dl.acm.org
IoT devices can be used to complete a wide array of physical tasks, but due to factors such
as low computational resources and distributed physical deployment, they are susceptible to …

REDTag: a predictive maintenance framework for parcel delivery services

S Proto, E Di Corso, D Apiletti, L Cagliero… - IEEE …, 2020 - ieeexplore.ieee.org
The overwhelming increase of parcel transports has prompted the need for effective and
scalable intelligent logistics systems. In parallel, with the advent of Industry 4.0, a tight …