Predicting faults in high performance computing systems: An in-depth survey of the state-of-the-practice

D Jauk, D Yang, M Schulz - … of the International Conference for High …, 2019 - dl.acm.org
As we near exascale, resilience remains a major technical hurdle. Any technique with the
goal of achieving resilience suffers from having to be reactive, as failures can appear at any …

Development of models in resilient computing

O Drozd, V Kharchenko, A Rucinski… - 2019 10th …, 2019 - ieeexplore.ieee.org
The article analyzes the concept of" Resilience" in relation to the development of computing.
The strategy for reacting to perturbations in this process can be based either on" harsh …

[PDF][PDF] Patterns for things that fail

A Ramadas, G Domingues, JP Dias, A Aguiar… - Proceedings of the 24th …, 2017 - hillside.net
Internet of Things is a paradigm that empowers the Internet-connected heterogeneous
devices alongside with their capabilities to sense the physical world and act on it. Internet of …

A heterogeneous fault diagnosis approach to enhance performance of connected vehicles

B Ranjan Senapati, S Swain… - International Journal …, 2023 - Wiley Online Library
Evolution of wireless access technology, availability of smart sensors, and reduction in the
size of the set up of the communication system have engrossed many researchers toward …

A survey on software/hardware fault injection tools and techniques

NK Salih, D Satyanarayana… - … IEEE Symposium on …, 2022 - ieeexplore.ieee.org
Computational systems need to be dependable, but they are vulnerable to defects, errors,
failure and faults, and they affect their behaviour. Fault injection is one of the useful …

Real-time fault detection and diagnosis using intelligent monitoring and supervision systems

GP Alvarez - Fault detection, diagnosis and prognosis, 2020 - books.google.com
In monitoring and supervision schemes, fault detection and diagnosis characterize high
efficiency and quality production systems. To achieve such properties, these structures are …

[PDF][PDF] Ws-diamond: An approach to web services-diagnosability, monitoring and diagnosis

L Console, WSD Team - International e-Challenges Conference …, 2007 - researchgate.net
Self-healing software is one of the challenges issues for IST research. The WS-Diamond
project aims at making a step in this direction by developing a framework for self-healing …

Cascading failures in internet of things: review and perspectives on reliability and resilience

L Xing - IEEE Internet of Things Journal, 2020 - ieeexplore.ieee.org
In the Internet of Things (IoT), various devices operate collaboratively in collecting data,
relaying information to one another, and processing information intelligently. Due to …

[图书][B] Soft computing in condition monitoring and diagnostics of electrical and mechanical systems

H Malik, A Iqbal, AK Yadav - 2020 - Springer
First of all, we are thankful to the contributors of this edited book. We are indebted to 44
author experts in the fields of soft computing, condition monitoring, and fault diagnosis of …

Maintenance assistance application of Engineering to Order manufacturing equipment: A Product Service System (PSS) approach

D Mourtzis, J Angelopoulos, N Boli - IFAC-PapersOnLine, 2018 - Elsevier
Following nowadays shift to Servitization and Digitalization, which is aligned with
technological advances in mobile technologies and mixed reality, this paper introduces an …