A rollback in the history of communication-induced checkpointing

IC Garcia, G Vieira, LE Buzato - arXiv preprint arXiv:1702.06167, 2017 - arxiv.org
The literature on communication-induced checkpointing presents a family of protocols that
use logical clocks to control whether forced checkpoints must be taken. Efficiency of these …

Autonomic web services based on different adaptive quasi-asynchronous checkpointing techniques

M Vargas-Santiago, L Morales-Rosales, R Monroy… - Applied Sciences, 2020 - mdpi.com
Companies, organizations and individuals use Web services to build complex business
functionalities. Web services must operate properly in the unreliable Internet infrastructure …

Autonomic web services enhanced by asynchronous checkpointing

M Vargas-Santiago, L Morales-Rosales… - IEEE …, 2017 - ieeexplore.ieee.org
The evolution of business software technologies is constant and is becoming increasingly
complex which leads to a great probability of software/hardware failures. Business …

Fault tolerance approach based on checkpointing towards dependable business processes

MV Santiago, SEP Hernandez… - IEEE Latin America …, 2016 - ieeexplore.ieee.org
The Business Process Execution Language (BPEL) has become one of the predominant
standards for Web services compositions oriented to business processes. An important …

An efficient validation approach for quasi-synchronous checkpointing oriented to distributed diagnosability

H Khlif, HH Kacem, SEP Hernandez, AH Kacem… - Journal of Systems and …, 2016 - Elsevier
The autonomic computing paradigm is oriented towards enabling complex distributed
systems to manage themselves, even in faulty situations. The diagnosability analysis is a …

A graph transformation-based approach for the validation of checkpointing algorithms in distributed systems

H Khlif, HH Kacem, SEP Hernández… - 2014 IEEE 23rd …, 2014 - ieeexplore.ieee.org
Autonomic Computing Systems are oriented to prevent the human intervention and to
enable distributed systems to manage themselves. One of their challenges is the efficient …

[PDF][PDF] MCSR: A graph transformation based approach for Minimal and Compact Set Representation of Causal Dependencies in Distributed Systems.

H Khlif, HH Kacem, SEP Hernández - TACC, 2023 - ceur-ws.org
Causal ordering is an important property in distributed systems. Several algorithms have
been developed over this principle. For example, there are solutions for roll-back recovery …

Infrastructure hardening: a competitive coevolutionary methodology inspired by neo-darwinian arms races

T Service, D Tauritz, W Siever - 31st Annual International …, 2007 - ieeexplore.ieee.org
The world is increasingly dependent on critical infrastructures such as the electric power
grid, water, gas, and oil transport systems, which are susceptible to cascading failures that …

Reducing Overhead of Distributed Checkpointing with Group Communication

J Ahn - Journal of advanced information technology and …, 2020 - dbpia.co.kr
A protocol HMNR, was proposed to utilize control information of every other process
piggybacked on each sent message for minimizing the number of forced checkpoints. Then …

[PDF][PDF] Dependability for ESB systems in critical environments based on checkpointing and self-healing principles

MV Santiago, SE Hernández, HH Kacem, LA Rosales - 2015 - 192.100.172.221
IBM created a new paradigm called Autonomic Computing (AC); where systems are seen as
self-manageable. To attack the problem of intercommunication and heterogeneity the …