The/spl phi/accrual failure detector
The detection of failures is a fundamental issue for fault-tolerance in distributed systems.
Recently, many people have come to realize that failure detection ought to be provided as …
Recently, many people have come to realize that failure detection ought to be provided as …
A short introduction to failure detectors for asynchronous distributed systems
M Reynal - ACM SIGACT News, 2005 - dl.acm.org
Since the first version of Chandra and Toueg's seminal paper titled" Unreliable failure
detectors for reliable distributed systems" in 1991, the failure detector concept has been …
detectors for reliable distributed systems" in 1991, the failure detector concept has been …
From static distributed systems to dynamic systems
A noteworthy advance in distributed computing is due to the recent development of peer-to-
peer systems. These systems are essentially dynamic in the sense that no process can get a …
peer systems. These systems are essentially dynamic in the sense that no process can get a …
The alpha of indulgent consensus
R Guerraoui, M Raynal - The Computer Journal, 2007 - ieeexplore.ieee.org
This paper presents a simple framework unifying a family of consensus algorithms that can
tolerate process crash failures and asynchronous periods of the network, also called …
tolerate process crash failures and asynchronous periods of the network, also called …
The theta-model: achieving synchrony without clocks
We present a novel partially synchronous system model, which augments the asynchronous
model by a (possibly unknown) bound Θ on the ratio of longest and shortest end-to-end …
model by a (possibly unknown) bound Θ on the ratio of longest and shortest end-to-end …
Definition and specification of accrual failure detectors
X Défago, P Urbán, N Hayashibara… - … Systems and Networks …, 2005 - ieeexplore.ieee.org
For many years, people have been advocating the development of failure detection as a
basic service, but, unfortunately, without meeting much success so far. We believe that this …
basic service, but, unfortunately, without meeting much success so far. We believe that this …
Automated rule-based diagnosis through a distributed monitor system
G Khanna, MY Cheng, P Varadharajan… - … on Dependable and …, 2007 - ieeexplore.ieee.org
In today's world where distributed systems form many of our critical infrastructures,
dependability outagesare becoming increasingly common. In many situations, it is …
dependability outagesare becoming increasingly common. In many situations, it is …
Time-free and timer-based assumptions can be combined to obtain eventual leadership
A Mostefaoui, M Raynal… - IEEE Transactions on …, 2006 - ieeexplore.ieee.org
Leader-based protocols rest on a primitive able to provide the processes with the same
unique leader. Such protocols are very common in distributed computing to solve …
unique leader. Such protocols are very common in distributed computing to solve …
Eventual leader election with weak assumptions on initial knowledge, communication reliability, and synchrony
This report considers the eventual leader election problem in asynchronous message-
passing systems where an arbitrary number $ t $ of processes can crash ($ t< n $, where $ n …
passing systems where an arbitrary number $ t $ of processes can crash ($ t< n $, where $ n …
A time-free assumption to implement eventual leadership
A Mostefaoui, E Mourgaya, M Raynal… - Parallel Processing …, 2006 - World Scientific
Leader-based protocols rest on a primitive able to provide the processes with the same
unique leader. Such protocols are very common in distributed computing to solve …
unique leader. Such protocols are very common in distributed computing to solve …