Software fault tolerance: A tutorial

W Torres-Pomales - 2000 - ntrs.nasa.gov
Because of our present inability to produce error-free software, software fault tolerance is
and will continue to be an important consideration in software systems. The root cause of …

Data diversity: An approach to software fault tolerance

PE Ammann, JC Knight - Ieee transactions on computers, 1988 - ieeexplore.ieee.org
Data diversity is described, and the results of a pilot study are presented. The regions of the
input space that cause failure for certain experimental programs are discussed, and data …

Definition and analysis of hardware-and-software fault-tolerant architectures

JC Laprie, J Arlat, C Beounes, K Kanoun - Predictably Dependable …, 1995 - Springer
This paper is devoted to the definition and the analysis of architectures aimed at tolerating
hardware faults and software faults. The paper is composed of fourth sections. The second …

Analysis of preventive maintenance in transactions based software systems

S Garg, A Puliafito, M Telek… - IEEE transactions on …, 1998 - ieeexplore.ieee.org
Preventive maintenance of operational software systems, a novel technique for software
fault tolerance, is used specifically to counteract the phenomenon of software" aging" …

Improving the reliability of commodity operating systems

MM Swift, BN Bershad, HM Levy - ACM Transactions on Computer …, 2005 - dl.acm.org
Despite decades of research in extensible operating system technology, extensions such as
device drivers remain a significant cause of system failures. In Windows XP, for example …

Software fault tolerance: A tutorial

T Wilfredo - 2000 - dl.acm.org
Because of our present inability to produce error-free software, software fault tolerance is
and will continue to be an important consideration in software systems. The root cause of …

Dependability modeling and evaluation of software fault-tolerant systems

J Arlat, K Kanoun, JC Laprie - IEEE Transactions on Computers, 1990 - ieeexplore.ieee.org
Dependability modeling and evaluation (encompassing reliability and safety issues) of the
two major fault tolerance software approaches-recovery blocks (RBs) and N version …

Executable assertions for detecting data errors in embedded control systems

M Hiller - … Conference on Dependable Systems and Networks …, 2000 - ieeexplore.ieee.org
In order to be able to tolerate the effects of faults, we must first detect the symptoms of faults,
ie the errors. This paper evaluates the error detection properties of an error detection …

Modeling of correlated failures and community error recovery in multiversion software

VF Nicola, A Goyal - IEEE Transactions on Software …, 1990 - ieeexplore.ieee.org
Three aspects of the modeling of multiversion software are considered. First, the beta-
binomial distribution is proposed for modeling correlated failures in multiversion software …

Software fault mitigation and availability assurance techniques

KS Trivedi, M Grottke, E Andrade - International Journal of System …, 2010 - Springer
Companies are expected to keep their systems up and running and make data continuously
available. Several recent studies have established that most system outages are due to …