Toward a scalable, transactional, fault-tolerant message passing interface for petascale and exascale machines

A Hassani - 2016 - search.proquest.com
Increases in the scale of computing machines directly correlate with the rate of failures. High
Performance Computing (HPC) applications provide fault-tolerance through redundancy in …

[引用][C] FA-MPI: fault-aware MPI specification and concept of operations

A Skjellum, PV Bangalore, YS Dandass - University of Alabama at Birmingham, Tech …, 2012