Toward a scalable, transactional, fault-tolerant message passing interface for petascale and exascale machines
A Hassani - 2016 - search.proquest.com
Increases in the scale of computing machines directly correlate with the rate of failures. High
Performance Computing (HPC) applications provide fault-tolerance through redundancy in …
Performance Computing (HPC) applications provide fault-tolerance through redundancy in …
[引用][C] FA-MPI: fault-aware MPI specification and concept of operations
A Skjellum, PV Bangalore, YS Dandass - University of Alabama at Birmingham, Tech …, 2012