Data-centric reliability management in gpus

G Kadam, E Smirni, A Jog - 2021 51st Annual IEEE/IFIP …, 2021 - ieeexplore.ieee.org
Graphics Processing Units (GPUs) have become the default choice of acceleration in a wide
range of application domains. To keep up with computational demands, the GPU memory …

[图书][B] Low-Overhead Techniques for Secure and Reliable GPU Computing

G Kadam - 2021 - search.proquest.com
Abstract In recent years, Graphics Processing Units (GPUs) have become a de facto choice
to accelerate the computations in various domains such as machine learning, security …

Language support for reliable memory regions

S Hukerikar, C Engelmann - … and Compilers for Parallel Computing: 29th …, 2017 - Springer
The path to exascale computational capabilities in high-performance computing (HPC)
systems is challenged by the inadequacy of present software technologies to adapt to the …

[PDF][PDF] A Survey on Rule-Based Systems and the significance of Fault Tolerance for High-Performance Computing

G Sreenivasulu, PVS Srinivas - International Journal of Computer …, 2018 - academia.edu
The high-performance computing (HPC) systems participate an significant responsibility in
many highly computational applications and systems. Understanding the failure behavior of …