Smartly handling renewable energy instability in supporting a cloud datacenter
The size and energy consumption of datacenters have been increasing significantly over the
past years. As a result, datacenters' increasing electricity monetary cost, energy …
past years. As a result, datacenters' increasing electricity monetary cost, energy …
FT-CNN: Algorithm-based fault tolerance for convolutional neural networks
Convolutional neural networks (CNNs) are becoming more and more important for solving
challenging and critical problems in many fields. CNN inference applications have been …
challenging and critical problems in many fields. CNN inference applications have been …
ePVF: An enhanced program vulnerability factor methodology for cross-layer resilience analysis
B Fang, Q Lu, K Pattabiraman… - 2016 46th Annual …, 2016 - ieeexplore.ieee.org
The Program Vulnerability Factor (PVF) has been proposed as a metric to understand the
impact of hardware faults on software. The PVF is calculated by identifying the program bits …
impact of hardware faults on software. The PVF is calculated by identifying the program bits …
GreenMM: energy efficient GPU matrix multiplication through undervolting
The current trend of ever-increasing performance in scientific applications comes with
tremendous growth in energy consumption. In this paper, we present GreenMM framework …
tremendous growth in energy consumption. In this paper, we present GreenMM framework …
An instability-resilient renewable energy allocation system for a cloud datacenter
Renewable energy supply is a promising solution for datacenters' increasing electricity
monetary cost, energy consumption and harmful gas emissions. However, due to the …
monetary cost, energy consumption and harmful gas emissions. However, due to the …
Winograd convolution: A perspective from fault tolerance
Winograd convolution is originally proposed to reduce the computing overhead by
converting multiplication in neural network (NN) with addition via linear transformation. Other …
converting multiplication in neural network (NN) with addition via linear transformation. Other …
TSM2: optimizing tall-and-skinny matrix-matrix multiplication on GPUs
Linear algebra operations have been widely used in big data analytics and scientific
computations. Many works have been done on optimizing linear algebra operations on …
computations. Many works have been done on optimizing linear algebra operations on …
What does power consumption behavior of hpc jobs reveal?: Demystifying, quantifying, and predicting power consumption characteristics
T Patel, A Wagenhäuser, C Eibel… - 2020 IEEE …, 2020 - ieeexplore.ieee.org
As we approach exascale computing, large-scale HPC systems are becoming increasingly
power-constrained, requiring them to run HPC workloads in an energy-efficient manner. The …
power-constrained, requiring them to run HPC workloads in an energy-efficient manner. The …
New-sum: A novel online abft scheme for general iterative methods
Emerging high-performance computing platforms, with large component counts and lower
power margins, are anticipated to be more susceptible to soft errors in both logic circuits and …
power margins, are anticipated to be more susceptible to soft errors in both logic circuits and …
Artificial intelligence: An energy efficiency tool for enhanced high performance computing
AH Kelechi, MH Alsharif, OJ Bameyi, PJ Ezra… - Symmetry, 2020 - mdpi.com
Power-consuming entities such as high performance computing (HPC) sites and large data
centers are growing with the advance in information technology. In business, HPC is used to …
centers are growing with the advance in information technology. In business, HPC is used to …