Smartly handling renewable energy instability in supporting a cloud datacenter

J Gao, H Wang, H Shen - 2020 IEEE international parallel and …, 2020 - ieeexplore.ieee.org
The size and energy consumption of datacenters have been increasing significantly over the
past years. As a result, datacenters' increasing electricity monetary cost, energy …

FT-CNN: Algorithm-based fault tolerance for convolutional neural networks

K Zhao, S Di, S Li, X Liang, Y Zhai… - … on Parallel and …, 2020 - ieeexplore.ieee.org
Convolutional neural networks (CNNs) are becoming more and more important for solving
challenging and critical problems in many fields. CNN inference applications have been …

ePVF: An enhanced program vulnerability factor methodology for cross-layer resilience analysis

B Fang, Q Lu, K Pattabiraman… - 2016 46th Annual …, 2016 - ieeexplore.ieee.org
The Program Vulnerability Factor (PVF) has been proposed as a metric to understand the
impact of hardware faults on software. The PVF is calculated by identifying the program bits …

GreenMM: energy efficient GPU matrix multiplication through undervolting

H Zamani, Y Liu, D Tripathy, L Bhuyan… - Proceedings of the ACM …, 2019 - dl.acm.org
The current trend of ever-increasing performance in scientific applications comes with
tremendous growth in energy consumption. In this paper, we present GreenMM framework …

An instability-resilient renewable energy allocation system for a cloud datacenter

H Shen, H Wang, J Gao, R Buyya - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Renewable energy supply is a promising solution for datacenters' increasing electricity
monetary cost, energy consumption and harmful gas emissions. However, due to the …

Winograd convolution: A perspective from fault tolerance

X Xue, H Huang, C Liu, T Luo, L Zhang… - Proceedings of the 59th …, 2022 - dl.acm.org
Winograd convolution is originally proposed to reduce the computing overhead by
converting multiplication in neural network (NN) with addition via linear transformation. Other …

TSM2: optimizing tall-and-skinny matrix-matrix multiplication on GPUs

J Chen, N Xiong, X Liang, D Tao, S Li… - Proceedings of the …, 2019 - dl.acm.org
Linear algebra operations have been widely used in big data analytics and scientific
computations. Many works have been done on optimizing linear algebra operations on …

What does power consumption behavior of hpc jobs reveal?: Demystifying, quantifying, and predicting power consumption characteristics

T Patel, A Wagenhäuser, C Eibel… - 2020 IEEE …, 2020 - ieeexplore.ieee.org
As we approach exascale computing, large-scale HPC systems are becoming increasingly
power-constrained, requiring them to run HPC workloads in an energy-efficient manner. The …

New-sum: A novel online abft scheme for general iterative methods

D Tao, SL Song, S Krishnamoorthy, P Wu… - Proceedings of the 25th …, 2016 - dl.acm.org
Emerging high-performance computing platforms, with large component counts and lower
power margins, are anticipated to be more susceptible to soft errors in both logic circuits and …

Artificial intelligence: An energy efficiency tool for enhanced high performance computing

AH Kelechi, MH Alsharif, OJ Bameyi, PJ Ezra… - Symmetry, 2020 - mdpi.com
Power-consuming entities such as high performance computing (HPC) sites and large data
centers are growing with the advance in information technology. In business, HPC is used to …