[HTML][HTML] A review on the decarbonization of high-performance computing centers
High-performance computing relies on performance-oriented infrastructures with access to
powerful computing resources to complete tasks that contribute to solve complex problems …
powerful computing resources to complete tasks that contribute to solve complex problems …
Toward sustainable hpc: Carbon footprint estimation and environmental implications of hpc systems
The rapid growth in demand for HPC systems has led to a rise in carbon footprint, which
requires urgent intervention. In this work, we present a comprehensive analysis of the …
requires urgent intervention. In this work, we present a comprehensive analysis of the …
Dynamic GPU energy optimization for machine learning training workloads
GPUs are widely used to accelerate the training of machine learning workloads. As modern
machine learning models become increasingly larger, they require a longer time to train …
machine learning models become increasingly larger, they require a longer time to train …
Design and Analysis of an APU for Exascale Computing
T Vijayaraghavan, Y Eckert, GH Loh… - … Symposium on High …, 2017 - ieeexplore.ieee.org
The challenges to push computing to exaflop levels are difficult given desired targets for
memory capacity, memory bandwidth, power efficiency, reliability, and cost. This paper …
memory capacity, memory bandwidth, power efficiency, reliability, and cost. This paper …
GPGPU power modeling for multi-domain voltage-frequency scaling
Dynamic Voltage and Frequency Scaling (DVFS) on Graphics Processing Units (GPUs)
components is one of the most promising power management strategies, due to its potential …
components is one of the most promising power management strategies, due to its potential …
A simple model for portable and fast prediction of execution time and power consumption of GPU kernels
L Braun, S Nikas, C Song, V Heuveline… - ACM Transactions on …, 2020 - dl.acm.org
Characterizing compute kernel execution behavior on GPUs for efficient task scheduling is a
non-trivial task. We address this with a simple model enabling portable and fast predictions …
non-trivial task. We address this with a simple model enabling portable and fast predictions …
Understanding the future of energy efficiency in multi-module gpus
As Moore's law slows down, GPUs must pivot towards multi-module designs to continue
scaling performance at historical rates. Prior work on multi-module GPUs has focused on …
scaling performance at historical rates. Prior work on multi-module GPUs has focused on …
Corf: Coalescing operand register file for gpus
The Register File (RF) in GPUs is a critical structure that maintains the state for thousands of
threads that support the GPU processing model. The RF organization substantially affects …
threads that support the GPU processing model. The RF organization substantially affects …
Modeling and decoupling the GPU power consumption for cross-domain DVFS
Dynamic voltage and frequency scaling (DVFS) is a popular technique to improve the
energy-efficiency of high-performance computing systems. It allows placing the devices into …
energy-efficiency of high-performance computing systems. It allows placing the devices into …
BOW: Breathing operand windows to exploit bypassing in GPUs
The Register File (RF) is a critical structure in Graphics Processing Units (GPUs) responsible
for a large portion of the area and power. To simplify the architecture of the RF, it is …
for a large portion of the area and power. To simplify the architecture of the RF, it is …