Adaptive contention management for fine-grained synchronization on commodity GPUs

L Gao, J Wang, W Zhang - ACM Transactions on Architecture and Code …, 2022 - dl.acm.org
As more emerging applications are moving to GPUs, fine-grained synchronization has
become imperative. However, their performance can be severely impaired in case of …

Gpu-ether: Gpu-native packet i/o for gpu applications on commodity ethernet

C Jung, S Kim, I Yeom, H Woo… - IEEE INFOCOM 2021 …, 2021 - ieeexplore.ieee.org
Despite the advent of various network enhancement technologies, it is yet a challenge to
provide high-performance networking for GPU-accelerated applications on commodity …

General Purpose Computing on Graphics Processing Units for Accelerated Deep Learning in Neural Networks

C Helmick - 2022 - digitalcommons.liberty.edu
Graphics processing units (GPUs) contain a significant number of cores relative to central
processing units (CPUs), allowing them to handle high levels of parallelization in …

Convolutional neural network adaptation and optimization method in SIMT computing mode

冯臻夫, 张亚英, 杨乐乐, 邢立冬 - 中国邮电高校学报(英文), 2024 - jcupt.bupt.edu.cn
For studying and optimizing the performance of general-purpose computing on graphics
processing units (GPGPU) based on single instruction multiple threads (SIMT) processor …