Parallel programming models for heterogeneous many-cores: a comprehensive survey
Heterogeneous many-cores are now an integral part of modern computing systems ranging
from embedding systems to supercomputers. While heterogeneous many-core design offers …
from embedding systems to supercomputers. While heterogeneous many-core design offers …
Graviton: Trusted execution environments on {GPUs}
We propose Graviton, an architecture for supporting trusted execution environments on
GPUs. Graviton enables applications to offload security-and performance-sensitive kernels …
GPUs. Graviton enables applications to offload security-and performance-sensitive kernels …
Multi2Sim: A simulation framework for CPU-GPU computing
Accurate simulation is essential for the proper design and evaluation of any computing
platform. Upon the current move toward the CPU-GPU heterogeneous computing era …
platform. Upon the current move toward the CPU-GPU heterogeneous computing era …
Duality cache for data parallel acceleration
Duality Cache is an in-cache computation architecture that enables general purpose data
parallel applications to run on caches. This paper presents a holistic approach of building …
parallel applications to run on caches. This paper presents a holistic approach of building …
Gdev:{First-Class}{GPU} Resource Management in the Operating System
Graphics processing units (GPUs) have become a very powerful platform embracing a
concept of heterogeneous many-core computing. However, application domains of GPUs …
concept of heterogeneous many-core computing. However, application domains of GPUs …
A performance analysis framework for identifying potential benefits in GPGPU applications
Tuning code for GPGPU and other emerging many-core platforms is a challenge because
few models or tools can precisely pinpoint the root cause of performance bottlenecks. In this …
few models or tools can precisely pinpoint the root cause of performance bottlenecks. In this …
A survey on techniques for cooperative CPU-GPU computing
K Raju, NN Chiplunkar - Sustainable Computing: Informatics and Systems, 2018 - Elsevier
Abstract Graphical Processing Unit provides massive parallelism due to the presence of
hundreds of cores. Usage of GPUs for general purpose computation (GPGPU) has resulted …
hundreds of cores. Usage of GPUs for general purpose computation (GPGPU) has resulted …
GKLEE: concolic verification and test generation for GPUs
Programs written for GPUs often contain correctness errors such as races, deadlocks, or
may compute the wrong result. Existing debugging tools often miss these errors because of …
may compute the wrong result. Existing debugging tools often miss these errors because of …
Transparent CPU-GPU collaboration for data-parallel kernels on heterogeneous systems
Heterogeneous computing on CPUs and GPUs has traditionally used fixed roles for each
device: the GPU handles data parallel work by taking advantage of its massive number of …
device: the GPU handles data parallel work by taking advantage of its massive number of …
OCCA: A unified approach to multi-threading languages
The inability to predict lasting languages and architectures led us to develop OCCA, a C++
library focused on host-device interaction. Using run-time compilation and macro …
library focused on host-device interaction. Using run-time compilation and macro …