A survey of CPU-GPU heterogeneous computing techniques

S Mittal, JS Vetter - ACM Computing Surveys (CSUR), 2015 - dl.acm.org
As both CPUs and GPUs become employed in a wide range of applications, it has been
acknowledged that both of these Processing Units (PUs) have their unique features and …

Parallel programming models for heterogeneous many-cores: a comprehensive survey

J Fang, C Huang, T Tang, Z Wang - CCF Transactions on High …, 2020 - Springer
Heterogeneous many-cores are now an integral part of modern computing systems ranging
from embedding systems to supercomputers. While heterogeneous many-core design offers …

Telekine: Secure computing with cloud {GPUs}

T Hunt, Z Jia, V Miller, A Szekely, Y Hu… - … USENIX Symposium on …, 2020 - usenix.org
GPUs have become ubiquitous in the cloud due to the dramatic performance gains they
enable in domains such as machine learning and computer vision. However, offloading …

Fluidic kernels: Cooperative execution of opencl programs on multiple heterogeneous devices

P Pandit, R Govindarajan - … IEEE/ACM International Symposium on Code …, 2014 - dl.acm.org
Programming heterogeneous computing systems with Graphics Processing Units (GPU) and
multi-core CPUs in them is complex and time-consuming. OpenCL has emerged as an …

A complete and efficient CUDA-sharing solution for HPC clusters

AJ Pena, C Reaño, F Silla, R Mayo, ES Quintana-Ortí… - Parallel Computing, 2014 - Elsevier
In this paper we detail the key features, architectural design, and implementation of rCUDA,
an advanced framework to enable remote and transparent GPGPU acceleration in HPC …

High performance in silico virtual drug screening on many-core processors

S McIntosh-Smith, J Price… - … journal of high …, 2015 - journals.sagepub.com
Drug screening is an important part of the drug development pipeline for the pharmaceutical
industry. Traditional, lab-based methods are increasingly being augmented with …

An automatic input-sensitive approach for heterogeneous task partitioning

K Kofler, I Grasso, B Cosenza, T Fahringer - Proceedings of the 27th …, 2013 - dl.acm.org
Unleashing the full potential of heterogeneous systems, consisting of multi-core CPUs and
GPUs, is a challenging task due to the difference in processing capabilities, memory …

IRIS: A portable runtime system exploiting multiple heterogeneous programming systems

J Kim, S Lee, B Johnston… - 2021 IEEE High …, 2021 - ieeexplore.ieee.org
Across embedded, mobile, enterprise, and high performance computing systems, computer
architectures are becoming more heterogeneous and complex. This complexity is causing a …

CEDR: A compiler-integrated, extensible DSSoC runtime

J Mack, S Hassan, N Kumbhare… - ACM Transactions on …, 2023 - dl.acm.org
In this work, we present a C ompiler-integrated, E xtensible D omain Specific System on
Chip R untime (CEDR) ecosystem to facilitate research toward addressing the challenges of …

An asynchronous dataflow-driven execution model for distributed accelerator computing

P Salzmann, F Knorr, P Thoman… - 2023 IEEE/ACM …, 2023 - ieeexplore.ieee.org
While domain-specific HPC software packages continue to thrive and are vital to many
scientific communities, a general purpose high-productivity GPU cluster programming model …