A survey of CPU-GPU heterogeneous computing techniques

S Mittal, JS Vetter - ACM Computing Surveys (CSUR), 2015 - dl.acm.org
As both CPUs and GPUs become employed in a wide range of applications, it has been
acknowledged that both of these Processing Units (PUs) have their unique features and …

Load balancing in a changing world: dealing with heterogeneity and performance variability

M Boyer, K Skadron, S Che, N Jayasena - Proceedings of the ACM …, 2013 - dl.acm.org
Fully utilizing the power of modern heterogeneous systems requires judiciously dividing
work across all of the available computational devices. Existing approaches for partitioning …

Simplifying programming and load balancing of data parallel applications on heterogeneous systems

B Pérez, JL Bosque, R Beivide - … of the 9th Annual Workshop on General …, 2016 - dl.acm.org
Heterogeneous architectures have experienced a great development thanks to their
excellent cost/performance ratio and low power consumption. But heterogeneity significantly …

[HTML][HTML] Sigmoid: An auto-tuned load balancing algorithm for heterogeneous systems

B Pérez, E Stafford, JL Bosque, R Beivide - Journal of Parallel and …, 2021 - Elsevier
A challenge that heterogeneous system programmers face is leveraging the performance of
all the devices that integrate the system. This paper presents Sigmoid, a new load balancing …

AEML: An acceleration engine for multi-GPU load-balancing in distributed heterogeneous environment

Z Tang, L Du, X Zhang, L Yang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
For the rapid growth computation requirements in big data and artificial intelligence area,
CPU-GPU heterogeneous clusters can provide more powerful computing capacity …

A dynamic multi–objective approach for dynamic load balancing in heterogeneous systems

A Cabrera, A Acosta, F Almeida… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Modern standards in High Performance Computing (HPC) have started to consider energy
consumption and power draw as a limiting factor. New and more complex architectures have …

A new GPU bundle adjustment method for large-scale data

M Zheng, S Zhou, X Xiong, J Zhu - … Engineering & Remote …, 2017 - ingentaconnect.com
We developed a fast and effective bundle adjustment method for large-scale datasets. The
preconditioned conjugate gradient (PCG) algorithm and GPU parallel computing technology …

Efficient large Pearson correlation matrix computing using hybrid MPI/CUDA

E Kijsipongse, U Suriya, C Ngamphiw… - … Joint Conference on …, 2011 - ieeexplore.ieee.org
The calculation of pairwise correlation coefficient on a dataset, known as the correlation
matrix, is often used in data analysis, signal processing, pattern recognition, image …

A fuzzy neural network based dynamic data allocation model on heterogeneous multi-GPUs for large-scale computations

CL Zhang, YP Xu, ZJ Xu, J He, J Wang… - International Journal of …, 2018 - Springer
The parallel computation capabilities of modern graphics processing units (GPUs) have
attracted increasing attention from researchers and engineers who have been conducting …

Towards the dynamic load balancing on heterogeneous multi-GPU systems

A Acosta, V Blanco, F Almeida - 2012 IEEE 10th International …, 2012 - ieeexplore.ieee.org
The advent of multicore systems, joined to the potential acceleration of the graphics
processing units, alleviates some well known important architectural problems at the …