A survey of CPU-GPU heterogeneous computing techniques
As both CPUs and GPUs become employed in a wide range of applications, it has been
acknowledged that both of these Processing Units (PUs) have their unique features and …
acknowledged that both of these Processing Units (PUs) have their unique features and …
Load balancing in a changing world: dealing with heterogeneity and performance variability
Fully utilizing the power of modern heterogeneous systems requires judiciously dividing
work across all of the available computational devices. Existing approaches for partitioning …
work across all of the available computational devices. Existing approaches for partitioning …
Simplifying programming and load balancing of data parallel applications on heterogeneous systems
Heterogeneous architectures have experienced a great development thanks to their
excellent cost/performance ratio and low power consumption. But heterogeneity significantly …
excellent cost/performance ratio and low power consumption. But heterogeneity significantly …
[HTML][HTML] Sigmoid: An auto-tuned load balancing algorithm for heterogeneous systems
A challenge that heterogeneous system programmers face is leveraging the performance of
all the devices that integrate the system. This paper presents Sigmoid, a new load balancing …
all the devices that integrate the system. This paper presents Sigmoid, a new load balancing …
AEML: An acceleration engine for multi-GPU load-balancing in distributed heterogeneous environment
Z Tang, L Du, X Zhang, L Yang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
For the rapid growth computation requirements in big data and artificial intelligence area,
CPU-GPU heterogeneous clusters can provide more powerful computing capacity …
CPU-GPU heterogeneous clusters can provide more powerful computing capacity …
A dynamic multi–objective approach for dynamic load balancing in heterogeneous systems
Modern standards in High Performance Computing (HPC) have started to consider energy
consumption and power draw as a limiting factor. New and more complex architectures have …
consumption and power draw as a limiting factor. New and more complex architectures have …
A new GPU bundle adjustment method for large-scale data
M Zheng, S Zhou, X Xiong, J Zhu - … Engineering & Remote …, 2017 - ingentaconnect.com
We developed a fast and effective bundle adjustment method for large-scale datasets. The
preconditioned conjugate gradient (PCG) algorithm and GPU parallel computing technology …
preconditioned conjugate gradient (PCG) algorithm and GPU parallel computing technology …
Efficient large Pearson correlation matrix computing using hybrid MPI/CUDA
E Kijsipongse, U Suriya, C Ngamphiw… - … Joint Conference on …, 2011 - ieeexplore.ieee.org
The calculation of pairwise correlation coefficient on a dataset, known as the correlation
matrix, is often used in data analysis, signal processing, pattern recognition, image …
matrix, is often used in data analysis, signal processing, pattern recognition, image …
A fuzzy neural network based dynamic data allocation model on heterogeneous multi-GPUs for large-scale computations
The parallel computation capabilities of modern graphics processing units (GPUs) have
attracted increasing attention from researchers and engineers who have been conducting …
attracted increasing attention from researchers and engineers who have been conducting …
Towards the dynamic load balancing on heterogeneous multi-GPU systems
The advent of multicore systems, joined to the potential acceleration of the graphics
processing units, alleviates some well known important architectural problems at the …
processing units, alleviates some well known important architectural problems at the …