Design of self‐adaptable data parallel applications on multicore clusters automatically optimized for performance and energy through load distribution

R Reddy Manumachu… - … : Practice and Experience, 2019 - Wiley Online Library
Self‐adaptability is a highly preferred feature in HPC applications. A crucial building block of
a self‐adaptable application is a data partitioning algorithm that must possess several …

A fuzzy neural network based dynamic data allocation model on heterogeneous multi-GPUs for large-scale computations

CL Zhang, YP Xu, ZJ Xu, J He, J Wang… - International Journal of …, 2018 - Springer
The parallel computation capabilities of modern graphics processing units (GPUs) have
attracted increasing attention from researchers and engineers who have been conducting …

SpMV and BiCG-Stab sparse solver on Multi-GPUs for reservoir simulation

M Al-Mouhamed, L Firdaus, AH Khan… - Multimedia Tools and …, 2024 - Springer
This paper is on a multi-GPU simulation of a petroleum reservoir using a 3D structured grid,
where each point is represented by its state. Using the Darcy model for porous media, each …

PK-OMLP: An OMLP based k-exclusion real-time locking protocol for multi-GPU sharing under partitioned scheduling

M Yang, H Lei, Y Liao, F Rabbe - 2013 IEEE 11th International …, 2013 - ieeexplore.ieee.org
With rapid development of Graphics Processing Units (GPU) technologies, GPUs are
strongly motivated to be adopted in many real-time applications. However, it is still a …

Plb-hec: A profile-based load-balancing algorithm for heterogeneous cpu-gpu clusters

L Sant'Ana, D Cordeiro… - 2015 IEEE International …, 2015 - ieeexplore.ieee.org
The use of GPU clusters for scientific applications in areas such as physics, chemistry and
bioinformatics is becoming more widespread. These clusters frequently have different types …

Parallel data partitioning algorithms for optimization of data-parallel applications on modern extreme-scale multicore platforms for performance and energy

RR Manumachu, A Lastovetsky - IEEE Access, 2018 - ieeexplore.ieee.org
Data partitioning algorithms aiming to minimize the execution time and the energy of
computations in self-adaptable data-parallel applications on modern extreme-scale …

Dataflow management, dynamic load balancing, and concurrent processing for real‐time embedded vision applications using Quasar

B Goossens - International Journal of Circuit Theory and …, 2018 - Wiley Online Library
Programming modern embedded vision systems brings various challenges, due to the steep
learning curve for programmers and the different characteristics of the devices. Quasar, a …

PLB-HAC: dynamic load-balancing for heterogeneous accelerator clusters

L Sant'Ana, D Cordeiro, RY de Camargo - Euro-Par 2019: Parallel …, 2019 - Springer
Efficient usage of Heterogeneous clusters containing combinations of CPUs and
accelerators, such as GPUs and Xeon Phi boards requires balancing the computational load …

Astrophysical n-body simulations on grape-4 special-purpose computer

J Makino, M Taiji - Proceedings of the 1995 ACM/IEEE conference on …, 1995 - dl.acm.org
We report on resent astrophysical N-body simulations performed on the GRAPE-4 (GRAvity
PipE 4) system, a special-purpose computer for astrophysical N-body simulations. We first …

Accelerating the phylogenetic parsimony function on heterogeneous systems

S Santander‐Jiménez, A Ilic, L Sousa… - Concurrency and …, 2017 - Wiley Online Library
The availability of heterogeneous CPU+ GPU systems has opened the door to new
opportunities for the development of parallel solutions to tackle complex biological …