Software Resource Disaggregation for HPC with Serverless Computing
Aggregated HPC resources have rigid allocation systems and programming models which
struggle to adapt to diverse and changing workloads. Consequently, HPC systems fail to …
struggle to adapt to diverse and changing workloads. Consequently, HPC systems fail to …
MPI-based Remote OpenMP Offloading: A More Efficient and Easy-to-use Implementation
MPI+ X is the most popular hybrid programming model for distributed computation on
modern heterogeneous HPC systems. Nonetheless, for simplicity, HPC developers ideally …
modern heterogeneous HPC systems. Nonetheless, for simplicity, HPC developers ideally …
OpenMP kernel language extensions for performance portable GPU codes
In contemporary high-performance computing architectures, the integration of GPU
accelerators has become increasingly prevalent. To harness the full potential of these …
accelerators has become increasingly prevalent. To harness the full potential of these …
Evaluation of Programming Models and Performance for Stencil Computation on Current GPU Architectures
B Shan, M Araya-Polo - arXiv preprint arXiv:2404.04441, 2024 - arxiv.org
Accelerated computing is widely used in high-performance computing. Therefore, it is crucial
to experiment and discover how to better utilize GPUGPUs latest generations on relevant …
to experiment and discover how to better utilize GPUGPUs latest generations on relevant …
Evaluation of Programming Models and Performance for Stencil Computation on GPGPUs
B Shan, M Araya-Polo - 2024 IEEE International Parallel and …, 2024 - ieeexplore.ieee.org
GPGPUs are widely used in high-performance computing. Therefore, it is crucial to
experiment and discover how to better utilize their latest generations of relevant …
experiment and discover how to better utilize their latest generations of relevant …
Towards a Scalable and Efficient PGAS-Based Distributed OpenMP
MPI+ X has been the de facto standard for distributed memory parallel programming. It is
widely used primarily as an explicit two-sided communication model, which often leads to …
widely used primarily as an explicit two-sided communication model, which often leads to …
Evaluation of Directive-Based Programming Models for Stencil Computation on Current GPGPU Architectures
Stencil calculations are a widely-used computing pattern, and tracking the performance of
such computing pattern on modern GPGPUs is of interest to the computational community. In …
such computing pattern on modern GPGPUs is of interest to the computational community. In …
Transparent Remote OpenMP Offloading Based on MPI
IK Kasmeridis, S Mantelos, A Piperis… - … Conference on Parallel …, 2023 - Springer
In this work, we present an efficient mechanism which allows unmodified OpenMP
applications to leverage the computational resources of any node in a cluster through the …
applications to leverage the computational resources of any node in a cluster through the …
A Multi-purpose Framework for Efficient Parallelized Execution of Charged Particle Tracking
G Mania - 2023 - ediss.sub.uni-hamburg.de
Complex particle tracking software used in High Energy Physics experiments already
pushes the edges of computing resources with demanding requirements for speed and …
pushes the edges of computing resources with demanding requirements for speed and …
An Exploration of Task-Based Programming for Scientific Applications
E Raut - 2022 - search.proquest.com
This dissertation explores the state-of-the-art in task-based programming. One of the main
motivating examples is Reverse Time Migration (RTM), a seismic imaging technique …
motivating examples is Reverse Time Migration (RTM), a seismic imaging technique …