Parallel programming models for heterogeneous many-cores: a comprehensive survey
Heterogeneous many-cores are now an integral part of modern computing systems ranging
from embedding systems to supercomputers. While heterogeneous many-core design offers …
from embedding systems to supercomputers. While heterogeneous many-core design offers …
Programming languages for data-Intensive HPC applications: A systematic mapping study
A major challenge in modelling and simulation is the need to combine expertise in both
software technologies and a given scientific domain. When High-Performance Computing …
software technologies and a given scientific domain. When High-Performance Computing …
SPBench: a framework for creating benchmarks of stream processing applications
In a fast-changing data-driven world, real-time data processing systems are becoming
ubiquitous in everyday applications. The increasing data we produce, such as audio, video …
ubiquitous in everyday applications. The increasing data we produce, such as audio, video …
The NAS parallel benchmarks for evaluating C++ parallel programming frameworks on shared-memory architectures
Abstract The NAS Parallel Benchmarks (NPB), originally implemented mostly in Fortran, is a
consolidated suite containing several benchmarks extracted from Computational Fluid …
consolidated suite containing several benchmarks extracted from Computational Fluid …
Efficient NAS benchmark kernels with C++ parallel programming
Benchmarking is a way to study the performance of new architectures and parallel
programming frameworks. Well-established benchmark suites such as the NAS Parallel …
programming frameworks. Well-established benchmark suites such as the NAS Parallel …
Algorithmic skeletons and parallel design patterns in mainstream parallel programming
This paper discusses the impact of structured parallel programming methodologies in state-
of-the-art industrial and research parallel programming frameworks. We first recap the main …
of-the-art industrial and research parallel programming frameworks. We first recap the main …
Simplifying self-adaptive and power-aware computing with Nornir
Self-adaptation is an emerging requirement in parallel computing. It enables the dynamic
selection of resources toallocate to the application in order to meet performance and power …
selection of resources toallocate to the application in order to meet performance and power …
Stream parallelism with ordered data constraints on multi-core systems
It is often a challenge to keep input/output tasks/results in order for parallel computations
over data streams, particularly when stateless task operators are replicated to increase …
over data streams, particularly when stateless task operators are replicated to increase …
Accelerating actor-based applications with parallel patterns
L Rinaldi, M Torquati, G Mencagli… - 2019 27th Euromicro …, 2019 - ieeexplore.ieee.org
Parallel programmers mandate high-level parallel programming tools allowing to reduce the
effort of the efficient parallelization of their applications. Parallel programming leveraging …
effort of the efficient parallelization of their applications. Parallel programming leveraging …
Assessing Application Efficiency and Performance Portability in Single-Source Programming for Heterogeneous Parallel Systems
A Ernstsson, D Griebler, C Kessler - International Journal of Parallel …, 2023 - Springer
We analyze the performance portability of the skeleton-based, single-source multi-backend
high-level programming framework SkePU across multiple different CPU–GPU …
high-level programming framework SkePU across multiple different CPU–GPU …