Parallel programming models for heterogeneous many-cores: a comprehensive survey

J Fang, C Huang, T Tang, Z Wang - CCF Transactions on High …, 2020 - Springer
Heterogeneous many-cores are now an integral part of modern computing systems ranging
from embedding systems to supercomputers. While heterogeneous many-core design offers …

Programming languages for data-Intensive HPC applications: A systematic mapping study

V Amaral, B Norberto, M Goulão, M Aldinucci… - Parallel Computing, 2020 - Elsevier
A major challenge in modelling and simulation is the need to combine expertise in both
software technologies and a given scientific domain. When High-Performance Computing …

SPBench: a framework for creating benchmarks of stream processing applications

AM Garcia, D Griebler, C Schepke, LG Fernandes - Computing, 2023 - Springer
In a fast-changing data-driven world, real-time data processing systems are becoming
ubiquitous in everyday applications. The increasing data we produce, such as audio, video …

The NAS parallel benchmarks for evaluating C++ parallel programming frameworks on shared-memory architectures

J Löff, D Griebler, G Mencagli, G Araujo… - Future Generation …, 2021 - Elsevier
Abstract The NAS Parallel Benchmarks (NPB), originally implemented mostly in Fortran, is a
consolidated suite containing several benchmarks extracted from Computational Fluid …

Efficient NAS benchmark kernels with C++ parallel programming

D Griebler, J Loff, G Mencagli… - 2018 26th Euromicro …, 2018 - ieeexplore.ieee.org
Benchmarking is a way to study the performance of new architectures and parallel
programming frameworks. Well-established benchmark suites such as the NAS Parallel …

Algorithmic skeletons and parallel design patterns in mainstream parallel programming

M Danelutto, G Mencagli, M Torquati… - International Journal of …, 2021 - Springer
This paper discusses the impact of structured parallel programming methodologies in state-
of-the-art industrial and research parallel programming frameworks. We first recap the main …

Simplifying self-adaptive and power-aware computing with Nornir

D De Sensi, T De Matteis, M Danelutto - Future Generation Computer …, 2018 - Elsevier
Self-adaptation is an emerging requirement in parallel computing. It enables the dynamic
selection of resources toallocate to the application in order to meet performance and power …

Stream parallelism with ordered data constraints on multi-core systems

D Griebler, RB Hoffmann, M Danelutto… - The Journal of …, 2019 - Springer
It is often a challenge to keep input/output tasks/results in order for parallel computations
over data streams, particularly when stateless task operators are replicated to increase …

Accelerating actor-based applications with parallel patterns

L Rinaldi, M Torquati, G Mencagli… - 2019 27th Euromicro …, 2019 - ieeexplore.ieee.org
Parallel programmers mandate high-level parallel programming tools allowing to reduce the
effort of the efficient parallelization of their applications. Parallel programming leveraging …

Assessing Application Efficiency and Performance Portability in Single-Source Programming for Heterogeneous Parallel Systems

A Ernstsson, D Griebler, C Kessler - International Journal of Parallel …, 2023 - Springer
We analyze the performance portability of the skeleton-based, single-source multi-backend
high-level programming framework SkePU across multiple different CPU–GPU …