A survey of algorithmic skeleton frameworks: high‐level structured parallel programming enablers

H González‐Vélez, M Leyton - Software: Practice and …, 2010 - Wiley Online Library
Structured parallel programs ought to be conceived as two separate and complementary
entities: computation, which expresses the calculations in a procedural manner, and …

Toward using higher-level abstractions to teach parallel computing

C Ferner, B Wilkinson, B Heath - 2013 IEEE International …, 2013 - ieeexplore.ieee.org
We have developed two new approaches to teaching parallel computing to undergraduates
using higher level tools that lead to ease of programming, good software design, and …

[PDF][PDF] A framework for an automatic hybrid MPI+ OpenMP code generation.

K Hamidouche, J Falcou, D Etiemble - SpringSim (hpc), 2011 - academia.edu
Clusters of symmetric multiprocessors (SMPs) are the most currently used architecture for
large scale applications and combining MPI and OpenMP models is regarded as a suitable …

[PDF][PDF] AAP4All: An Adaptive Auto Parallelization of Serial Code for HPC Systems.

MU Ashraf, FA Eassa, LJ Osterweil… - … Automation & Soft …, 2021 - researchgate.net
High Performance Computing (HPC) technologies are emphasizing to increase the system
performance across many disciplines. The primary challenge in HPC systems is how to …

Automatic code generation for GPUs in llc

R Reyes, F de Sande - The Journal of Supercomputing, 2011 - Springer
Abstract llc is a C-based language where parallelism is expressed using compiler directives.
In this paper, we present a new backend of an llc compiler that produces code for GPUs. We …

Optimization strategies in different CUDA architectures using llCoMP

R Reyes, F de Sande - Microprocessors and Microsystems, 2012 - Elsevier
Due to the current proliferation of GPU devices in HPC environments, scientist and
engineers spend much of their time optimizing codes for these platforms. At the same time …

Using patterns to teach parallel computing

C Ferner, B Wilkinson, B Heath - 2014 IEEE International …, 2014 - ieeexplore.ieee.org
In this paper, we describe the results of teaching a parallel programming course using a
pattern programming approach in a course taught across the State of North Carolina on a …

Implementation and performance analysis of sequential versus parallelized algorithms for solving multivariate equations of CFD Poisson processes

S Sbîrnă, LS Sbîrnă - 2023 27th International Conference on …, 2023 - ieeexplore.ieee.org
The present work aims to find computationally-efficient models for solving discretized partial
differential equations. To accomplish that, we implement and compare the performance of a …

A framework for the development of parallel and distributed real-time embedded systems

R Garibay-Martínez, LL Ferreira… - 2012 38th Euromicro …, 2012 - ieeexplore.ieee.org
Embedded real-time applications increasingly present high computation requirements,
which need to be completed within specific deadlines. But, those applications present highly …

Toward optimised skeletons for heterogeneous parallel architecture with performance cost model

KA Armih - 2013 - ros.hw.ac.uk
High performance architectures are increasingly heterogeneous with shared and distributed
memory components, and accelerators like GPUs. Programming such architectures is …