A new approach to latency insensitive design

MR Casu, L Macchiarulo - Proceedings of the 41st Annual Design …, 2004 - dl.acm.org
Latency Insensitive Protocols have been proposed as a viable mean to speed up large
Systems-on-Chip where the limit in clock frequency is given by long global wires connecting …

Fork-join and data-driven execution models on multi-core architectures: Case study of the FMM

A Amer, N Maruyama, M Pericàs, K Taura… - … Conference, ISC 2013 …, 2013 - Springer
Extracting maximum performance of multi-core architectures is a difficult task primarily due to
bandwidth limitations of the memory subsystem and its complex hierarchy. In this work, we …

[PDF][PDF] Enabling task parallelism for many-core architectures

PR Atkinson - 2021 - research-information.bris.ac.uk
The requirements placed on computer architectures from modern computational workloads
have driven constant performance improvements. In the last 15 years, the largest source of …

Scaling fmm with data-driven openmp tasks on multicore architectures

A Amer, S Matsuoka, M Pericas, N Maruyama… - … : Memory, Devices, and …, 2016 - Springer
Poor scalability on parallel architectures can be attributed to several factors, among which
idle times, data movement, and runtime overhead are predominant. Conventional parallel …

Tapas: an implicitly parallel programming framework for hierarchical n-body algorithms

K Fukuda, M Matsuda, N Maruyama… - 2016 IEEE 22nd …, 2016 - ieeexplore.ieee.org
Tapas is our new C++ programming framework for hierarchical algorithms such as N-body,
on large scale heterogeneous supercomputers. Although N-body and their variants are …

[PDF][PDF] Adaptation de codes industriels de simulation en Calcul Haute Performance aux architectures modernes de supercalculateurs

N Möller - 2019 - researchgate.net
Computational science is at the core of many scientific domains as much in academic
research than in industrial research and development. Computer simulations are used in a …

Industrial Code Modernization of High Performance Computing simulations on modern supercomputer architectures

N Möller - 2019 - theses.hal.science
For many years, the stability of the architecture paradigm has facilitated the performance
portability of large HPC codes from one generation of supercomputers to another. The …

[PDF][PDF] Tapas: An Implicitly Parallel ProgrammingFramework For Hierarchical

K Fukuda, M Matsuda, N Maruyama… - Book name The 22nd … - t2r2.star.titech.ac.jp
Tapas is our new C++ programming framework for hierarchical algorithms such as n-body,
on large scale heterogeneous supercomputers. Although n-body and their variants are …