A new approach to latency insensitive design
MR Casu, L Macchiarulo - Proceedings of the 41st Annual Design …, 2004 - dl.acm.org
Latency Insensitive Protocols have been proposed as a viable mean to speed up large
Systems-on-Chip where the limit in clock frequency is given by long global wires connecting …
Systems-on-Chip where the limit in clock frequency is given by long global wires connecting …
Fork-join and data-driven execution models on multi-core architectures: Case study of the FMM
Extracting maximum performance of multi-core architectures is a difficult task primarily due to
bandwidth limitations of the memory subsystem and its complex hierarchy. In this work, we …
bandwidth limitations of the memory subsystem and its complex hierarchy. In this work, we …
[PDF][PDF] Enabling task parallelism for many-core architectures
PR Atkinson - 2021 - research-information.bris.ac.uk
The requirements placed on computer architectures from modern computational workloads
have driven constant performance improvements. In the last 15 years, the largest source of …
have driven constant performance improvements. In the last 15 years, the largest source of …
Scaling fmm with data-driven openmp tasks on multicore architectures
Poor scalability on parallel architectures can be attributed to several factors, among which
idle times, data movement, and runtime overhead are predominant. Conventional parallel …
idle times, data movement, and runtime overhead are predominant. Conventional parallel …
Tapas: an implicitly parallel programming framework for hierarchical n-body algorithms
K Fukuda, M Matsuda, N Maruyama… - 2016 IEEE 22nd …, 2016 - ieeexplore.ieee.org
Tapas is our new C++ programming framework for hierarchical algorithms such as N-body,
on large scale heterogeneous supercomputers. Although N-body and their variants are …
on large scale heterogeneous supercomputers. Although N-body and their variants are …
[PDF][PDF] Adaptation de codes industriels de simulation en Calcul Haute Performance aux architectures modernes de supercalculateurs
N Möller - 2019 - researchgate.net
Computational science is at the core of many scientific domains as much in academic
research than in industrial research and development. Computer simulations are used in a …
research than in industrial research and development. Computer simulations are used in a …
Industrial Code Modernization of High Performance Computing simulations on modern supercomputer architectures
N Möller - 2019 - theses.hal.science
For many years, the stability of the architecture paradigm has facilitated the performance
portability of large HPC codes from one generation of supercomputers to another. The …
portability of large HPC codes from one generation of supercomputers to another. The …
[PDF][PDF] Tapas: An Implicitly Parallel ProgrammingFramework For Hierarchical
K Fukuda, M Matsuda, N Maruyama… - Book name The 22nd … - t2r2.star.titech.ac.jp
Tapas is our new C++ programming framework for hierarchical algorithms such as n-body,
on large scale heterogeneous supercomputers. Although n-body and their variants are …
on large scale heterogeneous supercomputers. Although n-body and their variants are …