An overview of MPI characteristics of exascale proxy applications

B Klenk, H Fröning - … Computing: 32nd International Conference, ISC High …, 2017 - Springer
The scale of applications and computing systems is tremendously increasing and needs to
increase even more to realize exascale systems. As the number of nodes keeps growing …

Towards millions of communicating threads

HV Dang, M Snir, W Gropp - Proceedings of the 23rd European MPI …, 2016 - dl.acm.org
Proceedings of the 23rd European MPI Users' Group Meeting: Towards millions of
communicating threads Page 1 Towards millions of communicating threads Hoang-Vu …

Measuring multithreaded message matching misery

W Schonbein, MGF Dosanjh, RE Grant… - Euro-Par 2018: Parallel …, 2018 - Springer
MPI usage patterns are changing as applications move towards fully-multithreaded
runtimes. However, the impact of these patterns on MPI message matching is not well …

Why is MPI so slow? analyzing the fundamental limits in implementing MPI-3.1

K Raffenetti, A Amer, L Oden, C Archer… - Proceedings of the …, 2017 - dl.acm.org
This paper provides an in-depth analysis of the software overheads in the MPI performance-
critical path and exposes mandatory performance overheads that are unavoidable based on …

Partitioned collective communication

DJ Holmes, A Skjellum, J Jaeger… - 2021 Workshop on …, 2021 - ieeexplore.ieee.org
Partitioned point-to-point communication and persistent collective communication were both
recently standardized in MPI-4.0. Each offers performance and scalability advantages over …

Cmb: a configurable messaging benchmark to explore fine-grained communication

WP Marts, DA Kruse, MGF Dosanjh… - 2024 IEEE 24th …, 2024 - ieeexplore.ieee.org
Modern communication APIs provide increased ability to specify when, where, and how to
send data between processes. One recent innovation is fine-grained communication, where …

Adaptive and dynamic design for MPI tag matching

M Bayatpour, H Subramoni… - 2016 IEEE …, 2016 - ieeexplore.ieee.org
The Message Passing Interface (MPI) standard specifies the use of (source, tag,
communicator) tuple to identify whether an incoming message is what the receiver process …

Accelerating MPI message matching through FPGA offload

Q Xiong, A Skjellum… - 2018 28th International …, 2018 - ieeexplore.ieee.org
The Message Passing Interface (MPI) is the de facto communication standard for distributed-
memory High-Performance Computing (HPC) systems. Ultra-low latency communication in …

Relaxations for high-performance message passing on massively parallel SIMT processors

B Klenk, H Fröening, H Eberle… - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
Accelerators, such as GPUs, have proven to be highly successful in reducing execution time
and power consumption of compute-intensive applications. Even though they are already …

Characterizing MPI matching via trace-based simulation

KB Ferreira, S Levy, K Pedretti, RE Grant - Proceedings of the 24th …, 2017 - dl.acm.org
With the increased scale expected on future leadership-class systems, detailed information
about the resource usage and performance of MPI message matching provides important …