An overview of MPI characteristics of exascale proxy applications
The scale of applications and computing systems is tremendously increasing and needs to
increase even more to realize exascale systems. As the number of nodes keeps growing …
increase even more to realize exascale systems. As the number of nodes keeps growing …
Towards millions of communicating threads
Proceedings of the 23rd European MPI Users' Group Meeting: Towards millions of
communicating threads Page 1 Towards millions of communicating threads Hoang-Vu …
communicating threads Page 1 Towards millions of communicating threads Hoang-Vu …
Measuring multithreaded message matching misery
MPI usage patterns are changing as applications move towards fully-multithreaded
runtimes. However, the impact of these patterns on MPI message matching is not well …
runtimes. However, the impact of these patterns on MPI message matching is not well …
Why is MPI so slow? analyzing the fundamental limits in implementing MPI-3.1
This paper provides an in-depth analysis of the software overheads in the MPI performance-
critical path and exposes mandatory performance overheads that are unavoidable based on …
critical path and exposes mandatory performance overheads that are unavoidable based on …
Partitioned collective communication
Partitioned point-to-point communication and persistent collective communication were both
recently standardized in MPI-4.0. Each offers performance and scalability advantages over …
recently standardized in MPI-4.0. Each offers performance and scalability advantages over …
Cmb: a configurable messaging benchmark to explore fine-grained communication
WP Marts, DA Kruse, MGF Dosanjh… - 2024 IEEE 24th …, 2024 - ieeexplore.ieee.org
Modern communication APIs provide increased ability to specify when, where, and how to
send data between processes. One recent innovation is fine-grained communication, where …
send data between processes. One recent innovation is fine-grained communication, where …
Adaptive and dynamic design for MPI tag matching
M Bayatpour, H Subramoni… - 2016 IEEE …, 2016 - ieeexplore.ieee.org
The Message Passing Interface (MPI) standard specifies the use of (source, tag,
communicator) tuple to identify whether an incoming message is what the receiver process …
communicator) tuple to identify whether an incoming message is what the receiver process …
Accelerating MPI message matching through FPGA offload
Q Xiong, A Skjellum… - 2018 28th International …, 2018 - ieeexplore.ieee.org
The Message Passing Interface (MPI) is the de facto communication standard for distributed-
memory High-Performance Computing (HPC) systems. Ultra-low latency communication in …
memory High-Performance Computing (HPC) systems. Ultra-low latency communication in …
Relaxations for high-performance message passing on massively parallel SIMT processors
Accelerators, such as GPUs, have proven to be highly successful in reducing execution time
and power consumption of compute-intensive applications. Even though they are already …
and power consumption of compute-intensive applications. Even though they are already …
Characterizing MPI matching via trace-based simulation
With the increased scale expected on future leadership-class systems, detailed information
about the resource usage and performance of MPI message matching provides important …
about the resource usage and performance of MPI message matching provides important …