Enabling efficient multithreaded MPI communication through a library-based implementation...

B Acun, K Chandrasekar… - 2019 Tenth International …, 2019 - ieeexplore.ieee.org

Dynamic voltage and frequency scaling (DVFS) is a well-known technique to reduce the
power and/or energy consumption of various applications. While most processors provide …

被引用次数：36 相关文章所有 6 个版本

[PDF] nsf.gov

Give MPI threading a fair chance: A study of multithreaded MPI designs

T Patinyasakdikul, D Eberius… - … Conference on Cluster …, 2019 - ieeexplore.ieee.org

The Message Passing Interface (MPI) has been one of the most prominent programming
paradigms in high-performance computing (HPC) for the past decade. Lately, with changes …

被引用次数：29 相关文章所有 6 个版本

[PDF] acm.org

Improving MPI multi-threaded RMA communication performance

N Hjelm, MGF Dosanjh, RE Grant, T Groves… - Proceedings of the 47th …, 2018 - dl.acm.org

One-sided communication is crucial to enabling communication concurrency. As core counts
have increased, particularly with many-core architectures, one-sided (RMA) communication …

被引用次数：35 相关文章所有 2 个版本

[PDF] acm.org

How I learned to stop worrying about user-visible endpoints and love MPI

R Zambre, A Chandramowliswharan… - Proceedings of the 34th …, 2020 - dl.acm.org

MPI+ threads is gaining prominence as an alternative to the traditional" MPI everywhere"
model in order to better handle the disproportionate increase in the number of cores …

被引用次数：24 相关文章所有 5 个版本

[PDF] arxiv.org

Lessons learned on MPI+ threads communication

R Zambre… - … Conference for High …, 2022 - ieeexplore.ieee.org

Hybrid MPI+ threads programming is gaining prominence, but, in practice, applications
perform slower with it compared to the MPI everywhere model. The most critical challenge to …

被引用次数：7 相关文章所有 8 个版本

[PDF] acm.org

Mpi sessions: Leveraging runtime infrastructure to increase scalability of applications at exascale

D Holmes, K Mohror, RE Grant, A Skjellum… - Proceedings of the 23rd …, 2016 - dl.acm.org

MPI includes all processes in MPI_COMM_WORLD; this is untenable for reasons of scale,
resiliency, and overhead. This paper offers a new approach, extending MPI with a new …

被引用次数：38 相关文章所有 4 个版本

Exampi: A modern design and implementation to accelerate message passing interface innovation

A Skjellum, M Rüfenacht, N Sultana, D Schafer… - … Computing: 6th Latin …, 2020 - Springer

The difficulty of deep experimentation with Message Passing Interface (MPI)
implementations—which are quite large and complex—substantially raises the cost and …

被引用次数：19 相关文章所有 2 个版本

[PDF] osti.gov

RMA-MT: a benchmark suite for assessing MPI multi-threaded RMA performance

MGF Dosanjh, T Groves, RE Grant… - 2016 16th IEEE/ACM …, 2016 - ieeexplore.ieee.org

Reaching Exascale will require leveraging massive parallelism while potentially leveraging
asynchronous communication to help achieve scalability at such large levels of concurrency …

被引用次数：36 相关文章所有 7 个版本

[PDF] pitt.edu

Process-in-process: techniques for practical address-space sharing

A Hori, M Si, B Gerofi, M Takagi, J Dayal… - Proceedings of the 27th …, 2018 - dl.acm.org

The two most common parallel execution models for many-core CPUs today are
multiprocess (eg, MPI) and multithread (eg, OpenMP). The multiprocess model allows each …

被引用次数：28 相关文章所有 9 个版本

[PDF] osti.gov

Fuzzy matching: Hardware accelerated mpi communication middleware

MGF Dosanjh, W Schonbein, RE Grant… - 2019 19th IEEE/ACM …, 2019 - ieeexplore.ieee.org

Contemporary parallel scientific codes often rely on message passing for inter-process
communication. However, inefficient coding practices or multithreading (eg, via …

被引用次数：16 相关文章所有 5 个版本