The MPI bugs initiative: a framework for MPI verification tools evaluation

M Laurent, E Saillard, M Quinson - 2021 IEEE/ACM 5th …, 2021 - ieeexplore.ieee.org
Ensuring the correctness of MPI programs becomes as challenging and important as
achieving the best performance. Many tools have been proposed in the literature to detect …

OpenACC errors classification and static detection techniques

AM Alghamdi, FE Eassa - IEEE Access, 2019 - ieeexplore.ieee.org
With the continued increase of usage of High-Performance Computing (HPC) in scientific
fields, the need for programming models in a heterogeneous architecture with less …

MPI Errors Detection using GNN Embedding and Vector Embedding over LLVM IR

J El Karchi, H Chen, A TehraniJamsaz… - 2024 IEEE …, 2024 - ieeexplore.ieee.org
Identifying errors in parallel MPI programs is a challenging task. Despite the growing
number of verification tools, debugging parallel programs remains a significant challenge …

Enhancing scalability of a matrix-free eigensolver for studying many-body localization

R Van Beeumen, KZ Ibrahim… - … Journal of High …, 2022 - journals.sagepub.com
We propose several techniques to enhance the parallel scalability of a matrix-free
eigensolver designed for studying many-body localization (MBL) of quantum spin chain …

Runtime correctness checking for emerging programming paradigms

J Protze, C Terboven, MS Müller, S Petiton… - Proceedings of the First …, 2017 - dl.acm.org
With rapidly increasing concurrency, the HPC community is looking for new parallel
programming paradigms to make best use of current and up-coming machines. Under the …

[图书][B] Modular techniques and interfaces for data race detection in multi-paradigm parallel programming

J Protze - 2021 - publications.rwth-aachen.de
The demand for ever-growing computing capabilities in scientific computing and simulation
has led to heterogeneous computing systems with multiple parallelism levels. The …

ACC_TEST: Hybrid testing approach for OpenACC-based programs

FE Eassa, AM Alghamdi, S Haridi… - IEEE …, 2020 - ieeexplore.ieee.org
In recent years, OpenACC has been used in many supercomputers and attracted many non-
computer science specialists for parallelizing their programs in different scientific fields …

Partial aggregation for collective communication in distributed memory machines

R Kowalewski - 2021 - edoc.ub.uni-muenchen.de
Abstract High Performance Computing (HPC) systems interconnect a large number of
Process-ing Elements (PEs) in high-bandwidth networks to simulate complex scientific …