Predictive reliability and fault management in exascale systems: State of the art and perspectives
Performance and power constraints come together with Complementary Metal Oxide
Semiconductor technology scaling in future Exascale systems. Technology scaling makes …
Semiconductor technology scaling in future Exascale systems. Technology scaling makes …
Nu: Achieving {Microsecond-Scale} resource fungibility with logical processes
Datacenters waste significant compute and memory resources today because they lack
resource fungibility: the ability to reassign resources quickly and without disruption. We …
resource fungibility: the ability to reassign resources quickly and without disruption. We …
Terabase-scale metagenome coassembly with MetaHipMer
Metagenome sequence datasets can contain terabytes of reads, too many to be
coassembled together on a single shared-memory computer; consequently, they have only …
coassembled together on a single shared-memory computer; consequently, they have only …
Spatially distributed infection increases viral load in a computational model of SARS-CoV-2 lung infection
A key question in SARS-CoV-2 infection is why viral loads and patient outcomes vary
dramatically across individuals. Because spatial-temporal dynamics of viral spread and …
dramatically across individuals. Because spatial-temporal dynamics of viral spread and …
Embracing Irregular Parallelism in HPC with YGM
YGM is a general-purpose asynchronous distributed computing library for C++/MPI,
designed to handle the irregular data access patterns and small messages of graph …
designed to handle the irregular data access patterns and small messages of graph …
A Fine-grained Asynchronous Bulk Synchronous parallelism model for PGAS applications
Abstract The Partitioned Global Address Space (PGAS) model is well suited for executing
irregular applications on cluster-based systems, due to its efficient support for short, one …
irregular applications on cluster-based systems, due to its efficient support for short, one …
Static local concurrency errors detection in MPI-RMA programs
E Saillard, M Sergent, CTA Kaci… - 2022 IEEE/ACM Sixth …, 2022 - ieeexplore.ieee.org
Communications are a critical part of HPC simulations, and one of the main focuses of
application developers when scaling on supercomputers. While classical message passing …
application developers when scaling on supercomputers. While classical message passing …
ECP software technology capability assessment report
The Exascale Computing Project (ECP) Software Technology (ST) Focus Area is
responsible for developing critical software capabilities that will enable successful execution …
responsible for developing critical software capabilities that will enable successful execution …
Towards efficient remote openmp offloading
On modern heterogeneous HPC systems, the most popular way to realize distributed
computation is the hybrid programming model of MPI+ X (X being OpenMP/CUDA/etc.), as it …
computation is the hybrid programming model of MPI+ X (X being OpenMP/CUDA/etc.), as it …
Devastator: A Scalable Parallel Discrete Event Simulation Framework for Modern C++
Parallel discrete event simulation is a fundamental simulation technology that is essential to
the parallelization of event-based models including hardware and transportation systems …
the parallelization of event-based models including hardware and transportation systems …