ECP software technology capability assessment report
The Exascale Computing Project (ECP) Software Technology (ST) Focus Area is
responsible for developing critical software capabilities that will enable successful execution …
responsible for developing critical software capabilities that will enable successful execution …
X-OpenMP—eXtreme fine-grained tasking using lock-less work stealing
Processors with 100s of threads of execution are among the state-of-the-art in high-end
computing systems. This transition to many-core computing has required the community to …
computing systems. This transition to many-core computing has required the community to …
Finer-lru: A scalable page management scheme for hpc manycore architectures
In HPC systems, the increasing need for a higher level of concurrency has led to packing
more cores within a single chip. However, since multiple processes share memory space …
more cores within a single chip. However, since multiple processes share memory space …
Software combining to mitigate multithreaded MPI contention
Efforts to mitigate lock contention from concurrent threaded accesses to MPI have reduced
contention through fine-grained locking, avoided locking altogether by offloading …
contention through fine-grained locking, avoided locking altogether by offloading …
Runtime techniques for efficient execution of virtualized, migratable MPI ranks
S White - 2022 - ideals.illinois.edu
Abstract The Message Passing Interface (MPI) is the dominant programming system for
scientific applications that run on distributed memory parallel computers. MPI is a library …
scientific applications that run on distributed memory parallel computers. MPI is a library …
ECP Software Technology Capability Assessment Report V3. 0
The Exascale Computing Project (ECP) Software Technology (ST) focus area is responsible
for (1) developing critical software capabilities that will enable the successful execution of …
for (1) developing critical software capabilities that will enable the successful execution of …
A Survey on Minimizing Lock Contention in Shared Resources in Linux Kernel
Many programs in multi-core environment use shared-memory parallelism using multi-
threading. The multiple threads typically use locks to coordinate access the shared …
threading. The multiple threads typically use locks to coordinate access the shared …
I/O Performance Optimization Schemes for Manycore HPC Systems
방지우 - 2023 - s-space.snu.ac.kr
High-performance computing (HPC) systems are composed of thousands of compute nodes,
storage systems, and high-speed networks, which provide multiple layers of I/O stacks with …
storage systems, and high-speed networks, which provide multiple layers of I/O stacks with …
A Fine-Grained Page Management Scheme For Hpc Manycore I/O Systems
In HPC systems, the increasing need for a higher level of concurrency has led to packing
more cores within a single chip. However, since multiple processes share memory space …
more cores within a single chip. However, since multiple processes share memory space …
Partial aggregation for collective communication in distributed memory machines
R Kowalewski - 2021 - edoc.ub.uni-muenchen.de
Abstract High Performance Computing (HPC) systems interconnect a large number of
Process-ing Elements (PEs) in high-bandwidth networks to simulate complex scientific …
Process-ing Elements (PEs) in high-bandwidth networks to simulate complex scientific …