Advanced thread synchronization for multithreaded MPI implementations
Concurrent multithreaded access to the Message Passing Interface (MPI) is gaining
importance to support emerging hybrid MPI applications. The interoperability between …
importance to support emerging hybrid MPI applications. The interoperability between …
Systemwide power management with Argo
The Argo project is a DOE initiative for designing a modular operating system/runtime for the
next generation of supercomputers. A key focus area in this project is power management …
next generation of supercomputers. A key focus area in this project is power management …
An efficient abortable-locking protocol for multi-level NUMA systems
The popularity of Non-Uniform Memory Access (NUMA) architectures has led to numerous
locality-preserving hierarchical lock designs, such as HCLH, HMCS, and cohort locks …
locality-preserving hierarchical lock designs, such as HCLH, HMCS, and cohort locks …
Software combining to mitigate multithreaded MPI contention
Efforts to mitigate lock contention from concurrent threaded accesses to MPI have reduced
contention through fine-grained locking, avoided locking altogether by offloading …
contention through fine-grained locking, avoided locking altogether by offloading …
Lock contention management in multithreaded mpi
In this article, we investigate contention management in lock-based thread-safe MPI
libraries. Specifically, we make two assumptions:(1) locks are the only form of …
libraries. Specifically, we make two assumptions:(1) locks are the only form of …
A Distributed Version of Syrup
A portfolio SAT solver has to share clauses in order to be efficient. In a distributed
environment, such sharing implies additional problems: more information has to be …
environment, such sharing implies additional problems: more information has to be …
Level-synchronous BFS algorithm implemented in Java using PCJ library
M Ryczkowska, M Nowicki… - … on Computational Science …, 2016 - ieeexplore.ieee.org
Graph processing is used in many fields of science such as sociology, risk prediction or
biology. Although analysis of graphs is important it also poses numerous challenges …
biology. Although analysis of graphs is important it also poses numerous challenges …
Towards data-flow parallelization for adaptive mesh refinement applications
Adaptive Mesh Refinement (AMR) is a prevalent method used by distributed-memory
simulation applications to adapt the accuracy of their solutions depending on the turbulent …
simulation applications to adapt the accuracy of their solutions depending on the turbulent …
Analyzing the performance trade-off in implementing user-level threads
User-level threads have been widely adopted as a means of achieving lightweight
concurrent execution without the costs of OS-level threads. Nevertheless, the costs of …
concurrent execution without the costs of OS-level threads. Nevertheless, the costs of …
Lessons learned from analyzing dynamic promotion for user-level threading
A performance vs. practicality trade-off exists between user-level threading techniques. The
community has settled mostly on a black-and-white perspective; fully fledged threads …
community has settled mostly on a black-and-white perspective; fully fledged threads …